BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016970
(379 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
Length = 489
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 288/420 (68%), Positives = 327/420 (77%), Gaps = 44/420 (10%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGS------------------------- 35
MKGFRE+ AS+C SK DTPNRSL S E GS
Sbjct: 1 MKGFRERV-ASRCSSKCPVDTPNRSLTSDCLESGSNFSTKGSLWSSFFASAFSVFETYRE 59
Query: 36 -----------------SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHK 78
+ VK++V+ GSMRRIHERVLGPSRTGISS+TSDIWLLGVC+K
Sbjct: 60 SPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVCYK 119
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLV 138
I++DE+ G+A N LAEF D+SSRIL++YR+GFD IGDSK SDVGWGCMLRSSQMLV
Sbjct: 120 ISEDES-GNADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLV 178
Query: 139 AQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
AQALLFH+LGR W KP QKP D+ YVEILHLFGDSE +PFSIHNL+QAGKAY LAAGSWV
Sbjct: 179 AQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWV 238
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
GPYAMCRSWE+LAR +R E L QSLPMA+YVVSGDEDGERGGAPVV I+DASRHC F
Sbjct: 239 GPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEF 298
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
S+GQADWTPILLLVPLVLGL+KVNPRYIP+L+ TFTF QSLGI+GGKPGASTYIVGVQ++
Sbjct: 299 SRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQDD 358
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+A YLDPH+VQ V+NIG+DD+EADTS+YHSD++RHI L SIDPSLAIGFYCRDK F
Sbjct: 359 NAFYLDPHEVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFDEF 418
>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
Length = 486
Score = 539 bits (1388), Expect = e-151, Method: Compositional matrix adjust.
Identities = 264/358 (73%), Positives = 305/358 (85%)
Query: 15 SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 74
S+S+P + G G + V+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 75 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 134
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
VQ+E A YLDPH+ Q V++I +++LEADTS+YH ++IRHI LDSIDPSLAIGFYCRDK
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDK 411
>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 536 bits (1381), Expect = e-150, Method: Compositional matrix adjust.
Identities = 261/345 (75%), Positives = 302/345 (87%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + +VK++V G+MRRI ERVLG S+TGIS++TSDIWLLG +KI+QD++ G+A N
Sbjct: 78 GWTSSVKKIVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATNA 137
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
LA F++DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQMLVAQALLFHRLGR WRK
Sbjct: 138 LAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRK 197
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P+ KP DR+YVEILHLFGDSE S FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE+LAR
Sbjct: 198 PVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARS 257
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
+R ET L Q+LPMA+YVVSG EDGERGGAPV+ I+DA+RHCS FSKG+ DWTPILLLVP
Sbjct: 258 KREETNLEYQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVP 317
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGVQ+E+A YLDPH+VQPV+N
Sbjct: 318 LVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVN 377
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+DD+EA+TS+YH DV+RHI LD IDPSLAIGFYCRDK F
Sbjct: 378 FSRDDVEANTSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDF 422
>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 536 bits (1381), Expect = e-150, Method: Compositional matrix adjust.
Identities = 279/417 (66%), Positives = 322/417 (77%), Gaps = 45/417 (10%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSET---------------------- 38
MKGFRE+ + S ST ++PNRS S SELGS++T
Sbjct: 1 MKGFRERGFVASSKSSSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDTH 60
Query: 39 -----------------------VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 75
VK++V GSMRRI E VLG S+TGIS++T DIWLLG
Sbjct: 61 CDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGA 120
Query: 76 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
C+KI+QD + GDAA N LA FN DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 121 CYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQ 180
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
MLVAQALLFHRLGR WRKPL KP DREYVEILHLFGDSE+S FSIHNLL+AGKAYGLAAG
Sbjct: 181 MLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAG 240
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
SWVGPYA+C SWE+L R +R ET L QSL MA+YVVSG EDGERGGAPV+CI++A+RHC
Sbjct: 241 SWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHC 300
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
S FSKGQ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 301 SEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 360
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
Q+E+A YLDPH+VQPV+N+ +DD+EA+TS+YH +V+RH+ LD IDPSLAIGFYCRDK
Sbjct: 361 QDENAFYLDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDK 417
>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
Length = 489
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 265/361 (73%), Positives = 305/361 (84%), Gaps = 3/361 (0%)
Query: 15 SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 74
S+S+P + G G + V+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 75 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 134
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYH---SDVIRHIHLDSIDPSLAIGFYCRD 371
VQ+E A YLDPH+ Q V++I +++LEADTS+YH S +IRHI LDSIDPSLAIGFYCRD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRD 413
Query: 372 K 372
K
Sbjct: 414 K 414
>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
Length = 483
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 262/341 (76%), Positives = 292/341 (85%)
Query: 38 TVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEF 97
TV++++T+GSMRRI ER+LG R+G+ SS DIWLLGVCHKI+QD DAA + G+A +
Sbjct: 78 TVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQDHPPDDAASSPGVAGY 137
Query: 98 NQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
QDFSSRIL++YRKGF I DSK TSDV WGCMLRSSQMLVAQALLFHRLGR WRKP QK
Sbjct: 138 EQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPSQK 197
Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
P D+EYVEILHLFGDSETS FSIHNLLQAG+AY LAAGSWVGPYAMCRSWE L R +R
Sbjct: 198 PLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRSWETLVRSKRET 257
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
L Q LPMAIY+VSGDEDGERGGAPV+ IDDASRHC FSKGQ DW+PILLLVPLVLG
Sbjct: 258 PILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHDWSPILLLVPLVLG 317
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
LEK+NPRYIP+LR TFTFPQSLGI+GGKPGASTYIVGVQ+E+A YLDPH+VQ V+NI KD
Sbjct: 318 LEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQQVVNIDKD 377
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
DLEADTS+YH +VIRHI L+SIDPSLAIGFYCRDK F
Sbjct: 378 DLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNF 418
>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 485
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 251/340 (73%), Positives = 280/340 (82%), Gaps = 4/340 (1%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + V+++VT GSMRR ERVLG SRT ISSS DIWLLGVCHKI+Q E+ G +NG
Sbjct: 79 GWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQESTGGVDTSNG 138
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
LA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LA
Sbjct: 199 PIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLA-- 256
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ CS FS G A WTP+LLLVP
Sbjct: 257 -RKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWTPLLLLVP 315
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+N
Sbjct: 316 LVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPHDVQQVVN 375
Query: 334 IGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
I D E TS+YH +V+RHI LDSIDPSLAIGFYCRDK
Sbjct: 376 ISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDK 415
>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 486
Score = 489 bits (1260), Expect = e-136, Method: Compositional matrix adjust.
Identities = 248/340 (72%), Positives = 278/340 (81%), Gaps = 4/340 (1%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + V+++VT GSMRR ERVLG SRT ISSS DIWLLGVCHKI+Q E+ G +NG
Sbjct: 79 GWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQESSGGVDNSNG 138
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
LA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LA
Sbjct: 199 PIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLA-- 256
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ C FS G A WTP+LLLVP
Sbjct: 257 -RKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWTPLLLLVP 315
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+G Q E A YLDPHDVQ V+N
Sbjct: 316 LVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQNEKAFYLDPHDVQQVVN 375
Query: 334 IGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
I D E TS+YH +++RHI LDSIDPSLAIGFYCRDK
Sbjct: 376 ISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDK 415
>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
Length = 487
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 240/339 (70%), Positives = 274/339 (80%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + V+++V+ GSMRR ERVLG RT +SSS DIWLLGVCHKI+Q E+ GD N
Sbjct: 79 GWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCHKISQHESTGDVDIRNV 138
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
A F QDF SRILI+YRKGFD I DSK TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 FAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
+ KP D+EY++IL LFGDSE + FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LAR
Sbjct: 199 TVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARN 258
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
QR + G Q LPMAIYVVSGDEDGERGGAPVVCI+DA + C FS+G WTP+LLLVP
Sbjct: 259 QREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLEFSRGLVPWTPLLLLVP 318
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ + A YLDPH+V+PV+N
Sbjct: 319 LVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQNDKAFYLDPHEVKPVVN 378
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
I D E +TS+YH ++ RH+ LDSIDPSLAIGFYCRDK
Sbjct: 379 ITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDK 417
>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
Full=Autophagy-related protein 4 homolog a;
Short=AtAPG4a; Short=Protein autophagy 4a
gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 467
Score = 463 bits (1192), Expect = e-128, Method: Compositional matrix adjust.
Identities = 215/340 (63%), Positives = 275/340 (80%), Gaps = 2/340 (0%)
Query: 34 GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE G+
Sbjct: 74 GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 133
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML AQALLFHRLGR W
Sbjct: 134 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWT 193
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 194 KKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 252
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPI+LLV
Sbjct: 253 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 312
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 313 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 372
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDK
Sbjct: 373 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDK 412
>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 422
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 215/340 (63%), Positives = 275/340 (80%), Gaps = 2/340 (0%)
Query: 34 GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE G+
Sbjct: 29 GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 88
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML AQALLFHRLGR W
Sbjct: 89 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWT 148
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 149 KKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 207
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPI+LLV
Sbjct: 208 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 267
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 268 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 327
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDK
Sbjct: 328 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDK 367
>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
Length = 467
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 220/340 (64%), Positives = 279/340 (82%), Gaps = 2/340 (0%)
Query: 34 GSSETVKRLVTA-GSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ A G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI++DEA G+
Sbjct: 74 GWTAFVKRVSMATGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISEDEASGETNTGC 133
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA F QDFSS+IL++YR+GF+P D+ TSDV WGCM+RSSQML AQALLFHRLGR W
Sbjct: 134 VLAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRSWT 193
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
K + P ++EY+E L FGDSE+S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 194 KKSELP-EQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGSWVGPYAICRAWESLAC 252
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPILLLV
Sbjct: 253 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPILLLV 312
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 313 PLVLGLDSVNPRYIPSLIATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 372
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ K+ + DTS+YH +VIR++ L+S+DPSLA+GFYCRDK
Sbjct: 373 TVNKETPDVDTSSYHCNVIRYVPLESLDPSLALGFYCRDK 412
>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
Length = 476
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 214/330 (64%), Positives = 271/330 (82%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 86 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEAESFEEADAGRVLAAFRQDFS 145
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P + +
Sbjct: 146 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPPNEK 205
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET +
Sbjct: 206 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDVKH 265
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G +W PILLLVPLVLGL+KVN
Sbjct: 266 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGDTEWPPILLLVPLVLGLDKVN 325
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQE+ YLDPHDVQ V+ + K++ + D
Sbjct: 326 PRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVD 385
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
TS+YH + +R++ L+S+DPSLA+GFYC+DK
Sbjct: 386 TSSYHCNTLRYVPLESLDPSLALGFYCQDK 415
>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
Length = 489
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 241/416 (57%), Positives = 284/416 (68%), Gaps = 44/416 (10%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSETV-------KRLVTAG-SMRRIH 52
+K F ++ A+KC SKS+ +T + S S+ GSS++ T+G S+ +
Sbjct: 3 LKAFCDRIVAAKCSSKSSTETVDNSQVPACSKAGSSDSKFPKASLWSSFFTSGFSVIETY 62
Query: 53 ERVLGPSRTGISSSTSD----------IWL--------------------------LGVC 76
+ + + S S WL LGVC
Sbjct: 63 SKSPASEKKAVHSQNSGWGCCCEESCYCWLNEEIPRACTLGQAELTFQALMVIYGFLGVC 122
Query: 77 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
HK +Q E+ GD + A F QDFSS+IL++YRKGFD IGDSK TSDV WGCMLRSSQM
Sbjct: 123 HKFSQQESTGDVDNSTVFAAFEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQM 182
Query: 137 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 196
LVAQALLFH+LGR WRK KP D+EY++IL FGDSE S FSIHNLLQAGK YGLA GS
Sbjct: 183 LVAQALLFHKLGRMWRKTTDKPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGS 242
Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
WVGPYAMCRSWE LAR QR G Q LPMA+YVVSGDEDGERGGAPVVCI+DASR CS
Sbjct: 243 WVGPYAMCRSWEVLARNQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCIEDASRRCS 302
Query: 257 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
FS+G A WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 EFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQ 362
Query: 317 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
E A YLDPHDVQPV++I D + +TS+YH +++R + LDSIDPSLAIGFYCRDK
Sbjct: 363 NEKAFYLDPHDVQPVVHINGDAQDPNTSSYHCNIVRQMPLDSIDPSLAIGFYCRDK 418
>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
Full=Autophagy-related protein 4 homolog b;
Short=AtAPG4b; Short=Protein autophagy 4b
gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 477
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 213/330 (64%), Positives = 270/330 (81%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 87 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 146
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 147 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 206
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 207 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 266
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 267 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 326
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQE+ YLDPHDVQ V+ + K++ + D
Sbjct: 327 PRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVD 386
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
TS+YH + +R++ L+S+DPSLA+GFYC+ K
Sbjct: 387 TSSYHCNTLRYVPLESLDPSLALGFYCQHK 416
>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 478
Score = 430 bits (1106), Expect = e-118, Method: Compositional matrix adjust.
Identities = 221/361 (61%), Positives = 274/361 (75%), Gaps = 9/361 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S S ++R+V +GSM R LG S+ SS D+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 K 372
K
Sbjct: 409 K 409
>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
Length = 451
Score = 430 bits (1105), Expect = e-118, Method: Compositional matrix adjust.
Identities = 203/340 (59%), Positives = 262/340 (77%), Gaps = 18/340 (5%)
Query: 34 GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE G+
Sbjct: 74 GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 133
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML AQ
Sbjct: 134 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQLP---------- 183
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 184 -------EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 236
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPI+LLV
Sbjct: 237 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 296
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 297 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 356
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDK
Sbjct: 357 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDK 396
>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
Length = 892
Score = 430 bits (1105), Expect = e-118, Method: Compositional matrix adjust.
Identities = 220/361 (60%), Positives = 275/361 (76%), Gaps = 9/361 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S S ++R+V +GSM R LG S+ ++SD+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 K 372
K
Sbjct: 409 K 409
>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B;
Short=Protein autophagy 4; AltName: Full=OsAtg4
Length = 478
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 219/361 (60%), Positives = 272/361 (75%), Gaps = 9/361 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S ++R+V +GSM R LG S+ SS D+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 K 372
K
Sbjct: 409 K 409
>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
Length = 493
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/339 (63%), Positives = 261/339 (76%), Gaps = 9/339 (2%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
S ++R V GSM R LG ++ + D+W LG C+K + +E+ D ++G A
Sbjct: 90 SRALRRFVGGGSMWRF----LGCAKV---LTNGDVWFLGKCYKFSSEESSSDLDTDSGHA 142
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
F +DFSSRI ++YRKGFD I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AFLEDFSSRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPS 202
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
QKP + EY+ ILHLFGDSE FS+HNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R
Sbjct: 203 QKPCNPEYIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNR 262
Query: 216 A--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
E G +S PMA+YVVSGDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVP
Sbjct: 263 EQPEVSNGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSTWSPILLLVP 322
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STYI G+Q++ A+YLDPHDVQ +N
Sbjct: 323 LVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTYIAGIQDDRALYLDPHDVQMAVN 382
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
I D+L+ADTS+YH +R + LD +DPSLAIGFYCRDK
Sbjct: 383 IASDNLDADTSSYHCSTVRDMALDLLDPSLAIGFYCRDK 421
>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
Length = 912
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 218/361 (60%), Positives = 273/361 (75%), Gaps = 9/361 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S ++R+V +GSM R LG S+ ++SD+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 K 372
K
Sbjct: 409 K 409
>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
Length = 595
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 217/341 (63%), Positives = 267/341 (78%), Gaps = 10/341 (2%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R S D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARVLTSG---DVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+KP+D +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R
Sbjct: 203 SEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTN 262
Query: 215 R--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
R A+ G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+
Sbjct: 263 REQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLI 322
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ +
Sbjct: 323 PLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAV 382
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
+I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDKG
Sbjct: 383 DIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKG 423
>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
Length = 473
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 219/361 (60%), Positives = 267/361 (73%), Gaps = 9/361 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + +R L S ++R+ GSM R LG S+ + ++SD+W L
Sbjct: 52 FEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 104
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 105 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 164
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE FSIHNLLQAGK+YGLA
Sbjct: 165 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 224
Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R E G + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 225 AGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 284
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 285 AQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETFTFPQSLGILGGKPGTSTY 344
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
+ GVQ++ +YLDPH+VQ ++I D+LEADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 345 VAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 404
Query: 372 K 372
K
Sbjct: 405 K 405
>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
Length = 484
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 221/355 (62%), Positives = 264/355 (74%), Gaps = 9/355 (2%)
Query: 20 DTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 79
D RS S ++R V GSM R LG G + + D+W LG C+K+
Sbjct: 65 DQSGRSGGHASGSYAWSRVLRRFVGGGSMWRF----LG---CGKALTAGDVWFLGKCYKL 117
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
+ +E+ D+ G A F +DFSSR+ I+YRKGFD I DSK+TSDV WGCM+RSSQMLVA
Sbjct: 118 SSEESSSDSDSEGGHAAFLEDFSSRVWITYRKGFDVISDSKLTSDVNWGCMVRSSQMLVA 177
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 199
QAL+FH LGR WRKP Q P D E+ ILHLFGDSE FSIHNLLQAGK+YGLAAGSWVG
Sbjct: 178 QALIFHHLGRSWRKPAQNPSDPEHTRILHLFGDSEVCAFSIHNLLQAGKSYGLAAGSWVG 237
Query: 200 PYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 257
PYAMCR+W+ L R R + + +S PM +YVVSGDEDGERGGAPVVCID A++ C
Sbjct: 238 PYAMCRAWQTLIRTNREQPEVINRNESFPMVLYVVSGDEDGERGGAPVVCIDVAAQLCYD 297
Query: 258 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPGASTYI GVQ+
Sbjct: 298 FNKGQSAWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGASTYIAGVQD 357
Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ A+YLDPH+VQ +NI D+LEADTS+YH +R + LD IDPSLAIGFYCRDK
Sbjct: 358 DRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLAIGFYCRDK 412
>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
Length = 505
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 221/365 (60%), Positives = 268/365 (73%), Gaps = 9/365 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + NRSL S ++R+ GSM R LG S+ + ++SD+W L
Sbjct: 53 FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE FSIHNLLQAGK+YGLA
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 225
Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R E G + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 226 AGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 285
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T TFPQSLGI+GGKPG STY
Sbjct: 286 AQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETLTFPQSLGILGGKPGTSTY 345
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 346 IAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRD 405
Query: 372 KGLLV 376
KG L+
Sbjct: 406 KGELL 410
>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
Length = 429
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 220/360 (61%), Positives = 274/360 (76%), Gaps = 10/360 (2%)
Query: 17 STPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 76
S+P RS S G S ++R V +GSM R+ LG R ++SD+W LG C
Sbjct: 74 SSPACDARSTKSSSGSYGLSRILRRFVGSGSMWRL----LGCGRV---LTSSDVWFLGKC 126
Query: 77 HKIAQDEALGDAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
+K++ +E + ++ A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQ
Sbjct: 127 YKVSPEEEESGDSESDSGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQ 186
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
MLVAQAL+FH LGR WRKP +KP++ +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAG
Sbjct: 187 MLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAG 246
Query: 196 SWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
SW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGERGGAPVVCID A++
Sbjct: 247 SWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQ 306
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI
Sbjct: 307 LCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLGILGGKPGTSTYIA 366
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
GVQ++ A+YLDPH+VQ ++I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDKG
Sbjct: 367 GVQDDRALYLDPHEVQMTVDIALDNLEADTSSYHCSVVRALALEQIDPSLAIGFYCRDKG 426
>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
gi|194701156|gb|ACF84662.1| unknown [Zea mays]
gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
Length = 492
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 216/346 (62%), Positives = 268/346 (77%), Gaps = 10/346 (2%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R ++ D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+KP+D +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R
Sbjct: 203 SEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTN 262
Query: 215 R--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
R A+ G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+
Sbjct: 263 REQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLI 322
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ +
Sbjct: 323 PLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAV 382
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDK F
Sbjct: 383 DIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKDDFDDF 428
>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
gi|219886349|gb|ACL53549.1| unknown [Zea mays]
gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
Length = 492
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 219/359 (61%), Positives = 273/359 (76%), Gaps = 10/359 (2%)
Query: 17 STPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 76
S+P RS S G S ++R V +GSM R+ LG R ++SD+W LG C
Sbjct: 74 SSPACDARSTKSSSGSYGLSRILRRFVGSGSMWRL----LGCGRV---LTSSDVWFLGKC 126
Query: 77 HKIAQDEALGDAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
+K++ +E + ++ A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQ
Sbjct: 127 YKVSPEEEESGDSESDSGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQ 186
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
MLVAQAL+FH LGR WRKP +KP++ +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAG
Sbjct: 187 MLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAG 246
Query: 196 SWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
SW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGERGGAPVVCID A++
Sbjct: 247 SWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQ 306
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI
Sbjct: 307 LCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLGILGGKPGTSTYIA 366
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
GVQ++ A+YLDPH+VQ ++I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDK
Sbjct: 367 GVQDDRALYLDPHEVQMTVDIALDNLEADTSSYHCSVVRALALEQIDPSLAIGFYCRDK 425
>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
Length = 486
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 221/355 (62%), Positives = 265/355 (74%), Gaps = 9/355 (2%)
Query: 20 DTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 79
D RS S ++R V GSM R LG G + + +D+ LG C+K+
Sbjct: 67 DQSGRSGGHASGSYAWSRVLRRFVGGGSMWRF----LG---CGKALTAADVQFLGKCYKL 119
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
+ +E+ D+ G A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQMLVA
Sbjct: 120 SSEESSSDSDSEGGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVA 179
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 199
QAL+FH LGR WRKP Q P + EY+ ILHLFGDSE FSIHNLLQAGK+YGLAAGSWVG
Sbjct: 180 QALIFHHLGRSWRKPAQNPSNPEYIRILHLFGDSEACAFSIHNLLQAGKSYGLAAGSWVG 239
Query: 200 PYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 257
PYAMCR+W+ L R R + + +S PMA+YVVSGDEDGERGGAPVVCID A++ C
Sbjct: 240 PYAMCRAWQTLIRTNREQPEVINRNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYD 299
Query: 258 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPGASTYI GVQ+
Sbjct: 300 FNKDQSAWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGASTYIAGVQD 359
Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ A+YLDPH+VQ +NI D+LEADTS+YH +R + LD IDPSLAIGFYCRDK
Sbjct: 360 DRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLAIGFYCRDK 414
>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 474
Score = 416 bits (1070), Expect = e-114, Method: Compositional matrix adjust.
Identities = 219/361 (60%), Positives = 265/361 (73%), Gaps = 9/361 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + NRSL S ++R+ GSM R LG S+ + ++SD+W L
Sbjct: 53 FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE FSIHNLLQAGK+YGLA
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 225
Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R E G + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 226 AGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 285
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T TFPQSLGI+GGKPG STY
Sbjct: 286 AQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETLTFPQSLGILGGKPGTSTY 345
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 346 IAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRD 405
Query: 372 K 372
K
Sbjct: 406 K 406
>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
Length = 1216
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 218/394 (55%), Positives = 273/394 (69%), Gaps = 42/394 (10%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S ++R+V +GSM R LG S+ ++SD+W L
Sbjct: 327 FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 379
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 380 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 439
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 440 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 499
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 500 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 559
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 560 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 619
Query: 312 IVGVQEESAIYLDPHDVQ---------------------------------PVINIGKDD 338
I GVQ++ A+YLDPH+VQ ++I D+
Sbjct: 620 IAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYGSYSGVFSTSQAVDIAADN 679
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+EADTS+YH +R + LD IDPSLAIGFYCRDK
Sbjct: 680 IEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDK 713
>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
Length = 462
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/300 (67%), Positives = 241/300 (80%), Gaps = 2/300 (0%)
Query: 81 QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQ 140
++E G + ++G A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQ
Sbjct: 99 EEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQ 158
Query: 141 ALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
AL+FH LGR WRKP +KP+D +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAGSWVGP
Sbjct: 159 ALIFHHLGRSWRKPSEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGP 218
Query: 201 YAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
YAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F
Sbjct: 219 YAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNF 278
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+KGQ W+PILLL+PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+
Sbjct: 279 NKGQCTWSPILLLIPLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQED 338
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
A+YLDPHDVQ ++I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDK F
Sbjct: 339 RALYLDPHDVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKDDFDDF 398
>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/331 (58%), Positives = 257/331 (77%), Gaps = 4/331 (1%)
Query: 46 GSMRRIHERVLGPSRTGISSST-SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR 104
GSMRR+ E +LGP T ++S+ S+IW+LG+C+K++ D + EF DF+SR
Sbjct: 1 GSMRRLQELLLGPRFTAANASSGSEIWVLGLCYKVSADPN-NETLSVQAFEEFISDFTSR 59
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
I I+YRKGF+ +G SK+TSDVGWGCMLRS QML+AQAL+ H LGR WR+ +P + Y+
Sbjct: 60 IWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLGRSWRREPGQPCSQAYL 119
Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GC 222
+IL FGDSE+ PFSIHNLL+AG +GLAAGSW+GPYA+CR+ EALAR R ++ G
Sbjct: 120 QILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLEALARADREQSQKKGGK 179
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
++LP A+YVVSG+ +GERGGAPV+C++D + CS + + +WTP+L+LVPLVLGL+KVN
Sbjct: 180 RALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTEEWTPLLVLVPLVLGLDKVN 239
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRY+P+LR TFTFPQSLGI GGKPGASTY++GVQ+E A+YLDPH+ Q V+ + ++LE D
Sbjct: 240 PRYLPSLRATFTFPQSLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELD 299
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
TS+YH +R + LD+IDPSLAIGFYCRD+
Sbjct: 300 TSSYHCSTVRRLPLDTIDPSLAIGFYCRDRA 330
>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 346
Score = 366 bits (940), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 182/313 (58%), Positives = 245/313 (78%), Gaps = 5/313 (1%)
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
SSS +IW+LG+C+K++ D A +A + EF DFSSRI I+YRKGF+ +G+SK+TS
Sbjct: 4 SSSGGEIWVLGICYKVSAD-ANDEAVSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTS 62
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
DVGWGCMLRS Q+L+AQAL+ H LGR WR+ + +EY++IL FGDSE+ FSIHNL
Sbjct: 63 DVGWGCMLRSGQILLAQALVCHYLGRTWRRNACQECLQEYLQILQSFGDSESCSFSIHNL 122
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARC---QRAETGLGCQSLPMAIYVVSGDEDGER 240
L+AG+ +GLAAGSW+GPYA+CR+ EALA+ Q A+ G G ++LP A+YVVSG+ +G+R
Sbjct: 123 LEAGRPFGLAAGSWLGPYALCRTLEALAKADEDQNAKKG-GKRALPFAVYVVSGETEGDR 181
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
GGAPV C++DA+ CS + + +W+P+++LVPLVLGL+K+NPRY+P+LR TFT PQSLG
Sbjct: 182 GGAPVRCVEDAAVLCSKWGEATEEWSPLVVLVPLVLGLDKLNPRYLPSLRATFTLPQSLG 241
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
+ GGKPGAST+++GVQ + A+YLDPH+ Q V + ++LE DTS YH V+R + LDSID
Sbjct: 242 VAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLELDTSFYHCSVVRRLPLDSID 301
Query: 361 PSLAIGFYCRDKG 373
PSLAIGFYCRD+
Sbjct: 302 PSLAIGFYCRDRA 314
>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
Length = 358
Score = 354 bits (909), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 185/348 (53%), Positives = 245/348 (70%), Gaps = 29/348 (8%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDAA 89
+ V+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 2 TAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKEST 59
Query: 90 GNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HRL
Sbjct: 60 SSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRL 119
Query: 148 GRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
GR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 120 GRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCH 178
Query: 206 SWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+ EALAR G G Q +A+YVVSGD GERGGAPV+ D + C
Sbjct: 179 AIEALAR----NDGRGRQGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC--------- 225
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YLD
Sbjct: 226 --PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYLD 283
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
PH+VQ V+++ + LE D+++YH V+R + LD+IDPSLA+GFYCR++
Sbjct: 284 PHEVQKVVSVSGESLEFDSASYHCSVVRKMPLDAIDPSLALGFYCRNR 331
>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
Length = 358
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 184/348 (52%), Positives = 245/348 (70%), Gaps = 29/348 (8%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDAA 89
+ V+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 2 TAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKEST 59
Query: 90 GNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HRL
Sbjct: 60 SSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRL 119
Query: 148 GRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
GR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 120 GRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCH 178
Query: 206 SWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+ EALAR G G + +A+YVVSGD GERGGAPV+ D + C
Sbjct: 179 AIEALAR----NDGRGREGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC--------- 225
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YLD
Sbjct: 226 --PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYLD 283
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
PH+VQ V+++ + LE D+++YH V+R + LD+IDPSLA+GFYCR++
Sbjct: 284 PHEVQKVVSVSGESLEFDSASYHCSVVRKMLLDAIDPSLALGFYCRNR 331
>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 267
Score = 314 bits (804), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 156/244 (63%), Positives = 198/244 (81%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 1 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 60
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 61 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 120
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 121 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 180
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 181 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 240
Query: 283 PRYI 286
PR++
Sbjct: 241 PRFV 244
>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 360
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 155/244 (63%), Positives = 196/244 (80%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 87 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 146
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 147 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 206
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 207 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 266
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 267 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 326
Query: 283 PRYI 286
P +
Sbjct: 327 PSHF 330
>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
Length = 290
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 143/209 (68%), Positives = 172/209 (82%), Gaps = 2/209 (0%)
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
S G GCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +L LFGDSE FSIHN
Sbjct: 14 SLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGDSEACAFSIHN 73
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGER 240
LLQA + YGLAAGSW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGER
Sbjct: 74 LLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGER 133
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
GGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLG
Sbjct: 134 GGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLG 193
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
I+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 194 ILGGKPGTSTYIAGVQDDRALYLDPHEVQ 222
>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
Length = 472
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 142/205 (69%), Positives = 168/205 (81%), Gaps = 2/205 (0%)
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 172
FD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR RKP +KP++ +Y+ +LHLFGD
Sbjct: 34 FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93
Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIY 230
SE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R A+ G ++ PMA+Y
Sbjct: 94 SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
VVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLK 213
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGV 315
TF FPQSL I+GGKPG STYI GV
Sbjct: 214 ETFMFPQSLCILGGKPGTSTYIAGV 238
>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
Length = 169
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 113/165 (68%), Positives = 130/165 (78%)
Query: 53 ERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKG 112
+ +LG S T SSTSDIWLLG C+K++ +E+ G NG A F +DFSSRI I+YRKG
Sbjct: 2 QELLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKG 61
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 172
FD IGDSK TSDV WGCM+RSSQMLVAQALLFH LGR WRKP QKP D +Y+EILHLFGD
Sbjct: 62 FDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGD 121
Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
SE FSIHNLL+AGKAYGLAA WVGPYAMCR+WE + R +R +
Sbjct: 122 SEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQ 166
>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
Length = 416
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 134/263 (50%), Positives = 166/263 (63%), Gaps = 56/263 (21%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
L F +DFSSRI I+YRKGFD I D K+TSDV WGCM+RSSQMLVAQAL+FH LGR WRK
Sbjct: 29 LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRK 88
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P +K L++ +
Sbjct: 89 PPEK------------------------TLIRTNR------------------------- 99
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
++A+ G ++ PM +YVVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVP
Sbjct: 100 EQADAVDGKENFPMELYVVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVP 159
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI- 332
LVLGL+K+NPRYIP L+ TF FPQSLGI+G KPG STYI GVQ++ A+YLDPH+VQ V+
Sbjct: 160 LVLGLDKINPRYIPLLKETFMFPQSLGILGVKPGTSTYIAGVQDDRALYLDPHEVQMVLA 219
Query: 333 NIGKDDLEADTSTYHSDVIRHIH 355
NI + T +D I +IH
Sbjct: 220 NIKWPE------TLETDFIYNIH 236
>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
Length = 219
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 117/153 (76%), Positives = 129/153 (84%), Gaps = 1/153 (0%)
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
MAIYVVSGDEDGERGGAPVVCI+DA +HCS FS+GQA WTP+LLLVPLVLGL+KVNPRYI
Sbjct: 1 MAIYVVSGDEDGERGGAPVVCIEDAFKHCSEFSRGQAAWTPLLLLVPLVLGLDKVNPRYI 60
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TST 345
P L TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+NI D E + TS+
Sbjct: 61 PLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQEPNSTSS 120
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
YH +V+RHI LDSIDPSLAIGFYCRDK F
Sbjct: 121 YHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDF 153
>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 348
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 170/303 (56%), Gaps = 19/303 (6%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
+LGV + DE + ++ + +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 1 MLGVTYWSKDDECNAEKY-DDARRAWERDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTL 59
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAY 190
RS+QM+VA AL H GR WR+ ++ D E V+ +L +F D ++PFSIH++ + A+
Sbjct: 60 RSAQMMVANALSIHTRGRHWRRQVKAKEDDESVDHVLSMFIDDASAPFSIHSVCETTTAW 119
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG-DEDGERGGAPVVCID 249
G G W P MCR++ AL G +A++VV G +ED GG P ID
Sbjct: 120 GAPPGRWFEPSVMCRAFSALIEAN------GDLRNQIAVHVVGGQNEDDSAGGVPT--ID 171
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
D G+A +LL VPLVLG+ +N RYI LR F QS+G++GG+P A
Sbjct: 172 DGELRAKSADVGKA----LLLFVPLVLGVGRNINTRYISQLRSIIAFKQSIGVIGGRPNA 227
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
S Y+VG ++ YLDPH VQP + + D +Y+ + + +DP+LA+GFY
Sbjct: 228 SLYLVGHSDDVFFYLDPHTVQPANSFAE---AVDFDSYYCSTPLQMRGELLDPTLALGFY 284
Query: 369 CRD 371
CRD
Sbjct: 285 CRD 287
>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
Length = 362
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 118/289 (40%), Positives = 161/289 (55%), Gaps = 39/289 (13%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D SRI ++YR+GF PI S ITSDVGWGC LRS QML+AQAL++H +GR WR+ L+ +
Sbjct: 23 DLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQWRRKLEAAY 82
Query: 160 DREYVEILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
E ++L FGD E PFSIHN+ G+ +G+ AG W+GP +C + + +
Sbjct: 83 PEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLADMVN-KVQP 141
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWTPILLL 271
GL C+ + G GGAPV+C SR + F G + +
Sbjct: 142 GGLQCR-----VVATFG------GGAPVLC---TSRLATAFEGGADRSGGEVGSSGSEES 187
Query: 272 VPLVLGLE-----------KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
P GL K+NPRY L+ T+PQS+GIVGG+P +S Y +G+Q++
Sbjct: 188 GPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSLYFIGLQDQHV 247
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+YLDPH+VQ V + AD TY +R + L +IDPSLAIGFYC
Sbjct: 248 LYLDPHEVQEVASEA-----ADLDTYFCSSLRLMPLANIDPSLAIGFYC 291
>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
Length = 342
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 168/327 (51%), Gaps = 32/327 (9%)
Query: 58 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN----GLAEFNQDFSSRILISYRKGF 113
P +T + S IWLLG C+ E + + L EF++ F+S I ++YR+ F
Sbjct: 12 PLKTNFNED-SPIWLLGRCYHAKNYEYTSEQSKQQCQILSLEEFHRHFTSLIWLTYRRSF 70
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---KPLQKPFDREYVEILHLF 170
+ S +TSD GWGCMLRS QM++A L+FH L + WR + + + Y IL F
Sbjct: 71 VQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRISGRCHSREQEHYYRVILQFF 130
Query: 171 GDS---ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
GD E SPFS+H L+ G+ G AG W GP ++ E +++
Sbjct: 131 GDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHILE--------------KAMIS 176
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVN 282
A + + D + V ID+ R C+ Q D W P+++LVP+ LG E +N
Sbjct: 177 ATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRLGGEALN 236
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
P YIP ++ FT Q +GI+GG+P S Y VG Q+E I+LDPH QPV++ ++
Sbjct: 237 PIYIPCVKSLFTLDQCIGIIGGRPKHSLYFVGFQDEKMIHLDPHYCQPVVDTTQEKFP-- 294
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYC 369
T ++H R +DPS IGFYC
Sbjct: 295 TESFHCPNPRKTSFKKMDPSCTIGFYC 321
>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
Length = 477
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 115/342 (33%), Positives = 165/342 (48%), Gaps = 44/342 (12%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG C+ ++ L A+ N + EF +DF SR
Sbjct: 86 SKESPVWLLGQCYLKKSEDPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFVSR 145
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF----- 159
I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR ++P
Sbjct: 146 IWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQPIETLQQ 205
Query: 160 ---DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
DR + I+ FGD SPFSIH L+ G + G AG W GP ++ C
Sbjct: 206 RLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAHLLSQAVECASK 265
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
++ L A+YV V + D C W ++LLVPL L
Sbjct: 266 QSNSNFDHL--AVYVAQD---------CAVYLQDVENICRT---PDGKWKALVLLVPLRL 311
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++ K
Sbjct: 312 GADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVWK 371
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+D +++H R + L +DPS +GFY +K L F
Sbjct: 372 NDFSL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNKEALTDF 411
>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
Length = 453
Score = 194 bits (492), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 123/336 (36%), Positives = 170/336 (50%), Gaps = 43/336 (12%)
Query: 55 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 114
+LG I S +SD LG Q ++ ++ + G F +DF SR+ ++YR+ F
Sbjct: 70 LLGKCYRRIESPSSDSTELGTDVAAFQSQSEIASSDDEGFEGFKKDFISRLWLTYRREFP 129
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDRE-YVE------I 166
+ S +SD GWGCMLRS QML+AQAL+ H LGR WR +P +P RE ++E I
Sbjct: 130 ILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDWRWQPDHQPTTRESFIEVVNHRKI 189
Query: 167 LHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
+ FGD S SPFSIH L+ G+A G AG W GP G
Sbjct: 190 IKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGP------------------GFVAHL 231
Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG--------QADWTPILLLVPLVL 276
A S ED + VC+ ++ C+V+ K W ++LL+P+ L
Sbjct: 232 FRQAFKRAS--EDNYEFDSLTVCV---AQDCAVYIKDVMEECTDKNGKWKSLILLIPVRL 286
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G EK N Y P L F+ Q +GI+GG+P S Y VG Q++ I+LDPH Q V+++
Sbjct: 287 GAEKFNSIYAPCLTTLFSLKQCIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQEVVDVWA 346
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
D +++H R IHL +DPS IGFYC K
Sbjct: 347 VDFP--LTSFHCRSPRKIHLSKMDPSCCIGFYCPTK 380
>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
Length = 410
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 172/326 (52%), Gaps = 34/326 (10%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+ S +W+LG + + D LAE +D SR+ ++YRKGFDPIG S TSD
Sbjct: 30 TESPVWILGKQYSVLYD-----------LAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQ 78
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM++AQ+L+ LGR WR K +D +Y EIL +F D ++ +S+ +
Sbjct: 79 GWGCMLRCGQMMLAQSLICRHLGRDWRWTKDK-YDPKYFEILRMFQDKRSAKYSLQVIAS 137
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD------EDGE 239
G + G A G W GP + + L C E + + V+ D +
Sbjct: 138 MGTSEGKAIGEWFGPNTISQVLRKL--CVSDEWSNLVVHVALDNTVIIDDVFCLCKSSKK 195
Query: 240 RGGAPVVCIDDASRHCSVFS-----------KGQAD-WTPILLLVPLVLGLEKVNPRYIP 287
P+ + A +F+ G+ D W P+LL+VPL LGL ++NP YIP
Sbjct: 196 ESNEPIPGVHAACASALLFNGHDPTAEGHDPSGEDDSWRPLLLIVPLRLGLSEINPVYIP 255
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
L+ TF QS+GI+GGKP + + +G E+ +Y+DPH QP +++ + E+D S YH
Sbjct: 256 FLKTCLTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPFVDVTQPG-ESDAS-YH 313
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKG 373
+ + +DPS+A+GF+C+ +
Sbjct: 314 CSYSCRMPVSYLDPSVAVGFFCQTEA 339
>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
pisum]
gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
pisum]
gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
pisum]
gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
pisum]
Length = 402
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 103/317 (32%), Positives = 165/317 (52%), Gaps = 35/317 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLATMDE---------LSSLVFHVALDN------ 192
Query: 243 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ +++ FTFPQSL
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKMCFTFPQSL 250
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIRHIHL 356
G++GG+P + Y +G I+LDPH Q + + D+E + +YH I + +
Sbjct: 251 GVIGGRPNHALYFIGFVGNDVIFLDPHTTQQIGMLPNKDIETEHKIDHSYHCQQINRLPI 310
Query: 357 DSIDPSLAIGFYCRDKG 373
++DPSLA F C+ +
Sbjct: 311 LNMDPSLAACFMCQTEN 327
>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
Length = 369
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/326 (35%), Positives = 165/326 (50%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 65
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 177
Query: 240 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 178 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 237
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 238 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 297
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 298 CRHPPSRMSIGELDPSIAVGFFCKTE 323
>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
Length = 405
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 164/324 (50%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 34 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 80
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 81 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 140
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 141 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 192
Query: 240 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 193 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 252
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 253 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 312
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 313 CRHPPSRMSIGELDPSIAVGFFCK 336
>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
Length = 424
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 178/350 (50%), Gaps = 61/350 (17%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
+ GV H ++ + G+ + G E+ +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 42 MFGVTH-WDRETSSGERSNEVGRREWERDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTL 100
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDRE----------------------------- 162
RS+QM++A AL H GR WR+ +Q E
Sbjct: 101 RSAQMMLANALSIHSRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSE 160
Query: 163 --------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+IL LF D +PFSIH + + +G G W P MCR++EAL
Sbjct: 161 RTRAGSDAQEDILRLFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALV--- 217
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
AE LG + + ++VVSG E GE GG P V D+A G+A +LL VP+
Sbjct: 218 -AEHDLGSE---LTVHVVSGRE-GEDGGVPTV--DEAEVRAKSADVGKA----LLLFVPV 266
Query: 275 VLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
VLG+ + +N RY+ LR F QS+GIVGG+P +S Y+VG ++ YLDPH VQ +
Sbjct: 267 VLGVGRTINARYLSQLRSMMAFKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASS 326
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD----KGLLVTFE 379
+ D E +Y+ H+ +DP+LA+GFYCRD LLV E
Sbjct: 327 MVTMDFE----SYYCPTPLHVCGGDLDPTLALGFYCRDGDDVASLLVDIE 372
>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
Length = 517
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 157/289 (54%), Gaps = 22/289 (7%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---K 153
F +DFSSR+ +YR+ F PI + ITSD GWGCMLRSSQM++AQA++ H LGR WR
Sbjct: 181 FLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYRRN 240
Query: 154 PLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 210
+ D + +++ LFGD + SPFS+H L+Q G G AG W GP + EAL
Sbjct: 241 NQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKEAL 300
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS-VFSKGQADWTPIL 269
+ E L L + IYV + ++D C S G W ++
Sbjct: 301 EGACQTEQLL----LDLRIYVAQD---------CTIYLEDVRALCRGTRSNGAPLWRSVI 347
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+LVP+ LG E++NP YIP ++ + P +G++GG+P S Y +G Q E IYLDPH VQ
Sbjct: 348 ILVPVRLGGEQLNPTYIPCVKGMLSHPNCIGVIGGRPRHSLYFLGWQGEKVIYLDPHYVQ 407
Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+++G D D +YH R + +DPS +GFYC+ + F
Sbjct: 408 EAVDVGPQDFPLD--SYHCSWPRKMSFYKMDPSCTMGFYCKTEDEFEHF 454
>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
Length = 390
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 105/280 (37%), Positives = 148/280 (52%), Gaps = 12/280 (4%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++A+AL+ LGR WR +
Sbjct: 45 DVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWARGRRQ 104
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G G W GP A+ +W L
Sbjct: 105 REEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRL 164
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
A + + + + D E G C++ A C++ + A W P++L
Sbjct: 165 AVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGA---CALVEEETALWKPLVL 221
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
L+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP
Sbjct: 222 LIPLRLGLSDINEAYIDTLKQCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP 281
Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
+ +D D + + +H+ +DPS+A GF+CR
Sbjct: 282 AVEPSEDGQVPDETYHCQHPPCRMHICELDPSIAAGFFCR 321
>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
Length = 375
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 157/306 (51%), Gaps = 33/306 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 26 VWILGACYNVKTKKS-----------ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGC 74
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR WR +K +EY IL F D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALICSHLGRDWRWDPEKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVG 134
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +++YV + V I+
Sbjct: 135 EGKSVGEWYGPNTVAQVLKKLALFDDWNS--------LSVYVSMDN---------TVVIE 177
Query: 250 DASRHC-----SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
D + C + S+ DW P+LL++PL +G+ +NP YI L+ F PQS G++GG
Sbjct: 178 DIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQALKECFKMPQSCGVLGG 237
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
KP + Y +G ++ IYLDPH Q ++ D S + + + S+DPS+A
Sbjct: 238 KPNLAYYFIGFIDDELIYLDPHTTQQAVDTESGSAVDDQSFHCQRTPHRMKITSLDPSVA 297
Query: 365 IGFYCR 370
+GF+C+
Sbjct: 298 LGFFCK 303
>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 356
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 157/301 (52%), Gaps = 13/301 (4%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + +D + E D SRI I+YRK F IG + TSD GWGC
Sbjct: 26 VWILGKAYHLIRDRS-----------ELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGC 74
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQALL LGR WR ++ + Y +IL LF D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALLCKHLGREWRWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVG 134
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + L + S+ I VV R C
Sbjct: 135 EGKSIGQWFGPNTVAQVLRKLTLFDDWSSIAVHISMDNTI-VVEDIRKLCRTPLFTECAS 193
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ S+ + G W P++L +PL LGL ++NP Y+ L+ FT QSLG++GGKP +
Sbjct: 194 PKAASASLENGGTTYWKPLVLFIPLRLGLTEINPLYLDVLKKCFTLKQSLGMIGGKPNHA 253
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G ++ +YLDPH QPV++I K D TYH +++ +DPS+A+GF+C
Sbjct: 254 HYFIGFYGKTLVYLDPHTTQPVVDINKWASIPD-DTYHCKHPSRMNIMHLDPSIALGFFC 312
Query: 370 R 370
Sbjct: 313 H 313
>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
Length = 394
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 159/320 (49%), Gaps = 26/320 (8%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 129
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 188
Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 189 PCAGAAALPADSSRHCNGFPAGAEVTNRLAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 248
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 249 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFIPDESFHCQHPPSR 308
Query: 354 IHLDSIDPSLAIGFYCRDKG 373
+ + +DPS+A+GF+C+ +G
Sbjct: 309 MSIGELDPSIAVGFFCKTEG 328
>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
Length = 390
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 158/317 (49%), Gaps = 26/317 (8%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 65
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQSDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 184
Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 185 PCAGATALPTDSSRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 244
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 245 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCRHPPSR 304
Query: 354 IHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 305 MGISELDPSIAVGFFCK 321
>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
Length = 392
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 115/341 (33%), Positives = 169/341 (49%), Gaps = 46/341 (13%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG+C+ + L A+ N + EF +DF SR
Sbjct: 6 SKESPVWLLGLCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 65
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREY 163
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q + +
Sbjct: 66 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQSTDESSH 125
Query: 164 VEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRAE 217
I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 126 RMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQAME------RAS 179
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
+ +A+YV + V C D R ++LLVPL LG
Sbjct: 180 EDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR------------KALILLVPLRLG 227
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
+K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++ +
Sbjct: 228 ADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVEGN 287
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ + +++H R + L +DPS +GFY DK L F
Sbjct: 288 E-KFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLTDF 327
>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
Length = 380
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 164/320 (51%), Gaps = 38/320 (11%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W+LGV + +D E D SSR+ +YRK F PIG + SD GWGCM
Sbjct: 32 WILGVGYNTVKDRQ-----------ELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCM 80
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
LR QM++ QAL+ LGR WR +D +Y +IL LF D + S +SIH + Q G +
Sbjct: 81 LRCGQMMLGQALICRHLGRDWRWK-SAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSE 139
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE----------DGER 240
G + G W GP + + + LA + + +AI+V + R
Sbjct: 140 GKSVGQWFGPNTVAQVLKKLALFEDWSS--------LAIHVAMDNTVIIDDIKKLCRSAR 191
Query: 241 GGAP------VVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P +C ++ S S+ A W P++L++PL LGL ++NP Y L+ F
Sbjct: 192 QPTPSQVTNSFLCNGVSAEQTSARSRSPALPWQPLMLIIPLRLGLSELNPVYTDCLKACF 251
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
T QSLG++GGKP + Y +G S +YLDPH QP + + + ++ S++H
Sbjct: 252 TLRQSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPAVEL-EGNVPIPDSSFHCTHPSR 310
Query: 354 IHLDSIDPSLAIGFYCRDKG 373
+++ +DPS+A+GF+C+D+
Sbjct: 311 MNIQDLDPSIALGFFCQDEA 330
>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
Length = 682
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 160/314 (50%), Gaps = 17/314 (5%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + L ++ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 262 AAENQLAESPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 321
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 322 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 381
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDE---DGERGGAP 244
G W GP ++ + AL R S+ +A IY+ +E E P
Sbjct: 382 KPGDWYGPASVSYLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKP 441
Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
V A R S K W +++L+PL LG +K+NP Y L+L + LGI+GG
Sbjct: 442 HVPWQQAKRSTSDAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEYCLGIIGG 501
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
KP S Y VG QE+ I+LDPH Q ++++ ++ ++H R + +DPS
Sbjct: 502 KPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETFP--MHSFHCKSPRKLKSSKMDPSCC 559
Query: 365 IGFYCRDKGLLVTF 378
IGFYC K +F
Sbjct: 560 IGFYCPTKTDFDSF 573
>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
Length = 445
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 157/319 (49%), Gaps = 26/319 (8%)
Query: 66 STSDIWLLGVCHKIA--QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I+ +DE L D A SR+ +YRK F IG + TS
Sbjct: 74 TSEPVWILGRKYSISTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 120
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 121 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 180
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V+ R G
Sbjct: 181 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMEDIRRLCRAGL 239
Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P D RHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 240 PCAGAAALPADPGRHCNGFPAGAEVSNRLAPWRPLVLLIPLRLGLTDINEAYVETLKHCF 299
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 300 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFADSCFIPDESFHCQHPPSR 359
Query: 354 IHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 360 MGVRELDPSIAVGFFCQTE 378
>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A
gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
Length = 396
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 161/324 (49%), Gaps = 49/324 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEE 324
>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
Length = 1114
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 113/334 (33%), Positives = 168/334 (50%), Gaps = 29/334 (8%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNN-------GLAEFNQDFSSRILISYRKGFDPIGDSK 120
S +WLLG + I + + D + +F QDFSS + +YR+ F I +K
Sbjct: 226 SPVWLLGKFYHIKPSDLIDDDIQRGKRTRVVPNIEKFKQDFSSLLWFTYRQDFPAIPGTK 285
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFGD--SETS 176
+TSD GWGCMLRS QM++A+AL H LG W + ++E +I+ FGD + S
Sbjct: 286 LTSDCGWGCMLRSGQMMLAKALTLHYLGPEWNVFSDQTREQETYRKQIIRWFGDYLCDES 345
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGD 235
PFS+H L++ GK G G W GP ++ E + + Q+ +T L + +YV
Sbjct: 346 PFSMHRLVEVGKNLGKQPGEWFGPASVAHILKETMVKGQKTQTVLS----DLCVYVSQDC 401
Query: 236 EDGERGGAPVVCI----------DDASRHCSVFSKGQADWT-PILLLVPLVLGLEKVNPR 284
++ + C S H S DW +++L+P+ LG E++NP
Sbjct: 402 TVYKQDIYELCCTRPRADTKFTNSTESEHESSQDASSMDWKRAVVILIPVRLGGEQLNPV 461
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YIP ++ + +GI+GGKP S Y VG QE+ IYLDPH Q V++ +
Sbjct: 462 YIPCVKGLLSQDSCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVDTRERHFP--IQ 519
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+YH R + +D IDPS IGFYCR++ F
Sbjct: 520 SYHCMSPRKVSIDKIDPSCTIGFYCRNQKEFEKF 553
>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
Length = 432
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 100/292 (34%), Positives = 157/292 (53%), Gaps = 10/292 (3%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR-LGRPWR 152
+ EF +DFS+++ SYR+GF+ IGDS +D GWGCMLRS QML+A LL + +G+ W+
Sbjct: 88 IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALA 211
KP + ++ +++ LF D ++PFSIHN+ G+ + G + G W P + + AL
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207
Query: 212 -RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC----SVFSKGQADWT 266
+ G + + + V DD S + + + W
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALRSDGSWM 267
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+L+L+P LG++ +N Y L +TFPQ+LGIVGGKP AS Y + Q+++ YLDPH
Sbjct: 268 PLLILIPTKLGIDTINEIYYRPLLDIYTFPQNLGIVGGKPRASLYFIASQDDNLFYLDPH 327
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
VQ I + D + S+Y ++ + ++ +DPSL I F+C K + F
Sbjct: 328 TVQNSI---ESDSDFSLSSYFCNIPKKANISEVDPSLVIPFFCSTKESFLDF 376
>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
Length = 393
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 115/330 (34%), Positives = 160/330 (48%), Gaps = 52/330 (15%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 188
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 280
A G A D+ RHC+ F G A W P++LL+PL LGL
Sbjct: 189 CA------------GAAAFPA--DSDRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTD 234
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFI 294
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
D S + + + +DPS+A+GF+C+
Sbjct: 295 PDESFHCQHPPSRMSIGELDPSIAVGFFCK 324
>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
Length = 393
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 109/325 (33%), Positives = 163/325 (50%), Gaps = 40/325 (12%)
Query: 65 SSTSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
++ +W+LG + I +DE L D A SR+ +YR+ F IG + T
Sbjct: 21 ETSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPT 67
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH
Sbjct: 68 SDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQ 127
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDED 237
+ Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 128 IAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEI 179
Query: 238 GERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
A + C+ D+ RHC+ F G + W P++LL+PL LGL +N Y
Sbjct: 180 RRLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAY 239
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S
Sbjct: 240 VETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESF 299
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCR 370
+ + + +DPS+A+GF+C+
Sbjct: 300 HCQHPPSRMGIGELDPSIAVGFFCK 324
>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
Length = 442
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 162/318 (50%), Gaps = 25/318 (7%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNN------GLAEFNQDFSSRILISYRKGFDPIGDS 119
S S IWLLG C+ Q E A N G+ F +DFSS I +SYRK F + +S
Sbjct: 63 SDSPIWLLGRCYYAKQAEYDSKNAVQNTQYKIHGIDCFFEDFSSLIYLSYRKHFSQLANS 122
Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGD--SET 175
+TSD GWGCMLR+ QML+A ALL H L WR +K ++ Y+ IL F D S+
Sbjct: 123 NLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKYTEKNYIYRMILRFFNDENSDN 182
Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 234
SPFS+H L++ G G W GP ++ + A + S P + + V
Sbjct: 183 SPFSLHELVRIGSK---KPGEWYGPTSVAHTLSA---------AVNLTSHPVLDTFRVYV 230
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
D V+ ++C+ + + W +L+LVP+ LG + +NP YIP L+ T
Sbjct: 231 ANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLNPIYIPCLKALLT 290
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
+GI+GG+P S Y VG Q + I LDPH +Q +++ + ++ H + +
Sbjct: 291 LDYCVGIIGGRPKHSLYFVGFQGKKLINLDPHYLQEYVDMTTQEFPVESFRCH--YPKKM 348
Query: 355 HLDSIDPSLAIGFYCRDK 372
+DPS A+GFYCR +
Sbjct: 349 AFKKMDPSCAVGFYCRTR 366
>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 396
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 161/314 (51%), Gaps = 26/314 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPS 195
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E P+ +A+ H S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ESSHDPL----NATNHNKAISACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ + D + + + + + +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQSPQRMSILN 311
Query: 359 IDPSLAIGFYCRDK 372
+DPS+A+GF+C+++
Sbjct: 312 LDPSVALGFFCKEE 325
>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
Length = 398
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 167/315 (53%), Gaps = 28/315 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDG 238
G + G W GP A+ W +LA + + + + I +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAG 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
LG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNIL 312
Query: 358 SIDPSLAIGFYCRDK 372
++DPS+A+GF+C+++
Sbjct: 313 NLDPSVALGFFCKEE 327
>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
Length = 396
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 159/324 (49%), Gaps = 49/324 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPFKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + L + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLTLFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEE 324
>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
Length = 393
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 163/327 (49%), Gaps = 40/327 (12%)
Query: 65 SSTSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
++ +W+LG + I +DE L D A SR+ +YR+ F IG + T
Sbjct: 21 ETSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPT 67
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH
Sbjct: 68 SDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQ 127
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDED 237
+ Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 128 IAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEI 179
Query: 238 GERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y
Sbjct: 180 RRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAY 239
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S
Sbjct: 240 VETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESF 299
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + +DPS+A+GF+C+ +
Sbjct: 300 HCQHPPSRMGIGELDPSIAVGFFCKKE 326
>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
Length = 391
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 154/292 (52%), Gaps = 16/292 (5%)
Query: 92 NGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
N L E ++ D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LG
Sbjct: 34 NALTEKDEILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVCRHLG 93
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
R WR + EY+ +L+ F D + S +SIH + Q G G G W GP + + +
Sbjct: 94 RDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLK 153
Query: 209 ALARCQRAETGLGCQSLPMAIYVVS--------GDEDGERGGAPVV--CIDDASRHCSVF 258
LA + ++ + + D GE G + C++ A C++
Sbjct: 154 KLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGA---CAMA 210
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+ A W P++LL+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E
Sbjct: 211 EEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLPQSLGVIGGKPNSAHYFIGYVGE 270
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
IYLDPH QP + +D D + + +H+ +DPS+A GF+CR
Sbjct: 271 ELIYLDPHTTQPAVEPSEDSQVPDETYHCQHPPCRMHICELDPSIAAGFFCR 322
>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
Length = 390
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 163/327 (49%), Gaps = 40/327 (12%)
Query: 65 SSTSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
++ +W+LG + I +DE L D A SR+ +YR+ F IG + T
Sbjct: 18 ETSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPT 64
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH
Sbjct: 65 SDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQ 124
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDED 237
+ Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 125 IAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEI 176
Query: 238 GERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y
Sbjct: 177 RRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAY 236
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S
Sbjct: 237 VETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESF 296
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + +DPS+A+GF+C+ +
Sbjct: 297 HCQHPPSRMGIGELDPSIAVGFFCKKE 323
>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
Length = 393
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 156/324 (48%), Gaps = 36/324 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR + Y +LH F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALLCRHLGRGWRWTQWERQPDSYFSVLHAFMDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSG 234
Q G G + G W GP + + +W ALA + + I +
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSALA----VHVAMDNTVVMEEIRRLCR 184
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPT 288
G A D+ RHC+ F W P++LL+PL LGL +N Y T
Sbjct: 185 SSLPRAGAAAFPA--DSDRHCNGFPAEAEVGPRPVPWRPLVLLIPLRLGLTDINAAYTET 242
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + L D S +
Sbjct: 243 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQVTDSCLIPDESFHCQ 302
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 303 HPPHRMSIAELDPSIAVGFFCQTE 326
>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B
Length = 393
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 162/324 (50%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCK 324
>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
Length = 396
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 160/324 (49%), Gaps = 49/324 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+Y + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYDSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEE 324
>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
Length = 398
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 160/313 (51%), Gaps = 24/313 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTAD 197
Query: 241 GGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+P I + S+ S F W P+LL+VPL LG+ ++NP Y+ + F PQSL
Sbjct: 198 KSSPDSFITSNQSKDTSAFCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSL 254
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
G +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ ++
Sbjct: 255 GALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQQMNILNL 314
Query: 360 DPSLAIGFYCRDK 372
DPS+A+GF+C+++
Sbjct: 315 DPSVALGFFCKEE 327
>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
Length = 479
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 161/327 (49%), Gaps = 40/327 (12%)
Query: 65 SSTSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
++ +W+LG + I +DE L D A SR+ +YR+ F IG + T
Sbjct: 107 ETSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPT 153
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH
Sbjct: 154 SDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQ 213
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDED 237
+ Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 214 IAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEI 265
Query: 238 GERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRY 285
A + C D+ RHC+ F G W P++LL+PL LGL +N Y
Sbjct: 266 RRLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAY 325
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S
Sbjct: 326 VETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESF 385
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + +DPS+A+GF+C +
Sbjct: 386 HCQHPPCRMGIGELDPSIAVGFFCETE 412
>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
Length = 398
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 163/327 (49%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSV--------------------FSKGQ----ADWTPILLLVPLVLGLEKVNPRY 285
D + C V SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ G++ D +
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTGENGTVNDQTF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +++ ++DPS+A+GF+C+++
Sbjct: 301 HCLQSPQRMNILNLDPSVALGFFCKEE 327
>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
Length = 411
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 163/327 (49%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + + + + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 42 VWILGKQHLLKTERS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 90
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 91 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 150
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 151 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 193
Query: 250 DASRHCSVF--------------------SKGQA----DWTPILLLVPLVLGLEKVNPRY 285
D + C VF SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 194 DIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 253
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D +
Sbjct: 254 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTF 313
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +++ ++DPS+A+GF+C+++
Sbjct: 314 HCLQSPQRMNILNLDPSVALGFFCKEE 340
>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
Length = 510
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 139 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 185
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 186 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 245
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 246 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 297
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 298 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 357
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 358 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 417
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 418 CQHPPCRMSIAELDPSIAVGFFCKTE 443
>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
Length = 508
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 137 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 183
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 184 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 243
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 244 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 295
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 296 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 355
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 356 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 415
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 416 CQHPPCRMSIAELDPSIAVGFFCKTE 441
>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
Length = 521
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTE 442
>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
Length = 468
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTE 414
>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
Length = 355
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 163/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 25 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 74 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 133
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 134 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 193
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 194 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 246
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 247 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 306
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 307 ILNLDPSVALGFFCKEE 323
>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
Length = 394
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 161/327 (49%), Gaps = 40/327 (12%)
Query: 65 SSTSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
++ +W+LG + I +DE L D A SR+ +YR+ F IG + T
Sbjct: 22 ETSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPT 68
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH
Sbjct: 69 SDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQ 128
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDED 237
+ Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 IAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEI 180
Query: 238 GERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRY 285
A + C D+ RHC+ F G W P++LL+PL LGL +N Y
Sbjct: 181 RRLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S
Sbjct: 241 VETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + +DPS+A+GF+C +
Sbjct: 301 HCQHPPCRMGIGELDPSIAVGFFCETE 327
>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTE 414
>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
[Homo sapiens]
Length = 415
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTE 326
>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
Length = 398
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 163/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S HC W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 311 ILNLDPSVALGFFCKEE 327
>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
boliviensis]
Length = 422
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 163/314 (51%), Gaps = 26/314 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPG 221
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+R + ++ SR S + W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 222 DRPPDSLTASNE-SRGTSAYCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 277
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 278 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILN 337
Query: 359 IDPSLAIGFYCRDK 372
+DPS+A+GF+C+++
Sbjct: 338 LDPSVALGFFCKEE 351
>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
Length = 496
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTE 442
>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
Length = 396
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCK 327
>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
Length = 473
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 105/316 (33%), Positives = 155/316 (49%), Gaps = 21/316 (6%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
++ +W+LG + + ++ E D +SR+ +YRK F IG + TSD
Sbjct: 103 TSEPVWILGRKYSLLTEKN-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 151
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQAL+ LGR WR QK Y+ +LH F D + S +SIH + Q
Sbjct: 152 GWGCMLRCGQMIFAQALVCRHLGRDWRWTQQKRQPDSYLSVLHAFMDRKDSYYSIHQIAQ 211
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G + G W GP + + + LA + L V+ R P
Sbjct: 212 MGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRSSHPC 270
Query: 246 VCIDDASR----HCSVFS-----KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
HC+ F ++ W P++LL+PL LGL +N Y+ TL+L F P
Sbjct: 271 AGAATPPAGADWHCNGFPASTEVTNRSPWRPLVLLIPLRLGLTDINEAYVETLKLCFRMP 330
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHL 356
QSLG++GGKP ++ Y +G E IYLDPH QP + + D S + + +
Sbjct: 331 QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDLCFIPDESFHCQHPPCRMSI 390
Query: 357 DSIDPSLAIGFYCRDK 372
+DPS+A+GF+C+ +
Sbjct: 391 GELDPSIAVGFFCKTE 406
>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
[Homo sapiens]
gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
construct]
gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
Length = 393
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
Length = 393
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
Length = 509
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTE 442
>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
[Homo sapiens]
Length = 402
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 163/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 201
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 202 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 254
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 255 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 314
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 315 ILNLDPSVALGFFCKEE 331
>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
Length = 393
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
Length = 481
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTE 414
>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
Length = 393
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
Length = 488
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 156/318 (49%), Gaps = 32/318 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
I+LLG + + A F DFS+R+ +YR+ F P+ + TSD GWGC
Sbjct: 129 IYLLGHVYHNKNNSA--------SFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGC 180
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFG---DSETSPFSIHNLL 184
MLRS+QM++A+A +FH LGR WR Q+ V +I+ F D+ +PFS+HN++
Sbjct: 181 MLRSAQMMLAEAFIFHLLGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMV 240
Query: 185 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS---LPMAIYVVSGDEDGERG 241
+A G AG W GP L RC G+ MAIYV
Sbjct: 241 RAAAHCGKKAGDWFGPSTAAY---LLKRCLEEAAGVADSKEIFEQMAIYVAQD------- 290
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+ D C+ S +W ++LL+P+ LG E+VN YI ++ + LGI
Sbjct: 291 --CTIYTQDVLDLCT--SDPNIEWKSVVLLIPVRLGGERVNVNYIHCIKEILAYQNCLGI 346
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y VG Q + +YLDPH +Q + + L +++H R + +DP
Sbjct: 347 IGGKPRHSLYFVGFQGKKLVYLDPHYLQKTTDTSR--LNFSVNSFHCTTARKVSFSKLDP 404
Query: 362 SLAIGFYCRDKGLLVTFE 379
S IGFYC+ + +F+
Sbjct: 405 SATIGFYCKTRRDFESFQ 422
>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
Length = 405
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
Length = 380
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTE 326
>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B;
Short=hAPG4B
gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
Length = 393
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
Length = 408
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 161/314 (51%), Gaps = 26/314 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 39 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 87
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 88 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 147
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 148 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVG 207
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E + +AS G+ W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 208 ESPPDTL----NASNQSKGTPAGRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 263
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 264 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 323
Query: 359 IDPSLAIGFYCRDK 372
+DPS+A+GF+C+++
Sbjct: 324 LDPSVALGFFCKEE 337
>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
gorilla]
gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A;
Short=hAPG4A
gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
construct]
Length = 398
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 163/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 311 ILNLDPSVALGFFCKEE 327
>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
Length = 396
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCK 327
>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
Length = 398
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 27 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 73
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 74 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 133
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 134 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 185
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 186 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 245
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 246 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 305
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 306 CQHPPCRMSIAELDPSIAVGFFCK 329
>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
Length = 398
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/313 (31%), Positives = 163/313 (52%), Gaps = 24/313 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVG 197
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
P ++ +++ F+ A W P+LL+VPL LG+ ++NP Y+ + F PQSLG
Sbjct: 198 ESTPGT-LNASNQSRGTFACCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLG 255
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSI 359
+GGKP + Y +G + I+LDPH Q +N +++ D T+H + +++ ++
Sbjct: 256 ALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVNT-EENGTVDDQTFHCLQSPQRMNILNL 314
Query: 360 DPSLAIGFYCRDK 372
DPS+A+GF+C+++
Sbjct: 315 DPSVALGFFCKEE 327
>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
Length = 393
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
Length = 394
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/325 (34%), Positives = 158/325 (48%), Gaps = 42/325 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAIFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 189
Query: 227 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAEATAFPADSEGHCNGLPA---------GAEVTNRPSLWRPLVLLIPLRLGLTDINEAY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + L D S
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSFLIPDESF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCR 370
+ + + +DPS+A+GF+C+
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCK 325
>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
Length = 517
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 109/339 (32%), Positives = 165/339 (48%), Gaps = 39/339 (11%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGN----------------NGLAEFNQDFSSRILISYRK 111
S +WLLG C+ + + D + N L F DF S++ +YRK
Sbjct: 67 SPLWLLGKCYHLKKPSLSSDTSENAEGSQQSTSESYNMLPKHLKLFLVDFHSKLWFTYRK 126
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYV--EILH 168
GF + D+ +TSD GWGCMLR++QM++AQ+ + H LGR WR P + ++ + I+
Sbjct: 127 GFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLSMEQSDIHRNIIT 186
Query: 169 LFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
F D + PFS+H L + G +Y G+W GP + C + +T L L
Sbjct: 187 WFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECAKGKTEL----LN 242
Query: 227 MAIYVVSGDEDGERGGAPVVC-----IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ ++ D +C DA S S ++ +++L+P+ LG +
Sbjct: 243 NIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRS----VIVLIPVRLGEATL 298
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDDL 339
NP YIP ++ T QS+GI+GGKP S Y +G Q+E YLDPH Q + K+DL
Sbjct: 299 NPIYIPCIQSMLTLDQSVGIMGGKPKHSLYFIGFQDEYLFYLDPHYCQQADHPAAFKNDL 358
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
YH + R ++ +DPS +GFYCRD +F
Sbjct: 359 ---LQNYHCNSPRKTNISKMDPSCCLGFYCRDYKDFQSF 394
>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
Length = 398
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 163/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 311 ILNLDPSVALGFFCKEE 327
>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
[Megachile rotundata]
Length = 518
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 162/348 (46%), Gaps = 55/348 (15%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG ++ +E L A+ + + EF +DF+SR
Sbjct: 126 SKESPVWLLGKIYRKKPEEFLEKASEAEKTLDTGSEISLAMDAISFEDSIEEFKKDFTSR 185
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P E
Sbjct: 186 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWQPDQPIKTEQQ 245
Query: 165 E--------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+ I+ FGD SPFSIH L+ G +G AG W GP ++A
Sbjct: 246 KLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAGDWYGP-------SSVAHLL 298
Query: 215 RAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
+ LP +A+YV V + D C + W ++L
Sbjct: 299 SQAVEHAAEHLPIFSNLAVYVAQD---------CAVYLQDVESVCQM---PDGKWKSLIL 346
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
VPL LG +K+NP Y L T +G++GG+P S Y +G QE+ I LDPH Q
Sbjct: 347 FVPLRLGTDKLNPVYTSCLTHLLTLDTCIGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE 406
Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+++ KD+ +++H R + + +DPS +GFY DK F
Sbjct: 407 TVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKNQFTNF 452
>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
Length = 398
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 163/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 311 ILNLDPSVALGFFCKEE 327
>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
Length = 398
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 165/322 (51%), Gaps = 42/322 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP++I
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSI---- 193
Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 194 -DTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305
Query: 351 IRHIHLDSIDPSLAIGFYCRDK 372
+ +++ ++DPS+A+GF+C+++
Sbjct: 306 PQRMNILNLDPSVALGFFCKEE 327
>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
Length = 486
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 154/305 (50%), Gaps = 27/305 (8%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR + +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKR 251
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 252 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 299
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
C + W ++L VPL LG +K+NP Y L T +G++GG+P S Y +
Sbjct: 300 VCQM---PDGKWKSLILFVPLRLGADKLNPVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 356
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY +K
Sbjct: 357 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHNKM 414
Query: 374 LLVTF 378
F
Sbjct: 415 QFTNF 419
>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
Length = 393
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCK 324
>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
Length = 385
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 110/320 (34%), Positives = 152/320 (47%), Gaps = 53/320 (16%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVG 126
+WLLG C+ N L EF++ D +S+ +YRK + PIG TSD G
Sbjct: 25 VWLLGCCY--------------NPLEEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKG 70
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCMLR QM++ QAL+ LGR WR K Y +IL LF DS+ S +SIH + Q
Sbjct: 71 WGCMLRCGQMILGQALVMRHLGRDWRWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQM 130
Query: 187 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
G + G W GP + + L M +YV + +V
Sbjct: 131 GVSEGKKISQWFGPNTAAQVLKKLIMFDEWSQ--------MGVYVAMDN---------IV 173
Query: 247 CIDDASR----HCSVFSKGQA--------------DWTPILLLVPLVLGLEKVNPRYIPT 288
IDD + H + S+G A W P+LL +PL LGL +NP Y
Sbjct: 174 VIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLLFIPLRLGLTDLNPIYKDK 233
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L F +LGI+GGKP ++ Y +G+Q + +YLDPH VQ + + K + TYH
Sbjct: 234 LNKCFRIKNTLGIIGGKPNSAHYFIGIQGDYLLYLDPHTVQETVKV-KPNCPFSDKTYHQ 292
Query: 349 DVIRHIHLDSIDPSLAIGFY 368
+H +DPS+A+GFY
Sbjct: 293 KGTNRLHFSYMDPSVALGFY 312
>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
Length = 395
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 105/288 (36%), Positives = 149/288 (51%), Gaps = 27/288 (9%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR K
Sbjct: 52 DIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKHKEH 111
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
EY +IL F D + +SIH + Q G G + G W GP + + + LA +
Sbjct: 112 PEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS- 170
Query: 220 LGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSK-----GQAD-WT 266
+A+Y VV D P C + A+ + S +S+ GQ+ W
Sbjct: 171 -------LAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSSGWR 223
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + IYLDPH
Sbjct: 224 PLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEIIYLDPH 283
Query: 327 DVQPVINIGKDDLEADTSTYHSDV-IRHIHLDSIDPSLAIGFYCRDKG 373
Q + D E TYH + + ++DPS+A+GF+C+D+
Sbjct: 284 TTQTFV-----DTEDQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDEN 326
>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
Length = 396
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 162/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 195
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S HC W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 196 DRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 248
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 249 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 308
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 309 ILNLDPSVALGFFCKEE 325
>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
Length = 398
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 99/314 (31%), Positives = 162/314 (51%), Gaps = 26/314 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+R + + S+ S + W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 DRPPDSLTA-SNQSKGTSAYCTA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILN 313
Query: 359 IDPSLAIGFYCRDK 372
+DPS+A+GF+C+++
Sbjct: 314 LDPSVALGFFCKEE 327
>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
Length = 398
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 162/317 (51%), Gaps = 32/317 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S HC W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 311 ILNLDPSVALGFFCKEE 327
>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
Length = 456
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 167/345 (48%), Gaps = 48/345 (13%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG C+ ++ L +A+ N + EF +DF+SR
Sbjct: 62 SKESPVWLLGQCYLKKSEDPLENASEALEPEGTGSQVSLAMDATNFENTIEEFKRDFASR 121
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQ 156
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W+ Q
Sbjct: 122 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKWRPEQSIENTQQ 181
Query: 157 KPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
D + I+ F D SPFSIH L+ G + G AG W GP ++ L++
Sbjct: 182 MRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSVAH---LLSQAV 238
Query: 215 RAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
L L +A+YV V + D C G W ++LLVP
Sbjct: 239 ERTGELPNSKLSRLAVYVAQD---------CAVYMQDVEEVCRTSDGG---WKSLILLVP 286
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L+LG +K+NP Y P + T +G++GG+P S Y +G Q++ I+LDPH Q ++
Sbjct: 287 LMLGTDKLNPVYAPCVTSLLTLDACIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVD 346
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ K++ +++H R + L +DPS +GFY ++ L F
Sbjct: 347 VSKENFPL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNRESLTDF 389
>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
Length = 380
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 161/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ + PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCYMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTE 326
>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
Length = 420
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 160/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 49 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 95
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR ++ Y +LH F D + S +SIH +
Sbjct: 96 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWAQRRRQPDSYFSVLHAFIDRKDSHYSIHQI 155
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD-------- 235
Q G G + G W GP + + + LA + +A+++ +
Sbjct: 156 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 207
Query: 236 ---EDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
+ C D S+HC+ G + W P++LL+PL LGL +N Y+
Sbjct: 208 RLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTWRPLVLLIPLRLGLTDINEAYV 267
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D + +
Sbjct: 268 ETLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELAGGFSIPDETFH 327
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+++ +DPS+A+GF+C+ +
Sbjct: 328 CQHPPCRMNIAELDPSIAVGFFCKTE 353
>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
Length = 398
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 101/321 (31%), Positives = 164/321 (51%), Gaps = 28/321 (8%)
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
S + +W+LG H + +++ + D S+R+ +YR+ F PIG + +S
Sbjct: 23 SDTDELVWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM++AQAL+ LGR W QK +EY IL F D + +SIH +
Sbjct: 72 DAGWGCMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 131
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV-- 232
Q G G + G W GP A+ W +LA + + + + V+
Sbjct: 132 AQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPS 191
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
S D GE + ++ + S + W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 192 SADTAGESPPGSLTALNQSKGT----SACRPAWKPLLLIVPLRLGINQINPVYVDAFKEC 247
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVI 351
F PQSLG +GGKP + Y +G I+LDPH Q ++ +++ D T+H
Sbjct: 248 FKMPQSLGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSP 306
Query: 352 RHIHLDSIDPSLAIGFYCRDK 372
+ +++ ++DPS+A+GF+C+++
Sbjct: 307 QRMNILNLDPSVALGFFCKEE 327
>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
Length = 393
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 160/323 (49%), Gaps = 38/323 (11%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
++ +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 22 TSEPVWILGRKYSIFTEKE-----------ELLSDVASRLWFTYRKNFPAIGGTGPTSDT 70
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q
Sbjct: 71 GWGCMLRCGQMIFAQALVCQHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQ 130
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGER- 240
G G + G W GP + + + LA + +A+++ + E+ R
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRL 182
Query: 241 -------GGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIP 287
G AP +HC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 183 CRTSLPCGTAPASSA-APDQHCNGFPAGAEVTTRLSPWRPLVLLIPLRLGLTDINAAYVE 241
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + L D S +
Sbjct: 242 TLKRCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEATDSCLVPDESFHC 301
Query: 348 SDVIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 302 QHPPCRMSIGELDPSIAVGFFCK 324
>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
Length = 394
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 155/306 (50%), Gaps = 40/306 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
++LLGV + + +D A F +D SR +YRK F PIGD+ TSD GWGC
Sbjct: 45 VYLLGVKYDLPRDGA-----------SFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGC 93
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
LR QML+ LL LGR WR D +Y +IL +F D S +SI + G
Sbjct: 94 TLRCGQMLLGHTLLLRHLGRDWRWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGAD 153
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
+G + G W GP + ++ + LA + Q +A+YV +V ID
Sbjct: 154 FGRSVGQWFGPNNVAQAIKRLA--------VHDQWSEVAVYVAMD---------MLVVID 196
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D S ++ P+L+ +PL LG E+ N Y ++ F QS+GI+GGKP +
Sbjct: 197 DIS-----------NFRPVLVFIPLRLGQERFNMEYKEAVKACFAVRQSVGIIGGKPRHA 245
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ G ++ IYLDPH Q + + + +D STYH+ I +H+ +DPSLA+GF+C
Sbjct: 246 LWFTGYHDDYLIYLDPHKTQSCVTLPDAGIVSD-STYHTTQIERLHISELDPSLALGFFC 304
Query: 370 RDKGLL 375
+ + L
Sbjct: 305 QTEADL 310
>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
Length = 393
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 147/285 (51%), Gaps = 17/285 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 44 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 103
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 104 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 163
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
A + + + + D +RG P D C++ + A W
Sbjct: 164 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 220
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 221 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 280
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
H QP ++ +D D S + +H+ +DPS+A GF+C+
Sbjct: 281 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQ 325
>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related cysteine endopeptidase 2A;
Short=Autophagin-2A; AltName: Full=Autophagy-related
protein 4 homolog A; AltName: Full=bAut2A
gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
Length = 398
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 161/315 (51%), Gaps = 28/315 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312
Query: 358 SIDPSLAIGFYCRDK 372
++DPS+A+GF+C+++
Sbjct: 313 NLDPSVALGFFCKEE 327
>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
Length = 396
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 161/315 (51%), Gaps = 28/315 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312
Query: 358 SIDPSLAIGFYCRDK 372
++DPS+A+GF+C+++
Sbjct: 313 NLDPSVALGFFCKEE 327
>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
Length = 394
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 147/285 (51%), Gaps = 17/285 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
A + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
H QP ++ +D D S + +H+ +DPS+A GF+C+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQ 326
>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
Length = 486
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 105/309 (33%), Positives = 153/309 (49%), Gaps = 35/309 (11%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 251
Query: 194 AGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
AG W GP + + ++ E A A L A+YV V +
Sbjct: 252 AGDWYGPSSVAHLLSQAVENAAERHPAFNNL-------AVYVAQD---------CAVYLQ 295
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D C W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 296 DIENVCQT---PDGKWKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 352
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY
Sbjct: 353 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 410
Query: 370 RDKGLLVTF 378
+K F
Sbjct: 411 HNKMQFTNF 419
>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
Length = 398
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 165/314 (52%), Gaps = 26/314 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D +R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK REY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIG 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E +P+ ++ +++ S + A W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPLNTLNASNQSKSAPASCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313
Query: 359 IDPSLAIGFYCRDK 372
+DPS+A+GF+C+++
Sbjct: 314 LDPSVALGFFCKEE 327
>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
Length = 394
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 147/285 (51%), Gaps = 17/285 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
A + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
H QP ++ +D D S + +H+ +DPS+A GF+C+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQ 326
>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
Length = 369
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 166/315 (52%), Gaps = 30/315 (9%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 1 WILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 49
Query: 131 LRSSQMLVAQALLFHRLGRP--WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
LR QM++AQAL+ LGR W K ++P +EY IL F D + +SIH + Q G
Sbjct: 50 LRCGQMMLAQALICRHLGRDLNWEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGV 107
Query: 189 AYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDED 237
G + G W GP A+ W +LA + + + + I +S D
Sbjct: 108 GEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTA 167
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
GE +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQ
Sbjct: 168 GE---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQ 223
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
SLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++
Sbjct: 224 SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNIL 283
Query: 358 SIDPSLAIGFYCRDK 372
++DPS+A+GF+C+++
Sbjct: 284 NLDPSVALGFFCKEE 298
>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
Length = 510
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 162/337 (48%), Gaps = 62/337 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 115 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 174
Query: 157 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR- 212
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 175 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTE 233
Query: 213 ---------------CQRAETGLGC---------QSLPMAIYVV---------------S 233
R E C Q P+ + S
Sbjct: 234 HSPNGLKMYVPKDGIIYRKEVYQLCAVQPADGPAQHSPLRVDDDGGDTDHDGDTDGLESS 293
Query: 234 GDEDGERGGAP-----VVCIDDASRHCSVFSKGQAD------------WTPILLLVPLVL 276
D G P + D +S H + S +++ W P+++LVP+ L
Sbjct: 294 TDSMRHSHGNPGVPSTIEAGDYSSSHAELMSSAESECESLDDNFTELTWHPVIILVPVRL 353
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+DPH VQP + +
Sbjct: 354 GIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQPTVKMDD 413
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
D L +Y ++ + + D IDPSLA+GF C +
Sbjct: 414 DPL-FPIESYRMEIPQAMSFDDIDPSLALGFLCSSQA 449
>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
cuniculus]
Length = 405
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 161/315 (51%), Gaps = 28/315 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 36 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 84
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 85 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 144
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S + G
Sbjct: 145 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSANTPG 204
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 205 ERLHDSLT----ASNQSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 260
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
LG +GGKP + Y +G I+LDPH Q ++ +++ D T+H + +++
Sbjct: 261 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNIL 319
Query: 358 SIDPSLAIGFYCRDK 372
++DPS+A+GF+C+++
Sbjct: 320 NLDPSVALGFFCKEE 334
>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
Length = 525
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 107/309 (34%), Positives = 157/309 (50%), Gaps = 35/309 (11%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 171 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 230
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 231 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 290
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCID 249
AG W GP ++ A Q E + + P +A+YV V +
Sbjct: 291 AGDWYGPSSV-----AHLLSQAVENAV--ERHPAFNNLAVYVAQD---------CAVYLQ 334
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D C S G+ W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 335 DIENVCQT-SDGK--WKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 391
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY
Sbjct: 392 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 449
Query: 370 RDKGLLVTF 378
+K F
Sbjct: 450 HNKMQFTNF 458
>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
Length = 412
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 157/320 (49%), Gaps = 28/320 (8%)
Query: 65 SSTSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
++ +W+LG + I +D+ L D A SR+ +YR+ F IG + T
Sbjct: 38 ETSEPVWILGRKYSIFTEKDDILSDVA-------------SRLWFTYRRNFPAIGGTGPT 84
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH
Sbjct: 85 SDTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQ 144
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ Q G G + G W GP + + + LA + L V+ R G
Sbjct: 145 IAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRTG 203
Query: 243 AP----VVCIDDASRHCSVF--------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
P DA RHC+ F + + W P++LL+PL LGL +N Y+ TL+
Sbjct: 204 LPCAGAAALPTDADRHCNGFPTQTEVTNRQSPSLWRPLVLLIPLRLGLTDINEAYVETLK 263
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D + +
Sbjct: 264 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIPDETFHCQHP 323
Query: 351 IRHIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 324 PCRMGIGELDPSIAVGFFCK 343
>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
Length = 606
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 161/318 (50%), Gaps = 34/318 (10%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
G+ F +DF SRI ++YR+ F + DS TSD GWGCM+RS QML+AQ L+ H LGR WR
Sbjct: 195 GIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLGRSWR 254
Query: 153 KPLQKPFDRE---YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
+ E + +++ FGD S+TSPFSIH L+ GK G G W GP A+
Sbjct: 255 WDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAVAHLL 314
Query: 208 EALARCQRAET----GLGCQ-SLPMAIYV--------VSGDEDG---ERGGAPVVCIDDA 251
R E G+ + A+Y+ V G +R GAP +
Sbjct: 315 RQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGTNSSS 374
Query: 252 SRH------CSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
S + ++G D W ++LLVPL LG +K+NP Y L+ + +G
Sbjct: 375 STAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLKAMLSLDYCIG 434
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GG+P S Y VG QE+ I+LDPH Q ++++ +D+ +++H R + L +D
Sbjct: 435 IIGGRPKHSLYFVGYQEDKLIHLDPHYCQDMVDVNQDNFP--VASFHCKSPRKMKLSKMD 492
Query: 361 PSLAIGFYCRDKGLLVTF 378
PS IGFYC K F
Sbjct: 493 PSCCIGFYCETKKDFYKF 510
>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
Length = 394
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/319 (33%), Positives = 160/319 (50%), Gaps = 27/319 (8%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+T +W+LG + + E D +SR+ +YRK F PIG + TSD
Sbjct: 21 ETTEPVWILG-----------NEYSALTEKEEILSDVTSRLWFTYRKSFPPIGGTGPTSD 69
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 184
GWGCMLR QM++ QAL+ LGR WR + +EY+ IL+ F D + S +SIH +
Sbjct: 70 TGWGCMLRCGQMILGQALMCRHLGRDWRWVRGQKQRQEYISILNAFIDKKDSYYSIHQIA 129
Query: 185 QAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 234
Q G G G W GP A+ +W L + + + + + + +
Sbjct: 130 QMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRLVVHVAMDNTVVIEEIKRLCMPWLDK 189
Query: 235 DE---DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
E + ER G C++ A C++ + A W P++LL+PL LGL +N YI TL+
Sbjct: 190 AEVFGEPERVGELNGCLEGA---CALSEEEVALWKPLVLLIPLRLGLSDINGAYIETLKK 246
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
F PQSLG++GGKP ++ Y +G IYLDPH Q + + D + +
Sbjct: 247 CFMLPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFPDDTYHCQHPP 306
Query: 352 RHIHLDSIDPSLAIGFYCR 370
+H+ +DPS+A+GF+CR
Sbjct: 307 CRMHICELDPSIAVGFFCR 325
>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
Length = 398
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 160/327 (48%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLRTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKG--QAD----------------------WTPILLLVPLVLGLEKVNPRY 285
D + C V G AD W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPLLLIVPLRLGINQINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + + D +
Sbjct: 241 IEAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEESGIVDDETF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + + ++DPS+A+GF+C+++
Sbjct: 301 HCLQSPQRMSILNLDPSVALGFFCKEE 327
>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
CIRAD86]
Length = 445
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 153/301 (50%), Gaps = 45/301 (14%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
+EF DF SR+ I+YR F PI S TSD GWGCM+R
Sbjct: 109 SEFLDDFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIR 168
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A ++ HRLGR WRK + +RE+ +IL LF D+ +PFSIH ++ G +A G
Sbjct: 169 SGQSLLANTIVVHRLGRDWRKGQK---EREHKDILSLFADTPDAPFSIHKFVEHGAQACG 225
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A ARC RA T Q+ + +Y D D V ID A
Sbjct: 226 TYPGEWFGP-------NATARCLRALTDKYHQA-GLRVYARPNDSD--------VYID-A 268
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ ++ P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 269 LTATATQKDANDEFQPTLIVLGIRLGIEKVTPAYHAALKAALELPQSMGIAGGRPSSSHY 328
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VG Q ++ YLDPH +P+++ + DT H+ +R + L +DPS+ +GF R
Sbjct: 329 FVGHQGDNFFYLDPHTTRPMLSPQPSAEDVDTC--HTRRVRRLSLAEMDPSMLLGFLVRS 386
Query: 372 K 372
K
Sbjct: 387 K 387
>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
Length = 332
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 150/285 (52%), Gaps = 40/285 (14%)
Query: 70 IWLLGVCHKIA------------QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG 117
+WLLGV + +A ++ + D + N F D SR+ SYR F PI
Sbjct: 70 VWLLGVRYTLAPPPMGQRGEGRETEQTVVDESQN-----FKLDMWSRLWFSYRYNFHPIS 124
Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR---EYVEILHLFGDSE 174
+++T+D GWGCM+RS QML+ QAL+ H LGR WR ++ +Y ++L +F D
Sbjct: 125 GTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLSHTSKYNELPSDYRKVLEMFLDHP 184
Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC-QSLPMAIYVVS 233
+P SIH+ ++AG+ G AG+W GP +C ++ L A LG +L + Y
Sbjct: 185 CAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKL----HAGGALGSDNNLQLLAY--- 237
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
DG G D+ QA P+ +L+P LG+ V+P YIP + F
Sbjct: 238 ---DGNDG-------DNTIYKSEALELLQAG--PLFILLPTRLGVSSVDPSYIPKISHVF 285
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
+FPQSLG +GGKP ++ Y + Q E+ YLDPH QP+INI + +
Sbjct: 286 SFPQSLGFIGGKPSSAHYFIASQGEAVYYLDPHTPQPLINISEKE 330
>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 394
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 110/325 (33%), Positives = 156/325 (48%), Gaps = 42/325 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRRYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHVAMDNTVVMEDIRRLCRSSLP 189
Query: 227 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAGASAFPADSEGHCNGFPAR---------AEVTNRPSPWRPLVLLIPLRLGLTDINEAY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCSIPDESF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCR 370
+ + + +DPS+A+GF+C
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCE 325
>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
Length = 368
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 105/307 (34%), Positives = 160/307 (52%), Gaps = 39/307 (12%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+ D+W+LG + I Q GD + N D SRI ++YRK F IG + T+D
Sbjct: 26 TEEDVWILGKRYNILQ----GD------MGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDS 75
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM++AQAL+ LGR W+ + EY++IL F D + S +SIH + Q
Sbjct: 76 GWGCMLRCGQMMLAQALVCRHLGRDWQWDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQ 135
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G + G A GSW GP + + + L+ + + ++V +
Sbjct: 136 MGVSEGKAVGSWFGPNTVAQVLKKLSAFDDWSS--------LCLHVAMDN---------T 178
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
V I+D S +W P++L +PL LGL ++N Y L+ FTF QSLGI+GG+
Sbjct: 179 VIIEDIS-----------NWRPLVLFIPLRLGLTEMNVVYNEPLKACFTFKQSLGIIGGR 227
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +TY +G + +YLDPH Q +N + D S +H +++ +DPS+A+
Sbjct: 228 PNHATYFIGYFGNNLVYLDPHTTQQTVNPDELSRIPDGS-FHCVYPCRMNIADVDPSVAL 286
Query: 366 GFYCRDK 372
GF+C+ +
Sbjct: 287 GFFCKSE 293
>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
Length = 434
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/288 (37%), Positives = 148/288 (51%), Gaps = 25/288 (8%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F F S + +YR F +G TSD+GWGCMLR+ QM++AQ L H LG WR+
Sbjct: 108 ASFLTHFRSVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQ 167
Query: 155 LQK--PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+ P Y +++ F D PFS+H + AG YG G W GP M + E L +
Sbjct: 168 SDRSSPL---YAKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLK 224
Query: 213 CQRAETGLG---CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-WTPI 268
+ + +GL CQ +Y+ P+ DD +GQ W P+
Sbjct: 225 -EFSPSGLRAYVCQD--GCLYLDQLRRTATAAHWPLDEDDD---------EGQGKSWAPM 272
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
L+++PL LGL+++N Y P L+ TF PQS+GI GGKP AS Y VG Q++ YLDPH V
Sbjct: 273 LIMLPLRLGLDQLNEDYAPVLKETFRIPQSVGISGGKPRASLYFVGNQDDYVFYLDPHTV 332
Query: 329 QPV---INIGKDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
QP +G D T+H + + IDPSL + FYCR++
Sbjct: 333 QPAPRFPEVGDVPASEDVYDTFHCSAPLRLPIRDIDPSLCLAFYCRNR 380
>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
Length = 373
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 103/328 (31%), Positives = 162/328 (49%), Gaps = 54/328 (16%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 250 DASRHCSVF--------------------SKGQ----ADWTPILLLVPLVLGLEKVNPRY 285
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 179 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 238
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ +++ D T
Sbjct: 239 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 297
Query: 346 YHS-DVIRHIHLDSIDPSLAIGFYCRDK 372
+H + + + ++DPS+A+GF+C+++
Sbjct: 298 FHCLQSPQRMSILNLDPSVALGFFCKEE 325
>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
Length = 398
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/314 (31%), Positives = 158/314 (50%), Gaps = 26/314 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 313
Query: 359 IDPSLAIGFYCRDK 372
+DPS+A+GF+C+++
Sbjct: 314 LDPSVALGFFCKEE 327
>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
Length = 357
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 160/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDP QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPATTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTE 329
>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
Length = 398
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/300 (37%), Positives = 154/300 (51%), Gaps = 31/300 (10%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
+ + F + + +YR+ F + TSD GWGCMLRS+QML+ QAL LGR WR P
Sbjct: 41 YKRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPAL 100
Query: 155 ----LQKPFDREYVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
+ +YV +L F DS +SIH++++ G Y G W GP +
Sbjct: 101 FEAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLR 160
Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV-------FSKG 261
L R E G +A+YV ++G VV DD +R C ++
Sbjct: 161 DLVNLHRREFGG-----ELAMYV---PQEG------VVYTDDVTRLCFFDPLLHPPTAED 206
Query: 262 QADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+DW T +L+L+PL LGL++VN RY+P L TF FPQS+GI+GGK G S Y VG Q++
Sbjct: 207 SSDWSTALLILIPLRLGLDQVNERYVPALEKTFAFPQSVGIIGGKKGHSVYFVGTQQDQL 266
Query: 321 IYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
LDPHDV P + A T HS +++ IDPSLA+GF C ++ FE
Sbjct: 267 HLLDPHDVHPAPELNPAFPTATHLRTVHSSRPLVMNVTGIDPSLALGFLCDNRADYEDFE 326
>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
Length = 429
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 160/327 (48%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 60 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 108
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 109 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 168
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 169 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 211
Query: 250 DASRHCSVF--------------------SKGQ----ADWTPILLLVPLVLGLEKVNPRY 285
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 212 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 271
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D +
Sbjct: 272 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 331
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + + ++DPS+A+GF+C+++
Sbjct: 332 HCLQSPQRMSILNLDPSVALGFFCKEE 358
>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
Length = 398
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 162/327 (49%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHC--------------------SVFSKGQA----DWTPILLLVPLVLGLEKVNPRY 285
D + C S SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPLLLIVPLRLGINQINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D +
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +++ ++DPS+A+GF+C+++
Sbjct: 301 HCLQPPQRMNILNLDPSVALGFFCKEE 327
>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
Complex
Length = 357
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 160/326 (49%), Gaps = 40/326 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWG MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGSMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTE 329
>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
[Tribolium castaneum]
gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
Length = 366
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 151/290 (52%), Gaps = 24/290 (8%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG-DSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
N L E + QD S+I +YRK F PIG D +T+D GWGCMLR QM++AQAL+ L
Sbjct: 33 NALQELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHL 92
Query: 148 GRPW-RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
GR W +P K D Y++IL F D +PFSIH + G + G W GP + +
Sbjct: 93 GRDWVWEPETK--DSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQV 150
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
+ L + +L + + E +C+ S CS DW
Sbjct: 151 LKKLVKYDEWSAIEMHIALDNTLIISDIRE---------LCLSQGSDGCS-----SGDWK 196
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LL+VPL LGL+++NP Y L+ F F QSLG++GGKP + Y +G + IYLDPH
Sbjct: 197 PLLLIVPLRLGLQEINPIYASGLKKCFQFKQSLGVIGGKPNLALYFIGHVGDEVIYLDPH 256
Query: 327 DVQP---VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
Q V + ++ STYH I++ S+DPS+A+ F+C +G
Sbjct: 257 TTQKSGSVESKETEEEIELDSTYHCKYASRINILSMDPSVAVCFFCNTEG 306
>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
Length = 381
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 163/318 (51%), Gaps = 44/318 (13%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
S S +W+LG + + + + E N + SR L +YRK F I DS TSD
Sbjct: 28 SDSPVWILG-----------NELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDS 76
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD----REYVEILHLFGDSETSPFSIH 181
GWGCMLR QM++A+AL LGR W+ Q+ D ++Y++IL LF DS+ +P+S+H
Sbjct: 77 GWGCMLRCGQMVLAEALQRVSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLH 136
Query: 182 NLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+ G++ G+W GP + + L + +ET + P+ ++V +
Sbjct: 137 QIALMGESIQSKKPVGTWFGPNTIA---QVLRKLSVSET-----TNPIRVHVAMDN---- 184
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
V +D+ C F + P+LL +PL LGL ++NP Y L+ F FPQ L
Sbjct: 185 -----TVIVDEIKESCG-FIGDPSQGKPLLLFIPLRLGLTEINPIYFQDLKECFEFPQIL 238
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPH-----DVQPVINIGKDDLEADTSTYHSDVIRHI 354
G++GG+P + Y +G + IYLDPH V+ +G ++ TYH+D +
Sbjct: 239 GVIGGRPNHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGG----SEDKTYHTDRAYRM 294
Query: 355 HLDSIDPSLAIGFYCRDK 372
+DPSL++ F C+D+
Sbjct: 295 DFKDLDPSLSLCFLCKDE 312
>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
Length = 673
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 159/318 (50%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + + + G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 253 AAENQVTECPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIA 312
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 313 QGLICHFLGRSWRYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 372
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E
Sbjct: 373 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQ 432
Query: 245 VVCIDDASRHCSVFSK----GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V A + S K Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 433 HVPWQHAKKSTSDAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 492
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++ ++H R I +D
Sbjct: 493 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETFS--MHSFHCKSPRKIKSSKMD 550
Query: 361 PSLAIGFYCRDKGLLVTF 378
PS IGFYC K +F
Sbjct: 551 PSCCIGFYCATKTDFDSF 568
>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
Length = 370
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 120/343 (34%), Positives = 164/343 (47%), Gaps = 46/343 (13%)
Query: 44 TAGSMRRIHERVLGPSRT--GISSSTSDIWLLGV-CHKIAQDEALGDAAGNNGLAEFNQD 100
T M + E VL ++ I ST +WLLG H I N L QD
Sbjct: 5 TRDIMDCMFEAVLDSTQDPDDIPQSTEPVWLLGKKYHAI------------NELNTIRQD 52
Query: 101 FSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKP 158
S++ +YRK F PIG S TSD GWGCMLR QM++ QAL+ LGR W+ P +
Sbjct: 53 IVSKLWFTYRKDFVPIGGSDGKTSDKGWGCMLRCGQMVLGQALMSIHLGRDWQWNPTTR- 111
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
D Y+ IL F DS +PFSIH + G + G G W GP + + + L +
Sbjct: 112 -DATYLSILKKFEDSRKAPFSIHQIASMGISEGKEVGQWFGPNTVAQVLKKLVKFDEGND 170
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVP 273
+AI+V + VV I + C SK AD W P+LL+VP
Sbjct: 171 --------VAIHVALDN---------VVIISEIRDLC--LSKETADVSTPHWKPLLLIVP 211
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LGL ++N Y+ L+ F F QSLGI+GGKP ++ Y +G IY DPH Q +
Sbjct: 212 LRLGLTQMNSIYLGGLKQCFQFKQSLGIIGGKPNSALYFIGYVGNEVIYFDPHTTQKAGS 271
Query: 334 IGKDDLEADTS---TYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
+G D + +YH + + +DPS+A+ F CR +
Sbjct: 272 VGNKDTSEEKDVDLSYHCKHASRMSMLGMDPSVAVCFLCRSEA 314
>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
[Ornithorhynchus anatinus]
Length = 436
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 157/327 (48%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 68 VWILGRQHHLKAEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 116
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W K EY +IL F D + +SIH + Q G
Sbjct: 117 MLRCGQMMLAQALICRHLGRDWCWEKHKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVG 176
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 177 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 219
Query: 250 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 285
D + C + +G A W P+LL+VPL LG+ +NP Y
Sbjct: 220 DIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPLLLIVPLRLGINHINPIY 279
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 280 IDAFKECFKTPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQTFVDTEENGQVDDHSF 339
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + + ++DPS+A+GF+C+++
Sbjct: 340 HCQQAPQRMKIMNLDPSVALGFFCKEE 366
>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
Length = 393
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 109/317 (34%), Positives = 153/317 (48%), Gaps = 26/317 (8%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C
Sbjct: 308 MSIAELDPSIAVGFFCE 324
>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
Length = 485
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 151/305 (49%), Gaps = 27/305 (8%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + + + EF +DF+SR+ ++YR+ F + S TSD GWGCMLRS QM++AQAL+
Sbjct: 131 AMDAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALV 190
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR + +P E + I+ FGD TSPFSIH L+ G G
Sbjct: 191 CHFLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKR 250
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 251 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 298
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
C + W ++L VPL LG +K+N Y L T +G++GG+P S Y +
Sbjct: 299 VCQM---PDGKWKSLILFVPLRLGADKLNLVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 355
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY DK
Sbjct: 356 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKM 413
Query: 374 LLVTF 378
F
Sbjct: 414 QFTNF 418
>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
Length = 384
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 100/289 (34%), Positives = 144/289 (49%), Gaps = 36/289 (12%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQAL+ +GR WR QKP
Sbjct: 44 NDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWDKQKP 103
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 -KGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQWSS 162
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------------- 264
+A+++ + V +D+ R C S +D
Sbjct: 163 --------IAVHIAMDN---------TVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDP 205
Query: 265 ----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
W P++LL+PL LGL ++N YI TL+ F PQSLG++GG+P ++ Y +G +
Sbjct: 206 SCAQWKPLVLLIPLRLGLSEINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDEL 265
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
IYLDPH Q + D S + +H+ IDPS+A+GF+C
Sbjct: 266 IYLDPHTTQLSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSIAVGFFC 314
>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; AltName: Full=Autophagy-related
protein 4 homolog B; AltName: Full=bAut2B
gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
Length = 393
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 109/317 (34%), Positives = 153/317 (48%), Gaps = 26/317 (8%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C
Sbjct: 308 MSIAELDPSIAVGFFCE 324
>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
Length = 390
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 109/317 (34%), Positives = 153/317 (48%), Gaps = 26/317 (8%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C
Sbjct: 308 MSIAELDPSIAVGFFCE 324
>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 97/293 (33%), Positives = 147/293 (50%), Gaps = 23/293 (7%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
IYLDPH Q ++ + D + + + + +DPS+A+GF+C+D+
Sbjct: 274 IYLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDEN 326
>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
Length = 406
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/335 (30%), Positives = 160/335 (47%), Gaps = 60/335 (17%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 284
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YI + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + L D +
Sbjct: 241 YIEAFKECFKMPQSLGALGGKPNNAYYFIGSLGDELIFLDPHTTQTFVDTEESGLVDDHT 300
Query: 345 TYHSDVIRHIHLDSIDPSLAI-------GFYCRDK 372
+ + + + ++DPS+A+ GF+C+++
Sbjct: 301 FHCLQSPQRMSILNLDPSVALVGQGAFMGFFCKEE 335
>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
Length = 392
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 160/324 (49%), Gaps = 41/324 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEK-SIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 179
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 180 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 239
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 240 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 299
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
+ + ++DPS+A+GF+C+
Sbjct: 300 CQHPPCRMSIANLDPSIAVGFFCK 323
>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
Length = 392
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/293 (33%), Positives = 146/293 (49%), Gaps = 23/293 (7%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 40 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 99
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 100 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 159
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 160 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 210
Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 211 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 270
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
IYLDPH Q + + D + + + + +DPS+A+GF+C+D+
Sbjct: 271 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDEN 323
>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
dendrobatidis JAM81]
Length = 441
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 107/283 (37%), Positives = 150/283 (53%), Gaps = 30/283 (10%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
F DF SR+ ++YRKGF I + T D GWGCMLRS QMLVA ALLFH LGR WR L
Sbjct: 137 HFLDDFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWR--L 194
Query: 156 QKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
DR+ Y IL F D TSP+SI + G + G W GP + + + L
Sbjct: 195 GDSNDRDTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLV 254
Query: 212 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILL 270
Q + + ++V DG + I A+R G+ TP +L+
Sbjct: 255 NDD--------QRISLKVHV---SNDGVVYKNEINTILSATR-----DDGK---TPAVLI 295
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
++PL LG+E +NP Y P ++ F +GI GG+P +S + +GV + IYLDPH ++P
Sbjct: 296 MIPLRLGVETMNPVYYPGVKHCFAMSHCVGIAGGRPNSSLFFLGVDGDHLIYLDPHHLRP 355
Query: 331 VI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
+ +I +E D +YH + +R + + S+DPSL IGFYC
Sbjct: 356 SVDSRDITSYKME-DLLSYHCEKVRLLPIASMDPSLVIGFYCH 397
>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/293 (33%), Positives = 146/293 (49%), Gaps = 23/293 (7%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
IYLDPH Q + + D + + + + +DPS+A+GF+C+D+
Sbjct: 274 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDEN 326
>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
Length = 392
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 114/329 (34%), Positives = 165/329 (50%), Gaps = 26/329 (7%)
Query: 61 TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK 120
T ++ ++ +WLLG K D A D + + F S + +YR+ + + +
Sbjct: 14 TPSAALSAPVWLLG---KRYDDVAAVD------FDAYKRSFESILWFTYRRDYPAMTPYE 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGDSE 174
TSD GWGCMLRS+QML+ QAL LGR WR P + YV++L F DS
Sbjct: 65 HTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPALFETEIDARLPETYVQLLRWFADSP 124
Query: 175 TSP--FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+SIH +++ G Y G W GP + L R E G VV
Sbjct: 125 DVECRYSIHQMVKLGVQYDKLPGEWYGPTTAAQVLRDLVNLHRREFGGELSMYVPQEGVV 184
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADW-TPILLLVPLVLGLEKVNPRYIPTLRL 291
D+ + +C D H ++ ++DW T +L+L+PL LGL++VN RY+P ++
Sbjct: 185 YSDDVAK------LCFFDPLLHPPT-TEDKSDWSTALLILIPLRLGLDQVNERYVPAIQK 237
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDV 350
+F FPQS+GI+GGK G S Y VG Q++ LDPHDV P + A T HS
Sbjct: 238 SFAFPQSVGIIGGKKGHSVYFVGTQQDQLHLLDPHDVHPAPELNTAFPTATHLRTVHSSR 297
Query: 351 IRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+++ +IDPSLA+GF C ++ FE
Sbjct: 298 PLVMNVTTIDPSLALGFLCENRVDYEDFE 326
>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
Length = 459
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 173/351 (49%), Gaps = 48/351 (13%)
Query: 62 GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI 121
S ++S +WLLG C+ QD D+ + ++ F S + +YR+ F+ +
Sbjct: 66 NTSQNSSKLWLLGDCYS-PQDFDNFDSMKD----AYHDAFESILWYTYRRDFETMVPYDF 120
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----LQKPFDREYVEILHLFGDSETS 176
TSD GWGCMLRS+QML+++A + LG W+ P L+ P + YV++L F DS +
Sbjct: 121 TSDAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLELP--KVYVKLLKWFVDSFDT 178
Query: 177 --PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
+SIHN+ + G Y G W GP A+ R L Q P V+
Sbjct: 179 ECKYSIHNITRIGMQYDKLPGEWYGP-------TTAAQALRDLVNLHAQESPECNLVMYV 231
Query: 235 DEDGERGGAPV--VCI---DDASRHCSVFSKGQADWT---------------------PI 268
+DG V +CI D + +V + Q+D T +
Sbjct: 232 PQDGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRDNSEKMWQKSL 291
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
L+L+PL LGL+ +NPRY+P ++ F FPQ++GI+GGK G S Y VG + LDPHD+
Sbjct: 292 LILIPLRLGLDSINPRYLPAIQRVFEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPHDI 351
Query: 329 QPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
P ++ A T HS + + L SIDPSLA+GFYC D+ + F
Sbjct: 352 HPTADLNTAFPTATHLRTVHSRLPLEMSLGSIDPSLALGFYCSDRKDYLDF 402
>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
purpuratus]
Length = 390
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 164/343 (47%), Gaps = 50/343 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
IW+LG + ++Q + E D SR+ +YRKGF IG + T+D GWGC
Sbjct: 48 IWILGKKYDLSQHQL-----------EARLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGC 96
Query: 130 MLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
MLR QM++AQAL++ LGR WR +P ++ D Y++IL LF D + S FSIH + Q G
Sbjct: 97 MLRCGQMMLAQALVYKHLGRDWRWRPQEQ--DETYLKILQLFLDKKDSCFSIHQIAQMGV 154
Query: 189 AYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
G G W GP + + SW LA + + + + V S E+
Sbjct: 155 GEGKKVGDWFGPNTVGQVIRKLSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETS 214
Query: 240 RGGAPV--------------------------VCIDDASRHCSVFSKGQADWTPILLLVP 273
G+ + + + + S G W + L++P
Sbjct: 215 SEGSKTGSERRKRTSSSENIRHKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIP 274
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LGL ++N Y+ L+ FT PQSLG++GGKP + Y +GV + +YLDPH QP +
Sbjct: 275 LRLGLNEINTVYMQRLKRCFTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHTTQPAAD 334
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLV 376
I K D S +H + + + ++DPS+ + + KGL V
Sbjct: 335 IDKWAFLQDES-FHCEHASRMPIKNLDPSIGLVSTKKKKGLQV 376
>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
Length = 473
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 112/306 (36%), Positives = 156/306 (50%), Gaps = 47/306 (15%)
Query: 97 FNQDFSSRILISYRKGFDPI----GDSK------------------ITSDVGWGCMLRSS 134
F DF SR+ I+YR F PI G S TSD GWGCM+RS
Sbjct: 138 FLDDFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSG 197
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLA 193
Q L+A LLF RLGR WR+ Q ++E E+L LF D +PFSIH +Q G A G
Sbjct: 198 QSLLANTLLFLRLGRGWRRGSQ---EQEESELLSLFADHPRAPFSIHRFVQHGATACGKC 254
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCIDDAS 252
G W GP A + +ALA G + +Y+ S G + ER + C
Sbjct: 255 PGEWFGPAAAAQCIQALAN--------GHPQAGLNVYITSDGSDIYERQFREIACR---- 302
Query: 253 RHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ G+ D P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S Y
Sbjct: 303 ---GLGEDGEDDSIKPTLILLGVRLGIDRVTPVYWESLKEVIRFPQSVGIAGGRPSSSHY 359
Query: 312 IVGVQEESAIYLDPHDVQPVI---NIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+ Q ++ YLDPH +P + G+D + STYH+ +R +H+ +DPS+ IGF
Sbjct: 360 FIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGELSTYHTRRLRRLHIREMDPSMLIGF 419
Query: 368 YCRDKG 373
RD+G
Sbjct: 420 LVRDEG 425
>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
Length = 389
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 168/336 (50%), Gaps = 34/336 (10%)
Query: 38 TVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEF 97
KR++ A +E + R G + +W+LG + L E
Sbjct: 21 NTKRMLEACEAFVTYESGIILERQGFEVNDEPVWILG-----------REYDTKTKLDEL 69
Query: 98 NQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--WRKPL 155
N D SR+L++YR+ F PIGDS +TSD GWGCMLR QM+VAQAL+ LGR W
Sbjct: 70 NSDVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFWPVGD 129
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
+ Y +IL LF D +T+ +SIH L Q G + G G W GP + + + L+
Sbjct: 130 DQRTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLSEYDE 189
Query: 216 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC--SVFSKGQADWTPILLLVP 273
+ I+V + V I++ + C + + W+P+LL+VP
Sbjct: 190 WSA--------LKIHVAMDN---------AVVIEEIEQLCHKKITPTETSTWSPLLLVVP 232
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LGL +NP YI +L+ PQS+G++GGKP + Y +G + ++LDPH Q I+
Sbjct: 233 LRLGLLNINPIYIDSLKACLQMPQSIGMIGGKPSQALYFIGYVGDDVVFLDPHLTQNAID 292
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ +D E D S+YH I S+DPSLA+ F C
Sbjct: 293 LDED--EFDDSSYHPATCARISFQSMDPSLAVCFSC 326
>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
Length = 354
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 154/304 (50%), Gaps = 44/304 (14%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR
Sbjct: 8 GIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSWR 67
Query: 153 ---KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 201
KP+Q RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 68 WSEKPIQN--GREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGP- 124
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV-------VCIDDASRH 254
++A C + ++ V + E+ E V + I D H
Sbjct: 125 ------ASVAHCLK------------SVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTH 166
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
C + W ++LLVP+ LG E++NP Y P L T +GI+GG+P S Y VG
Sbjct: 167 CRL---PNGCWKSLILLVPVKLGTERLNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVG 223
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGL 374
Q++ I+LDPH Q ++++ + + T+H R + + +DPS IGFY +
Sbjct: 224 YQDDRLIHLDPHYCQEMVDVWQPNFSLQ--TFHCRSPRKMPISKMDPSCCIGFYLQTHHD 281
Query: 375 LVTF 378
TF
Sbjct: 282 FETF 285
>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
tropicalis]
Length = 384
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 99/292 (33%), Positives = 144/292 (49%), Gaps = 36/292 (12%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQALL +GR WR QK
Sbjct: 44 NDITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWDKQKS 103
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 -QGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQWSS 162
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------------- 264
+A+++ + V +D+ R C + ++
Sbjct: 163 --------IAVHIAMDN---------TVVMDEIRRLCRAGTNESSEAGALCNGYTGVSDP 205
Query: 265 ----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
W P++LL+PL LGL +N YI TL+ F PQSLG++GG+P ++ Y +G +
Sbjct: 206 SCSLWKPLVLLIPLRLGLSDINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDEL 265
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
IYLDPH Q + D S + +H+ IDPS+A+GF+CR +
Sbjct: 266 IYLDPHTTQLAVEPSDCCFVEDESFHCQHPPCRMHVSEIDPSIAVGFFCRSQ 317
>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
Length = 393
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 153/315 (48%), Gaps = 20/315 (6%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+T +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 21 ETTEPVWILGRKYTIFTEKE-----------EILSDVTSRLWFTYRKNFPAIGGTGPTSD 69
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 184
GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH +
Sbjct: 70 TGWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQIA 129
Query: 185 QAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
Q G G + G W GP A+ +W +LA + + + +
Sbjct: 130 QMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCKAGFPC 189
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
DG + + + + + W P++LL+PL LGL +N Y TL+ F
Sbjct: 190 ADGAAFPTDSELLSNGYPPAAEVTDRASPWRPLVLLIPLRLGLTDINEAYTETLKHCFMM 249
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG++GGKP ++ Y +G E IYLDPH QP + + + D + + ++
Sbjct: 250 PQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVESTEGGVFPDETFHCQHPPCRMN 309
Query: 356 LDSIDPSLAIGFYCR 370
+ +DPS+A+GF+C+
Sbjct: 310 IGELDPSIAVGFFCK 324
>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
Length = 461
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 154/324 (47%), Gaps = 37/324 (11%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+T +W+LG + I ++ + D +SR+ +YRK F IG + TSD
Sbjct: 91 TTEPVWILGRKYTIFTEKE-----------DILSDVTSRLWFTYRKNFPAIGGTGPTSDT 139
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQALL LGR WR + Y +L+ F D + S +SIH + Q
Sbjct: 140 GWGCMLRCGQMIFAQALLCRHLGRDWRWKKGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 199
Query: 186 AGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ--------SLPMA 228
G G + G W GP + + +W +LA + + + + P
Sbjct: 200 MGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSSLAVHIAMDNTVVIEEIRRLCKPNFPAG 259
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
D + G P + + W P++LL+PL LGL ++N YI T
Sbjct: 260 ASAFPTDSEFLLNGFP---------SGAEVTNRPTQWKPLVLLIPLRLGLTEINEAYIET 310
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ F PQSLG++GGKP ++ Y +G IYLDPH QP + I D S +
Sbjct: 311 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEISGSCFIPDESFHCQ 370
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDK 372
+++ +DPS+A+GF+C+ +
Sbjct: 371 HPPCRMNIVELDPSIAVGFFCKTE 394
>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 410
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 160/334 (47%), Gaps = 56/334 (16%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S ++W++G ++ Q + D ++ SR+ +YRK F PIG + SD
Sbjct: 30 KSGGEVWIVG---RVWQTQDFDD---------IKKEIRSRMWFTYRKSFSPIGGTGPISD 77
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
GWGCMLR QML+AQAL+ LGR W+ P + D YV IL +F D + +SIH +
Sbjct: 78 SGWGCMLRCGQMLLAQALICRHLGREWQWSPSCR--DEAYVRILRMFQDKKNELYSIHMI 135
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALA----------------RCQR--- 215
+ G++ G G W GP A+ W +LA C R
Sbjct: 136 AKMGESEGKEIGKWFGPSTIAHVIKKLAIYDDWSSLAVHVAMDNVIVQEDVKKLCSREVF 195
Query: 216 -AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
A Q P I V ED + V C + +S W P+LL++P+
Sbjct: 196 DALRKRLLQEEPSEI-VADWFEDARKDNKKVDCANLSS-----------PWKPLLLILPM 243
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LGL ++NP YIP L+ F ++G++GGKP + Y +G ++ +YLDPH Q +++
Sbjct: 244 RLGLSELNPCYIPALKEFFACKYNIGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTFVDL 303
Query: 335 GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
D S+YHS I I + IDPSLAI FY
Sbjct: 304 DVSMDLFDDSSYHSAFILDISFNEIDPSLAIAFY 337
>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
Length = 397
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 162/340 (47%), Gaps = 47/340 (13%)
Query: 48 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
M + E LGP I +D+WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDSMLAGAVGEPEDIPKRNTDVWLLGKRYNAIQELEL-----------IR 61
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTP 118
Query: 159 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
D Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
+ + ++V V +D+ C S + W P+LL+VPL L
Sbjct: 179 SS--------LVVHVAMDS---------TVVLDEIYTRCQEVSA--STWKPLLLIVPLRL 219
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q ++ +
Sbjct: 220 GISDINPMYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGSVAQ 279
Query: 337 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
A+ +YH + ++DPSLA+ F C+ +
Sbjct: 280 KTTAAEQELDESYHQKYAARLSFGAMDPSLAVCFLCKTRN 319
>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
Length = 439
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 153/305 (50%), Gaps = 51/305 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A ALL R+GR WR+ + +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGVSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +AL+ Q + +Y+ +GD G+ V
Sbjct: 220 HPGEWFGPSATARCIQALSNSQAKSE--------LRVYI-TGD------GSDVY----ED 260
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ S+ +D+TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 261 KFMSIAKPNHSDFTPTLILVGTRLGLDKITPVYWEALKYSLQMPQSVGIAGGRPSSSHYF 320
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+GVQE YLDPH +P + KD++E D + H+ +R +H+ +DPS+ I F
Sbjct: 321 IGVQESDFFYLDPHQTRPALPY-KDNVEDYTTEDIDSCHTRRLRRLHIKEMDPSMLIAFL 379
Query: 369 CRDKG 373
RD+
Sbjct: 380 IRDEN 384
>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
Length = 505
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 119/362 (32%), Positives = 170/362 (46%), Gaps = 66/362 (18%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG C+ + L A+ N + EF +DF SR
Sbjct: 80 SKESPVWLLGQCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 139
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR-PWR-KPLQKPFDRE 162
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q +
Sbjct: 140 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWRWRPEQLTDESS 199
Query: 163 YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRA 216
+ I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 200 HRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQAME------RA 253
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
+ +A+YV + V C D R ++LLVPL L
Sbjct: 254 SEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGRRKA------------LILLVPLRL 301
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------- 329
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q
Sbjct: 302 GADKLNPVYAPCLTALLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQNEFYFRI 361
Query: 330 --------PVINIGKD-DLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLV 376
P + I + D+E + +++H R + L +DPS +GFY DK L
Sbjct: 362 LLSITDSLPYLFIQETVDVEGNEKFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLT 421
Query: 377 TF 378
F
Sbjct: 422 DF 423
>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
Length = 383
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 103/317 (32%), Positives = 159/317 (50%), Gaps = 29/317 (9%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + ++W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 22 IPETKDNVWVLGKKYSAIQD-----------LERIRRDITSVIWCTYRKGFVPIGDEGLT 70
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 180
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ + + +P+SI
Sbjct: 71 SDKGWGCMLRCGQMVLGVALIKVHLSADW---VWTPETRDPTYLKIVQRLEERKQAPYSI 127
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G G G W GP + + + L + + + I+V + +
Sbjct: 128 HQVALMGACEGKEVGQWFGPNTVAQVLKKLVVYDKWSS--------LVIHVALDNTVVKE 179
Query: 241 GGAPVVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+++ CS G +DW P+LL+VPL LGL ++NP Y+ L++ F PQS
Sbjct: 180 DILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGLSEINPIYMEGLKICFQSPQS 239
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIH 355
+G++GGKP + Y++G + IYLDPH Q V N D+ + TYH I
Sbjct: 240 IGVIGGKPNQALYLIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIP 299
Query: 356 LDSIDPSLAIGFYCRDK 372
+ S+DPS+A+ F CR +
Sbjct: 300 ILSMDPSVAVCFLCRTR 316
>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
Length = 355
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 105/297 (35%), Positives = 149/297 (50%), Gaps = 27/297 (9%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W
Sbjct: 15 EGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 74
Query: 152 RKPLQKPFD--REYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 201
R +KP RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 75 RWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALGKKPGDWYGPA 134
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 261
++ ++L E + + +YV V I D C +
Sbjct: 135 SVAHCLKSLIASASKENY---EFDHLEVYVAQDS---------TVYIQDIYSMCQLL--- 179
Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
W ++LLVP+ LG EK NP Y P L T +GI+GG+P S Y VG Q++ I
Sbjct: 180 HGAWKSLILLVPVKLGTEKFNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVGYQDDKLI 239
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+LDPH Q ++++ + + ++H R + L +DPS IGFY + TF
Sbjct: 240 HLDPHYCQEMVDVWQPNFS--LQSFHCRSPRKMPLAKMDPSCCIGFYLGTQHDFETF 294
>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 354
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 155/319 (48%), Gaps = 40/319 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAI 365
+ + +DPS+A+
Sbjct: 301 CQHPPCRMSIAELDPSIAV 319
>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
Length = 366
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 99/271 (36%), Positives = 137/271 (50%), Gaps = 41/271 (15%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRKGF PIG + TSD GWGCMLR QM++ QAL+ LGR WR +
Sbjct: 68 DVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWVSGEEQ 127
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
EYV IL+ F D + S +SIH + + +C W A A G
Sbjct: 128 RHEYVNILNAFIDKKDSYYSIHQIER-----------------LCMPWLDKAEACAASEG 170
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
+G + +G GA C+ + A W P++LL+PL LGL
Sbjct: 171 VG-------------ELNGYLEGA-----------CAFSEEETALWKPLVLLIPLRLGLT 206
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q ++ +D
Sbjct: 207 DINEAYIETLKKCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQTAVDPCEDGT 266
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
D S + +H+ +DPS+A GF+CR
Sbjct: 267 FTDDSYHCQHPPCRMHICELDPSIAAGFFCR 297
>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
Length = 518
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 155/319 (48%), Gaps = 40/319 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 145 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 191
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 192 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 251
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 252 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 303
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 304 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 363
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 364 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 423
Query: 347 HSDVIRHIHLDSIDPSLAI 365
+ + +DPS+A+
Sbjct: 424 CQHPPCRMSIAELDPSIAV 442
>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 628
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 166/345 (48%), Gaps = 59/345 (17%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+ G+ F +DF SR+ ++YRK F + DS TSD GWGCM+RS QML+AQ L+ H LGR
Sbjct: 188 DEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLGRG 247
Query: 151 WR-----KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSW 197
WR + L+ FD E I+ FGD S TSPFSIH L+ GK G G W
Sbjct: 248 WRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPGDW 307
Query: 198 VGPYAMCRSW-EALARCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV------ 246
GP ++ +A+ + T L ++ + A+Y+ ++ P V
Sbjct: 308 YGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAPWQKK 367
Query: 247 ------CIDDASR------------------HCSVF---------SKGQADWTPILLLVP 273
C D S+ H + F S + W ++LLVP
Sbjct: 368 MSSAAACTDSPSQATTPRVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLILLVP 427
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LG EK+NP Y L+ + +GI+GG+P S + VG QE+ I+LDPH Q +++
Sbjct: 428 LRLGTEKLNPIYNDCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYCQDMVD 487
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ +++ S++H R + L +DPS IGFYC + F
Sbjct: 488 VNQENFPV--SSFHCKSPRKMKLSKMDPSCCIGFYCATRKDFFKF 530
>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
Length = 676
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 153/322 (47%), Gaps = 43/322 (13%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 258 AVENQVGETPWEEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 317
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L+ G A G
Sbjct: 318 QGLIVHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGK 377
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP ++ L T +++YV + I D
Sbjct: 378 KPGDWYGPASVSY---LLKHALEHATQENADFDNISVYVAKD---------CTIYIQDIE 425
Query: 253 RHCSV----------------------FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
CS+ Q W +++L+PL LG +KVNP Y L+
Sbjct: 426 DQCSIPEPAPKQTHVPWQQMKRPSLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAHCLK 485
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
L + LGI+GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H
Sbjct: 486 LLLSTENCLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQENF--SMQSFHCKS 543
Query: 351 IRHIHLDSIDPSLAIGFYCRDK 372
R I +DPS IGFYC K
Sbjct: 544 PRKIKTSKMDPSCCIGFYCATK 565
>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
Length = 396
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 157/335 (46%), Gaps = 58/335 (17%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
+T +W+LG + I +DE L D +SR+ +YRK F IG + TS
Sbjct: 25 TTDPVWILGRKYTIFTEKDEILSDV-------------TSRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + +A+++ +
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDN-------- 175
Query: 244 PVVCIDDASRHCSV-FSKGQA-------------------------DWTPILLLVPLVLG 277
V ++D R C FS A W P++LL+PL LG
Sbjct: 176 -TVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQWRPLVLLIPLRLG 234
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
L +N Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q + +
Sbjct: 235 LTDINEAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNG 294
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ D S + +++ +DPS+A+GF+C+ +
Sbjct: 295 GVIPDESFHCQHPPCRMNIGELDPSIAVGFFCKSE 329
>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
Length = 436
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 155/333 (46%), Gaps = 44/333 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG K +D + +FN + ++ +YR+ F PIG + SD GWGC
Sbjct: 31 VWILGKHFKPDED-----------MEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGC 79
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQALL LGR W + + Y+ ILH F D + S +SIH + Q G
Sbjct: 80 MLRCGQMMLAQALLCRHLGRDWDWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVG 139
Query: 190 YGLAAGSWVGPYAMCRSWEALA-------------------------RCQRAETGLGCQS 224
G G W GP + + + L C+ + GC
Sbjct: 140 EGKQIGQWFGPNTVAQVIKKLVLFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYG 199
Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHC-------SVFSKGQADWTPILLLVPLVLG 277
I+ S + P C ++S+ S S+ W P+LL +PL LG
Sbjct: 200 ECSYIHDRSSLTGNQSVSKPPHCSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLG 259
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
L ++N Y +L++ FT QSLG++GGKP + Y +G + +YLDPH Q I +
Sbjct: 260 LSEINSDYYNSLKIMFTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQTIEPERF 319
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
++ D S +H + S+DPS+A+GFYC
Sbjct: 320 NVIPDES-FHCVYPCFMSFQSLDPSVALGFYCH 351
>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
Length = 382
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 100/295 (33%), Positives = 151/295 (51%), Gaps = 41/295 (13%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + D +S+I ++YRK F IG + TSD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 35 LDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREWRW 94
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+P K +++Y+ IL +F D + FSIH + Q G + G G W GP + LA
Sbjct: 95 EPGTK--NKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR-HCSVF------------- 258
+ + +AI+V + V I++ S+ C ++
Sbjct: 153 FDKWSS--------LAIHVAMDN---------TVIINEISKFRCHIWAAADGLVRNRTNS 195
Query: 259 -----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
+ + W P+LL +PL LGL ++N Y L+ TF QSLG++GGKP + Y +
Sbjct: 196 EPSRPANSEGSWKPLLLFIPLRLGLSEINRIYAFGLKRTFALKQSLGMIGGKPNHALYFI 255
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
GV E+ I+LDPH Q ++ D D +YH +++ +DPS+A+ FY
Sbjct: 256 GVVEDELIFLDPHTTQLACDLDVD--SPDDQSYHCAHASRMNISELDPSVALCFY 308
>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
Length = 382
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 162/339 (47%), Gaps = 47/339 (13%)
Query: 48 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
M + E LGP I +++WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDGVLAGAVGEIEDIPKRNTNVWLLGKRYNAIQE-----------LEPIR 61
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTP 118
Query: 159 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
D Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
+ +A++V V +DD C ++ W P+LL+VPL L
Sbjct: 179 SS--------LAVHVAMDS---------TVVLDDIYTCCQ--ESSESSWKPLLLIVPLRL 219
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q + +
Sbjct: 220 GITDINPIYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQ 279
Query: 337 DDLEAD---TSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
A+ +YH + ++DPSLA+ F C+ +
Sbjct: 280 KTTAAERELDESYHQKYAARLSFGAMDPSLAVCFLCKTR 318
>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
litura]
Length = 365
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 157/316 (49%), Gaps = 28/316 (8%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 5 IPQTKESVWILGKKYSAIQD-----------LDRIRRDITSIIWCTYRKGFIPIGDEGLT 53
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 180
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ F + + +P+SI
Sbjct: 54 SDKGWGCMLRCGQMVLGVALVRVHLSADW---VWTPETRDPTYLKIIQRFEERKQAPYSI 110
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G + G G W GP + + + L + + + I+V + +
Sbjct: 111 HQVALMGASEGKQVGQWFGPNTVAQVLKKLTVYDKWSS--------LVIHVALDNTVVKE 162
Query: 241 GGAPVVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+++ CS DW P+LL+VPL LGL ++NP YI L++ F PQS+
Sbjct: 163 DILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLSEINPIYIDGLKICFQCPQSI 222
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIHL 356
G++GGKP + Y+VG + IYLDPH Q V D+ + +YH I +
Sbjct: 223 GVIGGKPNQALYLVGCVGDEVIYLDPHTTQRSGLVETKTTDEQKEMDWSYHCKYASRIPM 282
Query: 357 DSIDPSLAIGFYCRDK 372
++DPS+A+ F CR K
Sbjct: 283 LAMDPSVAVCFLCRTK 298
>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
Length = 393
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 154/323 (47%), Gaps = 39/323 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 230
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAA 193
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
+ + D G P +D + A W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPALESDVLYNGCP----EDVG-----LRERLALWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG++GGKP ++ Y +G E IYLDPH QP + G D S +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPGDSGCLPDESFHCQHP 304
Query: 351 IRHIHLDSIDPSLAIGFYCRDKG 373
+ + +DPS+A+GF+C +
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEA 327
>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
Length = 405
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 161/330 (48%), Gaps = 39/330 (11%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S S IWLLG + + + N DF SRI ++YRK F + S TSD
Sbjct: 18 SKDSPIWLLGRIYHQSHKTDDSSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSD 77
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPFDREYVEILHLFGD--S 173
GWGCMLRS QML+AQAL+ H LGR WR + LQ+ R I+ FGD S
Sbjct: 78 CGWGCMLRSGQMLLAQALVCHFLGRDWRWNESGAQEQQTLQESLHR---MIVQWFGDKPS 134
Query: 174 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
P SIH ++ G + G G W GP ++ S+ QRA T + + +Y+
Sbjct: 135 PACPLSIHQMVSQGHISAGKRPGDWYGPSSV--SYIIKQILQRA-TDTYPELDTLRVYIA 191
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD----------WTPILLLVPLVLGLEKVN 282
V +DD + CS + + W ++LL+PL LG E++N
Sbjct: 192 QD---------CTVYLDDVKQSCSKICNYECEETDYELIDDQWKSLILLIPLRLGGERMN 242
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
P Y L+ + Q +GI+GGKP S Y +G Q++ I+LDPH+ Q ++++ + +
Sbjct: 243 PTYDSCLKGLLSLEQCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDVLIPNF--N 300
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
++H +R L +DPS +GFY R +
Sbjct: 301 LKSFHCHELRKTALKQVDPSCCVGFYLRSQ 330
>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
Length = 708
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 163/316 (51%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 289 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 348
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 349 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 408
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 409 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 468
Query: 245 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 469 HVPWQQAKRPQAETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 528
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 529 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 586
Query: 363 LAIGFYCRDKGLLVTF 378
IGFYC K +F
Sbjct: 587 CCIGFYCATKSDFDSF 602
>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
NZE10]
Length = 442
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 149/303 (49%), Gaps = 49/303 (16%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
+EF +D S+I ++YR F PI S TSD GWGCM+R
Sbjct: 111 SEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDGFTSDTGWGCMIR 170
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A A+L HRLGR WR+ + +REY +IL LF D+ SP SIH ++ G +A G
Sbjct: 171 SGQSLLANAILIHRLGRDWRRGDK---EREYKDILSLFADTPESPLSIHKFVEHGAQACG 227
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA--IYVVSGDEDGERGGAPVVCID 249
G W GP A R AL + E GL S P +YV
Sbjct: 228 TYPGEWFGPNATARCIRALTE-KYHEAGLQVYSRPNDSDVYV------------------ 268
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D+ + + P L+++ + LG+EKV P Y L+ QS+GI GG+P +S
Sbjct: 269 DSLMQTAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALELSQSVGIAGGRPSSS 328
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G Q ++ YLDPH +P+++ L D ++ H+ +R + + +DPS+ +GF
Sbjct: 329 HYFIGHQGDNFFYLDPHTTRPMLS--PQPLAEDINSCHTRRVRRLGIAEMDPSMLLGFLI 386
Query: 370 RDK 372
R K
Sbjct: 387 RSK 389
>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
Length = 393
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 154/319 (48%), Gaps = 39/319 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 230
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAA 193
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
+ + DG G P ++A ++ W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPAVESDGLYNGCP----EEAG-----VRDRRSLWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEHNDSGCLPDESFHCQHP 304
Query: 351 IRHIHLDSIDPSLAIGFYC 369
+ + +DPS+A+GF+C
Sbjct: 305 PCRMSIAELDPSIAVGFFC 323
>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
Length = 672
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 161/318 (50%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + + D+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 252 AAENQMADSPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 311
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 312 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGK 371
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ +E E P
Sbjct: 372 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKP 431
Query: 245 VVCIDDASRH----CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V S+ + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 432 HVPWQMTSKKPASDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 491
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++ ++H R + +D
Sbjct: 492 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETFS--MQSFHCKSPRKLKSSKMD 549
Query: 361 PSLAIGFYCRDKGLLVTF 378
PS IGFYC K +F
Sbjct: 550 PSCCIGFYCATKTDFDSF 567
>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
Length = 583
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 109/335 (32%), Positives = 153/335 (45%), Gaps = 67/335 (20%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
+ F +DF +R+ ++YRK F + DS TSD GWGCM+RS QML+AQ LL H LGR WR
Sbjct: 167 IEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNWRW 226
Query: 153 ----KPLQKPF------DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 200
+ L+ + D + +I+ FGD S TSPFSIH L+ GK G G W GP
Sbjct: 227 DATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWYGP 286
Query: 201 YAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
++A R L Q + + +YV V I D C+
Sbjct: 287 -------GSVAHLLRQAVKLAAQEISDLDGVNVYVAQDC---------AVYIQDIIDECT 330
Query: 257 VFS---------------------------------KGQADWTPILLLVPLVLGLEKVNP 283
V + W ++LLVPL LG EK+NP
Sbjct: 331 VSAGPTLAPWQKKSPGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNP 390
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
Y L+ + +GI+GG+P S Y VG QE+ I+LDPH Q ++++ +
Sbjct: 391 IYSDCLKAMLSLDNCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQDMVDVVNQE-NFPV 449
Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+++H R + L +DPS IGFYC + F
Sbjct: 450 ASFHCKSPRKMKLSKMDPSCCIGFYCETRKDFFKF 484
>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
Length = 393
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 151/330 (45%), Gaps = 59/330 (17%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 224
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 225 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPTVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
D S + + + +DPS+A+GF+C
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCH 324
>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; Short=cAut2B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
Length = 393
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 151/330 (45%), Gaps = 59/330 (17%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 224
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 225 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPAVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
D S + + + +DPS+A+GF+C
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCH 324
>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
Length = 703
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 161/316 (50%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 245 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQQAKRPQAETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581
Query: 363 LAIGFYCRDKGLLVTF 378
IGFYC K F
Sbjct: 582 CCIGFYCATKSDFDNF 597
>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
Length = 396
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/310 (31%), Positives = 155/310 (50%), Gaps = 32/310 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 195
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S HC W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 196 DRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 248
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 249 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 308
Query: 356 LDSIDPSLAI 365
+ ++DPS+A+
Sbjct: 309 ILNLDPSVAL 318
>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
Length = 402
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 160/324 (49%), Gaps = 38/324 (11%)
Query: 51 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 110
+ + V G I +D+W+LG + Q+ L +D SR+ +YR
Sbjct: 31 VGQAVGGGESEDIPRRNTDVWVLGKRYNAIQELEL-----------IRRDIQSRLWCTYR 79
Query: 111 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F
Sbjct: 80 CGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-PECRDATYLKIVNRF 138
Query: 171 GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
D + S +SIH + G++ A G W+GP + + + L R +A++
Sbjct: 139 EDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVRFD--------DWCSLAVH 190
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
V V +DD C + W P+LL++PL LG+ +NP Y+P L+
Sbjct: 191 VAMDS---------TVVLDDIYSLC----REGDSWKPLLLVIPLRLGITDINPMYVPALK 237
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTSTY 346
S G++GG+P + Y +G ++ +YLDPH Q +G+ + E D TY
Sbjct: 238 RCLELDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGTVGQKTGVGEQEYD-ETY 296
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCR 370
H ++ ++DPSLA+ F C+
Sbjct: 297 HQKHAARLNFSAMDPSLAVCFLCK 320
>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
Length = 706
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/316 (34%), Positives = 163/316 (51%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 287 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 346
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 347 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 406
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 407 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 466
Query: 245 VVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K + W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 467 HVPWQQAKRPQAETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 526
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 527 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 584
Query: 363 LAIGFYCRDKGLLVTF 378
IGFYC K +F
Sbjct: 585 CCIGFYCATKSDFDSF 600
>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
Length = 668
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 162/316 (51%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 249 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 308
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 309 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 368
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 369 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 428
Query: 245 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 429 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 488
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 489 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 546
Query: 363 LAIGFYCRDKGLLVTF 378
IGFYC K F
Sbjct: 547 CCIGFYCATKSDFDNF 562
>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
Length = 703
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 161/316 (50%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQKAKRPQAENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581
Query: 363 LAIGFYCRDKGLLVTF 378
IGFYC K F
Sbjct: 582 CCIGFYCATKSDFDNF 597
>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
Length = 459
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 165/370 (44%), Gaps = 80/370 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 112
S S ++LLG C+ DE+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTDES-GELSTDGSNFDKINTEISGNVEEFRKDFISRIWLTYREE 94
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 151
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIDSSDSESWTAHTV 154
Query: 152 --------------RKP----------LQKPFDRE-------YVEILHLFGDSETSPFSI 180
R+P L++ +D + +I+ FGDS + F +
Sbjct: 155 KKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNEVYHRKIISWFGDSPLTAFGL 214
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H L++ GK G AG W GP + R G + IYV
Sbjct: 215 HQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTIYVAQD------ 263
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V D R CS G+AD +++LVP+ LG E+ N Y+ ++ + +G
Sbjct: 264 --CTVYSSDVIDRQCSFMDSGEADTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVG 321
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +D
Sbjct: 322 IIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMD 379
Query: 361 PSLAIGFYCR 370
PS IGFYCR
Sbjct: 380 PSCTIGFYCR 389
>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
Length = 653
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 162/316 (51%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 234 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 293
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 294 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 353
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 354 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 413
Query: 245 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 414 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 473
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 474 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 531
Query: 363 LAIGFYCRDKGLLVTF 378
IGFYC K F
Sbjct: 532 CCIGFYCATKSDFDNF 547
>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
Length = 390
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 162/321 (50%), Gaps = 49/321 (15%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
S S +W+LG + N +AE N + SR+L +YRK F I S TSD
Sbjct: 28 SDSPVWILG-----------NELCARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDS 76
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 179
GWGCMLR QM++ +AL LGR W+ + + +Y++IL+LF DS+ +P+S
Sbjct: 77 GWGCMLRCGQMVLGEALQRISLGRDWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYS 136
Query: 180 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
IH + G++ G+W GP + + + L+ ++ ++P+ ++V +
Sbjct: 137 IHQIALMGESIQSKKPVGTWFGPNTVAQVLKKLSFFEK--------TVPIRLHVAMDN-- 186
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
V ID+ C F G ++ P+LL +PL LGL ++NP Y L+ F FPQ
Sbjct: 187 -------TVIIDEIKESCG-FVGGDSE-KPLLLFIPLRLGLTEINPIYFQDLKECFEFPQ 237
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT------STYHSDVI 351
LG++GG+P + Y +G + IYLDPH I+ DT T+H++
Sbjct: 238 ILGVIGGRPNHALYFIGYVDNELIYLDPH-----ISTQSASSTVDTFGGPQDQTHHTERA 292
Query: 352 RHIHLDSIDPSLAIGFYCRDK 372
+ +DPSL++ F CR++
Sbjct: 293 YRMDFKDLDPSLSLCFLCRNE 313
>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
Length = 668
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/316 (34%), Positives = 163/316 (51%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 248 AVENQVGEHPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 307
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H +GR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 308 QGLICHFMGRTWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGK 367
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 368 KPGDWYGPASVSYLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKP 427
Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + SK Q W +++L+PL LG +K+N Y L+L + LGI+
Sbjct: 428 NVPWQQAKRPQAEVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAHCLKLLLSTEHCLGII 487
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ + ++H R + +DPS
Sbjct: 488 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENFSLN--SFHCKSPRKLKSSKMDPS 545
Query: 363 LAIGFYCRDKGLLVTF 378
IGFYC K F
Sbjct: 546 CCIGFYCATKSDFDNF 561
>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
Length = 475
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 163/330 (49%), Gaps = 48/330 (14%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 100 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 146
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 147 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 206
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
Q G G + G W GP + + + LA + +A++V + E+
Sbjct: 207 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHVAMDNTVVMEEIR 258
Query: 240 R---------GGAPVVCIDDASRHCSVF----------SKGQADWTPILLLVPLVLGLEK 280
R G A + DA RHC+ F S + W P++LL+PL LGL
Sbjct: 259 RLCRSSLPCSGAAALPA--DADRHCNGFPAPMEVTSRPSPSPSPWRPLVLLIPLRLGLTD 316
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+N Y+ TL+ F PQSLG++GGKP ++ Y +G + IYLDPH QP + +
Sbjct: 317 INEAYVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFI 376
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
D + + + + +DPS+A+GF+C+
Sbjct: 377 PDETFHCQHPPCRMGIGELDPSIAVGFFCK 406
>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
24927]
Length = 444
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/297 (36%), Positives = 158/297 (53%), Gaps = 45/297 (15%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-------------------ITSDVGWGCMLRSSQML 137
F DF ++ ++YR F PI S TSD GWGCM+RS Q +
Sbjct: 111 FLDDFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCV 170
Query: 138 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGS 196
+A A+ +LGR WR+ + P +E IL LF D +PFS+HN ++ G+A G+ G
Sbjct: 171 LANAISLLKLGRDWRRG-KSP--QEEQHILSLFADDPRAPFSLHNFVKYGEASCGVYPGE 227
Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
W GP A R +ALA A+ G Q +Y+ +GD GG +DA R +
Sbjct: 228 WFGPSATARCIQALA----AQHDEGLQ-----VYI-TGD-----GGD---VYEDAFRKIA 269
Query: 257 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
+ G + P L+LV + LG+E+V P Y L+ + PQS+GI GG+P AS Y +GVQ
Sbjct: 270 ISDDGV--FHPTLVLVGIRLGIERVTPVYWEALKSSLMMPQSVGIAGGRPSASHYFIGVQ 327
Query: 317 EESAIYLDPHDVQPVINIGKD-DLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRD 371
+S YLDPH+ +P++ KD D A+ + H+ +R +HL +DPS+ + F RD
Sbjct: 328 GQSLFYLDPHNTRPLLPYRKDSDYTAEEIEFCHTRKLRRLHLREMDPSMLLAFLIRD 384
>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
Length = 474
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 110/354 (31%), Positives = 163/354 (46%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----VHLCGRRYHFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKP------------- 158
+TSD GWGCMLRS QM++AQ LL H L R WR P + P
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLAPPEMPGPASPSRYRGPGR 193
Query: 159 --------------FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 HVPPRWTQGTLEMEQDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C +P + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-KCSEVPRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS IGFY ++ T
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTIGFYAGNRKEFETL 408
>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
Length = 387
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/293 (33%), Positives = 147/293 (50%), Gaps = 39/293 (13%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + D +S+I ++YR+ F I + TSD GWGCMLR QM VA+AL+ L R W+
Sbjct: 41 LDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGWQW 100
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
P + D Y+ +L +F D + FSIH + Q G + G A G W GP + LA
Sbjct: 101 APGIR--DESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------- 263
+ + +AI+V + VV +DD + C + + ++
Sbjct: 159 FDKWSS--------LAIHVAMDN---------VVIMDDIRKVCRLEATAESGVRNRAEPA 201
Query: 264 --------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
W P+LL +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI+GV
Sbjct: 202 GLAAAAAESWKPLLLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYIIGV 261
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ ++LDPH Q +++ D D +YH + + +DPS+A+ FY
Sbjct: 262 VGDDLVFLDPHTTQLAVDL--DTEFPDDESYHCAHASRMDIGQLDPSIALCFY 312
>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
Length = 388
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 168/317 (52%), Gaps = 26/317 (8%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + +D + D S++ +YRKGF PIGDS +T
Sbjct: 21 IPQTREPVWILGRKYDAGRD-----------VTAIRSDIKSKLWFTYRKGFVPIGDSGLT 69
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 180
SD GWGCMLR QM++AQAL+ LGR WR K ++P EY+ IL +F D++T+ +SI
Sbjct: 70 SDKGWGCMLRCGQMVLAQALVCLHLGRDWRWKKDSKEP---EYLRILKMFEDTKTATYSI 126
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G + G G W GP + + + L+ + + + +L I V +R
Sbjct: 127 HQIALMGVSEGKDVGQWFGPNTVTQVLKKLSVYDKWSSIVIHVALDNTIIVNDIKSLCQR 186
Query: 241 GGAPVVCIDDASRHCS-----VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
V ID +++ S V+ W P+LL+VPL LGL ++NP Y+ L+ FTF
Sbjct: 187 NEQSV--IDSSAQKHSPLNEPVYFNSARKWKPLLLVVPLRLGLSEINPVYLNGLKTCFTF 244
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIR 352
QSLG++GGKP + Y +G E IYLDPH QPV + +L + + +YH
Sbjct: 245 RQSLGVIGGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYHCPRAS 304
Query: 353 HIHLDSIDPSLAIGFYC 369
+ +DPS+A+ F+C
Sbjct: 305 RSRILDMDPSVAVCFFC 321
>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
Length = 447
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 144/301 (47%), Gaps = 45/301 (14%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
++F DF SRI I+YR GF PI S TSD GWGCM+R
Sbjct: 110 SDFIDDFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIR 169
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A +L HRLGR WRK ++ E+ IL LF D+ +PFSIH ++ G +A G
Sbjct: 170 SGQSLLANTILLHRLGRDWRKGQKQ---EEHKNILSLFADTPEAPFSIHKFVEHGAQACG 226
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A ARC RA T + +Y D D DA
Sbjct: 227 TYPGEWFGP-------NATARCLRALTD-KYHGAGLRVYARPNDSD---------VYADA 269
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + P L+++ + LG+EKV Y L+ PQS+GI GG+P +S Y
Sbjct: 270 LIETATQKDADDKFQPTLIVLGIRLGIEKVTSAYHVALKAALELPQSVGIAGGRPSSSHY 329
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
+G Q +S YLDPH + +++ D T H+ IR + L +DPS+ +GF R
Sbjct: 330 FLGHQGDSFFYLDPHTTRHMLSPQPS--AEDIETCHTRRIRKLPLSEMDPSMLLGFLVRS 387
Query: 372 K 372
+
Sbjct: 388 Q 388
>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
Length = 440
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 149/301 (49%), Gaps = 45/301 (14%)
Query: 95 AEFNQDFSSRILISYRKGFDPI----------------------GDSKITSDVGWGCMLR 132
++F DF SR+ ++YR F PI TSD GWGCM+R
Sbjct: 109 SQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGNFTSDTGWGCMIR 168
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A ++ RLGR WR+ + ++++ EIL +F D+ +PFSIH ++ G A G
Sbjct: 169 SGQSLLANTVVMLRLGRDWRRGQK---EKQHHEILSMFADTPEAPFSIHKFVEHGASACG 225
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A ARC RA T + + +Y D D V ID
Sbjct: 226 TYPGEWFGP-------SATARCIRALTE-KYHDVGLRVYARPNDSD--------VYIDTL 269
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + S + ++P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 270 TATTTQHSASET-FSPTLIVLGVRLGIEKVTPAYHAALKSILELPQSVGIAGGRPSSSHY 328
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VG Q + YLDPH +P++ D + H+ IR + + +DPS+ +GF RD
Sbjct: 329 FVGHQGDHFFYLDPHTTRPMLTAQP--TAEDVESCHTRRIRRLSIAEMDPSMLLGFLVRD 386
Query: 372 K 372
K
Sbjct: 387 K 387
>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
Length = 433
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 162/359 (45%), Gaps = 62/359 (17%)
Query: 62 GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI 121
+ S + ++LLG HK A GD + + E+ +SR+ +YRK F PIG +
Sbjct: 19 SVFDSNTPVYLLG--HKFP---ARGDM---DSIKEY---VTSRLWFTYRKNFMPIGGTGP 67
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
TSD GWGCMLR QML+AQAL+ LG W + +Y IL +F D + PFS+H
Sbjct: 68 TSDQGWGCMLRCGQMLLAQALIVRHLGTEWMWDRDNK-EEDYKRILRMFQDKKCCPFSLH 126
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALA---RCQRAETGLGCQSLPMAIYVVS----- 233
+ Q G + G W GP + + L R + +L +A V +
Sbjct: 127 QIAQMGVSERKQIGEWFGPNTAAQVLKKLVVYDDWSRLAVHVALDNLLIASDVRTMAHTR 186
Query: 234 ---------------GDEDGERGGAPVVCIDDASRHCSVFS-----------KGQADWTP 267
+E G G +C + + C + S + + W P
Sbjct: 187 PPSRLSSRHTTENEQSEESGNASGGNSLCSFGSVKMCMLQSALMKECDENPVEDEEQWRP 246
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+L++VPL LGL +N Y+P + F PQ GI+GG+P + Y +G+ E IYLDPH
Sbjct: 247 LLIIVPLRLGLTSINRCYLPAIEAFFQLPQCTGIIGGRPNHALYFIGIAGEQLIYLDPHV 306
Query: 328 VQPVINIG----------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
Q I++ K D S+YH + HI DS DPSLA+ F CR
Sbjct: 307 CQAAIDLDERCASLQQQDGFVEVVKSTDIFDDSSYHCPFLLHIAYDSADPSLALSFICR 365
>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
Length = 405
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 144/307 (46%), Gaps = 47/307 (15%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--K 153
E DF S+I +YRK F IG + T D GWGCMLR QM++AQAL+ LGR W+ K
Sbjct: 46 ELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWKWNK 105
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
Q D+ Y IL +F D +++ +SI + G + G GSW GP + + + LA
Sbjct: 106 NCQ---DQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----------- 262
+ + ++ D VC DD C + Q
Sbjct: 163 DEWSS---------IVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRS 213
Query: 263 --------------------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
W P+LL++PL LGL ++N Y+ +L+ +FPQS+GI+
Sbjct: 214 KKSSQDSSKQDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKACLSFPQSVGII 273
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP + + VG + IYLDPH Q ++ D +YH +++ +DPS
Sbjct: 274 GGKPNHAHWFVGYMSDKLIYLDPHTTQLCEDL--DSPNFSDESYHCPYPSTMNVMELDPS 331
Query: 363 LAIGFYC 369
+A+GFYC
Sbjct: 332 IALGFYC 338
>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
Length = 393
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 156/315 (49%), Gaps = 38/315 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +++WLLG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNANVWLLGKRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFVPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D+ S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTPDCRDTTYLKIVNRFEDTRKSFYSI 148
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G++ A G W+GP + + + L R SL + + + S
Sbjct: 149 HQIALMGESQNKAVGEWLGPNTVAQILKILVRFD------DWSSLNVHVAMDS------- 195
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V +DD C ++ W P+LL+VPL LG+ +NP Y+P L+ S G
Sbjct: 196 ----TVVLDDIFTLCQ--EPSESAWKPLLLIVPLRLGISDINPIYVPALKRCLELNSSCG 249
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
++GG+P + Y +G ++ +YLDPH Q + + A+ +YH +
Sbjct: 250 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQKTTAAEQELDESYHQKYAARLSFA 309
Query: 358 SIDPSLAIGFYCRDK 372
++DPSLA+ F C+ +
Sbjct: 310 AMDPSLAVCFLCKTR 324
>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
Length = 437
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 154/326 (47%), Gaps = 46/326 (14%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S+ S + +LG + +D + F F S ++YR GF PI S +T+D
Sbjct: 61 SNNSPVLVLGKLYIPERDTKPQSEGIPRHILMFMDHFYSLPWMTYRCGFSPILSSSLTTD 120
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR-----------KPLQKPFDREYVEILHLFGDS 173
GWGCM+RS QML+A L H LGR WR K ++ V IL FGDS
Sbjct: 121 CGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHRQVKNWNNYVVLILSWFGDS 180
Query: 174 ETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIY 230
E+ PFSIH L++A +G G W GP + L R C R + IY
Sbjct: 181 ESELCPFSIHRLMEAAYYHGNKPGDWFGPSQV----SILIRDCVRRALREHINLQKLNIY 236
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW------TPILLLVPLVLGLEKVNPR 284
V S C+V+ K D +L+LVP+ LG E +NP
Sbjct: 237 V--------------------SHDCTVYIKDVQDIFESDLDQSLLVLVPVRLGSESLNPI 276
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YIP ++ ++GI+GG+P S + +G Q+E+ I+LDPH Q +N+ + D D S
Sbjct: 277 YIPCVKALLALDHTVGIIGGRPKHSVFFIGFQDENLIHLDPHYSQTAVNMTRTDF--DVS 334
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCR 370
+YH + I + +DPS +GFYC
Sbjct: 335 SYHCRSPKKIPVTKMDPSCTLGFYCH 360
>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
Length = 471
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 58/313 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 131
+F DF SR+ I+YR F PI DS + TSD GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH +Q G A
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKI---QEESELVSLFADHPRAPFSIHRFVQHGATAC 252
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 249
G G W GP A + +AL + + GL +YV + G + ER V C +
Sbjct: 253 GKCPGEWFGPSAAAQCIQALVKSN-PQAGL-------RVYVTNDGSDIYERQFREVACDE 304
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
S P L+L+ + LG+++V P Y +L+ +PQS+GI GG+P +S
Sbjct: 305 SGS------------IKPTLILLGVRLGIDRVTPIYWDSLKALLHYPQSVGIAGGRPSSS 352
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---------ADTSTYHSDVIRHIHLDSID 360
Y + Q +S YLDPH +P + + E + STYH+ +R +H+ +D
Sbjct: 353 HYFIATQGDSFFYLDPHQTRPCLAPRSEPTEDEESHPYSPEELSTYHTRRLRRLHVREMD 412
Query: 361 PSLAIGFYCRDKG 373
PS+ IG RD+G
Sbjct: 413 PSMLIGLLVRDEG 425
>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
Length = 342
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 26/312 (8%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAI 365
+ + +DPS+A+
Sbjct: 308 MSIAELDPSIAV 319
>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
Length = 401
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 98/282 (34%), Positives = 148/282 (52%), Gaps = 16/282 (5%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + D +S+I ++YRK F I + TSD GWGCMLR QM++A+AL+ LG+ W+
Sbjct: 54 LDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGWQW 113
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
P + D Y+ +L +F D + +SIH + Q G + G A G W GP + L+
Sbjct: 114 APGIR--DENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA-----DWTP 267
+ + L + V+ R P V DD RH S G A W P
Sbjct: 172 FDKW-SSLAVHVAMDNVVVMDDIRKICRVETPAV--DDGVRH-RTQSHGLACASAVSWKP 227
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+LL +PL LGL ++NP Y L+ TF QS+GI+GGKP + +I+GV + ++LDPH
Sbjct: 228 LLLFIPLRLGLNEINPVYYCGLKRTFALKQSVGIIGGKPNHALFIIGVVGDDLVFLDPHT 287
Query: 328 VQPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
Q +++ D+E + +YH + + +DPS+A+ FY
Sbjct: 288 TQLAVDL---DVEFPEDESYHCAHASRMDIGQLDPSIALCFY 326
>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
Length = 454
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 155/312 (49%), Gaps = 58/312 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 131
+F DF S++ I+YR F PI GDS I TSD GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKL---QEESELVSLFADHPRAPFSIHRFVHHGATAC 235
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 249
G G W GP A + +AL + + GL +Y+ S G + E+ V C +
Sbjct: 236 GKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACDE 287
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S
Sbjct: 288 SGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGRPSSS 335
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHLDSID 360
Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+ +D
Sbjct: 336 HYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHIREMD 395
Query: 361 PSLAIGFYCRDK 372
PS+ IGF RD+
Sbjct: 396 PSMLIGFLVRDE 407
>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
Length = 678
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 164/318 (51%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 140 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 245 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q +++I ++ ++H R + + +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548
Query: 361 PSLAIGFYCRDKGLLVTF 378
PS IGFYC K +F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566
>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
pulchellus]
Length = 390
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 95/297 (31%), Positives = 150/297 (50%), Gaps = 44/297 (14%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + + +S+I ++YRK F I + TSD GWGCMLR QM+VA+A++ LG+ W+
Sbjct: 41 LDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGKDWQW 100
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
P K D +Y+ +L +F D + +SIH + Q G + G G W GP + L+
Sbjct: 101 SPGTK--DEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKLST 158
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV--------------- 257
+ + +A++V + VV +DD + C V
Sbjct: 159 FDKWSS--------LAMHVAMDN---------VVVMDDIRKICRVETTTDVEDGIRNRTQ 201
Query: 258 -----FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ G W P++L +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI
Sbjct: 202 SHGGPAAAGARSWKPLVLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYI 261
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+GV + ++LDPH Q +++ D+E + +YH + + +DPS+A+ FY
Sbjct: 262 IGVVGDDLVFLDPHTTQLAVDL---DVECPEDESYHCAHASRMDIGQLDPSIALCFY 315
>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 164/318 (51%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 140 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 245 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q +++I ++ ++H R + + +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548
Query: 361 PSLAIGFYCRDKGLLVTF 378
PS IGFYC K +F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566
>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
Length = 474
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 159/354 (44%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + +F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQQFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + ++H R + +DPS +GFY ++ T
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETL 408
>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
1015]
Length = 384
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 168/351 (47%), Gaps = 50/351 (14%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
+RI + + P S IW LG+ + +D A + F DF SRI ++
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKD-ANTRETQHAWPESFLLDFESRIWMT 69
Query: 109 YRKGFDPI----GDSK-------------------ITSDVGWGCMLRSSQMLVAQALLFH 145
YR F PI GD K TSD GWGCM+RS Q L+A AL
Sbjct: 70 YRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQSLLANALSML 129
Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMC 204
LGR WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A
Sbjct: 130 VLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKYPGEWFGPSATA 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+ EAL+ C + + +YV + + + D +R+ S
Sbjct: 187 KCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK-----FMDIARNTS------GA 227
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
+ P L+L+ LG++ + P Y L+ FPQS+GI GG+P AS Y VG Q YLD
Sbjct: 228 FQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGRPSASHYFVGAQGSHLFYLD 287
Query: 325 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
PH +P + G+ + + TYH+ +R IH+ +DPS+ IGF R++
Sbjct: 288 PHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRNQ 338
>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
Length = 450
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 156/320 (48%), Gaps = 47/320 (14%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNNG------LAEFNQDFSSRILISYRKGFDPIGDSKI 121
S I LLG C+ ++ E N F +DFSS+I +YRK F + S +
Sbjct: 82 SPIILLGKCYCCSKSEKEDQRRQPNNSNILTTFDRFKRDFSSKIWFTYRKDFPKLYGSPL 141
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETS 176
TSDVGWGCMLR++QM++AQAL+ H LGR W + +E + +I+ LFGD S
Sbjct: 142 TSDVGWGCMLRTAQMIIAQALVMHYLGRDWTIHHTQQNRKETMLHRQIIRLFGDFPGNDS 201
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
PFSI L++ G +G G W GP ++ YVV
Sbjct: 202 PFSIQALVRIGVDHGKRPGDWYGPASVA-------------------------YVVRDAI 236
Query: 237 DGERGGAPV---VCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPT 288
+ P+ VC+ A C+V+ + D W +++LVP+ LG E +NP Y
Sbjct: 237 NQVPDFHPLLSQVCVYVAP-DCTVYIQDVIDLCTQHWKAVVILVPVRLGGEALNPIYSQC 295
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
++ LGI+GG+P S Y VG QEE +YLDPH Q ++ D TSTYH
Sbjct: 296 VQSLLAHELCLGIIGGRPKHSLYFVGWQEEKLLYLDPHFCQDTVDTRFRDFP--TSTYHC 353
Query: 349 DVIRHIHLDSIDPSLAIGFY 368
R + L +DPS +GFY
Sbjct: 354 LSPRKLALQKMDPSCTLGFY 373
>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 439
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 145/304 (47%), Gaps = 49/304 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A ALL R+GR WR+ +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGSSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A ARC +A T +S + +Y+ D +D
Sbjct: 220 HPGEWFGP-------SAAARCIQALTNSQVES-ELRVYITGDGSD---------VYEDT- 261
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
S+ +TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 262 -FMSIAKPNSTKFTPTLILVGTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYF 320
Query: 313 VGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+GVQE YLDPH +P + +D D + H+ +R +H+ +DPS+ I F
Sbjct: 321 IGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLI 380
Query: 370 RDKG 373
RD+
Sbjct: 381 RDEN 384
>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
Length = 408
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/326 (30%), Positives = 158/326 (48%), Gaps = 38/326 (11%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 195
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 311
Query: 359 IDPSLAI------------GFYCRDK 372
+DPS+A+ GF+C+++
Sbjct: 312 LDPSVALVVLSCLLLLPPKGFFCKEE 337
>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 918
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/345 (33%), Positives = 167/345 (48%), Gaps = 37/345 (10%)
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNG-----LAEFNQDFSSRILISYRKGFDPIGD 118
S S S IW+LG C+ + E G + + +F DF + + SYRK F+ I
Sbjct: 260 SISDSPIWMLGNCYSGKELECNGHTENKHNKRSRHICKFFADFQTLVCFSYRKDFERIPG 319
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKPFDREYVEILHLFG 171
SK T+D GWGC LRS+QMLVA+AL+ GR WR PL + + I+ LF
Sbjct: 320 SKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCPAPLSSSKEDQLRLIIRLFQ 379
Query: 172 DS--ETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D SPFSIHN++Q G + + AG W GP ++ R + L A ++
Sbjct: 380 DQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFADLINQAYAMHQSPFRAYQAI 439
Query: 229 IYVVSGDEDGERGGAPVVCID-DASRHCSVFSKGQADWT-------------------PI 268
+++ D E P D + S S D T P+
Sbjct: 440 DHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVTPSASTSQSPPVLPPPFIPL 499
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
L+L+PL LGL ++N YIP L+ Q +GI+GG+P S Y VG QE++ I+ DPH
Sbjct: 500 LILMPLRLGLNEINRMYIPCLKALLMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPHGC 559
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
+ +++ + T T+HS V I +DPS+AIGF C+++
Sbjct: 560 KRFVDMQQTSFP--TETFHSAVPNKIPFTHMDPSMAIGFLCQNQA 602
>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
Length = 474
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 158/354 (44%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQD--------CTVYKADVARLLS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + ++H R + +DPS +GFY ++ T
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETL 408
>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
Length = 378
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/277 (34%), Positives = 138/277 (49%), Gaps = 11/277 (3%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 36 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 95
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA + L
Sbjct: 96 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIA 154
Query: 226 PMAIYVVSGDEDGERGGAPVVCID----DASRHCSVFSKGQ------ADWTPILLLVPLV 275
V+ R P D+ RHC+ F G + W P++LL+PL
Sbjct: 155 MDNTVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLR 214
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 215 LGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPT 274
Query: 336 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
D S + + + +DPS+A+GF+C+ +
Sbjct: 275 DGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTE 311
>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
gorilla]
Length = 379
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 143/284 (50%), Gaps = 25/284 (8%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
QP + D S + + + +DPS+A+GF+C+ +
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTE 312
>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
Length = 474
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 158/354 (44%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + ++H R + +DPS +GFY ++ T
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETL 408
>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
Length = 379
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 143/284 (50%), Gaps = 25/284 (8%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPL 208
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
QP + D S + + + +DPS+A+GF+C+ +
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTE 312
>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
Length = 458
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 157/368 (42%), Gaps = 77/368 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 152 -------------RKPLQKPFDREYV-----------EILH-----LFGDSETSPFSIHN 182
R+P +E + E+ H FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEVYHRKIISWFGDSPLAAFGLHQ 215
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
L++ GK G AG W GP + R G + +YV
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQ--------D 262
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI+
Sbjct: 263 CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGII 322
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPS 380
Query: 363 LAIGFYCR 370
IGFYCR
Sbjct: 381 CTIGFYCR 388
>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 494
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 156/320 (48%), Gaps = 54/320 (16%)
Query: 84 ALGDAAGNNG---LAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------- 122
A GDA G F DF SRI ++YR GF DP S ++
Sbjct: 139 AYGDADGTTDGGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGA 198
Query: 123 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
SD GWGCM+RS Q L+A ALL RLGR WR+ +RE IL LF D +
Sbjct: 199 DQAGFSSDTGWGCMIRSGQSLLANALLISRLGREWRRGQNPKAERE---ILSLFADDPRA 255
Query: 177 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
P+S+HN ++ G +A G G W GP A R +ALA +E + +Y
Sbjct: 256 PYSLHNFVKHGAEACGKFPGEWFGPSATARCIQALANKHESE---------LRVYST--- 303
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
G P V D ++ + + P L+LV LG++K+N Y L T
Sbjct: 304 -----GDLPDVYEDS---FMAIANPDGQHFHPTLVLVCTRLGIDKINKVYEQALISTLQM 355
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIR 352
QS+GI GG+P S Y +GVQ++ YLDPH +P++ +D + + + H+ +R
Sbjct: 356 EQSIGIAGGRPSQSHYFIGVQDQWLFYLDPHYPRPMLPYRENPEDYTQEEVDSCHTRRLR 415
Query: 353 HIHLDSIDPSLAIGFYCRDK 372
H+H++ +DPS+ IGF +D+
Sbjct: 416 HLHVEDLDPSMLIGFLIKDE 435
>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
Length = 404
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 169/370 (45%), Gaps = 68/370 (18%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 96
+RI + + P S IW LG+ + +D + N E
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKDANTRETPDKNNTRENVMGTTNYRKPS 70
Query: 97 -------FNQDFSSRILISYRKGFDPI----GDSK-------------------ITSDVG 126
F DF SRI ++YR F PI GD K TSD G
Sbjct: 71 EHAWPESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTG 130
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + F+ E ++L LF D+ T+PFS+H ++
Sbjct: 131 WGCMIRSGQSLLANALSMLVLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKH 187
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G ++ G G W GP A + EAL+ C + + +YV + + +
Sbjct: 188 GAESCGKYPGEWFGPSATAKCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK--- 236
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
D +R+ S + P L+L+ LG++ + P Y L+ FPQS+GI GG+
Sbjct: 237 --FMDIARNTS------GAFQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGR 288
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
P AS Y VG Q YLDPH +P + G+ + + TYH+ +R IH+ +DPS
Sbjct: 289 PSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPS 348
Query: 363 LAIGFYCRDK 372
+ IGF R++
Sbjct: 349 MLIGFLIRNQ 358
>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
Length = 424
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 103/355 (29%), Positives = 162/355 (45%), Gaps = 64/355 (18%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 82
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHYLPRDWTWAEGAGLGPPEPVGLSSPNRYRGPAR 142
Query: 153 ---------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 203
P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMAPTLGPGAPPSWSRERRHRQIVSWFADHPRAPFGLHQLVELGQSSGKKAGDWYGP--- 199
Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 263
+A R + + +YV + A +V D + A
Sbjct: 200 ----SLVAHILRKAVESCAEVTRLVVYVSQDCTVYKADVARLVARPDPT----------A 245
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YL
Sbjct: 246 EWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYL 305
Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
DPH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 306 DPHYCQPTVDVSRADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETL 358
>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
Length = 397
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/292 (34%), Positives = 149/292 (51%), Gaps = 21/292 (7%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 45 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWE 104
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 105 KHKNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 164
Query: 215 RAETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSKGQ----- 262
+ +A+Y VV D P C + A+ H S +S+ +
Sbjct: 165 EWNS--------LAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGH 216
Query: 263 -ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + I
Sbjct: 217 CSGWRPLLLVVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEII 276
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
YLDPH Q ++ + D + + + + ++DPS+A+GF+C+D+
Sbjct: 277 YLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDEN 328
>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
Length = 456
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 163/367 (44%), Gaps = 77/367 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 112
S S ++LLG C+ +E+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTEES-GELSTDGSNFDKISTEISGNVEEFRKDFISRIWLTYREE 94
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 151
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPEALDMESCDWESWTSSTV 154
Query: 152 ---------------------RKPLQKPFD----REYV---EILHLFGDSETSPFSIHNL 183
R P ++ +D R V +I+ FGDS + F +H L
Sbjct: 155 RKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYHRKIISWFGDSPLAAFGLHQL 214
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
++ GK G AG W GP + R G + +YV
Sbjct: 215 IEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD--------C 261
Query: 244 PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
V D R CS+ G+A +++L P+ LG E+ N Y+ ++ + +GI+G
Sbjct: 262 TVYSSDVIDRQCSLVDSGKAGTKAVIILFPVRLGGERTNTDYLEFVKGILSLEYCVGIIG 321
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 322 GKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPSC 379
Query: 364 AIGFYCR 370
IGFYCR
Sbjct: 380 TIGFYCR 386
>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
Length = 408
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 108/358 (30%), Positives = 165/358 (46%), Gaps = 64/358 (17%)
Query: 46 GSMRRIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
G RR E R SRT S +S + VC + + E GD + F +DF
Sbjct: 4 GGARRPREHGGRWAVKSRTSFSKISS----VHVCGRRYRFEGEGD------IQRFQRDFV 53
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------- 151
SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ+LL H L R W
Sbjct: 54 SRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWTWAEGLGSAEP 113
Query: 152 ---------RKPL------------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
R P + +R + +I+ F D +PF +H L++ G++
Sbjct: 114 AGSASPSRYRGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPGAPFGLHRLVELGQSS 173
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G AG W GP +A R + + +YV + A +V D
Sbjct: 174 GKKAGDWYGP-------SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPD 226
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S
Sbjct: 227 PT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRLELCLGIMGGKPRHSL 276
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
Y +G Q++ +YLDPH QP +++ + D + ++H R + +DPS +GFY
Sbjct: 277 YFIGYQDDFLLYLDPHYCQPTVDVSQTDFPLE--SFHCTSPRKMAFAKMDPSCTVGFY 332
>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
Length = 473
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 160/354 (45%), Gaps = 64/354 (18%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 249 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + + ++H R + +DPS +GFY ++ T
Sbjct: 356 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETL 407
>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
Length = 469
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 161/350 (46%), Gaps = 59/350 (16%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQKPF----------- 159
+TSD GWGCMLRS QM++AQ LL H L R W P P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSQGVGLGPPESSPNRYRGPAHWMPP 192
Query: 160 -----------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 HWVQAAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------S 245
Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 268
+A R + + +YV + A +V D + A+W +
Sbjct: 246 LVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKAV 295
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH
Sbjct: 296 VILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYC 355
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 356 QPTVDVSQADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETL 403
>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
Length = 472
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 162/353 (45%), Gaps = 62/353 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 156 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
H QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 406
>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
Length = 442
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 160/354 (45%), Gaps = 64/354 (18%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 53 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 102
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 103 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 161
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 217
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 218 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 264
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + + ++H R + +DPS +GFY ++ T
Sbjct: 325 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETL 376
>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
Length = 472
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 162/353 (45%), Gaps = 62/353 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 156 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
H QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETL 406
>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
Length = 513
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 153/310 (49%), Gaps = 47/310 (15%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVG 126
G++ A F DF S+I ++YR GF DP S +T +D G
Sbjct: 143 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 202
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 203 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 259
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 260 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 303
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 304 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 362
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 362
P +S Y +G Q YLDPH +P + G+ E + ++YH+ +R +H+ +DPS
Sbjct: 363 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 422
Query: 363 LAIGFYCRDK 372
+ IGF +D+
Sbjct: 423 MLIGFLIKDE 432
>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
Length = 508
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 153/310 (49%), Gaps = 47/310 (15%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVG 126
G++ A F DF S+I ++YR GF DP S +T +D G
Sbjct: 138 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 197
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 198 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 254
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 255 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 298
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 299 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 357
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 362
P +S Y +G Q YLDPH +P + G+ E + ++YH+ +R +H+ +DPS
Sbjct: 358 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 417
Query: 363 LAIGFYCRDK 372
+ IGF +D+
Sbjct: 418 MLIGFLIKDE 427
>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 336
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 136/265 (51%), Gaps = 25/265 (9%)
Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 166
++YR F I DS +D GWGCMLR QML+A+A+ LG+ W +K +E
Sbjct: 36 MTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWAPTSRKQRHQEMARF 95
Query: 167 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
L LF D+ +PFSIH + + G+A G G W GP + + + L QR+ + C
Sbjct: 96 LPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVNSQRSSLIVHCA--- 152
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRY 285
V++ E + A + D +H +L+LVP+ LGL + +NP Y
Sbjct: 153 -MDGVLNRTEASTQLAA---ALSDGKKHS------------LLVLVPIRLGLNQSINPVY 196
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ-PVINIGKDDLEADTS 344
IP L+ T PQ LGI+GGKP A+ + VG E+ +YLDPH VQ + + D +E
Sbjct: 197 IPALKATLELPQCLGIIGGKPNAAHFFVGTVNENVLYLDPHVVQDAAMELTPDTVE---- 252
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYC 369
++ V+ + + +DPS+ + C
Sbjct: 253 SFSVAVLSKMAISDVDPSMCAAYLC 277
>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
Length = 454
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/305 (33%), Positives = 141/305 (46%), Gaps = 50/305 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 132
F DF RI ++YR GF PI S+ TSD GWGCM+R
Sbjct: 117 FLDDFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 176
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
S Q L+A AL RLGR WR+ E +L LF D +PFSIH ++ G Y G
Sbjct: 177 SGQSLLANALAISRLGRDWRRGSNST---EENRLLSLFADDPAAPFSIHKFVRHGALYCG 233
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A +AL+ + + G M +YV S + V + +
Sbjct: 234 KHPGEWFGPSATATCIQALSD-EYKDAG-------MNVYVSSDNTYVYEDKFKAVAYNQS 285
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
R P L+L+ LG++++ P Y L PQ+LGI GG+P AS Y
Sbjct: 286 DRM-----------RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQALGIAGGRPSASHY 334
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+GVQ YLDPH +P + DL + + + H+ +R IH+D +DPS+ +GF
Sbjct: 335 FIGVQNSFFFYLDPHHTRPALPYKTGDLAYTQEEIDSCHTRRLRRIHIDDMDPSMLVGFL 394
Query: 369 CRDKG 373
RD+
Sbjct: 395 IRDEN 399
>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 601
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/348 (31%), Positives = 167/348 (47%), Gaps = 51/348 (14%)
Query: 52 HERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRK 111
+ R+ R+G+S + L + + ++G++ A F DF S+I ++YR
Sbjct: 195 YHRLSTSDRSGLSPTRQ----LPFTNNTRPESTSSSSSGHDWPAPFLDDFESKIWLTYRS 250
Query: 112 GF-------DPIGDSKIT----------------SDVGWGCMLRSSQMLVAQALLFHRLG 148
GF DP S +T +D GWGCM+RS Q L+A AL LG
Sbjct: 251 GFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLASALSILSLG 310
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 207
R WR+ + D+E +L LF D +PFSIH ++ G A G G W GP A R
Sbjct: 311 RDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEYGASACGKYPGEWFGPSATARCI 367
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
+AL+ C+ + +YV S D +D R + +A P
Sbjct: 368 QALSS--------ECKHAGLNVYVTSDGSD---------VYEDRFRTIASGGATEAGIHP 410
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q YLDPH
Sbjct: 411 TLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGRPSSSHYFIGAQGSYFFYLDPHH 470
Query: 328 VQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+P + G+ E + ++YH+ +R +H+ +DPS+ IGF +D+
Sbjct: 471 TRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPSMLIGFLIKDE 518
>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
Length = 442
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 164/354 (46%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 52 SRTSFSKLSS----VHLCGRRYRFETEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 101
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR----------KP--LQKPF------- 159
+TSD GWGCMLRS QM++AQ LL H L R W +P L P+
Sbjct: 102 GYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWVKGVGLDPPEPSRLASPYWHHGPAC 161
Query: 160 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 WIPPHWTQGSPELEQERRHRQIVSWFADHPKAPFGLHQLVELGQSSGKKAGDWYGP---- 217
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 218 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 264
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + + ++H R + +DPS +GFY D+ T
Sbjct: 325 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETL 376
>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
aries]
Length = 454
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 148/319 (46%), Gaps = 42/319 (13%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 69 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 115
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +
Sbjct: 116 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYCRVPP--------------- 160
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA A + L V++ R G
Sbjct: 161 -QMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-AWSALAVHVAMDNTVVMADVRRLCRSGL 218
Query: 244 PVVCID----DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F G A W P++LL+PL LGL VN Y TL+ F
Sbjct: 219 PCAGAEAFPADSERHCNGFPAGAEGGECTAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 278
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 279 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 338
Query: 354 IHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 339 MSITELDPSIAVGFFCKTE 357
>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum PHI26]
Length = 401
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/367 (30%), Positives = 163/367 (44%), Gaps = 68/367 (18%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 96
+RI + P T + S IW LG + A + D A NN +
Sbjct: 9 KRIVQYFWDPEPTNNVPAAS-IWCLG--KEYAPPQPFSDPATNNPHSSSGQPDASTLNDT 65
Query: 97 -----FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWG 128
F DF SRI I+YR F PI +K TSD GWG
Sbjct: 66 AWPNAFVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWG 125
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
CM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH + G
Sbjct: 126 CMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIHKFVNRGA 182
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
++ G G W GP A + + L+ A + +YV + D
Sbjct: 183 ESCGKYPGEWFGPSATAKCIQLLSTQSEAHR--------LRVYVTNDTSD---------V 225
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
+D H S G P L+L+ LG+E V P Y LR T+PQS+GI GG+P
Sbjct: 226 YEDKFAHVSHDRSGCIQ--PTLILIGTRLGIENVTPAYWDGLRAALTYPQSVGIAGGRPS 283
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAI 365
AS Y +G Q+ +LDPH +P D+L + + +Y++ +R IH+ +DPS+ I
Sbjct: 284 ASHYFLGAQDCHLFFLDPHTTRPATPYRPDELYTQEELDSYYTSRLRRIHIKDMDPSMLI 343
Query: 366 GFYCRDK 372
GF +D+
Sbjct: 344 GFLIKDE 350
>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
Length = 607
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 162/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 216 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 265
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 154
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 266 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWIEGPGLAHPELPGSASSSQGRGPAR 325
Query: 155 ----------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L++ + + +I+ F D +P +H L++ G++ G AG W GP
Sbjct: 326 WMPPSCPWGALEREQELRHRQIVSWFADHPRAPLGLHRLVELGQSSGKKAGDWYGP---- 381
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +A+YV + A +V D + A+
Sbjct: 382 ---SLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHLVASPDPA----------AE 428
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 429 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 488
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY ++ L T
Sbjct: 489 PHYCQPTVDVSQADFSLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKELETL 540
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 10/93 (10%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 130 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 179
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 180 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212
>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 441
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 173/369 (46%), Gaps = 39/369 (10%)
Query: 9 GASKCFSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTS 68
GA+ C S PD + S S S ++ + GS E V G + S
Sbjct: 50 GATACTPSSLPDLKSASAESSRSAQPATPPDSTASSLGSGVHEDEDVGGWPTPFLDDFES 109
Query: 69 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 128
IWL +Q A+ + L+ + R + + GF TSD GWG
Sbjct: 110 KIWLT----YRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGF--------TSDTGWG 157
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
CM+RS Q L+A AL+ R+GR WR+ +E I+ LF D+ T+P+SIHN ++ G
Sbjct: 158 CMIRSGQSLLANALVMLRMGRDWRR--GSSASQEERSIISLFADTPTAPYSIHNFVEHGA 215
Query: 189 AY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGDEDGERGGAPVV 246
A G G W GP A R +ALA G QS + +YV G E E +
Sbjct: 216 AACGKHPGEWFGPSATARCIQALAN--------GHQSPELRVYVTGDGLEVYEDSFMKIA 267
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
D GQA + P L+LV LGL+K+ P Y L+ + PQSLGI GG+P
Sbjct: 268 KPD-----------GQA-FIPTLILVGTRLGLDKITPVYWEALKSSLQIPQSLGIAGGQP 315
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSL 363
+S Y +GVQ YLDPH +P + + +D + D + H+ +R IH+ +DPS+
Sbjct: 316 SSSHYFIGVQGHHFFYLDPHQTRPALPLPDNIEDYSQEDIDSCHTRRLRRIHIKEMDPSM 375
Query: 364 AIGFYCRDK 372
I F RD+
Sbjct: 376 LIAFLIRDE 384
>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
Length = 423
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 163/354 (46%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 82
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSEASGLGPSEPSGLASPNRYRGPAR 142
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 198
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 199 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 245
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 246 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 305
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 306 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 357
>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
Length = 457
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/370 (30%), Positives = 163/370 (44%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++E L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENEMLSARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPGALNIENSDSESWTSHTVK 155
Query: 155 ---------------LQKP-------------FDREYVEILH-----LFGDSETSPFSIH 181
L+ P + EI H FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNEIYHRKIVSWFGDSPLAFFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + + G + IYV +D
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQG-----ITIYVA---QDCTVY 267
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+ V+ ASR S+G D +++LVP+ LG E+ NP Y+ ++ + +GI
Sbjct: 268 NSDVIDTQSASRT----SEGAED-KAVIILVPVRLGGERTNPDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH QP +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQPFVDVSVKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
porcellus]
Length = 474
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 162/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSK-LSSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWAEGPGLGSPELPGTASPSPGRSPAR 192
Query: 152 ----RKPLQKP-FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P P ++E + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WVPPRWPRGAPELEQELRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +A+YV + A +V D + A+
Sbjct: 249 ---SLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHLVASRDPT----------AE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY ++ T
Sbjct: 356 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETL 407
>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
Length = 379
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 142/284 (50%), Gaps = 25/284 (8%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
QP + D S + + + +DPS+A+G +C+ +
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGSFCKTE 312
>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
Length = 400
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 144/304 (47%), Gaps = 49/304 (16%)
Query: 96 EFNQDFSSRILISYRKGFDPIG----------------------DSK-ITSDVGWGCMLR 132
EF D SRI I+YR F PI DS+ TSD GWGCM+R
Sbjct: 75 EFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTGWGCMIR 134
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
S Q L+A A+L LGR WR+ + + ++LH F D +PFSIH +Q G +
Sbjct: 135 SGQSLLANAMLILLLGRDWRRGTEAGKE---AQLLHQFADHPEAPFSIHRFVQHGAEFCN 191
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R +AL A+ G S + +Y+ D + D
Sbjct: 192 KYPGEWFGPSATARCIQALV----AQQG----SSELRVYITDDTAD--------IYEDKF 235
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+R + D+ P L+LV LG++ V P Y L+ PQS+GI GG+P AS Y
Sbjct: 236 AR---IAQAEHGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLPQSVGIAGGRPSASHY 292
Query: 312 IVGVQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+GV + YLDPH +P ++ + +TYH+ +R IH+ +DPS+ IGF
Sbjct: 293 FIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRLRRIHIKDMDPSMLIGFI 352
Query: 369 CRDK 372
R +
Sbjct: 353 IRSR 356
>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
terrestris]
Length = 383
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 106/287 (36%), Positives = 152/287 (52%), Gaps = 16/287 (5%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 31 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 90
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 91 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
+ L + +L + V + G V D A V K + W
Sbjct: 150 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 204
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 205 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 264
Query: 327 DVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q ++GK +++E D +TYH I + IDPS+A+ F+C
Sbjct: 265 TTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPITGIDPSVALCFFC 310
>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
Length = 343
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 139/303 (45%), Gaps = 48/303 (15%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
E D +SR+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR
Sbjct: 40 EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWIK 99
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR---------S 206
K Y +L+ F D + S +SIH + Q G G + G W GP + + +
Sbjct: 100 GKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFDT 159
Query: 207 WEALA----------------RCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV 246
W +LA CQ + G + P +Y +E G R +
Sbjct: 160 WSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL- 218
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
W P++LL+PL LGL ++N YI TL+ F PQSLG++GGKP
Sbjct: 219 ------------------WKPLVLLIPLRLGLTEINEAYIETLKHCFMMPQSLGVIGGKP 260
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
++ Y +G E IYLDPH QP + D S + + + +DPS+A+
Sbjct: 261 NSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCLPDESFHCQHPPCRMSIAELDPSIAVV 320
Query: 367 FYC 369
C
Sbjct: 321 CSC 323
>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
UAMH 10762]
Length = 446
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/329 (33%), Positives = 164/329 (49%), Gaps = 65/329 (19%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------- 120
A++EALG AEF D +RI ++YR F PI S
Sbjct: 103 AEEEALG------WPAEFMDDMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGN 156
Query: 121 ---ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 177
TSD GWGCM+RS Q L+A +L +LGR WR+ + + +Y ++ LF D+ +P
Sbjct: 157 SGGFTSDAGWGCMIRSGQTLLANSLATLKLGRDWRRGQK---EDDYKHLISLFADTPEAP 213
Query: 178 FSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
FSIH ++ G +A G G W GP A RS +AL R + GL + P +
Sbjct: 214 FSIHKFVEHGAQACGKHPGEWFGPSATARSVQALTEKYR-DVGLRVYARP---------D 263
Query: 237 DGERGGAPVVCIDDASRHCSVF-SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRL 291
DG+ V +D S+F + GQ D + P L+++ + LG++++ P Y L+
Sbjct: 264 DGD------VYVD------SLFATAGQMDANDEFQPTLIVLGIRLGIDRITPVYHAALKA 311
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI--NIGKDDLEADTSTYHSD 349
T PQS+GI GG+P +S Y VG Q ++ YLDPH + I N +DL ++ H+
Sbjct: 312 TLEMPQSVGIAGGRPSSSHYFVGHQGDNFFYLDPHTTRQAIPQNPSAEDL----ASCHTR 367
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+R + + +DPS+ +GF K V +
Sbjct: 368 RLRRLKIAEMDPSMLLGFLIHSKEEFVEW 396
>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
Length = 380
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 157/327 (48%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 12 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 60
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 61 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 120
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 121 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 163
Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 164 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 223
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 224 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 283
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 284 HCQQAPHRMKIMNLDPSVALGFFCKEE 310
>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
Length = 397
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 157/327 (48%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 181 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEE 327
>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
Length = 383
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 106/287 (36%), Positives = 152/287 (52%), Gaps = 16/287 (5%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 31 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 90
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 91 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
+ L + +L + V + G V D A V K + W
Sbjct: 150 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 204
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 205 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 264
Query: 327 DVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q ++GK +++E D +TYH I + IDPS+A+ F+C
Sbjct: 265 TTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPITGIDPSVALCFFC 310
>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
terrestris]
Length = 386
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 106/287 (36%), Positives = 152/287 (52%), Gaps = 16/287 (5%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 34 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 93
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 94 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 152
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
+ L + +L + V + G V D A V K + W
Sbjct: 153 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 207
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 208 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 267
Query: 327 DVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q ++GK +++E D +TYH I + IDPS+A+ F+C
Sbjct: 268 TTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPITGIDPSVALCFFC 313
>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
Length = 445
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 162/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 55 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 104
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 105 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 164
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 165 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 220
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 221 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 267
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 268 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 327
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 328 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETL 379
>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
Length = 405
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 158/314 (50%), Gaps = 26/314 (8%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD--SK 120
I + + +W+LG + +D + +D SR+ +YRKGF PIG S
Sbjct: 46 IPQTENSVWVLGKKYNAKKD-----------IDAIRRDIRSRLWFTYRKGFVPIGGFGST 94
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPF 178
TSD GWGCMLR QM++ QAL+ LGR WR P R Y+ IL F D +P+
Sbjct: 95 FTSDKGWGCMLRCGQMVLGQALISLHLGRDWR---WTPETRSSTYLNILRRFEDRRAAPY 151
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
SIH + G + G G W GP + + + L + +L + V +
Sbjct: 152 SIHQIALMGASEGKDVGQWFGPNTIAQVLKKLVVYDDWSSITIHVALDNTLVVNDVVQQC 211
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
GA +D K + W P+LLL+PL LGL ++NP YI L+ +F FPQS
Sbjct: 212 RVEGATTAEVDGEKPL-----KAPSQWKPLLLLIPLRLGLNEINPIYINGLKTSFQFPQS 266
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV--INIGKDDLEADT-STYHSDVIRHIH 355
LG++GGKP + Y +G + I+LDPH Q ++ DD EA+ +TYH + I
Sbjct: 267 LGLIGGKPSHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCKIASRIP 326
Query: 356 LDSIDPSLAIGFYC 369
+ +DPS+A+ F+C
Sbjct: 327 ITGMDPSVALCFFC 340
>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
Length = 428
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 162/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 38 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 87
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 88 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 147
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 148 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 203
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 204 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 250
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 251 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 310
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 311 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETL 362
>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
Length = 480
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 145/316 (45%), Gaps = 56/316 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 134
F DF SRI ++YR F PI S+ TSD GWGCM+RS
Sbjct: 114 FLDDFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSG 173
Query: 135 QMLVAQALLFHRLGRPWRK-------------PLQKPFDREYVEILHLFGDSETSPFSIH 181
Q L+A L+ LGR WR+ + EIL LF DS +PFSIH
Sbjct: 174 QSLLANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREAEILSLFADSPDAPFSIH 233
Query: 182 NLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
+Q G A G G W GP A A C R E C + + +YV +
Sbjct: 234 RFVQHGASACGKHPGQWFGP-------SATASCIR-ELSTECAAAGLRVYVTPSASE--- 282
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+D R + S P L+L + LGL+++ P Y L+ + T+PQS+G
Sbjct: 283 ------LYEDRFRSIAAASPSDPTIKPTLILFGIRLGLDRITPVYHEALKSSLTYPQSIG 336
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLD 357
I GG+P +S Y VG Q + YLDPH+ +P + D E + +T H+ +R + ++
Sbjct: 337 IAGGRPSSSHYFVGCQGDLFFYLDPHETRPALPHHASPADYSEEEIATCHTRRLRGLRIN 396
Query: 358 SIDPSLAIGFYCRDKG 373
+DPS+ IGF +D+
Sbjct: 397 EMDPSMLIGFLIKDEA 412
>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
Length = 454
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 148/303 (48%), Gaps = 50/303 (16%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
F DF SRI ++YR GF DP +S ++ SD GWGCM+RS
Sbjct: 118 FLDDFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRS 177
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A AL RLGR WR+ +RE IL LF D +P+S+HN ++ G A G
Sbjct: 178 GQSLLANALQISRLGRDWRRATDPDAERE---ILSLFADDPRAPYSLHNFVKHGAAACGK 234
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R EALA + E+ L S G P V D
Sbjct: 235 YPGEWFGPSATARCIEALA--NQHESSLRVYST---------------GDLPDVYEDS-- 275
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+V + + P L+LV LG++K+N Y L T QS+GI GG+P +S Y
Sbjct: 276 -FMAVANPDGEHFHPTLILVCTRLGIDKINQVYEEALISTLQMEQSIGIAGGRPSSSHYF 334
Query: 313 VGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ + YLDPH +P + +D + + H+ +RH+H++ +DPS+ IGF
Sbjct: 335 VGVQGQWLFYLDPHHPRPALPYREAPEDYTSEELGSCHTRRLRHLHVEDMDPSMLIGFLI 394
Query: 370 RDK 372
+D+
Sbjct: 395 KDE 397
>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
Length = 439
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 161/368 (43%), Gaps = 66/368 (17%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 91
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 47 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 106
Query: 92 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 128
F DF S+I ++YR F PI TSD GWG
Sbjct: 107 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 166
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 167 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 223
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 224 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 267
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 268 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 324
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 364
AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +DPS+
Sbjct: 325 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 384
Query: 365 IGFYCRDK 372
IGF R++
Sbjct: 385 IGFLVRNE 392
>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
Length = 469
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 152/316 (48%), Gaps = 62/316 (19%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 127
+F DF S++ I+YR F PI + TSD GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 246
Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 245
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 247 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 298
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 299 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 346
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 356
P +S Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+
Sbjct: 347 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 406
Query: 357 DSIDPSLAIGFYCRDK 372
+DPS+ IGF RD+
Sbjct: 407 REMDPSMLIGFLVRDE 422
>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
Length = 402
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 161/368 (43%), Gaps = 66/368 (17%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 91
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 70
Query: 92 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 128
F DF S+I ++YR F PI TSD GWG
Sbjct: 71 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 130
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 131 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 187
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 188 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 231
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 232 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 288
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 364
AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +DPS+
Sbjct: 289 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 348
Query: 365 IGFYCRDK 372
IGF R++
Sbjct: 349 IGFLVRNE 356
>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
Length = 356
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 107/328 (32%), Positives = 151/328 (46%), Gaps = 32/328 (9%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 169 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 344
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
TYH+ +R IH+ +DPS+ IGF R++
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNE 310
>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
Length = 471
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 148/321 (46%), Gaps = 51/321 (15%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
G + F +DF SR+ +YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 104 GEGDIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163
Query: 150 PW--------------------RKPLQKPFDR------------EYVEILHLFGDSETSP 177
W R P + R ++ +I+ F D +P
Sbjct: 164 DWTWAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAP 223
Query: 178 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
FS+H L++ G++ G AG W GP +A R + + +YV
Sbjct: 224 FSLHRLVELGQSLGKKAGDWYGP-------SVVAHILRKAVESCSEVTHLVVYVSQDCTV 276
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+ A +V D + A+W +++LVP+ LG E +NP Y+P ++
Sbjct: 277 YKADVARLVARPDPT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSEL 326
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
LGI+GGKP S Y +G Q++ +YLDPH QP ++I + D + ++H R +
Sbjct: 327 CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDISQADFPLE--SFHCTAPRKMAFT 384
Query: 358 SIDPSLAIGFYCRDKGLLVTF 378
+DPS +GFY K T
Sbjct: 385 KMDPSCTVGFYAGGKKEFETL 405
>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
Length = 453
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 147/317 (46%), Gaps = 53/317 (16%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
G + F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QML+AQ LL H R
Sbjct: 84 GEGDIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSR 143
Query: 150 PW-----------RKPL---------------------QKPFDRE--YVEILHLFGDSET 175
W R+P + F++E + I+ F D
Sbjct: 144 DWTWSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPG 203
Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
+PF +H L++ G++ G AG W GP +A R + + +YV
Sbjct: 204 APFGLHRLVELGRSSGKRAGDWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDC 256
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+ A +V D S +W I++LVP+ LG E +NP Y+P ++
Sbjct: 257 TVYKADVAQLVAQPDPS----------TEWKSIVILVPVRLGGETLNPVYVPCVKELLRL 306
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
+GI+GGKP S Y +G Q++ +YLDPH QP ++ ++ + ++H R +
Sbjct: 307 ELCIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPFVDTSQESFPLE--SFHCTSPRKMA 364
Query: 356 LDSIDPSLAIGFYCRDK 372
+DPS IGFY ++
Sbjct: 365 FSRMDPSCTIGFYAGNR 381
>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
Length = 454
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 152/316 (48%), Gaps = 62/316 (19%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 127
+F DF S++ I+YR F PI + TSD GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 231
Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 245
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 232 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 283
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 284 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 331
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 356
P +S Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+
Sbjct: 332 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 391
Query: 357 DSIDPSLAIGFYCRDK 372
+DPS+ IGF RD+
Sbjct: 392 REMDPSMLIGFLVRDE 407
>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
Length = 489
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 100/304 (32%), Positives = 139/304 (45%), Gaps = 49/304 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 132
F DF S+I ++YR F PI S+ TSD GWGCM+R
Sbjct: 153 FLDDFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 212
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
S QML+A AL RLGR WR+ E ++L LF D +PFSIH ++ G Y G
Sbjct: 213 SGQMLLANALAISRLGRDWRRVSHT---TEENKLLSLFADDPAAPFSIHRFVRHGALYCG 269
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A +AL+ + M +YV S +D
Sbjct: 270 KHPGEWFGPSATATCIQALSEEYKVAG--------MNVYVSSDS---------TYVYEDK 312
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + G P L+L+ LG++++ P Y L PQSLGI GG+P +S Y
Sbjct: 313 FKAVAYNQPGHM--RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQSLGIAGGRPSSSHY 370
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+GVQ YLDPH +P + D + + H+ +R IH+D +DPS+ +GF
Sbjct: 371 FIGVQNSFFFYLDPHHTRPALPHKVDSAYTQEQVDSCHTRRLRRIHIDDMDPSMLVGFLI 430
Query: 370 RDKG 373
RD+
Sbjct: 431 RDEN 434
>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
familiaris]
Length = 473
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 162/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGPGLGPSEPAGLASPNRYRGPAR 192
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 249 ---SLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARLVARPDPT----------AE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 356 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETL 407
>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
oryzae 3.042]
Length = 357
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 107/328 (32%), Positives = 151/328 (46%), Gaps = 32/328 (9%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 169 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 344
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
TYH+ +R IH+ +DPS+ IGF R++
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNE 310
>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
Length = 508
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 163/350 (46%), Gaps = 57/350 (16%)
Query: 58 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 113
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 114 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRR 204
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
+ +E ++L LF D +PFSIH ++ G A G G W GP A R +AL+
Sbjct: 205 GTKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS 261
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWT 266
C+ + +YV S D +D R ++ S G D
Sbjct: 262 --------ECEHAGLNVYVTSDGSD---------VYEDRFR--AIASAGGTGAGTSTDVH 302
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q YLDPH
Sbjct: 303 PTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFYLDPH 362
Query: 327 DVQPVI---NIGKDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+P + + G + +TYH+ +R +H+ +DPS+ IGF RD+
Sbjct: 363 HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDE 412
>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
Length = 478
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 164/381 (43%), Gaps = 82/381 (21%)
Query: 65 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGD 118
S S + LLG C H A+DE A L F +DF+SR+ ++YR+ F P+
Sbjct: 36 SRNSPVLLLGKCYHFKAEDEESPTEASVEDLVMGDVDAFRRDFASRVWLTYREEFSPLPG 95
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE------------- 162
S +TSD GWGCMLR+ QM++AQ L+ H LGR W + L +P D E
Sbjct: 96 STLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWTWSEALTLQPLDTETWTTTAAKRLVAS 155
Query: 163 ---------------------------------------YVEILHLFGDSETSPFSIHNL 183
+ ++ FGDS ++P +H L
Sbjct: 156 LEASLQGVPGPSVRSSSPQAQALSLGSAEEADAHLKEMYHRTLVSWFGDSPSTPLGLHRL 215
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS--LPMAIYVVSGD------ 235
++ G G AG W GP + + + + GL C + + V S D
Sbjct: 216 VRLGLTMGKQAGDWYGPAVVAHILKKAVE-EAMDPGLACITAYVSQDCTVYSADVVDCHR 274
Query: 236 ------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
E AP + +D H S + +A +++LVP+ LG EK NP Y
Sbjct: 275 APRAERTSDETPDAPTLPQNDQPAHASTLPESRA----VIILVPVRLGGEKTNPEYFDFA 330
Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSD 349
+ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D +YH
Sbjct: 331 KSILSLEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSYHCP 388
Query: 350 VIRHIHLDSIDPSLAIGFYCR 370
+ + +DPS +GFY R
Sbjct: 389 SPKKMPFSKMDPSCTVGFYSR 409
>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
Length = 473
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 164/354 (46%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S ++L G ++ E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSK-ISSVYLCGHRYRF---ESEGD------IQRFQRDFMSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPLQ 156
+TSD GWGCMLRS QML+AQ LL H L R W R P +
Sbjct: 133 GCLTSDCGWGCMLRSGQMLLAQGLLLHFLPRDWTWAEGSGLGPPELSGSASPSRYRGPAR 192
Query: 157 K----------PFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ ++E+ +I+ F D +PF +H L+ G++ G AG W GP
Sbjct: 193 RVPPHWAQCTPELEQEHWHRQIVSWFADHPQAPFGLHRLVALGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D +A+
Sbjct: 249 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDP----------KAE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 356 PHYCQPSVDVSQADFSLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETL 407
>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
Length = 432
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 138/297 (46%), Gaps = 50/297 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF S+ +YR F I S+ T+D GWGCM+RS
Sbjct: 105 FLDDFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRS 164
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A AL LGR WR+ + +E E+L LF D+ +PFSIH + G A G
Sbjct: 165 GQSLLANALSILNLGRDWRRGSKI---KEECELLSLFADNPQAPFSIHRFVDYGASACGK 221
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R EAL+ C+ + +YV+S D + D
Sbjct: 222 HPGEWFGPSATARCIEALSN--------ECKHTDLNVYVMSDGSDVHEDQFRQIAGPDGI 273
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
R P L+L+ + LG+E V P Y LR +PQS+GI GG+P +S Y
Sbjct: 274 R-------------PTLILLGVRLGIESVTPVYWEALRAIIRYPQSVGIAGGRPSSSLYF 320
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+GVQ YLDPH +P ++ D + TYH+ +R +H+ +DPS+ IGF
Sbjct: 321 IGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLDTYHTRRLRRLHIREMDPSMLIGF 377
>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 473
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + E+ GD + F +DF SR+ ++YR+ F P
Sbjct: 83 SRTRFSKISS----VHLCGRRYCFESEGD------IQRFQRDFVSRLWLTYRRDFPPFAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWARGASLSPPEPSGLASSNRYRGPAH 192
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
++ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 CMTPCWAQRAPELEQERRHRQIVSWFADHPQAPFGLHQLVELGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 249 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 356 PHYCQPAVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 407
>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
Length = 474
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------- 159
+TSD GWGCMLRS QM++AQ LL H L R W L P
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSWYHGPAR 193
Query: 160 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPCWAQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D S A+
Sbjct: 250 ---SLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARLVARPDPS----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WNSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + + ++H R + +DPS +GFY D+ T
Sbjct: 357 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 408
>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
Length = 207
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 84/162 (51%), Positives = 103/162 (63%), Gaps = 7/162 (4%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + NRSL S ++R+ GSM R LG S+ SS D+W L
Sbjct: 53 FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASKALTSS---DVWFL 105
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 207
>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
Length = 411
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 345
>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 467
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/275 (38%), Positives = 139/275 (50%), Gaps = 42/275 (15%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + +V G W P L+LV LG++K+ P Y L+ + PQS+GI GG+P AS
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKASLQIPQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
Y VGVQ + YLDPH +P++ L A TS
Sbjct: 311 HYFVGVQGNNFYYLDPHSTRPLLPFHPPSLAAATS 345
>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
Length = 411
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 345
>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
Length = 494
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 105/338 (31%), Positives = 156/338 (46%), Gaps = 54/338 (15%)
Query: 67 TSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF-------DPIGDS 119
S + L H G + A F DF S+I ++YR F DP S
Sbjct: 89 NSQVPLFANHHGSTTANPSGQQGQQDWPAAFLDDFESKIWLTYRSSFPLIPKSSDPNAAS 148
Query: 120 KIT----------------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 163
+T +D GWGCM+RS Q L+A AL LGR WR+ + +E
Sbjct: 149 AMTLGVRLRSQLVDPQGFTTDTGWGCMIRSGQSLLANALAILFLGREWRRGTKV---KEE 205
Query: 164 VEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+L LF D +PFSIH ++ G A G G W GP A R +AL+ C
Sbjct: 206 SNLLSLFADDPRAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS--------EC 257
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG----QADWTPILLLVPLVLGL 278
+ + +YV S D +D R ++ S G D P L+L+ + LG+
Sbjct: 258 KHAGLNVYVTSDGSD---------VYED--RFRAIASGGGTGTSTDIRPTLILLGIRLGI 306
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV----INI 334
++V P Y L+ +PQ++GI GG+P +S Y +G Q YLDPH +P + +
Sbjct: 307 DRVTPVYWEALKAVLKYPQAVGIAGGRPSSSHYFIGAQGSHFFYLDPHHTRPALPYHVPV 366
Query: 335 GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + +TYH+ +R +H+ +DPS+ IGF RD+
Sbjct: 367 DQQYTDEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDE 404
>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
Length = 411
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 130
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 345
>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
Length = 474
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 408
>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
Length = 439
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 49 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 98
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 99 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 158
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 159 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 214
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 215 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 261
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 262 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 321
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 322 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 373
>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
Length = 494
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 105/338 (31%), Positives = 156/338 (46%), Gaps = 54/338 (15%)
Query: 67 TSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF-------DPIGDS 119
S + L H G + A F DF S+I ++YR F DP S
Sbjct: 89 NSQVPLFANHHGSTTANPPGQQGQQDWPAAFLDDFESKIWLTYRSSFPLIPKSSDPNAAS 148
Query: 120 KIT----------------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 163
+T +D GWGCM+RS Q L+A AL LGR WR+ + +E
Sbjct: 149 AMTLGVRLRSQLVDPQGFTTDTGWGCMIRSGQSLLANALAILFLGREWRRGTKV---KEE 205
Query: 164 VEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+L LF D +PFSIH ++ G A G G W GP A R +AL+ C
Sbjct: 206 SNLLSLFADDPRAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS--------EC 257
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG----QADWTPILLLVPLVLGL 278
+ + +YV S D +D R ++ S G D P L+L+ + LG+
Sbjct: 258 KHAGLNVYVTSDGSD---------VYED--RFRAIASGGGTGTSTDIRPTLILLGIRLGI 306
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV----INI 334
++V P Y L+ +PQ++GI GG+P +S Y +G Q YLDPH +P + +
Sbjct: 307 DRVTPVYWEALKAVLKYPQAVGIAGGRPSSSHYFIGAQGSHFFYLDPHHTRPALPYHVPV 366
Query: 335 GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + +TYH+ +R +H+ +DPS+ IGF RD+
Sbjct: 367 DQQYTDEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDE 404
>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
Length = 474
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 408
>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
Length = 458
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 159/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF+SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHIIAGNVEEFRKDFTSRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDFESWTSNTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMASDHADDKAVIILVPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
Length = 392
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 165/354 (46%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 5 SRTSFSKISS----VHLCGRRYCFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 54
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 55 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGAGLSPPEPSGLASPNRHHGLAH 114
Query: 153 -KPLQ-----KPFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
KP + ++E+ +I+ F D +PF +H L++ G+++G AG W GP
Sbjct: 115 WKPPRWAQGAPELEQEHWHRQIVSWFADHPQAPFGLHQLVELGQSWGKKAGDWYGP---- 170
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 171 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDCT----------AE 217
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++S +YLD
Sbjct: 218 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDSLLYLD 277
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + + ++H R + +DPS +GFY ++ T
Sbjct: 278 PHYCQPTVDVSQAGFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGNRKEFETL 329
>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
Length = 457
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 60/337 (17%)
Query: 84 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
ALG + +G+ + SSR +YRK F PIG + TSD GWGCMLR +QML+ + L
Sbjct: 34 ALGKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVL 93
Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
L +GR + ++ Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 94 LRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNT 152
Query: 203 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 246
+ W +A + L + +L MA S D E+G+
Sbjct: 153 AAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQ------- 205
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
+H + + + +W P+LL++PL LGL +N Y+P ++ F PQ +GI+GGKP
Sbjct: 206 ----VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKLPQCVGIIGGKP 261
Query: 307 GASTYIVGVQEESAIYLDPHDVQPV------------------INIGK-DDLE------- 340
+ Y VG+ YLDPH +P N + +DLE
Sbjct: 262 NLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTESEQHDTNFSELEDLEPLPSQTS 321
Query: 341 -----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
D STYH +++ + +SIDPSLA+ +C +
Sbjct: 322 DVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESR 358
>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
Length = 474
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 408
>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
Length = 474
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 408
>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
Length = 456
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 159/317 (50%), Gaps = 57/317 (17%)
Query: 87 DAAGNNGLAE-FNQDFSSRILISYRKGF-------DP---------------IGD-SKIT 122
+++G++G F DF SRI ++YR GF DP +GD + T
Sbjct: 111 ESSGDSGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFT 170
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCM+RS Q L+A ALL RLGR WR+ +R IL LF D +P+S+HN
Sbjct: 171 SDTGWGCMIRSGQSLLANALLISRLGRDWRRMTDPDAERP---ILALFADDSRAPYSLHN 227
Query: 183 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ G+ A G G W GP A R +ALA + E+ L S G
Sbjct: 228 FVKHGELACGKYPGEWFGPSATARCIQALA--NKHESSLRVYST---------------G 270
Query: 242 GAPVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
P V D S + + D + P L+LV LG++K+N Y+ L T QS
Sbjct: 271 DLPDVYED------SFMATAKPDGETFHPTLILVCTRLGIDKINQVYVEALISTLQMEQS 324
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--GKDDLEADT-STYHSDVIRHIH 355
+GI GG+P +S Y VGVQ + YLDPH +P + DD ++ + H+ +R +H
Sbjct: 325 IGIAGGRPASSHYFVGVQGQWLFYLDPHHPRPKLPYRENPDDYTSEELDSCHTRRLRRLH 384
Query: 356 LDSIDPSLAIGFYCRDK 372
++ +DPS+ IGF +D+
Sbjct: 385 VEDMDPSMLIGFLIKDE 401
>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
Length = 497
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 161/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 216
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 272
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 273 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 319
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 320 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 379
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 380 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 431
>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
Length = 458
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKLLPARSGCTIKDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
+ S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPDALNIENSDSESWTSNTAK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNEIYHRKIISWFGDSPLTLFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMAPDNTDDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
Length = 454
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 139/303 (45%), Gaps = 49/303 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF S+ ++YR F I S TSD GWGCM+RS
Sbjct: 118 FLDDFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRS 177
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
Q L+A A+ LGR WR+ Q P D ++L F D +P+SIH +Q G A G
Sbjct: 178 GQSLLANAMAAINLGRDWRR-GQNPEDER--KLLSWFADDPRAPYSIHQFVQHGAVACGK 234
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA Q + P+ +Y G P V D
Sbjct: 235 YPGEWFGPSATARCIQALANAQEQQ--------PLRVYST--------GDGPDVYED--- 275
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ + + + P L+LV LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 276 KFMEIAKPDGSRFNPTLILVGTRLGIDKITPVYWEALIAALQMPQSVGIAGGRPASSHYF 335
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+G Q YLDPH +P + D EAD T H+ +R +H+ +DPS+ +GF
Sbjct: 336 IGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVDTVHTRRLRRLHVRELDPSMLVGFLI 395
Query: 370 RDK 372
RD+
Sbjct: 396 RDE 398
>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
Length = 458
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 159/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
Length = 458
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIADHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLAPFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
Length = 458
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
Length = 384
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 159/319 (49%), Gaps = 50/319 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
+W+LG + ++ L +D S++ +YRKGF PIG S TSD GW
Sbjct: 23 VWILGKQYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGW 71
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
GCMLR QM++ QAL+ LGR W+ P + + Y++IL F D T+PFSIH +
Sbjct: 72 GCMLRCGQMVLGQALIILHLGRDWQWTPETR--NSTYLKILERFEDRRTAPFSIHQIASM 129
Query: 187 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
G + G G W GP + + + L + + I+V + +
Sbjct: 130 GASEGKEVGQWFGPNTIAQVLKKLVVYDDWSS--------ITIHVALDN---------TL 172
Query: 247 CIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
++D R C V K + W P+LLL+PL LGL ++NP YI L+ +F
Sbjct: 173 IVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFK 232
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDV 350
PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 233 IPQSLGVIGGKPNLALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMD-ATYHCKF 291
Query: 351 IRHIHLDSIDPSLAIGFYC 369
I + IDPS+A+ F+C
Sbjct: 292 ASRIPITGIDPSVALCFFC 310
>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
Length = 382
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 157/322 (48%), Gaps = 42/322 (13%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWSLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 231
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 347
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
Query: 348 SDVIRHIHLDSIDPSLAIGFYC 369
I + IDPS+A+ F+C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFC 310
>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
Length = 411
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 160/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETL 345
>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 407
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 161/373 (43%), Gaps = 71/373 (19%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA-----------QDE------ALG 86
+RI + + P + IW LGV + KI QDE +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPGKLGNYQDELEAGTSKID 70
Query: 87 DAAGNNGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITS 123
D + F DF S+I ++YR F PI TS
Sbjct: 71 DVTAHGWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTS 130
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH
Sbjct: 131 DTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRF 187
Query: 184 LQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
++ G ++ G G W GP A R EAL+ C ++ +YV + D
Sbjct: 188 VKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD----- 234
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V D R V G P L+L+ LG++ V P Y L+ PQS+GI
Sbjct: 235 ---VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIA 288
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSI 359
GG+P AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +
Sbjct: 289 GGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDM 348
Query: 360 DPSLAIGFYCRDK 372
DPS+ IGF R++
Sbjct: 349 DPSMLIGFLVRNE 361
>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
Length = 458
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 155/370 (41%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 181
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
Length = 382
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 157/322 (48%), Gaps = 42/322 (13%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWNLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 231
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 347
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
Query: 348 SDVIRHIHLDSIDPSLAIGFYC 369
I + IDPS+A+ F+C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFC 310
>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
Length = 459
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNN-----------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +E G +N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKSEEDDGIPVRSNWAPEDPAVISGNVDEFRKDFVSRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
P+G S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPAALDMENSDSESWTSHTVK 155
Query: 155 -LQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 181
L F+ +V +I+ FGDS + F +H
Sbjct: 156 KLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNEGFHRKIISWFGDSPRTYFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L + GK G AG W GP + R G + +YV
Sbjct: 216 QLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQG-----LTVYVAQ-------- 262
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G+ D +L+LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 263 DCTVYNSDVTDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S +GFYCR+
Sbjct: 381 SCTVGFYCRN 390
>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 448
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/306 (33%), Positives = 150/306 (49%), Gaps = 56/306 (18%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAERN---IVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 253 RHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
S + + D + P L+LV LG++K+NP Y L T QS+GI GG+P +S
Sbjct: 270 ---SFLATARPDGETFHPTLILVCTRLGIDKINPVYEEALISTLQMEQSIGIAGGRPSSS 326
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIG 366
Y VGVQ + YLDPH +P + ++ L + + H+ +R++H++ +DPS+ IG
Sbjct: 327 HYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIG 386
Query: 367 FYCRDK 372
F +D+
Sbjct: 387 FLIQDE 392
>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 515
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 146/303 (48%), Gaps = 50/303 (16%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 133
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G A G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYGATACGK 296
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +AL E+GL S G P V D
Sbjct: 297 YPGEWFGPLATARCIQALT--DEKESGLRVYST---------------GDLPDVYEDSFM 339
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ +G + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 340 AVANPDGRG---FQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 396
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+GVQ + YLDPH +P + +D + T H+ +R +H+D +DPS+ IGF
Sbjct: 397 IGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMDPSMLIGFLI 456
Query: 370 RDK 372
+D+
Sbjct: 457 KDE 459
>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
Length = 474
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 160/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WMSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 408
>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
Length = 385
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 158/318 (49%), Gaps = 35/318 (11%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W+LG H++ +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 17 WILGRQHQLKTEKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 65
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
LR QM++AQAL+ LGR W K EY IL F D + +SIH + Q G
Sbjct: 66 LRCGQMMLAQALICRHLGRDWHWEEHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGE 125
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGER------ 240
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 126 GKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCRLPN 177
Query: 241 GGAPVVCIDDASRHCSVFSKGQAD------WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
P V H S+ S+ ++ W P+LL++PL LG+ +NP Y+ + F
Sbjct: 178 QNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLIIPLRLGINHINPVYVDAFKECFK 237
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S + +
Sbjct: 238 MPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQLFVDSEENSTVDDRSFHCQQAPHRM 297
Query: 355 HLDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 298 KIMNLDPSVALGFFCKEE 315
>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
gallopavo]
Length = 421
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 156/327 (47%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGRRHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 205 DIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCTGWKPLLLIIPLRLGINHINPVY 264
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 265 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 324
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 325 HCQQAPHRMKIMNLDPSVALGFFCKEE 351
>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
Length = 458
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIHHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 474
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 160/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 408
>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
Length = 411
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 153/311 (49%), Gaps = 36/311 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-SDCRDATYLKIVNRFEDVRNSYYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ Q G+ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGETQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELESSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH + ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAAAEQDYDETYHQKHAARLSFSAM 309
Query: 360 DPSLAIGFYCR 370
DPSLA+ F C+
Sbjct: 310 DPSLAVCFLCK 320
>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
Length = 395
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 154/327 (47%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 250 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 285
D + C +G W P+LL++PL LG+ +NP Y
Sbjct: 179 DIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGWKPLLLIIPLRLGINHINPVY 238
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 239 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDESF 298
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 299 HCQQAPHRMKIMNLDPSVALGFFCKEE 325
>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 458
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----------------LQ 156
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P ++
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 157 K-----------------------------PFDRE------YVEILHLFGDSETSPFSIH 181
K P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNEVYHRKIISWFGDSPVALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
Length = 508
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 148/303 (48%), Gaps = 50/303 (16%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A A+L R GR WR+ +RE I+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA+ + + +Y+ P V D+
Sbjct: 290 YPGEWFGPSATARCIQALAKKHDSS---------LRVYLTRD--------LPEVYEDN-- 330
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
S + + P L+LV LG++K+NP Y L T PQ++GI GG+P +S Y
Sbjct: 331 -FMSTANPDGNHFHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSSHYF 389
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+G Q + YLDPH +P + ++ + + + H+ +RH+H++ +DPS+ IGF
Sbjct: 390 IGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIGFLI 449
Query: 370 RDK 372
+D+
Sbjct: 450 KDE 452
>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
Length = 412
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 155/327 (47%), Gaps = 52/327 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHC------------------SVFSKGQAD------WTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 181 DIKKMCWSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGWKPLLLIIPLRLGINHINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDKSF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + + ++DPS+A+GF+C+++
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEE 327
>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
Length = 509
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 161/345 (46%), Gaps = 70/345 (20%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 120
L + HK D+A A + EF +D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKPDQAADTEA--SWPREFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSL 108
Query: 121 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
T+D GWGCM+R+SQ L+A +LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANSLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 168 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
F D T+PFSIHN ++ G G G W GP A RS + L +TGL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKTGLKV---- 223
Query: 227 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 282
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQQGAELRPVLILAGIRLGVKNVN 263
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 337
P Y L+ T +PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLGWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323
Query: 338 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
++E+ D + H++ IR +HLD +DPS+ +G ++
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRA 368
>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
Length = 458
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +++ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNEIYHRKVISWFGDSPLAPFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
Length = 393
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 152/330 (46%), Gaps = 55/330 (16%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVLTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALICRHLGRDWRWSKGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + LA + +A+++ + V ++
Sbjct: 134 EGKSIGQWYGPNTVAQVLRKLASFDTWSS--------LAVHIAMDN---------TVVME 176
Query: 250 DASRHC---------SVFSKGQADW------------------TPILLLVPLVLGLEKVN 282
+ R C S F + D+ P++LL+PL LGL +N
Sbjct: 177 EIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLLWKPLVLLIPLRLGLTDIN 236
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D
Sbjct: 237 EAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPMDSCYIPD 296
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
S + + + +DPS+A+GF+C +
Sbjct: 297 ESFHCQHPPCRMSIAELDPSIAVGFFCNSE 326
>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
Length = 458
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 164/391 (41%), Gaps = 83/391 (21%)
Query: 50 RIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR-- 104
R+HE R+ + +S L +A++ AL D+ N + + F+SR
Sbjct: 11 RVHEEAKRLFADWKPAVSKMLETYLTLDPSFSVAENYALFDS--NLPIYLLGEKFTSRRD 68
Query: 105 -----------ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ +YRK F PIG T+D GWGCMLR QML+A+ L+ LGR W
Sbjct: 69 MERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGRNWL- 127
Query: 154 PLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR--- 205
+DR EY IL +F D + S FSIH + G + G G W GP +
Sbjct: 128 -----WDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLK 182
Query: 206 ------SWEALA--------------------------RCQRAETGLGCQSLPMAIYVVS 233
W LA R ETG A+
Sbjct: 183 KLVIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAE 242
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
+ E +P + S + +W P+L+++PL LGL +N Y P ++ F
Sbjct: 243 AEIFPESTRSPT---RSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFF 299
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---------------KDD 338
PQ +GI+GG+P + Y G+ + + +YLDPH Q +++ K+D
Sbjct: 300 QLPQCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDFVDLDETTATRDERDGYVEIKND 359
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
E STYH I +D +DPSLA+GF C
Sbjct: 360 -EFRDSTYHCPFILTTKIDKVDPSLALGFLC 389
>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
[Ciona intestinalis]
Length = 422
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 158/335 (47%), Gaps = 58/335 (17%)
Query: 69 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 128
+IW+LG + + AL F + S + +YRKG+ PIG + TSD GWG
Sbjct: 39 NIWVLGSRFHLPHERAL-----------FLEHIKSFLWFTYRKGYTPIGGTGPTSDSGWG 87
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
CMLR QML+A+AL + + W+ KP Y ILH D +S +SIH + Q G
Sbjct: 88 CMLRCGQMLLARALAELTMDKDWKWTEDKPQPPPYKRILHQLSDERSSCYSIHQIAQMGV 147
Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
G G W GP + + L++ + +AI+V + VCI
Sbjct: 148 EEGKEVGQWFGPNTISQVLRRLSQFDQENV--------LAIHVAMDN---------TVCI 190
Query: 249 DDASRHCSVFSKGQAD----------------------------WTPILLLVPLVLGLEK 280
+D R CS Q + W P+LLL+PL LGL +
Sbjct: 191 EDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLLLLIPLRLGLSE 250
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK--DD 338
+NP Y L+ + +S+G++GGKP + Y +G E+S I+LDPH QP + + +
Sbjct: 251 INPVYFTHLKECLHWKESVGVIGGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN 310
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
D +T+H D + L ++DPSLA+GF C +G
Sbjct: 311 ERYDDTTFHCDTPGRMLLTNLDPSLALGFICTTRG 345
>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
Length = 651
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 127/405 (31%), Positives = 185/405 (45%), Gaps = 78/405 (19%)
Query: 15 SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSST------- 67
+K TP P++ + S + + V L++ + E VLG S T +S T
Sbjct: 215 AKETPLCPSQ-MHSSQQPISDHQPVSTLLS------LVEAVLGSSDTLPTSVTWLAHQLK 267
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 127
+ W L H + A + F + +++R F TSDVGW
Sbjct: 268 ARGWELLASHGVPYTSPTAHTAFPGVWHSVHAVFQHILSLTHRTCF--------TSDVGW 319
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQ 185
GCMLRS Q ++A AL+ LGR WR+ ++ +Y IL F D S PFSIH L+
Sbjct: 320 GCMLRSVQSMLANALIRVHLGRHWRRRAKQKTHPQYARILSWFMDDPSLECPFSIHRLVD 379
Query: 186 AGKAYGLAAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
G+ G+ AG W GP +A+C+ +A C GLG VV+ D G
Sbjct: 380 EGQRLGVQAGDWFGPSTAAFALCKLIQAYDAC-----GLGV--------VVTND--GMLY 424
Query: 242 GAPVVCIDDASRHCSVFSKGQAD-WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
VV + F+ G++D WT P+L+L+ LGL++V P Y P L+ +FT PQS+
Sbjct: 425 KEQVVA--------ASFAPGRSDPWTRPVLILLVQRLGLDQVPPHYRPALKQSFTMPQSV 476
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI------------GKDDLEADTSTYH 347
G+VGG+P +S Y VGVQ E + LDPH V+P + DL + S +
Sbjct: 477 GVVGGRPRSSLYFVGVQREHLLCLDPHHVRPCVPFRSPPRMTRASVGASTDLASTVSPWF 536
Query: 348 SDVIRHIHLDS-------------IDPSLAIGFYCRDKGLLVTFE 379
+ LDS +DPS+ +GF C L+ +
Sbjct: 537 EEAYTAEELDSFHTPHTSLLPISQMDPSMLLGFVCEQASDLIDLQ 581
>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
Length = 411
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 154/311 (49%), Gaps = 36/311 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-ADCRDATYLKIVNRFEDVRNSFYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH + ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAVAEQDYDETYHQKHAARLSFSAM 309
Query: 360 DPSLAIGFYCR 370
DPSLA+ F C+
Sbjct: 310 DPSLAVCFLCK 320
>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
Length = 411
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 155/313 (49%), Gaps = 40/313 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 358 SIDPSLAIGFYCR 370
++DPSLA+ F C+
Sbjct: 308 AMDPSLAVCFLCK 320
>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
Length = 458
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
Length = 458
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
Length = 482
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 155/357 (43%), Gaps = 74/357 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S + + +C + Q E GD + F +DF+SR+ ++YR+ F P+ +TSD
Sbjct: 79 TSFSKLSTVHLCGRRYQFEGEGD------IQRFQKDFASRLWLTYRRDFPPLDGGSLTSD 132
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------------- 152
GWGCMLRS QML+AQ LL H R W
Sbjct: 133 CGWGCMLRSGQMLLAQGLLLHFFSRDWTWAEAVLPPSPRESELFRSMSPSRSGASWQRGS 192
Query: 153 -----------------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
P Q + ++ I+ F D +PF +H L++ G++ G AG
Sbjct: 193 STASGLGRATWSTGGTLSPRQLEQEEQHRRIVSWFADQPGAPFGLHRLVELGRSSGKRAG 252
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
W GP +A R + + +YV + A ++ D S
Sbjct: 253 DWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQLMAQPDPS--- 302
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
+W +++LVP+ LG E +NP Y+P ++ +GI+GGKP S Y +G
Sbjct: 303 -------TEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDLCIGIIGGKPRHSLYFIGY 355
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
Q++ +YLDPH QP ++ ++ + ++H R + +DPS IGFY ++
Sbjct: 356 QDDFLLYLDPHYCQPCVDTSQERFPLE--SFHCTSPRKMAFSRMDPSCTIGFYAGNR 410
>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
Length = 509
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 160/345 (46%), Gaps = 70/345 (20%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 120
L + HK QD+A A + EF D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKQDQAAETEA--SWPREFLGDVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSL 108
Query: 121 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
T+D GWGCM+R+SQ L+A LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANGLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 168 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
F D T+PFSIHN ++ G G G W GP A RS + L + GL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKIGLKV---- 223
Query: 227 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 282
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQEGAELRPVLILAGIRLGVKNVN 263
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 337
P Y L+ T ++PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLSWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323
Query: 338 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
++E+ D + H++ IR +HLD +DPS+ +G ++
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRA 368
>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
Length = 410
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 155/313 (49%), Gaps = 40/313 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 358 SIDPSLAIGFYCR 370
++DPSLA+ F C+
Sbjct: 308 AMDPSLAVCFLCK 320
>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
Length = 466
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 44 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 103
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 104 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 163
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 164 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 223
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 224 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 271
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 272 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 330
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 331 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 388
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 389 SCTIGFYCRN 398
>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
Length = 400
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 149/335 (44%), Gaps = 69/335 (20%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF+SRI ++YR+ F I S +T+D GWGC +R+ QML+AQ L+ H LG
Sbjct: 15 AGN--VEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLG 72
Query: 149 RPW----------------------------------RKPLQKPF------------DRE 162
R W + L+ P D E
Sbjct: 73 RAWTWPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHE 132
Query: 163 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
+ +I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 133 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 192
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + C+ + AD +++LVP+ L
Sbjct: 193 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCASMASDHADDKAVIILVPVRL 239
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 240 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 299
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
D + T+H + + +DPS IGFYCR+
Sbjct: 300 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRN 332
>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
Length = 435
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 170/384 (44%), Gaps = 79/384 (20%)
Query: 51 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQ 99
+H R + ++T S + S + LLG C+ +DE A+ D + EF +
Sbjct: 1 MHTRWVLKTKTYFSRN-SPVLLLGKCYHFKYEDEHKMLTARSGCAIEDRVIAGNVDEFRK 59
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--------- 150
DF SRI ++YR+ F PI S +++D GWGC LR+ QML+AQ L+ H LGR
Sbjct: 60 DFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWIWPDALNI 119
Query: 151 -------WRKPLQKPFD--------------------REYVE----------------IL 167
W K F +E +E I+
Sbjct: 120 ENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDEVYHRKII 179
Query: 168 HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
FGDS ++ F +H L++ G+ G AG W GP + R G +
Sbjct: 180 SWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----I 234
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+YV +D + V+ ASR G AD +++LVP+ LG E+ N Y+
Sbjct: 235 TVYVA---QDCTVYNSDVIDKQSASR-----PAGNADDKAVIILVPVRLGGERTNTDYLE 286
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
++ + +GI+GGKP S Y G Q++S IY+DPH Q +++ D + T+H
Sbjct: 287 FVKGVLSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFH 344
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRD 371
+ + +DPS IGFYCR+
Sbjct: 345 CPSPKKMSFRKMDPSCTIGFYCRN 368
>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
Length = 411
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 168/358 (46%), Gaps = 60/358 (16%)
Query: 33 LGSSETVKRLVTAGSMRRIHERVLGPSRT---------------GISSSTSDIWLLGVCH 77
+G S+ + R+ M + E LGP I +D+W+LG +
Sbjct: 3 VGLSDQLARI-----MESVFEAYLGPDSVLASAVGQAVGSGEPEDIPRRNTDVWVLGKKY 57
Query: 78 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 137
Q+ L +D SR+ +YR GF P+G+ ++T+D GWGCMLR QM+
Sbjct: 58 NAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMV 106
Query: 138 VAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
+AQAL+ LGR W P D Y++I++ F D S +SIH + Q G++ A G
Sbjct: 107 LAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVG 163
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
W+GP + + + L R + +AI+V V +DD C
Sbjct: 164 EWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD---------STVVLDDVYASC 206
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
+ W P+LL++PL LG+ +NP Y+P L+ S G++GG+P + Y +G
Sbjct: 207 ----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCGMIGGRPNQALYFLGY 262
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCR 370
++ +YLDPH Q + + A+ TYH ++ ++DPSLA+ F C+
Sbjct: 263 VDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFSAMDPSLAVCFLCK 320
>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
Length = 458
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L+ GK G AG W GP + R G + IYV
Sbjct: 216 QLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
Length = 396
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 145/335 (43%), Gaps = 69/335 (20%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 11 AGN--VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 68
Query: 149 RPWRKP----------------------------------------LQKPFDREYVE--- 165
R W P QK R Y +
Sbjct: 69 RAWTWPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHE 128
Query: 166 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 129 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 188
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + C+ + D +++L+P+ L
Sbjct: 189 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRL 235
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 236 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 295
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
D + T+H + + +DPS IGFYCR+
Sbjct: 296 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRN 328
>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
Length = 388
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 160/354 (45%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 50 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 99
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 100 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSGLGPSEPSGLASPNRYRGPAR 159
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L G++ G AG W GP
Sbjct: 160 WVPPRWAHGTPELEQERRHRQIVSWFADHPRAPFGLHRLGGLGQSSGKKAGDWYGP---- 215
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 216 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 262
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 263 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 322
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 323 PHYCQPTVDVTQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETL 374
>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
Length = 417
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 107/344 (31%), Positives = 168/344 (48%), Gaps = 25/344 (7%)
Query: 39 VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
V V G R I GP + + +W+LG + + A ++
Sbjct: 19 VTLCVFPGVKRHITILSDGPEE--LPETDEPVWILGKQYDLQ--------AVITEKSKLL 68
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
D S+R+ +YR+ F PIG + +SD GWGCMLR QM++AQAL+ LGR W +Q+
Sbjct: 69 SDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEMQQE 128
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEA 209
EY IL F D + +SIH + Q G G + G W GP A+ W +
Sbjct: 129 QPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS 188
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 269
LA + + + + ++ + +P +D S H S G W P+L
Sbjct: 189 LAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGLDQ-STHLPEPSPG---WKPLL 244
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
L++PL LG+ ++NP YI + F PQSLG +GGKP ++ Y +G IYLDPH Q
Sbjct: 245 LIIPLRLGINQINPVYIDAFKECFKMPQSLGALGGKPNSAYYFIGFLGNELIYLDPHTTQ 304
Query: 330 PVINIGKDDLEADTSTYHSDVIRH-IHLDSIDPSLAIGFYCRDK 372
++ ++D D ++H H + + ++DPS+A+GF+ +++
Sbjct: 305 TFVD-SEEDGTVDDQSFHCQQSPHRMQILNLDPSVALGFFFKEE 347
>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 468
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 141/309 (45%), Gaps = 56/309 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI +SYR GF PI S T+D GWGCM+R+
Sbjct: 128 FLDDFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRT 187
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A LL HRLGR WR+ + +R+ +L LF D +P+SIH ++ G A G
Sbjct: 188 GQSLLANTLLSHRLGRGWRRGEKSDEERK---LLSLFADDPRAPYSIHKFVEHGAAKCGK 244
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R EALA + +Y G P V D
Sbjct: 245 YPGEWFGPSATARCIEALANTNEKT---------LRVYST--------GDLPDVYEDS-- 285
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+LV LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 286 -FMEVARPDGKTFHPTLILVSTRLGIDKINQVYWESLTATLQMPQSVGIAGGRPSSSHYF 344
Query: 313 VGVQE------ESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q + YLDPH +P + D +D + H+ +R +H+ +DPS+
Sbjct: 345 VGAQRSDEDQGSNLFYLDPHHTRPALPYFDDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 404
Query: 364 AIGFYCRDK 372
IGF D+
Sbjct: 405 LIGFLITDE 413
>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
Full=Autophagy-related protein 4
gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 506
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 128/399 (32%), Positives = 177/399 (44%), Gaps = 87/399 (21%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSR 60
G R A A+ C S ++ S A GS+LGS +TV VT+G ++ L
Sbjct: 112 FNGVRTTATAT-CLSDTS-----MSAAPTGSQLGSFDTVPDSVTSG-----YDSALAYEE 160
Query: 61 TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF------- 113
G QD A F DF SRI ++YR F
Sbjct: 161 PG------------------QDGGWPPA--------FLDDFESRIWMTYRTDFALIPRSS 194
Query: 114 DPIGDSKIT----------------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
DP S ++ SD GWGCM+RS Q L+A A+L RLGR WR+
Sbjct: 195 DPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQSLLANAILIARLGREWRRGTD- 253
Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
D E +I+ LF D +P+S+HN ++ G A G G W GP A R +ALA
Sbjct: 254 -LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGKYPGEWFGPSATARCIQALA--DEK 309
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
++GL S G P V D + +G + P L+LV L
Sbjct: 310 QSGLRVYST---------------GDLPDVYEDSFMAVANPDGRG---FQPTLILVCTRL 351
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++K+N Y L T PQS+GI GG+P +S Y VGVQ + YLDPH +P + +
Sbjct: 352 GIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYFVGVQGQRLFYLDPHHPRPALPYRE 411
Query: 337 DD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
D + T H+ +R +H+ +DPS+ IGF +D+
Sbjct: 412 DPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLIKDE 450
>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
Length = 458
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + EF +DF SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRLWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
Length = 397
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 149/284 (52%), Gaps = 18/284 (6%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR WR +
Sbjct: 47 TSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVRRHLGRDWRWVRSQSQRE 106
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEALAR 212
+Y+ IL+ F D + +S+H + Q G G + G W GP A+ SW L
Sbjct: 107 DYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRLTV 166
Query: 213 CQRAETGLGCQS-----LPMAIYVVSGDEDGERG-GAPVVCIDDASRHCSVFSKGQADWT 266
+ + + +P Y + D + G P C++ A C++ + A W
Sbjct: 167 HVAMDNTVVIEEIKRLCMPWLDYGGAACVDLQGGMPEPNGCLEGA---CALAEEETALWK 223
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LLL+PL LGL +N YI TL+ F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 224 PLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGGKPNHAHYFIGYVGEELIYLDPH 283
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
QP + +D D + + +H+ IDPS+A+GF+CR
Sbjct: 284 TTQPAVEPCEDSQVPDDTYHCQHPPCRMHICEIDPSIAVGFFCR 327
>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
Length = 437
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/314 (32%), Positives = 145/314 (46%), Gaps = 50/314 (15%)
Query: 87 DAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSK-----------------------IT 122
D+ N G + F DF +R+ I+YR F I S+ +
Sbjct: 94 DSDANGGWPSPFLDDFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFS 153
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCM+RS Q L+A AL RLGR WR+ +R IL LF D +PFSIH
Sbjct: 154 SDTGWGCMIRSGQSLLANALQVLRLGRAWRRGQDSQGERR---ILSLFADDPKAPFSIHR 210
Query: 183 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ G A G G W GP A R +AL+ G + + +Y+ D
Sbjct: 211 FVEHGAVACGKHPGEWFGPSATARCIQALSN--------GYEDAGLRVYITGDGSD---- 258
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+D+ V + P L+LV + LG+++V P Y L+ + QS+GI
Sbjct: 259 -----VYEDS--FMKVAKDANNTFHPTLVLVGIRLGIDRVTPVYWEALKASLQLSQSIGI 311
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDS 358
GG+P AS Y VG Q YLDPH +P + + D + D + H+ +R +H+
Sbjct: 312 AGGRPSASHYFVGTQGSYFFYLDPHTTRPFLPLHSDLSDYTQEDIDSCHTRRLRRLHVKE 371
Query: 359 IDPSLAIGFYCRDK 372
+DPS+ I F RD+
Sbjct: 372 MDPSMLIAFLIRDE 385
>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
Length = 448
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 145/303 (47%), Gaps = 49/303 (16%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
F DF S++ SYR GF DP S ++ SD GWGCM+RS
Sbjct: 108 FLDDFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRS 167
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A +++ RL R WR+ + + +RE I+ LF D +P+SIH ++ G +A G
Sbjct: 168 GQSLLANSMVILRLSRGWRRGVGRDKERE---IVSLFADDPRAPYSIHKFVEHGAEACGK 224
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R + LA+ +S + +Y+ D + G
Sbjct: 225 YPGQWFGPSATARCIQELAKRH--------ESADVRVYITGDGSDVYKDG---------- 266
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
SV ++ P L+LV LG++KV P Y L+ + PQS+GI GG+P +S Y
Sbjct: 267 -FMSVAKPDGVNFKPTLILVGTRLGIDKVTPVYWEALKASLQMPQSVGIAGGRPSSSHYF 325
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ YLDPH I D E A+ + H+ +R + + +DPS+ IGF
Sbjct: 326 VGVQGSHFFYLDPHQTMAAIPFHTDVDEYTPAEIDSCHTRRLRRLDIKEMDPSMLIGFLI 385
Query: 370 RDK 372
RD+
Sbjct: 386 RDE 388
>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
Length = 451
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGSVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGEERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
Length = 458
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FG+S + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGNSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/282 (35%), Positives = 140/282 (49%), Gaps = 29/282 (10%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
L + DF SR+ +YR+ F IG S TSD GWGCMLR+ QMLVA+ LL RLGR +
Sbjct: 39 LEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRNYVW 98
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
D Y EIL LF D+ ++ S+ + L A A G W GP M + L R
Sbjct: 99 SESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMA---QVLKR 155
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
++ +SL + V VV ++D S + + G+ TP++L++
Sbjct: 156 ITKS------RSLGFGVTVAMDS---------VVSVEDVS--AEIINGGKP--TPLVLMI 196
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA----IYLDPHDV 328
PL LGL VN Y+ L++ +GI+GGKP + Y VG QE +YLDPH
Sbjct: 197 PLRLGLNSVNEIYVNPLKIFLASKYCVGIMGGKPNQAHYFVGYQETVEDTWLLYLDPHTT 256
Query: 329 Q--PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
Q PV E + H+D + I +DPSLA+GF+
Sbjct: 257 QQSPVSVNNNMPFEQFDKSLHTDKLCWIKALKLDPSLAVGFF 298
>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
Length = 458
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
Length = 458
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
Length = 458
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 155/370 (41%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 155
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPYALSIENSDSESRTSHTVK 155
Query: 156 ----------------------------QKPFDRE------YVEILHLFGDSETSPFSIH 181
+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMASDNTDDKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
2508]
gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
2509]
Length = 506
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 146/303 (48%), Gaps = 50/303 (16%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
F DF SRI ++YR F DP S ++ SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A A+L RLGR WR+ D E +I+ LF D +P+S+HN ++ G A G
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGK 287
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA ++GL S G P V D
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDSFM 330
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ +G + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 331 AVANPDGRG---FQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ + YLDPH +P + +D + T H+ +R +H+ +DPS+ IGF
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLI 447
Query: 370 RDK 372
+D+
Sbjct: 448 KDE 450
>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
Length = 458
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 158/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKSILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMAFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
Length = 450
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 159/371 (42%), Gaps = 93/371 (25%)
Query: 67 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
S ++LLG C+ +++ D N+G + EF +DF SRI ++YRK F
Sbjct: 38 NSPVFLLGKCYHFKYEDSGVTADDCSNSGSDSKEDLSGNVDEFRKDFISRIWLTYRKEFP 97
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
I S T+D GWGC LR+ QML+AQ LL H LGR W
Sbjct: 98 QIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWTWTEALDIFCSESDFWTANTARK 157
Query: 152 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 187
++PLQ + Y E LH F D + F +H L++ G
Sbjct: 158 LDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRKIISWFADYPLAYFGLHQLVKLG 217
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
K G AG W GP + L R E+ D E G +
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256
Query: 248 IDDASRHCSVFSK-------GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
D C++++ + + +++LVP+ LG E+ N Y ++ + +G
Sbjct: 257 AQD----CTIYNADVYDLQCNKGNEKAVVILVPVRLGGERTNMEYFEYVKGILSLEFCIG 312
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG Q++S IY+DPH Q +++ + + ++H + + +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMD 370
Query: 361 PSLAIGFYCRD 371
PS +GFYCR+
Sbjct: 371 PSCTVGFYCRN 381
>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
Length = 458
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
boliviensis]
Length = 458
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEMYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
Length = 454
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 148/323 (45%), Gaps = 54/323 (16%)
Query: 79 IAQDEALGDAAGNNG--LAEFNQDFSSRILISYRKGFDPIGDSK---------------- 120
+A DE D +G +G F DF S+ ++YR F I S
Sbjct: 101 LAYDE---DYSGQDGGWPTAFLDDFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKS 157
Query: 121 -------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 173
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L LF D
Sbjct: 158 QLVDQNGFSSDSGWGCMIRSGQSLLANAMAVINLGRDWRRGQNQEEERK---LLSLFADD 214
Query: 174 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+P+SIH +Q G A G G W GP A R +ALA Q + P+ +Y
Sbjct: 215 PRAPYSIHQFVQHGAVACGKYPGEWFGPSATARCIQALANAQMHQ--------PLRVYST 266
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
G P V D + + + + P L+LV LG++K+ P Y L
Sbjct: 267 --------GDGPDVYED---KFMKIAKPDGSRFHPTLILVGTRLGIDKITPVYWEALIAA 315
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSD 349
PQS+GI GG+P +S Y +G Q YLDPH +P + + EAD T H+
Sbjct: 316 LQMPQSVGIAGGRPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVDTVHTR 375
Query: 350 VIRHIHLDSIDPSLAIGFYCRDK 372
+R +H+ +DPS+ IGF D+
Sbjct: 376 RLRRLHVRELDPSMLIGFLILDE 398
>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
Length = 458
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 156/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTICLKETIGKCSEDHETENEICHRKIISWFGDSPLAAFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITVYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQSASMTSDNTDDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
Length = 507
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 104/298 (34%), Positives = 147/298 (49%), Gaps = 52/298 (17%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-----KPLQKPFDREYVEILHLFGD--SE 174
TSD GWGCM+RS QML+AQ L+ H LGR WR P++ P D + +++ F D S+
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242
Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMC-----------RSWEALARCQ--------- 214
SPFS+H L+QA G GSW GP +C R +E LAR
Sbjct: 243 ESPFSLHRLVQAS---GQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299
Query: 215 -RAET-----GLGCQSLPMAIYVVSGDEDGERGGA-------PVVCIDD---ASRHCSVF 258
R E G + P + E+ + + P + D +S ++F
Sbjct: 300 YREEIMNLARGQPVRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQDGIQSSPSTTLF 359
Query: 259 SKGQADWTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
++LL+P+ LGL+K ++ RY+P + P +GI+GG+P S YI+G Q
Sbjct: 360 PSHA-----VILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIYILGCQN 414
Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLL 375
I+LDPH QPV+ D E + T+H V R I +DPS A+GFYCR +G L
Sbjct: 415 TQLIHLDPHFTQPVVRNVVDSEEFNVKTWHCLVPRVIEAAKLDPSCAVGFYCRSRGDL 472
>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
Length = 458
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 157/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPAISSCAIEDCVISGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 181
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNEIYHRKIVSWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQCASMASDNPDNKAVIILVPVRLGGERTNVDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYC++
Sbjct: 381 SCTIGFYCQN 390
>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 440
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 162/379 (42%), Gaps = 79/379 (20%)
Query: 65 SSTSDIWLLGVCH--KIAQDEALGDAAGN--------NGLAEFNQDFSSRILISYRKGFD 114
S S + LLG C+ K+ +DE + +A + +F +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKVEEDEGVAEACCEASDEEDVVGNVEDFRRDFGSRIWLTYREEFP 95
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDRE--------- 162
P+ S +TSD GWGCMLR+ QM++AQALL H +GR W R +P D E
Sbjct: 96 PLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWTWSRTMSLQPLDTETWTTSAAKR 155
Query: 163 ----------------------------------YVE-------ILHLFGDSETSPFSIH 181
+VE ++ FGDS ++ F +H
Sbjct: 156 LVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEGEAFHRTLVSWFGDSPSAQFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSW-----EALARCQRAETGLGCQSLPMAIYVVSGDE 236
++ G G AG W GP + EAL T Q + V
Sbjct: 216 RMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSLAGITAYVSQDCTVYSADVIDGH 275
Query: 237 DGERGGAP-----VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
+P V + ++ S +A +++LVP+ LG EK NP Y +
Sbjct: 276 KASTSASPESSDDVTLLSPNNQAASALPDSRA----VIILVPVRLGGEKTNPDYFNLAKS 331
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
+ +GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H
Sbjct: 332 ILSLDYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 389
Query: 352 RHIHLDSIDPSLAIGFYCR 370
+ + +DPS +GFY R
Sbjct: 390 KKMPFTKMDPSCTLGFYSR 408
>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
Length = 318
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 49/266 (18%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 77 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 125
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 126 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 185
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 186 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 228
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 229 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 288
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVG 314
+ F PQSLG +GGKP + Y +G
Sbjct: 289 FKECFKMPQSLGALGGKPNNAYYFIG 314
>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
Length = 457
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 157/369 (42%), Gaps = 77/369 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEA-----------LGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE + D + + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDEPSDQSPNGSCDDMTDESFSRNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWANAFVFENPESESWTSQTVK 155
Query: 152 -----------------------RKPLQKPFDREYVE------ILHLFGDSETSPFSIHN 182
+ P++ E VE I+ F DS + F +H
Sbjct: 156 KLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEEQYHRRIISWFADSPFANFGLHR 215
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
L++ GK G AG W GP + L R + E + + IYV +
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAH----LLR-KAVEKARDPELQGITIYVAQDCTVYKSDV 270
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
+C S SV S I++L+P+ LG E+ N Y ++ + +GI+
Sbjct: 271 IDALCPFTDSEKTSVKS--------IIILIPVRLGGERTNMEYFEFVKGILSLDYCIGII 322
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSVKDFPLE--SFHCPSPKKMSFKKMDPS 380
Query: 363 LAIGFYCRD 371
IG YC D
Sbjct: 381 CTIGLYCPD 389
>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
familiaris]
gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
familiaris]
Length = 458
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 154/370 (41%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKFEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSDSWTSNTVK 155
Query: 158 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 181
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
TFB-10046 SS5]
Length = 989
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 134/284 (47%), Gaps = 47/284 (16%)
Query: 97 FNQDFSSRILISYRKGFDPI-----------------------------GDSKITSDVGW 127
F DF+SR+ ++YR F PI G+ TSD GW
Sbjct: 314 FYADFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGW 373
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPL---QKPFDREYVEILHLFGDSET--SPFSIHN 182
GCMLR+ Q L+A L+ LGR WR+P P YV+IL F D+ + +PFS+H
Sbjct: 374 GCMLRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHR 433
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIYVVSGDEDGERG 241
+ +GK +G G W GP + L RA+ G+ +A+ V + D
Sbjct: 434 MAMSGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVA-----IAVDGVLYETDIYSA 488
Query: 242 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+ D +R S F + W +L+LV LGL+ VNP Y L+ FTFPQSL
Sbjct: 489 SHYPMSSADGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTIFTFPQSL 548
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI-----GKDD 338
GI GG+P +S Y VG Q S YLDPH +P + + G DD
Sbjct: 549 GIAGGRPSSSYYFVGSQGNSLFYLDPHHTRPAVPLRTPPPGDDD 592
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
D T+H D +R + L +DPS+ +GF CRD+
Sbjct: 699 DLKTFHCDRVRKMPLSGLDPSMLLGFLCRDE 729
>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
Length = 1119
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 150/371 (40%), Gaps = 112/371 (30%)
Query: 91 NNGLAEFNQDFSSRILISYRKGF-----DPIGDSK------------------------- 120
N A F D SRI ++YR GF DP S
Sbjct: 644 NGWPAAFYHDSYSRIALTYRSGFPIIPCDPSSSSTGVVQGMLNNLSMSIGRGGHRGPSPT 703
Query: 121 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-----------KPFDREYV 164
++SD GWGCMLR+ Q L+A AL+ LGR WR+PL P Y
Sbjct: 704 NAEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYA 763
Query: 165 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
IL LF D S SPFS+H Q GK G G W GP + + L
Sbjct: 764 RILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYE------- 816
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---W-TPILLLVPLVLGL 278
P + VVS C+D V + D W TP+L+L+ + LG+
Sbjct: 817 ---PAGLKVVS-------------CVDGTVYESEVVAASTKDGEKWKTPVLVLINVRLGI 860
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN--IGK 336
+ VNP Y ++ F PQS+GI GG+P +S Y VG Q S Y+DPH +P + +
Sbjct: 861 DGVNPIYYEAIKGIFRLPQSVGIAGGRPSSSYYFVGAQANSLFYIDPHHPRPAVPLVLPP 920
Query: 337 DD-------------LEADT----------------------STYHSDVIRHIHLDSIDP 361
DD ADT +TYH+D +R L S+DP
Sbjct: 921 DDSLVRAAQHLPLTPSTADTPAKESARQLDDFLLAAYPDAAWATYHTDKVRKCALSSLDP 980
Query: 362 SLAIGFYCRDK 372
S+ +GF D+
Sbjct: 981 SMLLGFLVEDE 991
>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
Length = 491
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 140/309 (45%), Gaps = 56/309 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427
Query: 364 AIGFYCRDK 372
IGF D+
Sbjct: 428 LIGFLILDE 436
>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
Length = 458
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 155/370 (41%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 111
S S + LLG C+ +E A G N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVQPQWVGENEPVVVSGNVEEFRRDFISRIWLTYRE 95
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR------------------- 152
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDVDNSDSESWTSHT 155
Query: 153 ---------------------KPLQKPFDRE----------YVEILHLFGDSETSPFSIH 181
P+++P R + +I+ F DS + F +H
Sbjct: 156 VKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESCHRKIVSWFADSPLACFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + CS + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKADVIDKQCSSMDPENTEDKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S +GFYCR+
Sbjct: 381 SCTVGFYCRN 390
>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
Length = 459
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 152/369 (41%), Gaps = 78/369 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQAL------------------------------- 142
I S +T+D GWGC LR+ QML+AQ L
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 143 ------------------LFHRLGRPWRKPLQKPFDREYV---EILHLFGDSETSPFSIH 181
L H R R+ R V +I+ FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNEVYHRKIISWFGDSPLAAFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDP 380
Query: 362 SLAIGFYCR 370
S IGFYCR
Sbjct: 381 SCTIGFYCR 389
>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
Length = 572
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 140/309 (45%), Gaps = 56/309 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKSDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508
Query: 364 AIGFYCRDK 372
IGF D+
Sbjct: 509 LIGFLILDE 517
>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
Length = 500
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/306 (31%), Positives = 144/306 (47%), Gaps = 64/306 (20%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A A+L R GR WR+ +RE I+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI---YVVSGDEDGERGGAPVVCID 249
G W GP A ARC + + LP ++ + + DG
Sbjct: 290 YPGEWFGP-------SATARCIHSLRVYLTRDLPEVYEDNFMSTANPDGNH--------- 333
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ P L+LV LG++K+NP Y L T PQ++GI GG+P +S
Sbjct: 334 ---------------FHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSS 378
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIG 366
Y +G Q + YLDPH +P + ++ + + + H+ +RH+H++ +DPS+ IG
Sbjct: 379 HYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIG 438
Query: 367 FYCRDK 372
F +D+
Sbjct: 439 FLIKDE 444
>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
Length = 572
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 140/309 (45%), Gaps = 56/309 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508
Query: 364 AIGFYCRDK 372
IGF D+
Sbjct: 509 LIGFLILDE 517
>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
Length = 463
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/379 (29%), Positives = 165/379 (43%), Gaps = 80/379 (21%)
Query: 59 SRTGISSSTSDIWLLGVCH--KIAQDE--------ALGDAAGNNGLAEFNQDFSSRILIS 108
S+T S + S ++LLG C+ K+ DE AL D + EF +DF+SR+ ++
Sbjct: 31 SKTAFSRN-SPVFLLGKCYHFKVVDDENPTESTAEALDDDVVTGNVDEFRKDFTSRVWLT 89
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQK---- 157
YR+ F + S TSD GWGC LR+ QM++AQALL H LGR W+ +PL
Sbjct: 90 YREEFPALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWKWSEALSLEPLDTETWT 149
Query: 158 ---------------------------PFDREYVE------------ILHLFGDSETSPF 178
P E E I+ FGD ++
Sbjct: 150 SSAARRLVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETYHRTIVSWFGDGPSAQL 209
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
I+ L++ G G AG W GP +A R ++ I V +D
Sbjct: 210 GIYKLVELGMTSGKQAGDWYGP-------AVVAHILRKAVDEAVDAMLKGIRVYVA-QDC 261
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQA-------DWTPILLLVPLVLGLEKVNPRYIPTLRL 291
A V ID S S Q D +++L+P+ LG EK+NP Y+ ++
Sbjct: 262 TVYSADV--IDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEYLNFVKS 319
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
+ +GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H
Sbjct: 320 ILSLEYCIGIIGGKPKQAYYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 377
Query: 352 RHIHLDSIDPSLAIGFYCR 370
+ + +DPS IGFY +
Sbjct: 378 KKMSFSKMDPSCTIGFYSK 396
>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
Length = 393
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 94/279 (33%), Positives = 146/279 (52%), Gaps = 23/279 (8%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
+ FSS + +YRK F IG TSD GWGCMLR+ QM++ QAL+ LGR W
Sbjct: 79 KSFSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWMWTSDDR 138
Query: 159 F-DRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
DRE Y+ IL +F D +++ FSIH + G + G A G W GP + ++ + L +
Sbjct: 139 LPDRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKLVQYDHW 198
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
M ++V + ++ + D C +K W P+LL+VPL L
Sbjct: 199 S--------EMKLHVAMDN---------IIILSDIKSLCC--AKESNKWRPLLLVVPLRL 239
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
GL ++N Y + +F SLGI+GG+P + Y +G+Q E ++LDPH +++
Sbjct: 240 GLSEINDIYTNAVLNSFKMKHSLGIIGGRPSHALYFIGIQREELVFLDPHTTHNYVDL-- 297
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLL 375
D+ + STYH + + + ++DPS+A+ FY D+ L
Sbjct: 298 DEEPYNDSTYHCQRAQRMKISNMDPSIAMCFYIGDEDEL 336
>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
Length = 458
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 156/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
Length = 468
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 160/373 (42%), Gaps = 69/373 (18%)
Query: 65 SSTSDIWLLGVCHKI------AQDEALGDAAGNNGL----AEFNQDFSSRILISYRKGFD 114
S S + LLG C+ Q EA +A+ G+ +F +DF SRI ++YR+ F
Sbjct: 29 SRNSPVLLLGKCYHFKAEEDEGQTEACREASDEEGVMGNVEDFRRDFGSRIWLTYREEFP 88
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE--------- 162
P+ S +TSD GWGCMLR+ QM++AQALL H LGR W + +P D E
Sbjct: 89 PLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWTWSGAMSLQPLDTETWTTSAAKR 148
Query: 163 ----------------------------------------YVEILHLFGDSETSPFSIHN 182
+ ++ FGDS ++ F +H
Sbjct: 149 LVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDGGFHRTLVSWFGDSPSAQFGLHR 208
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS-LPMAIYVVSGDEDGERG 241
+++ G A G AG W GP + + R G S + V S D
Sbjct: 209 MVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLAGISSYVSQDCTVYSADVIDSHK 268
Query: 242 GAPVVCID----DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+ + +S H S + D +++LVP+ LG EK NP Y + +
Sbjct: 269 ASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVRLGGEKTNPDYFNLAKSFLSLDY 328
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
+GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H + +
Sbjct: 329 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSPKKMPFT 386
Query: 358 SIDPSLAIGFYCR 370
+DPS GFY R
Sbjct: 387 KMDPSCTFGFYSR 399
>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 401
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 157/348 (45%), Gaps = 71/348 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ-----------------DFSSRILISYRKG 112
IW LG + A + D A NN + Q DF SRI I+YR
Sbjct: 29 IWCLG--REYAPSQPPSDPASNNPRSPSRQPNASTLNDTTWPKAFLSDFGSRIWITYRSN 86
Query: 113 FDPIGDSK-----------------------ITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
F PI +K TSD GWGCM+RS Q L+A LGR
Sbjct: 87 FTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQSLLANTFSVLLLGR 146
Query: 150 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWE 208
WR+ + E +++ +F D +PFSIH + G ++ G G W GP
Sbjct: 147 DWRRGEKV---EEESKLISMFADHPEAPFSIHRFVNRGAESCGKYPGEWFGP-------S 196
Query: 209 ALARCQRAETGLGCQS-LP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
A A+C + L QS +P + +Y+ + D +D H + G+
Sbjct: 197 ATAKCIQL---LSTQSEVPQLRVYLTNDTSD---------VYEDKFAHVAHDESGRIQ-- 242
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P L+L+ LG++ V P Y LR T+PQS+GI GG+P AS Y VG Q+ +LDPH
Sbjct: 243 PTLILIGTRLGIDNVTPAYWDGLRAALTYPQSVGIAGGRPSASHYFVGAQDCHLFFLDPH 302
Query: 327 DVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+P D L + + +Y++ +R IH+ +DPS+ IGF +D+
Sbjct: 303 TTRPATLYRPDGLYTQEELDSYYTSRLRRIHIKDMDPSMLIGFLVKDE 350
>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
Length = 449
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/321 (33%), Positives = 150/321 (46%), Gaps = 53/321 (16%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 116
+A D+ D +G F DF SRI ++YR FDPI
Sbjct: 99 LAYDDQSNDGGWPSG---FITDFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQL 155
Query: 117 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
GD S +SD GWGCM+RS Q L+A + RLGR WR Q E IL F D
Sbjct: 156 GDQSPFSSDSGWGCMIRSGQSLLANTIALVRLGRDWR---QGQSLEEECRILKDFADDPR 212
Query: 176 SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
+P+SIH+ ++ G A G G W GP A R +ALA +I V S
Sbjct: 213 APYSIHSFVRHGASACGKYPGEWFGPSATARCIQALANSHEP-----------SIRVYS- 260
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
G P V DD + + G+A + P L+LV LGL+K+ P Y L
Sbjct: 261 -----TGDGPDVYEDDFMKIAN--PTGEA-FHPTLVLVGTRLGLDKITPVYWEALIAALQ 312
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVI 351
PQS+GI GG+P +S Y +G Q YLDPH +P + ++ ++ + + H+ +
Sbjct: 313 MPQSVGIAGGRPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMDYTSEEIESCHTARL 372
Query: 352 RHIHLDSIDPSLAIGFYCRDK 372
R IH+ +DPS+ IGF R +
Sbjct: 373 RRIHVREMDPSMLIGFLIRSE 393
>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
Length = 459
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 148/328 (45%), Gaps = 60/328 (18%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 120
+A DE L DA F DF SR+ ++YR F+PI S
Sbjct: 109 LAYDELLEDAGWP---IAFLDDFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLA 165
Query: 121 ----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
+SD GWGCM+RS Q L+A L+ +LGR WR+ R+ EIL F D +
Sbjct: 166 DQGGFSSDTGWGCMIRSGQSLLANTLVICQLGRDWRRGKAA---RQEREILARFADDPRA 222
Query: 177 PFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
P+S+HN ++ G A G G W GP A R +ALA + + +Y
Sbjct: 223 PYSLHNFVRHGAVACGKFPGEWFGPSATARCIQALANSNESS---------LRVYST--- 270
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
G P V D +V + P L+LV LG++K+N Y L T
Sbjct: 271 -----GDLPDVYEDS---FMAVAKPDGETFHPTLILVGTRLGIDKINQVYWEALTATLQM 322
Query: 296 PQSLGIVGGKPGASTYIVGVQEES--------AIYLDPHDVQPVINIGKD---DLEADTS 344
PQS+GI GG+P AS Y +G Q YLDPH +P + +D D +
Sbjct: 323 PQSVGIAGGRPSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN 382
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
T H+ +R +H+ +DPS+ IGF +D+
Sbjct: 383 TCHTRRLRRLHVRDMDPSMLIGFLIKDE 410
>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
Length = 458
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 156/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -FSVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
Length = 585
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 94/329 (28%), Positives = 142/329 (43%), Gaps = 66/329 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----- 151
F +DF+SRI ++YR+ F + + T+D GWGCMLRS QML+AQ L+ H LG+ W
Sbjct: 198 FQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDWTWPDA 257
Query: 152 ------------------------------------------------RKPLQKPFDREY 163
R P + +R +
Sbjct: 258 LHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELEKERYH 317
Query: 164 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+I+ F D + F IH L+ G + G AG W GP C C
Sbjct: 318 RKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDC--------CS 369
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 283
+ VS D +G V + + S + + G A W +++LVP+ LG E NP
Sbjct: 370 EAGNLVVYVSQDCTVYKGD--VANLANKSEDRTAWDPG-AVWKAVIILVPMRLGGEAFNP 426
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
Y+ ++ +GI+GGKP S Y VG Q+++ +YLDPH QP ++ K++ +
Sbjct: 427 AYVDCVKELLKLEFCIGIIGGKPRHSLYFVGYQDDALLYLDPHYCQPFVDTTKENFPLE- 485
Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
++H + R +DPS IGFY +
Sbjct: 486 -SFHCNSPRKTAFTKVDPSCTIGFYAHHR 513
>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
Length = 491
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 141/309 (45%), Gaps = 56/309 (18%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKI----------------TSDVGWGCMLRS 133
F DF SRI ++YR GF DP S++ T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427
Query: 364 AIGFYCRDK 372
IGF D+
Sbjct: 428 LIGFLILDE 436
>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
Length = 545
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 115/346 (33%), Positives = 149/346 (43%), Gaps = 97/346 (28%)
Query: 96 EFNQDFSSRILISYRKGF--------------------------DPIGDSKITSDVGWGC 129
+F D SRI +SYR GF DP G TSDVGWGC
Sbjct: 64 DFLADVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRG---YTSDVGWGC 120
Query: 130 MLRSSQMLVAQALLFHRLGRPWR----------------------------KPLQKPFDR 161
M+R+SQ L+A ALLF LGR WR K +
Sbjct: 121 MIRTSQSLLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSE 180
Query: 162 EYV----EILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
E I+ F DS SPFSIH ++ G KA AG W GP A S AL
Sbjct: 181 ETAVSEETIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYAL------ 234
Query: 217 ETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
C P + +Y +G GG V D+ + G P+L+L
Sbjct: 235 -----CNEFPDSGLKVYY-----NGNGGGD--VYEDE------LLETG----FPLLVLCG 272
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LG++ VNP Y +LR + PQS+GI GG+P S Y G Q E YLDPH +P +
Sbjct: 273 LRLGIDNVNPIYWDSLRQMLSLPQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPAVK 332
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ DT+++HS I +HL +DPS+ +GFY + TF+
Sbjct: 333 T----TDKDTTSFHSSRIWKLHLKEMDPSMLVGFYITSEADWETFK 374
>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
Length = 458
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 156/370 (42%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENSDSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNEIYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 470
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 133/263 (50%), Gaps = 42/263 (15%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SRI ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR ++P +E+ +++ +F D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + L R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVHKNR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDVQPVI 332
Y V Q + YLDPH +P++
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLL 333
>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
Length = 518
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 151/309 (48%), Gaps = 49/309 (15%)
Query: 87 DAAG-NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
DA G ++G +F D+ SR+ I+YR F P+ ++ T+D GWGCM+R++QM+VAQA++ +
Sbjct: 159 DANGVSSGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIMLN 218
Query: 146 RLGRPWRKPLQKP-----------FDREYVE---ILHLFGDSETSPFSIHNLLQ--AGKA 189
R GR WR +K FDRE ++ IL LF D +SP IH +++ A +
Sbjct: 219 RFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAKEK 278
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
A GSW P EA+ ++A L +I ++GD A + I
Sbjct: 279 GKKAVGSWYSPS------EAVFIMKKA--------LTESISPLTGD------TAMYLSI- 317
Query: 250 DASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
D H +W L+LV +V LG ++NP Y+P L F+ LG+ GG+P
Sbjct: 318 DGRVHIRDIEVETKNWMKTLILVIVVRLGAAELNPIYVPHLMRLFSMESCLGVTGGRPDH 377
Query: 309 STYIVGVQEESAIYLDPHDVQPVINI----------GKDDLEADTSTYHSDVIRHIHLDS 358
S + VG + IYLDPH I I K + +YH ++ +H
Sbjct: 378 SCWFVGFYGDQIIYLDPHVAHEYIPIDMNFNVNMTDNKKSKKCPERSYHCRLLSKMHFLD 437
Query: 359 IDPSLAIGF 367
+DPS A+ F
Sbjct: 438 MDPSCALCF 446
>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
heterostrophus C5]
Length = 471
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 101/259 (38%), Positives = 129/259 (49%), Gaps = 42/259 (16%)
Query: 92 NGLAEFNQDFSSRILISYRKGF-------DPIGDSKI--------------TSDVGWGCM 130
N + F DF SRI ++YR GF DP S + TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + ++ GQ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEVAIDDDGQ--WQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDV 328
Y V Q + YLDPH
Sbjct: 311 HYFVATQGNNFFYLDPHST 329
>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
Length = 383
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 152/299 (50%), Gaps = 40/299 (13%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
N + E + +D S++ +YRKGF PIG +S TSD GWGCMLR QM++AQAL+
Sbjct: 31 NAIKELDAIRRDIRSKLWFTYRKGFVPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLH 90
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
LG+ W+ + + + Y++IL F D + FSIH + G + G G W GP + +
Sbjct: 91 LGKDWQW-MPETKNNTYLKILRRFEDKRAAAFSIHQIALMGASEGKEVGQWFGPNTIAQV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------- 259
+ L + + I+V + + ++D R C V
Sbjct: 150 LKKLIVYDEWSS--------LTIHVALDN---------TLIVNDILRQCRVEGGVTAEAD 192
Query: 260 -----KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
+ + W P+LLL+PL LGL ++NP YI L+ +F QSLG++GGKP + Y +G
Sbjct: 193 GEIPLRAPSQWKPLLLLIPLRLGLSEINPVYINGLKTSFKISQSLGVIGGKPNLALYFIG 252
Query: 315 VQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ IYLDPH Q I ++++E D S YH I + +DPS+A+ F+C
Sbjct: 253 CVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDIS-YHCKSASRIPITGMDPSVALCFFC 310
>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
Length = 470
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 133/263 (50%), Gaps = 42/263 (15%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SRI ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR ++P +E+ +I+ +F D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDIVAMFADDPRAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + L + E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVH-KNKEVGL-------KVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDVQPVI 332
Y V Q + YLDPH +P++
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLL 333
>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
Length = 468
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 147/324 (45%), Gaps = 56/324 (17%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 92 DDEIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSRE 151
Query: 151 W----------------------RKPL-------------------QKPF-DREYVEILH 168
W R PL + P ++ + I+
Sbjct: 152 WTWPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIR 211
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
F D ++PF +H ++ G +G AG W GP +A + C+ ++
Sbjct: 212 WFSDHPSAPFGLHRMVALGSIFGKKAGDWYGP-------SIVAHIIKKAIETSCEVAELS 264
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+YV S D + + D + G+A +++LVP LG E NP Y
Sbjct: 265 VYV-SQDCTVYKADIEQLFAGDVPHAETSRDAGKA----VIILVPARLGGETFNPVYKHC 319
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ P LGI+GGKP S Y +G Q+ +YLDPH Q I+ ++D + ++H
Sbjct: 320 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQSYIDTSRNDFPLE--SFHC 377
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDK 372
+ R I + +DPS FY +++
Sbjct: 378 NTPRKISITRMDPSCTFAFYAQNR 401
>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
Length = 521
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/330 (31%), Positives = 154/330 (46%), Gaps = 56/330 (16%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 127
+D+ LG + + DE+ +G F D+ SR+ I+YR F + D+ T+D GW
Sbjct: 146 NDVVFLGRRYSTSVDES----GLRSGFENFCSDYYSRLWITYRTDFPALLDTDTTTDCGW 201
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQK-----------PFDREYVE---ILHLFGDS 173
GCM+R++QM+VAQA++ +R GR WR +K FDRE ++ IL LF D
Sbjct: 202 GCMIRTTQMMVAQAIMVNRFGRDWRFTRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDK 261
Query: 174 ETSPFSIHNLL---QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
T+P IH ++ GK A GSW P EA+ ++A L S P+
Sbjct: 262 PTAPLGIHKMVGIAAMGKGKK-AVGSWYSPS------EAVFIMKKA---LTESSSPLT-- 309
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTL 289
G ++ D H +W L+LV +V LG ++NP Y+P L
Sbjct: 310 ----------GNTAMLLSIDGRVHIRDIEVETKNWMKKLILVIVVRLGAAELNPIYVPHL 359
Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH--------DVQPVINI----GKD 337
F LGI GG+P S++ VG + IYLDPH D+ P N+ K
Sbjct: 360 MRLFAMESCLGITGGRPDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSDSKK 419
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+ +YH ++ +H +DPS A+ F
Sbjct: 420 AKKCPEKSYHCRLLSKMHFFDMDPSCALCF 449
>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
Length = 473
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 96/259 (37%), Positives = 121/259 (46%), Gaps = 42/259 (16%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SRI ++YR GF I S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R E GL +YV D V ID
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYVSGDGADVYEDKLKEVAID 261
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D +W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 262 D-----------DGEWQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDV 328
Y V Q + YLDPH
Sbjct: 311 HYFVATQGNNFFYLDPHST 329
>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
Length = 482
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 168/387 (43%), Gaps = 90/387 (23%)
Query: 65 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAE---------FNQDFSSRILISYRKGFD 114
S S + LLG C H A D+ D A E F +DF+SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGRCYHFKADDDGSADEASCREPEEGFSMGNVEAFRKDFTSRVWLTYREEFP 95
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQ----------- 156
P+ S +T+D GWGC+LR+ QM++AQAL+ H LGR W +PL
Sbjct: 96 PLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLGRDWTWSEALTLQPLDTETWTASAAKR 155
Query: 157 -------------KPFDREYVE-----------------------ILHLFGDSETSPFSI 180
K DR++ E I+ FGD+ ++ +
Sbjct: 156 LVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEAEAHLKEMYHRTIISWFGDTSSALLGL 215
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG--CQSLPMAIYVVSGD-ED 237
H L++ G G AG+W GP + + + ++GL + V S D D
Sbjct: 216 HRLVRLGLTMGKNAGNWYGPAVVAHILKKAVE-EAMDSGLAGITAYVSQDCTVYSADVAD 274
Query: 238 GER--------------GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 283
+ GG P +D S+ QA +++L+P+ LG EK+NP
Sbjct: 275 CHKPPSARQASVSPPIAGGGP--SKEDQPGSASILPDSQA----VIILIPVRLGGEKINP 328
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
Y ++ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D
Sbjct: 329 EYFEFVKNILSVEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDFP--L 386
Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCR 370
++H + I +DPS IGFY R
Sbjct: 387 QSFHCPSPKKIPFTRMDPSCTIGFYSR 413
>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
Length = 460
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 162/372 (43%), Gaps = 80/372 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 111
S S + LLG C+ +E A AG N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVGSGWAGENEHVVIYGNVEEFRRDFISRIWLTYRE 95
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------- 154
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDIENSDSASWTSHT 155
Query: 155 ------------------------LQKPF-----DRE------YVEILHLFGDSETSPFS 179
L++P D E + +I+ FGDS + F
Sbjct: 156 VKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGRNELCHRKIISWFGDSPLACFG 215
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H L++ GK G AG W GP + R G + IYV +D
Sbjct: 216 LHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVA---QDCT 267
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
A V+ S ++ +A I+LLVP+ LG E+ N Y+ ++ + +
Sbjct: 268 VYKADVIDKQGISAGLET-TEDKA----IILLVPVRLGGERTNMDYLDFVKGILSLEYCV 322
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
GI+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +
Sbjct: 323 GIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKM 380
Query: 360 DPSLAIGFYCRD 371
DPS +GFYCR+
Sbjct: 381 DPSCTVGFYCRN 392
>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
Length = 268
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 92/268 (34%), Positives = 133/268 (49%), Gaps = 40/268 (14%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVG 314
TL+ F PQSLG++GGKP ++ Y +G
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIG 268
>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
Length = 451
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 141/301 (46%), Gaps = 50/301 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 133
F +D +++ ++YR GFDPI S +SD GWGCM+RS
Sbjct: 117 FLEDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRS 176
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A + +LGR WR+ +E +++ +F D +P+SIHN ++ G A G
Sbjct: 177 GQSLLATTIGILQLGRDWRR---GKCQQEERQLISMFADDPRAPYSIHNFVRHGATACGK 233
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A A+C +A T LP+ +Y + +D + D
Sbjct: 234 FPGEWFGP-------SATAQCIQALTS--ASGLPLKVYSPNDGQDVYEDSFMKIAKPD-- 282
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
GQ D+ P L+L+ LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 283 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLLAALQMPQSVGIAGGRPSSSHYF 333
Query: 313 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VG Q YLDPH + I D E D + H+ +R +HL +DPS+ IGF
Sbjct: 334 VGSQGSYLFYLDPHHTRKAIPYHADVTKYTEEDIESCHTSRLRRLHLKEMDPSMLIGFLI 393
Query: 370 R 370
R
Sbjct: 394 R 394
>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
206040]
Length = 452
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 143/308 (46%), Gaps = 50/308 (16%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVG 126
G A F +D SS+ ++YR GF+PI S +SD G
Sbjct: 113 GTGWPAGFVEDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSG 172
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A + RLGR WR+ + +R ++ +F D +P+SIHN ++
Sbjct: 173 WGCMIRSGQSLLATTIGILRLGRDWRRDQSQEEERH---LISMFADDPRAPYSIHNFVRH 229
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A A+C +A T L + IY + +D
Sbjct: 230 GATACGKYPGEWFGP-------SATAQCIQALTS--SSGLSLNIYSPNDGQD-------- 272
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ + S S GQ + P L+L+ LG++K+ P Y L PQS+GI GG+
Sbjct: 273 --VYEDSFMKIAKSDGQT-FNPTLILIRTRLGIDKITPIYWDALIAALHMPQSVGIAGGR 329
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPS 362
P +S Y VG Q YLDPH + I D E D + H+ +R IH+ +DPS
Sbjct: 330 PASSHYFVGSQGSYLFYLDPHHTRKAIPYHDDVTKYTEEDIESCHTSRLRRIHIKEMDPS 389
Query: 363 LAIGFYCR 370
+ IGF R
Sbjct: 390 MLIGFLIR 397
>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
Length = 409
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 155/312 (49%), Gaps = 38/312 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMD-------- 194
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 195 -STVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTSTYHSDVIRHIHLDS 358
GG+P + Y +G E+ +YLDPH Q +G+ + E D TYH + +
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQKTGVVGQKTSSGEQEHD-ETYHQKHAARLSFSA 308
Query: 359 IDPSLAIGFYCR 370
+DPSLA+ F C+
Sbjct: 309 MDPSLAVCFLCK 320
>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
Length = 458
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 155/370 (41%), Gaps = 78/370 (21%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKVLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG------------------------- 148
I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIESSDSDSWTSNTIH 155
Query: 149 -------------RPWRKPL--------QKPFDRE------YVEILHLFGDSETSPFSIH 181
R R P + P D + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSEIYHRQIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRD 371
S IGFYCR+
Sbjct: 381 SCTIGFYCRN 390
>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
Length = 449
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 143/321 (44%), Gaps = 52/321 (16%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 120
+A DEA+ G + F DF S+ ++YR F+PI S
Sbjct: 98 LAYDEAMNQDGG--WPSAFLDDFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQF 155
Query: 121 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L F D
Sbjct: 156 MDQAGYSSDSGWGCMIRSGQSLLANAMAVLDLGRDWRRGVAAEKERQ---LLSKFADDPK 212
Query: 176 SPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
+P+SIH +Q G A G G W GP A R +AL + +Y
Sbjct: 213 APYSIHRFVQHGAVACGKYPGEWFGPSATARCIQALVNANEPH---------LRVYST-- 261
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
G P V D R + + P L+LV LG++K+ P Y L
Sbjct: 262 ------GDGPDVYED---RFFDIAKPSGETFHPTLILVGTRLGIDKITPVYWDALIAALQ 312
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVI 351
PQS+GI GG+P +S Y +G Q YLDPH + + +D +AD + H+ +
Sbjct: 313 MPQSIGIAGGRPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSVHTRRL 372
Query: 352 RHIHLDSIDPSLAIGFYCRDK 372
R +H+ +DPS+ IGF D+
Sbjct: 373 RRLHVREMDPSMLIGFVIHDE 393
>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 153/311 (49%), Gaps = 36/311 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMD-------- 194
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 195 -STVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
GG+P + Y +G E+ +YLDPH Q +G+ + TYH + ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQRTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309
Query: 360 DPSLAIGFYCR 370
DPSLA+ F C+
Sbjct: 310 DPSLAVCFLCK 320
>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
Length = 409
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/302 (31%), Positives = 140/302 (46%), Gaps = 49/302 (16%)
Query: 97 FNQDFSSRILISYRKGFDPI---------------------GDSKITSDVGWGCMLRSSQ 135
F +DF S + ++YR F PI TSD GWGCM+RS Q
Sbjct: 86 FLEDFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQ 145
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAA 194
++A AL RLGR WR+ + KP E +L LF D +PFSIH ++ G+ G
Sbjct: 146 AVIANALAHLRLGRGWRRGM-KP--EEEKRLLALFADDPRAPFSIHKFVRHGEVECGKNP 202
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
G W GP A +AL + + +Y + ++ E V ++
Sbjct: 203 GEWFGPSAAAMCIQALTH--------AYEPAGLRVYQTNSNDLYEEDFRKVAVVN----- 249
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
VF P L+L + LG+E++ Y L PQ++GI GG+P +S Y +
Sbjct: 250 -GVFK-------PTLVLAGIRLGIERITNIYYEPLAACLRMPQTVGIAGGRPSSSHYFIA 301
Query: 315 VQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VQ E+ YLDPH +P++ +D E + T H+ IR +H+ +DPS+ I F RD
Sbjct: 302 VQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHTRRIRRLHIREMDPSMLIAFLIRD 361
Query: 372 KG 373
+
Sbjct: 362 EA 363
>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
Length = 1257
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 138/294 (46%), Gaps = 61/294 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
F D++SR+ ++YR F PI D+ +
Sbjct: 317 FYSDYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWS 376
Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE---ILHLFGDS 173
TSD GWGCMLR+ Q L+A AL+ L R WR+P + +YV+ IL F D+
Sbjct: 377 GEKGWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDN 436
Query: 174 ET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC--------- 222
+ +PF IH + AGK G GSW GP + + L + + GL
Sbjct: 437 PSPLAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLV-GEFEDAGLEVALAVDSVVY 495
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEK 280
QS A S +++G G + V + + +G W P+L+LV + LG++
Sbjct: 496 QSDVYAASAASRNQNGVEGDSKTVGTSKSRKKG----QGPPKWGNRPVLILVGIRLGIDG 551
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
VNP Y +++ FTFPQ++GI GG+P +S Y VG Q +S YLDPH +P I +
Sbjct: 552 VNPIYYESVKTLFTFPQTVGIAGGRPSSSYYFVGAQGDSLFYLDPHHTRPAIPL 605
>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
Length = 481
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 166/386 (43%), Gaps = 88/386 (22%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
IS T IW LG + + +G+ + +SR +YR+ F PIG + +
Sbjct: 25 ISIDTFPIWALG-----------KEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPS 73
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D WGCMLR +QML+ + LL +GR + ++K D Y +IL +F D + + +SIH
Sbjct: 74 TDQYWGCMLRCAQMLLGEVLLRRHIGRHFEWDIEKTSDV-YEKILQMFFDEKDALYSIHQ 132
Query: 183 LLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYVV 232
+ Q G + G W GP + W +A + L Q +L MA
Sbjct: 133 IAQMGVSEGKEVSEWFGPNTAAQVIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYP 192
Query: 233 SGDE----DGERG-------GAPVVCID-DASRHCSVFSKGQ-------------ADWTP 267
S D GE G + ++C++ D + F G +W P
Sbjct: 193 SEDAVKLIMGEFGFKSDRISSSHIICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRP 252
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+LL++PL LGL +N Y+ ++ F PQ +GI+GGKP + Y VG+ YLDPH
Sbjct: 253 LLLMIPLRLGLTSINSCYLSAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHH 312
Query: 328 VQP--------------------------VINIGKDDLE---------------ADTSTY 346
+P + + G +LE + STY
Sbjct: 313 CRPKTSKFFVEKEQQQQSSGDSTPEKVEKIDDNGFHELEDLEPLPSQTSDVYTKMNDSTY 372
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDK 372
H +++ + DSIDPSLA+ +C +
Sbjct: 373 HCQMMQWMEYDSIDPSLALALFCETR 398
>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
Length = 459
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 156/368 (42%), Gaps = 78/368 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDA-AGNN----------GLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE + G+N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKTDEPTEQSPNGSNYDVTEEEVSRNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 155
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWPDALVNENPESESWTSHTVK 155
Query: 156 ------------QKPFDREYV----------------------EILHLFGDSETSPFSIH 181
+K F + + +I+ FGDS + F +H
Sbjct: 156 KLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRDEHYHRKIVSWFGDSPLANFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ G G AG W GP + L R + E + + +YV
Sbjct: 216 RLIEYGNKSGKMAGDWYGPAVVAH----LLR-KAVEEAKDPELQGITVYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D CS+ + +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKSDVVEMQCSLKDSEKPGAKSVIILIPVRLGGERTNMEYLEFVKGILSLEYCIGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
VGG+P S Y G Q++S IY+DPH Q +++ + + ++H + + +DP
Sbjct: 323 VGGRPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMDP 380
Query: 362 SLAIGFYC 369
S IG YC
Sbjct: 381 SCTIGLYC 388
>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 376
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 96/279 (34%), Positives = 142/279 (50%), Gaps = 37/279 (13%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFS 179
TSD GWGCM R QML+AQAL+ H LGR WR + ++I+ F DS + SP S
Sbjct: 67 TSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLS 126
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSG 234
+H L+Q G W GP ++C A+ R + L + + +Y V+
Sbjct: 127 LHRLVQMSDR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYR 180
Query: 235 DE--DGERG------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLG 277
+E D RG P + D H +++ + Q+D T ILLL+PL+ G
Sbjct: 181 EEIIDLARGLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFG 236
Query: 278 L-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
++NPRYI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+
Sbjct: 237 KGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNS 296
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLL 375
D ++H + + + +++PS A+GFYCR +G L
Sbjct: 297 PKFSVD--SWHCPIPKTMSAANLNPSCAVGFYCRTRGEL 333
>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
boliviensis]
Length = 319
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 128/258 (49%), Gaps = 25/258 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C DA+RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 233 RMSIAELDPSIAVGFFCK 250
>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
SS1]
Length = 1286
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 145/312 (46%), Gaps = 57/312 (18%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKIT---------------------------- 122
NN F DF+SR+ ++YR F PI DS +T
Sbjct: 333 NNWPPVFYSDFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGW 392
Query: 123 ---------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLF 170
SD GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V++L F
Sbjct: 393 PGSGEKGWTSDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWF 452
Query: 171 GDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
DS T PFS+H + AGK G G W GP + + L E GLG +A
Sbjct: 453 FDSPTPHCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---IA 508
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
D P + + + + G+A +L+L+ + LGL+ VNP Y T
Sbjct: 509 SDSQIFQSDVFAASHPPMDSPSSKKKLASTWGGRA----VLVLIGIRLGLDGVNPIYYET 564
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + L ST +
Sbjct: 565 IKALYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAV-----PLRPPPST--N 617
Query: 349 DVIRHIHLDSID 360
D++ I +SI+
Sbjct: 618 DIVLDISRESIE 629
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 25/39 (64%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
A+ T+H + +R + L +DPS+ +GF CRD+G F+
Sbjct: 836 AELKTFHCERVRKMPLSGLDPSMLVGFLCRDEGDWEDFK 874
>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 450
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 155/371 (41%), Gaps = 93/371 (25%)
Query: 67 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
S ++LLG C+ +++ D N+G + EF +DF SRI ++YR+ F
Sbjct: 38 NSPVFLLGKCYHFKYEDSSVTSDGGSNSGSESKEDLSGNVDEFRKDFISRIWLTYREEFP 97
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
I S T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 98 QIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWTWTEALDIFSSESEFWTANTARK 157
Query: 152 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 187
++PL + E H F D + F +H L++ G
Sbjct: 158 LTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQKIISWFADYPLAYFGLHQLVKLG 217
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
K G AG W GP + L R E+ D E G +
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256
Query: 248 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
D C+++S D +++LVP+ LG E+ N Y ++ + +G
Sbjct: 257 AQD----CTIYSADVYDLQCNKGTEKAVVILVPVRLGGERTNMEYFEFVKGILSLEFCIG 312
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG Q++S IY+DPH Q +++ + + ++H + + +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSVKNFPLE--SFHCPSPKKMSFKKMD 370
Query: 361 PSLAIGFYCRD 371
PS IGFYCR+
Sbjct: 371 PSCTIGFYCRN 381
>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
Length = 319
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 127/258 (49%), Gaps = 25/258 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C DA RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 233 RMSIAELDPSIAVGFFCK 250
>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
Length = 331
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 128/258 (49%), Gaps = 25/258 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V+C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VLCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 233 RMSIAELDPSIAVGFFCK 250
>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
Length = 531
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 124/265 (46%), Gaps = 49/265 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 311 YIVGVQEESAIYLDPHDVQPVINIG 335
Y G Q + YLDPH Q + G
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFG 306
>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
Length = 758
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 124/265 (46%), Gaps = 49/265 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 311 YIVGVQEESAIYLDPHDVQPVINIG 335
Y G Q + YLDPH Q + G
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFG 306
>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
Length = 478
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 160/354 (45%), Gaps = 80/354 (22%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
+GL + +SR+ +YR+ F PIG + ++D GWGCMLR +QML+ + LL +GR +
Sbjct: 47 DGLEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHF 106
Query: 152 RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR------ 205
++K Y +IL +F D + + +SIH + Q G G W GP +
Sbjct: 107 EWDIEKT-SEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLT 165
Query: 206 ---SWEALARCQRAETGLGCQ-SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV---- 257
W +A + L + +L MA S + + + + + ++ ++
Sbjct: 166 IFDDWSNIAVHVALDNILVKEDALTMATTYPSDN------ASYIFAVHNFLKYFTLNLTF 219
Query: 258 --FSK-GQ-----------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
F++ GQ DW P+L+++PL LGL +NP Y+P ++ F PQ +GI+G
Sbjct: 220 PNFAENGQIEKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFELPQCVGIIG 279
Query: 304 GKPGASTYIVGVQEESAIYLDPH-----------------------------DVQPVINI 334
GKP + Y VG+ YLDPH D+Q I+
Sbjct: 280 GKPNLAHYFVGIAGTKLFYLDPHHCRAKTTKRDAGVTTNTMISSITTTDAQLDIQNQIDD 339
Query: 335 GK----DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+DLE D STYH +++ + +SIDPSLA+ +C +
Sbjct: 340 SDFHKLEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEYESIDPSLALALFCETR 393
>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
Length = 454
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 152/333 (45%), Gaps = 49/333 (14%)
Query: 84 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 39 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 98
Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 99 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 157
Query: 203 MCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
+ W +A A + + + + ED + +D +
Sbjct: 158 AAQVMKKLTIFDDWSNIA-VHVALDNILVKEDAITMATSYPSEDAVKLIMENGLVD---K 213
Query: 254 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ S G +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P + Y
Sbjct: 214 NRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRPNHALY 273
Query: 312 IVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE-------------- 340
VG+ YLDPH +P ++G LE
Sbjct: 274 FVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQTADVYT 333
Query: 341 -ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
D STYH ++ I +++DPSLA+ +C +
Sbjct: 334 KMDDSTYHCQMMLWIEYENVDPSLALAMFCETR 366
>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
Length = 481
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 152/333 (45%), Gaps = 49/333 (14%)
Query: 84 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 66 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 125
Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 126 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 184
Query: 203 MCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
+ W +A A + + + + ED + +D +
Sbjct: 185 AAQVMKKLTIFDDWSNIA-VHVALDNILVKEDAITMATSYPSEDAVKLIMENGLVD---K 240
Query: 254 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ S G +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P + Y
Sbjct: 241 NRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRPNHALY 300
Query: 312 IVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE-------------- 340
VG+ YLDPH +P ++G LE
Sbjct: 301 FVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQTADVYT 360
Query: 341 -ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
D STYH ++ I +++DPSLA+ +C +
Sbjct: 361 KMDDSTYHCQMMLWIEYENVDPSLALAMFCETR 393
>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
Length = 400
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 146/287 (50%), Gaps = 28/287 (9%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
N + E + +D SR+ +YR F P+G+ ++T+D GWGCMLR QM++AQAL+ LG
Sbjct: 52 NAIQELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLG 111
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
R W + D Y++I++ F D+ S +S+H + G++ G W+GP + + +
Sbjct: 112 REWYWT-SECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILK 170
Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 268
L C L I+V V +DD S+ W P+
Sbjct: 171 KLV-CFDDWCSL-------VIHVAMDS---------TVVLDDIYS----LSQDGESWKPL 209
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
LL++PL LG+ +NP Y+P L+ F S G++GG+P + Y VG ++ +YLDPH
Sbjct: 210 LLIIPLRLGITDINPIYVPALKRCFELESSCGMIGGRPNQALYFVGYVDDEVLYLDPHTT 269
Query: 329 QPVINIGKDDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
Q +G+ A+ TYH ++ ++DPSLA+ F C+ +
Sbjct: 270 QRTGAVGQKTTTAEQELDETYHQKYAARLNFSAMDPSLAVCFICKTQ 316
>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 321
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/257 (31%), Positives = 131/257 (50%), Gaps = 21/257 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 120
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 121 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 173
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 174 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 233
Query: 356 LDSIDPSLAIGFYCRDK 372
+ ++DPS+A+GF+C+++
Sbjct: 234 ILNLDPSVALGFFCKEE 250
>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 331
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 127/258 (49%), Gaps = 25/258 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 233 RMSIAELDPSIAVGFFCK 250
>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
Length = 450
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 143/303 (47%), Gaps = 50/303 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 133
F +D +++ ++YR GF+PI S +SD GWGCM+RS
Sbjct: 115 FTEDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRS 174
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A + +LGR WR+ + +E ++ +F D +PFSIHN ++ G A G
Sbjct: 175 GQSLLATTIATLQLGRDWRRGKNQ---QEERRLISMFADDPRAPFSIHNFVRHGATACGK 231
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A A+C +A T L + +Y + +D V D
Sbjct: 232 FPGEWFGP-------SATAQCIQALTS--SSDLDLHVYSPNDGQDVYEDSFMKVAKPD-- 280
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
GQ D+ P L+L+ LG++K+ P Y L T PQS+GI GG+P +S Y
Sbjct: 281 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLIATLQMPQSVGIAGGRPSSSHYF 331
Query: 313 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VG Q YLDPH + + +D + D + H+ +R +H+ +DPS+ IGF
Sbjct: 332 VGSQGSYLFYLDPHHTRKALPYHEDVANYTDEDIDSCHTSRLRRLHVKEMDPSMLIGFLI 391
Query: 370 RDK 372
R +
Sbjct: 392 RSE 394
>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
gorilla]
gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
gorilla]
gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 127/258 (49%), Gaps = 25/258 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 233 RMSIAELDPSIAVGFFCK 250
>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 480
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 149/294 (50%), Gaps = 39/294 (13%)
Query: 101 FSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFH----RLGRPWRKPL 155
F S +YR + PIG S SD GWGCM+R+ QML+ QA++ H L + + +
Sbjct: 154 FKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMMRHVFEDNLKYEYIEKI 213
Query: 156 QKPFDREYVEILHLF---GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+ + EY+ +L LF G+ + SP+SI N+ G G W GP A+ + L +
Sbjct: 214 TE-YREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIVLKRLTK 272
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILLL 271
+ P+ + + VC++ + + +V + DWT + ++
Sbjct: 273 IYK----------PVKQFTM------------YVCLE-GNIYLNVIQEKSKDWTQSVFIV 309
Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA--IYLDPHDVQ 329
+PL LGL + P Y+ +++ FTFPQ++GI GG+ ++ Y +G+ + S IYLDPH VQ
Sbjct: 310 IPLRLGLNYIEPEYLSSVKKVFTFPQNVGIAGGRENSALYFIGISDSSNNLIYLDPHLVQ 369
Query: 330 ---PVINIGKDD-LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
P N+ ++ S++H + + L+ + S+AIGFY RD + F+
Sbjct: 370 KSVPTCNMQTNEQFYQYESSFHCTKFKKMPLNRMCTSVAIGFYIRDYNDFLDFQ 423
>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
Length = 331
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/251 (33%), Positives = 122/251 (48%), Gaps = 11/251 (4%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + L V+ R P
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRNSVPCAGAT 119
Query: 250 ----DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSL
Sbjct: 120 AFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSL 179
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
G++GGKP ++ Y +G E IYLDPH QP + D S + + + +
Sbjct: 180 GVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMSIAEL 239
Query: 360 DPSLAIGFYCR 370
DPS+A+GF+C+
Sbjct: 240 DPSIAVGFFCK 250
>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 127/258 (49%), Gaps = 25/258 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDSTVVMEEIRRLCRTS 112
Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCR 370
+ + +DPS+A+GF+C+
Sbjct: 233 RMSIAELDPSIAVGFFCK 250
>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
Length = 319
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/260 (32%), Positives = 129/260 (49%), Gaps = 25/260 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E + A
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEISKLCRAS 112
Query: 245 VVCIDDAS------RHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ C A+ RHC+ G W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 LPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDK 372
+ + +DPS+A+GF+C+ +
Sbjct: 233 RMGIGELDPSIAVGFFCKTE 252
>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
Length = 389
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 99/305 (32%), Positives = 153/305 (50%), Gaps = 36/305 (11%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + +D L +D +R+ +YR+GF PIG S++T+D GWGC
Sbjct: 28 VWILGKSYSATEDLDL-----------IRRDVQTRLWCTYRRGFVPIGGSQLTTDKGWGC 76
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL LGR W + + Y++I++ F DS+ +PFS+H + G++
Sbjct: 77 MLRCGQMVLAQALTQLHLGRDWSWTPETT-NETYLKIVNRFEDSKAAPFSLHQIALTGES 135
Query: 190 YGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
G W GP + + + L + + I+V + +
Sbjct: 136 SEEKRVGEWFGPNTVAQVLKKLVKFD--------DWCSLVIHVALDN---------TLAT 178
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
D+ C V W P+LL++PL LGL ++NP Y+ L+ F + G+VGG+P
Sbjct: 179 DEVLELC-VDRSNPDSWKPLLLIIPLRLGLSEINPIYVDGLKKCFELAGNCGMVGGRPNQ 237
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLA 364
+ Y +G + A+YLDPH VQ IG D+ E D T+H R I+ +DPSLA
Sbjct: 238 ALYFIGYVADEALYLDPHTVQRSGTIGSKRDPDERELD-ETFHQKYARRINFKGMDPSLA 296
Query: 365 IGFYC 369
+ F C
Sbjct: 297 LCFLC 301
>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
4308]
Length = 378
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 159/348 (45%), Gaps = 50/348 (14%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQD-EALGDAAGNNGLAE----------- 96
+RI + + P TS IW LG+ + +D G+ N +
Sbjct: 11 KRIVQYLWDPEPRNDEDPTSSIWCLGIEYHPEKDVSPRGETPDKNSARDNTTGTTNYRKP 70
Query: 97 --------FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
F DF SRI ++YR F PI ++ D M S L+A AL LG
Sbjct: 71 SEHAWPESFLLDFESRIWMTYRSNFPPI--PRVEGDDKSASMTLGS--LLANALSTLVLG 126
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 207
R WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A +
Sbjct: 127 RDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCI 183
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
EAL+ C S + +YV + + + R +V + P
Sbjct: 184 EALSS--------QCGSPTLKVYVSNDTSEVYQ-----------DRFMNVARNSSGVFQP 224
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
L+L+ LG++ + P Y L+ T PQS+GI GG+P AS Y VG Q YLDPH
Sbjct: 225 TLILLGTRLGIDHITPVYWDGLKATLQLPQSVGIAGGRPSASHYFVGAQGSHLFYLDPHY 284
Query: 328 VQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+P + G+ + + TYH+ +R IH+ +DPS+ IGF RD+
Sbjct: 285 TRPALPDRQGGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRDQ 332
>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
Length = 431
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 111/364 (30%), Positives = 164/364 (45%), Gaps = 43/364 (11%)
Query: 48 MRRIHERVLGPSRTGISSSTSDI--WLLGVCHKIAQDEALGDAAGNNGLA--EFNQDFSS 103
MR R P R+ +SS+ + W +++ L + E D +S
Sbjct: 1 MRPGPRRSCTPRRSALSSTLGEASDWCTAAAREVSAVSGLSQLQQDESYEKDEILSDVAS 60
Query: 104 RILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 163
R+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 61 RLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSY 120
Query: 164 VEILHLFGDSETSPFSIHNLL------QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
+L F D + S +SIH + + + S +GP +C+S+ A+ +R
Sbjct: 121 FSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGP-QLCQSFAAVRLSRRRR 179
Query: 218 TGLGCQSLP--MAIYVVSGDEDGERGGAPVVCIDD--ASRHCSVFSKG--------QADW 265
L S P +A++ V ++D A RHC+ G W
Sbjct: 180 WELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDISADRHCNGVPAGAEVTHRPPLPPW 239
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLT-------------------FTFPQSLGIVGGKP 306
P++LL+PL LGL +N Y+ TL+L F PQSLG++GGKP
Sbjct: 240 RPLVLLIPLRLGLTDINEAYVGTLKLASTLVGLCSAAASLPLRQHCFMMPQSLGVIGGKP 299
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
++ Y +G E IYLDPH QP + + D S + + + +DPS+A G
Sbjct: 300 NSAHYFIGYVGEELIYLDPHTTQPAVEVADRRSIPDESFHCQHPPSRMRIGELDPSIA-G 358
Query: 367 FYCR 370
F+C+
Sbjct: 359 FFCQ 362
>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
Length = 459
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 153/374 (40%), Gaps = 85/374 (22%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 181
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQ-------- 262
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++L+P+ LG E+ N Y+ ++ ++L I
Sbjct: 263 DCTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGIL---RALNI 319
Query: 302 VG----GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
V KP S Y G Q++S IY+DPH Q +++ D + T+H + +
Sbjct: 320 VWVLLVAKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFR 377
Query: 358 SIDPSLAIGFYCRD 371
+DPS IGFYCR+
Sbjct: 378 KMDPSCTIGFYCRN 391
>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
Length = 462
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 128/265 (48%), Gaps = 42/265 (15%)
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGF-------DPIGDSKI--------------TSDVG 126
A N + F DF SRI ++YR GF DP S + TSD G
Sbjct: 87 AQYGNWPSAFLDDFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
+GCM+RS Q ++A AL RLGR WR P +E+ IL LF D +PFSIH ++
Sbjct: 147 FGCMIRSGQCILANALQTLRLGRDWRY-QDDPTAQEHCNILSLFADDPQAPFSIHRFVEH 205
Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A R + L + E GL +YV SGD GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KYKEAGL-------RVYV-SGD------GADV 250
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+
Sbjct: 251 Y--EDKLKQVAVEEDGE--WIPTLILVGTRLGIDKITPVYWEALKASLQMKQSMGIAGGR 306
Query: 306 PGASTYIVGVQEESAIYLDPHDVQP 330
P AS Y V Q YLDPH +P
Sbjct: 307 PSASHYFVATQANHFFYLDPHSTRP 331
>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
nidulans FGSC A4]
Length = 402
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 164/365 (44%), Gaps = 64/365 (17%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGV-----CHKIAQDEALGDAAGNN--------GLA 95
+RI + + P S IW LG C + DE+ G G
Sbjct: 11 KRIIQYIWDPEPKNDEEPGSPIWCLGTRYPPQCVEETADESRNPDHGQQQNTNTSAPGWP 70
Query: 96 E-FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
E F DF S+I ++YR F PI TSD GWGCM+
Sbjct: 71 EAFLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMI 130
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
RS Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G +
Sbjct: 131 RSGQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFC 187
Query: 191 GLAAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R ++ + +Y+ + D + V D
Sbjct: 188 GKHPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRD 238
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS
Sbjct: 239 E---------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSAS 287
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
Y V VQ YLDPH+ +P + + E + +TYH+ +R +++ +DPS+ IGF
Sbjct: 288 HYFVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGF 347
Query: 368 YCRDK 372
RD+
Sbjct: 348 LIRDE 352
>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 414
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 173/368 (47%), Gaps = 63/368 (17%)
Query: 66 STSDIWLLGVCHKIAQDE-------ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
S S IWLLG + A++E + + L++F +DF +RI +YR GF I
Sbjct: 45 SHSPIWLLG--KQYAKNEPRPNLRRGFDENSAVGKLSDFLEDFRTRIWFTYRHGFPCIPG 102
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDREYVEILHLFGDSET 175
+K +D GWGC +RS QML+A+ +L H LGR W + L + + +++ LF D+ T
Sbjct: 103 TKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLLGQSGLPEDEALMHRKVIGLFCDNLT 162
Query: 176 SPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
SPFS+HNL+Q G+ +G AGSW GP ++ + + +A E GL +A++V+
Sbjct: 163 SPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQ-VAMNNAIERGL---VEGLAVHVIGD 218
Query: 235 DE----DGERGG-----APV----------------VCIDDASRHCSV------------ 257
E D ER G APV D R SV
Sbjct: 219 GELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSYLDLRRLTSVSNGDLLPSHDGE 278
Query: 258 ------FSKGQADWTP-ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
F W+ +L+L+PL LG+EK N Y L+ + +G++GG+
Sbjct: 279 SIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHLKRVLSTKFCVGVIGGRHHKCY 338
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
Y G + I LDPH QP ++ + + ++H + + IDP +IGFY R
Sbjct: 339 YFCGWHTDYLIRLDPHYSQPAVDATQPGVS--LHSFHCKYPKKTLIADIDPWCSIGFYIR 396
Query: 371 DKGLLVTF 378
++ L +F
Sbjct: 397 NRLELQSF 404
>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
Length = 389
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 156/311 (50%), Gaps = 34/311 (10%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + + D L QD SR+ +YR+GF PIG++++T
Sbjct: 21 IPKTNDTVWILGKQYNASDD-----------LEAIRQDVQSRLWCTYRRGFVPIGNTQLT 69
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQALL LGR W + D Y+ I++ F DS+ +PFS+H
Sbjct: 70 TDKGWGCMLRCGQMVLAQALLQLHLGRDWVWEAETR-DDIYLNIVNRFEDSKQAPFSLHQ 128
Query: 183 L-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
+ L + G W GP + + + L + C+ + I+V +
Sbjct: 129 IALMGDSSEEKRIGEWFGPNTVAQVLKKLVKFDD-----WCR---LVIHVALDN------ 174
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D+ C V K W P+LL++PL LGL +VNP YI L+ F P S G+
Sbjct: 175 ---TVATDEIVELC-VDKKEPEAWKPLLLIIPLRLGLSEVNPIYIEGLKKCFQLPGSCGM 230
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDS 358
+GG+P + Y +G A+YLDPH VQ V +G A+ T+H I S
Sbjct: 231 IGGRPNQALYFIGYVGGEALYLDPHTVQRVGTVGSKQDPAEQELDETFHQRYASRISFTS 290
Query: 359 IDPSLAIGFYC 369
+DPSLA+ F C
Sbjct: 291 MDPSLAVCFLC 301
>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
Length = 403
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/342 (30%), Positives = 157/342 (45%), Gaps = 62/342 (18%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
I + +W+LG + ++ L +D S++ +YRKGF PIG +S
Sbjct: 16 IPQTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFIPIGGCNST 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
TSD GWGCMLR QM++AQAL+ LG+ W+ + + + Y++IL F D + FSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQW-MPETKNNTYLKILSRFEDKRAAAFSI 123
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 231
H + G + G G W GP + + W +L + L + +
Sbjct: 124 HQIALTGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSSLTIHVALDNTLIVNDILKQCRI 183
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
G+ G P+ K + W P+LLL+PL LGL ++NP YI L++
Sbjct: 184 EGGETAEADGEVPL--------------KAPSQWKPLLLLIPLRLGLSEINPVYINGLKV 229
Query: 292 --------------------TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
+F QSLG++GGKP + Y +G + IYLDPH Q
Sbjct: 230 KFKILCMQKKKYICIQFFQTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQRS 289
Query: 332 ----INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
I ++++E D TYH I + +DPS+A+ F+C
Sbjct: 290 GSVEDKISEEEIEMDI-TYHCKSASRIPITGMDPSVALCFFC 330
>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
MF3/22]
Length = 1147
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 135/298 (45%), Gaps = 62/298 (20%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
G N F DFSSR+ ++YR + PI D +
Sbjct: 335 GANWPPGFYSDFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGS 394
Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDS 173
TSD GWGCMLR+ Q L+A AL+ LGR WR+P Q + + YV+IL F DS
Sbjct: 395 GEKGWTSDSGWGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYATYVKILTWFFDS 454
Query: 174 ET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV 231
PFS+H + AGK G G W GP + + + AE GLG S+ V
Sbjct: 455 TDIHCPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHA-FAEAGLGV-SVATDGVV 512
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---------------W--TPILLLVPL 274
D P + RH + + + W P+L+LV +
Sbjct: 513 YETDVLAASNAGPYMY-----RHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGI 567
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
LG++ VNP Y ++ FTFPQS+GI GG+P +S Y VGVQ ++ YLDPH +P +
Sbjct: 568 RLGIDCVNPVYYDAVKALFTFPQSVGIAGGRPSSSYYFVGVQTDNLFYLDPHHSRPSV 625
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
T+H D +R + L S+DPS+ IGF CRD+
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCRDE 755
>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
Length = 427
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 152/354 (42%), Gaps = 63/354 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 37 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 86
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 87 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 146
Query: 152 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 147 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 202
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 203 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 249
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G
Sbjct: 250 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGXXXXXXXXXX 309
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 310 XXXCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 361
>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 152/331 (45%), Gaps = 60/331 (18%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 116
LG G+ E ++D SRI +YR GF+PI
Sbjct: 69 LGRRYGSGSKEEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 117 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
+ T+DVGWGCM+R+SQML+A A+ LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAIQLLLLGRGFT--YADSSEKKHSDIIDMF 186
Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D +PFS+HN ++A L G W GP A S + L + Q E+ S P
Sbjct: 187 TDDPKAPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDES-----SSPRF 241
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
++S D DD + + + + IL+L+P+ LGL KV+P Y +
Sbjct: 242 RVIISESCD---------IYDD--KIGKLLQENEDAEGAILILLPVRLGLNKVSPYYHNS 290
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L F+ PQ +GI GGKP +S Y G + +YLDPH Q V + T+H+
Sbjct: 291 LSSLFSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSV------KASSIYDTFHT 344
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
++ + ++ +DPS+ IG + K +F+
Sbjct: 345 HNVQSLKIEDMDPSMLIGILIKSKEDYESFK 375
>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
FP-101664 SS1]
Length = 997
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 94/282 (33%), Positives = 133/282 (47%), Gaps = 58/282 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 123
F DF+SRI ++YR F PI D+ + T+
Sbjct: 298 FYADFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTT 357
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 178
D GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 358 DAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPF 417
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
S+H + GK G G W GP + + L + P A V+ DG
Sbjct: 418 SVHRMALVGKDLGKDVGQWFGPSTAAGAIKTL-----------VHAFPEATLGVANAVDG 466
Query: 239 ERGGAPVVCIDDASRHC--SVFSKGQA--DW--TPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ V ASR S G A DW +L+L+ + LG+E VNP Y T++
Sbjct: 467 TLYESDVYA---ASRSVMYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTL 523
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 524 YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPL 565
>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 459
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 140/312 (44%), Gaps = 58/312 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------------- 142
F + F+S + +YR+GF P+ S +T+D GWGC+LRSSQML+AQ L
Sbjct: 98 FRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWSGN 157
Query: 143 ---------LFHRLGR---------------PWRKPLQKPFDREYVEILHLFGDSETSPF 178
L H + W L +P + IL F D+ T+PF
Sbjct: 158 QRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTAPF 217
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
IH L++ GK+ G AG W GP A R LP + V+ D
Sbjct: 218 GIHRLVELGKSSGKKAGDWYGP-------SIAAHILRKAVEASVVDLPNLVAYVAQD--- 267
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ + D + C W +L+LVP+ LG + +NP YI +++
Sbjct: 268 -----CTIYLQDVRKLCE--RPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLECC 320
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
+GI+GGKP S + VG Q++ +YLDPH QP +++ K+ ++H R +
Sbjct: 321 IGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPTVDVTKN---FPLESFHCKNPRKMPFSR 377
Query: 359 IDPSLAIGFYCR 370
+DPS IGFY +
Sbjct: 378 MDPSCTIGFYAK 389
>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
Length = 1509
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 147/327 (44%), Gaps = 78/327 (23%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE-------------- 162
+T+D GWGCMLR+ Q L+A AL+ LGR W++ Q F E
Sbjct: 776 LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAENQS 835
Query: 163 -------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
Y+ IL F D S PF +H + + GK G G W GP +
Sbjct: 836 LASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAGAI 895
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----A 263
+ L E G+ + ++ + D R A SR + S + A
Sbjct: 896 KQLV-FDFPEAGIAVELAHDGVFYL----DEVRAAASAST--GKSRASGMLSGNRRAETA 948
Query: 264 DWT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
W P+L+L+ + LGLE VNP Y +++ TF+FPQS+GI GG+P +S Y +G Q S Y
Sbjct: 949 VWRRPVLILIGIRLGLETVNPIYYESVKATFSFPQSVGIAGGRPSSSYYFMGHQGNSLFY 1008
Query: 323 LDPHDVQPVINI------------------------GKDD---------LEADTSTYHSD 349
LDPH+V+P + + +DD EA TST+H +
Sbjct: 1009 LDPHNVRPAVPLRYPPTTFPAAAPSRFDVSHRYALEDRDDEDEWWSHAYTEAQTSTFHCE 1068
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKGLLV 376
+R + + S+DPS+ +GF +D+ LV
Sbjct: 1069 KVRRMPIKSLDPSMLLGFLVKDEEALV 1095
>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
bisporus H97]
Length = 1261
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 132/292 (45%), Gaps = 65/292 (22%)
Query: 97 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 122
F DF SRI ++YR F PI DS +T
Sbjct: 247 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 306
Query: 123 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD-- 172
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D
Sbjct: 307 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 366
Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
S +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 367 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 415
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------DW--TPILLLVPLVLGLEKVN 282
S +DG V A + + ++ W P+L+LV L LG++ VN
Sbjct: 416 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 475
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 476 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 527
>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1355
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 132/292 (45%), Gaps = 65/292 (22%)
Query: 97 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 122
F DF SRI ++YR F PI DS +T
Sbjct: 334 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 393
Query: 123 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD-- 172
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D
Sbjct: 394 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 453
Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
S +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 454 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 502
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------DW--TPILLLVPLVLGLEKVN 282
S +DG V A + + ++ W P+L+LV L LG++ VN
Sbjct: 503 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 562
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 563 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 614
>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
Length = 379
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 100/298 (33%), Positives = 152/298 (51%), Gaps = 24/298 (8%)
Query: 77 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
H+I L +A L + +D SR+ +YR+GF PIG S+ TSD GWGCMLR QM
Sbjct: 13 HRIRCIFGLSNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQM 72
Query: 137 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA-AG 195
++AQALL LGR W + D Y+ I++ F D++ +PFS+H + G++ G
Sbjct: 73 VLAQALLQLHLGRDWEWTAETR-DETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVG 131
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
W GP + + + L + + ++V + D+ C
Sbjct: 132 EWFGPNTVAQVLKKLVKFD--------DWCSVVVHVALD---------STLATDEVVELC 174
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
S W P+LL++PL LGL ++NP Y+ L+ F + G++GG+P + Y +G
Sbjct: 175 EDKSDAGTSWKPLLLIIPLRLGLSEINPIYVAGLKKCFELAGNCGMIGGRPNQALYFIGY 234
Query: 316 QEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ A++LDPH VQ NIG D+ E D S +H R I+ ++DPSLA+ F C
Sbjct: 235 VGDEALFLDPHTVQRSGNIGDKTGLDEREMDES-FHQRYARRINFKAMDPSLALCFLC 291
>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
commune H4-8]
Length = 602
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 144/310 (46%), Gaps = 82/310 (26%)
Query: 69 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------- 121
+IWL+GVCH G +F DF++RI ++YR GF+ I D ++
Sbjct: 114 EIWLMGVCHA-------------PGAPDFYADFATRIWLTYRSGFELIRDRQLIDLPPPV 160
Query: 122 ------------------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
+SD GWGCMLR+ Q L+A ALL GR WR+ +
Sbjct: 161 ASLDGHLQGEWATDEAEPPGAYGFSSDSGWGCMLRTGQSLLANALLTAWFGRDWRRISEV 220
Query: 158 PFDRE--YVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
+ YV +L LF D+ T+PFSIH + AGK G G W GP + + L
Sbjct: 221 ETHQHSLYVHLLSLFLDTPHPTAPFSIHRMALAGKQLGKDIGQWFGPSTAAGAIKNL--- 277
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT------- 266
+ P+A G VV +D A VF+ ++W+
Sbjct: 278 --------VSAYPLA------------GIGVVVGMDGALSKSEVFTASHSEWSDEEAALD 317
Query: 267 ----PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
P+L+L+ L LGL++VNP Y T++ FTFPQS+GI GG+P +S + VG Q IY
Sbjct: 318 WGDRPVLILLNLRLGLDRVNPIYHDTIKALFTFPQSVGIAGGRPCSSYHFVGAQGSDLIY 377
Query: 323 LDPHDVQPVI 332
LDPH + +
Sbjct: 378 LDPHHTRNTV 387
>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
Length = 858
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 132/275 (48%), Gaps = 58/275 (21%)
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------------TS 123
AA + EF DF+SR+ ++YR GF PI D + TS
Sbjct: 148 AAASGWPQEFFSDFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTS 207
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIH 181
D GWGCMLR+ Q L+A AL+ +GR Y+ ++ LF DS + +PFS+H
Sbjct: 208 DAGWGCMLRTGQSLLANALVVAWMGRGALA--------LYIHLISLFLDSPSPSAPFSVH 259
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
+ AG+A G G W GP + +AL + GLG V+ EDG
Sbjct: 260 RMALAGRALGKDVGQWFGPSTAAGAIKALVNAY-PDAGLG----------VAIAEDG--- 305
Query: 242 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
V R + + +W P+L+L+ + LGL+ VNP Y T++ +TFPQSL
Sbjct: 306 ----VVYQTQRRQ----KEREREWGDQPVLVLLGIRLGLDGVNPIYYDTIKQLYTFPQSL 357
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
GI GG+P +S Y VG Q YLDPH +P + +
Sbjct: 358 GIAGGRPSSSYYFVGAQAGDLFYLDPHHARPTVPL 392
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
A+T T+H + +R + + +DPS+ IGF C+D+
Sbjct: 537 AETRTFHCERVRKMPMSGLDPSMLIGFLCKDRA 569
>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
Length = 324
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 130/300 (43%), Gaps = 56/300 (18%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
++ +W+LG + + ++ E D +SR+ +YRK F IG + TSD
Sbjct: 26 TSEPVWILGRKYSVLTEKE-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 74
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQAL+ LGR WR Y +L+ F D + S +SIH + Q
Sbjct: 75 GWGCMLRCGQMIFAQALVCRHLGRDWRWAQWTQQPDSYFNVLNAFIDRKDSYYSIHQIAQ 134
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G + G W GP + + + LA
Sbjct: 135 MGVGEGKSIGQWYGPNTVAQVLKKLA---------------------------------- 160
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
VF + I + +V G +N Y+ TL+ F PQSLG++GGK
Sbjct: 161 -----------VFDTWSSLAVHIAMDNTVVTGEININEAYVETLKHCFMMPQSLGVIGGK 209
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P ++ Y +G + IYLDPH QP + + L D S + + + +DPS+A+
Sbjct: 210 PNSAHYFIGYVGDELIYLDPHTTQPAVELTDSCLVPDESFHCQHPPSRMSIRELDPSIAV 269
>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
Length = 379
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 145/303 (47%), Gaps = 50/303 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF S+I ++YR F PI TSD GWGCM+RS
Sbjct: 50 FLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRS 109
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G + G
Sbjct: 110 GQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFCGK 166
Query: 193 AAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R + LA R ++ + +Y+ + D + V D+
Sbjct: 167 HPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRDE- 216
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS Y
Sbjct: 217 --------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSASHY 266
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
V VQ YLDPH+ +P + + E + +TYH+ +R +++ +DPS+ IGF
Sbjct: 267 FVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGFLI 326
Query: 370 RDK 372
RD+
Sbjct: 327 RDE 329
>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
Length = 988
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 125/272 (45%), Gaps = 57/272 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 124
F DF+SRI ++YR F PI D+ + TSD
Sbjct: 305 FYSDFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSD 364
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD------REYVEILHLFGDSETS-- 176
GWGCMLR+ Q L+A LL LGR WR+P P+ YV+IL F D+ +
Sbjct: 365 AGWGCMLRTGQSLLANTLLHLHLGRDWRRP---PYPICTADYATYVQILTWFFDNPSPLC 421
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
PFS+H + GK G G W GP + + L E GLG ++ S
Sbjct: 422 PFSVHRMALVGKELGKEVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVATDSVIYQSD-- 478
Query: 237 DGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
V S S G++ W +L+LV + LGL+ VNP Y T++ +T
Sbjct: 479 ---------VYTASRSNLGSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIYYDTIKALYT 529
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
FPQS+GI GG+P +S Y VG Q ++ YLDPH
Sbjct: 530 FPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561
>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 452
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 153/339 (45%), Gaps = 64/339 (18%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S +S + LLG +++ +DEA + F + F+S + ++YR+GF + S +T+D
Sbjct: 70 SKSSPLILLGKSYEL-KDEANKE--------RFRRSFASLLWLTYRRGFPQLAGSSLTTD 120
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----------------------------- 155
GWGC+LR+ QML+A+ LL H + W +
Sbjct: 121 SGWGCVLRTGQMLLARGLLTHLMPPGWMWSVWYRAVKDDLDLPHHADCTDCKSNMRCRYQ 180
Query: 156 ------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+P + + +++ F D +PF IH L++ G + G AG W GP +
Sbjct: 181 SLGSLYDRPLEAMHRKVVSWFADHPKAPFGIHRLVELGASSGKKAGDWYGPSIVA---HI 237
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 269
L + A LP + V+ D + + D C W ++
Sbjct: 238 LQKAVAASV-----DLPNLVVYVAQD--------CTIYLQDVRGLCE--RPPPHSWKSVI 282
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+LVP+ LG + +NP YI ++ +GI+GG+P S + VG Q++ +YLDPH Q
Sbjct: 283 ILVPVRLGGQDLNPSYISCVKKLLELQCCIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQ 342
Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+N+ K++ + ++H R + +DPS IGFY
Sbjct: 343 LTVNVTKENFPLE--SFHCKYPRKMPFSRMDPSCTIGFY 379
>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
Length = 433
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 136/298 (45%), Gaps = 59/298 (19%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
TSD GWGCMLR +QML+ + LL +GR + ++ Y +IL +F D + + +SIH
Sbjct: 49 TSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIH 107
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYV 231
+ Q G G W GP + W +A + L + +L MA
Sbjct: 108 QIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTY 167
Query: 232 VSGD------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
S D E+G+ +H + + + +W P+LL++PL LGL +N Y
Sbjct: 168 PSEDAVKLIMENGQ-----------VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCY 216
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV-------------- 331
+P ++ F PQ +GI+GGKP + Y VG+ YLDPH +P
Sbjct: 217 LPAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTES 276
Query: 332 ----INIGK-DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
N + +DLE D STYH +++ + +SIDPSLA+ +C +
Sbjct: 277 EQHDTNFSELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESR 334
>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
Length = 1541
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/334 (30%), Positives = 151/334 (45%), Gaps = 81/334 (24%)
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK--PLQKPFD--------- 160
GF G +T+D GWGCMLR+ Q L+A ALL LGR W + P + D
Sbjct: 814 GFSRAG---LTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLS 870
Query: 161 -------------RE-------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 198
RE Y++IL F D S PF +H + + GK G G W
Sbjct: 871 LDSSVEMQSLQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 930
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
GP + + L + + G+ + ++ + DE GA R
Sbjct: 931 GPSTAAGAIKQLV-TEFPDAGIAVELAHDGVFYL--DEVRLAAGARSALQSGKGR----- 982
Query: 259 SKGQADWT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
+G A T P+++L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y +G
Sbjct: 983 -QGDAAVTWRRPVVILIGIRLGLDSVNPIYYESVKETFSFPHSVGIAGGRPSSSYYFMGH 1041
Query: 316 QEESAIYLDPHDVQPVINI------------------------GKDD---------LEAD 342
Q S YLDPH+V+P + + KDD EA
Sbjct: 1042 QGNSLFYLDPHNVRPAVALRYPPSTFPTAVPHQLDVAHRFALEDKDDELEWWSHAYTEAQ 1101
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLV 376
TST+H + +R + + S+DPS+ +GF +D+ L+
Sbjct: 1102 TSTFHCEKVRRMPIKSLDPSMLLGFLVKDEEDLM 1135
>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
Length = 1505
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 147/328 (44%), Gaps = 77/328 (23%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE------------------ 162
+T+D GWGCMLR+ Q L+A AL+ LGR W + + P R+
Sbjct: 785 LTTDSGWGCMLRTGQSLLANALINVHLGRSWMR--EAPPARQLEFLQELANLSLDTSAEK 842
Query: 163 ---------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
Y++IL F D S PF +H + + GK G G W GP
Sbjct: 843 QSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 902
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDE-DGERGGAPVVCIDDASRHCSVFSKGQAD 264
+ + L + + GL + ++ + DE G + + AS + KG
Sbjct: 903 AIKQLV-SEFPDAGLAVELAHDGVFYL--DEVRAAAGASRQLGKGRASATGTNGRKGDTA 959
Query: 265 WT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
T P+L+L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y +G Q S
Sbjct: 960 LTWHKPVLILIGIRLGLDSVNPIYYESVKATFSFPHSVGIAGGRPSSSYYFMGHQGNSLF 1019
Query: 322 YLDPHDVQPVINI------------------------GKDD---------LEADTSTYHS 348
YLDPH+V+P + + DD EA TST+H
Sbjct: 1020 YLDPHNVRPAVALRFPPSTFPAAVPRQLDIAHRFAFEEHDDEDEWWSHAYTEAQTSTFHC 1079
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKGLLV 376
D +R + + S+DPS+ +GF +D+ L
Sbjct: 1080 DKVRRMPIKSLDPSMLLGFLVKDEEDLA 1107
>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.9]
Length = 992
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/284 (34%), Positives = 134/284 (47%), Gaps = 49/284 (17%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 122 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD 172
TSD GWGCMLR+ Q L+A ALL LGR WR+P +Y V+I+ F D
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFD 410
Query: 173 --SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 TPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGVI 469
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
S + A I RH V G+A +++L+ + LGL+ VNP Y T++
Sbjct: 470 FQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTIK 520
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+TFPQS+GI GG+P +S Y +G Q ++ YLDPH +P + +
Sbjct: 521 ALYTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPHHARPAVPL 564
>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
Length = 340
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/301 (31%), Positives = 139/301 (46%), Gaps = 61/301 (20%)
Query: 94 LAEFNQDFSSRILISYRKGFDPI------------------------------GDSKITS 123
L E +SR+ +YR GF+PI + ++
Sbjct: 52 LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHN 182
DVGWGCM+R+SQ L+A AL LGR + P E VE I+ LFGD T PFS+HN
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171
Query: 183 LLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
++ A L G W GP A S + L C + E+ ++ ++I D E
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRL--CAKFESN-EIPNINVSICESCNLYDEEI 228
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
G +F + + +P+L+L PL LG++K+N Y P+L QS+G
Sbjct: 229 RG--------------IFEESE---SPLLILFPLRLGIDKINSIYYPSLLQLLALKQSVG 271
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I GGKP +S Y G Q + +YLDPH++Q +D TYH+ + + + ++D
Sbjct: 272 IAGGKPSSSYYFFGFQGSNLLYLDPHNLQAA--------SSDPGTYHTSKFQTLSISNLD 323
Query: 361 P 361
P
Sbjct: 324 P 324
>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
Length = 358
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 144/301 (47%), Gaps = 44/301 (14%)
Query: 94 LAEFNQDF---SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
AE+ +DF S + I RK G + TSD GWGCMLR QM+ AQAL+ LGR
Sbjct: 13 FAEY-EDFPETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRD 71
Query: 151 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
WR +K Y +L+ F D + S +SIH + Q G G + G W GP + + + L
Sbjct: 72 WRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKL 131
Query: 211 ARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFS 259
A + +A+++ V +E V C D+ RHC+ F
Sbjct: 132 AVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFP 183
Query: 260 KGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +
Sbjct: 184 AGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFI 243
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP--SLAIGFYCRD 371
G ES+ + P + P+ + + H + ++P S A+GF+C+
Sbjct: 244 GYVGESSSHRVPVGLCPLRAF-------------CEQVPHARCNIVEPEGSRALGFFCKT 290
Query: 372 K 372
+
Sbjct: 291 E 291
>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 84/254 (33%), Positives = 123/254 (48%), Gaps = 25/254 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIG 366
+ + +DPS+A+G
Sbjct: 233 RMSIAELDPSIAVG 246
>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1009
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 131/288 (45%), Gaps = 57/288 (19%)
Query: 97 FNQDFSSRILISYRKGFDPI-------------------------------GDSKITSDV 125
F DF+SRI ++YR F PI GD +SD
Sbjct: 308 FYADFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDA 367
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFSI 180
GWGCMLR+ Q L+A AL+ LGR WRKP +Y ++I+ F D + PFS+
Sbjct: 368 GWGCMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSV 427
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMC------RSWEALARCQRAETGLGCQSLPMA---IYV 231
H + GK G+ G W GP + ++ Q A L + P A IYV
Sbjct: 428 HRMALVGKQLGVKVGQWFGPSTAAGAIKYVSAHSSMVPNQPARRTL-VHAFPEAGLGIYV 486
Query: 232 VSGD---EDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYI 286
+ D E A I RH W P+L+L+ LG++ VNP Y
Sbjct: 487 AADGGTIYDSEVFAASHSGIGSPRRHTRRV------WGDRPVLILIGHRLGIDGVNPIYY 540
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
TL+ +T+PQS+GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 541 DTLKTLYTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPTIPL 588
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
T+H D +R + L S+DPS+ IGF C+D+
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCKDE 755
>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
Length = 1572
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 152/353 (43%), Gaps = 109/353 (30%)
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK---PL-QKPFDRE----- 162
GF G +T+D GWGCMLR+ Q L+A AL+ LGR W++ PL Q+ F E
Sbjct: 824 GFSRAG---LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLS 880
Query: 163 ----------------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 198
Y++IL F D S PF +H + + GK G G W
Sbjct: 881 IADAAEKESLQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 940
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD-------A 251
GP + + L P A V DG V +D+ +
Sbjct: 941 GPSTASGAIKQL-----------VSEFPQAGIAVELARDG------VFYLDEVRAAASAS 983
Query: 252 SRHCSVFSKGQAD---------------WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+ SV S G+A W P+L+L+ + LGLE VNP Y +++ TF+F
Sbjct: 984 ASAASVQSGGKARSSGAASGSRKGEGLIWRRPVLILIGIRLGLESVNPIYYESVKATFSF 1043
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--------------------- 334
P S+GI GG+P +S Y +G Q S YLDPH+V+P + +
Sbjct: 1044 PHSVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPSTFPDAVPRHLGIAHRF 1103
Query: 335 ---GKDD---------LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLL 375
KDD E TST+H + +R + + S+DPS+ +GF +D+ L
Sbjct: 1104 VLEDKDDEDEWWSHAYSEVQTSTFHCEKVRRMPIKSLDPSMLLGFLVKDEESL 1156
>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
972h-]
gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
Length = 320
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 140/333 (42%), Gaps = 53/333 (15%)
Query: 48 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 107
M R ER L + T + IW LG +KI + +F D S I I
Sbjct: 4 MARFLERYLHFAPTNTEPPGTLIWFLGHSYKIEDSQ---------WPEKFLYDSFSLITI 54
Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
+YR G + G +TSD GWGCM+RS+Q L+A L R+ P +++ EIL
Sbjct: 55 TYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL---RICYP---------EKQLKEIL 100
Query: 168 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
LF D ++PFSIH + GK + G W GP C +AR +P
Sbjct: 101 ALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSC---SCVARLSDQNP-----DVP 152
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
+ +YV R V P+LLL+P LG++ +N Y
Sbjct: 153 LHVYVARNGNAIYRDQLSKVSF------------------PVLLLIPTRLGIDSINESYY 194
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
L F +GI GG+P ++ Y Q + YLDPH + A T+
Sbjct: 195 DQLLQVFEIRSFVGITGGRPRSAHYFYARQNQYFFYLDPHCTHFAHTTTQ---PASEETF 251
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
HS +R + + +DP + GF RD+ +FE
Sbjct: 252 HSATLRRVAIQDLDPCMIFGFLIRDEEEWHSFE 284
>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
Length = 1039
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 137/278 (49%), Gaps = 51/278 (18%)
Query: 97 FNQDFSSRILISYRKGFD-PIGDSKI-------------------------------TSD 124
F DF+SRI ++YR F PI D+++ +SD
Sbjct: 336 FYIDFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSD 395
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDREYVEILHLFGDSET--SPFS 179
GWGCMLR+ Q L+A AL+ LGR WR+P +Q YV+I+ F D+ +PFS
Sbjct: 396 TGWGCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFS 455
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H + AGK +G G W GP + + L E+GLG VS DG
Sbjct: 456 VHRMALAGKEFGTDVGQWFGPSVAAGAIKTLVNS-FPESGLG----------VSVATDGT 504
Query: 240 RGGAPVVCIDD---ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
+ V + +SR P+LLL+ + LG+E VNP Y T++L +TFP
Sbjct: 505 LFQSDVFAVSHGEMSSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIYYETIKLLYTFP 564
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
QS+GI GG+P +S Y VG Q ++ YLDPH+ +P I +
Sbjct: 565 QSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTRPAIPL 602
>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
Length = 469
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 145/323 (44%), Gaps = 56/323 (17%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 93 DDEIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSRE 152
Query: 151 W--RKPLQKPF----------------------------------------DREYVEILH 168
W + L + F D+ + I+
Sbjct: 153 WTWSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMR 212
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
F D SPF +H L+ G +G AG W GP +A + + ++
Sbjct: 213 WFSDHPGSPFGLHQLVTLGSIFGKKAGDWYGP-------SIVAHIIKKAIETSSEVPELS 265
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+YV S D + + D + G+A +++LVP+ LG E NP Y
Sbjct: 266 VYV-SQDCTVYKADIEQLFAGDVPHAETSRGAGKA----VIILVPVRLGGETFNPVYKHC 320
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ P LGI+GGKP S Y +G Q+ +YLDPH QP I+ K+D + ++H
Sbjct: 321 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPYIDTSKNDFPLE--SFHC 378
Query: 349 DVIRHIHLDSIDPSLAIGFYCRD 371
+ R I + +DPS FY ++
Sbjct: 379 NSPRKISITRMDPSCTFAFYAKN 401
>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
Length = 603
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 137/310 (44%), Gaps = 63/310 (20%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGF------DPIGDS------------------- 119
LG+ NN ++ DF SRI +YR F DP+ D
Sbjct: 55 LGNLYDNN--SDLLDDFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLR 112
Query: 120 --KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF--------DREYV---EI 166
+D GWGCMLR+SQ L+A L LGR WR+ PF +EYV ++
Sbjct: 113 ERTFNTDQGWGCMLRTSQSLLANTLQIMLLGRQWRR---NPFVDLTDYAKRKEYVNLIKL 169
Query: 167 LHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
L+LF D S SPFS+H + GK+ G G W GP + + L Q + L S
Sbjct: 170 LNLFMDNPSTLSPFSVHRMAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQ-TDINLSV-S 227
Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVN 282
+ + D GG ++W P+L+LV + LGL+ ++
Sbjct: 228 VASDSVIYKSDVYQASGGTSTT--------------ADSEWGNKPVLILVGVRLGLDGIH 273
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRY TL+ +GI GG+P +S Y G Q +S Y+DPH ++P INI E +
Sbjct: 274 PRYYETLKAFLRMQSCVGIAGGRPSSSYYFFGYQSDSLFYVDPHIMKPTINIKTPPTEGE 333
Query: 343 TSTYHSDVIR 352
T +++R
Sbjct: 334 LKTEIENLLR 343
>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
Length = 1202
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 126/283 (44%), Gaps = 53/283 (18%)
Query: 97 FNQDFSSRILISYRKGFDPI---------------------------GDSKITSDVGWGC 129
F +DF+SRI ++YR GF PI + +++D GWGC
Sbjct: 545 FYEDFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGC 604
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD--------------REYVEILHLFGD--S 173
MLR+ Q L+A AL F LGR WR+ + Y +L F D S
Sbjct: 605 MLRTGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPS 664
Query: 174 ETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV- 231
PFS+H GK G G W GP + + LA + +L +A+ V
Sbjct: 665 PLCPFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLA------SNFAPANLGVAVSVD 718
Query: 232 --VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
V + P A R S + P+L+L+ LGL+KVNP Y ++
Sbjct: 719 GTVYRSDVQAAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESI 778
Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
+ +FPQS+GI GG+P +S Y VGVQ+ S Y+DPH +P I
Sbjct: 779 KAALSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPHHTKPAI 821
>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
Length = 431
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 153/360 (42%), Gaps = 106/360 (29%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 60 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 109
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 110 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSRSASPSRYHGPAH 169
Query: 152 -RKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
R P L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 170 WRPPRWAQGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 225
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG- 261
+A R + + +YV S+ C+V+
Sbjct: 226 -----SLVAHILRKAVESCSEVTRLVVYV--------------------SQDCTVYKADV 260
Query: 262 ---------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
A+W +++LVP+ LG E +NP Y+P ++L T P
Sbjct: 261 VRLVARPDPAAEWKSVVILVPVRLGGETLNPVYVPCVKLMPTPP---------------- 304
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
++ +YLDPH QP +++ + D + ++H R + +DPS +GFY D+
Sbjct: 305 ---TDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDR 359
>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
Length = 425
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 138/303 (45%), Gaps = 73/303 (24%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAER---NIVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
S + + D + PTL L QS+GI GG+P +S Y
Sbjct: 270 ---SFLATARPD-----------------GETFHPTLIL---MEQSIGIAGGRPSSSHYF 306
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ + YLDPH +P + ++ L + + H+ +R++H++ +DPS+ IGF
Sbjct: 307 VGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIGFLI 366
Query: 370 RDK 372
+D+
Sbjct: 367 QDE 369
>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
Length = 252
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 118/233 (50%), Gaps = 32/233 (13%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192
Query: 243 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ ++++
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKVS 243
>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
Length = 430
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 135/304 (44%), Gaps = 50/304 (16%)
Query: 95 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 131
A F DF+SR ++YR F DP + S TSD GWGCM+
Sbjct: 121 AAFLDDFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMI 180
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
RS Q L+A A+ LGR WR+ + DRE +L LF D +P+SIHN ++ G+ Y
Sbjct: 181 RSGQSLLANAMAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSIHNFVRHGEKYC 237
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G W GP A R + L ++ E + IY G P + D+
Sbjct: 238 SKYPGEWFGPSATARCIQDLVNSRKQE---------LRIYST--------GDGPDIYEDN 280
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ + + P L+LV LG++K+ P Y L + QS+GI GG+P +S
Sbjct: 281 FMK---IAKPDGEVFHPTLVLVGTRLGIDKITPVYWEALIASVQMSQSVGIAGGRPSSSH 337
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEA---DTSTYHSDVIRHIHLDSIDPSLAIGF 367
Y VG Q YLDPH + + D D + H+ +R IH+ +DP+
Sbjct: 338 YFVGSQGHFLFYLDPHHTRKALPYYSDVARYTIDDMDSCHTSRLRRIHVREMDPNCHPAN 397
Query: 368 YCRD 371
RD
Sbjct: 398 EIRD 401
>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 470
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 155/353 (43%), Gaps = 72/353 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S ++ ++LLG + D+ + F +DF SR+ ++YR+ F + + +T+D
Sbjct: 76 SRSAPVYLLGERYYFRLDDEID---------RFQKDFVSRVWLTYRRDFPALEGTALTTD 126
Query: 125 VGWGCMLRSSQMLV---------------AQALLFH------------------------ 145
GWGCM+RS QML+ ++AL H
Sbjct: 127 CGWGCMIRSGQMLLAQGLLLHLLSREWTWSEALYTHFVEMEPIRSSSPSSMPLSLATDHS 186
Query: 146 -RLGRPWRKPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 203
R +P + P+ E + I+ F D ++PF +H ++ G +G AG W GP
Sbjct: 187 GRHSQPQTHCSRAPYGGEVHQNIVSWFSDHASAPFGLHRMVALGSIFGKRAGDWYGP--- 243
Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSG----DEDGERGGAPVVCIDDASRHCSVFS 259
+A + + +++YV D E+ A V D SR
Sbjct: 244 ----SIVAHIIKKAIESSSEVPDLSVYVSQDCTVYKADIEQLFAGEVPHTDTSR-----G 294
Query: 260 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 319
G+A +++LVP LG E NP Y L+ P LGI+GGKP S Y +G Q+
Sbjct: 295 AGKA----VIILVPARLGGETFNPVYKHCLKEFLRMPSCLGIIGGKPKHSLYFIGYQDNY 350
Query: 320 AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+YLDPH QP I+ +D+ + ++H + R + + +DPS FY +++
Sbjct: 351 LLYLDPHYCQPYIDTSRDNFPLE--SFHCNAPRKLSITRMDPSCTFAFYAKNR 401
>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
Length = 1093
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/291 (32%), Positives = 137/291 (47%), Gaps = 39/291 (13%)
Query: 117 GDSKITSDVGWGCMLRSSQMLVAQALL-------------FHRLGRPWRKPLQKPFDRE- 162
G +TSD GWGCMLR+ QML+A +L+ + P P + DR+
Sbjct: 431 GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPAPSLPPSET--DRQR 488
Query: 163 ---YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
YV+IL F D + PFS+H L AG G G W GP S + L A
Sbjct: 489 FEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAGSIKKLVSAFPA- 547
Query: 218 TGLGCQSLP------MAIYVVSGDEDGERGGAPVVCIDD-ASRHCSVFSKGQADWTPILL 270
GLG P A++ S + + D +R + K + +L+
Sbjct: 548 CGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERANRMKEEWGDRAVLI 607
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
L+ L LG+E V P Y +++ FTFPQ++GI GG+P +S Y VG Q + YLDPH +P
Sbjct: 608 LIGLRLGIEGVTPIYYDSVKALFTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTRP 667
Query: 331 VINI-----GKDDLE-----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
+ + G D ++ T+HSD +R +H+ +DPS+ GF R+
Sbjct: 668 AVPLRVPTDGPYDATGQFTLSEMKTFHSDKVRKMHISGLDPSMLCGFIVRN 718
>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
aries]
Length = 438
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 143/314 (45%), Gaps = 26/314 (8%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 85 TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGTLTSD 138
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 184
GWGCMLRS QM++AQ LL H L R W Q P
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHLLPRDWTWS-QGAGLGPAEPPGLGSPSPGPGPXXXXXXX 197
Query: 185 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 244
G+A G AG W GP +A R C + + VS D
Sbjct: 198 SWGRAPGKKAGDWYGP-------SLVAHILRKAVE-SCSEVTRLVVYVSQDC-------- 241
Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
V D +R + S A+W +++LVP+ LG E +NP Y+P ++ LGI+GG
Sbjct: 242 TVYKADVARLVAR-SDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGG 300
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
P S Y +G Q++ +YLDPH QP +++ + D + ++H R + +DPS
Sbjct: 301 TPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCT 358
Query: 365 IGFYCRDKGLLVTF 378
+GFY D+ T
Sbjct: 359 VGFYAGDRKEFETL 372
>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 145/339 (42%), Gaps = 76/339 (22%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 116
LG G++ E +D SRI +YR GF+PI
Sbjct: 69 LGRRYGSSSKEEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 117 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
+ T+DVGWGCM+R+SQML+A A LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAFQLLLLGRDF--AYVDGSEKKHSDIIDMF 186
Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D +PFS+HN ++A L G W GP A S + L + Q
Sbjct: 187 TDEPKTPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQF------------- 233
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG--------QADWTPILLLVPLVLGLEK 280
DG + V I S C ++ + IL+L+P+ LGL K
Sbjct: 234 --------DGSVSPSFRVII---SESCDIYDDKIGKLLQEIENSEDAILILLPVRLGLNK 282
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
V+P Y +L F Q +GI GGKP +S Y G +YLDPH Q + D
Sbjct: 283 VSPYYHDSLSSLFCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPHYPQSMKASSIYD-- 340
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
T+H++ ++ + ++ +DPS+ IG + K +F+
Sbjct: 341 ----TFHTNKVQSLKIEDMDPSMLIGILIKSKEDYESFK 375
>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
Length = 491
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 157/401 (39%), Gaps = 107/401 (26%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAG--------NNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPTESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNYDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQE----------------ESAIYLDPHDVQPVINIGKDDLEADT-- 343
+GGKP S Y G QE ++ + L+ + +P + G +D +
Sbjct: 323 IGGKPKQSYYFAGFQENEVQRSSMNSLKQKSSKNNLKLEGSEKRPQMGFGSEDEFKNILL 382
Query: 344 -------------STYHSDVIRHIHLDSIDPSLAIGFYCRD 371
T+H + + +DPS IGFYCR+
Sbjct: 383 DHVQAFGPPSYPRLTFHCPSPKKMSFRKMDPSCTIGFYCRN 423
>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
RWD-64-598 SS2]
Length = 1038
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 139/318 (43%), Gaps = 74/318 (23%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------ 121
+Q A G + EF DF+SRI ++YR F PI DS +
Sbjct: 271 SQSPASEKHPGQDWAPEFYADFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSP 330
Query: 122 --------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YV 164
T+D GWGCMLR+ Q L+A ALL LGR WR+P + + YV
Sbjct: 331 SPKSRRWFGGEKGWTTDTGWGCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYV 390
Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+I+ F DS +PFS+H + AGK G G W GP + + L + + GLG
Sbjct: 391 QIITWFLDSPLPQAPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKRLVQA-FPDAGLGV 449
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------W--TPIL 269
V D A V+S D W +L
Sbjct: 450 ----------------------AVASDGALYQTDVYSASYVDVGSPRNVRKLRWGGRAVL 487
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+L + LG+ VNP Y T++ F PQS+GI GG+P +S Y +GVQ ++ IYLDPH +
Sbjct: 488 VLFGIRLGINGVNPIYYDTIKGLFEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPHHAR 547
Query: 330 PVINIGKDDLEADTSTYH 347
P I + + EAD H
Sbjct: 548 PAIPL-RPLPEADEGNQH 564
>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 302
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/271 (32%), Positives = 135/271 (49%), Gaps = 37/271 (13%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAG 187
M R QML+AQAL+ H LGR WR + ++I+ F DS + SP S+H L+Q
Sbjct: 1 MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHRLVQMS 60
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGDE--DGER 240
G W GP ++C A+ R + L + + +Y V+ +E D R
Sbjct: 61 DR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLAR 114
Query: 241 G------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLGL-EKVNPR 284
G P + D H +++ + Q+D T ILLL+PL+ G ++NPR
Sbjct: 115 GLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFGKGNRINPR 170
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+ D
Sbjct: 171 YIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNSPKFSVD-- 228
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKGLL 375
++H + + + +++PS A+GFYCR +G L
Sbjct: 229 SWHCPIPKTMSAANLNPSCAVGFYCRTRGEL 259
>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
Length = 492
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 147/330 (44%), Gaps = 79/330 (23%)
Query: 87 DAAGNNGLAEFNQDFSSRILISYRKGFDPIG----------------------------- 117
D + ++G+ E QD S+I ++YR GF+PI
Sbjct: 77 DISVDDGVIE--QDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNF 134
Query: 118 -----DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW-----RKPLQKPFDREYVEIL 167
+ T+DVGWGCM+R+SQ L+A LGR + R P + EI+
Sbjct: 135 HGLLDNDNFTTDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRDRSP-------RHDEII 187
Query: 168 HLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+F D +PFS+HN ++ L G W GP A S + L C +
Sbjct: 188 DMFMDEPRAPFSLHNFIKVASESPLKVKPGQWFGPNAASLSIKRL-----------CDN- 235
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP----ILLLVPLVLGLEKV 281
+Y +G G VV + ++ + + ++ P IL+L+P+ LG++KV
Sbjct: 236 ---VYESNG-----TGRVKVVISESSNLYDDIITQMFTTLNPVPDAILVLLPVRLGIDKV 287
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
NP Y ++ QS+GI GGKP +S Y G + +YLDPH Q V N +
Sbjct: 288 NPLYHASVLELLALRQSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRN-----KTS 342
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
TYH++ + + +D +DPS+ IG +D
Sbjct: 343 VYDTYHTNSYQKLSVDDMDPSMMIGILIKD 372
>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
Length = 592
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 155/351 (44%), Gaps = 62/351 (17%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG------- 117
S DIW H A+D D N EF D +RI ++YR F PI
Sbjct: 75 SGLKDIWQTLRFH-TAEDNEKDDL--NKWPQEFIDDVYTRIWLTYRTKFSPIDRDPEGPS 131
Query: 118 ----------------DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+ T+D GWGCM+R+SQ L+A ALL +GR WR +
Sbjct: 132 PLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQSLLANALLNLHIGRDWR--YTGELNE 189
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGL 220
+ EI+ F D + PFSIH ++ GK G W GP A RS ++L
Sbjct: 190 MHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGEWFGPSAAARSIQSL---------- 239
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
C + V G + G+ V + A VF PIL+L+ L LG++
Sbjct: 240 -CNEFDSGVKVYIGSDSGDIYENDVFKV--AKDENGVFK-------PILILLGLRLGIDN 289
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+NP Y +L+ +S+GI GG+P S Y G Q + YLDPH QP + + D L+
Sbjct: 290 INPVYWDSLKAILNSKESIGIAGGRPSTSHYFFGFQGDHLFYLDPHLPQPAL-LHDDQLD 348
Query: 341 A------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
D ++ H+ +R IHL +DPS+ +GF +D+ + ++
Sbjct: 349 TSVSESTEIVSSLDVNSVHTKKLRKIHLSEVDPSMLLGFLIKDENEWIQWK 399
>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
Length = 499
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 162/359 (45%), Gaps = 75/359 (20%)
Query: 77 HKIAQDEALGDAAGNNGLAE---FNQDFSSRILISYRKGFDPIGDSKITSDVGWGC---- 129
+KI+ LGD+ N E F F SRI ++YRK F + S T+D GWGC
Sbjct: 83 NKISPVTILGDSYLLNSEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRS 142
Query: 130 --MLRSSQMLV-----------AQAL------LFH-----RLG----------------- 148
ML + +LV AQ L +F R G
Sbjct: 143 GQMLLAQGLLVHLMPRGWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPE 202
Query: 149 RPW----------RKPLQKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
RP +K L+ DR+ + +++ FGD T+PF IH L++ GK+ G A
Sbjct: 203 RPLLSEQATKCSRKKRLESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKA 262
Query: 195 GSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
G W GP + +A+AR + + +YV D + +C S+
Sbjct: 263 GDWYGPAIVAHILRKAVARASAVHS--------LVVYVAQ-DCTVYKEDVMHLCDPTPSQ 313
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
S QA W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y V
Sbjct: 314 TPSDPLSHQA-WKSVIILVPVRLGGECLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFV 372
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
G Q+E +YLDPH QPV+++ + + + ++H + + + + +DPS IGFY + K
Sbjct: 373 GFQDEQLLYLDPHYCQPVVDVSQ--VNSSLESFHCNAPKKMPFNRMDPSCTIGFYAKSK 429
>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
Length = 271
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/123 (52%), Positives = 88/123 (71%), Gaps = 8/123 (6%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R ++ D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQK 157
+K
Sbjct: 203 SEK 205
>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
Length = 292
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/123 (52%), Positives = 88/123 (71%), Gaps = 8/123 (6%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R ++ D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQK 157
+K
Sbjct: 203 SEK 205
>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
Length = 208
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/123 (53%), Positives = 87/123 (70%), Gaps = 8/123 (6%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R S D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARVLTSG---DVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQK 157
+K
Sbjct: 203 SEK 205
>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 414
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 132/284 (46%), Gaps = 34/284 (11%)
Query: 97 FNQDFSSRILISYRKGFDPIG---DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
F DF ++I ++YR F I D K S + LRS LV Q G W
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRS--QLVDQGGFTSDTG--WGC 158
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
E +IL LF D +P+SIH ++ G A G G W GP A AR
Sbjct: 159 SSSN----EERKILSLFADDPRAPYSIHKFVEHGASACGKHPGEWFGP-------SAAAR 207
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
C +A T +S + +Y+ +GD G+ V S+ +TP L+LV
Sbjct: 208 CIQALTNSQVES-ELRVYI-TGD------GSDVY----EDTFMSIAKPNSTKFTPTLILV 255
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
LGL+K+ P Y L+ + PQS+GI GG+P +S Y +GVQE YLDPH +P +
Sbjct: 256 GTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYFIGVQESDFFYLDPHQTRPAL 315
Query: 333 NIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
+D D + H+ +R +H+ +DPS+ I F RD+
Sbjct: 316 PFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLIRDEN 359
>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
LYAD-421 SS1]
Length = 999
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 133/278 (47%), Gaps = 50/278 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 123
F DF+SRI ++YR F PI D+ + TS
Sbjct: 303 FYADFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTS 362
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 178
D GWGCMLR+ Q L+A ALL LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 363 DAGWGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPF 422
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
S+H + GK G G W GP + + L + GLG +A+ S +
Sbjct: 423 SVHRMALVGKDLGKEVGQWFGPSTAAGAIKTLVHS-FPDAGLG-----VAVASDSTLYES 476
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
+ A + RH + +W +L+L+ + LG+E VNP Y T++ +TFP
Sbjct: 477 DVYAASRSSVYSTRRH----GHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLYTFP 532
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
Q++GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 533 QTVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAIPL 570
>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
Length = 511
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 145/354 (40%), Gaps = 77/354 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 129 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 182
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------------- 151
GWGCMLRS QM++AQ LL H L R W
Sbjct: 183 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRW 242
Query: 152 -RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP +
Sbjct: 243 AQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 295
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------KGQAD 264
A R C + + VS D +PV + + + +
Sbjct: 296 AHILRKAVE-SCSEVTRLVVYVSQDCTAAEASSPVSDTPASGPLHLLPLLLGVLFQQRCR 354
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W + L+ L LGI+GGKP S Y +G Q++ +YLD
Sbjct: 355 WLFVCELLRCEL---------------------CLGIMGGKPRHSLYFIGYQDDFLLYLD 393
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
PH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 394 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 445
>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
boliviensis]
Length = 463
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 144/348 (41%), Gaps = 93/348 (26%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 109 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 162
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------------- 159
GWGCMLRS QM++AQ LL H L R W L P
Sbjct: 163 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSRYHGPARWMPPCW 222
Query: 160 ---------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
+R + +I+ F D +PF +H L++ G++ G AG W GP +
Sbjct: 223 AQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 275
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
A R + + +YV S+ C+ G+ TP L
Sbjct: 276 AHILRKAVESSSEVTRLVVYV--------------------SQDCT----GKGTCTPSLQ 311
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
+ LR LGI+GGKP S Y +G Q++ +YLDPH QP
Sbjct: 312 EL----------------LRCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP 351
Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+++ + + + ++H R + +DPS +GFY D+ T
Sbjct: 352 TVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 397
>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
Length = 437
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 133/304 (43%), Gaps = 85/304 (27%)
Query: 96 EFNQDFSSRILISYRKGFDPI--------GDSK-----------------ITSDVGWGCM 130
+F DF S++ I+YR F PI GDS TSD GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 189
+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGATA 261
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCI 248
G G W GP A + +AL + + GL +Y+ S G + E+ V C
Sbjct: 262 CGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACD 313
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G +
Sbjct: 314 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGPE--- 359
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ STYH+ +R +H+ +DPS+ IGF
Sbjct: 360 ---------------------------------ELSTYHTRRLRRLHVREMDPSMLIGFL 386
Query: 369 CRDK 372
RD+
Sbjct: 387 VRDE 390
>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
Length = 342
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 138/304 (45%), Gaps = 68/304 (22%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS-SRILISYRKGFDPIGDSKITSDVGWG 128
+W+LG H + D L E F+ + L ++ G P +SD GWG
Sbjct: 35 VWILGKQHLLKTD----------SLPEIISHFTETSELTAHDGGTGP------SSDAGWG 78
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
CMLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 79 CMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKM-- 136
Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
C LP++ + + + G+P
Sbjct: 137 ---------------------------------CCILPLSADIATENP----SGSP---- 155
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
+AS H S W P+LL+VPL LG+ ++NP Y+ + SLG +GGKP
Sbjct: 156 -NASNHSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK-------SLGALGGKPNN 207
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ Y +G + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+
Sbjct: 208 AYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILNLDPSVALGFF 267
Query: 369 CRDK 372
C+++
Sbjct: 268 CKEE 271
>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
Length = 246
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 112/246 (45%), Gaps = 53/246 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 284
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 285 YIPTLR 290
YI +
Sbjct: 241 YIEAFK 246
>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
Length = 393
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 144/318 (45%), Gaps = 67/318 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PI W
Sbjct: 57 VWILGKQHLLKTEKS-----------KLLADISARLWFTYRRKFSPID---------WN- 95
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
W K ++P +EY IL F D + +SIH + Q G
Sbjct: 96 ---------------------WEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGVG 132
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGERGGAPV 245
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 133 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCCAS 184
Query: 246 VCIDDA-------SRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
DA S + S SKG + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 185 ALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFK 244
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D + + + +
Sbjct: 245 MPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQPPQRM 304
Query: 355 HLDSIDPSLAIGFYCRDK 372
++ ++DPS+A+GF+C+++
Sbjct: 305 NILNLDPSVALGFFCQEE 322
>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
Length = 433
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 152/363 (41%), Gaps = 83/363 (22%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVG 126
IWLLGV + + G +A + A F+ +DFSSR+ +YR+ F I + I +D G
Sbjct: 36 IWLLGVIYHRKMTQFYGASAVVDDGASFDAFLEDFSSRLWFTYRREFPAIPGTDIRTDCG 95
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWR------------------KPLQKPF-----DREY 163
WGCMLRSSQM++AQA + H LGR WR PL++ F D
Sbjct: 96 WGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTEAGEVRLPRHALWPLREGFRCTGGDGTA 155
Query: 164 VEIL----------HLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 210
V + FGD ++PFS+HNL+Q G+ G AG W GP ++ +AL
Sbjct: 156 VLVRCSPKPVNDPPRWFGDKADASTPFSLHNLVQRGRESGKKAGDWYGPSSVAYILKDAL 215
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
+ L + IYV + +DD + CS S
Sbjct: 216 EDAAHRDQRLA----QLCIYVAQD---------CTIYMDDVTALCSAGSTEGV------- 255
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQS--------------LGIVGGKPGASTYI-VGV 315
+ PR + R F+ Q+ + K G S + +
Sbjct: 256 -------THRRLPRTVFARREMFSGGQTQRMCIHSSWLHLFVFFVCFLKYGISFLLQLSA 308
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLL 375
EE IYLDPH Q ++++ D D ++H R + IDPS IGFYC+ K L
Sbjct: 309 AEEKVIYLDPHYCQEMVDVNSQDFPLD--SFHCSWPRKMSFSRIDPSCTIGFYCKTKHDL 366
Query: 376 VTF 378
F
Sbjct: 367 EDF 369
>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
Length = 1034
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 129/272 (47%), Gaps = 50/272 (18%)
Query: 97 FNQDFSSRILISYRKGF-DPIGDSKI-------------------------------TSD 124
F DF+SRI ++YR F PI D ++ +SD
Sbjct: 302 FYIDFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSD 361
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFS 179
GWGCMLR+ Q L+A AL+ LGR WRKP +Y V IL F D+ +PFS
Sbjct: 362 SGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFS 421
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H + AGK G G W GP + +AL E G+G +A+ V DG
Sbjct: 422 VHRMALAGKELGTDVGQWFGPSVAAGAIKALVNS-FPEAGIG-----VAVAV-----DGV 470
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
V + + W P+LLL+ + LG+E VNP Y T+++ +TFPQ
Sbjct: 471 LYQTDVHAASHGDHFGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIYYDTIKMLYTFPQ 530
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
S+GI GG+P +S Y VG Q ++ YLDPH +
Sbjct: 531 SVGIAGGRPSSSYYFVGSQADNLFYLDPHHAR 562
>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
Length = 336
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 134/305 (43%), Gaps = 70/305 (22%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG-GKPGA 308
D + C V P S VG PG
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTVGESTPG- 202
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGF 367
T Q + I+LDPH Q +N +++ D T+H + +++ ++DPS+A+GF
Sbjct: 203 -TLNASNQSDELIFLDPHTTQTFVNT-EENGTVDDQTFHCLQSPQRMNILNLDPSVALGF 260
Query: 368 YCRDK 372
+C+++
Sbjct: 261 FCKEE 265
>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
Length = 423
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 87/260 (33%), Positives = 129/260 (49%), Gaps = 44/260 (16%)
Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 177
+ TSD GWGCM+R+SQ L+A ALL +L + Q ++IL LF D TSP
Sbjct: 138 NDNFTSDAGWGCMIRTSQNLLAIALL--KLSEEHNESAQ-------LDILKLFQDDPTSP 188
Query: 178 FSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSG 234
FS+HN ++ + L G W GP A S + L ++ ET P I V
Sbjct: 189 FSLHNFIRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLET-------PGEIPYVYI 241
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
E+ + DD +F++ Q P+LLL P+ LG+++VN Y ++ +
Sbjct: 242 SENAD-------LFDDEIE--DLFNEEQK---PLLLLFPVRLGIDQVNKYYYKSILQLLS 289
Query: 295 FPQSLGIVGGKPGASTYIVGVQEES-AIYLDPHDVQPV---INIGKDDLEADTSTYHSDV 350
P S+GI GGKP +S Y +G + E+ +Y DPH Q V INI +TYH+
Sbjct: 290 LPYSVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI---------TTYHTAN 340
Query: 351 IRHIHLDSIDPSLAIGFYCR 370
+ ++ +DPS+ IG +
Sbjct: 341 YNKLDIEMVDPSMMIGVLLK 360
>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
Length = 994
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 90/277 (32%), Positives = 129/277 (46%), Gaps = 49/277 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 124
F DF+SRI ++YR F+PI D+ + TSD
Sbjct: 309 FYSDFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSD 368
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSETS--PFS 179
GWGCMLR+ Q L+A ALL LGR WR+P + + YV+I+ F D + PFS
Sbjct: 369 SGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFS 428
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H + GK G G W GP + + L E GLG +A+ V D
Sbjct: 429 VHRMALVGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---VAVDGVIYQSDVY 484
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+ + +H G+ W +L+L+ + LG++ VNP Y ++ +T PQ
Sbjct: 485 AVSRSTMGLGSPRKH------GRPSWGDRAVLVLIGIRLGIDGVNPIYYDLIKALYTLPQ 538
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+LGI GG+P +S Y VG Q + YLDPH +P I +
Sbjct: 539 TLGIAGGRPSSSYYFVGSQANNLFYLDPHHARPTIPL 575
>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
Length = 450
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 159/378 (42%), Gaps = 67/378 (17%)
Query: 26 LASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEAL 85
L+ + LG E V R T + + + + SRT + + S A + +
Sbjct: 4 LSRISQHLGIVEDVDRDGTVFILGKEYAPLNNKSRTDVETDDS-----------ALESLI 52
Query: 86 GDAAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT------------------ 122
+ N GL D SR+ +YR F PI G S I
Sbjct: 53 NIVSLNPGLL---SDVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALT 109
Query: 123 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
SD+GWGCM+R+ Q L+A A+ +L R +R + D E + ++ F D
Sbjct: 110 DPDSFYSDIGWGCMIRTGQALLANAIQRVKLAREFRINASRIDDNE-LNLIRWFQDDVKY 168
Query: 177 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
P S+HN ++A K G+ G W GP A RS + L E C I S D
Sbjct: 169 PLSLHNFVKAEEKISGMKPGQWFGPSATARSIKTLI-----EGFPLCGIKNCIISTQSAD 223
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+ D+ +R +F K + +LLL + LG++K+N Y + +
Sbjct: 224 ----------IYEDEVTR---IFHKDRD--ANLLLLFAVRLGVDKINSLYWKDIFKILSS 268
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
P S+GI GGKP +S Y G Q E+ YLDPH+ Q ++ DDLE S H +H
Sbjct: 269 PYSVGIAGGKPSSSLYFFGYQNENLFYLDPHNTQQS-SLMMDDLEFYRSC-HGHKFNKLH 326
Query: 356 LDSIDPSLAIGFYCRDKG 373
+ DPS+ +G K
Sbjct: 327 ISETDPSMLLGMLISGKN 344
>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 497
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 108/208 (51%), Gaps = 12/208 (5%)
Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
+++ LFGD +PF +H L+ GK G AG W GP + + R A+T +G QS
Sbjct: 231 KLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAH----ILRKAVAKTSVG-QS 285
Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 284
L A+YV +D V+ + D S V W +++LVP+ LG E +NP
Sbjct: 286 L--AVYVA---QDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGGEALNPS 340
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YI ++ + +GI+GGKP S Y +G Q+E +YLDPH QPV++ + + +
Sbjct: 341 YIECVKNILSLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDFTQANFSLE-- 398
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
++H + + +DPS IGFY R K
Sbjct: 399 SFHCSSPKKMPFSRMDPSCTIGFYARTK 426
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 11/73 (15%)
Query: 65 SSTSDIWLLGVCHKI-AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
+ TS I++LG + + ++DE + F DF SRI ++YR+ F + S +T+
Sbjct: 87 NKTSPIFVLGHAYLLNSEDE----------VERFRLDFVSRIWLTYRREFPQLEGSTLTT 136
Query: 124 DVGWGCMLRSSQM 136
D GWGCMLRS QM
Sbjct: 137 DCGWGCMLRSGQM 149
>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
[Homo sapiens]
Length = 340
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 131/303 (43%), Gaps = 66/303 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 184
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 185 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 207
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 208 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 266
Query: 370 RDK 372
+++
Sbjct: 267 KEE 269
>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
Length = 336
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 131/303 (43%), Gaps = 66/303 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 181 DIKKMCCV-------------------------------------LPLSADTAGDRPPDS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDK 372
+++
Sbjct: 263 KEE 265
>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
Length = 336
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 132/304 (43%), Gaps = 68/304 (22%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G P S
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTAGESPPGS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGFY 368
+ Q I+LDPH Q ++ +++ D T+H + +++ ++DPS+A+GF+
Sbjct: 204 LTALN-QSNELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNILNLDPSVALGFF 261
Query: 369 CRDK 372
C+++
Sbjct: 262 CKEE 265
>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
Length = 336
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 131/303 (43%), Gaps = 66/303 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRP-LD 202
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ + D + + + +++ ++DPS+A+GF+C
Sbjct: 203 YLTASNQSDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDK 372
+++
Sbjct: 263 KEE 265
>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
gorilla]
gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 336
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 131/303 (43%), Gaps = 66/303 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDK 372
+++
Sbjct: 263 KEE 265
>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
1558]
Length = 1159
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 81/248 (32%), Positives = 112/248 (45%), Gaps = 51/248 (20%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------------REYVEIL 167
+T+D GWGCMLR+ Q L+A AL+ LGR WR P Q YV IL
Sbjct: 580 LTTDAGWGCMLRTGQSLLANALIHLHLGRDWRVPSQPQVPPTSAAHLAELEAYSSYVRIL 639
Query: 168 HLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
F D + PFS+H + GK G G W GP + + L S
Sbjct: 640 SWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTL-----------VNSF 688
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------------W--T 266
P + V+ D +V D ++ S G +D W
Sbjct: 689 PPSGMAVATAVDS------IVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNR 742
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+L+L+ + LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q S +YLDPH
Sbjct: 743 AVLVLIGIRLGLDGVNPLYYESIKALFTFPQSVGIAGGRPSSSYYFVGTQANSLVYLDPH 802
Query: 327 DVQPVINI 334
+P + +
Sbjct: 803 FTRPAVPL 810
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+A T+H D +R I L +DPS+ +GF C+D+ F
Sbjct: 962 KAQLGTFHCDKVRKIPLSGLDPSMLLGFVCKDEADFEDF 1000
>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
passalidarum NRRL Y-27907]
Length = 363
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 131/281 (46%), Gaps = 43/281 (15%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
F+ R+ + R FD SDVGWGCM+R+SQ L+A AL+ LQ +
Sbjct: 104 FNKRLFTTVRSLFD---SENFNSDVGWGCMIRTSQSLLANALM----------KLQPSAE 150
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 218
E +++LF D+ S FS+HN ++ L G W GP A S + L + +T
Sbjct: 151 HE---VINLFQDNIASAFSLHNFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKT 207
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
G + + I S D E I++ SV L+L P+ LG+
Sbjct: 208 IQGVKYPHVFISENSDLYDEE--------IEELLVESSV-----------LILFPVRLGI 248
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
+ VN Y ++ P ++GI GGKP +S Y +G Q++ +Y DPH Q N
Sbjct: 249 DNVNSYYYDSIFQLLACPFTVGISGGKPSSSFYFLGYQDQDLLYFDPHSPQLYEN----- 303
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ +TYH++ + +H+ +DPS+ +G +DK F+
Sbjct: 304 -PINYTTYHTNNYQRLHIHMLDPSMMVGILVKDKSEYKEFK 343
>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
Length = 336
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 131/303 (43%), Gaps = 66/303 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDK 372
+++
Sbjct: 263 KEE 265
>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
Length = 577
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/314 (28%), Positives = 138/314 (43%), Gaps = 64/314 (20%)
Query: 95 AEFNQDFSSRILISYRKGFDPI-----------------------------GDSKITSDV 125
EF +D SR++ +YR F PI + T+D+
Sbjct: 127 VEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNSFTTDI 186
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 179
GWGCM+R+ Q L+ AL LGR +R P K E +I+ F D+ PFS
Sbjct: 187 GWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEE--DIIEWFYDNPNKPFS 244
Query: 180 IHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
IH + G + G W GP C + ++L + E G+ + V SGD
Sbjct: 245 IHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLIY-EFPECGID----ECILSVSSGD--- 296
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ D+ + H F K + T IL+L+ + LG++K+N Y ++ S
Sbjct: 297 -------IYEDEINEH---FQKNEN--TIILILLGVKLGIDKINQCYFNDIKDILNSRYS 344
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
GI GG+P +S Y G E Y DPH +P + + +D + ST +S ++ +
Sbjct: 345 CGISGGRPSSSLYFFGHMNEYLYYFDPH--KPQLQLNEDFKNSCHSTDYSKIL----ISE 398
Query: 359 IDPSLAIGFYCRDK 372
IDPS+ IGFY + K
Sbjct: 399 IDPSMLIGFYLKGK 412
>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
boliviensis]
Length = 360
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 131/303 (43%), Gaps = 66/303 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 228 -LTASNESDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286
Query: 370 RDK 372
+++
Sbjct: 287 KEE 289
>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 557
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 132/323 (40%), Gaps = 47/323 (14%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL---- 155
D S +YR F I ITSD GWGCMLRS+QM++ QAL H R WR P
Sbjct: 171 DERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQLLAR 230
Query: 156 --QKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
Q F R + + S S +S+HN++ AG Y G W GP C L
Sbjct: 231 RRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMRDLVH 290
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+ LG L I+ V G + + K +
Sbjct: 291 IHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQPQAH 350
Query: 273 PLVLGLEK---------------------------VNPRYIPTLRLTFTFPQSLGIVGGK 305
PL L E+ +N Y+ +L TF+ PQS+G++GG+
Sbjct: 351 PLDLEWEEELMESANTVEWDTALLLLVPLRLGLTSLNEEYVQSLAHTFSLPQSVGVLGGR 410
Query: 306 PGASTYIVGVQEE-SAIY-LDPHDVQ--PVINIGKDDLEADTSTYHS-DVIRHIHLD--- 357
P + + G Q++ S I+ LDPH VQ P + + +A + S D +R H
Sbjct: 411 PRGARWFYGAQKDGSKIFGLDPHTVQTAPGRQTARVNGQASSVVELSDDYLRSCHTTCPE 470
Query: 358 -----SIDPSLAIGFYCRDKGLL 375
+DPS+A+GFYCR + L
Sbjct: 471 MFPFCKMDPSIALGFYCRTRADL 493
>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 411
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 137/300 (45%), Gaps = 57/300 (19%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 127
F D SRI +YR F PI S +D+GW
Sbjct: 74 FLSDVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGW 133
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+R+ Q L+A A+ LGR +R + + +I+ F D+ PFS+HN ++ G
Sbjct: 134 GCMIRTGQSLLANAIQIAILGREFRVN-DGDVNEQERKIISWFMDTPDEPFSLHNFVKKG 192
Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
+ G W GP A RS ++L Q + G+ + ++ + DE
Sbjct: 193 CELSSKKPGEWFGPAATSRSIQSLVE-QFPDCGIDRCIVSVSSADIFKDE---------- 241
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
I+D +F + ++ ILLL+ + LG++KVN Y+ +R S+GI GG+P
Sbjct: 242 -IND------IFKNKR--YSNILLLMGVKLGVDKVNEYYLKDIRKILESRYSVGISGGRP 292
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
+S Y G Q+++ +Y DPH QP +E+ T H+D I++ +DPS+ IG
Sbjct: 293 SSSLYFFGYQDDTLLYFDPHKPQPST------IESLLETCHTDNFDKINISDMDPSMLIG 346
>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
jacchus]
Length = 360
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 131/303 (43%), Gaps = 66/303 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+E I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 228 LTASNRSDE-LIFLDPHTTQTFVDAEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286
Query: 370 RDK 372
+++
Sbjct: 287 KEE 289
>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
6054]
gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 514
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 137/281 (48%), Gaps = 38/281 (13%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
FS +L + + + I T+DVGWGCM+R+SQ L+A F RL L K D
Sbjct: 138 FSKSLLYNLQNFNNFIEKENFTTDVGWGCMIRTSQSLLANT--FVRL-------LDKQSD 188
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 218
I+ LF D+ +PFS+HN ++ + L G W GP A S + L C
Sbjct: 189 -----IIALFNDTYLAPFSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRL--CDGYYD 241
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
+++ I V+ + ++ ++ +KG +L+L+P+ LG+
Sbjct: 242 NSTSETILPRINVLISESTDLYDSQIAQLLEPSTE-----TKG------LLVLLPVRLGI 290
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
+ +N Y +L + QS+GI GGKP +S Y G Q+ S IY+DPH Q I D
Sbjct: 291 DSINSYYFSSLLHLLSLEQSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQ----IFSSD 346
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ D STY++ + + + +DPS+ IG + RD L ++E
Sbjct: 347 I--DMSTYYATRYQRVDIGKLDPSMLIGVFIRD---LTSYE 382
>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 500
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 104/210 (49%), Gaps = 12/210 (5%)
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ ++ FGD +PF +H L+ GK G AG W GP +A R
Sbjct: 232 HSRLVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGP-------SVVAHILRKAVDKTS 284
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+A+YV +D VV + D S + + DW +++LVP+ LG E +N
Sbjct: 285 VVTNLAVYVA---QDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGGEALN 341
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
P YI ++ +GI+GGKP S Y +G Q+E +YLDPH QPV+++ + + +
Sbjct: 342 PSYIDCVKNFLKLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQINFSLE 401
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
++H + + + +DPS IGFY ++K
Sbjct: 402 --SFHCSSPKKMPFNRMDPSCTIGFYAKNK 429
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 38/61 (62%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ F F SRI ++YR+ F + S T+D GWGCMLRS QML+AQ LL H + R W
Sbjct: 104 VERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPRDWVW 163
Query: 154 P 154
P
Sbjct: 164 P 164
>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
Length = 443
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 141/331 (42%), Gaps = 73/331 (22%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------- 119
LG N+ A N S++ +SYR GF+PI S
Sbjct: 69 LGQIFDNSNAA--NNYIESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFA 126
Query: 120 ---------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
TSD GWGCM+R+SQ L+A LL K + + EI+ LF
Sbjct: 127 NLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEQEIVKLF 173
Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D SPFSIHN ++ + L G W GP A S + L + + G P
Sbjct: 174 QDDTKSPFSIHNFIRVASSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGIN--PPR 231
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+++ + DD R VF+K +++ +++L P+ LG++KVN Y +
Sbjct: 232 VFISENSD----------LFDDEIR--DVFAKEKSN--SVIILFPIRLGIDKVNSYYYNS 277
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ + S GI GGKP +S Y +G ++ IY DPH Q V + + +YHS
Sbjct: 278 IFHLLSSKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQIV------ETPFNMDSYHS 331
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+++ +DPS+ IG + + F+
Sbjct: 332 TNYNTLNISLLDPSMMIGILVTNIDEYIDFK 362
>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1193
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 81/240 (33%), Positives = 110/240 (45%), Gaps = 28/240 (11%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 562 LTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAKYAQ 621
Query: 166 ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+L F D + PFS+H + GK G G W GP + + LA A G+
Sbjct: 622 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 680
Query: 224 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 274
+ +I Y S D +P R +K + W +L+LV +
Sbjct: 681 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRRGDNEAK-EEKWGKRAVLILVGV 739
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q YLDPH +P I +
Sbjct: 740 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 799
>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
Length = 485
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/227 (31%), Positives = 110/227 (48%), Gaps = 26/227 (11%)
Query: 150 PWRKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
P R P P D + +++ FGD ++PF +H L++ GK G AG W GP +
Sbjct: 210 PARCPSASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHM 269
Query: 207 W-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
+A+AR E +A+YV V +D C G W
Sbjct: 270 LRKAVARAAEFED--------LAVYVAQD---------CTVYKEDVMSLCESSGVG---W 309
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
+++LVP+ LG E +NP YI ++ +GI+GGKP S + VG Q+E +YLDP
Sbjct: 310 KSVVILVPVRLGGESLNPSYIECVKNILKLKCCIGIIGGKPKHSLFFVGFQDEQLLYLDP 369
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
H QPV+++ + + + ++H + R ++ +DPS IG Y R K
Sbjct: 370 HYCQPVVDVTQANFSLE--SFHCNSPRKMNFSRMDPSCTIGLYARSK 414
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 40/63 (63%), Gaps = 1/63 (1%)
Query: 91 NNGLAE-FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
N G E F Q F S + ++YR+ F + S +T+D GWGCMLRS QM++AQ LL H +
Sbjct: 92 NEGEVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPT 151
Query: 150 PWR 152
WR
Sbjct: 152 DWR 154
>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1093
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 55/274 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
F DF+SR+ ++YR F PI D+ +
Sbjct: 369 FYADFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGG 428
Query: 122 ----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-LQKPFDR--EYVEILHLFGDSE 174
TSD GWGCMLR+ Q L+A ALL LGR WR+P +P YV++L F DS
Sbjct: 429 EKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSP 488
Query: 175 TS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+ PFS+H + AGK G G W GP + + L A G G VV
Sbjct: 489 SPLCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVH---AFPGGGLGVAVAVDGVV 545
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ +P D+ RH + G +L+L+ + LGL+ VNP Y T++
Sbjct: 546 YETDVFSASHSP-----DSRRHHRTSTWGDRG---VLILIGIRLGLDGVNPIYYDTIKEL 597
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+T+PQS+GI GG+P +S Y VG Q +S YLDPH
Sbjct: 598 YTWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631
>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
Length = 391
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 142/309 (45%), Gaps = 42/309 (13%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
Q +S I +YRK F I +S+ TSD GWGCMLRS QM+ AQ L H R+ Q
Sbjct: 51 QIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQILRVH-----IRQKKQHS 105
Query: 159 FDREYVEILHLFGDSET---------------SPFSIHNLLQAGK-AYGLAAGSWVGPYA 202
D +Y ++L F D + SP+SI + + + + W P
Sbjct: 106 KDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWYRPDQ 164
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIY--VVSGDEDGERGGAPVVC-----------ID 249
+ + L + ++ E G + L + I ++ E G + C
Sbjct: 165 ILNALSLLHQQKQLE---GSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKNK 221
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ S+ C++ K I + + GL+++N Y+P L PQ GI+GG+ +
Sbjct: 222 EISKICNICQKKDPKALAIFFITRI--GLDEINKEYLPFLNDLIDLPQFQGIIGGRDDKA 279
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
YI+G + IYLDPH +Q IN G + D T+ +++I+ + + PS+A+GFYC
Sbjct: 280 YYILGRVNKRLIYLDPHYIQEHINRGNVVMLKD--TFFCKDVKYINEEQMSPSIALGFYC 337
Query: 370 RDKGLLVTF 378
+++ L F
Sbjct: 338 QNQSELDKF 346
>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
Length = 431
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/231 (29%), Positives = 112/231 (48%), Gaps = 15/231 (6%)
Query: 151 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------Y 201
W K ++P EY IL F D + +SIH + Q G G + G W GP
Sbjct: 137 WEKHQEQP--EEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 194
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 261
A+ W +LA + + + + ++ D + + +D + C + G
Sbjct: 195 ALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLD-WNTDCPGQTSG 253
Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
W P+LL+VPL LG+ ++NP Y + F PQSLG +GGKP ++ Y +G + I
Sbjct: 254 ---WKPLLLIVPLRLGINQINPIYADAFKECFKMPQSLGALGGKPNSAYYFIGFLGDELI 310
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
YLDPH Q ++ ++ D S + + + ++DPS+A+GF+C+++
Sbjct: 311 YLDPHTTQTFVDTEENGTVNDQSFHCQQSPPRMKILNLDPSVALGFFCKEE 361
>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
Length = 427
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/253 (33%), Positives = 119/253 (47%), Gaps = 30/253 (11%)
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 180
+D+GWGCM+R+ Q L+ AL LGR WR + EI F D+ PFS+
Sbjct: 55 TDIGWGCMIRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSL 114
Query: 181 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD---- 235
H + G + G G W GP A RS ++L + E G+ I V SGD
Sbjct: 115 HRFISKGMQLSGKKPGEWFGPAATARSIQSLVH-EFPECGID----KCLISVSSGDIYKT 169
Query: 236 --EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
ED G H GQ D T IL+L+ + LG+E +N Y ++R
Sbjct: 170 EVEDVFNEG-----------HTGEARNGQKDKT-ILILLGVKLGIETINRCYWDSIRRIL 217
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
+ S+GI GG+P +S Y G Q + +Y DPH QP + K+DL +T H+
Sbjct: 218 SSEYSIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQPSYD--KNDLFYETC--HTTNFGK 273
Query: 354 IHLDSIDPSLAIG 366
+ L +DPS+ +G
Sbjct: 274 LSLADMDPSMLLG 286
>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1188
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 109/239 (45%), Gaps = 26/239 (10%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAKYAQ 619
Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
++ F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANS-FAPCGIAVA 678
Query: 224 SLPMAI------YVVSG--DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
+ +I Y S +D R RH + +G+ +L+LV +
Sbjct: 679 TATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEGKWGERAVLILVGIR 738
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LGL+ VNP Y +++ FTFPQ+ G GG+P +S Y VG Q YLDPH +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQAGGSAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 797
>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
H99]
Length = 1185
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 80/240 (33%), Positives = 113/240 (47%), Gaps = 28/240 (11%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------QKPFDRE---------YVE 165
+TSD GWGCMLR+ Q L+ AL+ LGR WR P + ++E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYAQ 619
Query: 166 ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+L F D + PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 678
Query: 224 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 274
+ +I Y S D +P R +K + W +L+LV +
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRGGDNKAK-EGKWGKRAVLILVGI 737
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y +G Q YLDPH +P I +
Sbjct: 738 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFIGSQANHLFYLDPHLTRPAIPL 797
>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 132/318 (41%), Gaps = 70/318 (22%)
Query: 98 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 123
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDDTSSPFSIHNF 186
Query: 184 LQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ L G W GP A S + LA + + +P + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVFISENSD------ 240
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
DD R VF+K + +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
GGKP +S Y +G ++ IY DPH Q V + + +YH+ +++ +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345
Query: 362 SLAIGFYCRDKGLLVTFE 379
S+ IG + + F+
Sbjct: 346 SMMIGILVTNIDEYIDFK 363
>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 132/318 (41%), Gaps = 70/318 (22%)
Query: 98 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 123
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDGTSSPFSIHNF 186
Query: 184 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ L G W GP A S + L + L +P + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVFISENSD------ 240
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
DD R VF+K ++ +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKS--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
GGKP +S Y +G ++ IY DPH Q V + + +YH+ +++ +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345
Query: 362 SLAIGFYCRDKGLLVTFE 379
S+ IG + + F+
Sbjct: 346 SMMIGILVTNIDEYIDFK 363
>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
Length = 411
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 131/276 (47%), Gaps = 54/276 (19%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP-----------FDREYVE---IL 167
T+D GWGCM+R++QM+VAQA++ +R GR WR +K FD E ++ IL
Sbjct: 88 TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147
Query: 168 HLFGDSETSPFSIHNLLQ-AGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
LF D ++P IH +++ A + G A G W P EA+ ++A T
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPS------EAVFIMKKAITESASPLT 201
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPR 284
+ +S D G + ++ ++H WT L+LV +V LG ++N
Sbjct: 202 GDTVMYLSID-----GRVHIRDLEVETKH----------WTKTLMLVIVVRLGAAELNRI 246
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
Y+P L F+ LGI GG+P S + VG + IYLDPH I I D++ +TS
Sbjct: 247 YVPHLMRLFSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAHEYIPI---DMDFNTS 303
Query: 345 -------------TYHSDVIRHIHLDSIDPSLAIGF 367
+YH ++ +H +DPS A+ F
Sbjct: 304 QEDPKKPKKCPERSYHCRLLSKMHFLDMDPSCALCF 339
>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
leucogenys]
Length = 441
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 133/300 (44%), Gaps = 33/300 (11%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW--GCMLRSSQML-VAQALLFHR 146
G F DF SR+ ++YR + I D W G L ++ A +H
Sbjct: 98 GEGEHTAFPADFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHG 157
Query: 147 LGRPWRKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
R W P L++ +R + +I+ F D +PF +H L++ G++ G AG W
Sbjct: 158 PAR-WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWY 214
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
GP +A R + + +YV + A +V D +
Sbjct: 215 GP-------SLVAHILRKAVESCSEVTRLVVYVSQTCSMYKADVARLVARPDPT------ 261
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++
Sbjct: 262 ----AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCQLCLGIMGGKPRHSLYFIGYQDD 317
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+YLDPH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 375
>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
Length = 408
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 126/266 (47%), Gaps = 37/266 (13%)
Query: 116 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
I + T+DVGWGCM+R+SQ L+A +++ + + +E +++L F DSE
Sbjct: 123 IDNENFTTDVGWGCMIRTSQSLLANT---------YKRMISEDAQQE-IQLLDQFKDSEA 172
Query: 176 SPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS 233
+PFS+HN ++ L G W GP A S + L ++ G LP ++S
Sbjct: 173 APFSLHNFIRVANESPLQVKPGQWFGPNAASLSIQRLCNLVNSKENFG---LPGLSVLIS 229
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
+ D DD + + K Q+ +L+L+P+ LG++K N Y ++
Sbjct: 230 ENSD---------LYDDKVQEF-LDKKKQS----LLILLPIRLGIDKTNEFYYSSILQLL 275
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
QS+GI GGKP +S Y G + +YLDPH Q A ++YH+ +
Sbjct: 276 NCKQSVGIAGGKPSSSFYFFGYDNDELLYLDPHYPQ--------GTNAGYNSYHTPRYQR 327
Query: 354 IHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ + +DPS+ IG D TF+
Sbjct: 328 LTISQLDPSMMIGILVDDLQDYNTFK 353
>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4; AltName:
Full=Pexophagy zeocin-resistant mutant protein 8
gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
Length = 533
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 117/267 (43%), Gaps = 50/267 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ G +D +TPIL+L+ + LG+EKVN LR + QS+GI G K
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSLKQSVGISGRKTSFLA 281
Query: 311 YI-VGVQEESAIYLDPHDVQPVINIGK 336
+ +G Q + YL P + + GK
Sbjct: 282 LLSIGFQGDYLFYLIPTFPKKALTFGK 308
>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
Length = 411
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 119/303 (39%), Gaps = 83/303 (27%)
Query: 95 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 131
A F DF S+ ++YR F DP + S +SD GWGCM+
Sbjct: 109 AAFLDDFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMI 168
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG 191
RS QML+A A+ LGR A G
Sbjct: 169 RSGQMLLANAMAITNLGR--------------------------------------VACG 190
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R ++L Q + + +Y G P V D
Sbjct: 191 KYPGEWFGPSATARCIQSLTNAQEQPS--------LRVYST--------GDGPDVYED-- 232
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + + P L+LV LG++K+ P Y L PQS+GI GG+P AS Y
Sbjct: 233 -KFMKIAKPDGTRFHPTLILVGTRLGIDKITPVYWDALIAALQMPQSVGIAGGRPSASHY 291
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+G Q YLDPH +P + D +AD T H+ +R +H+ +DPS+ IGF
Sbjct: 292 FIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADIDTAHTRRLRRLHVREMDPSMLIGFL 351
Query: 369 CRD 371
+D
Sbjct: 352 IKD 354
>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 446
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 92/332 (27%), Positives = 136/332 (40%), Gaps = 72/332 (21%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------ 119
LG N A N S++ +SYR GF+PI S
Sbjct: 68 VLGQTFDNFDTA--NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNF 125
Query: 120 ----------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 169
TSD GWGCM+R+SQ L+A LL K + + EI+ L
Sbjct: 126 ANLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKL 172
Query: 170 FGDSETSPFSIHNLLQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
F D +SPFSIHN ++ L +G W GP A S + L + + +P
Sbjct: 173 FQDGTSSPFSIHNFIRVASLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPR 232
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+ D DD R VF+K + +L+L P+ LG++KVN Y
Sbjct: 233 VFISENSD-----------LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYN 277
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
++ S GI GGKP +S Y +G ++ IY DPH Q V + + +YH
Sbjct: 278 SIFHLLASKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYH 331
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ +++ +DPS+ IG + + F+
Sbjct: 332 TTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363
>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
Length = 314
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 136/335 (40%), Gaps = 57/335 (17%)
Query: 48 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 107
M I ER L T S + IW LG H A + A F QD + +
Sbjct: 4 MSHILERYLRMFPTNHEPSGTFIWSLG--HSYATETGKWPEA-------FVQDTYDLLSL 54
Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
+YRK G +SD GWGCM+RS Q ++A L R +P P+ K IL
Sbjct: 55 TYRKCI--AGMECFSSDAGWGCMIRSMQTMLANCL---RRVQP-SLPVHK--------IL 100
Query: 168 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
H F D + S+H + AG + G+W GP + L C + P
Sbjct: 101 HYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHL-----------CSTHP 149
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI--LLLVPLVLGLEKVNPR 284
V DG ++ + Q TP LLL L LG++ ++
Sbjct: 150 QVGLNVCVSHDG-----------------AIMYRDQLRNTPYPRLLLFTLRLGIDTIHTS 192
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
Y L T PQ++GIVGG+P A+ Y Q + YLDPH Q D A S
Sbjct: 193 YYEQLCHVLTIPQAIGIVGGRPRAAHYFYACQSQWFFYLDPHTTQTAHTF---DNPAPNS 249
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
++H +R + ++ +DP + +GF + FE
Sbjct: 250 SFHVTTLRRLRINELDPCMVLGFAITSEECQTDFE 284
>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 121/279 (43%), Gaps = 57/279 (20%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWR------------------------------------K 153
MLRS QM++AQ LL H L R W
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP +A
Sbjct: 61 ELER--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHI 111
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
R + +YV + A +V D + A+W +++LVP
Sbjct: 112 LRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVP 161
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP ++
Sbjct: 162 VRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVD 221
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+ + D + ++H R + DPS +GFY D+
Sbjct: 222 VSQADFPLE--SFHCTSPRKMAFAKTDPSCTVGFYAGDR 258
>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
Length = 489
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 139/309 (44%), Gaps = 53/309 (17%)
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT-------------------- 122
+ +N +F D SR+ +YR F PI G S ++
Sbjct: 69 SKNSNENPDFLSDVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNP 128
Query: 123 ----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
+DVGWGCM+R+ Q L+ AL RLGR +R + E + I+ F D +PF
Sbjct: 129 ACFNTDVGWGCMIRTGQSLLGNALQIARLGRGYR--IGSELKPEEISIIDWFVDIPDAPF 186
Query: 179 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
SIHN + G + G W GP A RS ++L R + CQ I V SGD
Sbjct: 187 SIHNFVSKGMELSSKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQ-----ISVSSGD-- 239
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
V +D + VF++ + + ILLL+ + LG+ VN Y ++
Sbjct: 240 --------VYEEDVMK---VFNESKD--SRILLLLGVKLGINAVNEFYWNDIKRLLGSKF 286
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
S+GI GG+P +S Y +G Q +YLDPH QP ++ + + + HS + +
Sbjct: 287 SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQPFLSPSHQE-RSFYDSCHSSNYGKLAIQ 345
Query: 358 SIDPSLAIG 366
+DPS+ IG
Sbjct: 346 DLDPSMLIG 354
>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
Length = 402
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 130/300 (43%), Gaps = 46/300 (15%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPWRKP--LQKP 158
SS I SYRK S +TSD GWGCM+R +QM +AQ + +H +P + ++
Sbjct: 71 SSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVIRHYHSFTQPEQLIVLIRHF 130
Query: 159 FDREYVEILHLFGDSETS-------PFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEAL 210
D + E+++ + + PFSI ++ K + G W P + + L
Sbjct: 131 LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKVEFKKEPGDWYKPNEILETLNYL 190
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP--- 267
+ + SL M IY+ + DA + + KG +W
Sbjct: 191 FKYSQY-------SLNMQIYI---------NYQCAFILQDAIKQMFNYDKGNQEWLKECI 234
Query: 268 -------------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
I + +P +GL++VN Y+ L + T P GI+GG + YIVG
Sbjct: 235 KNNNQFISQHDKGIAIFLPARIGLQRVNQDYLEVLNILMTLPYFQGIIGGVTNRAFYIVG 294
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGL 374
++ IYLDPH VQ N +DL ++Y I+ IH SIDPS+ + C GL
Sbjct: 295 RIQDYLIYLDPHFVQNAQNF--EDLSKTQASYTCQNIQLIHNKSIDPSIVVCL-CVRNGL 351
>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
Length = 521
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/308 (30%), Positives = 135/308 (43%), Gaps = 54/308 (17%)
Query: 96 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
EF D +R+ +YR F PI G S ++ +D+G
Sbjct: 115 EFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDIG 174
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+A AL LGR +R + E + I+ F D PFS+H +Q
Sbjct: 175 WGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHE-LRIIKWFEDDPKYPFSLHKFVQE 233
Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G + G G W GP A RS +AL A C I SGD
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPA-----CGIAHCVISTDSGD---------- 278
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
V +D+ +F + +LLL+ + LG++ VN Y +R + S+GI GG+
Sbjct: 279 VYMDEVE---PLFRADPS--AAVLLLLCVRLGVDVVNEVYWEHIRHILSSEHSVGIAGGR 333
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT-STYHSDVIRHIHLDSIDPSLA 364
P +S Y G Q+E YLDPH +P +N+ + D + H+ +H+ IDPS+
Sbjct: 334 PSSSLYFFGYQDEHLFYLDPH--KPQLNLASYQQDLDLFRSVHTQRFNKVHMSDIDPSML 391
Query: 365 IGFYCRDK 372
IG K
Sbjct: 392 IGILLNGK 399
>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 180
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 73/109 (66%), Gaps = 1/109 (0%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W P+++LVP+ LG++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+D
Sbjct: 11 WHPVIILVPVRLGIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMD 70
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
PH VQP + + D L +Y ++ + + D IDPSLA+GF C +
Sbjct: 71 PHFVQPTVKMDDDPLFP-IESYRMEIPQAMSFDDIDPSLALGFLCSSQA 118
>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
Length = 330
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 122/283 (43%), Gaps = 53/283 (18%)
Query: 130 MLRSSQMLVAQALLFHRLGRPW----------------------------------RKPL 155
MLRS QM++AQ LL H L R W +
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
+ +R + +I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 61 ELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILR 113
Query: 216 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
+ + +YV + A +V D + A+W +++LVP+
Sbjct: 114 KAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVPVR 163
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++
Sbjct: 164 LGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVS 223
Query: 336 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ D + ++H R + +DPS +G Y D+ T
Sbjct: 224 QADFPLE--SFHCTSPRKMAFAKMDPSCTVGSYAGDRKEFETL 264
>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
Length = 332
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 83/272 (30%), Positives = 125/272 (45%), Gaps = 34/272 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
+++ I++LFGDS S FSIH L+ G+ G W GP + A AE
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP--------SFASDIAAEHIN 148
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
+ YV + G G + SK + + P ++ VPL LG E
Sbjct: 149 EMRVFRTRGYVA---KLGSIVGPKI----------EELSKDEVGFNPCIIFVPLRLGPES 195
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I D++
Sbjct: 196 PENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DMK 250
Query: 341 ADTS--TYHSDVIRHIHLDSIDPSLAIGFYCR 370
D S +Y + ++ IDPS+++ F +
Sbjct: 251 GDWSYQSYFCKDNKSMNYSKIDPSISLVFLVK 282
>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
Length = 476
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 144/311 (46%), Gaps = 54/311 (17%)
Query: 95 AEFNQDFSSRILISYRKGF-----DPIGDSKI-------------------TSDVGWGCM 130
++F D ++R+ +YR GF DP G S + T+D GWGCM
Sbjct: 91 SDFISDVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRK-PLQKPF-DREYV-------EILHLFGDSETSPFSIH 181
+R+SQ L+A ALL +GR WR P + P + EY +I+ F D +PFSI
Sbjct: 151 IRTSQSLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQWQIITWFADFPWAPFSIQ 210
Query: 182 NLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
+++ G + G W GP A RS L + ++ C+ + Y+ G+ D
Sbjct: 211 QIVRYGSEHCNKKPGEWFGPSAASRSIVYLCK----QSYKACK---LNTYLTEGNGD--- 260
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+D S + + P L+L + LG+ VNP Y L+ + QS+G
Sbjct: 261 ------IYEDELLXVSCPEGTENGFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIHQSVG 314
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEAD-TSTYHSDVIRHIHL 356
I GG+P +S Y G Q ++ Y+DPH Q + ++ D + ++ H+ IR + L
Sbjct: 315 IAGGRPSSSHYFFGYQGDNLFYMDPHTPQTALLADHVDDADYRXEYVASVHTKRIRKLGL 374
Query: 357 DSIDPSLAIGF 367
+DPS+ IG
Sbjct: 375 CEMDPSMLIGL 385
>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 808
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 126/286 (44%), Gaps = 69/286 (24%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
F +DF+S I ++YR + PI D+ +
Sbjct: 142 FYEDFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKS 201
Query: 122 -TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSET-- 175
TSD GWGCMLR+ Q L+A AL+ LGR WR+P F E YV+IL F D+ +
Sbjct: 202 WTSDAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPL 261
Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ---SLPMAIYVV 232
+PF +H + AGKA G G+W GP S + LA CQ SL + V
Sbjct: 262 APFGVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPE-----CQLSVSLAVDGTVF 316
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
+ D V + SK G+A +L+LV + LGL+ VNP Y L+
Sbjct: 317 ASDVYAASHMGMVTTSGRSISSRRSASKWGGRA----VLILVNIRLGLDNVNPIYYDALK 372
Query: 291 LTFTFPQSLGIVGGKP--GASTYIVGVQEESAIYLDPHDVQPVINI 334
+ G+P G+S Y VG Q +S YLDPH +P I +
Sbjct: 373 V------------GRPRQGSSYYFVGSQADSLFYLDPHHTRPYIPL 406
>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
Length = 357
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 110/230 (47%), Gaps = 42/230 (18%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R E GL +Y VSGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVY-VSGD------GADV--YE 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
D + +V G W P L+LV LG++K+ P Y L++ P L
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKIREMDPSML 300
>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
Length = 495
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 88/313 (28%), Positives = 138/313 (44%), Gaps = 74/313 (23%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 127
F +D +R+ +YR F PI S +D+GW
Sbjct: 75 FLKDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGW 134
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGDSETSPFSIHNLLQ 185
GCM+R+ Q L+ L RLGR +R P +++ E I+ F D+ PFS+H +
Sbjct: 135 GCMIRTGQSLLGNTLQIVRLGRDFR---YDPENKDISENRIIEWFIDAPEKPFSLHQFIT 191
Query: 186 AG-KAYGLAAGSWVGPYAMCRSWEALAR----CQRAETGLGCQSLPMAIYVVSGDEDGER 240
G + G G W GP A RS ++L R C AE + V SGD
Sbjct: 192 EGMELSGKNPGEWFGPAATARSIQSLIRKFPDCGIAEC---------LVSVSSGD----- 237
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+ D+ + VF+ + + +L+L+ + LGL VN Y ++R + S+G
Sbjct: 238 -----IYSDEVKQ---VFADNKKN---LLILLGVKLGLNAVNECYWDSIRHILSSKYSVG 286
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHIHLD 357
I GG+P +S Y G + + +Y DPH QP LE + +Y H++ + ++
Sbjct: 287 ISGGRPSSSLYFFGYEGDELLYFDPHSPQP-------SLEENNVSYKSCHTNKYGKLLMN 339
Query: 358 SIDPSLAIGFYCR 370
+DPS+ +GF R
Sbjct: 340 DMDPSMLLGFLIR 352
>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
Length = 285
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 122/281 (43%), Gaps = 44/281 (15%)
Query: 101 FSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
F S I I+YR+ F P+ + SD GWGCM+R QM +A+ L K
Sbjct: 2 FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGL--------------KR 47
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAE 217
F + EI+ LF D + S FSI N+ +AGK + L AG W P +C + L +
Sbjct: 48 FQIKEDEIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKK--- 104
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
G + L I +S D ++ +D S G ++L + LG
Sbjct: 105 ---GFKDLK--IRTISSDR--------ILIFEDLEMEFSSEKNG------LILFLVCKLG 145
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
LEK Y+ F + S+G++GGKP + + VG E+ IYLDPH VQ +
Sbjct: 146 LEKTEENYLKIALKIFDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDPHYVQDF-----N 200
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
D ++Y + ID S+ + +K L F
Sbjct: 201 QNNVDQNSYFCKNYAVLDQKKIDSSIGNVLFFENKEELKMF 241
>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1295
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 118/281 (41%), Gaps = 49/281 (17%)
Query: 82 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 141
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 142 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 185
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
GK G G W GP + + LA P + VVS + G
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLANS----------FPPCGLSVVSAAD----GSVFR 665
Query: 246 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 293
+ AS + ++ G P +L+++P LGL+ VNP Y ++
Sbjct: 666 SEVYQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1295
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 118/281 (41%), Gaps = 49/281 (17%)
Query: 82 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 141
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 142 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 185
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
GK G G W GP + + LA P + VVS + G
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLANS----------FPPCGLSVVSAAD----GSVFR 665
Query: 246 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 293
+ AS + ++ G P +L+++P LGL+ VNP Y ++
Sbjct: 666 SEVYQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
Length = 337
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 105/219 (47%), Gaps = 19/219 (8%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 72 DRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SVVAHILRKAVE 124
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
C + + VS D V D +R S + A+W +++LVP+ LG E
Sbjct: 125 -SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAEWKSVVILVPVRLGGE 174
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 175 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVNQANF 234
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ ++H R + +DPS +GFY ++ T
Sbjct: 235 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETL 271
>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
Length = 1055
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 128/275 (46%), Gaps = 41/275 (14%)
Query: 105 ILISYRKGFDPI-GDSKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWRKPLQKPFDR 161
+ ++YRKG+DPI GD+++TSD GWGC RS QML+AQAL+ + R R +P
Sbjct: 603 VWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSARMQRLEGVRPSTW 662
Query: 162 EYVE----ILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
++ E +L +F DS + FSI ++ + G W+ P + + R
Sbjct: 663 QHEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLSPSEVAL---IIRRLNP 719
Query: 216 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
ETG+ V ++D G+ W P LL++PL
Sbjct: 720 PETGMR-----------------------VRIVNDTLLSTRRILAGEP-WMPTLLMIPLR 755
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE--SAIYLDPHDVQPVIN 333
GL+ + P +P F +P +G +GGKPG++ Y VG+ + +YLDPH + ++
Sbjct: 756 AGLDTLQPESVPAFVAFFDWPWCVGAIGGKPGSAYYYVGIDHDRRRVLYLDPHTTRSRLD 815
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ +A T D ++ + + S+ +G +
Sbjct: 816 LSN---QAAEKTCVPDKLKSMDMSKSCSSICVGLF 847
>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 523
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 130/277 (46%), Gaps = 43/277 (15%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALL--FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
TSD GWGCM+R+SQ L+A ALL FH G +P + +++ LF D+ ++PF
Sbjct: 179 FTSDAGWGCMIRTSQNLLANALLRLFHTTGG---QPQNFAVTKTEADVIELFQDTLSAPF 235
Query: 179 SIHNLLQAGKAYGL--AAGSWVGPYA-------MCRSWEALARCQRAETGLGCQS---LP 226
S+HN ++A + L G W GP A + + + + +R+E G S +P
Sbjct: 236 SLHNFIKAANSLSLNIKPGQWFGPSAASLSIKKLVNDYNLIQQERRSERDSGRDSGHKVP 295
Query: 227 M-----------AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG-----QADWTPILL 270
+ D +R P V + S +C ++ + + PIL
Sbjct: 296 TPNLKLHSKSADSDSDSDSDAISKRNSIPYVYV---SENCDLYDDEINAIFELEQRPILF 352
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQ 329
L P+ LG+E+VN Y ++ S+GI GGKP +S Y +G + E+ IY DPH Q
Sbjct: 353 LFPIRLGIEQVNKYYYSSILQILASKFSVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQ 412
Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
V + +YH+ + +D +DPS+ IG
Sbjct: 413 IV------QTPVNLESYHTSEYSKLKIDQLDPSMMIG 443
>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
Length = 494
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
Length = 494
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
Length = 494
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGBIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 494
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 506
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369
Query: 366 GFYCR 370
G +
Sbjct: 370 GILIK 374
>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
Length = 494
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GILIK 362
>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
Length = 506
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 126/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369
Query: 366 GFYCR 370
G +
Sbjct: 370 GILIK 374
>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
Length = 196
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 73/119 (61%), Gaps = 11/119 (9%)
Query: 265 WTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
W P+++LVPLVLGL++ VNPRY+P + PQS+GI+GGKP AS Y VG Q+E YL
Sbjct: 75 WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLPQSVGILGGKPCASLYFVGAQDEELFYL 134
Query: 324 DPHDVQPVINIGK----------DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
DPH VQ + + + + T TYH + H++ +DPS+ +GFYCR +
Sbjct: 135 DPHTVQLAVPLEQIWGCAQTGSPESGPFPTETYHCRSVLHMNARELDPSMVLGFYCRTR 193
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/45 (57%), Positives = 31/45 (68%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
F SR+ I+YR+GF IG T+D GWGC LRS QML+A AL H
Sbjct: 1 FHSRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANALQSH 45
>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 377
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/233 (33%), Positives = 104/233 (44%), Gaps = 52/233 (22%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
A F DF SRI I+YR F I SK T+D GWGCM+
Sbjct: 90 AAFLDDFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMI 149
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
RS Q L+A ALL +LGR WR+ + + + +L LF D +PFSIH ++ G A
Sbjct: 150 RSGQSLLANALLIQKLGRDWRRGSET---GKEIALLSLFADRPQAPFSIHRFVEHGAAAC 206
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A ARC C+ + +YV S D +D
Sbjct: 207 GKHPGEWFGP-------SATARCIDE-----CEHAGLNVYVTSDGSD---------VHED 245
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
R + G D P L+L+ + LG++ + P Y L+ +PQS+GI G
Sbjct: 246 KFRQIA----GLDDIKPTLILLGVRLGIDSITPVYWDALKAIIQYPQSVGIAG 294
>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
Length = 178
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/77 (64%), Positives = 61/77 (79%)
Query: 81 QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQ 140
++E G + ++G A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQ
Sbjct: 99 EEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQ 158
Query: 141 ALLFHRLGRPWRKPLQK 157
AL+FH LGR WRKP +K
Sbjct: 159 ALIFHHLGRSWRKPSEK 175
>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
purpuratus]
Length = 1018
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 81/145 (55%), Gaps = 10/145 (6%)
Query: 70 IWLLGVC-HKIAQDEALGDAAGNNGLAE-----FNQDFSSRILISYRKGFDPIGDSKITS 123
IW LG C H+ +D G + + F QDFSSR+ ++YR+ F + S TS
Sbjct: 346 IWFLGKCYHQRPEDPDPERPPGMDSVRSMVIEMFKQDFSSRLWMTYRREFPTLAGSNFTS 405
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDS--ETSPFS 179
D GWGCMLRS QM++A +L+ H LGR W KP + + + +I+ FGD + SPFS
Sbjct: 406 DCGWGCMLRSGQMMLAHSLILHFLGREWNIYKPQTQEMLQFHRQIVRWFGDQPLDMSPFS 465
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMC 204
+H L+ G+ G G W GP ++
Sbjct: 466 VHRLVGIGQNNGKKVGDWYGPSSVA 490
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 70/123 (56%), Gaps = 2/123 (1%)
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
ID + S ++G W +++++P+ LG ++VNP YI ++ FT LGI+GGKP
Sbjct: 819 IDPSRSRTSTSTEGGKPWCAVVIMIPVRLGGDEVNPVYIRPIQSLFTLESCLGIIGGKPK 878
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
S + VG QEE I+LDPH Q V+++ D ++H R + + +DPS IGF
Sbjct: 879 HSLFFVGFQEEKLIHLDPHYCQQVVDMKTRDFPL--WSFHCMSPRKMSISKMDPSCTIGF 936
Query: 368 YCR 370
Y R
Sbjct: 937 YIR 939
>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
Length = 296
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/209 (30%), Positives = 101/209 (48%), Gaps = 19/209 (9%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 31 DRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP-------SVVAHILRKAVE 83
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
C + + VS D + D +R S + A+W +++LVP+ LG E
Sbjct: 84 -SCSEVSRLVVYVSQDCTVYKA--------DVARLLS-WPDPTAEWKSVVILVPVRLGGE 133
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ +
Sbjct: 134 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQPSF 193
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ ++H R + +DPS +GFY
Sbjct: 194 PLE--SFHCTSPRKMAFAKMDPSCTVGFY 220
>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
Length = 266
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 74/129 (57%), Gaps = 6/129 (4%)
Query: 250 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 71 DSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 130
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP ++ Y +G E IYLDPH QP + + D S + + + +DPS+
Sbjct: 131 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPSRMGIGELDPSI 190
Query: 364 AIGFYCRDK 372
A+GF+C+ +
Sbjct: 191 AVGFFCKKE 199
>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
Length = 355
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 123/278 (44%), Gaps = 29/278 (10%)
Query: 102 SSRILISYRKGFDPIGDS-KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
S+ + +YR IGDS + +D GWGC LR QM+V +AL R + K L P +
Sbjct: 52 SAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK-LSYPSE 110
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
+ IL F D S+H + K G AG W P + Q A +
Sbjct: 111 AARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQEA---M 167
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
G Q ++V +V +DD + +F +A LL VPL LG++
Sbjct: 168 GLQ-----VHVAMD---------SMVVLDDLRK---LFRADRA----TLLFVPLRLGIDI 206
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
V IP ++ F P +LGI+GG+PGA+ Y +G + + + LDPH Q + G D
Sbjct: 207 VQAEMIPAVKRFFHSPSALGIMGGRPGAAHYFIGYMDHNLLLLDPHTTQDPLRAGSQDAL 266
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ + LD +DP++ + F D+ L F
Sbjct: 267 VSCRCSRPML---LDLDKVDPTMCLAFLLTDEESLQRF 301
>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 388
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/230 (31%), Positives = 109/230 (47%), Gaps = 33/230 (14%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ SYR+ F+P+ + TSDVGWGC +R+ QM++A A + +R G D V
Sbjct: 94 LYFSYRRQFEPLRNGA-TSDVGWGCTIRACQMMLAWAFMRYRNGG------SVTMDDNVV 146
Query: 165 EIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
+ L LF D T+PF IH + G +G+ G W GP M + AL R+ G
Sbjct: 147 DSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSSGG 206
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
G + L + D + G VV S+H ++LL+P+ LG +
Sbjct: 207 EGPEVLVAS--------DRQIGVQDVVVRLQRSQH-------------VVLLIPVKLGPQ 245
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
V+ Y L+ F S+G VGG+ ++ + G Q + I+LDPH VQ
Sbjct: 246 TVSVTYANALKRFFEMGSSIGAVGGEKNSAYFFFGYQGDKIIHLDPHYVQ 295
>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
Length = 494
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 128/305 (41%), Gaps = 57/305 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
EF D SR+ +YR F PI G S ++ +D+G
Sbjct: 85 EFLLDVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R +K RE +I+ F D+ +PFSIHN +
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVDNEKSLKRES-KIVTWFNDTPEAPFSIHNFVST 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L C + V SG D +
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIYGFPE-----CGITDCVVSVSSG--DIYQNEVEK 256
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ +++ + IL L+ + LG+ VN Y ++ +S+GI GG+
Sbjct: 257 IYVENPD-------------SIILFLLGVKLGINAVNESYRESICGILNSARSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q +Y DPH QP + E+ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNQFLYFDPHIPQPAVE------ESFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCR 370
G +
Sbjct: 358 GVLIK 362
>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
8797]
Length = 448
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 129/305 (42%), Gaps = 59/305 (19%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------------IT 122
N +F +D +R+ +YR F PI S
Sbjct: 38 NEKMQFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFN 97
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D+GWGCM+R+ Q L+ AL R GR +R D +I+ F D+ +PFS+HN
Sbjct: 98 TDIGWGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD----DIIQWFKDTPDAPFSLHN 153
Query: 183 LLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ G + + G W GP A RS ++L C + G+ I VS + ++
Sbjct: 154 FVKKGVELADMKPGQWFGPAATSRSIQSLI-CNFPQCGID-----HCIVSVSSADIYKQD 207
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+ D S +L+L + LG+ VN Y +R S+GI
Sbjct: 208 VEDMFDADPDSN--------------LLILFGVKLGVSAVNASYWEDIRRLLNSKFSVGI 253
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
GG+P +S Y G Q + +Y DPH QP + DD A +T HS + L +DP
Sbjct: 254 AGGRPSSSLYFFGYQNQELLYFDPHTPQPSL---IDD--AAFNTCHSIEFGKLELRDMDP 308
Query: 362 SLAIG 366
S+ IG
Sbjct: 309 SMLIG 313
>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 330
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 121/273 (44%), Gaps = 36/273 (13%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARCQRAETG 219
+++ I++LFGDS S FSIH L+ G+ G W GP +A + E + + T
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRTR 156
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
L I G I D + P ++ VPL LG E
Sbjct: 157 GYVAKLGSII-----------GSKIEELIKDG-----------GGFNPCIIFVPLRLGPE 194
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I D+
Sbjct: 195 SPENEFKPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DM 249
Query: 340 EADTS--TYHSDVIRHIHLDSIDPSLAIGFYCR 370
+ D S +Y + + +DPS+++ F +
Sbjct: 250 KGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVK 282
>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
Length = 389
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 125/305 (40%), Gaps = 43/305 (14%)
Query: 87 DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-LFH 145
D A + + + F I SYR + S +TSD GWGCMLR QM + Q + F+
Sbjct: 47 DLAVDQKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFY 106
Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSE-------------------TSPFSIHNLL-Q 185
L +E E++ F D++ SPFSI ++ Q
Sbjct: 107 NLSSS----------QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQ 156
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED---GERGG 242
+ G W P + + L R + + L +++S + GG
Sbjct: 157 TKLELQKSPGEWYKPNDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFNKNGG 216
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
D KGQ D + + + +GL+ N Y+ L T+PQ GI+
Sbjct: 217 -------DEEWLKEQIEKGQNDEFGVSIFILTRIGLDTCNQEYLKVLNDIMTYPQFQGIL 269
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GG P + YI+G IYLDPH VQ N ++E D S+Y I+ I + +DPS
Sbjct: 270 GGFPNKALYILGRVGNYYIYLDPHYVQNAQNY--QEMENDRSSYTCQSIQLIDSNQLDPS 327
Query: 363 LAIGF 367
+AI F
Sbjct: 328 MAISF 332
>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
Length = 463
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 129/312 (41%), Gaps = 67/312 (21%)
Query: 92 NGLAEFNQDF----SSRILISYRKGFDPIGDSK--------------------------- 120
N + NQDF +SR+ +YR F PI S
Sbjct: 52 NRNSNLNQDFLSDVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNP 111
Query: 121 --ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
+D+GWGCM+R+ Q L+ AL +LGR +R L + EI+ F D+ PF
Sbjct: 112 DCFNTDIGWGCMIRTGQSLLGNALQLAKLGRHFR--LDNKMGIKDDEIISWFRDTTQEPF 169
Query: 179 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
SIH ++ G K G W GP A S ++L + E G+ + V SGD
Sbjct: 170 SIHKFVEKGNKLANKKPGEWFGPAATSISIQSLIE-EFPECGID----KCLVSVSSGD-- 222
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+D R +F + + IL L+ + LGL+ VN Y +
Sbjct: 223 ---------IFEDDVRE--IFEENMD--SKILFLMGVKLGLDAVNSFYWEDILNILDSKF 269
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHI 354
S+GI GG+P +S Y G Q +Y DPH QP + D S Y H+ +
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQPSL--------VDPSVYETCHTTNFGKL 321
Query: 355 HLDSIDPSLAIG 366
+ +DPS+ IG
Sbjct: 322 DIKDMDPSMLIG 333
>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
Length = 330
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 78/279 (27%), Positives = 124/279 (44%), Gaps = 29/279 (10%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDR 161
I ++YRK + + TSD GWGCM+RS QM +AQ+ + +G W + Q ++
Sbjct: 38 IWVTYRKNMKELPGGR-TSDSGWGCMIRSMQMALAQSFVSLVMGNSWKFTKTGFQVERNK 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
++ I++LFGD S FSIHNL+ G+ G W GP S+ + T
Sbjct: 97 FHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGP-----SFASEIAADHLNT-- 149
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
I+V R G V S+ + P ++ VPL LG
Sbjct: 150 --------IHVFRTRGYVARLGRIV------KPDILDISEDNGNILPTIIFVPLRLGPVN 195
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+ P L+ F PQ +G+VGGKP + + YLDPH Q +++ D
Sbjct: 196 AEEDFRPILKKVFDIPQCVGMVGGKPNLAFFFHTFDGNLLYYLDPHTTQNAVSM---DGG 252
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+Y + ++ + ++DPS+++ F ++K FE
Sbjct: 253 WSAESYFCNDVKSMKYKNLDPSVSLLFLIKNKDDFNKFE 291
>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
Length = 354
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 108/226 (47%), Gaps = 25/226 (11%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA-LLFHRLGRPWRKPLQKPFDREY 163
+ SYR GF P+ + T+DV WGC++R++QML+AQA + F G + RE
Sbjct: 69 LYFSYRCGFTPLSNGS-TTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFVDGSALQILREK 127
Query: 164 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
V+ LF D ++PF IH + + YG+A G W G ++ +L + G G
Sbjct: 128 VQ--PLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSLRGGNG-- 183
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 283
P + V D E V + SR ++LL+P VLGL++++
Sbjct: 184 --PAVLVFV----DREVSALKVRDLLSHSRQ-------------VVLLIPAVLGLDRISV 224
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+Y L +G++GG+ ++ Y VG Q + IYLDPH Q
Sbjct: 225 KYSKMLIRCLEMESCIGVIGGRKSSALYFVGHQSNNIIYLDPHRAQ 270
>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 133/317 (41%), Gaps = 89/317 (28%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 119
G +E + R +SYR GF+PI +
Sbjct: 75 GDSEVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 178
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 179 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 235
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQS GI GGKP +S Y G Q S +YLDPH Q V A +YHS + +
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSSYQKLD 322
Query: 356 LDSIDPSLAIGFYCRDK 372
+ +DPS+ G ++
Sbjct: 323 ISDMDPSMMAGIVLKNN 339
>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
Length = 263
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 73/129 (56%), Gaps = 6/129 (4%)
Query: 250 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 68 DSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 127
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP ++ Y VG E IYLDPH QP + D S + + + +DPS+
Sbjct: 128 GKPNSAHYFVGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFHCQHPPCRMSIAELDPSI 187
Query: 364 AIGFYCRDK 372
A+GF+C+ +
Sbjct: 188 AVGFFCKTE 196
>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 357
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 124/278 (44%), Gaps = 35/278 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNAL----------- 177
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G V+ + + ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEQVKELLRQSMHVVLLIPVMLGIRVI 227
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP + E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQPAFTSSGNSGEL 287
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ R + S D S+ +GFY FE
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSFAVFE 319
>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 330
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 124/279 (44%), Gaps = 37/279 (13%)
Query: 100 DFSSR-ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---L 155
DF+ I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ +
Sbjct: 33 DFARHTIWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCI 90
Query: 156 QKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARC 213
+ +++ I++LFGDS S FSIH L+ G+ G W GP +A + E +
Sbjct: 91 NTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEM 150
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
+ T L I G I D + P ++ VP
Sbjct: 151 RVFRTRGYVAKLGSII-----------GSKIEELIKDG-----------GGFNPCIIFVP 188
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LG E + P L+ F PQ +G++GGKPG + Y + +LDPH Q I
Sbjct: 189 LRLGPESPENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTTQNAI- 247
Query: 334 IGKDDLEADTS--TYHSDVIRHIHLDSIDPSLAIGFYCR 370
D++ D S +Y + + +DPS+++ F +
Sbjct: 248 ----DMKGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVK 282
>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
Length = 398
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 104/235 (44%), Gaps = 48/235 (20%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK--------------------------ITSDVGWGC 129
+F DF S++ I+YR F PI + TSD GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-K 188
M+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGAT 301
Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
A G G W GP A + +AL + + GL G + E+ V C
Sbjct: 302 ACGKCPGEWFGPSAASQCIQALVKSN-PQVGL------RVCITSDGSDIYEKQFKEVACD 354
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G
Sbjct: 355 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAG 398
>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 133/317 (41%), Gaps = 89/317 (28%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 119
G E + R +SYR GF+PI +
Sbjct: 75 GDLEVQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 178
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 179 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 235
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQS GI GGKP +S Y G Q S +YLDPH Q V A +YHS + + +
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSLYQKLD 322
Query: 356 LDSIDPSLAIGFYCRDK 372
+ +DPS+ G ++
Sbjct: 323 ISDMDPSMMAGIVLKNN 339
>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 357
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 125/278 (44%), Gaps = 35/278 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 227
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ R + S D S+ +GFY L FE
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALFE 319
>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 371
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 124/302 (41%), Gaps = 57/302 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLI------YGFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DP ++
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPRCSL 369
Query: 366 GF 367
F
Sbjct: 370 VF 371
>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
Length = 351
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 125/278 (44%), Gaps = 35/278 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 68 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 127 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 171
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 172 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 221
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 222 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 281
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ R + S D S+ +GFY L FE
Sbjct: 282 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALFE 313
>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 485
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/301 (27%), Positives = 124/301 (41%), Gaps = 57/301 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 76 EFLLDVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 135
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R F RE I++ F D+ +PFS+HN +
Sbjct: 136 WGCMIRTGQSLLGNALQILHLGRDFRVDEDDDFRRE-SRIVNWFNDTPEAPFSLHNFVST 194
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS + L E G+ + V SG D
Sbjct: 195 GTELSDKRPGEWFGPAATARSIQYLIY-GFPECGINA----CIVSVSSG--DIYENEVEE 247
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
V +D+ + + IL L+ + LG+ VN Y ++ S+GI GG+
Sbjct: 248 VFVDNPN-------------SSILFLLGVKLGINAVNESYRESICGILNSAWSVGIAGGR 294
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ ++ H+ + L +DPS+ I
Sbjct: 295 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVNSCHTSKFGRLQLSEMDPSMLI 348
Query: 366 G 366
G
Sbjct: 349 G 349
>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 357
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 125/278 (44%), Gaps = 35/278 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG---RPWRKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G R + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G ++ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLATGGE-----GPVILAFPERQIFLEEVKELLRQSTHVVLLIPVMLGICVI 227
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+ R + S D S+ +GFY L FE
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLSVFE 319
>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.3]
Length = 873
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 144/350 (41%), Gaps = 76/350 (21%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 122 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRP-WRKPLQKPFDRE---YVEILHLFG 171
TSD GWGCMLR+ Q L+A ALL LGR WR+P + YV+I+ F
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFF 410
Query: 172 D--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI 229
D S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 DTPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGV 469
Query: 230 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
S + A I RH V G+A +++L+ + LGL+ VNP Y T+
Sbjct: 470 IFQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTI 520
Query: 290 RLT-----------FTFPQSLGIVGGKPGASTYIV----------GVQEESAIYLDPHDV 328
+++ T P + G P AS I G E + LDP
Sbjct: 521 KVSIRTLRPYRWILMTVPYTSGFNASLP-ASPEISSDMDVRELGWGDSEGAGEALDPMAE 579
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
V D L T+H D +R + + +DPS+ +GF C+D+ F
Sbjct: 580 HYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDENDWFDF 625
>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 113/261 (43%), Gaps = 39/261 (14%)
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
YR + +S +T+D GWGC RS+Q L+ Q +L +L R +R + F + V L
Sbjct: 25 YRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYIL--KLHRKFRSLYDQVFGQN-VNPLD 81
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
LF D ++PF I NL + A GL G W P M A + L C
Sbjct: 82 LFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIM----AATIKLIFDTLNLSC------ 131
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
++S D + +H P L+L+P + GL K++ Y+
Sbjct: 132 --IISQDLTLDSNDI---------KHTKY---------PALILIPSLFGLSKMDDSYLSF 171
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L L SLG V G+ ++ Y VG E Y DPH + + + ++
Sbjct: 172 LLLCLCIESSLGFVSGQNASAYYFVGFDLEDFYYFDPHVTKEAV------VSPPYDSFFD 225
Query: 349 DVIRHIHLDSIDPSLAIGFYC 369
++ + +SI+PS+ +GFYC
Sbjct: 226 LELKSMKKESINPSVLLGFYC 246
>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 298
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 122/267 (45%), Gaps = 42/267 (15%)
Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVE 165
+Y K F P+ T+D WGC +RS+Q L+ Q + L+ LG R P + +Y
Sbjct: 28 TYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDDIRNIF--PTNSKY-- 82
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
LF D SPF + ++ ++YG+ G WV P + + + R
Sbjct: 83 --ELFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKEIMNFFRI--------- 131
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
PVV + V ++ + P+LLL L+LG E +Y
Sbjct: 132 ------------------PVVIAEHGCLSREVLNEALSHNIPVLLLFTLMLGYENFELKY 173
Query: 286 IPTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
+P L+LT + QS+G+VGG+ G + +IVG Q+E +Y DPHDV +I K D +
Sbjct: 174 LPFLKLTLSLIYQSVGVVGGQQGKAYFIVGHQKEKLLYFDPHDVNE--SITKID---QIN 228
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRD 371
++ + D++ S+ +GF+ +
Sbjct: 229 QLFKPPLKVMPADTLSSSMLVGFFITN 255
>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
Length = 469
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 133/303 (43%), Gaps = 52/303 (17%)
Query: 96 EFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDVG 126
EF +D +SR+ +YR F PI G S + +D+G
Sbjct: 62 EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
WGCM+R+ Q L+A AL LGR +R + ++I+ F D+ PFS+H +Q
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181
Query: 186 AG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 244
G K G G W GP A+ RS +L C ++S D +
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHC--------IISTD-------SA 226
Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
V +D+ K LLL+ + LG++ N Y ++ + QS+GI GG
Sbjct: 227 DVYLDEIDPLFRANPKANV-----LLLLGVRLGVDFTNEYYWDDIKNILSSSQSVGISGG 281
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
+P +S Y G Q + YLDPH VQ + + + D E + H IHL +IDPS+
Sbjct: 282 RPSSSLYFFGYQGDYLFYLDPHKVQLNLALYESD-EERFHSVHPQTFNKIHLSAIDPSML 340
Query: 365 IGF 367
+GF
Sbjct: 341 LGF 343
>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
Length = 392
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 128/280 (45%), Gaps = 48/280 (17%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
+ D ++RI +YRK F P+ S+ T+DVGWGCMLR QM++A L+ +
Sbjct: 119 QLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLM----------AV 168
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+P + HL +++ N L+AG+ G ++ VG + + ALA+
Sbjct: 169 LQP------RVHHLLK------YTMENHHLKAGRFQGPSS---VGSALLHQVPSALAQLN 213
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
+ + + + Y S + I D R +GQA++ PI+L++PL
Sbjct: 214 QFRD----EEVKLRTYFASD----------TLVILDQLRP----EEGQAEFEPIMLVLPL 255
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LG+EK+ P+Y L+L P +G +GG + YI G Q LDPH +
Sbjct: 256 RLGIEKIGPQYHARLQLLLRQPWCMGFIGGHDKRAMYIFGYQGHQYFGLDPHRCSAAVAQ 315
Query: 335 GKDDLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
+L ++H+ + I D +DPSLA+ R
Sbjct: 316 STAELRDRWVEVRDSFHTSKLSGIERDDLDPSLAVFLLAR 355
>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 84/324 (25%), Positives = 135/324 (41%), Gaps = 59/324 (18%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
N + + QD I I+YR+ F P+ S SD GWGCMLR QM +AQ L H
Sbjct: 57 NKIKQLVQD---TIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQMLKKHLKNHGD 113
Query: 152 RKPLQKPFDREYVEILHLFGDSETS----------------------PFSIHNL-LQAGK 188
++ D +Y IL F D+++ PFSI + A K
Sbjct: 114 KR------DEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKK 167
Query: 189 AYGLAAGSWVGPYAM------------CRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
+ L G W P + R+ E L ++ L L ++ + +
Sbjct: 168 EFNLDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFDIKFET 227
Query: 237 DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
D + +++ + SK + + V +GL++ N +Y+ L P
Sbjct: 228 DKD--------LEEQLEKTQLKSKN-----SLAIFVLTRIGLDEPNQKYLKVLDELMELP 274
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK--DDLEADTSTYHSDVIRHI 354
GIVGG P + YI+G + IYLDPH VQ N G+ ++ + ++Y I +
Sbjct: 275 YFQGIVGGTPKRAFYILGRINDHYIYLDPHYVQEAENKGQIIENKMFNRTSYSCKYIHLL 334
Query: 355 HLDSIDPSLAIGFYCRDKGLLVTF 378
+ +D S+ + +Y R+K L+ F
Sbjct: 335 NQKHVDTSMGLSYYIRNKSELLQF 358
>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
Length = 603
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 74/126 (58%), Gaps = 1/126 (0%)
Query: 87 DAAGNNGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
D G + + EF +DF++R+L +YR+GF I +++ +D GWGCMLRS QML++ LL H
Sbjct: 129 DIPGQSFIKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHH 188
Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
LG W+K Y I+ +F D ++PFSIHN+ G+ G G W P + +
Sbjct: 189 ALGDDWKKSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQ 248
Query: 206 SWEALA 211
+ ++L
Sbjct: 249 AIKSLV 254
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 69/149 (46%), Gaps = 37/149 (24%)
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VGGKP AS Y + VQ+++ YLDPH
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEIFKFPQNLGVVGGKPRASLYFIAVQDDNLFYLDPH 430
Query: 327 DVQPVINIGKDDLEAD-------------------------------------TSTYHSD 349
VQ I+I + E +T+
Sbjct: 431 TVQNHIDINNSNGEPSNFSFSSSPSSSNINIINTNNNNNNNNNNDKNNNNSFPVNTFFCS 490
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ H+ +DPSL + F+C+ + F
Sbjct: 491 QTKRTHVSEVDPSLVVAFFCKSRSDFDDF 519
>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
[Homo sapiens]
Length = 231
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 102/226 (45%), Gaps = 61/226 (26%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEK---- 133
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
MCR + +S D G+R + +
Sbjct: 134 -------------MCR-----------------------VLPLSADTAGDRPPDSLTASN 157
Query: 250 DA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ S +CS W P+LL+VPL LG+ ++NP Y+ ++T
Sbjct: 158 QSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKVT 196
>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
Length = 392
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 134/311 (43%), Gaps = 41/311 (13%)
Query: 86 GDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
DA + + Q S I SYRK S +TSD GWGCM+R +QM +AQ +
Sbjct: 46 NDADIEQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQII--- 102
Query: 146 RLGRPWRKPLQ-----KPF----DREYVEILHLFGDSET----SPFSIHNLLQAGKA-YG 191
R ++KP Q + F D E + + F ++ +PFSI ++ K
Sbjct: 103 RYYNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----------VSGDEDGER 240
G W + ++ + L + + SL M IY+ + + +
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQY-------SLNMEIYINYDCAFILQDAIQQMFNQQE 215
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
G + + + +++ + F D I + +P +GL+ +N Y+ L P G
Sbjct: 216 GNE--IWLKERAKNNNQFDL--QDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQG 271
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
++GG + Y VG ++ IYLDPH VQ N DDL + ++Y I+ IH ID
Sbjct: 272 MIGGVSKRALYFVGRIQDYLIYLDPHFVQNAQNF--DDLSKNQASYTCQNIQLIHNSLID 329
Query: 361 PSLAIGFYCRD 371
PS+ + R+
Sbjct: 330 PSIVVCLCIRN 340
>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
norvegicus]
Length = 224
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 72/129 (55%), Gaps = 6/129 (4%)
Query: 250 DASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
++ RHC+ G W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 29 ESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 88
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP ++ Y +G E IYLDPH QP + + D S + + + +DPS+
Sbjct: 89 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPCRMGIGELDPSI 148
Query: 364 AIGFYCRDK 372
A+GF+C+ +
Sbjct: 149 AVGFFCKTE 157
>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 117/263 (44%), Gaps = 39/263 (14%)
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
YR F I +S ++ D GWGC RSSQ LV Q +L RL + + F + L
Sbjct: 25 YRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYIL--RLHKNFPDLYNSTFGID-KNPLD 81
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
LF D +PF I N++ + GL G+W P + +++++ + L C
Sbjct: 82 LFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSIFQ----SLHLNC------ 131
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+V D ++ + ++ P+L+L+P + GLEK+ YI
Sbjct: 132 --IVPQDSTF------------------IYEELESTNYPVLILIPGLFGLEKIEKPYISF 171
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ L+ SLG V G ++ Y +G + Y DPH + + D +
Sbjct: 172 IFLSLCMNSSLGFVSGHNDSAFYFIGFDSDYFYYFDPHVTKQALTGPPYDSLFELK---- 227
Query: 349 DVIRHIHLDSIDPSLAIGFYCRD 371
++ + +++I+PS+ +GFYC D
Sbjct: 228 --LKSMKIENINPSVLLGFYCDD 248
>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
Length = 460
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/247 (32%), Positives = 118/247 (47%), Gaps = 27/247 (10%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
+D+GWGCM+R+ Q L+ AL LGR +R + + D+E +I+ F D+ + FSI
Sbjct: 114 FNTDIGWGCMIRTGQSLLGNALQIANLGRDFR--VNQGKDQEEYKIIDWFADTPQAHFSI 171
Query: 181 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
HN + G K G W GP A RS + L Q + G+ I V SGD
Sbjct: 172 HNFVSQGLKLSNKKPGEWFGPAATSRSIQCLVE-QFPDCGID----KCLISVSSGD---- 222
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+D R +F+ Q + ILLL+ + LG+ VN Y ++ T S+
Sbjct: 223 -------VFEDEVRE--IFA--QKPQSRILLLLGVKLGVNAVNEYYWDDVKKTLGSKFSV 271
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
GI GG+P +S Y +G Q IY DPH QP + + + T H+ + L +
Sbjct: 272 GIAGGRPSSSLYFMGFQGNELIYFDPHTPQPSLQTSANFYD----TCHALNFGKLLLSDL 327
Query: 360 DPSLAIG 366
DPS+ IG
Sbjct: 328 DPSMLIG 334
>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 444
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 136/313 (43%), Gaps = 71/313 (22%)
Query: 103 SRILISYRKGFDPIGDSK----------------------------------ITSDVGWG 128
SR+ +SYR GFDPI ++ TSD GWG
Sbjct: 84 SRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAGWG 143
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AG 187
CM+R+SQ L+A LL P D + +++ LF D+++SPFSIHN ++ AG
Sbjct: 144 CMIRTSQNLLANTLL-----------QLLPPDSKQ-DVIGLFQDNQSSPFSIHNFIKVAG 191
Query: 188 KA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
++ + G W GP A S + L + + G + + I S DGE
Sbjct: 192 ESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVFISENSDLYDGEINEI--- 248
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
+ +G++ +L+L P+ LG++KVN Y ++ S GI GGKP
Sbjct: 249 ----------LSEEGRS----VLVLFPIRLGIDKVNSYYYDSIFQVLKSKFSCGISGGKP 294
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
+S Y +G IY DPH Q V N + +YH+ +++ +DPS+ IG
Sbjct: 295 SSSFYFLGYDNSDLIYFDPHLPQLVEN------PINIESYHTRNYNRLNISLLDPSMMIG 348
Query: 367 FYCRDKGLLVTFE 379
R + F+
Sbjct: 349 ILLRSMDDYLEFK 361
>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
Length = 465
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 70/109 (64%), Gaps = 3/109 (2%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++++PL LG++++N YI L+ + PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 217 WKSLIIMIPLKLGVDRINTSYIRKLKSILSIPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 276
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
PH VQ ++ ++ + T+ + + + +IDPSL++GFYC+DK
Sbjct: 277 PHFVQDTVDPSSNNY---SETFCGCIPQKMSFSNIDPSLSVGFYCKDKS 322
>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
Length = 616
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
Q++W +++LVP+ LGL+K+N Y ++ P S+G++GGKP S Y VG Q+E
Sbjct: 426 NQSNWKSLIILVPVKLGLDKLNEIYFSGIKAMLQMPSSIGLIGGKPKQSFYFVGFQDEHI 485
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
IYLDPH V I+ + ++YH + + +H IDPS+A GFYC
Sbjct: 486 IYLDPHFVHDTIHPFDSNF---LNSYHDCIPQKMHFSQIDPSMAFGFYCH 532
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 61/100 (61%), Gaps = 7/100 (7%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR---P 150
+ F +DF S + SYRK F I ++ IT+D+GWGCMLR+ QM++A+ALL H P
Sbjct: 194 VERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENIP 253
Query: 151 WRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGK 188
+ + ++ + +Y +I+ F D S+ + +SIH ++ K
Sbjct: 254 YGEKIKT--NSKYKKIMSWFCDYPSKENFYSIHQIVHKNK 291
>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
Length = 734
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 77/144 (53%), Gaps = 11/144 (7%)
Query: 238 GERGGA---PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
GE G+ P+ C D S C W I++LVP+ LGL+K+N Y ++
Sbjct: 515 GENSGSFKDPLTCSDFFSSSCI-----PQRWKSIIILVPIKLGLDKLNEVYFREIKSMLE 569
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
PQS+G++GGKP S Y VG Q+E IYLDPH V ++ + + +YH V + +
Sbjct: 570 LPQSIGLIGGKPKQSFYFVGYQDEHIIYLDPHFVHDTVSPNDINF---SDSYHHCVPQKM 626
Query: 355 HLDSIDPSLAIGFYCRDKGLLVTF 378
+ +DPS+AIGFYC + F
Sbjct: 627 LISQLDPSMAIGFYCHTQSDFEDF 650
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
N + F DF + + SYRK F PI ++ IT+D+GWGCM+R+ QM
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQM 315
>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 297
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 109/223 (48%), Gaps = 33/223 (14%)
Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 166
+Y KGF P+ T+D WGC +RS Q L+ Q + +L + + ++ F
Sbjct: 27 FTYHKGFSPLAGG-YTTDKNWGCCIRSGQGLLMQFV--SKLYQLYGDKIKNIFPNG--SK 81
Query: 167 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
LF D +PF IH + + + +G+ AG WV P + ++ L
Sbjct: 82 FELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDLLSF------------- 128
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
I+VV E+G C+ S S G P+LLL L+LG + + +Y+
Sbjct: 129 FGIHVVIA-ENG--------CLSRESLR-EALSYGH----PVLLLFTLMLGYKDFDLKYL 174
Query: 287 PTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
P LRLT + QS+G+VGG+ G + Y+VG Q+E+ +Y DPH+V
Sbjct: 175 PFLRLTLSLIYQSVGVVGGQQGKAYYLVGHQKENLLYFDPHEV 217
>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
Length = 551
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 76/135 (56%), Gaps = 6/135 (4%)
Query: 248 IDDASRHCSVFSKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
IDD S+ + D W P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VG
Sbjct: 363 IDDESKD-EISENNNKDNDETWEPLLILIPMRLGLDGLNSIYHSSLLEIFKFPQNLGVVG 421
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP AS Y + Q+++ YLDPH VQ I + ++ + +T+ + H+ +DPSL
Sbjct: 422 GKPRASLYFIAAQDDNLFYLDPHTVQNHIEV-ENGSKFPLNTFFCSTTKRTHVSEVDPSL 480
Query: 364 AIGFYCRDKGLLVTF 378
+ F+C+ K F
Sbjct: 481 VVAFFCKTKDDFNDF 495
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 5/119 (4%)
Query: 94 LAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
+ EF DF++R+L +YR+GF I D+ +D GWGCMLRS QML++ LL + LG W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
+ + +I+ +F D ++PFSIHN+ G+ G G W P + ++ + L
Sbjct: 200 RSSSAT----HPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILV 254
>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus A1163]
Length = 226
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 69/117 (58%), Gaps = 3/117 (2%)
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+ G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 18 NDGRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGS 77
Query: 319 SAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
YLDPH +P + NI + + TYH+ +R IH+ +DPS+ IGF +D+
Sbjct: 78 HLFYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDR 134
>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus Af293]
Length = 226
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 69/117 (58%), Gaps = 3/117 (2%)
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+ G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 18 NDGRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGS 77
Query: 319 SAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
YLDPH +P + NI + + TYH+ +R IH+ +DPS+ IGF +D+
Sbjct: 78 HLFYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDR 134
>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
Length = 378
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 107/260 (41%), Gaps = 68/260 (26%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF SRI ++YR+ F PI S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 149 RPWRKP----------------------------------LQKPF------------DRE 162
R W P L+ P D E
Sbjct: 103 RAWTWPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHE 162
Query: 163 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
+ +I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 IRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + + + AD +++LVP+ L
Sbjct: 223 PDLQG-----ITIYVAQ--------DCTVYSSDVIDKQRTAMTADNADDKAVIILVPVRL 269
Query: 277 GLEKVNPRYIPTLRLTFTFP 296
G E+ N Y+ ++ TF P
Sbjct: 270 GGERTNTDYLEFVK-TFHCP 288
>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
Length = 356
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 112/255 (43%), Gaps = 36/255 (14%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
TSD+GWGCM+R+ Q L+A AL G P EI+ LF D +PFSIH
Sbjct: 85 TSDIGWGCMIRTGQTLLANALQRTNKGTPCS------------EIIELFVDETKNPFSIH 132
Query: 182 NLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
N + GK L G W P + E L C + + SGD +
Sbjct: 133 NFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKC-----IVSISSGDIYEQ- 186
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTFTFPQSL 299
+ +DD+ + +K Q ILLL + LG+ +N +Y ++ +
Sbjct: 187 --DVLDELDDSEPPAN--TKQQH----ILLLFGIKLGINTINIEKYGQDIKDITNNKYTC 238
Query: 300 GIVGGKPGASTYIVGVQE--ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
GI GG+P +S + G + +Y DPH N D+ D STYHS + +
Sbjct: 239 GISGGQPKSSLFFFGYNNTHDRILYFDPHKPN---NFTTDN---DYSTYHSTEFNELEMF 292
Query: 358 SIDPSLAIGFYCRDK 372
++DPS+ IGF ++
Sbjct: 293 NLDPSMIIGFLVKNN 307
>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
Length = 484
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 11/93 (11%)
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++K+NP YIP L+ ++PQS+GIVGG+P AS Y+ GVQ+ S IYLDPH+ Q +
Sbjct: 339 GMDKINPVYIPQLQQVLSWPQSVGIVGGRPSASLYVCGVQDASFIYLDPHEAQLALG--- 395
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
TY DV+R + +DPSLAIGF C
Sbjct: 396 --------TYFCDVVRVLPSAQLDPSLAIGFVC 420
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 65/117 (55%), Gaps = 6/117 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
DF SR+ +YRK F +G S +TSDVGWGC LRS QML+A+ R G R L + +
Sbjct: 49 DFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVRHGWRAGAMMRVALGRDW 108
Query: 160 DR-----EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
R E V ++ D +P SIH + AG G+ G W+GP+ +C+ EAL
Sbjct: 109 QRCSDNLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPGRWLGPWMLCKGLEAL 165
>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
Length = 745
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/109 (43%), Positives = 68/109 (62%), Gaps = 3/109 (2%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
PH VQ +N D ++TY + + + +DPSL+IGFYCRD+
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQA 608
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 155 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 189
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
Length = 745
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/109 (43%), Positives = 68/109 (62%), Gaps = 3/109 (2%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
PH VQ +N D ++TY + + + +DPSL+IGFYCRD+
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQA 608
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 155 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 189
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
Length = 179
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 66/109 (60%), Gaps = 3/109 (2%)
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ YLDPH
Sbjct: 26 PTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHLFYLDPH 85
Query: 327 DVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+P + NI + + + TYH+ +R IH+ +DPS+ IGF +D+
Sbjct: 86 QTRPALPQRNIDERYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDR 134
>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
Length = 257
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 80/176 (45%), Gaps = 44/176 (25%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP 249
>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 394
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 116/266 (43%), Gaps = 33/266 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGC---EAIKRTMQG--AVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLNEGPSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGF 367
+ T +R +H +D SL + F
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAF 281
>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 338
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 68/120 (56%), Gaps = 2/120 (1%)
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
S+ W +++L+P+ LG E++NP YI ++ FT +GI+GGKP S Y +G QE+
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSLFTLKHCIGIIGGKPKHSLYFIGFQED 215
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
I+LDPH Q V+++ D ++H R + L +DPS IGFYC+ + F
Sbjct: 216 KLIHLDPHLCQDVVDMRSRDFPL--QSFHCMSPRKMSLMKMDPSCTIGFYCKTQDDFKEF 273
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 61/100 (61%), Gaps = 6/100 (6%)
Query: 59 SRTGISSSTSDIWLLGVC--HKIAQDEALGDAAGNNGLA---EFNQDFSSRILISYRKGF 113
S+T S T IWLLG C H+ +A ++ L F +DF+SR+ ++YR+ F
Sbjct: 42 SQTNFSYHTP-IWLLGECYHHRPDDPNETEQSAEDDCLTPMERFKRDFTSRLWLTYRREF 100
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ + +T+D GWGCMLRS QM++AQ+ L H LGR +++
Sbjct: 101 QQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140
>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 394
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 117/266 (43%), Gaps = 33/266 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLCEGLSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGF 367
+ T +R +H +D SL + F
Sbjct: 256 ASVTPSVADVRCVHWSRVDTSLFLAF 281
>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 141/348 (40%), Gaps = 66/348 (18%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
I++LG H+I D+ + + + Q I I+YR+ + P+ S SD GWGC
Sbjct: 38 IYILG--HRIDIDQF----EIEDRINKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGC 91
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS------------- 176
MLR QM +AQ L H ++ D +Y I+ F D+++
Sbjct: 92 MLRVGQMAMAQMLKKHLKNHGDKR------DEDYDNIILAFADNDSQENKEFIEFQNSKD 145
Query: 177 ---------PFSIHNL-LQAGKAYGLAAGSWVGPYAM------------CRSWEALARCQ 214
PFSI + A K + L G W P + R+ E L
Sbjct: 146 KQKAHNFICPFSIQKIAYLAKKEFNLDPGEWYRPNYILFLLELLHNTIPIRASENLKLSV 205
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
++ L L ++ + D + +++ + K + + V
Sbjct: 206 FNDSCLFLDQLMNRMFEAKFETDKD--------LEEQLEKTQLIGKN-----SLAIFVLT 252
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+GL++ N +Y+ L P GIVGG P + YI+G + +YLDPH VQ N
Sbjct: 253 RIGLDEPNQKYLKILDEIMELPYFQGIVGGTPKRAFYILGKINDHYLYLDPHYVQEAEN- 311
Query: 335 GKDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
KD + + ++Y I ++ +D S+ + FY R++ L+ F
Sbjct: 312 -KDQINENKMFNRTSYSCKNIHLLNQKHVDTSMGLSFYIRNQSELLQF 358
>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
Length = 194
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 78/158 (49%), Gaps = 27/158 (17%)
Query: 70 IWLLGVCHKI--------AQDEALGDAAGNNGLA----------------EFNQDFSSRI 105
IWLLG + I A EA D N G + +F DF+SR+
Sbjct: 29 IWLLGCSYIIKPTDHIQQALLEAQRDLMFNKGSSENEEENNQNMHMLWPPDFYDDFTSRL 88
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-KPFDREYV 164
++YR + PI S +D+GWGC LRS Q L+A L+ H LGR WR+ Q + ++Y
Sbjct: 89 WMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQAAWKQYS 148
Query: 165 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 200
I+H F D S +PFSIH + GK G G W GP
Sbjct: 149 RIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGP 186
>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
Length = 216
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 68/120 (56%), Gaps = 14/120 (11%)
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 28 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 87
Query: 324 DPHDVQPVINIG--------KDDL------EADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
DPH Q +++ +DD E STYH I +D +DPSLA+GF+C
Sbjct: 88 DPHFCQNFVDLDETTTTRDERDDYVEIKNDEFKDSTYHCPFILSTKIDKVDPSLALGFFC 147
>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 394
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 116/266 (43%), Gaps = 33/266 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGF 367
+ T +R +H +D SL + F
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAF 281
>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
Length = 384
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 66/109 (60%), Gaps = 2/109 (1%)
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W +++L+P+ LG E +NP Y P ++ FT LG++GG+P S Y VG QE+ I+L
Sbjct: 203 NWCSVIILIPVRLGGESLNPIYEPCIKGLFTMDHCLGVIGGRPKHSLYFVGFQEDKLIHL 262
Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
DPH Q V+++ D + ++H R + + +DPS IGFYCR +
Sbjct: 263 DPHFCQEVVDMTPRDFPLE--SFHCMNPRKMSIARMDPSCTIGFYCRTR 309
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 7/91 (7%)
Query: 70 IWLLGVCHKIAQDE------ALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKIT 122
IWL GVC+ +E L D+ E F +DF+S++ ++YR+ F + S T
Sbjct: 88 IWLQGVCYHRRNEELTKELEPLTDSDRRLYTMELFKRDFASKVWLTYRREFPQLAGSMFT 147
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+D GWGCMLRS QML+A L+ H LGR +++
Sbjct: 148 TDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178
>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 394
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 116/266 (43%), Gaps = 33/266 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSANG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGF 367
+ T +R +H +D SL + F
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAF 281
>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
Length = 440
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 68/121 (56%), Gaps = 16/121 (13%)
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 252 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 311
Query: 324 DPHDVQPVINIG---------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
DPH Q +++ K+D E STYH I +D +DPSLA+GF+
Sbjct: 312 DPHFCQNFVDLDEATTTKDERGDYVEIKND-EFRDSTYHCPFILSTKIDKVDPSLALGFF 370
Query: 369 C 369
C
Sbjct: 371 C 371
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 57/122 (46%), Gaps = 28/122 (22%)
Query: 85 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 59 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 118
Query: 144 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
LGR W +DR EY IL G SE G G W
Sbjct: 119 VRHLGRNWL------WDRDVMLTEYKRILPNMGVSE----------------GKEIGEWF 156
Query: 199 GP 200
GP
Sbjct: 157 GP 158
>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 296
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 117/271 (43%), Gaps = 54/271 (19%)
Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY--- 163
+YR F I ITSD GWGC RS+Q L+A L + P D EY
Sbjct: 30 FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNY-----------APVDAEYFFT 78
Query: 164 ----VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
+ + LF D PFSI NL+ + +G+ G+W P + + E++ +
Sbjct: 79 VFNEIPMFSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFK------- 131
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
L +++ ++S D + ++ D + +LG++
Sbjct: 132 ----DLKLSV-LISKDSN-------IIPEDVKTMRAPFLLLIPI-----------LLGMK 168
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE-ESAIYLDPHDVQPVINIGKDD 338
V ++IP ++ TF P+ LG V G S ++VG+ E ++ +Y DPH + +
Sbjct: 169 DVEQKFIPFIKYTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPHVTKQAVASS--- 225
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
D S + R I + S++PS +GF+C
Sbjct: 226 --FDHSEFFEVPPRGIKMKSLNPSFLLGFFC 254
>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
Length = 419
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 117/260 (45%), Gaps = 39/260 (15%)
Query: 110 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 169
R FD + TSD GWGCM+R+SQ L+A AL K + +EIL L
Sbjct: 130 RSLFD---NENFTSDAGWGCMIRTSQNLLANAL---------LKLAGEANGNVQLEILKL 177
Query: 170 FGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
F D + FSIHN ++ A L+ G W GP A S L + Q P
Sbjct: 178 FQDDPNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLT------IEMTDQESPT 231
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+ V E+ + DD + K P+LLL P+ LG++ VN Y
Sbjct: 232 VVPFVYISENAD-------LYDDEIEETFLKEK-----RPLLLLFPVRLGIDHVNKYYYK 279
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQPVINIGKDDLEADTSTY 346
++ S+GI GGKP +S Y +G + +E+ IY DPH Q + + ++Y
Sbjct: 280 SILQLLASRFSVGIAGGKPSSSFYFIGYENDENLIYFDPHLPQVF------ESPINLASY 333
Query: 347 HSDVIRHIHLDSIDPSLAIG 366
H+ + ++ +DPS+ IG
Sbjct: 334 HTLNYNKLSIEMLDPSMMIG 353
>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
Length = 378
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 95/240 (39%), Gaps = 71/240 (29%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGW 127
N +F DF SR ++YR F PI SK +SD GW
Sbjct: 109 NGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGFSSDSGW 168
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+RS Q L+A A RLGR WR+ QK E ++I+ +F D +P+SIHN + G
Sbjct: 169 GCMIRSGQSLLANATGIVRLGRDWRRGQQK---AEEIKIMRMFADDPAAPYSIHNFVDYG 225
Query: 188 KAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
+ G G W GP A +
Sbjct: 226 SSKCGKYPGEWFGPSATSQ----------------------------------------- 244
Query: 247 CIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
CI+ S + ++D + P L+L+ LG++K+ Y L PQS+GI G
Sbjct: 245 CINPDVYEDSFMATAKSDHGFFKPTLILISTRLGIDKITQVYWEALISALQMPQSVGIAG 304
>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
Length = 348
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 125/282 (44%), Gaps = 40/282 (14%)
Query: 93 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 152 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
AL M Y+ SG + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLSSGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
Q +DT S + L S S+ +GFY
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFY 284
>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
Length = 378
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 105/260 (40%), Gaps = 68/260 (26%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 149 RPWRKP----------------------------------LQKPF------------DRE 162
R W P L+ P D E
Sbjct: 103 RAWTWPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHE 162
Query: 163 YV-EILH-----LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
EI H FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 MQNEIYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + C+ + D +++LVP+ L
Sbjct: 223 PELQG-----ITIYVAQ--------DCTVYSSDVIDKQCASMAPDITDDKAVIILVPVRL 269
Query: 277 GLEKVNPRYIPTLRLTFTFP 296
G E+ N Y+ ++ TF P
Sbjct: 270 GGERTNIDYLEFVK-TFHCP 288
>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
Length = 373
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 4/117 (3%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 147 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 206
Query: 157 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEAL 210
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 207 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLL 262
>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
Length = 286
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 63/107 (58%), Gaps = 2/107 (1%)
Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
+A+W I++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +
Sbjct: 109 EAEWKSIIILVPVRLGGETLNPAYMPCIKELLRMEPCLGIIGGKPKHSLYFIGYQDDFLL 168
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
YLDPH QP ++ KD + ++H R + +DPS +GFY
Sbjct: 169 YLDPHYCQPCVDTMKDSFPLE--SFHCTAPRKLPFAKMDPSCTVGFY 213
>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
Length = 391
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 64/110 (58%), Gaps = 17/110 (15%)
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++K+NP Y+P L+ T+PQS+GIVGG+P AS Y+ GVQ+ S ++LDPH+ QP + G
Sbjct: 216 GMDKINPVYLPQLQRILTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWGI 275
Query: 337 DDLEADT-----------------STYHSDVIRHIHLDSIDPSLAIGFYC 369
T +TY D +R + ++DPS+AIGF C
Sbjct: 276 AGDAGHTKEAGNGGSAVVLPASSLATYFCDTVRLMPATALDPSMAIGFLC 325
>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
Length = 256
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 81/176 (46%), Gaps = 45/176 (25%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP 248
>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 348
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 125/282 (44%), Gaps = 40/282 (14%)
Query: 93 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 152 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
AL M Y+ +G + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLRNGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
Q +DT S + L S S+ +GFY
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFY 284
>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
Length = 483
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 82/245 (33%), Positives = 113/245 (46%), Gaps = 34/245 (13%)
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD+GWGCM+R+ Q L+ AL RL P P +K +++ F D ++PFS+HN
Sbjct: 146 SDIGWGCMIRTGQALLGNALA--RLRSP---PEEK-------QLIGWFEDRSSAPFSLHN 193
Query: 183 LLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ G A G W GP A RS ++L + GL I SGD E
Sbjct: 194 FVREGNALSRKPPGEWFGPSATSRSIQSLVHA-FPQCGLNH----CIISTDSGDVYEEDV 248
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
G P++ + ILLL+ + LGL VN RY P ++ S+GI
Sbjct: 249 G-PIL--------------EREPQATILLLLGVKLGLNNVNSRYWPDVKHILGSSFSVGI 293
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
GG+P +S Y G Q + YLDPH Q + D E S HS +H +DP
Sbjct: 294 AGGRPSSSLYFFGYQGDYLFYLDPHTSQLDLASCATDNEKYESV-HSARFNKVHFSELDP 352
Query: 362 SLAIG 366
S+ IG
Sbjct: 353 SMLIG 357
>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
Length = 350
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 2/103 (1%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
QD SR+ +YR+GF PIG++++T+D GWGCMLR QM++A+AL LGR W+ ++
Sbjct: 72 QDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLGRDWQWS-EET 130
Query: 159 FDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGP 200
D Y++I++ F D++ +PFS+H + L + G W GP
Sbjct: 131 RDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGP 173
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 56/95 (58%), Gaps = 3/95 (3%)
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG-- 335
L +VNP YI L+ F P S G++GG+P + Y +G E A+YLDPH VQ V IG
Sbjct: 180 LNEVNPIYIEGLKKCFQLPGSCGMIGGRPNQALYFIGYVGEEALYLDPHTVQRVGCIGEK 239
Query: 336 KDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYC 369
++ +E + +T+H I S+DPSLA+ F C
Sbjct: 240 QESVEQEQDATFHQRHASRIAFASMDPSLAVCFLC 274
>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
Length = 700
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 64/109 (58%), Gaps = 5/109 (4%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAI 321
A W P+LL +PL LGL + NP Y ++ P S+GI+GG+P + +IVG +E +
Sbjct: 259 ATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQIPHSIGIMGGRPSHAVWIVGTAGDEDLL 318
Query: 322 YLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
LDPH QP +DDL A D T+H D + L+ +DPS+ IGF C
Sbjct: 319 CLDPHTTQPA---SQDDLTAEDDVTHHCDCPVRLPLERLDPSMVIGFVC 364
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 136 MLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
M++A+A+ LG+ WR P + D Y + +F D ++S +SI N+ G A
Sbjct: 1 MMLAEAITRIHLGKDWRWTPGCQ--DEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPI 58
Query: 195 GSWVGPYAMCR 205
GSW GP + +
Sbjct: 59 GSWFGPNTVAQ 69
>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
Length = 347
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 106/242 (43%), Gaps = 28/242 (11%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
M+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + AG
Sbjct: 1 MIRTGQSLLGNALQILHLGRDFRVNGNESLERES-KFVNWFNDTPEAPFSLHNFVSAGTE 59
Query: 190 YG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
G W GP A RS ++L G + I VS + E V
Sbjct: 60 LSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKVFAE 113
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+P +
Sbjct: 114 NPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGRPSS 159
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ IG
Sbjct: 160 SLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLIGIL 213
Query: 369 CR 370
+
Sbjct: 214 IK 215
>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 360
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 65/110 (59%), Gaps = 2/110 (1%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 181 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 240
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
LDPH QP +++ + D + ++H R + +DPS +GFY D+
Sbjct: 241 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDR 288
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 86 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 139
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 140 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 167
>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
Length = 362
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 65/110 (59%), Gaps = 2/110 (1%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 183 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 242
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
LDPH QP +++ + D + ++H R + +DPS +GFY D+
Sbjct: 243 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDR 290
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 88 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 141
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 142 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 169
>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
Length = 265
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 64/112 (57%), Gaps = 2/112 (1%)
Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y +G Q+E
Sbjct: 151 AHQSWQSVIILVPVRLGGESLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFIGFQDEQL 210
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
+YLDPH QPV+++ + + + ++H + + + +DPS IGFY + K
Sbjct: 211 LYLDPHYCQPVVDVSQVNFSLE--SFHCNSPKKMPFSRMDPSCTIGFYAKSK 260
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 41/60 (68%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ F F SRI ++YRK F P+ S +T+D GWGCMLRS QML+AQ LL H + R +++
Sbjct: 74 VERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQGLLVHLMHRVYKE 133
>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
Length = 546
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 70/138 (50%), Gaps = 23/138 (16%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W++G+ + ++E E D S + I+YR GF + T D GWGCM
Sbjct: 38 WIMGIPYTELREE------------ERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCM 85
Query: 131 LRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGD--SETSPFSIHN 182
LRS+QML+ QAL H LGR WR P L+ P EY ++ LF D E + FSIHN
Sbjct: 86 LRSAQMLMTQALQRHTLGRSWRVPRTLEERLRVP---EYRTLVRLFADHPGEANLFSIHN 142
Query: 183 LLQAGKAYGLAAGSWVGP 200
+ Q G Y G W GP
Sbjct: 143 MCQVGIRYDKLPGEWYGP 160
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 65/113 (57%), Gaps = 4/113 (3%)
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
++LLVPL LGL++++ YIP+L T PQSLG +GG+P + + +G Q + LDPH
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSLLETLRVPQSLGFLGGRPNHAIFFIGAQGNTLTGLDPHT 439
Query: 328 VQPVINIGKD-DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
QP ++G+ E + H + + IDPSLA+ FY D+ TFE
Sbjct: 440 TQPAADMGEGFPSERYVHSLHCQSAVSMDVHRIDPSLALAFYLPDR---ATFE 489
>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
Length = 359
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 180 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 239
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
LDPH QP +++ + D + ++H R + +DPS +GFY D+ T
Sbjct: 240 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 293
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 85 TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGSLTSD 138
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 166
>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 516
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 89/346 (25%), Positives = 140/346 (40%), Gaps = 70/346 (20%)
Query: 99 QDFSSRILISYRKGFDPIGD------------SKITSDVGWGCMLRSSQMLVAQALLFHR 146
++F + I I+YRK F + + S+ SD GWGCM+R QM A+ L H
Sbjct: 71 ENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCMVRVGQMAFAEGLRRHL 130
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSWVGPY 201
+ +K + K + V I D + +P+SI + + A + L G W P
Sbjct: 131 VEN--KKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIALSDFNLLPGEWYTPI 188
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGD-------EDGERGGAPVVCID 249
+C L ++A G + L +A++ +V D D +RG +C +
Sbjct: 189 RICYILGLLHNERKAIKG--TEDLKVAVFSSSRPIVFQDFLERMCKVDPQRGKHAQICPN 246
Query: 250 -------------DASRHCSVFSKGQ---------ADWTPILLLV-PL------------ 274
D H + + Q ++ TP L LV P+
Sbjct: 247 QCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCPIHHELQYSMIVYI 306
Query: 275 --VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
++GL+ P Y+ + F SLG++GGKP + Y VG E+ IYLDPH VQ
Sbjct: 307 VCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDEFIYLDPHYVQEFS 366
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
N + TY + +ID S ++ +Y +D L F
Sbjct: 367 NEKNFQSSSQLETYFCKKFQTYPSKNIDSSFSLMYYLKDLEQLEEF 412
>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
anatinus]
Length = 147
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 32/136 (23%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----R 152
F +DF SR+ ++YR+ F P+ S TSD GWGCMLRS QML+AQ L+ H L R W
Sbjct: 5 FQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWIWAEA 64
Query: 153 KPLQKP----------------------------FDREYVEILHLFGDSETSPFSIHNLL 184
P KP +R++ I+ F D +PFS+H L+
Sbjct: 65 GPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSLHRLV 124
Query: 185 QAGKAYGLAAGSWVGP 200
+ G+ G AG W GP
Sbjct: 125 RLGQGSGKRAGDWYGP 140
>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 463
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 4/114 (3%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 253 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQASHFFY 312
Query: 323 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
LDPH +P + + + +TYH+ +R +H+ +DPS+ IGF RD+
Sbjct: 313 LDPHHTRPALAYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDE 366
>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 172
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 69/127 (54%), Gaps = 11/127 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + + +
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKMCRV 141
Query: 190 YGLAAGS 196
L+A +
Sbjct: 142 LPLSADT 148
>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
Length = 632
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 62/108 (57%), Gaps = 4/108 (3%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
++W P+LL VPL LGL NP Y ++ F P +GI+GG P + +IVGV + I
Sbjct: 385 SNWRPLLLFVPLRLGLHNPNPCYFNAIKAVFRLPNCIGILGGSPCHAVWIVGVTGDDVIC 444
Query: 323 LDPHDVQPVINIGKDDLEAD-TSTYHSDVIRHIHLDSIDPSLAIGFYC 369
LDPH QP G+ +L+ D TYH + + L +DPS+ +GF C
Sbjct: 445 LDPHTTQPA---GRGNLKPDYDQTYHCENPIRMPLKRLDPSMVLGFLC 489
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 214 QR 215
R
Sbjct: 161 DR 162
>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 327
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 128/272 (47%), Gaps = 38/272 (13%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 219
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 220 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 279 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 336
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH +
Sbjct: 188 SRYMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSS 247
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ E S +R + +D S +GF+
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFF 279
>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 343
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 122/293 (41%), Gaps = 37/293 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
I SYR GF + I SD GWGCMLRS QM+ A LL H P +Q + +
Sbjct: 27 IYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLLRHLKENP---QIQNQLKIQNI 83
Query: 165 E-----ILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
I+ F +++ PFSI + A + + L G W P + S + L + +
Sbjct: 84 NDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKKLLNNFQTFS 143
Query: 219 GLGCQS------LPMAIYVVSGDEDGERGGAPV------VCIDDASRHCSVFSKGQADWT 266
+ S P+ G++ + + + I++ + + + +
Sbjct: 144 EMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKIMKQNSNKYQ 203
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+++GL+ +Y+ L FT S+G ++G+ + YLDPH
Sbjct: 204 INKQNYKILIGLDYPEEKYLDILIKLFTHRLSIG-----------MIGLNNDKLTYLDPH 252
Query: 327 DVQPV-INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
VQ IN E + TY + ++ I+ ++ PS+ +GFY +D L F
Sbjct: 253 IVQHADINTN----EINLKTYFQEEVKQINKHALGPSVGLGFYLKDLNDLNEF 301
>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
Length = 454
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 4/114 (3%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 245 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFY 304
Query: 323 LDPHDVQPVI---NIGKDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
LDPH +P + + G + +TYH+ +R +H+ +DPS+ IGF RD+
Sbjct: 305 LDPHHTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDE 358
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 24/119 (20%)
Query: 58 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 113
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 114 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
DP + T+D GWGCM+RS Q L+A AL LGR R
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRACR 203
>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
gambiense DAL972]
Length = 327
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 128/272 (47%), Gaps = 38/272 (13%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 219
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 220 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 279 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 336
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH +
Sbjct: 188 SRCMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSG 247
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ E S +R + +D S +GF+
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFF 279
>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 75/265 (28%), Positives = 112/265 (42%), Gaps = 31/265 (11%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L + GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWAY--GRPADRRLALFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHNL+++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGC---EAIKRTM--QDAIKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+ L + VV+ C+ H F +G A+ +L V + +
Sbjct: 149 EQLQTRVTVVTSTNG---------CVYADEVH-HTFKQG-AEVVLVLASVRVSAAAQLTQ 197
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
Y+ +L PQ LGIVGG PG S Y + YLDPH +
Sbjct: 198 ESYLQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPHQRTTAALLSDGPSATV 256
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGF 367
+ T +R +H +D SL + F
Sbjct: 257 SVTPSVSDVRCVHWSRVDTSLFLAF 281
>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
Length = 326
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 119/287 (41%), Gaps = 40/287 (13%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
L D N L E +S L++YR F+P+ S +TSD GWGC+ R+SQML+A L
Sbjct: 28 TLYDEDELNNLLE-----TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVLR 82
Query: 144 FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPY 201
H +++ D +PFS+H + +A +G A W P
Sbjct: 83 RHAASEC------------HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYW-APS 129
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 261
C EA+ C + G + +++ V S ER +
Sbjct: 130 QGC---EAIRSCVESAVRQGLLTQKLSVVVSSSGTIPER---------------EIHEHL 171
Query: 262 QADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+ D + +L+LVP+ G ++ L P +G+VGG P YIVG
Sbjct: 172 RGDGS-VLVLVPVRCGTSRRMTQTMFFALEHLLHIPSCMGVVGGVPNRGYYIVGTSGHRL 230
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+YLDPH + + + + T ++++R + D +D S GF
Sbjct: 231 LYLDPHCMTQNAMVSCELGKVGIVTPTTNLLRSVRWDHVDTSFFFGF 277
>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 327
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 123/270 (45%), Gaps = 42/270 (15%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMTLDFSFQYFC 95
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+ D +PFS+H ++++ G L W P C EA++ C R+ G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRSAIHRGAL 148
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 282
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 341
+ +L P +G+VGG PG YIVG +E +YLDPH + + E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIVGTGGQELLLYLDPHCMTQEALVS---CES 249
Query: 342 DTSTYHSDVIRH---IHLDSIDPSLAIGFY 368
DT+ RH + D +D S IGF+
Sbjct: 250 DTAGVVRPTPRHLLCVPYDRVDTSFFIGFF 279
>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
Length = 556
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 69/122 (56%), Gaps = 4/122 (3%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
E + +SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR W+
Sbjct: 37 EIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLGRSWKWS 96
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P Q+ EY +L +F D ++ +SI + G + G + GSW GP + + + L+
Sbjct: 97 PEQE--SPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKKLSVY 154
Query: 214 QR 215
R
Sbjct: 155 DR 156
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 4/78 (5%)
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TSTYHSDVI 351
F P +GI+GG P + +IVGV ++ I LDPH QP G+ +L+ D TYH D
Sbjct: 351 FRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPA---GRGNLKPDYDQTYHCDNP 407
Query: 352 RHIHLDSIDPSLAIGFYC 369
I L +DPS+ +GF C
Sbjct: 408 IRIPLKRLDPSMVLGFLC 425
>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 81/176 (46%), Gaps = 44/176 (25%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 216
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 272
>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
Length = 356
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 102/257 (39%), Gaps = 87/257 (33%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 79 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 127
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR Q G
Sbjct: 128 MLRCGQMMLAQALICRHLGRA----------------------------------QMGVG 153
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 154 EGKSVGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 196
Query: 250 DASRHCSV--FSKGQAD----------------------WTPILLLVPLVLGLEKVNPRY 285
D + C + FS AD W P+LL+VPL LG+ ++NP Y
Sbjct: 197 DIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLLLIVPLRLGINQINPVY 256
Query: 286 IPTLRLTFTFPQSLGIV 302
+ + TF + G V
Sbjct: 257 VDAFK-TFVDTEENGTV 272
>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
Length = 414
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 214 QR 215
R
Sbjct: 161 DR 162
>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
Length = 269
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 94/199 (47%), Gaps = 24/199 (12%)
Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
S +SIH + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 1 NSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD 52
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
V +DD C + W P+LL++PL LG+ +NP Y+P L+
Sbjct: 53 ---------STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLE 99
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVI 351
S G++GG+P + Y +G ++ +YLDPH Q + + A+ TYH
Sbjct: 100 LDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHA 159
Query: 352 RHIHLDSIDPSLAIGFYCR 370
++ ++DPSLA+ F C+
Sbjct: 160 ARLNFSAMDPSLAVCFLCK 178
>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
Length = 256
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 12/122 (9%)
Query: 85 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 30 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 89
Query: 144 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
LG W +DR EY IL +F D + FSIH + G + G G W
Sbjct: 90 VRHLGHNWL------WDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWF 143
Query: 199 GP 200
GP
Sbjct: 144 GP 145
>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
Length = 256
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 11/114 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH +
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 129
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/62 (25%), Positives = 36/62 (58%)
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
Y + + I+LDPH Q ++ +D D + + + +++ ++DPS+A+GF+C+
Sbjct: 124 YSIHQMGDELIFLDPHTTQTFVDTEEDGTVDDQTFHCLQSPQRMNILNLDPSVALGFFCK 183
Query: 371 DK 372
++
Sbjct: 184 EE 185
>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
Length = 364
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/322 (23%), Positives = 127/322 (39%), Gaps = 95/322 (29%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
L EF D + I ++ G + +SD GWGCMLR QM++AQAL+ LGR
Sbjct: 24 LEEF-PDTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRA--- 79
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
Q G G + G W GP + + + LA
Sbjct: 80 -------------------------------QMGVGEGKSIGEWFGPNTVAQVLKKLALF 108
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF--------------- 258
+ +A+YV + V I+D + C V
Sbjct: 109 DEWNS--------LAVYVSMDN---------TVVIEDIKKMCCVLPLSADTDTESPPDSP 151
Query: 259 -----SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV------- 302
SKG + W P+LL+VPL LG+ ++NP Y+ +L + L +
Sbjct: 152 TASNQSKGPSACGSAWKPLLLIVPLRLGINQINPVYVDAFKLQASCHPILIVTKEGVRRT 211
Query: 303 ---------GGKPGASTYIVGVQEESA---IYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
G + S + V ++ I+LDPH Q ++ ++ + D + +
Sbjct: 212 RILPPKDSSGARASESLKVKHVSFKTGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQS 271
Query: 351 IRHIHLDSIDPSLAIGFYCRDK 372
+ +++ ++DPS+A+GF+C+++
Sbjct: 272 PQRMNILNLDPSVALGFFCKEE 293
>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
IL3000]
Length = 327
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 121/270 (44%), Gaps = 42/270 (15%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMALDFSFQYFC 95
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+ D +PFS+H ++++ G L W P C EA++ C R G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRRAIHRGAL 148
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 282
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 341
+ +L P +G+VGG PG YI+G +E +YLDPH + + E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIIGTGGQELLLYLDPHCMTQEALVS---CES 249
Query: 342 DTSTYHSDVIRH---IHLDSIDPSLAIGFY 368
DT RH + D +D S +GF+
Sbjct: 250 DTVGVVRPTPRHLLCVPYDRVDTSFFLGFF 279
>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 328
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 205
WR + ++ D+E ++PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 263
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 264 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMKFFSLEHLLHSSTCIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
LDPH + + +A T + +++ + D +D S +GF
Sbjct: 231 LDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
Length = 259
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 99/243 (40%), Gaps = 55/243 (22%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 103
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 104 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 126
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 127 -LTASNQGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 185
Query: 370 RDK 372
+++
Sbjct: 186 KEE 188
>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
Length = 328
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 121/287 (42%), Gaps = 47/287 (16%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSET---SPFSIHNLLQA--GKAYGLAAGSWVGPYAM 203
WR + + H F D +T +PFS+H +++A KA W
Sbjct: 82 --WR------YSANDCRLDH-FRDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWT----- 127
Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--G 261
GC+++ + + +R P + + S+ C + +
Sbjct: 128 --------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICS 168
Query: 262 QADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
++ +L+L P+ G ++ +L +G+VGG P S YI+G +
Sbjct: 169 NLEFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRL 228
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+YLDPH + + +A T + +++ + D +D S +GF
Sbjct: 229 LYLDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 328
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 205
WR + ++ D+E ++PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 263
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 264 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
LDPH + + +A T + +++ + D +D S +GF
Sbjct: 231 LDPHCMTQEALVSGHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 823
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 64/114 (56%), Gaps = 3/114 (2%)
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
IL+++P LGL KVN Y +++ F ++GI+GG+P + Y VG Q+ I LDPH
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIKYVFQCRLNVGIMGGRPNQALYFVGTQKTDLICLDPHL 670
Query: 328 VQPVINIGKDDLEAD--TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
VQ + + +++L TYH D + + + +D SLA GFY +D F+
Sbjct: 671 VQDTV-LNQEELSNVELNQTYHCDQAKKLSMTKLDTSLAFGFYLKDYNDFEVFQ 723
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 10/99 (10%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLG-----RPWRKPLQKPFDREYVEILHLFGD---S 173
T+DVGWGC +R QM++ QAL+ H +G + QK + Y +I+ L D S
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLN--YAKIIQLIHDNDCS 451
Query: 174 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+T FSI N+ + G + G W GP+A+ L R
Sbjct: 452 QTGAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNR 490
>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
Length = 483
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 115/250 (46%), Gaps = 37/250 (14%)
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 181
+DVGWGCM+R+ Q L+ AL R+ + +P D + EI LF D+ S FS+
Sbjct: 135 TDVGWGCMIRTGQSLLGNAL--QRVKSTVKDQPYIYEMD-DTKEITDLFKDNTKSAFSLQ 191
Query: 182 NLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
N ++ G+ Y +A G W GP L + C I V SGD E
Sbjct: 192 NFVKCGRIYNKIAPGEWFGPATTATCIRYLIQENPCYGIEAC-----YISVSSGDIFKEN 246
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTP---ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+G D P IL+L+ + LGL+ V+ RY ++ P
Sbjct: 247 ------------------IQGMIDRYPNGNILILLGIKLGLDSVHERYWGEIKTMLESPF 288
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
S+GI GG+P +S Y G +++ ++ DPH+ Q + DD + + H++ ++
Sbjct: 289 SVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQTAL---IDDFD---ESCHTENFGKLNFS 342
Query: 358 SIDPSLAIGF 367
+DPS+ +GF
Sbjct: 343 DLDPSMLLGF 352
>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 328
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 120/285 (42%), Gaps = 43/285 (15%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDEELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 205
WR + +I D+E ++PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFRDI-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 263
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRRLIPPIRVVVCSQGCLLAREICSNL 170
Query: 264 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
LDPH + + A T + +++ + D +D S +GF
Sbjct: 231 LDPHCMTQEALVSSHAERAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
Length = 128
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 64/120 (53%), Gaps = 15/120 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125
>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum Pd1]
Length = 208
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 67/145 (46%), Gaps = 30/145 (20%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------I 121
L D A N F DF SRI I+YR F PI +K
Sbjct: 59 LNDTAWPNA---FVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGF 115
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
TSD GWGCM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH
Sbjct: 116 TSDTGWGCMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIH 172
Query: 182 NLLQAG-KAYGLAAGSWVGPYAMCR 205
+ G ++ G G W GP A +
Sbjct: 173 KFVNRGAESCGKYPGEWFGPSATAK 197
>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
Length = 255
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 67/140 (47%), Gaps = 27/140 (19%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 126
G A F DF+SR ++YR F DP + S TSD G
Sbjct: 116 GTGWPAAFLDDFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSG 175
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + DRE +L LF D +P+S+HN ++
Sbjct: 176 WGCMIRSGQSLLANALAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSVHNFVRH 232
Query: 187 GKAY-GLAAGSWVGPYAMCR 205
G+ Y G W GP A R
Sbjct: 233 GEKYCSKYPGEWFGPSATAR 252
>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 388
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 70/283 (24%), Positives = 118/283 (41%), Gaps = 40/283 (14%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ + + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAAAKKLLYFSYRNCFPPLPN-RSTTDTRWGCLVRTTQMLVGSCLLRYHCKGA 112
Query: 151 WRKPLQKPFDREYVE----ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
+ P +R+ E I LF D ++P IH + + S + P
Sbjct: 113 YVLP-----ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSP------ 161
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHCSVFSKGQADW 265
E G+ + +A + GD AP C ++ + S ++
Sbjct: 162 ---------TEAGMAIAAALIAFHAQGGD-------APFTFCCENRNIDESAVMAKLSEG 205
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
++L++P+VLG+ ++ +Y L GI GG AS Y+ G Q + ++DP
Sbjct: 206 QHVILIIPVVLGIAPMSGQYERMLLKILDMKACCGIAGGFKQASLYMFGHQGRNVFFMDP 265
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
H VQ G+ T + DP + +GFY
Sbjct: 266 HYVQRAYTSGR------TVGTLEGARGDLAARRFDPCMVLGFY 302
>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 425
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 75/172 (43%), Gaps = 27/172 (15%)
Query: 58 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF---- 113
P+R+ S++ LL LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLIP 144
Query: 114 ---DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 KSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRRG 204
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCR 205
+ +E ++L LF D +PFSIH ++ G A G G W GP A R
Sbjct: 205 TKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATAR 253
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 43/76 (56%), Gaps = 6/76 (7%)
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD-----LEADTSTYHSDVIRHIHL 356
+ G+P +S Y +G Q YLDPH +P + + +D + +TYH+ +R +H+
Sbjct: 255 IHGRPSSSHYFIGAQGSHFFYLDPHHTRPAL-VYRDAGDRPYTTEELNTYHTRRLRRLHI 313
Query: 357 DSIDPSLAIGFYCRDK 372
+DPS+ IGF RD+
Sbjct: 314 KDMDPSMLIGFLIRDE 329
>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
Length = 321
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/322 (24%), Positives = 119/322 (36%), Gaps = 91/322 (28%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
I+ L H + + DAA I I+YR+ + +G + +TSD GWGC
Sbjct: 38 IFGLSYTHDTPSELSFADAA---------HRIHDLITITYRQKYATLGHTYLTSDAGWGC 88
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH---------LFGDSETSPFSI 180
+RS QML+ +++ + L K F EY H L D E+S SI
Sbjct: 89 AIRSVQMLLVNSIVVY---------LDKSFHPEYTSHDHIAIKNNAKQLVFDKESSVLSI 139
Query: 181 HNL-LQAGKAYGLAAGSWVGPYAMCRS--------WEALARCQRAETGLGCQSLPMAIYV 231
HN+ +Q G+ P + C + WE +R L C
Sbjct: 140 HNIYIQDAIIKHNPTGTNFLPPSTCATAVADLYNFWE-----KRTFDVLMCTEY------ 188
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
I + ++ P LL +P ++ + N ++
Sbjct: 189 ----------------IPEVTQ-------------PTLLFIPRIVTKSERN-----FIQT 214
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
T PQS G V G A+ Y GVQE+ +LDPH VQ +G Y + I
Sbjct: 215 TSFLPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQDASEVG----------YFNRPI 264
Query: 352 RHIHLDSIDPSLAIGFYCRDKG 373
+ D +D S G C +K
Sbjct: 265 FEANFDELDNSFVFGMMCENKS 286
>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
Length = 142
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 63/144 (43%), Gaps = 40/144 (27%)
Query: 83 EALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI--------------------- 121
+ +G +G N EF DF+S++ ++YR F PI D+ +
Sbjct: 3 DMVGTTSGANWPPEFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLG 62
Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
TSD GWGCMLR+ Q L+A AL+F LGR WR+P Y
Sbjct: 63 GERGWTSDSGWGCMLRTGQSLLANALVFMWLGREWRRPPAPMPTESYA------------ 110
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGP 200
S+H + AGK G G W GP
Sbjct: 111 --SVHRMALAGKELGKDVGQWFGP 132
>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 649
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 119/292 (40%), Gaps = 29/292 (9%)
Query: 105 ILISYRKGFDPIGD-----SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
I SYR F I D +++D GWGCM+R SQML+A+AL H L + Q
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204
Query: 160 DRE---YVEILHLFGD--SETSPFS------------IHNLLQAGKAYGLAAGSWVGPYA 202
D E Y I+ LF D SE+ + + N Y L + A
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---RHCSVFS 259
+ R ++ + T + + I S + + G ++ D + S
Sbjct: 265 ILRQYQQ--NVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEAS 322
Query: 260 KGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+ Q D IL++V L G+ K ++ +G + G YI+G QE+
Sbjct: 323 RKQNDRLNNILVMVHLKFGINKFEMQHKDYFIELLKIKNFVGALSGTETKGMYIIGFQED 382
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
I LDPH +Q G+ L+ D TY + R I L+ + +++G++ +
Sbjct: 383 RLIVLDPHFIQKSTE-GEQGLDKDYCTYFNKTPRSISLECLSSDISLGYFIQ 433
>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
Length = 224
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 5/101 (4%)
Query: 83 EALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 137
E + + NN + +F DF+SR+ ++YR + PI S +D+GWGCMLRS Q L
Sbjct: 120 EEISEEEDNNNMYLRWPLDFYDDFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSL 179
Query: 138 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
+A L+ H LGR WR+ Q R+ + I L + PF
Sbjct: 180 LANTLIIHFLGRDWRRQTQNQTTRKELCIGFLMSYHQEHPF 220
>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
Length = 296
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 93/220 (42%), Gaps = 41/220 (18%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
+R + +I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVE 103
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
+ + +YV S+ C+V L + L +
Sbjct: 104 SCSEVTRLVVYV--------------------SQDCTV-----------LHMRSLAIDPS 132
Query: 280 KVNPRYIPT-LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
K +P+ L+ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 133 KDRSTCLPSSLQELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQAN 192
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTF 378
+ ++H R + +DPS +GFY D+ T
Sbjct: 193 FPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 230
>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
Length = 567
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 9/121 (7%)
Query: 254 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+CS ++ + W P++++VP+ LG + L QSLG +GG+P S Y
Sbjct: 406 NCSRMAQAREPCSWRPLIVVVPVRLGARSEDQH----LSRIDKHLQSLGFIGGRPRHSYY 461
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VGV+ +A YLDPH QP +I K+ + +++H + L IDPSLA+GFYC D
Sbjct: 462 FVGVRGYNAYYLDPHITQPYQSIRKN---INVASFHCAHPGKMSLAHIDPSLALGFYCDD 518
Query: 372 K 372
K
Sbjct: 519 K 519
>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 359
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/347 (23%), Positives = 139/347 (40%), Gaps = 67/347 (19%)
Query: 40 KRLVTAGSMRRI--------HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAG 90
++LV GS + HE + P G S ++LGV K Q D+ L +
Sbjct: 5 QKLVQHGSYNILSKFYNQIGHEDIQKPIFIGGCS----FYILGVEFKTKQMDKQLAEQPP 60
Query: 91 NNGL----AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL---- 142
L A S+ ++YR G++ + +S +T+DVGWGC +R+ QM++A A+
Sbjct: 61 EVYLQYSSAPAFFRISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIV 120
Query: 143 ---LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFSIHNLLQAGKAY--GLAAG 195
+ P+ P E + +L F DS T+P SIH++ ++ +
Sbjct: 121 YSGALNNTQTPYI-----PTKEEIMNVLVPFIDSPNSTTPLSIHHVYESRFVVEKNKSGV 175
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
+++ P + +++ L + P+ C+ ++
Sbjct: 176 NYLAPSVVAKAYSGLVNSWKL--------------------------CPIRCVMCSNVSI 209
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
+ + P L+ +P+VL N L+ + GIVGG + ++ G
Sbjct: 210 PTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGF 264
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
+YLDPH VQP K E DT +Y + +IDP+
Sbjct: 265 HALQFLYLDPHIVQPSF---KSFTEIDTKSYSPISTNRFSVHTIDPT 308
>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 359
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/310 (23%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 70 IWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRILISYRKGFDPIGDSKITS 123
++LGV K Q D+ L + L A F + S+ ++YR G++ + +S +T+
Sbjct: 39 FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAATFFR-ISNLFWMTYRSGYEKLPNSSLTT 97
Query: 124 DVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--E 174
DVGWGC +R+ QM++A A+ + P+ P +E + +L F DS
Sbjct: 98 DVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----PTKQEVMNVLIPFIDSPNS 152
Query: 175 TSPFSIHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
T+P SIH++ ++ + +++ P + +++ L +
Sbjct: 153 TTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL---------------- 196
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
P+ C+ ++ + + P L+ +P+VL N L+
Sbjct: 197 ----------CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQI 241
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
+ GIVGG + ++ G +YLDPH VQP K E DT +Y
Sbjct: 242 YKSKLFAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTEIDTKSYSPIGTN 298
Query: 353 HIHLDSIDPS 362
+ +IDP+
Sbjct: 299 RFSVHTIDPT 308
>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 371
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 121/286 (42%), Gaps = 40/286 (13%)
Query: 99 QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
++ SS + +SY+K + IT+D GWGC LR+SQM++AQ L H + + +
Sbjct: 52 EELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRHLYEKRVQSFIY 111
Query: 157 KPFDREYVEILHL---FGDSET------SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
D+ ++ HL F +S + SPF H+LL +A L Y +
Sbjct: 112 N--DKTKLDFQHLIMMFAESNSLENMDQSPFGFHSLL--TQAINLFQVPLKQQYTPVQGI 167
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
+AL + Q L ++ +V+ V+ +D + + K
Sbjct: 168 KALKQ------QFKQQKLVKSLKIVT-------SSTGVIFQEDIRQKMKNWEKS------ 208
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+LL++ LG K+N Y+ ++ +G +GG S ++VG + + LDPH
Sbjct: 209 LLLILHFKLGTGKLNQIYVEQIKSLMDLEYFVGAIGGIKNKSLFMVGYMNDQFLSLDPHV 268
Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDS---IDPSLAIGFYCR 370
Q N KD L + S + + DS + +I FY R
Sbjct: 269 QQ---NACKDPLNLNDEEMSSFFPKKVRADSCVKYEGDFSISFYIR 311
>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
Length = 81
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/48 (72%), Positives = 41/48 (85%)
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
RYIP L+ T TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ V
Sbjct: 10 RYIPLLKETLTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLV 57
>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
Length = 102
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 49/83 (59%), Gaps = 11/83 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 29 VWVLGECYNVKTEKT-----------ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWR 152
MLR QM++AQAL+ +LGR WR
Sbjct: 78 MLRCGQMILAQALVCSQLGRAWR 100
>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 325
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 67/313 (21%), Positives = 127/313 (40%), Gaps = 67/313 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+++LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VYILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSCLGNTYLSSDAGWGC 91
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------REYVEILHLFGDSETSPFSIHN 182
+R++QM++ L+ ++ +Q+ D + ++ L D +S SIHN
Sbjct: 92 AIRATQMMIVNTLVI------FKDQMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHN 145
Query: 183 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
+ + K + +++ P C + +L + E ++
Sbjct: 146 IYIQEIIKVHNPTGTNFLPPSICCIAISSLLQ-----------------------EWDKK 182
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+ C+D +CS P L L+P ++ + + + T QS G
Sbjct: 183 LFNCITCLDHIP-NCSY---------PTLYLIPQIITFTEHQ-----LILDSLTLSQSRG 227
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
VGG ++ ++ G Q + +LDPH VQ + G Y + I L I
Sbjct: 228 FVGGIGESAIFVFGYQGTTLFFLDPHYVQNAGDFG----------YFNPPTYQIDLSLIS 277
Query: 361 PSLAIGFYCRDKG 373
PS+ F C ++
Sbjct: 278 PSIVFAFMCYNEN 290
>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
Length = 362
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 70/323 (21%), Positives = 137/323 (42%), Gaps = 42/323 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ-----DFSSRILISYRKGFDPIGDSKITSD 124
++LLG+ +K + + L +++ S+ + ++YR G++ + +S + +D
Sbjct: 39 LFLLGIEYKTTPLKKQAQELPQSSLLQYSSMAAYVRMSNLLWMTYRSGYEKLPNSSLNTD 98
Query: 125 VGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSI 180
VGWGC +R+ QM+++ A+ L ++ P E + ++ F D +T+P SI
Sbjct: 99 VGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIPKQNEILNVVIPFVDFFEQTTPLSI 158
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H++ +E+ ++ ++G+ + P + D
Sbjct: 159 HHV-----------------------YESRFVVEQNKSGVNYLA-PTIVAKAYSDLVNSW 194
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+ C+ ++ + + + P L+ +P+++ + V R L+ + F G
Sbjct: 195 KMCALRCVMASNTSIPLCDIKKEPFKPTLVFLPIIMD-QLVKSR----LQQIYKFNMFAG 249
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI-NIGKDDLEAD---TSTYHSDVIRHIHL 356
IV G + YI G ++LDPH VQP + K DL++ T + I I L
Sbjct: 250 IVSGIGDRAVYIFGFHVMRCLFLDPHTVQPAAESFTKIDLKSYAPINPTLNRFAIHSIEL 309
Query: 357 DSIDPSLAIGFYCRDKGLLVTFE 379
D ID GF + + FE
Sbjct: 310 DKIDQFCTFGFLIKSLEEVDAFE 332
>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 388
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 118/285 (41%), Gaps = 44/285 (15%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGT 112
Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 264
E G+ + +A + GD P C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFY 368
PH +Q + +D + + R + DP + +GFY
Sbjct: 265 PHYIQ-------NAYTSDKTVGTLEGARGELSARRFDPCMVLGFY 302
>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 388
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 118/285 (41%), Gaps = 44/285 (15%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGA 112
Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 264
E G+ + +A + GD P C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFY 368
PH +Q + +D + + R + DP + +GFY
Sbjct: 265 PHYIQ-------NAYTSDRTVGTLEGARGELSARRFDPCMVLGFY 302
>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 228
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 12/96 (12%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWR 152
+TSD GWGCMLRS QM++AQ LL H L G+PWR
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKPWR 169
>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 355
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 36/228 (15%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 134
+A DE D N +F DF SRI ++YR F+ I S + TS + L+S
Sbjct: 99 LAYDEPTKD---NGWPPQFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQL 155
Query: 135 ---QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
+ +++ RLGR WR+ Q P E EI+ LF D +P+S+H+ ++ G A
Sbjct: 156 GDQSPFSSDSMI--RLGRDWRR-GQSP--HEEREIIKLFADHPNAPYSLHSFVRHGASAC 210
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A R +ALA + + +Y G P V D+
Sbjct: 211 GKYPGEWFGPSATARCIQALANSHESS---------LRVYST--------GDGPDVYEDE 253
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ +G+A + P L+LV LG++K+ P Y L + PQS
Sbjct: 254 FMKIAK--PEGEA-FHPTLILVGTRLGIDKITPVYWEALIASLQMPQS 298
>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 1216
Score = 73.9 bits (180), Expect = 1e-10, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 61/101 (60%), Gaps = 2/101 (1%)
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+ LL+P LGL++++P +I L+ + QS+G++GGKP + Y +G + +YLDPH
Sbjct: 493 LFLLLPCRLGLDEISPIHIEILKKLLSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDPHY 552
Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
++ + K+DL + S+Y + + + ++ I SL GFY
Sbjct: 553 IKECVR--KEDLMENISSYFEEDVFKMPINKISTSLVFGFY 591
Score = 51.2 bits (121), Expect = 9e-04, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 7/54 (12%)
Query: 99 QDFSSRILISYRKGFDPIGDSKI-------TSDVGWGCMLRSSQMLVAQALLFH 145
Q + + IL +YRK F P+ KI TSD GWGCM+R+ QM+ AQ + H
Sbjct: 257 QIYQNTILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRH 310
>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 388
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 119/287 (41%), Gaps = 48/287 (16%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ S T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKVATKKLLYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGA 112
Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+ P + + E E I LF D ++P IH + S + P
Sbjct: 113 YVLP--EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHC---SVFSKGQADW 265
E G+ + +A GD P C + SRH +V +K +
Sbjct: 162 ------TEAGMAIAAALIAFRAQGGD-------VPFTFCCE--SRHIDEPAVMAK-LLEG 205
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++DP
Sbjct: 206 QHVVLIIPVVLGIAPMSDQYELVMLKILDVKACCGIAGGFKQASLYMFGHQGRSVFFMDP 265
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIR----HIHLDSIDPSLAIGFY 368
H VQ A TS+ + + DP + +GFY
Sbjct: 266 HYVQ----------NAYTSSRTVGTLEGSRGELRARRFDPCMVLGFY 302
>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
Length = 429
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 26/114 (22%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 133
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYG 290
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 42/72 (58%), Gaps = 3/72 (4%)
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSID 360
G+P +S Y +GVQ + YLDPH +P + +D + T H+ +R +H+D +D
Sbjct: 302 GRPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMD 361
Query: 361 PSLAIGFYCRDK 372
PS+ IGF +D+
Sbjct: 362 PSMLIGFLIKDE 373
>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
Length = 282
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 72/142 (50%), Gaps = 14/142 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+WLLG + ++ + + F D+ SRI ++YR P+ S T+D GWGC
Sbjct: 116 LWLLGEFYFTSRPDEDDEVV----FRAFAIDYYSRIWLTYRTELSPLPGSSKTTDCGWGC 171
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE-----YVEILHLFGDSETSPFSIHNLL 184
LR+ QM++AQAL+ LGR WR + +R + +I+ LFGD + ++ L+
Sbjct: 172 TLRTCQMMLAQALVVLHLGREWRFWGDEEANRYRCGFGHYDIVSLFGDHLDADLGLYRLM 231
Query: 185 QAGKAYGL--AAGSWVGPYAMC 204
+ K A G+W Y+ C
Sbjct: 232 KIAKERNEHDAVGNW---YSAC 250
>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
Length = 806
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 62/114 (54%), Gaps = 2/114 (1%)
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+++++ + LGLE + Y L+ F+ Q +GI+GGKP + Y VG Q++ I+LDPH
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLRQCVGILGGKPNFALYFVGYQQDHMIFLDPHY 700
Query: 328 VQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
VQ + + D E + + I ++S+DP + +GF ++ L+ E
Sbjct: 701 VQQALTSDEQLKDQELKDTYQSQRSAKKIKMESLDPCIGVGFLIQNSKDLIAIE 754
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 13/72 (18%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV----EILHLFGDSETS 176
I SD GWGCM+R QM++A + L K LQ+ + + IL + D +
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFL---------KLLQQNHNFHDILTHDSILSMILDQLDA 443
Query: 177 PFSIHNLLQAGK 188
PF IH + + G+
Sbjct: 444 PFGIHQITEEGR 455
>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
Length = 158
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 46/68 (67%)
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
LGL+ VNP Y T+++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 1 LGLDGVNPIYYDTIKILYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPLR 60
Query: 336 KDDLEADT 343
LE ++
Sbjct: 61 PPTLEPES 68
>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
Length = 312
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 67/284 (23%), Positives = 117/284 (41%), Gaps = 59/284 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
FNQ + I YR G K SD GWGC++R QM++A AL+ R+
Sbjct: 49 FNQKKDTLIWFCYRANIQFEG--KAISDQGWGCLVRVGQMMLANALM--------RECKI 98
Query: 157 KPFDREYVEILHLFGDSE----TSPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 210
++ I+HLF D++ +PFSI +++ A + G W GP M
Sbjct: 99 LAINKTKAMIIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIM------- 151
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT---P 267
S ED + + I+ + + Q D + P
Sbjct: 152 ----------------------SVIEDLNKNNMNIKQINLVNFLEQCVLESQIDLSFKKP 189
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
LL++ ++G + + I L+ Q G + GK + +++G Q+ +AI++DPH
Sbjct: 190 HLLIIHAIIGDKSLGQLEIQNLQSHMQISQFAGAIIGKNNKAFFLIGFQKNNAIFMDPHY 249
Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VQ K ++E + ++ L ++ ++A+ FY +
Sbjct: 250 VQES---NKIEMECN--------LKCQPLKQLNGTIALAFYISN 282
>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 200
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
+++ I++LFGDS S FSIH L+ G+ G W GP
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP 136
>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 106/269 (39%), Gaps = 39/269 (14%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ SYR F P+ + T+D WGC+LR++QML+ LL + + P + +
Sbjct: 74 LYFSYRSCFPPLPNGS-TTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELK-A 131
Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
I LF D ++P IH + S + P E G+
Sbjct: 132 NISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSP---------------TEAGMA--- 173
Query: 225 LPMAIYVVSGDEDGERGGAPVV--CIDDASRHCSVFSK---GQADWTPILLLVPLVLGLE 279
MA +++ +G G P C + +V +K GQ ++L++P+VLGL
Sbjct: 174 --MAAALIACHAEG--GDVPFTFSCENRNIDEPAVVAKLLEGQH----VILIIPVVLGLA 225
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
++ +Y + GI GG AS Y+ G Q ++DPH +Q D
Sbjct: 226 PLSDKYESMMLKILDMKACCGIAGGFKQASFYMFGHQGRKVFFMDPHYIQKAYT--SDKT 283
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
D+ DP + +GFY
Sbjct: 284 AGTLYGARGDLTAR----KFDPCMVLGFY 308
>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
Length = 348
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 28/124 (22%)
Query: 97 FNQDFSSRILISYRKGFDPI---------------------GD-SKITSDVGWGCMLRSS 134
F DF SR ++YR GF+PI GD S +SD GWGCM+RS
Sbjct: 120 FLDDFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSG 179
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
Q L+A A+ + LGR WR ++ EI+ LF D +P+SIH + G +A
Sbjct: 180 QSLLANAMAMYELGRGWRLSDGGIAEK---EIISLFADDPRAPYSIHRFVGHG---AVAC 233
Query: 195 GSWV 198
GS++
Sbjct: 234 GSFL 237
Score = 38.1 bits (87), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 321 IYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
YLDPH +P + + E + + H+ +R +H+ +DPS+ IGF RD+
Sbjct: 238 FYLDPHHTRPGLPFHEHPSEYTQEEVGSCHTRRLRRLHIREMDPSMLIGFLIRDE 292
>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia strain d4-2]
gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia]
gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
Length = 277
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 104/242 (42%), Gaps = 48/242 (19%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
F Q + I SYR G + SD GWGC++R QM+VA +L+
Sbjct: 14 FLQLKETFIWFSYRANIQYEG--RAISDQGWGCLIRVGQMIVANSLIRESTNS------- 64
Query: 157 KPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 210
KP D + +I+ LF D++ +PFSI +++ A Y + G W GP MC + L
Sbjct: 65 KPNDLK-TKIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLL 123
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW---TP 267
Q A+T + I + C + + Q D P
Sbjct: 124 ---QSAKT------------------------IKQLKIINFLEQCVI--EKQIDLQFKQP 154
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
LL++ ++G ++++ ++ L+ PQ G + GK + +++G Q I +DPH
Sbjct: 155 QLLIIHAIIGNKELDQYFVAELQKHMQIPQFAGAIVGKSKKAYFLIGYQNNQGIVMDPHY 214
Query: 328 VQ 329
VQ
Sbjct: 215 VQ 216
>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
Length = 325
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/315 (22%), Positives = 124/315 (39%), Gaps = 71/315 (22%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+ +LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VHILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 182
+R++QM++ AL+ ++ +Q+ D E L D +S SIHN
Sbjct: 92 AIRATQMMIVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145
Query: 183 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
+ Q K + +++ P C + +L + E
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSVCCIAISSLLQ--------------------------EW 179
Query: 241 GGAPVVCIDDASR--HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
P CI + CS P L L+P ++ + + + +L L+ QS
Sbjct: 180 DKKPFNCITCLNHIPSCS---------CPTLYLIPRIITFTE-HQLILDSLALS----QS 225
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
G VGG ++ ++ G Q + +LDPH VQ + G Y + I +
Sbjct: 226 RGFVGGIGESAIFVFGCQGTTLFFLDPHYVQNAGDFG----------YFNPPTYQIDISL 275
Query: 359 IDPSLAIGFYCRDKG 373
I S+ F C ++
Sbjct: 276 ISSSVVFAFMCYEEN 290
>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
Length = 348
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 122/296 (41%), Gaps = 54/296 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK------------ITSDVGWGCMLRSSQMLVAQALLF 144
F ++F IL +YR F I ++ I SDVGWGCM R +QM +A +
Sbjct: 44 FLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVTQMSIAHGIC- 102
Query: 145 HRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAM 203
+ K + E +IL+ F D+E++ FSIHN++ G +G+ SW+GP
Sbjct: 103 -----QFMKRFLGNLNIE--KILNNFQDNESAKFSIHNMVNIGLSEFGIDPTSWIGPTTS 155
Query: 204 CRSWEALARCQRAETGLGCQSLPMA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
L R+ ++ +A I V G + D A +H FS+
Sbjct: 156 SMIANKLINDNRSIIS----NIQIASITYVEG----------TIYRDQAVKH---FSEVG 198
Query: 263 ADWTPILLLVPLVLGLEKVNPR-YIPTLRLTFTFPQSLGIVGGKPGAS--TYIVGVQEES 319
+D + L + LG K N Y T+ Q + I+GG +S IV
Sbjct: 199 SDSCTFVWLC-MKLGTSKFNINSYKKTVISMSNVSQFICIMGGNNYSSGALLIVAFSNSF 257
Query: 320 AIYLDPH-DVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
LDPH V P N +DD T I+ ++ SL++ + CR+
Sbjct: 258 LYCLDPHIKVLPSFSDKNFIRDDFIQKVPT-------RIYWGELNSSLSMVYICRN 306
>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
Length = 352
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 65/268 (24%), Positives = 113/268 (42%), Gaps = 57/268 (21%)
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--- 165
YR F P+ ++ +TSD GWGC +RS+QMLVA A+ K FD V
Sbjct: 92 YRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANAI---------GKLFTNDFDTGEVTDKM 142
Query: 166 ILHLFGD--SETSPFSIHNLL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
++ F D S PFSIHNL +A + S++ P A+ ++ + + + A G
Sbjct: 143 VIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK-KLANPKFG 201
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + V+++ P ++L+P+ + +
Sbjct: 202 MEILT------------------------TTFTFRVYTQ------PTIVLIPISIP-DSF 230
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
N + + + F+F G+VGG + Y G+ + ++LDPH V+ N +
Sbjct: 231 NDK----IAVIFSFYLFSGMVGGSGRKAFYFFGIHHDQLLFLDPHTVR---NTVINSCSF 283
Query: 342 DTSTYHSDV--IRHIHLDSIDPSLAIGF 367
D YH + ++ + +D S + F
Sbjct: 284 DPQEYHPIIGDVKALSYSLLDRSAVLAF 311
>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
Length = 149
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 11/82 (13%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKITS 123
S +S + LLG ++++ + G+ E F + FSS + +SYR+GF P+ S ++S
Sbjct: 74 SKSSPVCLLGQSYQLS----------STGVRESFRRVFSSLLWMSYRRGFRPLDGSTLSS 123
Query: 124 DVGWGCMLRSSQMLVAQALLFH 145
D GWGCMLRS+QML+AQ LL H
Sbjct: 124 DAGWGCMLRSAQMLLAQGLLLH 145
>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
Length = 564
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 74/315 (23%), Positives = 123/315 (39%), Gaps = 83/315 (26%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 176
+T+D WGC +RS+QM++A AL P IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQQSTFMYPVNS------------ILKLFDDNIRECTES 261
Query: 177 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 211
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321
Query: 212 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------- 260
+ CQ Q L V++ + E DD + FS+
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLT-----FSQMGLGCDRRI 375
Query: 261 ---------------GQADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
+W +L++V + LGL+K++P Y + PQ +G+VGG
Sbjct: 376 NYDKLPNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGG 435
Query: 305 KPGASTYIVG------VQEESAIYLDPHDVQP-VINIGKD-DLEA-DTSTYHSDVIRHIH 355
KP + Y G + ++LDPH VQ N+ DL+ + + +H+ R +
Sbjct: 436 KPNKAFYFFGHIIDQDTNKVKLMFLDPHKVQDYTYNVETSYDLDVKEQAKFHTTEARLLK 495
Query: 356 LDSIDPSLAIGFYCR 370
+ +D L GF +
Sbjct: 496 IKELDTCLGFGFLIK 510
>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
Length = 646
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 22/120 (18%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ EF +DFS++I +SYR+GF IGD+ +D GWG W+K
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGY---------------------WKK 447
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALAR 212
Q + I+ +F D T+PFSIHN+ G+ + G G W P + + ++L
Sbjct: 448 SGQNEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVN 507
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
IVGGKP AS Y + Q+++ YLDPH VQ I+
Sbjct: 541 IVGGKPRASLYFIAAQDDNLFYLDPHTVQQAID 573
>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
Length = 564
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/315 (22%), Positives = 120/315 (38%), Gaps = 83/315 (26%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 176
+T+D WGC +RS+QM++A AL P IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQQSTFMYPVNS------------ILKLFDDNIRECTES 261
Query: 177 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 211
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321
Query: 212 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------- 260
+ CQ Q L V++ + E DD + FS+
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLT-----FSQMGLGCDRRI 375
Query: 261 ---------------GQADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
+W +L++V + LGL+K++P Y + PQ +G+VGG
Sbjct: 376 NYDKLPNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGG 435
Query: 305 KPGASTYIVG------VQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIH 355
KP + Y G + ++LDPH VQ + D + + +H+ R +
Sbjct: 436 KPNKAFYFFGHIIDLDTNKVKLMFLDPHKVQDYTYDVETSYDLDVKEQAKFHTTEARLLK 495
Query: 356 LDSIDPSLAIGFYCR 370
+ +D L GF +
Sbjct: 496 IKELDTCLGFGFLIK 510
>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
Length = 266
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 91 NNGLAE--FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
NN + + F D S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+ALL H
Sbjct: 205 NNNIIQSNFLDDVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALLKH 261
>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
Length = 426
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 54/117 (46%), Gaps = 15/117 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ +YR GF+ + T D GWGCMLRS+QML+ AL R G R +
Sbjct: 28 LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL--TRNGAAPR-----------L 74
Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
LF D+ +++PF +HN + G Y + G W GP C L +R G
Sbjct: 75 ATAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLVDWRRNAPG 131
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 59/144 (40%), Gaps = 46/144 (31%)
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGA-----STYIVGVQEE---------------- 318
++ PRY LR PQS G++GG+P A +T + ++
Sbjct: 234 RLEPRYAEPLRAALRLPQSAGMLGGRPRANRIFNTTSMCASSDQNLQLCFENSTRAIDPS 293
Query: 319 ------SAIY---------------LDPHDVQPVINIGKDDL---EADTSTYHSDVIRHI 354
+A++ LDPH VQP + +G D A S D + +
Sbjct: 294 KSGRPRAALFFPGLAARDGGADVYGLDPHTVQPALAVGDDGALGPGAAASVAPRDA-KKL 352
Query: 355 HLDSIDPSLAIGFYCRDKGLLVTF 378
D++DPSLA+ FYC D+ + F
Sbjct: 353 AADALDPSLALAFYCADRDDFLDF 376
>gi|237837057|ref|XP_002367826.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
gi|211965490|gb|EEB00686.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
Length = 3559
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 73/138 (52%), Gaps = 13/138 (9%)
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 300
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 358
+V G+ + Y +G Q+++ +YLDPH +QP L A T ++ + + + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079
Query: 359 IDPSLAIGFYCRDKGLLV 376
++PSLA+ F+ R++ L+
Sbjct: 3080 LNPSLAVAFFVRNERQLL 3097
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 17/58 (29%)
Query: 107 ISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLVAQALLFHRL 147
+YR GF P+ G+ K I SDVGWGC +R++QML+ QAL H L
Sbjct: 1148 FTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLLMQALRRHFL 1205
>gi|221481944|gb|EEE20310.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 3562
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 73/138 (52%), Gaps = 13/138 (9%)
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 300
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 358
+V G+ + Y +G Q+++ +YLDPH +QP L A T ++ + + + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079
Query: 359 IDPSLAIGFYCRDKGLLV 376
++PSLA+ F+ R++ L+
Sbjct: 3080 LNPSLAVAFFVRNERQLL 3097
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 17/58 (29%)
Query: 107 ISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLVAQALLFHRL 147
+YR GF P+ G+ K I SDVGWGC +R++QML+ QAL H L
Sbjct: 1148 FTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLLMQALRRHFL 1205
>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 209
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 20/143 (13%)
Query: 99 QDFSSRILISYRKGFDPI----GDSKIT---SDVGWGCMLRSSQMLVAQALLFHRLGR-- 149
++F + I ++YR+ F P+ D KI SD GWGCM+R QM +A+ L H +
Sbjct: 24 ENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRHHLQQKGI 83
Query: 150 -PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSW 207
++ +Q D + FGD +P+SI + + A K + L G W P +C
Sbjct: 84 YDNKRIIQAFLDND-------FGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRICHVL 136
Query: 208 EALARCQRAETGLGCQSLPMAIY 230
L + L C+ L + ++
Sbjct: 137 SLLHN--DKKQILDCEDLKVGVF 157
>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
Length = 360
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 96/228 (42%), Gaps = 35/228 (15%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 134
+A D+ + D +G F DF S+I ++YR F+PI S + TS + L+S
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 135 --QMLVAQALLFHRLGR-PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
Q + + RLGR WR+ E +L F D +P+SIH+ ++ G A
Sbjct: 159 GDQSPFSSDTMV-RLGRGDWRRGESV---EEECRLLKDFADDPRAPYSIHSFVRHGASAC 214
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A R +AL + +I V S G P V D+
Sbjct: 215 GKYPGEWFGPSATARCIQALTNSHES-----------SIRVYST------GDGPDVYEDE 257
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ D+ P L+LV LG++K+ P Y L PQS
Sbjct: 258 ---FMQIAKPPGEDFHPTLVLVGTRLGIDKITPVYWEALIAALQMPQS 302
>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 348
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 118/290 (40%), Gaps = 67/290 (23%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S I YR F + ++ +TSD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 162 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQR 215
+ + ++H F D S P+SIH+L G GS P++
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFS------------- 178
Query: 216 AETGLGCQSLPMAIYVVSG--DEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILL 270
IY ++ ++D R C V + ++ P ++
Sbjct: 179 -----------SVIYALTELVNKDFNRAF-----------ECHVITNKFLLKSINKPTIV 216
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
+P + +K + R I F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 217 FIPFTIP-DKFDQRLIT----IFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRP 271
Query: 331 VI-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+I K D E D SD I+ + ++ ++ S+ F L++ +
Sbjct: 272 CASSIMKFD-EKDYIAKLSD-IKSLRINELERSVVFSFVIHSFQELISLQ 319
>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
Length = 93
Score = 62.4 bits (150), Expect = 4e-07, Method: Composition-based stats.
Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 19/91 (20%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIG-- 117
I + IW+LG + N L E + +D S + +YRKGF PIG
Sbjct: 16 IPQTDEPIWILGKKY--------------NALKELDMIRRDIRSMLWFTYRKGFIPIGGC 61
Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
+S TSD GWGCMLR QM++AQAL+ LG
Sbjct: 62 NSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92
>gi|221505025|gb|EEE30679.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 3554
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 63/112 (56%), Gaps = 7/112 (6%)
Query: 268 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
LLL PL L EK+NP Y+ +L P SLG+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2988 CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLGMVAGRGQQAFYCIGTQQKALLYLDPH 3047
Query: 327 D-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDSIDPSLAIGFYCRDKGLLV 376
+QP L A T ++ + + + +++PSLA+ F+ R++ L+
Sbjct: 3048 SGIQPPAL----QLPAATPSFFAGSCWKVSDVAALNPSLAVAFFVRNERQLL 3095
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 17/58 (29%)
Query: 107 ISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLVAQALLFHRL 147
+YR GF P+ G+ K I SDVGWGC +R++QML+ QAL H L
Sbjct: 1148 FTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLLMQALRRHFL 1205
>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 3465
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 7/112 (6%)
Query: 268 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
LLL PL L EK+NP Y+P+L P S+G+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2955 CLLLFPLTLCSGEKINPVYVPSLLAYLELPWSVGMVAGRGQQAFYCIGTQQKALLYLDPH 3014
Query: 327 D-VQ-PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLV 376
+Q P + + A S + + + +++PSL++ F+ R L
Sbjct: 3015 SGIQPPALQL----PSATPSFFAGSCWKIADVAALNPSLSVAFFVRSGSQLA 3062
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 17/70 (24%)
Query: 96 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
+ +Q S +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 942 QLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQMLL 1001
Query: 139 AQALLFHRLG 148
QAL H LG
Sbjct: 1002 MQALRRHFLG 1011
>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 348
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 116/277 (41%), Gaps = 65/277 (23%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S I YR F + ++ + SD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 162 EYVE---ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
+ + ++H F D + P+SIH+L + +G+
Sbjct: 132 DNINRKTVIHWFLDFYNVECPYSIHSLFTTQI---IVSGN-------------------- 168
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR--HCSVFSKG---QADWTPILLL 271
G LP+++ + E + D +R C V + + P ++
Sbjct: 169 --PNGSSFLPLSVVTYALTELVNK---------DLNRIFECHVITNKFLLNSINKPTIIF 217
Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
+P + ++ N R I F+F G+VGG + Y G+ + ++LDPH V+P
Sbjct: 218 IPFTIP-DEFNQRLIS----IFSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDPHFVRPC 272
Query: 332 I-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+I K D E D SD I+ +H++ ++ S+ F
Sbjct: 273 ASSIMKFD-EKDYIAKLSD-IKSLHINELERSVVFSF 307
>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
Length = 353
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 125/317 (39%), Gaps = 47/317 (14%)
Query: 75 VCHKIAQ-DEALGDAAGNNGLAE----FNQDFSSRILISYRKGFDPIGD---------SK 120
+ + I Q D++L GN A+ F + F IL SYR F I S
Sbjct: 20 IIYNIDQHDDSLIFLFGNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSS 79
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
+T+D+GWGCMLR QM +A LL R K + IL F D E S FSI
Sbjct: 80 VTTDLGWGCMLRVIQMSLALGLL--------RYCKMKKYTYSLDYILQNFQDLEESLFSI 131
Query: 181 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
H ++ G + W GP + + L + P
Sbjct: 132 HQFVKVGCSIFNKKPKDWFGPTSASTIADYLVKNN-----------PFLFNNFRISSILF 180
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTF-TFPQ 297
+ G I ++ S ++ ++ T + + LG +N +Y ++ F PQ
Sbjct: 181 KDGT----IYKSNLFQSFKNEEYSENTLTFVWLCTRLGSSALNIQKYKDSIFSIFKNVPQ 236
Query: 298 SLGIVGGKPGAST--YIVGVQEESAIYLDPH-DVQPVINIGKDDLEADTSTYHSDVIRHI 354
+ I GG +S+ IVG E+ LDPH +Q I + E + V I
Sbjct: 237 LICIAGGHNCSSSALLIVGASEKFLYCLDPHIKLQEAFVIKNFNREE----FIQQVPMRI 292
Query: 355 HLDSIDPSLAIGFYCRD 371
++++PSL+ F C D
Sbjct: 293 SWENLNPSLSFVFCCTD 309
>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 348
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/290 (20%), Positives = 113/290 (38%), Gaps = 67/290 (23%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S I YR F + ++ +TSD GWGC +R+ QML+A +++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSII-------------KLFGS 131
Query: 162 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
+ + ++H F D S P+SIH+L + +
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFT-------------------------TQIIVS 166
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILLLVP 273
+ G LP ++ + + E + + + C + + + P ++ +P
Sbjct: 167 KNPNGSSFLPFSVVIYALTELVNKDF-------NRAFECHIITNKFLLNSINKPTIVFIP 219
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP--- 330
+ E + L F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 220 FTIPDE-----FEQRLITIFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRPCAS 274
Query: 331 -VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLVTFE 379
+I + D A S I+ + ++ ++ S+ F L++ +
Sbjct: 275 SIIKFDEKDYIAKLSD-----IKSLRINELERSVVFSFVIHSFQELISLQ 319
>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 141
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y +G Q++ +YLDPH QP +++ + D + ++H R + +DP
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDP 58
Query: 362 SLAIGFYCRDKGLLVT 377
S +GFY D+ T
Sbjct: 59 SCTVGFYAGDRKEFET 74
>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
Length = 98
Score = 58.9 bits (141), Expect = 4e-06, Method: Composition-based stats.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 13/81 (16%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
+W+LG + ++ L +D S + +YRKGF PIG +S TSD GW
Sbjct: 23 VWILGRVYNAIKE-----------LDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGW 71
Query: 128 GCMLRSSQMLVAQALLFHRLG 148
GCMLR QM++A+AL+ LG
Sbjct: 72 GCMLRCGQMVLARALITLHLG 92
>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
Length = 141
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y +G Q++ +YLDPH QP +++ + + + ++H R + +DP
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDP 58
Query: 362 SLAIGFYCRDKGLLVT 377
S +GFY D+ T
Sbjct: 59 SCTVGFYAGDRKEFET 74
>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 341
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 18/101 (17%)
Query: 105 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
IL +YR F+PI G + + SD GWGC +R++QML+AQA+ G+ D
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV--KMAGK----------D 113
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGK-AYGLAAGSWVGP 200
+ +L LF DS +P S+H +++ G+ G+W GP
Sbjct: 114 ADDSVVLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGP 154
>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 193
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 25/150 (16%)
Query: 52 HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRI 105
HE V P G S ++LGV K Q D+ L + L A F + S+
Sbjct: 25 HEDVQKPIFVGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAAAFFR-ISNLF 79
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKP 158
++YR G++ + +S +T+DVGWGC +R+ QM++A A+ + P+ P
Sbjct: 80 WMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----P 134
Query: 159 FDREYVEILHLFGDS--ETSPFSIHNLLQA 186
+E + +L F DS T+P SIH++ ++
Sbjct: 135 TKQEVMNVLIPFIDSPNSTTPLSIHHVYES 164
>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 183
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 35/186 (18%)
Query: 28 SVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGD 87
++GS +S + KRL+ L P + + + +LG C+ +E L
Sbjct: 10 NIGSYFYNSMSSKRLIK-----------LQPF-----TQKNVVHILGNCYYPETNENLNH 53
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
N+ N I+ +YR+ + +G++ ++SD GWGC +R++QM+V AL+
Sbjct: 54 LTFNDA----NLKIHDLIVATYRQKYSYLGNTYLSSDAGWGCAIRATQMMVVNALVI--- 106
Query: 148 GRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHNLL--QAGKAYGLAAGSWV 198
++ +Q+ D E L D +S SIHN+ Q K + +++
Sbjct: 107 ---FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHNIYIQQVIKTHNPKGTNFL 163
Query: 199 GPYAMC 204
P C
Sbjct: 164 PPSICC 169
>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
Length = 538
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 26/129 (20%)
Query: 136 MLVAQALLFHRLGRPWR----------------KPLQKPFDREYVEILHLFGDS--ETSP 177
M++AQ L+ H LGR WR +L LF D+ E +P
Sbjct: 1 MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60
Query: 178 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
FS+H+L +AG+A G+ AG W+GP+ MC++ A A R Q + + + V E
Sbjct: 61 FSLHSLCRAGQACGVVAGRWLGPWVMCKTLAAAAGAARR------QGVDLGLTVAVLAES 114
Query: 238 GERGGAPVV 246
G GGAP++
Sbjct: 115 G--GGAPLL 121
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 26/37 (70%)
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
K+NPRYIP L PQS+GIVGG+P +S Y VG Q
Sbjct: 215 KLNPRYIPQLEAVLAMPQSIGIVGGRPSSSLYFVGFQ 251
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 29/55 (52%), Gaps = 5/55 (9%)
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKG 373
S IYLDPH VQ D T+ + R + L SIDPSLA+GFYC G
Sbjct: 331 SVIYLDPHQVQEAAACPDD-----WRTFWCETPRSMPLPSIDPSLALGFYCSSLG 380
>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
Length = 206
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 30/135 (22%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFD-------------------PI-GDSKITS 123
A+ D L E +DF IL++YR+G P+ + I +
Sbjct: 17 AMCDQNPGPKLRERLKDF---ILLTYRRGLSIHLPRFYAGNIPKRFYGIWPLWQQTDIKT 73
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGC LR++QM +A+AL R PL + IL LF D+ +PFS+ NL
Sbjct: 74 DRGWGCALRATQMALAEAL------RDVLSPLDN-VQEQRSRILQLFYDTTEAPFSLENL 126
Query: 184 LQAGKAYGLAAGSWV 198
+ A +G +W+
Sbjct: 127 VMADVEHGANVVAWI 141
>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 384
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
S+G++GG PG + Y +G+ + IYLDPH +Q K D TY I +
Sbjct: 223 SIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQEAHQNEKTVQNID--TYFCKFINRVSQK 280
Query: 358 SIDPSLAIGFYCRDKGLLVTF 378
++ SLA GFY ++ L F
Sbjct: 281 KLESSLAFGFYIKNLQELEQF 301
>gi|328852471|gb|EGG01617.1| Hypothetical protein MELLADRAFT_92005 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|328859149|gb|EGG08259.1| Hypothetical protein MELLADRAFT_123247 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDQVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
Length = 389
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 26/94 (27%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 116
+A D+ + D +G F DF S+I ++YR F+PI
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 117 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
GD S +SD GWGCM+RS Q ++A + RLGR
Sbjct: 159 GDQSPFSSDSGWGCMIRSGQSMLANTIAMVRLGR 192
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSID 360
G+P +S Y +G Q YLDPH + + +D +E + ++ H+ +R IH+ +D
Sbjct: 262 GRPSSSHYFIGAQGSFLFYLDPHHTRVALPYREDPIEYTSEEIASCHTPRLRRIHVREMD 321
Query: 361 PSLAIGFYCRDK 372
PS+ IGF +++
Sbjct: 322 PSMLIGFLIQNE 333
>gi|328852767|gb|EGG01910.1| Hypothetical protein MELLADRAFT_123246 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNINPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 658
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 55/144 (38%), Gaps = 48/144 (33%)
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-----------------QEESAIY-LD 324
P Y TL +FPQS+G++GG P + + G QE Y LD
Sbjct: 418 PTYGSTLAKLLSFPQSVGMLGGTPRHALWFYGADEVDPPTFGDDGKALNGQECGGWYGLD 477
Query: 325 PHDVQ------PVINIGKDDLEADT------------------------STYHSDVIRHI 354
PH Q GKD++ +D +T H++ R I
Sbjct: 478 PHTTQVAPRGTRTTKYGKDEVSSDDIELNNCQWQVQLNDAYLRSLHFTPTTTHANHQRSI 537
Query: 355 HLDSIDPSLAIGFYCRDKGLLVTF 378
L +DPS A+GFY RD V F
Sbjct: 538 PLSKLDPSCALGFYIRDHSDFVQF 561
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFH 145
+ SD GWGCMLRS+QM++AQ + H
Sbjct: 133 LKSDAGWGCMLRSAQMMMAQTVRMH 157
>gi|440292697|gb|ELP85881.1| hypothetical protein EIN_133850 [Entamoeba invadens IP1]
Length = 348
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 109/298 (36%), Gaps = 61/298 (20%)
Query: 95 AEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
++ + S+ ++YR GF + +T+D GWGC +RS QML +L+ R+ P
Sbjct: 62 SQIAKHLSTLFKVTYRNGFTYHLPHCSLTTDAGWGCTIRSVQMLFLNSLI--RIQEP--- 116
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR- 212
FD+ DS+T + G V P + R + L
Sbjct: 117 --DPGFDK----------DSQTK---------------MKKGFLVHPMDVRREYVQLIED 149
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS--------RHCSV-----FS 259
R E L + V ++ G +P C S R C V F
Sbjct: 150 TPRKEAVLSIHKMFDLEVVRKNNQKGTNYLSPSTCATAISVLMEQWDERPCHVMFVQTFP 209
Query: 260 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 319
K T +++L PL N R + +P G+V G + Y+VG
Sbjct: 210 KHVEPNTILMVLAPL-------NER----TQCCLDYPFVSGVVCGVETRAIYVVGHSGGV 258
Query: 320 AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI-GFYCRDKGLLV 376
+ LDPH VQ G D+ D S D I+ + L + I F RD L V
Sbjct: 259 LLLLDPHHVQKAHEDGDFDI-TDYSVRTKD-IKMVGLSQLAFGNCIWSFLVRDNNLEV 314
>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
Length = 307
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
+ F +DF SRI ++YR+ F + DS TSD GWGCM+
Sbjct: 195 IEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMI 232
>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
Length = 473
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 56/116 (48%), Gaps = 27/116 (23%)
Query: 105 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 157
I +YR+GF DS +T+D GWGC++R QM++A+ L F+++ PL +
Sbjct: 52 IRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAELLKRHLKCFYKVDLFSFPPLLQ 111
Query: 158 PFDREYVEILHLFGDSE--------TSP----FSIHNLLQ-AGKAYGLAAGSWVGP 200
++L +F D + + P FSI +++ A K +G G W P
Sbjct: 112 -------DVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKEWGKKPGEWYSP 160
Score = 41.2 bits (95), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 30/54 (55%)
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+G ++ NP Y+ +R G++GG+P + +IVG + + LDPH VQ
Sbjct: 286 IGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHLVQ 339
>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
Length = 469
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 27/116 (23%)
Query: 105 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 157
I +YR+GF +S +T+D GWGC++R QM++A+ L F+ + PL +
Sbjct: 52 IRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAELLKRHLKCFYNVNLFQFPPLMQ 111
Query: 158 PFDREYVEILHLFGDSETSP------------FSIHNLLQ-AGKAYGLAAGSWVGP 200
E+L LF D + FSI +++ A + +G G W P
Sbjct: 112 -------EVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 12/105 (11%)
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
+G ++ NP YI +R G++GG+P + +IVG ++ + LDPH VQ N+
Sbjct: 286 IGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHLVQQA-NMN 344
Query: 336 KDDLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKGLLV 376
++ + + SD ID SL + FY +++ L+
Sbjct: 345 PEEYVKSCFPGEALFMSD-------KEIDCSLGLVFYLKNEEDLI 382
>gi|307108756|gb|EFN56995.1| hypothetical protein CHLNCDRAFT_143631 [Chlorella variabilis]
Length = 137
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 9/103 (8%)
Query: 33 LGSSETVKRLVTAGSMRRIHERVLGPSRTGIS-SSTSDIWLLGVCHKIAQDEALGDAAGN 91
LG S + L A + ++H+ + +G S + + +WLLG C+ + +A
Sbjct: 15 LGLSRSYYALARALRLNKLHDLLA----SGASITPDAPVWLLGQCYSCPPGAS--EAQQE 68
Query: 92 NGLAEFNQDFSSRILISYRKGFDPI--GDSKITSDVGWGCMLR 132
LA + S +SYR GF I G + + SD GWGC LR
Sbjct: 69 EALARMLHHYQSIPWMSYRTGFTSIAAGSAHLQSDAGWGCTLR 111
>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
Length = 346
Score = 44.3 bits (103), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 20/94 (21%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
NN +A + S+ ++YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 148 GRP-------WRKPLQKPF-------DREYVEIL 167
P + +QK F REYV+++
Sbjct: 112 QEPNPGFGEDAAEKVQKNFIIHSMEERREYVQLI 145
>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
Length = 135
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 34/66 (51%), Gaps = 11/66 (16%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +++W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTNVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWG 128
+D GWG
Sbjct: 92 TDKGWG 97
>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 346
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 6/63 (9%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 148 GRP 150
P
Sbjct: 112 QEP 114
>gi|389585790|dbj|GAB68520.1| peptidase, partial [Plasmodium cynomolgi strain B]
Length = 894
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 418 FTKRKRTKYTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 466
>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 346
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 6/62 (9%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 58 SNNNVA---KHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RLQ 112
Query: 149 RP 150
P
Sbjct: 113 EP 114
>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
Length = 133
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 30/42 (71%), Gaps = 5/42 (11%)
Query: 105 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQAL 142
IL +YR F+PI G + + SD GWGC +R++QML+AQA+
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV 107
>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
Length = 350
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 102/302 (33%), Gaps = 90/302 (29%)
Query: 85 LGDAAGNNGLAEFNQDFSSR--ILISYRKG-------------------FDPIGDSK--- 120
+ + N +N+ SR IL +YR G F P+ S
Sbjct: 1 MSNVVRENVNVLYNKRLESRFGILFTYRYGLEYKFPRPINFKRRRLFNIFSPLNLSNGIV 60
Query: 121 -ITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------RKPLQKPFDREYVE----- 165
I SD GWGC+LRS+QM ++QALL LG + R P + D+ +
Sbjct: 61 TIDSDKGWGCVLRSTQMAISQALLNLVLGPEFSVEQLEIRNRTPRNRKIDQSLLNIDTFE 120
Query: 166 -----------------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW-VGPY--AMCR 205
IL F D + FSI+N + A GP A+C
Sbjct: 121 KLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIYNFVIADYVLKTCTKFLHFGPTSAALC- 179
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
A + +LP+ + D H S + +
Sbjct: 180 ----------ASKIINDLNLPIN----------------SIAFPDGVFHISDVREILEEK 213
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP-GASTYIVGVQEESAIYLD 324
+L+ V L+++ +R F Q GI+GG S YI G + Y D
Sbjct: 214 RNLLVWVSNKKKLDRIER---ECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYND 270
Query: 325 PH 326
PH
Sbjct: 271 PH 272
>gi|221060360|ref|XP_002260825.1| peptidase [Plasmodium knowlesi strain H]
gi|193810899|emb|CAQ42797.1| peptidase, putative [Plasmodium knowlesi strain H]
Length = 1001
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
+F++R + KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 464 NFTNRRRTKHTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 513
>gi|156102174|ref|XP_001616780.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805654|gb|EDL47053.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1007
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 468 FAKRKRDRYSKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 516
>gi|193784751|dbj|BAG53904.1| unnamed protein product [Homo sapiens]
Length = 146
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 1/75 (1%)
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
P SL G T ++ EE IYLDPH QP + D S + +
Sbjct: 4 PLSLSSAGSATHLPTCLILPGEE-LIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMS 62
Query: 356 LDSIDPSLAIGFYCR 370
+ +DPS+A+GF+C+
Sbjct: 63 IAELDPSIAVGFFCK 77
>gi|50303849|ref|XP_451871.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641003|emb|CAH02264.1| KLLA0B07667p [Kluyveromyces lactis]
Length = 1999
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 74/173 (42%), Gaps = 20/173 (11%)
Query: 10 ASKCFS------KSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGI 63
+SKCF KS DT ++L S + S++VKRL T M I R+ G R
Sbjct: 1024 SSKCFEFLAKSVKSDDDTLLQALRDATSNVLFSKSVKRLQTLYKMDGI--RMDGHRRVSR 1081
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQD----FSSRILISYRKGFDPIGD 118
S L + K DE +N + A F +D +LI R+ D + D
Sbjct: 1082 SQ------LTHILFKERTDEYDRSIIDSNSIYALFKKDNVNLTKKMVLIEERRLNDYLAD 1135
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLF-HRLGRPWRKPLQKPFDREYVEILHLF 170
+ + G+ C LR + + + A L + R W ++ R+ +++L +F
Sbjct: 1136 DRYQKEAGYACALRVIRKVASTAYLRDFKSTREWYLAARENVKRQRIQLLPVF 1188
>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
gorilla]
Length = 351
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 91
>gi|148682816|gb|EDL14763.1| mCG116861, isoform CRA_a [Mus musculus]
Length = 127
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 35/56 (62%), Gaps = 2/56 (3%)
Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGFYCRDK 372
+ I+LDPH Q ++I + L D T+H + + + ++DPS+A+GF+C+++
Sbjct: 1 DELIFLDPHTTQTFVDIEESGL-VDDQTFHCLQSPQRMSILNLDPSVALGFFCKEE 55
>gi|294954843|ref|XP_002788322.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
gi|239903634|gb|EER20118.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
Length = 345
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 25/100 (25%), Positives = 46/100 (46%), Gaps = 26/100 (26%)
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESA-------------------IYLDPHDVQPVIN 333
P +G++GG+ + Y+VGV E+ + +DPH VQ +
Sbjct: 207 LKLPWCVGVIGGQSTRAHYVVGVAEKDTYLQSSTWGRSGYRQTRTDLLSIDPHFVQSAV- 265
Query: 334 IGKDDLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDK 372
+EA + ++ +SD + ++PSL +GFY +D+
Sbjct: 266 -----VEAQSISFKNSDEPSRLQPTKLNPSLGVGFYVKDE 300
>gi|124025328|ref|YP_001014444.1| acetyltransferase [Prochlorococcus marinus str. NATL1A]
gi|123960396|gb|ABM75179.1| possible acetyltransferase [Prochlorococcus marinus str. NATL1A]
Length = 180
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 8/73 (10%)
Query: 56 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 115
LG ++ G+S ++ K+ DE L + G FNQ SS + S+ K FD
Sbjct: 4 LGSTKIGMSGWKNE--------KLLSDETLKNIYGKQAFQYFNQTNSSLFVFSHSKSFDL 55
Query: 116 IGDSKITSDVGWG 128
I ++ VGWG
Sbjct: 56 IELEQLLQAVGWG 68
>gi|72383728|ref|YP_293083.1| acetyltransferase [Prochlorococcus marinus str. NATL2A]
gi|72003578|gb|AAZ59380.1| acetyltransferase, GNAT family [Prochlorococcus marinus str.
NATL2A]
Length = 180
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 8/73 (10%)
Query: 56 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 115
LG ++ G+S ++ K+ DE L + G FNQ SS + S+ K FD
Sbjct: 4 LGSTKIGMSGWKNE--------KLLSDETLKNIYGKQAFQYFNQTNSSLFVFSHSKSFDL 55
Query: 116 IGDSKITSDVGWG 128
I ++ VGWG
Sbjct: 56 IELEQLLQAVGWG 68
>gi|427707351|ref|YP_007049728.1| hypothetical protein Nos7107_1953 [Nostoc sp. PCC 7107]
gi|427359856|gb|AFY42578.1| hypothetical protein Nos7107_1953 [Nostoc sp. PCC 7107]
Length = 129
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 14/82 (17%)
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQ VGGK G +TY VG + +N+G + DV++ I+
Sbjct: 31 PQPWVSVGGKDGDTTYAVGARA--------------LNLGVEVGNGPDGATGVDVLKFIN 76
Query: 356 LDSIDPSLAIGFYCRDKGLLVT 377
L I P + +G Y +DKG+ V+
Sbjct: 77 LPVISPYVGVGLYSQDKGVAVS 98
>gi|407037202|gb|EKE38551.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 157
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 8/98 (8%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
+ P L+ +P+VL N L+ + GIVGG + ++ G +YLD
Sbjct: 17 FKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFLYLD 71
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
PH VQP K E DT +Y + +IDP+
Sbjct: 72 PHIVQPSF---KSFTEIDTKSYSPIGSNRFSVHTIDPT 106
>gi|392343434|ref|XP_003754884.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
gi|392355909|ref|XP_003752169.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
Length = 126
Score = 38.5 bits (88), Expect = 6.0, Method: Composition-based stats.
Identities = 17/53 (32%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGFYCRDK 372
I+LDPH Q ++ + L D T+H + + + ++DPS+A+GF+C+++
Sbjct: 4 IFLDPHTTQTFVDTEESGL-VDDHTFHCLQSPQRMSILNLDPSVALGFFCKEE 55
>gi|427717569|ref|YP_007065563.1| hypothetical protein Cal7507_2294 [Calothrix sp. PCC 7507]
gi|427350005|gb|AFY32729.1| hypothetical protein Cal7507_2294 [Calothrix sp. PCC 7507]
Length = 129
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 14/82 (17%)
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQ VGGK G +TY VG + +++G + + DV++ I
Sbjct: 31 PQPWVSVGGKDGDTTYAVGAKA--------------LDLGVEVGSGPKGSTGVDVLKFIS 76
Query: 356 LDSIDPSLAIGFYCRDKGLLVT 377
L I P + IG+Y DKG+ V+
Sbjct: 77 LPVISPYVGIGYYSEDKGVAVS 98
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.137 0.419
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,249,618,451
Number of Sequences: 23463169
Number of extensions: 269764957
Number of successful extensions: 567166
Number of sequences better than 100.0: 788
Number of HSP's better than 100.0 without gapping: 759
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 564347
Number of HSP's gapped (non-prelim): 1356
length of query: 379
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 235
effective length of database: 8,980,499,031
effective search space: 2110417272285
effective search space used: 2110417272285
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)