BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013409
(443 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
Length = 489
Score = 630 bits (1624), Expect = e-178, Method: Compositional matrix adjust.
Identities = 332/491 (67%), Positives = 379/491 (77%), Gaps = 50/491 (10%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGS------------------------- 35
MKGFRE+ AS+C SK DTPNRSL S E GS
Sbjct: 1 MKGFRERV-ASRCSSKCPVDTPNRSLTSDCLESGSNFSTKGSLWSSFFASAFSVFETYRE 59
Query: 36 -----------------SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHK 78
+ VK++V+ GSMRRIHERVLGPSRTGISS+TSDIWLLGVC+K
Sbjct: 60 SPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVCYK 119
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLV 138
I++DE+ G+A N LAEF D+SSRIL++YR+GFD IGDSK SDVGWGCMLRSSQMLV
Sbjct: 120 ISEDES-GNADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLV 178
Query: 139 AQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
AQALLFH+LGR W KP QKP D+ YVEILHLFGDSE +PFSIHNL+QAGKAY LAAGSWV
Sbjct: 179 AQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWV 238
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
GPYAMCRSWE+LAR +R E L QSLPMA+YVVSGDEDGERGGAPVV I+DASRHC F
Sbjct: 239 GPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEF 298
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
S+GQADWTPILLLVPLVLGL+KVNPRYIP+L+ TFTF QSLGI+GGKPGASTYIVGVQ++
Sbjct: 299 SRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQDD 358
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+A YLDPH+VQ V+NIG+DD+EADTS+YHSD++RHI L SIDPSLAIGFYCRDKDDFD+F
Sbjct: 359 NAFYLDPHEVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFDEF 418
Query: 379 CARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL-GETGGVPEDDSLGV-MSMNDAV-- 432
C ASKLA++S GAPLFTV HK KPV+H D+L E V EDDS+ V M +ND
Sbjct: 419 CLLASKLADDSQGAPLFTVAHCHKLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDAEG 478
Query: 433 GNAHEDDWQLL 443
G A ED+WQLL
Sbjct: 479 GGAQEDEWQLL 489
>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
Length = 486
Score = 619 bits (1596), Expect = e-175, Method: Compositional matrix adjust.
Identities = 308/433 (71%), Positives = 357/433 (82%), Gaps = 4/433 (0%)
Query: 15 SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 74
S+S+P + G G + V+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 75 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 134
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
VQ+E A YLDPH+ Q V++I +++LEADTS+YH ++IRHI LDSIDPSLAIGFYCRDKDD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDD 413
Query: 375 FDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAV 432
FDDFC RASKLA++SNGAPLFTV H KP++ SD + + G EDDS V+S A
Sbjct: 414 FDDFCIRASKLADKSNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAE 473
Query: 433 G--NAHEDDWQLL 443
G + HEDDWQLL
Sbjct: 474 GYEHEHEDDWQLL 486
>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 323/488 (66%), Positives = 373/488 (76%), Gaps = 51/488 (10%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSET---------------------- 38
MKGFRE+ + S ST ++PNRS S SELGS++T
Sbjct: 1 MKGFRERGFVASSKSSSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDTH 60
Query: 39 -----------------------VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 75
VK++V GSMRRI E VLG S+TGIS++T DIWLLG
Sbjct: 61 CDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGA 120
Query: 76 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
C+KI+QD + GDAA N LA FN DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 121 CYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQ 180
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
MLVAQALLFHRLGR WRKPL KP DREYVEILHLFGDSE+S FSIHNLL+AGKAYGLAAG
Sbjct: 181 MLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAG 240
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
SWVGPYA+C SWE+L R +R ET L QSL MA+YVVSG EDGERGGAPV+CI++A+RHC
Sbjct: 241 SWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHC 300
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
S FSKGQ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 301 SEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 360
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
Q+E+A YLDPH+VQPV+N+ +DD+EA+TS+YH +V+RH+ LD IDPSLAIGFYCRDKDDF
Sbjct: 361 QDENAFYLDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDF 420
Query: 376 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 435
DDFC ASKL +ESNGAPLFTV + +K + H ++G V DDSLGVM+MND G
Sbjct: 421 DDFCTLASKLTDESNGAPLFTVAHS-RKLLKH-----DSGEVRSDDSLGVMTMNDVEGCV 474
Query: 436 HEDDWQLL 443
HEDDWQLL
Sbjct: 475 HEDDWQLL 482
>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
Length = 489
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 310/436 (71%), Positives = 357/436 (81%), Gaps = 7/436 (1%)
Query: 15 SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 74
S+S+P + G G + V+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 75 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 134
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYH---SDVIRHIHLDSIDPSLAIGFYCRD 371
VQ+E A YLDPH+ Q V++I +++LEADTS+YH S +IRHI LDSIDPSLAIGFYCRD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRD 413
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMN 429
KDDFDDFC RASKLA+ESNGAPLFTV H KP++ SD + + G EDDS V+S
Sbjct: 414 KDDFDDFCIRASKLADESNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNK 473
Query: 430 DAVG--NAHEDDWQLL 443
A G + HEDDWQLL
Sbjct: 474 GAEGYEHEHEDDWQLL 489
>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 615 bits (1587), Expect = e-173, Method: Compositional matrix adjust.
Identities = 302/410 (73%), Positives = 353/410 (86%), Gaps = 6/410 (1%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + +VK++V G+MRRI ERVLG S+TGIS++TSDIWLLG +KI+QD++ G+A N
Sbjct: 78 GWTSSVKKIVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATNA 137
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
LA F++DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQMLVAQALLFHRLGR WRK
Sbjct: 138 LAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRK 197
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P+ KP DR+YVEILHLFGDSE S FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE+LAR
Sbjct: 198 PVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARS 257
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
+R ET L Q+LPMA+YVVSG EDGERGGAPV+ I+DA+RHCS FSKG+ DWTPILLLVP
Sbjct: 258 KREETNLEYQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVP 317
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGVQ+E+A YLDPH+VQPV+N
Sbjct: 318 LVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVN 377
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
+DD+EA+TS+YH DV+RHI LD IDPSLAIGFYCRDKDDFDDFC+ ASKLA+ESNGAP
Sbjct: 378 FSRDDVEANTSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNGAP 437
Query: 394 LFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
LFTV ++K + ++ V +DD LGVM+MNDA G +EDDWQLL
Sbjct: 438 LFTVANSYKSSKH------DSSEVRDDDPLGVMTMNDAEGCLNEDDWQLL 481
>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
Length = 483
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 301/407 (73%), Positives = 339/407 (83%), Gaps = 2/407 (0%)
Query: 38 TVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEF 97
TV++++T+GSMRRI ER+LG R+G+ SS DIWLLGVCHKI+QD DAA + G+A +
Sbjct: 78 TVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQDHPPDDAASSPGVAGY 137
Query: 98 NQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
QDFSSRIL++YRKGF I DSK TSDV WGCMLRSSQMLVAQALLFHRLGR WRKP QK
Sbjct: 138 EQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPSQK 197
Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
P D+EYVEILHLFGDSETS FSIHNLLQAG+AY LAAGSWVGPYAMCRSWE L R +R
Sbjct: 198 PLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRSWETLVRSKRET 257
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
L Q LPMAIY+VSGDEDGERGGAPV+ IDDASRHC FSKGQ DW+PILLLVPLVLG
Sbjct: 258 PILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHDWSPILLLVPLVLG 317
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
LEK+NPRYIP+LR TFTFPQSLGI+GGKPGASTYIVGVQ+E+A YLDPH+VQ V+NI KD
Sbjct: 318 LEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQQVVNIDKD 377
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
DLEADTS+YH +VIRHI L+SIDPSLAIGFYCRDKDDFD+FC RASKLAEES+GAPLFTV
Sbjct: 378 DLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCHRASKLAEESDGAPLFTV 437
Query: 398 TQTHK-KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
+TH P S L + + EDD GV+ M + +HEDDWQ L
Sbjct: 438 AETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNE-EESHEDDWQFL 483
>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 485
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 295/413 (71%), Positives = 333/413 (80%), Gaps = 9/413 (2%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + V+++VT GSMRR ERVLG SRT ISSS DIWLLGVCHKI+Q E+ G +NG
Sbjct: 79 GWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQESTGGVDTSNG 138
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
LA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LA
Sbjct: 199 PIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLA-- 256
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ CS FS G A WTP+LLLVP
Sbjct: 257 -RKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWTPLLLLVP 315
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+N
Sbjct: 316 LVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPHDVQQVVN 375
Query: 334 IGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
I D E TS+YH +V+RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGA
Sbjct: 376 ISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGA 435
Query: 393 PLFTV--TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
PLFTV +++ K V++ DV G+ G EDD G+ ND V N EDDWQLL
Sbjct: 436 PLFTVAKSRSFSKQVSN-DVSGDNTGFQEDDFPGMDCGNDTVTN--EDDWQLL 485
>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 486
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 292/413 (70%), Positives = 329/413 (79%), Gaps = 8/413 (1%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + V+++VT GSMRR ERVLG SRT ISSS DIWLLGVCHKI+Q E+ G +NG
Sbjct: 79 GWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQESSGGVDNSNG 138
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
LA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LA
Sbjct: 199 PIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLA-- 256
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ C FS G A WTP+LLLVP
Sbjct: 257 -RKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWTPLLLLVP 315
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+G Q E A YLDPHDVQ V+N
Sbjct: 316 LVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQNEKAFYLDPHDVQQVVN 375
Query: 334 IGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
I D E TS+YH +++RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGA
Sbjct: 376 ISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGA 435
Query: 393 PLFTVTQTH--KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
PLFTVTQ+ K V +DV G+ G E+D G+ ND N EDDWQLL
Sbjct: 436 PLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDTGTN--EDDWQLL 486
>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
Length = 487
Score = 556 bits (1432), Expect = e-155, Method: Compositional matrix adjust.
Identities = 282/412 (68%), Positives = 324/412 (78%), Gaps = 5/412 (1%)
Query: 34 GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
G + V+++V+ GSMRR ERVLG RT +SSS DIWLLGVCHKI+Q E+ GD N
Sbjct: 79 GWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCHKISQHESTGDVDIRNV 138
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
A F QDF SRILI+YRKGFD I DSK TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 FAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
+ KP D+EY++IL LFGDSE + FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LAR
Sbjct: 199 TVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARN 258
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
QR + G Q LPMAIYVVSGDEDGERGGAPVVCI+DA + C FS+G WTP+LLLVP
Sbjct: 259 QREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLEFSRGLVPWTPLLLLVP 318
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ + A YLDPH+V+PV+N
Sbjct: 319 LVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQNDKAFYLDPHEVKPVVN 378
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
I D E +TS+YH ++ RH+ LDSIDPSLAIGFYCRDKDDFDDFC+RA+KLAEESNGAP
Sbjct: 379 ITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDKDDFDDFCSRATKLAEESNGAP 438
Query: 394 LFTVTQTHKKP--VNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
LFTV Q+ P V + V G+ EDDSL + +NDA +EDDWQ L
Sbjct: 439 LFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSMNLVNDA---GNEDDWQFL 487
>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
Length = 489
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 288/489 (58%), Positives = 335/489 (68%), Gaps = 48/489 (9%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSETV-------KRLVTAG-SMRRIH 52
+K F ++ A+KC SKS+ +T + S S+ GSS++ T+G S+ +
Sbjct: 3 LKAFCDRIVAAKCSSKSSTETVDNSQVPACSKAGSSDSKFPKASLWSSFFTSGFSVIETY 62
Query: 53 ERVLGPSRTGISSSTSD----------IWL--------------------------LGVC 76
+ + + S S WL LGVC
Sbjct: 63 SKSPASEKKAVHSQNSGWGCCCEESCYCWLNEEIPRACTLGQAELTFQALMVIYGFLGVC 122
Query: 77 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
HK +Q E+ GD + A F QDFSS+IL++YRKGFD IGDSK TSDV WGCMLRSSQM
Sbjct: 123 HKFSQQESTGDVDNSTVFAAFEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQM 182
Query: 137 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 196
LVAQALLFH+LGR WRK KP D+EY++IL FGDSE S FSIHNLLQAGK YGLA GS
Sbjct: 183 LVAQALLFHKLGRMWRKTTDKPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGS 242
Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
WVGPYAMCRSWE LAR QR G Q LPMA+YVVSGDEDGERGGAPVVCI+DASR CS
Sbjct: 243 WVGPYAMCRSWEVLARNQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCIEDASRRCS 302
Query: 257 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
FS+G A WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 EFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQ 362
Query: 317 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
E A YLDPHDVQPV++I D + +TS+YH +++R + LDSIDPSLAIGFYCRDKDDFD
Sbjct: 363 NEKAFYLDPHDVQPVVHINGDAQDPNTSSYHCNIVRQMPLDSIDPSLAIGFYCRDKDDFD 422
Query: 377 DFCARASKLAEESNGAPLFTVTQTHKKPVNHS--DVLGETGGVPEDDSLGVMSMNDAVGN 434
DFC+RASKLAEESNGAPLFTV Q P + DV G+ G EDDS GV +NDA N
Sbjct: 423 DFCSRASKLAEESNGAPLFTVAQFRSFPFQDAGYDVSGDNTGFQEDDSHGVDLLNDAGTN 482
Query: 435 AHEDDWQLL 443
EDDWQLL
Sbjct: 483 --EDDWQLL 489
>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
Full=Autophagy-related protein 4 homolog a;
Short=AtAPG4a; Short=Protein autophagy 4a
gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 467
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 244/377 (64%), Positives = 306/377 (81%), Gaps = 3/377 (0%)
Query: 34 GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE G+
Sbjct: 74 GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 133
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML AQALLFHRLGR W
Sbjct: 134 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWT 193
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 194 KKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 252
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPI+LLV
Sbjct: 253 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 312
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 313 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 372
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
+ K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDDFC RA KLAEESNGA
Sbjct: 373 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEESNGA 432
Query: 393 PLFTVTQTHKKPVNHSD 409
PLFTVTQTH +N S+
Sbjct: 433 PLFTVTQTHTA-INQSN 448
>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 422
Score = 520 bits (1338), Expect = e-145, Method: Compositional matrix adjust.
Identities = 244/377 (64%), Positives = 306/377 (81%), Gaps = 3/377 (0%)
Query: 34 GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE G+
Sbjct: 29 GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 88
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML AQALLFHRLGR W
Sbjct: 89 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWT 148
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 149 KKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 207
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPI+LLV
Sbjct: 208 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 267
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 268 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 327
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
+ K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDDFC RA KLAEESNGA
Sbjct: 328 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEESNGA 387
Query: 393 PLFTVTQTHKKPVNHSD 409
PLFTVTQTH +N S+
Sbjct: 388 PLFTVTQTHTA-INQSN 403
>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
Length = 476
Score = 509 bits (1312), Expect = e-142, Method: Compositional matrix adjust.
Identities = 252/401 (62%), Positives = 316/401 (78%), Gaps = 10/401 (2%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 86 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEAESFEEADAGRVLAAFRQDFS 145
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P + +
Sbjct: 146 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPPNEK 205
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET +
Sbjct: 206 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDVKH 265
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G +W PILLLVPLVLGL+KVN
Sbjct: 266 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGDTEWPPILLLVPLVLGLDKVN 325
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQE+ YLDPHDVQ V+ + K++ + D
Sbjct: 326 PRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVD 385
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
TS+YH + +R++ L+S+DPSLA+GFYC+DKDDFDDFC RA+KLA +SNGAPLFTVTQ+H+
Sbjct: 386 TSSYHCNTLRYVPLESLDPSLALGFYCQDKDDFDDFCIRATKLAGDSNGAPLFTVTQSHR 445
Query: 403 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
G+ E S V+S + G HEDDWQLL
Sbjct: 446 T---------NDCGIAETSSSTVIS-TEISGEEHEDDWQLL 476
>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
Length = 467
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 246/369 (66%), Positives = 307/369 (83%), Gaps = 2/369 (0%)
Query: 34 GSSETVKRLVTA-GSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ A G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI++DEA G+
Sbjct: 74 GWTAFVKRVSMATGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISEDEASGETNTGC 133
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA F QDFSS+IL++YR+GF+P D+ TSDV WGCM+RSSQML AQALLFHRLGR W
Sbjct: 134 VLAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRSWT 193
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
K + P ++EY+E L FGDSE+S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 194 KKSELP-EQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGSWVGPYAICRAWESLAC 252
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPILLLV
Sbjct: 253 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPILLLV 312
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 313 PLVLGLDSVNPRYIPSLIATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 372
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
+ K+ + DTS+YH +VIR++ L+S+DPSLA+GFYCRDKDDFDDFC RASKLAE+SNGA
Sbjct: 373 TVNKETPDVDTSSYHCNVIRYVPLESLDPSLALGFYCRDKDDFDDFCLRASKLAEDSNGA 432
Query: 393 PLFTVTQTH 401
PLFT+TQTH
Sbjct: 433 PLFTITQTH 441
>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
Full=Autophagy-related protein 4 homolog b;
Short=AtAPG4b; Short=Protein autophagy 4b
gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 477
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 247/401 (61%), Positives = 312/401 (77%), Gaps = 10/401 (2%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 87 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 146
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 147 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 206
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 207 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 266
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 267 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 326
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQE+ YLDPHDVQ V+ + K++ + D
Sbjct: 327 PRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVD 386
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
TS+YH + +R++ L+S+DPSLA+GFYC+ KDDFDDFC RA+KLA +SNGAPLFTVTQ+H+
Sbjct: 387 TSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLAGDSNGAPLFTVTQSHR 446
Query: 403 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
+ N + + + G HEDDWQLL
Sbjct: 447 R--NDCGIAETSSSTETSTEIS--------GEEHEDDWQLL 477
>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 478
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 256/435 (58%), Positives = 324/435 (74%), Gaps = 17/435 (3%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S S ++R+V +GSM R LG S+ SS D+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
KDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DVLG +G D ++ V +
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG----DGNINVEDL 464
Query: 429 NDAVGNAHEDDWQLL 443
DA G E++WQ+L
Sbjct: 465 -DASGETGEEEWQIL 478
>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
Length = 451
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 232/377 (61%), Positives = 293/377 (77%), Gaps = 19/377 (5%)
Query: 34 GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
G + VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE G+
Sbjct: 74 GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 133
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML AQ
Sbjct: 134 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQLP---------- 183
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA
Sbjct: 184 -------EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 236
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C FSKGQ++WTPI+LLV
Sbjct: 237 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 296
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE+ YLDPH+VQ V+
Sbjct: 297 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 356
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
+ K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDDFC RA KLAEESNGA
Sbjct: 357 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEESNGA 416
Query: 393 PLFTVTQTHKKPVNHSD 409
PLFTVTQTH +N S+
Sbjct: 417 PLFTVTQTHTA-INQSN 432
>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B;
Short=Protein autophagy 4; AltName: Full=OsAtg4
Length = 478
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 254/435 (58%), Positives = 322/435 (74%), Gaps = 17/435 (3%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S ++R+V +GSM R LG S+ SS D+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
KDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DVLG +G D ++ V +
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG----DGNINVEDL 464
Query: 429 NDAVGNAHEDDWQLL 443
DA G E++WQ+L
Sbjct: 465 -DASGETGEEEWQIL 478
>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
Length = 892
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 246/407 (60%), Positives = 310/407 (76%), Gaps = 12/407 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S S ++R+V +GSM R LG S+ ++SD+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG 415
KDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DVLG +G
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG 455
>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
Length = 484
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 258/430 (60%), Positives = 313/430 (72%), Gaps = 16/430 (3%)
Query: 20 DTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 79
D RS S ++R V GSM R LG G + + D+W LG C+K+
Sbjct: 65 DQSGRSGGHASGSYAWSRVLRRFVGGGSMWRF----LG---CGKALTAGDVWFLGKCYKL 117
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
+ +E+ D+ G A F +DFSSR+ I+YRKGFD I DSK+TSDV WGCM+RSSQMLVA
Sbjct: 118 SSEESSSDSDSEGGHAAFLEDFSSRVWITYRKGFDVISDSKLTSDVNWGCMVRSSQMLVA 177
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 199
QAL+FH LGR WRKP Q P D E+ ILHLFGDSE FSIHNLLQAGK+YGLAAGSWVG
Sbjct: 178 QALIFHHLGRSWRKPAQNPSDPEHTRILHLFGDSEVCAFSIHNLLQAGKSYGLAAGSWVG 237
Query: 200 PYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 257
PYAMCR+W+ L R R + + +S PM +YVVSGDEDGERGGAPVVCID A++ C
Sbjct: 238 PYAMCRAWQTLIRTNREQPEVINRNESFPMVLYVVSGDEDGERGGAPVVCIDVAAQLCYD 297
Query: 258 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPGASTYI GVQ+
Sbjct: 298 FNKGQSAWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGASTYIAGVQD 357
Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
+ A+YLDPH+VQ +NI D+LEADTS+YH +R + LD IDPSLAIGFYCRDKDDFDD
Sbjct: 358 DRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLAIGFYCRDKDDFDD 417
Query: 378 FCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG-GVPEDDSLGVMSMNDAVG 433
FC+RAS+LAE++NGAPLFTV Q+ K+ N D G +G GV D++ + D G
Sbjct: 418 FCSRASELAEQANGAPLFTVVQSVQPSKQMYNQDDGSGCSGYGV--SDNIDTEDL-DGSG 474
Query: 434 NAHEDDWQLL 443
ED+WQ+L
Sbjct: 475 ETGEDEWQIL 484
>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
Length = 912
Score = 476 bits (1225), Expect = e-132, Method: Compositional matrix adjust.
Identities = 244/407 (59%), Positives = 308/407 (75%), Gaps = 12/407 (2%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S ++R+V +GSM R LG S+ ++SD+W L
Sbjct: 56 FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 108
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG 415
KDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DVLG +G
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG 455
>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
gi|219886349|gb|ACL53549.1| unknown [Zea mays]
gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
Length = 492
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 252/433 (58%), Positives = 319/433 (73%), Gaps = 20/433 (4%)
Query: 17 STPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 76
S+P RS S G S ++R V +GSM R+ LG R ++SD+W LG C
Sbjct: 74 SSPACDARSTKSSSGSYGLSRILRRFVGSGSMWRL----LGCGRV---LTSSDVWFLGKC 126
Query: 77 HKIAQDEALGDAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
+K++ +E + ++ A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQ
Sbjct: 127 YKVSPEEEESGDSESDSGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQ 186
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
MLVAQAL+FH LGR WRKP +KP++ +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAG
Sbjct: 187 MLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAG 246
Query: 196 SWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
SW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGERGGAPVVCID A++
Sbjct: 247 SWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQ 306
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI
Sbjct: 307 LCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLGILGGKPGTSTYIA 366
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
GVQ++ A+YLDPH+VQ ++I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDKD
Sbjct: 367 GVQDDRALYLDPHEVQMTVDIALDNLEADTSSYHCSVVRALALEQIDPSLAIGFYCRDKD 426
Query: 374 DFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
DFDDFC+RAS+LAE++NGAPLFTV Q+ K+ D LG +G +D +D
Sbjct: 427 DFDDFCSRASELAEKANGAPLFTVVQSIEPSKQMYKQDDGLGCSGSSMAND-------DD 479
Query: 431 AVGNAHEDDWQLL 443
G+ ++WQ+L
Sbjct: 480 LDGSGEAEEWQIL 492
>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
Length = 486
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 256/434 (58%), Positives = 311/434 (71%), Gaps = 24/434 (5%)
Query: 20 DTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 79
D RS S ++R V GSM R LG G + + +D+ LG C+K+
Sbjct: 67 DQSGRSGGHASGSYAWSRVLRRFVGGGSMWRF----LG---CGKALTAADVQFLGKCYKL 119
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
+ +E+ D+ G A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQMLVA
Sbjct: 120 SSEESSSDSDSEGGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVA 179
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 199
QAL+FH LGR WRKP Q P + EY+ ILHLFGDSE FSIHNLLQAGK+YGLAAGSWVG
Sbjct: 180 QALIFHHLGRSWRKPAQNPSNPEYIRILHLFGDSEACAFSIHNLLQAGKSYGLAAGSWVG 239
Query: 200 PYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 257
PYAMCR+W+ L R R + + +S PMA+YVVSGDEDGERGGAPVVCID A++ C
Sbjct: 240 PYAMCRAWQTLIRTNREQPEVINRNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYD 299
Query: 258 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPGASTYI GVQ+
Sbjct: 300 FNKDQSAWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGASTYIAGVQD 359
Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
+ A+YLDPH+VQ +NI D+LEADTS+YH +R + LD IDPSLAIGFYCRDKDDFDD
Sbjct: 360 DRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLAIGFYCRDKDDFDD 419
Query: 378 FCARASKLAEESNGAPLFTVTQT---HKKPVNHSD-----VLGETGGVPEDDSLGVMSMN 429
FC+RAS+LAE++NGAPLFTV Q+ K+ N D G +G + +D
Sbjct: 420 FCSRASELAEQANGAPLFTVVQSVQPSKQMYNRDDGSGCSGYGVSGNIDAEDL------- 472
Query: 430 DAVGNAHEDDWQLL 443
D G ED+WQ+L
Sbjct: 473 DGSGETGEDEWQIL 486
>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
Length = 493
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 247/412 (59%), Positives = 308/412 (74%), Gaps = 12/412 (2%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
S ++R V GSM R LG ++ + D+W LG C+K + +E+ D ++G A
Sbjct: 90 SRALRRFVGGGSMWRF----LGCAKV---LTNGDVWFLGKCYKFSSEESSSDLDTDSGHA 142
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
F +DFSSRI ++YRKGFD I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AFLEDFSSRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPS 202
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
QKP + EY+ ILHLFGDSE FS+HNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R
Sbjct: 203 QKPCNPEYIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNR 262
Query: 216 A--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
E G +S PMA+YVVSGDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVP
Sbjct: 263 EQPEVSNGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSTWSPILLLVP 322
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
LVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STYI G+Q++ A+YLDPHDVQ +N
Sbjct: 323 LVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTYIAGIQDDRALYLDPHDVQMAVN 382
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
I D+L+ADTS+YH +R + LD +DPSLAIGFYCRDKDDFDDFC+RAS+L ++NGAP
Sbjct: 383 IASDNLDADTSSYHCSTVRDMALDLLDPSLAIGFYCRDKDDFDDFCSRASELVVKANGAP 442
Query: 394 LFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
LFTV Q+ + K + + D + G D++ + + D G A E++WQ+L
Sbjct: 443 LFTVVQSIQPSKQMYNQDDGSGSSGDGMADNINMEDL-DGSGEAGEEEWQIL 493
>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
Length = 473
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 247/434 (56%), Positives = 312/434 (71%), Gaps = 16/434 (3%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + +R L S ++R+ GSM R LG S+ + ++SD+W L
Sbjct: 52 FEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 104
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 105 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 164
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE FSIHNLLQAGK+YGLA
Sbjct: 165 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 224
Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R E G + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 225 AGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 284
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 285 AQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETFTFPQSLGILGGKPGTSTY 344
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
+ GVQ++ +YLDPH+VQ ++I D+LEADTS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 345 VAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 404
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMN-- 429
KDDFDDFC+RAS+L +++NGAPLFTV Q+ + + +G D + ++++
Sbjct: 405 KDDFDDFCSRASELVDKANGAPLFTVMQSVQPSKQMYNEESSSG-----DGMDIINVEGL 459
Query: 430 DAVGNAHEDDWQLL 443
D G E++WQ+L
Sbjct: 460 DGSGETGEEEWQIL 473
>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
gi|194701156|gb|ACF84662.1| unknown [Zea mays]
gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
Length = 492
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 252/417 (60%), Positives = 312/417 (74%), Gaps = 23/417 (5%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R ++ D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+KP+D +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R
Sbjct: 203 SEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTN 262
Query: 215 R--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
R A+ G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+
Sbjct: 263 REQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLI 322
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ +
Sbjct: 323 PLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAV 382
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
+I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGA
Sbjct: 383 DIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGA 442
Query: 393 PLFTVTQT---HKKPVNHSDVLGETGG---VPEDDSLGVMSMNDAVGNAHEDDWQLL 443
PLFTV Q+ K+ D L G ED L DA G A E +WQ+L
Sbjct: 443 PLFTVMQSVQPSKQMYKQDDGLCCCSGSSMANEDYDL------DASGEAGE-EWQIL 492
>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 474
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 252/435 (57%), Positives = 312/435 (71%), Gaps = 18/435 (4%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + NRSL S ++R+ GSM R LG S+ + ++SD+W L
Sbjct: 53 FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE FSIHNLLQAGK+YGLA
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 225
Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R E G + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 226 AGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 285
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T TFPQSLGI+GGKPG STY
Sbjct: 286 AQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETLTFPQSLGILGGKPGTSTY 345
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 346 IAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRD 405
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
KDDFDDFC+RAS+L +++NGAPLFTV Q+ K+ N G+ G+ DS+ V +
Sbjct: 406 KDDFDDFCSRASELVDKANGAPLFTVVQSVQPSKQMYNEESSSGD--GM---DSINVEGL 460
Query: 429 NDAVGNAHEDDWQLL 443
D G E++WQ+L
Sbjct: 461 -DGSGETGEEEWQIL 474
>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
Length = 1216
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 244/440 (55%), Positives = 308/440 (70%), Gaps = 45/440 (10%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + N+S S ++R+V +GSM R LG S+ ++SD+W L
Sbjct: 327 FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 379
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 380 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 439
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE FSIHNLLQAG +YGLA
Sbjct: 440 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 499
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 500 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 559
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 560 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 619
Query: 312 IVGVQEESAIYLDPHDVQ---------------------------------PVINIGKDD 338
I GVQ++ A+YLDPH+VQ ++I D+
Sbjct: 620 IAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYGSYSGVFSTSQAVDIAADN 679
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
+EADTS+YH +R + LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV
Sbjct: 680 IEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVV 739
Query: 399 QT---HKKPVNHSDVLGETG 415
Q+ K+ N DVLG +G
Sbjct: 740 QSVQPSKQMYNQDDVLGISG 759
>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
Length = 505
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 252/466 (54%), Positives = 312/466 (66%), Gaps = 49/466 (10%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + NRSL S ++R+ GSM R LG S+ + ++SD+W L
Sbjct: 53 FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE FSIHNLLQAGK+YGLA
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 225
Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
AGSWVGPYAMCR+W+ L R E G + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 226 AGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 285
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T TFPQSLGI+GGKPG STY
Sbjct: 286 AQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETLTFPQSLGILGGKPGTSTY 345
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
I GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH +R + LD IDPSLAIGFYCRD
Sbjct: 346 IAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRD 405
Query: 372 K-------------------------------DDFDDFCARASKLAEESNGAPLFTVTQT 400
K DDFDDFC+RAS+L +++NGAPLFTV Q+
Sbjct: 406 KGELLLPDKMLGHHLSSLQSWFSYLLCLSAYVDDFDDFCSRASELVDKANGAPLFTVVQS 465
Query: 401 ---HKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
K+ N G+ G+ DS+ V + D G E++WQ+L
Sbjct: 466 VQPSKQMYNEESSSGD--GM---DSINVEGL-DGSGETGEEEWQIL 505
>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
Length = 462
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 237/371 (63%), Positives = 285/371 (76%), Gaps = 15/371 (4%)
Query: 81 QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQ 140
++E G + ++G A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQ
Sbjct: 99 EEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQ 158
Query: 141 ALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
AL+FH LGR WRKP +KP+D +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAGSWVGP
Sbjct: 159 ALIFHHLGRSWRKPSEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGP 218
Query: 201 YAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
YAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F
Sbjct: 219 YAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNF 278
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+KGQ W+PILLL+PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+
Sbjct: 279 NKGQCTWSPILLLIPLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQED 338
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
A+YLDPHDVQ ++I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDKDDFDDF
Sbjct: 339 RALYLDPHDVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKDDFDDF 398
Query: 379 CARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGG---VPEDDSLGVMSMNDAV 432
C+RAS+LAE++NGAPLFTV Q+ K+ D L G ED L DA
Sbjct: 399 CSRASELAEKANGAPLFTVMQSVQPSKQMYKQDDGLCCCSGSSMANEDYDL------DAS 452
Query: 433 GNAHEDDWQLL 443
G A E +WQ+L
Sbjct: 453 GEAGE-EWQIL 462
>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 211/357 (59%), Positives = 281/357 (78%), Gaps = 4/357 (1%)
Query: 46 GSMRRIHERVLGPSRTGISSST-SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR 104
GSMRR+ E +LGP T ++S+ S+IW+LG+C+K++ D + EF DF+SR
Sbjct: 1 GSMRRLQELLLGPRFTAANASSGSEIWVLGLCYKVSADPN-NETLSVQAFEEFISDFTSR 59
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
I I+YRKGF+ +G SK+TSDVGWGCMLRS QML+AQAL+ H LGR WR+ +P + Y+
Sbjct: 60 IWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLGRSWRREPGQPCSQAYL 119
Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GC 222
+IL FGDSE+ PFSIHNLL+AG +GLAAGSW+GPYA+CR+ EALAR R ++ G
Sbjct: 120 QILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLEALARADREQSQKKGGK 179
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
++LP A+YVVSG+ +GERGGAPV+C++D + CS + + +WTP+L+LVPLVLGL+KVN
Sbjct: 180 RALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTEEWTPLLVLVPLVLGLDKVN 239
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRY+P+LR TFTFPQSLGI GGKPGASTY++GVQ+E A+YLDPH+ Q V+ + ++LE D
Sbjct: 240 PRYLPSLRATFTFPQSLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELD 299
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
TS+YH +R + LD+IDPSLAIGFYCRD+ +FDD CAR+S+LA++SNGAP+FTV +
Sbjct: 300 TSSYHCSTVRRLPLDTIDPSLAIGFYCRDRAEFDDLCARSSELAKQSNGAPMFTVAE 356
>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
Length = 595
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 216/342 (63%), Positives = 268/342 (78%), Gaps = 10/342 (2%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R ++ D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+KP+D +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R
Sbjct: 203 SEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTN 262
Query: 215 R--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
R A+ G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+
Sbjct: 263 REQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLI 322
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ +
Sbjct: 323 PLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAV 382
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
+I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDK D
Sbjct: 383 DIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKGD 424
>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
Length = 429
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 219/359 (61%), Positives = 273/359 (76%), Gaps = 10/359 (2%)
Query: 17 STPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 76
S+P RS S G S ++R V +GSM R+ LG R ++SD+W LG C
Sbjct: 74 SSPACDARSTKSSSGSYGLSRILRRFVGSGSMWRL----LGCGRV---LTSSDVWFLGKC 126
Query: 77 HKIAQDEALGDAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
+K++ +E + ++ A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQ
Sbjct: 127 YKVSPEEEESGDSESDSGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQ 186
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
MLVAQAL+FH LGR WRKP +KP++ +Y+ +LHLFGDSE FSIHNLLQAG+ YGLAAG
Sbjct: 187 MLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAG 246
Query: 196 SWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
SW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGERGGAPVVCID A++
Sbjct: 247 SWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQ 306
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI
Sbjct: 307 LCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLGILGGKPGTSTYIA 366
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
GVQ++ A+YLDPH+VQ ++I D+LEADTS+YH V+R + L+ IDPSLAIGFYCRDK
Sbjct: 367 GVQDDRALYLDPHEVQMTVDIALDNLEADTSSYHCSVVRALALEQIDPSLAIGFYCRDK 425
>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 346
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/339 (58%), Positives = 267/339 (78%), Gaps = 5/339 (1%)
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
SSS +IW+LG+C+K++ D A +A + EF DFSSRI I+YRKGF+ +G+SK+TS
Sbjct: 4 SSSGGEIWVLGICYKVSAD-ANDEAVSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTS 62
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
DVGWGCMLRS Q+L+AQAL+ H LGR WR+ + +EY++IL FGDSE+ FSIHNL
Sbjct: 63 DVGWGCMLRSGQILLAQALVCHYLGRTWRRNACQECLQEYLQILQSFGDSESCSFSIHNL 122
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARC---QRAETGLGCQSLPMAIYVVSGDEDGER 240
L+AG+ +GLAAGSW+GPYA+CR+ EALA+ Q A+ G G ++LP A+YVVSG+ +G+R
Sbjct: 123 LEAGRPFGLAAGSWLGPYALCRTLEALAKADEDQNAKKG-GKRALPFAVYVVSGETEGDR 181
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
GGAPV C++DA+ CS + + +W+P+++LVPLVLGL+K+NPRY+P+LR TFT PQSLG
Sbjct: 182 GGAPVRCVEDAAVLCSKWGEATEEWSPLVVLVPLVLGLDKLNPRYLPSLRATFTLPQSLG 241
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
+ GGKPGAST+++GVQ + A+YLDPH+ Q V + ++LE DTS YH V+R + LDSID
Sbjct: 242 VAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLELDTSFYHCSVVRRLPLDSID 301
Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
PSLAIGFYCRD+ +FDD CAR+S+L ++ NGAP+FTV +
Sbjct: 302 PSLAIGFYCRDRAEFDDLCARSSELVKQYNGAPIFTVAE 340
>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
Length = 358
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 202/375 (53%), Positives = 268/375 (71%), Gaps = 29/375 (7%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDAA 89
+ V+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 2 TAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKEST 59
Query: 90 GNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HRL
Sbjct: 60 SSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRL 119
Query: 148 GRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
GR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 120 GRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCH 178
Query: 206 SWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+ EALAR G G + +A+YVVSGD GERGGAPV+ D + C
Sbjct: 179 AIEALAR----NDGRGREGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC--------- 225
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YLD
Sbjct: 226 --PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYLD 283
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH+VQ V+++ + LE D+++YH V+R + LD+IDPSLA+GFYCR+++D DD CARAS+
Sbjct: 284 PHEVQKVVSVSGESLEFDSASYHCSVVRKMLLDAIDPSLALGFYCRNREDLDDLCARASE 343
Query: 385 LAEESNGAPLFTVTQ 399
LA +SNGAP+FTV +
Sbjct: 344 LASQSNGAPMFTVAE 358
>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
Length = 358
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 202/375 (53%), Positives = 268/375 (71%), Gaps = 29/375 (7%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDAA 89
+ V+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 2 TAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKEST 59
Query: 90 GNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HRL
Sbjct: 60 SSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRL 119
Query: 148 GRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
GR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 120 GRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCH 178
Query: 206 SWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+ EALAR G G Q +A+YVVSGD GERGGAPV+ D + C
Sbjct: 179 AIEALAR----NDGRGRQGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC--------- 225
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YLD
Sbjct: 226 --PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYLD 283
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH+VQ V+++ + LE D+++YH V+R + LD+IDPSLA+GFYCR++++ DD CARAS+
Sbjct: 284 PHEVQKVVSVSGESLEFDSASYHCSVVRKMPLDAIDPSLALGFYCRNREELDDLCARASE 343
Query: 385 LAEESNGAPLFTVTQ 399
LA +SNGAP+FTV +
Sbjct: 344 LASQSNGAPMFTVAE 358
>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 267
Score = 314 bits (804), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 156/244 (63%), Positives = 198/244 (81%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 1 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 60
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 61 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 120
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 121 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 180
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 181 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 240
Query: 283 PRYI 286
PR++
Sbjct: 241 PRFV 244
>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 360
Score = 308 bits (790), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 155/244 (63%), Positives = 196/244 (80%)
Query: 43 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 87 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 146
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 147 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 206
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 207 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 266
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 267 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 326
Query: 283 PRYI 286
P +
Sbjct: 327 PSHF 330
>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
Length = 219
Score = 287 bits (734), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 158/220 (71%), Positives = 178/220 (80%), Gaps = 4/220 (1%)
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
MAIYVVSGDEDGERGGAPVVCI+DA +HCS FS+GQA WTP+LLLVPLVLGL+KVNPRYI
Sbjct: 1 MAIYVVSGDEDGERGGAPVVCIEDAFKHCSEFSRGQAAWTPLLLLVPLVLGLDKVNPRYI 60
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TST 345
P L TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+NI D E + TS+
Sbjct: 61 PLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQEPNSTSS 120
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH--KK 403
YH +V+RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGAPLFTV Q+ K
Sbjct: 121 YHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGAPLFTVAQSRSFSK 180
Query: 404 PVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
V+ +DV G+ G ED LG +D +EDDWQLL
Sbjct: 181 QVSGNDVSGDNTGFEEDAFLGT-DHDDNDAGTNEDDWQLL 219
>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
Length = 290
Score = 278 bits (710), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 143/209 (68%), Positives = 172/209 (82%), Gaps = 2/209 (0%)
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
S G GCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +L LFGDSE FSIHN
Sbjct: 14 SLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGDSEACAFSIHN 73
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGER 240
LLQA + YGLAAGSW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGER
Sbjct: 74 LLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGER 133
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
GGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLG
Sbjct: 134 GGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLG 193
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
I+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 194 ILGGKPGTSTYIAGVQDDRALYLDPHEVQ 222
>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
Length = 472
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/205 (69%), Positives = 168/205 (81%), Gaps = 2/205 (0%)
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 172
FD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR RKP +KP++ +Y+ +LHLFGD
Sbjct: 34 FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93
Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIY 230
SE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R A+ G ++ PMA+Y
Sbjct: 94 SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
VVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLK 213
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGV 315
TF FPQSL I+GGKPG STYI GV
Sbjct: 214 ETFMFPQSLCILGGKPGTSTYIAGV 238
>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
Length = 169
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/165 (68%), Positives = 130/165 (78%)
Query: 53 ERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKG 112
+ +LG S T SSTSDIWLLG C+K++ +E+ G NG A F +DFSSRI I+YRKG
Sbjct: 2 QELLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKG 61
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 172
FD IGDSK TSDV WGCM+RSSQMLVAQALLFH LGR WRKP QKP D +Y+EILHLFGD
Sbjct: 62 FDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGD 121
Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
SE FSIHNLL+AGKAYGLAA WVGPYAMCR+WE + R +R +
Sbjct: 122 SEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQ 166
>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
Length = 362
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/364 (38%), Positives = 194/364 (53%), Gaps = 44/364 (12%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D SRI ++YR+GF PI S ITSDVGWGC LRS QML+AQAL++H +GR WR+ L+ +
Sbjct: 23 DLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQWRRKLEAAY 82
Query: 160 DREYVEILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
E ++L FGD E PFSIHN+ G+ +G+ AG W+GP +C + + +
Sbjct: 83 PEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLADMVN-KVQP 141
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWTPILLL 271
GL C+ + G GGAPV+C SR + F G + +
Sbjct: 142 GGLQCR-----VVATFG------GGAPVLC---TSRLATAFEGGADRSGGEVGSSGSEES 187
Query: 272 VPLVLGLE-----------KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
P GL K+NPRY L+ T+PQS+GIVGG+P +S Y +G+Q++
Sbjct: 188 GPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSLYFIGLQDQHV 247
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
+YLDPH+VQ V + AD TY +R + L +IDPSLAIGFYC DF+D C
Sbjct: 248 LYLDPHEVQEVASEA-----ADLDTYFCSSLRLMPLANIDPSLAIGFYCSSLSDFEDLCG 302
Query: 381 RASKLAEESNGAPLFT-VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
R L E+ APL V + +P ++ + G+P D S G A+ D+
Sbjct: 303 RLRTLEAEAGCAPLVCMVDEDAGEPSWPAEEVLSDEGIPSDAD----SPAPPAGGANRDN 358
Query: 440 WQLL 443
W++L
Sbjct: 359 WEML 362
>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
Length = 416
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 134/263 (50%), Positives = 166/263 (63%), Gaps = 56/263 (21%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
L F +DFSSRI I+YRKGFD I D K+TSDV WGCM+RSSQMLVAQAL+FH LGR WRK
Sbjct: 29 LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRK 88
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P +K L++ +
Sbjct: 89 PPEK------------------------TLIRTNR------------------------- 99
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
++A+ G ++ PM +YVVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVP
Sbjct: 100 EQADAVDGKENFPMELYVVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVP 159
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI- 332
LVLGL+K+NPRYIP L+ TF FPQSLGI+G KPG STYI GVQ++ A+YLDPH+VQ V+
Sbjct: 160 LVLGLDKINPRYIPLLKETFMFPQSLGILGVKPGTSTYIAGVQDDRALYLDPHEVQMVLA 219
Query: 333 NIGKDDLEADTSTYHSDVIRHIH 355
NI + T +D I +IH
Sbjct: 220 NIKWPE------TLETDFIYNIH 236
>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 348
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/329 (37%), Positives = 182/329 (55%), Gaps = 19/329 (5%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
+LGV + DE + ++ + +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 1 MLGVTYWSKDDECNAEKY-DDARRAWERDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTL 59
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAY 190
RS+QM+VA AL H GR WR+ ++ D E V+ +L +F D ++PFSIH++ + A+
Sbjct: 60 RSAQMMVANALSIHTRGRHWRRQVKAKEDDESVDHVLSMFIDDASAPFSIHSVCETTTAW 119
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG-DEDGERGGAPVVCID 249
G G W P MCR++ AL G +A++VV G +ED GG P ID
Sbjct: 120 GAPPGRWFEPSVMCRAFSALIEAN------GDLRNQIAVHVVGGQNEDDSAGGVPT--ID 171
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
D G+A +LL VPLVLG+ +N RYI LR F QS+G++GG+P A
Sbjct: 172 DGELRAKSADVGKA----LLLFVPLVLGVGRNINTRYISQLRSIIAFKQSIGVIGGRPNA 227
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
S Y+VG ++ YLDPH VQP + + D +Y+ + + +DP+LA+GFY
Sbjct: 228 SLYLVGHSDDVFFYLDPHTVQPANSFAE---AVDFDSYYCSTPLQMRGELLDPTLALGFY 284
Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTV 397
CRD DD D A LAE + AP+ V
Sbjct: 285 CRDGDDLDSLFASVKALAEANATAPVLDV 313
>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
Length = 369
Score = 217 bits (553), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 128/370 (34%), Positives = 187/370 (50%), Gaps = 40/370 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 65
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 177
Query: 240 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 178 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 237
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 238 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 297
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDFDD+C + +L+ P+F + + +
Sbjct: 298 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 357
Query: 407 HSDVLGETGG 416
DVL + G
Sbjct: 358 CPDVLNVSLG 367
>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
Length = 390
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 120/333 (36%), Positives = 173/333 (51%), Gaps = 12/333 (3%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++A+AL+ LGR WR +
Sbjct: 45 DVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWARGRRQ 104
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G G W GP A+ +W L
Sbjct: 105 REEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRL 164
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
A + + + + D E G C++ A C++ + A W P++L
Sbjct: 165 AVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGA---CALVEEETALWKPLVL 221
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
L+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP
Sbjct: 222 LIPLRLGLSDINEAYIDTLKQCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP 281
Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
+ +D D + + +H+ +DPS+A GF+CR +D+FDD+C R L+ +
Sbjct: 282 AVEPSEDGQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRGLSCKRG 341
Query: 391 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
G P+F + + + D L T + D L
Sbjct: 342 GLPMFELVDSQPTHMVSVDALNLTPDFSDSDRL 374
>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
Length = 405
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/366 (34%), Positives = 185/366 (50%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 34 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 80
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 81 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 140
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 141 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 192
Query: 240 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 193 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 252
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 253 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 312
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDFDD+C + +L+ P+F + + +
Sbjct: 313 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 372
Query: 407 HSDVLG 412
DVL
Sbjct: 373 CPDVLN 378
>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
Length = 342
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 178/343 (51%), Gaps = 32/343 (9%)
Query: 58 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN----GLAEFNQDFSSRILISYRKGF 113
P +T + S IWLLG C+ E + + L EF++ F+S I ++YR+ F
Sbjct: 12 PLKTNFNED-SPIWLLGRCYHAKNYEYTSEQSKQQCQILSLEEFHRHFTSLIWLTYRRSF 70
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---KPLQKPFDREYVEILHLF 170
+ S +TSD GWGCMLRS QM++A L+FH L + WR + + + Y IL F
Sbjct: 71 VQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRISGRCHSREQEHYYRVILQFF 130
Query: 171 GDS---ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
GD E SPFS+H L+ G+ G AG W GP ++ E +++
Sbjct: 131 GDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHILE--------------KAMIS 176
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVN 282
A + + D + V ID+ R C+ Q D W P+++LVP+ LG E +N
Sbjct: 177 ATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRLGGEALN 236
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
P YIP ++ FT Q +GI+GG+P S Y VG Q+E I+LDPH QPV++ ++
Sbjct: 237 PIYIPCVKSLFTLDQCIGIIGGRPKHSLYFVGFQDEKMIHLDPHYCQPVVDTTQEKFP-- 294
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
T ++H R +DPS IGFYC +DF+ FC AS++
Sbjct: 295 TESFHCPNPRKTSFKKMDPSCTIGFYCSSHEDFESFCQHASEV 337
>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
Length = 390
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 179/359 (49%), Gaps = 26/359 (7%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 65
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQSDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 184
Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 185 PCAGATALPTDSSRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 244
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 245 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCRHPPSR 304
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDFDD+C R +L+ P+F + + + DVL
Sbjct: 305 MGISELDPSIAVGFFCKTEDDFDDWCQRVRQLSLLGGALPMFELVEQQPSHLACPDVLN 363
>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
Length = 445
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 124/358 (34%), Positives = 177/358 (49%), Gaps = 26/358 (7%)
Query: 66 STSDIWLLGVCHKIA--QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I+ +DE L D A SR+ +YRK F IG + TS
Sbjct: 74 TSEPVWILGRKYSISTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 120
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 121 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 180
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V+ R G
Sbjct: 181 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMEDIRRLCRAGL 239
Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P D RHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 240 PCAGAAALPADPGRHCNGFPAGAEVSNRLAPWRPLVLLIPLRLGLTDINEAYVETLKHCF 299
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 300 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFADSCFIPDESFHCQHPPSR 359
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
+ + +DPS+A+GF+C+ ++DFDD+C R KL+ P+F + + + DVL
Sbjct: 360 MGVRELDPSIAVGFFCQTEEDFDDWCQRVRKLSLLGGALPMFELVEQQPSHLACPDVL 417
>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
Length = 410
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/352 (34%), Positives = 183/352 (51%), Gaps = 34/352 (9%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+ S +W+LG + + D LAE +D SR+ ++YRKGFDPIG S TSD
Sbjct: 30 TESPVWILGKQYSVLYD-----------LAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQ 78
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM++AQ+L+ LGR WR K +D +Y EIL +F D ++ +S+ +
Sbjct: 79 GWGCMLRCGQMMLAQSLICRHLGRDWRWTKDK-YDPKYFEILRMFQDKRSAKYSLQVIAS 137
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD------EDGE 239
G + G A G W GP + + L C E + + V+ D +
Sbjct: 138 MGTSEGKAIGEWFGPNTISQVLRKL--CVSDEWSNLVVHVALDNTVIIDDVFCLCKSSKK 195
Query: 240 RGGAPVVCIDDASRHCSVFS-----------KGQAD-WTPILLLVPLVLGLEKVNPRYIP 287
P+ + A +F+ G+ D W P+LL+VPL LGL ++NP YIP
Sbjct: 196 ESNEPIPGVHAACASALLFNGHDPTAEGHDPSGEDDSWRPLLLIVPLRLGLSEINPVYIP 255
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
L+ TF QS+GI+GGKP + + +G E+ +Y+DPH QP +++ + E+D S YH
Sbjct: 256 FLKTCLTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPFVDVTQPG-ESDAS-YH 313
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+ + +DPS+A+GF+C+ + DF+D C K P+F + Q
Sbjct: 314 CSYSCRMPVSYLDPSVAVGFFCQTEADFEDLCQCIRKYILHGQKTPMFELHQ 365
>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
Length = 393
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 185/366 (50%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
A + C+ D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CQDVLN 366
>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
Length = 394
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 123/359 (34%), Positives = 177/359 (49%), Gaps = 26/359 (7%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 129
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 188
Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 189 PCAGAAALPADSSRHCNGFPAGAEVTNRLAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 248
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 249 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFIPDESFHCQHPPSR 308
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ + DFDD+C + +L+ P+F + + + DVL
Sbjct: 309 MSIGELDPSIAVGFFCKTEGDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLACPDVLN 367
>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
Length = 434
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 127/333 (38%), Positives = 175/333 (52%), Gaps = 33/333 (9%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F F S + +YR F +G TSD+GWGCMLR+ QM++AQ L H LG WR+
Sbjct: 108 ASFLTHFRSVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQ 167
Query: 155 LQK--PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+ P Y +++ F D PFS+H + AG YG G W GP M + E L +
Sbjct: 168 SDRSSPL---YAKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLK 224
Query: 213 CQRAETGLG---CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-WTPI 268
+ + +GL CQ +Y+ P+ DD +GQ W P+
Sbjct: 225 -EFSPSGLRAYVCQD--GCLYLDQLRRTATAAHWPLDEDDD---------EGQGKSWAPM 272
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
L+++PL LGL+++N Y P L+ TF PQS+GI GGKP AS Y VG Q++ YLDPH V
Sbjct: 273 LIMLPLRLGLDQLNEDYAPVLKETFRIPQSVGISGGKPRASLYFVGNQDDYVFYLDPHTV 332
Query: 329 QPVINIGK-DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
QP + D+ A T+H + + IDPSL + FYCR+++DFDDFCARA +
Sbjct: 333 QPAPRFPEVGDVPASEDVYDTFHCSAPLRLPIRDIDPSLCLAFYCRNREDFDDFCARAIQ 392
Query: 385 LAEESNGAPLFTVTQ------THKKPVNHSDVL 411
L+E P+FTV + KP HS+ L
Sbjct: 393 LSE--GPMPIFTVAERMPDYLVRPKPPKHSEKL 423
>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
Length = 390
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 177
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 178 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 237
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 238 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 297
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 298 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 357
Query: 407 HSDVLG 412
DVL
Sbjct: 358 CQDVLN 363
>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
Length = 393
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CQDVLN 366
>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
Length = 391
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 118/345 (34%), Positives = 178/345 (51%), Gaps = 16/345 (4%)
Query: 92 NGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
N L E ++ D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LG
Sbjct: 34 NALTEKDEILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVCRHLG 93
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
R WR + EY+ +L+ F D + S +SIH + Q G G G W GP + + +
Sbjct: 94 RDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLK 153
Query: 209 ALARCQRAETGLGCQSLPMAIYVVS--------GDEDGERGGAPVV--CIDDASRHCSVF 258
LA + ++ + + D GE G + C++ A C++
Sbjct: 154 KLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGA---CAMA 210
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+ A W P++LL+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E
Sbjct: 211 EEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLPQSLGVIGGKPNSAHYFIGYVGE 270
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
IYLDPH QP + +D D + + +H+ +DPS+A GF+CR +D+FDD+
Sbjct: 271 ELIYLDPHTTQPAVEPSEDSQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDW 330
Query: 379 CARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
C R +L+ P+F + + + D L T + D L
Sbjct: 331 CMRIRRLSCNRGTLPMFELVDSQPSHMVSVDTLNLTPDFSDSDRL 375
>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B
Length = 393
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CQDVLN 366
>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
Length = 393
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/372 (33%), Positives = 181/372 (48%), Gaps = 52/372 (13%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 188
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 280
A G A D+ RHC+ F G A W P++LL+PL LGL
Sbjct: 189 CA------------GAAAFPA--DSDRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTD 234
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFI 294
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
D S + + + +DPS+A+GF+C+ +DDF+D+C + + L+ P+F + +
Sbjct: 295 PDESFHCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVTMLSLLGGALPMFELVEQ 354
Query: 401 HKKPVNHSDVLG 412
+ DVL
Sbjct: 355 QPSHLACPDVLN 366
>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
Length = 517
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 172/313 (54%), Gaps = 25/313 (7%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---K 153
F +DFSSR+ +YR+ F PI + ITSD GWGCMLRSSQM++AQA++ H LGR WR
Sbjct: 181 FLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYRRN 240
Query: 154 PLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 210
+ D + +++ LFGD + SPFS+H L+Q G G AG W GP + EAL
Sbjct: 241 NQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKEAL 300
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS-VFSKGQADWTPIL 269
+ E L L + IYV + ++D C S G W ++
Sbjct: 301 EGACQTEQLL----LDLRIYVAQD---------CTIYLEDVRALCRGTRSNGAPLWRSVI 347
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+LVP+ LG E++NP YIP ++ + P +G++GG+P S Y +G Q E IYLDPH VQ
Sbjct: 348 ILVPVRLGGEQLNPTYIPCVKGMLSHPNCIGVIGGRPRHSLYFLGWQGEKVIYLDPHYVQ 407
Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--- 386
+++G D D +YH R + +DPS +GFYC+ +D+F+ F +LA
Sbjct: 408 EAVDVGPQDFPLD--SYHCSWPRKMSFYKMDPSCTMGFYCKTEDEFEHFVKDVKQLAVPT 465
Query: 387 EESNGAPLFTVTQ 399
E + P+F V++
Sbjct: 466 ESRHEYPVFLVSE 478
>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
Length = 394
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
A + + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
H QP ++ +D D S + +H+ +DPS+A GF+C+ +DDFDD+CA+ K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341
Query: 386 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
+ G P+F + + + +DVL T + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378
>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
Length = 393
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 44 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 103
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 104 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 163
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
A + + + + + D +RG P D C++ + A W
Sbjct: 164 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 220
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 221 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 280
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
H QP ++ +D D S + +H+ +DPS+A GF+C+ +DDFDD+CA+ K+
Sbjct: 281 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 340
Query: 386 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
+ G P+F + + + +DVL T + D L
Sbjct: 341 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 377
>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
Length = 394
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
A + + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
H QP ++ +D D S + +H+ +DPS+A GF+C+ +DDFDD+CA+ K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341
Query: 386 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
+ G P+F + + + +DVL T + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378
>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
Length = 405
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 190/385 (49%), Gaps = 42/385 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLGETGGVPEDDSLGVMSMNDA 431
DVL + G E + V S+ D+
Sbjct: 361 CPDVLNLSLG--ESCQVQVGSLGDS 383
>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
[Homo sapiens]
Length = 415
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLGETGG 416
DVL + G
Sbjct: 361 CPDVLNLSLG 370
>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
Length = 393
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 176/364 (48%), Gaps = 36/364 (9%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR + Y +LH F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALLCRHLGRGWRWTQWERQPDSYFSVLHAFMDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSG 234
Q G G + G W GP + + +W ALA + + I +
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSALA----VHVAMDNTVVMEEIRRLCR 184
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPT 288
G A D+ RHC+ F W P++LL+PL LGL +N Y T
Sbjct: 185 SSLPRAGAAAFPA--DSDRHCNGFPAEAEVGPRPVPWRPLVLLIPLRLGLTDINAAYTET 242
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + L D S +
Sbjct: 243 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQVTDSCLIPDESFHCQ 302
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 408
+ + +DPS+A+GF+C+ ++DF+D+C + KL+ P+F + + +
Sbjct: 303 HPPHRMSIAELDPSIAVGFFCQTEEDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACP 362
Query: 409 DVLG 412
DVL
Sbjct: 363 DVLN 366
>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
Length = 521
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 189/385 (49%), Gaps = 42/385 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476
Query: 407 HSDVLGETGGVPEDDSLGVMSMNDA 431
DVL + G E + V S+ D+
Sbjct: 477 CPDVLNLSLG--ESCQVQVGSLGDS 499
>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
Length = 510
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 139 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 185
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 186 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 245
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 246 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 297
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 298 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 357
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 358 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 417
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 418 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 477
Query: 407 HSDVL 411
DVL
Sbjct: 478 CPDVL 482
>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 448
Query: 407 HSDVLGETGG 416
DVL + G
Sbjct: 449 CPDVLNLSLG 458
>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
Length = 468
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448
Query: 407 HSDVLGETGG 416
DVL + G
Sbjct: 449 CPDVLNLSLG 458
>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
Length = 481
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448
Query: 407 HSDVLG 412
DVL
Sbjct: 449 CPDVLN 454
>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
Length = 393
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CPDVLN 366
>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
Length = 393
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CPDVLN 366
>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
Length = 380
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360
Query: 407 HSDVLGETGG 416
DVL + G
Sbjct: 361 CPDVLNLSLG 370
>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
Length = 396
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363
Query: 407 HSDVLG 412
DVL
Sbjct: 364 CPDVLN 369
>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
Length = 393
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CPDVLN 366
>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
Length = 396
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363
Query: 407 HSDVLG 412
DVL
Sbjct: 364 CPDVLN 369
>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
Length = 508
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 137 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 183
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 184 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 243
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 244 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 295
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 296 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 355
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 356 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 415
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 416 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 475
Query: 407 HSDVL 411
DVL
Sbjct: 476 CPDVL 480
>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
[Homo sapiens]
gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
construct]
gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
Length = 393
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CPDVLN 366
>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
Length = 398
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 27 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 73
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 74 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 133
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 134 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 185
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 186 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 245
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 246 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 305
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 306 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 365
Query: 407 HSDVLG 412
DVL
Sbjct: 366 CPDVLN 371
>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
Length = 375
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 171/337 (50%), Gaps = 33/337 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 26 VWILGACYNVKTKKS-----------ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGC 74
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR WR +K +EY IL F D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALICSHLGRDWRWDPEKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVG 134
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +++YV + V I+
Sbjct: 135 EGKSVGEWYGPNTVAQVLKKLALFDDWNS--------LSVYVSMDN---------TVVIE 177
Query: 250 DASRHC-----SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
D + C + S+ DW P+LL++PL +G+ +NP YI L+ F PQS G++GG
Sbjct: 178 DIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQALKECFKMPQSCGVLGG 237
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
KP + Y +G ++ IYLDPH Q ++ D S + + + S+DPS+A
Sbjct: 238 KPNLAYYFIGFIDDELIYLDPHTTQQAVDTESGSAVDDQSFHCQRTPHRMKITSLDPSVA 297
Query: 365 IGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+GF+C+ ++DFD +C + + +F + + H
Sbjct: 298 LGFFCKSEEDFDSWCDLVQQELLKKRNLRMFELVEKH 334
>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B;
Short=hAPG4B
gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
Length = 393
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CPDVLN 366
>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
Length = 479
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 108 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 154
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 155 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 214
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 215 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 266
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 286
A + C D+ RHC+ F G W P++LL+PL LGL +N Y+
Sbjct: 267 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 326
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 327 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 386
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C ++DF+D+C + KL+ P+F + + +
Sbjct: 387 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 446
Query: 407 HSDVL 411
DVL
Sbjct: 447 CQDVL 451
>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
Length = 394
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 179/367 (48%), Gaps = 42/367 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAIFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 189
Query: 227 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAEATAFPADSEGHCNGLPA---------GAEVTNRPSLWRPLVLLIPLRLGLTDINEAY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + L D S
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSFLIPDESF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
+ + + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHL 360
Query: 406 NHSDVLG 412
DVL
Sbjct: 361 ACPDVLN 367
>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
Length = 496
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 182/370 (49%), Gaps = 40/370 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476
Query: 407 HSDVLGETGG 416
DVL + G
Sbjct: 477 CPDVLNLSLG 486
>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
Length = 509
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 180/365 (49%), Gaps = 40/365 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476
Query: 407 HSDVL 411
DVL
Sbjct: 477 CPDVL 481
>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
pisum]
gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
pisum]
gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
pisum]
gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
pisum]
Length = 402
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/344 (31%), Positives = 178/344 (51%), Gaps = 38/344 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192
Query: 243 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ +++ FTFPQSL
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKMCFTFPQSL 250
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIRHIHL 356
G++GG+P + Y +G I+LDPH Q + + D+E + +YH I + +
Sbjct: 251 GVIGGRPNHALYFIGFVGNDVIFLDPHTTQQIGMLPNKDIETEHKIDHSYHCQQINRLPI 310
Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARAS---KLAEESNGAPLFTV 397
++DPSLA F C+ ++DF+ C +++S PL T+
Sbjct: 311 LNMDPSLAACFMCQTENDFNALCHELKVHLVQSDQSPSQPLITI 354
>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
Length = 393
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CPDVLN 366
>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
Length = 380
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/370 (32%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ + PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCYMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360
Query: 407 HSDVLGETGG 416
DVL + G
Sbjct: 361 CPDVLNLSLG 370
>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
Length = 394
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 23 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 69
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 129
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 181
Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 286
A + C D+ RHC+ F G W P++LL+PL LGL +N Y+
Sbjct: 182 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 241
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 242 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 301
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C ++DF+D+C + KL+ P+F + + +
Sbjct: 302 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 361
Query: 407 HSDVLG 412
DVL
Sbjct: 362 CQDVLN 367
>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
Length = 393
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGCALPMFELVEQQPSHLA 360
Query: 407 HSDVLG 412
DVL
Sbjct: 361 CPDVLN 366
>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
Length = 473
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 175/355 (49%), Gaps = 21/355 (5%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
++ +W+LG + + ++ E D +SR+ +YRK F IG + TSD
Sbjct: 103 TSEPVWILGRKYSLLTEKN-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 151
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQAL+ LGR WR QK Y+ +LH F D + S +SIH + Q
Sbjct: 152 GWGCMLRCGQMIFAQALVCRHLGRDWRWTQQKRQPDSYLSVLHAFMDRKDSYYSIHQIAQ 211
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G + G W GP + + + LA + L V+ R P
Sbjct: 212 MGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRSSHPC 270
Query: 246 VCIDDASR----HCSVFS-----KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
HC+ F ++ W P++LL+PL LGL +N Y+ TL+L F P
Sbjct: 271 AGAATPPAGADWHCNGFPASTEVTNRSPWRPLVLLIPLRLGLTDINEAYVETLKLCFRMP 330
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHL 356
QSLG++GGKP ++ Y +G E IYLDPH QP + + D S + + +
Sbjct: 331 QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDLCFIPDESFHCQHPPCRMSI 390
Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
+DPS+A+GF+C+ ++DF+D+C + KL+ P+F + + + DVL
Sbjct: 391 GELDPSIAVGFFCKTEEDFNDWCQQVRKLSLLGGALPMFELVEQQPPHLACPDVL 445
>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
Length = 420
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 183/366 (50%), Gaps = 40/366 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 49 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 95
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR ++ Y +LH F D + S +SIH +
Sbjct: 96 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWAQRRRQPDSYFSVLHAFIDRKDSHYSIHQI 155
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
Q G G + G W GP + + + LA + +A+++ + E+
Sbjct: 156 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 207
Query: 240 R-------GGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
R C D S+HC+ G + W P++LL+PL LGL +N Y+
Sbjct: 208 RLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTWRPLVLLIPLRLGLTDINEAYV 267
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D + +
Sbjct: 268 ETLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELAGGFSIPDETFH 327
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+++ +DPS+A+GF+C+ ++DF+D+C + KL+ S P+F + + +
Sbjct: 328 CQHPPCRMNIAELDPSIAVGFFCKTEEDFNDWCQQVKKLSLLSGALPMFELVEQQPSHLA 387
Query: 407 HSDVLG 412
DVL
Sbjct: 388 CPDVLN 393
>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
Length = 477
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 119/373 (31%), Positives = 175/373 (46%), Gaps = 46/373 (12%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG C+ ++ L A+ N + EF +DF SR
Sbjct: 86 SKESPVWLLGQCYLKKSEDPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFVSR 145
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF----- 159
I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR ++P
Sbjct: 146 IWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQPIETLQQ 205
Query: 160 ---DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
DR + I+ FGD SPFSIH L+ G + G AG W GP ++ C
Sbjct: 206 RLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAHLLSQAVECASK 265
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
++ L A+YV V + D C W ++LLVPL L
Sbjct: 266 QSNSNFDHL--AVYVAQD---------CAVYLQDVENICRT---PDGKWKALVLLVPLRL 311
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++ K
Sbjct: 312 GADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVWK 371
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--EESNGAPL 394
+D +++H R + L +DPS +GFY +K+ DF + + P+
Sbjct: 372 NDFSL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNKEALTDFMETIQRFVIPNQKTNYPM 429
Query: 395 FTVTQTHKKPVNH 407
F + K + H
Sbjct: 430 FLFCEGSGKDLQH 442
>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
Length = 412
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 178/361 (49%), Gaps = 28/361 (7%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +D+ L D A SR+ +YR+ F IG + TS
Sbjct: 39 TSEPVWILGRKYSIFTEKDDILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 85
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 86 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 145
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V+ R G
Sbjct: 146 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRTGL 204
Query: 244 P----VVCIDDASRHCSVF--------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
P DA RHC+ F + + W P++LL+PL LGL +N Y+ TL+
Sbjct: 205 PCAGAAALPTDADRHCNGFPTQTEVTNRQSPSLWRPLVLLIPLRLGLTDINEAYVETLKH 264
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D + +
Sbjct: 265 CFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIPDETFHCQHPP 324
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 325 CRMGIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 384
Query: 412 G 412
Sbjct: 385 N 385
>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 356
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 114/311 (36%), Positives = 163/311 (52%), Gaps = 13/311 (4%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + +D + E D SRI I+YRK F IG + TSD GWGC
Sbjct: 26 VWILGKAYHLIRDRS-----------ELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGC 74
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQALL LGR WR ++ + Y +IL LF D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALLCKHLGREWRWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVG 134
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + L + S+ I VV R C
Sbjct: 135 EGKSIGQWFGPNTVAQVLRKLTLFDDWSSIAVHISMDNTI-VVEDIRKLCRTPLFTECAS 193
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ S+ + G W P++L +PL LGL ++NP Y+ L+ FT QSLG++GGKP +
Sbjct: 194 PKAASASLENGGTTYWKPLVLFIPLRLGLTEINPLYLDVLKKCFTLKQSLGMIGGKPNHA 253
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G ++ +YLDPH QPV++I K D TYH +++ +DPS+A+GF+C
Sbjct: 254 HYFIGFYGKTLVYLDPHTTQPVVDINKWASIPD-DTYHCKHPSRMNIMHLDPSIALGFFC 312
Query: 370 RDKDDFDDFCA 380
+ DFDD C
Sbjct: 313 HCESDFDDLCT 323
>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
Length = 394
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 118/372 (31%), Positives = 185/372 (49%), Gaps = 27/372 (7%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+T +W+LG + + E D +SR+ +YRK F PIG + TSD
Sbjct: 21 ETTEPVWILG-----------NEYSALTEKEEILSDVTSRLWFTYRKSFPPIGGTGPTSD 69
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 184
GWGCMLR QM++ QAL+ LGR WR + +EY+ IL+ F D + S +SIH +
Sbjct: 70 TGWGCMLRCGQMILGQALMCRHLGRDWRWVRGQKQRQEYISILNAFIDKKDSYYSIHQIA 129
Query: 185 QAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 234
Q G G G W GP A+ +W L + + + + + + +
Sbjct: 130 QMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRLVVHVAMDNTVVIEEIKRLCMPWLDK 189
Query: 235 DE---DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
E + ER G C++ A C++ + A W P++LL+PL LGL +N YI TL+
Sbjct: 190 AEVFGEPERVGELNGCLEGA---CALSEEEVALWKPLVLLIPLRLGLSDINGAYIETLKK 246
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
F PQSLG++GGKP ++ Y +G IYLDPH Q + + D + +
Sbjct: 247 CFMLPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFPDDTYHCQHPP 306
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
+H+ +DPS+A+GF+CR +D+FDD+C R +L+ + P+F + + + D +
Sbjct: 307 CRMHICELDPSIAVGFFCRTEDEFDDWCMRIRRLSCNKDNLPMFELVDSQPSHLVGVDAI 366
Query: 412 GETGGVPEDDSL 423
T + + L
Sbjct: 367 NLTPDFSDSERL 378
>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
Length = 393
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 122/375 (32%), Positives = 180/375 (48%), Gaps = 58/375 (15%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
++ +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 22 TSEPVWILGRKYSIFTEKE-----------ELLSDVASRLWFTYRKNFPAIGGTGPTSDT 70
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q
Sbjct: 71 GWGCMLRCGQMIFAQALVCQHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQ 130
Query: 186 AGKAYGLAAGSWVGP---------YAMCRSWEALA------------RCQR-AETGLGCQ 223
G G + G W GP A+ +W +LA +R T L C
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSLPCG 190
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLG 277
+ P + AP +HC+ F G + W P++LL+PL LG
Sbjct: 191 TAPAS------------SAAP-------DQHCNGFPAGAEVTTRLSPWRPLVLLIPLRLG 231
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
L +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 232 LTDINAAYVETLKRCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEATDS 291
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
L D S + + + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F +
Sbjct: 292 CLVPDESFHCQHPPCRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFEL 351
Query: 398 TQTHKKPVNHSDVLG 412
+ + DVL
Sbjct: 352 VEQPPSHLACPDVLN 366
>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A
gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
Length = 396
Score = 201 bits (510), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 176/353 (49%), Gaps = 50/353 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352
>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
Length = 390
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 175/359 (48%), Gaps = 26/359 (7%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C +DDF+D+C + SKL+ P+F + + + DVL
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPSHLACPDVLN 366
>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; AltName: Full=Autophagy-related
protein 4 homolog B; AltName: Full=bAut2B
gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
Length = 393
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 175/359 (48%), Gaps = 26/359 (7%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C +DDF+D+C + SKL+ P+F + + + DVL
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPSHLACPDVLN 366
>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 394
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 176/367 (47%), Gaps = 42/367 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRRYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHVAMDNTVVMEDIRRLCRSSLP 189
Query: 227 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAGASAFPADSEGHCNGFPAR---------AEVTNRPSPWRPLVLLIPLRLGLTDINEAY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCSIPDESF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
+ + + +DPS+A+GF+C +DDF D+C + KL+ P+F + + +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCETEDDFGDWCQQVKKLSLLGGALPMFELVEQQPSHL 360
Query: 406 NHSDVLG 412
DVL
Sbjct: 361 ACPDVLN 367
>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
Length = 424
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 126/364 (34%), Positives = 182/364 (50%), Gaps = 57/364 (15%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
+ GV H ++ + G+ + G E+ +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 42 MFGVTH-WDRETSSGERSNEVGRREWERDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTL 100
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDRE----------------------------- 162
RS+QM++A AL H GR WR+ +Q E
Sbjct: 101 RSAQMMLANALSIHSRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSE 160
Query: 163 --------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+IL LF D +PFSIH + + +G G W P MCR++EAL
Sbjct: 161 RTRAGSDAQEDILRLFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALV--- 217
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
AE LG + + ++VVSG E GE GG P V D+A G+A +LL VP+
Sbjct: 218 -AEHDLGSE---LTVHVVSGRE-GEDGGVPTV--DEAEVRAKSADVGKA----LLLFVPV 266
Query: 275 VLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
VLG+ + +N RY+ LR F QS+GIVGG+P +S Y+VG ++ YLDPH VQ +
Sbjct: 267 VLGVGRTINARYLSQLRSMMAFKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASS 326
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
+ D E +Y+ H+ +DP+LA+GFYCRD DD LA + AP
Sbjct: 327 MVTMDFE----SYYCPTPLHVCGGDLDPTLALGFYCRDGDDVASLLVDIEALARVNATAP 382
Query: 394 LFTV 397
+
Sbjct: 383 ALAI 386
>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
Length = 384
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 169/343 (49%), Gaps = 36/343 (10%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQAL+ +GR WR QKP
Sbjct: 44 NDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWDKQKP 103
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 -KGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQWSS 162
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------------- 264
+A+++ + V +D+ R C S +D
Sbjct: 163 --------IAVHIAMDN---------TVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDP 205
Query: 265 ----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
W P++LL+PL LGL ++N YI TL+ F PQSLG++GG+P ++ Y +G +
Sbjct: 206 SCAQWKPLVLLIPLRLGLSEINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDEL 265
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
IYLDPH Q + D S + +H+ IDPS+A+GF+C ++DF+D+C
Sbjct: 266 IYLDPHTTQLSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSIAVGFFCSSQEDFEDWCQ 325
Query: 381 RASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
KL+ P+F V +++ DVL T + D L
Sbjct: 326 HIKKLSLSGGALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368
>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
Length = 393
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/359 (33%), Positives = 174/359 (48%), Gaps = 26/359 (7%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVLN 366
>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
Length = 432
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 109/313 (34%), Positives = 171/313 (54%), Gaps = 11/313 (3%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR-LGRPWR 152
+ EF +DFS+++ SYR+GF+ IGDS +D GWGCMLRS QML+A LL + +G+ W+
Sbjct: 88 IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALA 211
KP + ++ +++ LF D ++PFSIHN+ G+ + G + G W P + + AL
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207
Query: 212 -RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC----SVFSKGQADWT 266
+ G + + + V DD S + + + W
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALRSDGSWM 267
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+L+L+P LG++ +N Y L +TFPQ+LGIVGGKP AS Y + Q+++ YLDPH
Sbjct: 268 PLLILIPTKLGIDTINEIYYRPLLDIYTFPQNLGIVGGKPRASLYFIASQDDNLFYLDPH 327
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
VQ I + D + S+Y ++ + ++ +DPSL I F+C K+ F DF R+ KL
Sbjct: 328 TVQNSI---ESDSDFSLSSYFCNIPKKANISEVDPSLVIPFFCSTKESFLDFLERSKKL- 383
Query: 387 EESNGAPLFTVTQ 399
E S+ PL+ + +
Sbjct: 384 ESSSEFPLYNIQE 396
>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
Length = 396
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 174/353 (49%), Gaps = 50/353 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPFKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + L + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLTLFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352
>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
Length = 380
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 116/349 (33%), Positives = 179/349 (51%), Gaps = 41/349 (11%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W+LGV + +D E D SSR+ +YRK F PIG + SD GWGCM
Sbjct: 32 WILGVGYNTVKDRQ-----------ELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCM 80
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
LR QM++ QAL+ LGR WR +D +Y +IL LF D + S +SIH + Q G +
Sbjct: 81 LRCGQMMLGQALICRHLGRDWRWK-SAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSE 139
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE----------DGER 240
G + G W GP + + + LA + + +AI+V + R
Sbjct: 140 GKSVGQWFGPNTVAQVLKKLALFEDWSS--------LAIHVAMDNTVIIDDIKKLCRSAR 191
Query: 241 GGAP------VVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P +C ++ S S+ A W P++L++PL LGL ++NP Y L+ F
Sbjct: 192 QPTPSQVTNSFLCNGVSAEQTSARSRSPALPWQPLMLIIPLRLGLSELNPVYTDCLKACF 251
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
T QSLG++GGKP + Y +G S +YLDPH QP + + + ++ S++H
Sbjct: 252 TLRQSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPAVEL-EGNVPIPDSSFHCTHPSR 310
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKL--AEESNGAPLFTVTQT 400
+++ +DPS+A+GF+C+D+ DF D C +L +++ A +F V Q+
Sbjct: 311 MNIQDLDPSIALGFFCQDEADFADLCENMRRLIIGQKTQNA-MFEVVQS 358
>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
Length = 392
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 41/366 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEK-SIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 179
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 180 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 239
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 240 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 299
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
+ + ++DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 300 CQHPPCRMSIANLDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 359
Query: 407 HSDVLG 412
DVL
Sbjct: 360 CPDVLN 365
>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 396
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPS 195
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E P+ +A+ H S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ESSHDPL----NATNHNKAISACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ + D + + + + + +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQSPQRMSILN 311
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 312 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353
>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
Length = 357
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDP QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPATTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356
>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
Complex
Length = 357
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWG MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGSMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356
>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
Length = 398
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 111/343 (32%), Positives = 179/343 (52%), Gaps = 27/343 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDG 238
G + G W GP A+ W +LA + + + + I +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAG 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
Length = 396
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 175/353 (49%), Gaps = 50/353 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+Y + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYDSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352
>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
Length = 453
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 172/346 (49%), Gaps = 60/346 (17%)
Query: 65 SSTSDIWLLGVCHK-----------------IAQDEALGDAAGNNGLAEFNQDFSSRILI 107
S S +WLLG C++ Q ++ ++ + G F +DF SR+ +
Sbjct: 63 SKESPVWLLGKCYRRIESPSSDSTELGTDVAAFQSQSEIASSDDEGFEGFKKDFISRLWL 122
Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDRE-YVE 165
+YR+ F + S +SD GWGCMLRS QML+AQAL+ H LGR WR +P +P RE ++E
Sbjct: 123 TYRREFPILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDWRWQPDHQPTTRESFIE 182
Query: 166 ------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
I+ FGD S SPFSIH L+ G+A G AG W GP
Sbjct: 183 VVNHRKIIKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGP----------------- 225
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------GQADWTPIL 269
G A S ED + VC+ ++ C+V+ K W ++
Sbjct: 226 -GFVAHLFRQAFKRAS--EDNYEFDSLTVCV---AQDCAVYIKDVMEECTDKNGKWKSLI 279
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
LL+P+ LG EK N Y P L F+ Q +GI+GG+P S Y VG Q++ I+LDPH Q
Sbjct: 280 LLIPVRLGAEKFNSIYAPCLTTLFSLKQCIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQ 339
Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
V+++ D +++H R IHL +DPS IGFYC K+ F
Sbjct: 340 EVVDVWAVDFP--LTSFHCRSPRKIHLSKMDPSCCIGFYCPTKESF 383
>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
Length = 398
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/342 (31%), Positives = 175/342 (51%), Gaps = 25/342 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTAD 197
Query: 241 GGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+P I + S+ S F W P+LL+VPL LG+ ++NP Y+ + F PQSL
Sbjct: 198 KSSPDSFITSNQSKDTSAFCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSL 254
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
G +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ ++
Sbjct: 255 GALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQQMNILNL 314
Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 315 DPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
tropicalis]
Length = 384
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 118/335 (35%), Positives = 172/335 (51%), Gaps = 20/335 (5%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQALL +GR WR QK
Sbjct: 44 NDITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWDKQKS 103
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 -QGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQWSS 162
Query: 219 GLGCQSLPMAIYV-----VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPI 268
+A+++ V DE A +A C+ ++ G +D W P+
Sbjct: 163 --------IAVHIAMDNTVVMDEIRRLCRAGTNESSEAGALCNGYT-GVSDPSCSLWKPL 213
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N YI TL+ F PQSLG++GG+P ++ Y +G + IYLDPH
Sbjct: 214 VLLIPLRLGLSDINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDELIYLDPHTT 273
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
Q + D S + +H+ IDPS+A+GF+CR ++DF+D+C + KL+
Sbjct: 274 QLAVEPSDCCFVEDESFHCQHPPCRMHVSEIDPSIAVGFFCRSQEDFEDWCQQIKKLSLS 333
Query: 389 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
P+F V +++ DVL T + D L
Sbjct: 334 GGALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368
>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
Length = 442
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 117/344 (34%), Positives = 176/344 (51%), Gaps = 28/344 (8%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNN------GLAEFNQDFSSRILISYRKGFDPIGDS 119
S S IWLLG C+ Q E A N G+ F +DFSS I +SYRK F + +S
Sbjct: 63 SDSPIWLLGRCYYAKQAEYDSKNAVQNTQYKIHGIDCFFEDFSSLIYLSYRKHFSQLANS 122
Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGD--SET 175
+TSD GWGCMLR+ QML+A ALL H L WR +K ++ Y+ IL F D S+
Sbjct: 123 NLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKYTEKNYIYRMILRFFNDENSDN 182
Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 234
SPFS+H L++ G G W GP ++ + A + S P + + V
Sbjct: 183 SPFSLHELVRIGSK---KPGEWYGPTSVAHTLSA---------AVNLTSHPVLDTFRVYV 230
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
D V+ ++C+ + + W +L+LVP+ LG + +NP YIP L+ T
Sbjct: 231 ANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLNPIYIPCLKALLT 290
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
+GI+GG+P S Y VG Q + I LDPH +Q +++ + ++ H + +
Sbjct: 291 LDYCVGIIGGRPKHSLYFVGFQGKKLINLDPHYLQEYVDMTTQEFPVESFRCHYP--KKM 348
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE---ESNGAPLF 395
+DPS A+GFYCR ++DF+ C +A ++ + + P+F
Sbjct: 349 AFKKMDPSCAVGFYCRTREDFESLCKQAVEMLKPPMQRTEYPMF 392
>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
Length = 398
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 178/356 (50%), Gaps = 53/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
D + C V SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ G++ D +
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTGENGTVNDQTF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
Length = 368
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 113/334 (33%), Positives = 174/334 (52%), Gaps = 39/334 (11%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+ D+W+LG + I Q GD + N D SRI ++YRK F IG + T+D
Sbjct: 26 TEEDVWILGKRYNILQ----GD------MGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDS 75
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM++AQAL+ LGR W+ + EY++IL F D + S +SIH + Q
Sbjct: 76 GWGCMLRCGQMMLAQALVCRHLGRDWQWDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQ 135
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G + G A GSW GP + + + L+ + + ++V +
Sbjct: 136 MGVSEGKAVGSWFGPNTVAQVLKKLSAFDDWSS--------LCLHVAMDN---------T 178
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
V I+D S +W P++L +PL LGL ++N Y L+ FTF QSLGI+GG+
Sbjct: 179 VIIEDIS-----------NWRPLVLFIPLRLGLTEMNVVYNEPLKACFTFKQSLGIIGGR 227
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +TY +G + +YLDPH Q +N + D S +H +++ +DPS+A+
Sbjct: 228 PNHATYFIGYFGNNLVYLDPHTTQQTVNPDELSRIPDGS-FHCVYPCRMNIADVDPSVAL 286
Query: 366 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
GF+C+ ++DFDD C + K + P+F + +
Sbjct: 287 GFFCKSEEDFDDLCQQIQKKIIDGKSRPMFEIAK 320
>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
Length = 436
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 174/366 (47%), Gaps = 44/366 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG K +D + +FN + ++ +YR+ F PIG + SD GWGC
Sbjct: 31 VWILGKHFKPDED-----------MEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGC 79
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQALL LGR W + + Y+ ILH F D + S +SIH + Q G
Sbjct: 80 MLRCGQMMLAQALLCRHLGRDWDWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVG 139
Query: 190 YGLAAGSWVGPYAMCRSWEALAR-------------------------CQRAETGLGCQS 224
G G W GP + + + L C+ + GC
Sbjct: 140 EGKQIGQWFGPNTVAQVIKKLVLFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYG 199
Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHC-------SVFSKGQADWTPILLLVPLVLG 277
I+ S + P C ++S+ S S+ W P+LL +PL LG
Sbjct: 200 ECSYIHDRSSLTGNQSVSKPPHCSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLG 259
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
L ++N Y +L++ FT QSLG++GGKP + Y +G + +YLDPH Q I +
Sbjct: 260 LSEINSDYYNSLKIMFTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQTIEPERF 319
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
++ D S +H + S+DPS+A+GFYC +DDFDD+C ++L + P+F +
Sbjct: 320 NVIPDES-FHCVYPCFMSFQSLDPSVALGFYCHTEDDFDDWCQAVNELVVQREKRPMFEI 378
Query: 398 TQTHKK 403
QT +
Sbjct: 379 NQTRPR 384
>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
Length = 355
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 25 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 74 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 133
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 134 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 193
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 194 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 246
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 247 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 306
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 307 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 351
>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
Length = 398
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQS 305
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
Length = 411
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 177/356 (49%), Gaps = 53/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + + + + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 42 VWILGKQHLLKTERS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 90
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 91 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 150
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 151 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 193
Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
D + C VF SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 194 DIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 253
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D +
Sbjct: 254 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTF 313
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + +++ ++DPS+A+GF+C+++ DFD++C K + N +F + Q H
Sbjct: 314 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCCLVQKEILKEN-LRMFELVQKH 368
>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
boliviensis]
Length = 422
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 179/348 (51%), Gaps = 37/348 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 216
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
D G+R + ++ SR S + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 217 ADTPGDRPPDSLTASNE-SRGTSAYCPA---WKPLLLIVPLRLGINQINPVYVDAFKECF 272
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + +
Sbjct: 273 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 332
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 333 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 379
>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
Length = 398
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 180/351 (51%), Gaps = 43/351 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP++I
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSI---- 193
Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 194 -DTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
Length = 393
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 115/364 (31%), Positives = 179/364 (49%), Gaps = 36/364 (9%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+T +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 22 TTEPVWILGRKYTIFTEKE-----------EILSDVTSRLWFTYRKNFPAIGGTGPTSDT 70
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH + Q
Sbjct: 71 GWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 130
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGER 240
G G + G W GP + + + LA + +A+++ V +E
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRL 182
Query: 241 GGAPVVCIDDAS--RHCSVFSKGQ----------ADWTPILLLVPLVLGLEKVNPRYIPT 288
A C D A+ + S G + W P++LL+PL LGL +N Y T
Sbjct: 183 CKAGFPCADGAAFPTDSELLSNGYPPAAEVTDRASPWRPLVLLIPLRLGLTDINEAYTET 242
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + + D + +
Sbjct: 243 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVESTEGGVFPDETFHCQ 302
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 408
+++ +DPS+A+GF+C+ ++DF+D+C + KL+ P+F + + +
Sbjct: 303 HPPCRMNIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSRIPGALPMFELVERQPSHFSCP 362
Query: 409 DVLG 412
DVL
Sbjct: 363 DVLN 366
>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
[Homo sapiens]
Length = 402
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 201
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 202 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 254
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 255 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 314
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 315 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 359
>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
Length = 395
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 113/319 (35%), Positives = 163/319 (51%), Gaps = 27/319 (8%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 49 LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKH 108
Query: 157 KPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 109 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 168
Query: 217 ETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSK-----GQAD 264
+ +A+Y VV D P C + A+ + S +S+ GQ+
Sbjct: 169 NS--------LAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSS 220
Query: 265 -WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + IYL
Sbjct: 221 GWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEIIYL 280
Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDV-IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
DPH Q + D E TYH + + ++DPS+A+GF+C+D++DFD++C
Sbjct: 281 DPHTTQTFV-----DTEDQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFDNWCEVI 335
Query: 383 SKLAEESNGAPLFTVTQTH 401
K + +F +T H
Sbjct: 336 EKEILKHQSLRMFELTPKH 354
>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
Length = 366
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 163/324 (50%), Gaps = 41/324 (12%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
D +SR+ +YRKGF PIG + TSD GWGCMLR QM++ QAL+ LGR WR +
Sbjct: 68 DVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWVSGEEQ 127
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
EYV IL+ F D + S +SIH + + +C W A A G
Sbjct: 128 RHEYVNILNAFIDKKDSYYSIHQIER-----------------LCMPWLDKAEACAASEG 170
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
+G + +G GA C+ + A W P++LL+PL LGL
Sbjct: 171 VG-------------ELNGYLEGA-----------CAFSEEETALWKPLVLLIPLRLGLT 206
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q ++ +D
Sbjct: 207 DINEAYIETLKKCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQTAVDPCEDGT 266
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
D S + +H+ +DPS+A GF+CR +D+FDD+C R +L+ + P+F + +
Sbjct: 267 FTDDSYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRRLSCNRDNLPMFELVE 326
Query: 400 THKKPVNHSDVLGETGGVPEDDSL 423
+ + D + T + + L
Sbjct: 327 SQPSHMVSVDAINLTPDFSDSERL 350
>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
Length = 682
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 162/314 (51%), Gaps = 17/314 (5%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + L ++ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 262 AAENQLAESPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 321
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 322 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 381
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDE---DGERGGAP 244
G W GP ++ + AL R S+ +A IY+ +E E P
Sbjct: 382 KPGDWYGPASVSYLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKP 441
Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
V A R S K W +++L+PL LG +K+NP Y L+L + LGI+GG
Sbjct: 442 HVPWQQAKRSTSDAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEYCLGIIGG 501
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
KP S Y VG QE+ I+LDPH Q ++++ ++ ++H R + +DPS
Sbjct: 502 KPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETFP--MHSFHCKSPRKLKSSKMDPSCC 559
Query: 365 IGFYCRDKDDFDDF 378
IGFYC K DFD F
Sbjct: 560 IGFYCPTKTDFDSF 573
>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
Length = 398
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 107/341 (31%), Positives = 176/341 (51%), Gaps = 23/341 (6%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVG 197
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
P ++ +++ F+ A W P+LL+VPL LG+ ++NP Y+ + F PQSLG
Sbjct: 198 ESTPGT-LNASNQSRGTFACCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLG 255
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
+GGKP + Y +G + I+LDPH Q +N ++ D + + + +++ ++D
Sbjct: 256 ALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVNTEENGTVDDQTFHCLQSPQRMNILNLD 315
Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
PS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 316 PSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
Length = 398
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
Length = 398
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
gorilla]
gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A;
Short=hAPG4A
gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
construct]
Length = 398
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
Length = 392
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 126/402 (31%), Positives = 190/402 (47%), Gaps = 49/402 (12%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG+C+ + L A+ N + EF +DF SR
Sbjct: 6 SKESPVWLLGLCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 65
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREY 163
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q + +
Sbjct: 66 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQSTDESSH 125
Query: 164 VEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRAE 217
I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 126 RMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQAME------RAS 179
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
+ +A+YV + V C D R ++LLVPL LG
Sbjct: 180 EDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR------------KALILLVPLRLG 227
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
+K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++ +
Sbjct: 228 ADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVEGN 287
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--EESNGAPLF 395
+ + +++H R + L +DPS +GFY DK+ DF + ++ P+F
Sbjct: 288 E-KFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLTDFMETIQQFVIPNQNMDYPMF 346
Query: 396 TVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHE 437
+ K + + E G +P G SM D + E
Sbjct: 347 LFCEGSGKDLQQGIEVVE-GLLPSSSRFGHESMEDDLFECEE 387
>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
Length = 461
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 115/364 (31%), Positives = 174/364 (47%), Gaps = 37/364 (10%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+T +W+LG + I ++ + D +SR+ +YRK F IG + TSD
Sbjct: 91 TTEPVWILGRKYTIFTEKE-----------DILSDVTSRLWFTYRKNFPAIGGTGPTSDT 139
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQALL LGR WR + Y +L+ F D + S +SIH + Q
Sbjct: 140 GWGCMLRCGQMIFAQALLCRHLGRDWRWKKGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 199
Query: 186 AGKAYGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQ-SLPMA 228
G G + G W GP + + +W +LA E C+ + P
Sbjct: 200 MGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSSLAVHIAMDNTVVIEEIRRLCKPNFPAG 259
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
D + G P + + W P++LL+PL LGL ++N YI T
Sbjct: 260 ASAFPTDSEFLLNGFP---------SGAEVTNRPTQWKPLVLLIPLRLGLTEINEAYIET 310
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ F PQSLG++GGKP ++ Y +G IYLDPH QP + I D S +
Sbjct: 311 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEISGSCFIPDESFHCQ 370
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 408
+++ +DPS+A+GF+C+ ++DF+D+C + KL+ P+F + + +
Sbjct: 371 HPPCRMNIVELDPSIAVGFFCKTEEDFNDWCQQVKKLSLIRGALPMFELVEHQPSHFSSP 430
Query: 409 DVLG 412
DVL
Sbjct: 431 DVLN 434
>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
Length = 408
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 39 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 87
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 88 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 147
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 148 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVG 207
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E + +AS G+ W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 208 ESPPDTL----NASNQSKGTPAGRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 263
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 264 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 323
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 324 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365
>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
Length = 396
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190
Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 304 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 353
>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
Length = 398
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
Length = 398
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 178/348 (51%), Gaps = 37/348 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
D G+R + + S+ S + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 193 ADTAGDRPPDSLTA-SNQSKGTSAYCTA---WKPLLLIVPLRLGINQINPVYVDAFKECF 248
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + +
Sbjct: 249 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 308
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 309 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
dendrobatidis JAM81]
Length = 441
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 168/312 (53%), Gaps = 32/312 (10%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
F DF SR+ ++YRKGF I + T D GWGCMLRS QMLVA ALLFH LGR WR L
Sbjct: 137 HFLDDFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWR--L 194
Query: 156 QKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
DR+ Y IL F D TSP+SI + G + G W GP + + + L
Sbjct: 195 GDSNDRDTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLV 254
Query: 212 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILL 270
Q + + ++V DG + I A+R G+ TP +L+
Sbjct: 255 NDD--------QRISLKVHV---SNDGVVYKNEINTILSATR-----DDGK---TPAVLI 295
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
++PL LG+E +NP Y P ++ F +GI GG+P +S + +GV + IYLDPH ++P
Sbjct: 296 MIPLRLGVETMNPVYYPGVKHCFAMSHCVGIAGGRPNSSLFFLGVDGDHLIYLDPHHLRP 355
Query: 331 VI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
+ +I +E D +YH + +R + + S+DPSL IGFYC DFD CA+ ++LA
Sbjct: 356 SVDSRDITSYKME-DLLSYHCEKVRLLPIASMDPSLVIGFYCHSLKDFDVLCAKMTELAT 414
Query: 388 ESNGAPLFTVTQ 399
S APLF++ +
Sbjct: 415 GS--APLFSIEE 424
>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
Length = 1114
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 182/364 (50%), Gaps = 32/364 (8%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNN-------GLAEFNQDFSSRILISYRKGFDPIGDSK 120
S +WLLG + I + + D + +F QDFSS + +YR+ F I +K
Sbjct: 226 SPVWLLGKFYHIKPSDLIDDDIQRGKRTRVVPNIEKFKQDFSSLLWFTYRQDFPAIPGTK 285
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFGD--SETS 176
+TSD GWGCMLRS QM++A+AL H LG W + ++E +I+ FGD + S
Sbjct: 286 LTSDCGWGCMLRSGQMMLAKALTLHYLGPEWNVFSDQTREQETYRKQIIRWFGDYLCDES 345
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGD 235
PFS+H L++ GK G G W GP ++ E + + Q+ +T L + +YV
Sbjct: 346 PFSMHRLVEVGKNLGKQPGEWFGPASVAHILKETMVKGQKTQTVLS----DLCVYVSQDC 401
Query: 236 EDGERGGAPVVCI----------DDASRHCSVFSKGQADWT-PILLLVPLVLGLEKVNPR 284
++ + C S H S DW +++L+P+ LG E++NP
Sbjct: 402 TVYKQDIYELCCTRPRADTKFTNSTESEHESSQDASSMDWKRAVVILIPVRLGGEQLNPV 461
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YIP ++ + +GI+GGKP S Y VG QE+ IYLDPH Q V++ +
Sbjct: 462 YIPCVKGLLSQDSCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVDTRERHFP--IQ 519
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTH 401
+YH R + +D IDPS IGFYCR++ +F+ F + ++ ++ P+F + H
Sbjct: 520 SYHCMSPRKVSIDKIDPSCTIGFYCRNQKEFEKFVQQTEEMVAPPKQRLSYPMFVFSDGH 579
Query: 402 KKPV 405
V
Sbjct: 580 SNEV 583
>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
Length = 398
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTAG 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E + ++ + S + W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ESPPGSLTALNQSKGT----SACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
Length = 393
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 180/373 (48%), Gaps = 39/373 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 230
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAA 193
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
+ + DG G P ++A ++ W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPAVESDGLYNGCP----EEAG-----VRDRRSLWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEHNDSGCLPDESFHCQHP 304
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 410
+ + +DPS+A+GF+C ++DF+D+C + KL+ P+F + + ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEEDFNDWCQQIKKLSLVRAALPMFELVERQPSHFSNPDV 364
Query: 411 LGETGGVPEDDSL 423
L T + D L
Sbjct: 365 LNLTPDSSDADRL 377
>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
Length = 396
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 177/375 (47%), Gaps = 58/375 (15%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
+T +W+LG + I +DE L D +SR+ +YRK F IG + TS
Sbjct: 25 TTDPVWILGRKYTIFTEKDEILSDV-------------TSRLWFTYRKNFPAIGGTGPTS 71
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQI 131
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + +A+++ +
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDN-------- 175
Query: 244 PVVCIDDASRHCSV-FSKGQA-------------------------DWTPILLLVPLVLG 277
V ++D R C FS A W P++LL+PL LG
Sbjct: 176 -TVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQWRPLVLLIPLRLG 234
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
L +N Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q + +
Sbjct: 235 LTDINEAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNG 294
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
+ D S + +++ +DPS+A+GF+C+ ++DF+D+C + KL+ P+F +
Sbjct: 295 GVIPDESFHCQHPPCRMNIGELDPSIAVGFFCKSEEDFNDWCQQVKKLSRIPGALPMFEL 354
Query: 398 TQTHKKPVNHSDVLG 412
+ + DVL
Sbjct: 355 VEHQPSHFSCPDVLN 369
>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
cuniculus]
Length = 405
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 174/343 (50%), Gaps = 27/343 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 36 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 84
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 85 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 144
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S + G
Sbjct: 145 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSANTPG 204
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 205 ERLHDSLT----ASNQSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 260
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 261 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 320
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 321 LDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 362
>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
Length = 393
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 224
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 225 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPTVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
D S + + + +DPS+A+GF+C ++DF+D+C + KL+ P+F + +
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354
Query: 401 HKKPVNHSDVLGETGGVPEDDSL 423
++ DVL T + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377
>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
Length = 393
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/373 (32%), Positives = 178/373 (47%), Gaps = 39/373 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 230
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAA 193
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
+ + D G P +D + A W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPALESDVLYNGCP----EDVG-----LRERLALWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG++GGKP ++ Y +G E IYLDPH QP + G D S +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPGDSGCLPDESFHCQHP 304
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 410
+ + +DPS+A+GF+C + DF+D+C + KL+ P+F + + ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEADFNDWCQQIKKLSLVRGALPMFELVERQPSHFSNPDV 364
Query: 411 LGETGGVPEDDSL 423
L T + D L
Sbjct: 365 LNLTPDSSDADRL 377
>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; Short=cAut2B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
Length = 393
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 224
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 225 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPAVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
D S + + + +DPS+A+GF+C ++DF+D+C + KL+ P+F + +
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354
Query: 401 HKKPVNHSDVLGETGGVPEDDSL 423
++ DVL T + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377
>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
Length = 488
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 173/353 (49%), Gaps = 35/353 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
I+LLG + + A F DFS+R+ +YR+ F P+ + TSD GWGC
Sbjct: 129 IYLLGHVYHNKNNSA--------SFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGC 180
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFG---DSETSPFSIHNLL 184
MLRS+QM++A+A +FH LGR WR Q+ V +I+ F D+ +PFS+HN++
Sbjct: 181 MLRSAQMMLAEAFIFHLLGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMV 240
Query: 185 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS---LPMAIYVVSGDEDGERG 241
+A G AG W GP L RC G+ MAIYV
Sbjct: 241 RAAAHCGKKAGDWFGPSTAAY---LLKRCLEEAAGVADSKEIFEQMAIYVAQD------- 290
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+ D C+ S +W ++LL+P+ LG E+VN YI ++ + LGI
Sbjct: 291 --CTIYTQDVLDLCT--SDPNIEWKSVVLLIPVRLGGERVNVNYIHCIKEILAYQNCLGI 346
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y VG Q + +YLDPH +Q + + L +++H R + +DP
Sbjct: 347 IGGKPRHSLYFVGFQGKKLVYLDPHYLQKTTDTSR--LNFSVNSFHCTTARKVSFSKLDP 404
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAE---ESNGAPLFTVTQTHKKPVNHSDVL 411
S IGFYC+ + DF+ F + + E ++ G P+F +++ VN + L
Sbjct: 405 SATIGFYCKTRRDFESFQSIMQSVTESCPQNQGYPVFIISEGSSALVNQLNPL 457
>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
Length = 396
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
Length = 369
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 182/345 (52%), Gaps = 33/345 (9%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 1 WILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 49
Query: 131 LRSSQMLVAQALLFHRLGR--PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
LR QM++AQAL+ LGR W K ++P +EY IL F D + +SIH + Q G
Sbjct: 50 LRCGQMMLAQALICRHLGRDLNWEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGV 107
Query: 189 AYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDED 237
G + G W GP A+ W +LA + + + + I +S D
Sbjct: 108 GEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTA 167
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
GE +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQ
Sbjct: 168 GE---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQ 223
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHL 356
SLG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H + +++
Sbjct: 224 SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNI 282
Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 283 LNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 326
>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
Length = 398
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 175/356 (49%), Gaps = 53/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLRTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKG--QAD----------------------WTPILLLVPLVLGLEKVNPRY 285
D + C V G AD W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPLLLIVPLRLGINQINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + + D +
Sbjct: 241 IEAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEESGIVDDETF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
Length = 398
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 179/343 (52%), Gaps = 27/343 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D +R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK REY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIG 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
E +P+ ++ +++ S + A W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPLNTLNASNQSKSAPASCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related cysteine endopeptidase 2A;
Short=Autophagin-2A; AltName: Full=Autophagy-related
protein 4 homolog A; AltName: Full=bAut2A
gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
Length = 398
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
Length = 475
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 180/368 (48%), Gaps = 40/368 (10%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 100 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 146
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 147 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 206
Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
Q G G + G W GP A+ +W +LA + + I +
Sbjct: 207 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLA----VHVAMDNTVVMEEIRRLCR 262
Query: 235 DEDGERGGAPVVCIDDASRHCSVF----------SKGQADWTPILLLVPLVLGLEKVNPR 284
G A + DA RHC+ F S + W P++LL+PL LGL +N
Sbjct: 263 SSLPCSGAAALPA--DADRHCNGFPAPMEVTSRPSPSPSPWRPLVLLIPLRLGLTDINEA 320
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
Y+ TL+ F PQSLG++GGKP ++ Y +G + IYLDPH QP + + D +
Sbjct: 321 YVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFIPDET 380
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
+ + + +DPS+A+GF+C+ +DDF D+C + KL+ + P+F + +
Sbjct: 381 FHCQHPPCRMGIGELDPSIAVGFFCKTEDDFRDWCQQVRKLSLQGGALPMFELVEQQPSH 440
Query: 405 VNHSDVLG 412
+ DVL
Sbjct: 441 LACPDVLN 448
>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
Length = 510
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 115/346 (33%), Positives = 169/346 (48%), Gaps = 62/346 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 115 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 174
Query: 157 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR- 212
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 175 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTE 233
Query: 213 ---------------CQRAETGLGC---------QSLPMAIYVV---------------S 233
R E C Q P+ + S
Sbjct: 234 HSPNGLKMYVPKDGIIYRKEVYQLCAVQPADGPAQHSPLRVDDDGGDTDHDGDTDGLESS 293
Query: 234 GDEDGERGGAP-----VVCIDDASRHCSVFSKGQAD------------WTPILLLVPLVL 276
D G P + D +S H + S +++ W P+++LVP+ L
Sbjct: 294 TDSMRHSHGNPGVPSTIEAGDYSSSHAELMSSAESECESLDDNFTELTWHPVIILVPVRL 353
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+DPH VQP + +
Sbjct: 354 GIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQPTVKMDD 413
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
D L +Y ++ + + D IDPSLA+GF C + +FDDFC A
Sbjct: 414 DPL-FPIESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 458
>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
Length = 370
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 168/347 (48%), Gaps = 44/347 (12%)
Query: 63 ISSSTSDIWLLGV-CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK- 120
I ST +WLLG H I N L QD S++ +YRK F PIG S
Sbjct: 26 IPQSTEPVWLLGKKYHAI------------NELNTIRQDIVSKLWFTYRKDFVPIGGSDG 73
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 179
TSD GWGCMLR QM++ QAL+ LGR W+ P + D Y+ IL F DS +PFS
Sbjct: 74 KTSDKGWGCMLRCGQMVLGQALMSIHLGRDWQWNPTTR--DATYLSILKKFEDSRKAPFS 131
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
IH + G + G G W GP + + + L + +AI+V +
Sbjct: 132 IHQIASMGISEGKEVGQWFGPNTVAQVLKKLVKFDEGND--------VAIHVALDN---- 179
Query: 240 RGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
VV I + C SK AD W P+LL+VPL LGL ++N Y+ L+ F
Sbjct: 180 -----VVIISEIRDLC--LSKETADVSTPHWKPLLLIVPLRLGLTQMNSIYLGGLKQCFQ 232
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVI 351
F QSLGI+GGKP ++ Y +G IY DPH Q ++G D + +YH
Sbjct: 233 FKQSLGIIGGKPNSALYFIGYVGNEVIYFDPHTTQKAGSVGNKDTSEEKDVDLSYHCKHA 292
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
+ + +DPS+A+ F CR + DF+D C ++ PLF V+
Sbjct: 293 SRMSMLGMDPSVAVCFLCRSEADFNDLCQNIKDQLIKTESQPLFEVS 339
>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
[Tribolium castaneum]
gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
Length = 366
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 117/321 (36%), Positives = 166/321 (51%), Gaps = 26/321 (8%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG-DSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
N L E + QD S+I +YRK F PIG D +T+D GWGCMLR QM++AQAL+ L
Sbjct: 33 NALQELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHL 92
Query: 148 GRPW-RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
GR W +P K D Y++IL F D +PFSIH + G + G W GP + +
Sbjct: 93 GRDWVWEPETK--DSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQV 150
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
+ L + +L + + E +C+ S CS DW
Sbjct: 151 LKKLVKYDEWSAIEMHIALDNTLIISDIRE---------LCLSQGSDGCS-----SGDWK 196
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LL+VPL LGL+++NP Y L+ F F QSLG++GGKP + Y +G + IYLDPH
Sbjct: 197 PLLLIVPLRLGLQEINPIYASGLKKCFQFKQSLGVIGGKPNLALYFIGHVGDEVIYLDPH 256
Query: 327 DVQP---VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 383
Q V + ++ STYH I++ S+DPS+A+ F+C + +F+D C
Sbjct: 257 TTQKSGSVESKETEEEIELDSTYHCKYASRINILSMDPSVAVCFFCNTEGEFNDLCHSIK 316
Query: 384 KLAEESNGAPLFTVTQTHKKP 404
K E PLF + T++KP
Sbjct: 317 KDLIEPEKQPLFEI--TYEKP 335
>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
Length = 373
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 174/356 (48%), Gaps = 53/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 179 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 238
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D +
Sbjct: 239 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 298
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + + ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 299 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353
>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
Length = 429
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 174/356 (48%), Gaps = 53/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 60 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 108
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 109 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 168
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 169 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 211
Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 212 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 271
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D +
Sbjct: 272 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 331
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + + ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 332 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 386
>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
Length = 398
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 172/343 (50%), Gaps = 27/343 (7%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 313
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
Length = 398
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 176/356 (49%), Gaps = 53/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHC--------------------SVFSKGQAD----WTPILLLVPLVLGLEKVNPRY 285
D + C S SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPLLLIVPLRLGINQINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D +
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + +++ ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 301 HCLQPPQRMNILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
Length = 406
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 175/364 (48%), Gaps = 61/364 (16%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 284
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YI + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + L D +
Sbjct: 241 YIEAFKECFKMPQSLGALGGKPNNAYYFIGSLGDELIFLDPHTTQTFVDTEESGLVDDHT 300
Query: 345 TYHSDVIRHIHLDSIDPSLAI-------GFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
+ + + + ++DPS+A+ GF+C+++ DFD++C+ K + N +F +
Sbjct: 301 FHCLQSPQRMSILNLDPSVALVGQGAFMGFFCKEEKDFDNWCSLVQKEILKEN-LRMFEL 359
Query: 398 TQTH 401
Q H
Sbjct: 360 VQKH 363
>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
Length = 381
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 176/345 (51%), Gaps = 44/345 (12%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
S S +W+LG + + + + E N + SR L +YRK F I DS TSD
Sbjct: 28 SDSPVWILG-----------NELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDS 76
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD----REYVEILHLFGDSETSPFSIH 181
GWGCMLR QM++A+AL LGR W+ Q+ D ++Y++IL LF DS+ +P+S+H
Sbjct: 77 GWGCMLRCGQMVLAEALQRVSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLH 136
Query: 182 NLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+ G++ G+W GP + + L + +ET + P+ ++V +
Sbjct: 137 QIALMGESIQSKKPVGTWFGPNTIA---QVLRKLSVSET-----TNPIRVHVAMDN---- 184
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
V +D+ C F + P+LL +PL LGL ++NP Y L+ F FPQ L
Sbjct: 185 -----TVIVDEIKESCG-FIGDPSQGKPLLLFIPLRLGLTEINPIYFQDLKECFEFPQIL 238
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPH-----DVQPVINIGKDDLEADTSTYHSDVIRHI 354
G++GG+P + Y +G + IYLDPH V+ +G ++ TYH+D +
Sbjct: 239 GVIGGRPNHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGG----SEDKTYHTDRAYRM 294
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+DPSL++ F C+D+ +F+D C R + +PLF + +
Sbjct: 295 DFKDLDPSLSLCFLCKDESEFEDMCERFLFKLIRGHNSPLFEICR 339
>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
[Ornithorhynchus anatinus]
Length = 436
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 172/359 (47%), Gaps = 54/359 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 68 VWILGRQHHLKAEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 116
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W K EY +IL F D + +SIH + Q G
Sbjct: 117 MLRCGQMMLAQALICRHLGRDWCWEKHKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVG 176
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 177 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 219
Query: 250 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 285
D + C + +G A W P+LL+VPL LG+ +NP Y
Sbjct: 220 DIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPLLLIVPLRLGINHINPIY 279
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 280 IDAFKECFKTPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQTFVDTEENGQVDDHSF 339
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
+ + + + ++DPS+A+GF+C+++ DFD++C+ K +F + Q K+P
Sbjct: 340 HCQQAPQRMKIMNLDPSVALGFFCKEEKDFDNWCSLVQKEILRQQSLRMFELVQ--KRP 396
>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
Length = 517
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 113/346 (32%), Positives = 169/346 (48%), Gaps = 39/346 (11%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGN----------------NGLAEFNQDFSSRILISYRK 111
S +WLLG C+ + + D + N L F DF S++ +YRK
Sbjct: 67 SPLWLLGKCYHLKKPSLSSDTSENAEGSQQSTSESYNMLPKHLKLFLVDFHSKLWFTYRK 126
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYV--EILH 168
GF + D+ +TSD GWGCMLR++QM++AQ+ + H LGR WR P + ++ + I+
Sbjct: 127 GFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLSMEQSDIHRNIIT 186
Query: 169 LFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
F D + PFS+H L + G +Y G+W GP + C + +T L L
Sbjct: 187 WFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECAKGKTEL----LN 242
Query: 227 MAIYVVSGDEDGERGGAPVVC-----IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ ++ D +C DA S S ++ +++L+P+ LG +
Sbjct: 243 NIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRS----VIVLIPVRLGEATL 298
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDDL 339
NP YIP ++ T QS+GI+GGKP S Y +G Q+E YLDPH Q + K+DL
Sbjct: 299 NPIYIPCIQSMLTLDQSVGIMGGKPKHSLYFIGFQDEYLFYLDPHYCQQADHPAAFKNDL 358
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
YH + R ++ +DPS +GFYCRD DF F A+K
Sbjct: 359 ---LQNYHCNSPRKTNISKMDPSCCLGFYCRDYKDFQSFVCEANKF 401
>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
Length = 385
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/352 (33%), Positives = 167/352 (47%), Gaps = 54/352 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVG 126
+WLLG C+ N L EF++ D +S+ +YRK + PIG TSD G
Sbjct: 25 VWLLGCCY--------------NPLEEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKG 70
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCMLR QM++ QAL+ LGR WR K Y +IL LF DS+ S +SIH + Q
Sbjct: 71 WGCMLRCGQMILGQALVMRHLGRDWRWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQM 130
Query: 187 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
G + G W GP + + L M +YV + +V
Sbjct: 131 GVSEGKKISQWFGPNTAAQVLKKLIMFDEWSQ--------MGVYVAMDN---------IV 173
Query: 247 CIDDASR----HCSVFSKGQA--------------DWTPILLLVPLVLGLEKVNPRYIPT 288
IDD + H + S+G A W P+LL +PL LGL +NP Y
Sbjct: 174 VIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLLFIPLRLGLTDLNPIYKDK 233
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L F +LGI+GGKP ++ Y +G+Q + +YLDPH VQ + + K + TYH
Sbjct: 234 LNKCFRIKNTLGIIGGKPNSAHYFIGIQGDYLLYLDPHTVQETVKV-KPNCPFSDKTYHQ 292
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA-EESNGAPLFTVTQ 399
+H +DPS+A+GFY +++F++ C + + S PLF V +
Sbjct: 293 KGTNRLHFSYMDPSVALGFYSATEEEFNELCRDFTDVCILNSAQPPLFEVVE 344
>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
Length = 383
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 116/350 (33%), Positives = 177/350 (50%), Gaps = 34/350 (9%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + ++W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 22 IPETKDNVWVLGKKYSAIQD-----------LERIRRDITSVIWCTYRKGFVPIGDEGLT 70
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 180
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ + + +P+SI
Sbjct: 71 SDKGWGCMLRCGQMVLGVALIKVHLSADW---VWTPETRDPTYLKIVQRLEERKQAPYSI 127
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G G G W GP + + + L + + + I+V + +
Sbjct: 128 HQVALMGACEGKEVGQWFGPNTVAQVLKKLVVYDKWSS--------LVIHVALDNTVVKE 179
Query: 241 GGAPVVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+++ CS G +DW P+LL+VPL LGL ++NP Y+ L++ F PQS
Sbjct: 180 DILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGLSEINPIYMEGLKICFQSPQS 239
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIH 355
+G++GGKP + Y++G + IYLDPH Q V N D+ + TYH I
Sbjct: 240 IGVIGGKPNQALYLIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIP 299
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTHKKP 404
+ S+DPS+A+ F CR + DFD+ C K L +ES PLF + + K+P
Sbjct: 300 ILSMDPSVAVCFLCRTRSDFDELCELIEKRLMQESQ--PLFEICE--KRP 345
>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
gorilla]
Length = 379
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
QP + D S + + + +DPS+A+GF+C+ +DDF+D+C + KL+
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328
Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
P+F + + + DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351
>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
Length = 459
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 125/370 (33%), Positives = 185/370 (50%), Gaps = 48/370 (12%)
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
S ++S +WLLG C+ QD D+ + ++ F S + +YR+ F+ + TS
Sbjct: 68 SQNSSKLWLLGDCYS-PQDFDNFDSMKD----AYHDAFESILWYTYRRDFETMVPYDFTS 122
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----LQKPFDREYVEILHLFGDSETS-- 176
D GWGCMLRS+QML+++A + LG W+ P L+ P + YV++L F DS +
Sbjct: 123 DAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLELP--KVYVKLLKWFVDSFDTEC 180
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
+SIHN+ + G Y G W GP A+ R L Q P V+ +
Sbjct: 181 KYSIHNITRIGMQYDKLPGEWYGP-------TTAAQALRDLVNLHAQESPECNLVMYVPQ 233
Query: 237 DGERGGAPV--VCI---DDASRHCSVFSKGQADWT---------------------PILL 270
DG V +CI D + +V + Q+D T +L+
Sbjct: 234 DGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRDNSEKMWQKSLLI 293
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
L+PL LGL+ +NPRY+P ++ F FPQ++GI+GGK G S Y VG + LDPHD+ P
Sbjct: 294 LIPLRLGLDSINPRYLPAIQRVFEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPHDIHP 353
Query: 331 VINIGKDDLEAD-TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 389
++ A T HS + + L SIDPSLA+GFYC D+ D+ DF R ++ E
Sbjct: 354 TADLNTAFPTATHLRTVHSRLPLEMSLGSIDPSLALGFYCSDRKDYLDFVDRVDRVQSEL 413
Query: 390 NGAPLFTVTQ 399
GA F++ +
Sbjct: 414 GGALPFSIAK 423
>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
Length = 379
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPL 208
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
QP + D S + + + +DPS+A+GF+C+ +DDF+D+C + KL+
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328
Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
P+F + + + DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351
>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 410
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 123/381 (32%), Positives = 179/381 (46%), Gaps = 64/381 (16%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
+ S ++W++G ++ Q + D ++ SR+ +YRK F PIG +
Sbjct: 28 LFKSGGEVWIVG---RVWQTQDFDD---------IKKEIRSRMWFTYRKSFSPIGGTGPI 75
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 181
SD GWGCMLR QML+AQAL+ LGR W+ P + D YV IL +F D + +SIH
Sbjct: 76 SDSGWGCMLRCGQMLLAQALICRHLGREWQWSPSCR--DEAYVRILRMFQDKKNELYSIH 133
Query: 182 NLLQAGKAYGLAAGSWVGP---------YAMCRSWEALA----------------RCQR- 215
+ + G++ G G W GP A+ W +LA C R
Sbjct: 134 MIAKMGESEGKEIGKWFGPSTIAHVIKKLAIYDDWSSLAVHVAMDNVIVQEDVKKLCSRE 193
Query: 216 ---AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
A Q P I V ED + V C + +S W P+LL++
Sbjct: 194 VFDALRKRLLQEEPSEI-VADWFEDARKDNKKVDCANLSS-----------PWKPLLLIL 241
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
P+ LGL ++NP YIP L+ F ++G++GGKP + Y +G ++ +YLDPH Q +
Sbjct: 242 PMRLGLSELNPCYIPALKEFFACKYNIGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTFV 301
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN-- 390
++ D S+YHS I I + IDPSLAI FY + +FDDFC A ++ N
Sbjct: 302 DLDVSMDLFDDSSYHSAFILDISFNEIDPSLAIAFYINTEAEFDDFCTFAKQVCLVGNFR 361
Query: 391 ------GAPLFTVTQTHKKPV 405
LF V Q + P+
Sbjct: 362 CFSSGSMVQLFQVLQKYPNPL 382
>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
Length = 394
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 159/312 (50%), Gaps = 40/312 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
++LLGV + + +D A F +D SR +YRK F PIGD+ TSD GWGC
Sbjct: 45 VYLLGVKYDLPRDGA-----------SFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGC 93
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
LR QML+ LL LGR WR D +Y +IL +F D S +SI + G
Sbjct: 94 TLRCGQMLLGHTLLLRHLGRDWRWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGAD 153
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
+G + G W GP + ++ + LA + Q +A+YV +V ID
Sbjct: 154 FGRSVGQWFGPNNVAQAIKRLA--------VHDQWSEVAVYVAMD---------MLVVID 196
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D S ++ P+L+ +PL LG E+ N Y ++ F QS+GI+GGKP +
Sbjct: 197 DIS-----------NFRPVLVFIPLRLGQERFNMEYKEAVKACFAVRQSVGIIGGKPRHA 245
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ G ++ IYLDPH Q + + + +D STYH+ I +H+ +DPSLA+GF+C
Sbjct: 246 LWFTGYHDDYLIYLDPHKTQSCVTLPDAGIVSD-STYHTTQIERLHISELDPSLALGFFC 304
Query: 370 RDKDDFDDFCAR 381
+ + D DD C +
Sbjct: 305 QTEADLDDLCDK 316
>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
Length = 378
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 36 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 95
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 96 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 148
Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 149 -LAVHIAMDNTVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 207
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 208 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 267
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
QP + D S + + + +DPS+A+GF+C+ +DDF D+C + KL+
Sbjct: 268 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLL 327
Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
P+F + + + DVL
Sbjct: 328 GGALPMFELVEQQPSHLACPDVL 350
>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
Length = 382
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 164/326 (50%), Gaps = 41/326 (12%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + D +S+I ++YRK F IG + TSD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 35 LDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREWRW 94
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+P K +++Y+ IL +F D + FSIH + Q G + G G W GP + LA
Sbjct: 95 EPGTK--NKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR-HCSVFS------------ 259
+ + +AI+V + V I++ S+ C +++
Sbjct: 153 FDKWSS--------LAIHVAMDN---------TVIINEISKFRCHIWAAADGLVRNRTNS 195
Query: 260 ------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
+ W P+LL +PL LGL ++N Y L+ TF QSLG++GGKP + Y +
Sbjct: 196 EPSRPANSEGSWKPLLLFIPLRLGLSEINRIYAFGLKRTFALKQSLGMIGGKPNHALYFI 255
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
GV E+ I+LDPH Q ++ D D +YH +++ +DPS+A+ FY +
Sbjct: 256 GVVEDELIFLDPHTTQLACDLDVD--SPDDQSYHCAHASRMNISELDPSVALCFYMATES 313
Query: 374 DFDDFCARASKLAEESNGAPLFTVTQ 399
DFD +C K PLF +TQ
Sbjct: 314 DFDVWCNLVQKHLISRMQQPLFEITQ 339
>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 160/321 (49%), Gaps = 23/321 (7%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
IYLDPH Q ++ + D + + + + +DPS+A+GF+C+D+++F+++C
Sbjct: 274 IYLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333
Query: 381 RASKLAEESNGAPLFTVTQTH 401
K + +F + H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354
>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
[Megachile rotundata]
Length = 518
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 178/387 (45%), Gaps = 58/387 (14%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG ++ +E L A+ + + EF +DF+SR
Sbjct: 126 SKESPVWLLGKIYRKKPEEFLEKASEAEKTLDTGSEISLAMDAISFEDSIEEFKKDFTSR 185
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P E
Sbjct: 186 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWQPDQPIKTEQQ 245
Query: 165 E--------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+ I+ FGD SPFSIH L+ G +G AG W GP ++A
Sbjct: 246 KLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAGDWYGP-------SSVAHLL 298
Query: 215 RAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
+ LP +A+YV V + D C + W ++L
Sbjct: 299 SQAVEHAAEHLPIFSNLAVYVAQD---------CAVYLQDVESVCQM---PDGKWKSLIL 346
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
VPL LG +K+NP Y L T +G++GG+P S Y +G QE+ I LDPH Q
Sbjct: 347 FVPLRLGTDKLNPVYTSCLTHLLTLDTCIGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE 406
Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL---AE 387
+++ KD+ +++H R + + +DPS +GFY DK+ F +F A +
Sbjct: 407 TVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKNQFTNFMEIAPSYLVPED 464
Query: 388 ESNGAPLFTVTQTHKKPVNHSDVLGET 414
E P+F + K ++ + ET
Sbjct: 465 EKVDYPMFLFCEGSGKDLHQQIEIAET 491
>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
Length = 398
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 115/309 (37%), Positives = 161/309 (52%), Gaps = 31/309 (10%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
+ + F + + +YR+ F + TSD GWGCMLRS+QML+ QAL LGR WR P
Sbjct: 41 YKRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPAL 100
Query: 155 ----LQKPFDREYVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
+ +YV +L F DS +SIH++++ G Y G W GP +
Sbjct: 101 FEAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLR 160
Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV-------FSKG 261
L R E G +A+YV ++G VV DD +R C ++
Sbjct: 161 DLVNLHRREFGG-----ELAMYV---PQEG------VVYTDDVTRLCFFDPLLHPPTAED 206
Query: 262 QADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+DW T +L+L+PL LGL++VN RY+P L TF FPQS+GI+GGK G S Y VG Q++
Sbjct: 207 SSDWSTALLILIPLRLGLDQVNERYVPALEKTFAFPQSVGIIGGKKGHSVYFVGTQQDQL 266
Query: 321 IYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
LDPHDV P + A T HS +++ IDPSLA+GF C ++ D++DF
Sbjct: 267 HLLDPHDVHPAPELNPAFPTATHLRTVHSSRPLVMNVTGIDPSLALGFLCDNRADYEDFE 326
Query: 380 ARASKLAEE 388
R L +E
Sbjct: 327 RRVRILHDE 335
>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
Length = 392
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 40 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 99
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 100 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 159
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 160 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 210
Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 211 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 270
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
IYLDPH Q + + D + + + + +DPS+A+GF+C+D+++F+++C
Sbjct: 271 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 330
Query: 381 RASKLAEESNGAPLFTVTQTH 401
K + +F + H
Sbjct: 331 VIEKEILKHQSLRMFELIPKH 351
>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
IYLDPH Q + + D + + + + +DPS+A+GF+C+D+++F+++C
Sbjct: 274 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333
Query: 381 RASKLAEESNGAPLFTVTQTH 401
K + +F + H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354
>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
Length = 379
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
QP + D S + + + +DPS+A+G +C+ +DDF+D+C + KL+
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGSFCKTEDDFNDWCQQVKKLSLL 328
Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
P+F + + + DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351
>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
aries]
Length = 454
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 168/358 (46%), Gaps = 42/358 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 69 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 115
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y + G E
Sbjct: 116 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYCRVPPQMGVGE--------- 166
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
G + G W GP + + + LA A + L V++ R G
Sbjct: 167 -------GKSIGQWYGPNTVAQVLKKLAVFD-AWSALAVHVAMDNTVVMADVRRLCRSGL 218
Query: 244 PVVCID----DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F G A W P++LL+PL LGL VN Y TL+ F
Sbjct: 219 PCAGAEAFPADSERHCNGFPAGAEGGECTAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 278
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 279 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 338
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 339 MSITELDPSIAVGFFCKTEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVL 396
>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
Length = 456
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 116/377 (30%), Positives = 177/377 (46%), Gaps = 51/377 (13%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG C+ ++ L +A+ N + EF +DF+SR
Sbjct: 62 SKESPVWLLGQCYLKKSEDPLENASEALEPEGTGSQVSLAMDATNFENTIEEFKRDFASR 121
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQ 156
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W+ Q
Sbjct: 122 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKWRPEQSIENTQQ 181
Query: 157 KPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
D + I+ F D SPFSIH L+ G + G AG W GP ++ L++
Sbjct: 182 MRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSVAH---LLSQAV 238
Query: 215 RAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
L L +A+YV V + D C G W ++LLVP
Sbjct: 239 ERTGELPNSKLSRLAVYVAQD---------CAVYMQDVEEVCRTSDGG---WKSLILLVP 286
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L+LG +K+NP Y P + T +G++GG+P S Y +G Q++ I+LDPH Q ++
Sbjct: 287 LMLGTDKLNPVYAPCVTSLLTLDACIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVD 346
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 392
+ K++ +++H R + L +DPS +GFY +++ DF SN
Sbjct: 347 VSKENFPL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNRESLTDFMETIHSFVIPSNQKT 404
Query: 393 --PLFTVTQTHKKPVNH 407
P+F + KK +
Sbjct: 405 DYPMFLFCEGSKKDLQQ 421
>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
litura]
Length = 365
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 176/351 (50%), Gaps = 33/351 (9%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 5 IPQTKESVWILGKKYSAIQD-----------LDRIRRDITSIIWCTYRKGFIPIGDEGLT 53
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 180
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ F + + +P+SI
Sbjct: 54 SDKGWGCMLRCGQMVLGVALVRVHLSADW---VWTPETRDPTYLKIIQRFEERKQAPYSI 110
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G + G G W GP + + + L + + + I+V + +
Sbjct: 111 HQVALMGASEGKQVGQWFGPNTVAQVLKKLTVYDKWSS--------LVIHVALDNTVVKE 162
Query: 241 GGAPVVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+++ CS DW P+LL+VPL LGL ++NP YI L++ F PQS+
Sbjct: 163 DILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLSEINPIYIDGLKICFQCPQSI 222
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIHL 356
G++GGKP + Y+VG + IYLDPH Q V D+ + +YH I +
Sbjct: 223 GVIGGKPNQALYLVGCVGDEVIYLDPHTTQRSGLVETKTTDEQKEMDWSYHCKYASRIPM 282
Query: 357 DSIDPSLAIGFYCRDKDDFDDFCAR-ASKLAEESNGAPLFTVTQTHKKPVN 406
++DPS+A+ F CR K DF++ CA +KL ES PLF + K+P +
Sbjct: 283 LAMDPSVAVCFLCRTKRDFEELCATIETKLMCESQ--PLFETCE--KRPAH 329
>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
Length = 673
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 161/318 (50%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + + + G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 253 AAENQVTECPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIA 312
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 313 QGLICHFLGRSWRYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 372
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E
Sbjct: 373 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQ 432
Query: 245 VVCIDDASRHCSVFSK----GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V A + S K Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 433 HVPWQHAKKSTSDAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 492
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++ ++H R I +D
Sbjct: 493 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMHSFHCKSPRKIKSSKMD 550
Query: 361 PSLAIGFYCRDKDDFDDF 378
PS IGFYC K DFD F
Sbjct: 551 PSCCIGFYCATKTDFDSF 568
>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
CIRAD86]
Length = 445
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/307 (36%), Positives = 159/307 (51%), Gaps = 45/307 (14%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
+EF DF SR+ I+YR F PI S TSD GWGCM+R
Sbjct: 109 SEFLDDFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIR 168
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A ++ HRLGR WRK + +RE+ +IL LF D+ +PFSIH ++ G +A G
Sbjct: 169 SGQSLLANTIVVHRLGRDWRKGQK---EREHKDILSLFADTPDAPFSIHKFVEHGAQACG 225
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A ARC RA T Q+ + +Y D D V ID A
Sbjct: 226 TYPGEWFGP-------NATARCLRALTDKYHQA-GLRVYARPNDSD--------VYID-A 268
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ ++ P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 269 LTATATQKDANDEFQPTLIVLGIRLGIEKVTPAYHAALKAALELPQSMGIAGGRPSSSHY 328
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VG Q ++ YLDPH +P+++ + DT H+ +R + L +DPS+ +GF R
Sbjct: 329 FVGHQGDNFFYLDPHTTRPMLSPQPSAEDVDTC--HTRRVRRLSLAEMDPSMLLGFLVRS 386
Query: 372 KDDFDDF 378
K+DF+++
Sbjct: 387 KEDFEEW 393
>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
Length = 392
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 172/338 (50%), Gaps = 26/338 (7%)
Query: 61 TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK 120
T ++ ++ +WLLG K D A D + + F S + +YR+ + + +
Sbjct: 14 TPSAALSAPVWLLG---KRYDDVAAVD------FDAYKRSFESILWFTYRRDYPAMTPYE 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGDSE 174
TSD GWGCMLRS+QML+ QAL LGR WR P + YV++L F DS
Sbjct: 65 HTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPALFETEIDARLPETYVQLLRWFADSP 124
Query: 175 TSP--FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+SIH +++ G Y G W GP + L R E G VV
Sbjct: 125 DVECRYSIHQMVKLGVQYDKLPGEWYGPTTAAQVLRDLVNLHRREFGGELSMYVPQEGVV 184
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADW-TPILLLVPLVLGLEKVNPRYIPTLRL 291
D+ + +C D H ++ ++DW T +L+L+PL LGL++VN RY+P ++
Sbjct: 185 YSDDVAK------LCFFDPLLHPPT-TEDKSDWSTALLILIPLRLGLDQVNERYVPAIQK 237
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDV 350
+F FPQS+GI+GGK G S Y VG Q++ LDPHDV P + A T HS
Sbjct: 238 SFAFPQSVGIIGGKKGHSVYFVGTQQDQLHLLDPHDVHPAPELNTAFPTATHLRTVHSSR 297
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
+++ +IDPSLA+GF C ++ D++DF R L +E
Sbjct: 298 PLVMNVTTIDPSLALGFLCENRVDYEDFERRVRILHDE 335
>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
Length = 387
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 167/338 (49%), Gaps = 41/338 (12%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + D +S+I ++YR+ F I + TSD GWGCMLR QM VA+AL+ L R W+
Sbjct: 41 LDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGWQW 100
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
P + D Y+ +L +F D + FSIH + Q G + G A G W GP + LA
Sbjct: 101 APGIR--DESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------- 263
+ + +AI+V + VV +DD + C + + ++
Sbjct: 159 FDKWSS--------LAIHVAMDN---------VVIMDDIRKVCRLEATAESGVRNRAEPA 201
Query: 264 --------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
W P+LL +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI+GV
Sbjct: 202 GLAAAAAESWKPLLLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYIIGV 261
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
+ ++LDPH Q +++ D D +YH + + +DPS+A+ FY + +F
Sbjct: 262 VGDDLVFLDPHTTQLAVDL--DTEFPDDESYHCAHASRMDIGQLDPSIALCFYLPTEAEF 319
Query: 376 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGE 413
D +C A K PLF +T+ +P+ D + E
Sbjct: 320 DSWCNLAHKHLISEMSQPLFEITE--HRPLGWPDFVDE 355
>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
Length = 606
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 161/318 (50%), Gaps = 34/318 (10%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
G+ F +DF SRI ++YR+ F + DS TSD GWGCM+RS QML+AQ L+ H LGR WR
Sbjct: 195 GIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLGRSWR 254
Query: 153 KPLQKPFDRE---YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
+ E + +++ FGD S+TSPFSIH L+ GK G G W GP A+
Sbjct: 255 WDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAVAHLL 314
Query: 208 EALARCQRAET----GLGCQ-SLPMAIYV--------VSGDEDG---ERGGAPVVCIDDA 251
R E G+ + A+Y+ V G +R GAP +
Sbjct: 315 RQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGTNSSS 374
Query: 252 SRHCSVFSKG-----------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
S + S A W ++LLVPL LG +K+NP Y L+ + +G
Sbjct: 375 STAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLKAMLSLDYCIG 434
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GG+P S Y VG QE+ I+LDPH Q ++++ +D+ +++H R + L +D
Sbjct: 435 IIGGRPKHSLYFVGYQEDKLIHLDPHYCQDMVDVNQDNFP--VASFHCKSPRKMKLSKMD 492
Query: 361 PSLAIGFYCRDKDDFDDF 378
PS IGFYC K DF F
Sbjct: 493 PSCCIGFYCETKKDFYKF 510
>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
Length = 474
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 117/382 (30%), Positives = 179/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----VHLCGRRYHFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKP------------- 158
+TSD GWGCMLRS QM++AQ LL H L R WR P + P
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLAPPEMPGPASPSRYRGPGR 193
Query: 159 --------------FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 HVPPRWTQGTLEMEQDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C +P + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-KCSEVPRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS IGFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTIGFYAGNRKEFETLCSELMR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
Length = 390
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 184/374 (49%), Gaps = 54/374 (14%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
S S +W+LG + N +AE N + SR+L +YRK F I S TSD
Sbjct: 28 SDSPVWILG-----------NELCARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDS 76
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 179
GWGCMLR QM++ +AL LGR W+ + + +Y++IL+LF DS+ +P+S
Sbjct: 77 GWGCMLRCGQMVLGEALQRISLGRDWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYS 136
Query: 180 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
IH + G++ G+W GP + + + L+ ++ ++P+ ++V +
Sbjct: 137 IHQIALMGESIQSKKPVGTWFGPNTVAQVLKKLSFFEK--------TVPIRLHVAMDN-- 186
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
V ID+ C F G ++ P+LL +PL LGL ++NP Y L+ F FPQ
Sbjct: 187 -------TVIIDEIKESCG-FVGGDSE-KPLLLFIPLRLGLTEINPIYFQDLKECFEFPQ 237
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT------STYHSDVI 351
LG++GG+P + Y +G + IYLDPH I+ DT T+H++
Sbjct: 238 ILGVIGGRPNHALYFIGYVDNELIYLDPH-----ISTQSASSTVDTFGGPQDQTHHTERA 292
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHS 408
+ +DPSL++ F CR++ +F+D C R + +PLF + + H P+ S
Sbjct: 293 YRMDFKDLDPSLSLCFLCRNESEFEDMCERFLFKLIRGHNSPLFEICRQRPEHLMPLPLS 352
Query: 409 DVLGET--GGVPED 420
L VPE+
Sbjct: 353 SSLNSDLPNAVPEE 366
>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
Length = 388
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 196/389 (50%), Gaps = 39/389 (10%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + +D + D S++ +YRKGF PIGDS +T
Sbjct: 21 IPQTREPVWILGRKYDAGRD-----------VTAIRSDIKSKLWFTYRKGFVPIGDSGLT 69
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 180
SD GWGCMLR QM++AQAL+ LGR WR K ++P EY+ IL +F D++T+ +SI
Sbjct: 70 SDKGWGCMLRCGQMVLAQALVCLHLGRDWRWKKDSKEP---EYLRILKMFEDTKTATYSI 126
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G + G G W GP + + + L+ + + + +L I V +R
Sbjct: 127 HQIALMGVSEGKDVGQWFGPNTVTQVLKKLSVYDKWSSIVIHVALDNTIIVNDIKSLCQR 186
Query: 241 GGAPVVCIDDASRHCS-----VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
V ID +++ S V+ W P+LL+VPL LGL ++NP Y+ L+ FTF
Sbjct: 187 NEQSV--IDSSAQKHSPLNEPVYFNSARKWKPLLLVVPLRLGLSEINPVYLNGLKTCFTF 244
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIR 352
QSLG++GGKP + Y +G E IYLDPH QPV + +L + + +YH
Sbjct: 245 RQSLGVIGGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYHCPRAS 304
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 409
+ +DPS+A+ F+C + +FD C + + +S PLF +T H PV +
Sbjct: 305 RSRILDMDPSVAVCFFCSSEVEFDILCQQIQEKLIKSEKQPLFEITLNKPRHWIPVEN-- 362
Query: 410 VLGETGGVPEDDSLGVMSMNDAVGNAHED 438
P + +L + + N+ ED
Sbjct: 363 --------PVERTLNLQDYERSFENSDED 383
>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
Length = 486
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 112/344 (32%), Positives = 167/344 (48%), Gaps = 30/344 (8%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR + +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKR 251
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 252 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 299
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
C + W ++L VPL LG +K+NP Y L T +G++GG+P S Y +
Sbjct: 300 VCQM---PDGKWKSLILFVPLRLGADKLNPVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 356
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY +K
Sbjct: 357 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHNKM 414
Query: 374 DFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGET 414
F +F A +E P+F + K + + E
Sbjct: 415 QFTNFMEIAPSYLVPEDEKVDYPMFLFCEGSGKDLQQKIEIAEN 458
>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
Length = 354
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 113/328 (34%), Positives = 164/328 (50%), Gaps = 49/328 (14%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR
Sbjct: 8 GIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSWR 67
Query: 153 ---KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 201
KP+Q RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 68 WSEKPIQN--GREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGP- 124
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV-------VCIDDASRH 254
++A C + ++ V + E+ E V + I D H
Sbjct: 125 ------ASVAHCLK------------SVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTH 166
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
C + W ++LLVP+ LG E++NP Y P L T +GI+GG+P S Y VG
Sbjct: 167 CRL---PNGCWKSLILLVPVKLGTERLNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVG 223
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
Q++ I+LDPH Q ++++ + + T+H R + + +DPS IGFY + D
Sbjct: 224 YQDDRLIHLDPHYCQEMVDVWQPNFSLQ--TFHCRSPRKMPISKMDPSCCIGFYLQTHHD 281
Query: 375 FDDFCARASKL-----AEESNGAPLFTV 397
F+ F + SN P+FT+
Sbjct: 282 FETFVNVINTFLTPQGVSSSNEYPMFTL 309
>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
Length = 393
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 119/401 (29%), Positives = 186/401 (46%), Gaps = 61/401 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVLTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALICRHLGRDWRWSKGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + LA + +A+++ + V ++
Sbjct: 134 EGKSIGQWYGPNTVAQVLRKLASFDTWSS--------LAVHIAMDN---------TVVME 176
Query: 250 DASRHC---------SVFSKGQADW------------------TPILLLVPLVLGLEKVN 282
+ R C S F + D+ P++LL+PL LGL +N
Sbjct: 177 EIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLLWKPLVLLIPLRLGLTDIN 236
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D
Sbjct: 237 EAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPMDSCYIPD 296
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
S + + + +DPS+A+GF+C ++DF+D+C R KL+ P+F + +
Sbjct: 297 ESFHCQHPPCRMSIAELDPSIAVGFFCNSEEDFNDWCQRIKKLSLIRGALPMFELVEHQP 356
Query: 403 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
+ DVL T + D L + ++ ++D+++L
Sbjct: 357 SHFSSPDVLNLTPDSSDADRL------ERFFDSEDEDFEIL 391
>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
Length = 486
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 112/348 (32%), Positives = 167/348 (47%), Gaps = 38/348 (10%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 251
Query: 194 AGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
AG W GP + + ++ E A A L A+YV V +
Sbjct: 252 AGDWYGPSSVAHLLSQAVENAAERHPAFNNL-------AVYVAQD---------CAVYLQ 295
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D C W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 296 DIENVCQT---PDGKWKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 352
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY
Sbjct: 353 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 410
Query: 370 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGET 414
+K F +F A +E P+F + K ++ + E
Sbjct: 411 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAEN 458
>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
Length = 525
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 171/348 (49%), Gaps = 38/348 (10%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 171 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 230
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 231 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 290
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCID 249
AG W GP ++ A Q E + + P +A+YV V +
Sbjct: 291 AGDWYGPSSV-----AHLLSQAVENAV--ERHPAFNNLAVYVAQD---------CAVYLQ 334
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D C S G+ W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 335 DIENVCQT-SDGK--WKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 391
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY
Sbjct: 392 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 449
Query: 370 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGET 414
+K F +F A +E P+F + K ++ + E
Sbjct: 450 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAEN 497
>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
Length = 397
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 166/312 (53%), Gaps = 18/312 (5%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR WR +
Sbjct: 47 TSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVRRHLGRDWRWVRSQSQRE 106
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEALAR 212
+Y+ IL+ F D + +S+H + Q G G + G W GP A+ SW L
Sbjct: 107 DYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRLTV 166
Query: 213 CQRAETGLGCQS-----LPMAIYVVSGDEDGERG-GAPVVCIDDASRHCSVFSKGQADWT 266
+ + + +P Y + D + G P C++ A C++ + A W
Sbjct: 167 HVAMDNTVVIEEIKRLCMPWLDYGGAACVDLQGGMPEPNGCLEGA---CALAEEETALWK 223
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LLL+PL LGL +N YI TL+ F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 224 PLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGGKPNHAHYFIGYVGEELIYLDPH 283
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
QP + +D D + + +H+ IDPS+A+GF+CR +DDFDD+C R KL+
Sbjct: 284 TTQPAVEPCEDSQVPDDTYHCQHPPCRMHICEIDPSIAVGFFCRTEDDFDDWCMRFRKLS 343
Query: 387 EESNGAPLFTVT 398
G P+F +
Sbjct: 344 HTRAGLPMFELV 355
>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
pulchellus]
Length = 390
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 164/328 (50%), Gaps = 44/328 (13%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + + +S+I ++YRK F I + TSD GWGCMLR QM+VA+A++ LG+ W+
Sbjct: 41 LDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGKDWQW 100
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
P K D +Y+ +L +F D + +SIH + Q G + G G W GP + L+
Sbjct: 101 SPGTK--DEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKLST 158
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV--------------- 257
+ + +A++V + VV +DD + C V
Sbjct: 159 FDKWSS--------LAMHVAMDN---------VVVMDDIRKICRVETTTDVEDGIRNRTQ 201
Query: 258 -----FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ G W P++L +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI
Sbjct: 202 SHGGPAAAGARSWKPLVLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYI 261
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
+GV + ++LDPH Q +++ D+E + +YH + + +DPS+A+ FY
Sbjct: 262 IGVVGDDLVFLDPHTTQLAVDL---DVECPEDESYHCAHASRMDIGQLDPSIALCFYMAT 318
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQ 399
+ +FD +C A K PLF +T+
Sbjct: 319 EAEFDSWCNLAHKHLISQMKQPLFEITE 346
>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
Length = 397
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 177/376 (47%), Gaps = 47/376 (12%)
Query: 48 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
M + E LGP I +D+WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDSMLAGAVGEPEDIPKRNTDVWLLGKRYNAIQELEL-----------IR 61
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTP 118
Query: 159 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
D Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
+ + ++V V +D+ C S + W P+LL+VPL L
Sbjct: 179 SS--------LVVHVAMDS---------TVVLDEIYTRCQEVSA--STWKPLLLIVPLRL 219
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q ++ +
Sbjct: 220 GISDINPMYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGSVAQ 279
Query: 337 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
A+ +YH + ++DPSLA+ F C+ ++ FD+ + +
Sbjct: 280 KTTAAEQELDESYHQKYAARLSFGAMDPSLAVCFLCKTRNSFDELLQQLRQEVLSLCTPA 339
Query: 394 LFTVTQTHKKPVNHSD 409
LF ++Q+ + +D
Sbjct: 340 LFEISQSRAVDWDTAD 355
>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
Length = 401
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 167/326 (51%), Gaps = 16/326 (4%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
L + D +S+I ++YRK F I + TSD GWGCMLR QM++A+AL+ LG+ W+
Sbjct: 54 LDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGWQW 113
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
P + D Y+ +L +F D + +SIH + Q G + G A G W GP + L+
Sbjct: 114 APGIR--DENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS----VFSKGQADWTPI 268
+ + L + V+ R P V DD RH + + W P+
Sbjct: 172 FDKW-SSLAVHVAMDNVVVMDDIRKICRVETPAV--DDGVRHRTQSHGLACASAVSWKPL 228
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
LL +PL LGL ++NP Y L+ TF QS+GI+GGKP + +I+GV + ++LDPH
Sbjct: 229 LLFIPLRLGLNEINPVYYCGLKRTFALKQSVGIIGGKPNHALFIIGVVGDDLVFLDPHTT 288
Query: 329 QPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
Q +++ D+E + +YH + + +DPS+A+ FY + +FD +C A K
Sbjct: 289 QLAVDL---DVEFPEDESYHCAHASRMDIGQLDPSIALCFYLPTECEFDSWCNLAHKHLI 345
Query: 388 ESNGAPLFTVTQTHKKPVNHSDVLGE 413
PLF +T+ ++P+ D E
Sbjct: 346 TQMKQPLFEITE--ERPLGWPDFTEE 369
>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
Length = 474
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 175/382 (45%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + +F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQQFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
Length = 459
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 181/405 (44%), Gaps = 82/405 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 112
S S ++LLG C+ DE+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTDES-GELSTDGSNFDKINTEISGNVEEFRKDFISRIWLTYREE 94
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 151
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIDSSDSESWTAHTV 154
Query: 152 --------------RKP----------LQKPFDRE-------YVEILHLFGDSETSPFSI 180
R+P L++ +D + +I+ FGDS + F +
Sbjct: 155 KKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNEVYHRKIISWFGDSPLTAFGL 214
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H L++ GK G AG W GP + R G + IYV
Sbjct: 215 HQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTIYVAQD------ 263
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V D R CS G+AD +++LVP+ LG E+ N Y+ ++ + +G
Sbjct: 264 --CTVYSSDVIDRQCSFMDSGEADTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVG 321
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +D
Sbjct: 322 IIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMD 379
Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
PS IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 380 PSCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424
>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
Length = 405
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 105/337 (31%), Positives = 157/337 (46%), Gaps = 47/337 (13%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--K 153
E DF S+I +YRK F IG + T D GWGCMLR QM++AQAL+ LGR W+ K
Sbjct: 46 ELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWKWNK 105
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
Q D+ Y IL +F D +++ +SI + G + G GSW GP + + + LA
Sbjct: 106 NCQ---DQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----------- 262
+ + ++ D VC DD C + Q
Sbjct: 163 DEWSS---------IVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRS 213
Query: 263 --------------------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
W P+LL++PL LGL ++N Y+ +L+ +FPQS+GI+
Sbjct: 214 KKSSQDSSKQDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKACLSFPQSVGII 273
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP + + VG + IYLDPH Q ++ D +YH +++ +DPS
Sbjct: 274 GGKPNHAHWFVGYMSDKLIYLDPHTTQLCEDL--DSPNFSDESYHCPYPSTMNVMELDPS 331
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+A+GFYC + +FDD K S+ P+F + +
Sbjct: 332 IALGFYCGTEKEFDDLTQSVQKFVVGSSKTPMFELYK 368
>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
Length = 424
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 179/383 (46%), Gaps = 67/383 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 82
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHYLPRDWTWAEGAGLGPPEPVGLSSPNRYRGPAR 142
Query: 153 ---------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 203
P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMAPTLGPGAPPSWSRERRHRQIVSWFADHPRAPFGLHQLVELGQSSGKKAGDWYGP--- 199
Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 263
+A R + + +YV + A +V D + A
Sbjct: 200 ----SLVAHILRKAVESCAEVTRLVVYVSQDCTVYKADVARLVARPDPT----------A 245
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YL
Sbjct: 246 EWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYL 305
Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 383
DPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +
Sbjct: 306 DPHYCQPTVDVSRADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELT 363
Query: 384 KLAEESNGA---PLFTVTQTHKK 403
++ S+ P+FT+ + H +
Sbjct: 364 RVLSSSSATERYPMFTLAEGHAQ 386
>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
Length = 474
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
Length = 382
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 184/388 (47%), Gaps = 48/388 (12%)
Query: 48 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
M + E LGP I +++WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDGVLAGAVGEIEDIPKRNTNVWLLGKRYNAIQE-----------LEPIR 61
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTP 118
Query: 159 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
D Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
+ +A++V V +DD C ++ W P+LL+VPL L
Sbjct: 179 SS--------LAVHVAMDS---------TVVLDDIYTCCQ--ESSESSWKPLLLIVPLRL 219
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q + +
Sbjct: 220 GITDINPIYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQ 279
Query: 337 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
A+ +YH + ++DPSLA+ F C+ +D F++ + + +
Sbjct: 280 KTTAAERELDESYHQKYAARLSFGAMDPSLAVCFLCKTRDSFEELLQQLRQDVLTLSTPA 339
Query: 394 LFTVTQTHKKPVNHSDVLGETGGVPEDD 421
LF ++Q+ + +D + E +P+ D
Sbjct: 340 LFEISQSRAVDWDTADDI-EWPAMPDID 366
>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
Length = 474
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
Length = 389
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 178/362 (49%), Gaps = 34/362 (9%)
Query: 40 KRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQ 99
KR++ A +E + R G + +W+LG + L E N
Sbjct: 23 KRMLEACEAFVTYESGIILERQGFEVNDEPVWILG-----------REYDTKTKLDELNS 71
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--WRKPLQK 157
D SR+L++YR+ F PIGDS +TSD GWGCMLR QM+VAQAL+ LGR W +
Sbjct: 72 DVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFWPVGDDQ 131
Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
Y +IL LF D +T+ +SIH L Q G + G G W GP + + + L+
Sbjct: 132 RTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLSEYDEWS 191
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC--SVFSKGQADWTPILLLVPLV 275
+ I+V + V I++ + C + + W+P+LL+VPL
Sbjct: 192 A--------LKIHVAMDN---------AVVIEEIEQLCHKKITPTETSTWSPLLLVVPLR 234
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
LGL +NP YI +L+ PQS+G++GGKP + Y +G + ++LDPH Q I++
Sbjct: 235 LGLLNINPIYIDSLKACLQMPQSIGMIGGKPSQALYFIGYVGDDVVFLDPHLTQNAIDLD 294
Query: 336 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
+D E D S+YH I S+DPSLA+ F C ++ D + ++E LF
Sbjct: 295 ED--EFDDSSYHPATCARISFQSMDPSLAVCFSCTTHSEWKDLLRQFKDMSEIGKKQNLF 352
Query: 396 TV 397
V
Sbjct: 353 EV 354
>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
Length = 473
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 114/317 (35%), Positives = 161/317 (50%), Gaps = 47/317 (14%)
Query: 97 FNQDFSSRILISYRKGFDPI----GDSK------------------ITSDVGWGCMLRSS 134
F DF SR+ I+YR F PI G S TSD GWGCM+RS
Sbjct: 138 FLDDFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSG 197
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLA 193
Q L+A LLF RLGR WR+ Q ++E E+L LF D +PFSIH +Q G A G
Sbjct: 198 QSLLANTLLFLRLGRGWRRGSQ---EQEESELLSLFADHPRAPFSIHRFVQHGATACGKC 254
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCIDDAS 252
G W GP A + +ALA G + +Y+ S G + ER + C
Sbjct: 255 PGEWFGPAAAAQCIQALAN--------GHPQAGLNVYITSDGSDIYERQFREIACR---- 302
Query: 253 RHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ G+ D P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S Y
Sbjct: 303 ---GLGEDGEDDSIKPTLILLGVRLGIDRVTPVYWESLKEVIRFPQSVGIAGGRPSSSHY 359
Query: 312 IVGVQEESAIYLDPHDVQPVI---NIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+ Q ++ YLDPH +P + G+D + STYH+ +R +H+ +DPS+ IGF
Sbjct: 360 FIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGELSTYHTRRLRRLHIREMDPSMLIGF 419
Query: 368 YCRDKDDFDDFCARASK 384
RD+ D++D R +
Sbjct: 420 LVRDEGDWEDLKGRIRR 436
>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
Length = 408
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 172/355 (48%), Gaps = 39/355 (10%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
G + G W GP A+ W +LA + + + + +S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 195
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 311
Query: 359 IDPSLAI------------GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+DPS+A+ GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 312 LDPSVALVVLSCLLLLPPKGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365
>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
Length = 469
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 178/378 (47%), Gaps = 62/378 (16%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQKPF----------- 159
+TSD GWGCMLRS QM++AQ LL H L R W P P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSQGVGLGPPESSPNRYRGPAHWMPP 192
Query: 160 -----------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 HWVQAAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------S 245
Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 268
+A R + + +YV + A +V D + A+W +
Sbjct: 246 LVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKAV 295
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH
Sbjct: 296 VILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYC 355
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +++
Sbjct: 356 QPTVDVSQADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTRVLSS 413
Query: 389 SNGA---PLFTVTQTHKK 403
S+ P+FT+ + H +
Sbjct: 414 SSATERYPMFTLVEGHAQ 431
>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
Length = 472
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 156 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
H QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413
Query: 386 AEESNGA---PLFTVTQTHKK 403
S+ P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434
>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
Length = 708
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 289 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 348
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 349 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 408
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 409 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 468
Query: 245 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 469 HVPWQQAKRPQAETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 528
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 529 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 586
Query: 363 LAIGFYCRDKDDFDDF 378
IGFYC K DFD F
Sbjct: 587 CCIGFYCATKSDFDSF 602
>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
Length = 442
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 182/385 (47%), Gaps = 66/385 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 52 SRTSFSKLSS----VHLCGRRYRFETEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 101
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR----------KP--LQKPF------- 159
+TSD GWGCMLRS QM++AQ LL H L R W +P L P+
Sbjct: 102 GYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWVKGVGLDPPEPSRLASPYWHHGPAC 161
Query: 160 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 WIPPHWTQGSPELEQERRHRQIVSWFADHPKAPFGLHQLVELGQSSGKKAGDWYGP---- 217
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 218 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 264
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 325 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 382
Query: 385 LAEESNGA---PLFTVTQTHKKPVN 406
+ S+ P+FT+ + H + N
Sbjct: 383 VLSSSSATERYPMFTLAEGHAQDHN 407
>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
Length = 703
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQQAKRPQAETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581
Query: 363 LAIGFYCRDKDDFDDF 378
IGFYC K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597
>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
Length = 676
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 156/328 (47%), Gaps = 43/328 (13%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 258 AVENQVGETPWEEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 317
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L+ G A G
Sbjct: 318 QGLIVHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGK 377
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP ++ L T +++YV + I D
Sbjct: 378 KPGDWYGPASVSY---LLKHALEHATQENADFDNISVYVAKD---------CTIYIQDIE 425
Query: 253 RHCSV----------------------FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
CS+ Q W +++L+PL LG +KVNP Y L+
Sbjct: 426 DQCSIPEPAPKQTHVPWQQMKRPSLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAHCLK 485
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
L + LGI+GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H
Sbjct: 486 LLLSTENCLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQENF--SMQSFHCKS 543
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
R I +DPS IGFYC K DFD
Sbjct: 544 PRKIKTSKMDPSCCIGFYCATKSDFDSL 571
>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
Length = 472
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 156 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
H QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413
Query: 386 AEESNGA---PLFTVTQTHKK 403
S+ P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434
>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
Length = 668
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 249 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 308
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 309 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 368
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 369 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 428
Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 429 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 488
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 489 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 546
Query: 363 LAIGFYCRDKDDFDDF 378
IGFYC K DFD+F
Sbjct: 547 CCIGFYCATKSDFDNF 562
>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
Length = 485
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 153/305 (50%), Gaps = 27/305 (8%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
A+ + + + EF +DF+SR+ ++YR+ F + S TSD GWGCMLRS QM++AQAL+
Sbjct: 131 AMDAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALV 190
Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
H LGR WR + +P E + I+ FGD TSPFSIH L+ G G
Sbjct: 191 CHFLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKR 250
Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 251 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 298
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
C + W ++L VPL LG +K+N Y L T +G++GG+P S Y +
Sbjct: 299 VCQM---PDGKWKSLILFVPLRLGADKLNLVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 355
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY DK
Sbjct: 356 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKM 413
Query: 374 DFDDF 378
F +F
Sbjct: 414 QFTNF 418
>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
Length = 703
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQKAKRPQAENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581
Query: 363 LAIGFYCRDKDDFDDF 378
IGFYC K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597
>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
Length = 653
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 234 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 293
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 294 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 353
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 354 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 413
Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 414 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 473
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 474 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 531
Query: 363 LAIGFYCRDKDDFDDF 378
IGFYC K DFD+F
Sbjct: 532 CCIGFYCATKSDFDNF 547
>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 918
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 172/351 (49%), Gaps = 37/351 (10%)
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNG-----LAEFNQDFSSRILISYRKGFDPIGD 118
S S S IW+LG C+ + E G + + +F DF + + SYRK F+ I
Sbjct: 260 SISDSPIWMLGNCYSGKELECNGHTENKHNKRSRHICKFFADFQTLVCFSYRKDFERIPG 319
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKPFDREYVEILHLFG 171
SK T+D GWGC LRS+QMLVA+AL+ GR WR PL + + I+ LF
Sbjct: 320 SKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCPAPLSSSKEDQLRLIIRLFQ 379
Query: 172 DSET--SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D SPFSIHN++Q G + + AG W GP ++ R + L A ++
Sbjct: 380 DQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFADLINQAYAMHQSPFRAYQAI 439
Query: 229 IYVVSGDEDGERGGAPVVCID-DASRHCSVFSKGQADWT-------------------PI 268
+++ D E P D + S S D T P+
Sbjct: 440 DHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVTPSASTSQSPPVLPPPFIPL 499
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
L+L+PL LGL ++N YIP L+ Q +GI+GG+P S Y VG QE++ I+ DPH
Sbjct: 500 LILMPLRLGLNEINRMYIPCLKALLMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPHGC 559
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
+ +++ + T T+HS V I +DPS+AIGF C+++ DFDD C
Sbjct: 560 KRFVDMQQTSFP--TETFHSAVPNKIPFTHMDPSMAIGFLCQNQADFDDLC 608
>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
Length = 672
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 163/318 (51%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + + D+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 252 AAENQMADSPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 311
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 312 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGK 371
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ +E E P
Sbjct: 372 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKP 431
Query: 245 VVCIDDASRH----CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V S+ + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 432 HVPWQMTSKKPASDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 491
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++ ++H R + +D
Sbjct: 492 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMQSFHCKSPRKLKSSKMD 549
Query: 361 PSLAIGFYCRDKDDFDDF 378
PS IGFYC K DFD F
Sbjct: 550 PSCCIGFYCATKTDFDSF 567
>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
Length = 668
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 167/316 (52%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 248 AVENQVGEHPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 307
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H +GR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 308 QGLICHFMGRTWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGK 367
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 368 KPGDWYGPASVSYLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKP 427
Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + SK Q W +++L+PL LG +K+N Y L+L + LGI+
Sbjct: 428 NVPWQQAKRPQAEVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAHCLKLLLSTEHCLGII 487
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ + ++H R + +DPS
Sbjct: 488 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENFSLN--SFHCKSPRKLKSSKMDPS 545
Query: 363 LAIGFYCRDKDDFDDF 378
IGFYC K DFD+F
Sbjct: 546 CCIGFYCATKSDFDNF 561
>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
Length = 439
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 159/310 (51%), Gaps = 51/310 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A ALL R+GR WR+ + +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGVSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +AL+ Q + +Y+ +GD G+ V
Sbjct: 220 HPGEWFGPSATARCIQALSNSQAKSE--------LRVYI-TGD------GSDVY----ED 260
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ S+ +D+TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 261 KFMSIAKPNHSDFTPTLILVGTRLGLDKITPVYWEALKYSLQMPQSVGIAGGRPSSSHYF 320
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+GVQE YLDPH +P + KD++E D + H+ +R +H+ +DPS+ I F
Sbjct: 321 IGVQESDFFYLDPHQTRPALPY-KDNVEDYTTEDIDSCHTRRLRRLHIKEMDPSMLIAFL 379
Query: 369 CRDKDDFDDF 378
RD++D++++
Sbjct: 380 IRDENDWNEW 389
>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
Length = 405
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 166/336 (49%), Gaps = 39/336 (11%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S S IWLLG + + + N DF SRI ++YRK F + S TSD
Sbjct: 18 SKDSPIWLLGRIYHQSHKTDDSSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSD 77
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPFDREYVEILHLFGD--S 173
GWGCMLRS QML+AQAL+ H LGR WR + LQ+ R I+ FGD S
Sbjct: 78 CGWGCMLRSGQMLLAQALVCHFLGRDWRWNESGAQEQQTLQESLHR---MIVQWFGDKPS 134
Query: 174 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
P SIH ++ G + G G W GP ++ S+ QRA T + + +Y+
Sbjct: 135 PACPLSIHQMVSQGHISAGKRPGDWYGPSSV--SYIIKQILQRA-TDTYPELDTLRVYIA 191
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD----------WTPILLLVPLVLGLEKVN 282
V +DD + CS + + W ++LL+PL LG E++N
Sbjct: 192 QD---------CTVYLDDVKQSCSKICNYECEETDYELIDDQWKSLILLIPLRLGGERMN 242
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
P Y L+ + Q +GI+GGKP S Y +G Q++ I+LDPH+ Q ++++ + +
Sbjct: 243 PTYDSCLKGLLSLEQCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDVLIPNF--N 300
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
++H +R L +DPS +GFY R + +FD+F
Sbjct: 301 LKSFHCHELRKTALKQVDPSCCVGFYLRSQREFDEF 336
>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
Length = 355
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/297 (35%), Positives = 151/297 (50%), Gaps = 27/297 (9%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W
Sbjct: 15 EGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 74
Query: 152 RKPLQKPFD--REYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 201
R +KP RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 75 RWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALGKKPGDWYGPA 134
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 261
++ ++L E + + +YV V I D C +
Sbjct: 135 SVAHCLKSLIASASKENY---EFDHLEVYVAQDS---------TVYIQDIYSMCQLL--- 179
Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
W ++LLVP+ LG EK NP Y P L T +GI+GG+P S Y VG Q++ I
Sbjct: 180 HGAWKSLILLVPVKLGTEKFNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVGYQDDKLI 239
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+LDPH Q ++++ + + ++H R + L +DPS IGFY + DF+ F
Sbjct: 240 HLDPHYCQEMVDVWQPNFSLQ--SFHCRSPRKMPLAKMDPSCCIGFYLGTQHDFETF 294
>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
Length = 332
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 150/285 (52%), Gaps = 40/285 (14%)
Query: 70 IWLLGVCHKIA------------QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG 117
+WLLGV + +A ++ + D + N F D SR+ SYR F PI
Sbjct: 70 VWLLGVRYTLAPPPMGQRGEGRETEQTVVDESQN-----FKLDMWSRLWFSYRYNFHPIS 124
Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR---EYVEILHLFGDSE 174
+++T+D GWGCM+RS QML+ QAL+ H LGR WR ++ +Y ++L +F D
Sbjct: 125 GTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLSHTSKYNELPSDYRKVLEMFLDHP 184
Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC-QSLPMAIYVVS 233
+P SIH+ ++AG+ G AG+W GP +C ++ L A LG +L + Y
Sbjct: 185 CAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKL----HAGGALGSDNNLQLLAY--- 237
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
DG G D+ QA P+ +L+P LG+ V+P YIP + F
Sbjct: 238 ---DGNDG-------DNTIYKSEALELLQAG--PLFILLPTRLGVSSVDPSYIPKISHVF 285
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
+FPQSLG +GGKP ++ Y + Q E+ YLDPH QP+INI + +
Sbjct: 286 SFPQSLGFIGGKPSSAHYFIASQGEAVYYLDPHTPQPLINISEKE 330
>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
Length = 442
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 53 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 102
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 103 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 161
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 217
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 218 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 264
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 325 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 382
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FTV + H +
Sbjct: 383 ILSSSSVTERYPMFTVAEGHAQ 404
>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
Length = 473
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R C + + VS D V D +R S + A+
Sbjct: 249 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 356 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 413
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FTV + H +
Sbjct: 414 ILSSSSVTERYPMFTVAEGHAQ 435
>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
porcellus]
Length = 474
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 179/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSK-LSSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWAEGPGLGSPELPGTASPSPGRSPAR 192
Query: 152 ----RKPLQKP-FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P P ++E + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WVPPRWPRGAPELEQELRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +A+YV + A +V D + A+
Sbjct: 249 ---SLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHLVASRDPT----------AE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY ++ +F+ CA ++
Sbjct: 356 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCAELTR 413
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 414 ILSCSSATERYPMFTLAEGHAQ 435
>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
Length = 437
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 167/356 (46%), Gaps = 54/356 (15%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S+ S + +LG + +D + F F S ++YR GF PI S +T+D
Sbjct: 61 SNNSPVLVLGKLYIPERDTKPQSEGIPRHILMFMDHFYSLPWMTYRCGFSPILSSSLTTD 120
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR-----------KPLQKPFDREYVEILHLFGDS 173
GWGCM+RS QML+A L H LGR WR K ++ V IL FGDS
Sbjct: 121 CGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHRQVKNWNNYVVLILSWFGDS 180
Query: 174 ETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIY 230
E+ PFSIH L++A +G G W GP + L R C R + IY
Sbjct: 181 ESELCPFSIHRLMEAAYYHGNKPGDWFGPSQV----SILIRDCVRRALREHINLQKLNIY 236
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW------TPILLLVPLVLGLEKVNPR 284
V S C+V+ K D +L+LVP+ LG E +NP
Sbjct: 237 V--------------------SHDCTVYIKDVQDIFESDLDQSLLVLVPVRLGSESLNPI 276
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YIP ++ ++GI+GG+P S + +G Q+E+ I+LDPH Q +N+ + D D S
Sbjct: 277 YIPCVKALLALDHTVGIIGGRPKHSVFFIGFQDENLIHLDPHYSQTAVNMTRTDF--DVS 334
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
+YH + I + +DPS +GFYC +DF+ F A K+ FTVT T
Sbjct: 335 SYHCRSPKKIPVTKMDPSCTLGFYCHTLEDFNHFRIEAEKVT--------FTVTPT 382
>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
NZE10]
Length = 442
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 169/354 (47%), Gaps = 56/354 (15%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
+EF +D S+I ++YR F PI S TSD GWGCM+R
Sbjct: 111 SEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDGFTSDTGWGCMIR 170
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A A+L HRLGR WR+ + +REY +IL LF D+ SP SIH ++ G +A G
Sbjct: 171 SGQSLLANAILIHRLGRDWRRGDK---EREYKDILSLFADTPESPLSIHKFVEHGAQACG 227
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDGERGGAPVVCID 249
G W GP A R AL + E GL S P +YV
Sbjct: 228 TYPGEWFGPNATARCIRALTE-KYHEAGLQVYSRPNDSDVYV------------------ 268
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D+ + + P L+++ + LG+EKV P Y L+ QS+GI GG+P +S
Sbjct: 269 DSLMQTAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALELSQSVGIAGGRPSSS 328
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Y +G Q ++ YLDPH +P+++ L D ++ H+ +R + + +DPS+ +GF
Sbjct: 329 HYFIGHQGDNFFYLDPHTTRPMLS--PQPLAEDINSCHTRRVRRLGIAEMDPSMLLGFLI 386
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
R KD+F+ + S++ P + H+ +S G V E ++L
Sbjct: 387 RSKDEFEQWRKSISEI-------PGKAIIHIHETEPKYSTGTERAGAVDEVETL 433
>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
Length = 397
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 163/320 (50%), Gaps = 21/320 (6%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 45 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWE 104
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 105 KHKNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 164
Query: 215 RAETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSKGQ----- 262
+ +A+Y VV D P C + A+ H S +S+ +
Sbjct: 165 EWNS--------LAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGH 216
Query: 263 -ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + I
Sbjct: 217 CSGWRPLLLVVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEII 276
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
YLDPH Q ++ + D + + + + ++DPS+A+GF+C+D++DF+++C
Sbjct: 277 YLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFNNWCEV 336
Query: 382 ASKLAEESNGAPLFTVTQTH 401
K + +F +T H
Sbjct: 337 IEKEILKHQSLRMFELTPKH 356
>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
Length = 706
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 287 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 346
Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 347 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 406
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 407 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 466
Query: 245 VVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V A R + K + W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 467 HVPWQQAKRPQAETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 526
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 527 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 584
Query: 363 LAIGFYCRDKDDFDDF 378
IGFYC K DFD F
Sbjct: 585 CCIGFYCATKSDFDSF 600
>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
Length = 380
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 12 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 60
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 61 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 120
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 121 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 163
Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 164 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 223
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 224 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 283
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 284 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 339
>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
Length = 397
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 181 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356
>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
Length = 408
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 115/396 (29%), Positives = 184/396 (46%), Gaps = 67/396 (16%)
Query: 46 GSMRRIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
G RR E R SRT S +S + VC + + E GD + F +DF
Sbjct: 4 GGARRPREHGGRWAVKSRTSFSKISS----VHVCGRRYRFEGEGD------IQRFQRDFV 53
Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------- 151
SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ+LL H L R W
Sbjct: 54 SRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWTWAEGLGSAEP 113
Query: 152 ---------RKPL------------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
R P + +R + +I+ F D +PF +H L++ G++
Sbjct: 114 AGSASPSRYRGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPGAPFGLHRLVELGQSS 173
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G AG W GP +A R + + +YV + A +V D
Sbjct: 174 GKKAGDWYGP-------SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPD 226
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S
Sbjct: 227 PT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRLELCLGIMGGKPRHSL 276
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
Y +G Q++ +YLDPH QP +++ + D + ++H R + +DPS +GFY
Sbjct: 277 YFIGYQDDFLLYLDPHYCQPTVDVSQTDFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAG 334
Query: 371 DKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 403
+ +F+ C+ +++ S+ P+FT+ + H +
Sbjct: 335 GRKEFETLCSELTRVLGSSSATERYPMFTLAEGHAQ 370
>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
Length = 433
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 176/392 (44%), Gaps = 62/392 (15%)
Query: 62 GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI 121
+ S + ++LLG HK A GD + + E+ +SR+ +YRK F PIG +
Sbjct: 19 SVFDSNTPVYLLG--HKFP---ARGDM---DSIKEY---VTSRLWFTYRKNFMPIGGTGP 67
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
TSD GWGCMLR QML+AQAL+ LG W + +Y IL +F D + PFS+H
Sbjct: 68 TSDQGWGCMLRCGQMLLAQALIVRHLGTEWMWDRDNK-EEDYKRILRMFQDKKCCPFSLH 126
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALA---RCQRAETGLGCQSLPMAIYVVS----- 233
+ Q G + G W GP + + L R + +L +A V +
Sbjct: 127 QIAQMGVSERKQIGEWFGPNTAAQVLKKLVVYDDWSRLAVHVALDNLLIASDVRTMAHTR 186
Query: 234 ---------------GDEDGERGGAPVVCIDDASRHCSVFS-----------KGQADWTP 267
+E G G +C + + C + S + + W P
Sbjct: 187 PPSRLSSRHTTENEQSEESGNASGGNSLCSFGSVKMCMLQSALMKECDENPVEDEEQWRP 246
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+L++VPL LGL +N Y+P + F PQ GI+GG+P + Y +G+ E IYLDPH
Sbjct: 247 LLIIVPLRLGLTSINRCYLPAIEAFFQLPQCTGIIGGRPNHALYFIGIAGEQLIYLDPHV 306
Query: 328 VQPVINIG----------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
Q I++ K D S+YH + HI DS DPSLA+ F CR
Sbjct: 307 CQAAIDLDERCASLQQQDGFVEVVKSTDIFDDSSYHCPFLLHIAYDSADPSLALSFICRT 366
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
+++++ ++ PLF + +T K
Sbjct: 367 EEEYEHLANNLKTKVLPASSPPLFELLETRPK 398
>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
Length = 457
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 111/368 (30%), Positives = 170/368 (46%), Gaps = 60/368 (16%)
Query: 84 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
ALG + +G+ + SSR +YRK F PIG + TSD GWGCMLR +QML+ + L
Sbjct: 34 ALGKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVL 93
Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
L +GR + ++ Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 94 LRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNT 152
Query: 203 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 246
+ W +A + L + +L MA S D E+G+
Sbjct: 153 AAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQ------- 205
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
+H + + + +W P+LL++PL LGL +N Y+P ++ F PQ +GI+GGKP
Sbjct: 206 ----VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKLPQCVGIIGGKP 261
Query: 307 GASTYIVGVQEESAIYLDPHDVQPV------------------INIGK-DDLE------- 340
+ Y VG+ YLDPH +P N + +DLE
Sbjct: 262 NLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTESEQHDTNFSELEDLEPLPSQTS 321
Query: 341 -----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
D STYH +++ + +SIDPSLA+ +C ++DFD+ C K ++ P+F
Sbjct: 322 DVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESREDFDNLCEELQKTTLPASKPPMF 381
Query: 396 TVTQTHKK 403
+ K
Sbjct: 382 EFLEKRPK 389
>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
Length = 402
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 171/355 (48%), Gaps = 38/355 (10%)
Query: 51 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 110
+ + V G I +D+W+LG + Q+ L +D SR+ +YR
Sbjct: 31 VGQAVGGGESEDIPRRNTDVWVLGKRYNAIQELEL-----------IRRDIQSRLWCTYR 79
Query: 111 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F
Sbjct: 80 CGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-PECRDATYLKIVNRF 138
Query: 171 GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
D + S +SIH + G++ A G W+GP + + + L R +A++
Sbjct: 139 EDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVRFD--------DWCSLAVH 190
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
V V +DD C + W P+LL++PL LG+ +NP Y+P L+
Sbjct: 191 VAMDS---------TVVLDDIYSLC----REGDSWKPLLLVIPLRLGITDINPMYVPALK 237
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTSTY 346
S G++GG+P + Y +G ++ +YLDPH Q +G+ + E D TY
Sbjct: 238 RCLELDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGTVGQKTGVGEQEYD-ETY 296
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
H ++ ++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 297 HQKHAARLNFSAMDPSLAVCFLCKTSDSFESLLTKFRQEVLGLCSPALFEISQTR 351
>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 628
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 115/345 (33%), Positives = 164/345 (47%), Gaps = 59/345 (17%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+ G+ F +DF SR+ ++YRK F + DS TSD GWGCM+RS QML+AQ L+ H LGR
Sbjct: 188 DEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLGRG 247
Query: 151 WR-----KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSW 197
WR + L+ FD E I+ FGD S TSPFSIH L+ GK G G W
Sbjct: 248 WRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPGDW 307
Query: 198 VGPYAMCRSWEALARCQRAET----GLGCQ-SLPMAIYVVSGDEDGERGGAPVV------ 246
GP ++ + E G+ + A+Y+ ++ P V
Sbjct: 308 YGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAPWQKK 367
Query: 247 ------CIDDASR------------------HCSVF---------SKGQADWTPILLLVP 273
C D S+ H + F S + W ++LLVP
Sbjct: 368 MSSAAACTDSPSQATTPRVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLILLVP 427
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LG EK+NP Y L+ + +GI+GG+P S + VG QE+ I+LDPH Q +++
Sbjct: 428 LRLGTEKLNPIYNDCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYCQDMVD 487
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ +++ S++H R + L +DPS IGFYC + DF F
Sbjct: 488 VNQENFPV--SSFHCKSPRKMKLSKMDPSCCIGFYCATRKDFFKF 530
>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
Length = 458
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 117/403 (29%), Positives = 173/403 (42%), Gaps = 79/403 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 152 -------------RKPLQKPFDREYV-----------EILH-----LFGDSETSPFSIHN 182
R+P +E + E+ H FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEVYHRKIISWFGDSPLAAFGLHQ 215
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
L++ GK G AG W GP + R G + +YV
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQ--------D 262
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI+
Sbjct: 263 CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGII 322
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPS 380
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 381 CTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 423
>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
Length = 445
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 179/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 55 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 104
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 105 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 164
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 165 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 220
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 221 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 267
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 268 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 327
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 328 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSELTR 385
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 386 VLSSSSATERYPMFTLAEGHAQ 407
>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
Length = 440
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 156/307 (50%), Gaps = 45/307 (14%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDS----------------------KITSDVGWGCMLR 132
++F DF SR+ ++YR F PI + TSD GWGCM+R
Sbjct: 109 SQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGNFTSDTGWGCMIR 168
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A ++ RLGR WR+ + ++++ EIL +F D+ +PFSIH ++ G A G
Sbjct: 169 SGQSLLANTVVMLRLGRDWRRGQK---EKQHHEILSMFADTPEAPFSIHKFVEHGASACG 225
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A ARC RA T + + +Y D D V ID
Sbjct: 226 TYPGEWFGP-------SATARCIRALTE-KYHDVGLRVYARPNDSD--------VYIDTL 269
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + S + ++P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 270 TATTTQHSASET-FSPTLIVLGVRLGIEKVTPAYHAALKSILELPQSVGIAGGRPSSSHY 328
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VG Q + YLDPH +P++ D + H+ IR + + +DPS+ +GF RD
Sbjct: 329 FVGHQGDHFFYLDPHTTRPMLTAQP--TAEDVESCHTRRIRRLSIAEMDPSMLLGFLVRD 386
Query: 372 KDDFDDF 378
K+DF+D+
Sbjct: 387 KEDFEDW 393
>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
Length = 428
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 179/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 38 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 87
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 88 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 147
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 148 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 203
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 204 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 250
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 251 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 310
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 311 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSELTR 368
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 369 VLSSSSATERYPMFTLAEGHAQ 390
>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
Length = 423
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 180/382 (47%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 82
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSEASGLGPSEPSGLASPNRYRGPAR 142
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 198
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 199 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 245
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 246 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 305
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 306 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 363
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 364 VLSCSSATERYPMFTLAEGHAQ 385
>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
familiaris]
Length = 473
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 154
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGPGLGPSEPAGLASPNRYRGPAR 192
Query: 155 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 248
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
+A R + + +YV + A +V D +
Sbjct: 249 -----SLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARLVARPDPT---------- 293
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 354 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 411
Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
+++ S+ P+FT+ + H +
Sbjct: 412 TRVLSSSSATERYPMFTLAEGHAQ 435
>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
Length = 505
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 174/372 (46%), Gaps = 66/372 (17%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
S S +WLLG C+ + L A+ N + EF +DF SR
Sbjct: 80 SKESPVWLLGQCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 139
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR-PWR-KPLQKPFDRE 162
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q +
Sbjct: 140 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWRWRPEQLTDESS 199
Query: 163 YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRA 216
+ I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 200 HRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQAME------RA 253
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
+ +A+YV + V C D R ++LLVPL L
Sbjct: 254 SEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGRRKA------------LILLVPLRL 301
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------- 329
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q
Sbjct: 302 GADKLNPVYAPCLTALLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQNEFYFRI 361
Query: 330 --------PVINIGKD-DLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
P + I + D+E + +++H R + L +DPS +GFY DK+
Sbjct: 362 LLSITDSLPYLFIQETVDVEGNEKFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLT 421
Query: 377 DFCARASKLAEE 388
DF ++ +
Sbjct: 422 DFMETIQRIKNK 433
>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
Length = 471
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 165/349 (47%), Gaps = 54/349 (15%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
G + F +DF SR+ +YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 104 GEGDIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163
Query: 150 PW--------------------RKPLQKPFDR------------EYVEILHLFGDSETSP 177
W R P + R ++ +I+ F D +P
Sbjct: 164 DWTWAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAP 223
Query: 178 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
FS+H L++ G++ G AG W GP +A R + + +YV
Sbjct: 224 FSLHRLVELGQSLGKKAGDWYGP-------SVVAHILRKAVESCSEVTHLVVYVSQDCTV 276
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+ A +V D + A+W +++LVP+ LG E +NP Y+P ++
Sbjct: 277 YKADVARLVARPDPT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSEL 326
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
LGI+GGKP S Y +G Q++ +YLDPH QP ++I + D + ++H R +
Sbjct: 327 CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDISQADFPLE--SFHCTAPRKMAFT 384
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 403
+DPS +GFY K +F+ C+ +++ S+ P+FT+ + H +
Sbjct: 385 KMDPSCTVGFYAGGKKEFETLCSELTRVLSSSSAMERYPMFTLAEGHAQ 433
>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
Length = 473
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 181/382 (47%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S ++L G ++ E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSK-ISSVYLCGHRYRF---ESEGD------IQRFQRDFMSRLWLTYRRDFPPLAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPLQ 156
+TSD GWGCMLRS QML+AQ LL H L R W R P +
Sbjct: 133 GCLTSDCGWGCMLRSGQMLLAQGLLLHFLPRDWTWAEGSGLGPPELSGSASPSRYRGPAR 192
Query: 157 K----------PFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ ++E+ +I+ F D +PF +H L+ G++ G AG W GP
Sbjct: 193 RVPPHWAQCTPELEQEHWHRQIVSWFADHPQAPFGLHRLVALGQSSGKKAGDWYGP---- 248
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D +A+
Sbjct: 249 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDP----------KAE 295
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 356 PHYCQPSVDVSQADFSLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 413
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 414 VLSSSSATERYPMFTLAEGHAQ 435
>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
Length = 456
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 117/402 (29%), Positives = 179/402 (44%), Gaps = 79/402 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 112
S S ++LLG C+ +E+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTEES-GELSTDGSNFDKISTEISGNVEEFRKDFISRIWLTYREE 94
Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 151
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPEALDMESCDWESWTSSTV 154
Query: 152 ---------------------RKPLQKPFD----REYV---EILHLFGDSETSPFSIHNL 183
R P ++ +D R V +I+ FGDS + F +H L
Sbjct: 155 RKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYHRKIISWFGDSPLAAFGLHQL 214
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
++ GK G AG W GP + R G + +YV
Sbjct: 215 IEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD--------C 261
Query: 244 PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
V D R CS+ G+A +++L P+ LG E+ N Y+ ++ + +GI+G
Sbjct: 262 TVYSSDVIDRQCSLVDSGKAGTKAVIILFPVRLGGERTNTDYLEFVKGILSLEYCVGIIG 321
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 322 GKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPSC 379
Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 380 TIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 421
>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
Length = 393
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 177/364 (48%), Gaps = 39/364 (10%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +++WLLG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNANVWLLGKRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFVPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D+ S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTPDCRDTTYLKIVNRFEDTRKSFYSI 148
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G++ A G W+GP + + + L R SL + + + S
Sbjct: 149 HQIALMGESQNKAVGEWLGPNTVAQILKILVRFD------DWSSLNVHVAMDS------- 195
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V +DD C ++ W P+LL+VPL LG+ +NP Y+P L+ S G
Sbjct: 196 ----TVVLDDIFTLCQ--EPSESAWKPLLLIVPLRLGISDINPIYVPALKRCLELNSSCG 249
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
++GG+P + Y +G ++ +YLDPH Q + + A+ +YH +
Sbjct: 250 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQKTTAAEQELDESYHQKYAARLSFA 309
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGV 417
++DPSLA+ F C+ +D F++ + + LF ++Q+ + +D + E +
Sbjct: 310 AMDPSLAVCFLCKTRDSFNELLQQLRQEVLSLCTPALFEISQSRAVDWDTADDI-EWPAM 368
Query: 418 PEDD 421
P+ D
Sbjct: 369 PDID 372
>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
Length = 453
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 168/361 (46%), Gaps = 56/361 (15%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
G + F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QML+AQ LL H R
Sbjct: 84 GEGDIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSR 143
Query: 150 PW-----------RKP---------------------LQKPFDRE--YVEILHLFGDSET 175
W R+P + F++E + I+ F D
Sbjct: 144 DWTWSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPG 203
Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
+PF +H L++ G++ G AG W GP +A R + + +YV
Sbjct: 204 APFGLHRLVELGRSSGKRAGDWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDC 256
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+ A +V D S +W I++LVP+ LG E +NP Y+P ++
Sbjct: 257 TVYKADVAQLVAQPDPS----------TEWKSIVILVPVRLGGETLNPVYVPCVKELLRL 306
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
+GI+GGKP S Y +G Q++ +YLDPH QP ++ ++ + ++H R +
Sbjct: 307 ELCIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPFVDTSQESFPLE--SFHCTSPRKMA 364
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDVLG 412
+DPS IGFY ++ +F+ C +++ S+ P+FT+++ H + +V
Sbjct: 365 FSRMDPSCTIGFYAGNRKEFELLCLELTRVLNSSSATERYPMFTLSEGHAQEYGLEEVCS 424
Query: 413 E 413
+
Sbjct: 425 Q 425
>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
Length = 474
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
Length = 411
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 130
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373
>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
Length = 474
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
Length = 411
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 352 VLSSSSAMERYPMFTLAEGHAQ 373
>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
Length = 474
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------- 159
+TSD GWGCMLRS QM++AQ LL H L R W L P
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSWYHGPAR 193
Query: 160 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPCWAQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D S A+
Sbjct: 250 ---SLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARLVARPDPS----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WNSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S P+FT+ + H +
Sbjct: 415 VLSSSAATERYPMFTLAEGHAQ 436
>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
Length = 411
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373
>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
Length = 474
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSAMERYPMFTLAEGHAQ 436
>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
Length = 385
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 171/347 (49%), Gaps = 35/347 (10%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W+LG H++ +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 17 WILGRQHQLKTEKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 65
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
LR QM++AQAL+ LGR W K EY IL F D + +SIH + Q G
Sbjct: 66 LRCGQMMLAQALICRHLGRDWHWEEHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGE 125
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGER------ 240
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 126 GKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCRLPN 177
Query: 241 GGAPVVCIDDASRHCSVFSKGQAD------WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
P V H S+ S+ ++ W P+LL++PL LG+ +NP Y+ + F
Sbjct: 178 QNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLIIPLRLGINHINPVYVDAFKECFK 237
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S + +
Sbjct: 238 MPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQLFVDSEENSTVDDRSFHCQQAPHRM 297
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ ++DPS+A+GF+C+++ DFD +C+ K + +F + Q H
Sbjct: 298 KIMNLDPSVALGFFCKEEKDFDTWCSLVQKEIHKQQSLRMFELIQKH 344
>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 473
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 180/384 (46%), Gaps = 70/384 (18%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + E+ GD + F +DF SR+ ++YR+ F P
Sbjct: 83 SRTRFSKISS----VHLCGRRYCFESEGD------IQRFQRDFVSRLWLTYRRDFPPFAG 132
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWARGASLSPPEPSGLASSNRYRGPAH 192
Query: 152 --------RKP-LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
R P L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 CMTPCWAQRAPELEQ--ERRHRQIVSWFADHPQAPFGLHQLVELGQSSGKKAGDWYGP-- 248
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
+A R + + +YV + A +V D +
Sbjct: 249 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 293
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 354 LDPHYCQPAVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 411
Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
+++ S+ P+FT+ + H +
Sbjct: 412 TRVLSSSSTTERYPMFTLAEGHAQ 435
>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
Length = 583
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/335 (33%), Positives = 155/335 (46%), Gaps = 67/335 (20%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
+ F +DF +R+ ++YRK F + DS TSD GWGCM+RS QML+AQ LL H LGR WR
Sbjct: 167 IEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNWRW 226
Query: 153 ----KPLQKPF------DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 200
+ L+ + D + +I+ FGD S TSPFSIH L+ GK G G W GP
Sbjct: 227 DATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWYGP 286
Query: 201 YAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
++A R L Q + + +YV V I D C+
Sbjct: 287 -------GSVAHLLRQAVKLAAQEISDLDGVNVYVAQDC---------AVYIQDIIDECT 330
Query: 257 VFS---------------------------------KGQADWTPILLLVPLVLGLEKVNP 283
V + W ++LLVPL LG EK+NP
Sbjct: 331 VSAGPTLAPWQKKSPGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNP 390
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
Y L+ + +GI+GG+P S Y VG QE+ I+LDPH Q ++++ +
Sbjct: 391 IYSDCLKAMLSLDNCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQDMVDVVNQE-NFPV 449
Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+++H R + L +DPS IGFYC + DF F
Sbjct: 450 ASFHCKSPRKMKLSKMDPSCCIGFYCETRKDFFKF 484
>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
Length = 678
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 140 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 245 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q +++I ++ ++H R + + +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548
Query: 361 PSLAIGFYCRDKDDFDDF 378
PS IGFYC K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566
>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
gallopavo]
Length = 421
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 170/356 (47%), Gaps = 52/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGRRHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 205 DIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCTGWKPLLLIIPLRLGINHINPVY 264
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 265 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 324
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 325 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLQMFELVQKH 380
>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 140 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 192
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 245 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG QE+ I+LDPH Q +++I ++ ++H R + + +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548
Query: 361 PSLAIGFYCRDKDDFDDF 378
PS IGFYC K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566
>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
Length = 518
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 172/367 (46%), Gaps = 42/367 (11%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 145 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 191
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 192 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 251
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 252 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 303
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 304 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 363
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 364 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 423
Query: 347 HSDVIRHIHLDSIDPSLAI--GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
+ + +DPS+A+ G + + + C +L+ P+F + +
Sbjct: 424 CQHPPCRMSIAELDPSIAVVRGGHRSTQAFCAECCLGMKQLSLLGGALPMFELVEQQPSH 483
Query: 405 VNHSDVL 411
+ DVL
Sbjct: 484 LACPDVL 490
>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
Length = 439
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 49 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 98
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 99 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 158
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 159 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 214
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 215 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 261
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 262 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 321
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 322 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 379
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 380 VLGSSSATERYPMFTLAEGHAQ 401
>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
Length = 474
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 415 VLGSSSATERYPMFTLAEGHAQ 436
>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
Length = 392
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 187/392 (47%), Gaps = 69/392 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 5 SRTSFSKISS----VHLCGRRYCFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 54
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 55 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGAGLSPPEPSGLASPNRHHGLAH 114
Query: 153 -KPLQ-----KPFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
KP + ++E+ +I+ F D +PF +H L++ G+++G AG W GP
Sbjct: 115 WKPPRWAQGAPELEQEHWHRQIVSWFADHPQAPFGLHQLVELGQSWGKKAGDWYGP---- 170
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 171 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDCT----------AE 217
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++S +YLD
Sbjct: 218 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDSLLYLD 277
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ ++
Sbjct: 278 PHYCQPTVDVSQAGFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGNRKEFETLCSELTR 335
Query: 385 LAEESNGA---PLFTVTQTHKKPVNHS-DVLG 412
+ S P+FT+ + H + +HS D LG
Sbjct: 336 VLSSSAATQRYPMFTLAEGHAQ--DHSLDNLG 365
>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
Length = 497
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 216
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 272
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 273 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 319
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 320 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 379
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 380 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 437
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 438 VLGSSSATERYPMFTLAEGHAQ 459
>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
Length = 607
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 216 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 265
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 154
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 266 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWIEGPGLAHPELPGSASSSQGRGPAR 325
Query: 155 ----------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
L++ + + +I+ F D +P +H L++ G++ G AG W GP
Sbjct: 326 WMPPSCPWGALEREQELRHRQIVSWFADHPRAPLGLHRLVELGQSSGKKAGDWYGP---- 381
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +A+YV + A +V D + A+
Sbjct: 382 ---SLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHLVASPDPA----------AE 428
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 429 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 488
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY ++ + + C+ ++
Sbjct: 489 PHYCQPTVDVSQADFSLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKELETLCSELTR 546
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 547 ILSSSSATERYPMFTLVEGHAQ 568
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 10/93 (10%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 130 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 179
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 180 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212
>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
Length = 454
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 160/318 (50%), Gaps = 58/318 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 131
+F DF S++ I+YR F PI GDS I TSD GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKL---QEESELVSLFADHPRAPFSIHRFVHHGATAC 235
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 249
G G W GP A + +AL + + GL +Y+ S G + E+ V C +
Sbjct: 236 GKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACDE 287
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S
Sbjct: 288 SGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGRPSSS 335
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHLDSID 360
Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+ +D
Sbjct: 336 HYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHIREMD 395
Query: 361 PSLAIGFYCRDKDDFDDF 378
PS+ IGF RD+DD++D
Sbjct: 396 PSMLIGFLVRDEDDWEDL 413
>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
Length = 411
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373
>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 474
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
Length = 474
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WMSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
Length = 395
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 168/356 (47%), Gaps = 52/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 250 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 285
D + C +G W P+LL++PL LG+ +NP Y
Sbjct: 179 DIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGWKPLLLIIPLRLGINHINPVY 238
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 239 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDESF 298
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 299 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 354
>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
Length = 447
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 158/332 (47%), Gaps = 49/332 (14%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
++F DF SRI I+YR GF PI S TSD GWGCM+R
Sbjct: 110 SDFIDDFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIR 169
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
S Q L+A +L HRLGR WRK ++ E+ IL LF D+ +PFSIH ++ G +A G
Sbjct: 170 SGQSLLANTILLHRLGRDWRKGQKQ---EEHKNILSLFADTPEAPFSIHKFVEHGAQACG 226
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A ARC RA T + +Y D D DA
Sbjct: 227 TYPGEWFGP-------NATARCLRALTD-KYHGAGLRVYARPNDSD---------VYADA 269
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + P L+++ + LG+EKV Y L+ PQS+GI GG+P +S Y
Sbjct: 270 LIETATQKDADDKFQPTLIVLGIRLGIEKVTSAYHVALKAALELPQSVGIAGGRPSSSHY 329
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
+G Q +S YLDPH + +++ D T H+ IR + L +DPS+ +GF R
Sbjct: 330 FLGHQGDSFFYLDPHTTRHMLSPQPS--AEDIETCHTRRIRKLPLSEMDPSMLLGFLVRS 387
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
+++F+++ K E G + + +T K
Sbjct: 388 QEEFEEW----RKAVLEMPGKAIIHIHETEPK 415
>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
1015]
Length = 384
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 121/378 (32%), Positives = 180/378 (47%), Gaps = 54/378 (14%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
+RI + + P S IW LG+ + +D A + F DF SRI ++
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKD-ANTRETQHAWPESFLLDFESRIWMT 69
Query: 109 YRKGFDPI----GDSK-------------------ITSDVGWGCMLRSSQMLVAQALLFH 145
YR F PI GD K TSD GWGCM+RS Q L+A AL
Sbjct: 70 YRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQSLLANALSML 129
Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMC 204
LGR WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A
Sbjct: 130 VLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKYPGEWFGPSATA 186
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+ EAL+ C + + +YV + + + D +R+ S
Sbjct: 187 KCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK-----FMDIARNTS------GA 227
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
+ P L+L+ LG++ + P Y L+ FPQS+GI GG+P AS Y VG Q YLD
Sbjct: 228 FQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGRPSASHYFVGAQGSHLFYLD 287
Query: 325 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
PH +P + G+ + + TYH+ +R IH+ +DPS+ IGF R+++D+ D+ R
Sbjct: 288 PHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRNQEDWADWLKR 347
Query: 382 ASKLAEESNGAPLFTVTQ 399
E G P+ V +
Sbjct: 348 ----IEAVKGRPIIHVLK 361
>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
Length = 412
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 169/356 (47%), Gaps = 52/356 (14%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHC------------------SVFSKGQAD------WTPILLLVPLVLGLEKVNPRY 285
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 181 DIKKMCWSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGWKPLLLIIPLRLGINHINPVY 240
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDKSF 300
Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356
>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
Length = 482
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 105/398 (26%), Positives = 176/398 (44%), Gaps = 77/398 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S + + +C + Q E GD + F +DF+SR+ ++YR+ F P+ +TSD
Sbjct: 79 TSFSKLSTVHLCGRRYQFEGEGD------IQRFQKDFASRLWLTYRRDFPPLDGGSLTSD 132
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------------- 152
GWGCMLRS QML+AQ LL H R W
Sbjct: 133 CGWGCMLRSGQMLLAQGLLLHFFSRDWTWAEAVLPPSPRESELFRSMSPSRSGASWQRGS 192
Query: 153 -----------------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
P Q + ++ I+ F D +PF +H L++ G++ G AG
Sbjct: 193 STASGLGRATWSTGGTLSPRQLEQEEQHRRIVSWFADQPGAPFGLHRLVELGRSSGKRAG 252
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
W GP +A R + + +YV + A ++ D S
Sbjct: 253 DWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQLMAQPDPS--- 302
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
+W +++LVP+ LG E +NP Y+P ++ +GI+GGKP S Y +G
Sbjct: 303 -------TEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDLCIGIIGGKPRHSLYFIGY 355
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
Q++ +YLDPH QP ++ ++ + ++H R + +DPS IGFY ++ +F
Sbjct: 356 QDDFLLYLDPHYCQPCVDTSQERFPLE--SFHCTSPRKMAFSRMDPSCTIGFYAGNRKEF 413
Query: 376 DDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDV 410
+ C +++ S+ P+FT+++ H + + +V
Sbjct: 414 EMLCLELTRVLNSSSATERYPMFTLSEGHAQEYSLEEV 451
>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
Length = 404
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 122/397 (30%), Positives = 181/397 (45%), Gaps = 72/397 (18%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 96
+RI + + P S IW LG+ + +D + N E
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKDANTRETPDKNNTRENVMGTTNYRKPS 70
Query: 97 -------FNQDFSSRILISYRKGFDPI----GDSK-------------------ITSDVG 126
F DF SRI ++YR F PI GD K TSD G
Sbjct: 71 EHAWPESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTG 130
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + F+ E ++L LF D+ T+PFS+H ++
Sbjct: 131 WGCMIRSGQSLLANALSMLVLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKH 187
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G ++ G G W GP A + EAL+ C + + +YV + + +
Sbjct: 188 GAESCGKYPGEWFGPSATAKCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK--- 236
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
D +R+ S + P L+L+ LG++ + P Y L+ FPQS+GI GG+
Sbjct: 237 --FMDIARNTS------GAFQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGR 288
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
P AS Y VG Q YLDPH +P + G+ + + TYH+ +R IH+ +DPS
Sbjct: 289 PSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPS 348
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+ IGF R+++D+ D+ R E G P+ V +
Sbjct: 349 MLIGFLIRNQEDWADWLKR----IEAVKGRPIIHVLK 381
>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
Length = 417
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 182/373 (48%), Gaps = 25/373 (6%)
Query: 39 VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
V V G R I GP + + +W+LG + + A ++
Sbjct: 19 VTLCVFPGVKRHITILSDGPEE--LPETDEPVWILGKQYDLQ--------AVITEKSKLL 68
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
D S+R+ +YR+ F PIG + +SD GWGCMLR QM++AQAL+ LGR W +Q+
Sbjct: 69 SDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEMQQE 128
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEA 209
EY IL F D + +SIH + Q G G + G W GP A+ W +
Sbjct: 129 QPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS 188
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 269
LA + + + + ++ + +P +D S H S G W P+L
Sbjct: 189 LAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGLDQ-STHLPEPSPG---WKPLL 244
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
L++PL LG+ ++NP YI + F PQSLG +GGKP ++ Y +G IYLDPH Q
Sbjct: 245 LIIPLRLGINQINPVYIDAFKECFKMPQSLGALGGKPNSAYYFIGFLGNELIYLDPHTTQ 304
Query: 330 PVINIGKDDLEADTSTYHSDVIRH-IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
++ ++D D ++H H + + ++DPS+A+GF+ ++++DFD++C K +
Sbjct: 305 TFVD-SEEDGTVDDQSFHCQQSPHRMQILNLDPSVALGFFFKEEEDFDNWCRLVQKEILK 363
Query: 389 SNGAPLFTVTQTH 401
+F + Q H
Sbjct: 364 PQSLQMFELVQKH 376
>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
Length = 457
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 181/422 (42%), Gaps = 80/422 (18%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++E L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENEMLSARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPGALNIENSDSESWTSHTVK 155
Query: 155 ---------------LQKP-------------FDREYVEILH-----LFGDSETSPFSIH 181
L+ P + EI H FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNEIYHRKIVSWFGDSPLAFFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + + G + IYV +D
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQG-----ITIYVA---QDCTVY 267
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+ V+ ASR S+G D +++LVP+ LG E+ NP Y+ ++ + +GI
Sbjct: 268 NSDVIDTQSASRT----SEGAED-KAVIILVPVRLGGERTNPDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH QP +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQPFVDVSVKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKKPVNHSDVLGETGGVPE 419
S IGFYCR+ DF+ +K+ S PLFT H + + + E
Sbjct: 381 SCTIGFYCRNVQDFERTSEEITKMLRISAKEKYPLFTFVNGHSRDYDFTSTTTNEDLFSE 440
Query: 420 DD 421
D+
Sbjct: 441 DE 442
>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
Length = 508
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 164/345 (47%), Gaps = 51/345 (14%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 126
G++ A F DF S+I ++YR GF I S T+D G
Sbjct: 138 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 197
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 198 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 254
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 255 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 298
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 299 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 357
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 362
P +S Y +G Q YLDPH +P + G+ E + ++YH+ +R +H+ +DPS
Sbjct: 358 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 417
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNH 407
+ IGF +D+DD+ D+ +A G + V+ P H
Sbjct: 418 MLIGFLIKDEDDWADWKRNVGSVA----GKAIVHVSDKENSPFGH 458
>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 494
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 173/363 (47%), Gaps = 65/363 (17%)
Query: 84 ALGDAAGNNG---LAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------- 122
A GDA G F DF SRI ++YR GF DP S ++
Sbjct: 139 AYGDADGTTDGGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGA 198
Query: 123 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
SD GWGCM+RS Q L+A ALL RLGR WR+ +RE IL LF D +
Sbjct: 199 DQAGFSSDTGWGCMIRSGQSLLANALLISRLGREWRRGQNPKAERE---ILSLFADDPRA 255
Query: 177 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
P+S+HN ++ G +A G G W GP A R +ALA +E + +Y
Sbjct: 256 PYSLHNFVKHGAEACGKFPGEWFGPSATARCIQALANKHESE---------LRVYST--- 303
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
G P V D ++ + + P L+LV LG++K+N Y L T
Sbjct: 304 -----GDLPDVYEDS---FMAIANPDGQHFHPTLVLVCTRLGIDKINKVYEQALISTLQM 355
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIR 352
QS+GI GG+P S Y +GVQ++ YLDPH +P++ +D + + + H+ +R
Sbjct: 356 EQSIGIAGGRPSQSHYFIGVQDQWLFYLDPHYPRPMLPYRENPEDYTQEEVDSCHTRRLR 415
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
H+H++ +DPS+ IGF +D+DD+D + + + G + TV+ H LG
Sbjct: 416 HLHVEDLDPSMLIGFLIKDEDDWDTWKSAVKHV----QGKAIITVSP-------HDPALG 464
Query: 413 ETG 415
TG
Sbjct: 465 GTG 467
>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 336
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 149/295 (50%), Gaps = 25/295 (8%)
Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 166
++YR F I DS +D GWGCMLR QML+A+A+ LG+ W +K +E
Sbjct: 36 MTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWAPTSRKQRHQEMARF 95
Query: 167 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
L LF D+ +PFSIH + + G+A G G W GP + + + L QR+ + C
Sbjct: 96 LPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVNSQRSSLIVHCA--- 152
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRY 285
V++ E + A + D +H +L+LVP+ LGL + +NP Y
Sbjct: 153 -MDGVLNRTEASTQLAA---ALSDGKKHS------------LLVLVPIRLGLNQSINPVY 196
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ-PVINIGKDDLEADTS 344
IP L+ T PQ LGI+GGKP A+ + VG E+ +YLDPH VQ + + D +E
Sbjct: 197 IPALKATLELPQCLGIIGGKPNAAHFFVGTVNENVLYLDPHVVQDAAMELTPDTVE---- 252
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
++ V+ + + +DPS+ + C + +D R+ ++ + G LF V +
Sbjct: 253 SFSVAVLSKMAISDVDPSMCAAYLCSSVAELEDLGKRSKQITSQFRGYGLFDVIE 307
>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
Length = 471
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 58/324 (17%)
Query: 96 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 131
+F DF SR+ I+YR F PI DS + TSD GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH +Q G A
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKI---QEESELVSLFADHPRAPFSIHRFVQHGATAC 252
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 249
G G W GP A + +AL + + GL +YV + G + ER V C +
Sbjct: 253 GKCPGEWFGPSAAAQCIQALVKSN-PQAGL-------RVYVTNDGSDIYERQFREVACDE 304
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
S P L+L+ + LG+++V P Y +L+ +PQS+GI GG+P +S
Sbjct: 305 SGS------------IKPTLILLGVRLGIDRVTPIYWDSLKALLHYPQSVGIAGGRPSSS 352
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---------ADTSTYHSDVIRHIHLDSID 360
Y + Q +S YLDPH +P + + E + STYH+ +R +H+ +D
Sbjct: 353 HYFIATQGDSFFYLDPHQTRPCLAPRSEPTEDEESHPYSPEELSTYHTRRLRRLHVREMD 412
Query: 361 PSLAIGFYCRDKDDFDDFCARASK 384
PS+ IG RD+ D++D +R +
Sbjct: 413 PSMLIGLLVRDEGDWEDLKSRVKE 436
>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
Length = 458
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 180/425 (42%), Gaps = 83/425 (19%)
Query: 50 RIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR-- 104
R+HE R+ + +S L +A++ AL D+ N + + F+SR
Sbjct: 11 RVHEEAKRLFADWKPAVSKMLETYLTLDPSFSVAENYALFDS--NLPIYLLGEKFTSRRD 68
Query: 105 -----------ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ +YRK F PIG T+D GWGCMLR QML+A+ L+ LGR W
Sbjct: 69 MERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGRNWL- 127
Query: 154 PLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR--- 205
+DR EY IL +F D + S FSIH + G + G G W GP +
Sbjct: 128 -----WDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLK 182
Query: 206 ------SWEALA--------------------------RCQRAETGLGCQSLPMAIYVVS 233
W LA R ETG A+
Sbjct: 183 KLVIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAE 242
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
+ E +P + S + +W P+L+++PL LGL +N Y P ++ F
Sbjct: 243 AEIFPESTRSPT---RSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFF 299
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---------------KDD 338
PQ +GI+GG+P + Y G+ + + +YLDPH Q +++ K+D
Sbjct: 300 QLPQCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDFVDLDETTATRDERDGYVEIKND 359
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
E STYH I +D +DPSLA+GF C +DD+++ R ++ PLF +
Sbjct: 360 -EFRDSTYHCPFILTTKIDKVDPSLALGFLCHTEDDYNELAQRLRTHLLPASTPPLFEML 418
Query: 399 QTHKK 403
+T K
Sbjct: 419 ETRPK 423
>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 354
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 155/319 (48%), Gaps = 40/319 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 347 HSDVIRHIHLDSIDPSLAI 365
+ + +DPS+A+
Sbjct: 301 CQHPPCRMSIAELDPSIAV 319
>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
Length = 478
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 120/416 (28%), Positives = 179/416 (43%), Gaps = 84/416 (20%)
Query: 65 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGD 118
S S + LLG C H A+DE A L F +DF+SR+ ++YR+ F P+
Sbjct: 36 SRNSPVLLLGKCYHFKAEDEESPTEASVEDLVMGDVDAFRRDFASRVWLTYREEFSPLPG 95
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE------------- 162
S +TSD GWGCMLR+ QM++AQ L+ H LGR W + L +P D E
Sbjct: 96 STLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWTWSEALTLQPLDTETWTTTAAKRLVAS 155
Query: 163 ---------------------------------------YVEILHLFGDSETSPFSIHNL 183
+ ++ FGDS ++P +H L
Sbjct: 156 LEASLQGVPGPSVRSSSPQAQALSLGSAEEADAHLKEMYHRTLVSWFGDSPSTPLGLHRL 215
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA--IYVVSGD------ 235
++ G G AG W GP + + + + GL C + ++ V S D
Sbjct: 216 VRLGLTMGKQAGDWYGPAVVAHILKKAVE-EAMDPGLACITAYVSQDCTVYSADVVDCHR 274
Query: 236 ------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
E AP + +D H S + +A +++LVP+ LG EK NP Y
Sbjct: 275 APRAERTSDETPDAPTLPQNDQPAHASTLPESRA----VIILVPVRLGGEKTNPEYFDFA 330
Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSD 349
+ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D +YH
Sbjct: 331 KSILSLEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSYHCP 388
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKK 403
+ + +DPS +GFY R D++ SKL + S P FT Q H +
Sbjct: 389 SPKKMPFSKMDPSCTVGFYSRSVQDYERISQELSKLLQPSAKEKYPAFTFVQGHGR 444
>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
purpuratus]
Length = 390
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 105/332 (31%), Positives = 159/332 (47%), Gaps = 50/332 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
IW+LG + ++Q + E D SR+ +YRKGF IG + T+D GWGC
Sbjct: 48 IWILGKKYDLSQHQL-----------EARLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGC 96
Query: 130 MLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
MLR QM++AQAL++ LGR WR +P ++ D Y++IL LF D + S FSIH + Q G
Sbjct: 97 MLRCGQMMLAQALVYKHLGRDWRWRPQEQ--DETYLKILQLFLDKKDSCFSIHQIAQMGV 154
Query: 189 AYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
G G W GP + + SW LA + + + + V S E+
Sbjct: 155 GEGKKVGDWFGPNTVGQVIRKLSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETS 214
Query: 240 RGGAPV--------------------------VCIDDASRHCSVFSKGQADWTPILLLVP 273
G+ + + + + S G W + L++P
Sbjct: 215 SEGSKTGSERRKRTSSSENIRHKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIP 274
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LGL ++N Y+ L+ FT PQSLG++GGKP + Y +GV + +YLDPH QP +
Sbjct: 275 LRLGLNEINTVYMQRLKRCFTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHTTQPAAD 334
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
I K D S +H + + + ++DPS+ +
Sbjct: 335 IDKWAFLQDES-FHCEHASRMPIKNLDPSIGL 365
>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
Length = 454
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 155/337 (45%), Gaps = 51/337 (15%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 132
F DF RI ++YR GF PI S+ TSD GWGCM+R
Sbjct: 117 FLDDFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 176
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
S Q L+A AL RLGR WR+ E +L LF D +PFSIH ++ G Y G
Sbjct: 177 SGQSLLANALAISRLGRDWRRGSNST---EENRLLSLFADDPAAPFSIHKFVRHGALYCG 233
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A +AL+ + + G M +YV S + V + +
Sbjct: 234 KHPGEWFGPSATATCIQALSD-EYKDAG-------MNVYVSSDNTYVYEDKFKAVAYNQS 285
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
R P L+L+ LG++++ P Y L PQ+LGI GG+P AS Y
Sbjct: 286 DRM-----------RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQALGIAGGRPSASHY 334
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+GVQ YLDPH +P + DL + + + H+ +R IH+D +DPS+ +GF
Sbjct: 335 FIGVQNSFFFYLDPHHTRPALPYKTGDLAYTQEEIDSCHTRRLRRIHIDDMDPSMLVGFL 394
Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
RD++D+ D+ R + E NG + + T P
Sbjct: 395 IRDENDWMDWKRRITSSRPE-NGKAIIHIVDTKNVPT 430
>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
Length = 450
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 121/370 (32%), Positives = 171/370 (46%), Gaps = 50/370 (13%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNNG------LAEFNQDFSSRILISYRKGFDPIGDSKI 121
S I LLG C+ ++ E N F +DFSS+I +YRK F + S +
Sbjct: 82 SPIILLGKCYCCSKSEKEDQRRQPNNSNILTTFDRFKRDFSSKIWFTYRKDFPKLYGSPL 141
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETS 176
TSDVGWGCMLR++QM++AQAL+ H LGR W + +E + +I+ LFGD S
Sbjct: 142 TSDVGWGCMLRTAQMIIAQALVMHYLGRDWTIHHTQQNRKETMLHRQIIRLFGDFPGNDS 201
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
PFSI L++ G +G G W GP ++ YVV
Sbjct: 202 PFSIQALVRIGVDHGKRPGDWYGPASVA-------------------------YVVRDAI 236
Query: 237 DGERGGAPV---VCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPT 288
+ P+ VC+ A C+V+ + D W +++LVP+ LG E +NP Y
Sbjct: 237 NQVPDFHPLLSQVCVYVAP-DCTVYIQDVIDLCTQHWKAVVILVPVRLGGEALNPIYSQC 295
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
++ LGI+GG+P S Y VG QEE +YLDPH Q ++ D TSTYH
Sbjct: 296 VQSLLAHELCLGIIGGRPKHSLYFVGWQEEKLLYLDPHFCQDTVDTRFRDFP--TSTYHC 353
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTHKKPV 405
R + L +DPS +GFY F+ KL ++ PLF +
Sbjct: 354 LSPRKLALQKMDPSCTLGFYIPTHAAFNRLVKDMQKLVTPPKDQGIYPLFVFQDGRSIDI 413
Query: 406 NHSDVLGETG 415
HS + E+
Sbjct: 414 EHSHIKPESN 423
>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 439
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 150/309 (48%), Gaps = 49/309 (15%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A ALL R+GR WR+ +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGSSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A ARC +A T +S + +Y+ D +D
Sbjct: 220 HPGEWFGP-------SAAARCIQALTNSQVES-ELRVYITGDGSD---------VYEDT- 261
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
S+ +TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 262 -FMSIAKPNSTKFTPTLILVGTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYF 320
Query: 313 VGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+GVQE YLDPH +P + +D D + H+ +R +H+ +DPS+ I F
Sbjct: 321 IGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLI 380
Query: 370 RDKDDFDDF 378
RD++D+ D+
Sbjct: 381 RDENDWKDW 389
>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
Length = 489
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/355 (31%), Positives = 161/355 (45%), Gaps = 53/355 (14%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 132
F DF S+I ++YR F PI S+ TSD GWGCM+R
Sbjct: 153 FLDDFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 212
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
S QML+A AL RLGR WR+ E ++L LF D +PFSIH ++ G Y G
Sbjct: 213 SGQMLLANALAISRLGRDWRRVSHT---TEENKLLSLFADDPAAPFSIHRFVRHGALYCG 269
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A +AL+ + M +YV S +D
Sbjct: 270 KHPGEWFGPSATATCIQALSEEYKVAG--------MNVYVSSDS---------TYVYEDK 312
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + G P L+L+ LG++++ P Y L PQSLGI GG+P +S Y
Sbjct: 313 FKAVAYNQPGHM--RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQSLGIAGGRPSSSHY 370
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+GVQ YLDPH +P + D + + H+ +R IH+D +DPS+ +GF
Sbjct: 371 FIGVQNSFFFYLDPHHTRPALPHKVDSAYTQEQVDSCHTRRLRRIHIDDMDPSMLVGFLI 430
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP---VNHSDVLGETGGVPEDD 421
RD++D+ D+ R + + E NG + + T P + L E + +DD
Sbjct: 431 RDENDWIDWKRRIAS-SREGNGKAIIHIIDTESVPTPTMEREAALDEVEALDDDD 484
>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum PHI26]
Length = 401
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 123/421 (29%), Positives = 186/421 (44%), Gaps = 75/421 (17%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 96
+RI + P T + S IW LG + A + D A NN +
Sbjct: 9 KRIVQYFWDPEPTNNVPAAS-IWCLG--KEYAPPQPFSDPATNNPHSSSGQPDASTLNDT 65
Query: 97 -----FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWG 128
F DF SRI I+YR F PI +K TSD GWG
Sbjct: 66 AWPNAFVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWG 125
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
CM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH + G
Sbjct: 126 CMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIHKFVNRGA 182
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
++ G G W GP A + + L+ A + +YV + D
Sbjct: 183 ESCGKYPGEWFGPSATAKCIQLLSTQSEAHR--------LRVYVTNDTSD---------V 225
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
+D H S G P L+L+ LG+E V P Y LR T+PQS+GI GG+P
Sbjct: 226 YEDKFAHVSHDRSGCIQ--PTLILIGTRLGIENVTPAYWDGLRAALTYPQSVGIAGGRPS 283
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAI 365
AS Y +G Q+ +LDPH +P D+L + + +Y++ +R IH+ +DPS+ I
Sbjct: 284 ASHYFLGAQDCHLFFLDPHTTRPATPYRPDELYTQEELDSYYTSRLRRIHIKDMDPSMLI 343
Query: 366 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN---HSDVLGETGGVPEDDS 422
GF +D++D+ D+ K + + G P+ + +P N ++ L E + + D
Sbjct: 344 GFLIKDEEDWADW----KKRVQSTPGQPIVHMLPCQHQPDNGQGRAEALDEVEALDDSDE 399
Query: 423 L 423
+
Sbjct: 400 I 400
>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
Length = 331
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 158/319 (49%), Gaps = 27/319 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V+C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VLCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292
Query: 413 ETGGVPEDDSLGVMSMNDA 431
+ G E + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309
>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
boliviensis]
Length = 319
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 149/300 (49%), Gaps = 25/300 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C DA+RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
24927]
Length = 444
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 110/301 (36%), Positives = 160/301 (53%), Gaps = 45/301 (14%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-------------------ITSDVGWGCMLRSSQML 137
F DF ++ ++YR F PI S TSD GWGCM+RS Q +
Sbjct: 111 FLDDFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCV 170
Query: 138 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGS 196
+A A+ +LGR WR+ + P +E IL LF D +PFS+HN ++ G+A G+ G
Sbjct: 171 LANAISLLKLGRDWRRG-KSP--QEEQHILSLFADDPRAPFSLHNFVKYGEASCGVYPGE 227
Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
W GP A R +ALA A+ G Q +Y+ +GD GG +DA R +
Sbjct: 228 WFGPSATARCIQALA----AQHDEGLQ-----VYI-TGD-----GGD---VYEDAFRKIA 269
Query: 257 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
+ G + P L+LV + LG+E+V P Y L+ + PQS+GI GG+P AS Y +GVQ
Sbjct: 270 ISDDGV--FHPTLVLVGIRLGIERVTPVYWEALKSSLMMPQSVGIAGGRPSASHYFIGVQ 327
Query: 317 EESAIYLDPHDVQPVINIGKD-DLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
+S YLDPH+ +P++ KD D A+ + H+ +R +HL +DPS+ + F RD D
Sbjct: 328 GQSLFYLDPHNTRPLLPYRKDSDYTAEEIEFCHTRKLRRLHLREMDPSMLLAFLIRDDRD 387
Query: 375 F 375
+
Sbjct: 388 W 388
>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 601
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 116/383 (30%), Positives = 179/383 (46%), Gaps = 55/383 (14%)
Query: 52 HERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRK 111
+ R+ R+G+S + L + + ++G++ A F DF S+I ++YR
Sbjct: 195 YHRLSTSDRSGLSPTRQ----LPFTNNTRPESTSSSSSGHDWPAPFLDDFESKIWLTYRS 250
Query: 112 GF-------DPIGDSKIT----------------SDVGWGCMLRSSQMLVAQALLFHRLG 148
GF DP S +T +D GWGCM+RS Q L+A AL LG
Sbjct: 251 GFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLASALSILSLG 310
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 207
R WR+ + D+E +L LF D +PFSIH ++ G A G G W GP A R
Sbjct: 311 RDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEYGASACGKYPGEWFGPSATARCI 367
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
+AL+ C+ + +YV S D +D R + +A P
Sbjct: 368 QALSS--------ECKHAGLNVYVTSDGSD---------VYEDRFRTIASGGATEAGIHP 410
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q YLDPH
Sbjct: 411 TLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGRPSSSHYFIGAQGSYFFYLDPHH 470
Query: 328 VQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
+P + G+ E + ++YH+ +R +H+ +DPS+ IGF +D+DD+ D+
Sbjct: 471 TRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPSMLIGFLIKDEDDWADWKRNVGS 530
Query: 385 LAEESNGAPLFTVTQTHKKPVNH 407
+A G + V P H
Sbjct: 531 VA----GKAIVHVFDKENSPFGH 549
>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
Length = 494
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)
Query: 95 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 131
A F DF S+I ++YR F DP S +T +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
RS Q L+A AL LGR WR+ + +E +L LF D +PFSIH ++ G A
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A R +AL+ C+ + +YV S D +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276
Query: 251 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
R ++ S G D P L+L+ + LG+++V P Y L+ +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334
Query: 307 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
+S Y +G Q YLDPH +P + + + + + +TYH+ +R +H+ +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
+ IGF RD+DD+D++ A NG + V P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNVRGGAVTGNGKAIIHVFDKETSP 436
>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
Length = 396
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 168/351 (47%), Gaps = 43/351 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190
Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ +++ ++DPS+A+ R D C + + + N +F + Q H
Sbjct: 304 PQRMNILNLDPSVALVGIRRLSGPGDTMCTVSPQEILKEN-LRMFELVQKH 353
>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
Length = 405
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 177/364 (48%), Gaps = 31/364 (8%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD--SK 120
I + + +W+LG + +D + +D SR+ +YRKGF PIG S
Sbjct: 46 IPQTENSVWVLGKKYNAKKD-----------IDAIRRDIRSRLWFTYRKGFVPIGGFGST 94
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPF 178
TSD GWGCMLR QM++ QAL+ LGR WR P R Y+ IL F D +P+
Sbjct: 95 FTSDKGWGCMLRCGQMVLGQALISLHLGRDWR---WTPETRSSTYLNILRRFEDRRAAPY 151
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
SIH + G + G G W GP + + + L + +L + V +
Sbjct: 152 SIHQIALMGASEGKDVGQWFGPNTIAQVLKKLVVYDDWSSITIHVALDNTLVVNDVVQQC 211
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
GA +D K + W P+LLL+PL LGL ++NP YI L+ +F FPQS
Sbjct: 212 RVEGATTAEVDGEKPL-----KAPSQWKPLLLLIPLRLGLNEINPIYINGLKTSFQFPQS 266
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV--INIGKDDLEADT-STYHSDVIRHIH 355
LG++GGKP + Y +G + I+LDPH Q ++ DD EA+ +TYH + I
Sbjct: 267 LGLIGGKPSHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCKIASRIP 326
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---DVLG 412
+ +DPS+A+ F+C + DF C + PLF + Q ++P + S DV
Sbjct: 327 ITGMDPSVALCFFCATEKDFMSLCRLMQDELIGNEKQPLFELCQ--ERPASWSPAEDVAA 384
Query: 413 ETGG 416
E G
Sbjct: 385 EALG 388
>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
Length = 494
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)
Query: 95 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 131
A F DF S+I ++YR F DP S +T +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
RS Q L+A AL LGR WR+ + +E +L LF D +PFSIH ++ G A
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A R +AL+ C+ + +YV S D +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276
Query: 251 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
R ++ S G D P L+L+ + LG+++V P Y L+ +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334
Query: 307 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
+S Y +G Q YLDPH +P + + + + + +TYH+ +R +H+ +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
+ IGF RD+DD+D++ A NG + V P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNLRGGAVTGNGKAIIHVFDKETSP 436
>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
Length = 513
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 158/324 (48%), Gaps = 47/324 (14%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 126
G++ A F DF S+I ++YR GF I S T+D G
Sbjct: 143 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 202
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 203 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 259
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 260 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 303
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 304 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 362
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 362
P +S Y +G Q YLDPH +P + G+ E + ++YH+ +R +H+ +DPS
Sbjct: 363 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 422
Query: 363 LAIGFYCRDKDDFDDFCARASKLA 386
+ IGF +D+DD+ D+ +A
Sbjct: 423 MLIGFLIKDEDDWADWKRNVGSVA 446
>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
Length = 319
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 148/300 (49%), Gaps = 25/300 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C DA RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
Length = 458
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKLLPARSGCTIKDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
+ S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPDALNIENSDSESWTSNTAK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNEIYHRKIISWFGDSPLTLFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMAPDNTDDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKSSSKEKYPLFTFVNAHSRDYDFTSTTTNKEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
[Ciona intestinalis]
Length = 422
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 168/359 (46%), Gaps = 58/359 (16%)
Query: 69 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 128
+IW+LG + + AL F + S + +YRKG+ PIG + TSD GWG
Sbjct: 39 NIWVLGSRFHLPHERAL-----------FLEHIKSFLWFTYRKGYTPIGGTGPTSDSGWG 87
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
CMLR QML+A+AL + + W+ KP Y ILH D +S +SIH + Q G
Sbjct: 88 CMLRCGQMLLARALAELTMDKDWKWTEDKPQPPPYKRILHQLSDERSSCYSIHQIAQMGV 147
Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
G G W GP + + L++ + +AI+V + VCI
Sbjct: 148 EEGKEVGQWFGPNTISQVLRRLSQFDQENV--------LAIHVAMDN---------TVCI 190
Query: 249 DDASRHCSVFSKGQAD----------------------------WTPILLLVPLVLGLEK 280
+D R CS Q + W P+LLL+PL LGL +
Sbjct: 191 EDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLLLLIPLRLGLSE 250
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDD 338
+NP Y L+ + +S+G++GGKP + Y +G E+S I+LDPH QP + + +
Sbjct: 251 INPVYFTHLKECLHWKESVGVIGGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN 310
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
D +T+H D + L ++DPSLA+GF C + F D C + ++ + PLF V
Sbjct: 311 ERYDDTTFHCDTPGRMLLTNLDPSLALGFICTTRGSFCDLCHKVKQMVKTPTSFPLFEV 369
>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
Length = 439
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 169/382 (44%), Gaps = 66/382 (17%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 91
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 47 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 106
Query: 92 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 128
F DF S+I ++YR F PI TSD GWG
Sbjct: 107 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 166
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 167 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 223
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 224 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 267
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 268 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 324
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 364
AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +DPS+
Sbjct: 325 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 384
Query: 365 IGFYCRDKDDFDDFCARASKLA 386
IGF R++DD++D+ R +
Sbjct: 385 IGFLVRNEDDWEDWKGRVGSVV 406
>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
Length = 402
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 120/381 (31%), Positives = 169/381 (44%), Gaps = 66/381 (17%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 91
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 70
Query: 92 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 128
F DF S+I ++YR F PI TSD GWG
Sbjct: 71 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 130
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 131 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 187
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 188 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 231
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 232 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 288
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 364
AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +DPS+
Sbjct: 289 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 348
Query: 365 IGFYCRDKDDFDDFCARASKL 385
IGF R++DD++D+ R +
Sbjct: 349 IGFLVRNEDDWEDWKGRVGSV 369
>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 331
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 157/319 (49%), Gaps = 27/319 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
Query: 413 ETGGVPEDDSLGVMSMNDA 431
+ G E + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309
>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
Length = 435
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 125/437 (28%), Positives = 190/437 (43%), Gaps = 82/437 (18%)
Query: 51 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQ 99
+H R + ++T S + S + LLG C+ ++ A+ D + EF +
Sbjct: 1 MHTRWVLKTKTYFSRN-SPVLLLGKCYHFKYEDEHKMLTARSGCAIEDRVIAGNVDEFRK 59
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--------- 150
DF SRI ++YR+ F PI S +++D GWGC LR+ QML+AQ L+ H LGR
Sbjct: 60 DFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWIWPDALNI 119
Query: 151 -------WRKPLQKPFD--------------------REYVE----------------IL 167
W K F +E +E I+
Sbjct: 120 ENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDEVYHRKII 179
Query: 168 HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
FGDS ++ F +H L++ G+ G AG W GP + R G +
Sbjct: 180 SWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----I 234
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+YV +D + V+ ASR G AD +++LVP+ LG E+ N Y+
Sbjct: 235 TVYVA---QDCTVYNSDVIDKQSASR-----PAGNADDKAVIILVPVRLGGERTNTDYLE 286
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
++ + +GI+GGKP S Y G Q++S IY+DPH Q +++ D + T+H
Sbjct: 287 FVKGVLSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFH 344
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPV 405
+ + +DPS IGFYCR+ DF +K+ + S+ PLFT H K
Sbjct: 345 CPSPKKMSFRKMDPSCTIGFYCRNVQDFQRASEEITKMLKMSSKEKYPLFTFVHGHSKDY 404
Query: 406 NH-SDVLGETGGVPEDD 421
+ S V E +DD
Sbjct: 405 DFTSTVANEEDLFSQDD 421
>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
Length = 459
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 114/404 (28%), Positives = 173/404 (42%), Gaps = 80/404 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNN-----------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +E G +N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKSEEDDGIPVRSNWAPEDPAVISGNVDEFRKDFVSRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
P+G S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPAALDMENSDSESWTSHTVK 155
Query: 155 -LQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 181
L F+ +V +I+ FGDS + F +H
Sbjct: 156 KLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNEGFHRKIISWFGDSPRTYFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L + GK G AG W GP + R G + +YV
Sbjct: 216 QLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQG-----LTVYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G+ D +L+LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVTDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
S +GFYCR+ DF+ +K+ + S+ PLFT + H +
Sbjct: 381 SCTVGFYCRNVQDFERASEEITKVLKASSKEKYPLFTFVKGHSR 424
>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
Length = 431
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 124/406 (30%), Positives = 185/406 (45%), Gaps = 43/406 (10%)
Query: 48 MRRIHERVLGPSRTGISSSTSDI--WLLGVCHKIAQDEALGDAAGNNGLA--EFNQDFSS 103
MR R P R+ +SS+ + W +++ L + E D +S
Sbjct: 1 MRPGPRRSCTPRRSALSSTLGEASDWCTAAAREVSAVSGLSQLQQDESYEKDEILSDVAS 60
Query: 104 RILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 163
R+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 61 RLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSY 120
Query: 164 VEILHLFGDSETSPFSIHNLL------QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
+L F D + S +SIH + + + S +GP +C+S+ A+ +R
Sbjct: 121 FSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGP-QLCQSFAAVRLSRRRR 179
Query: 218 TGLGCQSLP--MAIYVVSGDEDGERGGAPVVCIDD--ASRHCSVFSKG--------QADW 265
L S P +A++ V ++D A RHC+ G W
Sbjct: 180 WELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDISADRHCNGVPAGAEVTHRPPLPPW 239
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLT-------------------FTFPQSLGIVGGKP 306
P++LL+PL LGL +N Y+ TL+L F PQSLG++GGKP
Sbjct: 240 RPLVLLIPLRLGLTDINEAYVGTLKLASTLVGLCSAAASLPLRQHCFMMPQSLGVIGGKP 299
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
++ Y +G E IYLDPH QP + + D S + + + +DPS+A G
Sbjct: 300 NSAHYFIGYVGEELIYLDPHTTQPAVEVADRRSIPDESFHCQHPPSRMRIGELDPSIA-G 358
Query: 367 FYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
F+C+ +DDFDD+C + KL+ P+F + + + DVL
Sbjct: 359 FFCQTEDDFDDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 404
>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
Length = 454
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 127
+F DF S++ I+YR F PI + TSD GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 231
Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 245
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 232 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 283
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 284 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 331
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 356
P +S Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+
Sbjct: 332 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 391
Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASK 384
+DPS+ IGF RD+DD++D R +
Sbjct: 392 REMDPSMLIGFLVRDEDDWEDLKRRVRE 419
>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
Length = 469
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 127
+F DF S++ I+YR F PI + TSD GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 246
Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 245
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 247 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 298
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 299 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 346
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 356
P +S Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+
Sbjct: 347 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 406
Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASK 384
+DPS+ IGF RD+DD++D R +
Sbjct: 407 REMDPSMLIGFLVRDEDDWEDLKRRVRE 434
>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
Length = 319
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 150/300 (50%), Gaps = 25/300 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E + A
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEISKLCRAS 112
Query: 245 VVCIDDAS------RHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ C A+ RHC+ G W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 LPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + + DVL
Sbjct: 233 RMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 292
>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
Length = 458
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 117/421 (27%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++ + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIADHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLAPFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
Length = 356
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/341 (32%), Positives = 159/341 (46%), Gaps = 32/341 (9%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 169 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 344
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
TYH+ +R IH+ +DPS+ IGF R++DD++D+ R +
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSV 323
>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
Length = 508
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 173/382 (45%), Gaps = 57/382 (14%)
Query: 58 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 113
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 114 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRR 204
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
+ +E ++L LF D +PFSIH ++ G A G G W GP A R +AL+
Sbjct: 205 GTKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS 261
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWT 266
C+ + +YV S D +D R ++ S G D
Sbjct: 262 --------ECEHAGLNVYVTSDGSD---------VYEDRFR--AIASAGGTGAGTSTDVH 302
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q YLDPH
Sbjct: 303 PTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFYLDPH 362
Query: 327 DVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
+P + + + +TYH+ +R +H+ +DPS+ IGF RD+DD++ +
Sbjct: 363 HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSWKRSV 422
Query: 383 SKLAEESNGAPLFTVTQTHKKP 404
A G + V K P
Sbjct: 423 HNRAMIGTGKAIIHVFDKEKSP 444
>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
oryzae 3.042]
Length = 357
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 159/342 (46%), Gaps = 32/342 (9%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 169 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 344
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
TYH+ +R IH+ +DPS+ IGF R++DD++D+ R +
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSVV 324
>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
Length = 458
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 175/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFT 429
>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
Length = 458
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 117/422 (27%), Positives = 179/422 (42%), Gaps = 80/422 (18%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF+SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHIIAGNVEEFRKDFTSRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDFESWTSNTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMASDHADDKAVIILVPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHSDVLGETGGVPE 419
S IGFYCR+ DF +K+ + S+ PLFT H + + + + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTAAKEDDLFS 440
Query: 420 DD 421
+D
Sbjct: 441 ED 442
>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
terrestris]
Length = 383
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/297 (36%), Positives = 156/297 (52%), Gaps = 16/297 (5%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 31 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 90
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 91 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
+ L + +L + V + G V D A V K + W
Sbjct: 150 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 204
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 205 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 264
Query: 327 DVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
Q ++GK +++E D +TYH I + IDPS+A+ F+C + DF C
Sbjct: 265 TTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPITGIDPSVALCFFCATEKDFKSLC 320
>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDSTVVMEEIRRLCRTS 112
Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292
>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
gorilla]
gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
gorilla]
gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
Length = 331
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 156/319 (48%), Gaps = 27/319 (8%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRNS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
Query: 413 ETGGVPEDDSLGVMSMNDA 431
+ G E + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309
>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
Length = 458
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/421 (28%), Positives = 178/421 (42%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +++ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNEIYHRKVISWFGDSPLAPFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLQFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
terrestris]
Length = 386
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/297 (36%), Positives = 156/297 (52%), Gaps = 16/297 (5%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 34 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 93
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 94 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 152
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
+ L + +L + V + G V D A V K + W
Sbjct: 153 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 207
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 208 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 267
Query: 327 DVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
Q ++GK +++E D +TYH I + IDPS+A+ F+C + DF C
Sbjct: 268 TTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPITGIDPSVALCFFCATEKDFKSLC 323
>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
Length = 458
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++ A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
Length = 1119
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 134/455 (29%), Positives = 185/455 (40%), Gaps = 131/455 (28%)
Query: 91 NNGLAEFNQDFSSRILISYRKGF-----DPIGDSK------------------------- 120
N A F D SRI ++YR GF DP S
Sbjct: 644 NGWPAAFYHDSYSRIALTYRSGFPIIPCDPSSSSTGVVQGMLNNLSMSIGRGGHRGPSPT 703
Query: 121 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-----------KPFDREYV 164
++SD GWGCMLR+ Q L+A AL+ LGR WR+PL P Y
Sbjct: 704 NAEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYA 763
Query: 165 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
IL LF D S SPFS+H Q GK G G W GP + + L
Sbjct: 764 RILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYE------- 816
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---W-TPILLLVPLVLGL 278
P + VVS C+D V + D W TP+L+L+ + LG+
Sbjct: 817 ---PAGLKVVS-------------CVDGTVYESEVVAASTKDGEKWKTPVLVLINVRLGI 860
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN--IGK 336
+ VNP Y ++ F PQS+GI GG+P +S Y VG Q S Y+DPH +P + +
Sbjct: 861 DGVNPIYYEAIKGIFRLPQSVGIAGGRPSSSYYFVGAQANSLFYIDPHHPRPAVPLVLPP 920
Query: 337 DD-------------LEADT----------------------STYHSDVIRHIHLDSIDP 361
DD ADT +TYH+D +R L S+DP
Sbjct: 921 DDSLVRAAQHLPLTPSTADTPAKESARQLDDFLLAAYPDAAWATYHTDKVRKCALSSLDP 980
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---------DVLG 412
S+ +GF D+ D+ DF R +L++ S+ P+F + + + S L
Sbjct: 981 SMLLGFLVEDERDWQDFRLRVQELSQASS--PIFAIAPSPPSWMRRSTSSAAPATVSALS 1038
Query: 413 ETGGVPEDDSLGVMSMN-----DAVGNAHEDDWQL 442
T G DDS ++ D+ G + +DW+L
Sbjct: 1039 PTIG---DDSFSEVAGEDVADADSAGFSEPEDWEL 1070
>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
Length = 458
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++ A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
Length = 383
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
TSD GWGCMLR QM++ QAL+ LGR W+ + + Y++IL F D T+ FSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 123
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + G + G G W GP + + + L + +L + V +
Sbjct: 124 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
G V D A V K + W P+LLL+PL LGL ++NP YI L+ +F PQSLG
Sbjct: 184 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 238
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 356
++GGKP + Y +G E IYLDPH Q ++GK +++E D +TYH I +
Sbjct: 239 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 297
Query: 357 DSIDPSLAIGFYCRDKDDFDDFC 379
IDPS+A+ F+C + DF C
Sbjct: 298 TGIDPSVALCFFCATEKDFKSLC 320
>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
Length = 458
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/423 (28%), Positives = 175/423 (41%), Gaps = 81/423 (19%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 181
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNH-SDVLGETGGVP 418
S IGFYCR+ DF +K+ + S+ PLFT H + + S E
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDFFS 440
Query: 419 EDD 421
ED+
Sbjct: 441 EDE 443
>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
Length = 400
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 150/313 (47%), Gaps = 49/313 (15%)
Query: 96 EFNQDFSSRILISYRKGFDPIG----------------------DSK-ITSDVGWGCMLR 132
EF D SRI I+YR F PI DS+ TSD GWGCM+R
Sbjct: 75 EFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTGWGCMIR 134
Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
S Q L+A A+L LGR WR+ + + ++LH F D +PFSIH +Q G +
Sbjct: 135 SGQSLLANAMLILLLGRDWRRGTEAGKE---AQLLHQFADHPEAPFSIHRFVQHGAEFCN 191
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R +AL A+ G S + +Y+ D + D
Sbjct: 192 KYPGEWFGPSATARCIQALV----AQQG----SSELRVYITDDTAD--------IYEDKF 235
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+R + D+ P L+LV LG++ V P Y L+ PQS+GI GG+P AS Y
Sbjct: 236 AR---IAQAEHGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLPQSVGIAGGRPSASHY 292
Query: 312 IVGVQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+GV + YLDPH +P ++ + +TYH+ +R IH+ +DPS+ IGF
Sbjct: 293 FIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRLRRIHIKDMDPSMLIGFI 352
Query: 369 CRDKDDFDDFCAR 381
R ++D+ D+ R
Sbjct: 353 IRSREDWTDWKTR 365
>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
Length = 466
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++ A+ D + EF +DF SRI ++YR+ F
Sbjct: 44 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 103
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 104 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 163
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 164 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 223
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 224 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 271
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 272 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 330
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 331 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 388
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 389 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 437
>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
Length = 458
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 407
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 120/386 (31%), Positives = 169/386 (43%), Gaps = 71/386 (18%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA-----------QDE------ALG 86
+RI + + P + IW LGV + KI QDE +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPGKLGNYQDELEAGTSKID 70
Query: 87 DAAGNNGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITS 123
D + F DF S+I ++YR F PI TS
Sbjct: 71 DVTAHGWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTS 130
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH
Sbjct: 131 DTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRF 187
Query: 184 LQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
++ G ++ G G W GP A R EAL+ C ++ +YV + D
Sbjct: 188 VKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD----- 234
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V D R V G P L+L+ LG++ V P Y L+ PQS+GI
Sbjct: 235 ---VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIA 288
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSI 359
GG+P AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +
Sbjct: 289 GGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDM 348
Query: 360 DPSLAIGFYCRDKDDFDDFCARASKL 385
DPS+ IGF R++DD++D+ R +
Sbjct: 349 DPSMLIGFLVRNEDDWEDWKGRVGSV 374
>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
Length = 458
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIHHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
Length = 393
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 163/307 (53%), Gaps = 33/307 (10%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW------R 152
+ FSS + +YRK F IG TSD GWGCMLR+ QM++ QAL+ LGR W R
Sbjct: 79 KSFSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWMWTSDDR 138
Query: 153 KPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
P DRE Y+ IL +F D +++ FSIH + G + G A G W GP + ++ + L
Sbjct: 139 LP-----DRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKLV 193
Query: 212 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLL 271
+ M ++V + ++ + D C +K W P+LL+
Sbjct: 194 QYDHWS--------EMKLHVAMDN---------IIILSDIKSLCC--AKESNKWRPLLLV 234
Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
VPL LGL ++N Y + +F SLGI+GG+P + Y +G+Q E ++LDPH
Sbjct: 235 VPLRLGLSEINDIYTNAVLNSFKMKHSLGIIGGRPSHALYFIGIQREELVFLDPHTTHNY 294
Query: 332 INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 391
+++ D+ + STYH + + + ++DPS+A+ FY D+D+ D + +A +L +++G
Sbjct: 295 VDL--DEEPYNDSTYHCQRAQRMKISNMDPSIAMCFYIGDEDELDQWRVQAKELLVDNSG 352
Query: 392 APLFTVT 398
LF +T
Sbjct: 353 HMLFEIT 359
>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
Length = 480
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 153/328 (46%), Gaps = 56/328 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 134
F DF SRI ++YR F PI S+ TSD GWGCM+RS
Sbjct: 114 FLDDFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSG 173
Query: 135 QMLVAQALLFHRLGRPWRK-------------PLQKPFDREYVEILHLFGDSETSPFSIH 181
Q L+A L+ LGR WR+ + EIL LF DS +PFSIH
Sbjct: 174 QSLLANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREAEILSLFADSPDAPFSIH 233
Query: 182 NLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
+Q G A G G W GP A A C R E C + + +YV +
Sbjct: 234 RFVQHGASACGKHPGQWFGP-------SATASCIR-ELSTECAAAGLRVYVTPSASE--- 282
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+D R + S P L+L + LGL+++ P Y L+ + T+PQS+G
Sbjct: 283 ------LYEDRFRSIAAASPSDPTIKPTLILFGIRLGLDRITPVYHEALKSSLTYPQSIG 336
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLD 357
I GG+P +S Y VG Q + YLDPH+ +P + D E + +T H+ +R + ++
Sbjct: 337 IAGGRPSSSHYFVGCQGDLFFYLDPHETRPALPHHASPADYSEEEIATCHTRRLRGLRIN 396
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKL 385
+DPS+ IGF +D+ D++D+ R ++
Sbjct: 397 EMDPSMLIGFLIKDEADWEDWKRRIKEV 424
>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
Length = 458
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 116/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++ A+ D + EF +DF SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRLWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 458
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 120/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----------------LQ 156
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P ++
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 157 K-----------------------------PFDRE------YVEILHLFGDSETSPFSIH 181
K P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNEVYHRKIISWFGDSPVALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFQRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNKEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
Length = 458
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 120/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ E S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLEFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
Length = 454
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 153/309 (49%), Gaps = 50/309 (16%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
F DF SRI ++YR GF DP +S ++ SD GWGCM+RS
Sbjct: 118 FLDDFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRS 177
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A AL RLGR WR+ +RE IL LF D +P+S+HN ++ G A G
Sbjct: 178 GQSLLANALQISRLGRDWRRATDPDAERE---ILSLFADDPRAPYSLHNFVKHGAAACGK 234
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R EALA + E+ L S G P V D
Sbjct: 235 YPGEWFGPSATARCIEALA--NQHESSLRVYST---------------GDLPDVYEDS-- 275
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+V + + P L+LV LG++K+N Y L T QS+GI GG+P +S Y
Sbjct: 276 -FMAVANPDGEHFHPTLILVCTRLGIDKINQVYEEALISTLQMEQSIGIAGGRPSSSHYF 334
Query: 313 VGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ + YLDPH +P + +D + + H+ +RH+H++ +DPS+ IGF
Sbjct: 335 VGVQGQWLFYLDPHHPRPALPYREAPEDYTSEELGSCHTRRLRHLHVEDMDPSMLIGFLI 394
Query: 370 RDKDDFDDF 378
+D+DD+D +
Sbjct: 395 KDEDDWDTW 403
>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
Length = 388
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 169/366 (46%), Gaps = 63/366 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 50 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 99
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 100 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSGLGPSEPSGLASPNRYRGPAR 159
Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
+ +R + +I+ F D +PF +H L G++ G AG W GP
Sbjct: 160 WVPPRWAHGTPELEQERRHRQIVSWFADHPRAPFGLHRLGGLGQSSGKKAGDWYGP---- 215
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 216 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 262
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 263 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 322
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 323 PHYCQPTVDVTQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSELTR 380
Query: 385 LAEESN 390
+ S+
Sbjct: 381 VLSSSS 386
>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
Length = 396
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 163/386 (42%), Gaps = 78/386 (20%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 11 AGN--VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 68
Query: 149 RPWRKP----------------------------------------LQKPFDREYVE--- 165
R W P QK R Y +
Sbjct: 69 RAWTWPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHE 128
Query: 166 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 129 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 188
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + C+ + D +++L+P+ L
Sbjct: 189 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRL 235
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 236 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 295
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 394
D + T+H + + +DPS IGFYCR+ DF +K+ + S+ PL
Sbjct: 296 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 353
Query: 395 FTVTQTHKK-------PVNHSDVLGE 413
FT H + N D+ E
Sbjct: 354 FTFVNGHSRDYDFTSTTTNEEDLFSE 379
>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
Length = 458
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 116/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ ++ A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F DRE + +I+ FG+S + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGNSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
Length = 458
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 175/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L+ GK G AG W GP + R G + IYV
Sbjct: 216 QLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
Length = 458
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 116/421 (27%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 111
S S + LLG C+ +E A G N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVQPQWVGENEPVVVSGNVEEFRRDFISRIWLTYRE 95
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR------------------- 152
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDVDNSDSESWTSHT 155
Query: 153 ---------------------KPLQKPFDRE----------YVEILHLFGDSETSPFSIH 181
P+++P R + +I+ F DS + F +H
Sbjct: 156 VKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESCHRKIVSWFADSPLACFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + CS + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKADVIDKQCSSMDPENTEDKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDVLG 412
S +GFYCR+ DF+ +K+ + S+ PLFT + H + P N D+
Sbjct: 381 SCTVGFYCRNIQDFERASEEITKVLKASSREKYPLFTFVKGHARDYDFTCTPTNEDDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 441
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 125/372 (33%), Positives = 176/372 (47%), Gaps = 39/372 (10%)
Query: 9 GASKCFSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTS 68
GA+ C S PD + S S S ++ + GS E V G + S
Sbjct: 50 GATACTPSSLPDLKSASAESSRSAQPATPPDSTASSLGSGVHEDEDVGGWPTPFLDDFES 109
Query: 69 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 128
IWL +Q A+ + L+ + R + + GF TSD GWG
Sbjct: 110 KIWLT----YRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGF--------TSDTGWG 157
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
CM+RS Q L+A AL+ R+GR WR+ +E I+ LF D+ T+P+SIHN ++ G
Sbjct: 158 CMIRSGQSLLANALVMLRMGRDWRR--GSSASQEERSIISLFADTPTAPYSIHNFVEHGA 215
Query: 189 AY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGDEDGERGGAPVV 246
A G G W GP A R +ALA G QS + +YV G E E +
Sbjct: 216 AACGKHPGEWFGPSATARCIQALAN--------GHQSPELRVYVTGDGLEVYEDSFMKIA 267
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
D GQA + P L+LV LGL+K+ P Y L+ + PQSLGI GG+P
Sbjct: 268 KPD-----------GQA-FIPTLILVGTRLGLDKITPVYWEALKSSLQIPQSLGIAGGQP 315
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSL 363
+S Y +GVQ YLDPH +P + + +D + D + H+ +R IH+ +DPS+
Sbjct: 316 SSSHYFIGVQGHHFFYLDPHQTRPALPLPDNIEDYSQEDIDSCHTRRLRRIHIKEMDPSM 375
Query: 364 AIGFYCRDKDDF 375
I F RD+DD+
Sbjct: 376 LIAFLIRDEDDW 387
>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
Length = 342
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 26/312 (8%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 354 IHLDSIDPSLAI 365
+ + +DPS+A+
Sbjct: 308 MSIAELDPSIAV 319
>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
Length = 400
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 169/387 (43%), Gaps = 71/387 (18%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF+SRI ++YR+ F I S +T+D GWGC +R+ QML+AQ L+ H LG
Sbjct: 15 AGN--VEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLG 72
Query: 149 RPW----------------------------------RKPLQKPF------------DRE 162
R W + L+ P D E
Sbjct: 73 RAWTWPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHE 132
Query: 163 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
+ +I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 133 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 192
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + C+ + AD +++LVP+ L
Sbjct: 193 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCASMASDHADDKAVIILVPVRL 239
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 240 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 299
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 394
D + T+H + + +DPS IGFYCR+ DF +K+ + S+ PL
Sbjct: 300 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 357
Query: 395 FTVTQTHKKPVNHSDVLGETGGVPEDD 421
FT H + + + + + +D
Sbjct: 358 FTFVNGHSRDYDFTSTAAKEDDLFSED 384
>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
Length = 451
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGSVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGEERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
Length = 456
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/321 (35%), Positives = 163/321 (50%), Gaps = 57/321 (17%)
Query: 87 DAAGNNGLAE-FNQDFSSRILISYRKGF-------DP---------------IGD-SKIT 122
+++G++G F DF SRI ++YR GF DP +GD + T
Sbjct: 111 ESSGDSGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFT 170
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCM+RS Q L+A ALL RLGR WR+ +R IL LF D +P+S+HN
Sbjct: 171 SDTGWGCMIRSGQSLLANALLISRLGRDWRRMTDPDAERP---ILALFADDSRAPYSLHN 227
Query: 183 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ G+ A G G W GP A R +ALA + E+ L S G
Sbjct: 228 FVKHGELACGKYPGEWFGPSATARCIQALA--NKHESSLRVYST---------------G 270
Query: 242 GAPVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
P V D S + + D + P L+LV LG++K+N Y+ L T QS
Sbjct: 271 DLPDVYED------SFMATAKPDGETFHPTLILVCTRLGIDKINQVYVEALISTLQMEQS 324
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--GKDDLEADT-STYHSDVIRHIH 355
+GI GG+P +S Y VGVQ + YLDPH +P + DD ++ + H+ +R +H
Sbjct: 325 IGIAGGRPASSHYFVGVQGQWLFYLDPHHPRPKLPYRENPDDYTSEELDSCHTRRLRRLH 384
Query: 356 LDSIDPSLAIGFYCRDKDDFD 376
++ +DPS+ IGF +D+DD+D
Sbjct: 385 VEDMDPSMLIGFLIKDEDDWD 405
>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
Length = 411
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/342 (30%), Positives = 165/342 (48%), Gaps = 36/342 (10%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-ADCRDATYLKIVNRFEDVRNSFYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH + ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAVAEQDYDETYHQKHAARLSFSAM 309
Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 310 DPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351
>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
Length = 411
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351
>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
Length = 382
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWSLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 231
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 347
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
I + IDPS+A+ F+C + DF C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320
>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
Length = 410
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351
>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
Length = 411
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/343 (32%), Positives = 170/343 (49%), Gaps = 42/343 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-SDCRDATYLKIVNRFEDVRNSYYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ Q G+ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGETQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELESSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH + ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAAAEQDYDETYHQKHAARLSFSAM 309
Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEE--SNGAP-LFTVTQ 399
DPSLA+ F C+ D F+ A +KL EE S +P LF ++Q
Sbjct: 310 DPSLAVCFLCKTSDSFE---ALLTKLKEEVLSLCSPALFEISQ 349
>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
Length = 384
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 111/329 (33%), Positives = 163/329 (49%), Gaps = 50/329 (15%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
+W+LG + ++ L +D S++ +YRKGF PIG S TSD GW
Sbjct: 23 VWILGKQYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGW 71
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
GCMLR QM++ QAL+ LGR W+ P + + Y++IL F D T+PFSIH +
Sbjct: 72 GCMLRCGQMVLGQALIILHLGRDWQWTPETR--NSTYLKILERFEDRRTAPFSIHQIASM 129
Query: 187 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
G + G G W GP + + + L + + I+V + +
Sbjct: 130 GASEGKEVGQWFGPNTIAQVLKKLVVYDDWSS--------ITIHVALDN---------TL 172
Query: 247 CIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
++D R C V K + W P+LLL+PL LGL ++NP YI L+ +F
Sbjct: 173 IVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFK 232
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDV 350
PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 233 IPQSLGVIGGKPNLALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMD-ATYHCKF 291
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
I + IDPS+A+ F+C + DF C
Sbjct: 292 ASRIPITGIDPSVALCFFCATERDFKSLC 320
>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
Length = 458
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
Length = 411
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKFKEEVLSLCSPALFEISQTR 351
>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
Length = 382
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWNLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 231
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 347
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
I + IDPS+A+ F+C + DF C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320
>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 448
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 56/310 (18%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAERN---IVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 253 RHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
S + + D + P L+LV LG++K+NP Y L T QS+GI GG+P +S
Sbjct: 270 ---SFLATARPDGETFHPTLILVCTRLGIDKINPVYEEALISTLQMEQSIGIAGGRPSSS 326
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIG 366
Y VGVQ + YLDPH +P + ++ L + + H+ +R++H++ +DPS+ IG
Sbjct: 327 HYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIG 386
Query: 367 FYCRDKDDFD 376
F +D+DD+D
Sbjct: 387 FLIQDEDDWD 396
>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 515
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/307 (35%), Positives = 150/307 (48%), Gaps = 50/307 (16%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 133
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G A G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYGATACGK 296
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +AL E+GL S G P V D
Sbjct: 297 YPGEWFGPLATARCIQALT--DEKESGLRVYST---------------GDLPDVYEDS-- 337
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+V + + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 338 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 396
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+GVQ + YLDPH +P + +D + T H+ +R +H+D +DPS+ IGF
Sbjct: 397 IGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMDPSMLIGFLI 456
Query: 370 RDKDDFD 376
+D+DD+D
Sbjct: 457 KDEDDWD 463
>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
Length = 509
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 126/404 (31%), Positives = 187/404 (46%), Gaps = 80/404 (19%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 120
L + HK D+A A + EF +D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKPDQAADTEA--SWPREFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSL 108
Query: 121 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
T+D GWGCM+R+SQ L+A +LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANSLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 168 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
F D T+PFSIHN ++ G G G W GP A RS + L +TGL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKTGLKV---- 223
Query: 227 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 282
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQQGAELRPVLILAGIRLGVKNVN 263
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 337
P Y L+ T +PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLGWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323
Query: 338 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
++E+ D + H++ IR +HLD +DPS+ +G ++ +D A +
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHSINSH 380
Query: 389 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 430
G+ V + +PV + +GG+ E + LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419
>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
Length = 458
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
boliviensis]
Length = 458
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEMYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
Length = 509
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 126/404 (31%), Positives = 186/404 (46%), Gaps = 80/404 (19%)
Query: 72 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 120
L + HK QD+A A + EF D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKQDQAAETEA--SWPREFLGDVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSL 108
Query: 121 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
T+D GWGCM+R+SQ L+A LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANGLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 168 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
F D T+PFSIHN ++ G G G W GP A RS + L + GL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKIGLKV---- 223
Query: 227 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 282
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQEGAELRPVLILAGIRLGVKNVN 263
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 337
P Y L+ T ++PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLSWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323
Query: 338 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
++E+ D + H++ IR +HLD +DPS+ +G ++ +D A +
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHNINAH 380
Query: 389 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 430
G+ V + +PV + +GG+ E + LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419
>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
Length = 458
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKSILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMAFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 ETGGVPEDDSLGVMSMNDAV 432
E E L SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456
>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
Length = 389
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 172/343 (50%), Gaps = 42/343 (12%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG + +D L +D +R+ +YR+GF PIG S++T+D GWGC
Sbjct: 28 VWILGKSYSATEDLDL-----------IRRDVQTRLWCTYRRGFVPIGGSQLTTDKGWGC 76
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL LGR W + + Y++I++ F DS+ +PFS+H + G++
Sbjct: 77 MLRCGQMVLAQALTQLHLGRDWSWTPETT-NETYLKIVNRFEDSKAAPFSLHQIALTGES 135
Query: 190 YGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
G W GP + + + L + + I+V + +
Sbjct: 136 SEEKRVGEWFGPNTVAQVLKKLVKFD--------DWCSLVIHVALDN---------TLAT 178
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
D+ C V W P+LL++PL LGL ++NP Y+ L+ F + G+VGG+P
Sbjct: 179 DEVLELC-VDRSNPDSWKPLLLIIPLRLGLSEINPIYVDGLKKCFELAGNCGMVGGRPNQ 237
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLA 364
+ Y +G + A+YLDPH VQ IG D+ E D T+H R I+ +DPSLA
Sbjct: 238 ALYFIGYVADEALYLDPHTVQRSGTIGSKRDPDERELD-ETFHQKYARRINFKGMDPSLA 296
Query: 365 IGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKP 404
+ F C + DFDD R E+ NG PLF VT+T + P
Sbjct: 297 LCFLCATRKDFDDLIQR---FKEDLNGGGCQPLFEVTKTRQAP 336
>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
UAMH 10762]
Length = 446
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 165/326 (50%), Gaps = 65/326 (19%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------- 120
A++EALG AEF D +RI ++YR F PI S
Sbjct: 103 AEEEALG------WPAEFMDDMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGN 156
Query: 121 ---ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 177
TSD GWGCM+RS Q L+A +L +LGR WR+ + + +Y ++ LF D+ +P
Sbjct: 157 SGGFTSDAGWGCMIRSGQTLLANSLATLKLGRDWRRGQK---EDDYKHLISLFADTPEAP 213
Query: 178 FSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
FSIH ++ G +A G G W GP A RS +AL R + GL + P +
Sbjct: 214 FSIHKFVEHGAQACGKHPGEWFGPSATARSVQALTEKYR-DVGLRVYARP---------D 263
Query: 237 DGERGGAPVVCIDDASRHCSVF-SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRL 291
DG+ V +D S+F + GQ D + P L+++ + LG++++ P Y L+
Sbjct: 264 DGD------VYVD------SLFATAGQMDANDEFQPTLIVLGIRLGIDRITPVYHAALKA 311
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI--NIGKDDLEADTSTYHSD 349
T PQS+GI GG+P +S Y VG Q ++ YLDPH + I N +DL ++ H+
Sbjct: 312 TLEMPQSVGIAGGRPSSSHYFVGHQGDNFFYLDPHTTRQAIPQNPSAEDL----ASCHTR 367
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDF 375
+R + + +DPS+ +GF K++F
Sbjct: 368 RLRRLKIAEMDPSMLLGFLIHSKEEF 393
>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
Length = 508
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 151/307 (49%), Gaps = 50/307 (16%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A A+L R GR WR+ +RE I+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA+ + + +Y+ P V D+
Sbjct: 290 YPGEWFGPSATARCIQALAKKHDSS---------LRVYLTRD--------LPEVYEDN-- 330
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
S + + P L+LV LG++K+NP Y L T PQ++GI GG+P +S Y
Sbjct: 331 -FMSTANPDGNHFHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSSHYF 389
Query: 313 VGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+G Q + YLDPH +P + + D + + H+ +RH+H++ +DPS+ IGF
Sbjct: 390 IGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIGFLI 449
Query: 370 RDKDDFD 376
+D+DD+D
Sbjct: 450 KDEDDWD 456
>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
Length = 458
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 169/404 (41%), Gaps = 80/404 (19%)
Query: 65 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 155
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPYALSIENSDSESRTSHTVK 155
Query: 156 ----------------------------QKPFDRE------YVEILHLFGDSETSPFSIH 181
+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMASDNTDDKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
S IGFYCR+ DF +K+ + S+ PLFT H +
Sbjct: 381 SCTIGFYCRNIQDFKRASEEITKMLKISSKEKYPLFTFVNGHSR 424
>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
Length = 458
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 ETGGVPEDDSLGVMSMNDAV 432
E E L SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456
>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
Length = 458
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 172/404 (42%), Gaps = 80/404 (19%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPAISSCAIEDCVISGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 181
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNEIYHRKIVSWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQCASMASDNPDNKAVIILVPVRLGGERTNVDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
S IGFYC++ DF+ +K+ + S+ PLFT H +
Sbjct: 381 SCTIGFYCQNVQDFERASEEITKMLKVSSKEKYPLFTFVNGHSR 424
>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
Length = 454
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 144/309 (46%), Gaps = 49/309 (15%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF S+ ++YR F I S TSD GWGCM+RS
Sbjct: 118 FLDDFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRS 177
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
Q L+A A+ LGR WR+ Q P D ++L F D +P+SIH +Q G A G
Sbjct: 178 GQSLLANAMAAINLGRDWRRG-QNPEDER--KLLSWFADDPRAPYSIHQFVQHGAVACGK 234
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA Q + P+ +Y G P V D
Sbjct: 235 YPGEWFGPSATARCIQALANAQEQQ--------PLRVYST--------GDGPDVYED--- 275
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ + + + P L+LV LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 276 KFMEIAKPDGSRFNPTLILVGTRLGIDKITPVYWEALIAALQMPQSVGIAGGRPASSHYF 335
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+G Q YLDPH +P + D EAD T H+ +R +H+ +DPS+ +GF
Sbjct: 336 IGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVDTVHTRRLRRLHVRELDPSMLVGFLI 395
Query: 370 RDKDDFDDF 378
RD+DD+ ++
Sbjct: 396 RDEDDWAEW 404
>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
Length = 450
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 112/403 (27%), Positives = 172/403 (42%), Gaps = 95/403 (23%)
Query: 67 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
S ++LLG C+ +++ D N+G + EF +DF SRI ++YRK F
Sbjct: 38 NSPVFLLGKCYHFKYEDSGVTADDCSNSGSDSKEDLSGNVDEFRKDFISRIWLTYRKEFP 97
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
I S T+D GWGC LR+ QML+AQ LL H LGR W
Sbjct: 98 QIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWTWTEALDIFCSESDFWTANTARK 157
Query: 152 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 187
++PLQ + Y E LH F D + F +H L++ G
Sbjct: 158 LDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRKIISWFADYPLAYFGLHQLVKLG 217
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
K G AG W GP + L R E+ D E G +
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256
Query: 248 IDDASRHCSVFSK-------GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
D C++++ + + +++LVP+ LG E+ N Y ++ + +G
Sbjct: 257 AQD----CTIYNADVYDLQCNKGNEKAVVILVPVRLGGERTNMEYFEYVKGILSLEFCIG 312
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG Q++S IY+DPH Q +++ + + ++H + + +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMD 370
Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 401
PS +GFYCR+ +F+ +K+ + S PLFT H
Sbjct: 371 PSCTVGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413
>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
Length = 458
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 170/404 (42%), Gaps = 80/404 (19%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTICLKETIGKCSEDHETENEICHRKIISWFGDSPLAAFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITVYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQSASMTSDNTDDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
S IGFYCR+ DF +K+ + S+ PLFT H +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKMSSKEKYPLFTFVNGHSR 424
>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
Length = 458
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -FSVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 ETGGVPEDDSLGVMSMNDAV 432
E E L SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456
>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
Length = 463
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 120/414 (28%), Positives = 184/414 (44%), Gaps = 82/414 (19%)
Query: 59 SRTGISSSTSDIWLLGVCH--KIAQDE--------ALGDAAGNNGLAEFNQDFSSRILIS 108
S+T S + S ++LLG C+ K+ DE AL D + EF +DF+SR+ ++
Sbjct: 31 SKTAFSRN-SPVFLLGKCYHFKVVDDENPTESTAEALDDDVVTGNVDEFRKDFTSRVWLT 89
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQ-KPFDRE--- 162
YR+ F + S TSD GWGC LR+ QM++AQALL H LGR W+ + L +P D E
Sbjct: 90 YREEFPALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWKWSEALSLEPLDTETWT 149
Query: 163 --------------------------------------YVE------ILHLFGDSETSPF 178
Y++ I+ FGD ++
Sbjct: 150 SSAARRLVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETYHRTIVSWFGDGPSAQL 209
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
I+ L++ G G AG W GP +A R ++ I V +D
Sbjct: 210 GIYKLVELGMTSGKQAGDWYGP-------AVVAHILRKAVDEAVDAMLKGIRVYVA-QDC 261
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQA-------DWTPILLLVPLVLGLEKVNPRYIPTLRL 291
A V ID S S Q D +++L+P+ LG EK+NP Y+ ++
Sbjct: 262 TVYSADV--IDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEYLNFVKS 319
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
+ +GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H
Sbjct: 320 ILSLEYCIGIIGGKPKQAYYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 377
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
+ + +DPS IGFY + + F+ SK+ + S+ P FT+ + H K
Sbjct: 378 KKMSFSKMDPSCTIGFYSKSVEHFEKIANELSKILQPSSKEKYPAFTIMKGHGK 431
>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
Length = 459
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 168/404 (41%), Gaps = 80/404 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQA-------------------------------- 141
I S +T+D GWGC LR+ QML+AQ
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 142 -----------------LLFHRLGRPWRKPLQKPFDREYV---EILHLFGDSETSPFSIH 181
+L H R R+ R V +I+ FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNEVYHRKIISWFGDSPLAAFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
S IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 381 SCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424
>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
Length = 585
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 157/359 (43%), Gaps = 69/359 (19%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----- 151
F +DF+SRI ++YR+ F + + T+D GWGCMLRS QML+AQ L+ H LG+ W
Sbjct: 198 FQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDWTWPDA 257
Query: 152 ------------------------------------------------RKPLQKPFDREY 163
R P + +R +
Sbjct: 258 LHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELEKERYH 317
Query: 164 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+I+ F D + F IH L+ G + G AG W GP C C
Sbjct: 318 RKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDC--------CS 369
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 283
+ VS D +G V + + S + + G A W +++LVP+ LG E NP
Sbjct: 370 EAGNLVVYVSQDCTVYKGD--VANLANKSEDRTAWDPG-AVWKAVIILVPMRLGGEAFNP 426
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
Y+ ++ +GI+GGKP S Y VG Q+++ +YLDPH QP ++ K++ +
Sbjct: 427 AYVDCVKELLKLEFCIGIIGGKPRHSLYFVGYQDDALLYLDPHYCQPFVDTTKENFPLE- 485
Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 399
++H + R +DPS IGFY + +F++ C +++ S P+F++ +
Sbjct: 486 -SFHCNSPRKTAFTKVDPSCTIGFYAHHRTEFEELCLHLTQVLNSSTAKEKYPMFSIVE 543
>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
Length = 651
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 143/475 (30%), Positives = 208/475 (43%), Gaps = 100/475 (21%)
Query: 15 SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSST------- 67
+K TP P++ + S + + V L++ + E VLG S T +S T
Sbjct: 215 AKETPLCPSQ-MHSSQQPISDHQPVSTLLS------LVEAVLGSSDTLPTSVTWLAHQLK 267
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 127
+ W L H + A + F + +++R F TSDVGW
Sbjct: 268 ARGWELLASHGVPYTSPTAHTAFPGVWHSVHAVFQHILSLTHRTCF--------TSDVGW 319
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQ 185
GCMLRS Q ++A AL+ LGR WR+ ++ +Y IL F D S PFSIH L+
Sbjct: 320 GCMLRSVQSMLANALIRVHLGRHWRRRAKQKTHPQYARILSWFMDDPSLECPFSIHRLVD 379
Query: 186 AGKAYGLAAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
G+ G+ AG W GP +A+C+ +A C GLG V DG
Sbjct: 380 EGQRLGVQAGDWFGPSTAAFALCKLIQAYDAC-----GLG----------VVVTNDGMLY 424
Query: 242 GAPVVCIDDASRHCSVFSKGQAD-WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
VV + F+ G++D WT P+L+L+ LGL++V P Y P L+ +FT PQS+
Sbjct: 425 KEQVVA--------ASFAPGRSDPWTRPVLILLVQRLGLDQVPPHYRPALKQSFTMPQSV 476
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI------------GKDDLEADTSTYH 347
G+VGG+P +S Y VGVQ E + LDPH V+P + DL + S +
Sbjct: 477 GVVGGRPRSSLYFVGVQREHLLCLDPHHVRPCVPFRSPPRMTRASVGASTDLASTVSPWF 536
Query: 348 SDVIRHIHLDS-------------IDPSLAIGFYCRDKDDFDDFCAR----ASKLAEESN 390
+ LDS +DPS+ +GF C D D AR ++L + ++
Sbjct: 537 EEAYTAEELDSFHTPHTSLLPISQMDPSMLLGFVCEQASDLIDLQARIESSETRLFDVAD 596
Query: 391 GAPLF----------------TVTQTHKKPVNHSDVLGETGGVPE--DDSLGVMS 427
P + +THK HSD + GV + DDS M+
Sbjct: 597 NMPSYYRLSMSMGGEGEGDDDDNHRTHKAEDGHSDRVAAHSGVGDNVDDSGWTMA 651
>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
familiaris]
gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
familiaris]
Length = 458
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 117/421 (27%), Positives = 172/421 (40%), Gaps = 87/421 (20%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKFEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSDSWTSNTVK 155
Query: 158 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 181
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 413 E 413
E
Sbjct: 441 E 441
>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
Full=Autophagy-related protein 4
gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 506
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 131/405 (32%), Positives = 182/405 (44%), Gaps = 87/405 (21%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSR 60
G R A A+ C S ++ S A GS+LGS +TV VT+G ++ L
Sbjct: 112 FNGVRTTATAT-CLSDTS-----MSAAPTGSQLGSFDTVPDSVTSG-----YDSALAYEE 160
Query: 61 TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF------- 113
G QD A F DF SRI ++YR F
Sbjct: 161 PG------------------QDGGWPPA--------FLDDFESRIWMTYRTDFALIPRSS 194
Query: 114 DPIGDSKIT----------------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
DP S ++ SD GWGCM+RS Q L+A A+L RLGR WR+
Sbjct: 195 DPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQSLLANAILIARLGREWRRGTD- 253
Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
D E +I+ LF D +P+S+HN ++ G A G G W GP A R +ALA
Sbjct: 254 -LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGKYPGEWFGPSATARCIQALA--DEK 309
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
++GL S G P V D +V + + P L+LV L
Sbjct: 310 QSGLRVYST---------------GDLPDVYEDS---FMAVANPDGRGFQPTLILVCTRL 351
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++K+N Y L T PQS+GI GG+P +S Y VGVQ + YLDPH +P + +
Sbjct: 352 GIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYFVGVQGQRLFYLDPHHPRPALPYRE 411
Query: 337 DD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
D + T H+ +R +H+ +DPS+ IGF +D+DD+D +
Sbjct: 412 DPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLIKDEDDWDTW 456
>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
Length = 468
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 161/354 (45%), Gaps = 59/354 (16%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 92 DDEIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSRE 151
Query: 151 W----------------------RKPL-------------------QKPF-DREYVEILH 168
W R PL + P ++ + I+
Sbjct: 152 WTWPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIR 211
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
F D ++PF +H ++ G +G AG W GP +A + C+ ++
Sbjct: 212 WFSDHPSAPFGLHRMVALGSIFGKKAGDWYGP-------SIVAHIIKKAIETSCEVAELS 264
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+YV S D + + D + G+A +++LVP LG E NP Y
Sbjct: 265 VYV-SQDCTVYKADIEQLFAGDVPHAETSRDAGKA----VIILVPARLGGETFNPVYKHC 319
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ P LGI+GGKP S Y +G Q+ +YLDPH Q I+ ++D + ++H
Sbjct: 320 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQSYIDTSRNDFPLE--SFHC 377
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 399
+ R I + +DPS FY +++DDF C K+ + P+F++++
Sbjct: 378 NTPRKISITRMDPSCTFAFYAQNRDDFGKLCDHLMKVLHSPHAEEKYPIFSISE 431
>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 170/397 (42%), Gaps = 80/397 (20%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFT 396
S IGFYCR+ DF +K+ + S+ PLFT
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFT 417
>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
Length = 454
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 152/329 (46%), Gaps = 54/329 (16%)
Query: 79 IAQDEALGDAAGNNG--LAEFNQDFSSRILISYRKGFDPIGDSK---------------- 120
+A DE D +G +G F DF S+ ++YR F I S
Sbjct: 101 LAYDE---DYSGQDGGWPTAFLDDFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKS 157
Query: 121 -------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 173
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L LF D
Sbjct: 158 QLVDQNGFSSDSGWGCMIRSGQSLLANAMAVINLGRDWRRGQNQEEERK---LLSLFADD 214
Query: 174 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+P+SIH +Q G A G G W GP A R +ALA Q + P+ +Y
Sbjct: 215 PRAPYSIHQFVQHGAVACGKYPGEWFGPSATARCIQALANAQMHQ--------PLRVYST 266
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
D + D SR + P L+LV LG++K+ P Y L
Sbjct: 267 GDGPDVYEDKFMKIAKPDGSR-----------FHPTLILVGTRLGIDKITPVYWEALIAA 315
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSD 349
PQS+GI GG+P +S Y +G Q YLDPH +P + + EAD T H+
Sbjct: 316 LQMPQSVGIAGGRPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVDTVHTR 375
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+R +H+ +DPS+ IGF D+DD+D++
Sbjct: 376 RLRRLHVRELDPSMLIGFLILDEDDWDEW 404
>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
Length = 454
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)
Query: 84 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 39 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 98
Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 99 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 157
Query: 203 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 246
+ W +A + L + ++ MA S D E+G
Sbjct: 158 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 209
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
+ D +R +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P
Sbjct: 210 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 268
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 340
+ Y VG+ YLDPH +P ++G LE
Sbjct: 269 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 328
Query: 341 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 394
D STYH ++ I +++DPSLA+ +C +D+F++ C K ++ P+
Sbjct: 329 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 388
Query: 395 FTVTQTHKK 403
F Q K
Sbjct: 389 FEFLQRRPK 397
>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
Length = 460
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 122/423 (28%), Positives = 183/423 (43%), Gaps = 89/423 (21%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 111
S S + LLG C+ +E A AG N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVGSGWAGENEHVVIYGNVEEFRRDFISRIWLTYRE 95
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------- 154
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDIENSDSASWTSHT 155
Query: 155 ------------------------LQKPF-----DRE------YVEILHLFGDSETSPFS 179
L++P D E + +I+ FGDS + F
Sbjct: 156 VKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGRNELCHRKIISWFGDSPLACFG 215
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H L++ GK G AG W GP + R G + IYV +D
Sbjct: 216 LHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVA---QDCT 267
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
A V+ S ++ +A I+LLVP+ LG E+ N Y+ ++ + +
Sbjct: 268 VYKADVIDKQGISAGLET-TEDKA----IILLVPVRLGGERTNMDYLDFVKGILSLEYCV 322
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
GI+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +
Sbjct: 323 GIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKM 380
Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDV 410
DPS +GFYCR+ DF+ +++ + S+ PLFT + H + P N D+
Sbjct: 381 DPSCTVGFYCRNAQDFERASEELTQVLKASSREKYPLFTFVKGHARDYDFTSTPTNEDDL 440
Query: 411 LGE 413
E
Sbjct: 441 FSE 443
>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
Length = 481
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)
Query: 84 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 66 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 125
Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 126 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 184
Query: 203 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 246
+ W +A + L + ++ MA S D E+G
Sbjct: 185 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 236
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
+ D +R +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P
Sbjct: 237 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 295
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 340
+ Y VG+ YLDPH +P ++G LE
Sbjct: 296 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 355
Query: 341 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 394
D STYH ++ I +++DPSLA+ +C +D+F++ C K ++ P+
Sbjct: 356 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 415
Query: 395 FTVTQTHKK 403
F Q K
Sbjct: 416 FEFLQRRPK 424
>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 401
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 171/380 (45%), Gaps = 75/380 (19%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAE-----------------FNQDFSSRILISYRKG 112
IW LG + A + D A NN + F DF SRI I+YR
Sbjct: 29 IWCLG--REYAPSQPPSDPASNNPRSPSRQPNASTLNDTTWPKAFLSDFGSRIWITYRSN 86
Query: 113 FDPIGDSK-----------------------ITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
F PI +K TSD GWGCM+RS Q L+A LGR
Sbjct: 87 FTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQSLLANTFSVLLLGR 146
Query: 150 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWE 208
WR+ + E +++ +F D +PFSIH + G ++ G G W GP
Sbjct: 147 DWRRGEKV---EEESKLISMFADHPEAPFSIHRFVNRGAESCGKYPGEWFGP-------S 196
Query: 209 ALARCQRAETGLGCQS-LP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
A A+C + L QS +P + +Y+ + D +D H + G+
Sbjct: 197 ATAKCIQL---LSTQSEVPQLRVYLTNDTSD---------VYEDKFAHVAHDESGRIQ-- 242
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P L+L+ LG++ V P Y LR T+PQS+GI GG+P AS Y VG Q+ +LDPH
Sbjct: 243 PTLILIGTRLGIDNVTPAYWDGLRAALTYPQSVGIAGGRPSASHYFVGAQDCHLFFLDPH 302
Query: 327 DVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
+P D L + + +Y++ +R IH+ +DPS+ IGF +D+DD+ D+ K
Sbjct: 303 TTRPATLYRPDGLYTQEELDSYYTSRLRRIHIKDMDPSMLIGFLVKDEDDWADW----KK 358
Query: 385 LAEESNGAPLFTVTQTHKKP 404
+ G P+ + + +P
Sbjct: 359 RIRSTPGQPIVHIFPSQHQP 378
>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
Length = 457
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/431 (27%), Positives = 180/431 (41%), Gaps = 80/431 (18%)
Query: 65 SSTSDIWLLGVCHKIAQDEA-----------LGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE + D + + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDEPSDQSPNGSCDDMTDESFSRNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWANAFVFENPESESWTSQTVK 155
Query: 152 -----------------------RKPLQKPFDREYVE------ILHLFGDSETSPFSIHN 182
+ P++ E VE I+ F DS + F +H
Sbjct: 156 KLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEEQYHRRIISWFADSPFANFGLHR 215
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
L++ GK G AG W GP + L R + E + + IYV +
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAH----LLR-KAVEKARDPELQGITIYVAQDCTVYKSDV 270
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
+C S SV S I++L+P+ LG E+ N Y ++ + +GI+
Sbjct: 271 IDALCPFTDSEKTSVKS--------IIILIPVRLGGERTNMEYFEFVKGILSLDYCIGII 322
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSVKDFPLE--SFHCPSPKKMSFKKMDPS 380
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESN-GAPLFTVTQTHKKPVNH--SDVLGETGGVPE 419
IG YC D F+ +K+ + S PLFT H + + S V E E
Sbjct: 381 CTIGLYCPDMQGFERAAEEITKILKLSKEKYPLFTFVNGHSRDFDFVVSPVQEEKTMFSE 440
Query: 420 DDSLGVMSMND 430
++ + N+
Sbjct: 441 EEHKKLACFNN 451
>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
Length = 481
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 122/469 (26%), Positives = 195/469 (41%), Gaps = 104/469 (22%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
IS T IW LG + + +G+ + +SR +YR+ F PIG + +
Sbjct: 25 ISIDTFPIWALG-----------KEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPS 73
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D WGCMLR +QML+ + LL +GR + ++K D Y +IL +F D + + +SIH
Sbjct: 74 TDQYWGCMLRCAQMLLGEVLLRRHIGRHFEWDIEKTSDV-YEKILQMFFDEKDALYSIHQ 132
Query: 183 LLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYVV 232
+ Q G + G W GP + W +A + L Q +L MA
Sbjct: 133 IAQMGVSEGKEVSEWFGPNTAAQVIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYP 192
Query: 233 SGDE----DGERG-------GAPVVCID-DASRHCSVFSKGQAD-------------WTP 267
S D GE G + ++C++ D + F G + W P
Sbjct: 193 SEDAVKLIMGEFGFKSDRISSSHIICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRP 252
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+LL++PL LGL +N Y+ ++ F PQ +GI+GGKP + Y VG+ YLDPH
Sbjct: 253 LLLMIPLRLGLTSINSCYLSAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHH 312
Query: 328 VQP--------------------------VINIGKDDLE---------------ADTSTY 346
+P + + G +LE + STY
Sbjct: 313 CRPKTSKFFVEKEQQQQSSGDSTPEKVEKIDDNGFHELEDLEPLPSQTSDVYTKMNDSTY 372
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV- 405
H +++ + DSIDPSLA+ +C +++F++ C K ++ P+F + K +
Sbjct: 373 HCQMMQWMEYDSIDPSLALALFCETREEFENLCDELQKTTLTASNPPMFEFLEKRPKYLP 432
Query: 406 ---------------NHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
D+ + ED + +S+ DA A DD
Sbjct: 433 KFEPYTGVSMKIEMKEFDDIGAANSKIDEDFEVLDVSVEDAETGAEADD 481
>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 440
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 174/410 (42%), Gaps = 81/410 (19%)
Query: 65 SSTSDIWLLGVCH--KIAQDEALGDAAGN--------NGLAEFNQDFSSRILISYRKGFD 114
S S + LLG C+ K+ +DE + +A + +F +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKVEEDEGVAEACCEASDEEDVVGNVEDFRRDFGSRIWLTYREEFP 95
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDRE--------- 162
P+ S +TSD GWGCMLR+ QM++AQALL H +GR W R +P D E
Sbjct: 96 PLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWTWSRTMSLQPLDTETWTTSAAKR 155
Query: 163 ----------------------------------YVE-------ILHLFGDSETSPFSIH 181
+VE ++ FGDS ++ F +H
Sbjct: 156 LVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEGEAFHRTLVSWFGDSPSAQFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSW-----EALARCQRAETGLGCQSLPMAIYVVSGDE 236
++ G G AG W GP + EAL T Q + V
Sbjct: 216 RMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSLAGITAYVSQDCTVYSADVIDGH 275
Query: 237 DGERGGAP-----VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
+P V + ++ S +A +++LVP+ LG EK NP Y +
Sbjct: 276 KASTSASPESSDDVTLLSPNNQAASALPDSRA----VIILVPVRLGGEKTNPDYFNLAKS 331
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
+ +GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H
Sbjct: 332 ILSLDYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 389
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 399
+ + +DPS +GFY R DF+ +KL + S+ P F Q
Sbjct: 390 KKMPFTKMDPSCTLGFYSRSAQDFEKIKQELTKLLQPSSKEKYPAFIFVQ 439
>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 467
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 158/354 (44%), Gaps = 87/354 (24%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + +V G W P L+LV LG++K+ P Y L+ + PQS+GI GG+P AS
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKASLQIPQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEA---------------------------- 341
Y VGVQ + YLDPH +P++ L A
Sbjct: 311 HYFVGVQGNNFYYLDPHSTRPLLPFHPPSLAAATSDTPNLTASTTSVSSTTSSTTIVPPA 370
Query: 342 -----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
D ST H+ IR + + +DPS+ + F + D+ D+
Sbjct: 371 DSIPAPSDPRQSLYPPSDLSTCHTRRIRRLQIREMDPSMLLAFLVTSEADYQDW 424
>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
Length = 437
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 157/342 (45%), Gaps = 54/342 (15%)
Query: 87 DAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSK-----------------------IT 122
D+ N G + F DF +R+ I+YR F I S+ +
Sbjct: 94 DSDANGGWPSPFLDDFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFS 153
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCM+RS Q L+A AL RLGR WR+ +R IL LF D +PFSIH
Sbjct: 154 SDTGWGCMIRSGQSLLANALQVLRLGRAWRRGQDSQGERR---ILSLFADDPKAPFSIHR 210
Query: 183 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ G A G G W GP A R +AL+ G + + +Y+ D
Sbjct: 211 FVEHGAVACGKHPGEWFGPSATARCIQALSN--------GYEDAGLRVYITGDGSD---- 258
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+D+ V + P L+LV + LG+++V P Y L+ + QS+GI
Sbjct: 259 -----VYEDS--FMKVAKDANNTFHPTLVLVGIRLGIDRVTPVYWEALKASLQLSQSIGI 311
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDS 358
GG+P AS Y VG Q YLDPH +P + + D + D + H+ +R +H+
Sbjct: 312 AGGRPSASHYFVGTQGSYFFYLDPHTTRPFLPLHSDLSDYTQEDIDSCHTRRLRRLHVKE 371
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
+DPS+ I F RD+ D+ ++ K E +G P+ V +
Sbjct: 372 MDPSMLIAFLIRDETDWQNW----RKAVAEVHGKPVIHVADS 409
>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
Length = 482
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/424 (28%), Positives = 179/424 (42%), Gaps = 96/424 (22%)
Query: 65 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAE---------FNQDFSSRILISYRKGFD 114
S S + LLG C H A D+ D A E F +DF+SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGRCYHFKADDDGSADEASCREPEEGFSMGNVEAFRKDFTSRVWLTYREEFP 95
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQ----------- 156
P+ S +T+D GWGC+LR+ QM++AQAL+ H LGR W +PL
Sbjct: 96 PLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLGRDWTWSEALTLQPLDTETWTASAAKR 155
Query: 157 -------------KPFDREYVE-----------------------ILHLFGDSETSPFSI 180
K DR++ E I+ FGD+ ++ +
Sbjct: 156 LVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEAEAHLKEMYHRTIISWFGDTSSALLGL 215
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG------------------- 221
H L++ G G AG+W GP + + + ++GL
Sbjct: 216 HRLVRLGLTMGKNAGNWYGPAVVAHILKKAVE-EAMDSGLAGITAYVSQDCTVYSADVAD 274
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
C P A GG P +D S+ QA +++L+P+ LG EK+
Sbjct: 275 CHKPPSARQASVSPPIA--GGGP--SKEDQPGSASILPDSQA----VIILIPVRLGGEKI 326
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
NP Y ++ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D
Sbjct: 327 NPEYFEFVKNILSVEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDFP- 385
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 399
++H + I +DPS IGFY R D+D SKL + S P FT Q
Sbjct: 386 -LQSFHCPSPKKIPFTRMDPSCTIGFYSRSLQDYDRIREELSKLLQPSTKEKYPAFTFVQ 444
Query: 400 THKK 403
H +
Sbjct: 445 GHGR 448
>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
Length = 458
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 115/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENSDSDSWTSNTVK 155
Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNEIYHRKIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF+ +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSRDFDFT 429
>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
2508]
gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
2509]
Length = 506
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 151/309 (48%), Gaps = 50/309 (16%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
F DF SRI ++YR F DP S ++ SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A A+L RLGR WR+ D E +I+ LF D +P+S+HN ++ G A G
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGK 287
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA ++GL S G P V D
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDS-- 328
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+V + + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 329 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ + YLDPH +P + +D + T H+ +R +H+ +DPS+ IGF
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLI 447
Query: 370 RDKDDFDDF 378
+D+DD+D +
Sbjct: 448 KDEDDWDTW 456
>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
Length = 478
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 174/386 (45%), Gaps = 82/386 (21%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
+GL + +SR+ +YR+ F PIG + ++D GWGCMLR +QML+ + LL +GR +
Sbjct: 47 DGLEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHF 106
Query: 152 RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR------ 205
++K Y +IL +F D + + +SIH + Q G G W GP +
Sbjct: 107 EWDIEKT-SEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLT 165
Query: 206 ---SWEALARCQRAETGLGCQ-SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV---- 257
W +A + L + +L MA S + + + + + ++ ++
Sbjct: 166 IFDDWSNIAVHVALDNILVKEDALTMATTYPSDN------ASYIFAVHNFLKYFTLNLTF 219
Query: 258 --FSK-GQ-----------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
F++ GQ DW P+L+++PL LGL +NP Y+P ++ F PQ +GI+G
Sbjct: 220 PNFAENGQIEKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFELPQCVGIIG 279
Query: 304 GKPGASTYIVGVQEESAIYLDPH-----------------------------DVQPVINI 334
GKP + Y VG+ YLDPH D+Q I+
Sbjct: 280 GKPNLAHYFVGIAGTKLFYLDPHHCRAKTTKRDAGVTTNTMISSITTTDAQLDIQNQIDD 339
Query: 335 GK----DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+DLE D STYH +++ + +SIDPSLA+ +C + DFD
Sbjct: 340 SDFHKLEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEYESIDPSLALALFCETRQDFDTL 399
Query: 379 CARASKLAEESNGAPLFTVTQTHKKP 404
C K S+ P+F + K+P
Sbjct: 400 CEELQKTTLPSSVPPMFEFLE--KRP 423
>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
Length = 343
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 139/303 (45%), Gaps = 48/303 (15%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
E D +SR+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR
Sbjct: 40 EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWIK 99
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR---------S 206
K Y +L+ F D + S +SIH + Q G G + G W GP + + +
Sbjct: 100 GKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFDT 159
Query: 207 WEALA----------------RCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV 246
W +LA CQ + G + P +Y +E G R +
Sbjct: 160 WSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL- 218
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
W P++LL+PL LGL ++N YI TL+ F PQSLG++GGKP
Sbjct: 219 ------------------WKPLVLLIPLRLGLTEINEAYIETLKHCFMMPQSLGVIGGKP 260
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
++ Y +G E IYLDPH QP + D S + + + +DPS+A+
Sbjct: 261 NSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCLPDESFHCQHPPCRMSIAELDPSIAVV 320
Query: 367 FYC 369
C
Sbjct: 321 CSC 323
>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
Length = 358
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 107/340 (31%), Positives = 164/340 (48%), Gaps = 44/340 (12%)
Query: 94 LAEFNQDF---SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
AE+ +DF S + I RK G + TSD GWGCMLR QM+ AQAL+ LGR
Sbjct: 13 FAEY-EDFPETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRD 71
Query: 151 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
WR +K Y +L+ F D + S +SIH + Q G G + G W GP + + + L
Sbjct: 72 WRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKL 131
Query: 211 ARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFS 259
A + +A+++ V +E V C D+ RHC+ F
Sbjct: 132 AVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFP 183
Query: 260 KGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +
Sbjct: 184 AGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFI 243
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP--SLAIGFYCRD 371
G ES+ + P + P+ + + H + ++P S A+GF+C+
Sbjct: 244 GYVGESSSHRVPVGLCPLRAF-------------CEQVPHARCNIVEPEGSRALGFFCKT 290
Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
+DDF+D+C + KL+ P+F + + + DVL
Sbjct: 291 EDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 330
>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
Length = 468
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 117/416 (28%), Positives = 176/416 (42%), Gaps = 71/416 (17%)
Query: 65 SSTSDIWLLGVCHKI------AQDEALGDAAGNNGLA----EFNQDFSSRILISYRKGFD 114
S S + LLG C+ Q EA +A+ G+ +F +DF SRI ++YR+ F
Sbjct: 29 SRNSPVLLLGKCYHFKAEEDEGQTEACREASDEEGVMGNVEDFRRDFGSRIWLTYREEFP 88
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE--------- 162
P+ S +TSD GWGCMLR+ QM++AQALL H LGR W + +P D E
Sbjct: 89 PLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWTWSGAMSLQPLDTETWTTSAAKR 148
Query: 163 ----------------------------------------YVEILHLFGDSETSPFSIHN 182
+ ++ FGDS ++ F +H
Sbjct: 149 LVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDGGFHRTLVSWFGDSPSAQFGLHR 208
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS-LPMAIYVVSGDEDGERG 241
+++ G A G AG W GP + + R G S + V S D
Sbjct: 209 MVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLAGISSYVSQDCTVYSADVIDSHK 268
Query: 242 GAPVVCID----DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+ + +S H S + D +++LVP+ LG EK NP Y + +
Sbjct: 269 ASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVRLGGEKTNPDYFNLAKSFLSLDY 328
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
+GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H + +
Sbjct: 329 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSPKKMPFT 386
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKKPVNHSDVL 411
+DPS GFY R DF+ ++L + S P F Q H + + S L
Sbjct: 387 KMDPSCTFGFYSRSAQDFERIKHELTELLQPSAKEKYPAFIFVQGHGRDYDLSASL 442
>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
Length = 432
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 138/297 (46%), Gaps = 50/297 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF S+ +YR F I S+ T+D GWGCM+RS
Sbjct: 105 FLDDFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRS 164
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A AL LGR WR+ + +E E+L LF D+ +PFSIH + G A G
Sbjct: 165 GQSLLANALSILNLGRDWRRGSKI---KEECELLSLFADNPQAPFSIHRFVDYGASACGK 221
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R EAL+ C+ + +YV+S D + D
Sbjct: 222 HPGEWFGPSATARCIEALSN--------ECKHTDLNVYVMSDGSDVHEDQFRQIAGPDGI 273
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
R P L+L+ + LG+E V P Y LR +PQS+GI GG+P +S Y
Sbjct: 274 R-------------PTLILLGVRLGIESVTPVYWEALRAIIRYPQSVGIAGGRPSSSLYF 320
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+GVQ YLDPH +P ++ D + TYH+ +R +H+ +DPS+ IGF
Sbjct: 321 IGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLDTYHTRRLRRLHIREMDPSMLIGF 377
>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 468
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 105/313 (33%), Positives = 145/313 (46%), Gaps = 56/313 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI +SYR GF PI S T+D GWGCM+R+
Sbjct: 128 FLDDFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRT 187
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A LL HRLGR WR+ + +R+ +L LF D +P+SIH ++ G A G
Sbjct: 188 GQSLLANTLLSHRLGRGWRRGEKSDEERK---LLSLFADDPRAPYSIHKFVEHGAAKCGK 244
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R EALA + +Y G P V D
Sbjct: 245 YPGEWFGPSATARCIEALANTNEKT---------LRVYST--------GDLPDVYEDS-- 285
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+LV LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 286 -FMEVARPDGKTFHPTLILVSTRLGIDKINQVYWESLTATLQMPQSVGIAGGRPSSSHYF 344
Query: 313 VGVQE------ESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q + YLDPH +P + D +D + H+ +R +H+ +DPS+
Sbjct: 345 VGAQRSDEDQGSNLFYLDPHHTRPALPYFDDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 404
Query: 364 AIGFYCRDKDDFD 376
IGF D++D++
Sbjct: 405 LIGFLITDEEDWE 417
>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
Length = 500
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 64/310 (20%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A A+L R GR WR+ +RE I+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI---YVVSGDEDGERGGAPVVCID 249
G W GP A ARC + + LP ++ + + DG
Sbjct: 290 YPGEWFGP-------SATARCIHSLRVYLTRDLPEVYEDNFMSTANPDGNH--------- 333
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ P L+LV LG++K+NP Y L T PQ++GI GG+P +S
Sbjct: 334 ---------------FHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSS 378
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
Y +G Q + YLDPH +P + + D + + H+ +RH+H++ +DPS+ IG
Sbjct: 379 HYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIG 438
Query: 367 FYCRDKDDFD 376
F +D+DD+D
Sbjct: 439 FLIKDEDDWD 448
>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 321
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 147/291 (50%), Gaps = 32/291 (10%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 115
Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 116 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 168
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 169 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 228
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 229 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 278
>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
Length = 207
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/161 (51%), Positives = 105/161 (65%), Gaps = 7/161 (4%)
Query: 14 FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
F + NRSL S ++R+ GSM R LG S+ + ++SD+W L
Sbjct: 53 FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105
Query: 74 GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
G C+K++ +E + +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 174
SQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSE 206
>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
Length = 379
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 172/336 (51%), Gaps = 30/336 (8%)
Query: 77 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
H+I L +A L + +D SR+ +YR+GF PIG S+ TSD GWGCMLR QM
Sbjct: 13 HRIRCIFGLSNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQM 72
Query: 137 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA-AG 195
++AQALL LGR W + D Y+ I++ F D++ +PFS+H + G++ G
Sbjct: 73 VLAQALLQLHLGRDWEWTAETR-DETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVG 131
Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
W GP + + + L + + ++V + D+ C
Sbjct: 132 EWFGPNTVAQVLKKLVKFD--------DWCSVVVHVALD---------STLATDEVVELC 174
Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
S W P+LL++PL LGL ++NP Y+ L+ F + G++GG+P + Y +G
Sbjct: 175 EDKSDAGTSWKPLLLIIPLRLGLSEINPIYVAGLKKCFELAGNCGMIGGRPNQALYFIGY 234
Query: 316 QEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
+ A++LDPH VQ NIG D+ E D S +H R I+ ++DPSLA+ F C
Sbjct: 235 VGDEALFLDPHTVQRSGNIGDKTGLDEREMDES-FHQRYARRINFKAMDPSLALCFLCAT 293
Query: 372 KDDFDDFCARASKLAEESNGAP---LFTVTQTHKKP 404
+ +FDD AR AE+ NG LF VT+T + P
Sbjct: 294 RTEFDDLLAR---FAEDLNGGSCQGLFEVTKTRQAP 326
>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
Length = 458
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 115/409 (28%), Positives = 172/409 (42%), Gaps = 80/409 (19%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKVLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG------------------------- 148
I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIESSDSDSWTSNTIH 155
Query: 149 -------------RPWRKPL--------QKPFDRE------YVEILHLFGDSETSPFSIH 181
R R P + P D + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSEIYHRQIISWFGDSPVAVFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVKGVLSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
S IGFYCR+ DF+ +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSRDFDFT 429
>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
Length = 449
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 171/372 (45%), Gaps = 59/372 (15%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 116
+A D+ D +G F DF SRI ++YR FDPI
Sbjct: 99 LAYDDQSNDGGWPSG---FITDFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQL 155
Query: 117 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
GD S +SD GWGCM+RS Q L+A + RLGR WR Q E IL F D
Sbjct: 156 GDQSPFSSDSGWGCMIRSGQSLLANTIALVRLGRDWR---QGQSLEEECRILKDFADDPR 212
Query: 176 SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
+P+SIH+ ++ G A G G W GP A R +ALA +I V S
Sbjct: 213 APYSIHSFVRHGASACGKYPGEWFGPSATARCIQALANSHEP-----------SIRVYST 261
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
G P V DD + + G+A + P L+LV LGL+K+ P Y L
Sbjct: 262 ------GDGPDVYEDDFMKIAN--PTGEA-FHPTLVLVGTRLGLDKITPVYWEALIAALQ 312
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVI 351
PQS+GI GG+P +S Y +G Q YLDPH +P + ++ ++ + + H+ +
Sbjct: 313 MPQSVGIAGGRPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMDYTSEEIESCHTARL 372
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
R IH+ +DPS+ IGF R ++D+ D+ + + G + V Q + V+
Sbjct: 373 RRIHVREMDPSMLIGFLIRSEEDWQDW----KRSVKHVQGKSIIHVAQ--RNAVHGGSSE 426
Query: 412 GETGGVPEDDSL 423
G G + E ++L
Sbjct: 427 GREGAIDEVETL 438
>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
Length = 459
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 152/332 (45%), Gaps = 60/332 (18%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 120
+A DE L DA F DF SR+ ++YR F+PI S
Sbjct: 109 LAYDELLEDAGWP---IAFLDDFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLA 165
Query: 121 ----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
+SD GWGCM+RS Q L+A L+ +LGR WR+ R+ EIL F D +
Sbjct: 166 DQGGFSSDTGWGCMIRSGQSLLANTLVICQLGRDWRRGKAA---RQEREILARFADDPRA 222
Query: 177 PFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
P+S+HN ++ G A G G W GP A R +ALA + + +Y
Sbjct: 223 PYSLHNFVRHGAVACGKFPGEWFGPSATARCIQALANSNESS---------LRVYST--- 270
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
G P V D +V + P L+LV LG++K+N Y L T
Sbjct: 271 -----GDLPDVYEDS---FMAVAKPDGETFHPTLILVGTRLGIDKINQVYWEALTATLQM 322
Query: 296 PQSLGIVGGKPGASTYIVGVQEES--------AIYLDPHDVQPVINIGKD---DLEADTS 344
PQS+GI GG+P AS Y +G Q YLDPH +P + +D D +
Sbjct: 323 PQSVGIAGGRPSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN 382
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
T H+ +R +H+ +DPS+ IGF +D+DD+D
Sbjct: 383 TCHTRRLRRLHVRDMDPSMLIGFLIKDEDDWD 414
>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
Length = 448
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 148/307 (48%), Gaps = 49/307 (15%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
F DF S++ SYR GF DP S ++ SD GWGCM+RS
Sbjct: 108 FLDDFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRS 167
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A +++ RL R WR+ + + +RE I+ LF D +P+SIH ++ G +A G
Sbjct: 168 GQSLLANSMVILRLSRGWRRGVGRDKERE---IVSLFADDPRAPYSIHKFVEHGAEACGK 224
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R + LA+ +S + +Y+ D + G V D
Sbjct: 225 YPGQWFGPSATARCIQELAKRH--------ESADVRVYITGDGSDVYKDGFMSVAKPDG- 275
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
++ P L+LV LG++KV P Y L+ + PQS+GI GG+P +S Y
Sbjct: 276 ----------VNFKPTLILVGTRLGIDKVTPVYWEALKASLQMPQSVGIAGGRPSSSHYF 325
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ YLDPH I D E A+ + H+ +R + + +DPS+ IGF
Sbjct: 326 VGVQGSHFFYLDPHQTMAAIPFHTDVDEYTPAEIDSCHTRRLRRLDIKEMDPSMLIGFLI 385
Query: 370 RDKDDFD 376
RD+ D++
Sbjct: 386 RDEKDWE 392
>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
Length = 545
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 98/393 (24%)
Query: 96 EFNQDFSSRILISYRKGF--------------------------DPIGDSKITSDVGWGC 129
+F D SRI +SYR GF DP G TSDVGWGC
Sbjct: 64 DFLADVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRG---YTSDVGWGC 120
Query: 130 MLRSSQMLVAQALLFHRLGRPWR----------------------------KPLQKPFDR 161
M+R+SQ L+A ALLF LGR WR K +
Sbjct: 121 MIRTSQSLLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSE 180
Query: 162 EYV----EILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
E I+ F DS SPFSIH ++ G KA AG W GP A S AL
Sbjct: 181 ETAVSEETIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYAL------ 234
Query: 217 ETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
C P + +Y +G GG V D+ + G P+L+L
Sbjct: 235 -----CNEFPDSGLKVYY-----NGNGGGD--VYEDE------LLETG----FPLLVLCG 272
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LG++ VNP Y +LR + PQS+GI GG+P S Y G Q E YLDPH +P +
Sbjct: 273 LRLGIDNVNPIYWDSLRQMLSLPQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPAVK 332
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
+ DT+++HS I +HL +DPS+ +GFY + D++ F + E+++
Sbjct: 333 T----TDKDTTSFHSSRIWKLHLKEMDPSMLVGFYITSEADWETFKGSLTASKEKTSSQI 388
Query: 394 LFTVTQTHKKP-VNHSDVLGETGGVPEDDSLGV 425
+ H P + D GG +DD + V
Sbjct: 389 VHIHPSRHNIPSFDEEDEYVSIGGASDDDFVDV 421
>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
Length = 389
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 117/346 (33%), Positives = 170/346 (49%), Gaps = 34/346 (9%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + + D L QD SR+ +YR+GF PIG++++T
Sbjct: 21 IPKTNDTVWILGKQYNASDD-----------LEAIRQDVQSRLWCTYRRGFVPIGNTQLT 69
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQALL LGR W + D Y+ I++ F DS+ +PFS+H
Sbjct: 70 TDKGWGCMLRCGQMVLAQALLQLHLGRDWVWEAETR-DDIYLNIVNRFEDSKQAPFSLHQ 128
Query: 183 L-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
+ L + G W GP + + + L + C+ + I+V +
Sbjct: 129 IALMGDSSEEKRIGEWFGPNTVAQVLKKLVKFDD-----WCR---LVIHVALDN------ 174
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D+ C V K W P+LL++PL LGL +VNP YI L+ F P S G+
Sbjct: 175 ---TVATDEIVELC-VDKKEPEAWKPLLLIIPLRLGLSEVNPIYIEGLKKCFQLPGSCGM 230
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDS 358
+GG+P + Y +G A+YLDPH VQ V +G A+ T+H I S
Sbjct: 231 IGGRPNQALYFIGYVGGEALYLDPHTVQRVGTVGSKQDPAEQELDETFHQRYASRISFTS 290
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
+DPSLA+ F C + FD AR + LF VT+T + P
Sbjct: 291 MDPSLAVCFLCVSRQQFDQLVARFNDSVNGGTSQALFEVTKTRQAP 336
>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
Length = 383
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 166/346 (47%), Gaps = 48/346 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
+W+LG + ++ L +D S++ +YRKGF PIG +S TSD GW
Sbjct: 23 VWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGCNSTFTSDKGW 71
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCMLR QM++AQAL+ LG+ W+ + + + Y++IL F D + FSIH + G
Sbjct: 72 GCMLRCGQMVLAQALITLHLGKDWQW-MPETKNNTYLKILRRFEDKRAAAFSIHQIALMG 130
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
+ G G W GP + + + L + + I+V + +
Sbjct: 131 ASEGKEVGQWFGPNTIAQVLKKLIVYDEWSS--------LTIHVALDN---------TLI 173
Query: 248 IDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
++D R C V + + W P+LLL+PL LGL ++NP YI L+ +F
Sbjct: 174 VNDILRQCRVEGGVTAEADGEIPLRAPSQWKPLLLLIPLRLGLSEINPVYINGLKTSFKI 233
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVI 351
QSLG++GGKP + Y +G + IYLDPH Q I ++++E D S YH
Sbjct: 234 SQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDIS-YHCKSA 292
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
I + +DPS+A+ F+C + +F C + PLF +
Sbjct: 293 SRIPITGMDPSVALCFFCATEKEFKSLCKSMQEELILPEKQPLFEL 338
>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 143/289 (49%), Gaps = 29/289 (10%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
L + DF SR+ +YR+ F IG S TSD GWGCMLR+ QMLVA+ LL RLGR +
Sbjct: 39 LEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRNYVW 98
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
D Y EIL LF D+ ++ S+ + L A A G W GP M + L R
Sbjct: 99 SESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMA---QVLKR 155
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
++ +SL + V VV ++D S + + G+ TP++L++
Sbjct: 156 ITKS------RSLGFGVTVAMDS---------VVSVEDVS--AEIINGGKP--TPLVLMI 196
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA----IYLDPHDV 328
PL LGL VN Y+ L++ +GI+GGKP + Y VG QE +YLDPH
Sbjct: 197 PLRLGLNSVNEIYVNPLKIFLASKYCVGIMGGKPNQAHYFVGYQETVEDTWLLYLDPHTT 256
Query: 329 Q--PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
Q PV E + H+D + I +DPSLA+GF+ ++F
Sbjct: 257 QQSPVSVNNNMPFEQFDKSLHTDKLCWIKALKLDPSLAVGFFFNTVEEF 305
>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 480
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 159/312 (50%), Gaps = 41/312 (13%)
Query: 101 FSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFH----RLGRPWRKPL 155
F S +YR + PIG S SD GWGCM+R+ QML+ QA++ H L + + +
Sbjct: 154 FKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMMRHVFEDNLKYEYIEKI 213
Query: 156 QKPFDREYVEILHLF---GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+ + EY+ +L LF G+ + SP+SI N+ G G W GP A+ + L +
Sbjct: 214 TE-YREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIVLKRLTK 272
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILLL 271
+ P+ + + VC++ + + +V + DWT + ++
Sbjct: 273 IYK----------PVKQFTM------------YVCLE-GNIYLNVIQEKSKDWTQSVFIV 309
Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES--AIYLDPHDVQ 329
+PL LGL + P Y+ +++ FTFPQ++GI GG+ ++ Y +G+ + S IYLDPH VQ
Sbjct: 310 IPLRLGLNYIEPEYLSSVKKVFTFPQNVGIAGGRENSALYFIGISDSSNNLIYLDPHLVQ 369
Query: 330 ---PVINIGKDD-LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
P N+ ++ S++H + + L+ + S+AIGFY RD +DF DF R L
Sbjct: 370 KSVPTCNMQTNEQFYQYESSFHCTKFKKMPLNRMCTSVAIGFYIRDYNDFLDFQTRIKSL 429
Query: 386 AEESNGAPLFTV 397
+ N +FTV
Sbjct: 430 SSGENS--IFTV 439
>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
Length = 459
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 118/437 (27%), Positives = 183/437 (41%), Gaps = 84/437 (19%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAA-GNN----------GLAEFNQDFSSRILISYRKGF 113
S S ++LLG C+ DE + G+N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKTDEPTEQSPNGSNYDVTEEEVSRNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 155
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWPDALVNENPESESWTSHTVK 155
Query: 156 ------------QKPFDREYV----------------------EILHLFGDSETSPFSIH 181
+K F + + +I+ FGDS + F +H
Sbjct: 156 KLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRDEHYHRKIVSWFGDSPLANFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ G G AG W GP + L R + E + + +YV
Sbjct: 216 RLIEYGNKSGKMAGDWYGPAVVAH----LLR-KAVEEAKDPELQGITVYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D CS+ + +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKSDVVEMQCSLKDSEKPGAKSVIILIPVRLGGERTNMEYLEFVKGILSLEYCIGI 322
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
VGG+P S Y G Q++S IY+DPH Q +++ + + ++H + + +DP
Sbjct: 323 VGGRPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMDP 380
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNH--SDVLGETGGV 417
S IG YC + F+ +K+ + S+ PLFT H K + S V E
Sbjct: 381 SCTIGLYCPNVQGFERASEEITKILKASSKEKYPLFTFVNGHSKDYDFMMSPVQEEKALF 440
Query: 418 PEDDS--LGVMSMNDAV 432
ED++ L S D V
Sbjct: 441 SEDENKKLKRFSTEDFV 457
>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
Length = 427
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 169/382 (44%), Gaps = 66/382 (17%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 37 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 86
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 87 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 146
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 147 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 202
Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
+A R + + +YV + A +V D + A+
Sbjct: 203 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 249
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G
Sbjct: 250 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGXXXXXXXXXX 309
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 310 XXXCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 367
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 368 VLGSSSATERYPMFTLAEGHAQ 389
>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
nidulans FGSC A4]
Length = 402
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 120/390 (30%), Positives = 178/390 (45%), Gaps = 68/390 (17%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGV-----CHKIAQDEALGDAAGNN--------GLA 95
+RI + + P S IW LG C + DE+ G G
Sbjct: 11 KRIIQYIWDPEPKNDEEPGSPIWCLGTRYPPQCVEETADESRNPDHGQQQNTNTSAPGWP 70
Query: 96 E-FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
E F DF S+I ++YR F PI TSD GWGCM+
Sbjct: 71 EAFLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMI 130
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
RS Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G +
Sbjct: 131 RSGQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFC 187
Query: 191 GLAAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R ++ + +Y+ + D + V D
Sbjct: 188 GKHPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRD 238
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS
Sbjct: 239 E---------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSAS 287
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
Y V VQ YLDPH+ +P + + E + +TYH+ +R +++ +DPS+ IGF
Sbjct: 288 HYFVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGF 347
Query: 368 YCRDKDDFDDFCARASKLAEESNGAPLFTV 397
RD+DD++D+ AR L G P+ T+
Sbjct: 348 LIRDEDDWEDWKARIMSL----EGKPIITI 373
>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
Length = 449
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 52/327 (15%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 120
+A DEA+ G + F DF S+ ++YR F+PI S
Sbjct: 98 LAYDEAMNQDGG--WPSAFLDDFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQF 155
Query: 121 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L F D
Sbjct: 156 MDQAGYSSDSGWGCMIRSGQSLLANAMAVLDLGRDWRRGVAAEKERQ---LLSKFADDPK 212
Query: 176 SPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
+P+SIH +Q G A G G W GP A R +AL + +Y
Sbjct: 213 APYSIHRFVQHGAVACGKYPGEWFGPSATARCIQALVNANEPH---------LRVYST-- 261
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
G P V D R + + P L+LV LG++K+ P Y L
Sbjct: 262 ------GDGPDVYED---RFFDIAKPSGETFHPTLILVGTRLGIDKITPVYWDALIAALQ 312
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVI 351
PQS+GI GG+P +S Y +G Q YLDPH + + +D +AD + H+ +
Sbjct: 313 MPQSIGIAGGRPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSVHTRRL 372
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDF 378
R +H+ +DPS+ IGF D+DD+D++
Sbjct: 373 RRLHVREMDPSMLIGFVIHDEDDWDEW 399
>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
Length = 507
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 105/304 (34%), Positives = 148/304 (48%), Gaps = 52/304 (17%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-----KPLQKPFDREYVEILHLFGD--SE 174
TSD GWGCM+RS QML+AQ L+ H LGR WR P++ P D + +++ F D S+
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242
Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMC-----------RSWEALARCQ--------- 214
SPFS+H L+QA G GSW GP +C R +E LAR
Sbjct: 243 ESPFSLHRLVQAS---GQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299
Query: 215 -RAET-----GLGCQSLPMAIYVVSGDEDGERGGA-------PVVCIDD---ASRHCSVF 258
R E G + P + E+ + + P + D +S ++F
Sbjct: 300 YREEIMNLARGQPVRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQDGIQSSPSTTLF 359
Query: 259 SKGQADWTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
++LL+P+ LGL+K ++ RY+P + P +GI+GG+P S YI+G Q
Sbjct: 360 PSHA-----VILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIYILGCQN 414
Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
I+LDPH QPV+ D E + T+H V R I +DPS A+GFYCR + D D
Sbjct: 415 TQLIHLDPHFTQPVVRNVVDSEEFNVKTWHCLVPRVIEAAKLDPSCAVGFYCRSRGDLSD 474
Query: 378 FCAR 381
R
Sbjct: 475 LLER 478
>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 450
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 172/403 (42%), Gaps = 95/403 (23%)
Query: 67 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
S ++LLG C+ +++ D N+G + EF +DF SRI ++YR+ F
Sbjct: 38 NSPVFLLGKCYHFKYEDSSVTSDGGSNSGSESKEDLSGNVDEFRKDFISRIWLTYREEFP 97
Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
I S T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 98 QIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWTWTEALDIFSSESEFWTANTARK 157
Query: 152 -------------------RKPLQ---KPFDRE--YVEILHLFGDSETSPFSIHNLLQAG 187
++PL K + E + +I+ F D + F +H L++ G
Sbjct: 158 LTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQKIISWFADYPLAYFGLHQLVKLG 217
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
K G AG W GP + L R E+ D E G +
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256
Query: 248 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
D C+++S D +++LVP+ LG E+ N Y ++ + +G
Sbjct: 257 AQD----CTIYSADVYDLQCNKGTEKAVVILVPVRLGGERTNMEYFEFVKGILSLEFCIG 312
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I+GGKP S Y VG Q++S IY+DPH Q +++ + + ++H + + +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSVKNFPLE--SFHCPSPKKMSFKKMD 370
Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 401
PS IGFYCR+ +F+ +K+ + S PLFT H
Sbjct: 371 PSCTIGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413
>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
Length = 409
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 166/345 (48%), Gaps = 42/345 (12%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
GG+P + Y +G E+ +YLDPH Q +G+ + TYH + ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQKTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309
Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTH 401
DPSLA+ F C+ D F KL +E G LF ++QT
Sbjct: 310 DPSLAVCFLCKTSDSFQQL---LDKLRQEVLGMCSPALFEISQTR 351
>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/342 (30%), Positives = 163/342 (47%), Gaps = 36/342 (10%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
GG+P + Y +G E+ +YLDPH Q +G+ + TYH + ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQRTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309
Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
DPSLA+ F C+ D F + + LF ++QT
Sbjct: 310 DPSLAVCFLCKTSDSFQQLLEKLRQEVLGMCSPALFEISQTR 351
>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
Length = 433
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/329 (29%), Positives = 150/329 (45%), Gaps = 59/329 (17%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
TSD GWGCMLR +QML+ + LL +GR + ++ Y +IL +F D + + +SIH
Sbjct: 49 TSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIH 107
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYV 231
+ Q G G W GP + W +A + L + +L MA
Sbjct: 108 QIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTY 167
Query: 232 VSGD------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
S D E+G+ +H + + + +W P+LL++PL LGL +N Y
Sbjct: 168 PSEDAVKLIMENGQ-----------VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCY 216
Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV-------------- 331
+P ++ F PQ +GI+GGKP + Y VG+ YLDPH +P
Sbjct: 217 LPAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTES 276
Query: 332 ----INIGK-DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
N + +DLE D STYH +++ + +SIDPSLA+ +C ++D
Sbjct: 277 EQHDTNFSELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESRED 336
Query: 375 FDDFCARASKLAEESNGAPLFTVTQTHKK 403
FD+ C K ++ P+F + K
Sbjct: 337 FDNLCQELQKTTLPASKPPMFEFLEKRPK 365
>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
Length = 409
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 150/313 (47%), Gaps = 49/313 (15%)
Query: 97 FNQDFSSRILISYRKGFDPI---------------------GDSKITSDVGWGCMLRSSQ 135
F +DF S + ++YR F PI TSD GWGCM+RS Q
Sbjct: 86 FLEDFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQ 145
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAA 194
++A AL RLGR WR+ + KP E +L LF D +PFSIH ++ G+ G
Sbjct: 146 AVIANALAHLRLGRGWRRGM-KP--EEEKRLLALFADDPRAPFSIHKFVRHGEVECGKNP 202
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
G W GP A A C +A T + + +Y + ++ E V ++
Sbjct: 203 GEWFGP-------SAAAMCIQALTH-AYEPAGLRVYQTNSNDLYEEDFRKVAVVN----- 249
Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
VF P L+L + LG+E++ Y L PQ++GI GG+P +S Y +
Sbjct: 250 -GVFK-------PTLVLAGIRLGIERITNIYYEPLAACLRMPQTVGIAGGRPSSSHYFIA 301
Query: 315 VQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VQ E+ YLDPH +P++ +D E + T H+ IR +H+ +DPS+ I F RD
Sbjct: 302 VQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHTRRIRRLHIREMDPSMLIAFLIRD 361
Query: 372 KDDFDDFCARASK 384
+ D++D+ R S+
Sbjct: 362 EADWEDWQRRISE 374
>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
Length = 518
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 155/318 (48%), Gaps = 49/318 (15%)
Query: 87 DAAG-NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
DA G ++G +F D+ SR+ I+YR F P+ ++ T+D GWGCM+R++QM+VAQA++ +
Sbjct: 159 DANGVSSGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIMLN 218
Query: 146 RLGRPWRKPLQKP-----------FDREYVE---ILHLFGDSETSPFSIHNLLQ--AGKA 189
R GR WR +K FDRE ++ IL LF D +SP IH +++ A +
Sbjct: 219 RFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAKEK 278
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
A GSW P EA+ ++A L +I ++GD A + I
Sbjct: 279 GKKAVGSWYSPS------EAVFIMKKA--------LTESISPLTGD------TAMYLSI- 317
Query: 250 DASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
D H +W L+LV +V LG ++NP Y+P L F+ LG+ GG+P
Sbjct: 318 DGRVHIRDIEVETKNWMKTLILVIVVRLGAAELNPIYVPHLMRLFSMESCLGVTGGRPDH 377
Query: 309 STYIVGVQEESAIYLDPHDVQPVINI----------GKDDLEADTSTYHSDVIRHIHLDS 358
S + VG + IYLDPH I I K + +YH ++ +H
Sbjct: 378 SCWFVGFYGDQIIYLDPHVAHEYIPIDMNFNVNMTDNKKSKKCPERSYHCRLLSKMHFLD 437
Query: 359 IDPSLAIGFYCRDKDDFD 376
+DPS A+ F ++ FD
Sbjct: 438 MDPSCALCFRFESREQFD 455
>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
Length = 521
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 159/340 (46%), Gaps = 56/340 (16%)
Query: 68 SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 127
+D+ LG + + DE+ +G F D+ SR+ I+YR F + D+ T+D GW
Sbjct: 146 NDVVFLGRRYSTSVDES----GLRSGFENFCSDYYSRLWITYRTDFPALLDTDTTTDCGW 201
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQK-----------PFDREYVE---ILHLFGDS 173
GCM+R++QM+VAQA++ +R GR WR +K FDRE ++ IL LF D
Sbjct: 202 GCMIRTTQMMVAQAIMVNRFGRDWRFTRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDK 261
Query: 174 ETSPFSIHNLL---QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
T+P IH ++ GK A GSW P EA+ ++A L S P+
Sbjct: 262 PTAPLGIHKMVGIAAMGKGKK-AVGSWYSPS------EAVFIMKKA---LTESSSPLT-- 309
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTL 289
G ++ D H +W L+LV +V LG ++NP Y+P L
Sbjct: 310 ----------GNTAMLLSIDGRVHIRDIEVETKNWMKKLILVIVVRLGAAELNPIYVPHL 359
Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH--------DVQPVINI----GKD 337
F LGI GG+P S++ VG + IYLDPH D+ P N+ K
Sbjct: 360 MRLFAMESCLGITGGRPDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSDSKK 419
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
+ +YH ++ +H +DPS A+ F ++ FD+
Sbjct: 420 AKKCPEKSYHCRLLSKMHFFDMDPSCALCFQFESREQFDN 459
>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
Length = 491
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427
Query: 364 AIGFYCRDKDDF 375
IGF D++++
Sbjct: 428 LIGFLILDEENW 439
>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
TFB-10046 SS5]
Length = 989
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 96/275 (34%), Positives = 131/275 (47%), Gaps = 42/275 (15%)
Query: 97 FNQDFSSRILISYRKGFDPI-----------------------------GDSKITSDVGW 127
F DF+SR+ ++YR F PI G+ TSD GW
Sbjct: 314 FYADFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGW 373
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPL---QKPFDREYVEILHLFGDSET--SPFSIHN 182
GCMLR+ Q L+A L+ LGR WR+P P YV+IL F D+ + +PFS+H
Sbjct: 374 GCMLRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHR 433
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIYVVSGDEDGERG 241
+ +GK +G G W GP + L RA+ G+ +A+ V + D
Sbjct: 434 MAMSGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVA-----IAVDGVLYETDIYSA 488
Query: 242 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+ D +R S F + W +L+LV LGL+ VNP Y L+ FTFPQSL
Sbjct: 489 SHYPMSSADGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTIFTFPQSL 548
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
GI GG+P +S Y VG Q S YLDPH +P + +
Sbjct: 549 GIAGGRPSSSYYFVGSQGNSLFYLDPHHTRPAVPL 583
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 4/72 (5%)
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA-PLFTVTQT 400
D T+H D +R + L +DPS+ +GF CRD+ D+ DF R +AE S G LF++ +
Sbjct: 699 DLKTFHCDRVRKMPLSGLDPSMLLGFLCRDEQDWKDFRRR---MAEISKGRDTLFSIQEE 755
Query: 401 HKKPVNHSDVLG 412
+ SD +G
Sbjct: 756 PPSWPSDSDDMG 767
>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
Length = 572
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKSDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508
Query: 364 AIGFYCRDKDDF 375
IGF D++++
Sbjct: 509 LIGFLILDEENW 520
>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
Length = 572
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508
Query: 364 AIGFYCRDKDDF 375
IGF D++++
Sbjct: 509 LIGFLILDEENW 520
>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
4308]
Length = 378
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 172/375 (45%), Gaps = 54/375 (14%)
Query: 49 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQD-----------EALGDAAGNNGLAE- 96
+RI + + P TS IW LG+ + +D A + G +
Sbjct: 11 KRIVQYLWDPEPRNDEDPTSSIWCLGIEYHPEKDVSPRGETPDKNSARDNTTGTTNYRKP 70
Query: 97 --------FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
F DF SRI ++YR F PI ++ D M S L+A AL LG
Sbjct: 71 SEHAWPESFLLDFESRIWMTYRSNFPPI--PRVEGDDKSASMTLGS--LLANALSTLVLG 126
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 207
R WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A +
Sbjct: 127 RDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCI 183
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
EAL+ C S + +YV + + + R +V + P
Sbjct: 184 EALSS--------QCGSPTLKVYVSNDTSEVYQ-----------DRFMNVARNSSGVFQP 224
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
L+L+ LG++ + P Y L+ T PQS+GI GG+P AS Y VG Q YLDPH
Sbjct: 225 TLILLGTRLGIDHITPVYWDGLKATLQLPQSVGIAGGRPSASHYFVGAQGSHLFYLDPHY 284
Query: 328 VQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
+P + G+ + + TYH+ +R IH+ +DPS+ IGF RD++D+DD+ R
Sbjct: 285 TRPALPDRQGGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRDQEDWDDWLNRIQA 344
Query: 385 LAEESNGAPLFTVTQ 399
+ G P+ V +
Sbjct: 345 V----KGRPIIHVLK 355
>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
Length = 758
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 311 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 341
Y G Q + YLDPH Q + G K D A
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341
Query: 342 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
D + H+ + +HL +DPS+ IGF +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398
>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
Length = 459
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 117/425 (27%), Positives = 171/425 (40%), Gaps = 94/425 (22%)
Query: 65 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 155 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 181
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQ-------- 262
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + C+ + D +++L+P+ LG E+ N Y+ ++ ++L I
Sbjct: 263 DCTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGIL---RALNI 319
Query: 302 VG----GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
V KP S Y G Q++S IY+DPH Q +++ D + T+H + +
Sbjct: 320 VWVLLVAKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFR 377
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHS 408
+DPS IGFYCR+ DF +K+ + S+ PLFT H + N
Sbjct: 378 KMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEE 437
Query: 409 DVLGE 413
D+ E
Sbjct: 438 DLFSE 442
>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
Length = 318
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 49/266 (18%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 77 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 125
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 126 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 185
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 186 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 228
Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 229 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 288
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVG 314
+ F PQSLG +GGKP + Y +G
Sbjct: 289 FKECFKMPQSLGALGGKPNNAYYFIG 314
>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
Length = 531
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 311 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 341
Y G Q + YLDPH Q + G K D A
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341
Query: 342 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
D + H+ + +HL +DPS+ IGF +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398
>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
Length = 491
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 144/312 (46%), Gaps = 56/312 (17%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGDSKI----------------TSDVGWGCMLRS 133
F DF SRI ++YR GF DP S++ T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427
Query: 364 AIGFYCRDKDDF 375
IGF D++++
Sbjct: 428 LIGFLILDEENW 439
>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
Length = 379
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 159/328 (48%), Gaps = 54/328 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
F DF S+I ++YR F PI TSD GWGCM+RS
Sbjct: 50 FLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRS 109
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G + G
Sbjct: 110 GQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFCGK 166
Query: 193 AAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R + LA R ++ + +Y+ + D + V D+
Sbjct: 167 HPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRDE- 216
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS Y
Sbjct: 217 --------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSASHY 266
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
V VQ YLDPH+ +P + + E + +TYH+ +R +++ +DPS+ IGF
Sbjct: 267 FVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGFLI 326
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTV 397
RD+DD++D+ AR L G P+ T+
Sbjct: 327 RDEDDWEDWKARIMSL----EGKPIITI 350
>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
Length = 451
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 146/309 (47%), Gaps = 50/309 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 133
F +D +++ ++YR GFDPI S +SD GWGCM+RS
Sbjct: 117 FLEDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRS 176
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A + +LGR WR+ +E +++ +F D +P+SIHN ++ G A G
Sbjct: 177 GQSLLATTIGILQLGRDWRR---GKCQQEERQLISMFADDPRAPYSIHNFVRHGATACGK 233
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A A+C +A T LP+ +Y + +D + D
Sbjct: 234 FPGEWFGP-------SATAQCIQALTS--ASGLPLKVYSPNDGQDVYEDSFMKIAKPD-- 282
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
GQ D+ P L+L+ LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 283 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLLAALQMPQSVGIAGGRPSSSHYF 333
Query: 313 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VG Q YLDPH + I D E D + H+ +R +HL +DPS+ IGF
Sbjct: 334 VGSQGSYLFYLDPHHTRKAIPYHADVTKYTEEDIESCHTSRLRRLHLKEMDPSMLIGFLI 393
Query: 370 RDKDDFDDF 378
R + D+ ++
Sbjct: 394 RTESDWSEW 402
>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
206040]
Length = 452
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/316 (32%), Positives = 148/316 (46%), Gaps = 50/316 (15%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVG 126
G A F +D SS+ ++YR GF+PI S +SD G
Sbjct: 113 GTGWPAGFVEDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSG 172
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A + RLGR WR+ + +R ++ +F D +P+SIHN ++
Sbjct: 173 WGCMIRSGQSLLATTIGILRLGRDWRRDQSQEEERH---LISMFADDPRAPYSIHNFVRH 229
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A A+C +A T L + IY + +D
Sbjct: 230 GATACGKYPGEWFGP-------SATAQCIQALTS--SSGLSLNIYSPNDGQD-------- 272
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ + S S GQ + P L+L+ LG++K+ P Y L PQS+GI GG+
Sbjct: 273 --VYEDSFMKIAKSDGQT-FNPTLILIRTRLGIDKITPIYWDALIAALHMPQSVGIAGGR 329
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPS 362
P +S Y VG Q YLDPH + I D E D + H+ +R IH+ +DPS
Sbjct: 330 PASSHYFVGSQGSYLFYLDPHHTRKAIPYHDDVTKYTEEDIESCHTSRLRRIHIKEMDPS 389
Query: 363 LAIGFYCRDKDDFDDF 378
+ IGF R + D+ ++
Sbjct: 390 MLIGFLIRTESDWTEW 405
>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
heterostrophus C5]
Length = 471
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 160/358 (44%), Gaps = 91/358 (25%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SRI ++YR GF I S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + ++ GQ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEVAIDDDGQ--WQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDVQPVI--------------NIGKDDLE--------------- 340
Y V Q + YLDPH +P++ N ++ L
Sbjct: 311 HYFVATQGNNFFYLDPHSTRPLLPYRPPPSSTENESQNQSQNQLAVPSSLDASATSNSSS 370
Query: 341 ------------ADTSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+D +TY H+ IR + + +DPS+ I F DD++++
Sbjct: 371 TTIVPSATPTDGSDRTTYSEEELATCHTRRIRRLQIREMDPSMLIAFLITSADDYENW 428
>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
Length = 1505
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 158/348 (45%), Gaps = 80/348 (22%)
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--------- 162
GF G +T+D GWGCMLR+ Q L+A AL+ LGR W + + P R+
Sbjct: 779 GFSRAG---LTTDSGWGCMLRTGQSLLANALINVHLGRSWMR--EAPPARQLEFLQELAN 833
Query: 163 ------------------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGS 196
Y++IL F D S PF +H + + GK G G
Sbjct: 834 LSLDTSAEKQSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGE 893
Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE-DGERGGAPVVCIDDASRHC 255
W GP + + L + + GL + ++ + DE G + + AS
Sbjct: 894 WFGPSTAAGAIKQLV-SEFPDAGLAVELAHDGVFYL--DEVRAAAGASRQLGKGRASATG 950
Query: 256 SVFSKGQADWT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
+ KG T P+L+L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y
Sbjct: 951 TNGRKGDTALTWHKPVLILIGIRLGLDSVNPIYYESVKATFSFPHSVGIAGGRPSSSYYF 1010
Query: 313 VGVQEESAIYLDPHDVQPVINI------------------------GKDD---------L 339
+G Q S YLDPH+V+P + + DD
Sbjct: 1011 MGHQGNSLFYLDPHNVRPAVALRFPPSTFPAAVPRQLDIAHRFAFEEHDDEDEWWSHAYT 1070
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
EA TST+H D +R + + S+DPS+ +GF +D++D D CAR L++
Sbjct: 1071 EAQTSTFHCDKVRRMPIKSLDPSMLLGFLVKDEEDLADLCARIKALSK 1118
>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
Length = 470
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 157/357 (43%), Gaps = 90/357 (25%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SRI ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR ++P +E+ +I+ +F D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDIVAMFADDPRAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + L + E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVH-KNKEVGL-------KVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD--------------------------- 342
Y V Q + YLDPH +P++ +
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLLPYRPSSWSTEEQASAPSTLEASATSATSTSSSTTIVP 370
Query: 343 -------------TSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
TS Y H+ IR + + +DPS+ + F +DD++D+
Sbjct: 371 SANEVTAPSDASRTSGYSPEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYEDW 427
>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 470
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 159/357 (44%), Gaps = 90/357 (25%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SRI ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR ++P +E+ +++ +F D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + L R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVHKNR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 310 TYIVGVQEESAIYLDPHDVQPVINI------------GKDDLE----------------- 340
Y V Q + YLDPH +P++ LE
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLLPYRPSSSSTEEQVAAPSTLEASATSVTSTSSSTTIVP 370
Query: 341 -ADTSTYHSDV------------------IRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
A+ T SDV IR + + +DPS+ + F +DD++D+
Sbjct: 371 SANEVTAPSDVSKPSGYSLEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYEDW 427
>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
Length = 1509
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 115/392 (29%), Positives = 172/392 (43%), Gaps = 88/392 (22%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE-------------- 162
+T+D GWGCMLR+ Q L+A AL+ LGR W++ Q F E
Sbjct: 776 LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAENQS 835
Query: 163 -------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
Y+ IL F D S PF +H + + GK G G W GP +
Sbjct: 836 LASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAGAI 895
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----A 263
+ L E G+ + ++ + D R A SR + S + A
Sbjct: 896 KQLV-FDFPEAGIAVELAHDGVFYL----DEVRAAASAST--GKSRASGMLSGNRRAETA 948
Query: 264 DWT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
W P+L+L+ + LGLE VNP Y +++ TF+FPQS+GI GG+P +S Y +G Q S Y
Sbjct: 949 VWRRPVLILIGIRLGLETVNPIYYESVKATFSFPQSVGIAGGRPSSSYYFMGHQGNSLFY 1008
Query: 323 LDPHDVQPVINI------------------------GKDD---------LEADTSTYHSD 349
LDPH+V+P + + +DD EA TST+H +
Sbjct: 1009 LDPHNVRPAVPLRYPPTTFPAAAPSRFDVSHRYALEDRDDEDEWWSHAYTEAQTSTFHCE 1068
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 409
+R + + S+DPS+ +GF +D++ D CAR L + +F+ ++ K V+ D
Sbjct: 1069 KVRRMPIKSLDPSMLLGFLVKDEEALVDLCARIKALPKT-----IFSFAESAPKWVDDDD 1123
Query: 410 V--LGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
E+ P D G +D VG + D
Sbjct: 1124 FDPSMESFSEPSADEAG---SDDDVGKGEDQD 1152
>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
Length = 1541
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/339 (30%), Positives = 154/339 (45%), Gaps = 81/339 (23%)
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK--PLQKPFD--------- 160
GF G +T+D GWGCMLR+ Q L+A ALL LGR W + P + D
Sbjct: 814 GFSRAG---LTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLS 870
Query: 161 -------------RE-------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 198
RE Y++IL F D S PF +H + + GK G G W
Sbjct: 871 LDSSVEMQSLQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 930
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
GP + + L + + G+ + ++ + DE GA R
Sbjct: 931 GPSTAAGAIKQLV-TEFPDAGIAVELAHDGVFYL--DEVRLAAGARSALQSGKGR----- 982
Query: 259 SKGQADWT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
+G A T P+++L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y +G
Sbjct: 983 -QGDAAVTWRRPVVILIGIRLGLDSVNPIYYESVKETFSFPHSVGIAGGRPSSSYYFMGH 1041
Query: 316 QEESAIYLDPHDVQPVINI------------------------GKDD---------LEAD 342
Q S YLDPH+V+P + + KDD EA
Sbjct: 1042 QGNSLFYLDPHNVRPAVALRYPPSTFPTAVPHQLDVAHRFALEDKDDELEWWSHAYTEAQ 1101
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
TST+H + +R + + S+DPS+ +GF +D++D D C R
Sbjct: 1102 TSTFHCEKVRRMPIKSLDPSMLLGFLVKDEEDLMDLCTR 1140
>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
Length = 473
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 122/263 (46%), Gaps = 42/263 (15%)
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 126
A N + F DF SRI ++YR GF I S+ TSD G
Sbjct: 87 AQYGNWPSAFLDDFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
+GCM+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++
Sbjct: 147 FGCMIRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEH 205
Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A R + LA R E GL +YV D
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYVSGDGADVYEDKLKE 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
V IDD +W P L+LV LG++K+ P Y L+ + QS+GI GG+
Sbjct: 258 VAIDD-----------DGEWQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGR 306
Query: 306 PGASTYIVGVQEESAIYLDPHDV 328
P AS Y V Q + YLDPH
Sbjct: 307 PSASHYFVATQGNNFFYLDPHST 329
>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 376
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/317 (32%), Positives = 155/317 (48%), Gaps = 42/317 (13%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFS 179
TSD GWGCM R QML+AQAL+ H LGR WR + ++I+ F DS + SP S
Sbjct: 67 TSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLS 126
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSG 234
+H L+Q G W GP ++C A+ R + L + + +Y V+
Sbjct: 127 LHRLVQMSDR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYR 180
Query: 235 DE--DGERG------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLG 277
+E D RG P + D H +++ + Q+D T ILLL+PL+ G
Sbjct: 181 EEIIDLARGLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFG 236
Query: 278 L-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
++NPRYI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+
Sbjct: 237 KGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNS 296
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 396
D ++H + + + +++PS A+GFYCR + + D R L S+
Sbjct: 297 PKFSVD--SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ--- 351
Query: 397 VTQTHKKPVNHS-DVLG 412
T +PV + +VLG
Sbjct: 352 -ASTRSRPVAFTVEVLG 367
>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
Length = 1093
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 153/319 (47%), Gaps = 42/319 (13%)
Query: 117 GDSKITSDVGWGCMLRSSQMLVAQALL-------------FHRLGRPWRKPLQKPFDRE- 162
G +TSD GWGCMLR+ QML+A +L+ + P P + DR+
Sbjct: 431 GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPAPSLPPSET--DRQR 488
Query: 163 ---YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
YV+IL F D + PFS+H L AG G G W GP S + L A
Sbjct: 489 FEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAGSIKKLVSAFPA- 547
Query: 218 TGLGCQSLP------MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPIL 269
GLG P A++ S + + D ++ + +W +L
Sbjct: 548 CGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERA-NRMKEEWGDRAVL 606
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+L+ L LG+E V P Y +++ FTFPQ++GI GG+P +S Y VG Q + YLDPH +
Sbjct: 607 ILIGLRLGIEGVTPIYYDSVKALFTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTR 666
Query: 330 PVINI-----GKDDLE-----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
P + + G D ++ T+HSD +R +H+ +DPS+ GF R+ +++ D
Sbjct: 667 PAVPLRVPTDGPYDATGQFTLSEMKTFHSDKVRKMHISGLDPSMLCGFIVRNVEEWRDLR 726
Query: 380 ARASKLAEESNG-APLFTV 397
AR LA+ G AP+FT+
Sbjct: 727 ARVDALAKSKGGKAPIFTI 745
>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
Length = 431
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 171/384 (44%), Gaps = 89/384 (23%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 60 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 109
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 110 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSRSASPSRYHGPAH 169
Query: 152 -RKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
R P L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 170 WRPPRWAQGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 225
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
+A R + + +YV +D A VV +
Sbjct: 226 -----SLVAHILRKAVESCSEVTRLVVYV---SQDCTVYKADVVRL-------VARPDPA 270
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++L T P ++ +Y
Sbjct: 271 AEWKSVVILVPVRLGGETLNPVYVPCVKLMPTPP-------------------TDDFLLY 311
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 312 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFEMLCSEL 369
Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
+++ S+ P+FT+ + H +
Sbjct: 370 TRVLSSSSATERYPMFTLAEGHAQ 393
>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
Length = 1257
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/294 (32%), Positives = 137/294 (46%), Gaps = 61/294 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
F D++SR+ ++YR F PI D+ +
Sbjct: 317 FYSDYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWS 376
Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE---ILHLFGD- 172
TSD GWGCMLR+ Q L+A AL+ L R WR+P + +YV+ IL F D
Sbjct: 377 GEKGWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDN 436
Query: 173 -SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC--------- 222
S +PF IH + AGK G GSW GP + + L + + GL
Sbjct: 437 PSPLAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLV-GEFEDAGLEVALAVDSVVY 495
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEK 280
QS A S +++G G + V + + +G W P+L+LV + LG++
Sbjct: 496 QSDVYAASAASRNQNGVEGDSKTVGTSKSRKKG----QGPPKWGNRPVLILVGIRLGIDG 551
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
VNP Y +++ FTFPQ++GI GG+P +S Y VG Q +S YLDPH +P I +
Sbjct: 552 VNPIYYESVKTLFTFPQTVGIAGGRPSSSYYFVGAQGDSLFYLDPHHTRPAIPL 605
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
T+H + +R + L ++DPS+ +GF CR+++++ D R +++A +F+V
Sbjct: 794 TFHCERVRKMPLSALDPSMLLGFLCRNEEEWKDLRERLAEMARTKKA--IFSV 844
>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 459
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 154/342 (45%), Gaps = 59/342 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------------- 142
F + F+S + +YR+GF P+ S +T+D GWGC+LRSSQML+AQ L
Sbjct: 98 FRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWSGN 157
Query: 143 ---------LFHRLGR---------------PWRKPLQKPFDREYVEILHLFGDSETSPF 178
L H + W L +P + IL F D+ T+PF
Sbjct: 158 QRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTAPF 217
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
IH L++ GK+ G AG W GP A R LP + V+ D
Sbjct: 218 GIHRLVELGKSSGKKAGDWYGP-------SIAAHILRKAVEASVVDLPNLVAYVAQD--- 267
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ + D + C W +L+LVP+ LG + +NP YI +++
Sbjct: 268 -----CTIYLQDVRKLCE--RPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLECC 320
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
+GI+GGKP S + VG Q++ +YLDPH QP +++ K+ ++H R +
Sbjct: 321 IGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPTVDVTKN---FPLESFHCKNPRKMPFSR 377
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQ 399
+DPS IGFY + + +F+ C ++ ++ + P+F +
Sbjct: 378 MDPSCTIGFYAKGQMEFESLCTSVNEAVSASAETYPMFIFEE 419
>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
Length = 450
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 148/312 (47%), Gaps = 50/312 (16%)
Query: 97 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 133
F +D +++ ++YR GF+PI S +SD GWGCM+RS
Sbjct: 115 FTEDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRS 174
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
Q L+A + +LGR WR+ + +E ++ +F D +PFSIHN ++ G A G
Sbjct: 175 GQSLLATTIATLQLGRDWRRGKNQ---QEERRLISMFADDPRAPFSIHNFVRHGATACGK 231
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A A+C +A T L + +Y + +D V D
Sbjct: 232 FPGEWFGP-------SATAQCIQALTS--SSDLDLHVYSPNDGQDVYEDSFMKVAKPD-- 280
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
GQ D+ P L+L+ LG++K+ P Y L T PQS+GI GG+P +S Y
Sbjct: 281 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLIATLQMPQSVGIAGGRPSSSHYF 331
Query: 313 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VG Q YLDPH + + +D + D + H+ +R +H+ +DPS+ IGF
Sbjct: 332 VGSQGSYLFYLDPHHTRKALPYHEDVANYTDEDIDSCHTSRLRRLHVKEMDPSMLIGFLI 391
Query: 370 RDKDDFDDFCAR 381
R + D+ ++ R
Sbjct: 392 RSESDWAEWRQR 403
>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
Length = 268
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 92/268 (34%), Positives = 133/268 (49%), Gaps = 40/268 (14%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVG 314
TL+ F PQSLG++GGKP ++ Y +G
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIG 268
>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
Length = 1572
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 114/387 (29%)
Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK---PL-QKPFDRE----- 162
GF G +T+D GWGCMLR+ Q L+A AL+ LGR W++ PL Q+ F E
Sbjct: 824 GFSRAG---LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLS 880
Query: 163 ----------------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 198
Y++IL F D S PF +H + + GK G G W
Sbjct: 881 IADAAEKESLQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 940
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD-------A 251
GP + + L P A V DG V +D+ +
Sbjct: 941 GPSTASGAIKQL-----------VSEFPQAGIAVELARDG------VFYLDEVRAAASAS 983
Query: 252 SRHCSVFSKGQAD---------------WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+ SV S G+A W P+L+L+ + LGLE VNP Y +++ TF+F
Sbjct: 984 ASAASVQSGGKARSSGAASGSRKGEGLIWRRPVLILIGIRLGLESVNPIYYESVKATFSF 1043
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--------------------- 334
P S+GI GG+P +S Y +G Q S YLDPH+V+P + +
Sbjct: 1044 PHSVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPSTFPDAVPRHLGIAHRF 1103
Query: 335 ---GKDD---------LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
KDD E TST+H + +R + + S+DPS+ +GF +D++ D CAR
Sbjct: 1104 VLEDKDDEDEWWSHAYSEVQTSTFHCEKVRRMPIKSLDPSMLLGFLVKDEESLQDLCARI 1163
Query: 383 SKLAEESNGAPLFTVTQTHKKPVNHSD 409
L + +F+ ++ K V+ D
Sbjct: 1164 KALPKT-----IFSFAESAPKWVDDDD 1185
>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 452
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 167/371 (45%), Gaps = 65/371 (17%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S +S + LLG +++ +DEA + F + F+S + ++YR+GF + S +T+D
Sbjct: 70 SKSSPLILLGKSYEL-KDEANKE--------RFRRSFASLLWLTYRRGFPQLAGSSLTTD 120
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----------------------------- 155
GWGC+LR+ QML+A+ LL H + W +
Sbjct: 121 SGWGCVLRTGQMLLARGLLTHLMPPGWMWSVWYRAVKDDLDLPHHADCTDCKSNMRCRYQ 180
Query: 156 ------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+P + + +++ F D +PF IH L++ G + G AG W GP +
Sbjct: 181 SLGSLYDRPLEAMHRKVVSWFADHPKAPFGIHRLVELGASSGKKAGDWYGPSIVA---HI 237
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 269
L + A LP + V+ D + + D C W ++
Sbjct: 238 LQKAVAASV-----DLPNLVVYVAQD--------CTIYLQDVRGLCE--RPPPHSWKSVI 282
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+LVP+ LG + +NP YI ++ +GI+GG+P S + VG Q++ +YLDPH Q
Sbjct: 283 ILVPVRLGGQDLNPSYISCVKKLLELQCCIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQ 342
Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 389
+N+ K++ + ++H R + +DPS IGFY + + + C +++ S
Sbjct: 343 LTVNVTKENFPLE--SFHCKYPRKMPFSRMDPSCTIGFYASGQQELELLCTNVNEVVSTS 400
Query: 390 -NGAPLFTVTQ 399
G P+F ++
Sbjct: 401 AEGYPMFIFSE 411
>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/330 (29%), Positives = 154/330 (46%), Gaps = 60/330 (18%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 116
LG G+ E ++D SRI +YR GF+PI
Sbjct: 69 LGRRYGSGSKEEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 117 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
+ T+DVGWGCM+R+SQML+A A+ LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAIQLLLLGRGFT--YADSSEKKHSDIIDMF 186
Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D +PFS+HN ++A L G W GP A S + L + Q E+ S P
Sbjct: 187 TDDPKAPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDES-----SSPRF 241
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
++S D DD + + + + IL+L+P+ LGL KV+P Y +
Sbjct: 242 RVIISESCD---------IYDD--KIGKLLQENEDAEGAILILLPVRLGLNKVSPYYHNS 290
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L F+ PQ +GI GGKP +S Y G + +YLDPH Q V + T+H+
Sbjct: 291 LSSLFSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSV------KASSIYDTFHT 344
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
++ + ++ +DPS+ IG + K+D++ F
Sbjct: 345 HNVQSLKIEDMDPSMLIGILIKSKEDYESF 374
>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
Length = 400
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 157/316 (49%), Gaps = 28/316 (8%)
Query: 92 NGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
N + E + +D SR+ +YR F P+G+ ++T+D GWGCMLR QM++AQAL+ LG
Sbjct: 52 NAIQELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLG 111
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
R W + D Y++I++ F D+ S +S+H + G++ G W+GP + + +
Sbjct: 112 REWYWT-SECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILK 170
Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 268
L C L I+V V +DD S+ W P+
Sbjct: 171 KLV-CFDDWCSL-------VIHVAMDS---------TVVLDDIYS----LSQDGESWKPL 209
Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
LL++PL LG+ +NP Y+P L+ F S G++GG+P + Y VG ++ +YLDPH
Sbjct: 210 LLIIPLRLGITDINPIYVPALKRCFELESSCGMIGGRPNQALYFVGYVDDEVLYLDPHTT 269
Query: 329 QPVINIGKDDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
Q +G+ A+ TYH ++ ++DPSLA+ F C+ + F+ + +
Sbjct: 270 QRTGAVGQKTTTAEQELDETYHQKYAARLNFSAMDPSLAVCFICKTQSSFELLLKQLREE 329
Query: 386 AEESNGAPLFTVTQTH 401
+ LF ++++
Sbjct: 330 VLTLSSPALFEISKSR 345
>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
Length = 1202
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 159/390 (40%), Gaps = 110/390 (28%)
Query: 97 FNQDFSSRILISYRKGFDPI---------------------------GDSKITSDVGWGC 129
F +DF+SRI ++YR GF PI + +++D GWGC
Sbjct: 545 FYEDFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGC 604
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD--------------REYVEILHLFGD--S 173
MLR+ Q L+A AL F LGR WR+ + Y +L F D S
Sbjct: 605 MLRTGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPS 664
Query: 174 ETSPFSIHNLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV- 231
PFS+H GK G G W GP + + LA + +L +A+ V
Sbjct: 665 PLCPFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLA------SNFAPANLGVAVSVD 718
Query: 232 --VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
V + P A R S + P+L+L+ LGL+KVNP Y ++
Sbjct: 719 GTVYRSDVQAAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESI 778
Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH----------------------- 326
+ +FPQS+GI GG+P +S Y VGVQ+ S Y+DPH
Sbjct: 779 KAALSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPHHTKPAIPFRQPPPDIAALAAELP 838
Query: 327 -DVQPVINIGKDDL----------------EADTST-----------------YHSDVIR 352
D+ +N + L E D +T +H D +R
Sbjct: 839 LDIHSPLNAWQRSLGDSLPPTPGAEPPAPDECDDATRLRAWFANEYDETCFGSFHCDRVR 898
Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
+ L +DPS+ IGF CRD+ D+DD +RA
Sbjct: 899 KMPLSGLDPSMLIGFLCRDEADWDDLQSRA 928
>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
SS1]
Length = 1286
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 145/312 (46%), Gaps = 57/312 (18%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKIT---------------------------- 122
NN F DF+SR+ ++YR F PI DS +T
Sbjct: 333 NNWPPVFYSDFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGW 392
Query: 123 ---------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLF 170
SD GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V++L F
Sbjct: 393 PGSGEKGWTSDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWF 452
Query: 171 GDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
DS T PFS+H + AGK G G W GP + + L E GLG +A
Sbjct: 453 FDSPTPHCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---IA 508
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
D P + + + + G+A +L+L+ + LGL+ VNP Y T
Sbjct: 509 SDSQIFQSDVFAASHPPMDSPSSKKKLASTWGGRA----VLVLIGIRLGLDGVNPIYYET 564
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + L ST +
Sbjct: 565 IKALYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAV-----PLRPPPST--N 617
Query: 349 DVIRHIHLDSID 360
D++ I +SI+
Sbjct: 618 DIVLDISRESIE 629
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 51/91 (56%), Gaps = 15/91 (16%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
A+ T+H + +R + L +DPS+ +GF CRD+ D++DF AR + L++ T+
Sbjct: 836 AELKTFHCERVRKMPLSGLDPSMLVGFLCRDEGDWEDFKARVADLSKTHK-----TIFSI 890
Query: 401 HKKPVNH-SDVLGETGGVPEDDSLGVMSMND 430
H +P ++ SD +D LG+ SM++
Sbjct: 891 HDEPPSYPSD---------SEDHLGLESMSE 912
>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
aries]
Length = 438
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 105/347 (30%), Positives = 163/347 (46%), Gaps = 31/347 (8%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 85 TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGTLTSD 138
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 184
GWGCMLRS QM++AQ LL H L R W Q P
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHLLPRDWTWS-QGAGLGPAEPPGLGSPSPGPGPXXXXXXX 197
Query: 185 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 244
G+A G AG W GP +A R C + + VS D
Sbjct: 198 SWGRAPGKKAGDWYGP-------SLVAHILRKAVE-SCSEVTRLVVYVSQDC-------- 241
Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
V D +R + S A+W +++LVP+ LG E +NP Y+P ++ LGI+GG
Sbjct: 242 TVYKADVARLVAR-SDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGG 300
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
P S Y +G Q++ +YLDPH QP +++ + D + ++H R + +DPS
Sbjct: 301 TPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCT 358
Query: 365 IGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 408
+GFY D+ +F+ C+ +++ S+ P+FT+ + H + +HS
Sbjct: 359 VGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLVEGHAQ--DHS 403
>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 470
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 172/385 (44%), Gaps = 79/385 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
S ++ ++LLG + D+ + F +DF SR+ ++YR+ F + + +T+D
Sbjct: 76 SRSAPVYLLGERYYFRLDDEID---------RFQKDFVSRVWLTYRRDFPALEGTALTTD 126
Query: 125 VGWGCMLRSSQMLV---------------AQALLFH------------------------ 145
GWGCM+RS QML+ ++AL H
Sbjct: 127 CGWGCMIRSGQMLLAQGLLLHLLSREWTWSEALYTHFVEMEPIRSSSPSSMPLSLATDHS 186
Query: 146 -RLGRPWRKPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 203
R +P + P+ E + I+ F D ++PF +H ++ G +G AG W GP
Sbjct: 187 GRHSQPQTHCSRAPYGGEVHQNIVSWFSDHASAPFGLHRMVALGSIFGKRAGDWYGP--- 243
Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSG----DEDGERGGAPVVCIDDASRHCSVFS 259
+A + + +++YV D E+ A V D SR
Sbjct: 244 ----SIVAHIIKKAIESSSEVPDLSVYVSQDCTVYKADIEQLFAGEVPHTDTSR-----G 294
Query: 260 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 319
G+A +++LVP LG E NP Y L+ P LGI+GGKP S Y +G Q+
Sbjct: 295 AGKA----VIILVPARLGGETFNPVYKHCLKEFLRMPSCLGIIGGKPKHSLYFIGYQDNY 350
Query: 320 AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
+YLDPH QP I+ +D+ + ++H + R + + +DPS FY +++DDF C
Sbjct: 351 LLYLDPHYCQPYIDTSRDNFPLE--SFHCNAPRKLSITRMDPSCTFAFYAKNRDDFGKLC 408
Query: 380 ARASKL-----AEESNGAPLFTVTQ 399
SK+ AEE P+F++++
Sbjct: 409 EHLSKVLHSPQAEEK--YPIFSISE 431
>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
Length = 403
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 166/371 (44%), Gaps = 64/371 (17%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
I + +W+LG + ++ L +D S++ +YRKGF PIG +S
Sbjct: 16 IPQTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFIPIGGCNST 64
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 179
TSD GWGCMLR QM++AQAL+ LG+ W+ P K + Y++IL F D + FS
Sbjct: 65 FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQWMPETK--NNTYLKILSRFEDKRAAAFS 122
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIY 230
IH + G + G G W GP + + W +L + L +
Sbjct: 123 IHQIALTGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSSLTIHVALDNTLIVNDILKQCR 182
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
+ G+ G P+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 183 IEGGETAEADGEVPL--------------KAPSQWKPLLLLIPLRLGLSEINPVYINGLK 228
Query: 291 L--------------------TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
+ +F QSLG++GGKP + Y +G + IYLDPH Q
Sbjct: 229 VKFKILCMQKKKYICIQFFQTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQR 288
Query: 331 V----INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
I ++++E D TYH I + +DPS+A+ F+C + +F C +
Sbjct: 289 SGSVEDKISEEEIEMDI-TYHCKSASRIPITGMDPSVALCFFCATEKEFMSLCKSMQEEL 347
Query: 387 EESNGAPLFTV 397
PLF +
Sbjct: 348 ILPEKQPLFEL 358
>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
Length = 469
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 161/356 (45%), Gaps = 63/356 (17%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 93 DDEIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSRE 152
Query: 151 W--RKPLQKPF----------------------------------------DREYVEILH 168
W + L + F D+ + I+
Sbjct: 153 WTWSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMR 212
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
F D SPF +H L+ G +G AG W GP +A + + ++
Sbjct: 213 WFSDHPGSPFGLHQLVTLGSIFGKKAGDWYGP-------SIVAHIIKKAIETSSEVPELS 265
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+YV S D + + D + G+A +++LVP+ LG E NP Y
Sbjct: 266 VYV-SQDCTVYKADIEQLFAGDVPHAETSRGAGKA----VIILVPVRLGGETFNPVYKHC 320
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L+ P LGI+GGKP S Y +G Q+ +YLDPH QP I+ K+D + ++H
Sbjct: 321 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPYIDTSKNDFPLE--SFHC 378
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL-----AEESNGAPLFTVTQ 399
+ R I + +DPS FY ++ +DF C K+ AEE P+F++++
Sbjct: 379 NSPRKISITRMDPSCTFAFYAKNSEDFGKLCDHLMKVLHSPRAEEK--YPIFSISE 432
>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 414
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 115/378 (30%), Positives = 177/378 (46%), Gaps = 63/378 (16%)
Query: 66 STSDIWLLGVCHKIAQDE-------ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
S S IWLLG + A++E + + L++F +DF +RI +YR GF I
Sbjct: 45 SHSPIWLLG--KQYAKNEPRPNLRRGFDENSAVGKLSDFLEDFRTRIWFTYRHGFPCIPG 102
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDREYVEILHLFGDSET 175
+K +D GWGC +RS QML+A+ +L H LGR W + L + + +++ LF D+ T
Sbjct: 103 TKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLLGQSGLPEDEALMHRKVIGLFCDNLT 162
Query: 176 SPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
SPFS+HNL+Q G+ +G AGSW GP ++ + + +A E GL +A++V+
Sbjct: 163 SPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQ-VAMNNAIERGL---VEGLAVHVIGD 218
Query: 235 DE----DGERGG-----APV----------------VCIDDASRHCSV------------ 257
E D ER G APV D R SV
Sbjct: 219 GELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSYLDLRRLTSVSNGDLLPSHDGE 278
Query: 258 ------FSKGQADWTP-ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
F W+ +L+L+PL LG+EK N Y L+ + +G++GG+
Sbjct: 279 SIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHLKRVLSTKFCVGVIGGRHHKCY 338
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
Y G + I LDPH QP ++ + + ++H + + IDP +IGFY R
Sbjct: 339 YFCGWHTDYLIRLDPHYSQPAVDATQPGVS--LHSFHCKYPKKTLIADIDPWCSIGFYIR 396
Query: 371 DKDDFDDFCARASKLAEE 388
++ + F A S++ E
Sbjct: 397 NRLELQSFLADISEVGFE 414
>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
Length = 462
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 109/353 (30%), Positives = 152/353 (43%), Gaps = 82/353 (23%)
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 126
A N + F DF SRI ++YR GF I S+ TSD G
Sbjct: 87 AQYGNWPSAFLDDFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
+GCM+RS Q ++A AL RLGR WR P +E+ IL LF D +PFSIH ++
Sbjct: 147 FGCMIRSGQCILANALQTLRLGRDWRY-QDDPTAQEHCNILSLFADDPQAPFSIHRFVEH 205
Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G A G G W GP A R + L + E GL +YV SGD GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KYKEAGL-------RVYV-SGD------GADV 250
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+
Sbjct: 251 Y--EDKLKQVAVEEDGE--WIPTLILVGTRLGIDKITPVYWEALKASLQMKQSMGIAGGR 306
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVI----NIGKDDLEA-------------------- 341
P AS Y V Q YLDPH +P + D+
Sbjct: 307 PSASHYFVATQANHFFYLDPHSTRPHLPYRPPTSSDETTTQLASSITSTSSSTTIVPSAS 366
Query: 342 ----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
D S+ H+ IR + + +DPS+ + F ++D++ +
Sbjct: 367 SLPPRSPPEPSTYTLDDISSCHTRRIRRLQIREMDPSMLLAFLVTSQEDYEKW 419
>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
Length = 499
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 175/387 (45%), Gaps = 76/387 (19%)
Query: 77 HKIAQDEALGDAAGNNG---LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC---- 129
+KI+ LGD+ N + F F SRI ++YRK F + S T+D GWGC
Sbjct: 83 NKISPVTILGDSYLLNSEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRS 142
Query: 130 --MLRSSQMLV-----------AQAL------LFH-----RLG----------------- 148
ML + +LV AQ L +F R G
Sbjct: 143 GQMLLAQGLLVHLMPRGWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPE 202
Query: 149 RPW----------RKPLQKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
RP +K L+ DR+ + +++ FGD T+PF IH L++ GK+ G A
Sbjct: 203 RPLLSEQATKCSRKKRLESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKA 262
Query: 195 GSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
G W GP + +A+AR + + +YV D + +C S+
Sbjct: 263 GDWYGPAIVAHILRKAVARASAVHS--------LVVYVAQ-DCTVYKEDVMHLCDPTPSQ 313
Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
S QA W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y V
Sbjct: 314 TPSDPLSHQA-WKSVIILVPVRLGGECLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFV 372
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
G Q+E +YLDPH QPV+++ + + + ++H + + + + +DPS IGFY + K
Sbjct: 373 GFQDEQLLYLDPHYCQPVVDVSQ--VNSSLESFHCNAPKKMPFNRMDPSCTIGFYAKSKK 430
Query: 374 DFDDFC-ARASKLAEESNGAPLFTVTQ 399
DF+ C A + L+ PLFT +
Sbjct: 431 DFESLCSAVGTALSSSKERYPLFTFIE 457
>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
MF3/22]
Length = 1147
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 135/298 (45%), Gaps = 62/298 (20%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
G N F DFSSR+ ++YR + PI D +
Sbjct: 335 GANWPPGFYSDFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGS 394
Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDS 173
TSD GWGCMLR+ Q L+A AL+ LGR WR+P Q + + YV+IL F DS
Sbjct: 395 GEKGWTSDSGWGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYATYVKILTWFFDS 454
Query: 174 ET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV 231
PFS+H + AGK G G W GP + + + AE GLG S+ V
Sbjct: 455 TDIHCPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHA-FAEAGLGV-SVATDGVV 512
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---------------W--TPILLLVPL 274
D P + RH + + + W P+L+LV +
Sbjct: 513 YETDVLAASNAGPYMY-----RHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGI 567
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
LG++ VNP Y ++ FTFPQS+GI GG+P +S Y VGVQ ++ YLDPH +P +
Sbjct: 568 RLGIDCVNPVYYDAVKALFTFPQSVGIAGGRPSSSYYFVGVQTDNLFYLDPHHSRPSV 625
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 29/42 (69%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
T+H D +R + L S+DPS+ IGF CRD+ D+ D R ++++
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCRDERDWKDLRERVTEMS 769
>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
boliviensis]
Length = 463
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 161/376 (42%), Gaps = 96/376 (25%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 109 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 162
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------------- 159
GWGCMLRS QM++AQ LL H L R W L P
Sbjct: 163 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSRYHGPARWMPPCW 222
Query: 160 ---------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
+R + +I+ F D +PF +H L++ G++ G AG W GP +
Sbjct: 223 AQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 275
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
A R + + +YV S+ C+ G+ TP L
Sbjct: 276 AHILRKAVESSSEVTRLVVYV--------------------SQDCT----GKGTCTPSLQ 311
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
+ LR LGI+GGKP S Y +G Q++ +YLDPH QP
Sbjct: 312 EL----------------LRCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP 351
Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
+++ + + + ++H R + +DPS +GFY D+ +F+ C+ +++ S+
Sbjct: 352 TVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSS 409
Query: 391 GA---PLFTVTQTHKK 403
P+FT+ + H +
Sbjct: 410 ATERYPMFTLAEGHAQ 425
>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
Length = 511
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 162/382 (42%), Gaps = 80/382 (20%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 129 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 182
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPW-------------------------------RK 153
GWGCMLRS QM++AQ LL H L R W R
Sbjct: 183 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRW 242
Query: 154 PLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
P +R + +I+ F D +PF +H L++ G++ G AG W GP +
Sbjct: 243 AQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 295
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------KGQAD 264
A R C + + VS D +PV + + + +
Sbjct: 296 AHILRKAVE-SCSEVTRLVVYVSQDCTAAEASSPVSDTPASGPLHLLPLLLGVLFQQRCR 354
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W + L+ L LGI+GGKP S Y +G Q++ +YLD
Sbjct: 355 WLFVCELLRCEL---------------------CLGIMGGKPRHSLYFIGYQDDFLLYLD 393
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 394 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 451
Query: 385 LAEESNGA---PLFTVTQTHKK 403
+ S+ P+FT+ + H +
Sbjct: 452 VLGSSSATERYPMFTLAEGHAQ 473
>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
Length = 491
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 116/452 (25%), Positives = 176/452 (38%), Gaps = 116/452 (25%)
Query: 65 SSTSDIWLLGVCHKIA---QDEALGDAAG--------NNGLAEFNQDFSSRILISYRKGF 113
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPTESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNYDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 302 VGGKPGASTYIVGVQE----------------ESAIYLDPHDVQPVINIGKDDLEADT-- 343
+GGKP S Y G QE ++ + L+ + +P + G +D +
Sbjct: 323 IGGKPKQSYYFAGFQENEVQRSSMNSLKQKSSKNNLKLEGSEKRPQMGFGSEDEFKNILL 382
Query: 344 -------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
T+H + + +DPS IGFYCR+ DF+ +K+ + S+
Sbjct: 383 DHVQAFGPPSYPRLTFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFERASEEITKMLKFSS 442
Query: 391 GA--PLFTVTQTHKK-------PVNHSDVLGE 413
PLFT H + N D+ E
Sbjct: 443 KEKYPLFTFVNGHSRDYDFTSTTTNEEDLFSE 474
>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
FP-101664 SS1]
Length = 997
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 133/282 (47%), Gaps = 58/282 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 123
F DF+SRI ++YR F PI D+ + T+
Sbjct: 298 FYADFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTT 357
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 178
D GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 358 DAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPF 417
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
S+H + GK G G W GP + + L + P A V+ DG
Sbjct: 418 SVHRMALVGKDLGKDVGQWFGPSTAAGAIKTL-----------VHAFPEATLGVANAVDG 466
Query: 239 ERGGAPVVCIDDASRHCSVFSK----GQADW--TPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ V ASR ++ + DW +L+L+ + LG+E VNP Y T++
Sbjct: 467 TLYESDVYA---ASRSVMYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTL 523
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 524 YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPL 565
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+ + T+H D +R + L +DPS+ +GF C+D+ ++ D R ++L N +F++
Sbjct: 696 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKDRIAELFR--NNKSIFSLAN 753
Query: 400 THKKPVNHSDVLG 412
+ + SD +G
Sbjct: 754 EPPQYPSDSDDMG 766
>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
bisporus H97]
Length = 1261
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 101/292 (34%), Positives = 134/292 (45%), Gaps = 65/292 (22%)
Query: 97 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 122
F DF SRI ++YR F PI DS +T
Sbjct: 247 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 306
Query: 123 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 174
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D+
Sbjct: 307 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 366
Query: 175 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+ +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 367 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 415
Query: 233 SGDEDGERGGAPVVCIDDA-------SRHCSVFSKGQA-DW--TPILLLVPLVLGLEKVN 282
S +DG V A + S S QA W P+L+LV L LG++ VN
Sbjct: 416 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 475
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 476 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 527
>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1355
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 133/292 (45%), Gaps = 65/292 (22%)
Query: 97 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 122
F DF SRI ++YR F PI DS +T
Sbjct: 334 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 393
Query: 123 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 174
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D+
Sbjct: 394 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 453
Query: 175 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+ +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 454 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 502
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD--------W--TPILLLVPLVLGLEKVN 282
S +DG V A + + ++ W P+L+LV L LG++ VN
Sbjct: 503 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 562
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 563 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 614
>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
Length = 858
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 132/275 (48%), Gaps = 58/275 (21%)
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPI------------------------GDSKITS 123
AA + EF DF+SR+ ++YR GF PI G +TS
Sbjct: 148 AAASGWPQEFFSDFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTS 207
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIH 181
D GWGCMLR+ Q L+A AL+ +GR Y+ ++ LF DS + +PFS+H
Sbjct: 208 DAGWGCMLRTGQSLLANALVVAWMGRGALA--------LYIHLISLFLDSPSPSAPFSVH 259
Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
+ AG+A G G W GP + +AL + GLG V+ EDG
Sbjct: 260 RMALAGRALGKDVGQWFGPSTAAGAIKALVNAY-PDAGLG----------VAIAEDG--- 305
Query: 242 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
V R + + +W P+L+L+ + LGL+ VNP Y T++ +TFPQSL
Sbjct: 306 ----VVYQTQRRQ----KEREREWGDQPVLVLLGIRLGLDGVNPIYYDTIKQLYTFPQSL 357
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
GI GG+P +S Y VG Q YLDPH +P + +
Sbjct: 358 GIAGGRPSSSYYFVGAQAGDLFYLDPHHARPTVPL 392
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 33/47 (70%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
A+T T+H + +R + + +DPS+ IGF C+D+ D++D+ R SKL +
Sbjct: 537 AETRTFHCERVRKMPMSGLDPSMLIGFLCKDRADWEDWRTRVSKLPK 583
>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
Length = 336
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 92/333 (27%), Positives = 147/333 (44%), Gaps = 69/333 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG-GKPGA 308
D + C V P S VG PG
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTVGESTPG- 202
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
T Q + I+LDPH Q +N ++ D + + + +++ ++DPS+A+GF+
Sbjct: 203 -TLNASNQSDELIFLDPHTTQTFVNTEENGTVDDQTFHCLQSPQRMNILNLDPSVALGFF 261
Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
C+++ DFD++C+ K + N +F + Q H
Sbjct: 262 CKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
Length = 393
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 158/347 (45%), Gaps = 68/347 (19%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PI W
Sbjct: 57 VWILGKQHLLKTEKS-----------KLLADISARLWFTYRRKFSPID---------WN- 95
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
W K ++P +EY IL F D + +SIH + Q G
Sbjct: 96 ---------------------WEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGVG 132
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGERGGAPV 245
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 133 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCCAS 184
Query: 246 VCIDDA-------SRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
DA S + S SKG + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 185 ALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFK 244
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D + + + +
Sbjct: 245 MPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQPPQRM 304
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++ ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 305 NILNLDPSVALGFFCQEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 350
>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
Length = 425
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 142/307 (46%), Gaps = 73/307 (23%)
Query: 97 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAER---NIVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
S + + D + PTL L QS+GI GG+P +S Y
Sbjct: 270 ---SFLATARPD-----------------GETFHPTLIL---MEQSIGIAGGRPSSSHYF 306
Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
VGVQ + YLDPH +P + ++ L + + H+ +R++H++ +DPS+ IGF
Sbjct: 307 VGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIGFLI 366
Query: 370 RDKDDFD 376
+D+DD+D
Sbjct: 367 QDEDDWD 373
>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
commune H4-8]
Length = 602
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 144/310 (46%), Gaps = 82/310 (26%)
Query: 69 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------- 121
+IWL+GVCH G +F DF++RI ++YR GF+ I D ++
Sbjct: 114 EIWLMGVCHA-------------PGAPDFYADFATRIWLTYRSGFELIRDRQLIDLPPPV 160
Query: 122 ------------------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
+SD GWGCMLR+ Q L+A ALL GR WR+ +
Sbjct: 161 ASLDGHLQGEWATDEAEPPGAYGFSSDSGWGCMLRTGQSLLANALLTAWFGRDWRRISEV 220
Query: 158 PFDRE--YVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
+ YV +L LF D+ T+PFSIH + AGK G G W GP + + L
Sbjct: 221 ETHQHSLYVHLLSLFLDTPHPTAPFSIHRMALAGKQLGKDIGQWFGPSTAAGAIKNL--- 277
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT------- 266
+ P+A G VV +D A VF+ ++W+
Sbjct: 278 --------VSAYPLA------------GIGVVVGMDGALSKSEVFTASHSEWSDEEAALD 317
Query: 267 ----PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
P+L+L+ L LGL++VNP Y T++ FTFPQS+GI GG+P +S + VG Q IY
Sbjct: 318 WGDRPVLILLNLRLGLDRVNPIYHDTIKALFTFPQSVGIAGGRPCSSYHFVGAQGSDLIY 377
Query: 323 LDPHDVQPVI 332
LDPH + +
Sbjct: 378 LDPHHTRNTV 387
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
AD +T+H + + + + DPS+ GF C+D D+DD+ AR S+L +
Sbjct: 524 HADLATFHCTNPKMMPISAQDPSMLAGFLCKDIADWDDWRARMSRLPNQ 572
>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
Length = 988
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 126/272 (46%), Gaps = 57/272 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 124
F DF+SRI ++YR F PI D+ + TSD
Sbjct: 305 FYSDFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSD 364
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD------REYVEILHLFGDSETS-- 176
GWGCMLR+ Q L+A LL LGR WR+P P+ YV+IL F D+ +
Sbjct: 365 AGWGCMLRTGQSLLANTLLHLHLGRDWRRP---PYPICTADYATYVQILTWFFDNPSPLC 421
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
PFS+H + GK G G W GP + + L E GLG S+ + D
Sbjct: 422 PFSVHRMALVGKELGKEVGQWFGPSTAAGAIKTLVHA-FPEAGLGV-SVATDSVIYQSD- 478
Query: 237 DGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
V S S G++ W +L+LV + LGL+ VNP Y T++ +T
Sbjct: 479 ---------VYTASRSNLGSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIYYDTIKALYT 529
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
FPQS+GI GG+P +S Y VG Q ++ YLDPH
Sbjct: 530 FPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 15/96 (15%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
T+H + IR + L +DPS+ IGF C+D++D+ D R + L+ T+ +P
Sbjct: 693 TFHCERIRKMPLSGLDPSMLIGFLCKDEEDWLDLRKRITDLSRTHK-----TIFSIQDEP 747
Query: 405 VN-HSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
N SD DD++G+ S+++ + ED+
Sbjct: 748 PNWPSD---------SDDNMGLESISEPDIDMPEDE 774
>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
Length = 342
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 150/333 (45%), Gaps = 69/333 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS-SRILISYRKGFDPIGDSKITSDVGWG 128
+W+LG H + D L E F+ + L ++ G P +SD GWG
Sbjct: 35 VWILGKQHLLKTD----------SLPEIISHFTETSELTAHDGGTGP------SSDAGWG 78
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
CMLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 79 CMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKM-- 136
Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
C LP++ + + + G
Sbjct: 137 ---------------------------------CCILPLSADIATENPSGS--------- 154
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
+AS H S W P+LL+VPL LG+ ++NP Y+ + SLG +GGKP
Sbjct: 155 PNASNHSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK-------SLGALGGKPNN 207
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ Y +G + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+
Sbjct: 208 AYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILNLDPSVALGFF 267
Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
C+++ DFD +C+ K + N +F + Q H
Sbjct: 268 CKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 299
>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
Length = 492
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 156/343 (45%), Gaps = 79/343 (23%)
Query: 87 DAAGNNGLAEFNQDFSSRILISYRKGFDPIG----------------------------- 117
D + ++G+ E QD S+I ++YR GF+PI
Sbjct: 77 DISVDDGVIE--QDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNF 134
Query: 118 -----DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW-----RKPLQKPFDREYVEIL 167
+ T+DVGWGCM+R+SQ L+A LGR + R P + EI+
Sbjct: 135 HGLLDNDNFTTDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRDRSP-------RHDEII 187
Query: 168 HLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
+F D +PFS+HN ++ L G W GP A S + L C +
Sbjct: 188 DMFMDEPRAPFSLHNFIKVASESPLKVKPGQWFGPNAASLSIKRL-----------CDN- 235
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP----ILLLVPLVLGLEKV 281
+Y +G G VV + ++ + + ++ P IL+L+P+ LG++KV
Sbjct: 236 ---VYESNG-----TGRVKVVISESSNLYDDIITQMFTTLNPVPDAILVLLPVRLGIDKV 287
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
NP Y ++ QS+GI GGKP +S Y G + +YLDPH Q V N +
Sbjct: 288 NPLYHASVLELLALRQSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRN-----KTS 342
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
TYH++ + + +D +DPS+ IG +D +D++DF + +K
Sbjct: 343 VYDTYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKSSCTK 385
>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
972h-]
gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
Length = 320
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 144/341 (42%), Gaps = 53/341 (15%)
Query: 48 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 107
M R ER L + T + IW LG +KI + +F D S I I
Sbjct: 4 MARFLERYLHFAPTNTEPPGTLIWFLGHSYKIEDSQ---------WPEKFLYDSFSLITI 54
Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
+YR G + G +TSD GWGCM+RS+Q L+A L R+ P +++ EIL
Sbjct: 55 TYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL---RICYP---------EKQLKEIL 100
Query: 168 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
LF D ++PFSIH + GK + G W GP C +AR +P
Sbjct: 101 ALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSC---SCVARLSDQNP-----DVP 152
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
+ +YV R V P+LLL+P LG++ +N Y
Sbjct: 153 LHVYVARNGNAIYRDQLSKVSF------------------PVLLLIPTRLGIDSINESYY 194
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
L F +GI GG+P ++ Y Q + YLDPH + A T+
Sbjct: 195 DQLLQVFEIRSFVGITGGRPRSAHYFYARQNQYFFYLDPHCTHFAHTTTQ---PASEETF 251
Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
HS +R + + +DP + GF RD++++ F A A+
Sbjct: 252 HSATLRRVAIQDLDPCMIFGFLIRDEEEWHSFEANQKYFAD 292
>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
Length = 324
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 130/300 (43%), Gaps = 56/300 (18%)
Query: 66 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
++ +W+LG + + ++ E D +SR+ +YRK F IG + TSD
Sbjct: 26 TSEPVWILGRKYSVLTEKE-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 74
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
GWGCMLR QM+ AQAL+ LGR WR Y +L+ F D + S +SIH + Q
Sbjct: 75 GWGCMLRCGQMIFAQALVCRHLGRDWRWAQWTQQPDSYFNVLNAFIDRKDSYYSIHQIAQ 134
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G + G W GP + + + LA
Sbjct: 135 MGVGEGKSIGQWYGPNTVAQVLKKLA---------------------------------- 160
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
VF + I + +V G +N Y+ TL+ F PQSLG++GGK
Sbjct: 161 -----------VFDTWSSLAVHIAMDNTVVTGEININEAYVETLKHCFMMPQSLGVIGGK 209
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P ++ Y +G + IYLDPH QP + + L D S + + + +DPS+A+
Sbjct: 210 PNSAHYFIGYVGDELIYLDPHTTQPAVELTDSCLVPDESFHCQHPPSRMSIRELDPSIAV 269
>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
Length = 336
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 145/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G P S
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTAGESPPGS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ Q I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 LTALN-QSNELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
Length = 336
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRP-LD 202
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ + D + + + +++ ++DPS+A+GF+C
Sbjct: 203 YLTASNQSDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 151/330 (45%), Gaps = 60/330 (18%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 116
LG G++ E +D SRI +YR GF+PI
Sbjct: 69 LGRRYGSSSKEEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 117 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
+ T+DVGWGCM+R+SQML+A A LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAFQLLLLGRDF--AYVDGSEKKHSDIIDMF 186
Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D +PFS+HN ++A L G W GP A S + L C+ G S +
Sbjct: 187 TDEPKTPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRL--CKSQFDGSVSPSFRVI 244
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
I S D ++ G + I+++ IL+L+P+ LGL KV+P Y +
Sbjct: 245 I-SESCDIYDDKIGKLLQEIENSE-------------DAILILLPVRLGLNKVSPYYHDS 290
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L F Q +GI GGKP +S Y G +YLDPH Q + + T+H+
Sbjct: 291 LSSLFCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPHYPQSM------KASSIYDTFHT 344
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ ++ + ++ +DPS+ IG + K+D++ F
Sbjct: 345 NKVQSLKIEDMDPSMLIGILIKSKEDYESF 374
>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
[Homo sapiens]
Length = 340
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 184
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 185 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 207
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 208 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 266
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 267 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 297
>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
Length = 336
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 181 DIKKMCCV-------------------------------------LPLSADTAGDRPPDS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.9]
Length = 992
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/284 (34%), Positives = 134/284 (47%), Gaps = 49/284 (17%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 122 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD 172
TSD GWGCMLR+ Q L+A ALL LGR WR+P +Y V+I+ F D
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFD 410
Query: 173 --SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 TPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGVI 469
Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
S + A I RH V G+A +++L+ + LGL+ VNP Y T++
Sbjct: 470 FQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTIK 520
Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+TFPQS+GI GG+P +S Y +G Q ++ YLDPH +P + +
Sbjct: 521 ALYTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPHHARPAVPL 564
Score = 44.7 bits (104), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 6/84 (7%)
Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
G E + LDP V D L T+H D +R + + +DPS+ +GF C+D++
Sbjct: 681 GDSEGAGEALDPMAEHYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDEN 736
Query: 374 DFDDFCARASKLAEESNGAPLFTV 397
D+ DF R + L +FTV
Sbjct: 737 DWFDFRRRVNDLMHRHKT--IFTV 758
>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
gorilla]
gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 336
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1009
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 131/288 (45%), Gaps = 57/288 (19%)
Query: 97 FNQDFSSRILISYRKGFDPI-------------------------------GDSKITSDV 125
F DF+SRI ++YR F PI GD +SD
Sbjct: 308 FYADFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDA 367
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFSI 180
GWGCMLR+ Q L+A AL+ LGR WRKP +Y ++I+ F D + PFS+
Sbjct: 368 GWGCMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSV 427
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMC------RSWEALARCQRAETGLGCQSLPMA---IYV 231
H + GK G+ G W GP + ++ Q A L + P A IYV
Sbjct: 428 HRMALVGKQLGVKVGQWFGPSTAAGAIKYVSAHSSMVPNQPARRTL-VHAFPEAGLGIYV 486
Query: 232 VSGD---EDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYI 286
+ D E A I RH W P+L+L+ LG++ VNP Y
Sbjct: 487 AADGGTIYDSEVFAASHSGIGSPRRHTRRV------WGDRPVLILIGHRLGIDGVNPIYY 540
Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
TL+ +T+PQS+GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 541 DTLKTLYTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPTIPL 588
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 1/51 (1%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
T+H D +R + L S+DPS+ IGF C+D+ ++ D +R ++L+ +S +P+F
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCKDESEWQDLKSRINELSRKSK-SPVF 777
>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
Length = 336
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 203
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 500
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 120/242 (49%), Gaps = 13/242 (5%)
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ ++ FGD +PF +H L+ GK G AG W GP +A R
Sbjct: 232 HSRLVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGP-------SVVAHILRKAVDKTS 284
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+A+YV +D VV + D S + + DW +++LVP+ LG E +N
Sbjct: 285 VVTNLAVYVA---QDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGGEALN 341
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
P YI ++ +GI+GGKP S Y +G Q+E +YLDPH QPV+++ + + +
Sbjct: 342 PSYIDCVKNFLKLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQINFSLE 401
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTH 401
++H + + + +DPS IGFY ++K DF+ C+ S+ L+ P+FT + H
Sbjct: 402 --SFHCSSPKKMPFNRMDPSCTIGFYAKNKKDFESLCSAVSEALSSSKEKYPVFTFVEGH 459
Query: 402 KK 403
+
Sbjct: 460 SQ 461
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 38/61 (62%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ F F SRI ++YR+ F + S T+D GWGCMLRS QML+AQ LL H + R W
Sbjct: 104 VERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPRDWVW 163
Query: 154 P 154
P
Sbjct: 164 P 164
>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
Length = 340
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 147/318 (46%), Gaps = 63/318 (19%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIG------------------------------DSKITS 123
L E +SR+ +YR GF+PI + ++
Sbjct: 52 LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHN 182
DVGWGCM+R+SQ L+A AL LGR + P E VE I+ LFGD T PFS+HN
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171
Query: 183 LLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
++ A L G W GP A S + L C + E+ ++ ++I D E
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRL--CAKFESN-EIPNINVSICESCNLYDEEI 228
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
G +F + + +P+L+L PL LG++K+N Y P+L QS+G
Sbjct: 229 RG--------------IFEESE---SPLLILFPLRLGIDKINSIYYPSLLQLLALKQSVG 271
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
I GGKP +S Y G Q + +YLDPH++Q +D TYH+ + + + ++D
Sbjct: 272 IAGGKPSSSYYFFGFQGSNLLYLDPHNLQAA--------SSDPGTYHTSKFQTLSISNLD 323
Query: 361 PSLAIGFYCRDKDDFDDF 378
P A + ++ +DD+
Sbjct: 324 PLNAC--WSVNQMTYDDY 339
>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
boliviensis]
Length = 360
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+ + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 228 -LTASNESDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317
>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
Length = 1039
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 138/278 (49%), Gaps = 51/278 (18%)
Query: 97 FNQDFSSRILISYRKGFD-PIGDSKI-------------------------------TSD 124
F DF+SRI ++YR F PI D+++ +SD
Sbjct: 336 FYIDFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSD 395
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDREYVEILHLFGDSET--SPFS 179
GWGCMLR+ Q L+A AL+ LGR WR+P +Q YV+I+ F D+ +PFS
Sbjct: 396 TGWGCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFS 455
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H + AGK +G G W GP + + L E+GLG VS DG
Sbjct: 456 VHRMALAGKEFGTDVGQWFGPSVAAGAIKTLVNS-FPESGLG----------VSVATDGT 504
Query: 240 RGGAPVVCI---DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
+ V + + +SR P+LLL+ + LG+E VNP Y T++L +TFP
Sbjct: 505 LFQSDVFAVSHGEMSSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIYYETIKLLYTFP 564
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
QS+GI GG+P +S Y VG Q ++ YLDPH+ +P I +
Sbjct: 565 QSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTRPAIPL 602
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 29/43 (67%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
T+H + +R + L +DPS+ IGF CRD+ ++ DF R ++L +
Sbjct: 739 TFHCERVRKMPLSGLDPSMLIGFLCRDEAEWWDFKKRVAELPK 781
>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
jacchus]
Length = 360
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
+E I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 228 LTASNRSDE-LIFLDPHTTQTFVDAEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317
>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/254 (33%), Positives = 123/254 (48%), Gaps = 25/254 (9%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 353 HIHLDSIDPSLAIG 366
+ + +DPS+A+G
Sbjct: 233 RMSIAELDPSIAVG 246
>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 497
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 120/233 (51%), Gaps = 13/233 (5%)
Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
+++ LFGD +PF +H L+ GK G AG W GP + + R A+T +G QS
Sbjct: 231 KLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAH----ILRKAVAKTSVG-QS 285
Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 284
L A+YV +D V+ + D S V W +++LVP+ LG E +NP
Sbjct: 286 L--AVYVA---QDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGGEALNPS 340
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YI ++ + +GI+GGKP S Y +G Q+E +YLDPH QPV++ + + +
Sbjct: 341 YIECVKNILSLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDFTQANFSLE-- 398
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFT 396
++H + + +DPS IGFY R K+DF+ C+ L+ P+FT
Sbjct: 399 SFHCSSPKKMPFSRMDPSCTIGFYARTKEDFESMCSVVGMVLSSSKEKYPIFT 451
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 11/73 (15%)
Query: 65 SSTSDIWLLGVCHKI-AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
+ TS I++LG + + ++DE + F DF SRI ++YR+ F + S +T+
Sbjct: 87 NKTSPIFVLGHAYLLNSEDE----------VERFRLDFVSRIWLTYRREFPQLEGSTLTT 136
Query: 124 DVGWGCMLRSSQM 136
D GWGCMLRS QM
Sbjct: 137 DCGWGCMLRSGQM 149
>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
Length = 437
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 140/316 (44%), Gaps = 85/316 (26%)
Query: 96 EFNQDFSSRILISYRKGFDPI--------GDSK-----------------ITSDVGWGCM 130
+F DF S++ I+YR F PI GDS TSD GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 189
+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGATA 261
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCI 248
G G W GP A + +AL + + GL +Y+ S G + E+ V C
Sbjct: 262 CGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACD 313
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G +
Sbjct: 314 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGPE--- 359
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+ STYH+ +R +H+ +DPS+ IGF
Sbjct: 360 ---------------------------------ELSTYHTRRLRRLHVREMDPSMLIGFL 386
Query: 369 CRDKDDFDDFCARASK 384
RD+DD++D R +
Sbjct: 387 VRDEDDWEDLKQRVRE 402
>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
Length = 431
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 126/260 (48%), Gaps = 15/260 (5%)
Query: 151 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------Y 201
W K ++P EY IL F D + +SIH + Q G G + G W GP
Sbjct: 137 WEKHQEQP--EEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 194
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 261
A+ W +LA + + + + ++ D + + +D + C + G
Sbjct: 195 ALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLD-WNTDCPGQTSG 253
Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
W P+LL+VPL LG+ ++NP Y + F PQSLG +GGKP ++ Y +G + I
Sbjct: 254 ---WKPLLLIVPLRLGINQINPIYADAFKECFKMPQSLGALGGKPNSAYYFIGFLGDELI 310
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
YLDPH Q ++ ++ D S + + + ++DPS+A+GF+C++++DFD++C
Sbjct: 311 YLDPHTTQTFVDTEENGTVNDQSFHCQQSPPRMKILNLDPSVALGFFCKEEEDFDNWCGL 370
Query: 382 ASKLAEESNGAPLFTVTQTH 401
K + +F + + H
Sbjct: 371 VQKEILKPQSLQMFELVEKH 390
>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
Length = 603
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 137/310 (44%), Gaps = 63/310 (20%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGF------DPIGDS------------------- 119
LG+ NN ++ DF SRI +YR F DP+ D
Sbjct: 55 LGNLYDNN--SDLLDDFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLR 112
Query: 120 --KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF--------DREYV---EI 166
+D GWGCMLR+SQ L+A L LGR WR+ PF +EYV ++
Sbjct: 113 ERTFNTDQGWGCMLRTSQSLLANTLQIMLLGRQWRR---NPFVDLTDYAKRKEYVNLIKL 169
Query: 167 LHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
L+LF D S SPFS+H + GK+ G G W GP + + L Q + L S
Sbjct: 170 LNLFMDNPSTLSPFSVHRMAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQ-TDINLSV-S 227
Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVN 282
+ + D GG ++W P+L+LV + LGL+ ++
Sbjct: 228 VASDSVIYKSDVYQASGGTSTT--------------ADSEWGNKPVLILVGVRLGLDGIH 273
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
PRY TL+ +GI GG+P +S Y G Q +S Y+DPH ++P INI E +
Sbjct: 274 PRYYETLKAFLRMQSCVGIAGGRPSSSYYFFGYQSDSLFYVDPHIMKPTINIKTPPTEGE 333
Query: 343 TSTYHSDVIR 352
T +++R
Sbjct: 334 LKTEIENLLR 343
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 37/62 (59%), Gaps = 5/62 (8%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
A STY D R +++ +DPS+ IGF +D+++F +F + +L ++ +F+V +
Sbjct: 470 ASISTYFCDKPRKMNISQMDPSMLIGFLVKDENEFFEFVNQIKELPQQ-----VFSVADS 524
Query: 401 HK 402
H+
Sbjct: 525 HR 526
>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
Length = 433
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 117/410 (28%), Positives = 170/410 (41%), Gaps = 94/410 (22%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVG 126
IWLLGV + + G +A + A F+ +DFSSR+ +YR+ F I + I +D G
Sbjct: 36 IWLLGVIYHRKMTQFYGASAVVDDGASFDAFLEDFSSRLWFTYRREFPAIPGTDIRTDCG 95
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWR------------------KPLQKPF-----DREY 163
WGCMLRSSQM++AQA + H LGR WR PL++ F D
Sbjct: 96 WGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTEAGEVRLPRHALWPLREGFRCTGGDGTA 155
Query: 164 VEIL----------HLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 210
V + FGD ++PFS+HNL+Q G+ G AG W GP ++ +AL
Sbjct: 156 VLVRCSPKPVNDPPRWFGDKADASTPFSLHNLVQRGRESGKKAGDWYGPSSVAYILKDAL 215
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
+ L + IYV + +DD + CS S
Sbjct: 216 EDAAHRDQRLA----QLCIYVAQD---------CTIYMDDVTALCSAGSTEGV------- 255
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQS--------------LGIVGGKPGASTYI-VGV 315
+ PR + R F+ Q+ + K G S + +
Sbjct: 256 -------THRRLPRTVFARREMFSGGQTQRMCIHSSWLHLFVFFVCFLKYGISFLLQLSA 308
Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
EE IYLDPH Q ++++ D D ++H R + IDPS IGFYC+ K D
Sbjct: 309 AEEKVIYLDPHYCQEMVDVNSQDFPLD--SFHCSWPRKMSFSRIDPSCTIGFYCKTKHDL 366
Query: 376 DDFCARASKLA---EESNGAPLFTV--------TQTHKKPVNHSDVLGET 414
+DF +L + + P+F + T T K+P VL +
Sbjct: 367 EDFTKNIRELTVPKQMRHEYPVFLISEGSCSDHTDTEKRPEEIVHVLQDV 416
>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 414
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 93/289 (32%), Positives = 134/289 (46%), Gaps = 34/289 (11%)
Query: 97 FNQDFSSRILISYRKGFDPIG---DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
F DF ++I ++YR F I D K S + LRS LV Q G W
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRS--QLVDQGGFTSDTG--WGC 158
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
E +IL LF D +P+SIH ++ G A G G W GP A AR
Sbjct: 159 SSSN----EERKILSLFADDPRAPYSIHKFVEHGASACGKHPGEWFGP-------SAAAR 207
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
C +A T +S + +Y+ D +D S+ +TP L+LV
Sbjct: 208 CIQALTNSQVES-ELRVYITGDGSD---------VYEDT--FMSIAKPNSTKFTPTLILV 255
Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
LGL+K+ P Y L+ + PQS+GI GG+P +S Y +GVQE YLDPH +P +
Sbjct: 256 GTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYFIGVQESDFFYLDPHQTRPAL 315
Query: 333 NIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+D D + H+ +R +H+ +DPS+ I F RD++D+ D+
Sbjct: 316 PFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLIRDENDWKDW 364
>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
Length = 266
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 60/169 (35%), Positives = 95/169 (56%), Gaps = 6/169 (3%)
Query: 250 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 71 DSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 130
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP ++ Y +G E IYLDPH QP + + D S + + + +DPS+
Sbjct: 131 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPSRMGIGELDPSI 190
Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
A+GF+C+ ++DF+D+C + KL++ P+F + + + DVL
Sbjct: 191 AVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 239
>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
Length = 485
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 79/255 (30%), Positives = 123/255 (48%), Gaps = 27/255 (10%)
Query: 150 PWRKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
P R P P D + +++ FGD ++PF +H L++ GK G AG W GP +
Sbjct: 210 PARCPSASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHM 269
Query: 207 W-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
+A+AR E +A+YV V +D C G W
Sbjct: 270 LRKAVARAAEFED--------LAVYVAQDC---------TVYKEDVMSLCESSGVG---W 309
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
+++LVP+ LG E +NP YI ++ +GI+GGKP S + VG Q+E +YLDP
Sbjct: 310 KSVVILVPVRLGGESLNPSYIECVKNILKLKCCIGIIGGKPKHSLFFVGFQDEQLLYLDP 369
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK- 384
H QPV+++ + + + ++H + R ++ +DPS IG Y R K DF+ C S+
Sbjct: 370 HYCQPVVDVTQANFSLE--SFHCNSPRKMNFSRMDPSCTIGLYARSKTDFESLCTAVSEA 427
Query: 385 LAEESNGAPLFTVTQ 399
L+ P+FT +
Sbjct: 428 LSSSKEKYPIFTFVE 442
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 40/63 (63%), Gaps = 1/63 (1%)
Query: 91 NNGLAE-FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
N G E F Q F S + ++YR+ F + S +T+D GWGCMLRS QM++AQ LL H +
Sbjct: 92 NEGEVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPT 151
Query: 150 PWR 152
WR
Sbjct: 152 DWR 154
>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
Length = 430
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 153/355 (43%), Gaps = 73/355 (20%)
Query: 95 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 131
A F DF+SR ++YR F DP + S TSD GWGCM+
Sbjct: 121 AAFLDDFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMI 180
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
RS Q L+A A+ LGR WR+ + DRE +L LF D +P+SIHN ++ G+ Y
Sbjct: 181 RSGQSLLANAMAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSIHNFVRHGEKYC 237
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G W GP A R + L ++ E + IY G P + D+
Sbjct: 238 SKYPGEWFGPSATARCIQDLVNSRKQE---------LRIYST--------GDGPDIYEDN 280
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ + + P L+LV LG++K+ P Y L + QS+GI GG+P +S
Sbjct: 281 FMK---IAKPDGEVFHPTLVLVGTRLGIDKITPVYWEALIASVQMSQSVGIAGGRPSSSH 337
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEA---DTSTYHSDVIRHIHLDSIDPSLAIGF 367
Y VG Q YLDPH + + D D + H+ +R IH+ +DP+
Sbjct: 338 YFVGSQGHFLFYLDPHHTRKALPYYSDVARYTIDDMDSCHTSRLRRIHVREMDPN----- 392
Query: 368 YCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 422
C A+++ + + + + V SD GE GG+P D S
Sbjct: 393 -----------CHPANEIRDATGRSVIDEVELL-------SDEDGEDGGIPHDKS 429
>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
leucogenys]
Length = 441
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 150/328 (45%), Gaps = 36/328 (10%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW--GCMLRSSQML-VAQALLFHR 146
G F DF SR+ ++YR + I D W G L ++ A +H
Sbjct: 98 GEGEHTAFPADFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHG 157
Query: 147 LGRPWRKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
R W P L++ +R + +I+ F D +PF +H L++ G++ G AG W
Sbjct: 158 PAR-WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWY 214
Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
GP +A R + + +YV + A +V D +
Sbjct: 215 GP-------SLVAHILRKAVESCSEVTRLVVYVSQTCSMYKADVARLVARPDPT------ 261
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++
Sbjct: 262 ----AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCQLCLGIMGGKPRHSLYFIGYQDD 317
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+YLDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 375
Query: 379 CARASKLAEESNGA---PLFTVTQTHKK 403
C+ +++ S+ P+FT+ + H +
Sbjct: 376 CSELTRVLSSSSAMERYPMFTLAEGHAQ 403
>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 302
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 148/309 (47%), Gaps = 42/309 (13%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAG 187
M R QML+AQAL+ H LGR WR + ++I+ F DS + SP S+H L+Q
Sbjct: 1 MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHRLVQMS 60
Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGDE--DGER 240
G W GP ++C A+ R + L + + +Y V+ +E D R
Sbjct: 61 DR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLAR 114
Query: 241 G------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLGL-EKVNPR 284
G P + D H +++ + Q+D T ILLL+PL+ G ++NPR
Sbjct: 115 GLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFGKGNRINPR 170
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
YI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+ D
Sbjct: 171 YIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNSPKFSVD-- 228
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
++H + + + +++PS A+GFYCR + + D R L S+ T +P
Sbjct: 229 SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ----ASTRSRP 284
Query: 405 VNHS-DVLG 412
V + +VLG
Sbjct: 285 VAFTVEVLG 293
>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
Length = 252
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 118/233 (50%), Gaps = 32/233 (13%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192
Query: 243 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ ++++
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKVS 243
>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 138/313 (44%), Gaps = 60/313 (19%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWR------------------------------------K 153
MLRS QM++AQ LL H L R W
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP +A
Sbjct: 61 ELER--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHI 111
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
R + +YV + A +V D + A+W +++LVP
Sbjct: 112 LRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVP 161
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP ++
Sbjct: 162 VRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVD 221
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 392
+ + D + ++H R + DPS +GFY D+ +F C+ +++ S+
Sbjct: 222 VSQADFPLE--SFHCTSPRKMAFAKTDPSCTVGFYAGDRKEFGTLCSELTRVLSSSSATE 279
Query: 393 --PLFTVTQTHKK 403
P+FT+ + H +
Sbjct: 280 RYPMFTLAEGHAQ 292
>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
RWD-64-598 SS2]
Length = 1038
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 139/318 (43%), Gaps = 74/318 (23%)
Query: 80 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------ 121
+Q A G + EF DF+SRI ++YR F PI DS +
Sbjct: 271 SQSPASEKHPGQDWAPEFYADFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSP 330
Query: 122 --------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YV 164
T+D GWGCMLR+ Q L+A ALL LGR WR+P + + YV
Sbjct: 331 SPKSRRWFGGEKGWTTDTGWGCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYV 390
Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+I+ F DS +PFS+H + AGK G G W GP + + L + + GLG
Sbjct: 391 QIITWFLDSPLPQAPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKRLVQA-FPDAGLGV 449
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------W--TPIL 269
V D A V+S D W +L
Sbjct: 450 ----------------------AVASDGALYQTDVYSASYVDVGSPRNVRKLRWGGRAVL 487
Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
+L + LG+ VNP Y T++ F PQS+GI GG+P +S Y +GVQ ++ IYLDPH +
Sbjct: 488 VLFGIRLGINGVNPIYYDTIKGLFEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPHHAR 547
Query: 330 PVINIGKDDLEADTSTYH 347
P I + + EAD H
Sbjct: 548 PAIPL-RPLPEADEGNQH 564
Score = 45.1 bits (105), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 5/69 (7%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
A+ T+H D +R + L +DPS+ +GF C+D++D+ DF R + L + T+
Sbjct: 716 AELKTFHCDRVRKMPLSGLDPSMLLGFLCQDEEDWIDFRHRITDLMHRNK-----TIFAI 770
Query: 401 HKKPVNHSD 409
+P N S+
Sbjct: 771 QDEPPNWSE 779
>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
Length = 263
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 93/169 (55%), Gaps = 6/169 (3%)
Query: 250 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 68 DSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 127
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP ++ Y VG E IYLDPH QP + D S + + + +DPS+
Sbjct: 128 GKPNSAHYFVGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFHCQHPPCRMSIAELDPSI 187
Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 188 AVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 236
>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
Length = 423
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 90/274 (32%), Positives = 136/274 (49%), Gaps = 44/274 (16%)
Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 177
+ TSD GWGCM+R+SQ L+A ALL +L + Q ++IL LF D TSP
Sbjct: 138 NDNFTSDAGWGCMIRTSQNLLAIALL--KLSEEHNESAQ-------LDILKLFQDDPTSP 188
Query: 178 FSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSG 234
FS+HN ++ + L G W GP A S + L ++ ET P I V
Sbjct: 189 FSLHNFIRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLET-------PGEIPYVYI 241
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
E+ + DD +F++ Q P+LLL P+ LG+++VN Y ++ +
Sbjct: 242 SENAD-------LFDDEIE--DLFNEEQK---PLLLLFPVRLGIDQVNKYYYKSILQLLS 289
Query: 295 FPQSLGIVGGKPGASTYIVGVQEES-AIYLDPHDVQPV---INIGKDDLEADTSTYHSDV 350
P S+GI GGKP +S Y +G + E+ +Y DPH Q V INI +TYH+
Sbjct: 290 LPYSVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI---------TTYHTAN 340
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
+ ++ +DPS+ IG + D++ +F S+
Sbjct: 341 YNKLDIEMVDPSMMIGVLLKSMDEYKEFKQDCSE 374
>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
Length = 592
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 107/347 (30%), Positives = 155/347 (44%), Gaps = 62/347 (17%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG------- 117
S DIW H A+D D N EF D +RI ++YR F PI
Sbjct: 75 SGLKDIWQTLRFH-TAEDNEKDDL--NKWPQEFIDDVYTRIWLTYRTKFSPIDRDPEGPS 131
Query: 118 ----------------DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+ T+D GWGCM+R+SQ L+A ALL +GR WR +
Sbjct: 132 PLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQSLLANALLNLHIGRDWR--YTGELNE 189
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGL 220
+ EI+ F D + PFSIH ++ GK G W GP A RS ++L
Sbjct: 190 MHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGEWFGPSAAARSIQSL---------- 239
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
C + V G + G+ V + A VF PIL+L+ L LG++
Sbjct: 240 -CNEFDSGVKVYIGSDSGDIYENDVFKV--AKDENGVFK-------PILILLGLRLGIDN 289
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+NP Y +L+ +S+GI GG+P S Y G Q + YLDPH QP + + D L+
Sbjct: 290 INPVYWDSLKAILNSKESIGIAGGRPSTSHYFFGFQGDHLFYLDPHLPQPAL-LHDDQLD 348
Query: 341 A------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
D ++ H+ +R IHL +DPS+ +GF +D++++
Sbjct: 349 TSVSESTEIVSSLDVNSVHTKKLRKIHLSEVDPSMLLGFLIKDENEW 395
>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
Length = 330
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 139/311 (44%), Gaps = 56/311 (18%)
Query: 130 MLRSSQMLVAQALLFHRLGRPW----------------------------------RKPL 155
MLRS QM++AQ LL H L R W +
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
+ +R + +I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 61 ELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILR 113
Query: 216 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
+ + +YV + A +V D + A+W +++LVP+
Sbjct: 114 KAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVPVR 163
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++
Sbjct: 164 LGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVS 223
Query: 336 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--- 392
+ D + ++H R + +DPS +G Y D+ +F+ C+ +++ S+
Sbjct: 224 QADFPLE--SFHCTSPRKMAFAKMDPSCTVGSYAGDRKEFETLCSELTRVLGSSSATERY 281
Query: 393 PLFTVTQTHKK 403
P+FT+ + H +
Sbjct: 282 PMFTLAEGHAQ 292
>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
LYAD-421 SS1]
Length = 999
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 133/278 (47%), Gaps = 50/278 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 123
F DF+SRI ++YR F PI D+ + TS
Sbjct: 303 FYADFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTS 362
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 178
D GWGCMLR+ Q L+A ALL LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 363 DAGWGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPF 422
Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
S+H + GK G G W GP + + L + GLG +A+ S +
Sbjct: 423 SVHRMALVGKDLGKEVGQWFGPSTAAGAIKTLVHS-FPDAGLG-----VAVASDSTLYES 476
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
+ A + RH + +W +L+L+ + LG+E VNP Y T++ +TFP
Sbjct: 477 DVYAASRSSVYSTRRH----GHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLYTFP 532
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
Q++GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 533 QTVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAIPL 570
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 29/46 (63%)
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
+ + T+H D +R + L +DPS+ +GF C+D+ ++ D R ++L
Sbjct: 699 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKERITEL 744
>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
Length = 208
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/123 (53%), Positives = 87/123 (70%), Gaps = 8/123 (6%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R S D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARVLTSG---DVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQK 157
+K
Sbjct: 203 SEK 205
>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
Length = 577
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 145/320 (45%), Gaps = 64/320 (20%)
Query: 95 AEFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDV 125
EF +D SR++ +YR F PI G S I T+D+
Sbjct: 127 VEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNSFTTDI 186
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 179
GWGCM+R+ Q L+ AL LGR +R P K E +I+ F D+ PFS
Sbjct: 187 GWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEE--DIIEWFYDNPNKPFS 244
Query: 180 IHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
IH + G + G W GP C + ++L + E G+ + V SGD
Sbjct: 245 IHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLIY-EFPECGID----ECILSVSSGD--- 296
Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ D+ + H F K + T IL+L+ + LG++K+N Y ++ S
Sbjct: 297 -------IYEDEINEH---FQKNEN--TIILILLGVKLGIDKINQCYFNDIKDILNSRYS 344
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
GI GG+P +S Y G E Y DPH +P + + +D + ST +S ++ +
Sbjct: 345 CGISGGRPSSSLYFFGHMNEYLYYFDPH--KPQLQLNEDFKNSCHSTDYSKIL----ISE 398
Query: 359 IDPSLAIGFYCRDKDDFDDF 378
IDPS+ IGFY + K D+D+F
Sbjct: 399 IDPSMLIGFYLKGKKDWDNF 418
>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
Length = 292
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/123 (52%), Positives = 88/123 (71%), Gaps = 8/123 (6%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R ++ D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQK 157
+K
Sbjct: 203 SEK 205
>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
Length = 271
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/123 (52%), Positives = 88/123 (71%), Gaps = 8/123 (6%)
Query: 36 SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
S ++R V +GSM R+ LG +R ++ D+W LG C++++ ++E G + ++G
Sbjct: 90 SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202
Query: 155 LQK 157
+K
Sbjct: 203 SEK 205
>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
Length = 450
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 164/383 (42%), Gaps = 67/383 (17%)
Query: 26 LASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEAL 85
L+ + LG E V R T + + + + SRT + + S A + +
Sbjct: 4 LSRISQHLGIVEDVDRDGTVFILGKEYAPLNNKSRTDVETDDS-----------ALESLI 52
Query: 86 GDAAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT------------------ 122
+ N GL D SR+ +YR F PI G S I
Sbjct: 53 NIVSLNPGLL---SDVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALT 109
Query: 123 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
SD+GWGCM+R+ Q L+A A+ +L R +R + D E + ++ F D
Sbjct: 110 DPDSFYSDIGWGCMIRTGQALLANAIQRVKLAREFRINASRIDDNE-LNLIRWFQDDVKY 168
Query: 177 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
P S+HN ++A K G+ G W GP A RS + L E C I S D
Sbjct: 169 PLSLHNFVKAEEKISGMKPGQWFGPSATARSIKTLI-----EGFPLCGIKNCIISTQSAD 223
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
+ D+ +R +F K + +LLL + LG++K+N Y + +
Sbjct: 224 ----------IYEDEVTR---IFHKDRD--ANLLLLFAVRLGVDKINSLYWKDIFKILSS 268
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
P S+GI GGKP +S Y G Q E+ YLDPH+ Q ++ DDLE S H +H
Sbjct: 269 PYSVGIAGGKPSSSLYFFGYQNENLFYLDPHNTQQS-SLMMDDLEFYRSC-HGHKFNKLH 326
Query: 356 LDSIDPSLAIGFYCRDKDDFDDF 378
+ DPS+ +G K+++D F
Sbjct: 327 ISETDPSMLLGMLISGKNEWDQF 349
>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
norvegicus]
Length = 224
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 93/169 (55%), Gaps = 6/169 (3%)
Query: 250 DASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
++ RHC+ G W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 29 ESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 88
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
GKP ++ Y +G E IYLDPH QP + + D S + + + +DPS+
Sbjct: 89 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPCRMGIGELDPSI 148
Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
A+GF+C+ ++DF+D+C + KL++ P+F + + + DVL
Sbjct: 149 AVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 197
>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 180
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 55/118 (46%), Positives = 80/118 (67%), Gaps = 1/118 (0%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W P+++LVP+ LG++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+D
Sbjct: 11 WHPVIILVPVRLGIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMD 70
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
PH VQP + + D L +Y ++ + + D IDPSLA+GF C + +FDDFC A
Sbjct: 71 PHFVQPTVKMDDDPLFP-IESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 127
>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
passalidarum NRRL Y-27907]
Length = 363
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/280 (30%), Positives = 133/280 (47%), Gaps = 43/280 (15%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
F+ R+ + R FD SDVGWGCM+R+SQ L+A AL+ LQ +
Sbjct: 104 FNKRLFTTVRSLFD---SENFNSDVGWGCMIRTSQSLLANALM----------KLQPSAE 150
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 218
E +++LF D+ S FS+HN ++ L G W GP A S + L + +T
Sbjct: 151 HE---VINLFQDNIASAFSLHNFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKT 207
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
G + + I S D E I++ SV L+L P+ LG+
Sbjct: 208 IQGVKYPHVFISENSDLYDEE--------IEELLVESSV-----------LILFPVRLGI 248
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
+ VN Y ++ P ++GI GGKP +S Y +G Q++ +Y DPH Q N
Sbjct: 249 DNVNSYYYDSIFQLLACPFTVGISGGKPSSSFYFLGYQDQDLLYFDPHSPQLYEN----- 303
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ +TYH++ + +H+ +DPS+ +G +DK ++ +F
Sbjct: 304 -PINYTTYHTNNYQRLHIHMLDPSMMVGILVKDKSEYKEF 342
>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
Length = 1034
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 129/272 (47%), Gaps = 50/272 (18%)
Query: 97 FNQDFSSRILISYRKGF-DPIGDSKI-------------------------------TSD 124
F DF+SRI ++YR F PI D ++ +SD
Sbjct: 302 FYIDFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSD 361
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFS 179
GWGCMLR+ Q L+A AL+ LGR WRKP +Y V IL F D+ +PFS
Sbjct: 362 SGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFS 421
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H + AGK G G W GP + +AL E G+G +A+ V DG
Sbjct: 422 VHRMALAGKELGTDVGQWFGPSVAAGAIKALVNS-FPEAGIG-----VAVAV-----DGV 470
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
V + + W P+LLL+ + LG+E VNP Y T+++ +TFPQ
Sbjct: 471 LYQTDVHAASHGDHFGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIYYDTIKMLYTFPQ 530
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
S+GI GG+P +S Y VG Q ++ YLDPH +
Sbjct: 531 SVGIAGGRPSSSYYFVGSQADNLFYLDPHHAR 562
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 28/45 (62%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
A+ T+H + +R + L +DPS+ +GF CRD+ ++ D R + L
Sbjct: 711 AELKTFHCERVRKMPLSGLDPSMLLGFLCRDEAEWVDLRKRVAGL 755
>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
Length = 337
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 121/247 (48%), Gaps = 22/247 (8%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 72 DRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SVVAHILRKAVE 124
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
C + + VS D + D +R S + A+W +++LVP+ LG E
Sbjct: 125 -SCSEVTRLVVYVSQDCTVYKA--------DVARLVS-WPDPTAEWKSVVILVPVRLGGE 174
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 175 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVNQANF 234
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 396
+ ++H R + +DPS +GFY ++ +F+ C+ ++ S+ P+FT
Sbjct: 235 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 292
Query: 397 VTQTHKK 403
V + H +
Sbjct: 293 VAEGHAQ 299
>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
Length = 246
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 112/246 (45%), Gaps = 53/246 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 250 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 284
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 285 YIPTLR 290
YI +
Sbjct: 241 YIEAFK 246
>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
Length = 994
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/277 (32%), Positives = 129/277 (46%), Gaps = 49/277 (17%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 124
F DF+SRI ++YR F+PI D+ + TSD
Sbjct: 309 FYSDFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSD 368
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETSPFS 179
GWGCMLR+ Q L+A ALL LGR WR+P + + YV+I+ F D S PFS
Sbjct: 369 SGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFS 428
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+H + GK G G W GP + + L E GLG +A+ V D
Sbjct: 429 VHRMALVGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---VAVDGVIYQSDVY 484
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+ + +H G+ W +L+L+ + LG++ VNP Y ++ +T PQ
Sbjct: 485 AVSRSTMGLGSPRKH------GRPSWGDRAVLVLIGIRLGIDGVNPIYYDLIKALYTLPQ 538
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+LGI GG+P +S Y VG Q + YLDPH +P I +
Sbjct: 539 TLGIAGGRPSSSYYFVGSQANNLFYLDPHHARPTIPL 575
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 52/100 (52%), Gaps = 16/100 (16%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
T+H D +R + L +DPS+ IGF C+D++D+ D R ++L THK+
Sbjct: 711 TFHCDRVRKMPLSGLDPSMLIGFLCKDENDWIDLRRRLTELF------------NTHKRH 758
Query: 405 VNHSDVLGETGGVPED--DSLGVMSMNDAVGNAHEDDWQL 442
+ + E P D D++G+ S+++ + E+D +L
Sbjct: 759 I--FSIQDEPPNWPSDSEDNIGLESISEPDIDLPEEDDEL 796
>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
6054]
gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 514
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 153/323 (47%), Gaps = 43/323 (13%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
FS +L + + + I T+DVGWGCM+R+SQ L+A F RL L K D
Sbjct: 138 FSKSLLYNLQNFNNFIEKENFTTDVGWGCMIRTSQSLLANT--FVRL-------LDKQSD 188
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 218
I+ LF D+ +PFS+HN ++ + L G W GP A S + L C
Sbjct: 189 -----IIALFNDTYLAPFSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRL--CDGYYD 241
Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
+++ I V+ + ++ ++ +KG +L+L+P+ LG+
Sbjct: 242 NSTSETILPRINVLISESTDLYDSQIAQLLEPSTE-----TKG------LLVLLPVRLGI 290
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
+ +N Y +L + QS+GI GGKP +S Y G Q+ S IY+DPH Q I D
Sbjct: 291 DSINSYYFSSLLHLLSLEQSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQ----IFSSD 346
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF---CARASKLAEESNGAPLF 395
+ D STY++ + + + +DPS+ IG + RD +++F C A+ +
Sbjct: 347 I--DMSTYYATRYQRVDIGKLDPSMLIGVFIRDLTSYENFKKSCLDAANKIVHFHATERS 404
Query: 396 TVTQTHKK-----PVNHSDVLGE 413
TV ++ +K +N SD+ E
Sbjct: 405 TVPESRRKNSEFVNINRSDLKDE 427
>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 557
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 141/346 (40%), Gaps = 48/346 (13%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL---- 155
D S +YR F I ITSD GWGCMLRS+QM++ QAL H R WR P
Sbjct: 171 DERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQLLAR 230
Query: 156 --QKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
Q F R + + S S +S+HN++ AG Y G W GP C L
Sbjct: 231 RRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMRDLVH 290
Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
+ LG L I+ V G + + K +
Sbjct: 291 IHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQPQAH 350
Query: 273 PLVLGLEK---------------------------VNPRYIPTLRLTFTFPQSLGIVGGK 305
PL L E+ +N Y+ +L TF+ PQS+G++GG+
Sbjct: 351 PLDLEWEEELMESANTVEWDTALLLLVPLRLGLTSLNEEYVQSLAHTFSLPQSVGVLGGR 410
Query: 306 PGASTYIVGVQEE-SAIY-LDPHDVQ--PVINIGKDDLEADTSTYHS-DVIRHIHLD--- 357
P + + G Q++ S I+ LDPH VQ P + + +A + S D +R H
Sbjct: 411 PRGARWFYGAQKDGSKIFGLDPHTVQTAPGRQTARVNGQASSVVELSDDYLRSCHTTCPE 470
Query: 358 -----SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP-LFTV 397
+DPS+A+GFYCR + D + +E + P LF+V
Sbjct: 471 MFPFCKMDPSIALGFYCRTRADLNHVLNSMGAWQKEHSSIPELFSV 516
>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
Length = 443
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 143/330 (43%), Gaps = 73/330 (22%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------- 119
LG N+ A N S++ +SYR GF+PI S
Sbjct: 69 LGQIFDNSNAA--NNYIESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFA 126
Query: 120 ---------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
TSD GWGCM+R+SQ L+A LL K + + EI+ LF
Sbjct: 127 NLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEQEIVKLF 173
Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
D SPFSIHN ++ + L G W GP A S + L + + G P
Sbjct: 174 QDDTKSPFSIHNFIRVASSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGIN--PPR 231
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+++ + DD R VF+K +++ +++L P+ LG++KVN Y +
Sbjct: 232 VFISENSD----------LFDDEIR--DVFAKEKSN--SVIILFPIRLGIDKVNSYYYNS 277
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ + S GI GGKP +S Y +G ++ IY DPH Q V + + +YHS
Sbjct: 278 IFHLLSSKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQIV------ETPFNMDSYHS 331
Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+++ +DPS+ IG + D++ DF
Sbjct: 332 TNYNTLNISLLDPSMMIGILVTNIDEYIDF 361
>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
Length = 296
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 120/247 (48%), Gaps = 22/247 (8%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 31 DRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP-------SVVAHILRKAVE 83
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
C + + VS D + D +R S + A+W +++LVP+ LG E
Sbjct: 84 -SCSEVSRLVVYVSQDCTVYKA--------DVARLLS-WPDPTAEWKSVVILVPVRLGGE 133
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ +
Sbjct: 134 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQPSF 193
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 396
+ ++H R + +DPS +GFY ++ +F+ C+ ++ S+ P+FT
Sbjct: 194 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 251
Query: 397 VTQTHKK 403
V + H +
Sbjct: 252 VAEGHAQ 258
>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 148/353 (41%), Gaps = 76/353 (21%)
Query: 98 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 123
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDGTSSPFSIHNF 186
Query: 184 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ L G W GP A S + L + L +P +S + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVF--ISENSD---- 240
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
DD R VF+K ++ +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKS--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
GGKP +S Y +G ++ IY DPH Q V + + +YH+ +++ +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 414
S+ IG + D++ DF S + +N F H PV ++ ++
Sbjct: 346 SMMIGILVTNIDEYIDF---KSSCIDNNNKIVHF---HPHTLPVQQDSIINQS 392
>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
1558]
Length = 1159
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 82/248 (33%), Positives = 112/248 (45%), Gaps = 51/248 (20%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------------REYVEIL 167
+T+D GWGCMLR+ Q L+A AL+ LGR WR P Q YV IL
Sbjct: 580 LTTDAGWGCMLRTGQSLLANALIHLHLGRDWRVPSQPQVPPTSAAHLAELEAYSSYVRIL 639
Query: 168 HLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
F D S PFS+H + GK G G W GP + + L S
Sbjct: 640 SWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTL-----------VNSF 688
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------------W--T 266
P + V+ D +V D ++ S G +D W
Sbjct: 689 PPSGMAVATAVDS------IVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNR 742
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+L+L+ + LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q S +YLDPH
Sbjct: 743 AVLVLIGIRLGLDGVNPLYYESIKALFTFPQSVGIAGGRPSSSYYFVGTQANSLVYLDPH 802
Query: 327 DVQPVINI 334
+P + +
Sbjct: 803 FTRPAVPL 810
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 5/60 (8%)
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
+A T+H D +R I L +DPS+ +GF C+D+ DF+DFC+R ++L ++ +FT+ +
Sbjct: 962 KAQLGTFHCDKVRKIPLSGLDPSMLLGFVCKDEADFEDFCSRVAQLPQK-----IFTIQE 1016
>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 411
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 142/309 (45%), Gaps = 57/309 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 127
F D SRI +YR F PI S +D+GW
Sbjct: 74 FLSDVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGW 133
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+R+ Q L+A A+ LGR +R + + +I+ F D+ PFS+HN ++ G
Sbjct: 134 GCMIRTGQSLLANAIQIAILGREFRVN-DGDVNEQERKIISWFMDTPDEPFSLHNFVKKG 192
Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
+ G W GP A RS ++L Q + G+ + ++ + DE
Sbjct: 193 CELSSKKPGEWFGPAATSRSIQSLVE-QFPDCGIDRCIVSVSSADIFKDE---------- 241
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
I+D +F + ++ ILLL+ + LG++KVN Y+ +R S+GI GG+P
Sbjct: 242 -IND------IFKNKR--YSNILLLMGVKLGVDKVNEYYLKDIRKILESRYSVGISGGRP 292
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
+S Y G Q+++ +Y DPH QP +E+ T H+D I++ +DPS+ IG
Sbjct: 293 SSSLYFFGYQDDTLLYFDPHKPQPST------IESLLETCHTDNFDKINISDMDPSMLIG 346
Query: 367 FYCRDKDDF 375
+ +DD+
Sbjct: 347 VLLQGEDDW 355
>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 136/317 (42%), Gaps = 70/317 (22%)
Query: 98 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 123
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDDTSSPFSIHNF 186
Query: 184 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ L G W GP A S + LA + + +P +S + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVF--ISENSD---- 240
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
DD R VF+K + +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
GGKP +S Y +G ++ IY DPH Q V + + +YH+ +++ +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345
Query: 362 SLAIGFYCRDKDDFDDF 378
S+ IG + D++ DF
Sbjct: 346 SMMIGILVTNIDEYIDF 362
>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
Length = 391
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 148/326 (45%), Gaps = 42/326 (12%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
Q +S I +YRK F I +S+ TSD GWGCMLRS QM+ AQ L H R+ Q
Sbjct: 51 QIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQILRVH-----IRQKKQHS 105
Query: 159 FDREYVEILHLFGDSE---------------TSPFSIHNLLQAGK-AYGLAAGSWVGPYA 202
D +Y ++L F D + SP+SI + + + + W P
Sbjct: 106 KDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWYRPDQ 164
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIY--VVSGDEDGERGGAPVVC-----------ID 249
+ + L + ++ E G + L + I ++ E G + C
Sbjct: 165 ILNALSLLHQQKQLE---GSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKNK 221
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
+ S+ C++ K I + + GL+++N Y+P L PQ GI+GG+ +
Sbjct: 222 EISKICNICQKKDPKALAIFFITRI--GLDEINKEYLPFLNDLIDLPQFQGIIGGRDDKA 279
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
YI+G + IYLDPH +Q IN G + D T+ +++I+ + + PS+A+GFYC
Sbjct: 280 YYILGRVNKRLIYLDPHYIQEHINRGNVVMLKD--TFFCKDVKYINEEQMSPSIALGFYC 337
Query: 370 RDKDDFDDFCARASKLAEESNGAPLF 395
+++ + D F ++ + + F
Sbjct: 338 QNQSELDKFFNSIEQIKKNYDNEKTF 363
>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 446
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 140/331 (42%), Gaps = 72/331 (21%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------ 119
LG N A N S++ +SYR GF+PI S
Sbjct: 68 VLGQTFDNFDTA--NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNF 125
Query: 120 ----------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 169
TSD GWGCM+R+SQ L+A LL K + + EI+ L
Sbjct: 126 ANLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKL 172
Query: 170 FGDSETSPFSIHNLLQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
F D +SPFSIHN ++ L +G W GP A S + L + + +P
Sbjct: 173 FQDGTSSPFSIHNFIRVASLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPR 232
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+S + D DD R VF+K + +L+L P+ LG++KVN Y
Sbjct: 233 VF--ISENSD---------LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYN 277
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
++ S GI GGKP +S Y +G ++ IY DPH Q V + + +YH
Sbjct: 278 SIFHLLASKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYH 331
Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ +++ +DPS+ IG + D++ DF
Sbjct: 332 TTNYNRLNISLLDPSMMIGILVTNIDEYIDF 362
>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
Length = 411
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/285 (30%), Positives = 135/285 (47%), Gaps = 54/285 (18%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP-----------FDREYVE---IL 167
T+D GWGCM+R++QM+VAQA++ +R GR WR +K FD E ++ IL
Sbjct: 88 TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147
Query: 168 HLFGDSETSPFSIHNLLQ-AGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
LF D ++P IH +++ A + G A G W P EA+ ++A T
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPS------EAVFIMKKAITESASPLT 201
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPR 284
+ +S D G + ++ ++H WT L+LV +V LG ++N
Sbjct: 202 GDTVMYLSID-----GRVHIRDLEVETKH----------WTKTLMLVIVVRLGAAELNRI 246
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
Y+P L F+ LGI GG+P S + VG + IYLDPH I I D++ +TS
Sbjct: 247 YVPHLMRLFSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAHEYIPI---DMDFNTS 303
Query: 345 -------------TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
+YH ++ +H +DPS A+ F ++ FD
Sbjct: 304 QEDPKKPKKCPERSYHCRLLSKMHFLDMDPSCALCFRFESREQFD 348
>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
Length = 408
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 128/268 (47%), Gaps = 37/268 (13%)
Query: 116 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
I + T+DVGWGCM+R+SQ L+A +++ + + +E +++L F DSE
Sbjct: 123 IDNENFTTDVGWGCMIRTSQSLLANT---------YKRMISEDAQQE-IQLLDQFKDSEA 172
Query: 176 SPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS 233
+PFS+HN ++ L G W GP A S + L ++ G LP ++S
Sbjct: 173 APFSLHNFIRVANESPLQVKPGQWFGPNAASLSIQRLCNLVNSKENFG---LPGLSVLIS 229
Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
+ D DD + + K Q+ +L+L+P+ LG++K N Y ++
Sbjct: 230 ENSD---------LYDDKVQEF-LDKKKQS----LLILLPIRLGIDKTNEFYYSSILQLL 275
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
QS+GI GGKP +S Y G + +YLDPH Q A ++YH+ +
Sbjct: 276 NCKQSVGIAGGKPSSSFYFFGYDNDELLYLDPHYPQ--------GTNAGYNSYHTPRYQR 327
Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
+ + +DPS+ IG D D++ F A
Sbjct: 328 LTISQLDPSMMIGILVDDLQDYNTFKAE 355
>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
Length = 411
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 123/310 (39%), Gaps = 83/310 (26%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
A F DF S+ ++YR F+ I S +SD GWGCM+
Sbjct: 109 AAFLDDFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMI 168
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG 191
RS QML+A A+ LGR A G
Sbjct: 169 RSGQMLLANAMAITNLGR--------------------------------------VACG 190
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R ++L Q + + +Y G P V D
Sbjct: 191 KYPGEWFGPSATARCIQSLTNAQEQPS--------LRVYST--------GDGPDVYED-- 232
Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+ + + P L+LV LG++K+ P Y L PQS+GI GG+P AS Y
Sbjct: 233 -KFMKIAKPDGTRFHPTLILVGTRLGIDKITPVYWDALIAALQMPQSVGIAGGRPSASHY 291
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
+G Q YLDPH +P + D +AD T H+ +R +H+ +DPS+ IGF
Sbjct: 292 FIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADIDTAHTRRLRRLHVREMDPSMLIGFL 351
Query: 369 CRDKDDFDDF 378
+D DD+ ++
Sbjct: 352 IKDDDDWSEW 361
>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
Length = 551
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 93/161 (57%), Gaps = 6/161 (3%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VGGKP AS Y + Q+++ YLD
Sbjct: 383 WEPLLILIPMRLGLDGLNSIYHSSLLEIFKFPQNLGVVGGKPRASLYFIAAQDDNLFYLD 442
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH VQ I + ++ + +T+ + H+ +DPSL + F+C+ KDDF+DF R+ K
Sbjct: 443 PHTVQNHIEV-ENGSKFPLNTFFCSTTKRTHVSEVDPSLVVAFFCKTKDDFNDFVERSKK 501
Query: 385 LAEESNGAPLFTVTQTHKKPVNHSDV----LGETGGVPEDD 421
+ + P+F++ + D + ETGG DD
Sbjct: 502 MTSQMEN-PIFSIFDNEPDYDSSRDYEYEEIDETGGETSDD 541
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 5/119 (4%)
Query: 94 LAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
+ EF DF++R+L +YR+GF I D+ +D GWGCMLRS QML++ LL + LG W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199
Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
+ + +I+ +F D ++PFSIHN+ G+ G G W P + ++ + L
Sbjct: 200 RSSSAT----HPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILV 254
>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
Length = 332
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/308 (29%), Positives = 139/308 (45%), Gaps = 38/308 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
+++ I++LFGDS S FSIH L+ G+ G W GP + A AE
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP--------SFASDIAAEHIN 148
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
+ YV + G G + SK + + P ++ VPL LG E
Sbjct: 149 EMRVFRTRGYVA---KLGSIVGPKI----------EELSKDEVGFNPCIIFVPLRLGPES 195
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I D++
Sbjct: 196 PENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DMK 250
Query: 341 ADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
D S +Y + ++ IDPS+++ F + +D++ F K E + LF
Sbjct: 251 GDWSYQSYFCKDNKSMNYSKIDPSISLVFLVKHVNDYEHF----KKSFENKTFSKLFIFK 306
Query: 399 QTHKKPVN 406
+K +N
Sbjct: 307 NEIEKKLN 314
>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
Length = 495
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 90/321 (28%), Positives = 145/321 (45%), Gaps = 74/321 (23%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 127
F +D +R+ +YR F PI S +D+GW
Sbjct: 75 FLKDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGW 134
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGDSETSPFSIHNLLQ 185
GCM+R+ Q L+ L RLGR +R P +++ E I+ F D+ PFS+H +
Sbjct: 135 GCMIRTGQSLLGNTLQIVRLGRDFR---YDPENKDISENRIIEWFIDAPEKPFSLHQFIT 191
Query: 186 AG-KAYGLAAGSWVGPYAMCRSWEALAR----CQRAETGLGCQSLPMAIYVVSGDEDGER 240
G + G G W GP A RS ++L R C AE + V SGD
Sbjct: 192 EGMELSGKNPGEWFGPAATARSIQSLIRKFPDCGIAEC---------LVSVSSGD----- 237
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+ D+ + VF+ + + +L+L+ + LGL VN Y ++R + S+G
Sbjct: 238 -----IYSDEVKQ---VFADNKKN---LLILLGVKLGLNAVNECYWDSIRHILSSKYSVG 286
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHIHLD 357
I GG+P +S Y G + + +Y DPH QP LE + +Y H++ + ++
Sbjct: 287 ISGGRPSSSLYFFGYEGDELLYFDPHSPQP-------SLEENNVSYKSCHTNKYGKLLMN 339
Query: 358 SIDPSLAIGFYCRDKDDFDDF 378
+DPS+ +GF R ++D+++F
Sbjct: 340 DMDPSMLLGFLIRGQEDWENF 360
>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1193
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 82/240 (34%), Positives = 110/240 (45%), Gaps = 28/240 (11%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 562 LTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAKYAQ 621
Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+L F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 622 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 680
Query: 224 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 274
+ +I Y S D +P R +K + W +L+LV +
Sbjct: 681 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRRGDNEAK-EEKWGKRAVLILVGV 739
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q YLDPH +P I +
Sbjct: 740 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 799
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
TYH + I+ + L +DPS+ +GF C+D+DDF+DF R ++L ++ +FTV
Sbjct: 952 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 999
>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1093
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 55/274 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
F DF+SR+ ++YR F PI D+ +
Sbjct: 369 FYADFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGG 428
Query: 122 ----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-LQKPFDR--EYVEILHLFGDSE 174
TSD GWGCMLR+ Q L+A ALL LGR WR+P +P YV++L F DS
Sbjct: 429 EKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSP 488
Query: 175 TS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
+ PFS+H + AGK G G W GP + + L A G G VV
Sbjct: 489 SPLCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVH---AFPGGGLGVAVAVDGVV 545
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ +P D+ RH + G +L+L+ + LGL+ VNP Y T++
Sbjct: 546 YETDVFSASHSP-----DSRRHHRTSTWGDRG---VLILIGIRLGLDGVNPIYYDTIKEL 597
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+T+PQS+GI GG+P +S Y VG Q +S YLDPH
Sbjct: 598 YTWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 37/57 (64%), Gaps = 2/57 (3%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
A+ T+H + +R + L +DPS+ IGF CRD++++ D AR + +A++ P+F V
Sbjct: 779 AELRTFHCERVRKMPLSGLDPSMLIGFLCRDEEEWRDLRARIANMAKKFK--PIFAV 833
>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
Length = 734
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 91/163 (55%), Gaps = 13/163 (7%)
Query: 238 GERGGA---PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
GE G+ P+ C D S C W I++LVP+ LGL+K+N Y ++
Sbjct: 515 GENSGSFKDPLTCSDFFSSSCI-----PQRWKSIIILVPIKLGLDKLNEVYFREIKSMLE 569
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
PQS+G++GGKP S Y VG Q+E IYLDPH V ++ + + +YH V + +
Sbjct: 570 LPQSIGLIGGKPKQSFYFVGYQDEHIIYLDPHFVHDTVSPNDINF---SDSYHHCVPQKM 626
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
+ +DPS+AIGFYC + DF+DFC R ++ E G P+ +V
Sbjct: 627 LISQLDPSMAIGFYCHTQSDFEDFCVRIKEI--EKRGFPVVSV 667
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
N + F DF + + SYRK F PI ++ IT+D+GWGCM+R+ QM
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQM 315
>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
Length = 521
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 137/310 (44%), Gaps = 52/310 (16%)
Query: 96 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
EF D +R+ +YR F PI G S ++ +D+G
Sbjct: 115 EFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDIG 174
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+A AL LGR +R + E + I+ F D PFS+H +Q
Sbjct: 175 WGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHE-LRIIKWFEDDPKYPFSLHKFVQE 233
Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G + G G W GP A RS +AL A C I SGD
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPA-----CGIAHCVISTDSGD---------- 278
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
V +D+ +F + +LLL+ + LG++ VN Y +R + S+GI GG+
Sbjct: 279 VYMDEVE---PLFRADPS--AAVLLLLCVRLGVDVVNEVYWEHIRHILSSEHSVGIAGGR 333
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q+E YLDPH Q + + DL+ S H+ +H+ IDPS+ I
Sbjct: 334 PSSSLYFFGYQDEHLFYLDPHKPQLNLASYQQDLDLFRSV-HTQRFNKVHMSDIDPSMLI 392
Query: 366 GFYCRDKDDF 375
G KDD+
Sbjct: 393 GILLNGKDDW 402
>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
Length = 314
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 140/341 (41%), Gaps = 57/341 (16%)
Query: 48 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 107
M I ER L T S + IW LG H A + A F QD + +
Sbjct: 4 MSHILERYLRMFPTNHEPSGTFIWSLG--HSYATETGKWPEA-------FVQDTYDLLSL 54
Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
+YRK G +SD GWGCM+RS Q ++A L R +P P+ K IL
Sbjct: 55 TYRKCI--AGMECFSSDAGWGCMIRSMQTMLANCL---RRVQP-SLPVHK--------IL 100
Query: 168 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
H F D + S+H + AG + G+W GP + L C + P
Sbjct: 101 HYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHL-----------CSTHP 149
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI--LLLVPLVLGLEKVNPR 284
V DG ++ + Q TP LLL L LG++ ++
Sbjct: 150 QVGLNVCVSHDG-----------------AIMYRDQLRNTPYPRLLLFTLRLGIDTIHTS 192
Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
Y L T PQ++GIVGG+P A+ Y Q + YLDPH Q D A S
Sbjct: 193 YYEQLCHVLTIPQAIGIVGGRPRAAHYFYACQSQWFFYLDPHTTQTAHTF---DNPAPNS 249
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
++H +R + ++ +DP + +GF ++ DF R KL
Sbjct: 250 SFHVTTLRRLRINELDPCMVLGFAITSEECQTDFEQRIVKL 290
>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
Length = 616
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 84/140 (60%), Gaps = 5/140 (3%)
Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
Q++W +++LVP+ LGL+K+N Y ++ P S+G++GGKP S Y VG Q+E
Sbjct: 426 NQSNWKSLIILVPVKLGLDKLNEIYFSGIKAMLQMPSSIGLIGGKPKQSFYFVGFQDEHI 485
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
IYLDPH V I+ + ++YH + + +H IDPS+A GFYC DF+ FC
Sbjct: 486 IYLDPHFVHDTIHPFDSNF---LNSYHDCIPQKMHFSQIDPSMAFGFYCHTYKDFEQFCI 542
Query: 381 RASKLAEESNGAPLFTVTQT 400
R ++ E++G P+ ++ +T
Sbjct: 543 RIKEI--EASGFPILSIGET 560
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 61/100 (61%), Gaps = 7/100 (7%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR---P 150
+ F +DF S + SYRK F I ++ IT+D+GWGCMLR+ QM++A+ALL H P
Sbjct: 194 VERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENIP 253
Query: 151 WRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGK 188
+ + ++ + +Y +I+ F D S+ + +SIH ++ K
Sbjct: 254 YGEKIKT--NSKYKKIMSWFCDYPSKENFYSIHQIVHKNK 291
>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
Length = 465
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 86/134 (64%), Gaps = 5/134 (3%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++++PL LG++++N YI L+ + PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 217 WKSLIIMIPLKLGVDRINTSYIRKLKSILSIPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 276
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH VQ ++ ++ + T+ + + + +IDPSL++GFYC+DK FDD C R SK
Sbjct: 277 PHFVQDTVDPSSNNY---SETFCGCIPQKMSFSNIDPSLSVGFYCKDKSSFDDLCDRLSK 333
Query: 385 LAEESNGAPLFTVT 398
L E++ P+ +++
Sbjct: 334 L--ENDEFPIISIS 345
>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
H99]
Length = 1185
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/239 (33%), Positives = 110/239 (46%), Gaps = 26/239 (10%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYAQ 619
Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+L F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 678
Query: 224 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSK-GQADWTPILLLVPLV 275
+ +I Y S D +P R +K G+ +L+LV +
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRGGDNKAKEGKWGKRAVLILVGIR 738
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y +G Q YLDPH +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFIGSQANHLFYLDPHLTRPAIPL 797
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
TYH + I+ + L +DPS+ +GF C+D+DDF+DF R ++L ++ +FTV
Sbjct: 946 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 993
>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1188
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 109/239 (45%), Gaps = 26/239 (10%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAKYAQ 619
Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
++ F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANS-FAPCGIAVA 678
Query: 224 SLPMAI------YVVSG--DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
+ +I Y S +D R RH + +G+ +L+LV +
Sbjct: 679 TATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEGKWGERAVLILVGIR 738
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LGL+ VNP Y +++ FTFPQ+ G GG+P +S Y VG Q YLDPH +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQAGGSAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 797
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 34/53 (64%), Gaps = 5/53 (9%)
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
TYH + I+ + L +DPS+ +GF C+ +DDF++F R + L ++ +FTV
Sbjct: 947 TYHCEKIKKMPLSGLDPSMLLGFVCKSEDDFENFVERVALLPKK-----IFTV 994
>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
Length = 489
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 145/321 (45%), Gaps = 53/321 (16%)
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT-------------------- 122
+ +N +F D SR+ +YR F PI G S ++
Sbjct: 69 SKNSNENPDFLSDVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNP 128
Query: 123 ----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
+DVGWGCM+R+ Q L+ AL RLGR +R + E + I+ F D +PF
Sbjct: 129 ACFNTDVGWGCMIRTGQSLLGNALQIARLGRGYR--IGSELKPEEISIIDWFVDIPDAPF 186
Query: 179 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
SIHN + G + G W GP A RS ++L R + CQ I V SGD
Sbjct: 187 SIHNFVSKGMELSSKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQ-----ISVSSGD-- 239
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
V +D + VF++ + + ILLL+ + LG+ VN Y ++
Sbjct: 240 --------VYEEDVMK---VFNESKD--SRILLLLGVKLGINAVNEFYWNDIKRLLGSKF 286
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
S+GI GG+P +S Y +G Q +YLDPH QP ++ + + + HS + +
Sbjct: 287 SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQPFLSPSHQE-RSFYDSCHSSNYGKLAIQ 345
Query: 358 SIDPSLAIGFYCRDKDDFDDF 378
+DPS+ IG +++F ++
Sbjct: 346 DLDPSMLIGILISGEEEFKEW 366
>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
Length = 427
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/253 (33%), Positives = 119/253 (47%), Gaps = 30/253 (11%)
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 180
+D+GWGCM+R+ Q L+ AL LGR WR + EI F D+ PFS+
Sbjct: 55 TDIGWGCMIRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSL 114
Query: 181 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD---- 235
H + G + G G W GP A RS ++L + E G+ I V SGD
Sbjct: 115 HRFISKGMQLSGKKPGEWFGPAATARSIQSLVH-EFPECGID----KCLISVSSGDIYKT 169
Query: 236 --EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
ED G H GQ D T IL+L+ + LG+E +N Y ++R
Sbjct: 170 EVEDVFNEG-----------HTGEARNGQKDKT-ILILLGVKLGIETINRCYWDSIRRIL 217
Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
+ S+GI GG+P +S Y G Q + +Y DPH QP + K+DL +T H+
Sbjct: 218 SSEYSIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQPSYD--KNDLFYETC--HTTNFGK 273
Query: 354 IHLDSIDPSLAIG 366
+ L +DPS+ +G
Sbjct: 274 LSLADMDPSMLLG 286
>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
Length = 484
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 86/167 (51%), Gaps = 21/167 (12%)
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++K+NP YIP L+ ++PQS+GIVGG+P AS Y+ GVQ+ S IYLDPH+ Q +
Sbjct: 339 GMDKINPVYIPQLQQVLSWPQSVGIVGGRPSASLYVCGVQDASFIYLDPHEAQLALG--- 395
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 396
TY DV+R + +DPSLAIGF C + +D AR LA + + APL T
Sbjct: 396 --------TYFCDVVRVLPSAQLDPSLAIGFVCTSSAELEDLFARLQALATQHSSAPLMT 447
Query: 397 VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
+T V G D + G D+W+L+
Sbjct: 448 LTTGSGAAV----------GCGSDADFTDDVLEGGTGQQQLDEWELV 484
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 65/117 (55%), Gaps = 6/117 (5%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
DF SR+ +YRK F +G S +TSDVGWGC LRS QML+A+ R G R L + +
Sbjct: 49 DFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVRHGWRAGAMMRVALGRDW 108
Query: 160 DR-----EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
R E V ++ D +P SIH + AG G+ G W+GP+ +C+ EAL
Sbjct: 109 QRCSDNLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPGRWLGPWMLCKGLEAL 165
>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4; AltName:
Full=Pexophagy zeocin-resistant mutant protein 8
gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
Length = 533
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 117/267 (43%), Gaps = 50/267 (18%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
+ G +D +TPIL+L+ + LG+EKVN LR + QS+GI G K
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSLKQSVGISGRKTSFLA 281
Query: 311 YI-VGVQEESAIYLDPHDVQPVINIGK 336
+ +G Q + YL P + + GK
Sbjct: 282 LLSIGFQGDYLFYLIPTFPKKALTFGK 308
>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 330
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 136/309 (44%), Gaps = 40/309 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARCQRAETG 219
+++ I++LFGDS S FSIH L+ G+ G W GP +A + E + + T
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRTR 156
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
L I G I D + P ++ VPL LG E
Sbjct: 157 GYVAKLGSII-----------GSKIEELIKDG-----------GGFNPCIIFVPLRLGPE 194
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I D+
Sbjct: 195 SPENEFKPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DM 249
Query: 340 EADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
+ D S +Y + + +DPS+++ F + +D++ F K E + LFT
Sbjct: 250 KGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTFSKLFTF 305
Query: 398 TQTHKKPVN 406
+K +N
Sbjct: 306 KDETEKELN 314
>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 330
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 141/315 (44%), Gaps = 41/315 (13%)
Query: 100 DFSSR-ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---L 155
DF+ I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ +
Sbjct: 33 DFARHTIWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCI 90
Query: 156 QKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARC 213
+ +++ I++LFGDS S FSIH L+ G+ G W GP +A + E +
Sbjct: 91 NTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEM 150
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
+ T RG + S+ + G + P ++ VP
Sbjct: 151 RVFRT---------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVP 188
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
L LG E + P L+ F PQ +G++GGKPG + Y + +LDPH Q I
Sbjct: 189 LRLGPESPENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTTQNAI- 247
Query: 334 IGKDDLEADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 391
D++ D S +Y + + +DPS+++ F + +D++ F K E
Sbjct: 248 ----DMKGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTF 299
Query: 392 APLFTVTQTHKKPVN 406
+ LFT +K +N
Sbjct: 300 SKLFTFKDETEKELN 314
>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
Length = 476
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 150/323 (46%), Gaps = 56/323 (17%)
Query: 95 AEFNQDFSSRILISYRKGF-----DPIGDSKI-------------------TSDVGWGCM 130
++F D ++R+ +YR GF DP G S + T+D GWGCM
Sbjct: 91 SDFISDVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRK-PLQKP---------FDREYVEILHLFGDSETSPFSI 180
+R+SQ L+A ALL +GR WR P + P +++++ +I+ F D +PFSI
Sbjct: 151 IRTSQSLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQW-QIITWFADFPWAPFSI 209
Query: 181 HNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
+++ G + G W GP A RS L + ++ C+ + Y+ G+ D
Sbjct: 210 QQIVRYGSEHCNKKPGEWFGPSAASRSIVYLCK----QSYKACK---LNTYLTEGNGD-- 260
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
+D S + + P L+L + LG+ VNP Y L+ + QS+
Sbjct: 261 -------IYEDELLXVSCPEGTENGFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIHQSV 313
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEAD-TSTYHSDVIRHIH 355
GI GG+P +S Y G Q ++ Y+DPH Q + ++ D + ++ H+ IR +
Sbjct: 314 GIAGGRPSSSHYFFGYQGDNLFYMDPHTPQTALLADHVDDADYRXEYVASVHTKRIRKLG 373
Query: 356 LDSIDPSLAIGFYCRDKDDFDDF 378
L +DPS+ IG +D+ +
Sbjct: 374 LCEMDPSMLIGLLVTSLEDYKEL 396
>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
Length = 285
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 127/291 (43%), Gaps = 44/291 (15%)
Query: 101 FSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
F S I I+YR+ F P+ + SD GWGCM+R QM +A+ L K
Sbjct: 2 FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGL--------------KR 47
Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAE 217
F + EI+ LF D + S FSI N+ +AGK + L AG W P +C + L +
Sbjct: 48 FQIKEDEIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKK--- 104
Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
G + L I +S D ++ +D S G ++L + LG
Sbjct: 105 ---GFKDLK--IRTISSDR--------ILIFEDLEMEFSSEKNG------LILFLVCKLG 145
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
LEK Y+ F + S+G++GGKP + + VG E+ IYLDPH VQ +
Sbjct: 146 LEKTEENYLKIALKIFDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDPHYVQDF-----N 200
Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
D ++Y + ID S+ + +K++ F +L EE
Sbjct: 201 QNNVDQNSYFCKNYAVLDQKKIDSSIGNVLFFENKEELKMFFQFLDQLKEE 251
>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
Length = 402
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/327 (26%), Positives = 139/327 (42%), Gaps = 45/327 (13%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPWRKP--LQKP 158
SS I SYRK S +TSD GWGCM+R +QM +AQ + +H +P + ++
Sbjct: 71 SSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVIRHYHSFTQPEQLIVLIRHF 130
Query: 159 FDREYVEILHLFGDSETS-------PFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEAL 210
D + E+++ + + PFSI ++ K + G W P + + L
Sbjct: 131 LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKVEFKKEPGDWYKPNEILETLNYL 190
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP--- 267
+ + SL M IY+ + DA + + KG +W
Sbjct: 191 FKYSQY-------SLNMQIYI---------NYQCAFILQDAIKQMFNYDKGNQEWLKECI 234
Query: 268 -------------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
I + +P +GL++VN Y+ L + T P GI+GG + YIVG
Sbjct: 235 KNNNQFISQHDKGIAIFLPARIGLQRVNQDYLEVLNILMTLPYFQGIIGGVTNRAFYIVG 294
Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
++ IYLDPH VQ N +DL ++Y I+ IH SIDPS+ + R+ +
Sbjct: 295 RIQDYLIYLDPHFVQNAQNF--EDLSKTQASYTCQNIQLIHNKSIDPSIVVCLCVRNGLE 352
Query: 375 FDDFCARASKLAEESNGAPLFTVTQTH 401
D + + +E ++ T+
Sbjct: 353 LLDLWHSLNHMKQEFQEFFFISILDTN 379
>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 523
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/289 (29%), Positives = 135/289 (46%), Gaps = 43/289 (14%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALL--FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
TSD GWGCM+R+SQ L+A ALL FH G +P + +++ LF D+ ++PF
Sbjct: 179 FTSDAGWGCMIRTSQNLLANALLRLFHTTGG---QPQNFAVTKTEADVIELFQDTLSAPF 235
Query: 179 SIHNLLQAGKAYGL--AAGSWVGPYA-------MCRSWEALARCQRAETGLGCQS---LP 226
S+HN ++A + L G W GP A + + + + +R+E G S +P
Sbjct: 236 SLHNFIKAANSLSLNIKPGQWFGPSAASLSIKKLVNDYNLIQQERRSERDSGRDSGHKVP 295
Query: 227 M-----------AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG-----QADWTPILL 270
+ D +R P V + S +C ++ + + PIL
Sbjct: 296 TPNLKLHSKSADSDSDSDSDAISKRNSIPYVYV---SENCDLYDDEINAIFELEQRPILF 352
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQ 329
L P+ LG+E+VN Y ++ S+GI GGKP +S Y +G + E+ IY DPH Q
Sbjct: 353 LFPIRLGIEQVNKYYYSSILQILASKFSVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQ 412
Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
V + +YH+ + +D +DPS+ IG D++ +F
Sbjct: 413 IV------QTPVNLESYHTSEYSKLKIDQLDPSMMIGILIETIDEYQEF 455
>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
Length = 1055
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 142/308 (46%), Gaps = 46/308 (14%)
Query: 105 ILISYRKGFDPI-GDSKITSDVGWGCMLRSSQMLVAQALLFHR--LGRPWRKPLQKPFDR 161
+ ++YRKG+DPI GD+++TSD GWGC RS QML+AQAL+ + R R +P
Sbjct: 603 VWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSARMQRLEGVRPSTW 662
Query: 162 EYVE----ILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
++ E +L +F DS + FSI ++ + G W+ P
Sbjct: 663 QHEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLSP--------------- 707
Query: 216 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
+ + I ++ E G R V ++D G+ W P LL++PL
Sbjct: 708 -------SEVALIIRRLNPPETGMR----VRIVNDTLLSTRRILAGEP-WMPTLLMIPLR 755
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE--SAIYLDPHDVQPVIN 333
GL+ + P +P F +P +G +GGKPG++ Y VG+ + +YLDPH + ++
Sbjct: 756 AGLDTLQPESVPAFVAFFDWPWCVGAIGGKPGSAYYYVGIDHDRRRVLYLDPHTTRSRLD 815
Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG-- 391
+ +A T D ++ + + S+ +G + + D + R + E+ +G
Sbjct: 816 LSN---QAAEKTCVPDKLKSMDMSKSCSSICVGLFLPELRDLTELVQRYKR--EQLSGMW 870
Query: 392 -APLFTVT 398
PLF V
Sbjct: 871 STPLFHVV 878
>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
Length = 330
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 127/278 (45%), Gaps = 29/278 (10%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDR 161
I ++YRK + + TSD GWGCM+RS QM +AQ+ + +G W + Q ++
Sbjct: 38 IWVTYRKNMKELPGGR-TSDSGWGCMIRSMQMALAQSFVSLVMGNSWKFTKTGFQVERNK 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
++ I++LFGD S FSIHNL+ G+ G W GP S+ + T
Sbjct: 97 FHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGP-----SFASEIAADHLNT-- 149
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
I+V R G V S+ + P ++ VPL LG
Sbjct: 150 --------IHVFRTRGYVARLGRIV------KPDILDISEDNGNILPTIIFVPLRLGPVN 195
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
+ P L+ F PQ +G+VGGKP + + YLDPH Q +++ D
Sbjct: 196 AEEDFRPILKKVFDIPQCVGMVGGKPNLAFFFHTFDGNLLYYLDPHTTQNAVSM---DGG 252
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+Y + ++ + ++DPS+++ F ++KDDF+ F
Sbjct: 253 WSAESYFCNDVKSMKYKNLDPSVSLLFLIKNKDDFNKF 290
>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 808
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 126/286 (44%), Gaps = 69/286 (24%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
F +DF+S I ++YR + PI D+ +
Sbjct: 142 FYEDFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKS 201
Query: 122 -TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSET-- 175
TSD GWGCMLR+ Q L+A AL+ LGR WR+P F E YV+IL F D+ +
Sbjct: 202 WTSDAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPL 261
Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ---SLPMAIYVV 232
+PF +H + AGKA G G+W GP S + LA CQ SL + V
Sbjct: 262 APFGVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPE-----CQLSVSLAVDGTVF 316
Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
+ D V + SK G+A +L+LV + LGL+ VNP Y L+
Sbjct: 317 ASDVYAASHMGMVTTSGRSISSRRSASKWGGRA----VLILVNIRLGLDNVNPIYYDALK 372
Query: 291 LTFTFPQSLGIVGGKP--GASTYIVGVQEESAIYLDPHDVQPVINI 334
+ G+P G+S Y VG Q +S YLDPH +P I +
Sbjct: 373 V------------GRPRQGSSYYFVGSQADSLFYLDPHHTRPYIPL 406
Score = 45.1 bits (105), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 30/46 (65%)
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
A+ T+H D +R + + ++DPS+ +GF CRD D+ DF R + ++
Sbjct: 519 AELRTFHCDRVRKMPMSALDPSMLLGFLCRDDADWKDFRTRVADVS 564
>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
Length = 391
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 85/155 (54%), Gaps = 18/155 (11%)
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G++K+NP Y+P L+ T+PQS+GIVGG+P AS Y+ GVQ+ S ++LDPH+ QP + G
Sbjct: 216 GMDKINPVYLPQLQRILTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWGI 275
Query: 337 DDLEADT-----------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
T +TY D +R + ++DPS+AIGF C D +D
Sbjct: 276 AGDAGHTKEAGNGGSAVVLPASSLATYFCDTVRLMPATALDPSMAIGFLCMGAADLEDLF 335
Query: 380 ARASKLAEESNGAPLFTVTQ-THKKPVNHSDVLGE 413
R LA+E + APL T+T T + V D GE
Sbjct: 336 TRLDALAKEHSLAPLMTLTSGTAQAGVGLEDDFGE 370
>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
Length = 355
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 142/323 (43%), Gaps = 32/323 (9%)
Query: 102 SSRILISYRKGFDPIGDS-KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
S+ + +YR IGDS + +D GWGC LR QM+V +AL R + K L P +
Sbjct: 52 SAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK-LSYPSE 110
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
+ IL F D S+H + K G AG W P + Q A +
Sbjct: 111 AARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQEA---M 167
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
G Q ++V +V +DD + +F +A LL VPL LG++
Sbjct: 168 GLQ-----VHVAMD---------SMVVLDDLRK---LFRADRA----TLLFVPLRLGIDI 206
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
V IP ++ F P +LGI+GG+PGA+ Y +G + + + LDPH Q + G D
Sbjct: 207 VQAEMIPAVKRFFHSPSALGIMGGRPGAAHYFIGYMDHNLLLLDPHTTQDPLRAGSQDAL 266
Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV--T 398
+ + LD +DP++ + F D++ F + EE+ G LF++ T
Sbjct: 267 VSCRCSRPML---LDLDKVDPTMCLAFLLTDEESLQRFADDYNASVEET-GVRLFSMLDT 322
Query: 399 QTHKKPVNHSDVLGETGGVPEDD 421
++ V + L E +DD
Sbjct: 323 KSFASSVAVASSLAEEEEFSDDD 345
>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
Length = 196
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 75/122 (61%), Gaps = 11/122 (9%)
Query: 265 WTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
W P+++LVPLVLGL++ VNPRY+P + PQS+GI+GGKP AS Y VG Q+E YL
Sbjct: 75 WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLPQSVGILGGKPCASLYFVGAQDEELFYL 134
Query: 324 DPHDVQPVINIGK----------DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
DPH VQ + + + + T TYH + H++ +DPS+ +GFYCR +
Sbjct: 135 DPHTVQLAVPLEQIWGCAQTGSPESGPFPTETYHCRSVLHMNARELDPSMVLGFYCRTRA 194
Query: 374 DF 375
DF
Sbjct: 195 DF 196
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/45 (57%), Positives = 31/45 (68%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
F SR+ I+YR+GF IG T+D GWGC LRS QML+A AL H
Sbjct: 1 FHSRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANALQSH 45
>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPIG---------------------------------DS 119
G +E + R +SYR GF+PI +
Sbjct: 75 GDSEVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 178
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 179 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 235
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQS GI GGKP +S Y G Q S +YLDPH Q V A +YHS + +
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSSYQKLD 322
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARAS 383
+ +DPS+ G ++ +D+ D R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350
>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 388
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 127/279 (45%), Gaps = 39/279 (13%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ SYR+ F+P+ + TSDVGWGC +R+ QM++A A + +R G D V
Sbjct: 94 LYFSYRRQFEPLRNGA-TSDVGWGCTIRACQMMLAWAFMRYRNGG------SVTMDDNVV 146
Query: 165 EIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
+ L LF D T+PF IH + G +G+ G W GP M + AL R+ G
Sbjct: 147 DSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSSGG 206
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
G + L + D + G VV S+H ++LL+P+ LG +
Sbjct: 207 EGPEVLVAS--------DRQIGVQDVVVRLQRSQH-------------VVLLIPVKLGPQ 245
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
V+ Y L+ F S+G VGG+ ++ + G Q + I+LDPH VQ +
Sbjct: 246 TVSVTYANALKRFFEMGSSIGAVGGEKNSAYFFFGYQGDKIIHLDPHYVQCALT------ 299
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+++ + R + + + S +GFY D+ D F
Sbjct: 300 SPNSNGTLAGTWRSLPVMQCNTSALLGFYVSSCDELDQF 338
>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1295
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 116/278 (41%), Gaps = 43/278 (15%)
Query: 82 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 141
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 142 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 185
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
GK G G W GP + + LA S P V DG + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668
Query: 246 VCIDD-------ASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
+ ++ S + W +L+++P LGL+ VNP Y ++
Sbjct: 669 YQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------ 722
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 -SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 31/49 (63%)
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
E T+H D ++ + L +DPS+ +GF C ++ +F+DFC R S+L +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979
>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)
Query: 93 GLAEFNQDFSSRILISYRKGFDPIG---------------------------------DS 119
G E + R +SYR GF+PI +
Sbjct: 75 GDLEVQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 178
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 179 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 235
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
PQS GI GGKP +S Y G Q S +YLDPH Q V A +YHS + + +
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSLYQKLD 322
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARAS 383
+ +DPS+ G ++ +D+ D R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350
>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 338
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 83/143 (58%), Gaps = 5/143 (3%)
Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
S+ W +++L+P+ LG E++NP YI ++ FT +GI+GGKP S Y +G QE+
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSLFTLKHCIGIIGGKPKHSLYFIGFQED 215
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
I+LDPH Q V+++ D ++H R + L +DPS IGFYC+ +DDF +F
Sbjct: 216 KLIHLDPHLCQDVVDMRSRDFPL--QSFHCMSPRKMSLMKMDPSCTIGFYCKTQDDFKEF 273
Query: 379 CARASKLAEESNGA---PLFTVT 398
C+ A ++ + + P+F +
Sbjct: 274 CSYAQEVLDSTKHVGDYPMFIFS 296
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 62/100 (62%), Gaps = 6/100 (6%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDE--ALGDAAGNNGLA---EFNQDFSSRILISYRKGF 113
S+T S T IWLLG C+ D+ +A ++ L F +DF+SR+ ++YR+ F
Sbjct: 42 SQTNFSYHTP-IWLLGECYHHRPDDPNETEQSAEDDCLTPMERFKRDFTSRLWLTYRREF 100
Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ + +T+D GWGCMLRS QM++AQ+ L H LGR +++
Sbjct: 101 QQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140
>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1295
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 116/278 (41%), Gaps = 43/278 (15%)
Query: 82 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 141
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 142 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 185
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
GK G G W GP + + LA S P V DG + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668
Query: 246 VCIDD-------ASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
+ ++ S + W +L+++P LGL+ VNP Y ++
Sbjct: 669 YQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------ 722
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 -SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 31/49 (63%)
Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
E T+H D ++ + L +DPS+ +GF C ++ +F+DFC R S+L +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979
>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
Length = 357
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 110/230 (47%), Gaps = 42/230 (18%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G G W GP A R + LA R E GL +Y VSGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVY-VSGD------GADV--YE 252
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
D + +V G W P L+LV LG++K+ P Y L++ P L
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKIREMDPSML 300
>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.3]
Length = 873
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 149/353 (42%), Gaps = 76/353 (21%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 122 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRP-WRKPLQKPFDRE---YVEILHLFG 171
TSD GWGCMLR+ Q L+A ALL LGR WR+P + YV+I+ F
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFF 410
Query: 172 D--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI 229
D S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 DTPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGV 469
Query: 230 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
S + A I RH V G+A +++L+ + LGL+ VNP Y T+
Sbjct: 470 IFQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTI 520
Query: 290 RLT-----------FTFPQSLGIVGGKPGASTYIV----------GVQEESAIYLDPHDV 328
+++ T P + G P AS I G E + LDP
Sbjct: 521 KVSIRTLRPYRWILMTVPYTSGFNASLP-ASPEISSDMDVRELGWGDSEGAGEALDPMAE 579
Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
V D L T+H D +R + + +DPS+ +GF C+D++D+ DF R
Sbjct: 580 HYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDENDWFDFRRR 628
>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 377
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/353 (27%), Positives = 136/353 (38%), Gaps = 107/353 (30%)
Query: 95 AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
A F DF SRI I+YR F I SK T+D GWGCM+
Sbjct: 90 AAFLDDFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMI 149
Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
RS Q L+A ALL +LGR WR+ + + + +L LF D +PFSIH ++ G A
Sbjct: 150 RSGQSLLANALLIQKLGRDWRRGSET---GKEIALLSLFADRPQAPFSIHRFVEHGAAAC 206
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A ARC C+ + +YV S D +D
Sbjct: 207 GKHPGEWFGP-------SATARCIDE-----CEHAGLNVYVTSDGSD---------VHED 245
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
R + G D P L+L+ + LG++ + P Y L+ +PQS+GI G
Sbjct: 246 KFRQIA----GLDDIKPTLILLGVRLGIDSITPVYWDALKAIIQYPQSVGIAG------- 294
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
+H+ +DPS+ IGF +
Sbjct: 295 ------------------------------------------RLHIKEMDPSMLIGFLIK 312
Query: 371 DKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
+ DD+ D+ R + G P+ V P N G V E ++L
Sbjct: 313 NNDDWHDWKHR----VRSAPGKPIIHVFDG--GPPNFGRHFEREGAVDEVEAL 359
>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 506
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 370 GILIKGEKDW 379
>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
Length = 494
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGBIYENEVEKV 257
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
Length = 506
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 370 GILIKGEKDW 379
>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 158/373 (42%), Gaps = 60/373 (16%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
N + + QD I I+YR+ F P+ S SD GWGCMLR QM +AQ L H
Sbjct: 57 NKIKQLVQD---TIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQMLKKHLKNHGD 113
Query: 152 RKPLQKPFDREYVEILHLFGDSETS----------------------PFSIHNL-LQAGK 188
++ D +Y IL F D+++ PFSI + A K
Sbjct: 114 KR------DEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKK 167
Query: 189 AYGLAAGSWVGPYAM------------CRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
+ L G W P + R+ E L ++ L L ++ + +
Sbjct: 168 EFNLDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFDIKFET 227
Query: 237 DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
D + +++ + SK + + V +GL++ N +Y+ L P
Sbjct: 228 DKD--------LEEQLEKTQLKSKN-----SLAIFVLTRIGLDEPNQKYLKVLDELMELP 274
Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK--DDLEADTSTYHSDVIRHI 354
GIVGG P + YI+G + IYLDPH VQ N G+ ++ + ++Y I +
Sbjct: 275 YFQGIVGGTPKRAFYILGRINDHYIYLDPHYVQEAENKGQIIENKMFNRTSYSCKYIHLL 334
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 414
+ +D S+ + +Y R+K + F K+ ++S+ +F ++ T + V++S+ L E+
Sbjct: 335 NQKHVDTSMGLSYYIRNKSELLQFWRDMKKIKQKSDDFFIF-LSDTTPEYVDYSNQLEES 393
Query: 415 GGVPEDDSLGVMS 427
DD + +
Sbjct: 394 SNKLNDDDVVFLQ 406
>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
Length = 494
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 132/310 (42%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
EF D SR+ +YR F PI G S ++ +D+G
Sbjct: 85 EFLLDVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 144
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R +K RE +I+ F D+ +PFSIHN +
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVDNEKSLKRES-KIVTWFNDTPEAPFSIHNFVST 203
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L C + V SG D +
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIYGFPE-----CGITDCVVSVSSG--DIYQNEVEK 256
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ +++ + IL L+ + LG+ VN Y ++ +S+GI GG+
Sbjct: 257 IYVENPD-------------SIILFLLGVKLGINAVNESYRESICGILNSARSVGIAGGR 303
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q +Y DPH QP + E+ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNQFLYFDPHIPQPAVE------ESFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 366 GFYCRDKDDF 375
G + ++D+
Sbjct: 358 GVLIKGEEDW 367
>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
Length = 392
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 142/310 (45%), Gaps = 49/310 (15%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
+ D ++RI +YRK F P+ S+ T+DVGWGCMLR QM++A L+ +
Sbjct: 119 QLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLM----------AV 168
Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
+P + HL +++ N L+AG+ G ++ VG + + ALA+
Sbjct: 169 LQP------RVHHLLK------YTMENHHLKAGRFQGPSS---VGSALLHQVPSALAQLN 213
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
+ + + + Y S + I D R +GQA++ PI+L++PL
Sbjct: 214 QFRD----EEVKLRTYFASD----------TLVILDQLRP----EEGQAEFEPIMLVLPL 255
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
LG+EK+ P+Y L+L P +G +GG + YI G Q LDPH +
Sbjct: 256 RLGIEKIGPQYHARLQLLLRQPWCMGFIGGHDKRAMYIFGYQGHQYFGLDPHRCSAAVAQ 315
Query: 335 GKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEES 389
+L ++H+ + I D +DPSLA+ R ++ DD + + +E+
Sbjct: 316 STAELRDRWVEVRDSFHTSKLSGIERDDLDPSLAVFLLARTAEELDDMLSVIGQPTSEDR 375
Query: 390 NGAPLFTVTQ 399
G L +V Q
Sbjct: 376 PGPALVSVVQ 385
>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
Length = 384
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 80/139 (57%), Gaps = 6/139 (4%)
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W +++L+P+ LG E +NP Y P ++ FT LG++GG+P S Y VG QE+ I+L
Sbjct: 203 NWCSVIILIPVRLGGESLNPIYEPCIKGLFTMDHCLGVIGGRPKHSLYFVGFQEDKLIHL 262
Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 383
DPH Q V+++ D + ++H R + + +DPS IGFYCR +DDF+ FC +
Sbjct: 263 DPHFCQEVVDMTPRDFPLE--SFHCMNPRKMSIARMDPSCTIGFYCRTRDDFNKFCTTVT 320
Query: 384 KLAEESNGA----PLFTVT 398
+ G P+F V+
Sbjct: 321 EEMLRQPGPKADYPMFIVS 339
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 7/91 (7%)
Query: 70 IWLLGVCHKIAQDE------ALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKIT 122
IWL GVC+ +E L D+ E F +DF+S++ ++YR+ F + S T
Sbjct: 88 IWLQGVCYHRRNEELTKELEPLTDSDRRLYTMELFKRDFASKVWLTYRREFPQLAGSMFT 147
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+D GWGCMLRS QML+A L+ H LGR +++
Sbjct: 148 TDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178
>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
Length = 216
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 84/154 (54%), Gaps = 14/154 (9%)
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 28 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 87
Query: 324 DPHDVQPVINIG--------KDDL------EADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
DPH Q +++ +DD E STYH I +D +DPSLA+GF+C
Sbjct: 88 DPHFCQNFVDLDETTTTRDERDDYVEIKNDEFKDSTYHCPFILSTKIDKVDPSLALGFFC 147
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
+DD+++ R ++ PLF + +T K
Sbjct: 148 HTEDDYNELAKRLRTHLLPASTPPLFEMLETRPK 181
>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
Length = 389
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 137/340 (40%), Gaps = 45/340 (13%)
Query: 87 DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-LFH 145
D A + + + F I SYR + S +TSD GWGCMLR QM + Q + F+
Sbjct: 47 DLAVDQKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFY 106
Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSE-------------------TSPFSIHNLL-Q 185
L +E E++ F D++ SPFSI ++ Q
Sbjct: 107 NLSSS----------QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQ 156
Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED---GERGG 242
+ G W P + + L R + + L +++S + GG
Sbjct: 157 TKLELQKSPGEWYKPNDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFNKNGG 216
Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
D KGQ D + + + +GL+ N Y+ L T+PQ GI+
Sbjct: 217 -------DEEWLKEQIEKGQNDEFGVSIFILTRIGLDTCNQEYLKVLNDIMTYPQFQGIL 269
Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
GG P + YI+G IYLDPH VQ N ++E D S+Y I+ I + +DPS
Sbjct: 270 GGFPNKALYILGRVGNYYIYLDPHYVQNAQNY--QEMENDRSSYTCQSIQLIDSNQLDPS 327
Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT-VTQTH 401
+AI F C R K + NG F +T+TH
Sbjct: 328 MAISF-CVKNALDLLDLWRRLKQTKSENGESFFMALTETH 366
>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
Length = 700
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 84/149 (56%), Gaps = 10/149 (6%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAI 321
A W P+LL +PL LGL + NP Y ++ P S+GI+GG+P + +IVG +E +
Sbjct: 259 ATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQIPHSIGIMGGRPSHAVWIVGTAGDEDLL 318
Query: 322 YLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
LDPH QP +DDL A D T+H D + L+ +DPS+ IGF C +D+FD CA
Sbjct: 319 CLDPHTTQPA---SQDDLTAEDDVTHHCDCPVRLPLERLDPSMVIGFVCTTEDEFDQLCA 375
Query: 381 RASK---LAEESNGAPLFTVTQTHKKPVN 406
+ E + G PLF V ++ +P N
Sbjct: 376 HLERDVLSVETTCGHPLFEVHKS--RPSN 402
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 35/66 (53%), Gaps = 3/66 (4%)
Query: 136 MLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
M++A+A+ LG+ WR P + D Y + +F D ++S +SI N+ G A
Sbjct: 1 MMLAEAITRIHLGKDWRWTPGCQ--DEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPI 58
Query: 195 GSWVGP 200
GSW GP
Sbjct: 59 GSWFGP 64
>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
purpuratus]
Length = 1018
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 81/145 (55%), Gaps = 10/145 (6%)
Query: 70 IWLLGVC-HKIAQDEALGDAAGNNGLAE-----FNQDFSSRILISYRKGFDPIGDSKITS 123
IW LG C H+ +D G + + F QDFSSR+ ++YR+ F + S TS
Sbjct: 346 IWFLGKCYHQRPEDPDPERPPGMDSVRSMVIEMFKQDFSSRLWMTYRREFPTLAGSNFTS 405
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDS--ETSPFS 179
D GWGCMLRS QM++A +L+ H LGR W KP + + + +I+ FGD + SPFS
Sbjct: 406 DCGWGCMLRSGQMMLAHSLILHFLGREWNIYKPQTQEMLQFHRQIVRWFGDQPLDMSPFS 465
Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMC 204
+H L+ G+ G G W GP ++
Sbjct: 466 VHRLVGIGQNNGKKVGDWYGPSSVA 490
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 6/154 (3%)
Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
ID + S ++G W +++++P+ LG ++VNP YI ++ FT LGI+GGKP
Sbjct: 819 IDPSRSRTSTSTEGGKPWCAVVIMIPVRLGGDEVNPVYIRPIQSLFTLESCLGIIGGKPK 878
Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
S + VG QEE I+LDPH Q V+++ D ++H R + + +DPS IGF
Sbjct: 879 HSLFFVGFQEEKLIHLDPHYCQQVVDMKTRDFPL--WSFHCMSPRKMSISKMDPSCTIGF 936
Query: 368 YCRDKDDFDDFCAR----ASKLAEESNGAPLFTV 397
Y R ++ F+ C S L S+ P+F V
Sbjct: 937 YIRTEEQFEQLCKELPTVVSPLGSHSSDYPMFIV 970
>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
8797]
Length = 448
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 154/363 (42%), Gaps = 68/363 (18%)
Query: 92 NGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------------IT 122
N +F +D +R+ +YR F PI S
Sbjct: 38 NEKMQFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFN 97
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
+D+GWGCM+R+ Q L+ AL R GR +R D +I+ F D+ +PFS+HN
Sbjct: 98 TDIGWGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD----DIIQWFKDTPDAPFSLHN 153
Query: 183 LLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
++ G + + G W GP A RS ++L C + G+ I VS + ++
Sbjct: 154 FVKKGVELADMKPGQWFGPAATSRSIQSLI-CNFPQCGID-----HCIVSVSSADIYKQD 207
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
+ D S +L+L + LG+ VN Y +R S+GI
Sbjct: 208 VEDMFDADPDSN--------------LLILFGVKLGVSAVNASYWEDIRRLLNSKFSVGI 253
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
GG+P +S Y G Q + +Y DPH QP + DD A +T HS + L +DP
Sbjct: 254 AGGRPSSSLYFFGYQNQELLYFDPHTPQPSL---IDD--AAFNTCHSIEFGKLELRDMDP 308
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD 421
S+ IG + D++++ ++ E S +F + + + + DV + G D+
Sbjct: 309 SMLIGIMIEGERDWENW----ARFTETSK---IFNILEERSEDCINVDV--DIDGDENDE 359
Query: 422 SLG 424
++G
Sbjct: 360 NIG 362
>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 357
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 125/277 (45%), Gaps = 35/277 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNAL----------- 177
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G V+ + + ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEQVKELLRQSMHVVLLIPVMLGIRVI 227
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP + E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQPAFTSSGNSGEL 287
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ R + S D S+ +GFY D F F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSFAVF 318
>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
Length = 286
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 79/141 (56%), Gaps = 3/141 (2%)
Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
+A+W I++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +
Sbjct: 109 EAEWKSIIILVPVRLGGETLNPAYMPCIKELLRMEPCLGIIGGKPKHSLYFIGYQDDFLL 168
Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
YLDPH QP ++ KD + ++H R + +DPS +GFY + DF+ C++
Sbjct: 169 YLDPHYCQPCVDTMKDSFPLE--SFHCTAPRKLPFAKMDPSCTVGFYAGTRKDFEALCSQ 226
Query: 382 -ASKLAEESNGAPLFTVTQTH 401
L + P+FTV + H
Sbjct: 227 LLQALNSTATRYPMFTVAEGH 247
>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
Length = 354
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 32/299 (10%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA-LLFHRLGRPWRKPLQKPFDREY 163
+ SYR GF P+ + T+DV WGC++R++QML+AQA + F G + RE
Sbjct: 69 LYFSYRCGFTPLSNGS-TTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFVDGSALQILREK 127
Query: 164 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
V+ LF D ++PF IH + + YG+A G W G ++ +L + G G
Sbjct: 128 VQ--PLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSLRGGNG-- 183
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 283
P + V D E V + SR ++LL+P VLGL++++
Sbjct: 184 --PAVLVFV----DREVSALKVRDLLSHSRQ-------------VVLLIPAVLGLDRISV 224
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
+Y L +G++GG+ ++ Y VG Q + IYLDPH Q E T
Sbjct: 225 KYSKMLIRCLEMESCIGVIGGRKSSALYFVGHQSNNIIYLDPHRAQRAFTEVASPGEL-T 283
Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
+H + + + S+ GFY + F F A + A + PL +V + +
Sbjct: 284 GAWHL-----LPVTACSTSILFGFYIDSLESFKQFEADMLE-ANSALAFPLISVATSER 336
>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
Length = 440
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 83/155 (53%), Gaps = 16/155 (10%)
Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 252 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 311
Query: 324 DPHDVQPVINIG---------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
DPH Q +++ K+D E STYH I +D +DPSLA+GF+
Sbjct: 312 DPHFCQNFVDLDEATTTKDERGDYVEIKND-EFRDSTYHCPFILSTKIDKVDPSLALGFF 370
Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
C +DD+ + R ++ PLF + +T K
Sbjct: 371 CHTEDDYSELANRLRTHLLPASTPPLFEMLETRPK 405
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 57/122 (46%), Gaps = 28/122 (22%)
Query: 85 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 59 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 118
Query: 144 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
LGR W +DR EY IL G SE G G W
Sbjct: 119 VRHLGRNWL------WDRDVMLTEYKRILPNMGVSE----------------GKEIGEWF 156
Query: 199 GP 200
GP
Sbjct: 157 GP 158
>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
Length = 178
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/77 (64%), Positives = 61/77 (79%)
Query: 81 QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQ 140
++E G + ++G A F +DFSSRI I+YRKGFD I SK+TSDV WGCM+RSSQMLVAQ
Sbjct: 99 EEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQ 158
Query: 141 ALLFHRLGRPWRKPLQK 157
AL+FH LGR WRKP +K
Sbjct: 159 ALIFHHLGRSWRKPSEK 175
>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 298
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 135/304 (44%), Gaps = 46/304 (15%)
Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVE 165
+Y K F P+ T+D WGC +RS+Q L+ Q + L+ LG R P + +Y
Sbjct: 28 TYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDDIRNIF--PTNSKY-- 82
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
LF D SPF + ++ ++YG+ G WV P + + + R
Sbjct: 83 --ELFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKEIMNFFRI--------- 131
Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
PVV + V ++ + P+LLL L+LG E +Y
Sbjct: 132 ------------------PVVIAEHGCLSREVLNEALSHNIPVLLLFTLMLGYENFELKY 173
Query: 286 IPTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
+P L+LT + QS+G+VGG+ G + +IVG Q+E +Y DPHDV +I K D +
Sbjct: 174 LPFLKLTLSLIYQSVGVVGGQQGKAYFIVGHQKEKLLYFDPHDVNE--SITKID---QIN 228
Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
++ + D++ S+ +GF+ + D ++ L +S P+ V + +
Sbjct: 229 QLFKPPLKVMPADTLSSSMLVGFFITNLQDAEEL----PMLLNQSGECPIHIVDKIEEAK 284
Query: 405 VNHS 408
H+
Sbjct: 285 ETHT 288
>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 360
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 181 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 240
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 241 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 298
Query: 383 SKLAEESNGA---PLFTVTQTHKKPVNHS 408
+++ S+ P+FT+ + H + +HS
Sbjct: 299 TRVLSSSSATERYPMFTLAEGHAQ--DHS 325
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 86 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 139
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 140 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 167
>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
Length = 362
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 183 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 242
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 243 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 300
Query: 383 SKLAEESNGA---PLFTVTQTHKKPVNHS 408
+++ S+ P+FT+ + H + +HS
Sbjct: 301 TRVLSSSSATERYPMFTLAEGHAQ--DHS 327
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 88 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 141
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 142 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 169
>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
Length = 359
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 83/144 (57%), Gaps = 5/144 (3%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 180 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 239
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 240 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 297
Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
+++ S+ P+FT+ + H +
Sbjct: 298 TRVLSSSSATERYPMFTLVEGHAQ 321
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 85 TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGSLTSD 138
Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 166
>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
Length = 745
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH VQ +N D ++TY + + + +DPSL+IGFYCRD+ F+D C R S
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619
Query: 385 LAEESNGAPLFTVTQ 399
+ + P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 155 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 189
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
Length = 745
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
PH VQ +N D ++TY + + + +DPSL+IGFYCRD+ F+D C R S
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619
Query: 385 LAEESNGAPLFTVTQ 399
+ + P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 155 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 189
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
Length = 469
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 145/332 (43%), Gaps = 54/332 (16%)
Query: 96 EFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDVG 126
EF +D +SR+ +YR F PI G S + +D+G
Sbjct: 62 EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
WGCM+R+ Q L+A AL LGR +R + ++I+ F D+ PFS+H +Q
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181
Query: 186 AG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 244
G K G G W GP A+ RS +L C ++S D +
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHC--------IISTD-------SA 226
Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
V +D+ K LLL+ + LG++ N Y ++ + QS+GI GG
Sbjct: 227 DVYLDEIDPLFRANPKANV-----LLLLGVRLGVDFTNEYYWDDIKNILSSSQSVGISGG 281
Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
+P +S Y G Q + YLDPH VQ + + + D E + H IHL +IDPS+
Sbjct: 282 RPSSSLYFFGYQGDYLFYLDPHKVQLNLALYESD-EERFHSVHPQTFNKIHLSAIDPSML 340
Query: 365 IGFYCRDKDDFDDF--CARASKLAEESNGAPL 394
+GF +DD+ + SK+ S+ P+
Sbjct: 341 LGFLLTGEDDWLSWKTTVLGSKIIHLSDSKPV 372
>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 485
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 128/310 (41%), Gaps = 57/310 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 76 EFLLDVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 135
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R F RE I++ F D+ +PFS+HN +
Sbjct: 136 WGCMIRTGQSLLGNALQILHLGRDFRVDEDDDFRRE-SRIVNWFNDTPEAPFSLHNFVST 194
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS + L E G+ + V SG D
Sbjct: 195 GTELSDKRPGEWFGPAATARSIQYLIY-GFPECGINA----CIVSVSSG--DIYENEVEE 247
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
V +D+ + + IL L+ + LG+ VN Y ++ S+GI GG+
Sbjct: 248 VFVDNPN-------------SSILFLLGVKLGINAVNESYRESICGILNSAWSVGIAGGR 294
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ ++ H+ + L +DPS+ I
Sbjct: 295 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVNSCHTSKFGRLQLSEMDPSMLI 348
Query: 366 GFYCRDKDDF 375
G + + D+
Sbjct: 349 GVLIKGEKDW 358
>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 444
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 136/312 (43%), Gaps = 71/312 (22%)
Query: 103 SRILISYRKGFDPIGDSK----------------------------------ITSDVGWG 128
SR+ +SYR GFDPI ++ TSD GWG
Sbjct: 84 SRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAGWG 143
Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AG 187
CM+R+SQ L+A LL P D + +++ LF D+++SPFSIHN ++ AG
Sbjct: 144 CMIRTSQNLLANTLL-----------QLLPPDSKQ-DVIGLFQDNQSSPFSIHNFIKVAG 191
Query: 188 KA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
++ + G W GP A S + L + + G + + I S DGE
Sbjct: 192 ESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVFISENSDLYDGEINE---- 247
Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
+ + R +L+L P+ LG++KVN Y ++ S GI GGKP
Sbjct: 248 ILSEEGRS-------------VLVLFPIRLGIDKVNSYYYDSIFQVLKSKFSCGISGGKP 294
Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
+S Y +G IY DPH Q V N + +YH+ +++ +DPS+ IG
Sbjct: 295 SSSFYFLGYDNSDLIYFDPHLPQLVEN------PINIESYHTRNYNRLNISLLDPSMMIG 348
Query: 367 FYCRDKDDFDDF 378
R DD+ +F
Sbjct: 349 ILLRSMDDYLEF 360
>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
Length = 463
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 129/312 (41%), Gaps = 67/312 (21%)
Query: 92 NGLAEFNQDF----SSRILISYRKGFDPIGDSK--------------------------- 120
N + NQDF +SR+ +YR F PI S
Sbjct: 52 NRNSNLNQDFLSDVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNP 111
Query: 121 --ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
+D+GWGCM+R+ Q L+ AL +LGR +R L + EI+ F D+ PF
Sbjct: 112 DCFNTDIGWGCMIRTGQSLLGNALQLAKLGRHFR--LDNKMGIKDDEIISWFRDTTQEPF 169
Query: 179 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
SIH ++ G K G W GP A S ++L + E G+ + V SGD
Sbjct: 170 SIHKFVEKGNKLANKKPGEWFGPAATSISIQSLIE-EFPECGID----KCLVSVSSGD-- 222
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+D R +F + + IL L+ + LGL+ VN Y +
Sbjct: 223 ---------IFEDDVRE--IFEENMD--SKILFLMGVKLGLDAVNSFYWEDILNILDSKF 269
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHI 354
S+GI GG+P +S Y G Q +Y DPH QP + D S Y H+ +
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQPSL--------VDPSVYETCHTTNFGKL 321
Query: 355 HLDSIDPSLAIG 366
+ +DPS+ IG
Sbjct: 322 DIKDMDPSMLIG 333
>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
Length = 392
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 143/340 (42%), Gaps = 41/340 (12%)
Query: 86 GDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
DA + + Q S I SYRK S +TSD GWGCM+R +QM +AQ +
Sbjct: 46 NDADIEQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQII--- 102
Query: 146 RLGRPWRKPLQ-----KPF----DREYVEILHLFGDSET----SPFSIHNLLQAGKA-YG 191
R ++KP Q + F D E + + F ++ +PFSI ++ K
Sbjct: 103 RYYNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162
Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----------VSGDEDGER 240
G W + ++ + L + + SL M IY+ + + +
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQY-------SLNMEIYINYDCAFILQDAIQQMFNQQE 215
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
G + + + +++ + F D I + +P +GL+ +N Y+ L P G
Sbjct: 216 GNE--IWLKERAKNNNQFDL--QDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQG 271
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
++GG + Y VG ++ IYLDPH VQ N DDL + ++Y I+ IH ID
Sbjct: 272 MIGGVSKRALYFVGRIQDYLIYLDPHFVQNAQNF--DDLSKNQASYTCQNIQLIHNSLID 329
Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
PS+ + R+ + D +E F++ +T
Sbjct: 330 PSIVVCLCIRNALELLDLWQIFQHFKQEYQDLFFFSLLET 369
>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 1216
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 154/362 (42%), Gaps = 79/362 (21%)
Query: 99 QDFSSRILISYRKGFDPIGDSKI-------TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
Q + + IL +YRK F P+ KI TSD GWGCM+R+ QM+ AQ + H +
Sbjct: 257 QIYQNTILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRHLKKTDY 316
Query: 152 RKP----------LQKPFDRE----YVEILHLFGDSETSPFSIHNLL-QAGKAYGLAAGS 196
+ L++ +E Y+ + P+SIH + +A Y + G
Sbjct: 317 IEQHQLINIIIGFLEEEEVQEGGKGYIFNQQSYIQDRIRPYSIHQITNRAFCKYKIQPGQ 376
Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED-----------GERGGAPV 245
W P + + L + + + G ++L + ++ S D+ G +G +
Sbjct: 377 WYTPNQIAIILKELHKKNKIK---GTENLKIDVH--SSDKPIIFEKILQTLLGRQGKINL 431
Query: 246 VC--------------IDDA------------SRHCSVFSKGQADWT------------- 266
C DD+ S + + + D T
Sbjct: 432 NCNHENQQSRNSINQDQDDSFEKIMPPNQQEIEEFSSQYEESKEDQTDNLCCKDCFKTDN 491
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+ LL+P LGL++++P +I L+ + QS+G++GGKP + Y +G + +YLDPH
Sbjct: 492 KLFLLLPCRLGLDEISPIHIEILKKLLSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDPH 551
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
++ + K+DL + S+Y + + + ++ I SL GFY D+ + F +L
Sbjct: 552 YIKECVR--KEDLMENISSYFEEDVFKMPINKISTSLVFGFYFSGVDELNKFYKFLRQLE 609
Query: 387 EE 388
+E
Sbjct: 610 KE 611
>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 357
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG---RPWRKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G R + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G ++ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLATGGE-----GPVILAFPERQIFLEEVKELLRQSTHVVLLIPVMLGICVI 227
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ R + S D S+ +GFY D F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLSVF 318
>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 357
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 227
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ R + S D S+ +GFY D F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 318
>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
Length = 398
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 104/235 (44%), Gaps = 48/235 (20%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK--------------------------ITSDVGWGC 129
+F DF S++ I+YR F PI + TSD GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-K 188
M+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGAT 301
Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
A G G W GP A + +AL + + GL G + E+ V C
Sbjct: 302 ACGKCPGEWFGPSAASQCIQALVKSN-PQVGL------RVCITSDGSDIYEKQFKEVACD 354
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G
Sbjct: 355 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAG 398
>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
Length = 351
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 68 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 127 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 171
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 172 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 221
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 222 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 281
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ R + S D S+ +GFY D F
Sbjct: 282 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 312
>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 371
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 124/302 (41%), Gaps = 57/302 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLI------YGFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y G Q ++ DPH QP + ++ + H+ + L +DP ++
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPRCSL 369
Query: 366 GF 367
F
Sbjct: 370 VF 371
>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
Length = 603
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 83/168 (49%), Gaps = 38/168 (22%)
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VGGKP AS Y + VQ+++ YLDPH
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEIFKFPQNLGVVGGKPRASLYFIAVQDDNLFYLDPH 430
Query: 327 DVQPVINIGKDDLEAD-------------------------------------TSTYHSD 349
VQ I+I + E +T+
Sbjct: 431 TVQNHIDINNSNGEPSNFSFSSSPSSSNINIINTNNNNNNNNNNDKNNNNSFPVNTFFCS 490
Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
+ H+ +DPSL + F+C+ + DFDDF R+ +A + P+F++
Sbjct: 491 QTKRTHVSEVDPSLVVAFFCKSRSDFDDFVDRSKAMASQMEN-PIFSI 537
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 74/126 (58%), Gaps = 1/126 (0%)
Query: 87 DAAGNNGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
D G + + EF +DF++R+L +YR+GF I +++ +D GWGCMLRS QML++ LL H
Sbjct: 129 DIPGQSFIKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHH 188
Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
LG W+K Y I+ +F D ++PFSIHN+ G+ G G W P + +
Sbjct: 189 ALGDDWKKSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQ 248
Query: 206 SWEALA 211
+ ++L
Sbjct: 249 AIKSLV 254
>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 113/261 (43%), Gaps = 39/261 (14%)
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
YR + +S +T+D GWGC RS+Q L+ Q +L +L R +R + F + V L
Sbjct: 25 YRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYIL--KLHRKFRSLYDQVFGQN-VNPLD 81
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
LF D ++PF I NL + A GL G W P M A + L C
Sbjct: 82 LFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIM----AATIKLIFDTLNLSC------ 131
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
++S D + +H P L+L+P + GL K++ Y+
Sbjct: 132 --IISQDLTLDSNDI---------KHTKY---------PALILIPSLFGLSKMDDSYLSF 171
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
L L SLG V G+ ++ Y VG E Y DPH + + + ++
Sbjct: 172 LLLCLCIESSLGFVSGQNASAYYFVGFDLEDFYYFDPHVTKEAV------VSPPYDSFFD 225
Query: 349 DVIRHIHLDSIDPSLAIGFYC 369
++ + +SI+PS+ +GFYC
Sbjct: 226 LELKSMKKESINPSVLLGFYC 246
>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
[Homo sapiens]
Length = 231
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 102/226 (45%), Gaps = 61/226 (26%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEK---- 133
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
MCR + +S D G+R + +
Sbjct: 134 -------------MCR-----------------------VLPLSADTAGDRPPDSLTASN 157
Query: 250 DA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
+ S +CS W P+LL+VPL LG+ ++NP Y+ ++T
Sbjct: 158 QSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKVT 196
>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 117/263 (44%), Gaps = 39/263 (14%)
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
YR F I +S ++ D GWGC RSSQ LV Q +L RL + + F + L
Sbjct: 25 YRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYIL--RLHKNFPDLYNSTFGID-KNPLD 81
Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
LF D +PF I N++ + GL G+W P + +++++ + L C
Sbjct: 82 LFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSIFQ----SLHLNC------ 131
Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
+V D ++ + ++ P+L+L+P + GLEK+ YI
Sbjct: 132 --IVPQDSTF------------------IYEELESTNYPVLILIPGLFGLEKIEKPYISF 171
Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
+ L+ SLG V G ++ Y +G + Y DPH + + D +
Sbjct: 172 IFLSLCMNSSLGFVSGHNDSAFYFIGFDSDYFYYFDPHVTKQALTGPPYDSLFELK---- 227
Query: 349 DVIRHIHLDSIDPSLAIGFYCRD 371
++ + +++I+PS+ +GFYC D
Sbjct: 228 --LKSMKIENINPSVLLGFYCDD 248
>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus A1163]
Length = 226
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)
Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 20 GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79
Query: 321 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
YLDPH +P + NI + + TYH+ +R IH+ +DPS+ IGF +D++D+
Sbjct: 80 FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137
>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus Af293]
Length = 226
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)
Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 20 GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79
Query: 321 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
YLDPH +P + NI + + TYH+ +R IH+ +DPS+ IGF +D++D+
Sbjct: 80 FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137
>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
Length = 460
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/334 (29%), Positives = 146/334 (43%), Gaps = 63/334 (18%)
Query: 96 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
+F D SR+ +YR F PI G S ++ +D+G
Sbjct: 60 QFLSDVHSRLHFTYRTKFVPIPRVSDGPSPLSFHFLIRENPLTTIENAIYNPDCFNTDIG 119
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+R+ Q L+ AL LGR +R + + D+E +I+ F D+ + FSIHN +
Sbjct: 120 WGCMIRTGQSLLGNALQIANLGRDFR--VNQGKDQEEYKIIDWFADTPQAHFSIHNFVSQ 177
Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
G K G W GP A RS + L Q + G+ I V SGD
Sbjct: 178 GLKLSNKKPGEWFGPAATSRSIQCLVE-QFPDCGID----KCLISVSSGD---------- 222
Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
+D R +F+ Q + ILLL+ + LG+ VN Y ++ T S+GI GG+
Sbjct: 223 -VFEDEVRE--IFA--QKPQSRILLLLGVKLGVNAVNEYYWDDVKKTLGSKFSVGIAGGR 277
Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
P +S Y +G Q IY DPH QP + + + T H+ + L +DPS+ I
Sbjct: 278 PSSSLYFMGFQGNELIYFDPHTPQPSLQTSANFYD----TCHALNFGKLLLSDLDPSMLI 333
Query: 366 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
G ++ + + EE + +F V+Q
Sbjct: 334 GILISGEEAW-------LQWKEEVKDSKIFNVSQ 360
>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
Length = 348
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 49/344 (14%)
Query: 93 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 152 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
AL M Y+ SG + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLSSGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
Q +DT S + L S S+ +GFY D F F
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298
Query: 387 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
+++N + +F + + V SD +G + D ++S D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFSEDDPDVCSLVSFGD 337
>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
Length = 632
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 4/118 (3%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
++W P+LL VPL LGL NP Y ++ F P +GI+GG P + +IVGV + I
Sbjct: 385 SNWRPLLLFVPLRLGLHNPNPCYFNAIKAVFRLPNCIGILGGSPCHAVWIVGVTGDDVIC 444
Query: 323 LDPHDVQPVINIGKDDLEAD-TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
LDPH QP G+ +L+ D TYH + + L +DPS+ +GF C + +FDD C
Sbjct: 445 LDPHTTQPA---GRGNLKPDYDQTYHCENPIRMPLKRLDPSMVLGFLCSTEKEFDDLC 499
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 214 QR 215
R
Sbjct: 161 DR 162
>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
Length = 364
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 142/351 (40%), Gaps = 96/351 (27%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
L EF D + I ++ G + +SD GWGCMLR QM++AQAL+ LGR
Sbjct: 24 LEEF-PDTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRA--- 79
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
Q G G + G W GP + + + LA
Sbjct: 80 -------------------------------QMGVGEGKSIGEWFGPNTVAQVLKKLALF 108
Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF--------------- 258
+ +A+YV + V I+D + C V
Sbjct: 109 DEWNS--------LAVYVSMDN---------TVVIEDIKKMCCVLPLSADTDTESPPDSP 151
Query: 259 -----SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV------- 302
SKG + W P+LL+VPL LG+ ++NP Y+ +L + L +
Sbjct: 152 TASNQSKGPSACGSAWKPLLLIVPLRLGINQINPVYVDAFKLQASCHPILIVTKEGVRRT 211
Query: 303 ---------GGKPGASTYIVGVQEESA---IYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
G + S + V ++ I+LDPH Q ++ ++ + D + +
Sbjct: 212 RILPPKDSSGARASESLKVKHVSFKTGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQS 271
Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 272 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 321
>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 348
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 148/344 (43%), Gaps = 49/344 (14%)
Query: 93 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 152 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
AL M Y+ +G + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLRNGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
Q +DT S + L S S+ +GFY D F F
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298
Query: 387 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
+++N + +F + + V SD +G + D ++S D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFNEDDPDVCSLVSFGD 337
>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
Length = 179
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 70/114 (61%), Gaps = 3/114 (2%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
+ P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ YLD
Sbjct: 24 FRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHLFYLD 83
Query: 325 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
PH +P + NI + + + TYH+ +R IH+ +DPS+ IGF +D++D+
Sbjct: 84 PHQTRPALPQRNIDERYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137
>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
Length = 378
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 137/386 (35%), Gaps = 130/386 (33%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF SRI ++YR+ F PI S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 149 RPWRKP----------------------------------LQKPFD--REYVE------- 165
R W P L+ P +E +E
Sbjct: 103 RAWTWPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHE 162
Query: 166 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 IRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + + + AD +++LVP+ L
Sbjct: 223 PDLQG-----ITIYVAQ--------DCTVYSSDVIDKQRTAMTADNADDKAVIILVPVRL 269
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G E+ N Y+ ++ TF P + K +DP
Sbjct: 270 GGERTNTDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 394
S IGFYCR+ DF +K+ S+ PL
Sbjct: 301 -------------------------SCTIGFYCRNIQDFKRASEEITKMLTISSKEKYPL 335
Query: 395 FTVTQTHKK-------PVNHSDVLGE 413
FT H + N D+ E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361
>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 92/397 (23%), Positives = 162/397 (40%), Gaps = 67/397 (16%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
I++LG H+I D+ + + + Q I I+YR+ + P+ S SD GWGC
Sbjct: 38 IYILG--HRIDIDQF----EIEDRINKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGC 91
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS------------- 176
MLR QM +AQ L H ++ D +Y I+ F D+++
Sbjct: 92 MLRVGQMAMAQMLKKHLKNHGDKR------DEDYDNIILAFADNDSQENKEFIEFQNSKD 145
Query: 177 ---------PFSIHNL-LQAGKAYGLAAGSWVGPYAM------------CRSWEALARCQ 214
PFSI + A K + L G W P + R+ E L
Sbjct: 146 KQKAHNFICPFSIQKIAYLAKKEFNLDPGEWYRPNYILFLLELLHNTIPIRASENLKLSV 205
Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
++ L L ++ + D + +++ + K + + V
Sbjct: 206 FNDSCLFLDQLMNRMFEAKFETDKD--------LEEQLEKTQLIGKN-----SLAIFVLT 252
Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
+GL++ N +Y+ L P GIVGG P + YI+G + +YLDPH VQ N
Sbjct: 253 RIGLDEPNQKYLKILDEIMELPYFQGIVGGTPKRAFYILGKINDHYLYLDPHYVQEAEN- 311
Query: 335 GKDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
KD + + ++Y I ++ +D S+ + FY R++ + F ++ + S+
Sbjct: 312 -KDQINENKMFNRTSYSCKNIHLLNQKHVDTSMGLSFYIRNQSELLQFWRNMKQIKQSSD 370
Query: 391 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMS 427
+F ++ + + V++S L E+ DD + +
Sbjct: 371 DFFIF-LSDSAPEYVDYSGQLEESSNKLNDDDVVFLQ 406
>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
Length = 259
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 114/272 (41%), Gaps = 56/272 (20%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 103
Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
D + C V P S G +P S
Sbjct: 104 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 126
Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 127 -LTASNQGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 185
Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
+++ DFD++C+ K + N +F + Q H
Sbjct: 186 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 216
>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 394
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGC---EAIKRTMQG--AVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLNEGPSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
+ T +R +H +D SL + F +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
Length = 454
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 245 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFY 304
Query: 323 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
LDPH +P + + + +TYH+ +R +H+ +DPS+ IGF RD+DD++ +
Sbjct: 305 LDPHHTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 364
Query: 379 CARASKLAEESNGAPLFTVTQTHKKP 404
A G + V K P
Sbjct: 365 KRSVHNRAMIGTGKAIIHVFDKEKSP 390
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 24/119 (20%)
Query: 58 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 113
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 114 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
DP + T+D GWGCM+RS Q L+A AL LGR R
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRACR 203
>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 297
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 109/223 (48%), Gaps = 33/223 (14%)
Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 166
+Y KGF P+ T+D WGC +RS Q L+ Q + +L + + ++ F
Sbjct: 27 FTYHKGFSPLAGG-YTTDKNWGCCIRSGQGLLMQFV--SKLYQLYGDKIKNIFPNG--SK 81
Query: 167 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
LF D +PF IH + + + +G+ AG WV P + ++ L
Sbjct: 82 FELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDLLSF------------- 128
Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
I+VV E+G C+ S S G P+LLL L+LG + + +Y+
Sbjct: 129 FGIHVVIA-ENG--------CLSRESLR-EALSYGH----PVLLLFTLMLGYKDFDLKYL 174
Query: 287 PTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
P LRLT + QS+G+VGG+ G + Y+VG Q+E+ +Y DPH+V
Sbjct: 175 PFLRLTLSLIYQSVGVVGGQQGKAYYLVGHQKENLLYFDPHEV 217
>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 463
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)
Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 253 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQASHFFY 312
Query: 323 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
LDPH +P + + + +TYH+ +R +H+ +DPS+ IGF RD+DD++ +
Sbjct: 313 LDPHHTRPALAYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 372
Query: 379 CARASKLAEESNGAPLFTVTQTHKKP 404
A G + V K P
Sbjct: 373 KRSVHNGAMIGTGKAIIHVFDKEKSP 398
>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 394
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 121/274 (44%), Gaps = 33/274 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLCEGLSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
+ T +R +H +D SL + F +D++
Sbjct: 256 ASVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
Length = 356
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/263 (30%), Positives = 117/263 (44%), Gaps = 37/263 (14%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
TSD+GWGCM+R+ Q L+A AL G P EI+ LF D +PFSI
Sbjct: 84 FTSDIGWGCMIRTGQTLLANALQRTNKGTPCS------------EIIELFVDETKNPFSI 131
Query: 181 HNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
HN + GK L G W P + E L C + + SGD +
Sbjct: 132 HNFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKC-----IVSISSGDIYEQ 186
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTFTFPQS 298
+ +DD+ + +K Q ILLL + LG+ +N +Y ++ +
Sbjct: 187 ---DVLDELDDSEPPAN--TKQQH----ILLLFGIKLGINTINIEKYGQDIKDITNNKYT 237
Query: 299 LGIVGGKPGASTYIVGVQE--ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHL 356
GI GG+P +S + G + +Y DPH N D+ D STYHS + +
Sbjct: 238 CGISGGQPKSSLFFFGYNNTHDRILYFDPHKPN---NFTTDN---DYSTYHSTEFNELEM 291
Query: 357 DSIDPSLAIGFYCR-DKDDFDDF 378
++DPS+ IGF + +K D++ F
Sbjct: 292 FNLDPSMIIGFLVKNNKADWNKF 314
>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 394
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
+ T +R +H +D SL + F +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 394
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSANG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255
Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
+ T +R +H +D SL + F +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
Length = 483
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/257 (33%), Positives = 118/257 (45%), Gaps = 34/257 (13%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
SD+GWGCM+R+ Q L+ AL RL P P +K +++ F D ++PFS+
Sbjct: 144 FCSDIGWGCMIRTGQALLGNALA--RLRSP---PEEK-------QLIGWFEDRSSAPFSL 191
Query: 181 HNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
HN ++ G A G W GP A RS ++L + GL I SGD E
Sbjct: 192 HNFVREGNALSRKPPGEWFGPSATSRSIQSLVHA-FPQCGLNH----CIISTDSGDVYEE 246
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
G P++ + ILLL+ + LGL VN RY P ++ S+
Sbjct: 247 DVG-PIL--------------EREPQATILLLLGVKLGLNNVNSRYWPDVKHILGSSFSV 291
Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
GI GG+P +S Y G Q + YLDPH Q + D E S HS +H +
Sbjct: 292 GIAGGRPSSSLYFFGYQGDYLFYLDPHTSQLDLASCATDNEKYESV-HSARFNKVHFSEL 350
Query: 360 DPSLAIGFYCRDKDDFD 376
DPS+ IG + DD+D
Sbjct: 351 DPSMLIGVLIQGLDDWD 367
>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
Length = 378
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 115/315 (36%), Gaps = 118/315 (37%)
Query: 91 NNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGW 127
N +F DF SR ++YR F PI SK +SD GW
Sbjct: 109 NGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGFSSDSGW 168
Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
GCM+RS Q L+A A RLGR WR+ QK E ++I+ +F D +P+SIHN + G
Sbjct: 169 GCMIRSGQSLLANATGIVRLGRDWRRGQQK---AEEIKIMRMFADDPAAPYSIHNFVDYG 225
Query: 188 KAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
+ G G W GP A +
Sbjct: 226 SSKCGKYPGEWFGPSATSQ----------------------------------------- 244
Query: 247 CIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
CI+ S + ++D + P L+L+ LG++K+ Y L PQS+GI G
Sbjct: 245 CINPDVYEDSFMATAKSDHGFFKPTLILISTRLGIDKITQVYWEALISALQMPQSVGIAG 304
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
+R +H+ +DPS+
Sbjct: 305 -----------------------------------------------LRRLHVQQMDPSM 317
Query: 364 AIGFYCRDKDDFDDF 378
IGF R ++++ ++
Sbjct: 318 LIGFIIRSEEEWKEW 332
>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
Length = 265
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y +G Q+E
Sbjct: 151 AHQSWQSVIILVPVRLGGESLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFIGFQDEQL 210
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
+YLDPH QPV+++ + + + ++H + + + +DPS IGFY + K DF+
Sbjct: 211 LYLDPHYCQPVVDVSQVNFSLE--SFHCNSPKKMPFSRMDPSCTIGFYAKSKKDFE 264
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 41/60 (68%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ F F SRI ++YRK F P+ S +T+D GWGCMLRS QML+AQ LL H + R +++
Sbjct: 74 VERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQGLLVHLMHRVYKE 133
>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
Length = 257
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 80/176 (45%), Gaps = 44/176 (25%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP 249
>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
Length = 378
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 134/386 (34%), Gaps = 130/386 (33%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 149 RPWRKP----------------------------------LQKPF------------DRE 162
R W P L+ P D E
Sbjct: 103 RAWTWPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHE 162
Query: 163 YV-EILH-----LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
EI H FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 MQNEIYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
G + IYV V D + C+ + D +++LVP+ L
Sbjct: 223 PELQG-----ITIYVAQ--------DCTVYSSDVIDKQCASMAPDITDDKAVIILVPVRL 269
Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
G E+ N Y+ ++ TF P + K +DP
Sbjct: 270 GGERTNIDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE--ESNGAPL 394
S IGFYCR+ DF +K+ + PL
Sbjct: 301 -------------------------SCTIGFYCRNVQDFKRASEEITKMLKVFSKEKYPL 335
Query: 395 FTVTQTHKK-------PVNHSDVLGE 413
FT H + N D+ E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361
>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 343
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 125/293 (42%), Gaps = 37/293 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
I SYR GF + I SD GWGCMLRS QM+ A LL H P +Q + +
Sbjct: 27 IYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLLRHLKENP---QIQNQLKIQNI 83
Query: 165 E-----ILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
I+ F +++ PFSI + A + + L G W P + S + L + +
Sbjct: 84 NDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKKLLNNFQTFS 143
Query: 219 GLGCQS------LPMAIYVVSGDEDGERGGAPV------VCIDDASRHCSVFSKGQADWT 266
+ S P+ G++ + + + I++ + + + +
Sbjct: 144 EMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKIMKQNSNKYQ 203
Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
+++GL+ +Y+ L FT S+G ++G+ + YLDPH
Sbjct: 204 INKQNYKILIGLDYPEEKYLDILIKLFTHRLSIG-----------MIGLNNDKLTYLDPH 252
Query: 327 DVQPV-INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
VQ IN E + TY + ++ I+ ++ PS+ +GFY +D +D ++F
Sbjct: 253 IVQHADINTN----EINLKTYFQEEVKQINKHALGPSVGLGFYLKDLNDLNEF 301
>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
Length = 194
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 78/158 (49%), Gaps = 27/158 (17%)
Query: 70 IWLLGVCHKI--------AQDEALGDAAGNNGLA----------------EFNQDFSSRI 105
IWLLG + I A EA D N G + +F DF+SR+
Sbjct: 29 IWLLGCSYIIKPTDHIQQALLEAQRDLMFNKGSSENEEENNQNMHMLWPPDFYDDFTSRL 88
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-KPFDREYV 164
++YR + PI S +D+GWGC LRS Q L+A L+ H LGR WR+ Q + ++Y
Sbjct: 89 WMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQAAWKQYS 148
Query: 165 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 200
I+H F D S +PFSIH + GK G G W GP
Sbjct: 149 RIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGP 186
>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
Length = 350
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 70/130 (53%), Gaps = 3/130 (2%)
Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG-- 335
L +VNP YI L+ F P S G++GG+P + Y +G E A+YLDPH VQ V IG
Sbjct: 180 LNEVNPIYIEGLKKCFQLPGSCGMIGGRPNQALYFIGYVGEEALYLDPHTVQRVGCIGEK 239
Query: 336 KDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 394
++ +E + +T+H I S+DPSLA+ F C + FD A + PL
Sbjct: 240 QESVEQEQDATFHQRHASRIAFASMDPSLAVCFLCCSRAQFDQLVAHFKERLNGGGSQPL 299
Query: 395 FTVTQTHKKP 404
F VT+T + P
Sbjct: 300 FEVTKTRQAP 309
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 2/103 (1%)
Query: 99 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
QD SR+ +YR+GF PIG++++T+D GWGCMLR QM++A+AL LGR W+ ++
Sbjct: 72 QDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLGRDWQWS-EET 130
Query: 159 FDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGP 200
D Y++I++ F D++ +PFS+H + L + G W GP
Sbjct: 131 RDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGP 173
>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 296
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 119/276 (43%), Gaps = 54/276 (19%)
Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY--- 163
+YR F I ITSD GWGC RS+Q L+A L + P D EY
Sbjct: 30 FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNY-----------APVDAEYFFT 78
Query: 164 ----VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
+ + LF D PFSI NL+ + +G+ G+W P + + E++ +
Sbjct: 79 VFNEIPMFSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFK------- 131
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
L +++ ++S D + ++ D + +LG++
Sbjct: 132 ----DLKLSV-LISKDSN-------IIPEDVKTMRAPFLLLIPI-----------LLGMK 168
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE-ESAIYLDPHDVQPVINIGKDD 338
V ++IP ++ TF P+ LG V G S ++VG+ E ++ +Y DPH + +
Sbjct: 169 DVEQKFIPFIKYTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPHVTKQAVASS--- 225
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
D S + R I + S++PS +GF+C ++
Sbjct: 226 --FDHSEFFEVPPRGIKMKSLNPSFLLGFFCSSTEN 259
>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
Length = 296
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 110/248 (44%), Gaps = 44/248 (17%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
+R + +I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVE 103
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
+ + +YV S+ C+V L + L +
Sbjct: 104 SCSEVTRLVVYV--------------------SQDCTV-----------LHMRSLAIDPS 132
Query: 280 KVNPRYIPT-LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
K +P+ L+ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 133 KDRSTCLPSSLQELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQAN 192
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLF 395
+ ++H R + +DPS +GFY D+ +F+ C+ +++ S+ P+F
Sbjct: 193 FPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMF 250
Query: 396 TVTQTHKK 403
T+ + H +
Sbjct: 251 TLAEGHAQ 258
>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
Length = 269
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/229 (27%), Positives = 105/229 (45%), Gaps = 24/229 (10%)
Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
S +SIH + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 1 NSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD 52
Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
V +DD C + W P+LL++PL LG+ +NP Y+P L+
Sbjct: 53 ---------STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLE 99
Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVI 351
S G++GG+P + Y +G ++ +YLDPH Q + + A+ TYH
Sbjct: 100 LDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHA 159
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
++ ++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 160 ARLNFSAMDPSLAVCFLCKTSDSFESLLTQFKEEVLSLCSPALFEISQT 208
>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
Length = 419
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 117/260 (45%), Gaps = 39/260 (15%)
Query: 110 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 169
R FD + TSD GWGCM+R+SQ L+A AL K + +EIL L
Sbjct: 130 RSLFD---NENFTSDAGWGCMIRTSQNLLANAL---------LKLAGEANGNVQLEILKL 177
Query: 170 FGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
F D + FSIHN ++ A L+ G W GP A S L + Q P
Sbjct: 178 FQDDPNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLT------IEMTDQESPT 231
Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
+ V E+ + DD + K P+LLL P+ LG++ VN Y
Sbjct: 232 VVPFVYISENAD-------LYDDEIEETFLKEK-----RPLLLLFPVRLGIDHVNKYYYK 279
Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQPVINIGKDDLEADTSTY 346
++ S+GI GGKP +S Y +G + +E+ IY DPH Q + + ++Y
Sbjct: 280 SILQLLASRFSVGIAGGKPSSSFYFIGYENDENLIYFDPHLPQVF------ESPINLASY 333
Query: 347 HSDVIRHIHLDSIDPSLAIG 366
H+ + ++ +DPS+ IG
Sbjct: 334 HTLNYNKLSIEMLDPSMMIG 353
>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
Length = 347
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 109/247 (44%), Gaps = 28/247 (11%)
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
M+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + AG
Sbjct: 1 MIRTGQSLLGNALQILHLGRDFRVNGNESLERES-KFVNWFNDTPEAPFSLHNFVSAGTE 59
Query: 190 YG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
G W GP A RS ++L G + I VS + E V
Sbjct: 60 LSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKVFAE 113
Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+P +
Sbjct: 114 NPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGRPSS 159
Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ IG
Sbjct: 160 SLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLIGIL 213
Query: 369 CRDKDDF 375
+ + D+
Sbjct: 214 IKGEKDW 220
>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
Length = 373
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 4/117 (3%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 147 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 206
Query: 157 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEAL 210
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 207 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLL 262
>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 516
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 98/400 (24%), Positives = 160/400 (40%), Gaps = 78/400 (19%)
Query: 99 QDFSSRILISYRKGFDPIGD------------SKITSDVGWGCMLRSSQMLVAQALLFHR 146
++F + I I+YRK F + + S+ SD GWGCM+R QM A+ L H
Sbjct: 71 ENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCMVRVGQMAFAEGLRRHL 130
Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSWVGPY 201
+ +K + K + V I D + +P+SI + + A + L G W P
Sbjct: 131 VEN--KKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIALSDFNLLPGEWYTPI 188
Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGD-------EDGERGGAPVVCID 249
+C L ++A G + L +A++ +V D D +RG +C +
Sbjct: 189 RICYILGLLHNERKAIKG--TEDLKVAVFSSSRPIVFQDFLERMCKVDPQRGKHAQICPN 246
Query: 250 -------------DASRHCSVFSKGQ---------ADWTPILLLV-PL------------ 274
D H + + Q ++ TP L LV P+
Sbjct: 247 QCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCPIHHELQYSMIVYI 306
Query: 275 --VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
++GL+ P Y+ + F SLG++GGKP + Y VG E+ IYLDPH VQ
Sbjct: 307 VCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDEFIYLDPHYVQEFS 366
Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
N + TY + +ID S ++ +Y +D + ++F L + N
Sbjct: 367 NEKNFQSSSQLETYFCKKFQTYPSKNIDSSFSLMYYLKDLEQLEEFYQFMMGLKRDYNEH 426
Query: 393 PLFTVTQTHKKPVNHSDVLG---ETGGVPEDDSLGVMSMN 429
+ T S LG E+ + D +L +++ N
Sbjct: 427 FFMMMEDTEP-----SFCLGDGKESSNLISDKNLNILADN 461
>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
Length = 256
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 81/176 (46%), Gaps = 45/176 (25%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP 248
>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
Length = 546
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 75/135 (55%), Gaps = 6/135 (4%)
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
++LLVPL LGL++++ YIP+L T PQSLG +GG+P + + +G Q + LDPH
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSLLETLRVPQSLGFLGGRPNHAIFFIGAQGNTLTGLDPHT 439
Query: 328 VQPVINIGKD-DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
QP ++G+ E + H + + IDPSLA+ FY D+ F+D R
Sbjct: 440 TQPAADMGEGFPSERYVHSLHCQSAVSMDVHRIDPSLALAFYLPDRATFEDLIKRIG--- 496
Query: 387 EESNGAPLFTVTQTH 401
E+N P F+V QT
Sbjct: 497 -ETN-PPPFSVEQTR 509
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 70/138 (50%), Gaps = 23/138 (16%)
Query: 71 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
W++G+ + ++E E D S + I+YR GF + T D GWGCM
Sbjct: 38 WIMGIPYTELREE------------ERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCM 85
Query: 131 LRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGD--SETSPFSIHN 182
LRS+QML+ QAL H LGR WR P L+ P EY ++ LF D E + FSIHN
Sbjct: 86 LRSAQMLMTQALQRHTLGRSWRVPRTLEERLRVP---EYRTLVRLFADHPGEANLFSIHN 142
Query: 183 LLQAGKAYGLAAGSWVGP 200
+ Q G Y G W GP
Sbjct: 143 MCQVGIRYDKLPGEWYGP 160
>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 327
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 219
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 220 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 279 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 336
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH +
Sbjct: 188 SRYMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSS 247
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
+ E S +R + +D S +GF+ + ++ R L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299
>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 31/273 (11%)
Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
++ +YR GF+ P I +D GWGC+LR+SQML+A L + GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWAY--GRPADRRLALFFDH- 102
Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
+ET+PFSIHNL+++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGC---EAIKRTM--QDAIKT 148
Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
+ L + VV+ C+ H F +G A+ +L V + +
Sbjct: 149 EQLQTRVTVVTSTNG---------CVYADEVH-HTFKQG-AEVVLVLASVRVSAAAQLTQ 197
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
Y+ +L PQ LGIVGG PG S Y + YLDPH +
Sbjct: 198 ESYLQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPHQRTTAALLSDGPSATV 256
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
+ T +R +H +D SL + F +D++
Sbjct: 257 SVTPSVSDVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
gambiense DAL972]
Length = 327
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 219
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 220 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 279 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 336
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH +
Sbjct: 188 SRCMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSG 247
Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
+ E S +R + +D S +GF+ + ++ R L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299
>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
Length = 326
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 118/280 (42%), Gaps = 35/280 (12%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S L++YR F+P+ S +TSD GWGC+ R+SQML+A L H
Sbjct: 41 TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVLRRHAASEC----------- 89
Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETG 219
+++ D +PFS+H + +A +G A W P C EA+ C +
Sbjct: 90 -HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYW-APSQGC---EAIRSCVESAVR 144
Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL- 278
G + +++ V S ER + + D + +L+LVP+ G
Sbjct: 145 QGLLTQKLSVVVSSSGTIPER---------------EIHEHLRGDGS-VLVLVPVRCGTS 188
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
++ L P +G+VGG P YIVG +YLDPH + + +
Sbjct: 189 RRMTQTMFFALEHLLHIPSCMGVVGGVPNRGYYIVGTSGHRLLYLDPHCMTQNAMVSCEL 248
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
+ T ++++R + D +D S GF D+++
Sbjct: 249 GKVGIVTPTTNLLRSVRWDHVDTSFFFGFLLDSLDEYEKL 288
>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
Length = 567
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 10/146 (6%)
Query: 254 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
+CS ++ + W P++++VP+ LG + L QSLG +GG+P S Y
Sbjct: 406 NCSRMAQAREPCSWRPLIVVVPVRLGARSEDQH----LSRIDKHLQSLGFIGGRPRHSYY 461
Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
VGV+ +A YLDPH QP +I K+ + +++H + L IDPSLA+GFYC D
Sbjct: 462 FVGVRGYNAYYLDPHITQPYQSIRKN---INVASFHCAHPGKMSLAHIDPSLALGFYCDD 518
Query: 372 KDDFDDFCARASKLAEESNGAPLFTV 397
K DF+D R +LA + P+ +V
Sbjct: 519 KSDFEDLIRRVEELA-AGDSHPILSV 543
>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
anatinus]
Length = 147
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 32/136 (23%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----R 152
F +DF SR+ ++YR+ F P+ S TSD GWGCMLRS QML+AQ L+ H L R W
Sbjct: 5 FQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWIWAEA 64
Query: 153 KPLQKP----------------------------FDREYVEILHLFGDSETSPFSIHNLL 184
P KP +R++ I+ F D +PFS+H L+
Sbjct: 65 GPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSLHRLV 124
Query: 185 QAGKAYGLAAGSWVGP 200
+ G+ G AG W GP
Sbjct: 125 RLGQGSGKRAGDWYGP 140
>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 172
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 68/125 (54%), Gaps = 11/125 (8%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + + +
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKMCRV 141
Query: 190 YGLAA 194
L+A
Sbjct: 142 LPLSA 146
>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 327
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 131/290 (45%), Gaps = 42/290 (14%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMTLDFSFQYFC 95
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+ D +PFS+H ++++ G L W P C EA++ C R+ G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRSAIHRGAL 148
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 282
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 341
+ +L P +G+VGG PG YIVG +E +YLDPH + + E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIVGTGGQELLLYLDPHCMTQEALVS---CES 249
Query: 342 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
DT+ RH + D +D S IGF+ + ++D + L+ +
Sbjct: 250 DTAGVVRPTPRHLLCVPYDRVDTSFFIGFFVDSFELWEDLQKKIEGLSRQ 299
>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 359
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 148/361 (40%), Gaps = 54/361 (14%)
Query: 52 HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL----AEFNQDFSSRIL 106
HE + P G S ++LGV K Q D+ L + L A S+
Sbjct: 25 HEDIQKPIFIGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAPAFFRISNLFW 80
Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYV 164
++YR G++ + +S +T+DVGWGC +R+ QM++A A+ + + + P E +
Sbjct: 81 MTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIPTKEEIM 140
Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGL 220
+L F DS T+P SIH++ ++ + +++ P + +++ L +
Sbjct: 141 NVLVPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL---- 196
Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
P+ C+ ++ + + P L+ +P+VL
Sbjct: 197 ----------------------CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL---- 230
Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
N L+ + GIVGG + ++ G +YLDPH VQP K E
Sbjct: 231 -NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTE 286
Query: 341 ADTSTYHSDVIRHIHLDSIDPS-----LAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
DT +Y + +IDP+ GF ++ + DDF A ++ E SN L
Sbjct: 287 IDTKSYSPISTNRFSVHTIDPTKLDDFCTFGFLIKNFHEIDDFMKFAKEVFEISNDKELR 346
Query: 396 T 396
T
Sbjct: 347 T 347
>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
Length = 556
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 69/122 (56%), Gaps = 4/122 (3%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
E + +SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR W+
Sbjct: 37 EIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLGRSWKWS 96
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P Q+ EY +L +F D ++ +SI + G + G + GSW GP + + + L+
Sbjct: 97 PEQE--SPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKKLSVY 154
Query: 214 QR 215
R
Sbjct: 155 DR 156
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 63/121 (52%), Gaps = 15/121 (12%)
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TSTYHSDVI 351
F P +GI+GG P + +IVGV ++ I LDPH QP G+ +L+ D TYH D
Sbjct: 351 FRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPA---GRGNLKPDYDQTYHCDNP 407
Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE------SNGAPLFTVTQTHKKPV 405
I L +DPS+ +GF C + +FDD C L EE +N PL + T +P
Sbjct: 408 IRIPLKRLDPSMVLGFLCSTEKEFDDLC---HNLKEEVLHPSVANSWPLVEIHTT--RPS 462
Query: 406 N 406
N
Sbjct: 463 N 463
>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 823
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 67/113 (59%), Gaps = 3/113 (2%)
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
IL+++P LGL KVN Y +++ F ++GI+GG+P + Y VG Q+ I LDPH
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIKYVFQCRLNVGIMGGRPNQALYFVGTQKTDLICLDPHL 670
Query: 328 VQPVINIGKDDLEAD--TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
VQ + + +++L TYH D + + + +D SLA GFY +D +DF+ F
Sbjct: 671 VQDTV-LNQEELSNVELNQTYHCDQAKKLSMTKLDTSLAFGFYLKDYNDFEVF 722
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 10/99 (10%)
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLG-----RPWRKPLQKPFDREYVEILHLFGD---S 173
T+DVGWGC +R QM++ QAL+ H +G + QK + Y +I+ L D S
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLN--YAKIIQLIHDNDCS 451
Query: 174 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
+T FSI N+ + G + G W GP+A+ L R
Sbjct: 452 QTGAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNR 490
>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 359
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 144/344 (41%), Gaps = 52/344 (15%)
Query: 70 IWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRILISYRKGFDPIGDSKITS 123
++LGV K Q D+ L + L A F + S+ ++YR G++ + +S +T+
Sbjct: 39 FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAATFFR-ISNLFWMTYRSGYEKLPNSSLTT 97
Query: 124 DVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFS 179
DVGWGC +R+ QM++A A+ + + + P +E + +L F DS T+P S
Sbjct: 98 DVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIPTKQEVMNVLIPFIDSPNSTTPLS 157
Query: 180 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
IH++ ++ + +++ P + +++ L +
Sbjct: 158 IHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL--------------------- 196
Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
P+ C+ ++ + + P L+ +P+VL N L+ +
Sbjct: 197 -----CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKL 246
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
GIVGG + ++ G +YLDPH VQP K E DT +Y +
Sbjct: 247 FAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTEIDTKSYSPIGTNRFSVH 303
Query: 358 SIDPS-----LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 396
+IDP+ GF ++ + DDF A + E SN L T
Sbjct: 304 TIDPTKLDDFCTFGFLIKNLHEVDDFMKLAKDVFEISNDKELRT 347
>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 81/176 (46%), Gaps = 44/176 (25%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 216
Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 272
>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
Length = 483
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/265 (29%), Positives = 122/265 (46%), Gaps = 39/265 (14%)
Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 181
+DVGWGCM+R+ Q L+ AL R+ + +P D + EI LF D+ S FS+
Sbjct: 135 TDVGWGCMIRTGQSLLGNAL--QRVKSTVKDQPYIYEMD-DTKEITDLFKDNTKSAFSLQ 191
Query: 182 NLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
N ++ G+ Y +A G W GP L + C I V SGD E
Sbjct: 192 NFVKCGRIYNKIAPGEWFGPATTATCIRYLIQENPCYGIEAC-----YISVSSGDIFKEN 246
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTP---ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
+G D P IL+L+ + LGL+ V+ RY ++ P
Sbjct: 247 ------------------IQGMIDRYPNGNILILLGIKLGLDSVHERYWGEIKTMLESPF 288
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
S+GI GG+P +S Y G +++ ++ DPH+ Q + DD + + H++ ++
Sbjct: 289 SVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQTAL---IDDFD---ESCHTENFGKLNFS 342
Query: 358 SIDPSLAIGFY--CRDKDDFDDFCA 380
+DPS+ +GF C D+F +F +
Sbjct: 343 DLDPSMLLGFLLPCSKWDEFQEFTS 367
>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
Length = 414
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 96 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 214 QR 215
R
Sbjct: 161 DR 162
>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
Length = 356
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 102/257 (39%), Gaps = 87/257 (33%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 79 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 127
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
MLR QM++AQAL+ LGR Q G
Sbjct: 128 MLRCGQMMLAQALICRHLGRA----------------------------------QMGVG 153
Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 154 EGKSVGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 196
Query: 250 DASRHCSV--FSKGQAD----------------------WTPILLLVPLVLGLEKVNPRY 285
D + C + FS AD W P+LL+VPL LG+ ++NP Y
Sbjct: 197 DIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLLLIVPLRLGINQINPVY 256
Query: 286 IPTLRLTFTFPQSLGIV 302
+ + TF + G V
Sbjct: 257 VDAFK-TFVDTEENGTV 272
>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 388
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 73/295 (24%), Positives = 123/295 (41%), Gaps = 44/295 (14%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ + + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAAAKKLLYFSYRNCFPPLPN-RSTTDTRWGCLVRTTQMLVGSCLLRYHCKGA 112
Query: 151 WRKPLQKPFDREYVE----ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
+ P +R+ E I LF D ++P IH + + S + P
Sbjct: 113 YVLP-----ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSP------ 161
Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHCSVFSKGQADW 265
E G+ + +A + GD AP C ++ + S ++
Sbjct: 162 ---------TEAGMAIAAALIAFHAQGGD-------APFTFCCENRNIDESAVMAKLSEG 205
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
++L++P+VLG+ ++ +Y L GI GG AS Y+ G Q + ++DP
Sbjct: 206 QHVILIIPVVLGIAPMSGQYERMLLKILDMKACCGIAGGFKQASLYMFGHQGRNVFFMDP 265
Query: 326 HDVQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
H VQ G+ LE + DP + +GFY D+ +F
Sbjct: 266 HYVQRAYTSGRTVGTLEGARG--------DLAARRFDPCMVLGFYLHTPADYCEF 312
>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
Length = 256
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 12/122 (9%)
Query: 85 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 30 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 89
Query: 144 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
LG W +DR EY IL +F D + FSIH + G + G G W
Sbjct: 90 VRHLGHNWL------WDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWF 143
Query: 199 GP 200
GP
Sbjct: 144 GP 145
>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
Length = 256
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 11/114 (9%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH +
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 129
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 1/91 (1%)
Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
Y + + I+LDPH Q ++ +D D + + + +++ ++DPS+A+GF+C+
Sbjct: 124 YSIHQMGDELIFLDPHTTQTFVDTEEDGTVDDQTFHCLQSPQRMNILNLDPSVALGFFCK 183
Query: 371 DKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++ DFD++C+ K + N +F + Q H
Sbjct: 184 EEKDFDNWCSLVQKEILKEN-LRMFELVQKH 213
>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
IL3000]
Length = 327
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 129/290 (44%), Gaps = 42/290 (14%)
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMALDFSFQYFC 95
Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
+ D +PFS+H ++++ G L W P C EA++ C R G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRRAIHRGAL 148
Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 282
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 341
+ +L P +G+VGG PG YI+G +E +YLDPH + + E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIIGTGGQELLLYLDPHCMTQEALVS---CES 249
Query: 342 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
DT RH + D +D S +GF+ + ++D + L+ +
Sbjct: 250 DTVGVVRPTPRHLLCVPYDRVDTSFFLGFFVDSFELWEDLQKKIEGLSRQ 299
>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 649
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/358 (23%), Positives = 145/358 (40%), Gaps = 36/358 (10%)
Query: 105 ILISYRKGFDPIGD-----SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
I SYR F I D +++D GWGCM+R SQML+A+AL H L + Q
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204
Query: 160 DRE---YVEILHLFGD--SETSPFS------------IHNLLQAGKAYGLAAGSWVGPYA 202
D E Y I+ LF D SE+ + + N Y L + A
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264
Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---RHCSVFS 259
+ R ++ + T + + I S + + G ++ D + S
Sbjct: 265 ILRQYQQ--NVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEAS 322
Query: 260 KGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
+ Q D IL++V L G+ K ++ +G + G YI+G QE+
Sbjct: 323 RKQNDRLNNILVMVHLKFGINKFEMQHKDYFIELLKIKNFVGALSGTETKGMYIIGFQED 382
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR--DKDDFD 376
I LDPH +Q G+ L+ D TY + R I L+ + +++G++ + ++ +
Sbjct: 383 RLIVLDPHFIQKSTE-GEQGLDKDYCTYFNKTPRSISLECLSSDISLGYFIQVNEEQSIN 441
Query: 377 DFCARASKLAEESNGAPLFTV----TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
F + L E+ + PL ++ +T + + + E DS+ +S N+
Sbjct: 442 QFIDQILTLNEK-HKEPLLSILNDRIETDEMEIEEHQINKEVKDQENQDSVNNISQNE 498
>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 328
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 205
WR + ++ D+E ++PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQ---- 130
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 263
GC+++ + + +R P + + S+ C + +
Sbjct: 131 ---------------GCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 264 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMKFFSLEHLLHSSTCIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
LDPH + + +A T + +++ + D +D S +GF
Sbjct: 231 LDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
Length = 348
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 146/347 (42%), Gaps = 68/347 (19%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSK------------ITSDVGWGCMLRSSQMLVAQALLF 144
F ++F IL +YR F I ++ I SDVGWGCM R +QM +A +
Sbjct: 44 FLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVTQMSIAHGIC- 102
Query: 145 HRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAM 203
+ K + E +IL+ F D+E++ FSIHN++ G + +G+ SW+GP
Sbjct: 103 -----QFMKRFLGNLNIE--KILNNFQDNESAKFSIHNMVNIGLSEFGIDPTSWIGPTTS 155
Query: 204 CRSWEALARCQRAETGLGCQSLPMA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
L R+ ++ +A I V G + D A +H FS+
Sbjct: 156 SMIANKLINDNRSIIS----NIQIASITYVEG----------TIYRDQAVKH---FSEVG 198
Query: 263 ADWTPILLLVPLVLGLEKVNPR-YIPTLRLTFTFPQSLGIVGGKPGAS--TYIVGVQEES 319
+D + L + LG K N Y T+ Q + I+GG +S IV
Sbjct: 199 SDSCTFVWLC-MKLGTSKFNINSYKKTVISMSNVSQFICIMGGNNYSSGALLIVAFSNSF 257
Query: 320 AIYLDPH-DVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
LDPH V P N +DD T I+ ++ SL++ + CR+ +DF
Sbjct: 258 LYCLDPHIKVLPSFSDKNFIRDDFIQKVPT-------RIYWGELNSSLSMVYICRNLEDF 310
Query: 376 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 422
DD C+ +++ + LF V +N+ D E + E DS
Sbjct: 311 DDLCSNLTRI-----NSDLFEV-------INNCDF--EVKSINELDS 343
>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
Length = 362
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/335 (21%), Positives = 143/335 (42%), Gaps = 45/335 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ-----DFSSRILISYRKGFDPIGDSKITSD 124
++LLG+ +K + + L +++ S+ + ++YR G++ + +S + +D
Sbjct: 39 LFLLGIEYKTTPLKKQAQELPQSSLLQYSSMAAYVRMSNLLWMTYRSGYEKLPNSSLNTD 98
Query: 125 VGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSI 180
VGWGC +R+ QM+++ A+ L ++ P E + ++ F D +T+P SI
Sbjct: 99 VGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIPKQNEILNVVIPFVDFFEQTTPLSI 158
Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
H++ +E+ ++ ++G+ + P + D
Sbjct: 159 HHV-----------------------YESRFVVEQNKSGVNYLA-PTIVAKAYSDLVNSW 194
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+ C+ ++ + + + P L+ +P+++ + V R L+ + F G
Sbjct: 195 KMCALRCVMASNTSIPLCDIKKEPFKPTLVFLPIIMD-QLVKSR----LQQIYKFNMFAG 249
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI-NIGKDDLEAD---TSTYHSDVIRHIHL 356
IV G + YI G ++LDPH VQP + K DL++ T + I I L
Sbjct: 250 IVSGIGDRAVYIFGFHVMRCLFLDPHTVQPAAESFTKIDLKSYAPINPTLNRFAIHSIEL 309
Query: 357 DSIDPSLAIGFYCR---DKDDFDDFCARASKLAEE 388
D ID GF + + D F+ FC ++ E
Sbjct: 310 DKIDQFCTFGFLIKSLEEVDAFEKFCTETFDISHE 344
>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 328
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 205
WR + ++ D+E ++PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 263
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 264 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
LDPH + + +A T + +++ + D +D S +GF
Sbjct: 231 LDPHCMTQEALVSGHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
Length = 328
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 121/287 (42%), Gaps = 47/287 (16%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 149 RPWRKPLQKPFDREYVEILHLFGDSET---SPFSIHNLLQA--GKAYGLAAGSWVGPYAM 203
WR + + H F D +T +PFS+H +++A KA W
Sbjct: 82 --WR------YSANDCRLDH-FRDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWT----- 127
Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--G 261
GC+++ + + +R P + + S+ C + +
Sbjct: 128 --------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICS 168
Query: 262 QADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
++ +L+L P+ G ++ +L +G+VGG P S YI+G +
Sbjct: 169 NLEFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRL 228
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+YLDPH + + +A T + +++ + D +D S +GF
Sbjct: 229 LYLDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 328
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 116/269 (43%), Gaps = 43/269 (15%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
L++YR F P+ S +TSD GWGC++RSSQML+A AL WR +
Sbjct: 44 FLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSANDCRLDHFR 95
Query: 165 EILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
+I D+E ++PFS+H +++A KA W G
Sbjct: 96 DI-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT-------------------PSQG 131
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGL- 278
C+++ + + +R P + + S+ C + + ++ +L+L P+ G
Sbjct: 132 CEAIRCCV-----NNAVDRRLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS 186
Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
++ +L +G+VGG P S YI+G + +YLDPH + +
Sbjct: 187 RRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLYLDPHCMTQEALVSSHA 246
Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
A T + +++ + D +D S +GF
Sbjct: 247 ERAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 371
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 130/304 (42%), Gaps = 40/304 (13%)
Query: 99 QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
++ SS + +SY+K + IT+D GWGC LR+SQM++AQ L H + + +
Sbjct: 52 EELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRHLYEKRVQSFIY 111
Query: 157 KPFDREYVEILHL---FGDSET------SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
D+ ++ HL F +S + SPF H+LL +A L Y +
Sbjct: 112 N--DKTKLDFQHLIMMFAESNSLENMDQSPFGFHSLL--TQAINLFQVPLKQQYTPVQGI 167
Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
+AL + Q L ++ +V+ V+ +D + + K
Sbjct: 168 KALKQ------QFKQQKLVKSLKIVT-------SSTGVIFQEDIRQKMKNWEKS------ 208
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+LL++ LG K+N Y+ ++ +G +GG S ++VG + + LDPH
Sbjct: 209 LLLILHFKLGTGKLNQIYVEQIKSLMDLEYFVGAIGGIKNKSLFMVGYMNDQFLSLDPHV 268
Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDS---IDPSLAIGFYCRDKDDFDDFCARASK 384
Q N KD L + S + + DS + +I FY R + ++ F + S
Sbjct: 269 QQ---NACKDPLNLNDEEMSSFFPKKVRADSCVKYEGDFSISFYIRSEKQYNIFLQKISN 325
Query: 385 LAEE 388
L ++
Sbjct: 326 LNKQ 329
>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 388
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 144/359 (40%), Gaps = 50/359 (13%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGA 112
Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID----DASRHCSVFSKGQADW 265
E G+ + +A + GD C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGDVPF------TFCCESRNIDEPAVMAKLSEGQH-- 207
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++DP
Sbjct: 208 --VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMDP 265
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
H +Q + +D + + R + DP + +GFY +D+ F A
Sbjct: 266 HYIQ-------NAYTSDRTVGTLEGARGELSARRFDPCMVLGFYLHTLEDYRVF-AEELA 317
Query: 385 LAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 436
+A PL + Q ++ SD E G +P ++ +S N A G H
Sbjct: 318 VANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376
>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 388
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 144/359 (40%), Gaps = 50/359 (13%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGT 112
Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID----DASRHCSVFSKGQADW 265
E G+ + +A + GD C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGDVPF------TFCCESRNIDEPAVMAKLSEGQH-- 207
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++DP
Sbjct: 208 --VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMDP 265
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
H +Q + +D + + R + DP + +GFY +D+ F A
Sbjct: 266 HYIQ-------NAYTSDKTVGTLEGARGELSARRFDPCMVLGFYIHTLEDYRVF-AEELV 317
Query: 385 LAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 436
+A PL + Q ++ SD E G +P ++ +S N A G H
Sbjct: 318 VANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376
>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
Length = 128
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 64/120 (53%), Gaps = 15/120 (12%)
Query: 66 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125
>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum Pd1]
Length = 208
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 67/145 (46%), Gaps = 30/145 (20%)
Query: 85 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------I 121
L D A N F DF SRI I+YR F PI +K
Sbjct: 59 LNDTAWPNA---FVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGF 115
Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
TSD GWGCM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH
Sbjct: 116 TSDTGWGCMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIH 172
Query: 182 NLLQAG-KAYGLAAGSWVGPYAMCR 205
+ G ++ G G W GP A +
Sbjct: 173 KFVNRGAESCGKYPGEWFGPSATAK 197
>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
Length = 81
Score = 81.6 bits (200), Expect = 8e-13, Method: Composition-based stats.
Identities = 35/48 (72%), Positives = 41/48 (85%)
Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
RYIP L+ T TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ V
Sbjct: 10 RYIPLLKETLTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLV 57
>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
Length = 255
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 67/140 (47%), Gaps = 27/140 (19%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 126
G A F DF+SR ++YR F DP + S TSD G
Sbjct: 116 GTGWPAAFLDDFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSG 175
Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
WGCM+RS Q L+A AL LGR WR+ + DRE +L LF D +P+S+HN ++
Sbjct: 176 WGCMIRSGQSLLANALAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSVHNFVRH 232
Query: 187 GKAY-GLAAGSWVGPYAMCR 205
G+ Y G W GP A R
Sbjct: 233 GEKYCSKYPGEWFGPSATAR 252
>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 425
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 75/172 (43%), Gaps = 27/172 (15%)
Query: 58 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF---- 113
P+R+ S++ LL LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLIP 144
Query: 114 ---DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 KSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRRG 204
Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCR 205
+ +E ++L LF D +PFSIH ++ G A G G W GP A R
Sbjct: 205 TKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATAR 253
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 6/109 (5%)
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD-----LEADTSTYHSDVIRHIHL 356
+ G+P +S Y +G Q YLDPH +P + + +D + +TYH+ +R +H+
Sbjct: 255 IHGRPSSSHYFIGAQGSHFFYLDPHHTRPAL-VYRDAGDRPYTTEELNTYHTRRLRRLHI 313
Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
+DPS+ IGF RD+DD++ + A G + V K P
Sbjct: 314 KDMDPSMLIGFLIRDEDDWNSWKRSVHNGAMIGTGKAIIHVFDKEKSPF 362
>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
Length = 321
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 120/323 (37%), Gaps = 91/323 (28%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
I+ L H + + DAA I I+YR+ + +G + +TSD GWGC
Sbjct: 38 IFGLSYTHDTPSELSFADAA---------HRIHDLITITYRQKYATLGHTYLTSDAGWGC 88
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH---------LFGDSETSPFSI 180
+RS QML+ +++ + L K F EY H L D E+S SI
Sbjct: 89 AIRSVQMLLVNSIVVY---------LDKSFHPEYTSHDHIAIKNNAKQLVFDKESSVLSI 139
Query: 181 HNL-LQAGKAYGLAAGSWVGPYAMCRS--------WEALARCQRAETGLGCQSLPMAIYV 231
HN+ +Q G+ P + C + WE +R L C
Sbjct: 140 HNIYIQDAIIKHNPTGTNFLPPSTCATAVADLYNFWE-----KRTFDVLMCTEY------ 188
Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
I + ++ P LL +P ++ + N ++
Sbjct: 189 ----------------IPEVTQ-------------PTLLFIPRIVTKSERN-----FIQT 214
Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
T PQS G V G A+ Y GVQE+ +LDPH VQ +G Y + I
Sbjct: 215 TSFLPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQDASEVG----------YFNRPI 264
Query: 352 RHIHLDSIDPSLAIGFYCRDKDD 374
+ D +D S G C +K D
Sbjct: 265 FEANFDELDNSFVFGMMCENKSD 287
>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
Length = 142
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 65/144 (45%), Gaps = 40/144 (27%)
Query: 83 EALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI--------------------- 121
+ +G +G N EF DF+S++ ++YR F PI D+ +
Sbjct: 3 DMVGTTSGANWPPEFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLG 62
Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
TSD GWGCMLR+ Q L+A AL+F LGR WR+P P E S
Sbjct: 63 GERGWTSDSGWGCMLRTGQSLLANALVFMWLGREWRRP-PAPMPTE-------------S 108
Query: 177 PFSIHNLLQAGKAYGLAAGSWVGP 200
S+H + AGK G G W GP
Sbjct: 109 YASVHRMALAGKELGKDVGQWFGP 132
>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 388
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 135/323 (41%), Gaps = 52/323 (16%)
Query: 92 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
+G EF + + ++L SYR F P+ S T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKVATKKLLYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGA 112
Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
+ P + + E E I LF D ++P IH + S + P
Sbjct: 113 YVLP--EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHC---SVFSKGQADW 265
E G+ + +A GD P C + SRH +V +K +
Sbjct: 162 ------TEAGMAIAAALIAFRAQGGD-------VPFTFCCE--SRHIDEPAVMAK-LLEG 205
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++DP
Sbjct: 206 QHVVLIIPVVLGIAPMSDQYELVMLKILDVKACCGIAGGFKQASLYMFGHQGRSVFFMDP 265
Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIR----HIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
H VQ A TS+ + + DP + +GFY +D+ F
Sbjct: 266 HYVQ----------NAYTSSRTVGTLEGSRGELRARRFDPCMVLGFYLHTPEDYRVF--- 312
Query: 382 ASKLAEESNGAPLFTVTQTHKKP 404
A +LA +N +F + ++P
Sbjct: 313 AEELA-VANSLVVFPLISFGRRP 334
>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 325
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 68/314 (21%), Positives = 129/314 (41%), Gaps = 67/314 (21%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+++LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VYILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSCLGNTYLSSDAGWGC 91
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------REYVEILHLFGDSETSPFSIHN 182
+R++QM++ L+ ++ +Q+ D + ++ L D +S SIHN
Sbjct: 92 AIRATQMMIVNTLVI------FKDQMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHN 145
Query: 183 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
+ + K + +++ P C + +L + E ++
Sbjct: 146 IYIQEIIKVHNPTGTNFLPPSICCIAISSLLQ-----------------------EWDKK 182
Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
+ C+D +CS P L L+P ++ + + + T QS G
Sbjct: 183 LFNCITCLDHIP-NCSY---------PTLYLIPQIITFTEHQ-----LILDSLTLSQSRG 227
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
VGG ++ ++ G Q + +LDPH VQ + G Y + I L I
Sbjct: 228 FVGGIGESAIFVFGYQGTTLFFLDPHYVQNAGDFG----------YFNPPTYQIDLSLIS 277
Query: 361 PSLAIGFYCRDKDD 374
PS+ F C +++D
Sbjct: 278 PSIVFAFMCYNEND 291
>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
Length = 224
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 5/101 (4%)
Query: 83 EALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 137
E + + NN + +F DF+SR+ ++YR + PI S +D+GWGCMLRS Q L
Sbjct: 120 EEISEEEDNNNMYLRWPLDFYDDFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSL 179
Query: 138 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
+A L+ H LGR WR+ Q R+ + I L + PF
Sbjct: 180 LANTLIIHFLGRDWRRQTQNQTTRKELCIGFLMSYHQEHPF 220
>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
Length = 312
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 125/298 (41%), Gaps = 59/298 (19%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
FNQ + I YR G K SD GWGC++R QM++A AL+ R+
Sbjct: 49 FNQKKDTLIWFCYRANIQFEG--KAISDQGWGCLVRVGQMMLANALM--------RECKI 98
Query: 157 KPFDREYVEILHLFGDSE----TSPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 210
++ I+HLF D++ +PFSI +++ A + G W GP M
Sbjct: 99 LAINKTKAMIIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIM------- 151
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT---P 267
S ED + + I+ + + Q D + P
Sbjct: 152 ----------------------SVIEDLNKNNMNIKQINLVNFLEQCVLESQIDLSFKKP 189
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
LL++ ++G + + I L+ Q G + GK + +++G Q+ +AI++DPH
Sbjct: 190 HLLIIHAIIGDKSLGQLEIQNLQSHMQISQFAGAIIGKNNKAFFLIGFQKNNAIFMDPHY 249
Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
VQ K ++E + ++ L ++ ++A+ FY + ++ +F + +KL
Sbjct: 250 VQES---NKIEMECN--------LKCQPLKQLNGTIALAFYISNYMEYLEFKKQVNKL 296
>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
Length = 564
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/357 (23%), Positives = 140/357 (39%), Gaps = 83/357 (23%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 176
+T+D WGC +RS+QM++A AL Q F IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261
Query: 177 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 211
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321
Query: 212 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------- 260
+ CQ Q L V++ + E DD + FS+
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLT-----FSQMGLGCDRRI 375
Query: 261 ---------------GQADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
+W +L++V + LGL+K++P Y + PQ +G+VGG
Sbjct: 376 NYDKLPNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGG 435
Query: 305 KPGASTYIVG------VQEESAIYLDPHDVQP-VINIGKD-DLEA-DTSTYHSDVIRHIH 355
KP + Y G + ++LDPH VQ N+ DL+ + + +H+ R +
Sbjct: 436 KPNKAFYFFGHIIDQDTNKVKLMFLDPHKVQDYTYNVETSYDLDVKEQAKFHTTEARLLK 495
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ +D L GF + DF+ F +E +F++ Q + N+S +
Sbjct: 496 IKELDTCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552
>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
Length = 102
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 49/83 (59%), Gaps = 11/83 (13%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 29 VWVLGECYNVKTEKT-----------ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGC 77
Query: 130 MLRSSQMLVAQALLFHRLGRPWR 152
MLR QM++AQAL+ +LGR WR
Sbjct: 78 MLRCGQMILAQALVCSQLGRAWR 100
>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
Length = 564
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/357 (22%), Positives = 137/357 (38%), Gaps = 83/357 (23%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 176
+T+D WGC +RS+QM++A AL Q F IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261
Query: 177 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 211
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321
Query: 212 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------- 260
+ CQ Q L V++ + E DD + FS+
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLT-----FSQMGLGCDRRI 375
Query: 261 ---------------GQADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
+W +L++V + LGL+K++P Y + PQ +G+VGG
Sbjct: 376 NYDKLPNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGG 435
Query: 305 KPGASTYIVG------VQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIH 355
KP + Y G + ++LDPH VQ + D + + +H+ R +
Sbjct: 436 KPNKAFYFFGHIIDLDTNKVKLMFLDPHKVQDYTYDVETSYDLDVKEQAKFHTTEARLLK 495
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ +D L GF + DF+ F +E +F++ Q + N+S +
Sbjct: 496 IKELDTCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552
>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 228
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 12/96 (12%)
Query: 59 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWR 152
+TSD GWGCMLRS QM++AQ LL H L G+PWR
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKPWR 169
>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 120/300 (40%), Gaps = 36/300 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ SYR F P+ + T+D WGC+LR++QML+ LL + + P + +
Sbjct: 74 LYFSYRSCFPPLPNGS-TTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELK-A 131
Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
I LF D ++P IH + S + P E G+
Sbjct: 132 NISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSP---------------TEAGMA--- 173
Query: 225 LPMAIYVVSGDEDGERGGAPVV--CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
MA +++ +G G P C + +V +K + ++L++P+VLGL ++
Sbjct: 174 --MAAALIACHAEG--GDVPFTFSCENRNIDEPAVVAK-LLEGQHVILIIPVVLGLAPLS 228
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
+Y + GI GG AS Y+ G Q ++DPH +Q D
Sbjct: 229 DKYESMMLKILDMKACCGIAGGFKQASFYMFGHQGRKVFFMDPHYIQKAYT--SDKTAGT 286
Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
D+ DP + +GFY +D+ F A +LA ++ ++ +HK
Sbjct: 287 LYGARGDLTAR----KFDPCMVLGFYLHTLEDYRVF---AEELAVVNSLVTFPLISWSHK 339
>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 355
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 36/228 (15%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 134
+A DE D N +F DF SRI ++YR F+ I S + TS + L+S
Sbjct: 99 LAYDEPTKD---NGWPPQFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQL 155
Query: 135 ---QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
+ +++ RLGR WR+ Q P E EI+ LF D +P+S+H+ ++ G A
Sbjct: 156 GDQSPFSSDSMI--RLGRDWRR-GQSP--HEEREIIKLFADHPNAPYSLHSFVRHGASAC 210
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A R +ALA + + +Y G P V D+
Sbjct: 211 GKYPGEWFGPSATARCIQALANSHESS---------LRVYST--------GDGPDVYEDE 253
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ +G+A + P L+LV LG++K+ P Y L + PQS
Sbjct: 254 FMKIAK--PEGEA-FHPTLILVGTRLGIDKITPVYWEALIASLQMPQS 298
>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 141
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 62/110 (56%), Gaps = 7/110 (6%)
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y +G Q++ +YLDPH QP +++ + D + ++H R + +DP
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDP 58
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 408
S +GFY D+ +F+ C+ +++ S+ P+FT+ + H + +HS
Sbjct: 59 SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ--DHS 106
>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
Length = 141
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 59/105 (56%), Gaps = 5/105 (4%)
Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
+GGKP S Y +G Q++ +YLDPH QP +++ + + + ++H R + +DP
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDP 58
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 403
S +GFY D+ +F+ C+ +++ S+ P+FT+ + H +
Sbjct: 59 SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ 103
>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
Length = 282
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 74/146 (50%), Gaps = 22/146 (15%)
Query: 70 IWLLGVCHKIAQ----DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
+WLLG + ++ DE + A F D+ SRI ++YR P+ S T+D
Sbjct: 116 LWLLGEFYFTSRPDEDDEVVFRA--------FAIDYYSRIWLTYRTELSPLPGSSKTTDC 167
Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE-----YVEILHLFGDSETSPFSI 180
GWGC LR+ QM++AQAL+ LGR WR + +R + +I+ LFGD + +
Sbjct: 168 GWGCTLRTCQMMLAQALVVLHLGREWRFWGDEEANRYRCGFGHYDIVSLFGDHLDADLGL 227
Query: 181 HNLLQAGKAYGL--AAGSWVGPYAMC 204
+ L++ K A G+W Y+ C
Sbjct: 228 YRLMKIAKERNEHDAVGNW---YSAC 250
>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
Length = 429
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 26/114 (22%)
Query: 97 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 133
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYG 290
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSID 360
G+P +S Y +GVQ + YLDPH +P + +D + T H+ +R +H+D +D
Sbjct: 302 GRPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMD 361
Query: 361 PSLAIGFYCRDKDDFD 376
PS+ IGF +D+DD+D
Sbjct: 362 PSMLIGFLIKDEDDWD 377
>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
Length = 158
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 46/68 (67%)
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
LGL+ VNP Y T+++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 1 LGLDGVNPIYYDTIKILYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPLR 60
Query: 336 KDDLEADT 343
LE ++
Sbjct: 61 PPTLEPES 68
>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
Length = 352
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 117/278 (42%), Gaps = 57/278 (20%)
Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--- 165
YR F P+ ++ +TSD GWGC +RS+QMLVA A+ K FD V
Sbjct: 92 YRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANAI---------GKLFTNDFDTGEVTDKM 142
Query: 166 ILHLFGD--SETSPFSIHNLL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
++ F D S PFSIHNL +A + S++ P A+ ++ + + + A G
Sbjct: 143 VIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK-KLANPKFG 201
Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
+ L + V+++ P ++L+P+ + +
Sbjct: 202 MEIL------------------------TTTFTFRVYTQ------PTIVLIPISIP-DSF 230
Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
N + + + F+F G+VGG + Y G+ + ++LDPH V+ N +
Sbjct: 231 NDK----IAVIFSFYLFSGMVGGSGRKAFYFFGIHHDQLLFLDPHTVR---NTVINSCSF 283
Query: 342 DTSTYHSDV--IRHIHLDSIDPSLAIGFYCRDKDDFDD 377
D YH + ++ + +D S + F + + DD
Sbjct: 284 DPQEYHPIIGDVKALSYSLLDRSAVLAFVVTSQRELDD 321
>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
Length = 806
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 2/110 (1%)
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
+++++ + LGLE + Y L+ F+ Q +GI+GGKP + Y VG Q++ I+LDPH
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLRQCVGILGGKPNFALYFVGYQQDHMIFLDPHY 700
Query: 328 VQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
VQ + + D E + + I ++S+DP + +GF ++ D
Sbjct: 701 VQQALTSDEQLKDQELKDTYQSQRSAKKIKMESLDPCIGVGFLIQNSKDL 750
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 13/72 (18%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV----EILHLFGDSETS 176
I SD GWGCM+R QM++A + L K LQ+ + + IL + D +
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFL---------KLLQQNHNFHDILTHDSILSMILDQLDA 443
Query: 177 PFSIHNLLQAGK 188
PF IH + + G+
Sbjct: 444 PFGIHQITEEGR 455
>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 200
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
+++ I++LFGDS S FSIH L+ G+ G W GP
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP 136
>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
Length = 353
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 130/325 (40%), Gaps = 47/325 (14%)
Query: 75 VCHKIAQ-DEALGDAAGNNGLAE----FNQDFSSRILISYRKGFDPIGD---------SK 120
+ + I Q D++L GN A+ F + F IL SYR F I S
Sbjct: 20 IIYNIDQHDDSLIFLFGNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSS 79
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
+T+D+GWGCMLR QM +A LL R K + IL F D E S FSI
Sbjct: 80 VTTDLGWGCMLRVIQMSLALGLL--------RYCKMKKYTYSLDYILQNFQDLEESLFSI 131
Query: 181 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
H ++ G + W GP + + L + P
Sbjct: 132 HQFVKVGCSIFNKKPKDWFGPTSASTIADYLVKNN-----------PFLFNNFRISSILF 180
Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTF-TFPQ 297
+ G I ++ S ++ ++ T + + LG +N +Y ++ F PQ
Sbjct: 181 KDGT----IYKSNLFQSFKNEEYSENTLTFVWLCTRLGSSALNIQKYKDSIFSIFKNVPQ 236
Query: 298 SLGIVGGKPGAST--YIVGVQEESAIYLDPH-DVQPVINIGKDDLEADTSTYHSDVIRHI 354
+ I GG +S+ IVG E+ LDPH +Q I + E + V I
Sbjct: 237 LICIAGGHNCSSSALLIVGASEKFLYCLDPHIKLQEAFVIKNFNREE----FIQQVPMRI 292
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFC 379
++++PSL+ F C D DDF+ C
Sbjct: 293 SWENLNPSLSFVFCCTDIDDFNHLC 317
>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
Length = 325
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/316 (22%), Positives = 127/316 (40%), Gaps = 71/316 (22%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
+ +LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VHILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91
Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 182
+R++QM++ AL+ ++ +Q+ D E L D +S SIHN
Sbjct: 92 AIRATQMMIVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145
Query: 183 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
+ Q K + +++ P C + +L + E
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSVCCIAISSLLQ--------------------------EW 179
Query: 241 GGAPVVCIDDASR--HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
P CI + CS P L L+P ++ + + + +L L+ QS
Sbjct: 180 DKKPFNCITCLNHIPSCS---------CPTLYLIPRIITFTE-HQLILDSLALS----QS 225
Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
G VGG ++ ++ G Q + +LDPH VQ + G + TY D+
Sbjct: 226 RGFVGGIGESAIFVFGCQGTTLFFLDPHYVQNAGDFGY----FNPPTYQIDI------SL 275
Query: 359 IDPSLAIGFYCRDKDD 374
I S+ F C ++++
Sbjct: 276 ISSSVVFAFMCYEENE 291
>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
Length = 426
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 72/169 (42%), Gaps = 50/169 (29%)
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGA-----STYIVGVQEE---------------- 318
++ PRY LR PQS G++GG+P A +T + ++
Sbjct: 234 RLEPRYAEPLRAALRLPQSAGMLGGRPRANRIFNTTSMCASSDQNLQLCFENSTRAIDPS 293
Query: 319 ------SAIY---------------LDPHDVQPVINIGKDDL---EADTSTYHSDVIRHI 354
+A++ LDPH VQP + +G D A S D + +
Sbjct: 294 KSGRPRAALFFPGLAARDGGADVYGLDPHTVQPALAVGDDGALGPGAAASVAPRDA-KKL 352
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
D++DPSLA+ FYC D+DDF DF RA L GAPLF V +
Sbjct: 353 AADALDPSLALAFYCADRDDFLDFVGRARALP----GAPLFEVVDAAPR 397
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 54/117 (46%), Gaps = 15/117 (12%)
Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
+ +YR GF+ + T D GWGCMLRS+QML+ AL R G R +
Sbjct: 28 LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL--TRNGAAPR-----------L 74
Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
LF D+ +++PF +HN + G Y + G W GP C L +R G
Sbjct: 75 ATAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLVDWRRNAPG 131
>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
Length = 348
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 28/124 (22%)
Query: 97 FNQDFSSRILISYRKGFDPI---------------------GD-SKITSDVGWGCMLRSS 134
F DF SR ++YR GF+PI GD S +SD GWGCM+RS
Sbjct: 120 FLDDFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSG 179
Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
Q L+A A+ + LGR WR ++ EI+ LF D +P+SIH + G +A
Sbjct: 180 QSLLANAMAMYELGRGWRLSDGGIAEK---EIISLFADDPRAPYSIHRFVGHG---AVAC 233
Query: 195 GSWV 198
GS++
Sbjct: 234 GSFL 237
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 321 IYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
YLDPH +P + + E + + H+ +R +H+ +DPS+ IGF RD+DD+D+
Sbjct: 238 FYLDPHHTRPGLPFHEHPSEYTQEEVGSCHTRRLRRLHIREMDPSMLIGFLIRDEDDWDN 297
Query: 378 F 378
+
Sbjct: 298 W 298
>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia strain d4-2]
gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia]
gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
Length = 277
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 122/291 (41%), Gaps = 59/291 (20%)
Query: 97 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
F Q + I SYR G + SD GWGC++R QM+VA +L+
Sbjct: 14 FLQLKETFIWFSYRANIQYEGRA--ISDQGWGCLIRVGQMIVANSLIRESTNS------- 64
Query: 157 KPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 210
KP D + +I+ LF D++ +PFSI +++ A Y + G W GP MC + L
Sbjct: 65 KPNDLK-TKIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLL 123
Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW---TP 267
Q A+T + I + C + + Q D P
Sbjct: 124 ---QSAKT------------------------IKQLKIINFLEQCVI--EKQIDLQFKQP 154
Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
LL++ ++G ++++ ++ L+ PQ G + GK + +++G Q I +DPH
Sbjct: 155 QLLIIHAIIGNKELDQYFVAELQKHMQIPQFAGAIVGKSKKAYFLIGYQNNQGIVMDPHY 214
Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
VQ E++ +S ++ I L ++A+ +Y + D+
Sbjct: 215 VQ----------ESNLLQLNSQ-LKCIPLKEFSGTIALCYYISNSYDYQQL 254
>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
Length = 149
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 11/82 (13%)
Query: 65 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKITS 123
S +S + LLG ++++ + G+ E F + FSS + +SYR+GF P+ S ++S
Sbjct: 74 SKSSPVCLLGQSYQLS----------STGVRESFRRVFSSLLWMSYRRGFRPLDGSTLSS 123
Query: 124 DVGWGCMLRSSQMLVAQALLFH 145
D GWGCMLRS+QML+AQ LL H
Sbjct: 124 DAGWGCMLRSAQMLLAQGLLLH 145
>gi|237837057|ref|XP_002367826.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
gi|211965490|gb|EEB00686.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
Length = 3559
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 87/175 (49%), Gaps = 16/175 (9%)
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 300
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 358
+V G+ + Y +G Q+++ +YLDPH +QP L A T ++ + + + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL 411
++PSLA+ F+ R++ A KL EE + + V + + P++ DVL
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKL-EEVDSFSMLQVVERRRPFSPLDLDDVL 3133
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 43 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 96 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 139 AQALLFHRL 147
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
Length = 646
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 22/120 (18%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
+ EF +DFS++I +SYR+GF IGD+ +D GWG W+K
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGY---------------------WKK 447
Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALAR 212
Q + I+ +F D T+PFSIHN+ G+ + G G W P + + ++L
Sbjct: 448 SGQNEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVN 507
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 26/99 (26%)
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
IVGGKP AS Y + Q+++ YLDPH VQ I+ + ++
Sbjct: 541 IVGGKPRASLYFIAAQDDNLFYLDPHTVQQAID-----------------------NEVE 577
Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
SL++ K+DF DF R+ KL +S PL+ + +
Sbjct: 578 FSLSVS--VETKEDFLDFLERSKKLVSKSE-FPLYNIAE 613
>gi|221481944|gb|EEE20310.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 3562
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 75/149 (50%), Gaps = 13/149 (8%)
Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 300
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 358
+V G+ + Y +G Q+++ +YLDPH +QP L A T ++ + + + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079
Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAE 387
++PSLA+ F+ R++ A KL E
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKLEE 3108
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 43 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 96 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 139 AQALLFHRL 147
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
Length = 538
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 41/79 (51%), Gaps = 5/79 (6%)
Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
S IYLDPH VQ D T+ + R + L SIDPSLA+GFYC ++ D
Sbjct: 331 SVIYLDPHQVQEAAACPDD-----WRTFWCETPRSMPLPSIDPSLALGFYCSSLGEYRDL 385
Query: 379 CARASKLAEESNGAPLFTV 397
C+R L S GAPL V
Sbjct: 386 CSRLEALERRSGGAPLVCV 404
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 26/129 (20%)
Query: 136 MLVAQALLFHRLGRPWR----------------KPLQKPFDREYVEILHLFGDS--ETSP 177
M++AQ L+ H LGR WR +L LF D+ E +P
Sbjct: 1 MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60
Query: 178 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
FS+H+L +AG+A G+ AG W+GP+ MC++ A A R Q + + + V E
Sbjct: 61 FSLHSLCRAGQACGVVAGRWLGPWVMCKTLAAAAGAARR------QGVDLGLTVAVLAES 114
Query: 238 GERGGAPVV 246
G GGAP++
Sbjct: 115 G--GGAPLL 121
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 26/37 (70%)
Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
K+NPRYIP L PQS+GIVGG+P +S Y VG Q
Sbjct: 215 KLNPRYIPQLEAVLAMPQSIGIVGGRPSSSLYFVGFQ 251
>gi|221505025|gb|EEE30679.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 3554
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 65/123 (52%), Gaps = 7/123 (5%)
Query: 268 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
LLL PL L EK+NP Y+ +L P SLG+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2988 CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLGMVAGRGQQAFYCIGTQQKALLYLDPH 3047
Query: 327 D-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDSIDPSLAIGFYCRDKDDFDDFCARASK 384
+QP L A T ++ + + + +++PSLA+ F+ R++ A K
Sbjct: 3048 SGIQPPAL----QLPAATPSFFAGSCWKVSDVAALNPSLAVAFFVRNERQLLGLAAALKK 3103
Query: 385 LAE 387
L E
Sbjct: 3104 LEE 3106
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 43 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 96 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 139 AQALLFHRL 147
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|193784751|dbj|BAG53904.1| unnamed protein product [Homo sapiens]
Length = 146
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 1/117 (0%)
Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
P SL G T ++ EE IYLDPH QP + D S + +
Sbjct: 4 PLSLSSAGSATHLPTCLILPGEE-LIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMS 62
Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
+ +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 63 IAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 119
>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
Length = 266
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 91 NNGLAE--FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
NN + + F D S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+ALL H
Sbjct: 205 NNNIIQSNFLDDVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALLKH 261
>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
Length = 360
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 96/228 (42%), Gaps = 35/228 (15%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 134
+A D+ + D +G F DF S+I ++YR F+PI S + TS + L+S
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 135 --QMLVAQALLFHRLGR-PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
Q + + RLGR WR+ E +L F D +P+SIH+ ++ G A
Sbjct: 159 GDQSPFSSDTMV-RLGRGDWRRGESV---EEECRLLKDFADDPRAPYSIHSFVRHGASAC 214
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
G G W GP A R +AL + +I V S G P V D+
Sbjct: 215 GKYPGEWFGPSATARCIQALTNSHES-----------SIRVYST------GDGPDVYEDE 257
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
+ D+ P L+LV LG++K+ P Y L PQS
Sbjct: 258 ---FMQIAKPPGEDFHPTLVLVGTRLGIDKITPVYWEALIAALQMPQS 302
>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 3465
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 59/106 (55%), Gaps = 7/106 (6%)
Query: 268 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
LLL PL L EK+NP Y+P+L P S+G+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2955 CLLLFPLTLCSGEKINPVYVPSLLAYLELPWSVGMVAGRGQQAFYCIGTQQKALLYLDPH 3014
Query: 327 D-VQ-PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
+Q P + + A S + + + +++PSL++ F+ R
Sbjct: 3015 SGIQPPALQL----PSATPSFFAGSCWKIADVAALNPSLSVAFFVR 3056
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 17/70 (24%)
Query: 96 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
+ +Q S +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 942 QLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQMLL 1001
Query: 139 AQALLFHRLG 148
QAL H LG
Sbjct: 1002 MQALRRHFLG 1011
>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 209
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 20/143 (13%)
Query: 99 QDFSSRILISYRKGFDPI----GDSKI---TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
++F + I ++YR+ F P+ D KI SD GWGCM+R QM +A+ L H +
Sbjct: 24 ENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRHHLQQKGI 83
Query: 152 ---RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSW 207
++ +Q D + FGD +P+SI + + A K + L G W P +C
Sbjct: 84 YDNKRIIQAFLDND-------FGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRICHVL 136
Query: 208 EALARCQRAETGLGCQSLPMAIY 230
L + L C+ L + ++
Sbjct: 137 SLLHN--DKKQILDCEDLKVGVF 157
>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 348
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 114/278 (41%), Gaps = 67/278 (24%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S I YR F + ++ +TSD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 162 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQR 215
+ + ++H F D S P+SIH+L G GS P++
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFS------------- 178
Query: 216 AETGLGCQSLPMAIYVVSG--DEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILL 270
IY ++ ++D R C V + ++ P ++
Sbjct: 179 -----------SVIYALTELVNKDFNRAF-----------ECHVITNKFLLKSINKPTIV 216
Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
+P + +K + R I F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 217 FIPFTIP-DKFDQRLIT----IFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRP 271
Query: 331 VI-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+I K D E D SD I+ + ++ ++ S+ F
Sbjct: 272 CASSIMKFD-EKDYIAKLSD-IKSLRINELERSVVFSF 307
>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
Length = 93
Score = 62.4 bits (150), Expect = 5e-07, Method: Composition-based stats.
Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 19/91 (20%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIG-- 117
I + IW+LG + N L E + +D S + +YRKGF PIG
Sbjct: 16 IPQTDEPIWILGKKY--------------NALKELDMIRRDIRSMLWFTYRKGFIPIGGC 61
Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
+S TSD GWGCMLR QM++AQAL+ LG
Sbjct: 62 NSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92
>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 348
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 116/277 (41%), Gaps = 65/277 (23%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S I YR F + ++ + SD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 162 EYVE---ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
+ + ++H F D + P+SIH+L + +G+
Sbjct: 132 DNINRKTVIHWFLDFYNVECPYSIHSLFTTQI---IVSGN-------------------- 168
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR--HCSVFSKG---QADWTPILLL 271
G LP+++ + E + D +R C V + + P ++
Sbjct: 169 --PNGSSFLPLSVVTYALTELVNK---------DLNRIFECHVITNKFLLNSINKPTIIF 217
Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
+P + ++ N R I F+F G+VGG + Y G+ + ++LDPH V+P
Sbjct: 218 IPFTIP-DEFNQRLIS----IFSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDPHFVRPC 272
Query: 332 I-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+I K D E D SD I+ +H++ ++ S+ F
Sbjct: 273 ASSIMKFD-EKDYIAKLSD-IKSLHINELERSVVFSF 307
>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
Length = 473
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 81/340 (23%)
Query: 105 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 157
I +YR+GF DS +T+D GWGC++R QM++A+ L F+++ PL +
Sbjct: 52 IRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAELLKRHLKCFYKVDLFSFPPLLQ 111
Query: 158 PFDREYVEILHLFGDSE--------TSP----FSIHNLLQ-AGKAYGLAAGSWVGPYAMC 204
++L +F D + + P FSI +++ A K +G G W P +
Sbjct: 112 -------DVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKEWGKKPGEWYSPNQIV 164
Query: 205 RS-WEALARCQRAET-GLG-------------------------CQ----SLPMAIYVVS 233
++ ++ L GLG CQ S+ + +
Sbjct: 165 QAIYKILQEINIPYCYGLGFVPFYESQIDLRAIFQEMCMMEDCVCQKKVFSIEQFLKSLE 224
Query: 234 GDEDGERGGAPV---------VCIDDASRHC-----SVFSK--GQADWTPILLLVPLVL- 276
E G+ V VC +D S ++ K Q + P+ + +L
Sbjct: 225 KLEIGKEEMVQVMHGNDSISDVCCEDQSEQNKKEIGNLLKKYICQKCFVPVRAVAVCLLS 284
Query: 277 --GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
G ++ NP Y+ +R G++GG+P + +IVG + + LDPH VQ
Sbjct: 285 RIGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHLVQE---- 340
Query: 335 GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
K + E + + ID SL + FY ++ DD
Sbjct: 341 AKMNPEEYIKSCFPGEALFMSDKEIDCSLGLVFYLKNLDD 380
>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 348
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/278 (20%), Positives = 109/278 (39%), Gaps = 67/278 (24%)
Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
+S I YR F + ++ +TSD GWGC +R+ QML+A +++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSII-------------KLFGS 131
Query: 162 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
+ + ++H F D S P+SIH+L + +
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFT-------------------------TQIIVS 166
Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILLLVP 273
+ G LP ++ + + E + + + C + + + P ++ +P
Sbjct: 167 KNPNGSSFLPFSVVIYALTELVNKDF-------NRAFECHIITNKFLLNSINKPTIVFIP 219
Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP--- 330
+ E + L F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 220 FTIPDE-----FEQRLITIFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRPCAS 274
Query: 331 -VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
+I + D A S I+ + ++ ++ S+ F
Sbjct: 275 SIIKFDEKDYIAKLSD-----IKSLRINELERSVVFSF 307
>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
Length = 98
Score = 58.9 bits (141), Expect = 5e-06, Method: Composition-based stats.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 13/81 (16%)
Query: 70 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
+W+LG + ++ L +D S + +YRKGF PIG +S TSD GW
Sbjct: 23 VWILGRVYNAIKE-----------LDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGW 71
Query: 128 GCMLRSSQMLVAQALLFHRLG 148
GCMLR QM++A+AL+ LG
Sbjct: 72 GCMLRCGQMVLARALITLHLG 92
>gi|412989956|emb|CCO20598.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Bathycoccus prasinos]
Length = 532
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 96/267 (35%), Gaps = 74/267 (27%)
Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
L G W+ P +C+ + + + + C L DG GG P +
Sbjct: 234 ALCPGQWMAPSEICKRYGKMMNRLDSFQNVRCLILG----------DGCGGGVPEFYPER 283
Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
K AD +L+LVPL G + +NP Y+ +L+ + + +GIVGGK AS
Sbjct: 284 VREEM----KTHAD-KDVLILVPLRCGASDAINPEYVKSLQKFLSVRECVGIVGGKKTAS 338
Query: 310 TYIVGVQE--------------------------------------ESAIYLDPHDVQPV 331
YIVG AIYLDPH +
Sbjct: 339 YYIVGFTSGKKSSDSYSGGEKEEEEEEKEEEENEEDEEEEEEEEEETRAIYLDPHVAKAY 398
Query: 332 INIGKDDLEADT-STYHSDV--------IRHIHLDSIDPSLAIGFYCRDKDDFDD----- 377
++ + + T S Y+ I + ++DPSL +GF + ++D+
Sbjct: 399 VSPRERSRDESTESAYYRSFFGSASEHGILYTPFHALDPSLVVGFLVGNDTNYDEMNNAS 458
Query: 378 ------FCARASKLAEESNGAPLFTVT 398
F + + ES PL TV
Sbjct: 459 SSSLDAFVDVLTNIERESGSTPLITVV 485
>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 341
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 18/101 (17%)
Query: 105 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
IL +YR F+PI G + + SD GWGC +R++QML+AQA+ G+ D
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV--KMAGK----------D 113
Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGK-AYGLAAGSWVGP 200
+ +L LF DS +P S+H +++ G+ G+W GP
Sbjct: 114 ADDSVVLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGP 154
>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 658
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 69/179 (38%), Gaps = 52/179 (29%)
Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-----------------QEESAIY-LD 324
P Y TL +FPQS+G++GG P + + G QE Y LD
Sbjct: 418 PTYGSTLAKLLSFPQSVGMLGGTPRHALWFYGADEVDPPTFGDDGKALNGQECGGWYGLD 477
Query: 325 PHDVQ------PVINIGKDDLEADT------------------------STYHSDVIRHI 354
PH Q GKD++ +D +T H++ R I
Sbjct: 478 PHTTQVAPRGTRTTKYGKDEVSSDDIELNNCQWQVQLNDAYLRSLHFTPTTTHANHQRSI 537
Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES---NGAPLFTVTQTHKKPVNHSDV 410
L +DPS A+GFY RD DF F L++E N P VT T K P DV
Sbjct: 538 PLSKLDPSCALGFYIRDHSDFVQFTNAIDALSKEHCRPNKLPDI-VTVTEKTPNYEVDV 595
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 121 ITSDVGWGCMLRSSQMLVAQALLFH 145
+ SD GWGCMLRS+QM++AQ + H
Sbjct: 133 LKSDAGWGCMLRSAQMMMAQTVRMH 157
>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 193
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 25/150 (16%)
Query: 52 HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRI 105
HE V P G S ++LGV K Q D+ L + L A F + S+
Sbjct: 25 HEDVQKPIFVGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAAAFFR-ISNLF 79
Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKP 158
++YR G++ + +S +T+DVGWGC +R+ QM++A A+ + P+ P
Sbjct: 80 WMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----P 134
Query: 159 FDREYVEILHLFGDS--ETSPFSIHNLLQA 186
+E + +L F DS T+P SIH++ ++
Sbjct: 135 TKQEVMNVLIPFIDSPNSTTPLSIHHVYES 164
>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 183
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 35/186 (18%)
Query: 28 SVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGD 87
++GS +S + KRL+ L P + + + +LG C+ +E L
Sbjct: 10 NIGSYFYNSMSSKRLIK-----------LQPF-----TQKNVVHILGNCYYPETNENLNH 53
Query: 88 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
N+ N I+ +YR+ + +G++ ++SD GWGC +R++QM+V AL+
Sbjct: 54 LTFNDA----NLKIHDLIVATYRQKYSYLGNTYLSSDAGWGCAIRATQMMVVNALVI--- 106
Query: 148 GRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHNLL--QAGKAYGLAAGSWV 198
++ +Q+ D E L D +S SIHN+ Q K + +++
Sbjct: 107 ---FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHNIYIQQVIKTHNPKGTNFL 163
Query: 199 GPYAMC 204
P C
Sbjct: 164 PPSICC 169
>gi|148682816|gb|EDL14763.1| mCG116861, isoform CRA_a [Mus musculus]
Length = 127
Score = 55.1 bits (131), Expect = 8e-05, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 48/81 (59%), Gaps = 1/81 (1%)
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
I+LDPH Q ++I + L D + + + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4 IFLDPHTTQTFVDIEESGLVDDQTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63
Query: 381 RASKLAEESNGAPLFTVTQTH 401
K + N +F + Q H
Sbjct: 64 LVQKEILKEN-LRMFELVQKH 83
>gi|78070455|gb|AAI07651.1| Atg4d protein [Rattus norvegicus]
Length = 168
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
+YLDPH QP +++ + + ++ +H R + +DPS +GFY ++ +F+ C+
Sbjct: 47 LYLDPHYCQPTVDVNQANFPLES--FHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCS 104
Query: 381 RASKLAEESNGA---PLFTVTQTHKK 403
++ S+ P+FTV + H +
Sbjct: 105 ELMRILSSSSVTERYPMFTVAEGHAQ 130
>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 384
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 2/81 (2%)
Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
S+G++GG PG + Y +G+ + IYLDPH +Q K D TY I +
Sbjct: 223 SIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQEAHQNEKTVQNID--TYFCKFINRVSQK 280
Query: 358 SIDPSLAIGFYCRDKDDFDDF 378
++ SLA GFY ++ + + F
Sbjct: 281 KLESSLAFGFYIKNLQELEQF 301
>gi|392343434|ref|XP_003754884.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
gi|392355909|ref|XP_003752169.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
Length = 126
Score = 52.4 bits (124), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 1/81 (1%)
Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
I+LDPH Q ++ + L D + + + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4 IFLDPHTTQTFVDTEESGLVDDHTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63
Query: 381 RASKLAEESNGAPLFTVTQTH 401
K + N +F + Q H
Sbjct: 64 LVQKEILKEN-LRMFELVQKH 83
>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
Length = 206
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 30/135 (22%)
Query: 84 ALGDAAGNNGLAEFNQDFSSRILISYRKGFD-------------------PI-GDSKITS 123
A+ D L E +DF IL++YR+G P+ + I +
Sbjct: 17 AMCDQNPGPKLRERLKDF---ILLTYRRGLSIHLPRFYAGNIPKRFYGIWPLWQQTDIKT 73
Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
D GWGC LR++QM +A+AL R PL + IL LF D+ +PFS+ NL
Sbjct: 74 DRGWGCALRATQMALAEAL------RDVLSPLDN-VQEQRSRILQLFYDTTEAPFSLENL 126
Query: 184 LQAGKAYGLAAGSWV 198
+ A +G +W+
Sbjct: 127 VMADVEHGANVVAWI 141
>gi|328852767|gb|EGG01910.1| Hypothetical protein MELLADRAFT_123246 [Melampsora larici-populina
98AG31]
Length = 134
Score = 51.6 bits (122), Expect = 7e-04, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNINPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|328852471|gb|EGG01617.1| Hypothetical protein MELLADRAFT_92005 [Melampsora larici-populina
98AG31]
Length = 134
Score = 51.2 bits (121), Expect = 0.001, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
Length = 389
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSID 360
G+P +S Y +G Q YLDPH + + +D +E + ++ H+ +R IH+ +D
Sbjct: 262 GRPSSSHYFIGAQGSFLFYLDPHHTRVALPYREDPIEYTSEEIASCHTPRLRRIHVREMD 321
Query: 361 PSLAIGFYCRDKDDFDDF 378
PS+ IGF +++ D+ +
Sbjct: 322 PSMLIGFLIQNEVDWQEL 339
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 26/94 (27%)
Query: 79 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 116
+A D+ + D +G F DF S+I ++YR F+PI
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 117 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
GD S +SD GWGCM+RS Q ++A + RLGR
Sbjct: 159 GDQSPFSSDSGWGCMIRSGQSMLANTIAMVRLGR 192
>gi|328859149|gb|EGG08259.1| Hypothetical protein MELLADRAFT_123247 [Melampsora larici-populina
98AG31]
Length = 134
Score = 51.2 bits (121), Expect = 0.001, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDQVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
Length = 350
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 126/356 (35%), Gaps = 95/356 (26%)
Query: 85 LGDAAGNNGLAEFNQDFSSR--ILISYRKG-------------------FDPIGDSK--- 120
+ + N +N+ SR IL +YR G F P+ S
Sbjct: 1 MSNVVRENVNVLYNKRLESRFGILFTYRYGLEYKFPRPINFKRRRLFNIFSPLNLSNGIV 60
Query: 121 -ITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------RKPLQKPFDREYVE----- 165
I SD GWGC+LRS+QM ++QALL LG + R P + D+ +
Sbjct: 61 TIDSDKGWGCVLRSTQMAISQALLNLVLGPEFSVEQLEIRNRTPRNRKIDQSLLNIDTFE 120
Query: 166 -----------------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW-VGPY--AMCR 205
IL F D + FSI+N + A GP A+C
Sbjct: 121 KLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIYNFVIADYVLKTCTKFLHFGPTSAALC- 179
Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
A + +LP+ + D H S + +
Sbjct: 180 ----------ASKIINDLNLPIN----------------SIAFPDGVFHISDVREILEEK 213
Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP-GASTYIVGVQEESAIYLD 324
+L+ V L+++ +R F Q GI+GG S YI G + Y D
Sbjct: 214 RNLLVWVSNKKKLDRIER---ECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYND 270
Query: 325 PHDV--QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
PH + ++ D+ D + S ++ ++ + S + F +D+DDF DF
Sbjct: 271 PHLYCKKAFRSLEYVDIFRD---FTSRRVKSMNWRYFNASFTLLFLFKDRDDFQDF 323
>gi|407037202|gb|EKE38551.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 157
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 58/137 (42%), Gaps = 13/137 (9%)
Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
+ P L+ +P+VL N L+ + GIVGG + ++ G +YLD
Sbjct: 17 FKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFLYLD 71
Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS-----LAIGFYCRDKDDFDDFC 379
PH VQP K E DT +Y + +IDP+ GF ++ + DDF
Sbjct: 72 PHIVQPSF---KSFTEIDTKSYSPIGSNRFSVHTIDPTKLDDFCTFGFLIKNLHEVDDFM 128
Query: 380 ARASKLAEESNGAPLFT 396
A + E SN L T
Sbjct: 129 KLAKDVFEISNDKELRT 145
>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
Length = 307
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%)
Query: 94 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
+ F +DF SRI ++YR+ F + DS TSD GWGCM+
Sbjct: 195 IEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMI 232
>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
Length = 469
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 27/116 (23%)
Query: 105 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 157
I +YR+GF +S +T+D GWGC++R QM++A+ L F+ + PL +
Sbjct: 52 IRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAELLKRHLKCFYNVNLFQFPPLMQ 111
Query: 158 PFDREYVEILHLFGDSETSP------------FSIHNLLQ-AGKAYGLAAGSWVGP 200
E+L LF D + FSI +++ A + +G G W P
Sbjct: 112 -------EVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160
Score = 45.8 bits (107), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 12/103 (11%)
Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
+G ++ NP YI +R G++GG+P + +IVG ++ + LDPH VQ N+
Sbjct: 286 IGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHLVQQA-NMN 344
Query: 336 KDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
++ + + SD ID SL + FY ++++D
Sbjct: 345 PEEYVKSCFPGEALFMSD-------KEIDCSLGLVFYLKNEED 380
>gi|307108756|gb|EFN56995.1| hypothetical protein CHLNCDRAFT_143631 [Chlorella variabilis]
Length = 137
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 9/103 (8%)
Query: 33 LGSSETVKRLVTAGSMRRIHERVLGPSRTGIS-SSTSDIWLLGVCHKIAQDEALGDAAGN 91
LG S + L A + ++H+ + +G S + + +WLLG C+ + +A
Sbjct: 15 LGLSRSYYALARALRLNKLHDLLA----SGASITPDAPVWLLGQCYSCPPGAS--EAQQE 68
Query: 92 NGLAEFNQDFSSRILISYRKGFDPI--GDSKITSDVGWGCMLR 132
LA + S +SYR GF I G + + SD GWGC LR
Sbjct: 69 EALARMLHHYQSIPWMSYRTGFTSIAAGSAHLQSDAGWGCTLR 111
>gi|294954843|ref|XP_002788322.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
gi|239903634|gb|EER20118.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
Length = 345
Score = 45.1 bits (105), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 52/113 (46%), Gaps = 26/113 (23%)
Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESA-------------------IYLDPHDVQPVIN 333
P +G++GG+ + Y+VGV E+ + +DPH VQ +
Sbjct: 207 LKLPWCVGVIGGQSTRAHYVVGVAEKDTYLQSSTWGRSGYRQTRTDLLSIDPHFVQSAV- 265
Query: 334 IGKDDLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
+EA + ++ +SD + ++PSL +GFY +D+ D ++ A ++
Sbjct: 266 -----VEAQSISFKNSDEPSRLQPTKLNPSLGVGFYVKDETDLEELSAELDRV 313
>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
Length = 346
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 20/94 (21%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
NN +A + S+ ++YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 148 GRP-------WRKPLQKPF-------DREYVEIL 167
P + +QK F REYV+++
Sbjct: 112 QEPNPGFGEDAAEKVQKNFIIHSMEERREYVQLI 145
>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
Length = 135
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 11/67 (16%)
Query: 63 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
I +++W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTNVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 123 SDVGWGC 129
+D GWG
Sbjct: 92 TDKGWGL 98
>gi|389585790|dbj|GAB68520.1| peptidase, partial [Plasmodium cynomolgi strain B]
Length = 894
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 418 FTKRKRTKYTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 466
>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 346
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 6/63 (9%)
Query: 89 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 148 GRP 150
P
Sbjct: 112 QEP 114
>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
Length = 133
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 30/42 (71%), Gaps = 5/42 (11%)
Query: 105 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQAL 142
IL +YR F+PI G + + SD GWGC +R++QML+AQA+
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV 107
>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 346
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 6/62 (9%)
Query: 90 GNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 58 SNNNVA---KHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RLQ 112
Query: 149 RP 150
P
Sbjct: 113 EP 114
>gi|221060360|ref|XP_002260825.1| peptidase [Plasmodium knowlesi strain H]
gi|193810899|emb|CAQ42797.1| peptidase, putative [Plasmodium knowlesi strain H]
Length = 1001
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
+F++R + KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 464 NFTNRRRTKHTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 513
>gi|156102174|ref|XP_001616780.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805654|gb|EDL47053.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1007
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 468 FAKRKRDRYSKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 516
>gi|149030140|gb|EDL85217.1| rCG23129 [Rattus norvegicus]
Length = 90
Score = 41.6 bits (96), Expect = 0.89, Method: Composition-based stats.
Identities = 16/44 (36%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 5 NLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 47
>gi|50303849|ref|XP_451871.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641003|emb|CAH02264.1| KLLA0B07667p [Kluyveromyces lactis]
Length = 1999
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 74/173 (42%), Gaps = 20/173 (11%)
Query: 10 ASKCFS------KSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGI 63
+SKCF KS DT ++L S + S++VKRL T M I R+ G R
Sbjct: 1024 SSKCFEFLAKSVKSDDDTLLQALRDATSNVLFSKSVKRLQTLYKMDGI--RMDGHRRVSR 1081
Query: 64 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQD----FSSRILISYRKGFDPIGD 118
S L + K DE +N + A F +D +LI R+ D + D
Sbjct: 1082 SQ------LTHILFKERTDEYDRSIIDSNSIYALFKKDNVNLTKKMVLIEERRLNDYLAD 1135
Query: 119 SKITSDVGWGCMLRSSQMLVAQALLF-HRLGRPWRKPLQKPFDREYVEILHLF 170
+ + G+ C LR + + + A L + R W ++ R+ +++L +F
Sbjct: 1136 DRYQKEAGYACALRVIRKVASTAYLRDFKSTREWYLAARENVKRQRIQLLPVF 1188
>gi|390457789|ref|XP_003732004.1| PREDICTED: cysteine protease ATG4B-like [Callithrix jacchus]
Length = 102
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 29/51 (56%)
Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
S+++GF+C+ +DDF+D C + KL+ P+F + + + DVL
Sbjct: 25 SISVGFFCKTEDDFNDRCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 75
>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
gorilla]
Length = 351
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 91
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.136 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,450,998,062
Number of Sequences: 23463169
Number of extensions: 325879431
Number of successful extensions: 652308
Number of sequences better than 100.0: 783
Number of HSP's better than 100.0 without gapping: 762
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 649429
Number of HSP's gapped (non-prelim): 1361
length of query: 443
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 297
effective length of database: 8,933,572,693
effective search space: 2653271089821
effective search space used: 2653271089821
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)