BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016269
(392 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
Length = 489
Score = 583 bits (1502), Expect = e-164, Method: Compositional matrix adjust.
Identities = 297/393 (75%), Positives = 339/393 (86%), Gaps = 5/393 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGFRE+ AS+C SK DTPNRSL S E GS+ S+KGSL SS F SAFSVFETY
Sbjct: 1 MKGFRERV-ASRCSSKCPVDTPNRSLTSDCLESGSN--FSTKGSLWSSFFASAFSVFETY 57
Query: 61 SESS-ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ES ASEKK H++ NGWT+AVK++V+ GSMRRIHERVLGPSRTGISS+TSDIWLLGVC
Sbjct: 58 RESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI++DE+ G+A N LAEF D+SSRIL++YR+GFD IGDSK SDVGWGCMLRSSQM
Sbjct: 118 YKISEDES-GNADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQM 176
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
LVAQALLFH+LGR W KP QKP D+ YVEILHLFGDSE +PFSIHNL+QAGKAY LAAGS
Sbjct: 177 LVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGS 236
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYAMCRSWE+LAR +R E L QSLPMA+YVVSGDEDGERGGAPVV I+DASRHC
Sbjct: 237 WVGPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCL 296
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FS+GQADWTPILLLVPLVLGL+KVNPRYIP+L+ TFTF QSLGI+GGKPGASTYIVGVQ
Sbjct: 297 EFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQ 356
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
+++A YLDPH+VQ V+NIG+DD+EADTS+YHS+
Sbjct: 357 DDNAFYLDPHEVQSVVNIGRDDIEADTSSYHSD 389
>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 286/392 (72%), Positives = 334/392 (85%), Gaps = 2/392 (0%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGFRE+ + S ST ++PNRS S SELGS+++K SK SL S+ F SAFSVF+T+
Sbjct: 1 MKGFRERGFVASSKSSSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDTH 60
Query: 61 SESSA-SEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 118
+SS+ SEKKA H + NGWT+AVK++V GSMRRI E VLG S+TGIS++T DIWLLG
Sbjct: 61 CDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGA 120
Query: 119 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 178
C+KI+QD + GDAA N LA FN DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 121 CYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQ 180
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
MLVAQALLFHRLGR WRKPL KP DREYVEILHLFGDSE+S FSIHNLL+AGKAYGLAAG
Sbjct: 181 MLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAG 240
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
SWVGPYA+C SWE+L R +R ET L QSL MA+YVVSG EDGERGGAPV+CI++A+RHC
Sbjct: 241 SWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHC 300
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
S FSKGQ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 301 SEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 360
Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
Q+E+A YLDPH+VQPV+N+ +DD+EA+TS+YH
Sbjct: 361 QDENAFYLDPHEVQPVVNVSRDDVEANTSSYH 392
>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
Length = 489
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 287/393 (73%), Positives = 330/393 (83%), Gaps = 10/393 (2%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGF EKA ASK K+ D+ N SE SS++K SK SL SS+F SAFSVFET
Sbjct: 1 MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53
Query: 61 SESS--ASEKKAVHN-KSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 117
SESS ASEKKA+ N ++NGWT AV+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 118 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 177
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
VQ+E A YLDPH+ Q V++I +++LEADTS+YH
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYH 386
>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
Length = 486
Score = 556 bits (1433), Expect = e-156, Method: Compositional matrix adjust.
Identities = 287/393 (73%), Positives = 330/393 (83%), Gaps = 10/393 (2%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGF EKA ASK K+ D+ N SE SS++K SK SL SS+F SAFSVFET
Sbjct: 1 MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53
Query: 61 SESS--ASEKKAVHN-KSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 117
SESS ASEKKA+ N ++NGWT AV+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 118 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 177
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
VQ+E A YLDPH+ Q V++I +++LEADTS+YH
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYH 386
>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 279/394 (70%), Positives = 331/394 (84%), Gaps = 3/394 (0%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK FR++ GA +T DTP S S SE GS+++K SK SL SS F SAFSVF+ Y
Sbjct: 1 MKVFRDR-GAVSPSKTTTTDTPKSSFISDSSEPGSTDTKVSKPSLWSSFFASAFSVFDIY 59
Query: 61 SESSA-SEKKAVHNK-SNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 118
+SS+ S +A H + SNGWT++VK++V G+MRRI ERVLG S+TGIS++TSDIWLLG
Sbjct: 60 RDSSSTSHNEAPHIRHSNGWTSSVKKIVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGA 119
Query: 119 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 178
+KI+QD++ G+A N LA F++DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 120 RYKISQDDSSGNADATNALAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQ 179
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
MLVAQALLFHRLGR WRKP+ KP DR+YVEILHLFGDSE S FSIHNLLQAGKAYGLAAG
Sbjct: 180 MLVAQALLFHRLGRSWRKPVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAG 239
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
SWVGPYAMCRSWE+LAR +R ET L Q+LPMA+YVVSG EDGERGGAPV+ I+DA+RHC
Sbjct: 240 SWVGPYAMCRSWESLARSKREETNLEYQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHC 299
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
S FSKG+ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 300 SEFSKGREDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 359
Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
Q+E+A YLDPH+VQPV+N +DD+EA+TS+YH +
Sbjct: 360 QDENAFYLDPHEVQPVVNFSRDDVEANTSSYHCD 393
>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
Length = 483
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 271/386 (70%), Positives = 308/386 (79%), Gaps = 1/386 (0%)
Query: 5 REKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESS 64
R K S C + D +R+ SV ELGS SSK S S F+S FS+FE + +SS
Sbjct: 3 RGKDLKSTCSPEPAADAIDRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKDSS 62
Query: 65 ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQ 124
+EKK H + N W A V++++T+GSMRRI ER+LG R+G+ SS DIWLLGVCHKI+Q
Sbjct: 63 VTEKKVFHPRHNVW-ATVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQ 121
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
D DAA + G+A + QDFSSRIL++YRKGF I DSK TSDV WGCMLRSSQMLVAQA
Sbjct: 122 DHPPDDAASSPGVAGYEQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQA 181
Query: 185 LLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
LLFHRLGR WRKP QKP D+EYVEILHLFGDSETS FSIHNLLQAG+AY LAAGSWVGPY
Sbjct: 182 LLFHRLGRSWRKPSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPY 241
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
AMCRSWE L R +R L Q LPMAIY+VSGDEDGERGGAPV+ IDDASRHC FSKG
Sbjct: 242 AMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKG 301
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
Q DW+PILLLVPLVLGLEK+NPRYIP+LR TFTFPQSLGI+GGKPGASTYIVGVQ+E+A
Sbjct: 302 QHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAF 361
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYH 390
YLDPH+VQ V+NI KDDLEADTS+YH
Sbjct: 362 YLDPHEVQQVVNIDKDDLEADTSSYH 387
>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 485
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 274/391 (70%), Positives = 313/391 (80%), Gaps = 4/391 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+KG E+ +SKC SKS+ +T + + V S+ GSS+ K K SL SS+F S FSV ETY
Sbjct: 3 LKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSDCKFPKASLWSSIFTSGFSVVETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
SESSASEKKAV ++S+GW AAV+++VT GSMRR ERVLG SRT ISSS DIWLLGVCH
Sbjct: 63 SESSASEKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCH 122
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+Q E+ G +NGLA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQML
Sbjct: 123 KISQQESTGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQML 182
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
VAQALLFH+LGR WRKP+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSW 242
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYAMCR+WE LA R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ CS
Sbjct: 243 VGPYAMCRTWEVLA---RKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSE 299
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FS G A WTP+LLLVPLVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 300 FSSGLAVWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQN 359
Query: 361 ESAIYLDPHDVQPVINIGKDDLE-ADTSTYH 390
E A YLDPHDVQ V+NI D E TS+YH
Sbjct: 360 EKAFYLDPHDVQQVVNISGDTQEPTGTSSYH 390
>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 486
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 273/391 (69%), Positives = 312/391 (79%), Gaps = 4/391 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+KG E+ +SKC SKS+ +T + + V S+ GSS SK K SL S++F S FSV ETY
Sbjct: 3 LKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSNSKFPKASLWSNIFTSGFSVVETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
SESSASEKKAVH++S+GW AAV+++VT GSMRR ERVLG SRT ISSS DIWLLGVCH
Sbjct: 63 SESSASEKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCH 122
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+Q E+ G +NGLA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQML
Sbjct: 123 KISQQESSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQML 182
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
VAQALLFH+LGR WRKP+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSW 242
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYAMCR+WE LA R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ C
Sbjct: 243 VGPYAMCRTWEVLA---RKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFE 299
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FS G A WTP+LLLVPLVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+G Q
Sbjct: 300 FSSGLAAWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQN 359
Query: 361 ESAIYLDPHDVQPVINIGKDDLE-ADTSTYH 390
E A YLDPHDVQ V+NI D E TS+YH
Sbjct: 360 EKAFYLDPHDVQQVVNISGDTQEPTSTSSYH 390
>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
Length = 487
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 259/390 (66%), Positives = 305/390 (78%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+K ++ A+KC SKS+ + + + S+ GSS+SK K SL S+ F S FSV ETY
Sbjct: 3 LKDLCDRIVAAKCSSKSSTEIVDNTQVPASSKAGSSDSKFPKASLWSTFFTSGFSVDETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
SESS+SEKK VH++++GW AAV+++V+ GSMRR ERVLG RT +SSS DIWLLGVCH
Sbjct: 63 SESSSSEKKTVHSRNSGWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCH 122
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+Q E+ GD N A F QDF SRILI+YRKGFD I DSK TSDV WGCMLRSSQML
Sbjct: 123 KISQHESTGDVDIRNVFAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQML 182
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
VAQALLFH+LGR WRK + KP D+EY++IL LFGDSE + FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKTVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSW 242
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYAMCR+WE LAR QR + G Q LPMAIYVVSGDEDGERGGAPVVCI+DA + C
Sbjct: 243 VGPYAMCRTWEVLARNQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLE 302
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FS+G WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 FSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQN 362
Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ A YLDPH+V+PV+NI D E +TS+YH
Sbjct: 363 DKAFYLDPHEVKPVVNITGDTQEPNTSSYH 392
>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
Full=Autophagy-related protein 4 homolog a;
Short=AtAPG4a; Short=Protein autophagy 4a
gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 467
Score = 466 bits (1200), Expect = e-129, Method: Compositional matrix adjust.
Identities = 230/391 (58%), Positives = 296/391 (75%), Gaps = 5/391 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ +C S S DT ++S + S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1 MKALCDRFVPQQCSSSSKSDTHDKS--PLVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57
Query: 61 SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ESS S K V NGWTA VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58 RESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI+ DE G+ LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQM
Sbjct: 118 YKISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQM 177
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQALLFHRLGR W K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQALLFHRLGRAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGS 236
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 237 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 296
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 297 EFSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQ 356
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
E+ YLDPH+VQ V+ + K+ + DTS+YH
Sbjct: 357 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYH 387
>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
Length = 489
Score = 463 bits (1191), Expect = e-128, Method: Compositional matrix adjust.
Identities = 246/391 (62%), Positives = 286/391 (73%), Gaps = 1/391 (0%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+K F ++ A+KC SKS+ +T + S S+ GSS+SK K SL SS F S FSV ETY
Sbjct: 3 LKAFCDRIVAAKCSSKSSTETVDNSQVPACSKAGSSDSKFPKASLWSSFFTSGFSVIETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRI-HERVLGPSRTGISSSTSDIWLLGVC 119
S+S ASEKKAVH++++GW + I LG + + LGVC
Sbjct: 63 SKSPASEKKAVHSQNSGWGCCCEESCYCWLNEEIPRACTLGQAELTFQALMVIYGFLGVC 122
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
HK +Q E+ GD + A F QDFSS+IL++YRKGFD IGDSK TSDV WGCMLRSSQM
Sbjct: 123 HKFSQQESTGDVDNSTVFAAFEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQM 182
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
LVAQALLFH+LGR WRK KP D+EY++IL FGDSE S FSIHNLLQAGK YGLA GS
Sbjct: 183 LVAQALLFHKLGRMWRKTTDKPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGS 242
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYAMCRSWE LAR QR G Q LPMA+YVVSGDEDGERGGAPVVCI+DASR CS
Sbjct: 243 WVGPYAMCRSWEVLARNQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCIEDASRRCS 302
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FS+G A WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 EFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQ 362
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
E A YLDPHDVQPV++I D + +TS+YH
Sbjct: 363 NEKAFYLDPHDVQPVVHINGDAQDPNTSSYH 393
>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
Length = 467
Score = 453 bits (1165), Expect = e-125, Method: Compositional matrix adjust.
Identities = 235/391 (60%), Positives = 300/391 (76%), Gaps = 5/391 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ +C S S DT ++S V S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1 MKALCDRFVPQQCSSSSKSDTHDKS--PVVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTA-GSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ESS S K V NGWTA VKR+ A G++RR ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58 RESSTSGHKQVCTTRNGWTAFVKRVSMATGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI++DEA G+ LA F QDFSS+IL++YR+GF+P D+ TSDV WGCM+RSSQM
Sbjct: 118 YKISEDEASGETNTGCVLAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQM 177
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQALLFHRLGR W K + P ++EY+E L FGDSE+S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQALLFHRLGRSWTKKSELP-EQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGS 236
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 237 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 296
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FSKGQ++WTPILLLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 297 EFSKGQSEWTPILLLVPLVLGLDSVNPRYIPSLIATFTFPQSVGILGGKPGASTYIVGVQ 356
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
E+ YLDPH+VQ V+ + K+ + DTS+YH
Sbjct: 357 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYH 387
>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 422
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 209/330 (63%), Positives = 261/330 (79%), Gaps = 2/330 (0%)
Query: 62 ESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
ESS S K V NGWTA VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+
Sbjct: 14 ESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCY 73
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+ DE G+ LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML
Sbjct: 74 KISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQML 133
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
AQALLFHRLGR W K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSW
Sbjct: 134 FAQALLFHRLGRAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSW 192
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 193 VGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLE 252
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE
Sbjct: 253 FSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQE 312
Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ YLDPH+VQ V+ + K+ + DTS+YH
Sbjct: 313 DKGFYLDPHEVQQVVTVNKETPDVDTSSYH 342
>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
Length = 451
Score = 433 bits (1114), Expect = e-119, Method: Compositional matrix adjust.
Identities = 218/391 (55%), Positives = 283/391 (72%), Gaps = 21/391 (5%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ +C S S DT ++S + S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1 MKALCDRFVPQQCSSSSKSDTHDKS--PLVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57
Query: 61 SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ESS S K V NGWTA VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58 RESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI+ DE G+ LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQM
Sbjct: 118 YKISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQM 177
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQ ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQLP-----------------EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGS 220
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 221 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 280
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 281 EFSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQ 340
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
E+ YLDPH+VQ V+ + K+ + DTS+YH
Sbjct: 341 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYH 371
>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
Length = 892
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/354 (60%), Positives = 276/354 (77%), Gaps = 9/354 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W+ ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYH 384
>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
Length = 476
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 224/391 (57%), Positives = 284/391 (72%), Gaps = 2/391 (0%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ SKC S T + + S + S S +L S + S+ V +
Sbjct: 1 MKAICDRFVPSKCSSSCTSEKRDIS-PTSLVSDSPSSDDKSNLTLCSDVVESSSPVSQPC 59
Query: 61 SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
E+S SE K V N WT +K + +G++RR +RVLGPSRTGISSSTS+IWLLGVC
Sbjct: 60 REASTSEHKQVCTTHNSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVC 119
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI++ E+ +A LA F QDFSS IL++YR+GF+PIGD+ TSDV WGCMLRS QM
Sbjct: 120 YKISEAESFEEADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQM 179
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQALLF RLGR WRK +P + +Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGS
Sbjct: 180 LFAQALLFQRLGRSWRKKDSEPPNEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGS 239
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CRSWE+LAR + ET + +S MA+++VSG EDGERGGAP++CI+D ++ C
Sbjct: 240 WVGPYAVCRSWESLARKNKEETDVKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL 299
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FS+G +W PILLLVPLVLGL+KVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQ
Sbjct: 300 EFSEGDTEWPPILLLVPLVLGLDKVNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQ 359
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
E+ YLDPHDVQ V+ + K++ + DTS+YH
Sbjct: 360 EDKGFYLDPHDVQQVVTVKKENQDVDTSSYH 390
>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 478
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/354 (60%), Positives = 276/354 (77%), Gaps = 9/354 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W+ ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYH 384
>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B;
Short=Protein autophagy 4; AltName: Full=OsAtg4
Length = 478
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 214/354 (60%), Positives = 274/354 (77%), Gaps = 9/354 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYH 384
>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
Length = 912
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 214/354 (60%), Positives = 274/354 (77%), Gaps = 9/354 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYH 384
>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
Length = 473
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 216/354 (61%), Positives = 268/354 (75%), Gaps = 9/354 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +F+S FS+FE + +SSA H+ S W+ ++R+ GSM R
Sbjct: 34 KQSKNSILSCVFSSPFSIFEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSMWRF---- 89
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 90 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 146
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 147 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 206
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R E G + PMA+YVVS
Sbjct: 207 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVS 266
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 267 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETF 326
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
TFPQSLGI+GGKPG STY+ GVQ++ +YLDPH+VQ ++I D+LEADTS+YH
Sbjct: 327 TFPQSLGILGGKPGTSTYVAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYH 380
>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
Full=Autophagy-related protein 4 homolog b;
Short=AtAPG4b; Short=Protein autophagy 4b
gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 477
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 218/367 (59%), Positives = 279/367 (76%), Gaps = 2/367 (0%)
Query: 25 SLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKR 84
S S+ S+ SS++KS+ +L S + S+ V + E+S S V + WT +K
Sbjct: 26 SPTSLVSDSASSDNKSNL-TLCSDVVASSSPVSQLCREASTSGHNPVCTTHSSWTVILKT 84
Query: 85 L-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQD 143
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QD
Sbjct: 85 ASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQD 144
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
FSS IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D
Sbjct: 145 FSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPAD 204
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 205 EKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDD 264
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++
Sbjct: 265 KHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDR 324
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
VNPRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQE+ YLDPHDVQ V+ + K++ +
Sbjct: 325 VNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQD 384
Query: 384 ADTSTYH 390
DTS+YH
Sbjct: 385 VDTSSYH 391
>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
Length = 505
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 213/354 (60%), Positives = 265/354 (74%), Gaps = 9/354 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K K S+LS +F+S FS+FE + +SSA+ H+ S W+ ++R+ GSM R
Sbjct: 35 KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 91 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 207
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R E G + PMA+YVVS
Sbjct: 208 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVS 267
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 268 GDEDGERGGAPVVCIDVAAQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETL 327
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH
Sbjct: 328 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYH 381
>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 474
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 213/354 (60%), Positives = 265/354 (74%), Gaps = 9/354 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K K S+LS +F+S FS+FE + +SSA+ H+ S W+ ++R+ GSM R
Sbjct: 35 KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 91 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 207
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R E G + PMA+YVVS
Sbjct: 208 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVS 267
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 268 GDEDGERGGAPVVCIDVAAQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETL 327
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH
Sbjct: 328 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYH 381
>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
Length = 493
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 220/367 (59%), Positives = 269/367 (73%), Gaps = 25/367 (6%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKA--VHNKSNG----------WTAAVKRL 85
SK KGS+LSS+F ++FE +SS+S A NKS G W+ A++R
Sbjct: 41 SKHCKGSILSSVF----TIFEAQQDSSSSVAAAAACENKSPGHSSGPSYGGAWSRALRRF 96
Query: 86 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
V GSM R LG ++ + D+W LG C+K + +E+ D ++G A F +DFS
Sbjct: 97 VGGGSMWRF----LGCAKV---LTNGDVWFLGKCYKFSSEESSSDLDTDSGHAAFLEDFS 149
Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
SRI ++YRKGFD I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP + E
Sbjct: 150 SRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPSQKPCNPE 209
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGL 263
Y+ ILHLFGDSE FS+HNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R E
Sbjct: 210 YIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVSN 269
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
G +S PMA+YVVSGDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K
Sbjct: 270 GNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSTWSPILLLVPLVLGLDK 329
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+NPRYIP L+ TFTFPQSLGI+GGKPG STYI G+Q++ A+YLDPHDVQ +NI D+L+
Sbjct: 330 INPRYIPLLKETFTFPQSLGILGGKPGTSTYIAGIQDDRALYLDPHDVQMAVNIASDNLD 389
Query: 384 ADTSTYH 390
ADTS+YH
Sbjct: 390 ADTSSYH 396
>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
Length = 1216
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 214/387 (55%), Positives = 274/387 (70%), Gaps = 42/387 (10%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W ++R+V +GSM R
Sbjct: 309 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 364
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 365 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 421
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE
Sbjct: 422 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 481
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 482 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 541
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 542 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 601
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------------------------ 372
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 602 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYG 661
Query: 373 ---------PVINIGKDDLEADTSTYH 390
++I D++EADTS+YH
Sbjct: 662 SYSGVFSTSQAVDIAADNIEADTSSYH 688
>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
Length = 595
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 268/360 (74%), Gaps = 14/360 (3%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHL
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHL 217
Query: 213 FGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPM 270
FGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R R A+ G ++ PM
Sbjct: 218 FGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPM 277
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
A+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+PLVLGL+K+NPRYIP
Sbjct: 278 ALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIP 337
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ ++I D+LEADTS+YH
Sbjct: 338 LLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYH 397
>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
gi|194701156|gb|ACF84662.1| unknown [Zea mays]
gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
Length = 492
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 268/360 (74%), Gaps = 14/360 (3%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHL
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHL 217
Query: 213 FGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPM 270
FGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R R A+ G ++ PM
Sbjct: 218 FGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPM 277
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
A+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+PLVLGL+K+NPRYIP
Sbjct: 278 ALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIP 337
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ ++I D+LEADTS+YH
Sbjct: 338 LLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYH 397
>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
Length = 484
Score = 393 bits (1009), Expect = e-107, Method: Compositional matrix adjust.
Identities = 212/355 (59%), Positives = 261/355 (73%), Gaps = 16/355 (4%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVH-NKSNGWTAAVKRLVTAGSMRRIHER 97
K K S+LSS+ ++FE + S + H + S W+ ++R V GSM R
Sbjct: 46 KQCKASILSSVL----TIFEPDQDQSG--RSGGHASGSYAWSRVLRRFVGGGSMWRF--- 96
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
LG G + + D+W LG C+K++ +E+ D+ G A F +DFSSR+ I+YRKGFD
Sbjct: 97 -LG---CGKALTAGDVWFLGKCYKLSSEESSSDSDSEGGHAAFLEDFSSRVWITYRKGFD 152
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP Q P D E+ ILHLFGDSE
Sbjct: 153 VISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNPSDPEHTRILHLFGDSE 212
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVV 275
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R + + +S PM +YVV
Sbjct: 213 VCAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVINRNESFPMVLYVV 272
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
SGDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 273 SGDEDGERGGAPVVCIDVAAQLCYDFNKGQSAWSPILLLVPLVLGLDKINPRYIPLLKET 332
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
FTFPQSLGI+GGKPGASTYI GVQ++ A+YLDPH+VQ +NI D+LEADTS+YH
Sbjct: 333 FTFPQSLGILGGKPGASTYIAGVQDDRALYLDPHEVQLAVNIASDNLEADTSSYH 387
>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
gi|219886349|gb|ACL53549.1| unknown [Zea mays]
gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
Length = 492
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 211/363 (58%), Positives = 271/363 (74%), Gaps = 15/363 (4%)
Query: 36 SESKSSKGSLLSSLFNSAFSVFETYSESSASEK-KAVHNKSN----GWTAAVKRLVTAGS 90
S S+ K S+LS +F+ F++FE + S+S A KS+ G + ++R V +GS
Sbjct: 45 SGSRQPKASILSGVFSPPFAIFEGQQQGSSSPACDARSTKSSSGSYGLSRILRRFVGSGS 104
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQDFSSRIL 149
M R+ LG R ++SD+W LG C+K++ +E + ++ A F +DFSSRI
Sbjct: 105 MWRL----LGCGRV---LTSSDVWFLGKCYKVSPEEEESGDSESDSGHAAFLEDFSSRIW 157
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
I+YRKGFD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +
Sbjct: 158 ITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGV 217
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQS 267
LHLFGDSE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R R A+ G ++
Sbjct: 218 LHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKEN 277
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
PMA+YVVSGDEDGERGGAPVVCID A++ CS F+KG + W+PILLLVPLVLGL+K+NPR
Sbjct: 278 FPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPR 337
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YIP L+ TF FPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEADTS
Sbjct: 338 YIPLLKETFMFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMTVDIALDNLEADTS 397
Query: 388 TYH 390
+YH
Sbjct: 398 SYH 400
>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
Length = 429
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 211/363 (58%), Positives = 271/363 (74%), Gaps = 15/363 (4%)
Query: 36 SESKSSKGSLLSSLFNSAFSVFETYSESSASEK-KAVHNKSN----GWTAAVKRLVTAGS 90
S S+ K S+LS +F+ F++FE + S+S A KS+ G + ++R V +GS
Sbjct: 45 SGSRQPKASILSGVFSPPFAIFEGQQQGSSSPACDARSTKSSSGSYGLSRILRRFVGSGS 104
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQDFSSRIL 149
M R+ LG R ++SD+W LG C+K++ +E + ++ A F +DFSSRI
Sbjct: 105 MWRL----LGCGRV---LTSSDVWFLGKCYKVSPEEEESGDSESDSGHAAFLEDFSSRIW 157
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
I+YRKGFD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +
Sbjct: 158 ITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGV 217
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQS 267
LHLFGDSE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R R A+ G ++
Sbjct: 218 LHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKEN 277
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
PMA+YVVSGDEDGERGGAPVVCID A++ CS F+KG + W+PILLLVPLVLGL+K+NPR
Sbjct: 278 FPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPR 337
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YIP L+ TF FPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEADTS
Sbjct: 338 YIPLLKETFMFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMTVDIALDNLEADTS 397
Query: 388 TYH 390
+YH
Sbjct: 398 SYH 400
>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
Length = 486
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 212/355 (59%), Positives = 262/355 (73%), Gaps = 16/355 (4%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVH-NKSNGWTAAVKRLVTAGSMRRIHER 97
K K S+LSS+ ++FE + S + H + S W+ ++R V GSM R
Sbjct: 48 KQCKASILSSVL----TIFEPDQDQSG--RSGGHASGSYAWSRVLRRFVGGGSMWRF--- 98
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
LG G + + +D+ LG C+K++ +E+ D+ G A F +DFSSRI I+YRKGFD
Sbjct: 99 -LG---CGKALTAADVQFLGKCYKLSSEESSSDSDSEGGHAAFLEDFSSRIWITYRKGFD 154
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP Q P + EY+ ILHLFGDSE
Sbjct: 155 AISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNPSNPEYIRILHLFGDSE 214
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVV 275
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R + + +S PMA+YVV
Sbjct: 215 ACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVINRNESFPMALYVV 274
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
SGDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 275 SGDEDGERGGAPVVCIDVAAQLCYDFNKDQSAWSPILLLVPLVLGLDKINPRYIPLLKET 334
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
FTFPQSLGI+GGKPGASTYI GVQ++ A+YLDPH+VQ +NI D+LEADTS+YH
Sbjct: 335 FTFPQSLGILGGKPGASTYIAGVQDDRALYLDPHEVQLAVNIASDNLEADTSSYH 389
>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
Length = 462
Score = 368 bits (944), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 198/355 (55%), Positives = 247/355 (69%), Gaps = 34/355 (9%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHER 97
S+ K S+LS +F ++FE + S++ A K + A R+ +RR+
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRI-----LRRVS-- 97
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
++E G + ++G A F +DFSSRI I+YRKGFD
Sbjct: 98 -------------------------PEEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFD 132
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHLFGDSE
Sbjct: 133 AIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHLFGDSE 192
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVV 275
FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R R A+ G ++ PMA+YVV
Sbjct: 193 ACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVV 252
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
SGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+PLVLGL+K+NPRYIP L+ T
Sbjct: 253 SGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIPLLKET 312
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
F FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ ++I D+LEADTS+YH
Sbjct: 313 FKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYH 367
>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 178/305 (58%), Positives = 237/305 (77%), Gaps = 4/305 (1%)
Query: 89 GSMRRIHERVLGPSRTGISSST-SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR 147
GSMRR+ E +LGP T ++S+ S+IW+LG+C+K++ D + EF DF+SR
Sbjct: 1 GSMRRLQELLLGPRFTAANASSGSEIWVLGLCYKVSADPN-NETLSVQAFEEFISDFTSR 59
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
I I+YRKGF+ +G SK+TSDVGWGCMLRS QML+AQAL+ H LGR WR+ +P + Y+
Sbjct: 60 IWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLGRSWRREPGQPCSQAYL 119
Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GC 265
+IL FGDSE+ PFSIHNLL+AG +GLAAGSW+GPYA+CR+ EALAR R ++ G
Sbjct: 120 QILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLEALARADREQSQKKGGK 179
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
++LP A+YVVSG+ +GERGGAPV+C++D + CS + + +WTP+L+LVPLVLGL+KVN
Sbjct: 180 RALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTEEWTPLLVLVPLVLGLDKVN 239
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
PRY+P+LR TFTFPQSLGI GGKPGASTY++GVQ+E A+YLDPH+ Q V+ + ++LE D
Sbjct: 240 PRYLPSLRATFTFPQSLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELD 299
Query: 386 TSTYH 390
TS+YH
Sbjct: 300 TSSYH 304
>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
Length = 358
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 227/324 (70%), Gaps = 29/324 (8%)
Query: 78 WTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDA 131
WTAAV+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 1 WTAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKES 58
Query: 132 AGNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR 189
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HR
Sbjct: 59 TSSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHR 118
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
LGR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 119 LGRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALC 177
Query: 248 RSWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
+ EALAR G G Q +A+YVVSGD GERGGAPV+ D + C
Sbjct: 178 HAIEALAR----NDGRGRQGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-------- 225
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YL
Sbjct: 226 ---PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYL 282
Query: 367 DPHDVQPVINIGKDDLEADTSTYH 390
DPH+VQ V+++ + LE D+++YH
Sbjct: 283 DPHEVQKVVSVSGESLEFDSASYH 306
>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 346
Score = 328 bits (840), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 164/287 (57%), Positives = 224/287 (78%), Gaps = 5/287 (1%)
Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
SSS +IW+LG+C+K++ D A +A + EF DFSSRI I+YRKGF+ +G+SK+TS
Sbjct: 4 SSSGGEIWVLGICYKVSAD-ANDEAVSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTS 62
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
DVGWGCMLRS Q+L+AQAL+ H LGR WR+ + +EY++IL FGDSE+ FSIHNL
Sbjct: 63 DVGWGCMLRSGQILLAQALVCHYLGRTWRRNACQECLQEYLQILQSFGDSESCSFSIHNL 122
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARC---QRAETGLGCQSLPMAIYVVSGDEDGER 283
L+AG+ +GLAAGSW+GPYA+CR+ EALA+ Q A+ G G ++LP A+YVVSG+ +G+R
Sbjct: 123 LEAGRPFGLAAGSWLGPYALCRTLEALAKADEDQNAKKG-GKRALPFAVYVVSGETEGDR 181
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
GGAPV C++DA+ CS + + +W+P+++LVPLVLGL+K+NPRY+P+LR TFT PQSLG
Sbjct: 182 GGAPVRCVEDAAVLCSKWGEATEEWSPLVVLVPLVLGLDKLNPRYLPSLRATFTLPQSLG 241
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ GGKPGAST+++GVQ + A+YLDPH+ Q V + ++LE DTS YH
Sbjct: 242 VAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLELDTSFYH 288
>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
Length = 358
Score = 328 bits (840), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 173/324 (53%), Positives = 227/324 (70%), Gaps = 29/324 (8%)
Query: 78 WTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDA 131
WTAAV+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 1 WTAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKES 58
Query: 132 AGNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR 189
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HR
Sbjct: 59 TSSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHR 118
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
LGR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 119 LGRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALC 177
Query: 248 RSWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
+ EALAR G G + +A+YVVSGD GERGGAPV+ D + C
Sbjct: 178 HAIEALAR----NDGRGREGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-------- 225
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YL
Sbjct: 226 ---PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYL 282
Query: 367 DPHDVQPVINIGKDDLEADTSTYH 390
DPH+VQ V+++ + LE D+++YH
Sbjct: 283 DPHEVQKVVSVSGESLEFDSASYH 306
>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 360
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 173/306 (56%), Positives = 226/306 (73%), Gaps = 2/306 (0%)
Query: 25 SLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKR 84
S S+ S+ SS++KS+ +L S + S+ V + E+S S V + WT +K
Sbjct: 26 SPTSLVSDSASSDNKSNL-TLCSDVVASSSPVSQLCREASTSGHNPVCTTHSSWTVILKT 84
Query: 85 L-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQD 143
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QD
Sbjct: 85 ASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQD 144
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
FSS IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D
Sbjct: 145 FSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPAD 204
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 205 EKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDD 264
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++
Sbjct: 265 KHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDR 324
Query: 324 VNPRYI 329
VNP +
Sbjct: 325 VNPSHF 330
>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 267
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/244 (63%), Positives = 198/244 (81%)
Query: 86 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 1 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 60
Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 61 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 120
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 121 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 180
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 181 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 240
Query: 326 PRYI 329
PR++
Sbjct: 241 PRFV 244
>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
Length = 290
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/209 (68%), Positives = 172/209 (82%), Gaps = 2/209 (0%)
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
S G GCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +L LFGDSE FSIHN
Sbjct: 14 SLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGDSEACAFSIHN 73
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGER 283
LLQA + YGLAAGSW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGER
Sbjct: 74 LLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGER 133
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
GGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLG
Sbjct: 134 GGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLG 193
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
I+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 194 ILGGKPGTSTYIAGVQDDRALYLDPHEVQ 222
>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
Length = 472
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 142/205 (69%), Positives = 168/205 (81%), Gaps = 2/205 (0%)
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 215
FD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR RKP +KP++ +Y+ +LHLFGD
Sbjct: 34 FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93
Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIY 273
SE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R A+ G ++ PMA+Y
Sbjct: 94 SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
VVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLK 213
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGV 358
TF FPQSL I+GGKPG STYI GV
Sbjct: 214 ETFMFPQSLCILGGKPGTSTYIAGV 238
>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
Length = 169
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 113/165 (68%), Positives = 130/165 (78%)
Query: 96 ERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKG 155
+ +LG S T SSTSDIWLLG C+K++ +E+ G NG A F +DFSSRI I+YRKG
Sbjct: 2 QELLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKG 61
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 215
FD IGDSK TSDV WGCM+RSSQMLVAQALLFH LGR WRKP QKP D +Y+EILHLFGD
Sbjct: 62 FDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGD 121
Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
SE FSIHNLL+AGKAYGLAA WVGPYAMCR+WE + R +R +
Sbjct: 122 SEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQ 166
>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
Length = 416
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 132/251 (52%), Positives = 162/251 (64%), Gaps = 51/251 (20%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
L F +DFSSRI I+YRKGFD I D K+TSDV WGCM+RSSQMLVAQAL+FH LGR WRK
Sbjct: 29 LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRK 88
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P +K L++ +
Sbjct: 89 PPEK------------------------TLIRTNR------------------------- 99
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
++A+ G ++ PM +YVVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVP
Sbjct: 100 EQADAVDGKENFPMELYVVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVP 159
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI- 375
LVLGL+K+NPRYIP L+ TF FPQSLGI+G KPG STYI GVQ++ A+YLDPH+VQ V+
Sbjct: 160 LVLGLDKINPRYIPLLKETFMFPQSLGILGVKPGTSTYIAGVQDDRALYLDPHEVQMVLA 219
Query: 376 NIG-KDDLEAD 385
NI + LE D
Sbjct: 220 NIKWPETLETD 230
>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
Length = 342
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 156/301 (51%), Gaps = 30/301 (9%)
Query: 101 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN----GLAEFNQDFSSRILISYRKGF 156
P +T + S IWLLG C+ E + + L EF++ F+S I ++YR+ F
Sbjct: 12 PLKTNFNED-SPIWLLGRCYHAKNYEYTSEQSKQQCQILSLEEFHRHFTSLIWLTYRRSF 70
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---KPLQKPFDREYVEILHLF 213
+ S +TSD GWGCMLRS QM++A L+FH L + WR + + + Y IL F
Sbjct: 71 VQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRISGRCHSREQEHYYRVILQFF 130
Query: 214 GDS---ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
GD E SPFS+H L+ G+ G AG W GP ++ E +++
Sbjct: 131 GDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHILE--------------KAMIS 176
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVN 325
A + + D + V ID+ R C+ Q D W P+++LVP+ LG E +N
Sbjct: 177 ATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRLGGEALN 236
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
P YIP ++ FT Q +GI+GG+P S Y VG Q+E I+LDPH QPV++ ++ +
Sbjct: 237 PIYIPCVKSLFTLDQCIGIIGGRPKHSLYFVGFQDEKMIHLDPHYCQPVVDTTQEKFPTE 296
Query: 386 T 386
+
Sbjct: 297 S 297
>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 348
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 149/262 (56%), Gaps = 16/262 (6%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
+LGV + DE + ++ + +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 1 MLGVTYWSKDDECNAEKY-DDARRAWERDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTL 59
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAY 233
RS+QM+VA AL H GR WR+ ++ D E V+ +L +F D ++PFSIH++ + A+
Sbjct: 60 RSAQMMVANALSIHTRGRHWRRQVKAKEDDESVDHVLSMFIDDASAPFSIHSVCETTTAW 119
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG-DEDGERGGAPVVCID 292
G G W P MCR++ AL G +A++VV G +ED GG P ID
Sbjct: 120 GAPPGRWFEPSVMCRAFSALIEAN------GDLRNQIAVHVVGGQNEDDSAGGVPT--ID 171
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
D G+A +LL VPLVLG+ +N RYI LR F QS+G++GG+P A
Sbjct: 172 DGELRAKSADVGKA----LLLFVPLVLGVGRNINTRYISQLRSIIAFKQSIGVIGGRPNA 227
Query: 352 STYIVGVQEESAIYLDPHDVQP 373
S Y+VG ++ YLDPH VQP
Sbjct: 228 SLYLVGHSDDVFFYLDPHTVQP 249
>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
Length = 207
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/179 (52%), Positives = 124/179 (69%), Gaps = 7/179 (3%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K K S+LS +F+S FS+FE + +SSA+ H+ S W+ ++R+ GSM R
Sbjct: 35 KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 91 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSE 206
>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
pisum]
gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
pisum]
gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
pisum]
gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
pisum]
Length = 402
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 150/289 (51%), Gaps = 32/289 (11%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLATMDE---------LSSLVFHVALDN------ 192
Query: 286 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ +++ FTFPQSL
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKMCFTFPQSL 250
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
G++GG+P + Y +G I+LDPH Q + + D+E + HS
Sbjct: 251 GVIGGRPNHALYFIGFVGNDVIFLDPHTTQQIGMLPNKDIETEHKIDHS 299
>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
Length = 477
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 146/303 (48%), Gaps = 42/303 (13%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG C+ ++ L A+ N + EF +DF SR
Sbjct: 86 SKESPVWLLGQCYLKKSEDPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFVSR 145
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF----- 202
I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR ++P
Sbjct: 146 IWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQPIETLQQ 205
Query: 203 ---DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
DR + I+ FGD SPFSIH L+ G + G AG W GP ++ C
Sbjct: 206 RLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAHLLSQAVECASK 265
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
++ L A+YV V + D C W ++LLVPL L
Sbjct: 266 QSNSNFDHL--AVYVAQD---------CAVYLQDVENICRT---PDGKWKALVLLVPLRL 311
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++ K
Sbjct: 312 GADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVWK 371
Query: 380 DDL 382
+D
Sbjct: 372 NDF 374
>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
Length = 362
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 104/258 (40%), Positives = 144/258 (55%), Gaps = 36/258 (13%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D SRI ++YR+GF PI S ITSDVGWGC LRS QML+AQAL++H +GR WR+ L+ +
Sbjct: 23 DLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQWRRKLEAAY 82
Query: 203 DREYVEILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
E ++L FGD E PFSIHN+ G+ +G+ AG W+GP +C + + +
Sbjct: 83 PEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLADMVN-KVQP 141
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------WTPILL 313
GL C+ + G GGAPV+C SR + F +G AD +
Sbjct: 142 GGLQCR-----VVATFG------GGAPVLC---TSRLATAF-EGGADRSGGEVGSSGSEE 186
Query: 314 LVPLVLGLE-----------KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 362
P GL K+NPRY L+ T+PQS+GIVGG+P +S Y +G+Q++
Sbjct: 187 SGPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSLYFIGLQDQH 246
Query: 363 AIYLDPHDVQPVINIGKD 380
+YLDPH+VQ V + D
Sbjct: 247 VLYLDPHEVQEVASEAAD 264
>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
Length = 410
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/300 (35%), Positives = 157/300 (52%), Gaps = 34/300 (11%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+ S +W+LG + + D LAE +D SR+ ++YRKGFDPIG S TSD
Sbjct: 30 TESPVWILGKQYSVLYD-----------LAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQ 78
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM++AQ+L+ LGR WR K +D +Y EIL +F D ++ +S+ +
Sbjct: 79 GWGCMLRCGQMMLAQSLICRHLGRDWRWTKDK-YDPKYFEILRMFQDKRSAKYSLQVIAS 137
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD------EDGE 282
G + G A G W GP + + L C E + + V+ D +
Sbjct: 138 MGTSEGKAIGEWFGPNTISQVLRKL--CVSDEWSNLVVHVALDNTVIIDDVFCLCKSSKK 195
Query: 283 RGGAPVVCIDDASRHCSVFS-----------KGQAD-WTPILLLVPLVLGLEKVNPRYIP 330
P+ + A +F+ G+ D W P+LL+VPL LGL ++NP YIP
Sbjct: 196 ESNEPIPGVHAACASALLFNGHDPTAEGHDPSGEDDSWRPLLLIVPLRLGLSEINPVYIP 255
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
L+ TF QS+GI+GGKP + + +G E+ +Y+DPH QP +++ + E+D S YH
Sbjct: 256 FLKTCLTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPFVDVTQPG-ESDAS-YH 313
>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
Length = 332
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 150/285 (52%), Gaps = 40/285 (14%)
Query: 113 IWLLGVCHKIA------------QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG 160
+WLLGV + +A ++ + D + N F D SR+ SYR F PI
Sbjct: 70 VWLLGVRYTLAPPPMGQRGEGRETEQTVVDESQN-----FKLDMWSRLWFSYRYNFHPIS 124
Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR---EYVEILHLFGDSE 217
+++T+D GWGCM+RS QML+ QAL+ H LGR WR ++ +Y ++L +F D
Sbjct: 125 GTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLSHTSKYNELPSDYRKVLEMFLDHP 184
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC-QSLPMAIYVVS 276
+P SIH+ ++AG+ G AG+W GP +C ++ L A LG +L + Y
Sbjct: 185 CAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKL----HAGGALGSDNNLQLLAY--- 237
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
DG G D+ QA P+ +L+P LG+ V+P YIP + F
Sbjct: 238 ---DGNDG-------DNTIYKSEALELLQAG--PLFILLPTRLGVSSVDPSYIPKISHVF 285
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
+FPQSLG +GGKP ++ Y + Q E+ YLDPH QP+INI + +
Sbjct: 286 SFPQSLGFIGGKPSSAHYFIASQGEAVYYLDPHTPQPLINISEKE 330
>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
Length = 405
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 150/300 (50%), Gaps = 40/300 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 34 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 80
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 81 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 140
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 141 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 192
Query: 283 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 193 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 252
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 253 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 312
>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
Length = 369
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 150/300 (50%), Gaps = 40/300 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 65
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 177
Query: 283 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 178 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 237
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 238 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 297
>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
Length = 517
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 102/258 (39%), Positives = 143/258 (55%), Gaps = 22/258 (8%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---K 196
F +DFSSR+ +YR+ F PI + ITSD GWGCMLRSSQM++AQA++ H LGR WR
Sbjct: 181 FLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYRRN 240
Query: 197 PLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 253
+ D + +++ LFGD + SPFS+H L+Q G G AG W GP + EAL
Sbjct: 241 NQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKEAL 300
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS-VFSKGQADWTPIL 312
+ E L L + IYV + ++D C S G W ++
Sbjct: 301 EGACQTEQLL----LDLRIYVAQD---------CTIYLEDVRALCRGTRSNGAPLWRSVI 347
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+LVP+ LG E++NP YIP ++ + P +G++GG+P S Y +G Q E IYLDPH VQ
Sbjct: 348 ILVPVRLGGEQLNPTYIPCVKGMLSHPNCIGVIGGRPRHSLYFLGWQGEKVIYLDPHYVQ 407
Query: 373 PVINIGKDDLEADTSTYH 390
+++G D D +YH
Sbjct: 408 EAVDVGPQDFPLD--SYH 423
>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
Length = 390
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 136/259 (52%), Gaps = 13/259 (5%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++A+AL+ LGR WR +
Sbjct: 45 DVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWARGRRQ 104
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G G W GP A+ +W L
Sbjct: 105 REEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRL 164
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 313
A + + + + D E G C++ A C++ + A W P++L
Sbjct: 165 AVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGA---CALVEEETALWKPLVL 221
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
L+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP
Sbjct: 222 LIPLRLGLSDINEAYIDTLKQCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP 281
Query: 374 VINIGKDDLEADTSTYHSE 392
+ +D D TYH +
Sbjct: 282 AVEPSEDGQVPD-ETYHCQ 299
>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 356
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 98/267 (36%), Positives = 139/267 (52%), Gaps = 12/267 (4%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + +D + E D SRI I+YRK F IG + TSD GWGC
Sbjct: 26 VWILGKAYHLIRDRS-----------ELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGC 74
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQALL LGR WR ++ + Y +IL LF D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALLCKHLGREWRWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVG 134
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + L + S+ I VV R C
Sbjct: 135 EGKSIGQWFGPNTVAQVLRKLTLFDDWSSIAVHISMDNTI-VVEDIRKLCRTPLFTECAS 193
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ S+ + G W P++L +PL LGL ++NP Y+ L+ FT QSLG++GGKP +
Sbjct: 194 PKAASASLENGGTTYWKPLVLFIPLRLGLTEINPLYLDVLKKCFTLKQSLGMIGGKPNHA 253
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGK 379
Y +G ++ +YLDPH QPV++I K
Sbjct: 254 HYFIGFYGKTLVYLDPHTTQPVVDINK 280
>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
Length = 219
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/122 (77%), Positives = 105/122 (86%), Gaps = 1/122 (0%)
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
MAIYVVSGDEDGERGGAPVVCI+DA +HCS FS+GQA WTP+LLLVPLVLGL+KVNPRYI
Sbjct: 1 MAIYVVSGDEDGERGGAPVVCIEDAFKHCSEFSRGQAAWTPLLLLVPLVLGLDKVNPRYI 60
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TST 388
P L TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+NI D E + TS+
Sbjct: 61 PLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQEPNSTSS 120
Query: 389 YH 390
YH
Sbjct: 121 YH 122
>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
Length = 375
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/269 (34%), Positives = 139/269 (51%), Gaps = 33/269 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 26 VWILGACYNVKTKKS-----------ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGC 74
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR WR +K +EY IL F D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALICSHLGRDWRWDPEKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVG 134
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +++YV + V I+
Sbjct: 135 EGKSVGEWYGPNTVAQVLKKLALFDDWNS--------LSVYVSMDN---------TVVIE 177
Query: 293 DASRHC-----SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
D + C + S+ DW P+LL++PL +G+ +NP YI L+ F PQS G++GG
Sbjct: 178 DIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQALKECFKMPQSCGVLGG 237
Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVIN 376
KP + Y +G ++ IYLDPH Q ++
Sbjct: 238 KPNLAYYFIGFIDDELIYLDPHTTQQAVD 266
>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
Length = 390
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 103/281 (36%), Positives = 141/281 (50%), Gaps = 26/281 (9%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 65
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQSDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 184
Query: 287 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 185 PCAGATALPTDSSRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 244
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
PQSLG++GGKP ++ Y +G E IYLDPH QP + +
Sbjct: 245 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEL 285
>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
Length = 394
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 106/296 (35%), Positives = 145/296 (48%), Gaps = 27/296 (9%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 188
Query: 287 PV----VCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 189 PCAGAAALPADSSRHCNGFPAGAEVTNRLAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 248
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +H +
Sbjct: 249 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFIPDES-FHCQ 303
>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
Length = 453
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 152/303 (50%), Gaps = 41/303 (13%)
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
+LG I S +SD LG Q ++ ++ + G F +DF SR+ ++YR+ F
Sbjct: 70 LLGKCYRRIESPSSDSTELGTDVAAFQSQSEIASSDDEGFEGFKKDFISRLWLTYRREFP 129
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDRE-YVE------I 209
+ S +SD GWGCMLRS QML+AQAL+ H LGR WR +P +P RE ++E I
Sbjct: 130 ILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDWRWQPDHQPTTRESFIEVVNHRKI 189
Query: 210 LHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
+ FGD S SPFSIH L+ G+A G AG W GP G
Sbjct: 190 IKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGP------------------GFVAHL 231
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG--------QADWTPILLLVPLVL 319
A S ED + VC+ ++ C+V+ K W ++LL+P+ L
Sbjct: 232 FRQAFKRAS--EDNYEFDSLTVCV---AQDCAVYIKDVMEECTDKNGKWKSLILLIPVRL 286
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G EK N Y P L F+ Q +GI+GG+P S Y VG Q++ I+LDPH Q V+++
Sbjct: 287 GAEKFNSIYAPCLTTLFSLKQCIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQEVVDVWA 346
Query: 380 DDL 382
D
Sbjct: 347 VDF 349
>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
Length = 424
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 108/308 (35%), Positives = 157/308 (50%), Gaps = 53/308 (17%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
+ GV H ++ + G+ + G E+ +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 42 MFGVTH-WDRETSSGERSNEVGRREWERDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTL 100
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDRE----------------------------- 205
RS+QM++A AL H GR WR+ +Q E
Sbjct: 101 RSAQMMLANALSIHSRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSE 160
Query: 206 --------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
+IL LF D +PFSIH + + +G G W P MCR++EAL
Sbjct: 161 RTRAGSDAQEDILRLFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALV--- 217
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
AE LG + + ++VVSG E GE GG P V D+A G+A +LL VP+
Sbjct: 218 -AEHDLGSE---LTVHVVSGRE-GEDGGVPTV--DEAEVRAKSADVGKA----LLLFVPV 266
Query: 318 VLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
VLG+ + +N RY+ LR F QS+GIVGG+P +S Y+VG ++ YLDPH VQ +
Sbjct: 267 VLGVGRTINARYLSQLRSMMAFKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASS 326
Query: 377 IGKDDLEA 384
+ D E+
Sbjct: 327 MVTMDFES 334
>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
Length = 445
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 105/296 (35%), Positives = 144/296 (48%), Gaps = 27/296 (9%)
Query: 109 STSDIWLLGVCHKIA--QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I+ +DE L D A SR+ +YRK F IG + TS
Sbjct: 74 TSEPVWILGRKYSISTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 120
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 121 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 180
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ R G
Sbjct: 181 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMEDIRRLCRAGL 239
Query: 287 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P D RHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 240 PCAGAAALPADPGRHCNGFPAGAEVSNRLAPWRPLVLLIPLRLGLTDINEAYVETLKHCF 299
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +H +
Sbjct: 300 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFADSCFIPDES-FHCQ 354
>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A
gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
Length = 396
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 144/299 (48%), Gaps = 50/299 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D T+H
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGL-VDDQTFH 298
>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
Length = 682
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 101/273 (36%), Positives = 143/273 (52%), Gaps = 15/273 (5%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + L ++ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 262 AAENQLAESPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 321
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 322 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 381
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDE---DGERGGAP 287
G W GP ++ + AL R S+ +A IY+ +E E P
Sbjct: 382 KPGDWYGPASVSYLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKP 441
Query: 288 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
V A R S K W +++L+PL LG +K+NP Y L+L + LGI+GG
Sbjct: 442 HVPWQQAKRSTSDAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEYCLGIIGG 501
Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
KP S Y VG QE+ I+LDPH Q ++++ ++
Sbjct: 502 KPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQE 534
>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 396
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 144/289 (49%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPS 195
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E P+ +A+ H S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ESSHDPL----NATNHNKAISACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ ++ + D T+H
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGM-VDDQTFH 299
>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
Length = 392
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 147/297 (49%), Gaps = 45/297 (15%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG+C+ + L A+ N + EF +DF SR
Sbjct: 6 SKESPVWLLGLCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 65
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREY 206
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q + +
Sbjct: 66 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQSTDESSH 125
Query: 207 VEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRAE 260
I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 126 RMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQAME------RAS 179
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
+ +A+YV + V C D R ++LLVPL LG
Sbjct: 180 EDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR------------KALILLVPLRLG 227
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++
Sbjct: 228 ADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDV 284
>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
Length = 393
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 151/303 (49%), Gaps = 41/303 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C+ D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-F 299
Query: 390 HSE 392
H +
Sbjct: 300 HCQ 302
>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
Length = 396
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 142/299 (47%), Gaps = 50/299 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPFKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + L + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLTLFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D T+H
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGL-VDDQTFH 298
>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
Length = 393
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 148/309 (47%), Gaps = 53/309 (17%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 188
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 323
A G A D+ RHC+ F G A W P++LL+PL LGL
Sbjct: 189 CA------------GAAAFPA--DSDRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTD 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFI 294
Query: 384 ADTSTYHSE 392
D S +H +
Sbjct: 295 PDES-FHCQ 302
>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
Length = 396
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 143/299 (47%), Gaps = 50/299 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+Y + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYDSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D T+H
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGL-VDDQTFH 298
>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
Length = 442
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 102/293 (34%), Positives = 148/293 (50%), Gaps = 23/293 (7%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNN------GLAEFNQDFSSRILISYRKGFDPIGDS 162
S S IWLLG C+ Q E A N G+ F +DFSS I +SYRK F + +S
Sbjct: 63 SDSPIWLLGRCYYAKQAEYDSKNAVQNTQYKIHGIDCFFEDFSSLIYLSYRKHFSQLANS 122
Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGD--SET 218
+TSD GWGCMLR+ QML+A ALL H L WR +K ++ Y+ IL F D S+
Sbjct: 123 NLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKYTEKNYIYRMILRFFNDENSDN 182
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 277
SPFS+H L++ G G W GP ++ + A + S P + + V
Sbjct: 183 SPFSLHELVRIGSK---KPGEWYGPTSVAHTLSA---------AVNLTSHPVLDTFRVYV 230
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
D V+ ++C+ + + W +L+LVP+ LG + +NP YIP L+ T
Sbjct: 231 ANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLNPIYIPCLKALLT 290
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+GI+GG+P S Y VG Q + I LDPH +Q +++ + ++ H
Sbjct: 291 LDYCVGIIGGRPKHSLYFVGFQGKKLINLDPHYLQEYVDMTTQEFPVESFRCH 343
>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
Length = 398
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 148/289 (51%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDG 281
G + G W GP A+ W +LA + + + + I +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAG 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFH 301
>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
Length = 391
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/271 (36%), Positives = 142/271 (52%), Gaps = 17/271 (6%)
Query: 135 NGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
N L E ++ D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LG
Sbjct: 34 NALTEKDEILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVCRHLG 93
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
R WR + EY+ +L+ F D + S +SIH + Q G G G W GP + + +
Sbjct: 94 RDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLK 153
Query: 252 ALARCQRAETGLGCQSLPMAIYVVS--------GDEDGERGGAPVV--CIDDASRHCSVF 301
LA + ++ + + D GE G + C++ A C++
Sbjct: 154 KLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGA---CAMA 210
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
+ A W P++LL+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E
Sbjct: 211 EEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLPQSLGVIGGKPNSAHYFIGYVGE 270
Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
IYLDPH QP + +D D TYH +
Sbjct: 271 ELIYLDPHTTQPAVEPSEDSQVPD-ETYHCQ 300
>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
Length = 390
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 150/303 (49%), Gaps = 41/303 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 177
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 178 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 237
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 238 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-F 296
Query: 390 HSE 392
H +
Sbjct: 297 HCQ 299
>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
Length = 393
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 150/303 (49%), Gaps = 41/303 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-F 299
Query: 390 HSE 392
H +
Sbjct: 300 HCQ 302
>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B
Length = 393
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 150/303 (49%), Gaps = 41/303 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-F 299
Query: 390 HSE 392
H +
Sbjct: 300 HCQ 302
>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
Length = 479
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 149/303 (49%), Gaps = 41/303 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 108 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 154
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 155 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 214
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 215 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 266
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G W P++LL+PL LGL +N Y+
Sbjct: 267 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 326
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 327 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-F 385
Query: 390 HSE 392
H +
Sbjct: 386 HCQ 388
>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
Length = 406
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 143/303 (47%), Gaps = 54/303 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 327
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YI + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + L D
Sbjct: 241 YIEAFKECFKMPQSLGALGGKPNNAYYFIGSLGDELIFLDPHTTQTFVDTEESGL-VDDH 299
Query: 388 TYH 390
T+H
Sbjct: 300 TFH 302
>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
Length = 394
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 149/303 (49%), Gaps = 41/303 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 23 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 181
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G W P++LL+PL LGL +N Y+
Sbjct: 182 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 241
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 242 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-F 300
Query: 390 HSE 392
H +
Sbjct: 301 HCQ 303
>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
Length = 393
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 146/309 (47%), Gaps = 53/309 (17%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR + Y +LH F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALLCRHLGRGWRWTQWERQPDSYFSVLHAFMDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQ-SLP 269
Q G G + G W GP + + +W ALA E C+ SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSALAVHVAMDNTVVMEEIRRLCRSSLP 188
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 323
R GA D+ RHC+ F W P++LL+PL LGL
Sbjct: 189 -------------RAGAAAFPA-DSDRHCNGFPAEAEVGPRPVPWRPLVLLIPLRLGLTD 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + L
Sbjct: 235 INAAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQVTDSCLI 294
Query: 384 ADTSTYHSE 392
D S +H +
Sbjct: 295 PDES-FHCQ 302
>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
Length = 380
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 147/303 (48%), Gaps = 56/303 (18%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W+LGV + +D E D SSR+ +YRK F PIG + SD GWGCM
Sbjct: 32 WILGVGYNTVKDRQ-----------ELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCM 80
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
LR QM++ QAL+ LGR WR +D +Y +IL LF D + S +SIH + Q G +
Sbjct: 81 LRCGQMMLGQALICRHLGRDWRWK-SAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSE 139
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G + G W GP + + + LA + + +AI+V + V IDD
Sbjct: 140 GKSVGQWFGPNTVAQVLKKLALFEDWSS--------LAIHVAMDN---------TVIIDD 182
Query: 294 ASRHC-------------------------SVFSKGQA-DWTPILLLVPLVLGLEKVNPR 327
+ C S S+ A W P++L++PL LGL ++NP
Sbjct: 183 IKKLCRSARQPTPSQVTNSFLCNGVSAEQTSARSRSPALPWQPLMLIIPLRLGLSELNPV 242
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
Y L+ FT QSLG++GGKP + Y +G S +YLDPH QP + + + ++ S
Sbjct: 243 YTDCLKACFTLRQSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPAVEL-EGNVPIPDS 301
Query: 388 TYH 390
++H
Sbjct: 302 SFH 304
>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
Length = 510
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 139 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 185
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 186 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 245
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 246 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 297
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 298 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 357
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 358 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 404
>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 354
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
Length = 393
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
Length = 405
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
Length = 393
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
Length = 393
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
Length = 509
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 403
>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
Length = 393
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
Length = 432
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 89/251 (35%), Positives = 135/251 (53%), Gaps = 7/251 (2%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR-LGRPWR 195
+ EF +DFS+++ SYR+GF+ IGDS +D GWGCMLRS QML+A LL + +G+ W+
Sbjct: 88 IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALA 254
KP + ++ +++ LF D ++PFSIHN+ G+ + G + G W P + + AL
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207
Query: 255 -RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS----VFSKGQADWT 309
+ G + + + V DD S + + + W
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALRSDGSWM 267
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+L+L+P LG++ +N Y L +TFPQ+LGIVGGKP AS Y + Q+++ YLDPH
Sbjct: 268 PLLILIPTKLGIDTINEIYYRPLLDIYTFPQNLGIVGGKPRASLYFIASQDDNLFYLDPH 327
Query: 370 DVQPVINIGKD 380
VQ I D
Sbjct: 328 TVQNSIESDSD 338
>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
Length = 518
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 145 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 191
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 192 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 251
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 252 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 303
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 304 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 363
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 364 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 410
>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
Length = 398
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 27 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 73
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 74 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 133
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 134 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 185
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 186 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 245
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 246 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 292
>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
Length = 393
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
Length = 396
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 290
>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B;
Short=hAPG4B
gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
Length = 393
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
Length = 481
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 375
>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
[Homo sapiens]
gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
construct]
gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
Length = 393
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
Length = 521
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 403
>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
Length = 468
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 375
>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
purpuratus]
Length = 390
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 150/316 (47%), Gaps = 50/316 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
IW+LG + ++Q + E D SR+ +YRKGF IG + T+D GWGC
Sbjct: 48 IWILGKKYDLSQHQL-----------EARLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGC 96
Query: 173 MLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
MLR QM++AQAL++ LGR WR +P ++ D Y++IL LF D + S FSIH + Q G
Sbjct: 97 MLRCGQMMLAQALVYKHLGRDWRWRPQEQ--DETYLKILQLFLDKKDSCFSIHQIAQMGV 154
Query: 232 AYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
G G W GP + + SW LA + + + + V S E+
Sbjct: 155 GEGKKVGDWFGPNTVGQVIRKLSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETS 214
Query: 283 RGGAPV--------------------------VCIDDASRHCSVFSKGQADWTPILLLVP 316
G+ + + + + S G W + L++P
Sbjct: 215 SEGSKTGSERRKRTSSSENIRHKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIP 274
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LGL ++N Y+ L+ FT PQSLG++GGKP + Y +GV + +YLDPH QP +
Sbjct: 275 LRLGLNEINTVYMQRLKRCFTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHTTQPAAD 334
Query: 377 IGKDDLEADTSTYHSE 392
I K D S +H E
Sbjct: 335 IDKWAFLQDES-FHCE 349
>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
Length = 380
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
Length = 508
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 137 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 183
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 184 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 243
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 244 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 295
Query: 282 ERGGAPVVCID------DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 296 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 355
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 356 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 402
>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
Length = 396
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 290
>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
Length = 496
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 403
>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
[Homo sapiens]
Length = 415
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 282 ERGGAPVVCID------DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 375
>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
Length = 395
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 133/263 (50%), Gaps = 26/263 (9%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR K
Sbjct: 52 DIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKHKEH 111
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
EY +IL F D + +SIH + Q G G + G W GP + + + LA +
Sbjct: 112 PEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS- 170
Query: 263 LGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSK-----GQAD-WT 309
+A+Y VV D P C + A+ + S +S+ GQ+ W
Sbjct: 171 -------LAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSSGWR 223
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + IYLDPH
Sbjct: 224 PLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEIIYLDPH 283
Query: 370 DVQPVINIGKDDLEADTSTYHSE 392
Q + D E TYH +
Sbjct: 284 TTQTFV-----DTEDQDQTYHCQ 301
>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
Length = 398
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 94/288 (32%), Positives = 142/288 (49%), Gaps = 25/288 (8%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTAD 197
Query: 284 GGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+P I + S+ S F W P+LL+VPL LG+ ++NP Y+ + F PQSL
Sbjct: 198 KSSPDSFITSNQSKDTSAFCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSL 254
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
G +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 255 GALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 301
>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
Length = 411
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 145/302 (48%), Gaps = 53/302 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + + + + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 42 VWILGKQHLLKTERS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 90
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 91 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 150
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 151 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 193
Query: 293 DASRHCSVF--------------------SKGQA----DWTPILLLVPLVLGLEKVNPRY 328
D + C VF SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 194 DIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 253
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D T
Sbjct: 254 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGM-VDDQT 312
Query: 389 YH 390
+H
Sbjct: 313 FH 314
>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
Length = 420
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 143/289 (49%), Gaps = 40/289 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 49 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 95
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR ++ Y +LH F D + S +SIH +
Sbjct: 96 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWAQRRRQPDSYFSVLHAFIDRKDSHYSIHQI 155
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
Q G G + G W GP + + + LA + +A+++ + E+
Sbjct: 156 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 207
Query: 283 R-------GGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
R C D S+HC+ G + W P++LL+PL LGL +N Y+
Sbjct: 208 RLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTWRPLVLLIPLRLGLTDINEAYV 267
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + +
Sbjct: 268 ETLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELA 316
>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
Length = 408
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 144/289 (49%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 39 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 87
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 88 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 147
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 148 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVG 207
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E + +AS G+ W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 208 ESPPDTL----NASNQSKGTPAGRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 263
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H
Sbjct: 264 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFH 311
>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
Length = 473
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/248 (37%), Positives = 126/248 (50%), Gaps = 10/248 (4%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
E D +SR+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR
Sbjct: 122 EILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQ 181
Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 258
QK Y+ +LH F D + S +SIH + Q G G + G W GP + + + LA
Sbjct: 182 QKRQPDSYLSVLHAFMDRKDSYYSIHQIAQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD- 240
Query: 259 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR----HCSVFS-----KGQADWT 309
+ L V+ R P HC+ F ++ W
Sbjct: 241 TWSSLAVHIAMDNTVVMEEIRRLCRSSHPCAGAATPPAGADWHCNGFPASTEVTNRSPWR 300
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P++LL+PL LGL +N Y+ TL+L F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 301 PLVLLIPLRLGLTDINEAYVETLKLCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPH 360
Query: 370 DVQPVINI 377
QP + +
Sbjct: 361 TTQPAVEL 368
>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
Length = 380
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 100/287 (34%), Positives = 143/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ + PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 241 ETLKHCYMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
Length = 412
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 102/298 (34%), Positives = 145/298 (48%), Gaps = 29/298 (9%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +D+ L D A SR+ +YR+ F IG + TS
Sbjct: 39 TSEPVWILGRKYSIFTEKDDILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 85
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 86 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 145
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ R G
Sbjct: 146 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRTGL 204
Query: 287 PV----VCIDDASRHCSVF--------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
P DA RHC+ F + + W P++LL+PL LGL +N Y+ TL+
Sbjct: 205 PCAGAAALPTDADRHCNGFPTQTEVTNRQSPSLWRPLVLLIPLRLGLTDINEAYVETLKH 264
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D T+H +
Sbjct: 265 CFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIPD-ETFHCQ 321
>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
Length = 398
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/287 (32%), Positives = 144/287 (50%), Gaps = 23/287 (8%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVG 197
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
P ++ +++ F+ A W P+LL+VPL LG+ ++NP Y+ + F PQSLG
Sbjct: 198 ESTPGT-LNASNQSRGTFACCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLG 255
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+GGKP + Y +G + I+LDPH Q +N +++ D T+H
Sbjct: 256 ALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVNT-EENGTVDDQTFH 301
>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
Length = 486
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/266 (35%), Positives = 136/266 (51%), Gaps = 25/266 (9%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR + +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKR 251
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 252 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 299
Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
C + W ++L VPL LG +K+NP Y L T +G++GG+P S Y +
Sbjct: 300 VCQM---PDGKWKSLILFVPLRLGADKLNPVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 356
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDL 382
G QE+ I LDPH Q +++ KD+
Sbjct: 357 GFQEDKLINLDPHYCQETVDVLKDNF 382
>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
Length = 398
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 145/302 (48%), Gaps = 53/302 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSV--------------------FSKGQ----ADWTPILLLVPLVLGLEKVNPRY 328
D + C V SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ G++ D T
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTGENGTVND-QT 299
Query: 389 YH 390
+H
Sbjct: 300 FH 301
>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
Length = 394
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/304 (34%), Positives = 146/304 (48%), Gaps = 43/304 (14%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAIFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 189
Query: 270 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAEATAFPADSEGHCNGLPA---------GAEVTNRPSLWRPLVLLIPLRLGLTDINEAY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + L D S
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSFLIPDES- 299
Query: 389 YHSE 392
+H +
Sbjct: 300 FHCQ 303
>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
Length = 355
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 145/292 (49%), Gaps = 33/292 (11%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 25 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 74 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 133
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 134 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 193
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 194 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 246
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 247 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 297
>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
boliviensis]
Length = 422
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 145/289 (50%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPG 221
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+R + ++ SR S + W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 222 DRPPDSLTASNE-SRGTSAYCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 277
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 278 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 325
>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
Length = 393
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/264 (36%), Positives = 135/264 (51%), Gaps = 18/264 (6%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 44 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 103
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 104 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 163
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
A + + + + D +RG P D C++ + A W
Sbjct: 164 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 220
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 221 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 280
Query: 369 HDVQPVINIGKDDLEADTSTYHSE 392
H QP ++ +D D S YH +
Sbjct: 281 HTTQPAVDPSEDGHFPDDS-YHCQ 303
>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
Length = 398
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 146/297 (49%), Gaps = 43/297 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D T+H
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVND-QTFH 301
>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
Length = 394
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 96/264 (36%), Positives = 135/264 (51%), Gaps = 18/264 (6%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
A + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 369 HDVQPVINIGKDDLEADTSTYHSE 392
H QP ++ +D D S YH +
Sbjct: 282 HTTQPAVDPSEDGHFPDDS-YHCQ 304
>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
Length = 394
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 96/264 (36%), Positives = 135/264 (51%), Gaps = 18/264 (6%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
A + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 369 HDVQPVINIGKDDLEADTSTYHSE 392
H QP ++ +D D S YH +
Sbjct: 282 HTTQPAVDPSEDGHFPDDS-YHCQ 304
>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
Length = 398
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 145/295 (49%), Gaps = 27/295 (9%)
Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
S + +W+LG H + +++ + D S+R+ +YR+ F PIG + +S
Sbjct: 23 SDTDELVWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM++AQAL+ LGR W QK +EY IL F D + +SIH +
Sbjct: 72 DAGWGCMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 131
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV-- 275
Q G G + G W GP A+ W +LA + + + + V+
Sbjct: 132 AQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPS 191
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
S D GE + ++ + S + W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 192 SADTAGESPPGSLTALNQSKGT----SACRPAWKPLLLIVPLRLGINQINPVYVDAFKEC 247
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
F PQSLG +GGKP + Y +G I+LDPH Q ++ +++ D T+H
Sbjct: 248 FKMPQSLGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDT-EENGTVDDQTFH 301
>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
[Homo sapiens]
Length = 402
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 145/292 (49%), Gaps = 33/292 (11%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 201
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 202 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 254
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 255 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 305
>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
Length = 398
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 147/297 (49%), Gaps = 43/297 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP++I
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSI---- 193
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 194 -DTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 301
>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
Length = 456
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 148/306 (48%), Gaps = 46/306 (15%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG C+ ++ L +A+ N + EF +DF+SR
Sbjct: 62 SKESPVWLLGQCYLKKSEDPLENASEALEPEGTGSQVSLAMDATNFENTIEEFKRDFASR 121
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQ 199
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W+ Q
Sbjct: 122 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKWRPEQSIENTQQ 181
Query: 200 KPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
D + I+ F D SPFSIH L+ G + G AG W GP ++ L++
Sbjct: 182 MRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSVAH---LLSQAV 238
Query: 258 RAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
L L +A+YV V + D C G W ++LLVP
Sbjct: 239 ERTGELPNSKLSRLAVYVAQD---------CAVYMQDVEEVCRTSDGG---WKSLILLVP 286
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L+LG +K+NP Y P + T +G++GG+P S Y +G Q++ I+LDPH Q ++
Sbjct: 287 LMLGTDKLNPVYAPCVTSLLTLDACIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVD 346
Query: 377 IGKDDL 382
+ K++
Sbjct: 347 VSKENF 352
>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
Length = 370
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/319 (36%), Positives = 153/319 (47%), Gaps = 46/319 (14%)
Query: 87 TAGSMRRIHERVLGPSRT--GISSSTSDIWLLGV-CHKIAQDEALGDAAGNNGLAEFNQD 143
T M + E VL ++ I ST +WLLG H I N L QD
Sbjct: 5 TRDIMDCMFEAVLDSTQDPDDIPQSTEPVWLLGKKYHAI------------NELNTIRQD 52
Query: 144 FSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKP 201
S++ +YRK F PIG S TSD GWGCMLR QM++ QAL+ LGR W+ P +
Sbjct: 53 IVSKLWFTYRKDFVPIGGSDGKTSDKGWGCMLRCGQMVLGQALMSIHLGRDWQWNPTTR- 111
Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 261
D Y+ IL F DS +PFSIH + G + G G W GP + + + L +
Sbjct: 112 -DATYLSILKKFEDSRKAPFSIHQIASMGISEGKEVGQWFGPNTVAQVLKKLVKFDEGND 170
Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVP 316
+AI+V + VV I + C SK AD W P+LL+VP
Sbjct: 171 --------VAIHVALDN---------VVIISEIRDLC--LSKETADVSTPHWKPLLLIVP 211
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LGL ++N Y+ L+ F F QSLGI+GGKP ++ Y +G IY DPH Q +
Sbjct: 212 LRLGLTQMNSIYLGGLKQCFQFKQSLGIIGGKPNSALYFIGYVGNEVIYFDPHTTQKAGS 271
Query: 377 IGKDDL--EADTS-TYHSE 392
+G D E D +YH +
Sbjct: 272 VGNKDTSEEKDVDLSYHCK 290
>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
Length = 398
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 143/302 (47%), Gaps = 53/302 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLRTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKG--QAD----------------------WTPILLLVPLVLGLEKVNPRY 328
D + C V G AD W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPLLLIVPLRLGINQINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + + D T
Sbjct: 241 IEAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEESGI-VDDET 299
Query: 389 YH 390
+H
Sbjct: 300 FH 301
>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 394
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 145/304 (47%), Gaps = 43/304 (14%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRRYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHVAMDNTVVMEDIRRLCRSSLP 189
Query: 270 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAGASAFPADSEGHCNGFPAR---------AEVTNRPSPWRPLVLLIPLRLGLTDINEAY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D +
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEF-TDSCSIPDES 299
Query: 389 YHSE 392
+H +
Sbjct: 300 FHCQ 303
>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
Length = 398
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 145/292 (49%), Gaps = 33/292 (11%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPG 197
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 301
>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
Length = 368
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 95/268 (35%), Positives = 139/268 (51%), Gaps = 38/268 (14%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+ D+W+LG + I Q GD + N D SRI ++YRK F IG + T+D
Sbjct: 26 TEEDVWILGKRYNILQ----GD------MGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDS 75
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM++AQAL+ LGR W+ + EY++IL F D + S +SIH + Q
Sbjct: 76 GWGCMLRCGQMMLAQALVCRHLGRDWQWDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQ 135
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G + G A GSW GP + + + L+ + + ++V +
Sbjct: 136 MGVSEGKAVGSWFGPNTVAQVLKKLSAFDDWSS--------LCLHVAMDN---------T 178
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
V I+D S +W P++L +PL LGL ++N Y L+ FTF QSLGI+GG+
Sbjct: 179 VIIEDIS-----------NWRPLVLFIPLRLGLTEMNVVYNEPLKACFTFKQSLGIIGGR 227
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVIN 376
P +TY +G + +YLDPH Q +N
Sbjct: 228 PNHATYFIGYFGNNLVYLDPHTTQQTVN 255
>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
gorilla]
gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A;
Short=hAPG4A
gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
construct]
Length = 398
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 145/292 (49%), Gaps = 33/292 (11%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 301
>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
Length = 398
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 148/289 (51%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D +R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK REY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIG 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E +P+ ++ +++ S + A W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPLNTLNASNQSKSAPASCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFH 301
>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
Length = 385
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 102/301 (33%), Positives = 141/301 (46%), Gaps = 53/301 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVG 169
+WLLG C+ N L EF++ D +S+ +YRK + PIG TSD G
Sbjct: 25 VWLLGCCY--------------NPLEEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKG 70
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCMLR QM++ QAL+ LGR WR K Y +IL LF DS+ S +SIH + Q
Sbjct: 71 WGCMLRCGQMILGQALVMRHLGRDWRWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQM 130
Query: 230 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
G + G W GP + + L M +YV + +V
Sbjct: 131 GVSEGKKISQWFGPNTAAQVLKKLIMFDEWSQ--------MGVYVAMDN---------IV 173
Query: 290 CIDDASR----HCSVFSKGQA--------------DWTPILLLVPLVLGLEKVNPRYIPT 331
IDD + H + S+G A W P+LL +PL LGL +NP Y
Sbjct: 174 VIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLLFIPLRLGLTDLNPIYKDK 233
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L F +LGI+GGKP ++ Y +G+Q + +YLDPH VQ + + K + TYH
Sbjct: 234 LNKCFRIKNTLGIIGGKPNSAHYFIGIQGDYLLYLDPHTVQETVKV-KPNCPFSDKTYHQ 292
Query: 392 E 392
+
Sbjct: 293 K 293
>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
Length = 398
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 145/292 (49%), Gaps = 33/292 (11%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAG 197
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 301
>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
Length = 396
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 142/289 (49%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFH 301
>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related cysteine endopeptidase 2A;
Short=Autophagin-2A; AltName: Full=Autophagy-related
protein 4 homolog A; AltName: Full=bAut2A
gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
Length = 398
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 142/289 (49%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFH 301
>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
Length = 393
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 147/312 (47%), Gaps = 59/312 (18%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
++ +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 22 TSEPVWILGRKYSIFTEKE-----------ELLSDVASRLWFTYRKNFPAIGGTGPTSDT 70
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q
Sbjct: 71 GWGCMLRCGQMIFAQALVCQHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQ 130
Query: 229 AGKAYGLAAGSWVGP---------YAMCRSWEALA------------RCQR-AETGLGCQ 266
G G + G W GP A+ +W +LA +R T L C
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSLPCG 190
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLG 320
+ P + AP +HC+ F G + W P++LL+PL LG
Sbjct: 191 TAPAS------------SAAP-------DQHCNGFPAGAEVTTRLSPWRPLVLLIPLRLG 231
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
L +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 232 LTDINAAYVETLKRCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEATDS 291
Query: 381 DLEADTSTYHSE 392
L D S +H +
Sbjct: 292 CLVPDES-FHCQ 302
>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
Length = 369
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 149/290 (51%), Gaps = 31/290 (10%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 1 WILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 49
Query: 174 LRSSQMLVAQALLFHRLGRP--WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
LR QM++AQAL+ LGR W K ++P +EY IL F D + +SIH + Q G
Sbjct: 50 LRCGQMMLAQALICRHLGRDLNWEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGV 107
Query: 232 AYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDED 280
G + G W GP A+ W +LA + + + + I +S D
Sbjct: 108 GEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTA 167
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
GE +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQ
Sbjct: 168 GE---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQ 223
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
SLG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H
Sbjct: 224 SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFH 272
>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
Length = 398
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 96/250 (38%), Positives = 132/250 (52%), Gaps = 30/250 (12%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
+ + F + + +YR+ F + TSD GWGCMLRS+QML+ QAL LGR WR P
Sbjct: 41 YKRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPAL 100
Query: 198 ----LQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
+ +YV +L F DS +SIH++++ G Y G W GP +
Sbjct: 101 FEAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLR 160
Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV-------FSKG 304
L R E G +A+YV ++G VV DD +R C ++
Sbjct: 161 DLVNLHRREFGG-----ELAMYV---PQEG------VVYTDDVTRLCFFDPLLHPPTAED 206
Query: 305 QADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+DW T +L+L+PL LGL++VN RY+P L TF FPQS+GI+GGK G S Y VG Q++
Sbjct: 207 SSDWSTALLILIPLRLGLDQVNERYVPALEKTFAFPQSVGIIGGKKGHSVYFVGTQQDQL 266
Query: 364 IYLDPHDVQP 373
LDPHDV P
Sbjct: 267 HLLDPHDVHP 276
>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
Length = 373
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 144/302 (47%), Gaps = 53/302 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 293 DASRHCSVF--------------------SKGQ----ADWTPILLLVPLVLGLEKVNPRY 328
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 179 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 238
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ +++ D T
Sbjct: 239 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 297
Query: 389 YH 390
+H
Sbjct: 298 FH 299
>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
cuniculus]
Length = 405
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 142/289 (49%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 36 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 84
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 85 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 144
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S + G
Sbjct: 145 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSANTPG 204
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 205 ERLHDSLT----ASNQSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 260
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G I+LDPH Q ++ +++ D T+H
Sbjct: 261 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDT-EENGTVDDQTFH 308
>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
Complex
Length = 357
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 100/287 (34%), Positives = 142/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWG MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGSMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 290
>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
CIRAD86]
Length = 445
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/272 (36%), Positives = 137/272 (50%), Gaps = 43/272 (15%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
+EF DF SR+ I+YR F PI S TSD GWGCM+R
Sbjct: 109 SEFLDDFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIR 168
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A ++ HRLGR WRK + +RE+ +IL LF D+ +PFSIH ++ G +A G
Sbjct: 169 SGQSLLANTIVVHRLGRDWRKGQK---EREHKDILSLFADTPDAPFSIHKFVEHGAQACG 225
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A ARC RA T Q+ + +Y D D V ID A
Sbjct: 226 TYPGEWFGP-------NATARCLRALTDKYHQA-GLRVYARPNDSD--------VYID-A 268
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ ++ P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 269 LTATATQKDANDEFQPTLIVLGIRLGIEKVTPAYHAALKAALELPQSMGIAGGRPSSSHY 328
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
VG Q ++ YLDPH +P+++ + DT
Sbjct: 329 FVGHQGDNFFYLDPHTTRPMLSPQPSAEDVDT 360
>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
Length = 357
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 100/287 (34%), Positives = 142/287 (49%), Gaps = 40/287 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDP QP +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPATTQPAVE 290
>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
Length = 396
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 144/292 (49%), Gaps = 33/292 (11%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 195
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S HC W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 196 DRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 248
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 249 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 299
>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
Length = 398
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 144/292 (49%), Gaps = 33/292 (11%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S HC W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 301
>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
Length = 393
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 105/289 (36%), Positives = 139/289 (48%), Gaps = 46/289 (15%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W ALA + M VV
Sbjct: 129 AQMGVGEGKSVGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177
Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
D R P + D+ RHC+ F A W P++LL+PL LGL VN
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV 286
>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
Length = 486
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 95/270 (35%), Positives = 135/270 (50%), Gaps = 33/270 (12%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 251
Query: 237 AGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
AG W GP + + ++ E A A L A+YV V +
Sbjct: 252 AGDWYGPSSVAHLLSQAVENAAERHPAFNNL-------AVYVAQD---------CAVYLQ 295
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D C W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 296 DIENVCQT---PDGKWKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 352
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
Y +G QE+ I LDPH Q +++ KD+
Sbjct: 353 LYFIGFQEDKLINLDPHYCQETVDVLKDNF 382
>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
Length = 396
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 145/297 (48%), Gaps = 43/297 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 299
>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
Length = 390
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 105/289 (36%), Positives = 139/289 (48%), Gaps = 46/289 (15%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W ALA + M VV
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177
Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
D R P + D+ RHC+ F A W P++LL+PL LGL VN
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV 286
>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
Length = 429
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 144/302 (47%), Gaps = 53/302 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 60 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 108
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 109 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 168
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 169 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 211
Query: 293 DASRHCSVF--------------------SKGQ----ADWTPILLLVPLVLGLEKVNPRY 328
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 212 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 271
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ +++ D T
Sbjct: 272 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 330
Query: 389 YH 390
+H
Sbjct: 331 FH 332
>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
Length = 398
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 144/289 (49%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + V +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+R + + S+ S + W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 DRPPDSLTA-SNQSKGTSAYCTA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 301
>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
Length = 525
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 97/270 (35%), Positives = 139/270 (51%), Gaps = 33/270 (12%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 171 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 230
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 231 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 290
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCID 292
AG W GP ++ A Q E + + P +A+YV V +
Sbjct: 291 AGDWYGPSSV-----AHLLSQAVENAV--ERHPAFNNLAVYVAQD---------CAVYLQ 334
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D C S G+ W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 335 DIENVCQT-SDGK--WKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 391
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
Y +G QE+ I LDPH Q +++ KD+
Sbjct: 392 LYFIGFQEDKLINLDPHYCQETVDVLKDNF 421
>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; AltName: Full=Autophagy-related
protein 4 homolog B; AltName: Full=bAut2B
gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
Length = 393
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 105/289 (36%), Positives = 139/289 (48%), Gaps = 46/289 (15%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W ALA + M VV
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177
Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
D R P + D+ RHC+ F A W P++LL+PL LGL VN
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV 286
>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
Length = 1114
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 98/289 (33%), Positives = 147/289 (50%), Gaps = 27/289 (9%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGNN-------GLAEFNQDFSSRILISYRKGFDPIGDSK 163
S +WLLG + I + + D + +F QDFSS + +YR+ F I +K
Sbjct: 226 SPVWLLGKFYHIKPSDLIDDDIQRGKRTRVVPNIEKFKQDFSSLLWFTYRQDFPAIPGTK 285
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFGD--SETS 219
+TSD GWGCMLRS QM++A+AL H LG W + ++E +I+ FGD + S
Sbjct: 286 LTSDCGWGCMLRSGQMMLAKALTLHYLGPEWNVFSDQTREQETYRKQIIRWFGDYLCDES 345
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGD 278
PFS+H L++ GK G G W GP ++ E + + Q+ +T L + +YV
Sbjct: 346 PFSMHRLVEVGKNLGKQPGEWFGPASVAHILKETMVKGQKTQTVLS----DLCVYVSQDC 401
Query: 279 EDGERGGAPVVCI----------DDASRHCSVFSKGQADWT-PILLLVPLVLGLEKVNPR 327
++ + C S H S DW +++L+P+ LG E++NP
Sbjct: 402 TVYKQDIYELCCTRPRADTKFTNSTESEHESSQDASSMDWKRAVVILIPVRLGGEQLNPV 461
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
YIP ++ + +GI+GGKP S Y VG QE+ IYLDPH Q V++
Sbjct: 462 YIPCVKGLLSQDSCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVD 510
>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
[Megachile rotundata]
Length = 518
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/309 (32%), Positives = 144/309 (46%), Gaps = 53/309 (17%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG ++ +E L A+ + + EF +DF+SR
Sbjct: 126 SKESPVWLLGKIYRKKPEEFLEKASEAEKTLDTGSEISLAMDAISFEDSIEEFKKDFTSR 185
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P E
Sbjct: 186 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWQPDQPIKTEQQ 245
Query: 208 E--------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
+ I+ FGD SPFSIH L+ G +G AG W GP ++A
Sbjct: 246 KLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAGDWYGP-------SSVAHLL 298
Query: 258 RAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 313
+ LP +A+YV V + D C + W ++L
Sbjct: 299 SQAVEHAAEHLPIFSNLAVYVAQD---------CAVYLQDVESVCQM---PDGKWKSLIL 346
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
VPL LG +K+NP Y L T +G++GG+P S Y +G QE+ I LDPH Q
Sbjct: 347 FVPLRLGTDKLNPVYTSCLTHLLTLDTCIGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE 406
Query: 374 VINIGKDDL 382
+++ KD+
Sbjct: 407 TVDVLKDNF 415
>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
Length = 398
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 141/289 (48%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFH 301
>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
Length = 408
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 141/289 (48%), Gaps = 27/289 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 195
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
LG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFH 299
>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
Length = 342
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 144/306 (47%), Gaps = 47/306 (15%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W ALA + M VV
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177
Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
D R P + D+ RHC+ F A W P++LL+PL LGL VN
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVA-AADRCPVPD 296
Query: 387 STYHSE 392
++H +
Sbjct: 297 ESFHCQ 302
>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
Length = 398
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 145/302 (48%), Gaps = 53/302 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHC--------------------SVFSKGQA----DWTPILLLVPLVLGLEKVNPRY 328
D + C S SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPLLLIVPLRLGINQINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ +++ D T
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 299
Query: 389 YH 390
+H
Sbjct: 300 FH 301
>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/269 (34%), Positives = 133/269 (49%), Gaps = 24/269 (8%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSE 392
IYLDPH Q ++ + D TYH +
Sbjct: 274 IYLDPHTTQTFVDTEEAGTVQD-QTYHCQ 301
>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
Length = 488
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 107/336 (31%), Positives = 158/336 (47%), Gaps = 45/336 (13%)
Query: 66 SEKKAVHNKSNGWTAAVK---RLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 122
S +AV N+ GW A +K + +G+ I + S I+LLG +
Sbjct: 89 STSEAVKNRVRGWWANMKYGWNAMNSGAQIDISDL----------SGADPIYLLGHVYHN 138
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
+ A F DFS+R+ +YR+ F P+ + TSD GWGCMLRS+QM++A
Sbjct: 139 KNNSA--------SFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGCMLRSAQMMLA 190
Query: 183 QALLFHRLGRPWRKPLQKPFDREYV--EILHLFG---DSETSPFSIHNLLQAGKAYGLAA 237
+A +FH LGR WR Q+ V +I+ F D+ +PFS+HN+++A G A
Sbjct: 191 EAFIFHLLGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMVRAAAHCGKKA 250
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQS---LPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP L RC G+ MAIYV + D
Sbjct: 251 GDWFGPSTAAY---LLKRCLEEAAGVADSKEIFEQMAIYVAQD---------CTIYTQDV 298
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
C+ S +W ++LL+P+ LG E+VN YI ++ + LGI+GGKP S Y
Sbjct: 299 LDLCT--SDPNIEWKSVVLLIPVRLGGERVNVNYIHCIKEILAYQNCLGIIGGKPRHSLY 356
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
VG Q + +YLDPH +Q + + L +++H
Sbjct: 357 FVGFQGKKLVYLDPHYLQKTTDTSR--LNFSVNSFH 390
>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
Length = 394
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 100/298 (33%), Positives = 147/298 (49%), Gaps = 28/298 (9%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+T +W+LG + + E D +SR+ +YRK F PIG + TSD
Sbjct: 21 ETTEPVWILG-----------NEYSALTEKEEILSDVTSRLWFTYRKSFPPIGGTGPTSD 69
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 227
GWGCMLR QM++ QAL+ LGR WR + +EY+ IL+ F D + S +SIH +
Sbjct: 70 TGWGCMLRCGQMILGQALMCRHLGRDWRWVRGQKQRQEYISILNAFIDKKDSYYSIHQIA 129
Query: 228 QAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 277
Q G G G W GP A+ +W L + + + + + + +
Sbjct: 130 QMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRLVVHVAMDNTVVIEEIKRLCMPWLDK 189
Query: 278 DE---DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
E + ER G C++ A C++ + A W P++LL+PL LGL +N YI TL+
Sbjct: 190 AEVFGEPERVGELNGCLEGA---CALSEEEVALWKPLVLLIPLRLGLSDINGAYIETLKK 246
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
F PQSLG++GGKP ++ Y +G IYLDPH Q + + D TYH +
Sbjct: 247 CFMLPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFPD-DTYHCQ 303
>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
Length = 434
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 128/242 (52%), Gaps = 21/242 (8%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
A F F S + +YR F +G TSD+GWGCMLR+ QM++AQ L H LG WR+
Sbjct: 108 ASFLTHFRSVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQ 167
Query: 198 LQK--PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+ P Y +++ F D PFS+H + AG YG G W GP M + E L +
Sbjct: 168 SDRSSPL---YAKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLK 224
Query: 256 CQRAETGLG---CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-WTPI 311
+ + +GL CQ +Y+ P+ DD +GQ W P+
Sbjct: 225 -EFSPSGLRAYVCQD--GCLYLDQLRRTATAAHWPLDEDDD---------EGQGKSWAPM 272
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
L+++PL LGL+++N Y P L+ TF PQS+GI GGKP AS Y VG Q++ YLDPH V
Sbjct: 273 LIMLPLRLGLDQLNEDYAPVLKETFRIPQSVGISGGKPRASLYFVGNQDDYVFYLDPHTV 332
Query: 372 QP 373
QP
Sbjct: 333 QP 334
>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/269 (34%), Positives = 132/269 (49%), Gaps = 24/269 (8%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSE 392
IYLDPH Q + + D TYH +
Sbjct: 274 IYLDPHTTQTFVETEEAGTVQD-QTYHCQ 301
>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
Length = 392
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/269 (34%), Positives = 132/269 (49%), Gaps = 24/269 (8%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 40 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 99
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 100 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 159
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 160 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 210
Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 211 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 270
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSE 392
IYLDPH Q + + D TYH +
Sbjct: 271 IYLDPHTTQTFVETEEAGTVQD-QTYHCQ 298
>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
Length = 392
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 100/287 (34%), Positives = 142/287 (49%), Gaps = 41/287 (14%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEK-SIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 179
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 180 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 239
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 240 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 286
>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
Length = 510
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 144/303 (47%), Gaps = 61/303 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 115 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 174
Query: 200 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR- 255
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 175 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTE 233
Query: 256 ---------------CQRAETGLGC---------QSLPMAIYVV---------------S 276
R E C Q P+ + S
Sbjct: 234 HSPNGLKMYVPKDGIIYRKEVYQLCAVQPADGPAQHSPLRVDDDGGDTDHDGDTDGLESS 293
Query: 277 GDEDGERGGAP-----VVCIDDASRHCSVFSKGQAD------------WTPILLLVPLVL 319
D G P + D +S H + S +++ W P+++LVP+ L
Sbjct: 294 TDSMRHSHGNPGVPSTIEAGDYSSSHAELMSSAESECESLDDNFTELTWHPVIILVPVRL 353
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+DPH VQP + +
Sbjct: 354 GIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQPTVKMDD 413
Query: 380 DDL 382
D L
Sbjct: 414 DPL 416
>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
Length = 505
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 102/293 (34%), Positives = 143/293 (48%), Gaps = 46/293 (15%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG C+ + L A+ N + EF +DF SR
Sbjct: 80 SKESPVWLLGQCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 139
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR-PWR-KPLQKPFDRE 205
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q +
Sbjct: 140 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWRWRPEQLTDESS 199
Query: 206 YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRA 259
+ I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 200 HRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQAME------RA 253
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
+ +A+YV + V C D R ++LLVPL L
Sbjct: 254 SEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGRRKA------------LILLVPLRL 301
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q
Sbjct: 302 GADKLNPVYAPCLTALLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQ 354
>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
Length = 392
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 101/272 (37%), Positives = 139/272 (51%), Gaps = 29/272 (10%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+WLLG K D A D + + F S + +YR+ + + + TSD GWGC
Sbjct: 23 VWLLG---KRYDDVAAVD------FDAYKRSFESILWFTYRRDYPAMTPYEHTSDAGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGDSETSP--FSIH 224
MLRS+QML+ QAL LGR WR P + YV++L F DS +SIH
Sbjct: 74 MLRSAQMLLGQALQRRLLGRDWRLPALFETEIDARLPETYVQLLRWFADSPDVECRYSIH 133
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL--PMAIYVVSGDEDGE 282
+++ G Y G W GP + L R E G G S+ P V S D
Sbjct: 134 QMVKLGVQYDKLPGEWYGPTTAAQVLRDLVNLHRREFG-GELSMYVPQEGVVYSDD---- 188
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+C D H ++ ++DW T +L+L+PL LGL++VN RY+P ++ +F FPQS
Sbjct: 189 ---VAKLCFFDPLLHPPT-TEDKSDWSTALLILIPLRLGLDQVNERYVPAIQKSFAFPQS 244
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+GI+GGK G S Y VG Q++ LDPHDV P
Sbjct: 245 VGIIGGKKGHSVYFVGTQQDQLHLLDPHDVHP 276
>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
Length = 354
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 96/269 (35%), Positives = 138/269 (51%), Gaps = 42/269 (15%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR
Sbjct: 8 GIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSWR 67
Query: 196 ---KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
KP+Q RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 68 WSEKPIQN--GREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGP- 124
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV-------VCIDDASRH 297
++A C + ++ V + E+ E V + I D H
Sbjct: 125 ------ASVAHCLK------------SVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTH 166
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
C + W ++LLVP+ LG E++NP Y P L T +GI+GG+P S Y VG
Sbjct: 167 CRL---PNGCWKSLILLVPVKLGTERLNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVG 223
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADT 386
Q++ I+LDPH Q ++++ + + T
Sbjct: 224 YQDDRLIHLDPHYCQEMVDVWQPNFSLQT 252
>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
Length = 517
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 93/288 (32%), Positives = 141/288 (48%), Gaps = 34/288 (11%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGN----------------NGLAEFNQDFSSRILISYRK 154
S +WLLG C+ + + D + N L F DF S++ +YRK
Sbjct: 67 SPLWLLGKCYHLKKPSLSSDTSENAEGSQQSTSESYNMLPKHLKLFLVDFHSKLWFTYRK 126
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVE--ILH 211
GF + D+ +TSD GWGCMLR++QM++AQ+ + H LGR WR P + ++ + I+
Sbjct: 127 GFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLSMEQSDIHRNIIT 186
Query: 212 LFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
F D + PFS+H L + G +Y G+W GP + C + +T L L
Sbjct: 187 WFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECAKGKTEL----LN 242
Query: 270 MAIYVVSGDEDGERGGAPVVC-----IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ ++ D +C DA S S ++ +++L+P+ LG +
Sbjct: 243 NIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRS----VIVLIPVRLGEATL 298
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
NP YIP ++ T QS+GI+GGKP S Y +G Q+E YLDPH Q
Sbjct: 299 NPIYIPCIQSMLTLDQSVGIMGGKPKHSLYFIGFQDEYLFYLDPHYCQ 346
>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 467
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 106/275 (38%), Positives = 139/275 (50%), Gaps = 42/275 (15%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + +V G W P L+LV LG++K+ P Y L+ + PQS+GI GG+P AS
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKASLQIPQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
Y VGVQ + YLDPH +P++ L A TS
Sbjct: 311 HYFVGVQGNNFYYLDPHSTRPLLPFHPPSLAAATS 345
>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
[Ornithorhynchus anatinus]
Length = 436
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 143/304 (47%), Gaps = 53/304 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 68 VWILGRQHHLKAEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 116
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W K EY +IL F D + +SIH + Q G
Sbjct: 117 MLRCGQMMLAQALICRHLGRDWCWEKHKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVG 176
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 177 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 219
Query: 293 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 328
D + C + +G A W P+LL+VPL LG+ +NP Y
Sbjct: 220 DIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPLLLIVPLRLGINHINPIY 279
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ +++ + D +
Sbjct: 280 IDAFKECFKTPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQTFVDT-EENGQVDDHS 338
Query: 389 YHSE 392
+H +
Sbjct: 339 FHCQ 342
>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
Length = 381
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 150/295 (50%), Gaps = 44/295 (14%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
S S +W+LG + + + + E N + SR L +YRK F I DS TSD
Sbjct: 28 SDSPVWILG-----------NELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDS 76
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD----REYVEILHLFGDSETSPFSIH 224
GWGCMLR QM++A+AL LGR W+ Q+ D ++Y++IL LF DS+ +P+S+H
Sbjct: 77 GWGCMLRCGQMVLAEALQRVSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLH 136
Query: 225 NLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+ G++ G+W GP + + L + +ET + P+ ++V +
Sbjct: 137 QIALMGESIQSKKPVGTWFGPNTIA---QVLRKLSVSET-----TNPIRVHVAMDN---- 184
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
V +D+ C F + P+LL +PL LGL ++NP Y L+ F FPQ L
Sbjct: 185 -----TVIVDEIKESCG-FIGDPSQGKPLLLFIPLRLGLTEINPIYFQDLKECFEFPQIL 238
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPH-----DVQPVINIGKDDLEADTSTYHSE 392
G++GG+P + Y +G + IYLDPH V+ +G ++ TYH++
Sbjct: 239 GVIGGRPNHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGG----SEDKTYHTD 289
>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
[Tribolium castaneum]
gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
Length = 366
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 93/243 (38%), Positives = 127/243 (52%), Gaps = 21/243 (8%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIG-DSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
N L E + QD S+I +YRK F PIG D +T+D GWGCMLR QM++AQAL+ L
Sbjct: 33 NALQELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHL 92
Query: 191 GRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
GR W +P K D Y++IL F D +PFSIH + G + G W GP + +
Sbjct: 93 GRDWVWEPETK--DSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQV 150
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
+ L + +L + + E +C+ S CS DW
Sbjct: 151 LKKLVKYDEWSAIEMHIALDNTLIISDIRE---------LCLSQGSDGCS-----SGDWK 196
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LL+VPL LGL+++NP Y L+ F F QSLG++GGKP + Y +G + IYLDPH
Sbjct: 197 PLLLIVPLRLGLQEINPIYASGLKKCFQFKQSLGVIGGKPNLALYFIGHVGDEVIYLDPH 256
Query: 370 DVQ 372
Q
Sbjct: 257 TTQ 259
>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
Length = 382
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 94/274 (34%), Positives = 139/274 (50%), Gaps = 41/274 (14%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + D +S+I ++YRK F IG + TSD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 35 LDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREWRW 94
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+P K +++Y+ IL +F D + FSIH + Q G + G G W GP + LA
Sbjct: 95 EPGTK--NKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR-HCSVF------------- 301
+ + +AI+V + V I++ S+ C ++
Sbjct: 153 FDKWSS--------LAIHVAMDN---------TVIINEISKFRCHIWAAADGLVRNRTNS 195
Query: 302 -----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
+ + W P+LL +PL LGL ++N Y L+ TF QSLG++GGKP + Y +
Sbjct: 196 EPSRPANSEGSWKPLLLFIPLRLGLSEINRIYAFGLKRTFALKQSLGMIGGKPNHALYFI 255
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
GV E+ I+LDPH Q ++ D D +YH
Sbjct: 256 GVVEDELIFLDPHTTQLACDLDVD--SPDDQSYH 287
>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
Length = 384
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/249 (35%), Positives = 126/249 (50%), Gaps = 36/249 (14%)
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQAL+ +GR WR QKP
Sbjct: 44 NDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWDKQKP 103
Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 261
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 -KGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQWSS 162
Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------------- 307
+A+++ + V +D+ R C S +D
Sbjct: 163 --------IAVHIAMDN---------TVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDP 205
Query: 308 ----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
W P++LL+PL LGL ++N YI TL+ F PQSLG++GG+P ++ Y +G +
Sbjct: 206 SCAQWKPLVLLIPLRLGLSEINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDEL 265
Query: 364 IYLDPHDVQ 372
IYLDPH Q
Sbjct: 266 IYLDPHTTQ 274
>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
Length = 389
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 157/312 (50%), Gaps = 34/312 (10%)
Query: 83 KRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQ 142
KR++ A +E + R G + +W+LG + L E N
Sbjct: 23 KRMLEACEAFVTYESGIILERQGFEVNDEPVWILG-----------REYDTKTKLDELNS 71
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--WRKPLQK 200
D SR+L++YR+ F PIGDS +TSD GWGCMLR QM+VAQAL+ LGR W +
Sbjct: 72 DVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFWPVGDDQ 131
Query: 201 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
Y +IL LF D +T+ +SIH L Q G + G G W GP + + + L+
Sbjct: 132 RTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLSEYDEWS 191
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC--SVFSKGQADWTPILLLVPLV 318
+ I+V + V I++ + C + + W+P+LL+VPL
Sbjct: 192 A--------LKIHVAMDN---------AVVIEEIEQLCHKKITPTETSTWSPLLLVVPLR 234
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
LGL +NP YI +L+ PQS+G++GGKP + Y +G + ++LDPH Q I++
Sbjct: 235 LGLLNINPIYIDSLKACLQMPQSIGMIGGKPSQALYFIGYVGDDVVFLDPHLTQNAIDLD 294
Query: 379 KDDLEADTSTYH 390
+D E D S+YH
Sbjct: 295 ED--EFDDSSYH 304
>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
Length = 673
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 97/277 (35%), Positives = 142/277 (51%), Gaps = 19/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + + + G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 253 AAENQVTECPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIA 312
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 313 QGLICHFLGRSWRYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 372
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E
Sbjct: 373 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQ 432
Query: 288 VVCIDDASRHCSVFSK----GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V A + S K Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 433 HVPWQHAKKSTSDAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 492
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++
Sbjct: 493 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQE 529
>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
Length = 485
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/266 (35%), Positives = 133/266 (50%), Gaps = 25/266 (9%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + + + EF +DF+SR+ ++YR+ F + S TSD GWGCMLRS QM++AQAL+
Sbjct: 131 AMDAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALV 190
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR + +P E + I+ FGD TSPFSIH L+ G G
Sbjct: 191 CHFLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKR 250
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 251 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 298
Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
C + W ++L VPL LG +K+N Y L T +G++GG+P S Y +
Sbjct: 299 VCQM---PDGKWKSLILFVPLRLGADKLNLVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 355
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDL 382
G QE+ I LDPH Q +++ KD+
Sbjct: 356 GFQEDKLINLDPHYCQETVDVLKDNF 381
>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
Length = 355
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/258 (36%), Positives = 132/258 (51%), Gaps = 25/258 (9%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W
Sbjct: 15 EGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 74
Query: 195 RKPLQKPFD--REYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
R +KP RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 75 RWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALGKKPGDWYGPA 134
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
++ ++L E + + +YV V I D C +
Sbjct: 135 SVAHCLKSLIASASKENY---EFDHLEVYVAQDS---------TVYIQDIYSMCQLL--- 179
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
W ++LLVP+ LG EK NP Y P L T +GI+GG+P S Y VG Q++ I
Sbjct: 180 HGAWKSLILLVPVKLGTEKFNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVGYQDDKLI 239
Query: 365 YLDPHDVQPVINIGKDDL 382
+LDPH Q ++++ + +
Sbjct: 240 HLDPHYCQEMVDVWQPNF 257
>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
Length = 390
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 140/269 (52%), Gaps = 38/269 (14%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
S S +W+LG + N +AE N + SR+L +YRK F I S TSD
Sbjct: 28 SDSPVWILG-----------NELCARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDS 76
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 222
GWGCMLR QM++ +AL LGR W+ + + +Y++IL+LF DS+ +P+S
Sbjct: 77 GWGCMLRCGQMVLGEALQRISLGRDWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYS 136
Query: 223 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
IH + G++ G+W GP + + + L+ ++ ++P+ ++V +
Sbjct: 137 IHQIALMGESIQSKKPVGTWFGPNTVAQVLKKLSFFEK--------TVPIRLHVAMDN-- 186
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V ID+ C F G ++ P+LL +PL LGL ++NP Y L+ F FPQ
Sbjct: 187 -------TVIIDEIKESCG-FVGGDSE-KPLLLFIPLRLGLTEINPIYFQDLKECFEFPQ 237
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPH 369
LG++GG+P + Y +G + IYLDPH
Sbjct: 238 ILGVIGGRPNHALYFIGYVDNELIYLDPH 266
>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
Length = 387
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 136/272 (50%), Gaps = 39/272 (14%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + D +S+I ++YR+ F I + TSD GWGCMLR QM VA+AL+ L R W+
Sbjct: 41 LDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGWQW 100
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
P + D Y+ +L +F D + FSIH + Q G + G A G W GP + LA
Sbjct: 101 APGIR--DESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------- 306
+ + +AI+V + VV +DD + C + + ++
Sbjct: 159 FDKWSS--------LAIHVAMDN---------VVIMDDIRKVCRLEATAESGVRNRAEPA 201
Query: 307 --------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
W P+LL +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI+GV
Sbjct: 202 GLAAAAAESWKPLLLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYIIGV 261
Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ ++LDPH Q +++ D D +YH
Sbjct: 262 VGDDLVFLDPHTTQLAVDL--DTEFPDDESYH 291
>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
Length = 393
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/293 (32%), Positives = 140/293 (47%), Gaps = 21/293 (7%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+T +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 22 TTEPVWILGRKYTIFTEKE-----------EILSDVTSRLWFTYRKNFPAIGGTGPTSDT 70
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH + Q
Sbjct: 71 GWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 130
Query: 229 AGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
G G + G W GP A+ +W +LA + + + +
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCKAGFPCA 190
Query: 280 DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
DG + + + + + W P++LL+PL LGL +N Y TL+ F P
Sbjct: 191 DGAAFPTDSELLSNGYPPAAEVTDRASPWRPLVLLIPLRLGLTDINEAYTETLKHCFMMP 250
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
QSLG++GGKP ++ Y +G E IYLDPH QP + + + D T+H +
Sbjct: 251 QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVESTEGGVFPD-ETFHCQ 302
>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
Length = 397
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 151/316 (47%), Gaps = 47/316 (14%)
Query: 91 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
M + E LGP I +D+WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDSMLAGAVGEPEDIPKRNTDVWLLGKRYNAIQELEL-----------IR 61
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTP 118
Query: 202 FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
R+ Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
+ + ++V V +D+ C S + W P+LL+VPL L
Sbjct: 179 SS--------LVVHVAMDS---------TVVLDEIYTRCQEVSA--STWKPLLLIVPLRL 219
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q ++ +
Sbjct: 220 GISDINPMYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGSVAQ 279
Query: 380 DDLEADT---STYHSE 392
A+ +YH +
Sbjct: 280 KTTAAEQELDESYHQK 295
>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
Length = 606
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 143/279 (51%), Gaps = 32/279 (11%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
G+ F +DF SRI ++YR+ F + DS TSD GWGCM+RS QML+AQ L+ H LGR WR
Sbjct: 195 GIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLGRSWR 254
Query: 196 KPLQKPFDRE---YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
+ E + +++ FGD S+TSPFSIH L+ GK G G W GP A+
Sbjct: 255 WDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAVAHLL 314
Query: 251 EALARCQRAET----GLGCQ-SLPMAIYV--------VSGDEDG---ERGGAPVVCIDDA 294
R E G+ + A+Y+ V G +R GAP +
Sbjct: 315 RQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGTNSSS 374
Query: 295 SRH------CSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
S + ++G D W ++LLVPL LG +K+NP Y L+ + +G
Sbjct: 375 STAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLKAMLSLDYCIG 434
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
I+GG+P S Y VG QE+ I+LDPH Q ++++ +D+
Sbjct: 435 IIGGRPKHSLYFVGYQEDKLIHLDPHYCQDMVDVNQDNF 473
>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
Length = 461
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 143/309 (46%), Gaps = 54/309 (17%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+T +W+LG + I ++ + D +SR+ +YRK F IG + TSD
Sbjct: 91 TTEPVWILGRKYTIFTEKE-----------DILSDVTSRLWFTYRKNFPAIGGTGPTSDT 139
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQALL LGR WR + Y +L+ F D + S +SIH + Q
Sbjct: 140 GWGCMLRCGQMIFAQALLCRHLGRDWRWKKGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 199
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G + G W GP + + + LA + +A+++ +
Sbjct: 200 MGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSS--------LAVHIAMDN---------T 242
Query: 289 VCIDDASRHC--------SVF-----------------SKGQADWTPILLLVPLVLGLEK 323
V I++ R C S F + W P++LL+PL LGL +
Sbjct: 243 VVIEEIRRLCKPNFPAGASAFPTDSEFLLNGFPSGAEVTNRPTQWKPLVLLIPLRLGLTE 302
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N YI TL+ F PQSLG++GGKP ++ Y +G IYLDPH QP + I
Sbjct: 303 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEISGSCFI 362
Query: 384 ADTSTYHSE 392
D S +H +
Sbjct: 363 PDES-FHCQ 370
>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
Length = 382
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 92/293 (31%), Positives = 143/293 (48%), Gaps = 44/293 (15%)
Query: 91 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
M + E LGP I +++WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDGVLAGAVGEIEDIPKRNTNVWLLGKRYNAIQE-----------LEPIR 61
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTP 118
Query: 202 FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
R+ Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
+ +A++V V +DD C ++ W P+LL+VPL L
Sbjct: 179 SS--------LAVHVAMDS---------TVVLDDIYTCCQ--ESSESSWKPLLLIVPLRL 219
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q
Sbjct: 220 GITDINPIYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQ 272
>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
Length = 388
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/295 (35%), Positives = 157/295 (53%), Gaps = 26/295 (8%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + +D + D S++ +YRKGF PIGDS +T
Sbjct: 21 IPQTREPVWILGRKYDAGRD-----------VTAIRSDIKSKLWFTYRKGFVPIGDSGLT 69
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 223
SD GWGCMLR QM++AQAL+ LGR WR K ++P EY+ IL +F D++T+ +SI
Sbjct: 70 SDKGWGCMLRCGQMVLAQALVCLHLGRDWRWKKDSKEP---EYLRILKMFEDTKTATYSI 126
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L+ + + + +L I V +R
Sbjct: 127 HQIALMGVSEGKDVGQWFGPNTVTQVLKKLSVYDKWSSIVIHVALDNTIIVNDIKSLCQR 186
Query: 284 GGAPVVCIDDASRHCS-----VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
V ID +++ S V+ W P+LL+VPL LGL ++NP Y+ L+ FTF
Sbjct: 187 NEQSV--IDSSAQKHSPLNEPVYFNSARKWKPLLLVVPLRLGLSEINPVYLNGLKTCFTF 244
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYH 390
QSLG++GGKP + Y +G E IYLDPH QPV + +L + + +YH
Sbjct: 245 RQSLGVIGGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYH 299
>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
litura]
Length = 365
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 90/270 (33%), Positives = 136/270 (50%), Gaps = 25/270 (9%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 5 IPQTKESVWILGKKYSAIQD-----------LDRIRRDITSIIWCTYRKGFIPIGDEGLT 53
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ F + + +P+SI
Sbjct: 54 SDKGWGCMLRCGQMVLGVALVRVHLSADW---VWTPETRDPTYLKIIQRFEERKQAPYSI 110
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L + + + I+V + +
Sbjct: 111 HQVALMGASEGKQVGQWFGPNTVAQVLKKLTVYDKWSS--------LVIHVALDNTVVKE 162
Query: 284 GGAPVVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+++ CS DW P+LL+VPL LGL ++NP YI L++ F PQS+
Sbjct: 163 DILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLSEINPIYIDGLKICFQCPQSI 222
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
G++GGKP + Y+VG + IYLDPH Q
Sbjct: 223 GVIGGKPNQALYLVGCVGDEVIYLDPHTTQ 252
>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
Length = 401
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 92/261 (35%), Positives = 137/261 (52%), Gaps = 16/261 (6%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + D +S+I ++YRK F I + TSD GWGCMLR QM++A+AL+ LG+ W+
Sbjct: 54 LDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGWQW 113
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
P + D Y+ +L +F D + +SIH + Q G + G A G W GP + L+
Sbjct: 114 APGIR--DENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA-----DWTP 310
+ + L + V+ R P V DD RH S G A W P
Sbjct: 172 FDKW-SSLAVHVAMDNVVVMDDIRKICRVETPAV--DDGVRH-RTQSHGLACASAVSWKP 227
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+LL +PL LGL ++NP Y L+ TF QS+GI+GGKP + +I+GV + ++LDPH
Sbjct: 228 LLLFIPLRLGLNEINPVYYCGLKRTFALKQSVGIIGGKPNHALFIIGVVGDDLVFLDPHT 287
Query: 371 VQPVINIGKDDLE-ADTSTYH 390
Q +++ D+E + +YH
Sbjct: 288 TQLAVDL---DVEFPEDESYH 305
>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
Length = 383
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 98/295 (33%), Positives = 148/295 (50%), Gaps = 31/295 (10%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + ++W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 22 IPETKDNVWVLGKKYSAIQD-----------LERIRRDITSVIWCTYRKGFVPIGDEGLT 70
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ + + +P+SI
Sbjct: 71 SDKGWGCMLRCGQMVLGVALIKVHLSADW---VWTPETRDPTYLKIVQRLEERKQAPYSI 127
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G G G W GP + + + L + + + I+V + D +
Sbjct: 128 HQVALMGACEGKEVGQWFGPNTVAQVLKKLVVYDKWSS--------LVIHV-ALDNTVVK 178
Query: 284 GGAPVVCIDDASR-HCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
CI + R CS G +DW P+LL+VPL LGL ++NP Y+ L++ F PQ
Sbjct: 179 EDILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGLSEINPIYMEGLKICFQSPQ 238
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSE 392
S+G++GGKP + Y++G + IYLDPH Q V N D+ + TYH +
Sbjct: 239 SIGVIGGKPNQALYLIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCK 293
>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
pulchellus]
Length = 390
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 139/276 (50%), Gaps = 44/276 (15%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + + +S+I ++YRK F I + TSD GWGCMLR QM+VA+A++ LG+ W+
Sbjct: 41 LDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGKDWQW 100
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
P K D +Y+ +L +F D + +SIH + Q G + G G W GP + L+
Sbjct: 101 SPGTK--DEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKLST 158
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK------------ 303
+ + +A++V + VV +DD + C V +
Sbjct: 159 FDKWSS--------LAMHVAMDN---------VVVMDDIRKICRVETTTDVEDGIRNRTQ 201
Query: 304 --------GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
G W P++L +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI
Sbjct: 202 SHGGPAAAGARSWKPLVLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYI 261
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYH 390
+GV + ++LDPH Q +++ D+E + +YH
Sbjct: 262 IGVVGDDLVFLDPHTTQLAVDL---DVECPEDESYH 294
>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
Length = 393
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 98/282 (34%), Positives = 138/282 (48%), Gaps = 39/282 (13%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 273
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAA 193
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
+ + DG G P ++A ++ W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPAVESDGLYNGCP----EEAG-----VRDRRSLWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV 286
>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
Length = 651
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 176/369 (47%), Gaps = 51/369 (13%)
Query: 22 PNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAA 81
P+ AS SE+ S+ S S ++ +S+ S+ SA E ++ +
Sbjct: 177 PSEDTASAASEVLSTSSYSPDTPSTATAVDSSHQ-----SDPSAKETPLCPSQMHSSQQP 231
Query: 82 VKRLVTAGSMRRIHERVLGPSRTGISSST-------SDIWLLGVCHKIAQDEALGDAAGN 134
+ ++ + E VLG S T +S T + W L H + A
Sbjct: 232 ISDHQPVSTLLSLVEAVLGSSDTLPTSVTWLAHQLKARGWELLASHGVPYTSPTAHTAFP 291
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+ F + +++R F TSDVGWGCMLRS Q ++A AL+ LGR W
Sbjct: 292 GVWHSVHAVFQHILSLTHRTCF--------TSDVGWGCMLRSVQSMLANALIRVHLGRHW 343
Query: 195 RKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCR 248
R+ ++ +Y IL F D + PFSIH L+ G+ G+ AG W GP +A+C+
Sbjct: 344 RRRAKQKTHPQYARILSWFMDDPSLECPFSIHRLVDEGQRLGVQAGDWFGPSTAAFALCK 403
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD- 307
+A C GLG VV+ D G VV + F+ G++D
Sbjct: 404 LIQAYDAC-----GLGV--------VVTND--GMLYKEQVVA--------ASFAPGRSDP 440
Query: 308 WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
WT P+L+L+ LGL++V P Y P L+ +FT PQS+G+VGG+P +S Y VGVQ E + L
Sbjct: 441 WTRPVLILLVQRLGLDQVPPHYRPALKQSFTMPQSVGVVGGRPRSSLYFVGVQREHLLCL 500
Query: 367 DPHDVQPVI 375
DPH V+P +
Sbjct: 501 DPHHVRPCV 509
>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
Length = 366
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/250 (36%), Positives = 125/250 (50%), Gaps = 42/250 (16%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRKGF PIG + TSD GWGCMLR QM++ QAL+ LGR WR +
Sbjct: 68 DVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWVSGEEQ 127
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
EYV IL+ F D + S +SIH + + +C W A A G
Sbjct: 128 RHEYVNILNAFIDKKDSYYSIHQIER-----------------LCMPWLDKAEACAASEG 170
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
+G + +G GA C+ + A W P++LL+PL LGL
Sbjct: 171 VG-------------ELNGYLEGA-----------CAFSEEETALWKPLVLLIPLRLGLT 206
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q ++ +D
Sbjct: 207 DINEAYIETLKKCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQTAVDPCEDGT 266
Query: 383 EADTSTYHSE 392
D S YH +
Sbjct: 267 FTDDS-YHCQ 275
>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
Length = 474
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 136/288 (47%), Gaps = 51/288 (17%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
G + F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 105 GEGDIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 164
Query: 193 PWR-------KPLQKP---------------------------FDREYVEILHLFGDSET 218
WR P + P DR + I+ F D
Sbjct: 165 DWRWVEGTGLAPPEMPGPASPSRYRGPGRHVPPRWTQGTLEMEQDRWHRRIVSWFADHPQ 224
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
+PF +H L++ G++ G AG W GP +A R C +P + VS D
Sbjct: 225 APFGLHRLVELGQSSGKKAGDWYGP-------SVVAHILRKAVE-KCSEVPRLVVYVSQD 276
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
V D +R S + A+W +++LVP+ LG E +NP Y+P ++
Sbjct: 277 C--------TVYKADVARLVS-WPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRS 327
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
LGI+GGKP S Y +G Q++ +YLDPH QP +++ + D ++
Sbjct: 328 ELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLES 375
>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
Length = 393
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 142/299 (47%), Gaps = 40/299 (13%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 273
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAA 193
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
+ + D G P +D + A W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPALESDVLYNGCP----EDVG-----LRERLALWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
F PQSLG++GGKP ++ Y +G E IYLDPH QP + G D S +H +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPGDSGCLPDES-FHCQ 302
>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
Length = 396
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 143/312 (45%), Gaps = 59/312 (18%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
+T +W+LG + I +DE L D +SR+ +YRK F IG + TS
Sbjct: 25 TTDPVWILGRKYTIFTEKDEILSDV-------------TSRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + +A+++ +
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDN-------- 175
Query: 287 PVVCIDDASRHCSV-FSKGQA-------------------------DWTPILLLVPLVLG 320
V ++D R C FS A W P++LL+PL LG
Sbjct: 176 -TVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQWRPLVLLIPLRLG 234
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
L +N Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q + +
Sbjct: 235 LTDINEAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNG 294
Query: 381 DLEADTSTYHSE 392
+ D S +H +
Sbjct: 295 GVIPDES-FHCQ 305
>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
Length = 343
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 93/267 (34%), Positives = 126/267 (47%), Gaps = 48/267 (17%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
E D +SR+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR
Sbjct: 40 EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWIK 99
Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR---------S 249
K Y +L+ F D + S +SIH + Q G G + G W GP + + +
Sbjct: 100 GKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFDT 159
Query: 250 WEALA----------------RCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV 289
W +LA CQ + G + P +Y +E G R +
Sbjct: 160 WSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL- 218
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
W P++LL+PL LGL ++N YI TL+ F PQSLG++GGKP
Sbjct: 219 ------------------WKPLVLLIPLRLGLTEINEAYIETLKHCFMMPQSLGVIGGKP 260
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVIN 376
++ Y +G E IYLDPH QP +
Sbjct: 261 NSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
dendrobatidis JAM81]
Length = 441
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/262 (37%), Positives = 135/262 (51%), Gaps = 30/262 (11%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
F DF SR+ ++YRKGF I + T D GWGCMLRS QMLVA ALLFH LGR WR L
Sbjct: 137 HFLDDFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWR--L 194
Query: 199 QKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
DR+ Y IL F D TSP+SI + G + G W GP + + + L
Sbjct: 195 GDSNDRDTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLV 254
Query: 255 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILL 313
Q + + ++V DG + I A+R G+ TP +L+
Sbjct: 255 NDD--------QRISLKVHV---SNDGVVYKNEINTILSATR-----DDGK---TPAVLI 295
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
++PL LG+E +NP Y P ++ F +GI GG+P +S + +GV + IYLDPH ++P
Sbjct: 296 MIPLRLGVETMNPVYYPGVKHCFAMSHCVGIAGGRPNSSLFFLGVDGDHLIYLDPHHLRP 355
Query: 374 VI---NIGKDDLEADTSTYHSE 392
+ +I +E D +YH E
Sbjct: 356 SVDSRDITSYKME-DLLSYHCE 376
>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
Length = 393
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 135/292 (46%), Gaps = 59/292 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 267
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 268 LPM----AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPTVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV 286
>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
Length = 475
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/305 (33%), Positives = 147/305 (48%), Gaps = 41/305 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 100 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 146
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 147 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 206
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W +LA + + I +
Sbjct: 207 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLA----VHVAMDNTVVMEEIRRLCR 262
Query: 278 DEDGERGGAPVVCIDDASRHCSVF----------SKGQADWTPILLLVPLVLGLEKVNPR 327
G A + DA RHC+ F S + W P++LL+PL LGL +N
Sbjct: 263 SSLPCSGAAALPA--DADRHCNGFPAPMEVTSRPSPSPSPWRPLVLLIPLRLGLTDINEA 320
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
Y+ TL+ F PQSLG++GGKP ++ Y +G + IYLDPH QP + + D
Sbjct: 321 YVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFIPD-E 379
Query: 388 TYHSE 392
T+H +
Sbjct: 380 TFHCQ 384
>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
Length = 394
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/279 (33%), Positives = 137/279 (49%), Gaps = 40/279 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
++LLGV + + +D A F +D SR +YRK F PIGD+ TSD GWGC
Sbjct: 45 VYLLGVKYDLPRDGA-----------SFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGC 93
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
LR QML+ LL LGR WR D +Y +IL +F D S +SI + G
Sbjct: 94 TLRCGQMLLGHTLLLRHLGRDWRWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGAD 153
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
+G + G W GP + ++ + LA + Q +A+YV +V ID
Sbjct: 154 FGRSVGQWFGPNNVAQAIKRLA--------VHDQWSEVAVYVAMD---------MLVVID 196
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D S ++ P+L+ +PL LG E+ N Y ++ F QS+GI+GGKP +
Sbjct: 197 DIS-----------NFRPVLVFIPLRLGQERFNMEYKEAVKACFAVRQSVGIIGGKPRHA 245
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
+ G ++ IYLDPH Q + + + +D STYH+
Sbjct: 246 LWFTGYHDDYLIYLDPHKTQSCVTLPDAGIVSD-STYHT 283
>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; Short=cAut2B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
Length = 393
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 135/292 (46%), Gaps = 59/292 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 267
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 268 LPM----AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPAVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV 286
>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
Length = 405
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/292 (34%), Positives = 143/292 (48%), Gaps = 37/292 (12%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S S IWLLG + + + N DF SRI ++YRK F + S TSD
Sbjct: 18 SKDSPIWLLGRIYHQSHKTDDSSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSD 77
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPFDREYVEILHLFGD--S 216
GWGCMLRS QML+AQAL+ H LGR WR + LQ+ R I+ FGD S
Sbjct: 78 CGWGCMLRSGQMLLAQALVCHFLGRDWRWNESGAQEQQTLQESLHR---MIVQWFGDKPS 134
Query: 217 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
P SIH ++ G + G G W GP ++ S+ QRA T + + +Y+
Sbjct: 135 PACPLSIHQMVSQGHISAGKRPGDWYGPSSV--SYIIKQILQRA-TDTYPELDTLRVYIA 191
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD----------WTPILLLVPLVLGLEKVN 325
V +DD + CS + + W ++LL+PL LG E++N
Sbjct: 192 QD---------CTVYLDDVKQSCSKICNYECEETDYELIDDQWKSLILLIPLRLGGERMN 242
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
P Y L+ + Q +GI+GGKP S Y +G Q++ I+LDPH+ Q ++++
Sbjct: 243 PTYDSCLKGLLSLEQCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDV 294
>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
Length = 402
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 151/305 (49%), Gaps = 42/305 (13%)
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 153
+ + V G I +D+W+LG + Q+ L +D SR+ +YR
Sbjct: 31 VGQAVGGGESEDIPRRNTDVWVLGKRYNAIQELEL-----------IRRDIQSRLWCTYR 79
Query: 154 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILH 211
GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P R+ Y++I++
Sbjct: 80 CGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPECRDATYLKIVN 136
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
F D + S +SIH + G++ A G W+GP + + + L R +A
Sbjct: 137 RFEDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVRFD--------DWCSLA 188
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
++V V +DD C +G + W P+LL++PL LG+ +NP Y+P
Sbjct: 189 VHVAMDS---------TVVLDDIYSLCR---EGDS-WKPLLLVIPLRLGITDINPMYVPA 235
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTS 387
L+ S G++GG+P + Y +G ++ +YLDPH Q +G+ + E D
Sbjct: 236 LKRCLELDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGTVGQKTGVGEQEYD-E 294
Query: 388 TYHSE 392
TYH +
Sbjct: 295 TYHQK 299
>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
NZE10]
Length = 442
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 92/264 (34%), Positives = 129/264 (48%), Gaps = 47/264 (17%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
+EF +D S+I ++YR F PI S TSD GWGCM+R
Sbjct: 111 SEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDGFTSDTGWGCMIR 170
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A A+L HRLGR WR+ + +REY +IL LF D+ SP SIH ++ G +A G
Sbjct: 171 SGQSLLANAILIHRLGRDWRRGDK---EREYKDILSLFADTPESPLSIHKFVEHGAQACG 227
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA--IYVVSGDEDGERGGAPVVCID 292
G W GP A R AL + E GL S P +YV
Sbjct: 228 TYPGEWFGPNATARCIRALTE-KYHEAGLQVYSRPNDSDVYV------------------ 268
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D+ + + P L+++ + LG+EKV P Y L+ QS+GI GG+P +S
Sbjct: 269 DSLMQTAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALELSQSVGIAGGRPSSS 328
Query: 353 TYIVGVQEESAIYLDPHDVQPVIN 376
Y +G Q ++ YLDPH +P+++
Sbjct: 329 HYFIGHQGDNFFYLDPHTTRPMLS 352
>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
TFB-10046 SS5]
Length = 989
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 134/284 (47%), Gaps = 47/284 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI-----------------------------TSDVGW 170
F DF+SR+ ++YR F PI D + TSD GW
Sbjct: 314 FYADFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGW 373
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPL---QKPFDREYVEILHLFGDSET--SPFSIHN 225
GCMLR+ Q L+A L+ LGR WR+P P YV+IL F D+ + +PFS+H
Sbjct: 374 GCMLRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHR 433
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIYVVSGDEDGERG 284
+ +GK +G G W GP + L RA+ G+ +A+ V + D
Sbjct: 434 MAMSGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVA-----IAVDGVLYETDIYSA 488
Query: 285 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+ D +R S F + W +L+LV LGL+ VNP Y L+ FTFPQSL
Sbjct: 489 SHYPMSSADGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTIFTFPQSL 548
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI-----GKDD 381
GI GG+P +S Y VG Q S YLDPH +P + + G DD
Sbjct: 549 GIAGGRPSSSYYFVGSQGNSLFYLDPHHTRPAVPLRTPPPGDDD 592
>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 336
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 83/224 (37%), Positives = 118/224 (52%), Gaps = 20/224 (8%)
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
++YR F I DS +D GWGCMLR QML+A+A+ LG+ W +K +E
Sbjct: 36 MTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWAPTSRKQRHQEMARF 95
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
L LF D+ +PFSIH + + G+A G G W GP + + + L QR+ + C
Sbjct: 96 LPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVNSQRSSLIVHCA--- 152
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK-VNPRY 328
V++ E + A + D +H +L+LVP+ LGL + +NP Y
Sbjct: 153 -MDGVLNRTEASTQLAA---ALSDGKKHS------------LLVLVPIRLGLNQSINPVY 196
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
IP L+ T PQ LGI+GGKP A+ + VG E+ +YLDPH VQ
Sbjct: 197 IPALKATLELPQCLGIIGGKPNAAHFFVGTVNENVLYLDPHVVQ 240
>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
tropicalis]
Length = 384
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 124/249 (49%), Gaps = 36/249 (14%)
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQALL +GR WR QK
Sbjct: 44 NDITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWDKQKS 103
Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 261
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 -QGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQWSS 162
Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------------- 307
+A+++ + V +D+ R C + ++
Sbjct: 163 --------IAVHIAMDN---------TVVMDEIRRLCRAGTNESSEAGALCNGYTGVSDP 205
Query: 308 ----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
W P++LL+PL LGL +N YI TL+ F PQSLG++GG+P ++ Y +G +
Sbjct: 206 SCSLWKPLVLLIPLRLGLSDINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDEL 265
Query: 364 IYLDPHDVQ 372
IYLDPH Q
Sbjct: 266 IYLDPHTTQ 274
>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
Length = 439
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 136/272 (50%), Gaps = 47/272 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A ALL R+GR WR+ + +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGVSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +AL+ Q + +Y+ +GD G+ V
Sbjct: 220 HPGEWFGPSATARCIQALSNSQAKSE--------LRVYI-TGD------GSDVY----ED 260
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ S+ +D+TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 261 KFMSIAKPNHSDFTPTLILVGTRLGLDKITPVYWEALKYSLQMPQSVGIAGGRPSSSHYF 320
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
+GVQE YLDPH +P + KD++E T+
Sbjct: 321 IGVQESDFFYLDPHQTRPALPY-KDNVEDYTT 351
>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
Length = 447
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 93/262 (35%), Positives = 126/262 (48%), Gaps = 43/262 (16%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
++F DF SRI I+YR GF PI S TSD GWGCM+R
Sbjct: 110 SDFIDDFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIR 169
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A +L HRLGR WRK ++ E+ IL LF D+ +PFSIH ++ G +A G
Sbjct: 170 SGQSLLANTILLHRLGRDWRKGQKQ---EEHKNILSLFADTPEAPFSIHKFVEHGAQACG 226
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A ARC RA T + +Y D D DA
Sbjct: 227 TYPGEWFGP-------NATARCLRALTD-KYHGAGLRVYARPNDSD---------VYADA 269
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + P L+++ + LG+EKV Y L+ PQS+GI GG+P +S Y
Sbjct: 270 LIETATQKDADDKFQPTLIVLGIRLGIEKVTSAYHVALKAALELPQSVGIAGGRPSSSHY 329
Query: 355 IVGVQEESAIYLDPHDVQPVIN 376
+G Q +S YLDPH + +++
Sbjct: 330 FLGHQGDSFFYLDPHTTRHMLS 351
>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
Length = 474
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 142/312 (45%), Gaps = 61/312 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + +F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQQFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGK 379
PH QP +++ +
Sbjct: 357 PHYCQPTVDVSQ 368
>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
Length = 318
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 49/266 (18%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 77 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 125
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 126 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 185
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 186 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 228
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 229 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 288
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVG 357
+ F PQSLG +GGKP + Y +G
Sbjct: 289 FKECFKMPQSLGALGGKPNNAYYFIG 314
>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
Length = 436
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 141/309 (45%), Gaps = 43/309 (13%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG K +D + +FN + ++ +YR+ F PIG + SD GWGC
Sbjct: 31 VWILGKHFKPDED-----------MEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGC 79
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQALL LGR W + + Y+ ILH F D + S +SIH + Q G
Sbjct: 80 MLRCGQMMLAQALLCRHLGRDWDWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVG 139
Query: 233 YGLAAGSWVGPYAMCRSWEALA-------------------------RCQRAETGLGCQS 267
G G W GP + + + L C+ + GC
Sbjct: 140 EGKQIGQWFGPNTVAQVIKKLVLFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYG 199
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHC-------SVFSKGQADWTPILLLVPLVLG 320
I+ S + P C ++S+ S S+ W P+LL +PL LG
Sbjct: 200 ECSYIHDRSSLTGNQSVSKPPHCSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLG 259
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
L ++N Y +L++ FT QSLG++GGKP + Y +G + +YLDPH Q I +
Sbjct: 260 LSEINSDYYNSLKIMFTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQTIEPERF 319
Query: 381 DLEADTSTY 389
++ D S +
Sbjct: 320 NVIPDESFH 328
>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
Length = 676
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 138/289 (47%), Gaps = 41/289 (14%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 258 AVENQVGETPWEEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 317
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L+ G A G
Sbjct: 318 QGLIVHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGK 377
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP ++ L T +++YV + I D
Sbjct: 378 KPGDWYGPASVSY---LLKHALEHATQENADFDNISVYVAKD---------CTIYIQDIE 425
Query: 296 RHCSV----------------------FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
CS+ Q W +++L+PL LG +KVNP Y L+
Sbjct: 426 DQCSIPEPAPKQTHVPWQQMKRPSLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAHCLK 485
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
L + LGI+GGKP S Y VG QE+ I+LDPH Q ++++ +++
Sbjct: 486 LLLSTENCLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQENF 534
>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
Length = 450
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 104/299 (34%), Positives = 145/299 (48%), Gaps = 47/299 (15%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGNNG------LAEFNQDFSSRILISYRKGFDPIGDSKI 164
S I LLG C+ ++ E N F +DFSS+I +YRK F + S +
Sbjct: 82 SPIILLGKCYCCSKSEKEDQRRQPNNSNILTTFDRFKRDFSSKIWFTYRKDFPKLYGSPL 141
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETS 219
TSDVGWGCMLR++QM++AQAL+ H LGR W + +E + +I+ LFGD S
Sbjct: 142 TSDVGWGCMLRTAQMIIAQALVMHYLGRDWTIHHTQQNRKETMLHRQIIRLFGDFPGNDS 201
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
PFSI L++ G +G G W GP ++ YVV
Sbjct: 202 PFSIQALVRIGVDHGKRPGDWYGPASVA-------------------------YVVRDAI 236
Query: 280 DGERGGAPV---VCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPT 331
+ P+ VC+ A C+V+ + D W +++LVP+ LG E +NP Y
Sbjct: 237 NQVPDFHPLLSQVCVYVAP-DCTVYIQDVIDLCTQHWKAVVILVPVRLGGEALNPIYSQC 295
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
++ LGI+GG+P S Y VG QEE +YLDPH Q ++ D TSTYH
Sbjct: 296 VQSLLAHELCLGIIGGRPKHSLYFVGWQEEKLLYLDPHFCQDTVDTRFRDFP--TSTYH 352
>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 410
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 149/316 (47%), Gaps = 56/316 (17%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
+ S ++W++G ++ Q + D ++ SR+ +YRK F PIG +
Sbjct: 28 LFKSGGEVWIVG---RVWQTQDFDD---------IKKEIRSRMWFTYRKSFSPIGGTGPI 75
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 224
SD GWGCMLR QML+AQAL+ LGR W+ P + D YV IL +F D + +SIH
Sbjct: 76 SDSGWGCMLRCGQMLLAQALICRHLGREWQWSPSCR--DEAYVRILRMFQDKKNELYSIH 133
Query: 225 NLLQAGKAYGLAAGSWVGP---------YAMCRSWEALA----------------RCQR- 258
+ + G++ G G W GP A+ W +LA C R
Sbjct: 134 MIAKMGESEGKEIGKWFGPSTIAHVIKKLAIYDDWSSLAVHVAMDNVIVQEDVKKLCSRE 193
Query: 259 ---AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
A Q P I V ED + V C + +S W P+LL++
Sbjct: 194 VFDALRKRLLQEEPSEI-VADWFEDARKDNKKVDCANLSS-----------PWKPLLLIL 241
Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
P+ LGL ++NP YIP L+ F ++G++GGKP + Y +G ++ +YLDPH Q +
Sbjct: 242 PMRLGLSELNPCYIPALKEFFACKYNIGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTFV 301
Query: 376 NIGKDDLEADTSTYHS 391
++ D S+YHS
Sbjct: 302 DLDVSMDLFDDSSYHS 317
>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
Length = 474
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 141/312 (45%), Gaps = 61/312 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGK 379
PH QP +++ +
Sbjct: 357 PHYCQPTVDVSQ 368
>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
Length = 437
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 142/303 (46%), Gaps = 46/303 (15%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S+ S + +LG + +D + F F S ++YR GF PI S +T+D
Sbjct: 61 SNNSPVLVLGKLYIPERDTKPQSEGIPRHILMFMDHFYSLPWMTYRCGFSPILSSSLTTD 120
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR-----------KPLQKPFDREYVEILHLFGDS 216
GWGCM+RS QML+A L H LGR WR K ++ V IL FGDS
Sbjct: 121 CGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHRQVKNWNNYVVLILSWFGDS 180
Query: 217 ETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIY 273
E+ PFSIH L++A +G G W GP + L R C R + IY
Sbjct: 181 ESELCPFSIHRLMEAAYYHGNKPGDWFGPSQV----SILIRDCVRRALREHINLQKLNIY 236
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW------TPILLLVPLVLGLEKVNPR 327
V S C+V+ K D +L+LVP+ LG E +NP
Sbjct: 237 V--------------------SHDCTVYIKDVQDIFESDLDQSLLVLVPVRLGSESLNPI 276
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YIP ++ ++GI+GG+P S + +G Q+E+ I+LDPH Q +N+ + D D S
Sbjct: 277 YIPCVKALLALDHTVGIIGGRPKHSVFFIGFQDENLIHLDPHYSQTAVNMTRTDF--DVS 334
Query: 388 TYH 390
+YH
Sbjct: 335 SYH 337
>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
Length = 474
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 141/312 (45%), Gaps = 61/312 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGK 379
PH QP +++ +
Sbjct: 357 PHYCQPTVDVSQ 368
>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
Length = 459
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 150/305 (49%), Gaps = 47/305 (15%)
Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
S ++S +WLLG C+ QD D+ + ++ F S + +YR+ F+ + TS
Sbjct: 68 SQNSSKLWLLGDCYS-PQDFDNFDSMKD----AYHDAFESILWYTYRRDFETMVPYDFTS 122
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----LQKPFDREYVEILHLFGDSETS-- 219
D GWGCMLRS+QML+++A + LG W+ P L+ P + YV++L F DS +
Sbjct: 123 DAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLELP--KVYVKLLKWFVDSFDTEC 180
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
+SIHN+ + G Y G W GP A+ R L Q P V+ +
Sbjct: 181 KYSIHNITRIGMQYDKLPGEWYGP-------TTAAQALRDLVNLHAQESPECNLVMYVPQ 233
Query: 280 DGERGGAPV--VCI---DDASRHCSVFSKGQADWT---------------------PILL 313
DG V +CI D + +V + Q+D T +L+
Sbjct: 234 DGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRDNSEKMWQKSLLI 293
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
L+PL LGL+ +NPRY+P ++ F FPQ++GI+GGK G S Y VG + LDPHD+ P
Sbjct: 294 LIPLRLGLDSINPRYLPAIQRVFEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPHDIHP 353
Query: 374 VINIG 378
++
Sbjct: 354 TADLN 358
>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
Length = 672
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 144/277 (51%), Gaps = 19/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + + D+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 252 AAENQMADSPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 311
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 312 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGK 371
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ +E E P
Sbjct: 372 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKP 431
Query: 288 VVCIDDASRH----CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V S+ + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 432 HVPWQMTSKKPASDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 491
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++
Sbjct: 492 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQE 528
>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
Length = 393
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 136/269 (50%), Gaps = 35/269 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +++WLLG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNANVWLLGKRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFVPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D+ S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTPDCRDTTYLKIVNRFEDTRKSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G++ A G W+GP + + + L R SL + + + S
Sbjct: 149 HQIALMGESQNKAVGEWLGPNTVAQILKILVRFD------DWSSLNVHVAMDS------- 195
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C ++ W P+LL+VPL LG+ +NP Y+P L+ S G
Sbjct: 196 ----TVVLDDIFTLCQ--EPSESAWKPLLLIVPLRLGISDINPIYVPALKRCLELNSSCG 249
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
++GG+P + Y +G ++ +YLDPH Q
Sbjct: 250 MIGGRPNQALYFLGYVDDEVLYLDPHTTQ 278
>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
Length = 708
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 141/277 (50%), Gaps = 17/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 289 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 348
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 349 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 408
Query: 236 AAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG----ERGGAP 287
G W GP Y + + E A+ + + ED E P
Sbjct: 409 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 468
Query: 288 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 469 HVPWQQAKRPQAETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 528
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y VG QE+ I+LDPH Q ++++ +++
Sbjct: 529 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF 565
>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 628
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 152/314 (48%), Gaps = 59/314 (18%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+ G+ F +DF SR+ ++YRK F + DS TSD GWGCM+RS QML+AQ L+ H LGR
Sbjct: 188 DEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLGRG 247
Query: 194 WR-----KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSW 240
WR + L+ FD E I+ FGD S TSPFSIH L+ GK G G W
Sbjct: 248 WRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPGDW 307
Query: 241 VGPYAMCRSW-EALARCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV------ 289
GP ++ +A+ + T L ++ + A+Y+ ++ P V
Sbjct: 308 YGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAPWQKK 367
Query: 290 ------CIDDASR------------------HCSVF---------SKGQADWTPILLLVP 316
C D S+ H + F S + W ++LLVP
Sbjct: 368 MSSAAACTDSPSQATTPRVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLILLVP 427
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LG EK+NP Y L+ + +GI+GG+P S + VG QE+ I+LDPH Q +++
Sbjct: 428 LRLGTEKLNPIYNDCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYCQDMVD 487
Query: 377 IGKDDLEADTSTYH 390
+ +++ S++H
Sbjct: 488 VNQENFPV--SSFH 499
>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
Length = 459
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 149/339 (43%), Gaps = 78/339 (23%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 155
S S ++LLG C+ DE+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTDES-GELSTDGSNFDKINTEISGNVEEFRKDFISRIWLTYREE 94
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 194
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIDSSDSESWTAHTV 154
Query: 195 --------------RKP----------LQKPFDRE-------YVEILHLFGDSETSPFSI 223
R+P L++ +D + +I+ FGDS + F +
Sbjct: 155 KKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNEVYHRKIISWFGDSPLTAFGL 214
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H L++ GK G AG W GP + R G + IYV
Sbjct: 215 HQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTIYVAQD------ 263
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V D R CS G+AD +++LVP+ LG E+ N Y+ ++ + +G
Sbjct: 264 --CTVYSSDVIDRQCSFMDSGEADTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVG 321
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
I+GGKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 322 IIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDF 360
>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
Length = 703
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 140/277 (50%), Gaps = 17/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 236 AAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG----ERGGAP 287
G W GP Y + + E A+ + + ED E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQQAKRPQAETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y VG QE+ I+LDPH Q ++++ +++
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF 560
>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
24927]
Length = 444
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 138/261 (52%), Gaps = 43/261 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-------------------ITSDVGWGCMLRSSQML 180
F DF ++ ++YR F PI S TSD GWGCM+RS Q +
Sbjct: 111 FLDDFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCV 170
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGS 239
+A A+ +LGR WR+ + P +E IL LF D +PFS+HN ++ G+A G+ G
Sbjct: 171 LANAISLLKLGRDWRRG-KSP--QEEQHILSLFADDPRAPFSLHNFVKYGEASCGVYPGE 227
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
W GP A R +ALA A+ G Q +Y+ +GD GG +DA R +
Sbjct: 228 WFGPSATARCIQALA----AQHDEGLQ-----VYI-TGD-----GGD---VYEDAFRKIA 269
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
+ G + P L+LV + LG+E+V P Y L+ + PQS+GI GG+P AS Y +GVQ
Sbjct: 270 ISDDGV--FHPTLVLVGIRLGIERVTPVYWEALKSSLMMPQSVGIAGGRPSASHYFIGVQ 327
Query: 360 EESAIYLDPHDVQPVINIGKD 380
+S YLDPH+ +P++ KD
Sbjct: 328 GQSLFYLDPHNTRPLLPYRKD 348
>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
Length = 706
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 97/277 (35%), Positives = 141/277 (50%), Gaps = 17/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 287 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 346
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 347 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 406
Query: 236 AAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG----ERGGAP 287
G W GP Y + + E A+ + + ED E P
Sbjct: 407 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 466
Query: 288 VVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K + W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 467 HVPWQQAKRPQAETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 526
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y VG QE+ I+LDPH Q ++++ +++
Sbjct: 527 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF 563
>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
Length = 653
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 141/277 (50%), Gaps = 17/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 234 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 293
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 294 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 353
Query: 236 AAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG----ERGGAP 287
G W GP Y + + E A+ + + ED E P
Sbjct: 354 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 413
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 414 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 473
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y VG QE+ I+LDPH Q ++++ +++
Sbjct: 474 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF 510
>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 470
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 133/263 (50%), Gaps = 42/263 (15%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SRI ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR ++P +E+ +++ +F D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + L R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVHKNR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDVQPVI 375
Y V Q + YLDPH +P++
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLL 333
>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
Length = 703
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/277 (35%), Positives = 145/277 (52%), Gaps = 17/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQKAKRPQAENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y VG QE+ I+LDPH Q ++++ +++
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF 560
>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
Length = 668
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 141/277 (50%), Gaps = 17/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 249 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 308
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 309 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 368
Query: 236 AAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG----ERGGAP 287
G W GP Y + + E A+ + + ED E P
Sbjct: 369 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 428
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 429 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 488
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y VG QE+ I+LDPH Q ++++ +++
Sbjct: 489 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF 525
>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
Length = 583
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 133/290 (45%), Gaps = 64/290 (22%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
+ F +DF +R+ ++YRK F + DS TSD GWGCM+RS QML+AQ LL H LGR WR
Sbjct: 167 IEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNWRW 226
Query: 196 ----KPLQKPF------DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 243
+ L+ + D + +I+ FGD S TSPFSIH L+ GK G G W GP
Sbjct: 227 DATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWYGP 286
Query: 244 YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC---IDDASRHCSV 300
++A R L Q + D DG C I D C+V
Sbjct: 287 -------GSVAHLLRQAVKLAAQEI--------SDLDGVNVYVAQDCAVYIQDIIDECTV 331
Query: 301 ---------------------------------FSKGQADWTPILLLVPLVLGLEKVNPR 327
+ W ++LLVPL LG EK+NP
Sbjct: 332 SAGPTLAPWQKKSPGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNPI 391
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
Y L+ + +GI+GG+P S Y VG QE+ I+LDPH Q ++++
Sbjct: 392 YSDCLKAMLSLDNCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQDMVDV 441
>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
Length = 470
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 133/263 (50%), Gaps = 42/263 (15%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SRI ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR ++P +E+ +I+ +F D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDIVAMFADDPRAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + L + E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVH-KNKEVGL-------KVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDVQPVI 375
Y V Q + YLDPH +P++
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLL 333
>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
heterostrophus C5]
Length = 471
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 101/259 (38%), Positives = 129/259 (49%), Gaps = 42/259 (16%)
Query: 135 NGLAEFNQDFSSRILISYRKGF-------DPIGDSKI--------------TSDVGWGCM 173
N + F DF SRI ++YR GF DP S + TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + ++ GQ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEVAIDDDGQ--WQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDV 371
Y V Q + YLDPH
Sbjct: 311 HYFVATQGNNFFYLDPHST 329
>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
Length = 405
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 124/267 (46%), Gaps = 45/267 (16%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--K 196
E DF S+I +YRK F IG + T D GWGCMLR QM++AQAL+ LGR W+ K
Sbjct: 46 ELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWKWNK 105
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
Q D+ Y IL +F D +++ +SI + G + G GSW GP + + + LA
Sbjct: 106 NCQ---DQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----------- 305
+ + ++ D VC DD C + Q
Sbjct: 163 DEWSS---------IVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRS 213
Query: 306 --------------------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
W P+LL++PL LGL ++N Y+ +L+ +FPQS+GI+
Sbjct: 214 KKSSQDSSKQDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKACLSFPQSVGII 273
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQ 372
GGKP + + VG + IYLDPH Q
Sbjct: 274 GGKPNHAHWFVGYMSDKLIYLDPHTTQ 300
>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
Length = 440
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 130/262 (49%), Gaps = 43/262 (16%)
Query: 138 AEFNQDFSSRILISYRKGFDPI----------------------GDSKITSDVGWGCMLR 175
++F DF SR+ ++YR F PI TSD GWGCM+R
Sbjct: 109 SQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGNFTSDTGWGCMIR 168
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A ++ RLGR WR+ + ++++ EIL +F D+ +PFSIH ++ G A G
Sbjct: 169 SGQSLLANTVVMLRLGRDWRRGQK---EKQHHEILSMFADTPEAPFSIHKFVEHGASACG 225
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A ARC RA T + + +Y D D V ID
Sbjct: 226 TYPGEWFGP-------SATARCIRALTE-KYHDVGLRVYARPNDSD--------VYIDTL 269
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + S + ++P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 270 TATTTQHSASET-FSPTLIVLGVRLGIEKVTPAYHAALKSILELPQSVGIAGGRPSSSHY 328
Query: 355 IVGVQEESAIYLDPHDVQPVIN 376
VG Q + YLDPH +P++
Sbjct: 329 FVGHQGDHFFYLDPHTTRPMLT 350
>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
Length = 408
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 148/317 (46%), Gaps = 59/317 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + VC + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 20 SRTSFSKISS----VHVCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 69
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------RKPL--- 198
+TSD GWGCMLRS QM++AQ+LL H L R W R P
Sbjct: 70 GCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWTWAEGLGSAEPAGSASPSRYRGPARWM 129
Query: 199 ---------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
+ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 130 PPRWAQGAPELEQERRHRQIVSWFADHPGAPFGLHRLVELGQSSGKKAGDWYGP------ 183
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
+A R + + +YV + A +V D + A+W
Sbjct: 184 -SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWK 232
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH
Sbjct: 233 SVVILVPVRLGGETLNPVYVPCVKELLRLELCLGIMGGKPRHSLYFIGYQDDFLLYLDPH 292
Query: 370 DVQPVINIGKDDLEADT 386
QP +++ + D ++
Sbjct: 293 YCQPTVDVSQTDFPLES 309
>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
Length = 668
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/281 (34%), Positives = 148/281 (52%), Gaps = 17/281 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 248 AVENQVGEHPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 307
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H +GR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 308 QGLICHFMGRTWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGK 367
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 368 KPGDWYGPASVSYLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKP 427
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + SK Q W +++L+PL LG +K+N Y L+L + LGI+
Sbjct: 428 NVPWQQAKRPQAEVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAHCLKLLLSTEHCLGII 487
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++
Sbjct: 488 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENFSLNS 528
>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
gorilla]
Length = 379
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 125/245 (51%), Gaps = 25/245 (10%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 372 QPVIN 376
QP +
Sbjct: 269 QPAVE 273
>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
Length = 379
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 125/245 (51%), Gaps = 25/245 (10%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 372 QPVIN 376
QP +
Sbjct: 269 QPAVE 273
>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
Length = 473
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 99/281 (35%), Positives = 138/281 (49%), Gaps = 47/281 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 177
F DF SR+ I+YR F PI + TSD GWGCM+RS
Sbjct: 138 FLDDFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSG 197
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLA 236
Q L+A LLF RLGR WR+ Q ++E E+L LF D +PFSIH +Q G A G
Sbjct: 198 QSLLANTLLFLRLGRGWRRGSQ---EQEESELLSLFADHPRAPFSIHRFVQHGATACGKC 254
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCIDDAS 295
G W GP A + +ALA G + +Y+ S G + ER + C
Sbjct: 255 PGEWFGPAAAAQCIQALAN--------GHPQAGLNVYITSDGSDIYERQFREIACR---- 302
Query: 296 RHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ G+ D P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S Y
Sbjct: 303 ---GLGEDGEDDSIKPTLILLGVRLGIDRVTPVYWESLKEVIRFPQSVGIAGGRPSSSHY 359
Query: 355 IVGVQEESAIYLDPHDVQPVI---NIGKDDLE-ADTSTYHS 391
+ Q ++ YLDPH +P + G+D + STYH+
Sbjct: 360 FIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGELSTYHT 400
>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
Length = 379
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 125/245 (51%), Gaps = 25/245 (10%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPL 208
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 372 QPVIN 376
QP +
Sbjct: 269 QPAVE 273
>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
Length = 442
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 145/319 (45%), Gaps = 62/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 53 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 102
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 103 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 161
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 217
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 218 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 264
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + + ++
Sbjct: 325 PHYCQPTVDVNQANFPLES 343
>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
Length = 473
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 128/259 (49%), Gaps = 42/259 (16%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SRI ++YR GF I S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + ++ G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEVAIDDDGE--WQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDV 371
Y V Q + YLDPH
Sbjct: 311 HYFVATQGNNFFYLDPHST 329
>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
Length = 473
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 145/319 (45%), Gaps = 62/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 249 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 295
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + + ++
Sbjct: 356 PHYCQPTVDVNQANFPLES 374
>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
Length = 378
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 125/245 (51%), Gaps = 25/245 (10%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 36 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 95
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 96 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 148
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 149 -LAVHIAMDNTVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 207
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 208 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 267
Query: 372 QPVIN 376
QP +
Sbjct: 268 QPAVE 272
>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
Length = 424
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 147/320 (45%), Gaps = 62/320 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 82
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHYLPRDWTWAEGAGLGPPEPVGLSSPNRYRGPAR 142
Query: 196 ---------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 246
P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMAPTLGPGAPPSWSRERRHRQIVSWFADHPRAPFGLHQLVELGQSSGKKAGDWYGP--- 199
Query: 247 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
+A R + + +YV + A +V D + A
Sbjct: 200 ----SLVAHILRKAVESCAEVTRLVVYVSQDCTVYKADVARLVARPDPT----------A 245
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YL
Sbjct: 246 EWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYL 305
Query: 367 DPHDVQPVINIGKDDLEADT 386
DPH QP +++ + D ++
Sbjct: 306 DPHYCQPTVDVSRADFPLES 325
>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
Length = 268
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 92/268 (34%), Positives = 133/268 (49%), Gaps = 40/268 (14%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVG 357
TL+ F PQSLG++GGKP ++ Y +G
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIG 268
>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
Length = 1257
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/294 (32%), Positives = 137/294 (46%), Gaps = 61/294 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
F D++SR+ ++YR F PI D+ +
Sbjct: 317 FYSDYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWS 376
Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE---ILHLFGD- 215
TSD GWGCMLR+ Q L+A AL+ L R WR+P + +YV+ IL F D
Sbjct: 377 GEKGWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDN 436
Query: 216 -SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC--------- 265
S +PF IH + AGK G GSW GP + + L + + GL
Sbjct: 437 PSPLAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLV-GEFEDAGLEVALAVDSVVY 495
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEK 323
QS A S +++G G + V + + +G W P+L+LV + LG++
Sbjct: 496 QSDVYAASAASRNQNGVEGDSKTVGTSKSRKKG----QGPPKWGNRPVLILVGIRLGIDG 551
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
VNP Y +++ FTFPQ++GI GG+P +S Y VG Q +S YLDPH +P I +
Sbjct: 552 VNPIYYESVKTLFTFPQTVGIAGGRPSSSYYFVGAQGDSLFYLDPHHTRPAIPL 605
>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
Length = 397
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/268 (35%), Positives = 134/268 (50%), Gaps = 22/268 (8%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 45 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWE 104
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 105 KHKNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 164
Query: 258 RAETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSKGQ----- 305
+ +A+Y VV D P C + A+ H S +S+ +
Sbjct: 165 EWNS--------LAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGH 216
Query: 306 -ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + I
Sbjct: 217 CSGWRPLLLVVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEII 276
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSE 392
YLDPH Q ++ + D TYH +
Sbjct: 277 YLDPHTTQTFVDTEEAGTVQD-QTYHCQ 303
>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
Length = 469
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 146/315 (46%), Gaps = 57/315 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQKPF----------- 202
+TSD GWGCMLRS QM++AQ LL H L R W P P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSQGVGLGPPESSPNRYRGPAHWMPP 192
Query: 203 -----------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 HWVQAAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------S 245
Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 311
+A R + + +YV + A +V D + A+W +
Sbjct: 246 LVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKAV 295
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH
Sbjct: 296 VILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYC 355
Query: 372 QPVINIGKDDLEADT 386
QP +++ + D ++
Sbjct: 356 QPTVDVSQADFPLES 370
>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
UAMH 10762]
Length = 446
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/290 (35%), Positives = 145/290 (50%), Gaps = 61/290 (21%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------- 163
A++EALG AEF D +RI ++YR F PI S
Sbjct: 103 AEEEALG------WPAEFMDDMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGN 156
Query: 164 ---ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 220
TSD GWGCM+RS Q L+A +L +LGR WR+ QK D Y ++ LF D+ +P
Sbjct: 157 SGGFTSDAGWGCMIRSGQTLLANSLATLKLGRDWRRG-QKEDD--YKHLISLFADTPEAP 213
Query: 221 FSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
FSIH ++ G +A G G W GP A RS +AL R + GL + P +
Sbjct: 214 FSIHKFVEHGAQACGKHPGEWFGPSATARSVQALTEKYR-DVGLRVYARP---------D 263
Query: 280 DGERGGAPVVCIDDASRHCSVF-SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRL 334
DG+ V +D S+F + GQ D + P L+++ + LG++++ P Y L+
Sbjct: 264 DGD------VYVD------SLFATAGQMDANDEFQPTLIVLGIRLGIDRITPVYHAALKA 311
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI--NIGKDDL 382
T PQS+GI GG+P +S Y VG Q ++ YLDPH + I N +DL
Sbjct: 312 TLEMPQSVGIAGGRPSSSHYFVGHQGDNFFYLDPHTTRQAIPQNPSAEDL 361
>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
porcellus]
Length = 474
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 148/319 (46%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKLSS-IYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWAEGPGLGSPELPGTASPSPGRSPAR 192
Query: 195 ----RKPLQKP-FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P P ++E + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WVPPRWPRGAPELEQELRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +A+YV + A +V D + A+
Sbjct: 249 ---SLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHLVASRDPT----------AE 295
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 356 PHYCQPTVDVSQADFPLES 374
>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
Length = 433
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 143/308 (46%), Gaps = 46/308 (14%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
+ S + ++LLG HK A GD + + E+ +SR+ +YRK F PIG + T
Sbjct: 20 VFDSNTPVYLLG--HKFP---ARGDM---DSIKEY---VTSRLWFTYRKNFMPIGGTGPT 68
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCMLR QML+AQAL+ LG W + +Y IL +F D + PFS+H
Sbjct: 69 SDQGWGCMLRCGQMLLAQALIVRHLGTEWMWDRDNK-EEDYKRILRMFQDKKCCPFSLHQ 127
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ---RAETGLGCQSLPMAIYVVS------ 276
+ Q G + G W GP + + L R + +L +A V +
Sbjct: 128 IAQMGVSERKQIGEWFGPNTAAQVLKKLVVYDDWSRLAVHVALDNLLIASDVRTMAHTRP 187
Query: 277 --------------GDEDGERGGAPVVCIDDASRHCSVFS-----------KGQADWTPI 311
+E G G +C + + C + S + + W P+
Sbjct: 188 PSRLSSRHTTENEQSEESGNASGGNSLCSFGSVKMCMLQSALMKECDENPVEDEEQWRPL 247
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
L++VPL LGL +N Y+P + F PQ GI+GG+P + Y +G+ E IYLDPH
Sbjct: 248 LIIVPLRLGLTSINRCYLPAIEAFFQLPQCTGIIGGRPNHALYFIGIAGEQLIYLDPHVC 307
Query: 372 QPVINIGK 379
Q I++ +
Sbjct: 308 QAAIDLDE 315
>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
Length = 472
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 147/318 (46%), Gaps = 60/318 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 198
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 199 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 369 HDVQPVINIGKDDLEADT 386
H QP +++ + D ++
Sbjct: 356 HYCQPTVDVSQADFPLES 373
>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
Length = 442
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 149/319 (46%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 52 SRTSFSKLSS----VHLCGRRYRFETEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 101
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR----------KP--LQKPF------- 202
+TSD GWGCMLRS QM++AQ LL H L R W +P L P+
Sbjct: 102 GYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWVKGVGLDPPEPSRLASPYWHHGPAC 161
Query: 203 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 WIPPHWTQGSPELEQERRHRQIVSWFADHPKAPFGLHQLVELGQSSGKKAGDWYGP---- 217
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 218 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 264
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + + ++
Sbjct: 325 PHYCQPTVDVSQANFPLES 343
>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
Length = 472
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 147/318 (46%), Gaps = 60/318 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 198
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 199 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 369 HDVQPVINIGKDDLEADT 386
H QP +++ + D ++
Sbjct: 356 HYCQPTVDVSQADFPLES 373
>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
Length = 458
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/337 (29%), Positives = 141/337 (41%), Gaps = 75/337 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 195 -------------RKPLQKPFDREYV-----------EILH-----LFGDSETSPFSIHN 225
R+P +E + E+ H FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEVYHRKIISWFGDSPLAAFGLHQ 215
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
L++ GK G AG W GP + R G + +YV
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQ--------D 262
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI+
Sbjct: 263 CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGII 322
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDF 359
>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
Length = 423
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 150/321 (46%), Gaps = 65/321 (20%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 82
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSEASGLGPSEPSGLASPNRYRGPAR 142
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 198
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 199 -----SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 243
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 244 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 303
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 304 LDPHYCQPTVDVSQADFPLES 324
>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
Length = 607
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 216 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 265
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 266 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWIEGPGLAHPELPGSASSSQGRGPAR 325
Query: 198 ----------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L++ + + +I+ F D +P +H L++ G++ G AG W GP
Sbjct: 326 WMPPSCPWGALEREQELRHRQIVSWFADHPRAPLGLHRLVELGQSSGKKAGDWYGP---- 381
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +A+YV + A +V D + A+
Sbjct: 382 ---SLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHLVASPDPA----------AE 428
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 429 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 488
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 489 PHYCQPTVDVSQADFSLES 507
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 10/93 (10%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 130 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 179
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 180 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212
>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
SS1]
Length = 1286
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 137/288 (47%), Gaps = 54/288 (18%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKIT---------------------------- 165
NN F DF+SR+ ++YR F PI DS +T
Sbjct: 333 NNWPPVFYSDFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGW 392
Query: 166 ---------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLF 213
SD GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V++L F
Sbjct: 393 PGSGEKGWTSDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWF 452
Query: 214 GDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
DS T PFS+H + AGK G G W GP + + L E GLG ++
Sbjct: 453 FDSPTPHCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLG-----VS 506
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYI 329
I S + A +D S + S G+A +L+L+ + LGL+ VNP Y
Sbjct: 507 IASDSQIFQSDVFAASHPPMDSPSSKKKLASTWGGRA----VLVLIGIRLGLDGVNPIYY 562
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
T++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 563 ETIKALYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPL 610
>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
Length = 445
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 149/321 (46%), Gaps = 65/321 (20%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 55 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 104
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 105 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 164
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 165 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 220
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 221 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 265
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 266 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 325
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 326 LDPHYCQPTVDVSQADFPLES 346
>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 141/277 (50%), Gaps = 19/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 183 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 236 AAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG----ERGGAP 287
G W GP Y + + E A+ + + ED E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 288 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
I+GGKP S Y VG QE+ I+LDPH Q +++I ++
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQE 527
>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
Length = 678
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 141/277 (50%), Gaps = 19/277 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 183 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 236 AAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG----ERGGAP 287
G W GP Y + + E A+ + + ED E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 288 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
I+GGKP S Y VG QE+ I+LDPH Q +++I ++
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQE 527
>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
Length = 471
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 49/286 (17%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
G + F +DF SR+ +YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 104 GEGDIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163
Query: 193 PW--------------------RKPLQKPFDR------------EYVEILHLFGDSETSP 220
W R P + R ++ +I+ F D +P
Sbjct: 164 DWTWAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAP 223
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
FS+H L++ G++ G AG W GP +A R + + +YV
Sbjct: 224 FSLHRLVELGQSLGKKAGDWYGP-------SVVAHILRKAVESCSEVTHLVVYVSQDCTV 276
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+ A +V D + A+W +++LVP+ LG E +NP Y+P ++
Sbjct: 277 YKADVARLVARPDPT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSEL 326
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
LGI+GGKP S Y +G Q++ +YLDPH QP ++I + D ++
Sbjct: 327 CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDISQADFPLES 372
>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
Length = 428
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 136/290 (46%), Gaps = 55/290 (18%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
G + F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 59 GEGDIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 118
Query: 193 PW----------------------RKP--------------LQKPFDREYVEILHLFGDS 216
W R P L++ +R + +I+ F D
Sbjct: 119 DWTWAEGSAPSPSEPSGLASPNRYRGPARWMPPRWAQGTPELEQ--ERRHRQIVSWFADH 176
Query: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS 276
+PF +H L++ G++ G AG W GP +A R + + +YV
Sbjct: 177 PQAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVESCSEVTHLVVYVSQ 229
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
+ A +V D + A+W +++LVP+ LG E +NP Y+P ++
Sbjct: 230 DCTVYKADVARLVARPDPT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELL 279
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
LGI+GGKP S Y +G Q++ +YLDPH QP +++ + D ++
Sbjct: 280 RSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLES 329
>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
Length = 392
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 148/312 (47%), Gaps = 61/312 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 5 SRTSFSKISS----VHLCGRRYCFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 54
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 55 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGAGLSPPEPSGLASPNRHHGLAH 114
Query: 196 -KPLQ-----KPFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
KP + ++E+ +I+ F D +PF +H L++ G+++G AG W GP
Sbjct: 115 WKPPRWAQGAPELEQEHWHRQIVSWFADHPQAPFGLHQLVELGQSWGKKAGDWYGP---- 170
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 171 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDCT----------AE 217
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++S +YLD
Sbjct: 218 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDSLLYLD 277
Query: 368 PHDVQPVINIGK 379
PH QP +++ +
Sbjct: 278 PHYCQPTVDVSQ 289
>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 918
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 152/320 (47%), Gaps = 37/320 (11%)
Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNG-----LAEFNQDFSSRILISYRKGFDPIGD 161
S S S IW+LG C+ + E G + + +F DF + + SYRK F+ I
Sbjct: 260 SISDSPIWMLGNCYSGKELECNGHTENKHNKRSRHICKFFADFQTLVCFSYRKDFERIPG 319
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKPFDREYVEILHLFG 214
SK T+D GWGC LRS+QMLVA+AL+ GR WR PL + + I+ LF
Sbjct: 320 SKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCPAPLSSSKEDQLRLIIRLFQ 379
Query: 215 DS--ETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D SPFSIHN++Q G + + AG W GP ++ R + L A ++
Sbjct: 380 DQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFADLINQAYAMHQSPFRAYQAI 439
Query: 272 IYVVSGDEDGERGGAPVVCID-DASRHCSVFSKGQADWT-------------------PI 311
+++ D E P D + S S D T P+
Sbjct: 440 DHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVTPSASTSQSPPVLPPPFIPL 499
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
L+L+PL LGL ++N YIP L+ Q +GI+GG+P S Y VG QE++ I+ DPH
Sbjct: 500 LILMPLRLGLNEINRMYIPCLKALLMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPHGC 559
Query: 372 QPVINIGKDDLEADTSTYHS 391
+ +++ + T T+HS
Sbjct: 560 KRFVDMQQTSFP--TETFHS 577
>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
Length = 531
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 124/265 (46%), Gaps = 49/265 (18%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 354 YIVGVQEESAIYLDPHDVQPVINIG 378
Y G Q + YLDPH Q + G
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFG 306
>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
Length = 453
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 133/288 (46%), Gaps = 51/288 (17%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
G + F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QML+AQ LL H R
Sbjct: 84 GEGDIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSR 143
Query: 193 PW-----------RKPL---------------------QKPFDRE--YVEILHLFGDSET 218
W R+P + F++E + I+ F D
Sbjct: 144 DWTWSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPG 203
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
+PF +H L++ G++ G AG W GP +A R + + +YV
Sbjct: 204 APFGLHRLVELGRSSGKRAGDWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDC 256
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+ A +V D S +W I++LVP+ LG E +NP Y+P ++
Sbjct: 257 TVYKADVAQLVAQPDPS----------TEWKSIVILVPVRLGGETLNPVYVPCVKELLRL 306
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GI+GGKP S Y +G Q++ +YLDPH QP ++ ++ ++
Sbjct: 307 ELCIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPFVDTSQESFPLES 354
>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
Length = 758
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 124/265 (46%), Gaps = 49/265 (18%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 354 YIVGVQEESAIYLDPHDVQPVINIG 378
Y G Q + YLDPH Q + G
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFG 306
>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
aries]
Length = 454
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 98/296 (33%), Positives = 135/296 (45%), Gaps = 43/296 (14%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 69 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 115
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +
Sbjct: 116 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYCRVPP--------------- 160
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA A + L V++ R G
Sbjct: 161 -QMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-AWSALAVHVAMDNTVVMADVRRLCRSGL 218
Query: 287 PVVCID----DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P + D+ RHC+ F G A W P++LL+PL LGL VN Y TL+ F
Sbjct: 219 PCAGAEAFPADSERHCNGFPAGAEGGECTAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 278
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
PQSLG++GGKP ++ Y +G E IYLDPH QP + D ++H +
Sbjct: 279 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV-AAADRCPVPDESFHCQ 333
>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
Length = 456
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/336 (29%), Positives = 147/336 (43%), Gaps = 75/336 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 155
S S ++LLG C+ +E+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTEES-GELSTDGSNFDKISTEISGNVEEFRKDFISRIWLTYREE 94
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 194
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPEALDMESCDWESWTSSTV 154
Query: 195 ---------------------RKPLQKPFD----REYV---EILHLFGDSETSPFSIHNL 226
R P ++ +D R V +I+ FGDS + F +H L
Sbjct: 155 RKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYHRKIISWFGDSPLAAFGLHQL 214
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
++ GK G AG W GP + R G + +YV
Sbjct: 215 IEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD--------C 261
Query: 287 PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
V D R CS+ G+A +++L P+ LG E+ N Y+ ++ + +GI+G
Sbjct: 262 TVYSSDVIDRQCSLVDSGKAGTKAVIILFPVRLGGERTNTDYLEFVKGILSLEYCVGIIG 321
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 322 GKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDF 357
>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
Length = 478
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 153/358 (42%), Gaps = 82/358 (22%)
Query: 108 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGD 161
S S + LLG C H A+DE A L F +DF+SR+ ++YR+ F P+
Sbjct: 36 SRNSPVLLLGKCYHFKAEDEESPTEASVEDLVMGDVDAFRRDFASRVWLTYREEFSPLPG 95
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE------------- 205
S +TSD GWGCMLR+ QM++AQ L+ H LGR W + L +P D E
Sbjct: 96 STLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWTWSEALTLQPLDTETWTTTAAKRLVAS 155
Query: 206 ---------------------------------------YVEILHLFGDSETSPFSIHNL 226
+ ++ FGDS ++P +H L
Sbjct: 156 LEASLQGVPGPSVRSSSPQAQALSLGSAEEADAHLKEMYHRTLVSWFGDSPSTPLGLHRL 215
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS--LPMAIYVVSGD------ 278
++ G G AG W GP + + + + GL C + + V S D
Sbjct: 216 VRLGLTMGKQAGDWYGPAVVAHILKKAVE-EAMDPGLACITAYVSQDCTVYSADVVDCHR 274
Query: 279 ------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
E AP + +D H S + +A +++LVP+ LG EK NP Y
Sbjct: 275 APRAERTSDETPDAPTLPQNDQPAHASTLPESRA----VIILVPVRLGGEKTNPEYFDFA 330
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D +YH
Sbjct: 331 KSILSLEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSYH 386
>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
familiaris]
Length = 473
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 149/321 (46%), Gaps = 65/321 (20%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGPGLGPSEPAGLASPNRYRGPAR 192
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 248
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 249 -----SLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARLVARPDPT---------- 293
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 354 LDPHYCQPTVDVSQADFPLES 374
>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 441
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/269 (36%), Positives = 132/269 (49%), Gaps = 48/269 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFESKIWLTYRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGFTSDTGWGCMIRS 162
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A AL+ R+GR WR+ +E I+ LF D+ T+P+SIHN ++ G A G
Sbjct: 163 GQSLLANALVMLRMGRDWRR--GSSASQEERSIISLFADTPTAPYSIHNFVEHGAAACGK 220
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGDEDGERGGAPVVCIDDA 294
G W GP A R +ALA G QS + +YV G E E + D
Sbjct: 221 HPGEWFGPSATARCIQALAN--------GHQSPELRVYVTGDGLEVYEDSFMKIAKPD-- 270
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
GQA + P L+LV LGL+K+ P Y L+ + PQSLGI GG+P +S Y
Sbjct: 271 ---------GQA-FIPTLILVGTRLGLDKITPVYWEALKSSLQIPQSLGIAGGQPSSSHY 320
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+GVQ YLDPH +P + + D++E
Sbjct: 321 FIGVQGHHFFYLDPHQTRPALPL-PDNIE 348
>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 473
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 148/321 (46%), Gaps = 65/321 (20%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + E+ GD + F +DF SR+ ++YR+ F P
Sbjct: 83 SRTRFSKISS----VHLCGRRYCFESEGD------IQRFQRDFVSRLWLTYRRDFPPFAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWARGASLSPPEPSGLASSNRYRGPAH 192
Query: 195 --------RKP-LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
R P L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 CMTPCWAQRAPELEQ--ERRHRQIVSWFADHPQAPFGLHQLVELGQSSGKKAGDWYGP-- 248
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 249 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 293
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 354 LDPHYCQPAVDVSQADFPLES 374
>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
Length = 474
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------- 202
+TSD GWGCMLRS QM++AQ LL H L R W L P
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSWYHGPAR 193
Query: 203 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPCWAQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D S A+
Sbjct: 250 ---SLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARLVARPDPS----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WNSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + + ++
Sbjct: 357 PHYCQPTVDVSQANFPLES 375
>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
terrestris]
Length = 383
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 140/265 (52%), Gaps = 16/265 (6%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 189
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 31 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 90
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 91 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 149
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
+ L + +L + V + G V D A V K + W
Sbjct: 150 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 204
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 205 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 264
Query: 370 DVQPVINIGK----DDLEADTSTYH 390
Q ++GK +++E D +TYH
Sbjct: 265 TTQRSGSVGKKLEEEEIEMD-ATYH 288
>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
Length = 411
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 130
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 294 PHYCQPTVDVSQADFPLES 312
>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
Length = 383
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 140/265 (52%), Gaps = 16/265 (6%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 189
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 31 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 90
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 91 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 149
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
+ L + +L + V + G V D A V K + W
Sbjct: 150 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 204
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 205 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 264
Query: 370 DVQPVINIGK----DDLEADTSTYH 390
Q ++GK +++E D +TYH
Sbjct: 265 TTQRSGSVGKKLEEEEIEMD-ATYH 288
>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
Length = 411
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 294 PHYCQPTVDVSQADFPLES 312
>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
Length = 474
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 357 PHYCQPTVDVSQADFPLES 375
>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
Length = 473
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 149/319 (46%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S ++L G ++ E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSK-ISSVYLCGHRYRF---ESEGD------IQRFQRDFMSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPLQ 199
+TSD GWGCMLRS QML+AQ LL H L R W R P +
Sbjct: 133 GCLTSDCGWGCMLRSGQMLLAQGLLLHFLPRDWTWAEGSGLGPPELSGSASPSRYRGPAR 192
Query: 200 K----------PFDREY--VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ ++E+ +I+ F D +PF +H L+ G++ G AG W GP
Sbjct: 193 RVPPHWAQCTPELEQEHWHRQIVSWFADHPQAPFGLHRLVALGQSSGKKAGDWYGP---- 248
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D +A+
Sbjct: 249 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDP----------KAE 295
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 356 PHYCQPSVDVSQADFSLES 374
>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
terrestris]
Length = 386
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 140/265 (52%), Gaps = 16/265 (6%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 189
N + E + +D S++ +YRK F PIG +S TSD GWGCMLR QM++ QAL+
Sbjct: 34 NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 93
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
LGR W+ + + Y++IL F D T+ FSIH + G + G G W GP + +
Sbjct: 94 LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 152
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
+ L + +L + V + G V D A V K + W
Sbjct: 153 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 207
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LLL+PL LGL ++NP YI L+ +F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 208 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 267
Query: 370 DVQPVINIGK----DDLEADTSTYH 390
Q ++GK +++E D +TYH
Sbjct: 268 TTQRSGSVGKKLEEEEIEMD-ATYH 291
>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 439
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 94/268 (35%), Positives = 131/268 (48%), Gaps = 47/268 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A ALL R+GR WR+ +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGSSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A ARC +A T +S + +Y+ +GD G+ V
Sbjct: 220 HPGEWFGP-------SAAARCIQALTNSQVES-ELRVYI-TGD------GSDVY----ED 260
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
S+ +TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 261 TFMSIAKPNSTKFTPTLILVGTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYF 320
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLE 383
+GVQE YLDPH +P + D++E
Sbjct: 321 IGVQESDFFYLDPHQTRPALPF-NDNVE 347
>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
Length = 474
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 357 PHYCQPTVDVSQADFPLES 375
>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
Length = 474
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 357 PHYCQPTVDVSQADFPLES 375
>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
Length = 474
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 357 PHYCQPTVDVSQADFPLES 375
>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
Length = 411
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 147/321 (45%), Gaps = 65/321 (20%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 196 ----------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 186
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 187 -----SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 231
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 232 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 291
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 292 LDPHYCQPTVDVSQADFPLES 312
>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
Length = 439
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 49 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 98
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 99 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 158
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 159 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 214
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 215 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 261
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 262 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 321
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 322 PHYCQPTVDVSQADFPLES 340
>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
Length = 471
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 138/288 (47%), Gaps = 58/288 (20%)
Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 174
+F DF SR+ I+YR F PI DS + TSD GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH +Q G A
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKI---QEESELVSLFADHPRAPFSIHRFVQHGATAC 252
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 292
G G W GP A + +AL + + GL +YV + G + ER V C +
Sbjct: 253 GKCPGEWFGPSAAAQCIQALVKSN-PQAGL-------RVYVTNDGSDIYERQFREVACDE 304
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
S P L+L+ + LG+++V P Y +L+ +PQS+GI GG+P +S
Sbjct: 305 SGS------------IKPTLILLGVRLGIDRVTPIYWDSLKALLHYPQSVGIAGGRPSSS 352
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---------ADTSTYHS 391
Y + Q +S YLDPH +P + + E + STYH+
Sbjct: 353 HYFIATQGDSFFYLDPHQTRPCLAPRSEPTEDEESHPYSPEELSTYHT 400
>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
Length = 497
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 146/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 216
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 272
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 273 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 319
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 320 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 379
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 380 PHYCQPTVDVSQADFPLES 398
>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
Length = 457
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 102/342 (29%), Positives = 147/342 (42%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENEMLSARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPGALNIENSDSESWTSHTVK 155
Query: 198 ---------------LQKP-------------FDREYVEILH-----LFGDSETSPFSIH 224
L+ P + EI H FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNEIYHRKIVSWFGDSPLAFFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + + G + IYV +D
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQG-----ITIYVA---QDCTVY 267
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
+ V+ ASR S+G D +++LVP+ LG E+ NP Y+ ++ + +GI
Sbjct: 268 NSDVIDTQSASRT----SEGAED-KAVIILVPVRLGGERTNPDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH QP +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQPFVDVSVKDFPLET 364
>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
Length = 454
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 92/264 (34%), Positives = 129/264 (48%), Gaps = 49/264 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 174
+F DF S++ I+YR F PI GDS I TSD GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKL---QEESELVSLFADHPRAPFSIHRFVHHGATAC 235
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 292
G G W GP A + +AL + + GL +Y+ S G + E+ V C +
Sbjct: 236 GKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACDE 287
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S
Sbjct: 288 SGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGRPSSS 335
Query: 353 TYIVGVQEESAIYLDPHDVQPVIN 376
Y + Q +S YLDPH +P +
Sbjct: 336 HYFIATQGDSFFYLDPHQTRPCLT 359
>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
Length = 457
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 86/264 (32%), Positives = 129/264 (48%), Gaps = 29/264 (10%)
Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
ALG + +G+ + SSR +YRK F PIG + TSD GWGCMLR +QML+ + L
Sbjct: 34 ALGKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVL 93
Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-- 243
L +GR + ++ Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 94 LRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNT 152
Query: 244 -------YAMCRSWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
+ W +A + L + +L MA S D E+G+
Sbjct: 153 AAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQ------- 205
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
+H + + + +W P+LL++PL LGL +N Y+P ++ F PQ +GI+GGKP
Sbjct: 206 ----VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKLPQCVGIIGGKP 261
Query: 350 GASTYIVGVQEESAIYLDPHDVQP 373
+ Y VG+ YLDPH +P
Sbjct: 262 NLAHYFVGIAGTKLFYLDPHYCRP 285
>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
Length = 405
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 102/294 (34%), Positives = 146/294 (49%), Gaps = 26/294 (8%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD--SK 163
I + + +W+LG + +D + +D SR+ +YRKGF PIG S
Sbjct: 46 IPQTENSVWVLGKKYNAKKD-----------IDAIRRDIRSRLWFTYRKGFVPIGGFGST 94
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPF 221
TSD GWGCMLR QM++ QAL+ LGR WR P R Y+ IL F D +P+
Sbjct: 95 FTSDKGWGCMLRCGQMVLGQALISLHLGRDWR---WTPETRSSTYLNILRRFEDRRAAPY 151
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
SIH + G + G G W GP + + + L + +L + V +
Sbjct: 152 SIHQIALMGASEGKDVGQWFGPNTIAQVLKKLVVYDDWSSITIHVALDNTLVVNDVVQQC 211
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
GA +D K + W P+LLL+PL LGL ++NP YI L+ +F FPQS
Sbjct: 212 RVEGATTAEVDGEKPL-----KAPSQWKPLLLLIPLRLGLNEINPIYINGLKTSFQFPQS 266
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV--INIGKDDLEADT-STYHSE 392
LG++GGKP + Y +G + I+LDPH Q ++ DD EA+ +TYH +
Sbjct: 267 LGLIGGKPSHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCK 320
>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
Length = 380
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 143/304 (47%), Gaps = 53/304 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 12 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 60
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 61 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 120
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 121 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 163
Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 164 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 223
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ +++ D +
Sbjct: 224 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVD-SEENGTVDDQS 282
Query: 389 YHSE 392
+H +
Sbjct: 283 FHCQ 286
>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
Length = 397
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 143/304 (47%), Gaps = 53/304 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 181 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ +++ D +
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVD-SEENGTVDDQS 299
Query: 389 YHSE 392
+H +
Sbjct: 300 FHCQ 303
>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
Length = 393
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 93/291 (31%), Positives = 135/291 (46%), Gaps = 55/291 (18%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVLTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALICRHLGRDWRWSKGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + LA + +A+++ + V ++
Sbjct: 134 EGKSIGQWYGPNTVAQVLRKLASFDTWSS--------LAVHIAMDN---------TVVME 176
Query: 293 DASRHC---------SVFSKGQADW------------------TPILLLVPLVLGLEKVN 325
+ R C S F + D+ P++LL+PL LGL +N
Sbjct: 177 EIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLLWKPLVLLIPLRLGLTDIN 236
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 237 EAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 287
>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
Length = 208
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205
>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
Length = 480
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 122/272 (44%), Gaps = 53/272 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 177
F DF SRI ++YR F PI S+ TSD GWGCM+RS
Sbjct: 114 FLDDFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSG 173
Query: 178 QMLVAQALLFHRLGRPWRK-------------PLQKPFDREYVEILHLFGDSETSPFSIH 224
Q L+A L+ LGR WR+ + EIL LF DS +PFSIH
Sbjct: 174 QSLLANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREAEILSLFADSPDAPFSIH 233
Query: 225 NLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
+Q G A G G W GP A A C R E C + + +YV +
Sbjct: 234 RFVQHGASACGKHPGQWFGP-------SATASCIR-ELSTECAAAGLRVYVTPSASE--- 282
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
+D R + S P L+L + LGL+++ P Y L+ + T+PQS+G
Sbjct: 283 ------LYEDRFRSIAAASPSDPTIKPTLILFGIRLGLDRITPVYHEALKSSLTYPQSIG 336
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
I GG+P +S Y VG Q + YLDPH+ +P +
Sbjct: 337 IAGGRPSSSHYFVGCQGDLFFYLDPHETRPAL 368
>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
Length = 411
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 146/321 (45%), Gaps = 65/321 (20%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 196 ----------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 186
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + +YV + A +V D +
Sbjct: 187 -----SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 231
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 232 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 291
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 292 LDPHYCQPTVDVSQADFPLES 312
>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
Length = 474
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 145/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WMSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 357 PHYCQPTVDVSQADFPLES 375
>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
Length = 482
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 139/322 (43%), Gaps = 72/322 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S + + +C + Q E GD + F +DF+SR+ ++YR+ F P+ +TSD
Sbjct: 79 TSFSKLSTVHLCGRRYQFEGEGD------IQRFQKDFASRLWLTYRRDFPPLDGGSLTSD 132
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------------- 195
GWGCMLRS QML+AQ LL H R W
Sbjct: 133 CGWGCMLRSGQMLLAQGLLLHFFSRDWTWAEAVLPPSPRESELFRSMSPSRSGASWQRGS 192
Query: 196 -----------------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
P Q + ++ I+ F D +PF +H L++ G++ G AG
Sbjct: 193 STASGLGRATWSTGGTLSPRQLEQEEQHRRIVSWFADQPGAPFGLHRLVELGRSSGKRAG 252
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
W GP +A R + + +YV + A ++ D S
Sbjct: 253 DWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQLMAQPDPS--- 302
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
+W +++LVP+ LG E +NP Y+P ++ +GI+GGKP S Y +G
Sbjct: 303 -------TEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDLCIGIIGGKPRHSLYFIGY 355
Query: 359 QEESAIYLDPHDVQPVINIGKD 380
Q++ +YLDPH QP ++ ++
Sbjct: 356 QDDFLLYLDPHYCQPCVDTSQE 377
>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 474
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 145/319 (45%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 357 PHYCQPTVDVSQADFPLES 375
>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
Length = 271
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205
>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
Length = 411
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 142/290 (48%), Gaps = 36/290 (12%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-SDCRDATYLKIVNRFEDVRNSYYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ Q G+ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGETQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELESSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSE 392
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH +
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAAAEQDYDETYHQK 299
>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
Length = 292
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205
>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
Length = 417
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 24/320 (7%)
Query: 82 VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
V V G R I GP + + +W+LG + + A ++
Sbjct: 19 VTLCVFPGVKRHITILSDGPEE--LPETDEPVWILGKQYDLQ--------AVITEKSKLL 68
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
D S+R+ +YR+ F PIG + +SD GWGCMLR QM++AQAL+ LGR W +Q+
Sbjct: 69 SDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEMQQE 128
Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEA 252
EY IL F D + +SIH + Q G G + G W GP A+ W +
Sbjct: 129 QPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS 188
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 312
LA + + + + ++ + +P + D S H S G W P+L
Sbjct: 189 LAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGL-DQSTHLPEPSPG---WKPLL 244
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
L++PL LG+ ++NP YI + F PQSLG +GGKP ++ Y +G IYLDPH Q
Sbjct: 245 LIIPLRLGINQINPVYIDAFKECFKMPQSLGALGGKPNSAYYFIGFLGNELIYLDPHTTQ 304
Query: 373 PVINIGKDDLEADTSTYHSE 392
++ ++D D ++H +
Sbjct: 305 TFVD-SEEDGTVDDQSFHCQ 323
>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
Length = 382
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 145/300 (48%), Gaps = 42/300 (14%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWSLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 274
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
Length = 462
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 127/265 (47%), Gaps = 42/265 (15%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 169
A N + F DF SRI ++YR GF I S+ TSD G
Sbjct: 87 AQYGNWPSAFLDDFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
+GCM+RS Q ++A AL RLGR WR P +E+ IL LF D +PFSIH ++
Sbjct: 147 FGCMIRSGQCILANALQTLRLGRDWRY-QDDPTAQEHCNILSLFADDPQAPFSIHRFVEH 205
Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R + L + E GL +YV SGD GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KYKEAGL-------RVYV-SGD------GADV 250
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+
Sbjct: 251 Y--EDKLKQVAVEEDGE--WIPTLILVGTRLGIDKITPVYWEALKASLQMKQSMGIAGGR 306
Query: 349 PGASTYIVGVQEESAIYLDPHDVQP 373
P AS Y V Q YLDPH +P
Sbjct: 307 PSASHYFVATQANHFFYLDPHSTRP 331
>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
Length = 382
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 145/300 (48%), Gaps = 42/300 (14%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWNLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 274
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
MF3/22]
Length = 1147
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 135/298 (45%), Gaps = 62/298 (20%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
G N F DFSSR+ ++YR + PI D +
Sbjct: 335 GANWPPGFYSDFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGS 394
Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDS 216
TSD GWGCMLR+ Q L+A AL+ LGR WR+P Q + + YV+IL F DS
Sbjct: 395 GEKGWTSDSGWGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYATYVKILTWFFDS 454
Query: 217 ET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV 274
PFS+H + AGK G G W GP + + + AE GLG S+ V
Sbjct: 455 TDIHCPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHA-FAEAGLGV-SVATDGVV 512
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---------------W--TPILLLVPL 317
D P + RH + + + W P+L+LV +
Sbjct: 513 YETDVLAASNAGPYMY-----RHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGI 567
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
LG++ VNP Y ++ FTFPQS+GI GG+P +S Y VGVQ ++ YLDPH +P +
Sbjct: 568 RLGIDCVNPVYYDAVKALFTFPQSVGIAGGRPSSSYYFVGVQTDNLFYLDPHHSRPSV 625
>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
1015]
Length = 384
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/327 (31%), Positives = 152/327 (46%), Gaps = 50/327 (15%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
+RI + + P S IW LG+ + +D A + F DF SRI ++
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKD-ANTRETQHAWPESFLLDFESRIWMT 69
Query: 152 YRKGFDPI----GDSK-------------------ITSDVGWGCMLRSSQMLVAQALLFH 188
YR F PI GD K TSD GWGCM+RS Q L+A AL
Sbjct: 70 YRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQSLLANALSML 129
Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMC 247
LGR WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A
Sbjct: 130 VLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKYPGEWFGPSATA 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+ EAL+ C + + +YV + + + D +R+ S
Sbjct: 187 KCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK-----FMDIARNTS------GA 227
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
+ P L+L+ LG++ + P Y L+ FPQS+GI GG+P AS Y VG Q YLD
Sbjct: 228 FQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGRPSASHYFVGAQGSHLFYLD 287
Query: 368 PHDVQPVI---NIGKDDLEADTSTYHS 391
PH +P + G+ + + TYH+
Sbjct: 288 PHYTRPALPDRQEGELYSKEEVDTYHT 314
>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
Length = 411
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 143/290 (49%), Gaps = 36/290 (12%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-ADCRDATYLKIVNRFEDVRNSFYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSE 392
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH +
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAVAEQDYDETYHQK 299
>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
Length = 388
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 147/321 (45%), Gaps = 65/321 (20%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 50 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 99
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 100 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSGLGPSEPSGLASPNRYRGPAR 159
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L G++ G AG W GP
Sbjct: 160 WVPPRWAHGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLGGLGQSSGKKAGDWYGP-- 215
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 216 -----SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 260
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 261 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 320
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 321 LDPHYCQPTVDVTQADFPLES 341
>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
Length = 458
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 144/342 (42%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF+SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHIIAGNVEEFRKDFTSRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDFESWTSNTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMASDHADDKAVIILVPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
Length = 458
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 143/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKLLPARSGCTIKDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
+ S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPDALNIENSDSESWTSNTAK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNEIYHRKIISWFGDSPLTLFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMAPDNTDDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
Length = 384
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 147/297 (49%), Gaps = 50/297 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 170
+W+LG + ++ L +D S++ +YRKGF PIG S TSD GW
Sbjct: 23 VWILGKQYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGW 71
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
GCMLR QM++ QAL+ LGR W+ P + + Y++IL F D T+PFSIH +
Sbjct: 72 GCMLRCGQMVLGQALIILHLGRDWQWTPETR--NSTYLKILERFEDRRTAPFSIHQIASM 129
Query: 230 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
G + G G W GP + + + L + + I+V + +
Sbjct: 130 GASEGKEVGQWFGPNTIAQVLKKLVVYDDWSS--------ITIHVALDN---------TL 172
Query: 290 CIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
++D R C V K + W P+LLL+PL LGL ++NP YI L+ +F
Sbjct: 173 IVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFK 232
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 233 IPQSLGVIGGKPNLALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMD-ATYH 288
>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
bisporus H97]
Length = 1261
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 132/292 (45%), Gaps = 65/292 (22%)
Query: 140 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 165
F DF SRI ++YR F PI DS +T
Sbjct: 247 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 306
Query: 166 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD-- 215
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D
Sbjct: 307 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 366
Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
S +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 367 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 415
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------DW--TPILLLVPLVLGLEKVN 325
S +DG V A + + ++ W P+L+LV L LG++ VN
Sbjct: 416 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 475
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 476 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 527
>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
Length = 404
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 153/346 (44%), Gaps = 68/346 (19%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 139
+RI + + P S IW LG+ + +D + N E
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKDANTRETPDKNNTRENVMGTTNYRKPS 70
Query: 140 -------FNQDFSSRILISYRKGFDPI----GDSK-------------------ITSDVG 169
F DF SRI ++YR F PI GD K TSD G
Sbjct: 71 EHAWPESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTG 130
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + F+ E ++L LF D+ T+PFS+H ++
Sbjct: 131 WGCMIRSGQSLLANALSMLVLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKH 187
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G ++ G G W GP A + EAL+ C + + +YV + + +
Sbjct: 188 GAESCGKYPGEWFGPSATAKCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK--- 236
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
D +R+ S + P L+L+ LG++ + P Y L+ FPQS+GI GG+
Sbjct: 237 --FMDIARNTS------GAFQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGR 288
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHS 391
P AS Y VG Q YLDPH +P + G+ + + TYH+
Sbjct: 289 PSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVDTYHT 334
>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1355
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 132/292 (45%), Gaps = 65/292 (22%)
Query: 140 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 165
F DF SRI ++YR F PI DS +T
Sbjct: 334 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 393
Query: 166 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD-- 215
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D
Sbjct: 394 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 453
Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
S +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 454 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 502
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------DW--TPILLLVPLVLGLEKVN 325
S +DG V A + + ++ W P+L+LV L LG++ VN
Sbjct: 503 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 562
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 563 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 614
>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 494
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/277 (35%), Positives = 131/277 (47%), Gaps = 51/277 (18%)
Query: 127 ALGDAAGNNG---LAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------- 165
A GDA G F DF SRI ++YR GF DP S ++
Sbjct: 139 AYGDADGTTDGGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGA 198
Query: 166 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
SD GWGCM+RS Q L+A ALL RLGR WR+ +RE IL LF D +
Sbjct: 199 DQAGFSSDTGWGCMIRSGQSLLANALLISRLGREWRRGQNPKAERE---ILSLFADDPRA 255
Query: 220 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
P+S+HN ++ G +A G G W GP A R +ALA +E + +Y
Sbjct: 256 PYSLHNFVKHGAEACGKFPGEWFGPSATARCIQALANKHESE---------LRVYST--- 303
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
G P V D ++ + + P L+LV LG++K+N Y L T
Sbjct: 304 -----GDLPDVYEDS---FMAIANPDGQHFHPTLVLVCTRLGIDKINKVYEQALISTLQM 355
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
QS+GI GG+P S Y +GVQ++ YLDPH +P++
Sbjct: 356 EQSIGIAGGRPSQSHYFIGVQDQWLFYLDPHYPRPML 392
>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
Length = 1119
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/296 (32%), Positives = 125/296 (42%), Gaps = 75/296 (25%)
Query: 134 NNGLAEFNQDFSSRILISYRKGF-----DPIGDSK------------------------- 163
N A F D SRI ++YR GF DP S
Sbjct: 644 NGWPAAFYHDSYSRIALTYRSGFPIIPCDPSSSSTGVVQGMLNNLSMSIGRGGHRGPSPT 703
Query: 164 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-----------KPFDREYV 207
++SD GWGCMLR+ Q L+A AL+ LGR WR+PL P Y
Sbjct: 704 NAEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYA 763
Query: 208 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
IL LF D S SPFS+H Q GK G G W GP + + L
Sbjct: 764 RILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYE------- 816
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---W-TPILLLVPLVLGL 321
P + VVS C+D V + D W TP+L+L+ + LG+
Sbjct: 817 ---PAGLKVVS-------------CVDGTVYESEVVAASTKDGEKWKTPVLVLINVRLGI 860
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+ VNP Y ++ F PQS+GI GG+P +S Y VG Q S Y+DPH +P + +
Sbjct: 861 DGVNPIYYEAIKGIFRLPQSVGIAGGRPSSSYYFVGAQANSLFYIDPHHPRPAVPL 916
>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
Length = 458
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 143/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIADHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLAPFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
Length = 518
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 92/266 (34%), Positives = 135/266 (50%), Gaps = 39/266 (14%)
Query: 130 DAAG-NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
DA G ++G +F D+ SR+ I+YR F P+ ++ T+D GWGCM+R++QM+VAQA++ +
Sbjct: 159 DANGVSSGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIMLN 218
Query: 189 RLGRPWRKPLQKP-----------FDREYVE---ILHLFGDSETSPFSIHNLLQ--AGKA 232
R GR WR +K FDRE ++ IL LF D +SP IH +++ A +
Sbjct: 219 RFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAKEK 278
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
A GSW P EA+ ++A L +I ++GD A + I
Sbjct: 279 GKKAVGSWYSPS------EAVFIMKKA--------LTESISPLTGD------TAMYLSI- 317
Query: 293 DASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
D H +W L+LV +V LG ++NP Y+P L F+ LG+ GG+P
Sbjct: 318 DGRVHIRDIEVETKNWMKTLILVIVVRLGAAELNPIYVPHLMRLFSMESCLGVTGGRPDH 377
Query: 352 STYIVGVQEESAIYLDPHDVQPVINI 377
S + VG + IYLDPH I I
Sbjct: 378 SCWFVGFYGDQIIYLDPHVAHEYIPI 403
>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
Length = 458
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 140/342 (40%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 224
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
Length = 411
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 144/292 (49%), Gaps = 40/292 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P R+ Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSE 392
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH +
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQK 299
>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
Length = 458
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 144/342 (42%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
Length = 458
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 143/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
Length = 411
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 144/292 (49%), Gaps = 40/292 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P R+ Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSE 392
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH +
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQK 299
>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
Length = 410
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 144/292 (49%), Gaps = 40/292 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P R+ Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSE 392
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH +
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQK 299
>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
Length = 508
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 136/286 (47%), Gaps = 47/286 (16%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 169
G++ A F DF S+I ++YR GF DP + T+D G
Sbjct: 138 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 197
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 198 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 254
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 255 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 298
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 299 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 357
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHS 391
P +S Y +G Q YLDPH +P + G+ E + ++YH+
Sbjct: 358 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHT 403
>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 440
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 101/350 (28%), Positives = 149/350 (42%), Gaps = 81/350 (23%)
Query: 108 SSTSDIWLLGVCH--KIAQDEALG----------DAAGNNGLAEFNQDFSSRILISYRKG 155
S S + LLG C+ K+ +DE + D GN + +F +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKVEEDEGVAEACCEASDEEDVVGN--VEDFRRDFGSRIWLTYREE 93
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDRE------- 205
F P+ S +TSD GWGCMLR+ QM++AQALL H +GR W R +P D E
Sbjct: 94 FPPLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWTWSRTMSLQPLDTETWTTSAA 153
Query: 206 ------------------------------------YVE-------ILHLFGDSETSPFS 222
+VE ++ FGDS ++ F
Sbjct: 154 KRLVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEGEAFHRTLVSWFGDSPSAQFG 213
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSW-----EALARCQRAETGLGCQSLPMAIYVVSG 277
+H ++ G G AG W GP + EAL T Q + V
Sbjct: 214 LHRMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSLAGITAYVSQDCTVYSADVID 273
Query: 278 DEDGERGGAP-----VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
+P V + ++ S +A +++LVP+ LG EK NP Y
Sbjct: 274 GHKASTSASPESSDDVTLLSPNNQAASALPDSRA----VIILVPVRLGGEKTNPDYFNLA 329
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D
Sbjct: 330 KSILSLDYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDF 379
>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
FP-101664 SS1]
Length = 997
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/282 (33%), Positives = 133/282 (47%), Gaps = 58/282 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 166
F DF+SRI ++YR F PI D+ + T+
Sbjct: 298 FYADFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTT 357
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 221
D GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 358 DAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPF 417
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
S+H + GK G G W GP + + L + P A V+ DG
Sbjct: 418 SVHRMALVGKDLGKDVGQWFGPSTAAGAIKTL-----------VHAFPEATLGVANAVDG 466
Query: 282 ERGGAPVVCIDDASRHC--SVFSKGQA--DW--TPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ V ASR S G A DW +L+L+ + LG+E VNP Y T++
Sbjct: 467 TLYESDVYA---ASRSVMYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTL 523
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 524 YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPL 565
>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
Length = 513
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 136/286 (47%), Gaps = 47/286 (16%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 169
G++ A F DF S+I ++YR GF DP + T+D G
Sbjct: 143 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 202
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 203 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 259
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 260 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 303
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 304 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 362
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHS 391
P +S Y +G Q YLDPH +P + G+ E + ++YH+
Sbjct: 363 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHT 408
>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
Length = 463
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 101/342 (29%), Positives = 147/342 (42%), Gaps = 77/342 (22%)
Query: 108 SSTSDIWLLGVCH--KIAQDE--------ALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
S S ++LLG C+ K+ DE AL D + EF +DF+SR+ ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKVVDDENPTESTAEALDDDVVTGNVDEFRKDFTSRVWLTYREEFP 95
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQK---------- 200
+ S TSD GWGC LR+ QM++AQALL H LGR W+ +PL
Sbjct: 96 ALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWKWSEALSLEPLDTETWTSSAARR 155
Query: 201 ---------------------PFDREYVE------------ILHLFGDSETSPFSIHNLL 227
P E E I+ FGD ++ I+ L+
Sbjct: 156 LVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETYHRTIVSWFGDGPSAQLGIYKLV 215
Query: 228 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 287
+ G G AG W GP +A R ++ I V +D A
Sbjct: 216 ELGMTSGKQAGDWYGP-------AVVAHILRKAVDEAVDAMLKGIRVYVA-QDCTVYSAD 267
Query: 288 VVCIDDASRHCSVFSKGQA-------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V ID S S Q D +++L+P+ LG EK+NP Y+ ++ +
Sbjct: 268 V--IDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEYLNFVKSILSLEY 325
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+GI+GGKP + Y VG Q++S IY+DPH Q +++ D
Sbjct: 326 CIGIIGGKPKQAYYFVGFQDDSLIYMDPHYCQSFVDVSTSDF 367
>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
Length = 456
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/274 (36%), Positives = 135/274 (49%), Gaps = 54/274 (19%)
Query: 130 DAAGNNGLAE-FNQDFSSRILISYRKGF-------DP---------------IGD-SKIT 165
+++G++G F DF SRI ++YR GF DP +GD + T
Sbjct: 111 ESSGDSGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFT 170
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCM+RS Q L+A ALL RLGR WR+ +R IL LF D +P+S+HN
Sbjct: 171 SDTGWGCMIRSGQSLLANALLISRLGRDWRRMTDPDAERP---ILALFADDSRAPYSLHN 227
Query: 226 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ G+ A G G W GP A R +ALA + E+ L S G
Sbjct: 228 FVKHGELACGKYPGEWFGPSATARCIQALA--NKHESSLRVYST---------------G 270
Query: 285 GAPVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
P V D S + + D + P L+LV LG++K+N Y+ L T QS
Sbjct: 271 DLPDVYED------SFMATAKPDGETFHPTLILVCTRLGIDKINQVYVEALISTLQMEQS 324
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
+GI GG+P +S Y VGVQ + YLDPH +P +
Sbjct: 325 IGIAGGRPASSHYFVGVQGQWLFYLDPHHPRPKL 358
>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
Length = 395
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 140/304 (46%), Gaps = 53/304 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 293 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 328
D + C +G W P+LL++PL LG+ +NP Y
Sbjct: 179 DIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGWKPLLLIIPLRLGINHINPVY 238
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ +++ D +
Sbjct: 239 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVD-SEENGTVDDES 297
Query: 389 YHSE 392
+H +
Sbjct: 298 FHCQ 301
>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
Length = 459
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/338 (28%), Positives = 140/338 (41%), Gaps = 76/338 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNN-----------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +E G +N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKSEEDDGIPVRSNWAPEDPAVISGNVDEFRKDFVSRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
P+G S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPAALDMENSDSESWTSHTVK 155
Query: 198 -LQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 224
L F+ +V +I+ FGDS + F +H
Sbjct: 156 KLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNEGFHRKIISWFGDSPRTYFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L + GK G AG W GP + R G + +YV
Sbjct: 216 QLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQG-----LTVYVAQ-------- 262
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G+ D +L+LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 263 DCTVYNSDVTDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+GGKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDF 360
>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
Length = 432
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 125/278 (44%), Gaps = 50/278 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+ +YR F I S+ T+D GWGCM+RS
Sbjct: 105 FLDDFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRS 164
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A AL LGR WR+ + +E E+L LF D+ +PFSIH + G A G
Sbjct: 165 GQSLLANALSILNLGRDWRRGSKI---KEECELLSLFADNPQAPFSIHRFVDYGASACGK 221
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R EAL+ C+ + +YV+S D + D
Sbjct: 222 HPGEWFGPSATARCIEALSN--------ECKHTDLNVYVMSDGSDVHEDQFRQIAGPDGI 273
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
R P L+L+ + LG+E V P Y LR +PQS+GI GG+P +S Y
Sbjct: 274 R-------------PTLILLGVRLGIESVTPVYWEALRAIIRYPQSVGIAGGRPSSSLYF 320
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHS 391
+GVQ YLDPH +P ++ D + TYH+
Sbjct: 321 IGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLDTYHT 358
>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
gallopavo]
Length = 421
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 142/304 (46%), Gaps = 53/304 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGRRHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 205 DIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCTGWKPLLLIIPLRLGINHINPVY 264
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ +++ D +
Sbjct: 265 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVD-SEENGTVDDQS 323
Query: 389 YHSE 392
+H +
Sbjct: 324 FHCQ 327
>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
Length = 458
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 143/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIHHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 601
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 136/286 (47%), Gaps = 47/286 (16%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 169
G++ A F DF S+I ++YR GF DP + T+D G
Sbjct: 229 GHDWPAPFLDDFESKIWLTYRSGFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 288
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 289 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 345
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 346 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 389
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 390 -VYEDRFRTIASGGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 448
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHS 391
P +S Y +G Q YLDPH +P + G+ E + ++YH+
Sbjct: 449 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHT 494
>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
Length = 858
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 132/275 (48%), Gaps = 58/275 (21%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------------TS 166
AA + EF DF+SR+ ++YR GF PI D + TS
Sbjct: 148 AAASGWPQEFFSDFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTS 207
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIH 224
D GWGCMLR+ Q L+A AL+ +GR Y+ ++ LF DS + +PFS+H
Sbjct: 208 DAGWGCMLRTGQSLLANALVVAWMGRGALA--------LYIHLISLFLDSPSPSAPFSVH 259
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
+ AG+A G G W GP + +AL + GLG V+ EDG
Sbjct: 260 RMALAGRALGKDVGQWFGPSTAAGAIKALVNAY-PDAGLG----------VAIAEDG--- 305
Query: 285 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
V R + + +W P+L+L+ + LGL+ VNP Y T++ +TFPQSL
Sbjct: 306 ----VVYQTQRRQ----KEREREWGDQPVLVLLGIRLGLDGVNPIYYDTIKQLYTFPQSL 357
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
GI GG+P +S Y VG Q YLDPH +P + +
Sbjct: 358 GIAGGRPSSSYYFVGAQAGDLFYLDPHHARPTVPL 392
>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
Length = 458
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 143/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +++ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNEIYHRKVISWFGDSPLAPFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
Length = 385
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 93/295 (31%), Positives = 144/295 (48%), Gaps = 36/295 (12%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W+LG H++ +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 17 WILGRQHQLKTEKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 65
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
LR QM++AQAL+ LGR W K EY IL F D + +SIH + Q G
Sbjct: 66 LRCGQMMLAQALICRHLGRDWHWEEHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGE 125
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGER------ 283
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 126 GKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCRLPN 177
Query: 284 GGAPVVCIDDASRHCSVFSKGQAD------WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
P V H S+ S+ ++ W P+LL++PL LG+ +NP Y+ + F
Sbjct: 178 QNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLIIPLRLGINHINPVYVDAFKECFK 237
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
PQSLG +GGKP + Y +G IYLDPH Q ++ +++ D ++H +
Sbjct: 238 MPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQLFVD-SEENSTVDDRSFHCQ 291
>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 458
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 143/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----------------LQ 199
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P ++
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 200 K-----------------------------PFDRE------YVEILHLFGDSETSPFSIH 224
K P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNEVYHRKIISWFGDSPVALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
Length = 412
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 141/304 (46%), Gaps = 53/304 (17%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHC------------------SVFSKGQAD------WTPILLLVPLVLGLEKVNPRY 328
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 181 DIKKMCWSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGWKPLLLIIPLRLGINHINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ +++ D +
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVD-SEENGTVDDKS 299
Query: 389 YHSE 392
+H +
Sbjct: 300 FHCQ 303
>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
commune H4-8]
Length = 602
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 144/310 (46%), Gaps = 82/310 (26%)
Query: 112 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------- 164
+IWL+GVCH G +F DF++RI ++YR GF+ I D ++
Sbjct: 114 EIWLMGVCHA-------------PGAPDFYADFATRIWLTYRSGFELIRDRQLIDLPPPV 160
Query: 165 ------------------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
+SD GWGCMLR+ Q L+A ALL GR WR+ +
Sbjct: 161 ASLDGHLQGEWATDEAEPPGAYGFSSDSGWGCMLRTGQSLLANALLTAWFGRDWRRISEV 220
Query: 201 PFDRE--YVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
+ YV +L LF D+ T+PFSIH + AGK G G W GP + + L
Sbjct: 221 ETHQHSLYVHLLSLFLDTPHPTAPFSIHRMALAGKQLGKDIGQWFGPSTAAGAIKNL--- 277
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT------- 309
+ P+A G VV +D A VF+ ++W+
Sbjct: 278 --------VSAYPLA------------GIGVVVGMDGALSKSEVFTASHSEWSDEEAALD 317
Query: 310 ----PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
P+L+L+ L LGL++VNP Y T++ FTFPQS+GI GG+P +S + VG Q IY
Sbjct: 318 WGDRPVLILLNLRLGLDRVNPIYHDTIKALFTFPQSVGIAGGRPCSSYHFVGAQGSDLIY 377
Query: 366 LDPHDVQPVI 375
LDPH + +
Sbjct: 378 LDPHHTRNTV 387
>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
Length = 468
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 147/342 (42%), Gaps = 67/342 (19%)
Query: 108 SSTSDIWLLGVCHKI------AQDEALGDAAGNNGLA----EFNQDFSSRILISYRKGFD 157
S S + LLG C+ Q EA +A+ G+ +F +DF SRI ++YR+ F
Sbjct: 29 SRNSPVLLLGKCYHFKAEEDEGQTEACREASDEEGVMGNVEDFRRDFGSRIWLTYREEFP 88
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE--------- 205
P+ S +TSD GWGCMLR+ QM++AQALL H LGR W + +P D E
Sbjct: 89 PLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWTWSGAMSLQPLDTETWTTSAAKR 148
Query: 206 ----------------------------------------YVEILHLFGDSETSPFSIHN 225
+ ++ FGDS ++ F +H
Sbjct: 149 LVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDGGFHRTLVSWFGDSPSAQFGLHR 208
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS-LPMAIYVVSGDEDGERG 284
+++ G A G AG W GP + + R G S + V S D
Sbjct: 209 MVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLAGISSYVSQDCTVYSADVIDSHK 268
Query: 285 GAPVVCID----DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+ + +S H S + D +++LVP+ LG EK NP Y + +
Sbjct: 269 ASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVRLGGEKTNPDYFNLAKSFLSLDY 328
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+GI+GGKP + Y VG Q++S IY+DPH Q +++ D
Sbjct: 329 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDF 370
>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
Length = 458
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
Length = 458
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
Length = 400
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 93/280 (33%), Positives = 129/280 (46%), Gaps = 49/280 (17%)
Query: 139 EFNQDFSSRILISYRKGFDPIG----------------------DSK-ITSDVGWGCMLR 175
EF D SRI I+YR F PI DS+ TSD GWGCM+R
Sbjct: 75 EFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTGWGCMIR 134
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
S Q L+A A+L LGR WR+ + + ++LH F D +PFSIH +Q G +
Sbjct: 135 SGQSLLANAMLILLLGRDWRRGTEA---GKEAQLLHQFADHPEAPFSIHRFVQHGAEFCN 191
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R +AL A+ G S + +Y+ D + D
Sbjct: 192 KYPGEWFGPSATARCIQALV----AQQG----SSELRVYITDDTAD--------IYEDKF 235
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+R + D+ P L+LV LG++ V P Y L+ PQS+GI GG+P AS Y
Sbjct: 236 AR---IAQAEHGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLPQSVGIAGGRPSASHY 292
Query: 355 IVGVQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHS 391
+GV + YLDPH +P ++ + +TYH+
Sbjct: 293 FIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHT 332
>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
Length = 509
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 99/295 (33%), Positives = 137/295 (46%), Gaps = 56/295 (18%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 163
L + HK D+A A + EF +D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKPDQAADTEA--SWPREFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSL 108
Query: 164 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
T+D GWGCM+R+SQ L+A +LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANSLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 211 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
F D T+PFSIHN ++ G G G W GP A RS + L +TGL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKTGLKV---- 223
Query: 270 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 325
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQQGAELRPVLILAGIRLGVKNVN 263
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
P Y L+ T +PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLGWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASE 318
>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
Length = 988
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 125/272 (45%), Gaps = 57/272 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 167
F DF+SRI ++YR F PI D+ + TSD
Sbjct: 305 FYSDFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSD 364
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD------REYVEILHLFGDSETS-- 219
GWGCMLR+ Q L+A LL LGR WR+P P+ YV+IL F D+ +
Sbjct: 365 AGWGCMLRTGQSLLANTLLHLHLGRDWRRP---PYPICTADYATYVQILTWFFDNPSPLC 421
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
PFS+H + GK G G W GP + + L E GLG ++ S
Sbjct: 422 PFSVHRMALVGKELGKEVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVATDSVIYQSD-- 478
Query: 280 DGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
V S S G++ W +L+LV + LGL+ VNP Y T++ +T
Sbjct: 479 ---------VYTASRSNLGSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIYYDTIKALYT 529
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
FPQS+GI GG+P +S Y VG Q ++ YLDPH
Sbjct: 530 FPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561
>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
Length = 458
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L+ GK G AG W GP + R G + IYV
Sbjct: 216 QLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 88/241 (36%), Positives = 121/241 (50%), Gaps = 27/241 (11%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
L + DF SR+ +YR+ F IG S TSD GWGCMLR+ QMLVA+ LL RLGR +
Sbjct: 39 LEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRNYVW 98
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
D Y EIL LF D+ ++ S+ + L A A G W GP M + L R
Sbjct: 99 SESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMA---QVLKR 155
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
++ +SL + V VV ++D S + + G+ TP++L++
Sbjct: 156 ITKS------RSLGFGVTVAMDS---------VVSVEDVS--AEIINGGKP--TPLVLMI 196
Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA----IYLDPHDV 371
PL LGL VN Y+ L++ +GI+GGKP + Y VG QE +YLDPH
Sbjct: 197 PLRLGLNSVNEIYVNPLKIFLASKYCVGIMGGKPNQAHYFVGYQETVEDTWLLYLDPHTT 256
Query: 372 Q 372
Q
Sbjct: 257 Q 257
>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
Length = 521
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 139/297 (46%), Gaps = 52/297 (17%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 170
+D+ LG + + DE+ +G F D+ SR+ I+YR F + D+ T+D GW
Sbjct: 146 NDVVFLGRRYSTSVDES----GLRSGFENFCSDYYSRLWITYRTDFPALLDTDTTTDCGW 201
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQK-----------PFDREYVE---ILHLFGDS 216
GCM+R++QM+VAQA++ +R GR WR +K FDRE ++ IL LF D
Sbjct: 202 GCMIRTTQMMVAQAIMVNRFGRDWRFTRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDK 261
Query: 217 ETSPFSIHNLL---QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273
T+P IH ++ GK A GSW P EA+ ++A L S P+
Sbjct: 262 PTAPLGIHKMVGIAAMGKGKK-AVGSWYSPS------EAVFIMKKA---LTESSSPLT-- 309
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTL 332
G ++ D H +W L+LV +V LG ++NP Y+P L
Sbjct: 310 ----------GNTAMLLSIDGRVHIRDIEVETKNWMKKLILVIVVRLGAAELNPIYVPHL 359
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH--------DVQPVINIGKDD 381
F LGI GG+P S++ VG + IYLDPH D+ P N+ D
Sbjct: 360 MRLFAMESCLGITGGRPDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSD 416
>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
Length = 466
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 44 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 103
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 104 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 163
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 164 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 223
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 224 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 271
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 272 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 330
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 331 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 372
>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
Length = 400
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 134/307 (43%), Gaps = 67/307 (21%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF+SRI ++YR+ F I S +T+D GWGC +R+ QML+AQ L+ H LG
Sbjct: 15 AGN--VEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLG 72
Query: 192 RPW----------------------------------RKPLQKPF------------DRE 205
R W + L+ P D E
Sbjct: 73 RAWTWPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHE 132
Query: 206 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ +I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 133 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 192
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + C+ + AD +++LVP+ L
Sbjct: 193 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCASMASDHADDKAVIILVPVRL 239
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 240 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 299
Query: 380 DDLEADT 386
D +T
Sbjct: 300 KDFPLET 306
>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
Length = 396
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 130/307 (42%), Gaps = 67/307 (21%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 11 AGN--VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 68
Query: 192 RPWRKP----------------------------------------LQKPFDREYVE--- 208
R W P QK R Y +
Sbjct: 69 RAWTWPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHE 128
Query: 209 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 129 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 188
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + C+ + D +++L+P+ L
Sbjct: 189 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRL 235
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 236 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 295
Query: 380 DDLEADT 386
D +T
Sbjct: 296 KDFPLET 302
>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
Length = 509
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/295 (33%), Positives = 136/295 (46%), Gaps = 56/295 (18%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 163
L + HK QD+A A + EF D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKQDQAAETEA--SWPREFLGDVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSL 108
Query: 164 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
T+D GWGCM+R+SQ L+A LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANGLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 211 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
F D T+PFSIHN ++ G G G W GP A RS + L + GL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKIGLKV---- 223
Query: 270 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 325
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQEGAELRPVLILAGIRLGVKNVN 263
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
P Y L+ T ++PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLSWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASE 318
>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
Length = 435
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 156/356 (43%), Gaps = 77/356 (21%)
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQ 142
+H R + ++T S + S + LLG C+ +DE A+ D + EF +
Sbjct: 1 MHTRWVLKTKTYFSRN-SPVLLLGKCYHFKYEDEHKMLTARSGCAIEDRVIAGNVDEFRK 59
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--------- 193
DF SRI ++YR+ F PI S +++D GWGC LR+ QML+AQ L+ H LGR
Sbjct: 60 DFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWIWPDALNI 119
Query: 194 -------WRKPLQKPF----------DRE--------------------------YVEIL 210
W K F +R+ + +I+
Sbjct: 120 ENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDEVYHRKII 179
Query: 211 HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
FGDS ++ F +H L++ G+ G AG W GP + R G +
Sbjct: 180 SWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----I 234
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+YV +D + V+ ASR G AD +++LVP+ LG E+ N Y+
Sbjct: 235 TVYVA---QDCTVYNSDVIDKQSASR-----PAGNADDKAVIILVPVRLGGERTNTDYLE 286
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
++ + +GI+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 287 FVKGVLSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 342
>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
Length = 458
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRLWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
Length = 457
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 142/337 (42%), Gaps = 75/337 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEA-----------LGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE + D + + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDEPSDQSPNGSCDDMTDESFSRNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWANAFVFENPESESWTSQTVK 155
Query: 195 -----------------------RKPLQKPFDREYVE------ILHLFGDSETSPFSIHN 225
+ P++ E VE I+ F DS + F +H
Sbjct: 156 KLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEEQYHRRIISWFADSPFANFGLHR 215
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
L++ GK G AG W GP + L R + E + + IYV +
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAH----LLR-KAVEKARDPELQGITIYVAQDCTVYKSDV 270
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
+C S SV S I++L+P+ LG E+ N Y ++ + +GI+
Sbjct: 271 IDALCPFTDSEKTSVKS--------IIILIPVRLGGERTNMEYFEFVKGILSLDYCIGII 322
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GGKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSVKDF 359
>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
Length = 508
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 139/306 (45%), Gaps = 53/306 (17%)
Query: 101 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 156
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 157 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRR 204
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
+ +E ++L LF D +PFSIH ++ G A G G W GP A R +AL+
Sbjct: 205 GTKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS 261
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWT 309
C+ + +YV S D +D R ++ S G D
Sbjct: 262 --------ECEHAGLNVYVTSDGSD---------VYEDRFR--AIASAGGTGAGTSTDVH 302
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q YLDPH
Sbjct: 303 PTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFYLDPH 362
Query: 370 DVQPVI 375
+P +
Sbjct: 363 HTRPAL 368
>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum PHI26]
Length = 401
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/332 (30%), Positives = 141/332 (42%), Gaps = 66/332 (19%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 139
+RI + P T + S IW LG + A + D A NN +
Sbjct: 9 KRIVQYFWDPEPTNNVPAAS-IWCLG--KEYAPPQPFSDPATNNPHSSSGQPDASTLNDT 65
Query: 140 -----FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWG 171
F DF SRI I+YR F PI +K TSD GWG
Sbjct: 66 AWPNAFVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWG 125
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
CM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH + G
Sbjct: 126 CMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIHKFVNRGA 182
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
++ G G W GP A + + L+ A + +YV + D
Sbjct: 183 ESCGKYPGEWFGPSATAKCIQLLSTQSEAHR--------LRVYVTNDTSD---------V 225
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
+D H S G P L+L+ LG+E V P Y LR T+PQS+GI GG+P
Sbjct: 226 YEDKFAHVSHDRSGCIQ--PTLILIGTRLGIENVTPAYWDGLRAALTYPQSVGIAGGRPS 283
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
AS Y +G Q+ +LDPH +P D+L
Sbjct: 284 ASHYFLGAQDCHLFFLDPHTTRPATPYRPDEL 315
>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 448
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/270 (34%), Positives = 128/270 (47%), Gaps = 53/270 (19%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAERN---IVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 296 RHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
S + + D + P L+LV LG++K+NP Y L T QS+GI GG+P +S
Sbjct: 270 ---SFLATARPDGETFHPTLILVCTRLGIDKINPVYEEALISTLQMEQSIGIAGGRPSSS 326
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
Y VGVQ + YLDPH +P + ++ L
Sbjct: 327 HYFVGVQRQWLFYLDPHHPRPALQYRENPL 356
>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
Length = 489
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/261 (33%), Positives = 117/261 (44%), Gaps = 47/261 (18%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 175
F DF S+I ++YR F PI S+ TSD GWGCM+R
Sbjct: 153 FLDDFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 212
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
S QML+A AL RLGR WR+ E ++L LF D +PFSIH ++ G Y G
Sbjct: 213 SGQMLLANALAISRLGRDWRRVSHT---TEENKLLSLFADDPAAPFSIHRFVRHGALYCG 269
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A +AL+ + M +YV S +D
Sbjct: 270 KHPGEWFGPSATATCIQALSEEYKVAG--------MNVYVSSDS---------TYVYEDK 312
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + G P L+L+ LG++++ P Y L PQSLGI GG+P +S Y
Sbjct: 313 FKAVAYNQPGHM--RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQSLGIAGGRPSSSHY 370
Query: 355 IVGVQEESAIYLDPHDVQPVI 375
+GVQ YLDPH +P +
Sbjct: 371 FIGVQNSFFFYLDPHHTRPAL 391
>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
Length = 454
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 95/260 (36%), Positives = 124/260 (47%), Gaps = 47/260 (18%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF SRI ++YR GF DP +S ++ SD GWGCM+RS
Sbjct: 118 FLDDFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRS 177
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A AL RLGR WR+ +RE IL LF D +P+S+HN ++ G A G
Sbjct: 178 GQSLLANALQISRLGRDWRRATDPDAERE---ILSLFADDPRAPYSLHNFVKHGAAACGK 234
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R EALA + E+ L S G P V D
Sbjct: 235 YPGEWFGPSATARCIEALA--NQHESSLRVYST---------------GDLPDVYEDS-- 275
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+V + + P L+LV LG++K+N Y L T QS+GI GG+P +S Y
Sbjct: 276 -FMAVANPDGEHFHPTLILVCTRLGIDKINQVYEEALISTLQMEQSIGIAGGRPSSSHYF 334
Query: 356 VGVQEESAIYLDPHDVQPVI 375
VGVQ + YLDPH +P +
Sbjct: 335 VGVQGQWLFYLDPHHPRPAL 354
>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
Length = 454
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/268 (33%), Positives = 119/268 (44%), Gaps = 47/268 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 175
F DF RI ++YR GF PI S+ TSD GWGCM+R
Sbjct: 117 FLDDFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 176
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
S Q L+A AL RLGR WR+ E +L LF D +PFSIH ++ G Y G
Sbjct: 177 SGQSLLANALAISRLGRDWRRGSNS---TEENRLLSLFADDPAAPFSIHKFVRHGALYCG 233
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A +AL+ + + G M +YV S + V + +
Sbjct: 234 KHPGEWFGPSATATCIQALSD-EYKDAG-------MNVYVSSDNTYVYEDKFKAVAYNQS 285
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
R P L+L+ LG++++ P Y L PQ+LGI GG+P AS Y
Sbjct: 286 DRM-----------RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQALGIAGGRPSASHY 334
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDL 382
+GVQ YLDPH +P + DL
Sbjct: 335 FIGVQNSFFFYLDPHHTRPALPYKTGDL 362
>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
Length = 451
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGSVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGEERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
Length = 458
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
Length = 458
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FG+S + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGNSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
Length = 458
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
Length = 448
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/265 (33%), Positives = 124/265 (46%), Gaps = 46/265 (17%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF S++ SYR GF DP S ++ SD GWGCM+RS
Sbjct: 108 FLDDFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRS 167
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A +++ RL R WR+ + + +RE I+ LF D +P+SIH ++ G +A G
Sbjct: 168 GQSLLANSMVILRLSRGWRRGVGRDKERE---IVSLFADDPRAPYSIHKFVEHGAEACGK 224
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R + LA+ +S + +Y+ D + G
Sbjct: 225 YPGQWFGPSATARCIQELAKRH--------ESADVRVYITGDGSDVYKDG---------- 266
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
SV ++ P L+LV LG++KV P Y L+ + PQS+GI GG+P +S Y
Sbjct: 267 -FMSVAKPDGVNFKPTLILVGTRLGIDKVTPVYWEALKASLQMPQSVGIAGGRPSSSHYF 325
Query: 356 VGVQEESAIYLDPHDVQPVINIGKD 380
VGVQ YLDPH I D
Sbjct: 326 VGVQGSHFFYLDPHQTMAAIPFHTD 350
>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
Length = 458
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 140/342 (40%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 198
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPYALSIENSDSESRTSHTVK 155
Query: 199 ----------------------------QKPFDRE------YVEILHLFGDSETSPFSIH 224
+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMASDNTDDKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
Length = 454
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/268 (32%), Positives = 126/268 (47%), Gaps = 53/268 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 170
+F DF S++ I+YR F PI + TSD GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 231
Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 288
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 232 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 283
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 284 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 331
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVIN 376
P +S Y + Q +S YLDPH +P +
Sbjct: 332 PSSSHYFIATQGDSFFYLDPHQTRPCLT 359
>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
Length = 469
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/268 (32%), Positives = 126/268 (47%), Gaps = 53/268 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 170
+F DF S++ I+YR F PI + TSD GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 246
Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 288
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 247 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 298
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 299 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 346
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVIN 376
P +S Y + Q +S YLDPH +P +
Sbjct: 347 PSSSHYFIATQGDSFFYLDPHQTRPCLT 374
>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
Length = 458
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPAISSCAIEDCVISGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 224
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNEIYHRKIVSWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQCASMASDNPDNKAVIILVPVRLGGERTNVDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
Length = 439
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 145/344 (42%), Gaps = 66/344 (19%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 134
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 47 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 106
Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 171
F DF S+I ++YR F PI TSD GWG
Sbjct: 107 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 166
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 167 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 223
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 224 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 267
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 268 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 324
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHS 391
AS Y +G Q YLDPH +P + D + + STYH+
Sbjct: 325 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHT 368
>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
Length = 393
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 88/256 (34%), Positives = 132/256 (51%), Gaps = 33/256 (12%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW------RKP 197
FSS + +YRK F IG TSD GWGCMLR+ QM++ QAL+ LGR W R P
Sbjct: 81 FSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWMWTSDDRLP 140
Query: 198 LQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
DRE Y+ IL +F D +++ FSIH + G + G A G W GP + ++ + L +
Sbjct: 141 -----DRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKLVQY 195
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
M ++V + ++ + D C +K W P+LL+VP
Sbjct: 196 DHWS--------EMKLHVAMDN---------IIILSDIKSLCC--AKESNKWRPLLLVVP 236
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LGL ++N Y + +F SLGI+GG+P + Y +G+Q E ++LDPH ++
Sbjct: 237 LRLGLSEINDIYTNAVLNSFKMKHSLGIIGGRPSHALYFIGIQREELVFLDPHTTHNYVD 296
Query: 377 IGKDDLEADTSTYHSE 392
+ D+ + STYH +
Sbjct: 297 L--DEEPYNDSTYHCQ 310
>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
Length = 450
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 96/334 (28%), Positives = 140/334 (41%), Gaps = 91/334 (27%)
Query: 110 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 157
S ++LLG C+ +++ D N+G + EF +DF SRI ++YRK F
Sbjct: 38 NSPVFLLGKCYHFKYEDSGVTADDCSNSGSDSKEDLSGNVDEFRKDFISRIWLTYRKEFP 97
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 194
I S T+D GWGC LR+ QML+AQ LL H LGR W
Sbjct: 98 QIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWTWTEALDIFCSESDFWTANTARK 157
Query: 195 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 230
++PLQ + Y E LH F D + F +H L++ G
Sbjct: 158 LDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRKIISWFADYPLAYFGLHQLVKLG 217
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
K G AG W GP + L R E+ D E G +
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256
Query: 291 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
D C++++ D +++LVP+ LG E+ N Y ++ + +G
Sbjct: 257 AQD----CTIYNADVYDLQCNKGNEKAVVILVPVRLGGERTNMEYFEYVKGILSLEFCIG 312
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
I+GGKP S Y VG Q++S IY+DPH Q +++
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDV 346
>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.9]
Length = 992
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/284 (34%), Positives = 134/284 (47%), Gaps = 49/284 (17%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 165 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD 215
TSD GWGCMLR+ Q L+A ALL LGR WR+P +Y V+I+ F D
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFD 410
Query: 216 --SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273
S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 TPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGVI 469
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
S + A I RH V G+A +++L+ + LGL+ VNP Y T++
Sbjct: 470 FQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTIK 520
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+TFPQS+GI GG+P +S Y +G Q ++ YLDPH +P + +
Sbjct: 521 ALYTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPHHARPAVPL 564
>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
Length = 458
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 150/349 (42%), Gaps = 66/349 (18%)
Query: 93 RIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR-- 147
R+HE R+ + +S L +A++ AL D+ N + + F+SR
Sbjct: 11 RVHEEAKRLFADWKPAVSKMLETYLTLDPSFSVAENYALFDS--NLPIYLLGEKFTSRRD 68
Query: 148 -----------ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ +YRK F PIG T+D GWGCMLR QML+A+ L+ LGR W
Sbjct: 69 MERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGRNWL- 127
Query: 197 PLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR--- 248
+DR EY IL +F D + S FSIH + G + G G W GP +
Sbjct: 128 -----WDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLK 182
Query: 249 ------SWEALARCQRAETGLGCQSL-PMAI----YVVSG----------DEDGERGGAP 287
W LA + L + MA Y SG D A
Sbjct: 183 KLVIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAE 242
Query: 288 VVCIDDASR--------HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
+++R S + +W P+L+++PL LGL +N Y P ++ F P
Sbjct: 243 AEIFPESTRSPTRSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFFQLP 302
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
Q +GI+GG+P + Y G+ + + +YLDPH Q + DL+ T+T
Sbjct: 303 QCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDFV-----DLDETTAT 346
>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 468
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 93/271 (34%), Positives = 121/271 (44%), Gaps = 53/271 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI +SYR GF PI S T+D GWGCM+R+
Sbjct: 128 FLDDFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRT 187
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A LL HRLGR WR+ + +R+ +L LF D +P+SIH ++ G A G
Sbjct: 188 GQSLLANTLLSHRLGRGWRRGEKSDEERK---LLSLFADDPRAPYSIHKFVEHGAAKCGK 244
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R EALA + +Y G P V D
Sbjct: 245 YPGEWFGPSATARCIEALANTNEKT---------LRVYST--------GDLPDVYEDS-- 285
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+LV LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 286 -FMEVARPDGKTFHPTLILVSTRLGIDKINQVYWESLTATLQMPQSVGIAGGRPSSSHYF 344
Query: 356 VGVQE------ESAIYLDPHDVQPVINIGKD 380
VG Q + YLDPH +P + D
Sbjct: 345 VGAQRSDEDQGSNLFYLDPHHTRPALPYFDD 375
>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1009
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 132/290 (45%), Gaps = 61/290 (21%)
Query: 140 FNQDFSSRILISYRKGFDPI-------------------------------GDSKITSDV 168
F DF+SRI ++YR F PI GD +SD
Sbjct: 308 FYADFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDA 367
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFSI 223
GWGCMLR+ Q L+A AL+ LGR WRKP +Y ++I+ F D + PFS+
Sbjct: 368 GWGCMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSV 427
Query: 224 HNLLQAGKAYGLAAGSWVGP--------YAMCRSWEALARCQRAETGLGCQSLPMA---I 272
H + GK G+ G W GP Y S ++ Q A L + P A I
Sbjct: 428 HRMALVGKQLGVKVGQWFGPSTAAGAIKYVSAHS--SMVPNQPARRTL-VHAFPEAGLGI 484
Query: 273 YVVSGD---EDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPR 327
YV + D E A I RH W P+L+L+ LG++ VNP
Sbjct: 485 YVAADGGTIYDSEVFAASHSGIGSPRRHTRRV------WGDRPVLILIGHRLGIDGVNPI 538
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
Y TL+ +T+PQS+GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 539 YYDTLKTLYTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPTIPL 588
>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
Length = 402
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 145/344 (42%), Gaps = 66/344 (19%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 134
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 70
Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 171
F DF S+I ++YR F PI TSD GWG
Sbjct: 71 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 130
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 131 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 187
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 188 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 231
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 232 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 288
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHS 391
AS Y +G Q YLDPH +P + D + + STYH+
Sbjct: 289 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHT 332
>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
boliviensis]
Length = 458
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEMYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
Length = 508
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 88/260 (33%), Positives = 123/260 (47%), Gaps = 47/260 (18%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A A+L R GR WR+ +R EI+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIER---EIVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA+ + + +Y+ P V D+
Sbjct: 290 YPGEWFGPSATARCIQALAKKHDSS---------LRVYLTR--------DLPEVYEDN-- 330
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
S + + P L+LV LG++K+NP Y L T PQ++GI GG+P +S Y
Sbjct: 331 -FMSTANPDGNHFHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSSHYF 389
Query: 356 VGVQEESAIYLDPHDVQPVI 375
+G Q + YLDPH +P +
Sbjct: 390 IGAQGQWLFYLDPHHPRPAL 409
>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
Length = 458
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
Length = 585
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 85/296 (28%), Positives = 127/296 (42%), Gaps = 64/296 (21%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----- 194
F +DF+SRI ++YR+ F + + T+D GWGCMLRS QML+AQ L+ H LG+ W
Sbjct: 198 FQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDWTWPDA 257
Query: 195 ------------------------------------------------RKPLQKPFDREY 206
R P + +R +
Sbjct: 258 LHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELEKERYH 317
Query: 207 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+I+ F D + F IH L+ G + G AG W GP C C
Sbjct: 318 RKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDC--------CS 369
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 326
+ VS D +G V + + S + + G A W +++LVP+ LG E NP
Sbjct: 370 EAGNLVVYVSQDCTVYKGD--VANLANKSEDRTAWDPG-AVWKAVIILVPMRLGGEAFNP 426
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
Y+ ++ +GI+GGKP S Y VG Q+++ +YLDPH QP ++ K++
Sbjct: 427 AYVDCVKELLKLEFCIGIIGGKPRHSLYFVGYQDDALLYLDPHYCQPFVDTTKENF 482
>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 515
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 93/265 (35%), Positives = 125/265 (47%), Gaps = 47/265 (17%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 176
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G A G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYGATACGK 296
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +AL E+GL S G P V D
Sbjct: 297 YPGEWFGPLATARCIQALT--DEKESGLRVYST---------------GDLPDVYEDSFM 339
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ +G + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 340 AVANPDGRG---FQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 396
Query: 356 VGVQEESAIYLDPHDVQPVINIGKD 380
+GVQ + YLDPH +P + +D
Sbjct: 397 IGVQGQRLFYLDPHHPRPALPYRED 421
>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
Length = 356
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 135/304 (44%), Gaps = 32/304 (10%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 212 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 387
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 388 TYHS 391
TYH+
Sbjct: 283 TYHT 286
>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
Length = 397
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 136/263 (51%), Gaps = 19/263 (7%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR WR +
Sbjct: 47 TSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVRRHLGRDWRWVRSQSQRE 106
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEALAR 255
+Y+ IL+ F D + +S+H + Q G G + G W GP A+ SW L
Sbjct: 107 DYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRLTV 166
Query: 256 CQRAETGLGCQS-----LPMAIYVVSGDEDGERG-GAPVVCIDDASRHCSVFSKGQADWT 309
+ + + +P Y + D + G P C++ A C++ + A W
Sbjct: 167 HVAMDNTVVIEEIKRLCMPWLDYGGAACVDLQGGMPEPNGCLEGA---CALAEEETALWK 223
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LLL+PL LGL +N YI TL+ F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 224 PLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGGKPNHAHYFIGYVGEELIYLDPH 283
Query: 370 DVQPVINIGKDDLEADTSTYHSE 392
QP + +D D TYH +
Sbjct: 284 TTQPAVEPCEDSQVPD-DTYHCQ 305
>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
Length = 437
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 94/288 (32%), Positives = 129/288 (44%), Gaps = 52/288 (18%)
Query: 130 DAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSK-----------------------IT 165
D+ N G + F DF +R+ I+YR F I S+ +
Sbjct: 94 DSDANGGWPSPFLDDFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFS 153
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCM+RS Q L+A AL RLGR WR+ +R IL LF D +PFSIH
Sbjct: 154 SDTGWGCMIRSGQSLLANALQVLRLGRAWRRGQDSQGERR---ILSLFADDPKAPFSIHR 210
Query: 226 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ G A G G W GP A R +AL+ G + + +Y+ D
Sbjct: 211 FVEHGAVACGKHPGEWFGPSATARCIQALSN--------GYEDAGLRVYITGDGSD---- 258
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
+D+ V + P L+LV + LG+++V P Y L+ + QS+GI
Sbjct: 259 -----VYEDS--FMKVAKDANNTFHPTLVLVGIRLGIDRVTPVYWEALKASLQLSQSIGI 311
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
GG+P AS Y VG Q YLDPH +P + L +D S Y E
Sbjct: 312 AGGRPSASHYFVGTQGSYFFYLDPHTTRPFLP-----LHSDLSDYTQE 354
>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
Full=Autophagy-related protein 4
gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 506
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 126/265 (47%), Gaps = 47/265 (17%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF SRI ++YR F DP S ++ SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFALIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A A+L RLGR WR+ D E +I+ LF D +P+S+HN ++ G A G
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAEK-DIIALFADDPRAPYSLHNFVKYGATACGK 287
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA ++GL S G P V D
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDSFM 330
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ +G + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 331 AVANPDGRG---FQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387
Query: 356 VGVQEESAIYLDPHDVQPVINIGKD 380
VGVQ + YLDPH +P + +D
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYRED 412
>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
Length = 481
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 139/303 (45%), Gaps = 47/303 (15%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
IS T IW LG + + +G+ + +SR +YR+ F PIG + +
Sbjct: 25 ISIDTFPIWALG-----------KEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPS 73
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D WGCMLR +QML+ + LL +GR + ++K D Y +IL +F D + + +SIH
Sbjct: 74 TDQYWGCMLRCAQMLLGEVLLRRHIGRHFEWDIEKTSDV-YEKILQMFFDEKDALYSIHQ 132
Query: 226 LLQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ-SLPMAIYVV 275
+ Q G + G W GP + W +A + L Q +L MA
Sbjct: 133 IAQMGVSEGKEVSEWFGPNTAAQVIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYP 192
Query: 276 SGDE----DGERG-------GAPVVCID-DASRHCSVFSKGQA-------------DWTP 310
S D GE G + ++C++ D + F G +W P
Sbjct: 193 SEDAVKLIMGEFGFKSDRISSSHIICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRP 252
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+LL++PL LGL +N Y+ ++ F PQ +GI+GGKP + Y VG+ YLDPH
Sbjct: 253 LLLMIPLRLGLTSINSCYLSAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHH 312
Query: 371 VQP 373
+P
Sbjct: 313 CRP 315
>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
2508]
gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
2509]
Length = 506
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 126/265 (47%), Gaps = 47/265 (17%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF SRI ++YR F DP S ++ SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A A+L RLGR WR+ D E +I+ LF D +P+S+HN ++ G A G
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAEK-DIIALFADDPRAPYSLHNFVKYGATACGK 287
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA ++GL S G P V D
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDSFM 330
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ +G + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 331 AVANPDGRG---FQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387
Query: 356 VGVQEESAIYLDPHDVQPVINIGKD 380
VGVQ + YLDPH +P + +D
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYRED 412
>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
oryzae 3.042]
Length = 357
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 135/304 (44%), Gaps = 32/304 (10%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 212 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 387
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 388 TYHS 391
TYH+
Sbjct: 283 TYHT 286
>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
Length = 454
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 92/279 (32%), Positives = 123/279 (44%), Gaps = 49/279 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+ ++YR F I S TSD GWGCM+RS
Sbjct: 118 FLDDFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRS 177
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
Q L+A A+ LGR WR+ Q P D ++L F D +P+SIH +Q G A G
Sbjct: 178 GQSLLANAMAAINLGRDWRR-GQNPEDER--KLLSWFADDPRAPYSIHQFVQHGAVACGK 234
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA Q + P+ +Y G P V D
Sbjct: 235 YPGEWFGPSATARCIQALANAQEQQ--------PLRVYST--------GDGPDVYED--- 275
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ + + + P L+LV LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 276 KFMEIAKPDGSRFNPTLILVGTRLGIDKITPVYWEALIAALQMPQSVGIAGGRPASSHYF 335
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHS 391
+G Q YLDPH +P + D EAD T H+
Sbjct: 336 IGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVDTVHT 374
>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
Length = 458
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 141/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTICLKETIGKCSEDHETENEICHRKIISWFGDSPLAAFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITVYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQSASMTSDNTDDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
Length = 494
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 134/286 (46%), Gaps = 54/286 (18%)
Query: 138 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 174
A F DF S+I ++YR F DP S +T +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A AL LGR WR+ + +E +L LF D +PFSIH ++ G A
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +AL+ C+ + +YV S D +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276
Query: 294 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
R ++ S G D P L+L+ + LG+++V P Y L+ +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334
Query: 350 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHS 391
+S Y +G Q YLDPH +P + + + + + +TYH+
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHT 380
>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
[Ciona intestinalis]
Length = 422
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/311 (30%), Positives = 143/311 (45%), Gaps = 58/311 (18%)
Query: 112 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 171
+IW+LG + + AL F + S + +YRKG+ PIG + TSD GWG
Sbjct: 39 NIWVLGSRFHLPHERAL-----------FLEHIKSFLWFTYRKGYTPIGGTGPTSDSGWG 87
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
CMLR QML+A+AL + + W+ KP Y ILH D +S +SIH + Q G
Sbjct: 88 CMLRCGQMLLARALAELTMDKDWKWTEDKPQPPPYKRILHQLSDERSSCYSIHQIAQMGV 147
Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
G G W GP + + L++ + +AI+V + VCI
Sbjct: 148 EEGKEVGQWFGPNTISQVLRRLSQFDQENV--------LAIHVAMDN---------TVCI 190
Query: 292 DDASRHCSVFSKGQAD----------------------------WTPILLLVPLVLGLEK 323
+D R CS Q + W P+LLL+PL LGL +
Sbjct: 191 EDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLLLLIPLRLGLSE 250
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK--DD 381
+NP Y L+ + +S+G++GGKP + Y +G E+S I+LDPH QP + + +
Sbjct: 251 INPVYFTHLKECLHWKESVGVIGGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN 310
Query: 382 LEADTSTYHSE 392
D +T+H +
Sbjct: 311 ERYDDTTFHCD 321
>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
Length = 468
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 132/295 (44%), Gaps = 54/295 (18%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 92 DDEIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSRE 151
Query: 194 W----------------------RKPL-------------------QKPF-DREYVEILH 211
W R PL + P ++ + I+
Sbjct: 152 WTWPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIR 211
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
F D ++PF +H ++ G +G AG W GP +A + C+ ++
Sbjct: 212 WFSDHPSAPFGLHRMVALGSIFGKKAGDWYGP-------SIVAHIIKKAIETSCEVAELS 264
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+YV S D + + D + G+A +++LVP LG E NP Y
Sbjct: 265 VYV-SQDCTVYKADIEQLFAGDVPHAETSRDAGKA----VIILVPARLGGETFNPVYKHC 319
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
L+ P LGI+GGKP S Y +G Q+ +YLDPH Q I+ ++D ++
Sbjct: 320 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQSYIDTSRNDFPLES 374
>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
Length = 494
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 134/286 (46%), Gaps = 54/286 (18%)
Query: 138 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 174
A F DF S+I ++YR F DP S +T +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A AL LGR WR+ + +E +L LF D +PFSIH ++ G A
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +AL+ C+ + +YV S D +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276
Query: 294 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
R ++ S G D P L+L+ + LG+++V P Y L+ +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334
Query: 350 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHS 391
+S Y +G Q YLDPH +P + + + + + +TYH+
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHT 380
>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
familiaris]
gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
familiaris]
Length = 458
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 139/342 (40%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKFEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSDSWTSNTVK 155
Query: 201 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 224
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 407
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/349 (30%), Positives = 145/349 (41%), Gaps = 71/349 (20%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA-----------QDE------ALG 129
+RI + + P + IW LGV + KI QDE +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPGKLGNYQDELEAGTSKID 70
Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITS 166
D + F DF S+I ++YR F PI TS
Sbjct: 71 DVTAHGWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTS 130
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH
Sbjct: 131 DTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRF 187
Query: 227 LQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
++ G ++ G G W GP A R EAL+ C ++ +YV + D
Sbjct: 188 VKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD----- 234
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V D R V G P L+L+ LG++ V P Y L+ PQS+GI
Sbjct: 235 ---VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIA 288
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHS 391
GG+P AS Y +G Q YLDPH +P + D + + STYH+
Sbjct: 289 GGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHT 337
>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
Length = 458
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 141/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKSILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
Length = 458
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 141/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
Length = 459
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 136/338 (40%), Gaps = 76/338 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQAL------------------------------- 185
I S +T+D GWGC LR+ QML+AQ L
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 186 ------------------LFHRLGRPWRKPLQKPFDREYV---EILHLFGDSETSPFSIH 224
L H R R+ R V +I+ FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNEVYHRKIISWFGDSPLAAFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+GGKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDF 360
>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
Length = 482
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 154/356 (43%), Gaps = 88/356 (24%)
Query: 108 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAE---------FNQDFSSRILISYRKGFD 157
S S + LLG C H A D+ D A E F +DF+SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGRCYHFKADDDGSADEASCREPEEGFSMGNVEAFRKDFTSRVWLTYREEFP 95
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQ----------- 199
P+ S +T+D GWGC+LR+ QM++AQAL+ H LGR W +PL
Sbjct: 96 PLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLGRDWTWSEALTLQPLDTETWTASAAKR 155
Query: 200 -------------KPFDREYVE-----------------------ILHLFGDSETSPFSI 223
K DR++ E I+ FGD+ ++ +
Sbjct: 156 LVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEAEAHLKEMYHRTIISWFGDTSSALLGL 215
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG--CQSLPMAIYVVSGD-ED 280
H L++ G G AG+W GP + + + ++GL + V S D D
Sbjct: 216 HRLVRLGLTMGKNAGNWYGPAVVAHILKKAVE-EAMDSGLAGITAYVSQDCTVYSADVAD 274
Query: 281 GER--------------GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 326
+ GG P +D S+ QA +++L+P+ LG EK+NP
Sbjct: 275 CHKPPSARQASVSPPIAGGGP--SKEDQPGSASILPDSQA----VIILIPVRLGGEKINP 328
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
Y ++ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D
Sbjct: 329 EYFEFVKNILSVEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDF 384
>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
Length = 458
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 138/338 (40%), Gaps = 76/338 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 154
S S + LLG C+ +E A G N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVQPQWVGENEPVVVSGNVEEFRRDFISRIWLTYRE 95
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR------------------- 195
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDVDNSDSESWTSHT 155
Query: 196 ---------------------KPLQKPFDRE----------YVEILHLFGDSETSPFSIH 224
P+++P R + +I+ F DS + F +H
Sbjct: 156 VKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESCHRKIVSWFADSPLACFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + CS + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKADVIDKQCSSMDPENTEDKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+GGKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDF 360
>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
Length = 383
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 140/278 (50%), Gaps = 42/278 (15%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 189
N + E + +D S++ +YRKGF PIG +S TSD GWGCMLR QM++AQAL+
Sbjct: 31 NAIKELDAIRRDIRSKLWFTYRKGFVPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLH 90
Query: 190 LGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
LG+ W+ P K + Y++IL F D + FSIH + G + G G W GP + +
Sbjct: 91 LGKDWQWMPETK--NNTYLKILRRFEDKRAAAFSIHQIALMGASEGKEVGQWFGPNTIAQ 148
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------ 302
+ L + + I+V + + ++D R C V
Sbjct: 149 VLKKLIVYDEWSS--------LTIHVALDN---------TLIVNDILRQCRVEGGVTAEA 191
Query: 303 ------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
+ + W P+LLL+PL LGL ++NP YI L+ +F QSLG++GGKP + Y +
Sbjct: 192 DGEIPLRAPSQWKPLLLLIPLRLGLSEINPVYINGLKTSFKISQSLGVIGGKPNLALYFI 251
Query: 357 GVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYH 390
G + IYLDPH Q I ++++E D S YH
Sbjct: 252 GCVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDIS-YH 288
>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
Length = 1039
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 138/278 (49%), Gaps = 51/278 (18%)
Query: 140 FNQDFSSRILISYRKGFD-PIGDSKI-------------------------------TSD 167
F DF+SRI ++YR F PI D+++ +SD
Sbjct: 336 FYIDFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSD 395
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDREYVEILHLFGDSET--SPFS 222
GWGCMLR+ Q L+A AL+ LGR WR+P +Q YV+I+ F D+ +PFS
Sbjct: 396 TGWGCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFS 455
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H + AGK +G G W GP + + L E+GLG VS DG
Sbjct: 456 VHRMALAGKEFGTDVGQWFGPSVAAGAIKTLVNS-FPESGLG----------VSVATDGT 504
Query: 283 RGGAPVVCI---DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
+ V + + +SR P+LLL+ + LG+E VNP Y T++L +TFP
Sbjct: 505 LFQSDVFAVSHGEMSSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIYYETIKLLYTFP 564
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
QS+GI GG+P +S Y VG Q ++ YLDPH+ +P I +
Sbjct: 565 QSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTRPAIPL 602
>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
Length = 449
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/286 (34%), Positives = 131/286 (45%), Gaps = 50/286 (17%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 159
+A D+ D +G F DF SRI ++YR FDPI
Sbjct: 99 LAYDDQSNDGGWPSG---FITDFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQL 155
Query: 160 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
GD S +SD GWGCM+RS Q L+A + RLGR WR Q E IL F D
Sbjct: 156 GDQSPFSSDSGWGCMIRSGQSLLANTIALVRLGRDWR---QGQSLEEECRILKDFADDPR 212
Query: 219 SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
+P+SIH+ ++ G A G G W GP A R +ALA +I V S
Sbjct: 213 APYSIHSFVRHGASACGKYPGEWFGPSATARCIQALANSHEP-----------SIRVYST 261
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
G P V DD + + G+A + P L+LV LGL+K+ P Y L
Sbjct: 262 ------GDGPDVYEDDFMKIAN--PTGEA-FHPTLVLVGTRLGLDKITPVYWEALIAALQ 312
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
PQS+GI GG+P +S Y +G Q YLDPH +P + ++ ++
Sbjct: 313 MPQSVGIAGGRPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMD 358
>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
Length = 458
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 141/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -FSVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
Length = 491
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/271 (33%), Positives = 120/271 (44%), Gaps = 53/271 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKD 380
VG Q YLDPH +P + +D
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHED 398
>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 141/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
Length = 409
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 93/291 (31%), Positives = 144/291 (49%), Gaps = 38/291 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMD-------- 194
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 195 -STVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTSTYHSE 392
GG+P + Y +G E+ +YLDPH Q +G+ + E D TYH +
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQKTGVVGQKTSSGEQEHD-ETYHQK 299
>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
Length = 454
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 133/299 (44%), Gaps = 54/299 (18%)
Query: 122 IAQDEALGDAAGNNG--LAEFNQDFSSRILISYRKGFDPIGDSK---------------- 163
+A DE D +G +G F DF S+ ++YR F I S
Sbjct: 101 LAYDE---DYSGQDGGWPTAFLDDFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKS 157
Query: 164 -------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L LF D
Sbjct: 158 QLVDQNGFSSDSGWGCMIRSGQSLLANAMAVINLGRDWRRGQNQEEERK---LLSLFADD 214
Query: 217 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
+P+SIH +Q G A G G W GP A R +ALA Q + P+ +Y
Sbjct: 215 PRAPYSIHQFVQHGAVACGKYPGEWFGPSATARCIQALANAQMHQ--------PLRVYST 266
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
G P V D + + + + P L+LV LG++K+ P Y L
Sbjct: 267 --------GDGPDVYED---KFMKIAKPDGSRFHPTLILVGTRLGIDKITPVYWEALIAA 315
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHS 391
PQS+GI GG+P +S Y +G Q YLDPH +P + + EAD T H+
Sbjct: 316 LQMPQSVGIAGGRPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVDTVHT 374
>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
Length = 459
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/333 (27%), Positives = 141/333 (42%), Gaps = 76/333 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA-GNN----------GLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE + G+N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKTDEPTEQSPNGSNYDVTEEEVSRNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 198
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWPDALVNENPESESWTSHTVK 155
Query: 199 ------------QKPFDREYV----------------------EILHLFGDSETSPFSIH 224
+K F + + +I+ FGDS + F +H
Sbjct: 156 KLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRDEHYHRKIVSWFGDSPLANFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ G G AG W GP + L R + E + + +YV
Sbjct: 216 RLIEYGNKSGKMAGDWYGPAVVAH----LLR-KAVEEAKDPELQGITVYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D CS+ + +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKSDVVEMQCSLKDSEKPGAKSVIILIPVRLGGERTNMEYLEFVKGILSLEYCIGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
VGG+P S Y G Q++S IY+DPH Q +++
Sbjct: 323 VGGRPKQSYYFAGFQDDSLIYMDPHYCQSFVDV 355
>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
Length = 478
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 130/263 (49%), Gaps = 35/263 (13%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+GL + +SR+ +YR+ F PIG + ++D GWGCMLR +QML+ + LL +GR +
Sbjct: 47 DGLEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHF 106
Query: 195 RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YA 245
++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 107 EWDIEKT-SEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLT 165
Query: 246 MCRSWEALARCQRAETGLGCQ-SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV---- 300
+ W +A + L + +L MA S + + + + + ++ ++
Sbjct: 166 IFDDWSNIAVHVALDNILVKEDALTMATTYPSDN------ASYIFAVHNFLKYFTLNLTF 219
Query: 301 --FSK-GQ-----------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
F++ GQ DW P+L+++PL LGL +NP Y+P ++ F PQ +GI+G
Sbjct: 220 PNFAENGQIEKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFELPQCVGIIG 279
Query: 347 GKPGASTYIVGVQEESAIYLDPH 369
GKP + Y VG+ YLDPH
Sbjct: 280 GKPNLAHYFVGIAGTKLFYLDPH 302
>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/291 (31%), Positives = 144/291 (49%), Gaps = 38/291 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMD-------- 194
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 195 -STVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTSTYHSE 392
GG+P + Y +G E+ +YLDPH Q +G+ + E D TYH +
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQRTGVVGQKTSSGEQEHD-ETYHQK 299
>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
Length = 572
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/271 (33%), Positives = 120/271 (44%), Gaps = 53/271 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKSDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKD 380
VG Q YLDPH +P + +D
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHED 479
>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
Length = 1202
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/283 (32%), Positives = 128/283 (45%), Gaps = 53/283 (18%)
Query: 140 FNQDFSSRILISYRKGFDPI---------------------------GDSKITSDVGWGC 172
F +DF+SRI ++YR GF PI + +++D GWGC
Sbjct: 545 FYEDFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGC 604
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE----------YVEILHLFGD--S 216
MLR+ Q L+A AL F LGR WR+ + P E Y +L F D S
Sbjct: 605 MLRTGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPS 664
Query: 217 ETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV- 274
PFS+H GK G G W GP + + LA + +L +A+ V
Sbjct: 665 PLCPFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLA------SNFAPANLGVAVSVD 718
Query: 275 --VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
V + P A R S + P+L+L+ LGL+KVNP Y ++
Sbjct: 719 GTVYRSDVQAAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESI 778
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
+ +FPQS+GI GG+P +S Y VGVQ+ S Y+DPH +P I
Sbjct: 779 KAALSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPHHTKPAI 821
>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
Length = 572
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 92/271 (33%), Positives = 120/271 (44%), Gaps = 53/271 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKD 380
VG Q YLDPH +P + +D
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHED 479
>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
Length = 458
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 141/342 (41%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENSDSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNEIYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
Length = 491
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/271 (33%), Positives = 121/271 (44%), Gaps = 53/271 (19%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKI----------------TSDVGWGCMLRS 176
F DF SRI ++YR GF DP S++ T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKD 380
VG Q YLDPH +P + +D
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHED 398
>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
Length = 603
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 133/300 (44%), Gaps = 57/300 (19%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGF------DPIGDS------------------- 162
LG+ NN ++ DF SRI +YR F DP+ D
Sbjct: 55 LGNLYDNN--SDLLDDFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLR 112
Query: 163 --KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK-PLQKPFD----REYV---EILHL 212
+D GWGCMLR+SQ L+A L LGR WR+ P D +EYV ++L+L
Sbjct: 113 ERTFNTDQGWGCMLRTSQSLLANTLQIMLLGRQWRRNPFVDLTDYAKRKEYVNLIKLLNL 172
Query: 213 FGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
F D S SPFS+H + GK+ G G W GP + + L Q + L S+
Sbjct: 173 FMDNPSTLSPFSVHRMAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQ-TDINLSV-SVAS 230
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRY 328
+ D GG ++W P+L+LV + LGL+ ++PRY
Sbjct: 231 DSVIYKSDVYQASGGTSTT--------------ADSEWGNKPVLILVGVRLGLDGIHPRY 276
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
TL+ +GI GG+P +S Y G Q +S Y+DPH ++P INI E + T
Sbjct: 277 YETLKAFLRMQSCVGIAGGRPSSSYYFFGYQSDSLFYVDPHIMKPTINIKTPPTEGELKT 336
>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
Length = 454
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 129/264 (48%), Gaps = 27/264 (10%)
Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 39 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 98
Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-- 243
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 99 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 157
Query: 244 -------YAMCRSWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
+ W +A + L + ++ MA S D E+G
Sbjct: 158 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 209
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
+ D +R +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P
Sbjct: 210 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 268
Query: 350 GASTYIVGVQEESAIYLDPHDVQP 373
+ Y VG+ YLDPH +P
Sbjct: 269 NHALYFVGMSGSKLFYLDPHYCRP 292
>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
Length = 481
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/259 (30%), Positives = 127/259 (49%), Gaps = 17/259 (6%)
Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 66 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 125
Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-- 243
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 126 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 184
Query: 244 -------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
+ W +A A + + + + ED + +D +
Sbjct: 185 AAQVMKKLTIFDDWSNIA-VHVALDNILVKEDAITMATSYPSEDAVKLIMENGLVD---K 240
Query: 297 HCSVFSKGQA--DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ S G +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P + Y
Sbjct: 241 NRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRPNHALY 300
Query: 355 IVGVQEESAIYLDPHDVQP 373
VG+ YLDPH +P
Sbjct: 301 FVGMSGSKLFYLDPHYCRP 319
>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
Length = 500
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/263 (32%), Positives = 119/263 (45%), Gaps = 61/263 (23%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A A+L R GR WR+ +R EI+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIER---EIVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI---YVVSGDEDGERGGAPVVCID 292
G W GP A ARC + + LP ++ + + DG
Sbjct: 290 YPGEWFGP-------SATARCIHSLRVYLTRDLPEVYEDNFMSTANPDGNH--------- 333
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ P L+LV LG++K+NP Y L T PQ++GI GG+P +S
Sbjct: 334 ---------------FHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSS 378
Query: 353 TYIVGVQEESAIYLDPHDVQPVI 375
Y +G Q + YLDPH +P +
Sbjct: 379 HYFIGAQGQWLFYLDPHHPRPAL 401
>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
Length = 324
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 127/286 (44%), Gaps = 61/286 (21%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
++ +W+LG + + ++ E D +SR+ +YRK F IG + TSD
Sbjct: 26 TSEPVWILGRKYSVLTEKE-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 74
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
GWGCMLR QM+ AQAL+ LGR WR + Q+P Y +L+ F D + S +SIH +
Sbjct: 75 GWGCMLRCGQMIFAQALVCRHLGRDWRWAQWTQQP--DSYFNVLNAFIDRKDSYYSIHQI 132
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA
Sbjct: 133 AQMGVGEGKSIGQWYGPNTVAQVLKKLA-------------------------------- 160
Query: 287 PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
VF + I + +V G +N Y+ TL+ F PQSLG++G
Sbjct: 161 -------------VFDTWSSLAVHIAMDNTVVTGEININEAYVETLKHCFMMPQSLGVIG 207
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
GKP ++ Y +G + IYLDPH QP + + L D S +H +
Sbjct: 208 GKPNSAHYFIGYVGDELIYLDPHTTQPAVELTDSCLVPDES-FHCQ 252
>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
Length = 431
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/329 (31%), Positives = 147/329 (44%), Gaps = 42/329 (12%)
Query: 91 MRRIHERVLGPSRTGISSSTSDI--WLLGVCHKIAQDEALGDAAGNNGLA--EFNQDFSS 146
MR R P R+ +SS+ + W +++ L + E D +S
Sbjct: 1 MRPGPRRSCTPRRSALSSTLGEASDWCTAAAREVSAVSGLSQLQQDESYEKDEILSDVAS 60
Query: 147 RILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 206
R+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 61 RLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSY 120
Query: 207 VEILHLFGDSETSPFSIHNLLQAG------KAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
+L F D + S +SIH + + S +GP +C+S+ A+ +R
Sbjct: 121 FSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGP-QLCQSFAAVRLSRRRR 179
Query: 261 TGLGCQSLP--MAIYVVSGDEDGERGGAPVVCIDD--ASRHCSVFSKG--------QADW 308
L S P +A++ V ++D A RHC+ G W
Sbjct: 180 WELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDISADRHCNGVPAGAEVTHRPPLPPW 239
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLT-------------------FTFPQSLGIVGGKP 349
P++LL+PL LGL +N Y+ TL+L F PQSLG++GGKP
Sbjct: 240 RPLVLLIPLRLGLTDINEAYVGTLKLASTLVGLCSAAASLPLRQHCFMMPQSLGVIGGKP 299
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIG 378
++ Y +G E IYLDPH QP + +
Sbjct: 300 NSAHYFIGYVGEELIYLDPHTTQPAVEVA 328
>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
Length = 358
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 88/258 (34%), Positives = 128/258 (49%), Gaps = 29/258 (11%)
Query: 137 LAEFNQDF---SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
AE+ +DF S + I RK G + TSD GWGCMLR QM+ AQAL+ LGR
Sbjct: 13 FAEY-EDFPETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRD 71
Query: 194 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
WR +K Y +L+ F D + S +SIH + Q G G + G W GP + + + L
Sbjct: 72 WRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKL 131
Query: 254 ARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFS 302
A + +A+++ V +E V C D+ RHC+ F
Sbjct: 132 AVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFP 183
Query: 303 KGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +
Sbjct: 184 AGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFI 243
Query: 357 GVQEESAIYLDPHDVQPV 374
G ES+ + P + P+
Sbjct: 244 GYVGESSSHRVPVGLCPL 261
>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
Length = 252
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 118/233 (50%), Gaps = 32/233 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192
Query: 286 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ ++++
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKVS 243
>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
Length = 460
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 145/340 (42%), Gaps = 78/340 (22%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 154
S S + LLG C+ +E A AG N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVGSGWAGENEHVVIYGNVEEFRRDFISRIWLTYRE 95
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------- 197
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDIENSDSASWTSHT 155
Query: 198 ------------------------LQKPF-----DRE------YVEILHLFGDSETSPFS 222
L++P D E + +I+ FGDS + F
Sbjct: 156 VKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGRNELCHRKIISWFGDSPLACFG 215
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H L++ GK G AG W GP + R G + IYV +D
Sbjct: 216 LHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVA---QDCT 267
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
A V+ S ++ +A I+LLVP+ LG E+ N Y+ ++ + +
Sbjct: 268 VYKADVIDKQGISAGLET-TEDKA----IILLVPVRLGGERTNMDYLDFVKGILSLEYCV 322
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
GI+GGKP S Y G Q++S IY+DPH Q +++ D
Sbjct: 323 GIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDF 362
>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
Length = 331
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 111/221 (50%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V+C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VLCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 213
>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
Length = 458
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 140/342 (40%), Gaps = 76/342 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKVLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG------------------------- 191
I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIESSDSDSWTSNTIH 155
Query: 192 -------------RPWRKPL--------QKPFDRE------YVEILHLFGDSETSPFSIH 224
R R P + P D + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSEIYHRQIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+GGKP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 364
>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
boliviensis]
Length = 319
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 111/221 (50%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C DA+RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 213
>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
Length = 459
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 127/290 (43%), Gaps = 57/290 (19%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 163
+A DE L DA F DF SR+ ++YR F+PI S
Sbjct: 109 LAYDELLEDAGWP---IAFLDDFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLA 165
Query: 164 ----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
+SD GWGCM+RS Q L+A L+ +LGR WR+ R+ EIL F D +
Sbjct: 166 DQGGFSSDTGWGCMIRSGQSLLANTLVICQLGRDWRRGKAA---RQEREILARFADDPRA 222
Query: 220 PFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
P+S+HN ++ G A G G W GP A R +ALA + + +Y
Sbjct: 223 PYSLHNFVRHGAVACGKFPGEWFGPSATARCIQALANSNESS---------LRVYST--- 270
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
G P V D +V + P L+LV LG++K+N Y L T
Sbjct: 271 -----GDLPDVYEDS---FMAVAKPDGETFHPTLILVGTRLGIDKINQVYWEALTATLQM 322
Query: 339 PQSLGIVGGKPGASTYIVGVQEES--------AIYLDPHDVQPVINIGKD 380
PQS+GI GG+P AS Y +G Q YLDPH +P + +D
Sbjct: 323 PQSVGIAGGRPSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHED 372
>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
RWD-64-598 SS2]
Length = 1038
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 139/318 (43%), Gaps = 74/318 (23%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------ 164
+Q A G + EF DF+SRI ++YR F PI DS +
Sbjct: 271 SQSPASEKHPGQDWAPEFYADFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSP 330
Query: 165 --------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YV 207
T+D GWGCMLR+ Q L+A ALL LGR WR+P + + YV
Sbjct: 331 SPKSRRWFGGEKGWTTDTGWGCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYV 390
Query: 208 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+I+ F DS +PFS+H + AGK G G W GP + + L + + GLG
Sbjct: 391 QIITWFLDSPLPQAPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKRLVQA-FPDAGLGV 449
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------W--TPIL 312
V D A V+S D W +L
Sbjct: 450 ----------------------AVASDGALYQTDVYSASYVDVGSPRNVRKLRWGGRAVL 487
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+L + LG+ VNP Y T++ F PQS+GI GG+P +S Y +GVQ ++ IYLDPH +
Sbjct: 488 VLFGIRLGINGVNPIYYDTIKGLFEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPHHAR 547
Query: 373 PVINIGKDDLEADTSTYH 390
P I + + EAD H
Sbjct: 548 PAIPL-RPLPEADEGNQH 564
>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
Length = 319
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C DA RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 213
>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 331
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 213
>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
Length = 340
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 132/288 (45%), Gaps = 61/288 (21%)
Query: 137 LAEFNQDFSSRILISYRKGFDPI------------------------------GDSKITS 166
L E +SR+ +YR GF+PI + ++
Sbjct: 52 LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHN 225
DVGWGCM+R+SQ L+A AL LGR + P E VE I+ LFGD T PFS+HN
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171
Query: 226 LLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
++ A L G W GP A S + L C + E+ ++ ++I D E
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRL--CAKFESN-EIPNINVSICESCNLYDEEI 228
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
G +F + + +P+L+L PL LG++K+N Y P+L QS+G
Sbjct: 229 RG--------------IFEESE---SPLLILFPLRLGIDKINSIYYPSLLQLLALKQSVG 271
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
I GGKP +S Y G Q + +YLDPH++Q +D TYH+
Sbjct: 272 IAGGKPSSSYYFFGFQGSNLLYLDPHNLQAA--------SSDPGTYHT 311
>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
gorilla]
gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
gorilla]
gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 213
>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDSTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 213
>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
Length = 331
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRNS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVE 213
>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 130/283 (45%), Gaps = 54/283 (19%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 159
LG G+ E ++D SRI +YR GF+PI
Sbjct: 69 LGRRYGSGSKEEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 160 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
+ T+DVGWGCM+R+SQML+A A+ LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAIQLLLLGRGFT--YADSSEKKHSDIIDMF 186
Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D +PFS+HN ++A L G W GP A S + L + Q E+ S P
Sbjct: 187 TDDPKAPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDES-----SSPRF 241
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
++S D DD + + + + IL+L+P+ LGL KV+P Y +
Sbjct: 242 RVIISESCD---------IYDD--KIGKLLQENEDAEGAILILLPVRLGLNKVSPYYHNS 290
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
L F+ PQ +GI GGKP +S Y G + +YLDPH Q V
Sbjct: 291 LSSLFSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSV 333
>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
Length = 545
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 133/317 (41%), Gaps = 101/317 (31%)
Query: 139 EFNQDFSSRILISYRKGF--------------------------DPIGDSKITSDVGWGC 172
+F D SRI +SYR GF DP G TSDVGWGC
Sbjct: 64 DFLADVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRG---YTSDVGWGC 120
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE------------------------ 208
M+R+SQ L+A ALLF LGR WR K D Y+
Sbjct: 121 MIRTSQSLLANALLFRHLGRGWR--WNKGDDFVYLSEGNTESRGGESRNGGANKEQETAV 178
Query: 209 ----------ILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQ 257
I+ F DS SPFSIH ++ G KA AG W GP A S AL
Sbjct: 179 SEETAVSEETIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYAL---- 234
Query: 258 RAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLL 314
C P + +Y +G GG V D+ + G P+L+L
Sbjct: 235 -------CNEFPDSGLKVYY-----NGNGGGD--VYEDE------LLETG----FPLLVL 270
Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
L LG++ VNP Y +LR + PQS+GI GG+P S Y G Q E YLDPH +P
Sbjct: 271 CGLRLGIDNVNPIYWDSLRQMLSLPQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPA 330
Query: 375 INIGKDDLEADTSTYHS 391
+ + DT+++HS
Sbjct: 331 VKT----TDKDTTSFHS 343
>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 401
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 135/313 (43%), Gaps = 69/313 (22%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAE-----------------FNQDFSSRILISYRKG 155
IW LG + A + D A NN + F DF SRI I+YR
Sbjct: 29 IWCLG--REYAPSQPPSDPASNNPRSPSRQPNASTLNDTTWPKAFLSDFGSRIWITYRSN 86
Query: 156 FDPIGDSK-----------------------ITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
F PI +K TSD GWGCM+RS Q L+A LGR
Sbjct: 87 FTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQSLLANTFSVLLLGR 146
Query: 193 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWE 251
WR+ + E +++ +F D +PFSIH + G ++ G G W GP
Sbjct: 147 DWRRGEKV---EEESKLISMFADHPEAPFSIHRFVNRGAESCGKYPGEWFGP-------S 196
Query: 252 ALARCQRAETGLGCQS-LP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
A A+C + L QS +P + +Y+ + D +D H + G+
Sbjct: 197 ATAKCIQL---LSTQSEVPQLRVYLTNDTSD---------VYEDKFAHVAHDESGRIQ-- 242
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P L+L+ LG++ V P Y LR T+PQS+GI GG+P AS Y VG Q+ +LDPH
Sbjct: 243 PTLILIGTRLGIDNVTPAYWDGLRAALTYPQSVGIAGGRPSASHYFVGAQDCHLFFLDPH 302
Query: 370 DVQPVINIGKDDL 382
+P D L
Sbjct: 303 TTRPATLYRPDGL 315
>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 450
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 92/334 (27%), Positives = 137/334 (41%), Gaps = 91/334 (27%)
Query: 110 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 157
S ++LLG C+ +++ D N+G + EF +DF SRI ++YR+ F
Sbjct: 38 NSPVFLLGKCYHFKYEDSSVTSDGGSNSGSESKEDLSGNVDEFRKDFISRIWLTYREEFP 97
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 194
I S T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 98 QIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWTWTEALDIFSSESEFWTANTARK 157
Query: 195 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 230
++PL + E H F D + F +H L++ G
Sbjct: 158 LTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQKIISWFADYPLAYFGLHQLVKLG 217
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
K G AG W GP + L R E+ D E G +
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256
Query: 291 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
D C+++S D +++LVP+ LG E+ N Y ++ + +G
Sbjct: 257 AQD----CTIYSADVYDLQCNKGTEKAVVILVPVRLGGERTNMEYFEFVKGILSLEFCIG 312
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
I+GGKP S Y VG Q++S IY+DPH Q +++
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDV 346
>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
Length = 507
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 90/270 (33%), Positives = 130/270 (48%), Gaps = 52/270 (19%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-----KPLQKPFDREYVEILHLFGD--SE 217
TSD GWGCM+RS QML+AQ L+ H LGR WR P++ P D + +++ F D S+
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMC-----------RSWEALARCQ--------- 257
SPFS+H L+QA G GSW GP +C R +E LAR
Sbjct: 243 ESPFSLHRLVQAS---GQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299
Query: 258 -RAET-----GLGCQSLPMAIYVVSGDEDGERGGA-------PVVCIDD---ASRHCSVF 301
R E G + P + E+ + + P + D +S ++F
Sbjct: 300 YREEIMNLARGQPVRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQDGIQSSPSTTLF 359
Query: 302 SKGQADWTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
++LL+P+ LGL+K ++ RY+P + P +GI+GG+P S YI+G Q
Sbjct: 360 PSHA-----VILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIYILGCQN 414
Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
I+LDPH QPV+ D E + T+H
Sbjct: 415 TQLIHLDPHFTQPVVRNVVDSEEFNVKTWH 444
>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPAVE 213
>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
Length = 400
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 133/264 (50%), Gaps = 28/264 (10%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
N + E + +D SR+ +YR F P+G+ ++T+D GWGCMLR QM++AQAL+ LG
Sbjct: 52 NAIQELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLG 111
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
R W + D Y++I++ F D+ S +S+H + G++ G W+GP + + +
Sbjct: 112 REWYWT-SECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILK 170
Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 311
L C L I+V V +DD S+ W P+
Sbjct: 171 KLV-CFDDWCSL-------VIHVAMDS---------TVVLDDIYS----LSQDGESWKPL 209
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
LL++PL LG+ +NP Y+P L+ F S G++GG+P + Y VG ++ +YLDPH
Sbjct: 210 LLIIPLRLGITDINPIYVPALKRCFELESSCGMIGGRPNQALYFVGYVDDEVLYLDPHTT 269
Query: 372 QPVINIGKDDLEADT---STYHSE 392
Q +G+ A+ TYH +
Sbjct: 270 QRTGAVGQKTTTAEQELDETYHQK 293
>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 459
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 126/280 (45%), Gaps = 55/280 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL------------- 185
F + F+S + +YR+GF P+ S +T+D GWGC+LRSSQML+AQ L
Sbjct: 97 HFRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWSG 156
Query: 186 ----------LFHRLGR---------------PWRKPLQKPFDREYVEILHLFGDSETSP 220
L H + W L +P + IL F D+ T+P
Sbjct: 157 NQRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTAP 216
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
F IH L++ GK+ G AG W GP A R LP + V+ D
Sbjct: 217 FGIHRLVELGKSSGKKAGDWYGP-------SIAAHILRKAVEASVVDLPNLVAYVAQD-- 267
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+ + D + C W +L+LVP+ LG + +NP YI +++
Sbjct: 268 ------CTIYLQDVRKLCE--RPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLEC 319
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
+GI+GGKP S + VG Q++ +YLDPH QP +++ K+
Sbjct: 320 CIGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPTVDVTKN 359
>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
Length = 319
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/237 (33%), Positives = 116/237 (48%), Gaps = 26/237 (10%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E + A
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEISKLCRAS 112
Query: 288 VVCIDDAS------RHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ C A+ RHC+ G W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 LPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +H +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-FHCQ 228
>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
LYAD-421 SS1]
Length = 999
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 133/278 (47%), Gaps = 50/278 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 166
F DF+SRI ++YR F PI D+ + TS
Sbjct: 303 FYADFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTS 362
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 221
D GWGCMLR+ Q L+A ALL LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 363 DAGWGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPF 422
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
S+H + GK G G W GP + + L + GLG +A+ S +
Sbjct: 423 SVHRMALVGKDLGKEVGQWFGPSTAAGAIKTLVHS-FPDAGLG-----VAVASDSTLYES 476
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
+ A + RH + +W +L+L+ + LG+E VNP Y T++ +TFP
Sbjct: 477 DVYAASRSSVYSTRRH----GHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLYTFP 532
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
Q++GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 533 QTVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAIPL 570
>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 414
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 152/323 (47%), Gaps = 61/323 (18%)
Query: 109 STSDIWLLGVCHKIAQDE-------ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
S S IWLLG + A++E + + L++F +DF +RI +YR GF I
Sbjct: 45 SHSPIWLLG--KQYAKNEPRPNLRRGFDENSAVGKLSDFLEDFRTRIWFTYRHGFPCIPG 102
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDREYVEILHLFGDSET 218
+K +D GWGC +RS QML+A+ +L H LGR W + L + + +++ LF D+ T
Sbjct: 103 TKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLLGQSGLPEDEALMHRKVIGLFCDNLT 162
Query: 219 SPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
SPFS+HNL+Q G+ +G AGSW GP ++ + + +A E GL +A++V+
Sbjct: 163 SPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQ-VAMNNAIERGL---VEGLAVHVIGD 218
Query: 278 DE----DGERGG-----APV----------------VCIDDASRHCSV------------ 300
E D ER G APV D R SV
Sbjct: 219 GELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSYLDLRRLTSVSNGDLLPSHDGE 278
Query: 301 ------FSKGQADWTP-ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
F W+ +L+L+PL LG+EK N Y L+ + +G++GG+
Sbjct: 279 SIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHLKRVLSTKFCVGVIGGRHHKCY 338
Query: 354 YIVGVQEESAIYLDPHDVQPVIN 376
Y G + I LDPH QP ++
Sbjct: 339 YFCGWHTDYLIRLDPHYSQPAVD 361
>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
Length = 451
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 126/277 (45%), Gaps = 52/277 (18%)
Query: 140 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 176
F +D +++ ++YR GFDPI S +SD GWGCM+RS
Sbjct: 117 FLEDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRS 176
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A + +LGR WR+ +E +++ +F D +P+SIHN ++ G A G
Sbjct: 177 GQSLLATTIGILQLGRDWRR---GKCQQEERQLISMFADDPRAPYSIHNFVRHGATACGK 233
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A A+C +A T LP+ +Y + +D + D
Sbjct: 234 FPGEWFGP-------SATAQCIQALTS--ASGLPLKVYSPNDGQDVYEDSFMKIAKPD-- 282
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
GQ D+ P L+L+ LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 283 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLLAALQMPQSVGIAGGRPSSSHYF 333
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
VG Q YLDPH + I AD + Y E
Sbjct: 334 VGSQGSYLFYLDPHHTRKAIP-----YHADVTKYTEE 365
>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
Length = 449
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 128/297 (43%), Gaps = 52/297 (17%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 163
+A DEA+ G + F DF S+ ++YR F+PI S
Sbjct: 98 LAYDEAMNQDGG--WPSAFLDDFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQF 155
Query: 164 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L F D
Sbjct: 156 MDQAGYSSDSGWGCMIRSGQSLLANAMAVLDLGRDWRRGVAAEKERQ---LLSKFADDPK 212
Query: 219 SPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
+P+SIH +Q G A G G W GP A R +AL + +Y
Sbjct: 213 APYSIHRFVQHGAVACGKYPGEWFGPSATARCIQALVNANEPH---------LRVYST-- 261
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
G P V D R + + P L+LV LG++K+ P Y L
Sbjct: 262 ------GDGPDVYED---RFFDIAKPSGETFHPTLILVGTRLGIDKITPVYWDALIAALQ 312
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHS 391
PQS+GI GG+P +S Y +G Q YLDPH + + +D +AD + H+
Sbjct: 313 MPQSIGIAGGRPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSVHT 369
>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
206040]
Length = 452
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 124/272 (45%), Gaps = 47/272 (17%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVG 169
G A F +D SS+ ++YR GF+PI S +SD G
Sbjct: 113 GTGWPAGFVEDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSG 172
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A + RLGR WR+ + +R ++ +F D +P+SIHN ++
Sbjct: 173 WGCMIRSGQSLLATTIGILRLGRDWRRDQSQEEERH---LISMFADDPRAPYSIHNFVRH 229
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A A+C +A T L + IY + +D
Sbjct: 230 GATACGKYPGEWFGP-------SATAQCIQALTS--SSGLSLNIYSPNDGQD-------- 272
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ + S S GQ + P L+L+ LG++K+ P Y L PQS+GI GG+
Sbjct: 273 --VYEDSFMKIAKSDGQT-FNPTLILIRTRLGIDKITPIYWDALIAALHMPQSVGIAGGR 329
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
P +S Y VG Q YLDPH + I D
Sbjct: 330 PASSHYFVGSQGSYLFYLDPHHTRKAIPYHDD 361
>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 321
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 114/237 (48%), Gaps = 32/237 (13%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 115
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 116 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 168
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D T+H
Sbjct: 169 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND-QTFH 224
>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 480
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 128/252 (50%), Gaps = 38/252 (15%)
Query: 144 FSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFH----RLGRPWRKPL 198
F S +YR + PIG S SD GWGCM+R+ QML+ QA++ H L + + +
Sbjct: 154 FKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMMRHVFEDNLKYEYIEKI 213
Query: 199 QKPFDREYVEILHLFGDS---ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+ + EY+ +L LF D+ + SP+SI N+ G G W GP A+ + L +
Sbjct: 214 TE-YREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIVLKRLTK 272
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILLL 314
+ P+ + + VC++ + + +V + DWT + ++
Sbjct: 273 IYK----------PVKQFTM------------YVCLE-GNIYLNVIQEKSKDWTQSVFIV 309
Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES--AIYLDPHDVQ 372
+PL LGL + P Y+ +++ FTFPQ++GI GG+ ++ Y +G+ + S IYLDPH VQ
Sbjct: 310 IPLRLGLNYIEPEYLSSVKKVFTFPQNVGIAGGRENSALYFIGISDSSNNLIYLDPHLVQ 369
Query: 373 ---PVINIGKDD 381
P N+ ++
Sbjct: 370 KSVPTCNMQTNE 381
>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
Length = 1034
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 94/269 (34%), Positives = 128/269 (47%), Gaps = 50/269 (18%)
Query: 140 FNQDFSSRILISYRKGF-DPIGDSKI-------------------------------TSD 167
F DF+SRI ++YR F PI D ++ +SD
Sbjct: 302 FYIDFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSD 361
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFS 222
GWGCMLR+ Q L+A AL+ LGR WRKP +Y V IL F D+ +PFS
Sbjct: 362 SGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFS 421
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H + AGK G G W GP + +AL E G+G +A+ V DG
Sbjct: 422 VHRMALAGKELGTDVGQWFGPSVAAGAIKALVNS-FPEAGIG-----VAVAV-----DGV 470
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V + + W P+LLL+ + LG+E VNP Y T+++ +TFPQ
Sbjct: 471 LYQTDVHAASHGDHFGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIYYDTIKMLYTFPQ 530
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPH 369
S+GI GG+P +S Y VG Q ++ YLDPH
Sbjct: 531 SVGIAGGRPSSSYYFVGSQADNLFYLDPH 559
>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
Length = 450
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 123/265 (46%), Gaps = 47/265 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 176
F +D +++ ++YR GF+PI S +SD GWGCM+RS
Sbjct: 115 FTEDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRS 174
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A + +LGR WR+ + +E ++ +F D +PFSIHN ++ G A G
Sbjct: 175 GQSLLATTIATLQLGRDWRRGKNQ---QEERRLISMFADDPRAPFSIHNFVRHGATACGK 231
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A A+C +A T L + +Y + +D V D
Sbjct: 232 FPGEWFGP-------SATAQCIQALTS--SSDLDLHVYSPNDGQDVYEDSFMKVAKPD-- 280
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
GQ D+ P L+L+ LG++K+ P Y L T PQS+GI GG+P +S Y
Sbjct: 281 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLIATLQMPQSVGIAGGRPSSSHYF 331
Query: 356 VGVQEESAIYLDPHDVQPVINIGKD 380
VG Q YLDPH + + +D
Sbjct: 332 VGSQGSYLFYLDPHHTRKALPYHED 356
>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
Length = 246
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 112/246 (45%), Gaps = 53/246 (21%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 327
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 328 YIPTLR 333
YI +
Sbjct: 241 YIEAFK 246
>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
Length = 459
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 138/346 (39%), Gaps = 83/346 (23%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 224
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQ-------- 262
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++L+P+ LG E+ N Y+ ++ ++L I
Sbjct: 263 DCTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGIL---RALNI 319
Query: 345 VG----GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
V KP S Y G Q++S IY+DPH Q +++ D +T
Sbjct: 320 VWVLLVAKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLET 365
>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 376
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/247 (34%), Positives = 122/247 (49%), Gaps = 35/247 (14%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFS 222
TSD GWGCM R QML+AQAL+ H LGR WR + ++I+ F DS + SP S
Sbjct: 67 TSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLS 126
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSG 277
+H L+Q G W GP ++C A+ R + L + + +Y V+
Sbjct: 127 LHRLVQMSDR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYR 180
Query: 278 DE--DGERG------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLG 320
+E D RG P + D H +++ + Q+D T ILLL+PL+ G
Sbjct: 181 EEIIDLARGLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFG 236
Query: 321 L-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
++NPRYI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+
Sbjct: 237 KGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNS 296
Query: 380 DDLEADT 386
D+
Sbjct: 297 PKFSVDS 303
>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
Length = 403
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 146/323 (45%), Gaps = 64/323 (19%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRKGF PIG +S
Sbjct: 16 IPQTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFIPIGGCNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 222
TSD GWGCMLR QM++AQAL+ LG+ W+ P K + Y++IL F D + FS
Sbjct: 65 FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQWMPETK--NNTYLKILSRFEDKRAAAFS 122
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIY 273
IH + G + G G W GP + + W +L + L +
Sbjct: 123 IHQIALTGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSSLTIHVALDNTLIVNDILKQCR 182
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
+ G+ G P+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 183 IEGGETAEADGEVPL--------------KAPSQWKPLLLLIPLRLGLSEINPVYINGLK 228
Query: 334 L--------------------TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+ +F QSLG++GGKP + Y +G + IYLDPH Q
Sbjct: 229 VKFKILCMQKKKYICIQFFQTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQR 288
Query: 374 V----INIGKDDLEADTSTYHSE 392
I ++++E D TYH +
Sbjct: 289 SGSVEDKISEEEIEMDI-TYHCK 310
>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
Length = 427
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 137/319 (42%), Gaps = 61/319 (19%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 37 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 86
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 87 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 146
Query: 195 -------RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 147 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 202
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 203 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 249
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G
Sbjct: 250 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGXXXXXXXXXX 309
Query: 368 PHDVQPVINIGKDDLEADT 386
QP +++ + D ++
Sbjct: 310 XXXCQPTVDVSQADFPLES 328
>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
Length = 409
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 130/277 (46%), Gaps = 49/277 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIG-----DSKIT----------------SDVGWGCMLRSSQ 178
F +DF S + ++YR F PI + K+T SD GWGCM+RS Q
Sbjct: 86 FLEDFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQ 145
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAA 237
++A AL RLGR WR+ + KP E +L LF D +PFSIH ++ G+ G
Sbjct: 146 AVIANALAHLRLGRGWRRGM-KP--EEEKRLLALFADDPRAPFSIHKFVRHGEVECGKNP 202
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
G W GP A A C +A T + + +Y + ++ E V ++
Sbjct: 203 GEWFGP-------SAAAMCIQALTH-AYEPAGLRVYQTNSNDLYEEDFRKVAVVN----- 249
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
VF P L+L + LG+E++ Y L PQ++GI GG+P +S Y +
Sbjct: 250 -GVFK-------PTLVLAGIRLGIERITNIYYEPLAACLRMPQTVGIAGGRPSSSHYFIA 301
Query: 358 VQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHS 391
VQ E+ YLDPH +P++ +D E + T H+
Sbjct: 302 VQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHT 338
>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
Length = 389
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 145/292 (49%), Gaps = 36/292 (12%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + +D L +D +R+ +YR+GF PIG S++T
Sbjct: 21 IPKTDDVVWILGKSYSATEDLDL-----------IRRDVQTRLWCTYRRGFVPIGGSQLT 69
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL LGR W + + Y++I++ F DS+ +PFS+H
Sbjct: 70 TDKGWGCMLRCGQMVLAQALTQLHLGRDWSWTPETT-NETYLKIVNRFEDSKAAPFSLHQ 128
Query: 226 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
+ G+ + G W GP + + + L + + I+V +
Sbjct: 129 IALTGESSEEKRVGEWFGPNTVAQVLKKLVKFD--------DWCSLVIHVALDN------ 174
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
+ D+ C S + W P+LL++PL LGL ++NP Y+ L+ F + G+
Sbjct: 175 ---TLATDEVLELCVDRSNPDS-WKPLLLIIPLRLGLSEINPIYVDGLKKCFELAGNCGM 230
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSE 392
VGG+P + Y +G + A+YLDPH VQ IG D+ E D T+H +
Sbjct: 231 VGGRPNQALYFIGYVADEALYLDPHTVQRSGTIGSKRDPDERELD-ETFHQK 281
>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
Length = 389
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 95/274 (34%), Positives = 140/274 (51%), Gaps = 31/274 (11%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + + D L QD SR+ +YR+GF PIG++++T
Sbjct: 21 IPKTNDTVWILGKQYNASDD-----------LEAIRQDVQSRLWCTYRRGFVPIGNTQLT 69
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQALL LGR W + D Y+ I++ F DS+ +PFS+H
Sbjct: 70 TDKGWGCMLRCGQMVLAQALLQLHLGRDWVWEAETR-DDIYLNIVNRFEDSKQAPFSLHQ 128
Query: 226 L-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
+ L + G W GP + + + L + C+ + I+V +
Sbjct: 129 IALMGDSSEEKRIGEWFGPNTVAQVLKKLVKFDD-----WCR---LVIHVALDN------ 174
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D+ C V K W P+LL++PL LGL +VNP YI L+ F P S G+
Sbjct: 175 ---TVATDEIVELC-VDKKEPEAWKPLLLIIPLRLGLSEVNPIYIEGLKKCFQLPGSCGM 230
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
+GG+P + Y +G A+YLDPH VQ V +G
Sbjct: 231 IGGRPNQALYFIGYVGGEALYLDPHTVQRVGTVG 264
>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
Length = 994
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/277 (32%), Positives = 129/277 (46%), Gaps = 49/277 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 167
F DF+SRI ++YR F+PI D+ + TSD
Sbjct: 309 FYSDFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSD 368
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSETS--PFS 222
GWGCMLR+ Q L+A ALL LGR WR+P + + YV+I+ F D + PFS
Sbjct: 369 SGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFS 428
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H + GK G G W GP + + L E GLG +A+ V D
Sbjct: 429 VHRMALVGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---VAVDGVIYQSDVY 484
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+ + +H G+ W +L+L+ + LG++ VNP Y ++ +T PQ
Sbjct: 485 AVSRSTMGLGSPRKH------GRPSWGDRAVLVLIGIRLGIDGVNPIYYDLIKALYTLPQ 538
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+LGI GG+P +S Y VG Q + YLDPH +P I +
Sbjct: 539 TLGIAGGRPSSSYYFVGSQANNLFYLDPHHARPTIPL 575
>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 452
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 141/314 (44%), Gaps = 62/314 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S +S + LLG +++ +DEA + F + F+S + ++YR+GF + S +T+D
Sbjct: 70 SKSSPLILLGKSYEL-KDEANKE--------RFRRSFASLLWLTYRRGFPQLAGSSLTTD 120
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----------------------------- 198
GWGC+LR+ QML+A+ LL H + W +
Sbjct: 121 SGWGCVLRTGQMLLARGLLTHLMPPGWMWSVWYRAVKDDLDLPHHADCTDCKSNMRCRYQ 180
Query: 199 ------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+P + + +++ F D +PF IH L++ G + G AG W GP +
Sbjct: 181 SLGSLYDRPLEAMHRKVVSWFADHPKAPFGIHRLVELGASSGKKAGDWYGPSIVA---HI 237
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 312
L + A LP + V+ D + + D C W ++
Sbjct: 238 LQKAVAASV-----DLPNLVVYVAQD--------CTIYLQDVRGLCE--RPPPHSWKSVI 282
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+LVP+ LG + +NP YI ++ +GI+GG+P S + VG Q++ +YLDPH Q
Sbjct: 283 ILVPVRLGGQDLNPSYISCVKKLLELQCCIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQ 342
Query: 373 PVINIGKDDLEADT 386
+N+ K++ ++
Sbjct: 343 LTVNVTKENFPLES 356
>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
Length = 430
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 117/256 (45%), Gaps = 47/256 (18%)
Query: 138 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 174
A F DF+SR ++YR F DP + S TSD GWGCM+
Sbjct: 121 AAFLDDFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMI 180
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
RS Q L+A A+ LGR WR+ + DRE +L LF D +P+SIHN ++ G+ Y
Sbjct: 181 RSGQSLLANAMAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSIHNFVRHGEKYC 237
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G W GP A R + L ++ E + IY G P + D+
Sbjct: 238 SKYPGEWFGPSATARCIQDLVNSRKQE---------LRIYST--------GDGPDIYEDN 280
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ + + P L+LV LG++K+ P Y L + QS+GI GG+P +S
Sbjct: 281 FMK---IAKPDGEVFHPTLVLVGTRLGIDKITPVYWEALIASVQMSQSVGIAGGRPSSSH 337
Query: 354 YIVGVQEESAIYLDPH 369
Y VG Q YLDPH
Sbjct: 338 YFVGSQGHFLFYLDPH 353
>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
1558]
Length = 1159
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 82/248 (33%), Positives = 112/248 (45%), Gaps = 51/248 (20%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------------REYVEIL 210
+T+D GWGCMLR+ Q L+A AL+ LGR WR P Q YV IL
Sbjct: 580 LTTDAGWGCMLRTGQSLLANALIHLHLGRDWRVPSQPQVPPTSAAHLAELEAYSSYVRIL 639
Query: 211 HLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
F D S PFS+H + GK G G W GP + + L S
Sbjct: 640 SWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTL-----------VNSF 688
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------------W--T 309
P + V+ D +V D ++ S G +D W
Sbjct: 689 PPSGMAVATAVDS------IVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNR 742
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+L+L+ + LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q S +YLDPH
Sbjct: 743 AVLVLIGIRLGLDGVNPLYYESIKALFTFPQSVGIAGGRPSSSYYFVGTQANSLVYLDPH 802
Query: 370 DVQPVINI 377
+P + +
Sbjct: 803 FTRPAVPL 810
>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
Length = 433
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/225 (32%), Positives = 109/225 (48%), Gaps = 28/225 (12%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
TSD GWGCMLR +QML+ + LL +GR + ++ Y +IL +F D + + +SIH
Sbjct: 49 TSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIH 107
Query: 225 NLLQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ-SLPMAIYV 274
+ Q G G W GP + W +A + L + +L MA
Sbjct: 108 QIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTY 167
Query: 275 VSGD------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
S D E+G+ +H + + + +W P+LL++PL LGL +N Y
Sbjct: 168 PSEDAVKLIMENGQ-----------VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCY 216
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+P ++ F PQ +GI+GGKP + Y VG+ YLDPH +P
Sbjct: 217 LPAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRP 261
>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
Length = 1509
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/252 (32%), Positives = 118/252 (46%), Gaps = 45/252 (17%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE-------------- 205
+T+D GWGCMLR+ Q L+A AL+ LGR W++ Q F E
Sbjct: 776 LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAENQS 835
Query: 206 -------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
Y+ IL F D S PF +H + + GK G G W GP +
Sbjct: 836 LASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAGAI 895
Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----A 306
+ L E G+ + ++ + D R A SR + S + A
Sbjct: 896 KQLV-FDFPEAGIAVELAHDGVFYL----DEVRAAASAST--GKSRASGMLSGNRRAETA 948
Query: 307 DWT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
W P+L+L+ + LGLE VNP Y +++ TF+FPQS+GI GG+P +S Y +G Q S Y
Sbjct: 949 VWRRPVLILIGIRLGLETVNPIYYESVKATFSFPQSVGIAGGRPSSSYYFMGHQGNSLFY 1008
Query: 366 LDPHDVQPVINI 377
LDPH+V+P + +
Sbjct: 1009 LDPHNVRPAVPL 1020
>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
Length = 1541
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 80/250 (32%), Positives = 119/250 (47%), Gaps = 45/250 (18%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK--PLQKPFD------------------ 203
+T+D GWGCMLR+ Q L+A ALL LGR W + P + D
Sbjct: 820 LTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLSLDSSVEMQS 879
Query: 204 ----RE-------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
RE Y++IL F D S PF +H + + GK G G W GP +
Sbjct: 880 LQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAGAI 939
Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT- 309
+ L + + G+ + ++ + DE GA R +G A T
Sbjct: 940 KQLV-TEFPDAGIAVELAHDGVFYL--DEVRLAAGARSALQSGKGR------QGDAAVTW 990
Query: 310 --PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
P+++L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y +G Q S YLD
Sbjct: 991 RRPVVILIGIRLGLDSVNPIYYESVKETFSFPHSVGIAGGRPSSSYYFMGHQGNSLFYLD 1050
Query: 368 PHDVQPVINI 377
PH+V+P + +
Sbjct: 1051 PHNVRPAVAL 1060
>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
nidulans FGSC A4]
Length = 402
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/341 (29%), Positives = 148/341 (43%), Gaps = 64/341 (18%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGV-----CHKIAQDEALGDAAGNN--------GLA 138
+RI + + P S IW LG C + DE+ G G
Sbjct: 11 KRIIQYIWDPEPKNDEEPGSPIWCLGTRYPPQCVEETADESRNPDHGQQQNTNTSAPGWP 70
Query: 139 E-FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 174
E F DF S+I ++YR F PI TSD GWGCM+
Sbjct: 71 EAFLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMI 130
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
RS Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G +
Sbjct: 131 RSGQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFC 187
Query: 234 GLAAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R ++ + +Y+ + D + V D
Sbjct: 188 GKHPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRD 238
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS
Sbjct: 239 E---------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSAS 287
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHS 391
Y V VQ YLDPH+ +P + + E + +TYH+
Sbjct: 288 HYFVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHT 328
>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
Length = 379
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 138/276 (50%), Gaps = 23/276 (8%)
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
H+I L +A L + +D SR+ +YR+GF PIG S+ TSD GWGCMLR QM
Sbjct: 13 HRIRCIFGLSNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQM 72
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGLAAG 238
++AQALL LGR W + D Y+ I++ F D++ +PFS+H + G+ + G
Sbjct: 73 VLAQALLQLHLGRDWEWTAETR-DETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVG 131
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
W GP + + + L + + ++V + D+ C
Sbjct: 132 EWFGPNTVAQVLKKLVKFD--------DWCSVVVHVALD---------STLATDEVVELC 174
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
S W P+LL++PL LGL ++NP Y+ L+ F + G++GG+P + Y +G
Sbjct: 175 EDKSDAGTSWKPLLLIIPLRLGLSEINPIYVAGLKKCFELAGNCGMIGGRPNQALYFIGY 234
Query: 359 QEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
+ A++LDPH VQ NIG D+ E D S +
Sbjct: 235 VGDEALFLDPHTVQRSGNIGDKTGLDEREMDESFHQ 270
>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
Length = 469
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 131/295 (44%), Gaps = 54/295 (18%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 93 DDEIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSRE 152
Query: 194 WR--KPLQKPF----------------------------------------DREYVEILH 211
W + L + F D+ + I+
Sbjct: 153 WTWSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMR 212
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
F D SPF +H L+ G +G AG W GP +A + + ++
Sbjct: 213 WFSDHPGSPFGLHQLVTLGSIFGKKAGDWYGP-------SIVAHIIKKAIETSSEVPELS 265
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+Y VS D + + D + G+A +++LVP+ LG E NP Y
Sbjct: 266 VY-VSQDCTVYKADIEQLFAGDVPHAETSRGAGKA----VIILVPVRLGGETFNPVYKHC 320
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
L+ P LGI+GGKP S Y +G Q+ +YLDPH QP I+ K+D ++
Sbjct: 321 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPYIDTSKNDFPLES 375
>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
Length = 491
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 128/316 (40%), Gaps = 76/316 (24%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPTESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQ-------- 262
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 263 DCTVYNYDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQE 360
+GGKP S Y G QE
Sbjct: 323 IGGKPKQSYYFAGFQE 338
>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
Length = 1572
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 86/270 (31%), Positives = 122/270 (45%), Gaps = 73/270 (27%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK---PL-QKPFDRE-------------- 205
+T+D GWGCMLR+ Q L+A AL+ LGR W++ PL Q+ F E
Sbjct: 830 LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLSIADAAEKES 889
Query: 206 -------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
Y++IL F D S PF +H + + GK G G W GP +
Sbjct: 890 LQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTASGAI 949
Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD-------ASRHCSVFSK 303
+ L P A V DG V +D+ ++ SV S
Sbjct: 950 KQL-----------VSEFPQAGIAVELARDG------VFYLDEVRAAASASASAASVQSG 992
Query: 304 GQAD---------------WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
G+A W P+L+L+ + LGLE VNP Y +++ TF+FP S+GI GG
Sbjct: 993 GKARSSGAASGSRKGEGLIWRRPVLILIGIRLGLESVNPIYYESVKATFSFPHSVGIAGG 1052
Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+P +S Y +G Q S YLDPH+V+P + +
Sbjct: 1053 RPSSSYYFMGHQGNSLFYLDPHNVRPAVPL 1082
>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1093
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 55/274 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
F DF+SR+ ++YR F PI D+ +
Sbjct: 369 FYADFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGG 428
Query: 165 ----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-LQKPFDR--EYVEILHLFGDSE 217
TSD GWGCMLR+ Q L+A ALL LGR WR+P +P YV++L F DS
Sbjct: 429 EKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSP 488
Query: 218 TS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
+ PFS+H + AGK G G W GP + + L A G G VV
Sbjct: 489 SPLCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVH---AFPGGGLGVAVAVDGVV 545
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ +P D+ RH + G +L+L+ + LGL+ VNP Y T++
Sbjct: 546 YETDVFSASHSP-----DSRRHHRTSTWGDRG---VLILIGIRLGLDGVNPIYYDTIKEL 597
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+T+PQS+GI GG+P +S Y VG Q +S YLDPH
Sbjct: 598 YTWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631
>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
Length = 1505
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 79/253 (31%), Positives = 120/253 (47%), Gaps = 44/253 (17%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE------------------ 205
+T+D GWGCMLR+ Q L+A AL+ LGR W + + P R+
Sbjct: 785 LTTDSGWGCMLRTGQSLLANALINVHLGRSWMR--EAPPARQLEFLQELANLSLDTSAEK 842
Query: 206 ---------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
Y++IL F D S PF +H + + GK G G W GP
Sbjct: 843 QSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 902
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDE-DGERGGAPVVCIDDASRHCSVFSKGQAD 307
+ + L + + GL + ++ + DE G + + AS + KG
Sbjct: 903 AIKQLV-SEFPDAGLAVELAHDGVFYL--DEVRAAAGASRQLGKGRASATGTNGRKGDTA 959
Query: 308 WT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
T P+L+L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y +G Q S
Sbjct: 960 LTWHKPVLILIGIRLGLDSVNPIYYESVKATFSFPHSVGIAGGRPSSSYYFMGHQGNSLF 1019
Query: 365 YLDPHDVQPVINI 377
YLDPH+V+P + +
Sbjct: 1020 YLDPHNVRPAVAL 1032
>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1193
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 82/240 (34%), Positives = 110/240 (45%), Gaps = 28/240 (11%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 208
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 562 LTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAKYAQ 621
Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+L F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 622 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 680
Query: 267 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 317
+ +I Y S D +P R +K + W +L+LV +
Sbjct: 681 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRRGDNEAK-EEKWGKRAVLILVGV 739
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q YLDPH +P I +
Sbjct: 740 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 799
>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 120/286 (41%), Gaps = 70/286 (24%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 159
LG G++ E +D SRI +YR GF+PI
Sbjct: 69 LGRRYGSSSKEEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 160 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
+ T+DVGWGCM+R+SQML+A A LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAFQLLLLGRDF--AYVDGSEKKHSDIIDMF 186
Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D +PFS+HN ++A L G W GP A S + L + Q
Sbjct: 187 TDEPKTPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQF------------- 233
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG--------QADWTPILLLVPLVLGLEK 323
DG + V I S C ++ + IL+L+P+ LGL K
Sbjct: 234 --------DGSVSPSFRVII---SESCDIYDDKIGKLLQEIENSEDAILILLPVRLGLNK 282
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
V+P Y +L F Q +GI GGKP +S Y G +YLDPH
Sbjct: 283 VSPYYHDSLSSLFCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPH 328
>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
4308]
Length = 378
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 143/324 (44%), Gaps = 50/324 (15%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQD-EALGDAAGNNGLAE----------- 139
+RI + + P TS IW LG+ + +D G+ N +
Sbjct: 11 KRIVQYLWDPEPRNDEDPTSSIWCLGIEYHPEKDVSPRGETPDKNSARDNTTGTTNYRKP 70
Query: 140 --------FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
F DF SRI ++YR F PI ++ D M S L+A AL LG
Sbjct: 71 SEHAWPESFLLDFESRIWMTYRSNFPPI--PRVEGDDKSASMTLGS--LLANALSTLVLG 126
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 250
R WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A +
Sbjct: 127 RDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCI 183
Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 310
EAL+ C S + +YV + + + R +V + P
Sbjct: 184 EALSS--------QCGSPTLKVYVSNDTSEVYQ-----------DRFMNVARNSSGVFQP 224
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
L+L+ LG++ + P Y L+ T PQS+GI GG+P AS Y VG Q YLDPH
Sbjct: 225 TLILLGTRLGIDHITPVYWDGLKATLQLPQSVGIAGGRPSASHYFVGAQGSHLFYLDPHY 284
Query: 371 VQPVI---NIGKDDLEADTSTYHS 391
+P + G+ + + TYH+
Sbjct: 285 TRPALPDRQGGELYSKEEVDTYHT 308
>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1188
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 109/239 (45%), Gaps = 26/239 (10%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 208
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAKYAQ 619
Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
++ F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANS-FAPCGIAVA 678
Query: 267 SLPMAI------YVVSG--DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
+ +I Y S +D R RH + +G+ +L+LV +
Sbjct: 679 TATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEGKWGERAVLILVGIR 738
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
LGL+ VNP Y +++ FTFPQ+ G GG+P +S Y VG Q YLDPH +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQAGGSAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 797
>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
H99]
Length = 1185
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/240 (33%), Positives = 113/240 (47%), Gaps = 28/240 (11%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------QKPFDRE---------YVE 208
+TSD GWGCMLR+ Q L+ AL+ LGR WR P + ++E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYAQ 619
Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+L F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 678
Query: 267 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 317
+ +I Y S D +P R +K + W +L+LV +
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRGGDNKAK-EGKWGKRAVLILVGI 737
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y +G Q YLDPH +P I +
Sbjct: 738 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFIGSQANHLFYLDPHLTRPAIPL 797
>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
Length = 431
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 139/321 (43%), Gaps = 84/321 (26%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 60 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 109
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 110 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSRSASPSRYHGPAH 169
Query: 195 -RKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
R P L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 170 WRPPRWAQGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 225
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV +D A VV +
Sbjct: 226 -----SLVAHILRKAVESCSEVTRLVVYV---SQDCTVYKADVVRL-------VARPDPA 270
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++L T P ++ +Y
Sbjct: 271 AEWKSVVILVPVRLGGETLNPVYVPCVKLMPTPP-------------------TDDFLLY 311
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 312 LDPHYCQPTVDVSQADFPLES 332
>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 470
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/320 (27%), Positives = 138/320 (43%), Gaps = 70/320 (21%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S ++ ++LLG + D+ + F +DF SR+ ++YR+ F + + +T+D
Sbjct: 76 SRSAPVYLLGERYYFRLDDEID---------RFQKDFVSRVWLTYRRDFPALEGTALTTD 126
Query: 168 VGWGCMLRSSQMLV---------------AQALLFH------------------------ 188
GWGCM+RS QML+ ++AL H
Sbjct: 127 CGWGCMIRSGQMLLAQGLLLHLLSREWTWSEALYTHFVEMEPIRSSSPSSMPLSLATDHS 186
Query: 189 -RLGRPWRKPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 246
R +P + P+ E + I+ F D ++PF +H ++ G +G AG W GP
Sbjct: 187 GRHSQPQTHCSRAPYGGEVHQNIVSWFSDHASAPFGLHRMVALGSIFGKRAGDWYGP--- 243
Query: 247 CRSWEALARCQRAETGLGCQSLPMAIYVVSG----DEDGERGGAPVVCIDDASRHCSVFS 302
+A + + +++YV D E+ A V D SR
Sbjct: 244 ----SIVAHIIKKAIESSSEVPDLSVYVSQDCTVYKADIEQLFAGEVPHTDTSR-----G 294
Query: 303 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 362
G+A +++LVP LG E NP Y L+ P LGI+GGKP S Y +G Q+
Sbjct: 295 AGKA----VIILVPARLGGETFNPVYKHCLKEFLRMPSCLGIIGGKPKHSLYFIGYQDNY 350
Query: 363 AIYLDPHDVQPVINIGKDDL 382
+YLDPH QP I+ +D+
Sbjct: 351 LLYLDPHYCQPYIDTSRDNF 370
>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4; AltName:
Full=Pexophagy zeocin-resistant mutant protein 8
gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
Length = 533
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 117/267 (43%), Gaps = 50/267 (18%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ G +D +TPIL+L+ + LG+EKVN LR + QS+GI G K
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSLKQSVGISGRKTSFLA 281
Query: 354 YI-VGVQEESAIYLDPHDVQPVINIGK 379
+ +G Q + YL P + + GK
Sbjct: 282 LLSIGFQGDYLFYLIPTFPKKALTFGK 308
>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
Length = 379
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 129/279 (46%), Gaps = 50/279 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+I ++YR F PI TSD GWGCM+RS
Sbjct: 50 FLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRS 109
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G + G
Sbjct: 110 GQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFCGK 166
Query: 236 AAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R + LA R ++ + +Y+ + D + V D+
Sbjct: 167 HPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRDE- 216
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS Y
Sbjct: 217 --------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSASHY 266
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHS 391
V VQ YLDPH+ +P + + E + +TYH+
Sbjct: 267 FVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHT 305
>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
972h-]
gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
Length = 320
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 118/280 (42%), Gaps = 50/280 (17%)
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 150
M R ER L + T + IW LG +KI + +F D S I I
Sbjct: 4 MARFLERYLHFAPTNTEPPGTLIWFLGHSYKIEDSQ---------WPEKFLYDSFSLITI 54
Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
+YR G + G +TSD GWGCM+RS+Q L+A L R+ P +++ EIL
Sbjct: 55 TYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL---RICYP---------EKQLKEIL 100
Query: 211 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
LF D ++PFSIH + GK + G W GP C +AR +P
Sbjct: 101 ALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSC---SCVARLSDQNP-----DVP 152
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
+ +YV R V P+LLL+P LG++ +N Y
Sbjct: 153 LHVYVARNGNAIYRDQLSKVSF------------------PVLLLIPTRLGIDSINESYY 194
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
L F +GI GG+P ++ Y Q + YLDPH
Sbjct: 195 DQLLQVFEIRSFVGITGGRPRSAHYFYARQNQYFFYLDPH 234
>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
aries]
Length = 438
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 117/254 (46%), Gaps = 18/254 (7%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
G + F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 104 GEGDIQRFQRDFVSRLWLTYRRDFPPLAGGTLTSDCGWGCMLRSGQMMLAQGLLLHLLPR 163
Query: 193 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
W Q P G+A G AG W GP
Sbjct: 164 DWTWS-QGAGLGPAEPPGLGSPSPGPGPXXXXXXXSWGRAPGKKAGDWYGP-------SL 215
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 312
+A R C + + VS D V D +R + S A+W ++
Sbjct: 216 VAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVAR-SDPTAEWKSVV 265
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+LVP+ LG E +NP Y+P ++ LGI+GG P S Y +G Q++ +YLDPH Q
Sbjct: 266 ILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGTPRHSLYFIGYQDDFLLYLDPHYCQ 325
Query: 373 PVINIGKDDLEADT 386
P +++ + D ++
Sbjct: 326 PTVDVSQADFPLES 339
>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
Length = 492
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 128/292 (43%), Gaps = 74/292 (25%)
Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPIG----------------------------- 160
D + ++G+ E QD S+I ++YR GF+PI
Sbjct: 77 DISVDDGVIE--QDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNF 134
Query: 161 -----DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW-----RKPLQKPFDREYVEIL 210
+ T+DVGWGCM+R+SQ L+A LGR + R P + EI+
Sbjct: 135 HGLLDNDNFTTDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRDRSP-------RHDEII 187
Query: 211 HLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+F D +PFS+HN ++ L G W GP A S + L C +
Sbjct: 188 DMFMDEPRAPFSLHNFIKVASESPLKVKPGQWFGPNAASLSIKRL-----------CDN- 235
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP----ILLLVPLVLGLEKV 324
+Y +G G VV + ++ + + ++ P IL+L+P+ LG++KV
Sbjct: 236 ---VYESNG-----TGRVKVVISESSNLYDDIITQMFTTLNPVPDAILVLLPVRLGIDKV 287
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
NP Y ++ QS+GI GGKP +S Y G + +YLDPH Q V N
Sbjct: 288 NPLYHASVLELLALRQSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRN 339
>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 808
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 126/287 (43%), Gaps = 71/287 (24%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
F +DF+S I ++YR + PI D+ +
Sbjct: 142 FYEDFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKS 201
Query: 165 -TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSET-- 218
TSD GWGCMLR+ Q L+A AL+ LGR WR+P F E YV+IL F D+ +
Sbjct: 202 WTSDAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPL 261
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR----CQRAETGLGCQSLPMAIYV 274
+PF +H + AGKA G G+W GP S + LA CQ + L A V
Sbjct: 262 APFGVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPECQLS-VSLAVDGTVFASDV 320
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTL 332
+ G V + SK G+A +L+LV + LGL+ VNP Y L
Sbjct: 321 YAASHMGM-----VTTSGRSISSRRSASKWGGRA----VLILVNIRLGLDNVNPIYYDAL 371
Query: 333 RLTFTFPQSLGIVGGKP--GASTYIVGVQEESAIYLDPHDVQPVINI 377
++ G+P G+S Y VG Q +S YLDPH +P I +
Sbjct: 372 KV------------GRPRQGSSYYFVGSQADSLFYLDPHHTRPYIPL 406
>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
Length = 423
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 82/238 (34%), Positives = 118/238 (49%), Gaps = 44/238 (18%)
Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 220
+ TSD GWGCM+R+SQ L+A ALL +L + Q ++IL LF D TSP
Sbjct: 138 NDNFTSDAGWGCMIRTSQNLLAIALL--KLSEEHNESAQ-------LDILKLFQDDPTSP 188
Query: 221 FSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSG 277
FS+HN ++ + L G W GP A S + L ++ ET P I V
Sbjct: 189 FSLHNFIRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLET-------PGEIPYVYI 241
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
E+ + DD +F++ Q P+LLL P+ LG+++VN Y ++ +
Sbjct: 242 SENAD-------LFDDEIE--DLFNEEQK---PLLLLFPVRLGIDQVNKYYYKSILQLLS 289
Query: 338 FPQSLGIVGGKPGASTYIVGVQEES-AIYLDPHDVQPV---INIGKDDLEADTSTYHS 391
P S+GI GGKP +S Y +G + E+ +Y DPH Q V INI +TYH+
Sbjct: 290 LPYSVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI---------TTYHT 338
>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
Length = 357
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 110/230 (47%), Gaps = 42/230 (18%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R E GL +Y VSGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVY-VSGD------GADV--YE 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
D + +V G W P L+LV LG++K+ P Y L++ P L
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKIREMDPSML 300
>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
Length = 425
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 116/267 (43%), Gaps = 70/267 (26%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAER---NIVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
S + + D + PTL L QS+GI GG+P +S Y
Sbjct: 270 ---SFLATARPD-----------------GETFHPTLIL---MEQSIGIAGGRPSSSHYF 306
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDL 382
VGVQ + YLDPH +P + ++ L
Sbjct: 307 VGVQRQWLFYLDPHHPRPALQYRENPL 333
>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
Length = 178
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 65/163 (39%), Positives = 88/163 (53%), Gaps = 32/163 (19%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHER 97
S+ K S+LS +F ++FE + S++ A K + A R+ +RR+
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRI-----LRRVS-- 97
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
++E G + ++G A F +DFSSRI I+YRKGFD
Sbjct: 98 -------------------------PEEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFD 132
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 133 AIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 175
>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
Length = 437
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/272 (30%), Positives = 125/272 (45%), Gaps = 49/272 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSK-----------------ITSDVGWGCM 173
+F DF S++ I+YR F PI GDS TSD GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 232
+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGATA 261
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCI 291
G G W GP A + +AL + + GL +Y+ S G + E+ V C
Sbjct: 262 CGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVAC- 312
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G + +
Sbjct: 313 ----------DESGGGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGPEELS 362
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+ + ++ +DP + + +DD E
Sbjct: 363 TYHTRRLRRLHVREMDPSMLIGFLVRDEDDWE 394
>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
boliviensis]
Length = 463
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 93/346 (26%), Positives = 141/346 (40%), Gaps = 96/346 (27%)
Query: 76 NGWTAAVKRLV-TAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGN 134
NG AV R++ AG + SRT S +S + +C + + E GD
Sbjct: 80 NGIAVAVMRVLHLAGRCPHVSPGWAVKSRTSFSKISS----IHLCGRRYRFEGEGD---- 131
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+ F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 132 --IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDW 189
Query: 195 R---------KPLQKPF-------------------------DREYVEILHLFGDSETSP 220
L P +R + +I+ F D +P
Sbjct: 190 TWAEGTGLGPPELSGPASPSRYHGPARWMPPCWAQGAPELEQERRHRQIVSWFADHPQAP 249
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
F +H L++ G++ G AG W GP +A R + + +YV
Sbjct: 250 FGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVESSSEVTRLVVYV------ 296
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
S+ C+ G+ TP L + LR
Sbjct: 297 --------------SQDCT----GKGTCTPSLQEL----------------LRCELC--- 319
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
LGI+GGKP S Y +G Q++ +YLDPH QP +++ + + ++
Sbjct: 320 -LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLES 364
>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
Length = 1093
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/244 (32%), Positives = 114/244 (46%), Gaps = 29/244 (11%)
Query: 160 GDSKITSDVGWGCMLRSSQMLVAQALL-------------FHRLGRPWRKPLQKPFDRE- 205
G +TSD GWGCMLR+ QML+A +L+ + P P + DR+
Sbjct: 431 GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPAPSLPPSET--DRQR 488
Query: 206 ---YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
YV+IL F D + PFS+H L AG G G W GP S + L A
Sbjct: 489 FEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAGSIKKLVSAFPA- 547
Query: 261 TGLGCQSLP------MAIYVVSGDEDGERGGAPVVCIDD-ASRHCSVFSKGQADWTPILL 313
GLG P A++ S + + D +R + K + +L+
Sbjct: 548 CGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERANRMKEEWGDRAVLI 607
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
L+ L LG+E V P Y +++ FTFPQ++GI GG+P +S Y VG Q + YLDPH +P
Sbjct: 608 LIGLRLGIEGVTPIYYDSVKALFTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTRP 667
Query: 374 VINI 377
+ +
Sbjct: 668 AVPL 671
>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1295
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 118/281 (41%), Gaps = 49/281 (17%)
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 185 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 228
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
GK G G W GP + + LA P + VVS + G
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLANS----------FPPCGLSVVSAAD----GSVFR 665
Query: 289 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 336
+ AS + ++ G P +L+++P LGL+ VNP Y ++
Sbjct: 666 SEVYQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
Length = 285
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 75/232 (32%), Positives = 107/232 (46%), Gaps = 39/232 (16%)
Query: 144 FSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
F S I I+YR+ F P+ + SD GWGCM+R QM +A+ L K
Sbjct: 2 FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGL--------------KR 47
Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAE 260
F + EI+ LF D + S FSI N+ +AGK + L AG W P +C + L +
Sbjct: 48 FQIKEDEIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKK--- 104
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
G + L I +S D ++ +D S G ++L + LG
Sbjct: 105 ---GFKDL--KIRTISSDR--------ILIFEDLEMEFSSEKNG------LILFLVCKLG 145
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
LEK Y+ F + S+G++GGKP + + VG E+ IYLDPH VQ
Sbjct: 146 LEKTEENYLKIALKIFDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDPHYVQ 197
>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1295
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 118/281 (41%), Gaps = 49/281 (17%)
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 185 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 228
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
GK G G W GP + + LA P + VVS + G
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLANS----------FPPCGLSVVSAAD----GSVFR 665
Query: 289 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 336
+ AS + ++ G P +L+++P LGL+ VNP Y ++
Sbjct: 666 SEVYQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
Length = 450
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 121/271 (44%), Gaps = 52/271 (19%)
Query: 143 DFSSRILISYRKGFDPI-----GDSKIT------------------------SDVGWGCM 173
D SR+ +YR F PI G S I SD+GWGCM
Sbjct: 64 DVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALTDPDSFYSDIGWGCM 123
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA-GKA 232
+R+ Q L+A A+ +L R +R + D E + ++ F D P S+HN ++A K
Sbjct: 124 IRTGQALLANAIQRVKLAREFRINASRIDDNE-LNLIRWFQDDVKYPLSLHNFVKAEEKI 182
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G+ G W GP A RS + L E C I S D + D
Sbjct: 183 SGMKPGQWFGPSATARSIKTL-----IEGFPLCGIKNCIISTQSAD----------IYED 227
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ +R +F K + +LLL + LG++K+N Y + + P S+GI GGKP +S
Sbjct: 228 EVTR---IFHKDRD--ANLLLLFAVRLGVDKINSLYWKDIFKILSSPYSVGIAGGKPSSS 282
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
Y G Q E+ YLDPH+ Q ++ DDLE
Sbjct: 283 LYFFGYQNENLFYLDPHNTQQS-SLMMDDLE 312
>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
Length = 499
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 143/323 (44%), Gaps = 73/323 (22%)
Query: 120 HKIAQDEALGDAAGNNGLAE---FNQDFSSRILISYRKGFDPIGDSKITSDVGWGC---- 172
+KI+ LGD+ N E F F SRI ++YRK F + S T+D GWGC
Sbjct: 83 NKISPVTILGDSYLLNSEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRS 142
Query: 173 --MLRSSQMLV-----------AQAL------LFH-----RLG----------------- 191
ML + +LV AQ L +F R G
Sbjct: 143 GQMLLAQGLLVHLMPRGWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPE 202
Query: 192 RPW----------RKPLQKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
RP +K L+ DR+ + +++ FGD T+PF IH L++ GK+ G A
Sbjct: 203 RPLLSEQATKCSRKKRLESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKA 262
Query: 238 GSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
G W GP + +A+AR + + +YV D + +C S+
Sbjct: 263 GDWYGPAIVAHILRKAVARASAVHS--------LVVYVAQ-DCTVYKEDVMHLCDPTPSQ 313
Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
S QA W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y V
Sbjct: 314 TPSDPLSHQA-WKSVIILVPVRLGGECLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFV 372
Query: 357 GVQEESAIYLDPHDVQPVINIGK 379
G Q+E +YLDPH QPV+++ +
Sbjct: 373 GFQDEQLLYLDPHYCQPVVDVSQ 395
>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
Length = 511
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 131/319 (41%), Gaps = 75/319 (23%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 129 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 182
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------------- 194
GWGCMLRS QM++AQ LL H L R W
Sbjct: 183 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRW 242
Query: 195 -RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
+ + +R + +I+ F D +PF +H L++ G++ G AG W GP +
Sbjct: 243 AQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 295
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------KGQAD 307
A R C + + VS D +PV + + + +
Sbjct: 296 AHILRKAVE-SCSEVTRLVVYVSQDCTAAEASSPVSDTPASGPLHLLPLLLGVLFQQRCR 354
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W L V +L E LGI+GGKP S Y +G Q++ +YLD
Sbjct: 355 W----LFVCELLRCELC-----------------LGIMGGKPRHSLYFIGYQDDFLLYLD 393
Query: 368 PHDVQPVINIGKDDLEADT 386
PH QP +++ + D ++
Sbjct: 394 PHYCQPTVDVSQADFPLES 412
>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
Length = 427
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 108/230 (46%), Gaps = 28/230 (12%)
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 223
+D+GWGCM+R+ Q L+ AL LGR WR + EI F D+ PFS+
Sbjct: 55 TDIGWGCMIRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSL 114
Query: 224 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD---- 278
H + G + G G W GP A RS ++L + E G+ I V SGD
Sbjct: 115 HRFISKGMQLSGKKPGEWFGPAATARSIQSLVH-EFPECGID----KCLISVSSGDIYKT 169
Query: 279 --EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
ED G H GQ D T IL+L+ + LG+E +N Y ++R
Sbjct: 170 EVEDVFNEG-----------HTGEARNGQKDKT-ILILLGVKLGIETINRCYWDSIRRIL 217
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+ S+GI GG+P +S Y G Q + +Y DPH QP + K+DL +T
Sbjct: 218 SSEYSIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQPSYD--KNDLFYET 265
>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
Length = 393
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 126/293 (43%), Gaps = 68/293 (23%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PI W
Sbjct: 57 VWILGKQHLLKTEKS-----------KLLADISARLWFTYRRKFSPID---------WN- 95
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
W K ++P +EY IL F D + +SIH + Q G
Sbjct: 96 ---------------------WEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGVG 132
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGERGGAPV 288
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 133 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCCAS 184
Query: 289 VCIDDA-------SRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
DA S + S SKG + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 185 ALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFK 244
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D T+H
Sbjct: 245 MPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGM-VDDQTFH 296
>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
Length = 489
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 123/275 (44%), Gaps = 52/275 (18%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT-------------------- 165
+ +N +F D SR+ +YR F PI G S ++
Sbjct: 69 SKNSNENPDFLSDVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNP 128
Query: 166 ----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
+DVGWGCM+R+ Q L+ AL RLGR +R + E + I+ F D +PF
Sbjct: 129 ACFNTDVGWGCMIRTGQSLLGNALQIARLGRGYR--IGSELKPEEISIIDWFVDIPDAPF 186
Query: 222 SIHNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
SIHN + G G W GP A RS ++L R + CQ I V SGD
Sbjct: 187 SIHNFVSKGMELSSKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQ-----ISVSSGD-- 239
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V +D + VF++ + + ILLL+ + LG+ VN Y ++
Sbjct: 240 --------VYEEDVMK---VFNESKD--SRILLLLGVKLGINAVNEFYWNDIKRLLGSKF 286
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
S+GI GG+P +S Y +G Q +YLDPH QP +
Sbjct: 287 SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQPFL 321
>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
Length = 342
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 121/279 (43%), Gaps = 69/279 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS-SRILISYRKGFDPIGDSKITSDVGWG 171
+W+LG H + D L E F+ + L ++ G P +SD GWG
Sbjct: 35 VWILGKQHLLKTD----------SLPEIISHFTETSELTAHDGGTGP------SSDAGWG 78
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
CMLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 79 CMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKM-- 136
Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
C LP++ + + + G+P
Sbjct: 137 ---------------------------------CCILPLSADIATENP----SGSP---- 155
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
+AS H S W P+LL+VPL LG+ ++NP Y+ + SLG +GGKP
Sbjct: 156 -NASNHSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK-------SLGALGGKPNN 207
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ Y +G + I+LDPH Q ++ +++ D T+H
Sbjct: 208 AYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFH 245
>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 377
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/233 (33%), Positives = 104/233 (44%), Gaps = 52/233 (22%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 174
A F DF SRI I+YR F I SK T+D GWGCM+
Sbjct: 90 AAFLDDFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMI 149
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
RS Q L+A ALL +LGR WR+ + + + +L LF D +PFSIH ++ G A
Sbjct: 150 RSGQSLLANALLIQKLGRDWRRGSET---GKEIALLSLFADRPQAPFSIHRFVEHGAAAC 206
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A ARC C+ + +YV S D +D
Sbjct: 207 GKHPGEWFGP-------SATARCIDE-----CEHAGLNVYVTSDGSD---------VHED 245
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
R + G D P L+L+ + LG++ + P Y L+ +PQS+GI G
Sbjct: 246 KFRQIA----GLDDIKPTLILLGVRLGIDSITPVYWDALKAIIQYPQSVGIAG 294
>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
purpuratus]
Length = 1018
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 81/144 (56%), Gaps = 10/144 (6%)
Query: 113 IWLLGVC-HKIAQDEALGDAAGNNGLAE-----FNQDFSSRILISYRKGFDPIGDSKITS 166
IW LG C H+ +D G + + F QDFSSR+ ++YR+ F + S TS
Sbjct: 346 IWFLGKCYHQRPEDPDPERPPGMDSVRSMVIEMFKQDFSSRLWMTYRREFPTLAGSNFTS 405
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDS--ETSPFS 222
D GWGCMLRS QM++A +L+ H LGR W KP + + + +I+ FGD + SPFS
Sbjct: 406 DCGWGCMLRSGQMMLAHSLILHFLGREWNIYKPQTQEMLQFHRQIVRWFGDQPLDMSPFS 465
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAM 246
+H L+ G+ G G W GP ++
Sbjct: 466 VHRLVGIGQNNGKKVGDWYGPSSV 489
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 55/92 (59%)
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
ID + S ++G W +++++P+ LG ++VNP YI ++ FT LGI+GGKP
Sbjct: 819 IDPSRSRTSTSTEGGKPWCAVVIMIPVRLGGDEVNPVYIRPIQSLFTLESCLGIIGGKPK 878
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
S + VG QEE I+LDPH Q V+++ D
Sbjct: 879 HSLFFVGFQEEKLIHLDPHYCQQVVDMKTRDF 910
>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
Length = 443
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 119/278 (42%), Gaps = 67/278 (24%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------- 162
LG N+ A N S++ +SYR GF+PI S
Sbjct: 69 LGQIFDNSNAA--NNYIESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFA 126
Query: 163 ---------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
TSD GWGCM+R+SQ L+A LL K + + EI+ LF
Sbjct: 127 NLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEQEIVKLF 173
Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D SPFSIHN ++ + L G W GP A S + L + + G P
Sbjct: 174 QDDTKSPFSIHNFIRVASSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGIN--PPR 231
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+++ + DD R VF+K +++ +++L P+ LG++KVN Y +
Sbjct: 232 VFISENSD----------LFDDEIR--DVFAKEKSN--SVIILFPIRLGIDKVNSYYYNS 277
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+ + S GI GGKP +S Y +G ++ IY DPH
Sbjct: 278 IFHLLSSKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPH 315
>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
Length = 336
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 78/279 (27%), Positives = 115/279 (41%), Gaps = 69/279 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG-GKPGA 351
D + C V P S VG PG
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTVGESTPG- 202
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
T Q + I+LDPH Q +N +++ D T+H
Sbjct: 203 -TLNASNQSDELIFLDPHTTQTFVNT-EENGTVDDQTFH 239
>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
Length = 411
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/233 (32%), Positives = 112/233 (48%), Gaps = 38/233 (16%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP-----------FDREYVE---IL 210
T+D GWGCM+R++QM+VAQA++ +R GR WR +K FD E ++ IL
Sbjct: 88 TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147
Query: 211 HLFGDSETSPFSIHNLLQ-AGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
LF D ++P IH +++ A + G A G W P EA+ ++A T
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPS------EAVFIMKKAITESASPLT 201
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPR 327
+ +S D G + ++ ++H WT L+LV +V LG ++N
Sbjct: 202 GDTVMYLSID-----GRVHIRDLEVETKH----------WTKTLMLVIVVRLGAAELNRI 246
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
Y+P L F+ LGI GG+P S + VG + IYLDPH I I D
Sbjct: 247 YVPHLMRLFSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAHEYIPIDMD 299
>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
Length = 592
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/292 (31%), Positives = 127/292 (43%), Gaps = 49/292 (16%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG------- 160
S DIW H A+D D N EF D +RI ++YR F PI
Sbjct: 75 SGLKDIWQTLRFH-TAEDNEKDDL--NKWPQEFIDDVYTRIWLTYRTKFSPIDRDPEGPS 131
Query: 161 ----------------DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+ T+D GWGCM+R+SQ L+A ALL +GR WR +
Sbjct: 132 PLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQSLLANALLNLHIGRDWR--YTGELNE 189
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+ EI+ F D + PFSIH ++ GK G W GP A RS ++L
Sbjct: 190 MHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGEWFGPSAAARSIQSL---------- 239
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
C + V G + G+ V + A VF PIL+L+ L LG++
Sbjct: 240 -CNEFDSGVKVYIGSDSGDIYENDVFKV--AKDENGVFK-------PILILLGLRLGIDN 289
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
+NP Y +L+ +S+GI GG+P S Y G Q + YLDPH QP +
Sbjct: 290 INPVYWDSLKAILNSKESIGIAGGRPSTSHYFFGFQGDHLFYLDPHLPQPAL 341
>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 388
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 73/230 (31%), Positives = 109/230 (47%), Gaps = 33/230 (14%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ SYR+ F+P+ + TSDVGWGC +R+ QM++A A + +R G D V
Sbjct: 94 LYFSYRRQFEPLRNGA-TSDVGWGCTIRACQMMLAWAFMRYRNGG------SVTMDDNVV 146
Query: 208 EIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
+ L LF D T+PF IH + G +G+ G W GP M + AL R+ G
Sbjct: 147 DSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSSGG 206
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
G + L + D + G VV S+H ++LL+P+ LG +
Sbjct: 207 EGPEVLVAS--------DRQIGVQDVVVRLQRSQH-------------VVLLIPVKLGPQ 245
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
V+ Y L+ F S+G VGG+ ++ + G Q + I+LDPH VQ
Sbjct: 246 TVSVTYANALKRFFEMGSSIGAVGGEKNSAYFFFGYQGDKIIHLDPHYVQ 295
>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
Length = 332
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 75/237 (31%), Positives = 109/237 (45%), Gaps = 27/237 (11%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+++ I++LFGDS S FSIH L+ G+ G W GP + A AE
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP--------SFASDIAAEHIN 148
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
+ YV + G G + SK + + P ++ VPL LG E
Sbjct: 149 EMRVFRTRGYVA---KLGSIVGPKI----------EELSKDEVGFNPCIIFVPLRLGPES 195
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I++ D
Sbjct: 196 PENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAIDMKGD 252
>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 302
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 78/239 (32%), Positives = 115/239 (48%), Gaps = 35/239 (14%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAG 230
M R QML+AQAL+ H LGR WR + ++I+ F DS + SP S+H L+Q
Sbjct: 1 MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHRLVQMS 60
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGDE--DGER 283
G W GP ++C A+ R + L + + +Y V+ +E D R
Sbjct: 61 DR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLAR 114
Query: 284 G------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLGL-EKVNPR 327
G P + D H +++ + Q+D T ILLL+PL+ G ++NPR
Sbjct: 115 GLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFGKGNRINPR 170
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
YI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+ D+
Sbjct: 171 YIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNSPKFSVDS 229
>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
Length = 1055
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 116/247 (46%), Gaps = 38/247 (15%)
Query: 148 ILISYRKGFDPI-GDSKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWRKPLQKPFDR 204
+ ++YRKG+DPI GD+++TSD GWGC RS QML+AQAL+ + R R +P
Sbjct: 603 VWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSARMQRLEGVRPSTW 662
Query: 205 EYVE----ILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 258
++ E +L +F DS + FSI ++ + G W+ P + + R
Sbjct: 663 QHEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLSPSEVAL---IIRRLNP 719
Query: 259 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
ETG+ V ++D G+ W P LL++PL
Sbjct: 720 PETGMR-----------------------VRIVNDTLLSTRRILAGEP-WMPTLLMIPLR 755
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE--SAIYLDPHDVQPVIN 376
GL+ + P +P F +P +G +GGKPG++ Y VG+ + +YLDPH + ++
Sbjct: 756 AGLDTLQPESVPAFVAFFDWPWCVGAIGGKPGSAYYYVGIDHDRRRVLYLDPHTTRSRLD 815
Query: 377 IGKDDLE 383
+ E
Sbjct: 816 LSNQAAE 822
>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 411
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 120/264 (45%), Gaps = 51/264 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 170
F D SRI +YR F PI S +D+GW
Sbjct: 74 FLSDVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGW 133
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+R+ Q L+A A+ LGR +R + + +I+ F D+ PFS+HN ++ G
Sbjct: 134 GCMIRTGQSLLANAIQIAILGREFRVN-DGDVNEQERKIISWFMDTPDEPFSLHNFVKKG 192
Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
+ G W GP A RS ++L Q + G+ + ++ + DE
Sbjct: 193 CELSSKKPGEWFGPAATSRSIQSLVE-QFPDCGIDRCIVSVSSADIFKDE---------- 241
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
I+D +F + ++ ILLL+ + LG++KVN Y+ +R S+GI GG+P
Sbjct: 242 -IND------IFKNKR--YSNILLLMGVKLGVDKVNEYYLKDIRKILESRYSVGISGGRP 292
Query: 350 GASTYIVGVQEESAIYLDPHDVQP 373
+S Y G Q+++ +Y DPH QP
Sbjct: 293 SSSLYFFGYQDDTLLYFDPHKPQP 316
>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 414
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 83/248 (33%), Positives = 115/248 (46%), Gaps = 32/248 (12%)
Query: 140 FNQDFSSRILISYRKGFDPIG---DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
F DF ++I ++YR F I D K S + LRS LV Q G W
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRS--QLVDQGGFTSDTG--WGC 158
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
E +IL LF D +P+SIH ++ G A G G W GP A AR
Sbjct: 159 SSSN----EERKILSLFADDPRAPYSIHKFVEHGASACGKHPGEWFGP-------SAAAR 207
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
C +A T +S + +Y+ +GD G+ V S+ +TP L+LV
Sbjct: 208 CIQALTNSQVES-ELRVYI-TGD------GSDVY----EDTFMSIAKPNSTKFTPTLILV 255
Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
LGL+K+ P Y L+ + PQS+GI GG+P +S Y +GVQE YLDPH +P +
Sbjct: 256 GTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYFIGVQESDFFYLDPHQTRPAL 315
Query: 376 NIGKDDLE 383
D++E
Sbjct: 316 PF-NDNVE 322
>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 114/270 (42%), Gaps = 64/270 (23%)
Query: 141 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 166
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDDTSSPFSIHNF 186
Query: 227 LQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ L G W GP A S + LA + + +P +S + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVF--ISENSD---- 240
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
DD R VF+K + +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
GGKP +S Y +G ++ IY DPH Q V
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV 321
>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 114/270 (42%), Gaps = 64/270 (23%)
Query: 141 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 166
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDGTSSPFSIHNF 186
Query: 227 LQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ L G W GP A S + L + L +P +S + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVF--ISENSD---- 240
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
DD R VF+K ++ +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKS--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
GGKP +S Y +G ++ IY DPH Q V
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV 321
>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
Length = 577
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 118/275 (42%), Gaps = 58/275 (21%)
Query: 138 AEFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDV 168
EF +D SR++ +YR F PI G S I T+D+
Sbjct: 127 VEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNSFTTDI 186
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 222
GWGCM+R+ Q L+ AL LGR +R P K E +I+ F D+ PFS
Sbjct: 187 GWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEE--DIIEWFYDNPNKPFS 244
Query: 223 IHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
IH + G + G W GP C + ++L + E G+ + V SGD
Sbjct: 245 IHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLIY-EFPECGID----ECILSVSSGD--- 296
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+ D+ + H F K + T IL+L+ + LG++K+N Y ++ S
Sbjct: 297 -------IYEDEINEH---FQKNEN--TIILILLGVKLGIDKINQCYFNDIKDILNSRYS 344
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
GI GG+P +S Y G E Y DPH Q +N
Sbjct: 345 CGISGGRPSSSLYFFGHMNEYLYYFDPHKPQLQLN 379
>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
Length = 336
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 113/278 (40%), Gaps = 67/278 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G P S
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTAGESPPGS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
+ Q I+LDPH Q ++ +++ D T+H
Sbjct: 204 LTALN-QSNELIFLDPHTTQTFVDT-EENGTVDDQTFH 239
>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 497
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 94/179 (52%), Gaps = 10/179 (5%)
Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
+++ LFGD +PF +H L+ GK G AG W GP + + R A+T +G QS
Sbjct: 231 KLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAH----ILRKAVAKTSVG-QS 285
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
L A+YV +D V+ + D S V W +++LVP+ LG E +NP
Sbjct: 286 L--AVYVA---QDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGGEALNPS 340
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
YI ++ + +GI+GGKP S Y +G Q+E +YLDPH QPV++ + + ++
Sbjct: 341 YIECVKNILSLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDFTQANFSLES 399
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 11/73 (15%)
Query: 108 SSTSDIWLLGVCHKI-AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
+ TS I++LG + + ++DE + F DF SRI ++YR+ F + S +T+
Sbjct: 87 NKTSPIFVLGHAYLLNSEDE----------VERFRLDFVSRIWLTYRREFPQLEGSTLTT 136
Query: 167 DVGWGCMLRSSQM 179
D GWGCMLRS QM
Sbjct: 137 DCGWGCMLRSGQM 149
>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 557
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 107/266 (40%), Gaps = 36/266 (13%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL---- 198
D S +YR F I ITSD GWGCMLRS+QM++ QAL H R WR P
Sbjct: 171 DERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQLLAR 230
Query: 199 --QKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
Q F R + + S S +S+HN++ AG Y G W GP C L
Sbjct: 231 RRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMRDLVH 290
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
+ LG L I+ V G + + K +
Sbjct: 291 IHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQPQAH 350
Query: 316 PLVLGLEK---------------------------VNPRYIPTLRLTFTFPQSLGIVGGK 348
PL L E+ +N Y+ +L TF+ PQS+G++GG+
Sbjct: 351 PLDLEWEEELMESANTVEWDTALLLLVPLRLGLTSLNEEYVQSLAHTFSLPQSVGVLGGR 410
Query: 349 PGASTYIVGVQEE-SAIY-LDPHDVQ 372
P + + G Q++ S I+ LDPH VQ
Sbjct: 411 PRGARWFYGAQKDGSKIFGLDPHTVQ 436
>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
Length = 354
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 108/226 (47%), Gaps = 25/226 (11%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA-LLFHRLGRPWRKPLQKPFDREY 206
+ SYR GF P+ + T+DV WGC++R++QML+AQA + F G + RE
Sbjct: 69 LYFSYRCGFTPLSNGS-TTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFVDGSALQILREK 127
Query: 207 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
V+ LF D ++PF IH + + YG+A G W G ++ +L + G G
Sbjct: 128 VQ--PLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSLRGGNG-- 183
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 326
P + V D E V + SR ++LL+P VLGL++++
Sbjct: 184 --PAVLVFV----DREVSALKVRDLLSHSRQ-------------VVLLIPAVLGLDRISV 224
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+Y L +G++GG+ ++ Y VG Q + IYLDPH Q
Sbjct: 225 KYSKMLIRCLEMESCIGVIGGRKSSALYFVGHQSNNIIYLDPHRAQ 270
>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 446
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 118/284 (41%), Gaps = 66/284 (23%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------ 162
LG N A N S++ +SYR GF+PI S
Sbjct: 68 VLGQTFDNFDTA--NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNF 125
Query: 163 ----------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
TSD GWGCM+R+SQ L+A LL K + + EI+ L
Sbjct: 126 ANLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKL 172
Query: 213 FGDSETSPFSIHNLLQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
F D +SPFSIHN ++ L +G W GP A S + L + + +P
Sbjct: 173 FQDGTSSPFSIHNFIRVASLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPR 232
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+S + D DD R VF+K + +L+L P+ LG++KVN Y
Sbjct: 233 VF--ISENSD---------LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYN 277
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
++ S GI GGKP +S Y +G ++ IY DPH Q V
Sbjct: 278 SIFHLLASKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV 321
>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
Length = 398
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 104/235 (44%), Gaps = 48/235 (20%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK--------------------------ITSDVGWGC 172
+F DF S++ I+YR F PI + TSD GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-K 231
M+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGAT 301
Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
A G G W GP A + +AL + + GL G + E+ V C
Sbjct: 302 ACGKCPGEWFGPSAASQCIQALVK-SNPQVGL------RVCITSDGSDIYEKQFKEVACD 354
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G
Sbjct: 355 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAG 398
>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
[Homo sapiens]
Length = 340
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 113/278 (40%), Gaps = 67/278 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 184
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 185 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 207
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
Q + I+LDPH Q ++ ++ D T+H
Sbjct: 208 -LTASNQSDELIFLDPHTTQTFVDTEENGTVND-QTFH 243
>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
Length = 408
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 105/213 (49%), Gaps = 29/213 (13%)
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I + T+DVGWGCM+R+SQ L+A +++ + + +E +++L F DSE
Sbjct: 123 IDNENFTTDVGWGCMIRTSQSLLANT---------YKRMISEDAQQE-IQLLDQFKDSEA 172
Query: 219 SPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS 276
+PFS+HN ++ L G W GP A S + L ++ G LP ++S
Sbjct: 173 APFSLHNFIRVANESPLQVKPGQWFGPNAASLSIQRLCNLVNSKENFG---LPGLSVLIS 229
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
+ D DD + + K Q+ +L+L+P+ LG++K N Y ++
Sbjct: 230 ENSD---------LYDDKVQEF-LDKKKQS----LLILLPIRLGIDKTNEFYYSSILQLL 275
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
QS+GI GGKP +S Y G + +YLDPH
Sbjct: 276 NCKQSVGIAGGKPSSSFYFFGYDNDELLYLDPH 308
>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
Length = 336
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 113/278 (40%), Gaps = 67/278 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRP-LD 202
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
Q + I+LDPH Q ++ ++ + D T+H
Sbjct: 203 YLTASNQSDELIFLDPHTTQTFVDTEENGMVND-QTFH 239
>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
Length = 336
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 113/278 (40%), Gaps = 67/278 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 181 DIKKMCCV-------------------------------------LPLSADTAGDRPPDS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
Q + I+LDPH Q ++ ++ D T+H
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVND-QTFH 239
>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
gorilla]
gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 336
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 113/278 (40%), Gaps = 67/278 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
Q + I+LDPH Q ++ ++ D T+H
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVND-QTFH 239
>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
Length = 603
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 74/126 (58%), Gaps = 1/126 (0%)
Query: 130 DAAGNNGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
D G + + EF +DF++R+L +YR+GF I +++ +D GWGCMLRS QML++ LL H
Sbjct: 129 DIPGQSFIKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHH 188
Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
LG W+K Y I+ +F D ++PFSIHN+ G+ G G W P + +
Sbjct: 189 ALGDDWKKSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQ 248
Query: 249 SWEALA 254
+ ++L
Sbjct: 249 AIKSLV 254
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 49/70 (70%)
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VGGKP AS Y + VQ+++ YLDPH
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEIFKFPQNLGVVGGKPRASLYFIAVQDDNLFYLDPH 430
Query: 370 DVQPVINIGK 379
VQ I+I
Sbjct: 431 TVQNHIDINN 440
>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
6054]
gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 514
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 120/250 (48%), Gaps = 35/250 (14%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
FS +L + + + I T+DVGWGCM+R+SQ L+A F RL L K D
Sbjct: 138 FSKSLLYNLQNFNNFIEKENFTTDVGWGCMIRTSQSLLANT--FVRL-------LDKQSD 188
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 261
I+ LF D+ +PFS+HN ++ + L G W GP A S + L C
Sbjct: 189 -----IIALFNDTYLAPFSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRL--CDGYYD 241
Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
+++ I V+ + ++ ++ +KG +L+L+P+ LG+
Sbjct: 242 NSTSETILPRINVLISESTDLYDSQIAQLLEPSTE-----TKG------LLVLLPVRLGI 290
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
+ +N Y +L + QS+GI GGKP +S Y G Q+ S IY+DPH Q I D
Sbjct: 291 DSINSYYFSSLLHLLSLEQSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQ----IFSSD 346
Query: 382 LEADTSTYHS 391
+ D STY++
Sbjct: 347 I--DMSTYYA 354
>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
Length = 336
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 113/278 (40%), Gaps = 67/278 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
Q + I+LDPH Q ++ ++ D T+H
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVND-QTFH 239
>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 330
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 73/238 (30%), Positives = 108/238 (45%), Gaps = 29/238 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARCQRAETG 262
+++ I++LFGDS S FSIH L+ G+ G W GP +A + E + + T
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRT- 155
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
RG + S+ + G + P ++ VPL LG E
Sbjct: 156 --------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVPLRLGPE 194
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I++ D
Sbjct: 195 SPENEFKPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAIDMKGD 252
>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
jacchus]
Length = 360
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 90/189 (47%), Gaps = 28/189 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 293 DASRHCSVF 301
D + C V
Sbjct: 205 DIKKMCRVL 213
>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
boliviensis]
Length = 360
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 90/189 (47%), Gaps = 28/189 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 293 DASRHCSVF 301
D + C V
Sbjct: 205 DIKKMCRVL 213
>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
Length = 433
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 136/332 (40%), Gaps = 89/332 (26%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVG 169
IWLLGV + + G +A + A F+ +DFSSR+ +YR+ F I + I +D G
Sbjct: 36 IWLLGVIYHRKMTQFYGASAVVDDGASFDAFLEDFSSRLWFTYRREFPAIPGTDIRTDCG 95
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWR------------------KPLQKPF-----DREY 206
WGCMLRSSQM++AQA + H LGR WR PL++ F D
Sbjct: 96 WGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTEAGEVRLPRHALWPLREGFRCTGGDGTA 155
Query: 207 VEIL----------HLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSW 250
V + FGD ++PFS+HNL+Q G+ G AG W GP Y + +
Sbjct: 156 VLVRCSPKPVNDPPRWFGDKADASTPFSLHNLVQRGRESGKKAGDWYGPSSVAYILKDAL 215
Query: 251 E-ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
E A R QR + IYV + +DD + CS S
Sbjct: 216 EDAAHRDQRLAQ--------LCIYVAQD---------CTIYMDDVTALCSAGSTEGV--- 255
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS--------------LGIVGGKPGASTYI 355
+ PR + R F+ Q+ + K G S +
Sbjct: 256 -----------THRRLPRTVFARREMFSGGQTQRMCIHSSWLHLFVFFVCFLKYGISFLL 304
Query: 356 -VGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+ EE IYLDPH Q ++++ D D+
Sbjct: 305 QLSAAEEKVIYLDPHYCQEMVDVNSQDFPLDS 336
>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
[Homo sapiens]
Length = 231
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 102/226 (45%), Gaps = 61/226 (26%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEK---- 133
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
MCR + +S D G+R + +
Sbjct: 134 -------------MCR-----------------------VLPLSADTAGDRPPDSLTASN 157
Query: 293 DA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ S +CS W P+LL+VPL LG+ ++NP Y+ ++T
Sbjct: 158 QSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKVT 196
>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 330
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 75/244 (30%), Positives = 111/244 (45%), Gaps = 30/244 (12%)
Query: 143 DFSSR-ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---L 198
DF+ I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ +
Sbjct: 33 DFARHTIWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCI 90
Query: 199 QKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARC 256
+ +++ I++LFGDS S FSIH L+ G+ G W GP +A + E +
Sbjct: 91 NTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEM 150
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
+ T RG + S+ + G + P ++ VP
Sbjct: 151 RVFRT---------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVP 188
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LG E + P L+ F PQ +G++GGKPG + Y + +LDPH Q I+
Sbjct: 189 LRLGPESPENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTTQNAID 248
Query: 377 IGKD 380
+ D
Sbjct: 249 MKGD 252
>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
Length = 476
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 125/272 (45%), Gaps = 50/272 (18%)
Query: 138 AEFNQDFSSRILISYRKGF-----DPIGDSKI-------------------TSDVGWGCM 173
++F D ++R+ +YR GF DP G S + T+D GWGCM
Sbjct: 91 SDFISDVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRK-PLQKPF-DREYV-------EILHLFGDSETSPFSIH 224
+R+SQ L+A ALL +GR WR P + P + EY +I+ F D +PFSI
Sbjct: 151 IRTSQSLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQWQIITWFADFPWAPFSIQ 210
Query: 225 NLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
+++ G + G W GP A RS L + ++ C+ + Y+ G+ D
Sbjct: 211 QIVRYGSEHCNKKPGEWFGPSAASRSIVYLCK----QSYKACK---LNTYLTEGNGD--- 260
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
+D S + + P L+L + LG+ VNP Y L+ + QS+G
Sbjct: 261 ------IYEDELLXVSCPEGTENGFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIHQSVG 314
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
I GG+P +S Y G Q ++ Y+DPH Q +
Sbjct: 315 IAGGRPSSSHYFFGYQGDNLFYMDPHTPQTAL 346
>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 500
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 88/181 (48%), Gaps = 10/181 (5%)
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ ++ FGD +PF +H L+ GK G AG W GP +A R
Sbjct: 232 HSRLVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGP-------SVVAHILRKAVDKTS 284
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
+A+YV +D VV + D S + + DW +++LVP+ LG E +N
Sbjct: 285 VVTNLAVYVA---QDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGGEALN 341
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
P YI ++ +GI+GGKP S Y +G Q+E +YLDPH QPV+++ + + +
Sbjct: 342 PSYIDCVKNFLKLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQINFSLE 401
Query: 386 T 386
+
Sbjct: 402 S 402
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 38/61 (62%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ F F SRI ++YR+ F + S T+D GWGCMLRS QML+AQ LL H + R W
Sbjct: 104 VERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPRDWVW 163
Query: 197 P 197
P
Sbjct: 164 P 164
>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 298
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 104/224 (46%), Gaps = 37/224 (16%)
Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVE 208
+Y K F P+ T+D WGC +RS+Q L+ Q + L+ LG R P + +Y
Sbjct: 28 TYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDDIRNIF--PTNSKY-- 82
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
LF D SPF + ++ ++YG+ G WV P + + + R
Sbjct: 83 --ELFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKEIMNFFRI--------- 131
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
PVV + V ++ + P+LLL L+LG E +Y
Sbjct: 132 ------------------PVVIAEHGCLSREVLNEALSHNIPVLLLFTLMLGYENFELKY 173
Query: 329 IPTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+P L+LT + QS+G+VGG+ G + +IVG Q+E +Y DPHDV
Sbjct: 174 LPFLKLTLSLIYQSVGVVGGQQGKAYFIVGHQKEKLLYFDPHDV 217
>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
Length = 495
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 124/288 (43%), Gaps = 71/288 (24%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 170
F +D +R+ +YR F PI S +D+GW
Sbjct: 75 FLKDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGW 134
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGDSETSPFSIHNLLQ 228
GCM+R+ Q L+ L RLGR +R P +++ E I+ F D+ PFS+H +
Sbjct: 135 GCMIRTGQSLLGNTLQIVRLGRDFR---YDPENKDISENRIIEWFIDAPEKPFSLHQFIT 191
Query: 229 AG-KAYGLAAGSWVGPYAMCRSWEALAR----CQRAETGLGCQSLPMAIYVVSGDEDGER 283
G + G G W GP A RS ++L R C AE + V SGD
Sbjct: 192 EGMELSGKNPGEWFGPAATARSIQSLIRKFPDCGIAEC---------LVSVSSGD----- 237
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
+ D+ + VF+ + + +L+L+ + LGL VN Y ++R + S+G
Sbjct: 238 -----IYSDEVKQ---VFADNKKN---LLILLGVKLGLNAVNECYWDSIRHILSSKYSVG 286
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
I GG+P +S Y G + + +Y DPH QP LE + +Y S
Sbjct: 287 ISGGRPSSSLYFFGYEGDELLYFDPHSPQP-------SLEENNVSYKS 327
>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
passalidarum NRRL Y-27907]
Length = 363
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/250 (30%), Positives = 114/250 (45%), Gaps = 43/250 (17%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
F+ R+ + R FD SDVGWGCM+R+SQ L+A AL+ LQ +
Sbjct: 104 FNKRLFTTVRSLFD---SENFNSDVGWGCMIRTSQSLLANALM----------KLQPSAE 150
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 261
E +++LF D+ S FS+HN ++ L G W GP A S + L + +T
Sbjct: 151 HE---VINLFQDNIASAFSLHNFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKT 207
Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
G + + I S D E I++ SV L+L P+ LG+
Sbjct: 208 IQGVKYPHVFISENSDLYDEE--------IEELLVESSV-----------LILFPVRLGI 248
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
+ VN Y ++ P ++GI GGKP +S Y +G Q++ +Y DPH Q N
Sbjct: 249 DNVNSYYYDSIFQLLACPFTVGISGGKPSSSFYFLGYQDQDLLYFDPHSPQLYEN----- 303
Query: 382 LEADTSTYHS 391
+ +TYH+
Sbjct: 304 -PINYTTYHT 312
>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
Length = 330
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 109/250 (43%), Gaps = 55/250 (22%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWR------------------------------------K 196
MLRS QM++AQ LL H L R W
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP +A
Sbjct: 61 ELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHI 111
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
R + + +YV + A +V D + A+W +++LVP
Sbjct: 112 LRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVP 161
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP ++
Sbjct: 162 VRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVD 221
Query: 377 IGKDDLEADT 386
+ + D ++
Sbjct: 222 VSQADFPLES 231
>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
Length = 355
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/238 (31%), Positives = 108/238 (45%), Gaps = 26/238 (10%)
Query: 145 SSRILISYRKGFDPIGDS-KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
S+ + +YR IGDS + +D GWGC LR QM+V +AL R + K L P +
Sbjct: 52 SAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK-LSYPSE 110
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+ IL F D S+H + K G AG W P + Q A +
Sbjct: 111 AARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQEA---M 167
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
G Q ++V +V +DD + +F +A LL VPL LG++
Sbjct: 168 GLQ-----VHVAMD---------SMVVLDDLRK---LFRADRA----TLLFVPLRLGIDI 206
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
V IP ++ F P +LGI+GG+PGA+ Y +G + + + LDPH Q + G D
Sbjct: 207 VQAEMIPAVKRFFHSPSALGIMGGRPGAAHYFIGYMDHNLLLLDPHTTQDPLRAGSQD 264
>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
leucogenys]
Length = 441
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 118/265 (44%), Gaps = 31/265 (11%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW--GCMLRSSQML-VAQALLFHR 189
G F DF SR+ ++YR + I D W G L ++ A +H
Sbjct: 98 GEGEHTAFPADFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHG 157
Query: 190 LGRPWRKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
R W P L++ +R + +I+ F D +PF +H L++ G++ G AG W
Sbjct: 158 PAR-WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWY 214
Query: 242 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 301
GP +A R + + +YV + A +V D +
Sbjct: 215 GP-------SLVAHILRKAVESCSEVTRLVVYVSQTCSMYKADVARLVARPDPT------ 261
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++
Sbjct: 262 ----AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCQLCLGIMGGKPRHSLYFIGYQDD 317
Query: 362 SAIYLDPHDVQPVINIGKDDLEADT 386
+YLDPH QP +++ + D ++
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLES 342
>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 108/250 (43%), Gaps = 55/250 (22%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWR------------------------------------K 196
MLRS QM++AQ LL H L R W
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP +A
Sbjct: 61 ELER--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHI 111
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
R + +YV + A +V D + A+W +++LVP
Sbjct: 112 LRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVP 161
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP ++
Sbjct: 162 VRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVD 221
Query: 377 IGKDDLEADT 386
+ + D ++
Sbjct: 222 VSQADFPLES 231
>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
Length = 494
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 494
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
Length = 494
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
Length = 485
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 95/198 (47%), Gaps = 24/198 (12%)
Query: 193 PWRKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
P R P P D + +++ FGD ++PF +H L++ GK G AG W GP +
Sbjct: 210 PARCPSASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHM 269
Query: 250 W-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
+A+AR E +A+YV V +D C G W
Sbjct: 270 LRKAVARAAEFED--------LAVYVAQDC---------TVYKEDVMSLCESSGVG---W 309
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+++LVP+ LG E +NP YI ++ +GI+GGKP S + VG Q+E +YLDP
Sbjct: 310 KSVVILVPVRLGGESLNPSYIECVKNILKLKCCIGIIGGKPKHSLFFVGFQDEQLLYLDP 369
Query: 369 HDVQPVINIGKDDLEADT 386
H QPV+++ + + ++
Sbjct: 370 HYCQPVVDVTQANFSLES 387
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 40/63 (63%), Gaps = 1/63 (1%)
Query: 134 NNGLAE-FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
N G E F Q F S + ++YR+ F + S +T+D GWGCMLRS QM++AQ LL H +
Sbjct: 92 NEGEVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPT 151
Query: 193 PWR 195
WR
Sbjct: 152 DWR 154
>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
Length = 506
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAV 342
>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 506
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAV 342
>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
Length = 494
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
Length = 494
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGBIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAV 330
>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
Length = 431
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 95/205 (46%), Gaps = 15/205 (7%)
Query: 194 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------Y 244
W K ++P EY IL F D + +SIH + Q G G + G W GP
Sbjct: 137 WEKHQEQP--EEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 194
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
A+ W +LA + + + + ++ D + + +D + C + G
Sbjct: 195 ALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLD-WNTDCPGQTSG 253
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
W P+LL+VPL LG+ ++NP Y + F PQSLG +GGKP ++ Y +G + I
Sbjct: 254 ---WKPLLLIVPLRLGINQINPIYADAFKECFKMPQSLGALGGKPNSAYYFIGFLGDELI 310
Query: 365 YLDPHDVQPVINIGKDDLEADTSTY 389
YLDPH Q ++ ++ D S +
Sbjct: 311 YLDPHTTQTFVDTEENGTVNDQSFH 335
>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
Length = 521
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 122/281 (43%), Gaps = 51/281 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
EF D +R+ +YR F PI G S ++ +D+G
Sbjct: 115 EFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDIG 174
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+A AL LGR +R + E + I+ F D PFS+H +Q
Sbjct: 175 WGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHE-LRIIKWFEDDPKYPFSLHKFVQE 233
Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G + G G W GP A RS +AL A C I SGD
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPA-----CGIAHCVISTDSGD---------- 278
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
V +D+ +F + +LLL+ + LG++ VN Y +R + S+GI GG+
Sbjct: 279 VYMDEVE---PLFRADPS--AAVLLLLCVRLGVDVVNEVYWEHIRHILSSEHSVGIAGGR 333
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
P +S Y G Q+E YLDPH Q + + DL+ S +
Sbjct: 334 PSSSLYFFGYQDEHLFYLDPHKPQLNLASYQQDLDLFRSVH 374
>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 357
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 63/232 (27%), Positives = 109/232 (46%), Gaps = 29/232 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNAL----------- 177
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G V+ + + ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEQVKELLRQSMHVVLLIPVMLGIRVI 227
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQPAFT 279
>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 371
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLI------YGFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAV 342
>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
Length = 463
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 114/273 (41%), Gaps = 56/273 (20%)
Query: 135 NGLAEFNQDF----SSRILISYRKGFDPIGDSK--------------------------- 163
N + NQDF +SR+ +YR F PI S
Sbjct: 52 NRNSNLNQDFLSDVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNP 111
Query: 164 --ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
+D+GWGCM+R+ Q L+ AL +LGR +R L + EI+ F D+ PF
Sbjct: 112 DCFNTDIGWGCMIRTGQSLLGNALQLAKLGRHFR--LDNKMGIKDDEIISWFRDTTQEPF 169
Query: 222 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
SIH ++ G K G W GP A S ++L + E G+ + V SGD
Sbjct: 170 SIHKFVEKGNKLANKKPGEWFGPAATSISIQSLIE-EFPECGID----KCLVSVSSGD-- 222
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+D R +F + + IL L+ + LGL+ VN Y +
Sbjct: 223 ---------IFEDDVRE--IFEENMD--SKILFLMGVKLGLDAVNSFYWEDILNILDSKF 269
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
S+GI GG+P +S Y G Q +Y DPH QP
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQP 302
>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
Length = 402
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 116/272 (42%), Gaps = 45/272 (16%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPWRKP--LQKP 201
SS I SYRK S +TSD GWGCM+R +QM +AQ + +H +P + ++
Sbjct: 71 SSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVIRHYHSFTQPEQLIVLIRHF 130
Query: 202 FDREYVEILHLFGDSETS-------PFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEAL 253
D + E+++ + + PFSI ++ K + G W P + + L
Sbjct: 131 LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKVEFKKEPGDWYKPNEILETLNYL 190
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP--- 310
+ + SL M IY+ + DA + + KG +W
Sbjct: 191 FKYSQY-------SLNMQIYI---------NYQCAFILQDAIKQMFNYDKGNQEWLKECI 234
Query: 311 -------------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
I + +P +GL++VN Y+ L + T P GI+GG + YIVG
Sbjct: 235 KNNNQFISQHDKGIAIFLPARIGLQRVNQDYLEVLNILMTLPYFQGIIGGVTNRAFYIVG 294
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
++ IYLDPH VQ N +DL ++Y
Sbjct: 295 RIQDYLIYLDPHFVQNAQNF--EDLSKTQASY 324
>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 357
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 110/232 (47%), Gaps = 29/232 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 227
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFT 279
>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 357
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 63/232 (27%), Positives = 110/232 (47%), Gaps = 29/232 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG---RPWRKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G R + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G ++ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLATGGE-----GPVILAFPERQIFLEEVKELLRQSTHVVLLIPVMLGICVI 227
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFT 279
>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
Length = 351
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 110/232 (47%), Gaps = 29/232 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 68 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 127 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 171
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 172 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 221
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP
Sbjct: 222 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFT 273
>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
Length = 314
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 115/285 (40%), Gaps = 54/285 (18%)
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 150
M I ER L T S + IW LG H A + A F QD + +
Sbjct: 4 MSHILERYLRMFPTNHEPSGTFIWSLG--HSYATETGKWPEA-------FVQDTYDLLSL 54
Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
+YRK G +SD GWGCM+RS Q ++A L R +P P+ K IL
Sbjct: 55 TYRKCI--AGMECFSSDAGWGCMIRSMQTMLANCL---RRVQP-SLPVHK--------IL 100
Query: 211 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
H F D + S+H + AG + G+W GP + L C + P
Sbjct: 101 HYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHL-----------CSTHP 149
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI--LLLVPLVLGLEKVNPR 327
V DG ++ + Q TP LLL L LG++ ++
Sbjct: 150 QVGLNVCVSHDG-----------------AIMYRDQLRNTPYPRLLLFTLRLGIDTIHTS 192
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
Y L T PQ++GIVGG+P A+ Y Q + YLDPH Q
Sbjct: 193 YYEQLCHVLTIPQAIGIVGGRPRAAHYFYACQSQWFFYLDPHTTQ 237
>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
Length = 378
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 107/260 (41%), Gaps = 68/260 (26%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF SRI ++YR+ F PI S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 192 RPWRKP----------------------------------LQKPF------------DRE 205
R W P L+ P D E
Sbjct: 103 RAWTWPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHE 162
Query: 206 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ +I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 IRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + + + AD +++LVP+ L
Sbjct: 223 PDLQG-----ITIYVAQ--------DCTVYSSDVIDKQRTAMTADNADDKAVIILVPVRL 269
Query: 320 GLEKVNPRYIPTLRLTFTFP 339
G E+ N Y+ ++ TF P
Sbjct: 270 GGERTNTDYLEFVK-TFHCP 288
>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
Length = 391
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 120/274 (43%), Gaps = 40/274 (14%)
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
Q +S I +YRK F I +S+ TSD GWGCMLRS QM+ AQ L H R+ Q
Sbjct: 51 QIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQILRVH-----IRQKKQHS 105
Query: 202 FDREYVEILHLFGDSET---------------SPFSIHNLLQAGK-AYGLAAGSWVGPYA 245
D +Y ++L F D + SP+SI + + + + W P
Sbjct: 106 KDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWYRPDQ 164
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIY--VVSGDEDGERGGAPVVC-----------ID 292
+ + L + ++ E G + L + I ++ E G + C
Sbjct: 165 ILNALSLLHQQKQLE---GSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKNK 221
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ S+ C++ K I + + GL+++N Y+P L PQ GI+GG+ +
Sbjct: 222 EISKICNICQKKDPKALAIFFITRI--GLDEINKEYLPFLNDLIDLPQFQGIIGGRDDKA 279
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
YI+G + IYLDPH +Q IN G + DT
Sbjct: 280 YYILGRVNKRLIYLDPHYIQEHINRGNVVMLKDT 313
>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 297
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 109/223 (48%), Gaps = 33/223 (14%)
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
+Y KGF P+ T+D WGC +RS Q L+ Q + +L + + ++ F
Sbjct: 27 FTYHKGFSPLAGG-YTTDKNWGCCIRSGQGLLMQFV--SKLYQLYGDKIKNIFPNG--SK 81
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
LF D +PF IH + + + +G+ AG WV P + ++ L
Sbjct: 82 FELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDLLSF------------- 128
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
I+VV E+G C+ S S G P+LLL L+LG + + +Y+
Sbjct: 129 FGIHVVIA-ENG--------CLSRESLR-EALSYGH----PVLLLFTLMLGYKDFDLKYL 174
Query: 330 PTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
P LRLT + QS+G+VGG+ G + Y+VG Q+E+ +Y DPH+V
Sbjct: 175 PFLRLTLSLIYQSVGVVGGQQGKAYYLVGHQKENLLYFDPHEV 217
>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
Length = 494
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 113/267 (42%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
EF D SR+ +YR F PI G S ++ +D+G
Sbjct: 85 EFLLDVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R +K RE +I+ F D+ +PFSIHN +
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVDNEKSLKRES-KIVTWFNDTPEAPFSIHNFVST 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L C + V SG D +
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIYGFPE-----CGITDCVVSVSSG--DIYQNEVEK 256
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ +++ + IL L+ + LG+ VN Y ++ +S+GI GG+
Sbjct: 257 IYVENPD-------------SIILFLLGVKLGINAVNESYRESICGILNSARSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q +Y DPH QP +
Sbjct: 304 PSSSLYFFGYQGNQFLYFDPHIPQPAV 330
>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 523
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 115/242 (47%), Gaps = 37/242 (15%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALL--FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
TSD GWGCM+R+SQ L+A ALL FH G +P + +++ LF D+ ++PF
Sbjct: 179 FTSDAGWGCMIRTSQNLLANALLRLFHTTGG---QPQNFAVTKTEADVIELFQDTLSAPF 235
Query: 222 SIHNLLQAGKAYGL--AAGSWVGPYA-------MCRSWEALARCQRAETGLGCQS---LP 269
S+HN ++A + L G W GP A + + + + +R+E G S +P
Sbjct: 236 SLHNFIKAANSLSLNIKPGQWFGPSAASLSIKKLVNDYNLIQQERRSERDSGRDSGHKVP 295
Query: 270 M-----------AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG-----QADWTPILL 313
+ D +R P V + S +C ++ + + PIL
Sbjct: 296 TPNLKLHSKSADSDSDSDSDAISKRNSIPYVYV---SENCDLYDDEINAIFELEQRPILF 352
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQ 372
L P+ LG+E+VN Y ++ S+GI GGKP +S Y +G + E+ IY DPH Q
Sbjct: 353 LFPIRLGIEQVNKYYYSSILQILASKFSVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQ 412
Query: 373 PV 374
V
Sbjct: 413 IV 414
>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
8797]
Length = 448
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 115/269 (42%), Gaps = 54/269 (20%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------ 165
N +F +D +R+ +YR F PI G S I+
Sbjct: 38 NEKMQFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFN 97
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D+GWGCM+R+ Q L+ AL R GR +R D +I+ F D+ +PFS+HN
Sbjct: 98 TDIGWGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD----DIIQWFKDTPDAPFSLHN 153
Query: 226 LLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ G + + G W GP A RS ++L C + G+ I VS + ++
Sbjct: 154 FVKKGVELADMKPGQWFGPAATSRSIQSLI-CNFPQCGID-----HCIVSVSSADIYKQD 207
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
+ D S +L+L + LG+ VN Y +R S+GI
Sbjct: 208 VEDMFDADPDSN--------------LLILFGVKLGVSAVNASYWEDIRRLLNSKFSVGI 253
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQP 373
GG+P +S Y G Q + +Y DPH QP
Sbjct: 254 AGGRPSSSLYFFGYQNQELLYFDPHTPQP 282
>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
Length = 257
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 80/176 (45%), Gaps = 44/176 (25%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP 249
>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 485
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 109/267 (40%), Gaps = 51/267 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 76 EFLLDVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 135
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R F RE I++ F D+ +PFS+HN +
Sbjct: 136 WGCMIRTGQSLLGNALQILHLGRDFRVDEDDDFRRE-SRIVNWFNDTPEAPFSLHNFVST 194
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS + L E G+ + V SG D
Sbjct: 195 GTELSDKRPGEWFGPAATARSIQYLIY-GFPECGINA----CIVSVSSG--DIYENEVEE 247
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
V +D+ + + IL L+ + LG+ VN Y ++ S+GI GG+
Sbjct: 248 VFVDNPN-------------SSILFLLGVKLGINAVNESYRESICGILNSAWSVGIAGGR 294
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI 375
P +S Y G Q ++ DPH QP +
Sbjct: 295 PSSSLYFFGYQGNEFLHFDPHIPQPAV 321
>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
Length = 337
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 90/184 (48%), Gaps = 17/184 (9%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 72 DRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SVVAHILRKAVE 124
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
C + + VS D + D +R S + A+W +++LVP+ LG E
Sbjct: 125 -SCSEVTRLVVYVSQDCTVYKA--------DVARLVS-WPDPTAEWKSVVILVPVRLGGE 174
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 175 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVNQANF 234
Query: 383 EADT 386
++
Sbjct: 235 PLES 238
>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 117/276 (42%), Gaps = 81/276 (29%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 162
G +E + R +SYR GF+PI +
Sbjct: 75 GDSEVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 221
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 222 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 278
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
PQS GI GGKP +S Y G Q S +YLDPH Q V
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV 306
>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
Length = 378
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 106/260 (40%), Gaps = 68/260 (26%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 192 RPWRKP----------------------------------LQKPF------------DRE 205
R W P L+ P D E
Sbjct: 103 RAWTWPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHE 162
Query: 206 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ +I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 MQNEIYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + C+ + D +++LVP+ L
Sbjct: 223 PELQG-----ITIYVAQ--------DCTVYSSDVIDKQCASMAPDITDDKAVIILVPVRL 269
Query: 320 GLEKVNPRYIPTLRLTFTFP 339
G E+ N Y+ ++ TF P
Sbjct: 270 GGERTNIDYLEFVK-TFHCP 288
>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 180
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 55/75 (73%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W P+++LVP+ LG++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+D
Sbjct: 11 WHPVIILVPVRLGIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMD 70
Query: 368 PHDVQPVINIGKDDL 382
PH VQP + + D L
Sbjct: 71 PHFVQPTVKMDDDPL 85
>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
Length = 194
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 78/158 (49%), Gaps = 27/158 (17%)
Query: 113 IWLLGVCHKI--------AQDEALGDAAGNNGLA----------------EFNQDFSSRI 148
IWLLG + I A EA D N G + +F DF+SR+
Sbjct: 29 IWLLGCSYIIKPTDHIQQALLEAQRDLMFNKGSSENEEENNQNMHMLWPPDFYDDFTSRL 88
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-KPFDREYV 207
++YR + PI S +D+GWGC LRS Q L+A L+ H LGR WR+ Q + ++Y
Sbjct: 89 WMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQAAWKQYS 148
Query: 208 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 243
I+H F D S +PFSIH + GK G G W GP
Sbjct: 149 RIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGP 186
>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
Length = 296
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 87/177 (49%), Gaps = 17/177 (9%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 31 DRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP-------SVVAHILRKAVE 83
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
C + + VS D + D +R S + A+W +++LVP+ LG E
Sbjct: 84 -SCSEVSRLVVYVSQDCTVYKA--------DVARLLS-WPDPTAEWKSVVILVPVRLGGE 133
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ +
Sbjct: 134 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQ 190
>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
Length = 330
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 103/234 (44%), Gaps = 26/234 (11%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDR 204
I ++YRK + + TSD GWGCM+RS QM +AQ+ + +G W + Q ++
Sbjct: 38 IWVTYRKNMKELPGGR-TSDSGWGCMIRSMQMALAQSFVSLVMGNSWKFTKTGFQVERNK 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
++ I++LFGD S FSIHNL+ G+ G W GP S+ + T
Sbjct: 97 FHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGP-----SFASEIAADHLNT-- 149
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
I+V R G V S+ + P ++ VPL LG
Sbjct: 150 --------IHVFRTRGYVARLGRIV------KPDILDISEDNGNILPTIIFVPLRLGPVN 195
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+ P L+ F PQ +G+VGGKP + + YLDPH Q +++
Sbjct: 196 AEEDFRPILKKVFDIPQCVGMVGGKPNLAFFFHTFDGNLLYYLDPHTTQNAVSM 249
>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
Length = 411
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/280 (27%), Positives = 104/280 (37%), Gaps = 83/280 (29%)
Query: 138 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 174
A F DF S+ ++YR F DP + S +SD GWGCM+
Sbjct: 109 AAFLDDFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMI 168
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG 234
RS QML+A A+ LGR A G
Sbjct: 169 RSGQMLLANAMAITNLGR--------------------------------------VACG 190
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R ++L Q + + +Y G P V D
Sbjct: 191 KYPGEWFGPSATARCIQSLTNAQEQPS--------LRVYST--------GDGPDVYED-- 232
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + + P L+LV LG++K+ P Y L PQS+GI GG+P AS Y
Sbjct: 233 -KFMKIAKPDGTRFHPTLILVGTRLGIDKITPVYWDALIAALQMPQSVGIAGGRPSASHY 291
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHS 391
+G Q YLDPH +P + D +AD T H+
Sbjct: 292 FIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADIDTAHT 331
>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
Length = 460
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 119/265 (44%), Gaps = 52/265 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
+F D SR+ +YR F PI G S ++ +D+G
Sbjct: 60 QFLSDVHSRLHFTYRTKFVPIPRVSDGPSPLSFHFLIRENPLTTIENAIYNPDCFNTDIG 119
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + + D+E +I+ F D+ + FSIHN +
Sbjct: 120 WGCMIRTGQSLLGNALQIANLGRDFR--VNQGKDQEEYKIIDWFADTPQAHFSIHNFVSQ 177
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G K G W GP A RS + L Q + G+ I V SGD
Sbjct: 178 GLKLSNKKPGEWFGPAATSRSIQCLVE-QFPDCGID----KCLISVSSGD---------- 222
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R +F+ Q + ILLL+ + LG+ VN Y ++ T S+GI GG+
Sbjct: 223 -VFEDEVRE--IFA--QKPQSRILLLLGVKLGVNAVNEYYWDDVKKTLGSKFSVGIAGGR 277
Query: 349 PGASTYIVGVQEESAIYLDPHDVQP 373
P +S Y +G Q IY DPH QP
Sbjct: 278 PSSSLYFMGFQGNELIYFDPHTPQP 302
>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 116/276 (42%), Gaps = 81/276 (29%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 162
G E + R +SYR GF+PI +
Sbjct: 75 GDLEVQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 221
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 222 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 278
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
PQS GI GGKP +S Y G Q S +YLDPH Q V
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV 306
>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
Length = 551
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 5/119 (4%)
Query: 137 LAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
+ EF DF++R+L +YR+GF I D+ +D GWGCMLRS QML++ LL + LG W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
+ + +I+ +F D ++PFSIHN+ G+ G G W P + ++ + L
Sbjct: 200 RSSSAT----HPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILV 254
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 48/70 (68%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VGGKP AS Y + Q+++ YLD
Sbjct: 383 WEPLLILIPMRLGLDGLNSIYHSSLLEIFKFPQNLGVVGGKPRASLYFIAAQDDNLFYLD 442
Query: 368 PHDVQPVINI 377
PH VQ I +
Sbjct: 443 PHTVQNHIEV 452
>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
Length = 373
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 4/117 (3%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 147 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 206
Query: 200 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEAL 253
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 207 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLL 262
>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
Length = 378
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 95/240 (39%), Gaps = 71/240 (29%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGW 170
N +F DF SR ++YR F PI SK +SD GW
Sbjct: 109 NGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGFSSDSGW 168
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+RS Q L+A A RLGR WR+ QK E ++I+ +F D +P+SIHN + G
Sbjct: 169 GCMIRSGQSLLANATGIVRLGRDWRRGQQK---AEEIKIMRMFADDPAAPYSIHNFVDYG 225
Query: 231 KAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
+ G G W GP A +
Sbjct: 226 SSKCGKYPGEWFGPSATSQ----------------------------------------- 244
Query: 290 CIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
CI+ S + ++D + P L+L+ LG++K+ Y L PQS+GI G
Sbjct: 245 CINPDVYEDSFMATAKSDHGFFKPTLILISTRLGIDKITQVYWEALISALQMPQSVGIAG 304
>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
Length = 256
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 81/176 (46%), Gaps = 45/176 (25%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP 248
>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 93/218 (42%), Gaps = 33/218 (15%)
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
YR + +S +T+D GWGC RS+Q L+ Q +L +L R +R + F + V L
Sbjct: 25 YRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYIL--KLHRKFRSLYDQVFGQN-VNPLD 81
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
LF D ++PF I NL + A GL G W P M A + L C
Sbjct: 82 LFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIM----AATIKLIFDTLNLSC------ 131
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
++S D + +H P L+L+P + GL K++ Y+
Sbjct: 132 --IISQDLTLDSNDI---------KHTKY---------PALILIPSLFGLSKMDDSYLSF 171
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
L L SLG V G+ ++ Y VG E Y DPH
Sbjct: 172 LLLCLCIESSLGFVSGQNASAYYFVGFDLEDFYYFDPH 209
>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
Length = 392
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 107/237 (45%), Gaps = 54/237 (22%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
+ D ++RI +YRK F P+ S+ T+DVGWGCMLR QM++A L+ +
Sbjct: 119 QLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLM----------AV 168
Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
+P +H+LL+ + + L AG + GP ++ +
Sbjct: 169 LQP--------------------RVHHLLKYTMENHHLKAGRFQGPSSVGSAL------- 201
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERG-----GAPVVCIDDASRHCSVFSKGQADWTPIL 312
+P A+ ++ D E + + I D R +GQA++ PI+
Sbjct: 202 -------LHQVPSALAQLNQFRDEEVKLRTYFASDTLVILDQLRP----EEGQAEFEPIM 250
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
L++PL LG+EK+ P+Y L+L P +G +GG + YI G Q LDPH
Sbjct: 251 LVLPLRLGIEKIGPQYHARLQLLLRQPWCMGFIGGHDKRAMYIFGYQGHQYFGLDPH 307
>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
Length = 266
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 61/106 (57%), Gaps = 7/106 (6%)
Query: 293 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 71 DSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 130
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
GKP ++ Y +G E IYLDPH QP + + D S +H +
Sbjct: 131 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-FHCQ 175
>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
Length = 350
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 2/103 (1%)
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
QD SR+ +YR+GF PIG++++T+D GWGCMLR QM++A+AL LGR W+ ++
Sbjct: 72 QDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLGRDWQWS-EET 130
Query: 202 FDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGP 243
D Y++I++ F D++ +PFS+H + L + G W GP
Sbjct: 131 RDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGP 173
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 37/59 (62%)
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
L +VNP YI L+ F P S G++GG+P + Y +G E A+YLDPH VQ V IG+
Sbjct: 180 LNEVNPIYIEGLKKCFQLPGSCGMIGGRPNQALYFIGYVGEEALYLDPHTVQRVGCIGE 238
>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
Length = 546
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 70/138 (50%), Gaps = 23/138 (16%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W++G+ + ++E E D S + I+YR GF + T D GWGCM
Sbjct: 38 WIMGIPYTELREE------------ERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCM 85
Query: 174 LRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGD--SETSPFSIHN 225
LRS+QML+ QAL H LGR WR P L+ P EY ++ LF D E + FSIHN
Sbjct: 86 LRSAQMLMTQALQRHTLGRSWRVPRTLEERLRVP---EYRTLVRLFADHPGEANLFSIHN 142
Query: 226 LLQAGKAYGLAAGSWVGP 243
+ Q G Y G W GP
Sbjct: 143 MCQVGIRYDKLPGEWYGP 160
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 46/69 (66%)
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
++LLVPL LGL++++ YIP+L T PQSLG +GG+P + + +G Q + LDPH
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSLLETLRVPQSLGFLGGRPNHAIFFIGAQGNTLTGLDPHT 439
Query: 371 VQPVINIGK 379
QP ++G+
Sbjct: 440 TQPAADMGE 448
>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 96/218 (44%), Gaps = 33/218 (15%)
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
YR F I +S ++ D GWGC RSSQ LV Q +L RL + + F + L
Sbjct: 25 YRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYIL--RLHKNFPDLYNSTFGID-KNPLD 81
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
LF D +PF I N++ + GL G+W P + +++++ + L C
Sbjct: 82 LFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSIFQ----SLHLNC------ 131
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+V D ++ + ++ P+L+L+P + GLEK+ YI
Sbjct: 132 --IVPQDSTF------------------IYEELESTNYPVLILIPGLFGLEKIEKPYISF 171
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+ L+ SLG V G ++ Y +G + Y DPH
Sbjct: 172 IFLSLCMNSSLGFVSGHNDSAFYFIGFDSDYFYYFDPH 209
>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
Length = 389
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 113/284 (39%), Gaps = 43/284 (15%)
Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-LFH 188
D A + + + F I SYR + S +TSD GWGCMLR QM + Q + F+
Sbjct: 47 DLAVDQKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFY 106
Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSE-------------------TSPFSIHNLL-Q 228
L +E E++ F D++ SPFSI ++ Q
Sbjct: 107 NLSSS----------QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQ 156
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED---GERGG 285
+ G W P + + L R + + L +++S + GG
Sbjct: 157 TKLELQKSPGEWYKPNDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFNKNGG 216
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
D KGQ D + + + +GL+ N Y+ L T+PQ GI+
Sbjct: 217 -------DEEWLKEQIEKGQNDEFGVSIFILTRIGLDTCNQEYLKVLNDIMTYPQFQGIL 269
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
GG P + YI+G IYLDPH VQ N ++E D S+Y
Sbjct: 270 GGFPNKALYILGRVGNYYIYLDPHYVQNAQNY--QEMENDRSSY 311
>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
Length = 263
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 55/89 (61%), Gaps = 6/89 (6%)
Query: 293 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 68 DSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 127
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVI 375
GKP ++ Y VG E IYLDPH QP +
Sbjct: 128 GKPNSAHYFVGYVGEELIYLDPHTTQPAV 156
>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
Length = 348
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 112/243 (46%), Gaps = 34/243 (13%)
Query: 136 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 195 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
AL M Y+ SG + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLSSGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 370 DVQ 372
Q
Sbjct: 249 YAQ 251
>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
Length = 392
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 122/286 (42%), Gaps = 41/286 (14%)
Query: 129 GDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
DA + + Q S I SYRK S +TSD GWGCM+R +QM +AQ +
Sbjct: 46 NDADIEQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQII--- 102
Query: 189 RLGRPWRKPLQ-----KPF----DREYVEILHLFGDSET----SPFSIHNLLQAGKA-YG 234
R ++KP Q + F D E + + F ++ +PFSI ++ K
Sbjct: 103 RYYNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----------VSGDEDGER 283
G W + ++ + L + + SL M IY+ + + +
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQY-------SLNMEIYINYDCAFILQDAIQQMFNQQE 215
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
G + + + +++ + F D I + +P +GL+ +N Y+ L P G
Sbjct: 216 GNE--IWLKERAKNNNQFDL--QDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQG 271
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
++GG + Y VG ++ IYLDPH VQ N DDL + ++Y
Sbjct: 272 MIGGVSKRALYFVGRIQDYLIYLDPHFVQNAQNF--DDLSKNQASY 315
>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
anatinus]
Length = 147
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 32/136 (23%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----R 195
F +DF SR+ ++YR+ F P+ S TSD GWGCMLRS QML+AQ L+ H L R W
Sbjct: 5 FQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWIWAEA 64
Query: 196 KPLQKP----------------------------FDREYVEILHLFGDSETSPFSIHNLL 227
P KP +R++ I+ F D +PFS+H L+
Sbjct: 65 GPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSLHRLV 124
Query: 228 QAGKAYGLAAGSWVGP 243
+ G+ G AG W GP
Sbjct: 125 RLGQGSGKRAGDWYGP 140
>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 172
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 69/127 (54%), Gaps = 11/127 (8%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + + +
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKMCRV 141
Query: 233 YGLAAGS 239
L+A +
Sbjct: 142 LPLSADT 148
>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/280 (27%), Positives = 114/280 (40%), Gaps = 57/280 (20%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
N + + QD I I+YR+ F P+ S SD GWGCMLR QM +AQ L H
Sbjct: 57 NKIKQLVQD---TIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQMLKKHLKNHGD 113
Query: 195 RKPLQKPFDREYVEILHLFGDSETS----------------------PFSIHNL-LQAGK 231
++ D +Y IL F D+++ PFSI + A K
Sbjct: 114 KR------DEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKK 167
Query: 232 AYGLAAGSWVGP-YAM-----------CRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
+ L G W P Y + R+ E L ++ L L ++ + +
Sbjct: 168 EFNLDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFDIKFET 227
Query: 280 DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
D + +++ + SK + + V +GL++ N +Y+ L P
Sbjct: 228 DKD--------LEEQLEKTQLKSKN-----SLAIFVLTRIGLDEPNQKYLKVLDELMELP 274
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
GIVGG P + YI+G + IYLDPH VQ N G+
Sbjct: 275 YFQGIVGGTPKRAFYILGRINDHYIYLDPHYVQEAENKGQ 314
>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
Length = 484
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 68/120 (56%), Gaps = 12/120 (10%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQ--------ALLFHRLGRPW 194
DF SR+ +YRK F +G S +TSDVGWGC LRS QML+A+ A++ LGR W
Sbjct: 49 DFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVRHGWRAGAMMRVALGRDW 108
Query: 195 RKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
++ + E V ++ D +P SIH + AG G+ G W+GP+ +C+ EAL
Sbjct: 109 QRCSD---NLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPGRWLGPWMLCKGLEAL 165
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/53 (58%), Positives = 42/53 (79%)
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
G++K+NP YIP L+ ++PQS+GIVGG+P AS Y+ GVQ+ S IYLDPH+ Q
Sbjct: 339 GMDKINPVYIPQLQQVLSWPQSVGIVGGRPSASLYVCGVQDASFIYLDPHEAQ 391
>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 394
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 102/225 (45%), Gaps = 33/225 (14%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
Y+ +L PQ LG+VGG PG S Y + YLDPH
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPH 240
>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 348
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 112/243 (46%), Gaps = 34/243 (13%)
Query: 136 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 195 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
AL M Y+ +G + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLRNGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 370 DVQ 372
Q
Sbjct: 249 YAQ 251
>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 394
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 101/225 (44%), Gaps = 33/225 (14%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGC---EAIKRTMQG--AVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
Y+ +L PQ LG+VGG PG S Y + YLDPH
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPH 240
>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 89/199 (44%), Gaps = 44/199 (22%)
Query: 79 TAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
A ++ L AG + + SRT S +S + +C + + E GD +
Sbjct: 84 VAVMQVLHLAGRCPYVSPGWVVKSRTSFSKISS----IHLCGRRYRFEGEGD------IQ 133
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---- 194
F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 RFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAE 193
Query: 195 ---------------------------RKPLQKP---FDREYVEILHLFGDSETSPFSIH 224
R P +R + +I+ F D +PF +H
Sbjct: 194 GMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLH 253
Query: 225 NLLQAGKAYGLAAGSWVGP 243
L++ G++ G AG W GP
Sbjct: 254 RLVELGQSSGKKAGDWYGP 272
>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 394
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 101/225 (44%), Gaps = 33/225 (14%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
Y+ +L PQ LG+VGG PG S Y + YLDPH
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPH 240
>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 444
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 117/267 (43%), Gaps = 65/267 (24%)
Query: 146 SRILISYRKGFDPIGDSK----------------------------------ITSDVGWG 171
SR+ +SYR GFDPI ++ TSD GWG
Sbjct: 84 SRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAGWG 143
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AG 230
CM+R+SQ L+A LL P D + +++ LF D+++SPFSIHN ++ AG
Sbjct: 144 CMIRTSQNLLANTLL-----------QLLPPDSKQ-DVIGLFQDNQSSPFSIHNFIKVAG 191
Query: 231 KA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
++ + G W GP A S + L + + G + + I S DGE
Sbjct: 192 ESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVFISENSDLYDGEINEI--- 248
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
+ +G++ +L+L P+ LG++KVN Y ++ S GI GGKP
Sbjct: 249 ----------LSEEGRS----VLVLFPIRLGIDKVNSYYYDSIFQVLKSKFSCGISGGKP 294
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVIN 376
+S Y +G IY DPH Q V N
Sbjct: 295 SSSFYFLGYDNSDLIYFDPHLPQLVEN 321
>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 394
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 101/225 (44%), Gaps = 33/225 (14%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSANG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
Y+ +L PQ LG+VGG PG S Y + YLDPH
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPH 240
>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
Length = 556
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 69/122 (56%), Gaps = 4/122 (3%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
E + +SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR W+
Sbjct: 37 EIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLGRSWKWS 96
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P Q+ EY +L +F D ++ +SI + G + G + GSW GP + + + L+
Sbjct: 97 PEQE--SPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKKLSVY 154
Query: 257 QR 258
R
Sbjct: 155 DR 156
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TSTYHSE 392
F P +GI+GG P + +IVGV ++ I LDPH QP G+ +L+ D TYH +
Sbjct: 351 FRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPA---GRGNLKPDYDQTYHCD 405
>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
Length = 469
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 123/284 (43%), Gaps = 55/284 (19%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDVG 169
EF +D +SR+ +YR F PI G S + +D+G
Sbjct: 62 EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
WGCM+R+ Q L+A AL LGR +R + ++I+ F D+ PFS+H +Q
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181
Query: 229 AG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 287
G K G G W GP A+ RS +L C ++S D +
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHC--------IISTD-------SA 226
Query: 288 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
V +D+ K LLL+ + LG++ N Y ++ + QS+GI GG
Sbjct: 227 DVYLDEIDPLFRANPKANV-----LLLLGVRLGVDFTNEYYWDDIKNILSSSQSVGISGG 281
Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
+P +S Y G Q + YLDPH VQ +N+ E+D +HS
Sbjct: 282 RPSSSLYFFGYQGDYLFYLDPHKVQ--LNLAL--YESDEERFHS 321
>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
norvegicus]
Length = 224
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 7/106 (6%)
Query: 293 DASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
++ RHC+ G W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 29 ESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 88
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSE 392
GKP ++ Y +G E IYLDPH QP + + D S +H +
Sbjct: 89 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDES-FHCQ 133
>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
Length = 356
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 111/280 (39%), Gaps = 89/280 (31%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 79 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 127
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR Q G
Sbjct: 128 MLRCGQMMLAQALICRHLGRA----------------------------------QMGVG 153
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 154 EGKSVGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 196
Query: 293 DASRHCSV--FSKGQAD----------------------WTPILLLVPLVLGLEKVNPRY 328
D + C + FS AD W P+LL+VPL LG+ ++NP Y
Sbjct: 197 DIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLLLIVPLRLGINQINPVY 256
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+ + TF + G V + + Q+ + + LDP
Sbjct: 257 VDAFK-TFVDTEENGTVDDQ--TFHCLQSPQQMNILNLDP 293
>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
Length = 632
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 257 QR 258
R
Sbjct: 161 DR 162
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 51/88 (57%), Gaps = 4/88 (4%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
++W P+LL VPL LGL NP Y ++ F P +GI+GG P + +IVGV + I
Sbjct: 385 SNWRPLLLFVPLRLGLHNPNPCYFNAIKAVFRLPNCIGILGGSPCHAVWIVGVTGDDVIC 444
Query: 366 LDPHDVQPVINIGKDDLEAD-TSTYHSE 392
LDPH QP G+ +L+ D TYH E
Sbjct: 445 LDPHTTQPA---GRGNLKPDYDQTYHCE 469
>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
Length = 414
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 257 QR 258
R
Sbjct: 161 DR 162
>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
Length = 196
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/66 (59%), Positives = 49/66 (74%), Gaps = 1/66 (1%)
Query: 308 WTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
W P+++LVPLVLGL++ VNPRY+P + PQS+GI+GGKP AS Y VG Q+E YL
Sbjct: 75 WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLPQSVGILGGKPCASLYFVGAQDEELFYL 134
Query: 367 DPHDVQ 372
DPH VQ
Sbjct: 135 DPHTVQ 140
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/45 (57%), Positives = 31/45 (68%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
F SR+ I+YR+GF IG T+D GWGC LRS QML+A AL H
Sbjct: 1 FHSRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANALQSH 45
>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
Length = 419
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 99/220 (45%), Gaps = 33/220 (15%)
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
R FD + TSD GWGCM+R+SQ L+A AL K + +EIL L
Sbjct: 130 RSLFD---NENFTSDAGWGCMIRTSQNLLANAL---------LKLAGEANGNVQLEILKL 177
Query: 213 FGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
F D + FSIHN ++ A L+ G W GP A S L + Q P
Sbjct: 178 FQDDPNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLT------IEMTDQESPT 231
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+ V E+ + DD + K P+LLL P+ LG++ VN Y
Sbjct: 232 VVPFVYISENAD-------LYDDEIEETFLKEK-----RPLLLLFPVRLGIDHVNKYYYK 279
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPH 369
++ S+GI GGKP +S Y +G + +E+ IY DPH
Sbjct: 280 SILQLLASRFSVGIAGGKPSSSFYFIGYENDENLIYFDPH 319
>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
Length = 256
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 12/122 (9%)
Query: 128 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 30 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 89
Query: 187 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
LG W +DR EY IL +F D + FSIH + G + G G W
Sbjct: 90 VRHLGHNWL------WDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWF 143
Query: 242 GP 243
GP
Sbjct: 144 GP 145
>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
Length = 256
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 11/114 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH +
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 129
>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.3]
Length = 873
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/244 (31%), Positives = 106/244 (43%), Gaps = 50/244 (20%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 165 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRP-WRKPLQKPFDRE---YVEILHLFG 214
TSD GWGCMLR+ Q L+A ALL LGR WR+P + YV+I+ F
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFF 410
Query: 215 D--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI 272
D S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 DTPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGV 469
Query: 273 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
S + A I RH V G+A +++L+ + LGL+ VNP Y T+
Sbjct: 470 IFQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTI 520
Query: 333 RLTF 336
+++
Sbjct: 521 KVSI 524
>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 327
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 114/230 (49%), Gaps = 38/230 (16%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 262
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 263 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 322 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPH 369
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH
Sbjct: 188 SRYMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPH 237
>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 99/224 (44%), Gaps = 31/224 (13%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L + GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWAY--GRPADRRLALFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHNL+++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGC---EAIKRTM--QDAIKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
+ L + VV+ C+ H F +G A+ +L V + +
Sbjct: 149 EQLQTRVTVVTSTNG---------CVYADEVH-HTFKQG-AEVVLVLASVRVSAAAQLTQ 197
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
Y+ +L PQ LGIVGG PG S Y + YLDPH
Sbjct: 198 ESYLQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPH 240
>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
gambiense DAL972]
Length = 327
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 114/230 (49%), Gaps = 38/230 (16%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 262
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 263 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 322 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPH 369
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH
Sbjct: 188 SRCMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPH 237
>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 516
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 124/301 (41%), Gaps = 70/301 (23%)
Query: 142 QDFSSRILISYRKGFDPIGD------------SKITSDVGWGCMLRSSQMLVAQALLFHR 189
++F + I I+YRK F + + S+ SD GWGCM+R QM A+ L H
Sbjct: 71 ENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCMVRVGQMAFAEGLRRHL 130
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSWVGPY 244
+ +K + K + V I D + +P+SI + + A + L G W P
Sbjct: 131 VEN--KKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIALSDFNLLPGEWYTPI 188
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGD-------EDGERGGAPVVCID 292
+C L ++A G + L +A++ +V D D +RG +C +
Sbjct: 189 RICYILGLLHNERKAIKG--TEDLKVAVFSSSRPIVFQDFLERMCKVDPQRGKHAQICPN 246
Query: 293 -------------DASRHCSVFSKGQ---------ADWTPILLLV-PL------------ 317
D H + + Q ++ TP L LV P+
Sbjct: 247 QCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCPIHHELQYSMIVYI 306
Query: 318 --VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
++GL+ P Y+ + F SLG++GGKP + Y VG E+ IYLDPH VQ
Sbjct: 307 VCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDEFIYLDPHYVQEFS 366
Query: 376 N 376
N
Sbjct: 367 N 367
>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
Length = 347
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 91/204 (44%), Gaps = 22/204 (10%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
M+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + AG
Sbjct: 1 MIRTGQSLLGNALQILHLGRDFRVNGNESLERES-KFVNWFNDTPEAPFSLHNFVSAGTE 59
Query: 233 YG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
G W GP A RS ++L G + I VS + E V
Sbjct: 60 LSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKVFAE 113
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+P +
Sbjct: 114 NPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGRPSS 159
Query: 352 STYIVGVQEESAIYLDPHDVQPVI 375
S Y G Q ++ DPH QP +
Sbjct: 160 SLYFFGYQGNEFLHFDPHIPQPAV 183
>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
Length = 356
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 99/231 (42%), Gaps = 36/231 (15%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
TSD+GWGCM+R+ Q L+A AL G P EI+ LF D +PFSIH
Sbjct: 85 TSDIGWGCMIRTGQTLLANALQRTNKGTPCS------------EIIELFVDETKNPFSIH 132
Query: 225 NLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
N + GK L G W P + E L C + + SGD +
Sbjct: 133 NFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKC-----IVSISSGDIYEQ- 186
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTFTFPQSL 342
+ +DD+ + +K Q ILLL + LG+ +N +Y ++ +
Sbjct: 187 --DVLDELDDSEPPAN--TKQQH----ILLLFGIKLGINTINIEKYGQDIKDITNNKYTC 238
Query: 343 GIVGGKPGASTYIVGVQE--ESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
GI GG+P +S + G + +Y DPH N D+ D STYHS
Sbjct: 239 GISGGQPKSSLFFFGYNNTHDRILYFDPHKPN---NFTTDN---DYSTYHS 283
>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 119/299 (39%), Gaps = 60/299 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
I++LG H+I D+ + + + Q I I+YR+ + P+ S SD GWGC
Sbjct: 38 IYILG--HRIDIDQF----EIEDRINKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS------------- 219
MLR QM +AQ L H ++ D +Y I+ F D+++
Sbjct: 92 MLRVGQMAMAQMLKKHLKNHGDKR------DEDYDNIILAFADNDSQENKEFIEFQNSKD 145
Query: 220 ---------PFSIHNL-LQAGKAYGLAAGSWVGP-YAM-----------CRSWEALARCQ 257
PFSI + A K + L G W P Y + R+ E L
Sbjct: 146 KQKAHNFICPFSIQKIAYLAKKEFNLDPGEWYRPNYILFLLELLHNTIPIRASENLKLSV 205
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
++ L L ++ + D + +++ + K + + V
Sbjct: 206 FNDSCLFLDQLMNRMFEAKFETDKD--------LEEQLEKTQLIGKN-----SLAIFVLT 252
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+GL++ N +Y+ L P GIVGG P + YI+G + +YLDPH VQ N
Sbjct: 253 RIGLDEPNQKYLKILDEIMELPYFQGIVGGTPKRAFYILGKINDHYLYLDPHYVQEAEN 311
>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
Length = 128
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 62/116 (53%), Gaps = 15/116 (12%)
Query: 113 IWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 170
+W+LG + I +DE L D A SR+ +YR+ F IG + TSD GW
Sbjct: 23 VWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTSDTGW 69
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
GCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 GCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125
>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
Length = 81
Score = 82.0 bits (201), Expect = 5e-13, Method: Composition-based stats.
Identities = 35/48 (72%), Positives = 41/48 (85%)
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
RYIP L+ T TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ V
Sbjct: 10 RYIPLLKETLTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLV 57
>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
Length = 255
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 67/140 (47%), Gaps = 27/140 (19%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 169
G A F DF+SR ++YR F DP + S TSD G
Sbjct: 116 GTGWPAAFLDDFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSG 175
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + DRE +L LF D +P+S+HN ++
Sbjct: 176 WGCMIRSGQSLLANALAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSVHNFVRH 232
Query: 230 GKAY-GLAAGSWVGPYAMCR 248
G+ Y G W GP A R
Sbjct: 233 GEKYCSKYPGEWFGPSATAR 252
>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum Pd1]
Length = 208
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 67/145 (46%), Gaps = 30/145 (20%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------I 164
L D A N F DF SRI I+YR F PI +K
Sbjct: 59 LNDTAWPNA---FVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGF 115
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
TSD GWGCM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH
Sbjct: 116 TSDTGWGCMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIH 172
Query: 225 NLLQAG-KAYGLAAGSWVGPYAMCR 248
+ G ++ G G W GP A +
Sbjct: 173 KFVNRGAESCGKYPGEWFGPSATAK 197
>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 296
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 99/228 (43%), Gaps = 49/228 (21%)
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY--- 206
+YR F I ITSD GWGC RS+Q L+A L + P D EY
Sbjct: 30 FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNY-----------APVDAEYFFT 78
Query: 207 ----VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
+ + LF D PFSI NL+ + +G+ G+W P + + E++ +
Sbjct: 79 VFNEIPMFSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFK------- 131
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
L +++ ++S D + ++ D + +LG++
Sbjct: 132 ----DLKLSV-LISKDSN-------IIPEDVKTMRAPFLLLIPI-----------LLGMK 168
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE-ESAIYLDPH 369
V ++IP ++ TF P+ LG V G S ++VG+ E ++ +Y DPH
Sbjct: 169 DVEQKFIPFIKYTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPH 216
>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 327
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 105/225 (46%), Gaps = 36/225 (16%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMTLDFSFQYFC 95
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+ D +PFS+H ++++ G L W P C EA++ C R+ G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRSAIHRGAL 148
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 325
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPH 369
+ +L P +G+VGG PG YIVG +E +YLDPH
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIVGTGGQELLLYLDPH 237
>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 425
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 75/172 (43%), Gaps = 27/172 (15%)
Query: 101 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF---- 156
P+R+ S++ LL LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLIP 144
Query: 157 ---DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 KSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRRG 204
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCR 248
+ +E ++L LF D +PFSIH ++ G A G G W GP A R
Sbjct: 205 TKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATAR 253
>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 388
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/251 (25%), Positives = 109/251 (43%), Gaps = 34/251 (13%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ + + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAAAKKLLYFSYRNCFPPLPN-RSTTDTRWGCLVRTTQMLVGSCLLRYHCKGA 112
Query: 194 WRKPLQKPFDREYVE----ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
+ P +R+ E I LF D ++P IH + + S + P
Sbjct: 113 YVLP-----ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSP------ 161
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHCSVFSKGQADW 308
E G+ + +A + GD AP C ++ + S ++
Sbjct: 162 ---------TEAGMAIAAALIAFHAQGGD-------APFTFCCENRNIDESAVMAKLSEG 205
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
++L++P+VLG+ ++ +Y L GI GG AS Y+ G Q + ++DP
Sbjct: 206 QHVILIIPVVLGIAPMSGQYERMLLKILDMKACCGIAGGFKQASLYMFGHQGRNVFFMDP 265
Query: 369 HDVQPVINIGK 379
H VQ G+
Sbjct: 266 HYVQRAYTSGR 276
>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
Length = 734
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 60/113 (53%), Gaps = 10/113 (8%)
Query: 281 GERGGA---PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
GE G+ P+ C D S C W I++LVP+ LGL+K+N Y ++
Sbjct: 515 GENSGSFKDPLTCSDFFSSSCI-----PQRWKSIIILVPIKLGLDKLNEVYFREIKSMLE 569
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
PQS+G++GGKP S Y VG Q+E IYLDPH V + +D+ S +H
Sbjct: 570 LPQSIGLIGGKPKQSFYFVGYQDEHIIYLDPHFVHDT--VSPNDINFSDSYHH 620
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
N + F DF + + SYRK F PI ++ IT+D+GWGCM+R+ QM
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQM 315
>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
Length = 326
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 102/246 (41%), Gaps = 40/246 (16%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
L D N L E +S L++YR F+P+ S +TSD GWGC+ R+SQML+A L
Sbjct: 28 TLYDEDELNNLLE-----TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVLR 82
Query: 187 FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPY 244
H +++ D +PFS+H + +A +G A W P
Sbjct: 83 RHAASEC------------HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYW-APS 129
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
C EA+ C + G + +++ V S ER +
Sbjct: 130 QGC---EAIRSCVESAVRQGLLTQKLSVVVSSSGTIPER---------------EIHEHL 171
Query: 305 QADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+ D + +L+LVP+ G ++ L P +G+VGG P YIVG
Sbjct: 172 RGDGS-VLVLVPVRCGTSRRMTQTMFFALEHLLHIPSCMGVVGGVPNRGYYIVGTSGHRL 230
Query: 364 IYLDPH 369
+YLDPH
Sbjct: 231 LYLDPH 236
>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
Length = 142
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 65/144 (45%), Gaps = 40/144 (27%)
Query: 126 EALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI--------------------- 164
+ +G +G N EF DF+S++ ++YR F PI D+ +
Sbjct: 3 DMVGTTSGANWPPEFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLG 62
Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
TSD GWGCMLR+ Q L+A AL+F LGR WR+P P E S
Sbjct: 63 GERGWTSDSGWGCMLRTGQSLLANALVFMWLGREWRRP-PAPMPTE-------------S 108
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGP 243
S+H + AGK G G W GP
Sbjct: 109 YASVHRMALAGKELGKDVGQWFGP 132
>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
Length = 483
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 98/210 (46%), Gaps = 33/210 (15%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
SD+GWGCM+R+ Q L+ AL RL P P +K +++ F D ++PFS+
Sbjct: 144 FCSDIGWGCMIRTGQALLGNALA--RLRSP---PEEK-------QLIGWFEDRSSAPFSL 191
Query: 224 HNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
HN ++ G A G W GP A RS ++L + GL I SGD E
Sbjct: 192 HNFVREGNALSRKPPGEWFGPSATSRSIQSLVHA-FPQCGLNH----CIISTDSGDVYEE 246
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
G P++ + ILLL+ + LGL VN RY P ++ S+
Sbjct: 247 DVG-PIL--------------EREPQATILLLLGVKLGLNNVNSRYWPDVKHILGSSFSV 291
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
GI GG+P +S Y G Q + YLDPH Q
Sbjct: 292 GIAGGRPSSSLYFFGYQGDYLFYLDPHTSQ 321
>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
IL3000]
Length = 327
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 104/225 (46%), Gaps = 36/225 (16%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMALDFSFQYFC 95
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+ D +PFS+H ++++ G L W P C EA++ C R G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRRAIHRGAL 148
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 325
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPH 369
+ +L P +G+VGG PG YI+G +E +YLDPH
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIIGTGGQELLLYLDPH 237
>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
Length = 224
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 5/101 (4%)
Query: 126 EALGDAAGNNGL-----AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
E + + NN + +F DF+SR+ ++YR + PI S +D+GWGCMLRS Q L
Sbjct: 120 EEISEEEDNNNMYLRWPLDFYDDFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSL 179
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
+A L+ H LGR WR+ Q R+ + I L + PF
Sbjct: 180 LANTLIIHFLGRDWRRQTQNQTTRKELCIGFLMSYHQEHPF 220
>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 338
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 62/100 (62%), Gaps = 6/100 (6%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDE--ALGDAAGNNGLA---EFNQDFSSRILISYRKGF 156
S+T S T IWLLG C+ D+ +A ++ L F +DF+SR+ ++YR+ F
Sbjct: 42 SQTNFSYHTP-IWLLGECYHHRPDDPNETEQSAEDDCLTPMERFKRDFTSRLWLTYRREF 100
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ + +T+D GWGCMLRS QM++AQ+ L H LGR +++
Sbjct: 101 QQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 50/81 (61%)
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
S+ W +++L+P+ LG E++NP YI ++ FT +GI+GGKP S Y +G QE+
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSLFTLKHCIGIIGGKPKHSLYFIGFQED 215
Query: 362 SAIYLDPHDVQPVINIGKDDL 382
I+LDPH Q V+++ D
Sbjct: 216 KLIHLDPHLCQDVVDMRSRDF 236
>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
Length = 465
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 47/69 (68%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++++PL LG++++N YI L+ + PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 217 WKSLIIMIPLKLGVDRINTSYIRKLKSILSIPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 276
Query: 368 PHDVQPVIN 376
PH VQ ++
Sbjct: 277 PHFVQDTVD 285
>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
Length = 391
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 31/59 (52%), Positives = 45/59 (76%)
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
G++K+NP Y+P L+ T+PQS+GIVGG+P AS Y+ GVQ+ S ++LDPH+ QP + G
Sbjct: 216 GMDKINPVYLPQLQRILTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWG 274
>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
Length = 286
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 51/82 (62%)
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
+A+W I++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +
Sbjct: 109 EAEWKSIIILVPVRLGGETLNPAYMPCIKELLRMEPCLGIIGGKPKHSLYFIGYQDDFLL 168
Query: 365 YLDPHDVQPVINIGKDDLEADT 386
YLDPH QP ++ KD ++
Sbjct: 169 YLDPHYCQPCVDTMKDSFPLES 190
>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 228
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 12/96 (12%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWR 195
+TSD GWGCMLRS QM++AQ LL H L G+PWR
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKPWR 169
>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
Length = 616
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 47/73 (64%)
Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
Q++W +++LVP+ LGL+K+N Y ++ P S+G++GGKP S Y VG Q+E
Sbjct: 426 NQSNWKSLIILVPVKLGLDKLNEIYFSGIKAMLQMPSSIGLIGGKPKQSFYFVGFQDEHI 485
Query: 364 IYLDPHDVQPVIN 376
IYLDPH V I+
Sbjct: 486 IYLDPHFVHDTIH 498
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 61/100 (61%), Gaps = 7/100 (7%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR---P 193
+ F +DF S + SYRK F I ++ IT+D+GWGCMLR+ QM++A+ALL H P
Sbjct: 194 VERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENIP 253
Query: 194 WRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGK 231
+ + ++ + +Y +I+ F D S+ + +SIH ++ K
Sbjct: 254 YGEKIKT--NSKYKKIMSWFCDYPSKENFYSIHQIVHKNK 291
>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
Length = 102
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 49/83 (59%), Gaps = 11/83 (13%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 29 VWVLGECYNVKTEKT-----------ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWR 195
MLR QM++AQAL+ +LGR WR
Sbjct: 78 MLRCGQMILAQALVCSQLGRAWR 100
>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 360
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 51/81 (62%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 181 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 240
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 241 LDPHYCQPTVDVSQADFPLES 261
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 86 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 139
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 140 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 167
>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
Length = 362
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 51/81 (62%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 183 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 242
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 243 LDPHYCQPTVDVSQADFPLES 263
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 88 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 141
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 142 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 169
>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
Length = 359
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 51/81 (62%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 180 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 239
Query: 366 LDPHDVQPVINIGKDDLEADT 386
LDPH QP +++ + D ++
Sbjct: 240 LDPHYCQPTVDVSQADFPLES 260
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 85 TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGSLTSD 138
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 166
>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus A1163]
Length = 226
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 53/93 (56%), Gaps = 3/93 (3%)
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
+ G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 18 NDGRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGS 77
Query: 362 SAIYLDPHDVQPVI---NIGKDDLEADTSTYHS 391
YLDPH +P + NI + + TYH+
Sbjct: 78 HLFYLDPHQTRPALPQRNIDDPYTDEEIETYHT 110
>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
Length = 384
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 50/80 (62%)
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W +++L+P+ LG E +NP Y P ++ FT LG++GG+P S Y VG QE+ I+L
Sbjct: 203 NWCSVIILIPVRLGGESLNPIYEPCIKGLFTMDHCLGVIGGRPKHSLYFVGFQEDKLIHL 262
Query: 367 DPHDVQPVINIGKDDLEADT 386
DPH Q V+++ D ++
Sbjct: 263 DPHFCQEVVDMTPRDFPLES 282
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 7/91 (7%)
Query: 113 IWLLGVCHKIAQDE------ALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKIT 165
IWL GVC+ +E L D+ E F +DF+S++ ++YR+ F + S T
Sbjct: 88 IWLQGVCYHRRNEELTKELEPLTDSDRRLYTMELFKRDFASKVWLTYRREFPQLAGSMFT 147
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+D GWGCMLRS QML+A L+ H LGR +++
Sbjct: 148 TDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178
>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
Length = 265
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 49/83 (59%)
Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y +G Q+E
Sbjct: 151 AHQSWQSVIILVPVRLGGESLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFIGFQDEQL 210
Query: 364 IYLDPHDVQPVINIGKDDLEADT 386
+YLDPH QPV+++ + + ++
Sbjct: 211 LYLDPHYCQPVVDVSQVNFSLES 233
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 65/127 (51%), Gaps = 12/127 (9%)
Query: 70 AVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALG 129
A +N GWT VK T + + +LG S S T L +C ++ L
Sbjct: 19 AWNNVKYGWT--VKSKTTFNKLSPV--TILGHSYLLNSEGT----LFFICLILSSFCCLN 70
Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR 189
+ + F F SRI ++YRK F P+ S +T+D GWGCMLRS QML+AQ LL H
Sbjct: 71 ----LDEVERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQGLLVHL 126
Query: 190 LGRPWRK 196
+ R +++
Sbjct: 127 MHRVYKE 133
>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 388
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 104/245 (42%), Gaps = 36/245 (14%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGA 112
Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 307
E G+ + +A + GD P C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264
Query: 368 PHDVQ 372
PH +Q
Sbjct: 265 PHYIQ 269
>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus Af293]
Length = 226
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 53/93 (56%), Gaps = 3/93 (3%)
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
+ G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 18 NDGRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGS 77
Query: 362 SAIYLDPHDVQPVI---NIGKDDLEADTSTYHS 391
YLDPH +P + NI + + TYH+
Sbjct: 78 HLFYLDPHQTRPALPQRNIDDPYTDEEIETYHT 110
>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 388
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 104/245 (42%), Gaps = 36/245 (14%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGT 112
Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 307
E G+ + +A + GD P C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264
Query: 368 PHDVQ 372
PH +Q
Sbjct: 265 PHYIQ 269
>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
Length = 282
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/246 (28%), Positives = 112/246 (45%), Gaps = 25/246 (10%)
Query: 20 DTPNRSLASVGS-ELGSSESKSSKGSLLSSLFNSAFSVFETYS---ESSASEKKAVHNKS 75
D +R ++G E+ + SK S G+LLSS N+ S S S S
Sbjct: 12 DGSDREQLTIGDCEVCDTTSKYSVGALLSSAANATSSKISRASINLRSLLSGSATKKTND 71
Query: 76 NGWTAAVKRLVTAGSMRRIHERV---LGPSRTGISSST----SDIWLLGVCHKIAQDEAL 128
+ + + + + S+R+ + V R IS S + +WLLG + ++ +
Sbjct: 72 DDVSTSESDIAISSSVRQKFDNVWFSFVYGRWRISRSKYKKKAPLWLLGEFYFTSRPDED 131
Query: 129 GDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
+ F D+ SRI ++YR P+ S T+D GWGC LR+ QM++AQAL+
Sbjct: 132 DEVV----FRAFAIDYYSRIWLTYRTELSPLPGSSKTTDCGWGCTLRTCQMMLAQALVVL 187
Query: 189 RLGRPWRKPLQKPFDRE-----YVEILHLFGDSETSPFSIHNLLQAGKAYGL--AAGSWV 241
LGR WR + +R + +I+ LFGD + ++ L++ K A G+W
Sbjct: 188 HLGREWRFWGDEEANRYRCGFGHYDIVSLFGDHLDADLGLYRLMKIAKERNEHDAVGNW- 246
Query: 242 GPYAMC 247
Y+ C
Sbjct: 247 --YSAC 250
>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 355
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 36/228 (15%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 177
+A DE D N +F DF SRI ++YR F+ I S + TS + L+S
Sbjct: 99 LAYDEPTKD---NGWPPQFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQL 155
Query: 178 ---QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
+ +++ RLGR WR+ Q P E EI+ LF D +P+S+H+ ++ G A
Sbjct: 156 GDQSPFSSDSMI--RLGRDWRR-GQSP--HEEREIIKLFADHPNAPYSLHSFVRHGASAC 210
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +ALA + + +Y G P V D+
Sbjct: 211 GKYPGEWFGPSATARCIQALANSHESS---------LRVYST--------GDGPDVYEDE 253
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+ +G+A + P L+LV LG++K+ P Y L + PQS
Sbjct: 254 FMKIAK--PEGEA-FHPTLILVGTRLGIDKITPVYWEALIASLQMPQS 298
>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 359
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/334 (23%), Positives = 134/334 (40%), Gaps = 67/334 (20%)
Query: 80 AAVKRLVTAGSMRRI--------HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGD 130
A ++LV GS + HE + P G S ++LGV K Q D+ L +
Sbjct: 2 AYFQKLVQHGSYNILSKFYNQIGHEDIQKPIFIGGCS----FYILGVEFKTKQMDKQLAE 57
Query: 131 AAGNNGL----AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL- 185
L A S+ ++YR G++ + +S +T+DVGWGC +R+ QM++A A+
Sbjct: 58 QPPEVYLQYSSAPAFFRISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAME 117
Query: 186 ------LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFSIHNLLQAGKAY--GL 235
+ P+ P E + +L F DS T+P SIH++ ++
Sbjct: 118 TIVYSGALNNTQTPYI-----PTKEEIMNVLVPFIDSPNSTTPLSIHHVYESRFVVEKNK 172
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
+ +++ P + +++ L + P+ C+ ++
Sbjct: 173 SGVNYLAPSVVAKAYSGLVNSWKL--------------------------CPIRCVMCSN 206
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ + P L+ +P+VL N L+ + GIVGG + ++
Sbjct: 207 VSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFV 261
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
G +YLDPH VQP K E DT +Y
Sbjct: 262 FGFHALQFLYLDPHIVQPSF---KSFTEIDTKSY 292
>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
Length = 429
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 26/114 (22%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 176
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYG 290
>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 328
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 103/244 (42%), Gaps = 43/244 (17%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETS-PFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 248
WR + ++ D+E S PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 306
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 307 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 366 LDPH 369
LDPH
Sbjct: 231 LDPH 234
>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
Length = 158
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 46/68 (67%)
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
LGL+ VNP Y T+++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 1 LGLDGVNPIYYDTIKILYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPLR 60
Query: 379 KDDLEADT 386
LE ++
Sbjct: 61 PPTLEPES 68
>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 328
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 103/244 (42%), Gaps = 43/244 (17%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETS-PFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 248
WR + ++ D+E S PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 306
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 307 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMKFFSLEHLLHSSTCIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 366 LDPH 369
LDPH
Sbjct: 231 LDPH 234
>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 328
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 103/244 (42%), Gaps = 43/244 (17%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDEELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETS-PFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 248
WR + +I D+E S PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFRDI-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 306
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRRLIPPIRVVVCSQGCLLAREICSNL 170
Query: 307 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 366 LDPH 369
LDPH
Sbjct: 231 LDPH 234
>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 359
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 123/294 (41%), Gaps = 57/294 (19%)
Query: 113 IWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRILISYRKGFDPIGDSKITS 166
++LGV K Q D+ L + L A F + S+ ++YR G++ + +S +T+
Sbjct: 39 FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAATFFR-ISNLFWMTYRSGYEKLPNSSLTT 97
Query: 167 DVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--E 217
DVGWGC +R+ QM++A A+ + P+ P +E + +L F DS
Sbjct: 98 DVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----PTKQEVMNVLIPFIDSPNS 152
Query: 218 TSPFSIHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
T+P SIH++ ++ + +++ P + +++ L +
Sbjct: 153 TTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL---------------- 196
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
P+ C+ ++ + + P L+ +P+VL N L+
Sbjct: 197 ----------CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQI 241
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
+ GIVGG + ++ G +YLDPH VQP K E DT +Y
Sbjct: 242 YKSKLFAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTEIDTKSY 292
>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
Length = 328
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 104/246 (42%), Gaps = 47/246 (19%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSET---SPFSIHNLLQA--GKAYGLAAGSWVGPYAM 246
WR + + H F D +T +PFS+H +++A KA W
Sbjct: 82 --WR------YSANDCRLDH-FRDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWT----- 127
Query: 247 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--G 304
GC+++ + + +R P + + S+ C + +
Sbjct: 128 --------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICS 168
Query: 305 QADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
++ +L+L P+ G ++ +L +G+VGG P S YI+G +
Sbjct: 169 NLEFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRL 228
Query: 364 IYLDPH 369
+YLDPH
Sbjct: 229 LYLDPH 234
>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
Length = 321
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 107/284 (37%), Gaps = 81/284 (28%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
I+ L H + + DAA I I+YR+ + +G + +TSD GWGC
Sbjct: 38 IFGLSYTHDTPSELSFADAA---------HRIHDLITITYRQKYATLGHTYLTSDAGWGC 88
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH---------LFGDSETSPFSI 223
+RS QML+ +++ + L K F EY H L D E+S SI
Sbjct: 89 AIRSVQMLLVNSIVVY---------LDKSFHPEYTSHDHIAIKNNAKQLVFDKESSVLSI 139
Query: 224 HNL-LQAGKAYGLAAGSWVGPYAMCRS--------WEALARCQRAETGLGCQSLPMAIYV 274
HN+ +Q G+ P + C + WE +R L C
Sbjct: 140 HNIYIQDAIIKHNPTGTNFLPPSTCATAVADLYNFWE-----KRTFDVLMCTEY------ 188
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
I + ++ P LL +P ++ + N ++
Sbjct: 189 ----------------IPEVTQ-------------PTLLFIPRIVTKSERN-----FIQT 214
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
T PQS G V G A+ Y GVQE+ +LDPH VQ +G
Sbjct: 215 TSFLPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQDASEVG 258
>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
Length = 700
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 52/89 (58%), Gaps = 5/89 (5%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAI 364
A W P+LL +PL LGL + NP Y ++ P S+GI+GG+P + +IVG +E +
Sbjct: 259 ATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQIPHSIGIMGGRPSHAVWIVGTAGDEDLL 318
Query: 365 YLDPHDVQPVINIGKDDLEA-DTSTYHSE 392
LDPH QP +DDL A D T+H +
Sbjct: 319 CLDPHTTQPA---SQDDLTAEDDVTHHCD 344
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 15/85 (17%)
Query: 179 MLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
M++A+A+ LG+ WR P + D Y + +F D ++S +SI N+ G A
Sbjct: 1 MMLAEAITRIHLGKDWRWTPGCQ--DEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPI 58
Query: 238 GSWVGP------------YAMCRSW 250
GSW GP Y C +W
Sbjct: 59 GSWFGPNTVAQVIKKLCAYDPCTNW 83
>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
Length = 440
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 47/73 (64%)
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 252 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 311
Query: 367 DPHDVQPVINIGK 379
DPH Q +++ +
Sbjct: 312 DPHFCQNFVDLDE 324
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 57/122 (46%), Gaps = 28/122 (22%)
Query: 128 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 59 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 118
Query: 187 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
LGR W +DR EY IL G SE G G W
Sbjct: 119 VRHLGRNWL------WDRDVMLTEYKRILPNMGVSE----------------GKEIGEWF 156
Query: 242 GP 243
GP
Sbjct: 157 GP 158
>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
Length = 179
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 51/87 (58%), Gaps = 3/87 (3%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
+ P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ YLD
Sbjct: 24 FRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHLFYLD 83
Query: 368 PHDVQPVI---NIGKDDLEADTSTYHS 391
PH +P + NI + + + TYH+
Sbjct: 84 PHQTRPALPQRNIDERYTDEEIETYHT 110
>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
Length = 483
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 97/212 (45%), Gaps = 31/212 (14%)
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 224
+DVGWGCM+R+ Q L+ AL R+ + +P D + EI LF D+ S FS+
Sbjct: 135 TDVGWGCMIRTGQSLLGNAL--QRVKSTVKDQPYIYEMD-DTKEITDLFKDNTKSAFSLQ 191
Query: 225 NLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
N ++ G+ Y +A G W GP L + C I V SGD E
Sbjct: 192 NFVKCGRIYNKIAPGEWFGPATTATCIRYLIQENPCYGIEAC-----YISVSSGDIFKEN 246
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTP---ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+G D P IL+L+ + LGL+ V+ RY ++ P
Sbjct: 247 ------------------IQGMIDRYPNGNILILLGIKLGLDSVHERYWGEIKTMLESPF 288
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
S+GI GG+P +S Y G +++ ++ DPH+ Q
Sbjct: 289 SVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQ 320
>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
Length = 216
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 54/98 (55%), Gaps = 14/98 (14%)
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 28 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 87
Query: 367 DPHDVQPVINIG--------KDDL------EADTSTYH 390
DPH Q +++ +DD E STYH
Sbjct: 88 DPHFCQNFVDLDETTTTRDERDDYVEIKNDEFKDSTYH 125
>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 388
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 106/244 (43%), Gaps = 34/244 (13%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ S T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKVATKKLLYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGA 112
Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+ P + + E E I LF D ++P IH + S + P
Sbjct: 113 YVLP--EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHC---SVFSKGQADW 308
E G+ + +A GD P C + SRH +V +K +
Sbjct: 162 ------TEAGMAIAAALIAFRAQGGD-------VPFTFCCE--SRHIDEPAVMAK-LLEG 205
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++DP
Sbjct: 206 QHVVLIIPVVLGIAPMSDQYELVMLKILDVKACCGIAGGFKQASLYMFGHQGRSVFFMDP 265
Query: 369 HDVQ 372
H VQ
Sbjct: 266 HYVQ 269
>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
Length = 745
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 46/69 (66%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 368 PHDVQPVIN 376
PH VQ +N
Sbjct: 563 PHFVQESVN 571
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 198 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 232
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
Length = 745
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 46/69 (66%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 368 PHDVQPVIN 376
PH VQ +N
Sbjct: 563 PHFVQESVN 571
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 198 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 232
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 649
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 70/268 (26%), Positives = 106/268 (39%), Gaps = 29/268 (10%)
Query: 148 ILISYRKGFDPIGD-----SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
I SYR F I D +++D GWGCM+R SQML+A+AL H L + Q
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204
Query: 203 DRE---YVEILHLFGD--SETSPFS------------IHNLLQAGKAYGLAAGSWVGPYA 245
D E Y I+ LF D SE+ + + N Y L + A
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---RHCSVFS 302
+ R ++ + T + + I S + + G ++ D + S
Sbjct: 265 ILRQYQ--QNVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEAS 322
Query: 303 KGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
+ Q D IL++V L G+ K ++ +G + G YI+G QE+
Sbjct: 323 RKQNDRLNNILVMVHLKFGINKFEMQHKDYFIELLKIKNFVGALSGTETKGMYIIGFQED 382
Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTY 389
I LDPH +Q G+ L+ D TY
Sbjct: 383 RLIVLDPHFIQKSTE-GEQGLDKDYCTY 409
>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
Length = 312
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 100/242 (41%), Gaps = 48/242 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
FNQ + I YR G K SD GWGC++R QM++A AL+ R+
Sbjct: 49 FNQKKDTLIWFCYRANIQFEG--KAISDQGWGCLVRVGQMMLANALM--------RECKI 98
Query: 200 KPFDREYVEILHLFGDSE----TSPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 253
++ I+HLF D++ +PFSI +++ A + G W GP M
Sbjct: 99 LAINKTKAMIIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIM------- 151
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT---P 310
S ED + + I+ + + Q D + P
Sbjct: 152 ----------------------SVIEDLNKNNMNIKQINLVNFLEQCVLESQIDLSFKKP 189
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
LL++ ++G + + I L+ Q G + GK + +++G Q+ +AI++DPH
Sbjct: 190 HLLIIHAIIGDKSLGQLEIQNLQSHMQISQFAGAIIGKNNKAFFLIGFQKNNAIFMDPHY 249
Query: 371 VQ 372
VQ
Sbjct: 250 VQ 251
>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 200
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
+++ I++LFGDS S FSIH L+ G+ G W GP
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP 136
>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 371
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 110/255 (43%), Gaps = 37/255 (14%)
Query: 142 QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
++ SS + +SY+K + IT+D GWGC LR+SQM++AQ L H + + +
Sbjct: 52 EELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRHLYEKRVQSFIY 111
Query: 200 KPFDREYVEILHL---FGDS------ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
D+ ++ HL F +S + SPF H+LL +A L Y +
Sbjct: 112 N--DKTKLDFQHLIMMFAESNSLENMDQSPFGFHSLL--TQAINLFQVPLKQQYTPVQGI 167
Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 310
+AL + Q L ++ +V+ V+ +D + + K
Sbjct: 168 KALKQ------QFKQQKLVKSLKIVT-------SSTGVIFQEDIRQKMKNWEKS------ 208
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+LL++ LG K+N Y+ ++ +G +GG S ++VG + + LDPH
Sbjct: 209 LLLILHFKLGTGKLNQIYVEQIKSLMDLEYFVGAIGGIKNKSLFMVGYMNDQFLSLDPHV 268
Query: 371 VQPVINIGKDDLEAD 385
Q N KD L +
Sbjct: 269 QQ---NACKDPLNLN 280
>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 343
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/264 (24%), Positives = 105/264 (39%), Gaps = 37/264 (14%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
I SYR GF + I SD GWGCMLRS QM+ A LL H P +Q + +
Sbjct: 27 IYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLLRHLKENP---QIQNQLKIQNI 83
Query: 208 E-----ILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 261
I+ F +++ PFSI + A + + L G W P + S + L + +
Sbjct: 84 NDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKKLLNNFQTFS 143
Query: 262 GLGCQS------LPMAIYVVSGDEDGERGGAPV------VCIDDASRHCSVFSKGQADWT 309
+ S P+ G++ + + + I++ + + + +
Sbjct: 144 EMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKIMKQNSNKYQ 203
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+++GL+ +Y+ L FT S+G ++G+ + YLDPH
Sbjct: 204 INKQNYKILIGLDYPEEKYLDILIKLFTHRLSIG-----------MIGLNNDKLTYLDPH 252
Query: 370 DVQPV-INIGKDDLEADTSTYHSE 392
VQ IN E + TY E
Sbjct: 253 IVQHADINTN----EINLKTYFQE 272
>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
Length = 348
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 28/124 (22%)
Query: 140 FNQDFSSRILISYRKGFDPI---------------------GD-SKITSDVGWGCMLRSS 177
F DF SR ++YR GF+PI GD S +SD GWGCM+RS
Sbjct: 120 FLDDFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSG 179
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
Q L+A A+ + LGR WR ++ EI+ LF D +P+SIH + G +A
Sbjct: 180 QSLLANAMAMYELGRGWRLSDGGIAEK---EIISLFADDPRAPYSIHRFVGHG---AVAC 233
Query: 238 GSWV 241
GS++
Sbjct: 234 GSFL 237
>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia strain d4-2]
gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia]
gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
Length = 277
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 104/242 (42%), Gaps = 48/242 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
F Q + I SYR G + SD GWGC++R QM+VA +L+
Sbjct: 14 FLQLKETFIWFSYRANIQYEG--RAISDQGWGCLIRVGQMIVANSLIRESTNS------- 64
Query: 200 KPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 253
KP D + +I+ LF D++ +PFSI +++ A Y + G W GP MC + L
Sbjct: 65 KPNDLK-TKIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLL 123
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW---TP 310
Q A+T + I + C + + Q D P
Sbjct: 124 ---QSAKT------------------------IKQLKIINFLEQCVI--EKQIDLQFKQP 154
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
LL++ ++G ++++ ++ L+ PQ G + GK + +++G Q I +DPH
Sbjct: 155 QLLIIHAIIGNKELDQYFVAELQKHMQIPQFAGAIVGKSKKAYFLIGYQNNQGIVMDPHY 214
Query: 371 VQ 372
VQ
Sbjct: 215 VQ 216
>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 96/230 (41%), Gaps = 33/230 (14%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ SYR F P+ + T+D WGC+LR++QML+ LL + + P + +
Sbjct: 74 LYFSYRSCFPPLPNGS-TTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELK-A 131
Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
I LF D ++P IH + S + P E G+
Sbjct: 132 NISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSP---------------TEAGMA--- 173
Query: 268 LPMAIYVVSGDEDGERGGAPVV--CIDDASRHCSVFSK---GQADWTPILLLVPLVLGLE 322
MA +++ +G G P C + +V +K GQ ++L++P+VLGL
Sbjct: 174 --MAAALIACHAEG--GDVPFTFSCENRNIDEPAVVAKLLEGQH----VILIIPVVLGLA 225
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
++ +Y + GI GG AS Y+ G Q ++DPH +Q
Sbjct: 226 PLSDKYESMMLKILDMKACCGIAGGFKQASFYMFGHQGRKVFFMDPHYIQ 275
>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
Length = 269
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 81/175 (46%), Gaps = 24/175 (13%)
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
+SIH + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 4 YSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD--- 52
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V +DD C + W P+LL++PL LG+ +NP Y+P L+
Sbjct: 53 ------STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDS 102
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSE 392
S G++GG+P + Y +G ++ +YLDPH Q + + A+ TYH +
Sbjct: 103 SCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQK 157
>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 325
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 58/269 (21%), Positives = 113/269 (42%), Gaps = 57/269 (21%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+++LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VYILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSCLGNTYLSSDAGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------REYVEILHLFGDSETSPFSIHN 225
+R++QM++ L+ ++ +Q+ D + ++ L D +S SIHN
Sbjct: 92 AIRATQMMIVNTLVI------FKDQMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHN 145
Query: 226 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
+ + K + +++ P C + +L + E ++
Sbjct: 146 IYIQEIIKVHNPTGTNFLPPSICCIAISSLLQ-----------------------EWDKK 182
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
+ C+D +CS P L L+P ++ + + + T QS G
Sbjct: 183 LFNCITCLDHIP-NCSY---------PTLYLIPQIITFTEHQ-----LILDSLTLSQSRG 227
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
VGG ++ ++ G Q + +LDPH VQ
Sbjct: 228 FVGGIGESAIFVFGYQGTTLFFLDPHYVQ 256
>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
Length = 149
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 11/82 (13%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKITS 166
S +S + LLG ++++ + G+ E F + FSS + +SYR+GF P+ S ++S
Sbjct: 74 SKSSPVCLLGQSYQLS----------STGVRESFRRVFSSLLWMSYRRGFRPLDGSTLSS 123
Query: 167 DVGWGCMLRSSQMLVAQALLFH 188
D GWGCMLRS+QML+AQ LL H
Sbjct: 124 DAGWGCMLRSAQMLLAQGLLLH 145
>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 463
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 42/70 (60%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 253 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQASHFFY 312
Query: 366 LDPHDVQPVI 375
LDPH +P +
Sbjct: 313 LDPHHTRPAL 322
>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
Length = 362
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 120/281 (42%), Gaps = 60/281 (21%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ-----DFSSRILISYRKGFDPIGDSKITSD 167
++LLG+ +K + + L +++ S+ + ++YR G++ + +S + +D
Sbjct: 39 LFLLGIEYKTTPLKKQAQELPQSSLLQYSSMAAYVRMSNLLWMTYRSGYEKLPNSSLNTD 98
Query: 168 VGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSI 223
VGWGC +R+ QM+++ A+ L ++ P E + ++ F D +T+P SI
Sbjct: 99 VGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIPKQNEILNVVIPFVDFFEQTTPLSI 158
Query: 224 HNLLQ---------AGKAYGLAAGSWVGPYA-MCRSWEALA-RCQRAETGLGCQSLPMAI 272
H++ + +G Y LA Y+ + SW+ A RC A S+P+
Sbjct: 159 HHVYESRFVVEQNKSGVNY-LAPTIVAKAYSDLVNSWKMCALRCVMASNT----SIPL-- 211
Query: 273 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
C + + + P L+ +P+++ + V R L
Sbjct: 212 -------------------------CDI---KKEPFKPTLVFLPIIMD-QLVKSR----L 238
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+ + F GIV G + YI G ++LDPH VQP
Sbjct: 239 QQIYKFNMFAGIVSGIGDRAVYIFGFHVMRCLFLDPHTVQP 279
>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
Length = 325
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/271 (23%), Positives = 111/271 (40%), Gaps = 61/271 (22%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+ +LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VHILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 225
+R++QM++ AL+ ++ +Q+ D E L D +S SIHN
Sbjct: 92 AIRATQMMIVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145
Query: 226 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
+ Q K + +++ P C + +L + E
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSVCCIAISSLLQ--------------------------EW 179
Query: 284 GGAPVVCIDDASR--HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
P CI + CS P L L+P ++ + + + +L L+ QS
Sbjct: 180 DKKPFNCITCLNHIPSCS---------CPTLYLIPRIITFTE-HQLILDSLALS----QS 225
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
G VGG ++ ++ G Q + +LDPH VQ
Sbjct: 226 RGFVGGIGESAIFVFGCQGTTLFFLDPHYVQ 256
>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
Length = 364
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 110/298 (36%), Gaps = 98/298 (32%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
L EF D + I ++ G + +SD GWGCMLR QM++AQAL+ LGR
Sbjct: 24 LEEF-PDTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRA--- 79
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
Q G G + G W GP + + + LA
Sbjct: 80 -------------------------------QMGVGEGKSIGEWFGPNTVAQVLKKLALF 108
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF--------------- 301
+ +A+YV + V I+D + C V
Sbjct: 109 DEWNS--------LAVYVSMDN---------TVVIEDIKKMCCVLPLSADTDTESPPDSP 151
Query: 302 -----SKGQ----ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
SKG + W P+LL+VPL LG+ ++NP Y+ +L + + IV +
Sbjct: 152 TASNQSKGPSACGSAWKPLLLIVPLRLGINQINPVYVDAFKLQASC-HPILIVTKEGVRR 210
Query: 353 TYIVGVQEESA--------------------IYLDPHDVQPVINIGKDDLEADTSTYH 390
T I+ ++ S I+LDPH Q ++ ++ + D T+H
Sbjct: 211 TRILPPKDSSGARASESLKVKHVSFKTGDELIFLDPHTTQTFVDTEENGM-VDDQTFH 267
>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
Length = 352
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 102/233 (43%), Gaps = 53/233 (22%)
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--- 208
YR F P+ ++ +TSD GWGC +RS+QMLVA A+ K FD V
Sbjct: 92 YRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANAI---------GKLFTNDFDTGEVTDKM 142
Query: 209 ILHLFGD--SETSPFSIHNLL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
++ F D S PFSIHNL +A + S++ P A+ ++ + + + A G
Sbjct: 143 VIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK-KLANPKFG 201
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + V+++ P ++L+P+ + +
Sbjct: 202 MEILT------------------------TTFTFRVYTQ------PTIVLIPISIP-DSF 230
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ-PVIN 376
N + + + F+F G+VGG + Y G+ + ++LDPH V+ VIN
Sbjct: 231 NDK----IAVIFSFYLFSGMVGGSGRKAFYFFGIHHDQLLFLDPHTVRNTVIN 279
>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
Length = 454
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 42/70 (60%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 245 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFY 304
Query: 366 LDPHDVQPVI 375
LDPH +P +
Sbjct: 305 LDPHHTRPAL 314
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 24/119 (20%)
Query: 101 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 156
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 157 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
DP + T+D GWGCM+RS Q L+A AL LGR R
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRACR 203
>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
Length = 646
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 22/120 (18%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ EF +DFS++I +SYR+GF IGD+ +D GWG W+K
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGY---------------------WKK 447
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALAR 255
Q + I+ +F D T+PFSIHN+ G+ + G G W P + + ++L
Sbjct: 448 SGQNEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVN 507
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
IVGGKP AS Y + Q+++ YLDPH VQ I+
Sbjct: 541 IVGGKPRASLYFIAAQDDNLFYLDPHTVQQAID 573
>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
Length = 259
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 81/218 (37%), Gaps = 56/218 (25%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 103
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 104 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 126
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
Q + I+LDPH Q ++ ++ D T+H
Sbjct: 127 -LTASNQGDELIFLDPHTTQTFVDTEENGTVND-QTFH 162
>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
Length = 266
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 134 NNGLAE--FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
NN + + F D S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+ALL H
Sbjct: 205 NNNIIQSNFLDDVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALLKH 261
>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
Length = 348
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 60/117 (51%), Gaps = 21/117 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------ITSDVGWGCMLRSSQMLVAQALLF 187
F ++F IL +YR F I ++ I SDVGWGCM R +QM +A +
Sbjct: 44 FLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVTQMSIAHGIC- 102
Query: 188 HRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGP 243
+ K + E +IL+ F D+E++ FSIHN++ G + +G+ SW+GP
Sbjct: 103 -----QFMKRFLGNLNIE--KILNNFQDNESAKFSIHNMVNIGLSEFGIDPTSWIGP 152
>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 1216
Score = 64.7 bits (156), Expect = 8e-08, Method: Composition-based stats.
Identities = 29/82 (35%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+ LL+P LGL++++P +I L+ + QS+G++GGKP + Y +G + +YLDPH
Sbjct: 493 LFLLLPCRLGLDEISPIHIEILKKLLSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDPHY 552
Query: 371 VQPVINIGKDDLEADTSTYHSE 392
++ + K+DL + S+Y E
Sbjct: 553 IKECVR--KEDLMENISSYFEE 572
Score = 51.2 bits (121), Expect = 0.001, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 7/54 (12%)
Query: 142 QDFSSRILISYRKGFDPIGDSKI-------TSDVGWGCMLRSSQMLVAQALLFH 188
Q + + IL +YRK F P+ KI TSD GWGCM+R+ QM+ AQ + H
Sbjct: 257 QIYQNTILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRH 310
>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
Length = 426
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 54/117 (46%), Gaps = 15/117 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ +YR GF+ + T D GWGCMLRS+QML+ AL R G R +
Sbjct: 28 LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL--TRNGAAPR-----------L 74
Query: 208 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
LF D+ +++PF +HN + G Y + G W GP C L +R G
Sbjct: 75 ATAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLVDWRRNAPG 131
>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 209
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 20/143 (13%)
Query: 142 QDFSSRILISYRKGFDPI----GDSKI---TSDVGWGCMLRSSQMLVAQALLFHRLGR-- 192
++F + I ++YR+ F P+ D KI SD GWGCM+R QM +A+ L H +
Sbjct: 24 ENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRHHLQQKGI 83
Query: 193 -PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSW 250
++ +Q D + FGD +P+SI + + A K + L G W P +C
Sbjct: 84 YDNKRIIQAFLDND-------FGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRICHVL 136
Query: 251 EALARCQRAETGLGCQSLPMAIY 273
L + L C+ L + ++
Sbjct: 137 SLLHN--DKKQILDCEDLKVGVF 157
>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 823
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 10/99 (10%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLG-----RPWRKPLQKPFDREYVEILHLFGD---S 216
T+DVGWGC +R QM++ QAL+ H +G + QK + Y +I+ L D S
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLN--YAKIIQLIHDNDCS 451
Query: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+T FSI N+ + G + G W GP+A+ L R
Sbjct: 452 QTGAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNR 490
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%), Gaps = 3/82 (3%)
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
IL+++P LGL KVN Y +++ F ++GI+GG+P + Y VG Q+ I LDPH
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIKYVFQCRLNVGIMGGRPNQALYFVGTQKTDLICLDPHL 670
Query: 371 VQPVINIGKDDLEAD--TSTYH 390
VQ + + +++L TYH
Sbjct: 671 VQDTV-LNQEELSNVELNQTYH 691
>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
Length = 564
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/266 (24%), Positives = 103/266 (38%), Gaps = 70/266 (26%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 219
+T+D WGC +RS+QM++A AL P IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQQSTFMYPVNS------------ILKLFDDNIRECTES 261
Query: 220 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 254
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321
Query: 255 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---------RHCSV--F 301
+ CQ Q L V++ + E DD + R +
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKL 380
Query: 302 SKGQADWTP---------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D P +L++V + LGL+K++P Y + PQ +G+VGGKP +
Sbjct: 381 PNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGGKPNKA 440
Query: 353 TYIVG------VQEESAIYLDPHDVQ 372
Y G + ++LDPH VQ
Sbjct: 441 FYFFGHIIDQDTNKVKLMFLDPHKVQ 466
>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
Length = 564
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/266 (24%), Positives = 103/266 (38%), Gaps = 70/266 (26%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 219
+T+D WGC +RS+QM++A AL P IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQQSTFMYPVNS------------ILKLFDDNIRECTES 261
Query: 220 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 254
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321
Query: 255 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---------RHCSV--F 301
+ CQ Q L V++ + E DD + R +
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKL 380
Query: 302 SKGQADWTP---------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D P +L++V + LGL+K++P Y + PQ +G+VGGKP +
Sbjct: 381 PNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGGKPNKA 440
Query: 353 TYIVG------VQEESAIYLDPHDVQ 372
Y G + ++LDPH VQ
Sbjct: 441 FYFFGHIIDLDTNKVKLMFLDPHKVQ 466
>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
Length = 93
Score = 62.4 bits (150), Expect = 3e-07, Method: Composition-based stats.
Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 19/91 (20%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIG-- 160
I + IW+LG + N L E + +D S + +YRKGF PIG
Sbjct: 16 IPQTDEPIWILGKKY--------------NALKELDMIRRDIRSMLWFTYRKGFIPIGGC 61
Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
+S TSD GWGCMLR QM++AQAL+ LG
Sbjct: 62 NSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92
>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
Length = 360
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 96/228 (42%), Gaps = 35/228 (15%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRS-- 176
+A D+ + D +G F DF S+I ++YR F+PI S + TS + L+S
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 177 -SQMLVAQALLFHRLGR-PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
Q + + RLGR WR+ E +L F D +P+SIH+ ++ G A
Sbjct: 159 GDQSPFSSDTMV-RLGRGDWRRGESV---EEECRLLKDFADDPRAPYSIHSFVRHGASAC 214
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +AL + +I V S G P V D+
Sbjct: 215 GKYPGEWFGPSATARCIQALTNSHES-----------SIRVYST------GDGPDVYEDE 257
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+ D+ P L+LV LG++K+ P Y L PQS
Sbjct: 258 ---FMQIAKPPGEDFHPTLVLVGTRLGIDKITPVYWEALIAALQMPQS 302
>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
Length = 296
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 78/185 (42%), Gaps = 39/185 (21%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
+R + +I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVE 103
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
+ + +YV S+ C+V L + L +
Sbjct: 104 SCSEVTRLVVYV--------------------SQDCTV-----------LHMRSLAIDPS 132
Query: 323 KVNPRYIPT-LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
K +P+ L+ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 133 KDRSTCLPSSLQELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQAN 192
Query: 382 LEADT 386
++
Sbjct: 193 FPLES 197
>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
Length = 806
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 41/65 (63%)
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+++++ + LGLE + Y L+ F+ Q +GI+GGKP + Y VG Q++ I+LDPH
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLRQCVGILGGKPNFALYFVGYQQDHMIFLDPHY 700
Query: 371 VQPVI 375
VQ +
Sbjct: 701 VQQAL 705
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 13/72 (18%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV----EILHLFGDSETS 219
I SD GWGCM+R QM++A + L K LQ+ + + IL + D +
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFL---------KLLQQNHNFHDILTHDSILSMILDQLDA 443
Query: 220 PFSIHNLLQAGK 231
PF IH + + G+
Sbjct: 444 PFGIHQITEEGR 455
>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 348
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 96/240 (40%), Gaps = 64/240 (26%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S I YR F + ++ +TSD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 205 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQR 258
+ + ++H F D S P+SIH+L G GS P++
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFS------------- 178
Query: 259 AETGLGCQSLPMAIYVVSG--DEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILL 313
IY ++ ++D R C V + ++ P ++
Sbjct: 179 -----------SVIYALTELVNKDFNRAF-----------ECHVITNKFLLKSINKPTIV 216
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+P + +K + R I F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 217 FIPFTIP-DKFDQRLIT----IFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRP 271
>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
Length = 353
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 61/141 (43%), Gaps = 23/141 (16%)
Query: 118 VCHKIAQ-DEALGDAAGNNGLAE----FNQDFSSRILISYRKGFDPIGD---------SK 163
+ + I Q D++L GN A+ F + F IL SYR F I S
Sbjct: 20 IIYNIDQHDDSLIFLFGNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSS 79
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
+T+D+GWGCMLR QM +A LL R K + IL F D E S FSI
Sbjct: 80 VTTDLGWGCMLRVIQMSLALGLL--------RYCKMKKYTYSLDYILQNFQDLEESLFSI 131
Query: 224 HNLLQAG-KAYGLAAGSWVGP 243
H ++ G + W GP
Sbjct: 132 HQFVKVGCSIFNKKPKDWFGP 152
>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
Length = 98
Score = 58.9 bits (141), Expect = 4e-06, Method: Composition-based stats.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 13/81 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 170
+W+LG + ++ L +D S + +YRKGF PIG +S TSD GW
Sbjct: 23 VWILGRVYNAIKE-----------LDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGW 71
Query: 171 GCMLRSSQMLVAQALLFHRLG 191
GCMLR QM++A+AL+ LG
Sbjct: 72 GCMLRCGQMVLARALITLHLG 92
>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 348
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 55/255 (21%), Positives = 99/255 (38%), Gaps = 62/255 (24%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S I YR F + ++ +TSD GWGC +R+ QML+A +++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSII-------------KLFGS 131
Query: 205 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ + ++H F D S P+SIH+L + +
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFT-------------------------TQIIVS 166
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILLLVP 316
+ G LP ++ + + E + + + C + + + P ++ +P
Sbjct: 167 KNPNGSSFLPFSVVIYALTELVNKDF-------NRAFECHIITNKFLLNSINKPTIVFIP 219
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP--- 373
+ E + L F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 220 FTIPDE-----FEQRLITIFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRPCAS 274
Query: 374 -VINIGKDDLEADTS 387
+I + D A S
Sbjct: 275 SIIKFDEKDYIAKLS 289
>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 3465
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 41/65 (63%), Gaps = 2/65 (3%)
Query: 311 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
LLL PL L EK+NP Y+P+L P S+G+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2955 CLLLFPLTLCSGEKINPVYVPSLLAYLELPWSVGMVAGRGQQAFYCIGTQQKALLYLDPH 3014
Query: 370 D-VQP 373
+QP
Sbjct: 3015 SGIQP 3019
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 17/70 (24%)
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
+ +Q S +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 942 QLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQMLL 1001
Query: 182 AQALLFHRLG 191
QAL H LG
Sbjct: 1002 MQALRRHFLG 1011
>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 341
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 18/101 (17%)
Query: 148 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
IL +YR F+PI G + + SD GWGC +R++QML+AQA+ G+ D
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV--KMAGK----------D 113
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGK-AYGLAAGSWVGP 243
+ +L LF DS +P S+H +++ G+ G+W GP
Sbjct: 114 ADDSVVLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGP 154
>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 348
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 97/239 (40%), Gaps = 62/239 (25%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S I YR F + ++ + SD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 205 EYVE---ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ + ++H F D + P+SIH+L + +G+
Sbjct: 132 DNINRKTVIHWFLDFYNVECPYSIHSLFTTQI---IVSGN-------------------- 168
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR--HCSVFSKG---QADWTPILLL 314
G LP+++ + E + D +R C V + + P ++
Sbjct: 169 --PNGSSFLPLSVVTYALTELVNK---------DLNRIFECHVITNKFLLNSINKPTIIF 217
Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+P + ++ N R I F+F G+VGG + Y G+ + ++LDPH V+P
Sbjct: 218 IPFTIP-DEFNQRLIS----IFSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDPHFVRP 271
>gi|237837057|ref|XP_002367826.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
gi|211965490|gb|EEB00686.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
Length = 3559
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 50/91 (54%), Gaps = 8/91 (8%)
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 343
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHD-VQP 373
+V G+ + Y +G Q+++ +YLDPH +QP
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQP 3054
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 86 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 182 AQALLFHRL 190
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|221481944|gb|EEE20310.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 3562
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 50/91 (54%), Gaps = 8/91 (8%)
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 343
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHD-VQP 373
+V G+ + Y +G Q+++ +YLDPH +QP
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQP 3054
Score = 44.7 bits (104), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 86 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 182 AQALLFHRL 190
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 193
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 25/150 (16%)
Query: 95 HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRI 148
HE V P G S ++LGV K Q D+ L + L A F + S+
Sbjct: 25 HEDVQKPIFVGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAAAFFR-ISNLF 79
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKP 201
++YR G++ + +S +T+DVGWGC +R+ QM++A A+ + P+ P
Sbjct: 80 WMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----P 134
Query: 202 FDREYVEILHLFGDS--ETSPFSIHNLLQA 229
+E + +L F DS T+P SIH++ ++
Sbjct: 135 TKQEVMNVLIPFIDSPNSTTPLSIHHVYES 164
>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
Length = 538
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 26/129 (20%)
Query: 179 MLVAQALLFHRLGRPWR----------------KPLQKPFDREYVEILHLFGDS--ETSP 220
M++AQ L+ H LGR WR +L LF D+ E +P
Sbjct: 1 MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
FS+H+L +AG+A G+ AG W+GP+ MC++ A A R Q + + + V E
Sbjct: 61 FSLHSLCRAGQACGVVAGRWLGPWVMCKTLAAAAGAARR------QGVDLGLTVAVLAES 114
Query: 281 GERGGAPVV 289
G GGAP++
Sbjct: 115 G--GGAPLL 121
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 26/37 (70%)
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
K+NPRYIP L PQS+GIVGG+P +S Y VG Q
Sbjct: 215 KLNPRYIPQLEAVLAMPQSIGIVGGRPSSSLYFVGFQ 251
>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 183
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 66/144 (45%), Gaps = 19/144 (13%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+ +LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VHILGNCYYPETNENLNHLTFNDA----NLKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 225
+R++QM+V AL+ ++ +Q+ D E L D +S SIHN
Sbjct: 92 AIRATQMMVVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145
Query: 226 LL--QAGKAYGLAAGSWVGPYAMC 247
+ Q K + +++ P C
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSICC 169
>gi|221505025|gb|EEE30679.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 3554
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 311 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
LLL PL L EK+NP Y+ +L P SLG+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2988 CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLGMVAGRGQQAFYCIGTQQKALLYLDPH 3047
Query: 370 D-VQP 373
+QP
Sbjct: 3048 SGIQP 3052
Score = 44.7 bits (104), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 86 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIAR---FTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 182 AQALLFHRL 190
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
Length = 567
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 6/86 (6%)
Query: 297 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+CS ++ + W P++++VP+ LG + L QSLG +GG+P S Y
Sbjct: 406 NCSRMAQAREPCSWRPLIVVVPVRLGARSEDQH----LSRIDKHLQSLGFIGGRPRHSYY 461
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKD 380
VGV+ +A YLDPH QP +I K+
Sbjct: 462 FVGVRGYNAYYLDPHITQPYQSIRKN 487
>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
Length = 206
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 30/135 (22%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFD-------------------PI-GDSKITS 166
A+ D L E +DF IL++YR+G P+ + I +
Sbjct: 17 AMCDQNPGPKLRERLKDF---ILLTYRRGLSIHLPRFYAGNIPKRFYGIWPLWQQTDIKT 73
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGC LR++QM +A+AL R PL + IL LF D+ +PFS+ NL
Sbjct: 74 DRGWGCALRATQMALAEAL------RDVLSPLDN-VQEQRSRILQLFYDTTEAPFSLENL 126
Query: 227 LQAGKAYGLAAGSWV 241
+ A +G +W+
Sbjct: 127 VMADVEHGANVVAWI 141
>gi|328852471|gb|EGG01617.1| Hypothetical protein MELLADRAFT_92005 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|328859149|gb|EGG08259.1| Hypothetical protein MELLADRAFT_123247 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDQVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
Length = 389
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 26/94 (27%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 159
+A D+ + D +G F DF S+I ++YR F+PI
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 160 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
GD S +SD GWGCM+RS Q ++A + RLGR
Sbjct: 159 GDQSPFSSDSGWGCMIRSGQSMLANTIAMVRLGR 192
>gi|328852767|gb|EGG01910.1| Hypothetical protein MELLADRAFT_123246 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNINPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|440292697|gb|ELP85881.1| hypothetical protein EIN_133850 [Entamoeba invadens IP1]
Length = 348
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 96/260 (36%), Gaps = 58/260 (22%)
Query: 138 AEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
++ + S+ ++YR GF + +T+D GWGC +RS QML +L+ R+ P
Sbjct: 62 SQIAKHLSTLFKVTYRNGFTYHLPHCSLTTDAGWGCTIRSVQMLFLNSLI--RIQEP--- 116
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EALAR 255
FD+ DS+T + G V P + R + + +
Sbjct: 117 --DPGFDK----------DSQTK---------------MKKGFLVHPMDVRREYVQLIED 149
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS--------RHCSV-----FS 302
R E L + V ++ G +P C S R C V F
Sbjct: 150 TPRKEAVLSIHKMFDLEVVRKNNQKGTNYLSPSTCATAISVLMEQWDERPCHVMFVQTFP 209
Query: 303 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 362
K T +++L PL N R + +P G+V G + Y+VG
Sbjct: 210 KHVEPNTILMVLAPL-------NER----TQCCLDYPFVSGVVCGVETRAIYVVGHSGGV 258
Query: 363 AIYLDPHDVQPVINIGKDDL 382
+ LDPH VQ G D+
Sbjct: 259 LLLLDPHHVQKAHEDGDFDI 278
>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
Length = 307
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
+ F +DF SRI ++YR+ F + DS TSD GWGCM+
Sbjct: 195 IEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMI 232
>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
Length = 473
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 62/131 (47%), Gaps = 31/131 (23%)
Query: 137 LAEFNQDFSSRIL----ISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQAL----- 185
L E N+D ++ +YR+GF DS +T+D GWGC++R QM++A+ L
Sbjct: 37 LIERNEDILDVVVHTIRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAELLKRHLK 96
Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSE--------TSP----FSIHNLLQ-AGKA 232
F+++ PL + ++L +F D + + P FSI +++ A K
Sbjct: 97 CFYKVDLFSFPPLLQ-------DVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKE 149
Query: 233 YGLAAGSWVGP 243
+G G W P
Sbjct: 150 WGKKPGEWYSP 160
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 30/54 (55%)
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+G ++ NP Y+ +R G++GG+P + +IVG + + LDPH VQ
Sbjct: 286 IGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHLVQ 339
>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
Length = 469
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 27/116 (23%)
Query: 148 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 200
I +YR+GF +S +T+D GWGC++R QM++A+ L F+ + PL +
Sbjct: 52 IRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAELLKRHLKCFYNVNLFQFPPLMQ 111
Query: 201 PFDREYVEILHLFGDSETSP------------FSIHNLLQ-AGKAYGLAAGSWVGP 243
E+L LF D + FSI +++ A + +G G W P
Sbjct: 112 -------EVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160
Score = 42.7 bits (99), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%)
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+G ++ NP YI +R G++GG+P + +IVG ++ + LDPH VQ
Sbjct: 286 IGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHLVQ 339
>gi|307108756|gb|EFN56995.1| hypothetical protein CHLNCDRAFT_143631 [Chlorella variabilis]
Length = 137
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 9/113 (7%)
Query: 66 SEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGIS-SSTSDIWLLGVCHKIAQ 124
+E AV S G + + L A + ++H+ + +G S + + +WLLG C+
Sbjct: 5 AELSAVDKLSLGLSRSYYALARALRLNKLHDLLA----SGASITPDAPVWLLGQCYSCPP 60
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI--GDSKITSDVGWGCMLR 175
+ +A LA + S +SYR GF I G + + SD GWGC LR
Sbjct: 61 GAS--EAQQEEALARMLHHYQSIPWMSYRTGFTSIAAGSAHLQSDAGWGCTLR 111
>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 141
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 25/38 (65%)
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+GGKP S Y +G Q++ +YLDPH QP +++ + D
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADF 38
>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
Length = 346
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 20/94 (21%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
NN +A + S+ ++YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 191 GRP-------WRKPLQKPF-------DREYVEIL 210
P + +QK F REYV+++
Sbjct: 112 QEPNPGFGEDAAEKVQKNFIIHSMEERREYVQLI 145
>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
Length = 135
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 11/67 (16%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +++W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTNVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGC 172
+D GWG
Sbjct: 92 TDKGWGL 98
>gi|389585790|dbj|GAB68520.1| peptidase, partial [Plasmodium cynomolgi strain B]
Length = 894
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 418 FTKRKRTKYTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 466
>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
Length = 141
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
+GGKP S Y +G Q++ +YLDPH QP +++ +
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQ 35
>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
Length = 133
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 112 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI----GDSKITSD 167
D+ +LG + +A A + + + IL +YR F+PI G + + SD
Sbjct: 31 DVHMLGRTYPPPVVDAKCSTAPPPEDSPLYRAYVDIILFTYRCAFEPIEGCVGPTSV-SD 89
Query: 168 VGWGCMLRSSQMLVAQAL 185
GWGC +R++QML+AQA+
Sbjct: 90 KGWGCAIRATQMLLAQAV 107
>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 346
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 6/63 (9%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 191 GRP 193
P
Sbjct: 112 QEP 114
>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 346
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 6/61 (9%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 59 NNNVA---KHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RLQE 113
Query: 193 P 193
P
Sbjct: 114 P 114
>gi|221060360|ref|XP_002260825.1| peptidase [Plasmodium knowlesi strain H]
gi|193810899|emb|CAQ42797.1| peptidase, putative [Plasmodium knowlesi strain H]
Length = 1001
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+F++R + KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 464 NFTNRRRTKHTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 513
>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 384
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
S+G++GG PG + Y +G+ + IYLDPH +Q
Sbjct: 223 SIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQE 255
>gi|156102174|ref|XP_001616780.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805654|gb|EDL47053.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1007
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 468 FAKRKRDRYSKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 516
>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
Length = 350
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 102/302 (33%), Gaps = 90/302 (29%)
Query: 128 LGDAAGNNGLAEFNQDFSSR--ILISYRKG-------------------FDPIGDSK--- 163
+ + N +N+ SR IL +YR G F P+ S
Sbjct: 1 MSNVVRENVNVLYNKRLESRFGILFTYRYGLEYKFPRPINFKRRRLFNIFSPLNLSNGIV 60
Query: 164 -ITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------RKPLQKPFDREYVE----- 208
I SD GWGC+LRS+QM ++QALL LG + R P + D+ +
Sbjct: 61 TIDSDKGWGCVLRSTQMAISQALLNLVLGPEFSVEQLEIRNRTPRNRKIDQSLLNIDTFE 120
Query: 209 -----------------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW-VGPY--AMCR 248
IL F D + FSI+N + A GP A+C
Sbjct: 121 KLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIYNFVIADYVLKTCTKFLHFGPTSAALC- 179
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
A + +LP+ + D H S + +
Sbjct: 180 ----------ASKIINDLNLPIN----------------SIAFPDGVFHISDVREILEEK 213
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP-GASTYIVGVQEESAIYLD 367
+L+ V L+++ +R F Q GI+GG S YI G + Y D
Sbjct: 214 RNLLVWVSNKKKLDRIER---ECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYND 270
Query: 368 PH 369
PH
Sbjct: 271 PH 272
>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 658
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFH 188
+ SD GWGCMLRS+QM++AQ + H
Sbjct: 133 LKSDAGWGCMLRSAQMMMAQTVRMH 157
>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
gorilla]
Length = 351
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 91
>gi|124025328|ref|YP_001014444.1| acetyltransferase [Prochlorococcus marinus str. NATL1A]
gi|123960396|gb|ABM75179.1| possible acetyltransferase [Prochlorococcus marinus str. NATL1A]
Length = 180
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 8/73 (10%)
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG ++ G+S ++ K+ DE L + G FNQ SS + S+ K FD
Sbjct: 4 LGSTKIGMSGWKNE--------KLLSDETLKNIYGKQAFQYFNQTNSSLFVFSHSKSFDL 55
Query: 159 IGDSKITSDVGWG 171
I ++ VGWG
Sbjct: 56 IELEQLLQAVGWG 68
>gi|72383728|ref|YP_293083.1| acetyltransferase [Prochlorococcus marinus str. NATL2A]
gi|72003578|gb|AAZ59380.1| acetyltransferase, GNAT family [Prochlorococcus marinus str.
NATL2A]
Length = 180
Score = 38.9 bits (89), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 8/73 (10%)
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG ++ G+S ++ K+ DE L + G FNQ SS + S+ K FD
Sbjct: 4 LGSTKIGMSGWKNE--------KLLSDETLKNIYGKQAFQYFNQTNSSLFVFSHSKSFDL 55
Query: 159 IGDSKITSDVGWG 171
I ++ VGWG
Sbjct: 56 IELEQLLQAVGWG 68
>gi|412989956|emb|CCO20598.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Bathycoccus prasinos]
Length = 532
Score = 37.7 bits (86), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 165 TSDVGWGCMLRSSQMLVAQALL 186
T+D GWGC LRS+QML +AL+
Sbjct: 135 TTDCGWGCTLRSAQMLFGEALM 156
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.133 0.399
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,261,943,171
Number of Sequences: 23463169
Number of extensions: 261399863
Number of successful extensions: 570674
Number of sequences better than 100.0: 791
Number of HSP's better than 100.0 without gapping: 760
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 568316
Number of HSP's gapped (non-prelim): 1322
length of query: 392
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 248
effective length of database: 8,980,499,031
effective search space: 2227163759688
effective search space used: 2227163759688
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 78 (34.7 bits)