BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011418
(486 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
Length = 489
Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/493 (73%), Positives = 413/493 (83%), Gaps = 11/493 (2%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGFRE+ AS+C SK DTPNRSL S E GS+ S+KGSL SS F SAFSVFETY
Sbjct: 1 MKGFRERV-ASRCSSKCPVDTPNRSLTSDCLESGSN--FSTKGSLWSSFFASAFSVFETY 57
Query: 61 SESS-ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ES ASEKK H++ NGWT+AVK++V+ GSMRRIHERVLGPSRTGISS+TSDIWLLGVC
Sbjct: 58 RESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI++DE+ G+A N LAEF D+SSRIL++YR+GFD IGDSK SDVGWGCMLRSSQM
Sbjct: 118 YKISEDES-GNADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQM 176
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
LVAQALLFH+LGR W KP QKP D+ YVEILHLFGDSE +PFSIHNL+QAGKAY LAAGS
Sbjct: 177 LVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGS 236
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYAMCRSWE+LAR +R E L QSLPMA+YVVSGDEDGERGGAPVV I+DASRHC
Sbjct: 237 WVGPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCL 296
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FS+GQADWTPILLLVPLVLGL+KVNPRYIP+L+ TFTF QSLGI+GGKPGASTYIVGVQ
Sbjct: 297 EFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQ 356
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
+++A YLDPH+VQ V+NIG+DD+EADTS+YHSD++RHI L SIDPSLAIGFYCRDKDDFD
Sbjct: 357 DDNAFYLDPHEVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFD 416
Query: 420 DFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL-GETGGVPEDDSLGV-MSMNDAV 475
+FC ASKLA++S GAPLFTV HK KPV+H D+L E V EDDS+ V M +ND
Sbjct: 417 EFCLLASKLADDSQGAPLFTVAHCHKLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDA 476
Query: 476 --GNAHEDDWQLL 486
G A ED+WQLL
Sbjct: 477 EGGGAQEDEWQLL 489
>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
Length = 486
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/493 (71%), Positives = 405/493 (82%), Gaps = 14/493 (2%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGF EKA ASK K+ D+ N SE SS++K SK SL SS+F SAFSVFET
Sbjct: 1 MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53
Query: 61 SESS--ASEKKAVHN-KSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 117
SESS ASEKKA+ N ++NGWT AV+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 118 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 177
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
VQ+E A YLDPH+ Q V++I +++LEADTS+YH ++IRHI LDSIDPSLAIGFYCRDKDD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDD 413
Query: 418 FDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAV 475
FDDFC RASKLA++SNGAPLFTV H KP++ SD + + G EDDS V+S A
Sbjct: 414 FDDFCIRASKLADKSNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAE 473
Query: 476 G--NAHEDDWQLL 486
G + HEDDWQLL
Sbjct: 474 GYEHEHEDDWQLL 486
>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/488 (71%), Positives = 407/488 (83%), Gaps = 8/488 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGFRE+ + S ST ++PNRS S SELGS+++K SK SL S+ F SAFSVF+T+
Sbjct: 1 MKGFRERGFVASSKSSSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDTH 60
Query: 61 SESSA-SEKKAVHNK-SNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 118
+SS+ SEKKA H + NGWT+AVK++V GSMRRI E VLG S+TGIS++T DIWLLG
Sbjct: 61 CDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGA 120
Query: 119 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 178
C+KI+QD + GDAA N LA FN DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 121 CYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQ 180
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
MLVAQALLFHRLGR WRKPL KP DREYVEILHLFGDSE+S FSIHNLL+AGKAYGLAAG
Sbjct: 181 MLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAG 240
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
SWVGPYA+C SWE+L R +R ET L QSL MA+YVVSG EDGERGGAPV+CI++A+RHC
Sbjct: 241 SWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHC 300
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
S FSKGQ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 301 SEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 360
Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
Q+E+A YLDPH+VQPV+N+ +DD+EA+TS+YH +V+RH+ LD IDPSLAIGFYCRDKDDF
Sbjct: 361 QDENAFYLDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDF 420
Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 478
DDFC ASKL +ESNGAPLFTV + +K + H ++G V DDSLGVM+MND G
Sbjct: 421 DDFCTLASKLTDESNGAPLFTVAHS-RKLLKH-----DSGEVRSDDSLGVMTMNDVEGCV 474
Query: 479 HEDDWQLL 486
HEDDWQLL
Sbjct: 475 HEDDWQLL 482
>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
Length = 489
Score = 678 bits (1750), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/496 (71%), Positives = 405/496 (81%), Gaps = 17/496 (3%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MKGF EKA ASK K+ D+ N SE SS++K SK SL SS+F SAFSVFET
Sbjct: 1 MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53
Query: 61 SESS--ASEKKAVHN-KSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 117
SESS ASEKKA+ N ++NGWT AV+++VT SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54 SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113
Query: 118 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 177
+C+KI+Q+E+ A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
QMLVAQALL HR+GR WRK KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
C FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYH---SDVIRHIHLDSIDPSLAIGFYCRD 414
VQ+E A YLDPH+ Q V++I +++LEADTS+YH S +IRHI LDSIDPSLAIGFYCRD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRD 413
Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMN 472
KDDFDDFC RASKLA+ESNGAPLFTV H KP++ SD + + G EDDS V+S
Sbjct: 414 KDDFDDFCIRASKLADESNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNK 473
Query: 473 DAVG--NAHEDDWQLL 486
A G + HEDDWQLL
Sbjct: 474 GAEGYEHEHEDDWQLL 489
>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 674 bits (1738), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/488 (70%), Positives = 404/488 (82%), Gaps = 9/488 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK FR++ GA +T DTP S S SE GS+++K SK SL SS F SAFSVF+ Y
Sbjct: 1 MKVFRDR-GAVSPSKTTTTDTPKSSFISDSSEPGSTDTKVSKPSLWSSFFASAFSVFDIY 59
Query: 61 SESSA-SEKKAVHNK-SNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 118
+SS+ S +A H + SNGWT++VK++V G+MRRI ERVLG S+TGIS++TSDIWLLG
Sbjct: 60 RDSSSTSHNEAPHIRHSNGWTSSVKKIVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGA 119
Query: 119 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 178
+KI+QD++ G+A N LA F++DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 120 RYKISQDDSSGNADATNALAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQ 179
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
MLVAQALLFHRLGR WRKP+ KP DR+YVEILHLFGDSE S FSIHNLLQAGKAYGLAAG
Sbjct: 180 MLVAQALLFHRLGRSWRKPVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAG 239
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
SWVGPYAMCRSWE+LAR +R ET L Q+LPMA+YVVSG EDGERGGAPV+ I+DA+RHC
Sbjct: 240 SWVGPYAMCRSWESLARSKREETNLEYQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHC 299
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
S FSKG+ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 300 SEFSKGREDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 359
Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
Q+E+A YLDPH+VQPV+N +DD+EA+TS+YH DV+RHI LD IDPSLAIGFYCRDKDDF
Sbjct: 360 QDENAFYLDPHEVQPVVNFSRDDVEANTSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDF 419
Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 478
DDFC+ ASKLA+ESNGAPLFTV ++K + ++ V +DD LGVM+MNDA G
Sbjct: 420 DDFCSLASKLADESNGAPLFTVANSYKSSKH------DSSEVRDDDPLGVMTMNDAEGCL 473
Query: 479 HEDDWQLL 486
+EDDWQLL
Sbjct: 474 NEDDWQLL 481
>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
Length = 483
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/483 (68%), Positives = 379/483 (78%), Gaps = 3/483 (0%)
Query: 5 REKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESS 64
R K S C + D +R+ SV ELGS SSK S S F+S FS+FE + +SS
Sbjct: 3 RGKDLKSTCSPEPAADAIDRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKDSS 62
Query: 65 ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQ 124
+EKK H + N W A V++++T+GSMRRI ER+LG R+G+ SS DIWLLGVCHKI+Q
Sbjct: 63 VTEKKVFHPRHNVW-ATVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQ 121
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
D DAA + G+A + QDFSSRIL++YRKGF I DSK TSDV WGCMLRSSQMLVAQA
Sbjct: 122 DHPPDDAASSPGVAGYEQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQA 181
Query: 185 LLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
LLFHRLGR WRKP QKP D+EYVEILHLFGDSETS FSIHNLLQAG+AY LAAGSWVGPY
Sbjct: 182 LLFHRLGRSWRKPSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPY 241
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
AMCRSWE L R +R L Q LPMAIY+VSGDEDGERGGAPV+ IDDASRHC FSKG
Sbjct: 242 AMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKG 301
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
Q DW+PILLLVPLVLGLEK+NPRYIP+LR TFTFPQSLGI+GGKPGASTYIVGVQ+E+A
Sbjct: 302 QHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAF 361
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
YLDPH+VQ V+NI KDDLEADTS+YH +VIRHI L+SIDPSLAIGFYCRDKDDFD+FC R
Sbjct: 362 YLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCHR 421
Query: 425 ASKLAEESNGAPLFTVTQTHK-KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDW 483
ASKLAEES+GAPLFTV +TH P S L + + EDD GV+ M + +HEDDW
Sbjct: 422 ASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNE-EESHEDDW 480
Query: 484 QLL 486
Q L
Sbjct: 481 QFL 483
>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 486
Score = 633 bits (1632), Expect = e-179, Method: Compositional matrix adjust.
Identities = 337/489 (68%), Positives = 386/489 (78%), Gaps = 8/489 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+KG E+ +SKC SKS+ +T + + V S+ GSS SK K SL S++F S FSV ETY
Sbjct: 3 LKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSNSKFPKASLWSNIFTSGFSVVETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
SESSASEKKAVH++S+GW AAV+++VT GSMRR ERVLG SRT ISSS DIWLLGVCH
Sbjct: 63 SESSASEKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCH 122
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+Q E+ G +NGLA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQML
Sbjct: 123 KISQQESSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQML 182
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
VAQALLFH+LGR WRKP+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSW 242
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYAMCR+WE LA R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ C
Sbjct: 243 VGPYAMCRTWEVLA---RKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFE 299
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FS G A WTP+LLLVPLVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+G Q
Sbjct: 300 FSSGLAAWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQN 359
Query: 361 ESAIYLDPHDVQPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
E A YLDPHDVQ V+NI D E TS+YH +++RHI LDSIDPSLAIGFYCRDKDDFD
Sbjct: 360 EKAFYLDPHDVQQVVNISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDFD 419
Query: 420 DFCARASKLAEESNGAPLFTVTQTH--KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGN 477
DFC++ASKLAEESNGAPLFTVTQ+ K V +DV G+ G E+D G+ ND N
Sbjct: 420 DFCSQASKLAEESNGAPLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDTGTN 479
Query: 478 AHEDDWQLL 486
EDDWQLL
Sbjct: 480 --EDDWQLL 486
>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 485
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 339/489 (69%), Positives = 389/489 (79%), Gaps = 9/489 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+KG E+ +SKC SKS+ +T + + V S+ GSS+ K K SL SS+F S FSV ETY
Sbjct: 3 LKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSDCKFPKASLWSSIFTSGFSVVETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
SESSASEKKAV ++S+GW AAV+++VT GSMRR ERVLG SRT ISSS DIWLLGVCH
Sbjct: 63 SESSASEKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCH 122
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+Q E+ G +NGLA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQML
Sbjct: 123 KISQQESTGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQML 182
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
VAQALLFH+LGR WRKP+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSW 242
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYAMCR+WE LA R + LG LPMAIYVVSGDEDGERGGAPVVCI+DAS+ CS
Sbjct: 243 VGPYAMCRTWEVLA---RKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSE 299
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FS G A WTP+LLLVPLVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 300 FSSGLAVWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQN 359
Query: 361 ESAIYLDPHDVQPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
E A YLDPHDVQ V+NI D E TS+YH +V+RHI LDSIDPSLAIGFYCRDKDDFD
Sbjct: 360 EKAFYLDPHDVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFD 419
Query: 420 DFCARASKLAEESNGAPLFTV--TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGN 477
DFC++ASKLAEESNGAPLFTV +++ K V++ DV G+ G EDD G+ ND V N
Sbjct: 420 DFCSQASKLAEESNGAPLFTVAKSRSFSKQVSN-DVSGDNTGFQEDDFPGMDCGNDTVTN 478
Query: 478 AHEDDWQLL 486
EDDWQLL
Sbjct: 479 --EDDWQLL 485
>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
Length = 487
Score = 610 bits (1573), Expect = e-172, Method: Compositional matrix adjust.
Identities = 320/488 (65%), Positives = 377/488 (77%), Gaps = 5/488 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+K ++ A+KC SKS+ + + + S+ GSS+SK K SL S+ F S FSV ETY
Sbjct: 3 LKDLCDRIVAAKCSSKSSTEIVDNTQVPASSKAGSSDSKFPKASLWSTFFTSGFSVDETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
SESS+SEKK VH++++GW AAV+++V+ GSMRR ERVLG RT +SSS DIWLLGVCH
Sbjct: 63 SESSSSEKKTVHSRNSGWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCH 122
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+Q E+ GD N A F QDF SRILI+YRKGFD I DSK TSDV WGCMLRSSQML
Sbjct: 123 KISQHESTGDVDIRNVFAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQML 182
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
VAQALLFH+LGR WRK + KP D+EY++IL LFGDSE + FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKTVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSW 242
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYAMCR+WE LAR QR + G Q LPMAIYVVSGDEDGERGGAPVVCI+DA + C
Sbjct: 243 VGPYAMCRTWEVLARNQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLE 302
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FS+G WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 FSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQN 362
Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
+ A YLDPH+V+PV+NI D E +TS+YH ++ RH+ LDSIDPSLAIGFYCRDKDDFDD
Sbjct: 363 DKAFYLDPHEVKPVVNITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDKDDFDD 422
Query: 421 FCARASKLAEESNGAPLFTVTQTHKKP--VNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 478
FC+RA+KLAEESNGAPLFTV Q+ P V + V G+ EDDSL + +NDA
Sbjct: 423 FCSRATKLAEESNGAPLFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSMNLVNDA---G 479
Query: 479 HEDDWQLL 486
+EDDWQ L
Sbjct: 480 NEDDWQFL 487
>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
Length = 489
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 311/489 (63%), Positives = 359/489 (73%), Gaps = 5/489 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
+K F ++ A+KC SKS+ +T + S S+ GSS+SK K SL SS F S FSV ETY
Sbjct: 3 LKAFCDRIVAAKCSSKSSTETVDNSQVPACSKAGSSDSKFPKASLWSSFFTSGFSVIETY 62
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRI-HERVLGPSRTGISSSTSDIWLLGVC 119
S+S ASEKKAVH++++GW + I LG + + LGVC
Sbjct: 63 SKSPASEKKAVHSQNSGWGCCCEESCYCWLNEEIPRACTLGQAELTFQALMVIYGFLGVC 122
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
HK +Q E+ GD + A F QDFSS+IL++YRKGFD IGDSK TSDV WGCMLRSSQM
Sbjct: 123 HKFSQQESTGDVDNSTVFAAFEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQM 182
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
LVAQALLFH+LGR WRK KP D+EY++IL FGDSE S FSIHNLLQAGK YGLA GS
Sbjct: 183 LVAQALLFHKLGRMWRKTTDKPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGS 242
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYAMCRSWE LAR QR G Q LPMA+YVVSGDEDGERGGAPVVCI+DASR CS
Sbjct: 243 WVGPYAMCRSWEVLARNQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCIEDASRRCS 302
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FS+G A WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 EFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQ 362
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
E A YLDPHDVQPV++I D + +TS+YH +++R + LDSIDPSLAIGFYCRDKDDFD
Sbjct: 363 NEKAFYLDPHDVQPVVHINGDAQDPNTSSYHCNIVRQMPLDSIDPSLAIGFYCRDKDDFD 422
Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHS--DVLGETGGVPEDDSLGVMSMNDAVGN 477
DFC+RASKLAEESNGAPLFTV Q P + DV G+ G EDDS GV +NDA N
Sbjct: 423 DFCSRASKLAEESNGAPLFTVAQFRSFPFQDAGYDVSGDNTGFQEDDSHGVDLLNDAGTN 482
Query: 478 AHEDDWQLL 486
EDDWQLL
Sbjct: 483 --EDDWQLL 489
>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
Full=Autophagy-related protein 4 homolog a;
Short=AtAPG4a; Short=Protein autophagy 4a
gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 467
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 275/453 (60%), Positives = 350/453 (77%), Gaps = 6/453 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ +C S S DT ++S + S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1 MKALCDRFVPQQCSSSSKSDTHDKS--PLVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57
Query: 61 SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ESS S K V NGWTA VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58 RESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI+ DE G+ LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQM
Sbjct: 118 YKISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQM 177
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQALLFHRLGR W K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQALLFHRLGRAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGS 236
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 237 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 296
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 297 EFSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQ 356
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
E+ YLDPH+VQ V+ + K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFD
Sbjct: 357 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFD 416
Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
DFC RA KLAEESNGAPLFTVTQTH +N S+
Sbjct: 417 DFCLRALKLAEESNGAPLFTVTQTHTA-INQSN 448
>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
Length = 467
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 278/445 (62%), Positives = 351/445 (78%), Gaps = 5/445 (1%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ +C S S DT ++S V S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1 MKALCDRFVPQQCSSSSKSDTHDKS--PVVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57
Query: 61 SESSASEKKAVHNKSNGWTAAVKRLVTA-GSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ESS S K V NGWTA VKR+ A G++RR ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58 RESSTSGHKQVCTTRNGWTAFVKRVSMATGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI++DEA G+ LA F QDFSS+IL++YR+GF+P D+ TSDV WGCM+RSSQM
Sbjct: 118 YKISEDEASGETNTGCVLAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQM 177
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQALLFHRLGR W K + P ++EY+E L FGDSE+S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQALLFHRLGRSWTKKSELP-EQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGS 236
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 237 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 296
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FSKGQ++WTPILLLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 297 EFSKGQSEWTPILLLVPLVLGLDSVNPRYIPSLIATFTFPQSVGILGGKPGASTYIVGVQ 356
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
E+ YLDPH+VQ V+ + K+ + DTS+YH +VIR++ L+S+DPSLA+GFYCRDKDDFD
Sbjct: 357 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVIRYVPLESLDPSLALGFYCRDKDDFD 416
Query: 420 DFCARASKLAEESNGAPLFTVTQTH 444
DFC RASKLAE+SNGAPLFT+TQTH
Sbjct: 417 DFCLRASKLAEDSNGAPLFTITQTH 441
>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 422
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 254/392 (64%), Positives = 315/392 (80%), Gaps = 3/392 (0%)
Query: 62 ESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
ESS S K V NGWTA VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC+
Sbjct: 14 ESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCY 73
Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
KI+ DE G+ LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQML
Sbjct: 74 KISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQML 133
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
AQALLFHRLGR W K + P ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGSW
Sbjct: 134 FAQALLFHRLGRAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSW 192
Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
VGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 193 VGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLE 252
Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQE
Sbjct: 253 FSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQE 312
Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
+ YLDPH+VQ V+ + K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDD
Sbjct: 313 DKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDD 372
Query: 421 FCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
FC RA KLAEESNGAPLFTVTQTH +N S+
Sbjct: 373 FCLRALKLAEESNGAPLFTVTQTHTA-INQSN 403
>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
Length = 451
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 263/453 (58%), Positives = 337/453 (74%), Gaps = 22/453 (4%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ +C S S DT ++S + S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1 MKALCDRFVPQQCSSSSKSDTHDKS--PLVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57
Query: 61 SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
ESS S K V NGWTA VKR+ + +G++RR ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58 RESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI+ DE G+ LA DFSS+IL++YRKGF+P D+ TSDV WGCM+RSSQM
Sbjct: 118 YKISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQM 177
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQ ++EY+E L FGDSE S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQLP-----------------EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGS 220
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CR+WE+LA +R +T Q+LPMA+++VSG EDGERGGAP++CI+DA++ C
Sbjct: 221 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 280
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 281 EFSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQ 340
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
E+ YLDPH+VQ V+ + K+ + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFD
Sbjct: 341 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFD 400
Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
DFC RA KLAEESNGAPLFTVTQTH +N S+
Sbjct: 401 DFCLRALKLAEESNGAPLFTVTQTHTA-INQSN 432
>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
Length = 476
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 276/487 (56%), Positives = 351/487 (72%), Gaps = 12/487 (2%)
Query: 1 MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
MK ++ SKC S T + + S + S S +L S + S+ V +
Sbjct: 1 MKAICDRFVPSKCSSSCTSEKRDIS-PTSLVSDSPSSDDKSNLTLCSDVVESSSPVSQPC 59
Query: 61 SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
E+S SE K V N WT +K + +G++RR +RVLGPSRTGISSSTS+IWLLGVC
Sbjct: 60 REASTSEHKQVCTTHNSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVC 119
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
+KI++ E+ +A LA F QDFSS IL++YR+GF+PIGD+ TSDV WGCMLRS QM
Sbjct: 120 YKISEAESFEEADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQM 179
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
L AQALLF RLGR WRK +P + +Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGS
Sbjct: 180 LFAQALLFQRLGRSWRKKDSEPPNEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGS 239
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
WVGPYA+CRSWE+LAR + ET + +S MA+++VSG EDGERGGAP++CI+D ++ C
Sbjct: 240 WVGPYAVCRSWESLARKNKEETDVKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL 299
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
FS+G +W PILLLVPLVLGL+KVNPRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQ
Sbjct: 300 EFSEGDTEWPPILLLVPLVLGLDKVNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQ 359
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
E+ YLDPHDVQ V+ + K++ + DTS+YH + +R++ L+S+DPSLA+GFYC+DKDDFD
Sbjct: 360 EDKGFYLDPHDVQQVVTVKKENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQDKDDFD 419
Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAH 479
DFC RA+KLA +SNGAPLFTVTQ+H+ G+ E S V+S + G H
Sbjct: 420 DFCIRATKLAGDSNGAPLFTVTQSHRT---------NDCGIAETSSSTVIS-TEISGEEH 469
Query: 480 EDDWQLL 486
EDDWQLL
Sbjct: 470 EDDWQLL 476
>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 478
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 267/453 (58%), Positives = 345/453 (76%), Gaps = 17/453 (3%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W+ ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
+ LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450
Query: 454 LGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
LG +G D ++ V + DA G E++WQ+L
Sbjct: 451 LGISG----DGNINVEDL-DASGETGEEEWQIL 478
>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B;
Short=Protein autophagy 4; AltName: Full=OsAtg4
Length = 478
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 266/453 (58%), Positives = 343/453 (75%), Gaps = 17/453 (3%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
+ LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450
Query: 454 LGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
LG +G D ++ V + DA G E++WQ+L
Sbjct: 451 LGISG----DGNINVEDL-DASGETGEEEWQIL 478
>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
Length = 892
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 258/425 (60%), Positives = 330/425 (77%), Gaps = 12/425 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W+ ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
+ LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450
Query: 454 LGETG 458
LG +G
Sbjct: 451 LGISG 455
>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
Full=Autophagy-related protein 4 homolog b;
Short=AtAPG4b; Short=Protein autophagy 4b
gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 477
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 265/463 (57%), Positives = 342/463 (73%), Gaps = 12/463 (2%)
Query: 25 SLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKR 84
S S+ S+ SS++KS+ +L S + S+ V + E+S S V + WT +K
Sbjct: 26 SPTSLVSDSASSDNKSNL-TLCSDVVASSSPVSQLCREASTSGHNPVCTTHSSWTVILKT 84
Query: 85 L-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQD 143
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QD
Sbjct: 85 ASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQD 144
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
FSS IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D
Sbjct: 145 FSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPAD 204
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 205 EKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDD 264
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++
Sbjct: 265 KHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDR 324
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
VNPRYIP+L TFTFPQSLGI+GGKPGASTYIVGVQE+ YLDPHDVQ V+ + K++ +
Sbjct: 325 VNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQD 384
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
DTS+YH + +R++ L+S+DPSLA+GFYC+ KDDFDDFC RA+KLA +SNGAPLFTVTQ+
Sbjct: 385 VDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLAGDSNGAPLFTVTQS 444
Query: 444 HKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
H++ N + + + G HEDDWQLL
Sbjct: 445 HRR--NDCGIAETSSSTETSTEIS--------GEEHEDDWQLL 477
>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
Length = 912
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 257/425 (60%), Positives = 328/425 (77%), Gaps = 12/425 (2%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W ++R+V +GSM R
Sbjct: 38 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 93
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 94 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 210
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D++EADTS+YH +R
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
+ LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+ K+ N DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450
Query: 454 LGETG 458
LG +G
Sbjct: 451 LGISG 455
>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
Length = 473
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 261/452 (57%), Positives = 332/452 (73%), Gaps = 16/452 (3%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +F+S FS+FE + +SSA H+ S W+ ++R+ GSM R
Sbjct: 34 KQSKNSILSCVFSSPFSIFEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSMWRF---- 89
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 90 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 146
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 147 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 206
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R E G + PMA+YVVS
Sbjct: 207 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVS 266
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 267 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETF 326
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
TFPQSLGI+GGKPG STY+ GVQ++ +YLDPH+VQ ++I D+LEADTS+YH +R
Sbjct: 327 TFPQSLGILGGKPGTSTYVAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYHCSTVRD 386
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGE 456
+ LD IDPSLAIGFYCRDKDDFDDFC+RAS+L +++NGAPLFTV Q+ + +
Sbjct: 387 LALDLIDPSLAIGFYCRDKDDFDDFCSRASELVDKANGAPLFTVMQSVQPSKQMYNEESS 446
Query: 457 TGGVPEDDSLGVMSMN--DAVGNAHEDDWQLL 486
+G D + ++++ D G E++WQ+L
Sbjct: 447 SG-----DGMDIINVEGLDGSGETGEEEWQIL 473
>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 474
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 263/453 (58%), Positives = 331/453 (73%), Gaps = 18/453 (3%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K K S+LS +F+S FS+FE + +SSA+ H+ S W+ ++R+ GSM R
Sbjct: 35 KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 91 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 207
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R E G + PMA+YVVS
Sbjct: 208 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVS 267
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 268 GDEDGERGGAPVVCIDVAAQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETL 327
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH +R
Sbjct: 328 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRD 387
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
+ LD IDPSLAIGFYCRDKDDFDDFC+RAS+L +++NGAPLFTV Q+ K+ N
Sbjct: 388 LALDLIDPSLAIGFYCRDKDDFDDFCSRASELVDKANGAPLFTVVQSVQPSKQMYNEESS 447
Query: 454 LGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
G+ G+ DS+ V + D G E++WQ+L
Sbjct: 448 SGD--GM---DSINVEGL-DGSGETGEEEWQIL 474
>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
Length = 1216
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 257/458 (56%), Positives = 328/458 (71%), Gaps = 45/458 (9%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K SK S+LS +FNS F++FE + +SSA++ + S W ++R+V +GSM R
Sbjct: 309 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 364
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ ++SD+W LG C+K++ +E+ D+ +G A F +DFSSRI I+YR+GFD
Sbjct: 365 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 421
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE
Sbjct: 422 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 481
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R R + + G +S PMA+YVVS
Sbjct: 482 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 541
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 542 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 601
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------------------------ 372
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 602 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYG 661
Query: 373 ---------PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
++I D++EADTS+YH +R + LD IDPSLAIGFYCRDKDDFDDFC+
Sbjct: 662 SYSGVFSTSQAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDDFCS 721
Query: 424 RASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG 458
RA++L +++NGAPLFTV Q+ K+ N DVLG +G
Sbjct: 722 RATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG 759
>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
Length = 493
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 267/465 (57%), Positives = 335/465 (72%), Gaps = 28/465 (6%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKA--VHNKSNG----------WTAAVKRL 85
SK KGS+LSS+F ++FE +SS+S A NKS G W+ A++R
Sbjct: 41 SKHCKGSILSSVF----TIFEAQQDSSSSVAAAAACENKSPGHSSGPSYGGAWSRALRRF 96
Query: 86 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
V GSM R LG ++ + D+W LG C+K + +E+ D ++G A F +DFS
Sbjct: 97 VGGGSMWRF----LGCAKV---LTNGDVWFLGKCYKFSSEESSSDLDTDSGHAAFLEDFS 149
Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
SRI ++YRKGFD I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP + E
Sbjct: 150 SRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPSQKPCNPE 209
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGL 263
Y+ ILHLFGDSE FS+HNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R E
Sbjct: 210 YIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVSN 269
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
G +S PMA+YVVSGDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K
Sbjct: 270 GNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSTWSPILLLVPLVLGLDK 329
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+NPRYIP L+ TFTFPQSLGI+GGKPG STYI G+Q++ A+YLDPHDVQ +NI D+L+
Sbjct: 330 INPRYIPLLKETFTFPQSLGILGGKPGTSTYIAGIQDDRALYLDPHDVQMAVNIASDNLD 389
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
ADTS+YH +R + LD +DPSLAIGFYCRDKDDFDDFC+RAS+L ++NGAPLFTV Q+
Sbjct: 390 ADTSSYHCSTVRDMALDLLDPSLAIGFYCRDKDDFDDFCSRASELVVKANGAPLFTVVQS 449
Query: 444 HK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
+ K + + D + G D++ + + D G A E++WQ+L
Sbjct: 450 IQPSKQMYNQDDGSGSSGDGMADNINMEDL-DGSGEAGEEEWQIL 493
>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
gi|194701156|gb|ACF84662.1| unknown [Zea mays]
gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
Length = 492
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 264/462 (57%), Positives = 333/462 (72%), Gaps = 27/462 (5%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHL
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHL 217
Query: 213 FGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPM 270
FGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R R A+ G ++ PM
Sbjct: 218 FGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPM 277
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
A+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+PLVLGL+K+NPRYIP
Sbjct: 278 ALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIP 337
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ ++I D+LEADTS+YH
Sbjct: 338 LLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYH 397
Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKP 447
V+R + L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+ K+
Sbjct: 398 CSVVRDLALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGAPLFTVMQSVQPSKQM 457
Query: 448 VNHSDVLGETGG---VPEDDSLGVMSMNDAVGNAHEDDWQLL 486
D L G ED L DA G A E +WQ+L
Sbjct: 458 YKQDDGLCCCSGSSMANEDYDL------DASGEAGE-EWQIL 492
>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
Length = 484
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 266/455 (58%), Positives = 329/455 (72%), Gaps = 23/455 (5%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVH-NKSNGWTAAVKRLVTAGSMRRIHER 97
K K S+LSS+ ++FE + S + H + S W+ ++R V GSM R
Sbjct: 46 KQCKASILSSVL----TIFEPDQDQSG--RSGGHASGSYAWSRVLRRFVGGGSMWRF--- 96
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
LG G + + D+W LG C+K++ +E+ D+ G A F +DFSSR+ I+YRKGFD
Sbjct: 97 -LG---CGKALTAGDVWFLGKCYKLSSEESSSDSDSEGGHAAFLEDFSSRVWITYRKGFD 152
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP Q P D E+ ILHLFGDSE
Sbjct: 153 VISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNPSDPEHTRILHLFGDSE 212
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVV 275
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R + + +S PM +YVV
Sbjct: 213 VCAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVINRNESFPMVLYVV 272
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
SGDEDGERGGAPVVCID A++ C F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 273 SGDEDGERGGAPVVCIDVAAQLCYDFNKGQSAWSPILLLVPLVLGLDKINPRYIPLLKET 332
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
FTFPQSLGI+GGKPGASTYI GVQ++ A+YLDPH+VQ +NI D+LEADTS+YH +R
Sbjct: 333 FTFPQSLGILGGKPGASTYIAGVQDDRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVR 392
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
+ LD IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+ K+ N D
Sbjct: 393 DMPLDLIDPSLAIGFYCRDKDDFDDFCSRASELAEQANGAPLFTVVQSVQPSKQMYNQDD 452
Query: 453 VLGETG-GVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
G +G GV D++ + D G ED+WQ+L
Sbjct: 453 GSGCSGYGV--SDNIDTEDL-DGSGETGEDEWQIL 484
>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
gi|219886349|gb|ACL53549.1| unknown [Zea mays]
gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
Length = 492
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 261/462 (56%), Positives = 337/462 (72%), Gaps = 25/462 (5%)
Query: 36 SESKSSKGSLLSSLFNSAFSVFETYSESSASEK-KAVHNKSN----GWTAAVKRLVTAGS 90
S S+ K S+LS +F+ F++FE + S+S A KS+ G + ++R V +GS
Sbjct: 45 SGSRQPKASILSGVFSPPFAIFEGQQQGSSSPACDARSTKSSSGSYGLSRILRRFVGSGS 104
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQDFSSRIL 149
M R+ LG R ++SD+W LG C+K++ +E + ++ A F +DFSSRI
Sbjct: 105 MWRL----LGCGRV---LTSSDVWFLGKCYKVSPEEEESGDSESDSGHAAFLEDFSSRIW 157
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
I+YRKGFD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +
Sbjct: 158 ITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGV 217
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQS 267
LHLFGDSE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R R A+ G ++
Sbjct: 218 LHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKEN 277
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
PMA+YVVSGDEDGERGGAPVVCID A++ CS F+KG + W+PILLLVPLVLGL+K+NPR
Sbjct: 278 FPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPR 337
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YIP L+ TF FPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEADTS
Sbjct: 338 YIPLLKETFMFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMTVDIALDNLEADTS 397
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---H 444
+YH V+R + L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+
Sbjct: 398 SYHCSVVRALALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGAPLFTVVQSIEPS 457
Query: 445 KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
K+ D LG +G +D +D G+ ++WQ+L
Sbjct: 458 KQMYKQDDGLGCSGSSMAND-------DDLDGSGEAEEWQIL 492
>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
Length = 486
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 264/459 (57%), Positives = 327/459 (71%), Gaps = 31/459 (6%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVH-NKSNGWTAAVKRLVTAGSMRRIHER 97
K K S+LSS+ ++FE + S + H + S W+ ++R V GSM R
Sbjct: 48 KQCKASILSSVL----TIFEPDQDQSG--RSGGHASGSYAWSRVLRRFVGGGSMWRF--- 98
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
LG G + + +D+ LG C+K++ +E+ D+ G A F +DFSSRI I+YRKGFD
Sbjct: 99 -LG---CGKALTAADVQFLGKCYKLSSEESSSDSDSEGGHAAFLEDFSSRIWITYRKGFD 154
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP Q P + EY+ ILHLFGDSE
Sbjct: 155 AISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNPSNPEYIRILHLFGDSE 214
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVV 275
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R R + + +S PMA+YVV
Sbjct: 215 ACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVINRNESFPMALYVV 274
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
SGDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 275 SGDEDGERGGAPVVCIDVAAQLCYDFNKDQSAWSPILLLVPLVLGLDKINPRYIPLLKET 334
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
FTFPQSLGI+GGKPGASTYI GVQ++ A+YLDPH+VQ +NI D+LEADTS+YH +R
Sbjct: 335 FTFPQSLGILGGKPGASTYIAGVQDDRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVR 394
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
+ LD IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+ K+ N D
Sbjct: 395 DMPLDLIDPSLAIGFYCRDKDDFDDFCSRASELAEQANGAPLFTVVQSVQPSKQMYNRDD 454
Query: 453 -----VLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
G +G + +D D G ED+WQ+L
Sbjct: 455 GSGCSGYGVSGNIDAEDL-------DGSGETGEDEWQIL 486
>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
Length = 505
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 263/484 (54%), Positives = 331/484 (68%), Gaps = 49/484 (10%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K K S+LS +F+S FS+FE + +SSA+ H+ S W+ ++R+ GSM R
Sbjct: 35 KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 91 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 207
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R E G + PMA+YVVS
Sbjct: 208 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVS 267
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
GDEDGERGGAPVVCID A++ C F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 268 GDEDGERGGAPVVCIDVAAQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETL 327
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEA TS+YH +R
Sbjct: 328 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRD 387
Query: 397 IHLDSIDPSLAIGFYCRDK-------------------------------DDFDDFCARA 425
+ LD IDPSLAIGFYCRDK DDFDDFC+RA
Sbjct: 388 LALDLIDPSLAIGFYCRDKGELLLPDKMLGHHLSSLQSWFSYLLCLSAYVDDFDDFCSRA 447
Query: 426 SKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
S+L +++NGAPLFTV Q+ K+ N G+ G+ DS+ V + D G E++
Sbjct: 448 SELVDKANGAPLFTVVQSVQPSKQMYNEESSSGD--GM---DSINVEGL-DGSGETGEEE 501
Query: 483 WQLL 486
WQ+L
Sbjct: 502 WQIL 505
>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
Length = 462
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 252/457 (55%), Positives = 312/457 (68%), Gaps = 47/457 (10%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHER 97
S+ K S+LS +F ++FE + S++ A K + A R+ +RR+
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRI-----LRRVS-- 97
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
++E G + ++G A F +DFSSRI I+YRKGFD
Sbjct: 98 -------------------------PEEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFD 132
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHLFGDSE
Sbjct: 133 AIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHLFGDSE 192
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVV 275
FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R R A+ G ++ PMA+YVV
Sbjct: 193 ACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVV 252
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
SGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+PLVLGL+K+NPRYIP L+ T
Sbjct: 253 SGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIPLLKET 312
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ ++I D+LEADTS+YH V+R
Sbjct: 313 FKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYHCSVVR 372
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
+ L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+ K+ D
Sbjct: 373 DLALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGAPLFTVMQSVQPSKQMYKQDD 432
Query: 453 VLGETGG---VPEDDSLGVMSMNDAVGNAHEDDWQLL 486
L G ED L DA G A E +WQ+L
Sbjct: 433 GLCCCSGSSMANEDYDL------DASGEAGE-EWQIL 462
>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
Length = 595
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 228/387 (58%), Positives = 289/387 (74%), Gaps = 14/387 (3%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHL
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHL 217
Query: 213 FGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPM 270
FGDSE FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R R A+ G ++ PM
Sbjct: 218 FGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPM 277
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
A+YVVSGDEDGERGGAPV CID A++ CS F+KGQ W+PILLL+PLVLGL+K+NPRYIP
Sbjct: 278 ALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIP 337
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ ++I D+LEADTS+YH
Sbjct: 338 LLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYH 397
Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDD 417
V+R + L+ IDPSLAIGFYCRDK D
Sbjct: 398 CSVVRDLALEQIDPSLAIGFYCRDKGD 424
>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 211/357 (59%), Positives = 281/357 (78%), Gaps = 4/357 (1%)
Query: 89 GSMRRIHERVLGPSRTGISSST-SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR 147
GSMRR+ E +LGP T ++S+ S+IW+LG+C+K++ D + EF DF+SR
Sbjct: 1 GSMRRLQELLLGPRFTAANASSGSEIWVLGLCYKVSADPN-NETLSVQAFEEFISDFTSR 59
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
I I+YRKGF+ +G SK+TSDVGWGCMLRS QML+AQAL+ H LGR WR+ +P + Y+
Sbjct: 60 IWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLGRSWRREPGQPCSQAYL 119
Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GC 265
+IL FGDSE+ PFSIHNLL+AG +GLAAGSW+GPYA+CR+ EALAR R ++ G
Sbjct: 120 QILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLEALARADREQSQKKGGK 179
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
++LP A+YVVSG+ +GERGGAPV+C++D + CS + + +WTP+L+LVPLVLGL+KVN
Sbjct: 180 RALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTEEWTPLLVLVPLVLGLDKVN 239
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
PRY+P+LR TFTFPQSLGI GGKPGASTY++GVQ+E A+YLDPH+ Q V+ + ++LE D
Sbjct: 240 PRYLPSLRATFTFPQSLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELD 299
Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
TS+YH +R + LD+IDPSLAIGFYCRD+ +FDD CAR+S+LA++SNGAP+FTV +
Sbjct: 300 TSSYHCSTVRRLPLDTIDPSLAIGFYCRDRAEFDDLCARSSELAKQSNGAPMFTVAE 356
>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
Length = 429
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 226/388 (58%), Positives = 289/388 (74%), Gaps = 15/388 (3%)
Query: 36 SESKSSKGSLLSSLFNSAFSVFETYSESSAS-----EKKAVHNKSNGWTAAVKRLVTAGS 90
S S+ K S+LS +F+ F++FE + S+S + S G + ++R V +GS
Sbjct: 45 SGSRQPKASILSGVFSPPFAIFEGQQQGSSSPACDARSTKSSSGSYGLSRILRRFVGSGS 104
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQDFSSRIL 149
M R+ LG R ++SD+W LG C+K++ +E + ++ A F +DFSSRI
Sbjct: 105 MWRL----LGCGRV---LTSSDVWFLGKCYKVSPEEEESGDSESDSGHAAFLEDFSSRIW 157
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
I+YRKGFD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +
Sbjct: 158 ITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGV 217
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQS 267
LHLFGDSE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R R A+ G ++
Sbjct: 218 LHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKEN 277
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
PMA+YVVSGDEDGERGGAPVVCID A++ CS F+KG + W+PILLLVPLVLGL+K+NPR
Sbjct: 278 FPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPR 337
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YIP L+ TF FPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ ++I D+LEADTS
Sbjct: 338 YIPLLKETFMFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMTVDIALDNLEADTS 397
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 415
+YH V+R + L+ IDPSLAIGFYCRDK
Sbjct: 398 SYHCSVVRALALEQIDPSLAIGFYCRDK 425
>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
Length = 358
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/376 (54%), Positives = 271/376 (72%), Gaps = 29/376 (7%)
Query: 78 WTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDA 131
WTAAV+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 1 WTAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKES 58
Query: 132 AGNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR 189
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HR
Sbjct: 59 TSSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHR 118
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
LGR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 119 LGRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALC 177
Query: 248 RSWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
+ EALAR G G Q +A+YVVSGD GERGGAPV+ D + C
Sbjct: 178 HAIEALAR----NDGRGRQGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-------- 225
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YL
Sbjct: 226 ---PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYL 282
Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
DPH+VQ V+++ + LE D+++YH V+R + LD+IDPSLA+GFYCR++++ DD CARAS
Sbjct: 283 DPHEVQKVVSVSGESLEFDSASYHCSVVRKMPLDAIDPSLALGFYCRNREELDDLCARAS 342
Query: 427 KLAEESNGAPLFTVTQ 442
+LA +SNGAP+FTV +
Sbjct: 343 ELASQSNGAPMFTVAE 358
>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
Length = 358
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/376 (54%), Positives = 271/376 (72%), Gaps = 29/376 (7%)
Query: 78 WTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDA 131
WTAAV+R V G +RRI E ++G SS S IWLLG C+++ + DE ++
Sbjct: 1 WTAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKES 58
Query: 132 AGNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR 189
++ +A+F DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HR
Sbjct: 59 TSSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHR 118
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
LGR WR+ ++P+ REY+EILH F DS + PFSIHN ++AG YGLAAGSW+GPYA+C
Sbjct: 119 LGRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALC 177
Query: 248 RSWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
+ EALAR G G + +A+YVVSGD GERGGAPV+ D + C
Sbjct: 178 HAIEALAR----NDGRGREGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-------- 225
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YL
Sbjct: 226 ---PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYL 282
Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
DPH+VQ V+++ + LE D+++YH V+R + LD+IDPSLA+GFYCR+++D DD CARAS
Sbjct: 283 DPHEVQKVVSVSGESLEFDSASYHCSVVRKMLLDAIDPSLALGFYCRNREDLDDLCARAS 342
Query: 427 KLAEESNGAPLFTVTQ 442
+LA +SNGAP+FTV +
Sbjct: 343 ELASQSNGAPMFTVAE 358
>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 346
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/339 (58%), Positives = 267/339 (78%), Gaps = 5/339 (1%)
Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
SSS +IW+LG+C+K++ D A +A + EF DFSSRI I+YRKGF+ +G+SK+TS
Sbjct: 4 SSSGGEIWVLGICYKVSAD-ANDEAVSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTS 62
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
DVGWGCMLRS Q+L+AQAL+ H LGR WR+ + +EY++IL FGDSE+ FSIHNL
Sbjct: 63 DVGWGCMLRSGQILLAQALVCHYLGRTWRRNACQECLQEYLQILQSFGDSESCSFSIHNL 122
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARC---QRAETGLGCQSLPMAIYVVSGDEDGER 283
L+AG+ +GLAAGSW+GPYA+CR+ EALA+ Q A+ G G ++LP A+YVVSG+ +G+R
Sbjct: 123 LEAGRPFGLAAGSWLGPYALCRTLEALAKADEDQNAKKG-GKRALPFAVYVVSGETEGDR 181
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
GGAPV C++DA+ CS + + +W+P+++LVPLVLGL+K+NPRY+P+LR TFT PQSLG
Sbjct: 182 GGAPVRCVEDAAVLCSKWGEATEEWSPLVVLVPLVLGLDKLNPRYLPSLRATFTLPQSLG 241
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
+ GGKPGAST+++GVQ + A+YLDPH+ Q V + ++LE DTS YH V+R + LDSID
Sbjct: 242 VAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLELDTSFYHCSVVRRLPLDSID 301
Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
PSLAIGFYCRD+ +FDD CAR+S+L ++ NGAP+FTV +
Sbjct: 302 PSLAIGFYCRDRAEFDDLCARSSELVKQYNGAPIFTVAE 340
>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 360
Score = 318 bits (814), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 173/306 (56%), Positives = 226/306 (73%), Gaps = 2/306 (0%)
Query: 25 SLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKR 84
S S+ S+ SS++KS+ +L S + S+ V + E+S S V + WT +K
Sbjct: 26 SPTSLVSDSASSDNKSNL-TLCSDVVASSSPVSQLCREASTSGHNPVCTTHSSWTVILKT 84
Query: 85 L-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQD 143
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QD
Sbjct: 85 ASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQD 144
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
FSS IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D
Sbjct: 145 FSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPAD 204
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 205 EKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDD 264
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++
Sbjct: 265 KHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDR 324
Query: 324 VNPRYI 329
VNP +
Sbjct: 325 VNPSHF 330
>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 267
Score = 314 bits (805), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 156/244 (63%), Positives = 198/244 (81%)
Query: 86 VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
+ +G++RR +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+ +A LA F QDFS
Sbjct: 1 MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 60
Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
S IL++YR+GF+PIGD+ TSDV WGCMLRS QML AQALLF RLGR WRK +P D +
Sbjct: 61 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 120
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR + ET
Sbjct: 121 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 180
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
+S MA+++VSG EDGERGGAP++CI+D ++ C FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 181 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 240
Query: 326 PRYI 329
PR++
Sbjct: 241 PRFV 244
>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
Length = 219
Score = 288 bits (736), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 158/220 (71%), Positives = 178/220 (80%), Gaps = 4/220 (1%)
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
MAIYVVSGDEDGERGGAPVVCI+DA +HCS FS+GQA WTP+LLLVPLVLGL+KVNPRYI
Sbjct: 1 MAIYVVSGDEDGERGGAPVVCIEDAFKHCSEFSRGQAAWTPLLLLVPLVLGLDKVNPRYI 60
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TST 388
P L TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+NI D E + TS+
Sbjct: 61 PLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQEPNSTSS 120
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH--KK 446
YH +V+RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGAPLFTV Q+ K
Sbjct: 121 YHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGAPLFTVAQSRSFSK 180
Query: 447 PVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
V+ +DV G+ G ED LG +D +EDDWQLL
Sbjct: 181 QVSGNDVSGDNTGFEEDAFLGT-DHDDNDAGTNEDDWQLL 219
>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
Length = 290
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 143/209 (68%), Positives = 172/209 (82%), Gaps = 2/209 (0%)
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
S G GCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +L LFGDSE FSIHN
Sbjct: 14 SLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGDSEACAFSIHN 73
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGER 283
LLQA + YGLAAGSW+GPYAMCR+W+ L R R A+ G ++ PMA+YVVSGDEDGER
Sbjct: 74 LLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGER 133
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
GGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLG
Sbjct: 134 GGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLG 193
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
I+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 194 ILGGKPGTSTYIAGVQDDRALYLDPHEVQ 222
>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
Length = 472
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 142/205 (69%), Positives = 168/205 (81%), Gaps = 2/205 (0%)
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 215
FD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR RKP +KP++ +Y+ +LHLFGD
Sbjct: 34 FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93
Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIY 273
SE FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R A+ G ++ PMA+Y
Sbjct: 94 SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
VVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLK 213
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGV 358
TF FPQSL I+GGKPG STYI GV
Sbjct: 214 ETFMFPQSLCILGGKPGTSTYIAGV 238
>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
Length = 169
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/165 (68%), Positives = 130/165 (78%)
Query: 96 ERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKG 155
+ +LG S T SSTSDIWLLG C+K++ +E+ G NG A F +DFSSRI I+YRKG
Sbjct: 2 QELLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKG 61
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 215
FD IGDSK TSDV WGCM+RSSQMLVAQALLFH LGR WRKP QKP D +Y+EILHLFGD
Sbjct: 62 FDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGD 121
Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
SE FSIHNLL+AGKAYGLAA WVGPYAMCR+WE + R +R +
Sbjct: 122 SEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQ 166
>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
Length = 362
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/364 (38%), Positives = 194/364 (53%), Gaps = 44/364 (12%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D SRI ++YR+GF PI S ITSDVGWGC LRS QML+AQAL++H +GR WR+ L+ +
Sbjct: 23 DLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQWRRKLEAAY 82
Query: 203 DREYVEILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
E ++L FGD E PFSIHN+ G+ +G+ AG W+GP +C + + +
Sbjct: 83 PEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLADMVN-KVQP 141
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWTPILLL 314
GL C+ + G GGAPV+C SR + F G + +
Sbjct: 142 GGLQCR-----VVATFG------GGAPVLC---TSRLATAFEGGADRSGGEVGSSGSEES 187
Query: 315 VPLVLGLE-----------KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
P GL K+NPRY L+ T+PQS+GIVGG+P +S Y +G+Q++
Sbjct: 188 GPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSLYFIGLQDQHV 247
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
+YLDPH+VQ V + AD TY +R + L +IDPSLAIGFYC DF+D C
Sbjct: 248 LYLDPHEVQEVASEA-----ADLDTYFCSSLRLMPLANIDPSLAIGFYCSSLSDFEDLCG 302
Query: 424 RASKLAEESNGAPLFT-VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
R L E+ APL V + +P ++ + G+P D S G A+ D+
Sbjct: 303 RLRTLEAEAGCAPLVCMVDEDAGEPSWPAEEVLSDEGIPSDAD----SPAPPAGGANRDN 358
Query: 483 WQLL 486
W++L
Sbjct: 359 WEML 362
>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
Length = 416
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 134/263 (50%), Positives = 166/263 (63%), Gaps = 56/263 (21%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
L F +DFSSRI I+YRKGFD I D K+TSDV WGCM+RSSQMLVAQAL+FH LGR WRK
Sbjct: 29 LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRK 88
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P +K L++ +
Sbjct: 89 PPEK------------------------TLIRTNR------------------------- 99
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
++A+ G ++ PM +YVVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVP
Sbjct: 100 EQADAVDGKENFPMELYVVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVP 159
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI- 375
LVLGL+K+NPRYIP L+ TF FPQSLGI+G KPG STYI GVQ++ A+YLDPH+VQ V+
Sbjct: 160 LVLGLDKINPRYIPLLKETFMFPQSLGILGVKPGTSTYIAGVQDDRALYLDPHEVQMVLA 219
Query: 376 NIGKDDLEADTSTYHSDVIRHIH 398
NI + T +D I +IH
Sbjct: 220 NIKWPE------TLETDFIYNIH 236
>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 348
Score = 221 bits (562), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 125/329 (37%), Positives = 182/329 (55%), Gaps = 19/329 (5%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
+LGV + DE + ++ + +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 1 MLGVTYWSKDDECNAEKY-DDARRAWERDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTL 59
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAY 233
RS+QM+VA AL H GR WR+ ++ D E V+ +L +F D ++PFSIH++ + A+
Sbjct: 60 RSAQMMVANALSIHTRGRHWRRQVKAKEDDESVDHVLSMFIDDASAPFSIHSVCETTTAW 119
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG-DEDGERGGAPVVCID 292
G G W P MCR++ AL G +A++VV G +ED GG P ID
Sbjct: 120 GAPPGRWFEPSVMCRAFSALIEAN------GDLRNQIAVHVVGGQNEDDSAGGVPT--ID 171
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
D G+A +LL VPLVLG+ +N RYI LR F QS+G++GG+P A
Sbjct: 172 DGELRAKSADVGKA----LLLFVPLVLGVGRNINTRYISQLRSIIAFKQSIGVIGGRPNA 227
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
S Y+VG ++ YLDPH VQP + + D +Y+ + + +DP+LA+GFY
Sbjct: 228 SLYLVGHSDDVFFYLDPHTVQPANSFAE---AVDFDSYYCSTPLQMRGELLDPTLALGFY 284
Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTV 440
CRD DD D A LAE + AP+ V
Sbjct: 285 CRDGDDLDSLFASVKALAEANATAPVLDV 313
>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
Length = 369
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 128/370 (34%), Positives = 187/370 (50%), Gaps = 40/370 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 65
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 177
Query: 283 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 178 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 237
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 238 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 297
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDFDD+C + +L+ P+F + + +
Sbjct: 298 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 357
Query: 450 HSDVLGETGG 459
DVL + G
Sbjct: 358 CPDVLNVSLG 367
>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
Length = 390
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 120/333 (36%), Positives = 173/333 (51%), Gaps = 12/333 (3%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++A+AL+ LGR WR +
Sbjct: 45 DVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWARGRRQ 104
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G G W GP A+ +W L
Sbjct: 105 REEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRL 164
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 313
A + + + + D E G C++ A C++ + A W P++L
Sbjct: 165 AVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGA---CALVEEETALWKPLVL 221
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
L+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP
Sbjct: 222 LIPLRLGLSDINEAYIDTLKQCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP 281
Query: 374 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 433
+ +D D + + +H+ +DPS+A GF+CR +D+FDD+C R L+ +
Sbjct: 282 AVEPSEDGQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRGLSCKRG 341
Query: 434 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
G P+F + + + D L T + D L
Sbjct: 342 GLPMFELVDSQPTHMVSVDALNLTPDFSDSDRL 374
>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
Length = 405
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 127/365 (34%), Positives = 185/365 (50%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F PIG + TS
Sbjct: 34 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 80
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 81 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 140
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
Q G G + G W GP + + + LA +A+++ + ED
Sbjct: 141 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 192
Query: 283 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
R G P D+SRHC+ F G A W P++LL+PL LGL +N Y+
Sbjct: 193 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 252
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + AD S +
Sbjct: 253 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 312
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDFDD+C + +L+ P+F + + +
Sbjct: 313 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 372
Query: 450 HSDVL 454
DVL
Sbjct: 373 CPDVL 377
>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
Length = 342
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 178/343 (51%), Gaps = 32/343 (9%)
Query: 101 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN----GLAEFNQDFSSRILISYRKGF 156
P +T + S IWLLG C+ E + + L EF++ F+S I ++YR+ F
Sbjct: 12 PLKTNFNED-SPIWLLGRCYHAKNYEYTSEQSKQQCQILSLEEFHRHFTSLIWLTYRRSF 70
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---KPLQKPFDREYVEILHLF 213
+ S +TSD GWGCMLRS QM++A L+FH L + WR + + + Y IL F
Sbjct: 71 VQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRISGRCHSREQEHYYRVILQFF 130
Query: 214 GDS---ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
GD E SPFS+H L+ G+ G AG W GP ++ E +++
Sbjct: 131 GDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHILE--------------KAMIS 176
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVN 325
A + + D + V ID+ R C+ Q D W P+++LVP+ LG E +N
Sbjct: 177 ATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRLGGEALN 236
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
P YIP ++ FT Q +GI+GG+P S Y VG Q+E I+LDPH QPV++ ++
Sbjct: 237 PIYIPCVKSLFTLDQCIGIIGGRPKHSLYFVGFQDEKMIHLDPHYCQPVVDTTQEKFP-- 294
Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
T ++H R +DPS IGFYC +DF+ FC AS++
Sbjct: 295 TESFHCPNPRKTSFKKMDPSCTIGFYCSSHEDFESFCQHASEV 337
>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
Length = 390
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 179/359 (49%), Gaps = 26/359 (7%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 65
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQSDSYFNVLNAFIDRKDSYYSIHQI 125
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 184
Query: 287 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 185 PCAGATALPTDSSRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 244
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 245 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCRHPPSR 304
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDFDD+C R +L+ P+F + + + DVL
Sbjct: 305 MGISELDPSIAVGFFCKTEDDFDDWCQRVRQLSLLGGALPMFELVEQQPSHLACPDVLN 363
>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
Length = 445
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/358 (34%), Positives = 177/358 (49%), Gaps = 26/358 (7%)
Query: 109 STSDIWLLGVCHKIA--QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I+ +DE L D A SR+ +YRK F IG + TS
Sbjct: 74 TSEPVWILGRKYSISTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 120
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 121 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 180
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ R G
Sbjct: 181 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMEDIRRLCRAGL 239
Query: 287 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P D RHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 240 PCAGAAALPADPGRHCNGFPAGAEVSNRLAPWRPLVLLIPLRLGLTDINEAYVETLKHCF 299
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 300 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFADSCFIPDESFHCQHPPSR 359
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+ + +DPS+A+GF+C+ ++DFDD+C R KL+ P+F + + + DVL
Sbjct: 360 MGVRELDPSIAVGFFCQTEEDFDDWCQRVRKLSLLGGALPMFELVEQQPSHLACPDVL 417
>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
Length = 393
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 185/366 (50%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C+ D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CQDVLN 366
>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
Length = 410
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/356 (33%), Positives = 184/356 (51%), Gaps = 34/356 (9%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+ S +W+LG + + D LAE +D SR+ ++YRKGFDPIG S TSD
Sbjct: 30 TESPVWILGKQYSVLYD-----------LAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQ 78
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM++AQ+L+ LGR WR K +D +Y EIL +F D ++ +S+ +
Sbjct: 79 GWGCMLRCGQMMLAQSLICRHLGRDWRWTKDK-YDPKYFEILRMFQDKRSAKYSLQVIAS 137
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD------EDGE 282
G + G A G W GP + + L C E + + V+ D +
Sbjct: 138 MGTSEGKAIGEWFGPNTISQVLRKL--CVSDEWSNLVVHVALDNTVIIDDVFCLCKSSKK 195
Query: 283 RGGAPVVCIDDASRHCSVFS-----------KGQAD-WTPILLLVPLVLGLEKVNPRYIP 330
P+ + A +F+ G+ D W P+LL+VPL LGL ++NP YIP
Sbjct: 196 ESNEPIPGVHAACASALLFNGHDPTAEGHDPSGEDDSWRPLLLIVPLRLGLSEINPVYIP 255
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
L+ TF QS+GI+GGKP + + +G E+ +Y+DPH QP +++ + E+D S YH
Sbjct: 256 FLKTCLTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPFVDVTQPG-ESDAS-YH 313
Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
+ + +DPS+A+GF+C+ + DF+D C K P+F + Q +
Sbjct: 314 CSYSCRMPVSYLDPSVAVGFFCQTEADFEDLCQCIRKYILHGQKTPMFELHQRRPR 369
>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
Length = 394
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/358 (34%), Positives = 177/358 (49%), Gaps = 26/358 (7%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ RG
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 188
Query: 287 PV----VCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P D+SRHC+ F G A W P++LL+PL LGL +N Y+ TL+ F
Sbjct: 189 PCAGAAALPADSSRHCNGFPAGAEVTNRLAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 248
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 249 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFIPDESFHCQHPPSR 308
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+ + +DPS+A+GF+C+ + DFDD+C + +L+ P+F + + + DVL
Sbjct: 309 MSIGELDPSIAVGFFCKTEGDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLACPDVL 366
>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
Length = 434
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/333 (38%), Positives = 174/333 (52%), Gaps = 33/333 (9%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
A F F S + +YR F +G TSD+GWGCMLR+ QM++AQ L H LG WR+
Sbjct: 108 ASFLTHFRSVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQ 167
Query: 198 LQK--PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+ P Y +++ F D PFS+H + AG YG G W GP M + E L +
Sbjct: 168 SDRSSPL---YAKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLK 224
Query: 256 CQRAETGLG---CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-WTPI 311
+ + +GL CQ +Y+ P+ DD +GQ W P+
Sbjct: 225 -EFSPSGLRAYVCQD--GCLYLDQLRRTATAAHWPLDEDDD---------EGQGKSWAPM 272
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
L+++PL LGL+++N Y P L+ TF PQS+GI GGKP AS Y VG Q++ YLDPH V
Sbjct: 273 LIMLPLRLGLDQLNEDYAPVLKETFRIPQSVGISGGKPRASLYFVGNQDDYVFYLDPHTV 332
Query: 372 QPV---INIGKDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
QP +G D T+H + + IDPSL + FYCR+++DFDDFCARA +
Sbjct: 333 QPAPRFPEVGDVPASEDVYDTFHCSAPLRLPIRDIDPSLCLAFYCRNREDFDDFCARAIQ 392
Query: 428 LAEESNGAPLFTVTQ------THKKPVNHSDVL 454
L+E P+FTV + KP HS+ L
Sbjct: 393 LSE--GPMPIFTVAERMPDYLVRPKPPKHSEKL 423
>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
Length = 393
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CQDVLN 366
>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
Length = 390
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 19 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 66 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 177
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 178 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 237
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 238 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 297
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 298 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 357
Query: 450 HSDVLG 455
DVL
Sbjct: 358 CQDVLN 363
>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B
Length = 393
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + +
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CQDVLN 366
>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
Length = 391
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 118/345 (34%), Positives = 178/345 (51%), Gaps = 16/345 (4%)
Query: 135 NGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
N L E ++ D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LG
Sbjct: 34 NALTEKDEILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVCRHLG 93
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
R WR + EY+ +L+ F D + S +SIH + Q G G G W GP + + +
Sbjct: 94 RDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLK 153
Query: 252 ALARCQRAETGLGCQSLPMAIYVVS--------GDEDGERGGAPVV--CIDDASRHCSVF 301
LA + ++ + + D GE G + C++ A C++
Sbjct: 154 KLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGA---CAMA 210
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
+ A W P++LL+PL LGL +N YI TL+ F PQSLG++GGKP ++ Y +G E
Sbjct: 211 EEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLPQSLGVIGGKPNSAHYFIGYVGE 270
Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
IYLDPH QP + +D D + + +H+ +DPS+A GF+CR +D+FDD+
Sbjct: 271 ELIYLDPHTTQPAVEPSEDSQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDW 330
Query: 422 CARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
C R +L+ P+F + + + D L T + D L
Sbjct: 331 CMRIRRLSCNRGTLPMFELVDSQPSHMVSVDTLNLTPDFSDSDRL 375
>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
Length = 517
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 172/313 (54%), Gaps = 25/313 (7%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---K 196
F +DFSSR+ +YR+ F PI + ITSD GWGCMLRSSQM++AQA++ H LGR WR
Sbjct: 181 FLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYRRN 240
Query: 197 PLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 253
+ D + +++ LFGD + SPFS+H L+Q G G AG W GP + EAL
Sbjct: 241 NQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKEAL 300
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS-VFSKGQADWTPIL 312
+ E L L + IYV + ++D C S G W ++
Sbjct: 301 EGACQTEQLL----LDLRIYVAQD---------CTIYLEDVRALCRGTRSNGAPLWRSVI 347
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+LVP+ LG E++NP YIP ++ + P +G++GG+P S Y +G Q E IYLDPH VQ
Sbjct: 348 ILVPVRLGGEQLNPTYIPCVKGMLSHPNCIGVIGGRPRHSLYFLGWQGEKVIYLDPHYVQ 407
Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--- 429
+++G D D +YH R + +DPS +GFYC+ +D+F+ F +LA
Sbjct: 408 EAVDVGPQDFPLD--SYHCSWPRKMSFYKMDPSCTMGFYCKTEDEFEHFVKDVKQLAVPT 465
Query: 430 EESNGAPLFTVTQ 442
E + P+F V++
Sbjct: 466 ESRHEYPVFLVSE 478
>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
Length = 393
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/372 (33%), Positives = 181/372 (48%), Gaps = 52/372 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 188
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 323
A G A D+ RHC+ F G A W P++LL+PL LGL
Sbjct: 189 CA------------GAAAFPA--DSDRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTD 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFI 294
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
D S + + + +DPS+A+GF+C+ +DDF+D+C + + L+ P+F + +
Sbjct: 295 PDESFHCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVTMLSLLGGALPMFELVEQ 354
Query: 444 HKKPVNHSDVLG 455
+ DVL
Sbjct: 355 QPSHLACPDVLN 366
>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
Length = 394
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
A + + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
H QP ++ +D D S + +H+ +DPS+A GF+C+ +DDFDD+CA+ K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341
Query: 429 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
+ G P+F + + + +DVL T + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378
>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
Length = 394
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 45 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
A + + + + + D +RG P D C++ + A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
H QP ++ +D D S + +H+ +DPS+A GF+C+ +DDFDD+CA+ K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341
Query: 429 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
+ G P+F + + + +DVL T + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378
>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
Length = 393
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 175/338 (51%), Gaps = 18/338 (5%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR W+ +
Sbjct: 44 DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 103
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
EYV IL+ F D + S +SIH + Q G G + G W GP A+ SW L
Sbjct: 104 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 163
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
A + + + + D +RG P D C++ + A W
Sbjct: 164 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 220
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
P++LL+PL LGL +N YI L+ F PQSLG++GGKP ++ Y +G + IYLDP
Sbjct: 221 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 280
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
H QP ++ +D D S + +H+ +DPS+A GF+C+ +DDFDD+CA+ K+
Sbjct: 281 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 340
Query: 429 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
+ G P+F + + + +DVL T + D L
Sbjct: 341 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 377
>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
Length = 405
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 190/385 (49%), Gaps = 42/385 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLGETGGVPEDDSLGVMSMNDA 474
DVL + G E + V S+ D+
Sbjct: 361 CPDVLNLSLG--ESCQVQVGSLGDS 383
>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
Length = 393
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/372 (34%), Positives = 179/372 (48%), Gaps = 52/372 (13%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR + Y +LH F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALLCRHLGRGWRWTQWERQPDSYFSVLHAFMDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQ-SLP 269
Q G G + G W GP + + +W ALA E C+ SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSALAVHVAMDNTVVMEEIRRLCRSSLP 188
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 323
R GA D+ RHC+ F W P++LL+PL LGL
Sbjct: 189 -------------RAGAAAFPA-DSDRHCNGFPAEAEVGPRPVPWRPLVLLIPLRLGLTD 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + L
Sbjct: 235 INAAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQVTDSCLI 294
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
D S + + + +DPS+A+GF+C+ ++DF+D+C + KL+ P+F + +
Sbjct: 295 PDESFHCQHPPHRMSIAELDPSIAVGFFCQTEEDFNDWCQQVRKLSLLGGALPMFELVEQ 354
Query: 444 HKKPVNHSDVLG 455
+ DVL
Sbjct: 355 QPSHLACPDVLN 366
>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
Length = 510
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 139 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 185
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 186 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 245
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 246 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 297
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 298 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 357
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 358 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 417
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 418 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 477
Query: 450 HSDVL 454
DVL
Sbjct: 478 CPDVL 482
>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
[Homo sapiens]
Length = 415
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLGETGG 459
DVL + G
Sbjct: 361 CPDVLNLSLG 370
>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
Length = 521
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 189/385 (49%), Gaps = 42/385 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476
Query: 450 HSDVLGETGGVPEDDSLGVMSMNDA 474
DVL + G E + V S+ D+
Sbjct: 477 CPDVLNLSLG--ESCQVQVGSLGDS 499
>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
Length = 468
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448
Query: 450 HSDVLGETGG 459
DVL + G
Sbjct: 449 CPDVLNLSLG 458
>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
Length = 393
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CPDVLN 366
>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
Length = 380
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360
Query: 450 HSDVLGETGG 459
DVL + G
Sbjct: 361 CPDVLNLSLG 370
>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
Length = 481
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448
Query: 450 HSDVL 454
DVL
Sbjct: 449 CPDVL 453
>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
Length = 508
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 137 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 183
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 184 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 243
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 244 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 295
Query: 282 ERGGAPVVCID------DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 296 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 355
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 356 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 415
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 416 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 475
Query: 450 HSDVL 454
DVL
Sbjct: 476 CPDVL 480
>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
Length = 393
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CPDVLN 366
>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
Length = 398
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 27 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 73
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 74 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 133
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 134 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 185
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 186 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 245
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 246 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 305
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 306 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 365
Query: 450 HSDVL 454
DVL
Sbjct: 366 CPDVL 370
>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
Length = 393
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CPDVLN 366
>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
Length = 396
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363
Query: 450 HSDVLG 455
DVL
Sbjct: 364 CPDVLN 369
>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 328
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 448
Query: 450 HSDVLGETGG 459
DVL + G
Sbjct: 449 CPDVLNLSLG 458
>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
Length = 479
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 181/365 (49%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 108 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 154
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 155 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 214
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 215 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 266
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G W P++LL+PL LGL +N Y+
Sbjct: 267 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 326
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 327 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 386
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C ++DF+D+C + KL+ P+F + + +
Sbjct: 387 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 446
Query: 450 HSDVL 454
DVL
Sbjct: 447 CQDVL 451
>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
[Homo sapiens]
gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
construct]
gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
Length = 393
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CPDVLN 366
>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B;
Short=hAPG4B
gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
Length = 393
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CPDVLN 366
>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
Length = 396
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363
Query: 450 HSDVLG 455
DVL
Sbjct: 364 CPDVLN 369
>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
Length = 375
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 171/337 (50%), Gaps = 33/337 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 26 VWILGACYNVKTKKS-----------ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGC 74
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR WR +K +EY IL F D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALICSHLGRDWRWDPEKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVG 134
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +++YV + V I+
Sbjct: 135 EGKSVGEWYGPNTVAQVLKKLALFDDWNS--------LSVYVSMDN---------TVVIE 177
Query: 293 DASRHC-----SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
D + C + S+ DW P+LL++PL +G+ +NP YI L+ F PQS G++GG
Sbjct: 178 DIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQALKECFKMPQSCGVLGG 237
Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 407
KP + Y +G ++ IYLDPH Q ++ D S + + + S+DPS+A
Sbjct: 238 KPNLAYYFIGFIDDELIYLDPHTTQQAVDTESGSAVDDQSFHCQRTPHRMKITSLDPSVA 297
Query: 408 IGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+GF+C+ ++DFD +C + + +F + + H
Sbjct: 298 LGFFCKSEEDFDSWCDLVQQELLKKRNLRMFELVEKH 334
>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
Length = 496
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 182/370 (49%), Gaps = 40/370 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476
Query: 450 HSDVLGETGG 459
DVL + G
Sbjct: 477 CPDVLNLSLG 486
>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
Length = 509
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 180/365 (49%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476
Query: 450 HSDVL 454
DVL
Sbjct: 477 CPDVL 481
>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
Length = 394
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 179/367 (48%), Gaps = 42/367 (11%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAIFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 189
Query: 270 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAEATAFPADSEGHCNGLPA---------GAEVTNRPSLWRPLVLLIPLRLGLTDINEAY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + L D S
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSFLIPDESF 300
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
+ + + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHL 360
Query: 449 NHSDVLG 455
DVL
Sbjct: 361 ACPDVLN 367
>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
Length = 393
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CPDVLN 366
>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
pisum]
gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
pisum]
gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
pisum]
gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
pisum]
Length = 402
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/344 (31%), Positives = 178/344 (51%), Gaps = 38/344 (11%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192
Query: 286 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ +++ FTFPQSL
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKMCFTFPQSL 250
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIRHIHL 399
G++GG+P + Y +G I+LDPH Q + + D+E + +YH I + +
Sbjct: 251 GVIGGRPNHALYFIGFVGNDVIFLDPHTTQQIGMLPNKDIETEHKIDHSYHCQQINRLPI 310
Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARAS---KLAEESNGAPLFTV 440
++DPSLA F C+ ++DF+ C +++S PL T+
Sbjct: 311 LNMDPSLAACFMCQTENDFNALCHELKVHLVQSDQSPSQPLITI 354
>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
Length = 394
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 181/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 23 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 181
Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 329
A + C D+ RHC+ F G W P++LL+PL LGL +N Y+
Sbjct: 182 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 241
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 242 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 301
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C ++DF+D+C + KL+ P+F + + +
Sbjct: 302 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 361
Query: 450 HSDVLG 455
DVL
Sbjct: 362 CQDVLN 367
>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
Length = 380
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/370 (32%), Positives = 183/370 (49%), Gaps = 40/370 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ + PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCYMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360
Query: 450 HSDVLGETGG 459
DVL + G
Sbjct: 361 CPDVLNLSLG 370
>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
Length = 393
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + +
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGCALPMFELVEQQPSHLA 360
Query: 450 HSDVLG 455
DVL
Sbjct: 361 CPDVLN 366
>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
Length = 420
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/365 (32%), Positives = 183/365 (50%), Gaps = 40/365 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 49 TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 95
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR ++ Y +LH F D + S +SIH +
Sbjct: 96 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWAQRRRQPDSYFSVLHAFIDRKDSHYSIHQI 155
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
Q G G + G W GP + + + LA + +A+++ + E+
Sbjct: 156 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 207
Query: 283 R-------GGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
R C D S+HC+ G + W P++LL+PL LGL +N Y+
Sbjct: 208 RLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTWRPLVLLIPLRLGLTDINEAYV 267
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D + +
Sbjct: 268 ETLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELAGGFSIPDETFH 327
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+++ +DPS+A+GF+C+ ++DF+D+C + KL+ S P+F + + +
Sbjct: 328 CQHPPCRMNIAELDPSIAVGFFCKTEEDFNDWCQQVKKLSLLSGALPMFELVEQQPSHLA 387
Query: 450 HSDVL 454
DVL
Sbjct: 388 CPDVL 392
>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
Length = 473
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 175/355 (49%), Gaps = 21/355 (5%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
++ +W+LG + + ++ E D +SR+ +YRK F IG + TSD
Sbjct: 103 TSEPVWILGRKYSLLTEKN-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 151
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQAL+ LGR WR QK Y+ +LH F D + S +SIH + Q
Sbjct: 152 GWGCMLRCGQMIFAQALVCRHLGRDWRWTQQKRQPDSYLSVLHAFMDRKDSYYSIHQIAQ 211
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G + G W GP + + + LA + L V+ R P
Sbjct: 212 MGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRSSHPC 270
Query: 289 VCIDDASR----HCSVFS-----KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
HC+ F ++ W P++LL+PL LGL +N Y+ TL+L F P
Sbjct: 271 AGAATPPAGADWHCNGFPASTEVTNRSPWRPLVLLIPLRLGLTDINEAYVETLKLCFRMP 330
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHL 399
QSLG++GGKP ++ Y +G E IYLDPH QP + + D S + + +
Sbjct: 331 QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDLCFIPDESFHCQHPPCRMSI 390
Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+DPS+A+GF+C+ ++DF+D+C + KL+ P+F + + + DVL
Sbjct: 391 GELDPSIAVGFFCKTEEDFNDWCQQVRKLSLLGGALPMFELVEQQPPHLACPDVL 445
>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
Length = 477
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 119/373 (31%), Positives = 176/373 (47%), Gaps = 46/373 (12%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG C+ ++ L A+ N + EF +DF SR
Sbjct: 86 SKESPVWLLGQCYLKKSEDPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFVSR 145
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF----- 202
I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR ++P
Sbjct: 146 IWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQPIETLQQ 205
Query: 203 ---DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
DR + I+ FGD SPFSIH L+ G + G AG W GP ++ C
Sbjct: 206 RLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAHLLSQAVECASK 265
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
++ L A+YV V + D C W ++LLVPL L
Sbjct: 266 QSNSNFDHL--AVYVAQD---------CAVYLQDVENICRT---PDGKWKALVLLVPLRL 311
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++ K
Sbjct: 312 GADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVWK 371
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK--LAEESNGAPL 437
+D +++H R + L +DPS +GFY +K+ DF + + + P+
Sbjct: 372 NDFSL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNKEALTDFMETIQRFVIPNQKTNYPM 429
Query: 438 FTVTQTHKKPVNH 450
F + K + H
Sbjct: 430 FLFCEGSGKDLQH 442
>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
Length = 412
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 178/360 (49%), Gaps = 28/360 (7%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +D+ L D A SR+ +YR+ F IG + TS
Sbjct: 39 TSEPVWILGRKYSIFTEKDDILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 85
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 86 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 145
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V+ R G
Sbjct: 146 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRTGL 204
Query: 287 PVV----CIDDASRHCSVF--------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
P DA RHC+ F + + W P++LL+PL LGL +N Y+ TL+
Sbjct: 205 PCAGAAALPTDADRHCNGFPTQTEVTNRQSPSLWRPLVLLIPLRLGLTDINEAYVETLKH 264
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D + +
Sbjct: 265 CFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIPDETFHCQHPP 324
Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 325 CRMGIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 384
>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
Length = 393
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 122/375 (32%), Positives = 180/375 (48%), Gaps = 58/375 (15%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
++ +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 22 TSEPVWILGRKYSIFTEKE-----------ELLSDVASRLWFTYRKNFPAIGGTGPTSDT 70
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q
Sbjct: 71 GWGCMLRCGQMIFAQALVCQHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQ 130
Query: 229 AGKAYGLAAGSWVGP---------YAMCRSWEALA------------RCQR-AETGLGCQ 266
G G + G W GP A+ +W +LA +R T L C
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSLPCG 190
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLG 320
+ P + AP +HC+ F G + W P++LL+PL LG
Sbjct: 191 TAPAS------------SAAP-------DQHCNGFPAGAEVTTRLSPWRPLVLLIPLRLG 231
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
L +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 232 LTDINAAYVETLKRCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEATDS 291
Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
L D S + + + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F +
Sbjct: 292 CLVPDESFHCQHPPCRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFEL 351
Query: 441 TQTHKKPVNHSDVLG 455
+ + DVL
Sbjct: 352 VEQPPSHLACPDVLN 366
>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 356
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 114/311 (36%), Positives = 163/311 (52%), Gaps = 13/311 (4%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + +D + E D SRI I+YRK F IG + TSD GWGC
Sbjct: 26 VWILGKAYHLIRDRS-----------ELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGC 74
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQALL LGR WR ++ + Y +IL LF D + S +SIH + Q G
Sbjct: 75 MLRCGQMILAQALLCKHLGREWRWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVG 134
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + L + S+ I VV R C
Sbjct: 135 EGKSIGQWFGPNTVAQVLRKLTLFDDWSSIAVHISMDNTI-VVEDIRKLCRTPLFTECAS 193
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ S+ + G W P++L +PL LGL ++NP Y+ L+ FT QSLG++GGKP +
Sbjct: 194 PKAASASLENGGTTYWKPLVLFIPLRLGLTEINPLYLDVLKKCFTLKQSLGMIGGKPNHA 253
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Y +G ++ +YLDPH QPV++I K D TYH +++ +DPS+A+GF+C
Sbjct: 254 HYFIGFYGKTLVYLDPHTTQPVVDINKWASIPD-DTYHCKHPSRMNIMHLDPSIALGFFC 312
Query: 413 RDKDDFDDFCA 423
+ DFDD C
Sbjct: 313 HCESDFDDLCT 323
>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
Length = 394
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 117/363 (32%), Positives = 182/363 (50%), Gaps = 27/363 (7%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+T +W+LG + + E D +SR+ +YRK F PIG + TSD
Sbjct: 21 ETTEPVWILG-----------NEYSALTEKEEILSDVTSRLWFTYRKSFPPIGGTGPTSD 69
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 227
GWGCMLR QM++ QAL+ LGR WR + +EY+ IL+ F D + S +SIH +
Sbjct: 70 TGWGCMLRCGQMILGQALMCRHLGRDWRWVRGQKQRQEYISILNAFIDKKDSYYSIHQIA 129
Query: 228 QAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 277
Q G G G W GP A+ +W L + + + + + + +
Sbjct: 130 QMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRLVVHVAMDNTVVIEEIKRLCMPWLDK 189
Query: 278 DE---DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
E + ER G C++ A C++ + A W P++LL+PL LGL +N YI TL+
Sbjct: 190 AEVFGEPERVGELNGCLEGA---CALSEEEVALWKPLVLLIPLRLGLSDINGAYIETLKK 246
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
F PQSLG++GGKP ++ Y +G IYLDPH Q + + D + +
Sbjct: 247 CFMLPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFPDDTYHCQHPP 306
Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+H+ +DPS+A+GF+CR +D+FDD+C R +L+ + P+F + + + D +
Sbjct: 307 CRMHICELDPSIAVGFFCRTEDEFDDWCMRIRRLSCNKDNLPMFELVDSQPSHLVGVDAI 366
Query: 455 GET 457
T
Sbjct: 367 NLT 369
>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A
gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
Length = 396
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 176/353 (49%), Gaps = 50/353 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352
>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
Length = 390
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 177/368 (48%), Gaps = 46/368 (12%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W ALA + M VV
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177
Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
D R P + D+ RHC+ F A W P++LL+PL LGL VN
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDE 297
Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
S + + + +DPS+A+GF+C +DDF+D+C + SKL+ P+F + +
Sbjct: 298 SFHCQHPPGRMSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPS 357
Query: 447 PVNHSDVL 454
+ DVL
Sbjct: 358 HLACPDVL 365
>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; AltName: Full=Autophagy-related
protein 4 homolog B; AltName: Full=bAut2B
gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
Length = 393
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 177/368 (48%), Gaps = 46/368 (12%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W ALA + M VV
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177
Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
D R P + D+ RHC+ F A W P++LL+PL LGL VN
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDE 297
Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
S + + + +DPS+A+GF+C +DDF+D+C + SKL+ P+F + +
Sbjct: 298 SFHCQHPPGRMSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPS 357
Query: 447 PVNHSDVL 454
+ DVL
Sbjct: 358 HLACPDVL 365
>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 394
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 176/366 (48%), Gaps = 42/366 (11%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 23 TSEPVWILGRRYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQALL LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
Q G G + G W GP A+ +W ALA + + + SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHVAMDNTVVMEDIRRLCRSSLP 189
Query: 270 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
A D +G G P + + + W P++LL+PL LGL +N Y
Sbjct: 190 CAGASAFPADSEGHCNGFPAR---------AEVTNRPSPWRPLVLLIPLRLGLTDINEAY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCSIPDESF 300
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
+ + + +DPS+A+GF+C +DDF D+C + KL+ P+F + + +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCETEDDFGDWCQQVKKLSLLGGALPMFELVEQQPSHL 360
Query: 449 NHSDVL 454
DVL
Sbjct: 361 ACPDVL 366
>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
Length = 424
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 126/364 (34%), Positives = 182/364 (50%), Gaps = 57/364 (15%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
+ GV H ++ + G+ + G E+ +D+ SR ++YR+GF+ +G +K +D GWGC L
Sbjct: 42 MFGVTH-WDRETSSGERSNEVGRREWERDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTL 100
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDRE----------------------------- 205
RS+QM++A AL H GR WR+ +Q E
Sbjct: 101 RSAQMMLANALSIHSRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSE 160
Query: 206 --------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
+IL LF D +PFSIH + + +G G W P MCR++EAL
Sbjct: 161 RTRAGSDAQEDILRLFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALV--- 217
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
AE LG + + ++VVSG E GE GG P V D+A G+A +LL VP+
Sbjct: 218 -AEHDLGSE---LTVHVVSGRE-GEDGGVPTV--DEAEVRAKSADVGKA----LLLFVPV 266
Query: 318 VLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
VLG+ + +N RY+ LR F QS+GIVGG+P +S Y+VG ++ YLDPH VQ +
Sbjct: 267 VLGVGRTINARYLSQLRSMMAFKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASS 326
Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
+ D E +Y+ H+ +DP+LA+GFYCRD DD LA + AP
Sbjct: 327 MVTMDFE----SYYCPTPLHVCGGDLDPTLALGFYCRDGDDVASLLVDIEALARVNATAP 382
Query: 437 LFTV 440
+
Sbjct: 383 ALAI 386
>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
Length = 393
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/358 (33%), Positives = 174/358 (48%), Gaps = 26/358 (7%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + L V++ R
Sbjct: 129 AQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187
Query: 287 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P + D+ RHC+ F A W P++LL+PL LGL VN Y TL+ F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+ + +DPS+A+GF+C +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVL 365
>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
Length = 384
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/342 (33%), Positives = 169/342 (49%), Gaps = 36/342 (10%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQAL+ +GR WR QKP
Sbjct: 45 DITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWDKQKP- 103
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 KGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQWSS- 162
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD--------------- 307
+A+++ + V +D+ R C S +D
Sbjct: 163 -------IAVHIAMDN---------TVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDPS 206
Query: 308 ---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
W P++LL+PL LGL ++N YI TL+ F PQSLG++GG+P ++ Y +G + I
Sbjct: 207 CAQWKPLVLLIPLRLGLSEINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDELI 266
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
YLDPH Q + D S + +H+ IDPS+A+GF+C ++DF+D+C
Sbjct: 267 YLDPHTTQLSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSIAVGFFCSSQEDFEDWCQH 326
Query: 425 ASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
KL+ P+F V +++ DVL T + D L
Sbjct: 327 IKKLSLSGGALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368
>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
Length = 396
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 174/353 (49%), Gaps = 50/353 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPFKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + L + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLTLFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352
>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
Length = 357
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDP QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPATTQPAVEPTDGCFIPDESFH 303
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356
>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
Length = 392
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 41/366 (11%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEK-SIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 179
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 180 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 239
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 240 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 299
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
+ + ++DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + +
Sbjct: 300 CQHPPCRMSIANLDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 359
Query: 450 HSDVLG 455
DVL
Sbjct: 360 CPDVLN 365
>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
Length = 432
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 109/313 (34%), Positives = 171/313 (54%), Gaps = 11/313 (3%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR-LGRPWR 195
+ EF +DFS+++ SYR+GF+ IGDS +D GWGCMLRS QML+A LL + +G+ W+
Sbjct: 88 IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALA 254
KP + ++ +++ LF D ++PFSIHN+ G+ + G + G W P + + AL
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207
Query: 255 -RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC----SVFSKGQADWT 309
+ G + + + V DD S + + + W
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALRSDGSWM 267
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+L+L+P LG++ +N Y L +TFPQ+LGIVGGKP AS Y + Q+++ YLDPH
Sbjct: 268 PLLILIPTKLGIDTINEIYYRPLLDIYTFPQNLGIVGGKPRASLYFIASQDDNLFYLDPH 327
Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
VQ I + D + S+Y ++ + ++ +DPSL I F+C K+ F DF R+ KL
Sbjct: 328 TVQNSI---ESDSDFSLSSYFCNIPKKANISEVDPSLVIPFFCSTKESFLDFLERSKKL- 383
Query: 430 EESNGAPLFTVTQ 442
E S+ PL+ + +
Sbjct: 384 ESSSEFPLYNIQE 396
>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
Length = 380
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 116/349 (33%), Positives = 179/349 (51%), Gaps = 41/349 (11%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W+LGV + +D E D SSR+ +YRK F PIG + SD GWGCM
Sbjct: 32 WILGVGYNTVKDRQ-----------ELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCM 80
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
LR QM++ QAL+ LGR WR +D +Y +IL LF D + S +SIH + Q G +
Sbjct: 81 LRCGQMMLGQALICRHLGRDWRWK-SAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSE 139
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE----------DGER 283
G + G W GP + + + LA + + +AI+V + R
Sbjct: 140 GKSVGQWFGPNTVAQVLKKLALFEDWSS--------LAIHVAMDNTVIIDDIKKLCRSAR 191
Query: 284 GGAP------VVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P +C ++ S S+ A W P++L++PL LGL ++NP Y L+ F
Sbjct: 192 QPTPSQVTNSFLCNGVSAEQTSARSRSPALPWQPLMLIIPLRLGLSELNPVYTDCLKACF 251
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
T QSLG++GGKP + Y +G S +YLDPH QP + + + ++ S++H
Sbjct: 252 TLRQSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPAVEL-EGNVPIPDSSFHCTHPSR 310
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKL--AEESNGAPLFTVTQT 443
+++ +DPS+A+GF+C+D+ DF D C +L +++ A +F V Q+
Sbjct: 311 MNIQDLDPSIALGFFCQDEADFADLCENMRRLIIGQKTQNA-MFEVVQS 358
>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
Complex
Length = 357
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 25 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWG MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGSMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356
>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
Length = 396
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 175/353 (49%), Gaps = 50/353 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+Y + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYDSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
+ F PQSLG +GGKP + Y +G + I+LDPH Q ++I + L D + +
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352
>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 396
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
G + G W GP A+ W +LA + + + + V+ S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPS 195
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E P+ +A+ H S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ESSHDPL----NATNHNKAISACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
LG +GGKP + Y +G + I+LDPH Q ++ ++ + D + + + + + +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQSPQRMSILN 311
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 312 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353
>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
Length = 398
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 111/343 (32%), Positives = 179/343 (52%), Gaps = 27/343 (7%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDG 281
G + G W GP A+ W +LA + + + + I +S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAG 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
Length = 453
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 172/346 (49%), Gaps = 60/346 (17%)
Query: 108 SSTSDIWLLGVCHK-----------------IAQDEALGDAAGNNGLAEFNQDFSSRILI 150
S S +WLLG C++ Q ++ ++ + G F +DF SR+ +
Sbjct: 63 SKESPVWLLGKCYRRIESPSSDSTELGTDVAAFQSQSEIASSDDEGFEGFKKDFISRLWL 122
Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDRE-YVE 208
+YR+ F + S +SD GWGCMLRS QML+AQAL+ H LGR WR +P +P RE ++E
Sbjct: 123 TYRREFPILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDWRWQPDHQPTTRESFIE 182
Query: 209 ------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
I+ FGD S SPFSIH L+ G+A G AG W GP
Sbjct: 183 VVNHRKIIKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGP----------------- 225
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------GQADWTPIL 312
G A S ED + VC+ ++ C+V+ K W ++
Sbjct: 226 -GFVAHLFRQAFKRAS--EDNYEFDSLTVCV---AQDCAVYIKDVMEECTDKNGKWKSLI 279
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
LL+P+ LG EK N Y P L F+ Q +GI+GG+P S Y VG Q++ I+LDPH Q
Sbjct: 280 LLIPVRLGAEKFNSIYAPCLTTLFSLKQCIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQ 339
Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
V+++ D +++H R IHL +DPS IGFYC K+ F
Sbjct: 340 EVVDVWAVDFP--LTSFHCRSPRKIHLSKMDPSCCIGFYCPTKESF 383
>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
tropicalis]
Length = 384
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 118/334 (35%), Positives = 172/334 (51%), Gaps = 20/334 (5%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YR+ F IG + TSD GWGCMLR QM+ AQALL +GR WR QK
Sbjct: 45 DITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWDKQKS- 103
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
EY+ IL F D + S +SIH + Q G G G W GP + + LA + +
Sbjct: 104 QGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQWSS- 162
Query: 263 LGCQSLPMAIYV-----VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPIL 312
+A+++ V DE A +A C+ ++ G +D W P++
Sbjct: 163 -------IAVHIAMDNTVVMDEIRRLCRAGTNESSEAGALCNGYT-GVSDPSCSLWKPLV 214
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
LL+PL LGL +N YI TL+ F PQSLG++GG+P ++ Y +G + IYLDPH Q
Sbjct: 215 LLIPLRLGLSDINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDELIYLDPHTTQ 274
Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 432
+ D S + +H+ IDPS+A+GF+CR ++DF+D+C + KL+
Sbjct: 275 LAVEPSDCCFVEDESFHCQHPPCRMHVSEIDPSIAVGFFCRSQEDFEDWCQQIKKLSLSG 334
Query: 433 NGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
P+F V +++ DVL T + D L
Sbjct: 335 GALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368
>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
Length = 398
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/342 (31%), Positives = 175/342 (51%), Gaps = 25/342 (7%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTAD 197
Query: 284 GGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+P I + S+ S F W P+LL+VPL LG+ ++NP Y+ + F PQSL
Sbjct: 198 KSSPDSFITSNQSKDTSAFCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSL 254
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 402
G +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ ++
Sbjct: 255 GALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQQMNILNL 314
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 315 DPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
Length = 442
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 117/344 (34%), Positives = 176/344 (51%), Gaps = 28/344 (8%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNN------GLAEFNQDFSSRILISYRKGFDPIGDS 162
S S IWLLG C+ Q E A N G+ F +DFSS I +SYRK F + +S
Sbjct: 63 SDSPIWLLGRCYYAKQAEYDSKNAVQNTQYKIHGIDCFFEDFSSLIYLSYRKHFSQLANS 122
Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGD--SET 218
+TSD GWGCMLR+ QML+A ALL H L WR +K ++ Y+ IL F D S+
Sbjct: 123 NLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKYTEKNYIYRMILRFFNDENSDN 182
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 277
SPFS+H L++ G G W GP ++ + A + S P + + V
Sbjct: 183 SPFSLHELVRIGSK---KPGEWYGPTSVAHTLSA---------AVNLTSHPVLDTFRVYV 230
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
D V+ ++C+ + + W +L+LVP+ LG + +NP YIP L+ T
Sbjct: 231 ANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLNPIYIPCLKALLT 290
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
+GI+GG+P S Y VG Q + I LDPH +Q +++ + ++ H + +
Sbjct: 291 LDYCVGIIGGRPKHSLYFVGFQGKKLINLDPHYLQEYVDMTTQEFPVESFRCHYP--KKM 348
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE---ESNGAPLF 438
+DPS A+GFYCR ++DF+ C +A ++ + + P+F
Sbjct: 349 AFKKMDPSCAVGFYCRTREDFESLCKQAVEMLKPPMQRTEYPMF 392
>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
Length = 398
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 178/356 (50%), Gaps = 53/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
D + C V SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ G++ D +
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTGENGTVNDQTF 300
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
Length = 368
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 175/335 (52%), Gaps = 41/335 (12%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+ D+W+LG + I Q GD + N D SRI ++YRK F IG + T+D
Sbjct: 26 TEEDVWILGKRYNILQ----GD------MGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDS 75
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM++AQAL+ LGR W+ + EY++IL F D + S +SIH + Q
Sbjct: 76 GWGCMLRCGQMMLAQALVCRHLGRDWQWDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQ 135
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G + G A GSW GP + + + L+ + + ++V +
Sbjct: 136 MGVSEGKAVGSWFGPNTVAQVLKKLSAFDDWSS--------LCLHVAMDN---------T 178
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
V I+D S +W P++L +PL LGL ++N Y L+ FTF QSLGI+GG+
Sbjct: 179 VIIEDIS-----------NWRPLVLFIPLRLGLTEMNVVYNEPLKACFTFKQSLGIIGGR 227
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL-EADTSTYHSDVIRHIHLDSIDPSLA 407
P +TY +G + +YLDPH Q +N D+L ++H +++ +DPS+A
Sbjct: 228 PNHATYFIGYFGNNLVYLDPHTTQQTVN--PDELSRIPDGSFHCVYPCRMNIADVDPSVA 285
Query: 408 IGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
+GF+C+ ++DFDD C + K + P+F + +
Sbjct: 286 LGFFCKSEEDFDDLCQQIQKKIIDGKSRPMFEIAK 320
>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
Length = 398
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQS 305
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
Length = 436
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 174/366 (47%), Gaps = 44/366 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG K +D + +FN + ++ +YR+ F PIG + SD GWGC
Sbjct: 31 VWILGKHFKPDED-----------MEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGC 79
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQALL LGR W + + Y+ ILH F D + S +SIH + Q G
Sbjct: 80 MLRCGQMMLAQALLCRHLGRDWDWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVG 139
Query: 233 YGLAAGSWVGPYAMCRSWEALAR-------------------------CQRAETGLGCQS 267
G G W GP + + + L C+ + GC
Sbjct: 140 EGKQIGQWFGPNTVAQVIKKLVLFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYG 199
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHC-------SVFSKGQADWTPILLLVPLVLG 320
I+ S + P C ++S+ S S+ W P+LL +PL LG
Sbjct: 200 ECSYIHDRSSLTGNQSVSKPPHCSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLG 259
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
L ++N Y +L++ FT QSLG++GGKP + Y +G + +YLDPH Q I +
Sbjct: 260 LSEINSDYYNSLKIMFTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQTIEPERF 319
Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
++ D S +H + S+DPS+A+GFYC +DDFDD+C ++L + P+F +
Sbjct: 320 NVIPDES-FHCVYPCFMSFQSLDPSVALGFYCHTEDDFDDWCQAVNELVVQREKRPMFEI 378
Query: 441 TQTHKK 446
QT +
Sbjct: 379 NQTRPR 384
>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
boliviensis]
Length = 422
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 179/348 (51%), Gaps = 37/348 (10%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 216
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
D G+R + ++ SR S + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 217 ADTPGDRPPDSLTASNE-SRGTSAYCPA---WKPLLLIVPLRLGINQINPVYVDAFKECF 272
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + +
Sbjct: 273 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 332
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 333 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 379
>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
Length = 411
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 177/356 (49%), Gaps = 53/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + + + + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 42 VWILGKQHLLKTERS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 90
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 91 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 150
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 151 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 193
Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
D + C VF SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 194 DIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 253
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D +
Sbjct: 254 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTF 313
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + +++ ++DPS+A+GF+C+++ DFD++C K + N +F + Q H
Sbjct: 314 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCCLVQKEILKEN-LRMFELVQKH 368
>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
Length = 398
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 180/351 (51%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP++I
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSI---- 193
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 194 -DTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
Length = 393
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 115/364 (31%), Positives = 179/364 (49%), Gaps = 36/364 (9%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+T +W+LG + I ++ E D +SR+ +YRK F IG + TSD
Sbjct: 22 TTEPVWILGRKYTIFTEKE-----------EILSDVTSRLWFTYRKNFPAIGGTGPTSDT 70
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH + Q
Sbjct: 71 GWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 130
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGER 283
G G + G W GP + + + LA + +A+++ V +E
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRL 182
Query: 284 GGAPVVCIDDAS--RHCSVFSKGQ----------ADWTPILLLVPLVLGLEKVNPRYIPT 331
A C D A+ + S G + W P++LL+PL LGL +N Y T
Sbjct: 183 CKAGFPCADGAAFPTDSELLSNGYPPAAEVTDRASPWRPLVLLIPLRLGLTDINEAYTET 242
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + + + D + +
Sbjct: 243 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVESTEGGVFPDETFHCQ 302
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 451
+++ +DPS+A+GF+C+ ++DF+D+C + KL+ P+F + + +
Sbjct: 303 HPPCRMNIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSRIPGALPMFELVERQPSHFSCP 362
Query: 452 DVLG 455
DVL
Sbjct: 363 DVLN 366
>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
Length = 355
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 25 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 74 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 133
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 134 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 193
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 194 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 246
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 247 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 306
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 307 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 351
>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
Length = 366
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 163/324 (50%), Gaps = 41/324 (12%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D +SR+ +YRKGF PIG + TSD GWGCMLR QM++ QAL+ LGR WR +
Sbjct: 68 DVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWVSGEEQ 127
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
EYV IL+ F D + S +SIH + + +C W A A G
Sbjct: 128 RHEYVNILNAFIDKKDSYYSIHQIER-----------------LCMPWLDKAEACAASEG 170
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
+G + +G GA C+ + A W P++LL+PL LGL
Sbjct: 171 VG-------------ELNGYLEGA-----------CAFSEEETALWKPLVLLIPLRLGLT 206
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q ++ +D
Sbjct: 207 DINEAYIETLKKCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQTAVDPCEDGT 266
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
D S + +H+ +DPS+A GF+CR +D+FDD+C R +L+ + P+F + +
Sbjct: 267 FTDDSYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRRLSCNRDNLPMFELVE 326
Query: 443 THKKPVNHSDVLGETGGVPEDDSL 466
+ + D + T + + L
Sbjct: 327 SQPSHMVSVDAINLTPDFSDSERL 350
>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
[Homo sapiens]
Length = 402
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 196
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 197 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 249
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 250 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 309
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 310 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 359
>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
Length = 682
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 162/314 (51%), Gaps = 17/314 (5%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + L ++ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 262 AAENQLAESPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 321
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 322 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 381
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDE---DGERGGAP 287
G W GP ++ + AL R S+ +A IY+ +E E P
Sbjct: 382 KPGDWYGPASVSYLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKP 441
Query: 288 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
V A R S K W +++L+PL LG +K+NP Y L+L + LGI+GG
Sbjct: 442 HVPWQQAKRSTSDAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEYCLGIIGG 501
Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 407
KP S Y VG QE+ I+LDPH Q ++++ ++ ++H R + +DPS
Sbjct: 502 KPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETFP--MHSFHCKSPRKLKSSKMDPSCC 559
Query: 408 IGFYCRDKDDFDDF 421
IGFYC K DFD F
Sbjct: 560 IGFYCPTKTDFDSF 573
>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
Length = 398
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 193 ADTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
gorilla]
gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A;
Short=hAPG4A
gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
construct]
Length = 398
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 193 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
dendrobatidis JAM81]
Length = 441
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 168/312 (53%), Gaps = 32/312 (10%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
F DF SR+ ++YRKGF I + T D GWGCMLRS QMLVA ALLFH LGR WR L
Sbjct: 137 HFLDDFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWR--L 194
Query: 199 QKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
DR+ Y IL F D TSP+SI + G + G W GP + + + L
Sbjct: 195 GDSNDRDTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLV 254
Query: 255 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILL 313
Q + + ++V DG + I A+R G+ TP +L+
Sbjct: 255 NDD--------QRISLKVHV---SNDGVVYKNEINTILSATR-----DDGK---TPAVLI 295
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
++PL LG+E +NP Y P ++ F +GI GG+P +S + +GV + IYLDPH ++P
Sbjct: 296 MIPLRLGVETMNPVYYPGVKHCFAMSHCVGIAGGRPNSSLFFLGVDGDHLIYLDPHHLRP 355
Query: 374 VI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
+ +I +E D +YH + +R + + S+DPSL IGFYC DFD CA+ ++LA
Sbjct: 356 SVDSRDITSYKME-DLLSYHCEKVRLLPIASMDPSLVIGFYCHSLKDFDVLCAKMTELAT 414
Query: 431 ESNGAPLFTVTQ 442
S APLF++ +
Sbjct: 415 GS--APLFSIEE 424
>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
Length = 395
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 163/316 (51%), Gaps = 27/316 (8%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR K
Sbjct: 52 DIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKHKEH 111
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
EY +IL F D + +SIH + Q G G + G W GP + + + LA +
Sbjct: 112 PEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS- 170
Query: 263 LGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSK-----GQAD-WT 309
+A+Y VV D P C + A+ + S +S+ GQ+ W
Sbjct: 171 -------LAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSSGWR 223
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + IYLDPH
Sbjct: 224 PLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEIIYLDPH 283
Query: 370 DVQPVINIGKDDLEADTSTYHSDV-IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
Q + D E TYH + + ++DPS+A+GF+C+D++DFD++C K
Sbjct: 284 TTQTFV-----DTEDQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFDNWCEVIEKE 338
Query: 429 AEESNGAPLFTVTQTH 444
+ +F +T H
Sbjct: 339 ILKHQSLRMFELTPKH 354
>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
Length = 488
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 191/403 (47%), Gaps = 48/403 (11%)
Query: 66 SEKKAVHNKSNGWTAAVK---RLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 122
S +AV N+ GW A +K + +G+ I + S I+LLG +
Sbjct: 89 STSEAVKNRVRGWWANMKYGWNAMNSGAQIDISDL----------SGADPIYLLGHVYHN 138
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
+ A F DFS+R+ +YR+ F P+ + TSD GWGCMLRS+QM++A
Sbjct: 139 KNNSA--------SFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGCMLRSAQMMLA 190
Query: 183 QALLFHRLGRPWRKPLQKPFDREYV--EILHLFG---DSETSPFSIHNLLQAGKAYGLAA 237
+A +FH LGR WR Q+ V +I+ F D+ +PFS+HN+++A G A
Sbjct: 191 EAFIFHLLGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMVRAAAHCGKKA 250
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQS---LPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP L RC G+ MAIYV + D
Sbjct: 251 GDWFGPSTAAY---LLKRCLEEAAGVADSKEIFEQMAIYVAQD---------CTIYTQDV 298
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
C+ S +W ++LL+P+ LG E+VN YI ++ + LGI+GGKP S Y
Sbjct: 299 LDLCT--SDPNIEWKSVVLLIPVRLGGERVNVNYIHCIKEILAYQNCLGIIGGKPRHSLY 356
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
VG Q + +YLDPH +Q + + L +++H R + +DPS IGFYC+
Sbjct: 357 FVGFQGKKLVYLDPHYLQKTTDTSR--LNFSVNSFHCTTARKVSFSKLDPSATIGFYCKT 414
Query: 415 KDDFDDFCARASKLAE---ESNGAPLFTVTQTHKKPVNHSDVL 454
+ DF+ F + + E ++ G P+F +++ VN + L
Sbjct: 415 RRDFESFQSIMQSVTESCPQNQGYPVFIISEGSSALVNQLNPL 457
>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
Length = 398
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 178/342 (52%), Gaps = 25/342 (7%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
G + G W GP A+ W +LA + + + + V+ D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVG 197
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
P ++ +++ F+ A W P+LL+VPL LG+ ++NP Y+ + F PQSLG
Sbjct: 198 ESTPGT-LNASNQSRGTFACCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLG 255
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSI 402
+GGKP + Y +G + I+LDPH Q +N +++ D T+H + +++ ++
Sbjct: 256 ALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVNT-EENGTVDDQTFHCLQSPQRMNILNL 314
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 315 DPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
Length = 1114
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 182/364 (50%), Gaps = 32/364 (8%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGNN-------GLAEFNQDFSSRILISYRKGFDPIGDSK 163
S +WLLG + I + + D + +F QDFSS + +YR+ F I +K
Sbjct: 226 SPVWLLGKFYHIKPSDLIDDDIQRGKRTRVVPNIEKFKQDFSSLLWFTYRQDFPAIPGTK 285
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFGD--SETS 219
+TSD GWGCMLRS QM++A+AL H LG W + ++E +I+ FGD + S
Sbjct: 286 LTSDCGWGCMLRSGQMMLAKALTLHYLGPEWNVFSDQTREQETYRKQIIRWFGDYLCDES 345
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGD 278
PFS+H L++ GK G G W GP ++ E + + Q+ +T L + +YV
Sbjct: 346 PFSMHRLVEVGKNLGKQPGEWFGPASVAHILKETMVKGQKTQTVLS----DLCVYVSQDC 401
Query: 279 EDGERGGAPVVCI----------DDASRHCSVFSKGQADWT-PILLLVPLVLGLEKVNPR 327
++ + C S H S DW +++L+P+ LG E++NP
Sbjct: 402 TVYKQDIYELCCTRPRADTKFTNSTESEHESSQDASSMDWKRAVVILIPVRLGGEQLNPV 461
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YIP ++ + +GI+GGKP S Y VG QE+ IYLDPH Q V++ +
Sbjct: 462 YIPCVKGLLSQDSCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVDTRERHFP--IQ 519
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTH 444
+YH R + +D IDPS IGFYCR++ +F+ F + ++ ++ P+F + H
Sbjct: 520 SYHCMSPRKVSIDKIDPSCTIGFYCRNQKEFEKFVQQTEEMVAPPKQRLSYPMFVFSDGH 579
Query: 445 KKPV 448
V
Sbjct: 580 SNEV 583
>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
Length = 398
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAG 197
Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
Length = 392
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 126/402 (31%), Positives = 191/402 (47%), Gaps = 49/402 (12%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG+C+ + L A+ N + EF +DF SR
Sbjct: 6 SKESPVWLLGLCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 65
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREY 206
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q + +
Sbjct: 66 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQSTDESSH 125
Query: 207 VEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRAE 260
I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 126 RMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQAME------RAS 179
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
+ +A+YV + V C D R ++LLVPL LG
Sbjct: 180 EDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR------------KALILLVPLRLG 227
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
+K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q +++ +
Sbjct: 228 ADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVEGN 287
Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK--LAEESNGAPLF 438
+ + +++H R + L +DPS +GFY DK+ DF + + ++ P+F
Sbjct: 288 E-KFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLTDFMETIQQFVIPNQNMDYPMF 346
Query: 439 TVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHE 480
+ K + + E G +P G SM D + E
Sbjct: 347 LFCEGSGKDLQQGIEVVE-GLLPSSSRFGHESMEDDLFECEE 387
>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
Length = 461
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 177/372 (47%), Gaps = 53/372 (14%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
+T +W+LG + I ++ + D +SR+ +YRK F IG + TSD
Sbjct: 91 TTEPVWILGRKYTIFTEKE-----------DILSDVTSRLWFTYRKNFPAIGGTGPTSDT 139
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQALL LGR WR + Y +L+ F D + S +SIH + Q
Sbjct: 140 GWGCMLRCGQMIFAQALLCRHLGRDWRWKKGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 199
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G + G W GP + + + LA + +A+++ +
Sbjct: 200 MGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSS--------LAVHIAMDN---------T 242
Query: 289 VCIDDASRHC--------SVF-----------------SKGQADWTPILLLVPLVLGLEK 323
V I++ R C S F + W P++LL+PL LGL +
Sbjct: 243 VVIEEIRRLCKPNFPAGASAFPTDSEFLLNGFPSGAEVTNRPTQWKPLVLLIPLRLGLTE 302
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N YI TL+ F PQSLG++GGKP ++ Y +G IYLDPH QP + I
Sbjct: 303 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEISGSCFI 362
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
D S + +++ +DPS+A+GF+C+ ++DF+D+C + KL+ P+F + +
Sbjct: 363 PDESFHCQHPPCRMNIVELDPSIAVGFFCKTEEDFNDWCQQVKKLSLIRGALPMFELVEH 422
Query: 444 HKKPVNHSDVLG 455
+ DVL
Sbjct: 423 QPSHFSSPDVLN 434
>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
Length = 398
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 178/348 (51%), Gaps = 37/348 (10%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
D G+R + + S+ S + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 193 ADTAGDRPPDSLTA-SNQSKGTSAYCTA---WKPLLLIVPLRLGINQINPVYVDAFKECF 248
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + + +
Sbjct: 249 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 308
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 309 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
Length = 398
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
Length = 396
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 304 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 353
>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
Length = 408
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 39 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 87
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 88 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 147
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 148 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVG 207
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E + +AS G+ W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 208 ESPPDTL----NASNQSKGTPAGRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 263
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 264 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 323
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 324 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365
>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
Length = 398
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/344 (31%), Positives = 177/344 (51%), Gaps = 29/344 (8%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTAG 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E + ++ + S + W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ESPPGSLTALNQSKGT----SACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 400
LG +GGKP + Y +G I+LDPH Q ++ +++ D T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNIL 312
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
Length = 393
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 180/373 (48%), Gaps = 39/373 (10%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 273
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAA 193
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
+ + DG G P ++A ++ W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPAVESDGLYNGCP----EEAG-----VRDRRSLWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEHNDSGCLPDESFHCQHP 304
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 453
+ + +DPS+A+GF+C ++DF+D+C + KL+ P+F + + ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEEDFNDWCQQIKKLSLVRAALPMFELVERQPSHFSNPDV 364
Query: 454 LGETGGVPEDDSL 466
L T + D L
Sbjct: 365 LNLTPDSSDADRL 377
>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
Length = 396
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 177/375 (47%), Gaps = 58/375 (15%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
+T +W+LG + I +DE L D +SR+ +YRK F IG + TS
Sbjct: 25 TTDPVWILGRKYTIFTEKDEILSDV-------------TSRLWFTYRKNFPAIGGTGPTS 71
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR + Y +L+ F D + S +SIH +
Sbjct: 72 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQI 131
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA + +A+++ +
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDN-------- 175
Query: 287 PVVCIDDASRHCSV-FSKGQA-------------------------DWTPILLLVPLVLG 320
V ++D R C FS A W P++LL+PL LG
Sbjct: 176 -TVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQWRPLVLLIPLRLG 234
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
L +N Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH Q + +
Sbjct: 235 LTDINEAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNG 294
Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
+ D S + +++ +DPS+A+GF+C+ ++DF+D+C + KL+ P+F +
Sbjct: 295 GVIPDESFHCQHPPCRMNIGELDPSIAVGFFCKSEEDFNDWCQQVKKLSRIPGALPMFEL 354
Query: 441 TQTHKKPVNHSDVLG 455
+ + DVL
Sbjct: 355 VEHQPSHFSCPDVLN 369
>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
cuniculus]
Length = 405
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 176/356 (49%), Gaps = 53/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 36 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 84
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 85 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 144
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 145 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 187
Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
D + C V SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 188 DIKKMCCVLPLSANTPGERLHDSLTASNQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 247
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G I+LDPH Q ++ ++ D +
Sbjct: 248 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDTEENGTVDDQTF 307
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 308 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 362
>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
Length = 393
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/373 (32%), Positives = 178/373 (47%), Gaps = 39/373 (10%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 273
G + G W GP + + +W +LA E CQS A
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAA 193
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
+ + D G P +D + A W P++LL+PL LGL ++N YI TL+
Sbjct: 194 CPALESDVLYNGCP----EDVG-----LRERLALWKPLVLLIPLRLGLTEINEAYIETLK 244
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG++GGKP ++ Y +G E IYLDPH QP + G D S +
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPGDSGCLPDESFHCQHP 304
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 453
+ + +DPS+A+GF+C + DF+D+C + KL+ P+F + + ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEADFNDWCQQIKKLSLVRGALPMFELVERQPSHFSNPDV 364
Query: 454 LGETGGVPEDDSL 466
L T + D L
Sbjct: 365 LNLTPDSSDADRL 377
>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
Length = 393
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 267
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 268 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPTVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
D S + + + +DPS+A+GF+C ++DF+D+C + KL+ P+F + +
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354
Query: 444 HKKPVNHSDVLGETGGVPEDDSL 466
++ DVL T + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377
>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
Length = 398
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 175/356 (49%), Gaps = 53/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLRTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKG--QAD----------------------WTPILLLVPLVLGLEKVNPRY 328
D + C V G AD W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPLLLIVPLRLGINQINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + + D +
Sbjct: 241 IEAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEESGIVDDETF 300
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + + ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 301 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355
>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; Short=cAut2B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
Length = 393
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 267
G + G W GP + + +W +LA CQ + G +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193
Query: 268 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
P +Y +E G R + W P++LL+PL LGL +
Sbjct: 194 CPAVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+N YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP +
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
D S + + + +DPS+A+GF+C ++DF+D+C + KL+ P+F + +
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354
Query: 444 HKKPVNHSDVLGETGGVPEDDSL 466
++ DVL T + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377
>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related cysteine endopeptidase 2A;
Short=Autophagin-2A; AltName: Full=Autophagy-related
protein 4 homolog A; AltName: Full=bAut2A
gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
Length = 398
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 400
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
Length = 369
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 182/345 (52%), Gaps = 33/345 (9%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 1 WILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 49
Query: 174 LRSSQMLVAQALLFHRLGRP--WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
LR QM++AQAL+ LGR W K ++P +EY IL F D + +SIH + Q G
Sbjct: 50 LRCGQMMLAQALICRHLGRDLNWEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGV 107
Query: 232 AYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDED 280
G + G W GP A+ W +LA + + + + I +S D
Sbjct: 108 GEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTA 167
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
GE +P ++ ++R S S G W P+LL+VPL LG+ ++NP Y+ + F PQ
Sbjct: 168 GE---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQ 223
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHL 399
SLG +GGKP + Y +G + I+LDPH Q ++ +++ D T+H + +++
Sbjct: 224 SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNI 282
Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 283 LNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 326
>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
Length = 398
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 179/343 (52%), Gaps = 27/343 (7%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D +R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK REY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
G + G W GP A+ W +LA + + + + V+ S D G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIG 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
E +P+ ++ +++ S + A W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 E---SPLNTLNASNQSKSAPASCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
Length = 396
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 400
LG +GGKP + Y +G + I+LDPH Q ++ +++ AD T+H + +++
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
Length = 475
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 120/367 (32%), Positives = 180/367 (49%), Gaps = 40/367 (10%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YR+ F IG + TS
Sbjct: 100 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 146
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 147 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 206
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W +LA + + I +
Sbjct: 207 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLA----VHVAMDNTVVMEEIRRLCR 262
Query: 278 DEDGERGGAPVVCIDDASRHCSVF----------SKGQADWTPILLLVPLVLGLEKVNPR 327
G A + DA RHC+ F S + W P++LL+PL LGL +N
Sbjct: 263 SSLPCSGAAALPA--DADRHCNGFPAPMEVTSRPSPSPSPWRPLVLLIPLRLGLTDINEA 320
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
Y+ TL+ F PQSLG++GGKP ++ Y +G + IYLDPH QP + + D +
Sbjct: 321 YVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFIPDET 380
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
+ + + +DPS+A+GF+C+ +DDF D+C + KL+ + P+F + +
Sbjct: 381 FHCQHPPCRMGIGELDPSIAVGFFCKTEDDFRDWCQQVRKLSLQGGALPMFELVEQQPSH 440
Query: 448 VNHSDVL 454
+ DVL
Sbjct: 441 LACPDVL 447
>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
Length = 510
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 115/346 (33%), Positives = 169/346 (48%), Gaps = 62/346 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 115 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 174
Query: 200 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR- 255
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 175 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTE 233
Query: 256 ---------------CQRAETGLGC---------QSLPMAIYVV---------------S 276
R E C Q P+ + S
Sbjct: 234 HSPNGLKMYVPKDGIIYRKEVYQLCAVQPADGPAQHSPLRVDDDGGDTDHDGDTDGLESS 293
Query: 277 GDEDGERGGAP-----VVCIDDASRHCSVFSKGQAD------------WTPILLLVPLVL 319
D G P + D +S H + S +++ W P+++LVP+ L
Sbjct: 294 TDSMRHSHGNPGVPSTIEAGDYSSSHAELMSSAESECESLDDNFTELTWHPVIILVPVRL 353
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+DPH VQP + +
Sbjct: 354 GIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQPTVKMDD 413
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
D L +Y ++ + + D IDPSLA+GF C + +FDDFC A
Sbjct: 414 DPL-FPIESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 458
>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
Length = 370
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 125/347 (36%), Positives = 169/347 (48%), Gaps = 44/347 (12%)
Query: 106 ISSSTSDIWLLGV-CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK- 163
I ST +WLLG H I N L QD S++ +YRK F PIG S
Sbjct: 26 IPQSTEPVWLLGKKYHAI------------NELNTIRQDIVSKLWFTYRKDFVPIGGSDG 73
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 222
TSD GWGCMLR QM++ QAL+ LGR W+ P + D Y+ IL F DS +PFS
Sbjct: 74 KTSDKGWGCMLRCGQMVLGQALMSIHLGRDWQWNPTTR--DATYLSILKKFEDSRKAPFS 131
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
IH + G + G G W GP + + + L + +AI+V +
Sbjct: 132 IHQIASMGISEGKEVGQWFGPNTVAQVLKKLVKFDEGND--------VAIHVALDN---- 179
Query: 283 RGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
VV I + C SK AD W P+LL+VPL LGL ++N Y+ L+ F
Sbjct: 180 -----VVIISEIRDLC--LSKETADVSTPHWKPLLLIVPLRLGLTQMNSIYLGGLKQCFQ 232
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL--EADTS-TYHSDVI 394
F QSLGI+GGKP ++ Y +G IY DPH Q ++G D E D +YH
Sbjct: 233 FKQSLGIIGGKPNSALYFIGYVGNEVIYFDPHTTQKAGSVGNKDTSEEKDVDLSYHCKHA 292
Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 441
+ + +DPS+A+ F CR + DF+D C ++ PLF V+
Sbjct: 293 SRMSMLGMDPSVAVCFLCRSEADFNDLCQNIKDQLIKTESQPLFEVS 339
>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
Length = 373
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 176/357 (49%), Gaps = 55/357 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 179 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 238
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ +++ D T
Sbjct: 239 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 297
Query: 389 YHS-DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+H + + + ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 298 FHCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353
>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
[Tribolium castaneum]
gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
Length = 366
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 117/321 (36%), Positives = 166/321 (51%), Gaps = 26/321 (8%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIG-DSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
N L E + QD S+I +YRK F PIG D +T+D GWGCMLR QM++AQAL+ L
Sbjct: 33 NALQELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHL 92
Query: 191 GRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
GR W +P K D Y++IL F D +PFSIH + G + G W GP + +
Sbjct: 93 GRDWVWEPETK--DSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQV 150
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
+ L + +L + + E +C+ S CS DW
Sbjct: 151 LKKLVKYDEWSAIEMHIALDNTLIISDIRE---------LCLSQGSDGCS-----SGDWK 196
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LL+VPL LGL+++NP Y L+ F F QSLG++GGKP + Y +G + IYLDPH
Sbjct: 197 PLLLIVPLRLGLQEINPIYASGLKKCFQFKQSLGVIGGKPNLALYFIGHVGDEVIYLDPH 256
Query: 370 DVQP---VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
Q V + ++ STYH I++ S+DPS+A+ F+C + +F+D C
Sbjct: 257 TTQKSGSVESKETEEEIELDSTYHCKYASRINILSMDPSVAVCFFCNTEGEFNDLCHSIK 316
Query: 427 KLAEESNGAPLFTVTQTHKKP 447
K E PLF + T++KP
Sbjct: 317 KDLIEPEKQPLFEI--TYEKP 335
>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
Length = 429
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 174/356 (48%), Gaps = 53/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 60 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 108
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 109 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 168
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 169 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 211
Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
D + C V SKG W P+LL+VPL LG+ ++NP Y
Sbjct: 212 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 271
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D +
Sbjct: 272 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 331
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + + ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 332 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 386
>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
Length = 398
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 178/357 (49%), Gaps = 55/357 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHC--------------------SVFSKGQAD----WTPILLLVPLVLGLEKVNPRY 328
D + C S SKG + W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPLLLIVPLRLGINQINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
+ + F PQSLG +GGKP + Y +G + I+LDPH Q ++ +++ D T
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 299
Query: 389 YHS-DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+H + +++ ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 300 FHCLQPPQRMNILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
Length = 398
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 172/343 (50%), Gaps = 27/343 (7%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 313
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355
>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
Length = 381
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 176/345 (51%), Gaps = 44/345 (12%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
S S +W+LG + + + + E N + SR L +YRK F I DS TSD
Sbjct: 28 SDSPVWILG-----------NELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDS 76
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD----REYVEILHLFGDSETSPFSIH 224
GWGCMLR QM++A+AL LGR W+ Q+ D ++Y++IL LF DS+ +P+S+H
Sbjct: 77 GWGCMLRCGQMVLAEALQRVSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLH 136
Query: 225 NLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+ G++ G+W GP + + L + +ET + P+ ++V +
Sbjct: 137 QIALMGESIQSKKPVGTWFGPNTIA---QVLRKLSVSET-----TNPIRVHVAMDN---- 184
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
V +D+ C F + P+LL +PL LGL ++NP Y L+ F FPQ L
Sbjct: 185 -----TVIVDEIKESCG-FIGDPSQGKPLLLFIPLRLGLTEINPIYFQDLKECFEFPQIL 238
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPH-----DVQPVINIGKDDLEADTSTYHSDVIRHI 397
G++GG+P + Y +G + IYLDPH V+ +G ++ TYH+D +
Sbjct: 239 GVIGGRPNHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGG----SEDKTYHTDRAYRM 294
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
+DPSL++ F C+D+ +F+D C R + +PLF + +
Sbjct: 295 DFKDLDPSLSLCFLCKDESEFEDMCERFLFKLIRGHNSPLFEICR 339
>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
Length = 406
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 175/364 (48%), Gaps = 61/364 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 327
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YI + F PQSLG +GGKP + Y +G + I+LDPH Q ++ + L D +
Sbjct: 241 YIEAFKECFKMPQSLGALGGKPNNAYYFIGSLGDELIFLDPHTTQTFVDTEESGLVDDHT 300
Query: 388 TYHSDVIRHIHLDSIDPSLAI-------GFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
+ + + + ++DPS+A+ GF+C+++ DFD++C+ K + N +F +
Sbjct: 301 FHCLQSPQRMSILNLDPSVALVGQGAFMGFFCKEEKDFDNWCSLVQKEILKEN-LRMFEL 359
Query: 441 TQTH 444
Q H
Sbjct: 360 VQKH 363
>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
[Ornithorhynchus anatinus]
Length = 436
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 172/359 (47%), Gaps = 54/359 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 68 VWILGRQHHLKAEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 116
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W K EY +IL F D + +SIH + Q G
Sbjct: 117 MLRCGQMMLAQALICRHLGRDWCWEKHKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVG 176
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 177 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 219
Query: 293 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 328
D + C + +G A W P+LL+VPL LG+ +NP Y
Sbjct: 220 DIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPLLLIVPLRLGINHINPIY 279
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 280 IDAFKECFKTPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQTFVDTEENGQVDDHSF 339
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
+ + + + ++DPS+A+GF+C+++ DFD++C+ K +F + Q K+P
Sbjct: 340 HCQQAPQRMKIMNLDPSVALGFFCKEEKDFDNWCSLVQKEILRQQSLRMFELVQ--KRP 396
>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
Length = 517
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 113/346 (32%), Positives = 169/346 (48%), Gaps = 39/346 (11%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGN----------------NGLAEFNQDFSSRILISYRK 154
S +WLLG C+ + + D + N L F DF S++ +YRK
Sbjct: 67 SPLWLLGKCYHLKKPSLSSDTSENAEGSQQSTSESYNMLPKHLKLFLVDFHSKLWFTYRK 126
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYV--EILH 211
GF + D+ +TSD GWGCMLR++QM++AQ+ + H LGR WR P + ++ + I+
Sbjct: 127 GFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLSMEQSDIHRNIIT 186
Query: 212 LFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
F D + PFS+H L + G +Y G+W GP + C + +T L L
Sbjct: 187 WFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECAKGKTEL----LN 242
Query: 270 MAIYVVSGDEDGERGGAPVVC-----IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ ++ D +C DA S S ++ +++L+P+ LG +
Sbjct: 243 NIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRS----VIVLIPVRLGEATL 298
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDDL 382
NP YIP ++ T QS+GI+GGKP S Y +G Q+E YLDPH Q + K+DL
Sbjct: 299 NPIYIPCIQSMLTLDQSVGIMGGKPKHSLYFIGFQDEYLFYLDPHYCQQADHPAAFKNDL 358
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
YH + R ++ +DPS +GFYCRD DF F A+K
Sbjct: 359 ---LQNYHCNSPRKTNISKMDPSCCLGFYCRDYKDFQSFVCEANKF 401
>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
Length = 385
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/352 (33%), Positives = 167/352 (47%), Gaps = 54/352 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVG 169
+WLLG C+ N L EF++ D +S+ +YRK + PIG TSD G
Sbjct: 25 VWLLGCCY--------------NPLEEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKG 70
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCMLR QM++ QAL+ LGR WR K Y +IL LF DS+ S +SIH + Q
Sbjct: 71 WGCMLRCGQMILGQALVMRHLGRDWRWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQM 130
Query: 230 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
G + G W GP + + L M +YV + +V
Sbjct: 131 GVSEGKKISQWFGPNTAAQVLKKLIMFDEWSQ--------MGVYVAMDN---------IV 173
Query: 290 CIDDASR----HCSVFSKGQA--------------DWTPILLLVPLVLGLEKVNPRYIPT 331
IDD + H + S+G A W P+LL +PL LGL +NP Y
Sbjct: 174 VIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLLFIPLRLGLTDLNPIYKDK 233
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L F +LGI+GGKP ++ Y +G+Q + +YLDPH VQ + + K + TYH
Sbjct: 234 LNKCFRIKNTLGIIGGKPNSAHYFIGIQGDYLLYLDPHTVQETVKV-KPNCPFSDKTYHQ 292
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA-EESNGAPLFTVTQ 442
+H +DPS+A+GFY +++F++ C + + S PLF V +
Sbjct: 293 KGTNRLHFSYMDPSVALGFYSATEEEFNELCRDFTDVCILNSAQPPLFEVVE 344
>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
gorilla]
Length = 379
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
QP + D S + + + +DPS+A+GF+C+ +DDF+D+C + KL+
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328
Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
P+F + + + DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351
>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
Length = 383
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 116/350 (33%), Positives = 177/350 (50%), Gaps = 34/350 (9%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + ++W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 22 IPETKDNVWVLGKKYSAIQD-----------LERIRRDITSVIWCTYRKGFVPIGDEGLT 70
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ + + +P+SI
Sbjct: 71 SDKGWGCMLRCGQMVLGVALIKVHLSADW---VWTPETRDPTYLKIVQRLEERKQAPYSI 127
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G G G W GP + + + L + + + I+V + +
Sbjct: 128 HQVALMGACEGKEVGQWFGPNTVAQVLKKLVVYDKWSS--------LVIHVALDNTVVKE 179
Query: 284 GGAPVVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+++ CS G +DW P+LL+VPL LGL ++NP Y+ L++ F PQS
Sbjct: 180 DILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGLSEINPIYMEGLKICFQSPQS 239
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIH 398
+G++GGKP + Y++G + IYLDPH Q V N D+ + TYH I
Sbjct: 240 IGVIGGKPNQALYLIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIP 299
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTHKKP 447
+ S+DPS+A+ F CR + DFD+ C K L +ES PLF + + K+P
Sbjct: 300 ILSMDPSVAVCFLCRTRSDFDELCELIEKRLMQESQ--PLFEICE--KRP 345
>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
Length = 379
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPL 208
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
QP + D S + + + +DPS+A+GF+C+ +DDF+D+C + KL+
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328
Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
P+F + + + DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351
>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
Length = 459
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 125/370 (33%), Positives = 184/370 (49%), Gaps = 48/370 (12%)
Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
S ++S +WLLG C+ QD D+ + ++ F S + +YR+ F+ + TS
Sbjct: 68 SQNSSKLWLLGDCYS-PQDFDNFDSMKD----AYHDAFESILWYTYRRDFETMVPYDFTS 122
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----LQKPFDREYVEILHLFGDS--ETS 219
D GWGCMLRS+QML+++A + LG W+ P L+ P + YV++L F DS
Sbjct: 123 DAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLELP--KVYVKLLKWFVDSFDTEC 180
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
+SIHN+ + G Y G W GP A+ R L Q P V+ +
Sbjct: 181 KYSIHNITRIGMQYDKLPGEWYGP-------TTAAQALRDLVNLHAQESPECNLVMYVPQ 233
Query: 280 DGERGGAPV--VCI---DDASRHCSVFSKGQADWT---------------------PILL 313
DG V +CI D + +V + Q+D T +L+
Sbjct: 234 DGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRDNSEKMWQKSLLI 293
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
L+PL LGL+ +NPRY+P ++ F FPQ++GI+GGK G S Y VG + LDPHD+ P
Sbjct: 294 LIPLRLGLDSINPRYLPAIQRVFEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPHDIHP 353
Query: 374 VINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 432
++ A T HS + + L SIDPSLA+GFYC D+ D+ DF R ++ E
Sbjct: 354 TADLNTAFPTATHLRTVHSRLPLEMSLGSIDPSLALGFYCSDRKDYLDFVDRVDRVQSEL 413
Query: 433 NGAPLFTVTQ 442
GA F++ +
Sbjct: 414 GGALPFSIAK 423
>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
Length = 394
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 159/312 (50%), Gaps = 40/312 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
++LLGV + + +D A F +D SR +YRK F PIGD+ TSD GWGC
Sbjct: 45 VYLLGVKYDLPRDGA-----------SFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGC 93
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
LR QML+ LL LGR WR D +Y +IL +F D S +SI + G
Sbjct: 94 TLRCGQMLLGHTLLLRHLGRDWRWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGAD 153
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
+G + G W GP + ++ + LA + Q +A+YV +V ID
Sbjct: 154 FGRSVGQWFGPNNVAQAIKRLA--------VHDQWSEVAVYVAMD---------MLVVID 196
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D S ++ P+L+ +PL LG E+ N Y ++ F QS+GI+GGKP +
Sbjct: 197 DIS-----------NFRPVLVFIPLRLGQERFNMEYKEAVKACFAVRQSVGIIGGKPRHA 245
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+ G ++ IYLDPH Q + + + +D STYH+ I +H+ +DPSLA+GF+C
Sbjct: 246 LWFTGYHDDYLIYLDPHKTQSCVTLPDAGIVSD-STYHTTQIERLHISELDPSLALGFFC 304
Query: 413 RDKDDFDDFCAR 424
+ + D DD C +
Sbjct: 305 QTEADLDDLCDK 316
>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 410
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 123/381 (32%), Positives = 179/381 (46%), Gaps = 64/381 (16%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
+ S ++W++G ++ Q + D ++ SR+ +YRK F PIG +
Sbjct: 28 LFKSGGEVWIVG---RVWQTQDFDD---------IKKEIRSRMWFTYRKSFSPIGGTGPI 75
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 224
SD GWGCMLR QML+AQAL+ LGR W+ P + D YV IL +F D + +SIH
Sbjct: 76 SDSGWGCMLRCGQMLLAQALICRHLGREWQWSPSCR--DEAYVRILRMFQDKKNELYSIH 133
Query: 225 NLLQAGKAYGLAAGSWVGP---------YAMCRSWEALA----------------RCQR- 258
+ + G++ G G W GP A+ W +LA C R
Sbjct: 134 MIAKMGESEGKEIGKWFGPSTIAHVIKKLAIYDDWSSLAVHVAMDNVIVQEDVKKLCSRE 193
Query: 259 ---AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
A Q P I V ED + V C + +S W P+LL++
Sbjct: 194 VFDALRKRLLQEEPSEI-VADWFEDARKDNKKVDCANLSS-----------PWKPLLLIL 241
Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
P+ LGL ++NP YIP L+ F ++G++GGKP + Y +G ++ +YLDPH Q +
Sbjct: 242 PMRLGLSELNPCYIPALKEFFACKYNIGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTFV 301
Query: 376 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN-- 433
++ D S+YHS I I + IDPSLAI FY + +FDDFC A ++ N
Sbjct: 302 DLDVSMDLFDDSSYHSAFILDISFNEIDPSLAIAFYINTEAEFDDFCTFAKQVCLVGNFR 361
Query: 434 ------GAPLFTVTQTHKKPV 448
LF V Q + P+
Sbjct: 362 CFSSGSMVQLFQVLQKYPNPL 382
>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
Length = 378
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 36 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 95
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 96 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 148
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 149 -LAVHIAMDNTVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 207
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 208 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 267
Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
QP + D S + + + +DPS+A+GF+C+ +DDF D+C + KL+
Sbjct: 268 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLL 327
Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
P+F + + + DVL
Sbjct: 328 GGALPMFELVEQQPSHLACPDVL 350
>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
Length = 382
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 167/332 (50%), Gaps = 43/332 (12%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + D +S+I ++YRK F IG + TSD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 35 LDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREWRW 94
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+P K +++Y+ IL +F D + FSIH + Q G + G G W GP + LA
Sbjct: 95 EPGTK--NKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR-HCSVFS------------ 302
+ + +AI+V + V I++ S+ C +++
Sbjct: 153 FDKWSS--------LAIHVAMDN---------TVIINEISKFRCHIWAAADGLVRNRTNS 195
Query: 303 ------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
+ W P+LL +PL LGL ++N Y L+ TF QSLG++GGKP + Y +
Sbjct: 196 EPSRPANSEGSWKPLLLFIPLRLGLSEINRIYAFGLKRTFALKQSLGMIGGKPNHALYFI 255
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
GV E+ I+LDPH Q ++ D D +YH +++ +DPS+A+ FY +
Sbjct: 256 GVVEDELIFLDPHTTQLACDLDVD--SPDDQSYHCAHASRMNISELDPSVALCFYMATES 313
Query: 417 DFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
DFD +C K PLF +TQ +PV
Sbjct: 314 DFDVWCNLVQKHLISRMQQPLFEITQ--DRPV 343
>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
Length = 398
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 115/309 (37%), Positives = 161/309 (52%), Gaps = 31/309 (10%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
+ + F + + +YR+ F + TSD GWGCMLRS+QML+ QAL LGR WR P
Sbjct: 41 YKRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPAL 100
Query: 198 ----LQKPFDREYVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
+ +YV +L F DS +SIH++++ G Y G W GP +
Sbjct: 101 FEAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLR 160
Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV-------FSKG 304
L R E G +A+YV ++G VV DD +R C ++
Sbjct: 161 DLVNLHRREFGG-----ELAMYV---PQEG------VVYTDDVTRLCFFDPLLHPPTAED 206
Query: 305 QADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+DW T +L+L+PL LGL++VN RY+P L TF FPQS+GI+GGK G S Y VG Q++
Sbjct: 207 SSDWSTALLILIPLRLGLDQVNERYVPALEKTFAFPQSVGIIGGKKGHSVYFVGTQQDQL 266
Query: 364 IYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
LDPHDV P + A T HS +++ IDPSLA+GF C ++ D++DF
Sbjct: 267 HLLDPHDVHPAPELNPAFPTATHLRTVHSSRPLVMNVTGIDPSLALGFLCDNRADYEDFE 326
Query: 423 ARASKLAEE 431
R L +E
Sbjct: 327 RRVRILHDE 335
>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
[Megachile rotundata]
Length = 518
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 178/387 (45%), Gaps = 58/387 (14%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG ++ +E L A+ + + EF +DF+SR
Sbjct: 126 SKESPVWLLGKIYRKKPEEFLEKASEAEKTLDTGSEISLAMDAISFEDSIEEFKKDFTSR 185
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P E
Sbjct: 186 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWQPDQPIKTEQQ 245
Query: 208 E--------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
+ I+ FGD SPFSIH L+ G +G AG W GP ++A
Sbjct: 246 KLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAGDWYGP-------SSVAHLL 298
Query: 258 RAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 313
+ LP +A+YV V + D C + W ++L
Sbjct: 299 SQAVEHAAEHLPIFSNLAVYVAQD---------CAVYLQDVESVCQM---PDGKWKSLIL 346
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
VPL LG +K+NP Y L T +G++GG+P S Y +G QE+ I LDPH Q
Sbjct: 347 FVPLRLGTDKLNPVYTSCLTHLLTLDTCIGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE 406
Query: 374 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL---AE 430
+++ KD+ +++H R + + +DPS +GFY DK+ F +F A +
Sbjct: 407 TVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKNQFTNFMEIAPSYLVPED 464
Query: 431 ESNGAPLFTVTQTHKKPVNHSDVLGET 457
E P+F + K ++ + ET
Sbjct: 465 EKVDYPMFLFCEGSGKDLHQQIEIAET 491
>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 160/321 (49%), Gaps = 23/321 (7%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
IYLDPH Q ++ + D + + + + +DPS+A+GF+C+D+++F+++C
Sbjct: 274 IYLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333
Query: 424 RASKLAEESNGAPLFTVTQTH 444
K + +F + H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354
>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 43 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 102
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213
Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
IYLDPH Q + + D + + + + +DPS+A+GF+C+D+++F+++C
Sbjct: 274 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333
Query: 424 RASKLAEESNGAPLFTVTQTH 444
K + +F + H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354
>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
Length = 392
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR W+
Sbjct: 40 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 99
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 100 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 159
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
+ +A+Y VS D +C + A+ H +S+ +
Sbjct: 160 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 210
Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G +
Sbjct: 211 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 270
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
IYLDPH Q + + D + + + + +DPS+A+GF+C+D+++F+++C
Sbjct: 271 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 330
Query: 424 RASKLAEESNGAPLFTVTQTH 444
K + +F + H
Sbjct: 331 VIEKEILKHQSLRMFELIPKH 351
>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
CIRAD86]
Length = 445
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 111/307 (36%), Positives = 159/307 (51%), Gaps = 45/307 (14%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
+EF DF SR+ I+YR F PI S TSD GWGCM+R
Sbjct: 109 SEFLDDFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIR 168
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A ++ HRLGR WRK + +RE+ +IL LF D+ +PFSIH ++ G +A G
Sbjct: 169 SGQSLLANTIVVHRLGRDWRKGQK---EREHKDILSLFADTPDAPFSIHKFVEHGAQACG 225
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A ARC RA T Q+ + +Y D D V ID A
Sbjct: 226 TYPGEWFGP-------NATARCLRALTDKYHQA-GLRVYARPNDSD--------VYID-A 268
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ ++ P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 269 LTATATQKDANDEFQPTLIVLGIRLGIEKVTPAYHAALKAALELPQSMGIAGGRPSSSHY 328
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
VG Q ++ YLDPH +P+++ + DT H+ +R + L +DPS+ +GF R
Sbjct: 329 FVGHQGDNFFYLDPHTTRPMLSPQPSAEDVDTC--HTRRVRRLSLAEMDPSMLLGFLVRS 386
Query: 415 KDDFDDF 421
K+DF+++
Sbjct: 387 KEDFEEW 393
>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
Length = 379
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L S+R+ + G + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 37 LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+L+ F D + S +SIH + Q G G + G W GP + + + LA +
Sbjct: 97 VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149
Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
+A+++ V +E V C D+ RHC+ F G + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
+LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +G E IYLDPH
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268
Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
QP + D S + + + +DPS+A+G +C+ +DDF+D+C + KL+
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGSFCKTEDDFNDWCQQVKKLSLL 328
Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
P+F + + + DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351
>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
litura]
Length = 365
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 176/351 (50%), Gaps = 33/351 (9%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + QD L +D +S I +YRKGF PIGD +T
Sbjct: 5 IPQTKESVWILGKKYSAIQD-----------LDRIRRDITSIIWCTYRKGFIPIGDEGLT 53
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
SD GWGCMLR QM++ AL+ L W + P R+ Y++I+ F + + +P+SI
Sbjct: 54 SDKGWGCMLRCGQMVLGVALVRVHLSADW---VWTPETRDPTYLKIIQRFEERKQAPYSI 110
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L + + + I+V + +
Sbjct: 111 HQVALMGASEGKQVGQWFGPNTVAQVLKKLTVYDKWSS--------LVIHVALDNTVVKE 162
Query: 284 GGAPVVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+++ CS DW P+LL+VPL LGL ++NP YI L++ F PQS+
Sbjct: 163 DILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLSEINPIYIDGLKICFQCPQSI 222
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIHL 399
G++GGKP + Y+VG + IYLDPH Q V D+ + +YH I +
Sbjct: 223 GVIGGKPNQALYLVGCVGDEVIYLDPHTTQRSGLVETKTTDEQKEMDWSYHCKYASRIPM 282
Query: 400 DSIDPSLAIGFYCRDKDDFDDFCAR-ASKLAEESNGAPLFTVTQTHKKPVN 449
++DPS+A+ F CR K DF++ CA +KL ES PLF + K+P +
Sbjct: 283 LAMDPSVAVCFLCRTKRDFEELCATIETKLMCESQ--PLFETCE--KRPAH 329
>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
aries]
Length = 454
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 168/358 (46%), Gaps = 42/358 (11%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 69 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 115
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +
Sbjct: 116 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYCRVPP--------------- 160
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
Q G G + G W GP + + + LA A + L V++ R G
Sbjct: 161 -QMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-AWSALAVHVAMDNTVVMADVRRLCRSGL 218
Query: 287 PVVCID----DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
P + D+ RHC+ F G A W P++LL+PL LGL VN Y TL+ F
Sbjct: 219 PCAGAEAFPADSERHCNGFPAGAEGGECTAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 278
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 279 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 338
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 339 MSITELDPSIAVGFFCKTEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVL 396
>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
Length = 456
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 179/377 (47%), Gaps = 51/377 (13%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG C+ ++ L +A+ N + EF +DF+SR
Sbjct: 62 SKESPVWLLGQCYLKKSEDPLENASEALEPEGTGSQVSLAMDATNFENTIEEFKRDFASR 121
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KP-------LQ 199
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W+ +P Q
Sbjct: 122 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKWRPEQSIENTQQ 181
Query: 200 KPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
D + I+ F D SPFSIH L+ G + G AG W GP ++ L++
Sbjct: 182 MRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSVAH---LLSQAV 238
Query: 258 RAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
L L +A+YV V + D C G W ++LLVP
Sbjct: 239 ERTGELPNSKLSRLAVYVAQD---------CAVYMQDVEEVCRTSDGG---WKSLILLVP 286
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L+LG +K+NP Y P + T +G++GG+P S Y +G Q++ I+LDPH Q ++
Sbjct: 287 LMLGTDKLNPVYAPCVTSLLTLDACIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVD 346
Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 435
+ K++ +++H R + L +DPS +GFY +++ DF SN
Sbjct: 347 VSKENFPL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNRESLTDFMETIHSFVIPSNQKT 404
Query: 436 --PLFTVTQTHKKPVNH 450
P+F + KK +
Sbjct: 405 DYPMFLFCEGSKKDLQQ 421
>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
Length = 673
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 161/318 (50%), Gaps = 21/318 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + + + G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 253 AAENQVTECPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIA 312
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S+ SPFSIH L++ G+ G
Sbjct: 313 QGLICHFLGRSWRYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 372
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E
Sbjct: 373 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQ 432
Query: 288 VVCIDDASRHCSVFSK----GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V A + S K Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 433 HVPWQHAKKSTSDAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 492
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++ ++H R I +D
Sbjct: 493 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMHSFHCKSPRKIKSSKMD 550
Query: 404 PSLAIGFYCRDKDDFDDF 421
PS IGFYC K DFD F
Sbjct: 551 PSCCIGFYCATKTDFDSF 568
>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
Length = 392
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 172/338 (50%), Gaps = 26/338 (7%)
Query: 104 TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK 163
T ++ ++ +WLLG K D A D + + F S + +YR+ + + +
Sbjct: 14 TPSAALSAPVWLLG---KRYDDVAAVD------FDAYKRSFESILWFTYRRDYPAMTPYE 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGDSE 217
TSD GWGCMLRS+QML+ QAL LGR WR P + YV++L F DS
Sbjct: 65 HTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPALFETEIDARLPETYVQLLRWFADSP 124
Query: 218 TSP--FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
+SIH +++ G Y G W GP + L R E G VV
Sbjct: 125 DVECRYSIHQMVKLGVQYDKLPGEWYGPTTAAQVLRDLVNLHRREFGGELSMYVPQEGVV 184
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADW-TPILLLVPLVLGLEKVNPRYIPTLRL 334
D+ + +C D H ++ ++DW T +L+L+PL LGL++VN RY+P ++
Sbjct: 185 YSDDVAK------LCFFDPLLHPPT-TEDKSDWSTALLILIPLRLGLDQVNERYVPAIQK 237
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDV 393
+F FPQS+GI+GGK G S Y VG Q++ LDPHDV P + A T HS
Sbjct: 238 SFAFPQSVGIIGGKKGHSVYFVGTQQDQLHLLDPHDVHPAPELNTAFPTATHLRTVHSSR 297
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
+++ +IDPSLA+GF C ++ D++DF R L +E
Sbjct: 298 PLVMNVTTIDPSLALGFLCENRVDYEDFERRVRILHDE 335
>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
Length = 387
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 167/338 (49%), Gaps = 41/338 (12%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + D +S+I ++YR+ F I + TSD GWGCMLR QM VA+AL+ L R W+
Sbjct: 41 LDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGWQW 100
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
P + D Y+ +L +F D + FSIH + Q G + G A G W GP + LA
Sbjct: 101 APGIR--DESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------- 306
+ + +AI+V + VV +DD + C + + ++
Sbjct: 159 FDKWSS--------LAIHVAMDN---------VVIMDDIRKVCRLEATAESGVRNRAEPA 201
Query: 307 --------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
W P+LL +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI+GV
Sbjct: 202 GLAAAAAESWKPLLLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYIIGV 261
Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
+ ++LDPH Q +++ D D +YH + + +DPS+A+ FY + +F
Sbjct: 262 VGDDLVFLDPHTTQLAVDL--DTEFPDDESYHCAHASRMDIGQLDPSIALCFYLPTEAEF 319
Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGE 456
D +C A K PLF +T+ +P+ D + E
Sbjct: 320 DSWCNLAHKHLISEMSQPLFEITE--HRPLGWPDFVDE 355
>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
Length = 474
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/382 (30%), Positives = 179/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----VHLCGRRYHFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKP------------- 201
+TSD GWGCMLRS QM++AQ LL H L R WR P + P
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLAPPEMPGPASPSRYRGPGR 193
Query: 202 --------------FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 HVPPRWTQGTLEMEQDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C +P + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-KCSEVPRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS IGFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTIGFYAGNRKEFETLCSELMR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
Length = 390
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 184/374 (49%), Gaps = 54/374 (14%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
S S +W+LG + N +AE N + SR+L +YRK F I S TSD
Sbjct: 28 SDSPVWILG-----------NELCARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDS 76
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 222
GWGCMLR QM++ +AL LGR W+ + + +Y++IL+LF DS+ +P+S
Sbjct: 77 GWGCMLRCGQMVLGEALQRISLGRDWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYS 136
Query: 223 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
IH + G++ G+W GP + + + L+ ++ ++P+ ++V +
Sbjct: 137 IHQIALMGESIQSKKPVGTWFGPNTVAQVLKKLSFFEK--------TVPIRLHVAMDN-- 186
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V ID+ C F G ++ P+LL +PL LGL ++NP Y L+ F FPQ
Sbjct: 187 -------TVIIDEIKESCG-FVGGDSE-KPLLLFIPLRLGLTEINPIYFQDLKECFEFPQ 237
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT------STYHSDVI 394
LG++GG+P + Y +G + IYLDPH I+ DT T+H++
Sbjct: 238 ILGVIGGRPNHALYFIGYVDNELIYLDPH-----ISTQSASSTVDTFGGPQDQTHHTERA 292
Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHS 451
+ +DPSL++ F CR++ +F+D C R + +PLF + + H P+ S
Sbjct: 293 YRMDFKDLDPSLSLCFLCRNESEFEDMCERFLFKLIRGHNSPLFEICRQRPEHLMPLPLS 352
Query: 452 DVLGE--TGGVPED 463
L VPE+
Sbjct: 353 SSLNSDLPNAVPEE 366
>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
Length = 606
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 161/318 (50%), Gaps = 34/318 (10%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
G+ F +DF SRI ++YR+ F + DS TSD GWGCM+RS QML+AQ L+ H LGR WR
Sbjct: 195 GIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLGRSWR 254
Query: 196 KPLQKPFDRE---YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
+ E + +++ FGD S+TSPFSIH L+ GK G G W GP A+
Sbjct: 255 WDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAVAHLL 314
Query: 251 EALARCQRAET----GLGCQ-SLPMAIYV--------VSGDEDG---ERGGAPVVCIDDA 294
R E G+ + A+Y+ V G +R GAP +
Sbjct: 315 RQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGTNSSS 374
Query: 295 SRHCSVFSKG-----------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
S + S A W ++LLVPL LG +K+NP Y L+ + +G
Sbjct: 375 STAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLKAMLSLDYCIG 434
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I+GG+P S Y VG QE+ I+LDPH Q ++++ +D+ +++H R + L +D
Sbjct: 435 IIGGRPKHSLYFVGYQEDKLIHLDPHYCQDMVDVNQDNFP--VASFHCKSPRKMKLSKMD 492
Query: 404 PSLAIGFYCRDKDDFDDF 421
PS IGFYC K DF F
Sbjct: 493 PSCCIGFYCETKKDFYKF 510
>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
Length = 486
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 112/343 (32%), Positives = 167/343 (48%), Gaps = 30/343 (8%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR + +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKR 251
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 252 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 299
Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
C + W ++L VPL LG +K+NP Y L T +G++GG+P S Y +
Sbjct: 300 VCQM---PDGKWKSLILFVPLRLGADKLNPVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 356
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY +K
Sbjct: 357 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHNKM 414
Query: 417 DFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGE 456
F +F A +E P+F + K + + E
Sbjct: 415 QFTNFMEIAPSYLVPEDEKVDYPMFLFCEGSGKDLQQKIEIAE 457
>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
Length = 354
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 113/328 (34%), Positives = 164/328 (50%), Gaps = 49/328 (14%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR
Sbjct: 8 GIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSWR 67
Query: 196 ---KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
KP+Q RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 68 WSEKPIQN--GREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGP- 124
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV-------VCIDDASRH 297
++A C + ++ V + E+ E V + I D H
Sbjct: 125 ------ASVAHCLK------------SVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTH 166
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
C + W ++LLVP+ LG E++NP Y P L T +GI+GG+P S Y VG
Sbjct: 167 CRL---PNGCWKSLILLVPVKLGTERLNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVG 223
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
Q++ I+LDPH Q ++++ + + T+H R + + +DPS IGFY + D
Sbjct: 224 YQDDRLIHLDPHYCQEMVDVWQPNFSLQ--TFHCRSPRKMPISKMDPSCCIGFYLQTHHD 281
Query: 418 FDDFCARASKL-----AEESNGAPLFTV 440
F+ F + SN P+FT+
Sbjct: 282 FETFVNVINTFLTPQGVSSSNEYPMFTL 309
>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
Length = 207
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/179 (52%), Positives = 124/179 (69%), Gaps = 7/179 (3%)
Query: 39 KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
K K S+LS +F+S FS+FE + +SSA+ H+ S W+ ++R+ GSM R
Sbjct: 35 KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90
Query: 99 LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
LG S+ + ++SD+W LG C+K++ +E + +G A F +DFSSRI I+YRKGFD
Sbjct: 91 LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+ EY+ ILH+FGDSE
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSE 206
>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
Length = 388
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 196/389 (50%), Gaps = 39/389 (10%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + +D + D S++ +YRKGF PIGDS +T
Sbjct: 21 IPQTREPVWILGRKYDAGRD-----------VTAIRSDIKSKLWFTYRKGFVPIGDSGLT 69
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 223
SD GWGCMLR QM++AQAL+ LGR WR K ++P EY+ IL +F D++T+ +SI
Sbjct: 70 SDKGWGCMLRCGQMVLAQALVCLHLGRDWRWKKDSKEP---EYLRILKMFEDTKTATYSI 126
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L+ + + + +L I V +R
Sbjct: 127 HQIALMGVSEGKDVGQWFGPNTVTQVLKKLSVYDKWSSIVIHVALDNTIIVNDIKSLCQR 186
Query: 284 GGAPVVCIDDASRHCS-----VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
V ID +++ S V+ W P+LL+VPL LGL ++NP Y+ L+ FTF
Sbjct: 187 NEQSV--IDSSAQKHSPLNEPVYFNSARKWKPLLLVVPLRLGLSEINPVYLNGLKTCFTF 244
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIR 395
QSLG++GGKP + Y +G E IYLDPH QPV + +L + + +YH
Sbjct: 245 RQSLGVIGGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYHCPRAS 304
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
+ +DPS+A+ F+C + +FD C + + +S PLF +T H PV +
Sbjct: 305 RSRILDMDPSVAVCFFCSSEVEFDILCQQIQEKLIKSEKQPLFEITLNKPRHWIPVEN-- 362
Query: 453 VLGETGGVPEDDSLGVMSMNDAVGNAHED 481
P + +L + + N+ ED
Sbjct: 363 --------PVERTLNLQDYERSFENSDED 383
>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
Length = 486
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 167/347 (48%), Gaps = 38/347 (10%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 192 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 251
Query: 237 AGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
AG W GP + + ++ E A A L A+YV V +
Sbjct: 252 AGDWYGPSSVAHLLSQAVENAAERHPAFNNL-------AVYVAQD---------CAVYLQ 295
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D C W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 296 DIENVCQT---PDGKWKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 352
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Y +G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY
Sbjct: 353 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 410
Query: 413 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGE 456
+K F +F A +E P+F + K ++ + E
Sbjct: 411 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAE 457
>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
Length = 393
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 119/401 (29%), Positives = 186/401 (46%), Gaps = 61/401 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + + ++ E D +SR+ +YRK F IG + TSD GWGC
Sbjct: 25 VWILGRKYSVLTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR K Y +L+ F D + S +SIH + Q G
Sbjct: 74 MLRCGQMIFAQALICRHLGRDWRWSKGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVG 133
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + LA + +A+++ + V ++
Sbjct: 134 EGKSIGQWYGPNTVAQVLRKLASFDTWSS--------LAVHIAMDN---------TVVME 176
Query: 293 DASRHC---------SVFSKGQADW------------------TPILLLVPLVLGLEKVN 325
+ R C S F + D+ P++LL+PL LGL +N
Sbjct: 177 EIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLLWKPLVLLIPLRLGLTDIN 236
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
YI TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D
Sbjct: 237 EAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPMDSCYIPD 296
Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 445
S + + + +DPS+A+GF+C ++DF+D+C R KL+ P+F + +
Sbjct: 297 ESFHCQHPPCRMSIAELDPSIAVGFFCNSEEDFNDWCQRIKKLSLIRGALPMFELVEHQP 356
Query: 446 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
+ DVL T + D L + ++ ++D+++L
Sbjct: 357 SHFSSPDVLNLTPDSSDADRL------ERFFDSEDEDFEIL 391
>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
Length = 525
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 114/347 (32%), Positives = 171/347 (49%), Gaps = 38/347 (10%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + +G+ EF +DF+SR+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+
Sbjct: 171 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 230
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR +P E + I+ FGD TSPFSIH L+ G +G
Sbjct: 231 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 290
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCID 292
AG W GP ++ A Q E + + P +A+YV V +
Sbjct: 291 AGDWYGPSSV-----AHLLSQAVENAV--ERHPAFNNLAVYVAQD---------CAVYLQ 334
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D C S G+ W ++L VPL LG +K+NP Y L T +G++GG+P S
Sbjct: 335 DIENVCQT-SDGK--WKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 391
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Y +G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY
Sbjct: 392 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 449
Query: 413 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGE 456
+K F +F A +E P+F + K ++ + E
Sbjct: 450 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAE 496
>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
Length = 397
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 166/312 (53%), Gaps = 18/312 (5%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+SR+ +YRK F PIG + TSD GWGCMLR QM++ +AL+ LGR WR +
Sbjct: 47 TSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVRRHLGRDWRWVRSQSQRE 106
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEALAR 255
+Y+ IL+ F D + +S+H + Q G G + G W GP A+ SW L
Sbjct: 107 DYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRLTV 166
Query: 256 CQRAETGLGCQS-----LPMAIYVVSGDEDGERG-GAPVVCIDDASRHCSVFSKGQADWT 309
+ + + +P Y + D + G P C++ A C++ + A W
Sbjct: 167 HVAMDNTVVIEEIKRLCMPWLDYGGAACVDLQGGMPEPNGCLEGA---CALAEEETALWK 223
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+LLL+PL LGL +N YI TL+ F PQSLG++GGKP + Y +G E IYLDPH
Sbjct: 224 PLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGGKPNHAHYFIGYVGEELIYLDPH 283
Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
QP + +D D + + +H+ IDPS+A+GF+CR +DDFDD+C R KL+
Sbjct: 284 TTQPAVEPCEDSQVPDDTYHCQHPPCRMHICEIDPSIAVGFFCRTEDDFDDWCMRFRKLS 343
Query: 430 EESNGAPLFTVT 441
G P+F +
Sbjct: 344 HTRAGLPMFELV 355
>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
Length = 459
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 181/405 (44%), Gaps = 82/405 (20%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 155
S S ++LLG C+ DE+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTDES-GELSTDGSNFDKINTEISGNVEEFRKDFISRIWLTYREE 94
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 194
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIDSSDSESWTAHTV 154
Query: 195 --------------RKP----------LQKPFDRE-------YVEILHLFGDSETSPFSI 223
R+P L++ +D + +I+ FGDS + F +
Sbjct: 155 KKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNEVYHRKIISWFGDSPLTAFGL 214
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H L++ GK G AG W GP + R G + IYV
Sbjct: 215 HQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTIYVAQD------ 263
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V D R CS G+AD +++LVP+ LG E+ N Y+ ++ + +G
Sbjct: 264 --CTVYSSDVIDRQCSFMDSGEADTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVG 321
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +D
Sbjct: 322 IIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMD 379
Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
PS IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 380 PSCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424
>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
Length = 474
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 175/382 (45%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + +F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQQFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
Length = 397
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 183/388 (47%), Gaps = 48/388 (12%)
Query: 91 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
M + E LGP I +D+WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDSMLAGAVGEPEDIPKRNTDVWLLGKRYNAIQELEL-----------IR 61
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTP 118
Query: 202 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
D Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
+ + ++V V +D+ C S + W P+LL+VPL L
Sbjct: 179 SS--------LVVHVAMDS---------TVVLDEIYTRCQEVSA--STWKPLLLIVPLRL 219
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q ++ +
Sbjct: 220 GISDINPMYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGSVAQ 279
Query: 380 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
A+ +YH + ++DPSLA+ F C+ ++ FD+ + +
Sbjct: 280 KTTAAEQELDESYHQKYAARLSFGAMDPSLAVCFLCKTRNSFDELLQQLRQEVLSLCTPA 339
Query: 437 LFTVTQTHKKPVNHSDVLGETGGVPEDD 464
LF ++Q+ + +D + E +P+ D
Sbjct: 340 LFEISQSRAVDWDTADDI-EWPAMPDID 366
>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
Length = 401
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 168/327 (51%), Gaps = 18/327 (5%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + D +S+I ++YRK F I + TSD GWGCMLR QM++A+AL+ LG+ W+
Sbjct: 54 LDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGWQW 113
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
P + D Y+ +L +F D + +SIH + Q G + G A G W GP + L+
Sbjct: 114 APGIR--DENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA-----DWTP 310
+ + L + V+ R P V DD RH S G A W P
Sbjct: 172 FDKW-SSLAVHVAMDNVVVMDDIRKICRVETPAV--DDGVRH-RTQSHGLACASAVSWKP 227
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+LL +PL LGL ++NP Y L+ TF QS+GI+GGKP + +I+GV + ++LDPH
Sbjct: 228 LLLFIPLRLGLNEINPVYYCGLKRTFALKQSVGIIGGKPNHALFIIGVVGDDLVFLDPHT 287
Query: 371 VQPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
Q +++ D+E + +YH + + +DPS+A+ FY + +FD +C A K
Sbjct: 288 TQLAVDL---DVEFPEDESYHCAHASRMDIGQLDPSIALCFYLPTECEFDSWCNLAHKHL 344
Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGE 456
PLF +T+ ++P+ D E
Sbjct: 345 ITQMKQPLFEITE--ERPLGWPDFTEE 369
>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
pulchellus]
Length = 390
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 164/328 (50%), Gaps = 44/328 (13%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
L + + +S+I ++YRK F I + TSD GWGCMLR QM+VA+A++ LG+ W+
Sbjct: 41 LDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGKDWQW 100
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
P K D +Y+ +L +F D + +SIH + Q G + G G W GP + L+
Sbjct: 101 SPGTK--DEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKLST 158
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK------------ 303
+ + +A++V + VV +DD + C V +
Sbjct: 159 FDKWSS--------LAMHVAMDN---------VVVMDDIRKICRVETTTDVEDGIRNRTQ 201
Query: 304 --------GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
G W P++L +PL LGL ++NP Y L+ TF QSLGI+GGKP + YI
Sbjct: 202 SHGGPAAAGARSWKPLVLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYI 261
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
+GV + ++LDPH Q +++ D+E + +YH + + +DPS+A+ FY
Sbjct: 262 IGVVGDDLVFLDPHTTQLAVDL---DVECPEDESYHCAHASRMDIGQLDPSIALCFYMAT 318
Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQ 442
+ +FD +C A K PLF +T+
Sbjct: 319 EAEFDSWCNLAHKHLISQMKQPLFEITE 346
>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
Length = 474
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
Length = 474
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436
>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
Length = 405
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 156/335 (46%), Gaps = 47/335 (14%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--K 196
E DF S+I +YRK F IG + T D GWGCMLR QM++AQAL+ LGR W+ K
Sbjct: 46 ELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWKWNK 105
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
Q D+ Y IL +F D +++ +SI + G + G GSW GP + + + LA
Sbjct: 106 NCQ---DQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----------- 305
+ + ++ D VC DD C + Q
Sbjct: 163 DEWSS---------IVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRS 213
Query: 306 --------------------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
W P+LL++PL LGL ++N Y+ +L+ +FPQS+GI+
Sbjct: 214 KKSSQDSSKQDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKACLSFPQSVGII 273
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP + + VG + IYLDPH Q ++ D +YH +++ +DPS
Sbjct: 274 GGKPNHAHWFVGYMSDKLIYLDPHTTQLCEDL--DSPNFSDESYHCPYPSTMNVMELDPS 331
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
+A+GFYC + +FDD K S+ P+F +
Sbjct: 332 IALGFYCGTEKEFDDLTQSVQKFVVGSSKTPMFEL 366
>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
Length = 424
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 179/383 (46%), Gaps = 67/383 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 82
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHYLPRDWTWAEGAGLGPPEPVGLSSPNRYRGPAR 142
Query: 196 ---------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 246
P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMAPTLGPGAPPSWSRERRHRQIVSWFADHPRAPFGLHQLVELGQSSGKKAGDWYGP--- 199
Query: 247 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
+A R + + +YV + A +V D + A
Sbjct: 200 ----SLVAHILRKAVESCAEVTRLVVYVSQDCTVYKADVARLVARPDPT----------A 245
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YL
Sbjct: 246 EWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYL 305
Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
DPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +
Sbjct: 306 DPHYCQPTVDVSRADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELT 363
Query: 427 KLAEESNGA---PLFTVTQTHKK 446
++ S+ P+FT+ + H +
Sbjct: 364 RVLSSSSATERYPMFTLAEGHAQ 386
>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
Length = 473
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 160/317 (50%), Gaps = 47/317 (14%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 177
F DF SR+ I+YR F PI + TSD GWGCM+RS
Sbjct: 138 FLDDFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSG 197
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLA 236
Q L+A LLF RLGR WR+ Q ++E E+L LF D +PFSIH +Q G A G
Sbjct: 198 QSLLANTLLFLRLGRGWRRGSQ---EQEESELLSLFADHPRAPFSIHRFVQHGATACGKC 254
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCIDDAS 295
G W GP A + +ALA G + +Y+ S G + ER + C
Sbjct: 255 PGEWFGPAAAAQCIQALAN--------GHPQAGLNVYITSDGSDIYERQFREIACR---- 302
Query: 296 RHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ G+ D P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S Y
Sbjct: 303 ---GLGEDGEDDSIKPTLILLGVRLGIDRVTPVYWESLKEVIRFPQSVGIAGGRPSSSHY 359
Query: 355 IVGVQEESAIYLDPHDVQPVI---NIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGF 410
+ Q ++ YLDPH +P + G+D + STYH+ +R +H+ +DPS+ IGF
Sbjct: 360 FIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGELSTYHTRRLRRLHIREMDPSMLIGF 419
Query: 411 YCRDKDDFDDFCARASK 427
RD+ D++D R +
Sbjct: 420 LVRDEGDWEDLKGRIRR 436
>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
Length = 389
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 178/362 (49%), Gaps = 34/362 (9%)
Query: 83 KRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQ 142
KR++ A +E + R G + +W+LG + L E N
Sbjct: 23 KRMLEACEAFVTYESGIILERQGFEVNDEPVWILG-----------REYDTKTKLDELNS 71
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--WRKPLQK 200
D SR+L++YR+ F PIGDS +TSD GWGCMLR QM+VAQAL+ LGR W +
Sbjct: 72 DVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFWPVGDDQ 131
Query: 201 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
Y +IL LF D +T+ +SIH L Q G + G G W GP + + + L+
Sbjct: 132 RTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLSEYDEWS 191
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC--SVFSKGQADWTPILLLVPLV 318
+ I+V + V I++ + C + + W+P+LL+VPL
Sbjct: 192 A--------LKIHVAMDN---------AVVIEEIEQLCHKKITPTETSTWSPLLLVVPLR 234
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
LGL +NP YI +L+ PQS+G++GGKP + Y +G + ++LDPH Q I++
Sbjct: 235 LGLLNINPIYIDSLKACLQMPQSIGMIGGKPSQALYFIGYVGDDVVFLDPHLTQNAIDLD 294
Query: 379 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 438
+D E D S+YH I S+DPSLA+ F C ++ D + ++E LF
Sbjct: 295 ED--EFDDSSYHPATCARISFQSMDPSLAVCFSCTTHSEWKDLLRQFKDMSEIGKKQNLF 352
Query: 439 TV 440
V
Sbjct: 353 EV 354
>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
Length = 382
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 184/388 (47%), Gaps = 48/388 (12%)
Query: 91 MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
M + E LGP I +++WLLG + Q+ L
Sbjct: 13 MDSVFEAYLGPDGVLAGAVGEIEDIPKRNTNVWLLGKRYNAIQE-----------LEPIR 61
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
+D SR+ +YR GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P
Sbjct: 62 RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTP 118
Query: 202 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
D Y++I++ F D+ S +SIH + G++ A G W+GP + + + L R
Sbjct: 119 DCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVRFDDW 178
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
+ +A++V V +DD C ++ W P+LL+VPL L
Sbjct: 179 SS--------LAVHVAMDS---------TVVLDDIYTCCQ--ESSESSWKPLLLIVPLRL 219
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G+ +NP YIP L+ S G++GG+P + Y +G ++ +YLDPH Q + +
Sbjct: 220 GITDINPIYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQ 279
Query: 380 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
A+ +YH + ++DPSLA+ F C+ +D F++ + + +
Sbjct: 280 KTTAAERELDESYHQKYAARLSFGAMDPSLAVCFLCKTRDSFEELLQQLRQDVLTLSTPA 339
Query: 437 LFTVTQTHKKPVNHSDVLGETGGVPEDD 464
LF ++Q+ + +D + E +P+ D
Sbjct: 340 LFEISQSRAVDWDTADDI-EWPAMPDID 366
>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
Length = 469
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 178/378 (47%), Gaps = 62/378 (16%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQKPF----------- 202
+TSD GWGCMLRS QM++AQ LL H L R W P P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSQGVGLGPPESSPNRYRGPAHWMPP 192
Query: 203 -----------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 HWVQAAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------S 245
Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 311
+A R + + +YV + A +V D + A+W +
Sbjct: 246 LVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKAV 295
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH
Sbjct: 296 VILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYC 355
Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +++
Sbjct: 356 QPTVDVSQADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTRVLSS 413
Query: 432 SNGA---PLFTVTQTHKK 446
S+ P+FT+ + H +
Sbjct: 414 SSATERYPMFTLVEGHAQ 431
>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
Length = 408
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 172/355 (48%), Gaps = 39/355 (10%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
G + G W GP A+ W +LA + + + + +S D
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 195
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
ER + AS S W P+LL+VPL LG+ ++NP Y+ + F PQS
Sbjct: 196 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
LG +GGKP + Y +G + I+LDPH Q ++ ++ D + + + +++ +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 311
Query: 402 IDPSLAI------------GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+DPS+A+ GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 312 LDPSVALVVLSCLLLLPPKGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365
>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
Length = 472
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 198
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 199 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
H QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413
Query: 429 AEESNGA---PLFTVTQTHKK 446
S+ P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434
>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
Length = 703
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQQAKRPQAETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581
Query: 406 LAIGFYCRDKDDFDDF 421
IGFYC K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597
>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 918
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 172/351 (49%), Gaps = 37/351 (10%)
Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNG-----LAEFNQDFSSRILISYRKGFDPIGD 161
S S S IW+LG C+ + E G + + +F DF + + SYRK F+ I
Sbjct: 260 SISDSPIWMLGNCYSGKELECNGHTENKHNKRSRHICKFFADFQTLVCFSYRKDFERIPG 319
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKPFDREYVEILHLFG 214
SK T+D GWGC LRS+QMLVA+AL+ GR WR PL + + I+ LF
Sbjct: 320 SKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCPAPLSSSKEDQLRLIIRLFQ 379
Query: 215 DSET--SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D SPFSIHN++Q G + + AG W GP ++ R + L A ++
Sbjct: 380 DQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFADLINQAYAMHQSPFRAYQAI 439
Query: 272 IYVVSGDEDGERGGAPVVCID-DASRHCSVFSKGQADWT-------------------PI 311
+++ D E P D + S S D T P+
Sbjct: 440 DHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVTPSASTSQSPPVLPPPFIPL 499
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
L+L+PL LGL ++N YIP L+ Q +GI+GG+P S Y VG QE++ I+ DPH
Sbjct: 500 LILMPLRLGLNEINRMYIPCLKALLMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPHGC 559
Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
+ +++ + T T+HS V I +DPS+AIGF C+++ DFDD C
Sbjct: 560 KRFVDMQQTSFP--TETFHSAVPNKIPFTHMDPSMAIGFLCQNQADFDDLC 608
>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
Length = 442
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 182/385 (47%), Gaps = 66/385 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 52 SRTSFSKLSS----VHLCGRRYRFETEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 101
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR----------KP--LQKPF------- 202
+TSD GWGCMLRS QM++AQ LL H L R W +P L P+
Sbjct: 102 GYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWVKGVGLDPPEPSRLASPYWHHGPAC 161
Query: 203 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 WIPPHWTQGSPELEQERRHRQIVSWFADHPKAPFGLHQLVELGQSSGKKAGDWYGP---- 217
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 218 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 264
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 325 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 382
Query: 428 LAEESNGA---PLFTVTQTHKKPVN 449
+ S+ P+FT+ + H + N
Sbjct: 383 VLSSSSATERYPMFTLAEGHAQDHN 407
>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
Length = 703
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 464 HVPWQKAKRPQAENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581
Query: 406 LAIGFYCRDKDDFDDF 421
IGFYC K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597
>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
Length = 668
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 249 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 308
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 309 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 368
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 369 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 428
Query: 288 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 429 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 488
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 489 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 546
Query: 406 LAIGFYCRDKDDFDDF 421
IGFYC K DFD+F
Sbjct: 547 CCIGFYCATKSDFDNF 562
>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
Length = 472
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 198
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192
Query: 199 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
Q P +R + +I+ F D +PF +H L++ G+ G AG W GP
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
+A R + + +YV + A +V D + A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
H QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413
Query: 429 AEESNGA---PLFTVTQTHKK 446
S+ P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434
>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
Length = 485
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 153/305 (50%), Gaps = 27/305 (8%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
A+ + + + EF +DF+SR+ ++YR+ F + S TSD GWGCMLRS QM++AQAL+
Sbjct: 131 AMDAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALV 190
Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
H LGR WR + +P E + I+ FGD TSPFSIH L+ G G
Sbjct: 191 CHFLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKR 250
Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
AG W GP ++ Q AE +L A+YV V + D
Sbjct: 251 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 298
Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
C + W ++L VPL LG +K+N Y L T +G++GG+P S Y +
Sbjct: 299 VCQM---PDGKWKSLILFVPLRLGADKLNLVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 355
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
G QE+ I LDPH Q +++ KD+ +++H R + + +DPS +GFY DK
Sbjct: 356 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKM 413
Query: 417 DFDDF 421
F +F
Sbjct: 414 QFTNF 418
>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
Length = 653
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML A
Sbjct: 234 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 293
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 294 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 353
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 354 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 413
Query: 288 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + +K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 414 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 473
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 474 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 531
Query: 406 LAIGFYCRDKDDFDDF 421
IGFYC K DFD+F
Sbjct: 532 CCIGFYCATKSDFDNF 547
>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
Length = 676
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 156/328 (47%), Gaps = 43/328 (13%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 258 AVENQVGETPWEEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 317
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L+ G A G
Sbjct: 318 QGLIVHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGK 377
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP ++ L T +++YV + I D
Sbjct: 378 KPGDWYGPASVSY---LLKHALEHATQENADFDNISVYVAKD---------CTIYIQDIE 425
Query: 296 RHCSV----------------------FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
CS+ Q W +++L+PL LG +KVNP Y L+
Sbjct: 426 DQCSIPEPAPKQTHVPWQQMKRPSLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAHCLK 485
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
L + LGI+GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H
Sbjct: 486 LLLSTENCLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQENF--SMQSFHCKS 543
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
R I +DPS IGFYC K DFD
Sbjct: 544 PRKIKTSKMDPSCCIGFYCATKSDFDSL 571
>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
Length = 708
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 289 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 348
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 349 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 408
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 409 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 468
Query: 288 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K Q W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 469 HVPWQQAKRPQAETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 528
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 529 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 586
Query: 406 LAIGFYCRDKDDFDDF 421
IGFYC K DFD F
Sbjct: 587 CCIGFYCATKSDFDSF 602
>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
Length = 473
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 249 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 295
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 356 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 413
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FTV + H +
Sbjct: 414 ILSSSSVTERYPMFTVAEGHAQ 435
>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
Length = 437
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 167/356 (46%), Gaps = 54/356 (15%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S+ S + +LG + +D + F F S ++YR GF PI S +T+D
Sbjct: 61 SNNSPVLVLGKLYIPERDTKPQSEGIPRHILMFMDHFYSLPWMTYRCGFSPILSSSLTTD 120
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR-----------KPLQKPFDREYVEILHLFGDS 216
GWGCM+RS QML+A L H LGR WR K ++ V IL FGDS
Sbjct: 121 CGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHRQVKNWNNYVVLILSWFGDS 180
Query: 217 ETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIY 273
E+ PFSIH L++A +G G W GP + L R C R + IY
Sbjct: 181 ESELCPFSIHRLMEAAYYHGNKPGDWFGPSQV----SILIRDCVRRALREHINLQKLNIY 236
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW------TPILLLVPLVLGLEKVNPR 327
V S C+V+ K D +L+LVP+ LG E +NP
Sbjct: 237 V--------------------SHDCTVYIKDVQDIFESDLDQSLLVLVPVRLGSESLNPI 276
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YIP ++ ++GI+GG+P S + +G Q+E+ I+LDPH Q +N+ + D D S
Sbjct: 277 YIPCVKALLALDHTVGIIGGRPKHSVFFIGFQDENLIHLDPHYSQTAVNMTRTDF--DVS 334
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
+YH + I + +DPS +GFYC +DF+ F A K+ FTVT T
Sbjct: 335 SYHCRSPKKIPVTKMDPSCTLGFYCHTLEDFNHFRIEAEKVT--------FTVTPT 382
>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
NZE10]
Length = 442
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 169/354 (47%), Gaps = 56/354 (15%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
+EF +D S+I ++YR F PI S TSD GWGCM+R
Sbjct: 111 SEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDGFTSDTGWGCMIR 170
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A A+L HRLGR WR+ + +REY +IL LF D+ SP SIH ++ G +A G
Sbjct: 171 SGQSLLANAILIHRLGRDWRRGDK---EREYKDILSLFADTPESPLSIHKFVEHGAQACG 227
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDGERGGAPVVCID 292
G W GP A R AL + E GL S P +YV
Sbjct: 228 TYPGEWFGPNATARCIRALTE-KYHEAGLQVYSRPNDSDVYV------------------ 268
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D+ + + P L+++ + LG+EKV P Y L+ QS+GI GG+P +S
Sbjct: 269 DSLMQTAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALELSQSVGIAGGRPSSS 328
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Y +G Q ++ YLDPH +P+++ L D ++ H+ +R + + +DPS+ +GF
Sbjct: 329 HYFIGHQGDNFFYLDPHTTRPMLS--PQPLAEDINSCHTRRVRRLGIAEMDPSMLLGFLI 386
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
R KD+F+ + S++ P + H+ +S G V E ++L
Sbjct: 387 RSKDEFEQWRKSISEI-------PGKAIIHIHETEPKYSTGTERAGAVDEVETL 433
>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
Length = 405
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 166/336 (49%), Gaps = 39/336 (11%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S S IWLLG + + + N DF SRI ++YRK F + S TSD
Sbjct: 18 SKDSPIWLLGRIYHQSHKTDDSSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSD 77
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPFDREYVEILHLFGD--S 216
GWGCMLRS QML+AQAL+ H LGR WR + LQ+ R I+ FGD S
Sbjct: 78 CGWGCMLRSGQMLLAQALVCHFLGRDWRWNESGAQEQQTLQESLHR---MIVQWFGDKPS 134
Query: 217 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
P SIH ++ G + G G W GP ++ S+ QRA T + + +Y+
Sbjct: 135 PACPLSIHQMVSQGHISAGKRPGDWYGPSSV--SYIIKQILQRA-TDTYPELDTLRVYIA 191
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD----------WTPILLLVPLVLGLEKVN 325
V +DD + CS + + W ++LL+PL LG E++N
Sbjct: 192 QD---------CTVYLDDVKQSCSKICNYECEETDYELIDDQWKSLILLIPLRLGGERMN 242
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
P Y L+ + Q +GI+GGKP S Y +G Q++ I+LDPH+ Q ++++ + +
Sbjct: 243 PTYDSCLKGLLSLEQCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDVLIPNF--N 300
Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
++H +R L +DPS +GFY R + +FD+F
Sbjct: 301 LKSFHCHELRKTALKQVDPSCCVGFYLRSQREFDEF 336
>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
Length = 668
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 167/316 (52%), Gaps = 19/316 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 248 AVENQVGEHPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 307
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H +GR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 308 QGLICHFMGRTWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGK 367
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 368 KPGDWYGPASVSYLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKP 427
Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + SK Q W +++L+PL LG +K+N Y L+L + LGI+
Sbjct: 428 NVPWQQAKRPQAEVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAHCLKLLLSTEHCLGII 487
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y VG QE+ I+LDPH Q ++++ +++ + ++H R + +DPS
Sbjct: 488 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENFSLN--SFHCKSPRKLKSSKMDPS 545
Query: 406 LAIGFYCRDKDDFDDF 421
IGFYC K DFD+F
Sbjct: 546 CCIGFYCATKSDFDNF 561
>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
Length = 442
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ G + F +DF SR+ ++YR+ F P+
Sbjct: 53 SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 102
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 103 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 161
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 162 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 217
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R C + + VS D V D +R S + A+
Sbjct: 218 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 264
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + + ++H R + +DPS +GFY ++ +F+ C+ +
Sbjct: 325 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 382
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FTV + H +
Sbjct: 383 ILSSSSVTERYPMFTVAEGHAQ 404
>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
Length = 355
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/297 (35%), Positives = 151/297 (50%), Gaps = 27/297 (9%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
G+ F DF S+I ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR W
Sbjct: 15 EGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 74
Query: 195 RKPLQKPFD--REYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
R +KP RE+ E I+ FGD S SP SIH ++ G+A G G W GP
Sbjct: 75 RWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALGKKPGDWYGPA 134
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
++ ++L E + + +YV V I D C +
Sbjct: 135 SVAHCLKSLIASASKENY---EFDHLEVYVAQDS---------TVYIQDIYSMCQLL--- 179
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
W ++LLVP+ LG EK NP Y P L T +GI+GG+P S Y VG Q++ I
Sbjct: 180 HGAWKSLILLVPVKLGTEKFNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVGYQDDKLI 239
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+LDPH Q ++++ + + ++H R + L +DPS IGFY + DF+ F
Sbjct: 240 HLDPHYCQEMVDVWQPNFSL--QSFHCRSPRKMPLAKMDPSCCIGFYLGTQHDFETF 294
>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
porcellus]
Length = 474
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 179/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSK-LSSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWAEGPGLGSPELPGTASPSPGRSPAR 192
Query: 195 ----RKPLQKP-FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P P ++E + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WVPPRWPRGAPELEQELRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 248
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +A+YV + A +V D + A+
Sbjct: 249 ---SLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHLVASRDPT----------AE 295
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY ++ +F+ CA ++
Sbjct: 356 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCAELTR 413
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 414 ILSCSSATERYPMFTLAEGHAQ 435
>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
Length = 672
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 163/318 (51%), Gaps = 21/318 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + + D+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 252 AAENQMADSPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 311
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 312 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGK 371
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ +E E P
Sbjct: 372 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKP 431
Query: 288 VVCIDDASRH----CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V S+ + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 432 HVPWQMTSKKPASDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 491
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I+GGKP S Y VG QE+ I+LDPH Q ++++ ++ ++H R + +D
Sbjct: 492 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMQSFHCKSPRKLKSSKMD 549
Query: 404 PSLAIGFYCRDKDDFDDF 421
PS IGFYC K DFD F
Sbjct: 550 PSCCIGFYCATKTDFDSF 567
>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
Length = 439
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 159/310 (51%), Gaps = 51/310 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A ALL R+GR WR+ + +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGVSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +AL+ Q + +Y+ +GD G+ V
Sbjct: 220 HPGEWFGPSATARCIQALSNSQAKSE--------LRVYI-TGD------GSDVY----ED 260
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ S+ +D+TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 261 KFMSIAKPNHSDFTPTLILVGTRLGLDKITPVYWEALKYSLQMPQSVGIAGGRPSSSHYF 320
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
+GVQE YLDPH +P + KD++E D + H+ +R +H+ +DPS+ I F
Sbjct: 321 IGVQESDFFYLDPHQTRPALPY-KDNVEDYTTEDIDSCHTRRLRRLHIKEMDPSMLIAFL 379
Query: 412 CRDKDDFDDF 421
RD++D++++
Sbjct: 380 IRDENDWNEW 389
>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
Length = 706
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 165/316 (52%), Gaps = 19/316 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SRI ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 287 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 346
Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR + Y + +H FGD S++SPFSIH L++ G+ G
Sbjct: 347 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 406
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 407 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 466
Query: 288 VVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V A R + K + W +++L+PL LG +K+NP Y L+L + LGI+
Sbjct: 467 HVPWQQAKRPQAETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 526
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y VG QE+ I+LDPH Q ++++ +++ ++H R + +DPS
Sbjct: 527 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 584
Query: 406 LAIGFYCRDKDDFDDF 421
IGFYC K DFD F
Sbjct: 585 CCIGFYCATKSDFDSF 600
>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
Length = 408
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 117/401 (29%), Positives = 187/401 (46%), Gaps = 69/401 (17%)
Query: 89 GSMRRIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
G RR E R SRT S +S + VC + + E GD + F +DF
Sbjct: 4 GGARRPREHGGRWAVKSRTSFSKISS----VHVCGRRYRFEGEGD------IQRFQRDFV 53
Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------- 194
SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ+LL H L R W
Sbjct: 54 SRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWTWAEGLGSAEP 113
Query: 195 ---------RKPL------------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
R P + +R + +I+ F D +PF +H L++ G++
Sbjct: 114 AGSASPSRYRGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPGAPFGLHRLVELGQSS 173
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G AG W GP +A R + + +YV + A +V D
Sbjct: 174 GKKAGDWYGP-------SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPD 226
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S
Sbjct: 227 PT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRLELCLGIMGGKPRHSL 276
Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
Y +G Q++ +YLDPH QP +++ + D + ++H R + +DPS +GFY
Sbjct: 277 YFIGYQDDFLLYLDPHYCQPTVDVSQTDFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAG 334
Query: 414 DKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 451
+ +F+ C+ +++ S+ P+FT+ + H + +HS
Sbjct: 335 GRKEFETLCSELTRVLGSSSATERYPMFTLAEGHAQ--DHS 373
>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
Length = 397
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 163/320 (50%), Gaps = 21/320 (6%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
+ D SR+ +YRK F PIG + +SD GWGCMLR QM++AQAL+ LGR WR
Sbjct: 45 CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWE 104
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
K EY +IL F D + +SIH + Q G G + G W GP + + + LA
Sbjct: 105 KHKNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 164
Query: 258 RAETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSKGQ----- 305
+ +A+Y VV D P C + A+ H S +S+ +
Sbjct: 165 EWNS--------LAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGH 216
Query: 306 -ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
+ W P+LL+VPL LG+ +NP Y+ + F PQSLG +GGKP + Y +G + I
Sbjct: 217 CSGWRPLLLVVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEII 276
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
YLDPH Q ++ + D + + + + ++DPS+A+GF+C+D++DF+++C
Sbjct: 277 YLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFNNWCEV 336
Query: 425 ASKLAEESNGAPLFTVTQTH 444
K + +F +T H
Sbjct: 337 IEKEILKHQSLRMFELTPKH 356
>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
Length = 332
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 150/285 (52%), Gaps = 40/285 (14%)
Query: 113 IWLLGVCHKIA------------QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG 160
+WLLGV + +A ++ + D + N F D SR+ SYR F PI
Sbjct: 70 VWLLGVRYTLAPPPMGQRGEGRETEQTVVDESQN-----FKLDMWSRLWFSYRYNFHPIS 124
Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR---EYVEILHLFGDSE 217
+++T+D GWGCM+RS QML+ QAL+ H LGR WR ++ +Y ++L +F D
Sbjct: 125 GTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLSHTSKYNELPSDYRKVLEMFLDHP 184
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC-QSLPMAIYVVS 276
+P SIH+ ++AG+ G AG+W GP +C ++ L A LG +L + Y
Sbjct: 185 CAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKL----HAGGALGSDNNLQLLAY--- 237
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
DG G D+ QA P+ +L+P LG+ V+P YIP + F
Sbjct: 238 ---DGNDG-------DNTIYKSEALELLQAG--PLFILLPTRLGVSSVDPSYIPKISHVF 285
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
+FPQSLG +GGKP ++ Y + Q E+ YLDPH QP+INI + +
Sbjct: 286 SFPQSLGFIGGKPSSAHYFIASQGEAVYYLDPHTPQPLINISEKE 330
>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 628
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 115/345 (33%), Positives = 164/345 (47%), Gaps = 59/345 (17%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+ G+ F +DF SR+ ++YRK F + DS TSD GWGCM+RS QML+AQ L+ H LGR
Sbjct: 188 DEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLGRG 247
Query: 194 WR-----KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSW 240
WR + L+ FD E I+ FGD S TSPFSIH L+ GK G G W
Sbjct: 248 WRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPGDW 307
Query: 241 VGPYAMCRSWEALARCQRAET----GLGCQ-SLPMAIYVVSGDEDGERGGAPVV------ 289
GP ++ + E G+ + A+Y+ ++ P V
Sbjct: 308 YGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAPWQKK 367
Query: 290 ------CIDDASR------------------HCSVF---------SKGQADWTPILLLVP 316
C D S+ H + F S + W ++LLVP
Sbjct: 368 MSSAAACTDSPSQATTPRVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLILLVP 427
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LG EK+NP Y L+ + +GI+GG+P S + VG QE+ I+LDPH Q +++
Sbjct: 428 LRLGTEKLNPIYNDCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYCQDMVD 487
Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ +++ S++H R + L +DPS IGFYC + DF F
Sbjct: 488 VNQENFPV--SSFHCKSPRKMKLSKMDPSCCIGFYCATRKDFFKF 530
>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
Length = 402
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 172/357 (48%), Gaps = 42/357 (11%)
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 153
+ + V G I +D+W+LG + Q+ L +D SR+ +YR
Sbjct: 31 VGQAVGGGESEDIPRRNTDVWVLGKRYNAIQELEL-----------IRRDIQSRLWCTYR 79
Query: 154 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILH 211
GF P+G+ ++T+D GWGCMLR QM++AQAL+ LGR W P R+ Y++I++
Sbjct: 80 CGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPECRDATYLKIVN 136
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
F D + S +SIH + G++ A G W+GP + + + L R +A
Sbjct: 137 RFEDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVRFD--------DWCSLA 188
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
++V V +DD C + W P+LL++PL LG+ +NP Y+P
Sbjct: 189 VHVAMDS---------TVVLDDIYSLC----REGDSWKPLLLVIPLRLGITDINPMYVPA 235
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTS 387
L+ S G++GG+P + Y +G ++ +YLDPH Q +G+ + E D
Sbjct: 236 LKRCLELDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGTVGQKTGVGEQEYD-E 294
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
TYH ++ ++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 295 TYHQKHAARLNFSAMDPSLAVCFLCKTSDSFESLLTKFRQEVLGLCSPALFEISQTR 351
>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
Length = 397
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 181 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 240
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 300
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356
>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
Length = 440
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 155/307 (50%), Gaps = 45/307 (14%)
Query: 138 AEFNQDFSSRILISYRKGFDPI----------------------GDSKITSDVGWGCMLR 175
++F DF SR+ ++YR F PI TSD GWGCM+R
Sbjct: 109 SQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGNFTSDTGWGCMIR 168
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A ++ RLGR WR+ + ++++ EIL +F D+ +PFSIH ++ G A G
Sbjct: 169 SGQSLLANTVVMLRLGRDWRRGQK---EKQHHEILSMFADTPEAPFSIHKFVEHGASACG 225
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A ARC RA T + + +Y D D V ID
Sbjct: 226 TYPGEWFGP-------SATARCIRALTE-KYHDVGLRVYARPNDSD--------VYIDTL 269
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + S + ++P L+++ + LG+EKV P Y L+ PQS+GI GG+P +S Y
Sbjct: 270 TATTTQHSASET-FSPTLIVLGVRLGIEKVTPAYHAALKSILELPQSVGIAGGRPSSSHY 328
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
VG Q + YLDPH +P++ D + H+ IR + + +DPS+ +GF RD
Sbjct: 329 FVGHQGDHFFYLDPHTTRPMLTAQP--TAEDVESCHTRRIRRLSIAEMDPSMLLGFLVRD 386
Query: 415 KDDFDDF 421
K+DF+D+
Sbjct: 387 KEDFEDW 393
>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
Length = 457
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 111/368 (30%), Positives = 170/368 (46%), Gaps = 60/368 (16%)
Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
ALG + +G+ + SSR +YRK F PIG + TSD GWGCMLR +QML+ + L
Sbjct: 34 ALGKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVL 93
Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L +GR + ++ Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 94 LRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNT 152
Query: 246 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
+ W +A + L + +L MA S D E+G+
Sbjct: 153 AAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQ------- 205
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
+H + + + +W P+LL++PL LGL +N Y+P ++ F PQ +GI+GGKP
Sbjct: 206 ----VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKLPQCVGIIGGKP 261
Query: 350 GASTYIVGVQEESAIYLDPHDVQPV------------------INIGK-DDLE------- 383
+ Y VG+ YLDPH +P N + +DLE
Sbjct: 262 NLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTESEQHDTNFSELEDLEPLPSQTS 321
Query: 384 -----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 438
D STYH +++ + +SIDPSLA+ +C ++DFD+ C K ++ P+F
Sbjct: 322 DVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESREDFDNLCEELQKTTLPASKPPMF 381
Query: 439 TVTQTHKK 446
+ K
Sbjct: 382 EFLEKRPK 389
>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
Length = 458
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 117/403 (29%), Positives = 173/403 (42%), Gaps = 79/403 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 195 -------------RKPLQKPFDREYV-----------EILH-----LFGDSETSPFSIHN 225
R+P +E + E+ H FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEVYHRKIISWFGDSPLAAFGLHQ 215
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
L++ GK G AG W GP + R G + +YV
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQ--------D 262
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI+
Sbjct: 263 CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGII 322
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPS 380
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 381 CTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 423
>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
Length = 380
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 12 VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 60
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY ILH F D + +SIH + Q G
Sbjct: 61 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 120
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 121 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 163
Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 164 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 223
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 224 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 283
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 284 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 339
>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
Length = 445
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 55 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 104
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 105 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 164
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 165 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 220
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 221 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 265
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 266 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 325
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 326 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 383
Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
+++ S+ P+FT+ + H +
Sbjct: 384 TRVLSSSSATERYPMFTLAEGHAQ 407
>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
Length = 428
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 38 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 87
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 88 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 147
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 148 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 203
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 204 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 248
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 249 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 308
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 309 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 366
Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
+++ S+ P+FT+ + H +
Sbjct: 367 TRVLSSSSATERYPMFTLAEGHAQ 390
>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
Length = 433
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 118/391 (30%), Positives = 176/391 (45%), Gaps = 62/391 (15%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
+ S + ++LLG HK A GD + + E+ +SR+ +YRK F PIG + T
Sbjct: 20 VFDSNTPVYLLG--HKFP---ARGDM---DSIKEY---VTSRLWFTYRKNFMPIGGTGPT 68
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCMLR QML+AQAL+ LG W + +Y IL +F D + PFS+H
Sbjct: 69 SDQGWGCMLRCGQMLLAQALIVRHLGTEWMWDRDNK-EEDYKRILRMFQDKKCCPFSLHQ 127
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALA---RCQRAETGLGCQSLPMAIYVVS------ 276
+ Q G + G W GP + + L R + +L +A V +
Sbjct: 128 IAQMGVSERKQIGEWFGPNTAAQVLKKLVVYDDWSRLAVHVALDNLLIASDVRTMAHTRP 187
Query: 277 --------------GDEDGERGGAPVVCIDDASRHCSVFS-----------KGQADWTPI 311
+E G G +C + + C + S + + W P+
Sbjct: 188 PSRLSSRHTTENEQSEESGNASGGNSLCSFGSVKMCMLQSALMKECDENPVEDEEQWRPL 247
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
L++VPL LGL +N Y+P + F PQ GI+GG+P + Y +G+ E IYLDPH
Sbjct: 248 LIIVPLRLGLTSINRCYLPAIEAFFQLPQCTGIIGGRPNHALYFIGIAGEQLIYLDPHVC 307
Query: 372 QPVINIG----------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 415
Q I++ K D S+YH + HI DS DPSLA+ F CR +
Sbjct: 308 QAAIDLDERCASLQQQDGFVEVVKSTDIFDDSSYHCPFLLHIAYDSADPSLALSFICRTE 367
Query: 416 DDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
++++ ++ PLF + +T K
Sbjct: 368 EEYEHLANNLKTKVLPASSPPLFELLETRPK 398
>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
Length = 423
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 185/389 (47%), Gaps = 72/389 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 33 SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 82
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 83 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSEASGLGPSEPSGLASPNRYRGPAR 142
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 143 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 198
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 199 -----SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 243
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 244 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 303
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 304 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 361
Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
+++ S+ P+FT+ + H + +HS
Sbjct: 362 TRVLSCSSATERYPMFTLAEGHAQ--DHS 388
>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
Length = 471
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 165/349 (47%), Gaps = 54/349 (15%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
G + F +DF SR+ +YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 104 GEGDIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163
Query: 193 PW--------------------RKPLQKPFDR------------EYVEILHLFGDSETSP 220
W R P + R ++ +I+ F D +P
Sbjct: 164 DWTWAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAP 223
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
FS+H L++ G++ G AG W GP +A R + + +YV
Sbjct: 224 FSLHRLVELGQSLGKKAGDWYGP-------SVVAHILRKAVESCSEVTHLVVYVSQDCTV 276
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+ A +V D + A+W +++LVP+ LG E +NP Y+P ++
Sbjct: 277 YKADVARLVARPDPT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSEL 326
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
LGI+GGKP S Y +G Q++ +YLDPH QP ++I + D + ++H R +
Sbjct: 327 CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDISQADFPLE--SFHCTAPRKMAFT 384
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 446
+DPS +GFY K +F+ C+ +++ S+ P+FT+ + H +
Sbjct: 385 KMDPSCTVGFYAGGKKEFETLCSELTRVLSSSSAMERYPMFTLAEGHAQ 433
>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
Length = 456
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 117/402 (29%), Positives = 179/402 (44%), Gaps = 79/402 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 155
S S ++LLG C+ +E+ G+ + G+N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVFLLGKCYHFKTEES-GELSTDGSNFDKISTEISGNVEEFRKDFISRIWLTYREE 94
Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 194
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 95 FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPEALDMESCDWESWTSSTV 154
Query: 195 ---------------------RKPLQKPFD----REYV---EILHLFGDSETSPFSIHNL 226
R P ++ +D R V +I+ FGDS + F +H L
Sbjct: 155 RKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYHRKIISWFGDSPLAAFGLHQL 214
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
++ GK G AG W GP + R G + +YV
Sbjct: 215 IEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD--------C 261
Query: 287 PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
V D R CS+ G+A +++L P+ LG E+ N Y+ ++ + +GI+G
Sbjct: 262 TVYSSDVIDRQCSLVDSGKAGTKAVIILFPVRLGGERTNTDYLEFVKGILSLEYCVGIIG 321
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
GKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 322 GKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPSC 379
Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 380 TIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 421
>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
Length = 505
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 173/369 (46%), Gaps = 66/369 (17%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
S S +WLLG C+ + L A+ N + EF +DF SR
Sbjct: 80 SKESPVWLLGQCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 139
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR-PWR-KPLQKPFDRE 205
+ ++YR+ F + S T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q +
Sbjct: 140 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWRWRPEQLTDESS 199
Query: 206 YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRA 259
+ I+ FGD T SPFSIH L+ G + G AG W GP + +C++ E RA
Sbjct: 200 HRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQAME------RA 253
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
+ +A+YV + V C D R ++LLVPL L
Sbjct: 254 SEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGRRKA------------LILLVPLRL 301
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------- 372
G +K+NP Y P L T +G++GG+P S Y +G Q++ I+LDPH Q
Sbjct: 302 GADKLNPVYAPCLTALLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQNEFYFRI 361
Query: 373 --------PVINIGKD-DLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
P + I + D+E + +++H R + L +DPS +GFY DK+
Sbjct: 362 LLSITDSLPYLFIQETVDVEGNEKFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLT 421
Query: 420 DFCARASKL 428
DF ++
Sbjct: 422 DFMETIQRI 430
>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
familiaris]
Length = 473
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGPGLGPSEPAGLASPNRYRGPAR 192
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 248
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 249 -----SLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARLVARPDPT---------- 293
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 354 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 411
Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
+++ S+ P+FT+ + H +
Sbjct: 412 TRVLSSSSATERYPMFTLAEGHAQ 435
>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
Length = 473
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 181/382 (47%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S ++L G ++ E+ GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSK-ISSVYLCGHRYRF---ESEGD------IQRFQRDFMSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPLQ 199
+TSD GWGCMLRS QML+AQ LL H L R W R P +
Sbjct: 133 GCLTSDCGWGCMLRSGQMLLAQGLLLHFLPRDWTWAEGSGLGPPELSGSASPSRYRGPAR 192
Query: 200 K----------PFDREY--VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+ ++E+ +I+ F D +PF +H L+ G++ G AG W GP
Sbjct: 193 RVPPHWAQCTPELEQEHWHRQIVSWFADHPQAPFGLHRLVALGQSSGKKAGDWYGP---- 248
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D +A+
Sbjct: 249 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDP----------KAE 295
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 356 PHYCQPSVDVSQADFSLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 413
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 414 VLSSSSATERYPMFTLAEGHAQ 435
>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
Length = 474
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
Length = 474
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
Length = 583
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 112/334 (33%), Positives = 154/334 (46%), Gaps = 65/334 (19%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
+ F +DF +R+ ++YRK F + DS TSD GWGCM+RS QML+AQ LL H LGR WR
Sbjct: 167 IEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNWRW 226
Query: 196 ----KPLQKPF------DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 243
+ L+ + D + +I+ FGD S TSPFSIH L+ GK G G W GP
Sbjct: 227 DATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWYGP 286
Query: 244 YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC---IDDASRHCSV 300
++A R L Q + D DG C I D C+V
Sbjct: 287 -------GSVAHLLRQAVKLAAQEI--------SDLDGVNVYVAQDCAVYIQDIIDECTV 331
Query: 301 FS---------------------------------KGQADWTPILLLVPLVLGLEKVNPR 327
+ W ++LLVPL LG EK+NP
Sbjct: 332 SAGPTLAPWQKKSPGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNPI 391
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
Y L+ + +GI+GG+P S Y VG QE+ I+LDPH Q ++++ + +
Sbjct: 392 YSDCLKAMLSLDNCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQDMVDVVNQE-NFPVA 450
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
++H R + L +DPS IGFYC + DF F
Sbjct: 451 SFHCKSPRKMKLSKMDPSCCIGFYCETRKDFFKF 484
>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
Length = 474
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSAMERYPMFTLAEGHAQ 436
>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
Length = 474
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------- 202
+TSD GWGCMLRS QM++AQ LL H L R W L P
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSWYHGPAR 193
Query: 203 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPCWAQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D S A+
Sbjct: 250 ---SLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARLVARPDPS----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WNSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S P+FT+ + H +
Sbjct: 415 VLSSSAATERYPMFTLAEGHAQ 436
>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
Length = 453
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 168/361 (46%), Gaps = 56/361 (15%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
G + F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QML+AQ LL H R
Sbjct: 84 GEGDIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSR 143
Query: 193 PW-----------RKPL---------------------QKPFDRE--YVEILHLFGDSET 218
W R+P + F++E + I+ F D
Sbjct: 144 DWTWSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPG 203
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
+PF +H L++ G++ G AG W GP +A R + + +YV
Sbjct: 204 APFGLHRLVELGRSSGKRAGDWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDC 256
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+ A +V D S +W I++LVP+ LG E +NP Y+P ++
Sbjct: 257 TVYKADVAQLVAQPDPS----------TEWKSIVILVPVRLGGETLNPVYVPCVKELLRL 306
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
+GI+GGKP S Y +G Q++ +YLDPH QP ++ ++ + ++H R +
Sbjct: 307 ELCIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPFVDTSQESFPLE--SFHCTSPRKMA 364
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDVLG 455
+DPS IGFY ++ +F+ C +++ S+ P+FT+++ H + +V
Sbjct: 365 FSRMDPSCTIGFYAGNRKEFELLCLELTRVLNSSSATERYPMFTLSEGHAQEYGLEEVCS 424
Query: 456 E 456
+
Sbjct: 425 Q 425
>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 473
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 180/384 (46%), Gaps = 70/384 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + E+ GD + F +DF SR+ ++YR+ F P
Sbjct: 83 SRTRFSKISS----VHLCGRRYCFESEGD------IQRFQRDFVSRLWLTYRRDFPPFAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWARGASLSPPEPSGLASSNRYRGPAH 192
Query: 195 --------RKP-LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
R P L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 CMTPCWAQRAPELEQ--ERRHRQIVSWFADHPQAPFGLHQLVELGQSSGKKAGDWYGP-- 248
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 249 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 293
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 354 LDPHYCQPAVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 411
Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
+++ S+ P+FT+ + H +
Sbjct: 412 TRVLSSSSTTERYPMFTLAEGHAQ 435
>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
Length = 411
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 181/387 (46%), Gaps = 68/387 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS 451
+ S+ P+FT+ + H + +HS
Sbjct: 352 VLSSSSAMERYPMFTLAEGHAQ--DHS 376
>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
Length = 411
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 130
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373
>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
Length = 411
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373
>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
Length = 393
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 177/364 (48%), Gaps = 39/364 (10%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +++WLLG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNANVWLLGKRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFVPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D+ S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTPDCRDTTYLKIVNRFEDTRKSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G++ A G W+GP + + + L R SL + + + S
Sbjct: 149 HQIALMGESQNKAVGEWLGPNTVAQILKILVRFD------DWSSLNVHVAMDS------- 195
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C ++ W P+LL+VPL LG+ +NP Y+P L+ S G
Sbjct: 196 ----TVVLDDIFTLCQ--EPSESAWKPLLLIVPLRLGISDINPIYVPALKRCLELNSSCG 249
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
++GG+P + Y +G ++ +YLDPH Q + + A+ +YH +
Sbjct: 250 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQKTTAAEQELDESYHQKYAARLSFA 309
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGV 460
++DPSLA+ F C+ +D F++ + + LF ++Q+ + +D + E +
Sbjct: 310 AMDPSLAVCFLCKTRDSFNELLQQLRQEVLSLCTPALFEISQSRAVDWDTADDI-EWPAM 368
Query: 461 PEDD 464
P+ D
Sbjct: 369 PDID 372
>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
Length = 497
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 216
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 272
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 273 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 319
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 320 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 379
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 380 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 437
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 438 VLGSSSATERYPMFTLAEGHAQ 459
>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
Length = 454
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 160/318 (50%), Gaps = 58/318 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 174
+F DF S++ I+YR F PI GDS I TSD GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKL---QEESELVSLFADHPRAPFSIHRFVHHGATAC 235
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 292
G G W GP A + +AL + + GL +Y+ S G + E+ V C +
Sbjct: 236 GKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACDE 287
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
P L+L+ + LG+++V P Y +L+ FPQS+GI GG+P +S
Sbjct: 288 SGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGRPSSS 335
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHLDSID 403
Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+ +D
Sbjct: 336 HYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHIREMD 395
Query: 404 PSLAIGFYCRDKDDFDDF 421
PS+ IGF RD+DD++D
Sbjct: 396 PSMLIGFLVRDEDDWEDL 413
>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
Length = 474
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 415 VLGSSSATERYPMFTLAEGHAQ 436
>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
Length = 392
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 187/392 (47%), Gaps = 69/392 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + E+ GD + F +DF+SR+ ++YR+ F P+
Sbjct: 5 SRTSFSKISS----VHLCGRRYCFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 54
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 55 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGAGLSPPEPSGLASPNRHHGLAH 114
Query: 196 -KPLQ-----KPFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
KP + ++E+ +I+ F D +PF +H L++ G+++G AG W GP
Sbjct: 115 WKPPRWAQGAPELEQEHWHRQIVSWFADHPQAPFGLHQLVELGQSWGKKAGDWYGP---- 170
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 171 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDCT----------AE 217
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++S +YLD
Sbjct: 218 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDSLLYLD 277
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + + ++H R + +DPS +GFY ++ +F+ C+ ++
Sbjct: 278 PHYCQPTVDVSQAGFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGNRKEFETLCSELTR 335
Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS-DVLG 455
+ S P+FT+ + H + +HS D LG
Sbjct: 336 VLSSSAATQRYPMFTLAEGHAQ--DHSLDNLG 365
>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
Length = 439
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 49 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 98
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 99 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 158
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 159 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 214
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 215 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 261
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 262 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 321
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 322 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 379
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 380 VLGSSSATERYPMFTLAEGHAQ 401
>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
Length = 678
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 183 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 288 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I+GGKP S Y VG QE+ I+LDPH Q +++I ++ ++H R + + +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548
Query: 404 PSLAIGFYCRDKDDFDDF 421
PS IGFYC K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566
>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
Length = 518
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 172/367 (46%), Gaps = 42/367 (11%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 145 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 191
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 192 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 251
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 252 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 303
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 304 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 363
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 364 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 423
Query: 390 HSDVIRHIHLDSIDPSLAI--GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
+ + +DPS+A+ G + + + C +L+ P+F + +
Sbjct: 424 CQHPPCRMSIAELDPSIAVVRGGHRSTQAFCAECCLGMKQLSLLGGALPMFELVEQQPSH 483
Query: 448 VNHSDVL 454
+ DVL
Sbjct: 484 LACPDVL 490
>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
A + +G+ G+ F +DF SR+ ++YR+ F + S TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310
Query: 183 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 235
Q L+ H LGR WR L + D + +I+ FGD S++SPFSIH L++ G+ G
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370
Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
G W GP ++ + AL + S+ +A IY+ ++ E P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430
Query: 288 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V A R + Q W +++L+PL LG +K+NP Y L+L + LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I+GGKP S Y VG QE+ I+LDPH Q +++I ++ ++H R + + +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548
Query: 404 PSLAIGFYCRDKDDFDDF 421
PS IGFYC K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566
>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 474
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
Length = 411
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 180/387 (46%), Gaps = 68/387 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 21 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 71 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +YV + A +V D + A+
Sbjct: 187 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351
Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS 451
+ S+ P+FT+ + H + +HS
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ--DHS 376
>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
Length = 474
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +YV + A +V D + A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 297 WMSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436
>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
gallopavo]
Length = 421
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 170/356 (47%), Gaps = 52/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGRRHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
D + C S + + W P+LL++PL LG+ +NP Y
Sbjct: 205 DIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCTGWKPLLLIIPLRLGINHINPVY 264
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 265 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 324
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 325 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLQMFELVQKH 380
>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
Length = 385
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 171/347 (49%), Gaps = 35/347 (10%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W+LG H++ +++ + D S+R+ +YR+ F PIG + +SD GWGCM
Sbjct: 17 WILGRQHQLKTEKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 65
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
LR QM++AQAL+ LGR W K EY IL F D + +SIH + Q G
Sbjct: 66 LRCGQMMLAQALICRHLGRDWHWEEHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGE 125
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGER------ 283
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 126 GKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCRLPN 177
Query: 284 GGAPVVCIDDASRHCSVFSKGQAD------WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
P V H S+ S+ ++ W P+LL++PL LG+ +NP Y+ + F
Sbjct: 178 QNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLIIPLRLGINHINPVYVDAFKECFK 237
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S + +
Sbjct: 238 MPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQLFVDSEENSTVDDRSFHCQQAPHRM 297
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ ++DPS+A+GF+C+++ DFD +C+ K + +F + Q H
Sbjct: 298 KIMNLDPSVALGFFCKEEKDFDTWCSLVQKEIHKQQSLRMFELIQKH 344
>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
Length = 607
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 216 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 265
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 266 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWIEGPGLAHPELPGSASSSQGRGPAR 325
Query: 198 ----------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
L++ + + +I+ F D +P +H L++ G++ G AG W GP
Sbjct: 326 WMPPSCPWGALEREQELRHRQIVSWFADHPRAPLGLHRLVELGQSSGKKAGDWYGP---- 381
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + +A+YV + A +V D + A+
Sbjct: 382 ---SLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHLVASPDPA----------AE 428
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLD
Sbjct: 429 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 488
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY ++ + + C+ ++
Sbjct: 489 PHYCQPTVDVSQADFSLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKELETLCSELTR 546
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 547 ILSSSSATERYPMFTLVEGHAQ 568
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 10/93 (10%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S I+L G ++ G + F +DF SR+ ++YR+ F P+
Sbjct: 130 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 179
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 180 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212
>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
Length = 447
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 158/332 (47%), Gaps = 49/332 (14%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
++F DF SRI I+YR GF PI S TSD GWGCM+R
Sbjct: 110 SDFIDDFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIR 169
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
S Q L+A +L HRLGR WRK ++ E+ IL LF D+ +PFSIH ++ G +A G
Sbjct: 170 SGQSLLANTILLHRLGRDWRKGQKQ---EEHKNILSLFADTPEAPFSIHKFVEHGAQACG 226
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A ARC RA T + +Y D D DA
Sbjct: 227 TYPGEWFGP-------NATARCLRALTD-KYHGAGLRVYARPNDSD---------VYADA 269
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + P L+++ + LG+EKV Y L+ PQS+GI GG+P +S Y
Sbjct: 270 LIETATQKDADDKFQPTLIVLGIRLGIEKVTSAYHVALKAALELPQSVGIAGGRPSSSHY 329
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
+G Q +S YLDPH + +++ D T H+ IR + L +DPS+ +GF R
Sbjct: 330 FLGHQGDSFFYLDPHTTRHMLSPQPS--AEDIETCHTRRIRKLPLSEMDPSMLLGFLVRS 387
Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
+++F+++ K E G + + +T K
Sbjct: 388 QEEFEEW----RKAVLEMPGKAIIHIHETEPK 415
>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
Length = 395
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 168/356 (47%), Gaps = 52/356 (14%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178
Query: 293 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 328
D + C +G W P+LL++PL LG+ +NP Y
Sbjct: 179 DIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGWKPLLLIIPLRLGINHINPVY 238
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D S
Sbjct: 239 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDESF 298
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 299 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 354
>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
Length = 412
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 170/358 (47%), Gaps = 56/358 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + D++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W+ K EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD--------------------------WTPILLLVPLVLGLEKVNP 326
D + C +S Q+ W P+LL++PL LG+ +NP
Sbjct: 181 DIKKMC--WSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGWKPLLLIIPLRLGINHINP 238
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
YI + F PQSLG +GGKP + Y +G IYLDPH Q ++ ++ D
Sbjct: 239 VYIDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDK 298
Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
S + + + ++DPS+A+GF+C+++ DFD++C+ K + +F + Q H
Sbjct: 299 SFHCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356
>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
1015]
Length = 384
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 121/378 (32%), Positives = 180/378 (47%), Gaps = 54/378 (14%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
+RI + + P S IW LG+ + +D A + F DF SRI ++
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKD-ANTRETQHAWPESFLLDFESRIWMT 69
Query: 152 YRKGFDPI----GDSK-------------------ITSDVGWGCMLRSSQMLVAQALLFH 188
YR F PI GD K TSD GWGCM+RS Q L+A AL
Sbjct: 70 YRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQSLLANALSML 129
Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMC 247
LGR WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A
Sbjct: 130 VLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKYPGEWFGPSATA 186
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+ EAL+ C + + +YV + + + D +R+ S
Sbjct: 187 KCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK-----FMDIARNTS------GA 227
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
+ P L+L+ LG++ + P Y L+ FPQS+GI GG+P AS Y VG Q YLD
Sbjct: 228 FQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGRPSASHYFVGAQGSHLFYLD 287
Query: 368 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
PH +P + G+ + + TYH+ +R IH+ +DPS+ IGF R+++D+ D+ R
Sbjct: 288 PHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRNQEDWADWLKR 347
Query: 425 ASKLAEESNGAPLFTVTQ 442
E G P+ V +
Sbjct: 348 ----IEAVKGRPIIHVLK 361
>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
Length = 482
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 105/398 (26%), Positives = 176/398 (44%), Gaps = 77/398 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S + + +C + Q E GD + F +DF+SR+ ++YR+ F P+ +TSD
Sbjct: 79 TSFSKLSTVHLCGRRYQFEGEGD------IQRFQKDFASRLWLTYRRDFPPLDGGSLTSD 132
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------------- 195
GWGCMLRS QML+AQ LL H R W
Sbjct: 133 CGWGCMLRSGQMLLAQGLLLHFFSRDWTWAEAVLPPSPRESELFRSMSPSRSGASWQRGS 192
Query: 196 -----------------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
P Q + ++ I+ F D +PF +H L++ G++ G AG
Sbjct: 193 STASGLGRATWSTGGTLSPRQLEQEEQHRRIVSWFADQPGAPFGLHRLVELGRSSGKRAG 252
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
W GP +A R + + +YV + A ++ D S
Sbjct: 253 DWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQLMAQPDPS--- 302
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
+W +++LVP+ LG E +NP Y+P ++ +GI+GGKP S Y +G
Sbjct: 303 -------TEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDLCIGIIGGKPRHSLYFIGY 355
Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
Q++ +YLDPH QP ++ ++ + ++H R + +DPS IGFY ++ +F
Sbjct: 356 QDDFLLYLDPHYCQPCVDTSQERFPLE--SFHCTSPRKMAFSRMDPSCTIGFYAGNRKEF 413
Query: 419 DDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDV 453
+ C +++ S+ P+FT+++ H + + +V
Sbjct: 414 EMLCLELTRVLNSSSATERYPMFTLSEGHAQEYSLEEV 451
>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
Length = 457
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 181/422 (42%), Gaps = 80/422 (18%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ ++E L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENEMLSARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPGALNIENSDSESWTSHTVK 155
Query: 198 ---------------LQKP-------------FDREYVEILH-----LFGDSETSPFSIH 224
L+ P + EI H FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNEIYHRKIVSWFGDSPLAFFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + + G + IYV +D
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQG-----ITIYVA---QDCTVY 267
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
+ V+ ASR S+G D +++LVP+ LG E+ NP Y+ ++ + +GI
Sbjct: 268 NSDVIDTQSASRT----SEGAED-KAVIILVPVRLGGERTNPDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH QP +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQPFVDVSVKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKKPVNHSDVLGETGGVPE 462
S IGFYCR+ DF+ +K+ S PLFT H + + + E
Sbjct: 381 SCTIGFYCRNVQDFERTSEEITKMLRISAKEKYPLFTFVNGHSRDYDFTSTTTNEDLFSE 440
Query: 463 DD 464
D+
Sbjct: 441 DE 442
>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 494
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 173/363 (47%), Gaps = 65/363 (17%)
Query: 127 ALGDAAGNNG---LAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------- 165
A GDA G F DF SRI ++YR GF DP S ++
Sbjct: 139 AYGDADGTTDGGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGA 198
Query: 166 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
SD GWGCM+RS Q L+A ALL RLGR WR+ +RE IL LF D +
Sbjct: 199 DQAGFSSDTGWGCMIRSGQSLLANALLISRLGREWRRGQNPKAERE---ILSLFADDPRA 255
Query: 220 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
P+S+HN ++ G +A G G W GP A R +ALA +E + +Y
Sbjct: 256 PYSLHNFVKHGAEACGKFPGEWFGPSATARCIQALANKHESE---------LRVYST--- 303
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
G P V D ++ + + P L+LV LG++K+N Y L T
Sbjct: 304 -----GDLPDVYEDS---FMAIANPDGQHFHPTLVLVCTRLGIDKINKVYEQALISTLQM 355
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIR 395
QS+GI GG+P S Y +GVQ++ YLDPH +P++ +D + + + H+ +R
Sbjct: 356 EQSIGIAGGRPSQSHYFIGVQDQWLFYLDPHYPRPMLPYRENPEDYTQEEVDSCHTRRLR 415
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
H+H++ +DPS+ IGF +D+DD+D + + + G + TV+ H LG
Sbjct: 416 HLHVEDLDPSMLIGFLIKDEDDWDTWKSAVKHV----QGKAIITVSP-------HDPALG 464
Query: 456 ETG 458
TG
Sbjct: 465 GTG 467
>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
Length = 417
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 182/373 (48%), Gaps = 25/373 (6%)
Query: 82 VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
V V G R I GP + + +W+LG + + A ++
Sbjct: 19 VTLCVFPGVKRHITILSDGPEE--LPETDEPVWILGKQYDLQ--------AVITEKSKLL 68
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
D S+R+ +YR+ F PIG + +SD GWGCMLR QM++AQAL+ LGR W +Q+
Sbjct: 69 SDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEMQQE 128
Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEA 252
EY IL F D + +SIH + Q G G + G W GP A+ W +
Sbjct: 129 QPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS 188
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 312
LA + + + + ++ + +P +D S H S G W P+L
Sbjct: 189 LAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGLDQ-STHLPEPSPG---WKPLL 244
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
L++PL LG+ ++NP YI + F PQSLG +GGKP ++ Y +G IYLDPH Q
Sbjct: 245 LIIPLRLGINQINPVYIDAFKECFKMPQSLGALGGKPNSAYYFIGFLGNELIYLDPHTTQ 304
Query: 373 PVINIGKDDLEADTSTYHSDVIRH-IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
++ ++D D ++H H + + ++DPS+A+GF+ ++++DFD++C K +
Sbjct: 305 TFVD-SEEDGTVDDQSFHCQQSPHRMQILNLDPSVALGFFFKEEEDFDNWCRLVQKEILK 363
Query: 432 SNGAPLFTVTQTH 444
+F + Q H
Sbjct: 364 PQSLQMFELVQKH 376
>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
Length = 508
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 164/345 (47%), Gaps = 51/345 (14%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 169
G++ A F DF S+I ++YR GF I S T+D G
Sbjct: 138 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 197
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 198 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 254
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 255 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 298
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 299 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 357
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 405
P +S Y +G Q YLDPH +P + G+ E + ++YH+ +R +H+ +DPS
Sbjct: 358 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 417
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNH 450
+ IGF +D+DD+ D+ +A G + V+ P H
Sbjct: 418 MLIGFLIKDEDDWADWKRNVGSVA----GKAIVHVSDKENSPFGH 458
>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 441
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 111/307 (36%), Positives = 153/307 (49%), Gaps = 50/307 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFESKIWLTYRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGFTSDTGWGCMIRS 162
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A AL+ R+GR WR+ +E I+ LF D+ T+P+SIHN ++ G A G
Sbjct: 163 GQSLLANALVMLRMGRDWRR--GSSASQEERSIISLFADTPTAPYSIHNFVEHGAAACGK 220
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGDEDGERGGAPVVCIDDA 294
G W GP A R +ALA G QS + +YV G E E + D
Sbjct: 221 HPGEWFGPSATARCIQALAN--------GHQSPELRVYVTGDGLEVYEDSFMKIAKPD-- 270
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
GQA + P L+LV LGL+K+ P Y L+ + PQSLGI GG+P +S Y
Sbjct: 271 ---------GQA-FIPTLILVGTRLGLDKITPVYWEALKSSLQIPQSLGIAGGQPSSSHY 320
Query: 355 IVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
+GVQ YLDPH +P + + +D + D + H+ +R IH+ +DPS+ I F
Sbjct: 321 FIGVQGHHFFYLDPHQTRPALPLPDNIEDYSQEDIDSCHTRRLRRIHIKEMDPSMLIAFL 380
Query: 412 CRDKDDF 418
RD+DD+
Sbjct: 381 IRDEDDW 387
>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
Length = 471
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 58/324 (17%)
Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 174
+F DF SR+ I+YR F PI DS + TSD GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH +Q G A
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKI---QEESELVSLFADHPRAPFSIHRFVQHGATAC 252
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 292
G G W GP A + +AL + + GL +YV + G + ER V C +
Sbjct: 253 GKCPGEWFGPSAAAQCIQALVKSN-PQAGL-------RVYVTNDGSDIYERQFREVACDE 304
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
S P L+L+ + LG+++V P Y +L+ +PQS+GI GG+P +S
Sbjct: 305 SGS------------IKPTLILLGVRLGIDRVTPIYWDSLKALLHYPQSVGIAGGRPSSS 352
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---------ADTSTYHSDVIRHIHLDSID 403
Y + Q +S YLDPH +P + + E + STYH+ +R +H+ +D
Sbjct: 353 HYFIATQGDSFFYLDPHQTRPCLAPRSEPTEDEESHPYSPEELSTYHTRRLRRLHVREMD 412
Query: 404 PSLAIGFYCRDKDDFDDFCARASK 427
PS+ IG RD+ D++D +R +
Sbjct: 413 PSMLIGLLVRDEGDWEDLKSRVKE 436
>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
Length = 404
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/397 (30%), Positives = 181/397 (45%), Gaps = 72/397 (18%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 139
+RI + + P S IW LG+ + +D + N E
Sbjct: 11 KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKDANTRETPDKNNTRENVMGTTNYRKPS 70
Query: 140 -------FNQDFSSRILISYRKGFDPI----GDSK-------------------ITSDVG 169
F DF SRI ++YR F PI GD K TSD G
Sbjct: 71 EHAWPESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTG 130
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + F+ E ++L LF D+ T+PFS+H ++
Sbjct: 131 WGCMIRSGQSLLANALSMLVLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKH 187
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G ++ G G W GP A + EAL+ C + + +YV + + +
Sbjct: 188 GAESCGKYPGEWFGPSATAKCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK--- 236
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
D +R+ S + P L+L+ LG++ + P Y L+ FPQS+GI GG+
Sbjct: 237 --FMDIARNTS------GAFQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGR 288
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
P AS Y VG Q YLDPH +P + G+ + + TYH+ +R IH+ +DPS
Sbjct: 289 PSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPS 348
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
+ IGF R+++D+ D+ R E G P+ V +
Sbjct: 349 MLIGFLIRNQEDWADWLKR----IEAVKGRPIIHVLK 381
>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 354
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 155/319 (48%), Gaps = 40/319 (12%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300
Query: 390 HSDVIRHIHLDSIDPSLAI 408
+ + +DPS+A+
Sbjct: 301 CQHPPCRMSIAELDPSIAV 319
>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
Length = 478
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/416 (28%), Positives = 179/416 (43%), Gaps = 84/416 (20%)
Query: 108 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGD 161
S S + LLG C H A+DE A L F +DF+SR+ ++YR+ F P+
Sbjct: 36 SRNSPVLLLGKCYHFKAEDEESPTEASVEDLVMGDVDAFRRDFASRVWLTYREEFSPLPG 95
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE------------- 205
S +TSD GWGCMLR+ QM++AQ L+ H LGR W + L +P D E
Sbjct: 96 STLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWTWSEALTLQPLDTETWTTTAAKRLVAS 155
Query: 206 ---------------------------------------YVEILHLFGDSETSPFSIHNL 226
+ ++ FGDS ++P +H L
Sbjct: 156 LEASLQGVPGPSVRSSSPQAQALSLGSAEEADAHLKEMYHRTLVSWFGDSPSTPLGLHRL 215
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA--IYVVSGD------ 278
++ G G AG W GP + + + + GL C + ++ V S D
Sbjct: 216 VRLGLTMGKQAGDWYGPAVVAHILKKAVE-EAMDPGLACITAYVSQDCTVYSADVVDCHR 274
Query: 279 ------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
E AP + +D H S + +A +++LVP+ LG EK NP Y
Sbjct: 275 APRAERTSDETPDAPTLPQNDQPAHASTLPESRA----VIILVPVRLGGEKTNPEYFDFA 330
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSD 392
+ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D +YH
Sbjct: 331 KSILSLEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSYHCP 388
Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKK 446
+ + +DPS +GFY R D++ SKL + S P FT Q H +
Sbjct: 389 SPKKMPFSKMDPSCTVGFYSRSVQDYERISQELSKLLQPSAKEKYPAFTFVQGHGR 444
>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 336
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 149/295 (50%), Gaps = 25/295 (8%)
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
++YR F I DS +D GWGCMLR QML+A+A+ LG+ W +K +E
Sbjct: 36 MTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWAPTSRKQRHQEMARF 95
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
L LF D+ +PFSIH + + G+A G G W GP + + + L QR+ + C
Sbjct: 96 LPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVNSQRSSLIVHCA--- 152
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRY 328
V++ E + A + D +H +L+LVP+ LGL + +NP Y
Sbjct: 153 -MDGVLNRTEASTQLAA---ALSDGKKHS------------LLVLVPIRLGLNQSINPVY 196
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ-PVINIGKDDLEADTS 387
IP L+ T PQ LGI+GGKP A+ + VG E+ +YLDPH VQ + + D +E
Sbjct: 197 IPALKATLELPQCLGIIGGKPNAAHFFVGTVNENVLYLDPHVVQDAAMELTPDTVE---- 252
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
++ V+ + + +DPS+ + C + +D R+ ++ + G LF V +
Sbjct: 253 SFSVAVLSKMAISDVDPSMCAAYLCSSVAELEDLGKRSKQITSQFRGYGLFDVIE 307
>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
Length = 454
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 155/337 (45%), Gaps = 51/337 (15%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 175
F DF RI ++YR GF PI S+ TSD GWGCM+R
Sbjct: 117 FLDDFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 176
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
S Q L+A AL RLGR WR+ E +L LF D +PFSIH ++ G Y G
Sbjct: 177 SGQSLLANALAISRLGRDWRRGSNS---TEENRLLSLFADDPAAPFSIHKFVRHGALYCG 233
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A +AL+ + + G M +YV S + V + +
Sbjct: 234 KHPGEWFGPSATATCIQALSD-EYKDAG-------MNVYVSSDNTYVYEDKFKAVAYNQS 285
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
R P L+L+ LG++++ P Y L PQ+LGI GG+P AS Y
Sbjct: 286 DRM-----------RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQALGIAGGRPSASHY 334
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
+GVQ YLDPH +P + DL + + + H+ +R IH+D +DPS+ +GF
Sbjct: 335 FIGVQNSFFFYLDPHHTRPALPYKTGDLAYTQEEIDSCHTRRLRRIHIDDMDPSMLVGFL 394
Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
RD++D+ D+ R + E NG + + T P
Sbjct: 395 IRDENDWMDWKRRITSSRPE-NGKAIIHIVDTKNVPT 430
>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
Length = 450
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 121/370 (32%), Positives = 171/370 (46%), Gaps = 50/370 (13%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGNNG------LAEFNQDFSSRILISYRKGFDPIGDSKI 164
S I LLG C+ ++ E N F +DFSS+I +YRK F + S +
Sbjct: 82 SPIILLGKCYCCSKSEKEDQRRQPNNSNILTTFDRFKRDFSSKIWFTYRKDFPKLYGSPL 141
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETS 219
TSDVGWGCMLR++QM++AQAL+ H LGR W + +E + +I+ LFGD S
Sbjct: 142 TSDVGWGCMLRTAQMIIAQALVMHYLGRDWTIHHTQQNRKETMLHRQIIRLFGDFPGNDS 201
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
PFSI L++ G +G G W GP ++ YVV
Sbjct: 202 PFSIQALVRIGVDHGKRPGDWYGPASVA-------------------------YVVRDAI 236
Query: 280 DGERGGAPV---VCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPT 331
+ P+ VC+ A C+V+ + D W +++LVP+ LG E +NP Y
Sbjct: 237 NQVPDFHPLLSQVCVYVAP-DCTVYIQDVIDLCTQHWKAVVILVPVRLGGEALNPIYSQC 295
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
++ LGI+GG+P S Y VG QEE +YLDPH Q ++ D TSTYH
Sbjct: 296 VQSLLAHELCLGIIGGRPKHSLYFVGWQEEKLLYLDPHFCQDTVDTRFRDFP--TSTYHC 353
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTHKKPV 448
R + L +DPS +GFY F+ KL ++ PLF +
Sbjct: 354 LSPRKLALQKMDPSCTLGFYIPTHAAFNRLVKDMQKLVTPPKDQGIYPLFVFQDGRSIDI 413
Query: 449 NHSDVLGETG 458
HS + E+
Sbjct: 414 EHSHIKPESN 423
>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
purpuratus]
Length = 390
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 105/332 (31%), Positives = 159/332 (47%), Gaps = 50/332 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
IW+LG + ++Q + E D SR+ +YRKGF IG + T+D GWGC
Sbjct: 48 IWILGKKYDLSQHQL-----------EARLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGC 96
Query: 173 MLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
MLR QM++AQAL++ LGR WR +P ++ D Y++IL LF D + S FSIH + Q G
Sbjct: 97 MLRCGQMMLAQALVYKHLGRDWRWRPQEQ--DETYLKILQLFLDKKDSCFSIHQIAQMGV 154
Query: 232 AYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
G G W GP + + SW LA + + + + V S E+
Sbjct: 155 GEGKKVGDWFGPNTVGQVIRKLSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETS 214
Query: 283 RGGAPV--------------------------VCIDDASRHCSVFSKGQADWTPILLLVP 316
G+ + + + + S G W + L++P
Sbjct: 215 SEGSKTGSERRKRTSSSENIRHKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIP 274
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LGL ++N Y+ L+ FT PQSLG++GGKP + Y +GV + +YLDPH QP +
Sbjct: 275 LRLGLNEINTVYMQRLKRCFTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHTTQPAAD 334
Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
I K D S +H + + + ++DPS+ +
Sbjct: 335 IDKWAFLQDES-FHCEHASRMPIKNLDPSIGL 365
>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
Length = 489
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/355 (31%), Positives = 161/355 (45%), Gaps = 53/355 (14%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 175
F DF S+I ++YR F PI S+ TSD GWGCM+R
Sbjct: 153 FLDDFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 212
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
S QML+A AL RLGR WR+ E ++L LF D +PFSIH ++ G Y G
Sbjct: 213 SGQMLLANALAISRLGRDWRRVSHT---TEENKLLSLFADDPAAPFSIHRFVRHGALYCG 269
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A +AL+ + M +YV S +D
Sbjct: 270 KHPGEWFGPSATATCIQALSEEYKVAG--------MNVYVSSDS---------TYVYEDK 312
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + G P L+L+ LG++++ P Y L PQSLGI GG+P +S Y
Sbjct: 313 FKAVAYNQPGHM--RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQSLGIAGGRPSSSHY 370
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+GVQ YLDPH +P + D + + H+ +R IH+D +DPS+ +GF
Sbjct: 371 FIGVQNSFFFYLDPHHTRPALPHKVDSAYTQEQVDSCHTRRLRRIHIDDMDPSMLVGFLI 430
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP---VNHSDVLGETGGVPEDD 464
RD++D+ D+ R + + E NG + + T P + L E + +DD
Sbjct: 431 RDENDWIDWKRRIAS-SREGNGKAIIHIIDTESVPTPTMEREAALDEVEALDDDD 484
>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
Length = 458
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 183/422 (43%), Gaps = 77/422 (18%)
Query: 93 RIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR-- 147
R+HE R+ + +S L +A++ AL D+ N + + F+SR
Sbjct: 11 RVHEEAKRLFADWKPAVSKMLETYLTLDPSFSVAENYALFDS--NLPIYLLGEKFTSRRD 68
Query: 148 -----------ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ +YRK F PIG T+D GWGCMLR QML+A+ L+ LGR W
Sbjct: 69 MERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGRNWL- 127
Query: 197 PLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR--- 248
+DR EY IL +F D + S FSIH + G + G G W GP +
Sbjct: 128 -----WDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLK 182
Query: 249 ------SWEALARCQRAETGLGCQSL-PMAI----YVVSG----------DEDGERGGAP 287
W LA + L + MA Y SG D A
Sbjct: 183 KLVIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAE 242
Query: 288 VVCIDDASRH--------CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
+++R S + +W P+L+++PL LGL +N Y P ++ F P
Sbjct: 243 AEIFPESTRSPTRSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFFQLP 302
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---------------KDDLEA 384
Q +GI+GG+P + Y G+ + + +YLDPH Q +++ K+D E
Sbjct: 303 QCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDFVDLDETTATRDERDGYVEIKND-EF 361
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
STYH I +D +DPSLA+GF C +DD+++ R ++ PLF + +T
Sbjct: 362 RDSTYHCPFILTTKIDKVDPSLALGFLCHTEDDYNELAQRLRTHLLPASTPPLFEMLETR 421
Query: 445 KK 446
K
Sbjct: 422 PK 423
>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum PHI26]
Length = 401
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 123/421 (29%), Positives = 186/421 (44%), Gaps = 75/421 (17%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 139
+RI + P T + S IW LG + A + D A NN +
Sbjct: 9 KRIVQYFWDPEPTNNVPAAS-IWCLG--KEYAPPQPFSDPATNNPHSSSGQPDASTLNDT 65
Query: 140 -----FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWG 171
F DF SRI I+YR F PI +K TSD GWG
Sbjct: 66 AWPNAFVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWG 125
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
CM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH + G
Sbjct: 126 CMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIHKFVNRGA 182
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
++ G G W GP A + + L+ A + +YV + D
Sbjct: 183 ESCGKYPGEWFGPSATAKCIQLLSTQSEAHR--------LRVYVTNDTSD---------V 225
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
+D H S G P L+L+ LG+E V P Y LR T+PQS+GI GG+P
Sbjct: 226 YEDKFAHVSHDRSGCIQ--PTLILIGTRLGIENVTPAYWDGLRAALTYPQSVGIAGGRPS 283
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAI 408
AS Y +G Q+ +LDPH +P D+L + + +Y++ +R IH+ +DPS+ I
Sbjct: 284 ASHYFLGAQDCHLFFLDPHTTRPATPYRPDELYTQEELDSYYTSRLRRIHIKDMDPSMLI 343
Query: 409 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN---HSDVLGETGGVPEDDS 465
GF +D++D+ D+ K + + G P+ + +P N ++ L E + + D
Sbjct: 344 GFLIKDEEDWADW----KKRVQSTPGQPIVHMLPCQHQPDNGQGRAEALDEVEALDDSDE 399
Query: 466 L 466
+
Sbjct: 400 I 400
>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 439
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 150/309 (48%), Gaps = 49/309 (15%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF ++I ++YR F I S+ TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A ALL R+GR WR+ +R+ IL LF D +P+SIH ++ G A G
Sbjct: 163 GQSLLANALLTLRMGREWRRGSSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A ARC +A T +S + +Y+ D +D
Sbjct: 220 HPGEWFGP-------SAAARCIQALTNSQVES-ELRVYITGDGSD---------VYEDT- 261
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
S+ +TP L+LV LGL+K+ P Y L+ + PQS+GI GG+P +S Y
Sbjct: 262 -FMSIAKPNSTKFTPTLILVGTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYF 320
Query: 356 VGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+GVQE YLDPH +P + +D D + H+ +R +H+ +DPS+ I F
Sbjct: 321 IGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLI 380
Query: 413 RDKDDFDDF 421
RD++D+ D+
Sbjct: 381 RDENDWKDW 389
>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
Length = 331
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 158/319 (49%), Gaps = 27/319 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V+C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VLCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292
Query: 456 ETGGVPEDDSLGVMSMNDA 474
+ G E + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309
>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
boliviensis]
Length = 319
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 149/300 (49%), Gaps = 25/300 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C DA+RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
24927]
Length = 444
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 110/301 (36%), Positives = 160/301 (53%), Gaps = 45/301 (14%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-------------------ITSDVGWGCMLRSSQML 180
F DF ++ ++YR F PI S TSD GWGCM+RS Q +
Sbjct: 111 FLDDFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCV 170
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGS 239
+A A+ +LGR WR+ + P +E IL LF D +PFS+HN ++ G+A G+ G
Sbjct: 171 LANAISLLKLGRDWRRG-KSP--QEEQHILSLFADDPRAPFSLHNFVKYGEASCGVYPGE 227
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
W GP A R +ALA A+ G Q +Y+ +GD GG +DA R +
Sbjct: 228 WFGPSATARCIQALA----AQHDEGLQ-----VYI-TGD-----GGD---VYEDAFRKIA 269
Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
+ G + P L+LV + LG+E+V P Y L+ + PQS+GI GG+P AS Y +GVQ
Sbjct: 270 ISDDGV--FHPTLVLVGIRLGIERVTPVYWEALKSSLMMPQSVGIAGGRPSASHYFIGVQ 327
Query: 360 EESAIYLDPHDVQPVINIGKD-DLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
+S YLDPH+ +P++ KD D A+ + H+ +R +HL +DPS+ + F RD D
Sbjct: 328 GQSLFYLDPHNTRPLLPYRKDSDYTAEEIEFCHTRKLRRLHLREMDPSMLLAFLIRDDRD 387
Query: 418 F 418
+
Sbjct: 388 W 388
>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
Length = 458
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 116/409 (28%), Positives = 174/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKLLPARSGCTIKDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
+ S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPDALNIENSDSESWTSNTAK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNEIYHRKIISWFGDSPLTLFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMAPDNTDDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKSSSKEKYPLFTFVNAHSRDYDFT 429
>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
Length = 319
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 148/300 (49%), Gaps = 25/300 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C DA RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
Length = 396
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 168/351 (47%), Gaps = 43/351 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S HC W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+ R D C + + + N +F + Q H
Sbjct: 304 PQRMNILNLDPSVALVGIRRLSGPGDTMCTVSPQEILKEN-LRMFELVQKH 353
>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
Length = 405
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 177/364 (48%), Gaps = 31/364 (8%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD--SK 163
I + + +W+LG + +D + +D SR+ +YRKGF PIG S
Sbjct: 46 IPQTENSVWVLGKKYNAKKD-----------IDAIRRDIRSRLWFTYRKGFVPIGGFGST 94
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPF 221
TSD GWGCMLR QM++ QAL+ LGR WR P R Y+ IL F D +P+
Sbjct: 95 FTSDKGWGCMLRCGQMVLGQALISLHLGRDWR---WTPETRSSTYLNILRRFEDRRAAPY 151
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
SIH + G + G G W GP + + + L + +L + V +
Sbjct: 152 SIHQIALMGASEGKDVGQWFGPNTIAQVLKKLVVYDDWSSITIHVALDNTLVVNDVVQQC 211
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
GA +D K + W P+LLL+PL LGL ++NP YI L+ +F FPQS
Sbjct: 212 RVEGATTAEVDGEKPL-----KAPSQWKPLLLLIPLRLGLNEINPIYINGLKTSFQFPQS 266
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV--INIGKDDLEADT-STYHSDVIRHIH 398
LG++GGKP + Y +G + I+LDPH Q ++ DD EA+ +TYH + I
Sbjct: 267 LGLIGGKPSHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCKIASRIP 326
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---DVLG 455
+ +DPS+A+ F+C + DF C + PLF + Q ++P + S DV
Sbjct: 327 ITGMDPSVALCFFCATEKDFMSLCRLMQDELIGNEKQPLFELCQ--ERPASWSPAEDVAA 384
Query: 456 ETGG 459
E G
Sbjct: 385 EALG 388
>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
Length = 494
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)
Query: 138 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 174
A F DF S+I ++YR F DP S +T +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A AL LGR WR+ + +E +L LF D +PFSIH ++ G A
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +AL+ C+ + +YV S D +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276
Query: 294 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
R ++ S G D P L+L+ + LG+++V P Y L+ +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334
Query: 350 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
+S Y +G Q YLDPH +P + + + + + +TYH+ +R +H+ +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
+ IGF RD+DD+D++ A NG + V P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNVRGGAVTGNGKAIIHVFDKETSP 436
>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
Length = 458
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIADHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLAPFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
Length = 459
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 114/404 (28%), Positives = 173/404 (42%), Gaps = 80/404 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNN-----------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +E G +N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKSEEDDGIPVRSNWAPEDPAVISGNVDEFRKDFVSRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
P+G S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPAALDMENSDSESWTSHTVK 155
Query: 198 -LQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 224
L F+ +V +I+ FGDS + F +H
Sbjct: 156 KLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNEGFHRKIISWFGDSPRTYFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L + GK G AG W GP + R G + +YV
Sbjct: 216 QLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQG-----LTVYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G+ D +L+LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVTDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
S +GFYCR+ DF+ +K+ + S+ PLFT + H +
Sbjct: 381 SCTVGFYCRNVQDFERASEEITKVLKASSKEKYPLFTFVKGHSR 424
>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
Length = 494
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)
Query: 138 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 174
A F DF S+I ++YR F DP S +T +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
RS Q L+A AL LGR WR+ + +E +L LF D +PFSIH ++ G A
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +AL+ C+ + +YV S D +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276
Query: 294 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
R ++ S G D P L+L+ + LG+++V P Y L+ +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334
Query: 350 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
+S Y +G Q YLDPH +P + + + + + +TYH+ +R +H+ +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
+ IGF RD+DD+D++ A NG + V P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNLRGGAVTGNGKAIIHVFDKETSP 436
>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
Length = 402
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 169/382 (44%), Gaps = 66/382 (17%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 134
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 70
Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 171
F DF S+I ++YR F PI TSD GWG
Sbjct: 71 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 130
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 131 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 187
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 188 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 231
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 232 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 288
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 407
AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +DPS+
Sbjct: 289 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 348
Query: 408 IGFYCRDKDDFDDFCARASKLA 429
IGF R++DD++D+ R +
Sbjct: 349 IGFLVRNEDDWEDWKGRVGSVV 370
>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 601
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 165/345 (47%), Gaps = 51/345 (14%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVG 169
G++ A F DF S+I ++YR GF DP S +T +D G
Sbjct: 229 GHDWPAPFLDDFESKIWLTYRSGFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 288
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 289 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 345
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 346 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 389
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 390 -VYEDRFRTIASGGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 448
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 405
P +S Y +G Q YLDPH +P + G+ E + ++YH+ +R +H+ +DPS
Sbjct: 449 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 508
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNH 450
+ IGF +D+DD+ D+ +A G + V P H
Sbjct: 509 MLIGFLIKDEDDWADWKRNVGSVA----GKAIVHVFDKENSPFGH 549
>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
[Ciona intestinalis]
Length = 422
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 168/359 (46%), Gaps = 58/359 (16%)
Query: 112 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 171
+IW+LG + + AL F + S + +YRKG+ PIG + TSD GWG
Sbjct: 39 NIWVLGSRFHLPHERAL-----------FLEHIKSFLWFTYRKGYTPIGGTGPTSDSGWG 87
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
CMLR QML+A+AL + + W+ KP Y ILH D +S +SIH + Q G
Sbjct: 88 CMLRCGQMLLARALAELTMDKDWKWTEDKPQPPPYKRILHQLSDERSSCYSIHQIAQMGV 147
Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
G G W GP + + L++ + +AI+V + VCI
Sbjct: 148 EEGKEVGQWFGPNTISQVLRRLSQFDQENV--------LAIHVAMDN---------TVCI 190
Query: 292 DDASRHCSVFSKGQAD----------------------------WTPILLLVPLVLGLEK 323
+D R CS Q + W P+LLL+PL LGL +
Sbjct: 191 EDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLLLLIPLRLGLSE 250
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDD 381
+NP Y L+ + +S+G++GGKP + Y +G E+S I+LDPH QP + + +
Sbjct: 251 INPVYFTHLKECLHWKESVGVIGGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN 310
Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
D +T+H D + L ++DPSLA+GF C + F D C + ++ + PLF V
Sbjct: 311 ERYDDTTFHCDTPGRMLLTNLDPSLALGFICTTRGSFCDLCHKVKQMVKTPTSFPLFEV 369
>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
Length = 435
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 125/437 (28%), Positives = 190/437 (43%), Gaps = 82/437 (18%)
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQ 142
+H R + ++T S + S + LLG C+ ++ A+ D + EF +
Sbjct: 1 MHTRWVLKTKTYFSRN-SPVLLLGKCYHFKYEDEHKMLTARSGCAIEDRVIAGNVDEFRK 59
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--------- 193
DF SRI ++YR+ F PI S +++D GWGC LR+ QML+AQ L+ H LGR
Sbjct: 60 DFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWIWPDALNI 119
Query: 194 -------WRKPLQKPFD--------------------REYVE----------------IL 210
W K F +E +E I+
Sbjct: 120 ENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDEVYHRKII 179
Query: 211 HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
FGDS ++ F +H L++ G+ G AG W GP + R G +
Sbjct: 180 SWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----I 234
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+YV +D + V+ ASR G AD +++LVP+ LG E+ N Y+
Sbjct: 235 TVYVA---QDCTVYNSDVIDKQSASR-----PAGNADDKAVIILVPVRLGGERTNTDYLE 286
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
++ + +GI+GGKP S Y G Q++S IY+DPH Q +++ D + T+H
Sbjct: 287 FVKGVLSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFH 344
Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPV 448
+ + +DPS IGFYCR+ DF +K+ + S+ PLFT H K
Sbjct: 345 CPSPKKMSFRKMDPSCTIGFYCRNVQDFQRASEEITKMLKMSSKEKYPLFTFVHGHSKDY 404
Query: 449 NH-SDVLGETGGVPEDD 464
+ S V E +DD
Sbjct: 405 DFTSTVANEEDLFSQDD 421
>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
Length = 439
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 169/382 (44%), Gaps = 66/382 (17%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 134
+RI + + P + IW LGV + KI QDE + D +
Sbjct: 47 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 106
Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 171
F DF S+I ++YR F PI TSD GWG
Sbjct: 107 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 166
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
CM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH ++ G
Sbjct: 167 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 223
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
++ G G W GP A R EAL+ C ++ +YV + D V
Sbjct: 224 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 267
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
D R V G P L+L+ LG++ V P Y L+ PQS+GI GG+P
Sbjct: 268 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 324
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 407
AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +DPS+
Sbjct: 325 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 384
Query: 408 IGFYCRDKDDFDDFCARASKLA 429
IGF R++DD++D+ R +
Sbjct: 385 IGFLVRNEDDWEDWKGRVGSVV 406
>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
Length = 513
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 158/324 (48%), Gaps = 47/324 (14%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 169
G++ A F DF S+I ++YR GF I S T+D G
Sbjct: 143 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 202
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + D+E +L LF D +PFSIH ++
Sbjct: 203 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 259
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R +AL+ C+ + +YV S D
Sbjct: 260 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 303
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R + +A P L+L+ + LG+++V P Y L+ +PQS+GI GG+
Sbjct: 304 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 362
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 405
P +S Y +G Q YLDPH +P + G+ E + ++YH+ +R +H+ +DPS
Sbjct: 363 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 422
Query: 406 LAIGFYCRDKDDFDDFCARASKLA 429
+ IGF +D+DD+ D+ +A
Sbjct: 423 MLIGFLIKDEDDWADWKRNVGSVA 446
>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 331
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 157/319 (49%), Gaps = 27/319 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
Query: 456 ETGGVPEDDSLGVMSMNDA 474
+ G E + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309
>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
Length = 458
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 117/422 (27%), Positives = 179/422 (42%), Gaps = 80/422 (18%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF+SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHIIAGNVEEFRKDFTSRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDFESWTSNTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCASMASDHADDKAVIILVPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHSDVLGETGGVPE 462
S IGFYCR+ DF +K+ + S+ PLFT H + + + + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTAAKEDDLFS 440
Query: 463 DD 464
+D
Sbjct: 441 ED 442
>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
Length = 469
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 170
+F DF S++ I+YR F PI + TSD GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 246
Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 288
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 247 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 298
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 299 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 346
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 399
P +S Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+
Sbjct: 347 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 406
Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASK 427
+DPS+ IGF RD+DD++D R +
Sbjct: 407 REMDPSMLIGFLVRDEDDWEDLKRRVRE 434
>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
Length = 454
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 170
+F DF S++ I+YR F PI + TSD GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 231
Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 288
A G G W GP A + +AL + + GL +Y+ S G + E+ V
Sbjct: 232 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 283
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
C + P L+L+ + LG+++V P Y +L+ FPQS+GI GG+
Sbjct: 284 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 331
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 399
P +S Y + Q +S YLDPH +P + + D E+ + STYH+ +R +H+
Sbjct: 332 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 391
Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASK 427
+DPS+ IGF RD+DD++D R +
Sbjct: 392 REMDPSMLIGFLVRDEDDWEDLKRRVRE 419
>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
Length = 458
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 175/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFT 429
>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
Length = 356
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 159/342 (46%), Gaps = 32/342 (9%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 212 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 387
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
TYH+ +R IH+ +DPS+ IGF R++DD++D+ R +
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSVV 324
>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
Length = 319
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 150/300 (50%), Gaps = 25/300 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E + A
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEISKLCRAS 112
Query: 288 VVCIDDAS------RHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ C A+ RHC+ G W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 LPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ ++DF+D+C + KL++ P+F + + + DVL
Sbjct: 233 RMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 292
>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
Length = 466
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 44 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 103
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 104 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 163
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 164 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 223
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 224 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 271
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 272 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 330
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 331 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 388
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 389 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 437
>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
Length = 458
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
Length = 431
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/406 (30%), Positives = 185/406 (45%), Gaps = 43/406 (10%)
Query: 91 MRRIHERVLGPSRTGISSSTSDI--WLLGVCHKIAQDEALGDAAGNNGLA--EFNQDFSS 146
MR R P R+ +SS+ + W +++ L + E D +S
Sbjct: 1 MRPGPRRSCTPRRSALSSTLGEASDWCTAAAREVSAVSGLSQLQQDESYEKDEILSDVAS 60
Query: 147 RILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 206
R+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR +K Y
Sbjct: 61 RLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSY 120
Query: 207 VEILHLFGDSETSPFSIHNLL------QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
+L F D + S +SIH + + + S +GP +C+S+ A+ +R
Sbjct: 121 FSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGP-QLCQSFAAVRLSRRRR 179
Query: 261 TGLGCQSLP--MAIYVVSGDEDGERGGAPVVCIDD--ASRHCSVFSKG--------QADW 308
L S P +A++ V ++D A RHC+ G W
Sbjct: 180 WELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDISADRHCNGVPAGAEVTHRPPLPPW 239
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLT-------------------FTFPQSLGIVGGKP 349
P++LL+PL LGL +N Y+ TL+L F PQSLG++GGKP
Sbjct: 240 RPLVLLIPLRLGLTDINEAYVGTLKLASTLVGLCSAAASLPLRQHCFMMPQSLGVIGGKP 299
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
++ Y +G E IYLDPH QP + + D S + + + +DPS+A G
Sbjct: 300 NSAHYFIGYVGEELIYLDPHTTQPAVEVADRRSIPDESFHCQHPPSRMRIGELDPSIA-G 358
Query: 410 FYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
F+C+ +DDFDD+C + KL+ P+F + + + DVL
Sbjct: 359 FFCQTEDDFDDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 404
>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
Length = 458
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 118/421 (28%), Positives = 178/421 (42%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +++ FGDS +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNEIYHRKVISWFGDSPLAPFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLQFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
oryzae 3.042]
Length = 357
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 159/342 (46%), Gaps = 32/342 (9%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
+RI + + P + IW LGV + + + +N A + RI
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
DP G TSD GWGCM+RS Q L+A A+L LGR WR+ + E +L
Sbjct: 71 L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121
Query: 212 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
LF D +P SIH ++ G ++ G G W GP A R EAL+ C ++
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+YV + D V D R V G P L+L+ LG++ V P Y
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 387
L+ PQS+GI GG+P AS Y +G Q YLDPH +P + D + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
TYH+ +R IH+ +DPS+ IGF R++DD++D+ R +
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSVV 324
>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
Length = 458
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
Length = 458
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
Length = 458
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/409 (28%), Positives = 171/409 (41%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 224
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFT 429
>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDSTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292
>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
Length = 508
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 173/382 (45%), Gaps = 57/382 (14%)
Query: 101 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 156
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 157 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRR 204
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
+ +E ++L LF D +PFSIH ++ G A G G W GP A R +AL+
Sbjct: 205 GTKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS 261
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWT 309
C+ + +YV S D +D R ++ S G D
Sbjct: 262 --------ECEHAGLNVYVTSDGSD---------VYEDRFR--AIASAGGTGAGTSTDVH 302
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q YLDPH
Sbjct: 303 PTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFYLDPH 362
Query: 370 DVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
+P + + + +TYH+ +R +H+ +DPS+ IGF RD+DD++ +
Sbjct: 363 HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSWKRSV 422
Query: 426 SKLAEESNGAPLFTVTQTHKKP 447
A G + V K P
Sbjct: 423 HNRAMIGTGKAIIHVFDKEKSP 444
>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
Length = 458
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIHHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
gorilla]
gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
gorilla]
gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
Length = 1119
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 134/455 (29%), Positives = 185/455 (40%), Gaps = 131/455 (28%)
Query: 134 NNGLAEFNQDFSSRILISYRKGF-----DPIGDSK------------------------- 163
N A F D SRI ++YR GF DP S
Sbjct: 644 NGWPAAFYHDSYSRIALTYRSGFPIIPCDPSSSSTGVVQGMLNNLSMSIGRGGHRGPSPT 703
Query: 164 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-----------KPFDREYV 207
++SD GWGCMLR+ Q L+A AL+ LGR WR+PL P Y
Sbjct: 704 NAEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYA 763
Query: 208 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
IL LF D S SPFS+H Q GK G G W GP + + L
Sbjct: 764 RILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYE------- 816
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---W-TPILLLVPLVLGL 321
P + VVS C+D V + D W TP+L+L+ + LG+
Sbjct: 817 ---PAGLKVVS-------------CVDGTVYESEVVAASTKDGEKWKTPVLVLINVRLGI 860
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN--IGK 379
+ VNP Y ++ F PQS+GI GG+P +S Y VG Q S Y+DPH +P + +
Sbjct: 861 DGVNPIYYEAIKGIFRLPQSVGIAGGRPSSSYYFVGAQANSLFYIDPHHPRPAVPLVLPP 920
Query: 380 DD-------------LEADT----------------------STYHSDVIRHIHLDSIDP 404
DD ADT +TYH+D +R L S+DP
Sbjct: 921 DDSLVRAAQHLPLTPSTADTPAKESARQLDDFLLAAYPDAAWATYHTDKVRKCALSSLDP 980
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---------DVLG 455
S+ +GF D+ D+ DF R +L++ S+ P+F + + + S L
Sbjct: 981 SMLLGFLVEDERDWQDFRLRVQELSQASS--PIFAIAPSPPSWMRRSTSSAAPATVSALS 1038
Query: 456 ETGGVPEDDSLGVMSMN-----DAVGNAHEDDWQL 485
T G DDS ++ D+ G + +DW+L
Sbjct: 1039 PTIG---DDSFSEVAGEDVADADSAGFSEPEDWEL 1070
>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
Length = 331
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 156/319 (48%), Gaps = 27/319 (8%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRNS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ + +DPS+A+GF+C+ +DDF D+C + KL+ P+F + + + DVL
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292
Query: 456 ETGGVPEDDSLGVMSMNDA 474
+ G E + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309
>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
Length = 400
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 150/313 (47%), Gaps = 49/313 (15%)
Query: 139 EFNQDFSSRILISYRKGFDPIG----------------------DSK-ITSDVGWGCMLR 175
EF D SRI I+YR F PI DS+ TSD GWGCM+R
Sbjct: 75 EFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTGWGCMIR 134
Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
S Q L+A A+L LGR WR+ + + ++LH F D +PFSIH +Q G +
Sbjct: 135 SGQSLLANAMLILLLGRDWRRGTEA---GKEAQLLHQFADHPEAPFSIHRFVQHGAEFCN 191
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R +AL A+ G S + +Y+ D + D
Sbjct: 192 KYPGEWFGPSATARCIQALV----AQQG----SSELRVYITDDTAD--------IYEDKF 235
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+R + D+ P L+LV LG++ V P Y L+ PQS+GI GG+P AS Y
Sbjct: 236 AR---IAQAEHGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLPQSVGIAGGRPSASHY 292
Query: 355 IVGVQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
+GV + YLDPH +P ++ + +TYH+ +R IH+ +DPS+ IGF
Sbjct: 293 FIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRLRRIHIKDMDPSMLIGFI 352
Query: 412 CRDKDDFDDFCAR 424
R ++D+ D+ R
Sbjct: 353 IRSREDWTDWKTR 365
>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
Length = 458
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 174/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRLWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 407
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 120/387 (31%), Positives = 169/387 (43%), Gaps = 71/387 (18%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA-----------QDE------ALG 129
+RI + + P + IW LGV + KI QDE +
Sbjct: 11 KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPGKLGNYQDELEAGTSKID 70
Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITS 166
D + F DF S+I ++YR F PI TS
Sbjct: 71 DVTAHGWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTS 130
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCM+RS Q L+A A+L LGR WR+ + E +L LF D +P SIH
Sbjct: 131 DTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRF 187
Query: 227 LQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
++ G ++ G G W GP A R EAL+ C ++ +YV + D
Sbjct: 188 VKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD----- 234
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V D R V G P L+L+ LG++ V P Y L+ PQS+GI
Sbjct: 235 ---VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIA 288
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSI 402
GG+P AS Y +G Q YLDPH +P + D + + STYH+ +R IH+ +
Sbjct: 289 GGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDM 348
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLA 429
DPS+ IGF R++DD++D+ R +
Sbjct: 349 DPSMLIGFLVRNEDDWEDWKGRVGSVV 375
>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 458
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 117/404 (28%), Positives = 172/404 (42%), Gaps = 80/404 (19%)
Query: 108 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----------------LQ 199
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P ++
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 200 K-----------------------------PFDRE------YVEILHLFGDSETSPFSIH 224
K P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNEVYHRKIISWFGDSPVALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
S IGFYCR+ DF +K+ + S+ PLFT H +
Sbjct: 381 SCTIGFYCRNVQDFQRASEEITKMLKISSKEKYPLFTFVNGHSR 424
>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
terrestris]
Length = 383
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++ QAL+ LGR W+ + + Y++IL F D T+ FSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 123
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L + +L + V +
Sbjct: 124 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
G V D A V K + W P+LLL+PL LGL ++NP YI L+ +F PQSLG
Sbjct: 184 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 238
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 399
++GGKP + Y +G E IYLDPH Q ++GK +++E D +TYH I +
Sbjct: 239 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 297
Query: 400 DSIDPSLAIGFYCRDKDDFDDFC 422
IDPS+A+ F+C + DF C
Sbjct: 298 TGIDPSVALCFFCATEKDFKSLC 320
>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
Length = 458
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 120/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF +K+ E S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLEFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
Length = 393
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 163/307 (53%), Gaps = 33/307 (10%)
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW------R 195
+ FSS + +YRK F IG TSD GWGCMLR+ QM++ QAL+ LGR W R
Sbjct: 79 KSFSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWMWTSDDR 138
Query: 196 KPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
P DRE Y+ IL +F D +++ FSIH + G + G A G W GP + ++ + L
Sbjct: 139 LP-----DRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKLV 193
Query: 255 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLL 314
+ M ++V + ++ + D C +K W P+LL+
Sbjct: 194 QYDHWS--------EMKLHVAMDN---------IIILSDIKSLCC--AKESNKWRPLLLV 234
Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
VPL LGL ++N Y + +F SLGI+GG+P + Y +G+Q E ++LDPH
Sbjct: 235 VPLRLGLSEINDIYTNAVLNSFKMKHSLGIIGGRPSHALYFIGIQREELVFLDPHTTHNY 294
Query: 375 INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 434
+++ D+ + STYH + + + ++DPS+A+ FY D+D+ D + +A +L +++G
Sbjct: 295 VDL--DEEPYNDSTYHCQRAQRMKISNMDPSIAMCFYIGDEDELDQWRVQAKELLVDNSG 352
Query: 435 APLFTVT 441
LF +T
Sbjct: 353 HMLFEIT 359
>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
Length = 383
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++ QAL+ LGR W+ + + Y++IL F D T+ FSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 123
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L + +L + V +
Sbjct: 124 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
G V D A V K + W P+LLL+PL LGL ++NP YI L+ +F PQSLG
Sbjct: 184 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 238
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 399
++GGKP + Y +G E IYLDPH Q ++GK +++E D +TYH I +
Sbjct: 239 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 297
Query: 400 DSIDPSLAIGFYCRDKDDFDDFC 422
IDPS+A+ F+C + DF C
Sbjct: 298 TGIDPSVALCFFCATEKDFKSLC 320
>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
terrestris]
Length = 386
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 19 IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 67
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++ QAL+ LGR W+ + + Y++IL F D T+ FSI
Sbjct: 68 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 126
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L + +L + V +
Sbjct: 127 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 186
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
G V D A V K + W P+LLL+PL LGL ++NP YI L+ +F PQSLG
Sbjct: 187 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 241
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 399
++GGKP + Y +G E IYLDPH Q ++GK +++E D +TYH I +
Sbjct: 242 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 300
Query: 400 DSIDPSLAIGFYCRDKDDFDDFC 422
IDPS+A+ F+C + DF C
Sbjct: 301 TGIDPSVALCFFCATEKDFKSLC 323
>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
Length = 458
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 174/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F DRE + +I+ FG+S + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGNSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF+ +K+ + S+ PLFT H K + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSKDFDFT 429
>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
Length = 480
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 153/328 (46%), Gaps = 56/328 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 177
F DF SRI ++YR F PI S+ TSD GWGCM+RS
Sbjct: 114 FLDDFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSG 173
Query: 178 QMLVAQALLFHRLGRPWRK-------------PLQKPFDREYVEILHLFGDSETSPFSIH 224
Q L+A L+ LGR WR+ + EIL LF DS +PFSIH
Sbjct: 174 QSLLANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREAEILSLFADSPDAPFSIH 233
Query: 225 NLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
+Q G A G G W GP A A C R E C + + +YV +
Sbjct: 234 RFVQHGASACGKHPGQWFGP-------SATASCIR-ELSTECAAAGLRVYVTPSASE--- 282
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
+D R + S P L+L + LGL+++ P Y L+ + T+PQS+G
Sbjct: 283 ------LYEDRFRSIAAASPSDPTIKPTLILFGIRLGLDRITPVYHEALKSSLTYPQSIG 336
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLD 400
I GG+P +S Y VG Q + YLDPH+ +P + D E + +T H+ +R + ++
Sbjct: 337 IAGGRPSSSHYFVGCQGDLFFYLDPHETRPALPHHASPADYSEEEIATCHTRRLRGLRIN 396
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKL 428
+DPS+ IGF +D+ D++D+ R ++
Sbjct: 397 EMDPSMLIGFLIKDEADWEDWKRRIKEV 424
>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
Length = 458
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 175/421 (41%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
L+ P DRE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L+ GK G AG W GP + R G + IYV
Sbjct: 216 QLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
Length = 396
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 163/386 (42%), Gaps = 78/386 (20%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 11 AGN--VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 68
Query: 192 RPWRKP----------------------------------------LQKPFDREYVE--- 208
R W P QK R Y +
Sbjct: 69 RAWTWPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHE 128
Query: 209 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 129 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 188
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + C+ + D +++L+P+ L
Sbjct: 189 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRL 235
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 236 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 295
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 437
D + T+H + + +DPS IGFYCR+ DF +K+ + S+ PL
Sbjct: 296 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 353
Query: 438 FTVTQTHKK-------PVNHSDVLGE 456
FT H + N D+ E
Sbjct: 354 FTFVNGHSRDYDFTSTTTNEEDLFSE 379
>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
Length = 651
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 149/511 (29%), Positives = 220/511 (43%), Gaps = 98/511 (19%)
Query: 22 PNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAA 81
P+ AS SE+ S+ S S ++ +S+ S+ SA E ++ +
Sbjct: 177 PSEDTASAASEVLSTSSYSPDTPSTATAVDSSHQ-----SDPSAKETPLCPSQMHSSQQP 231
Query: 82 VKRLVTAGSMRRIHERVLGPSRTGISSST-------SDIWLLGVCHKIAQDEALGDAAGN 134
+ ++ + E VLG S T +S T + W L H + A
Sbjct: 232 ISDHQPVSTLLSLVEAVLGSSDTLPTSVTWLAHQLKARGWELLASHGVPYTSPTAHTAFP 291
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+ F + +++R F TSDVGWGCMLRS Q ++A AL+ LGR W
Sbjct: 292 GVWHSVHAVFQHILSLTHRTCF--------TSDVGWGCMLRSVQSMLANALIRVHLGRHW 343
Query: 195 RKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP----YAMCR 248
R+ ++ +Y IL F D S PFSIH L+ G+ G+ AG W GP +A+C+
Sbjct: 344 RRRAKQKTHPQYARILSWFMDDPSLECPFSIHRLVDEGQRLGVQAGDWFGPSTAAFALCK 403
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD- 307
+A C GLG V DG VV + F+ G++D
Sbjct: 404 LIQAYDAC-----GLG----------VVVTNDGMLYKEQVVA--------ASFAPGRSDP 440
Query: 308 WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
WT P+L+L+ LGL++V P Y P L+ +FT PQS+G+VGG+P +S Y VGVQ E + L
Sbjct: 441 WTRPVLILLVQRLGLDQVPPHYRPALKQSFTMPQSVGVVGGRPRSSLYFVGVQREHLLCL 500
Query: 367 DPHDVQPVINI------------GKDDLEADTSTYHSDVIRHIHLDS------------- 401
DPH V+P + DL + S + + LDS
Sbjct: 501 DPHHVRPCVPFRSPPRMTRASVGASTDLASTVSPWFEEAYTAEELDSFHTPHTSLLPISQ 560
Query: 402 IDPSLAIGFYCRDKDDFDDFCAR----ASKLAEESNGAPLF----------------TVT 441
+DPS+ +GF C D D AR ++L + ++ P +
Sbjct: 561 MDPSMLLGFVCEQASDLIDLQARIESSETRLFDVADNMPSYYRLSMSMGGEGEGDDDDNH 620
Query: 442 QTHKKPVNHSDVLGETGGVPE--DDSLGVMS 470
+THK HSD + GV + DDS M+
Sbjct: 621 RTHKAEDGHSDRVAAHSGVGDNVDDSGWTMA 651
>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
Length = 454
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 153/309 (49%), Gaps = 50/309 (16%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF SRI ++YR GF DP +S ++ SD GWGCM+RS
Sbjct: 118 FLDDFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRS 177
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A AL RLGR WR+ +RE IL LF D +P+S+HN ++ G A G
Sbjct: 178 GQSLLANALQISRLGRDWRRATDPDAERE---ILSLFADDPRAPYSLHNFVKHGAAACGK 234
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R EALA + E+ L S G P V D
Sbjct: 235 YPGEWFGPSATARCIEALA--NQHESSLRVYST---------------GDLPDVYEDS-- 275
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+V + + P L+LV LG++K+N Y L T QS+GI GG+P +S Y
Sbjct: 276 -FMAVANPDGEHFHPTLILVCTRLGIDKINQVYEEALISTLQMEQSIGIAGGRPSSSHYF 334
Query: 356 VGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
VGVQ + YLDPH +P + +D + + H+ +RH+H++ +DPS+ IGF
Sbjct: 335 VGVQGQWLFYLDPHHPRPALPYREAPEDYTSEELGSCHTRRLRHLHVEDMDPSMLIGFLI 394
Query: 413 RDKDDFDDF 421
+D+DD+D +
Sbjct: 395 KDEDDWDTW 403
>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
Length = 458
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 116/421 (27%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 154
S S + LLG C+ +E A G N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVQPQWVGENEPVVVSGNVEEFRRDFISRIWLTYRE 95
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR------------------- 195
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDVDNSDSESWTSHT 155
Query: 196 ---------------------KPLQKPFDRE----------YVEILHLFGDSETSPFSIH 224
P+++P R + +I+ F DS + F +H
Sbjct: 156 VKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESCHRKIVSWFADSPLACFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + CS + +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKADVIDKQCSSMDPENTEDKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDVLG 455
S +GFYCR+ DF+ +K+ + S+ PLFT + H + P N D+
Sbjct: 381 SCTVGFYCRNIQDFERASEEITKVLKASSREKYPLFTFVKGHARDYDFTCTPTNEDDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
Length = 388
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 171/368 (46%), Gaps = 67/368 (18%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 50 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 99
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
+TSD GWGCMLRS QM++AQ LL H L R W R P
Sbjct: 100 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSGLGPSEPSGLASPNRYRGPAR 159
Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L++ +R + +I+ F D +PF +H L G++ G AG W GP
Sbjct: 160 WVPPRWAHGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLGGLGQSSGKKAGDWYGP-- 215
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV + A +V D +
Sbjct: 216 -----SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 260
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 261 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 320
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 321 LDPHYCQPTVDVTQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 378
Query: 426 SKLAEESN 433
+++ S+
Sbjct: 379 TRVLSSSS 386
>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
Length = 400
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 165/374 (44%), Gaps = 71/374 (18%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF+SRI ++YR+ F I S +T+D GWGC +R+ QML+AQ L+ H LG
Sbjct: 15 AGN--VEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLG 72
Query: 192 RPW----------------------------------RKPLQKPF------------DRE 205
R W + L+ P D E
Sbjct: 73 RAWTWPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHE 132
Query: 206 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ +I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 133 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 192
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + C+ + AD +++LVP+ L
Sbjct: 193 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCASMASDHADDKAVIILVPVRL 239
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G E+ N Y+ ++ + +GI+GGKP S Y G Q++S IY+DPH Q +++
Sbjct: 240 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 299
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 437
D + T+H + + +DPS IGFYCR+ DF +K+ + S+ PL
Sbjct: 300 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 357
Query: 438 FTVTQTHKKPVNHS 451
FT H + + +
Sbjct: 358 FTFVNGHSRDYDFT 371
>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
Length = 451
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGSVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGEERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
Length = 456
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 114/321 (35%), Positives = 163/321 (50%), Gaps = 57/321 (17%)
Query: 130 DAAGNNGLAE-FNQDFSSRILISYRKGF-------DP---------------IGD-SKIT 165
+++G++G F DF SRI ++YR GF DP +GD + T
Sbjct: 111 ESSGDSGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFT 170
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCM+RS Q L+A ALL RLGR WR+ +R IL LF D +P+S+HN
Sbjct: 171 SDTGWGCMIRSGQSLLANALLISRLGRDWRRMTDPDAERP---ILALFADDSRAPYSLHN 227
Query: 226 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ G+ A G G W GP A R +ALA + E+ L S G
Sbjct: 228 FVKHGELACGKYPGEWFGPSATARCIQALA--NKHESSLRVYST---------------G 270
Query: 285 GAPVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
P V D S + + D + P L+LV LG++K+N Y+ L T QS
Sbjct: 271 DLPDVYED------SFMATAKPDGETFHPTLILVCTRLGIDKINQVYVEALISTLQMEQS 324
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--GKDDLEADT-STYHSDVIRHIH 398
+GI GG+P +S Y VGVQ + YLDPH +P + DD ++ + H+ +R +H
Sbjct: 325 IGIAGGRPASSHYFVGVQGQWLFYLDPHHPRPKLPYRENPDDYTSEELDSCHTRRLRRLH 384
Query: 399 LDSIDPSLAIGFYCRDKDDFD 419
++ +DPS+ IGF +D+DD+D
Sbjct: 385 VEDMDPSMLIGFLIKDEDDWD 405
>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
Length = 458
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
Length = 342
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 111/322 (34%), Positives = 151/322 (46%), Gaps = 46/322 (14%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + + +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
Q G G + G W GP A+ +W ALA + M VV
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177
Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
D R P + D+ RHC+ F A W P++LL+PL LGL VN
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
Y TL+ F PQSLG++GGKP ++ Y +G E IYLDPH QP + D
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDE 297
Query: 387 STYHSDVIRHIHLDSIDPSLAI 408
S + + + +DPS+A+
Sbjct: 298 SFHCQHPPGRMSIAELDPSIAV 319
>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
Length = 411
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/342 (30%), Positives = 165/342 (48%), Gaps = 36/342 (10%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-ADCRDATYLKIVNRFEDVRNSFYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH + ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAVAEQDYDETYHQKHAARLSFSAM 309
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 310 DPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351
>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
Length = 458
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
Length = 411
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/343 (32%), Positives = 170/343 (49%), Gaps = 42/343 (12%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W D Y++I++ F D S +SIH
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-SDCRDATYLKIVNRFEDVRNSYYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ Q G+ A G W+GP + + + L R + +AI+V
Sbjct: 151 IAQMGETQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELESSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
GG+P + Y +G ++ +YLDPH Q +G+ A+ TYH + ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAAAEQDYDETYHQKHAARLSFSAM 309
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEE--SNGAP-LFTVTQ 442
DPSLA+ F C+ D F+ A +KL EE S +P LF ++Q
Sbjct: 310 DPSLAVCFLCKTSDSFE---ALLTKLKEEVLSLCSPALFEISQ 349
>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 515
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/307 (35%), Positives = 150/307 (48%), Gaps = 50/307 (16%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 176
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G A G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYGATACGK 296
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +AL E+GL S G P V D
Sbjct: 297 YPGEWFGPLATARCIQALT--DEKESGLRVYST---------------GDLPDVYEDS-- 337
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+V + + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 338 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 396
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+GVQ + YLDPH +P + +D + T H+ +R +H+D +DPS+ IGF
Sbjct: 397 IGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMDPSMLIGFLI 456
Query: 413 RDKDDFD 419
+D+DD+D
Sbjct: 457 KDEDDWD 463
>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 448
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 56/310 (18%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAERN---IVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 296 RHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
S + + D + P L+LV LG++K+NP Y L T QS+GI GG+P +S
Sbjct: 270 ---SFLATARPDGETFHPTLILVCTRLGIDKINPVYEEALISTLQMEQSIGIAGGRPSSS 326
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIG 409
Y VGVQ + YLDPH +P + ++ L + + H+ +R++H++ +DPS+ IG
Sbjct: 327 HYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIG 386
Query: 410 FYCRDKDDFD 419
F +D+DD+D
Sbjct: 387 FLIQDEDDWD 396
>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
boliviensis]
Length = 458
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEMYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
Length = 410
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351
>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
Length = 384
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 111/329 (33%), Positives = 163/329 (49%), Gaps = 50/329 (15%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 170
+W+LG + ++ L +D S++ +YRKGF PIG S TSD GW
Sbjct: 23 VWILGKQYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGW 71
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
GCMLR QM++ QAL+ LGR W+ P + + Y++IL F D T+PFSIH +
Sbjct: 72 GCMLRCGQMVLGQALIILHLGRDWQWTPETR--NSTYLKILERFEDRRTAPFSIHQIASM 129
Query: 230 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
G + G G W GP + + + L + + I+V + +
Sbjct: 130 GASEGKEVGQWFGPNTIAQVLKKLVVYDDWSS--------ITIHVALDN---------TL 172
Query: 290 CIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
++D R C V K + W P+LLL+PL LGL ++NP YI L+ +F
Sbjct: 173 IVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFK 232
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDV 393
PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 233 IPQSLGVIGGKPNLALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMD-ATYHCKF 291
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
I + IDPS+A+ F+C + DF C
Sbjct: 292 ASRIPITGIDPSVALCFFCATERDFKSLC 320
>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
Length = 411
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351
>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
Length = 382
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWSLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 274
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
I + IDPS+A+ F+C + DF C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320
>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
Length = 458
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 169/404 (41%), Gaps = 80/404 (19%)
Query: 108 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C H +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 198
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPYALSIENSDSESRTSHTVK 155
Query: 199 ----------------------------QKPFDRE------YVEILHLFGDSETSPFSIH 224
+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQRASMASDNTDDKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
S IGFYCR+ DF +K+ + S+ PLFT H +
Sbjct: 381 SCTIGFYCRNIQDFKRASEEITKMLKISSKEKYPLFTFVNGHSR 424
>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
Length = 411
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
+D GWGCMLR QM++AQAL+ LGR W P D Y++I++ F D S +SI
Sbjct: 92 TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
V +DD C + W P+LL++PL LG+ +NP Y+P L+ S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
++GG+P + Y +G ++ +YLDPH Q + + A+ TYH ++
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKFKEEVLSLCSPALFEISQTR 351
>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
Length = 458
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKSILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMAFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 ETGGVPEDDSLGVMSMNDAV 475
E E L SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456
>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
Length = 382
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRK F PIG +S
Sbjct: 16 IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++ QAL+ LGR W+ L+ + Y++IL F D +PFSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWNLETR-NSTYLKILERFEDKRNAPFSI 123
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 274
H + G + G G W GP + + W ++ + L + V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
G G AP+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
+F PQSLG++GGKP + Y +G IYLDPH Q ++ K +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288
Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
I + IDPS+A+ F+C + DF C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320
>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
Length = 508
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 151/307 (49%), Gaps = 50/307 (16%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A A+L R GR WR+ +RE I+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA+ + + +Y+ P V D+
Sbjct: 290 YPGEWFGPSATARCIQALAKKHDSS---------LRVYLTRD--------LPEVYEDN-- 330
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
S + + P L+LV LG++K+NP Y L T PQ++GI GG+P +S Y
Sbjct: 331 -FMSTANPDGNHFHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSSHYF 389
Query: 356 VGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+G Q + YLDPH +P + + D + + H+ +RH+H++ +DPS+ IGF
Sbjct: 390 IGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIGFLI 449
Query: 413 RDKDDFD 419
+D+DD+D
Sbjct: 450 KDEDDWD 456
>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
Length = 458
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 172/404 (42%), Gaps = 80/404 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPAISSCAIEDCVISGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 224
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNEIYHRKIVSWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQCASMASDNPDNKAVIILVPVRLGGERTNVDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
S IGFYC++ DF+ +K+ + S+ PLFT H +
Sbjct: 381 SCTIGFYCQNVQDFERASEEITKMLKVSSKEKYPLFTFVNGHSR 424
>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
Length = 458
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 ETGGVPEDDSLGVMSMNDAV 475
E E L SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456
>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
UAMH 10762]
Length = 446
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 165/326 (50%), Gaps = 65/326 (19%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------- 163
A++EALG AEF D +RI ++YR F PI S
Sbjct: 103 AEEEALG------WPAEFMDDMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGN 156
Query: 164 ---ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 220
TSD GWGCM+RS Q L+A +L +LGR WR+ + + +Y ++ LF D+ +P
Sbjct: 157 SGGFTSDAGWGCMIRSGQTLLANSLATLKLGRDWRRGQK---EDDYKHLISLFADTPEAP 213
Query: 221 FSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
FSIH ++ G +A G G W GP A RS +AL R + GL + P +
Sbjct: 214 FSIHKFVEHGAQACGKHPGEWFGPSATARSVQALTEKYR-DVGLRVYARP---------D 263
Query: 280 DGERGGAPVVCIDDASRHCSVF-SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRL 334
DG+ V +D S+F + GQ D + P L+++ + LG++++ P Y L+
Sbjct: 264 DGD------VYVD------SLFATAGQMDANDEFQPTLIVLGIRLGIDRITPVYHAALKA 311
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI--NIGKDDLEADTSTYHSD 392
T PQS+GI GG+P +S Y VG Q ++ YLDPH + I N +DL ++ H+
Sbjct: 312 TLEMPQSVGIAGGRPSSSHYFVGHQGDNFFYLDPHTTRQAIPQNPSAEDL----ASCHTR 367
Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDF 418
+R + + +DPS+ +GF K++F
Sbjct: 368 RLRRLKIAEMDPSMLLGFLIHSKEEF 393
>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
Length = 509
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 126/404 (31%), Positives = 187/404 (46%), Gaps = 80/404 (19%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 163
L + HK D+A A + EF +D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKPDQAADTEA--SWPREFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSL 108
Query: 164 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
T+D GWGCM+R+SQ L+A +LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANSLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 211 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
F D T+PFSIHN ++ G G G W GP A RS + L +TGL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKTGLKV---- 223
Query: 270 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 325
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQQGAELRPVLILAGIRLGVKNVN 263
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 380
P Y L+ T +PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLGWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323
Query: 381 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
++E+ D + H++ IR +HLD +DPS+ +G ++ +D A +
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHSINSH 380
Query: 432 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 473
G+ V + +PV + +GG+ E + LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419
>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
Length = 509
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 126/404 (31%), Positives = 186/404 (46%), Gaps = 80/404 (19%)
Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 163
L + HK QD+A A + EF D SRI ++YR GF I ++
Sbjct: 51 LRTLFHKFKQDQAAETEA--SWPREFLGDVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSL 108
Query: 164 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
T+D GWGCM+R+SQ L+A LL RLGR WR + + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANGLLQLRLGRGWRYDQTRECAK-HAEIV 167
Query: 211 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
F D T+PFSIHN ++ G G G W GP A RS + L + GL
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKIGLKV---- 223
Query: 270 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 325
+ SGD ED +F Q A+ P+L+L + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQEGAELRPVLILAGIRLGVKNVN 263
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 380
P Y L+ T ++PQS+GI GG+P +S Y G Q + YLDPH Q + I +
Sbjct: 264 PLYWDFLKKTLSWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323
Query: 381 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
++E+ D + H++ IR +HLD +DPS+ +G ++ +D A +
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHNINAH 380
Query: 432 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 473
G+ V + +PV + +GG+ E + LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419
>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
Length = 389
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 172/343 (50%), Gaps = 42/343 (12%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG + +D L +D +R+ +YR+GF PIG S++T+D GWGC
Sbjct: 28 VWILGKSYSATEDLDL-----------IRRDVQTRLWCTYRRGFVPIGGSQLTTDKGWGC 76
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL LGR W + + Y++I++ F DS+ +PFS+H + G++
Sbjct: 77 MLRCGQMVLAQALTQLHLGRDWSWTPETT-NETYLKIVNRFEDSKAAPFSLHQIALTGES 135
Query: 233 -YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
G W GP + + + L + + I+V + +
Sbjct: 136 SEEKRVGEWFGPNTVAQVLKKLVKFD--------DWCSLVIHVALDN---------TLAT 178
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
D+ C V W P+LL++PL LGL ++NP Y+ L+ F + G+VGG+P
Sbjct: 179 DEVLELC-VDRSNPDSWKPLLLIIPLRLGLSEINPIYVDGLKKCFELAGNCGMVGGRPNQ 237
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLA 407
+ Y +G + A+YLDPH VQ IG D+ E D T+H R I+ +DPSLA
Sbjct: 238 ALYFIGYVADEALYLDPHTVQRSGTIGSKRDPDERELD-ETFHQKYARRINFKGMDPSLA 296
Query: 408 IGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKP 447
+ F C + DFDD R E+ NG PLF VT+T + P
Sbjct: 297 LCFLCATRKDFDDLIQR---FKEDLNGGGCQPLFEVTKTRQAP 336
>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
Length = 450
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 113/403 (28%), Positives = 171/403 (42%), Gaps = 95/403 (23%)
Query: 110 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 157
S ++LLG C+ +++ D N+G + EF +DF SRI ++YRK F
Sbjct: 38 NSPVFLLGKCYHFKYEDSGVTADDCSNSGSDSKEDLSGNVDEFRKDFISRIWLTYRKEFP 97
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 194
I S T+D GWGC LR+ QML+AQ LL H LGR W
Sbjct: 98 QIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWTWTEALDIFCSESDFWTANTARK 157
Query: 195 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 230
++PLQ + Y E LH F D + F +H L++ G
Sbjct: 158 LDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRKIISWFADYPLAYFGLHQLVKLG 217
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
K G AG W GP + L R E+ D E G +
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256
Query: 291 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
D C++++ D +++LVP+ LG E+ N Y ++ + +G
Sbjct: 257 AQD----CTIYNADVYDLQCNKGNEKAVVILVPVRLGGERTNMEYFEYVKGILSLEFCIG 312
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I+GGKP S Y VG Q++S IY+DPH Q +++ + + ++H + + +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMD 370
Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 444
PS +GFYCR+ +F+ +K+ + S PLFT H
Sbjct: 371 PSCTVGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413
>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
Length = 458
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 170/404 (42%), Gaps = 80/404 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
+ L+ P D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTICLKETIGKCSEDHETENEICHRKIISWFGDSPLAAFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITVYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQSASMTSDNTDDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
S IGFYCR+ DF +K+ + S+ PLFT H +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKMSSKEKYPLFTFVNGHSR 424
>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
Length = 459
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 168/404 (41%), Gaps = 80/404 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE+ L N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQA-------------------------------- 184
I S +T+D GWGC LR+ QML+AQ
Sbjct: 96 PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155
Query: 185 -----------------LLFHRLGRPWRKPLQKPFDREYV---EILHLFGDSETSPFSIH 224
+L H R R+ R V +I+ FGDS + F +H
Sbjct: 156 KLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNEVYHRKIISWFGDSPLAAFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + +YV
Sbjct: 216 QLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D R CS G+ D +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
S IGFYCR DF+ +K+ + S+ PLFT + H +
Sbjct: 381 SCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424
>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
Length = 458
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -FSVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF+ +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 ETGGVPEDDSLGVMSMNDAV 475
E E L SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456
>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
Length = 463
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 118/408 (28%), Positives = 177/408 (43%), Gaps = 81/408 (19%)
Query: 108 SSTSDIWLLGVCH--KIAQDE--------ALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
S S ++LLG C+ K+ DE AL D + EF +DF+SR+ ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKVVDDENPTESTAEALDDDVVTGNVDEFRKDFTSRVWLTYREEFP 95
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQK---------- 200
+ S TSD GWGC LR+ QM++AQALL H LGR W+ +PL
Sbjct: 96 ALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWKWSEALSLEPLDTETWTSSAARR 155
Query: 201 ---------------------PFDREYVE------------ILHLFGDSETSPFSIHNLL 227
P E E I+ FGD ++ I+ L+
Sbjct: 156 LVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETYHRTIVSWFGDGPSAQLGIYKLV 215
Query: 228 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 287
+ G G AG W GP +A R ++ I V +D A
Sbjct: 216 ELGMTSGKQAGDWYGP-------AVVAHILRKAVDEAVDAMLKGIRVYVA-QDCTVYSAD 267
Query: 288 VVCIDDASRHCSVFSKGQA-------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V ID S S Q D +++L+P+ LG EK+NP Y+ ++ +
Sbjct: 268 V--IDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEYLNFVKSILSLEY 325
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
+GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H + +
Sbjct: 326 CIGIIGGKPKQAYYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSPKKMSFS 383
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
+DPS IGFY + + F+ SK+ + S+ P FT+ + H K
Sbjct: 384 KMDPSCTIGFYSKSVEHFEKIANELSKILQPSSKEKYPAFTIMKGHGK 431
>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
Length = 454
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 142/306 (46%), Gaps = 49/306 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+ ++YR F I S TSD GWGCM+RS
Sbjct: 118 FLDDFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRS 177
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
Q L+A A+ LGR WR+ Q P D ++L F D +P+SIH +Q G A G
Sbjct: 178 GQSLLANAMAAINLGRDWRR-GQNPEDER--KLLSWFADDPRAPYSIHQFVQHGAVACGK 234
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA Q + P+ +Y G P V D
Sbjct: 235 YPGEWFGPSATARCIQALANAQEQQ--------PLRVYST--------GDGPDVYED--- 275
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ + + + P L+LV LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 276 KFMEIAKPDGSRFNPTLILVGTRLGIDKITPVYWEALIAALQMPQSVGIAGGRPASSHYF 335
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+G Q YLDPH +P + D EAD T H+ +R +H+ +DPS+ +GF
Sbjct: 336 IGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVDTVHTRRLRRLHVRELDPSMLVGFLI 395
Query: 413 RDKDDF 418
RD+DD+
Sbjct: 396 RDEDDW 401
>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
Length = 468
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 161/354 (45%), Gaps = 59/354 (16%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 92 DDEIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSRE 151
Query: 194 W----------------------RKPL-------------------QKPF-DREYVEILH 211
W R PL + P ++ + I+
Sbjct: 152 WTWPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIR 211
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
F D ++PF +H ++ G +G AG W GP +A + C+ ++
Sbjct: 212 WFSDHPSAPFGLHRMVALGSIFGKKAGDWYGP-------SIVAHIIKKAIETSCEVAELS 264
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+YV S D + + D + G+A +++LVP LG E NP Y
Sbjct: 265 VYV-SQDCTVYKADIEQLFAGDVPHAETSRDAGKA----VIILVPARLGGETFNPVYKHC 319
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L+ P LGI+GGKP S Y +G Q+ +YLDPH Q I+ ++D + ++H
Sbjct: 320 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQSYIDTSRNDFPLE--SFHC 377
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 442
+ R I + +DPS FY +++DDF C K+ + P+F++++
Sbjct: 378 NTPRKISITRMDPSCTFAFYAQNRDDFGKLCDHLMKVLHSPHAEEKYPIFSISE 431
>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
Length = 585
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 157/359 (43%), Gaps = 69/359 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----- 194
F +DF+SRI ++YR+ F + + T+D GWGCMLRS QML+AQ L+ H LG+ W
Sbjct: 198 FQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDWTWPDA 257
Query: 195 ------------------------------------------------RKPLQKPFDREY 206
R P + +R +
Sbjct: 258 LHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELEKERYH 317
Query: 207 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+I+ F D + F IH L+ G + G AG W GP C C
Sbjct: 318 RKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDC--------CS 369
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 326
+ VS D +G V + + S + + G A W +++LVP+ LG E NP
Sbjct: 370 EAGNLVVYVSQDCTVYKGD--VANLANKSEDRTAWDPG-AVWKAVIILVPMRLGGEAFNP 426
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
Y+ ++ +GI+GGKP S Y VG Q+++ +YLDPH QP ++ K++ +
Sbjct: 427 AYVDCVKELLKLEFCIGIIGGKPRHSLYFVGYQDDALLYLDPHYCQPFVDTTKENFPLE- 485
Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 442
++H + R +DPS IGFY + +F++ C +++ S P+F++ +
Sbjct: 486 -SFHCNSPRKTAFTKVDPSCTIGFYAHHRTEFEELCLHLTQVLNSSTAKEKYPMFSIVE 543
>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
familiaris]
gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
familiaris]
Length = 458
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 117/421 (27%), Positives = 172/421 (40%), Gaps = 87/421 (20%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKFEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSDSWTSNTVK 155
Query: 201 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 224
F D E + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
S IGFYCR+ DF +K+ + S+ PLFT H + N D+
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440
Query: 456 E 456
E
Sbjct: 441 E 441
>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 170/397 (42%), Gaps = 80/397 (20%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFT 439
S IGFYCR+ DF +K+ + S+ PLFT
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFT 417
>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
Length = 482
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/424 (28%), Positives = 179/424 (42%), Gaps = 96/424 (22%)
Query: 108 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAE---------FNQDFSSRILISYRKGFD 157
S S + LLG C H A D+ D A E F +DF+SR+ ++YR+ F
Sbjct: 36 SRNSPVLLLGRCYHFKADDDGSADEASCREPEEGFSMGNVEAFRKDFTSRVWLTYREEFP 95
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQ----------- 199
P+ S +T+D GWGC+LR+ QM++AQAL+ H LGR W +PL
Sbjct: 96 PLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLGRDWTWSEALTLQPLDTETWTASAAKR 155
Query: 200 -------------KPFDREYVE-----------------------ILHLFGDSETSPFSI 223
K DR++ E I+ FGD+ ++ +
Sbjct: 156 LVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEAEAHLKEMYHRTIISWFGDTSSALLGL 215
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG------------------- 264
H L++ G G AG+W GP + + + ++GL
Sbjct: 216 HRLVRLGLTMGKNAGNWYGPAVVAHILKKAVE-EAMDSGLAGITAYVSQDCTVYSADVAD 274
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
C P A GG P +D S+ QA +++L+P+ LG EK+
Sbjct: 275 CHKPPSARQASVSPPIA--GGGP--SKEDQPGSASILPDSQA----VIILIPVRLGGEKI 326
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
NP Y ++ + +GI+GGKP + Y VG Q++S IY+DPH Q +++ D
Sbjct: 327 NPEYFEFVKNILSVEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDFP- 385
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 442
++H + I +DPS IGFY R D+D SKL + S P FT Q
Sbjct: 386 -LQSFHCPSPKKIPFTRMDPSCTIGFYSRSLQDYDRIREELSKLLQPSTKEKYPAFTFVQ 444
Query: 443 THKK 446
H +
Sbjct: 445 GHGR 448
>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
Length = 454
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)
Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 39 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 98
Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 99 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 157
Query: 246 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
+ W +A + L + ++ MA S D E+G
Sbjct: 158 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 209
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
+ D +R +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P
Sbjct: 210 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 268
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 383
+ Y VG+ YLDPH +P ++G LE
Sbjct: 269 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 328
Query: 384 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 437
D STYH ++ I +++DPSLA+ +C +D+F++ C K ++ P+
Sbjct: 329 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 388
Query: 438 FTVTQTHKK 446
F Q K
Sbjct: 389 FEFLQRRPK 397
>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
Length = 460
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 122/423 (28%), Positives = 183/423 (43%), Gaps = 89/423 (21%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 154
S S + LLG C+ +E A AG N + EF +DF SRI ++YR+
Sbjct: 36 SRNSPVLLLGKCYHFKSEEENDPAPVGSGWAGENEHVVIYGNVEEFRRDFISRIWLTYRE 95
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------- 197
F I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDIENSDSASWTSHT 155
Query: 198 ------------------------LQKPF-----DRE------YVEILHLFGDSETSPFS 222
L++P D E + +I+ FGDS + F
Sbjct: 156 VKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGRNELCHRKIISWFGDSPLACFG 215
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H L++ GK G AG W GP + R G + IYV +D
Sbjct: 216 LHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVA---QDCT 267
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
A V+ S ++ +A I+LLVP+ LG E+ N Y+ ++ + +
Sbjct: 268 VYKADVIDKQGISAGLET-TEDKA----IILLVPVRLGGERTNMDYLDFVKGILSLEYCV 322
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 402
GI+GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +
Sbjct: 323 GIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKM 380
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDV 453
DPS +GFYCR+ DF+ +++ + S+ PLFT + H + P N D+
Sbjct: 381 DPSCTVGFYCRNAQDFERASEELTQVLKASSREKYPLFTFVKGHARDYDFTSTPTNEDDL 440
Query: 454 LGE 456
E
Sbjct: 441 FSE 443
>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 440
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 174/410 (42%), Gaps = 81/410 (19%)
Query: 108 SSTSDIWLLGVCH--KIAQDEALGDAAGN--------NGLAEFNQDFSSRILISYRKGFD 157
S S + LLG C+ K+ +DE + +A + +F +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKVEEDEGVAEACCEASDEEDVVGNVEDFRRDFGSRIWLTYREEFP 95
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDRE--------- 205
P+ S +TSD GWGCMLR+ QM++AQALL H +GR W R +P D E
Sbjct: 96 PLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWTWSRTMSLQPLDTETWTTSAAKR 155
Query: 206 ----------------------------------YVE-------ILHLFGDSETSPFSIH 224
+VE ++ FGDS ++ F +H
Sbjct: 156 LVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEGEAFHRTLVSWFGDSPSAQFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSW-----EALARCQRAETGLGCQSLPMAIYVVSGDE 279
++ G G AG W GP + EAL T Q + V
Sbjct: 216 RMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSLAGITAYVSQDCTVYSADVIDGH 275
Query: 280 DGERGGAP-----VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
+P V + ++ S +A +++LVP+ LG EK NP Y +
Sbjct: 276 KASTSASPESSDDVTLLSPNNQAASALPDSRA----VIILVPVRLGGEKTNPDYFNLAKS 331
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
+ +GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H
Sbjct: 332 ILSLDYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 389
Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 442
+ + +DPS +GFY R DF+ +KL + S+ P F Q
Sbjct: 390 KKMPFTKMDPSCTLGFYSRSAQDFEKIKQELTKLLQPSSKEKYPAFIFVQ 439
>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
Length = 454
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 152/329 (46%), Gaps = 54/329 (16%)
Query: 122 IAQDEALGDAAGNNG--LAEFNQDFSSRILISYRKGFDPIGDSK---------------- 163
+A DE D +G +G F DF S+ ++YR F I S
Sbjct: 101 LAYDE---DYSGQDGGWPTAFLDDFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKS 157
Query: 164 -------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L LF D
Sbjct: 158 QLVDQNGFSSDSGWGCMIRSGQSLLANAMAVINLGRDWRRGQNQEEERK---LLSLFADD 214
Query: 217 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
+P+SIH +Q G A G G W GP A R +ALA Q + P+ +Y
Sbjct: 215 PRAPYSIHQFVQHGAVACGKYPGEWFGPSATARCIQALANAQMHQ--------PLRVYST 266
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
D + D SR + P L+LV LG++K+ P Y L
Sbjct: 267 GDGPDVYEDKFMKIAKPDGSR-----------FHPTLILVGTRLGIDKITPVYWEALIAA 315
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSD 392
PQS+GI GG+P +S Y +G Q YLDPH +P + + EAD T H+
Sbjct: 316 LQMPQSVGIAGGRPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVDTVHTR 375
Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+R +H+ +DPS+ IGF D+DD+D++
Sbjct: 376 RLRRLHVRELDPSMLIGFLILDEDDWDEW 404
>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
2508]
gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
2509]
Length = 506
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 151/309 (48%), Gaps = 50/309 (16%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF SRI ++YR F DP S ++ SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A A+L RLGR WR+ D E +I+ LF D +P+S+HN ++ G A G
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGK 287
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA ++GL S G P V D
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDS-- 328
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+V + + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 329 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
VGVQ + YLDPH +P + +D + T H+ +R +H+ +DPS+ IGF
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLI 447
Query: 413 RDKDDFDDF 421
+D+DD+D +
Sbjct: 448 KDEDDWDTW 456
>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
Length = 481
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)
Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
ALG + + +G+ + +SR +YR+ F PIG + ++D GWGCMLR +QML+ + L
Sbjct: 66 ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 125
Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-- 243
L +GR + ++K Y +IL +F D + + +SIH + Q G G W GP
Sbjct: 126 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 184
Query: 244 -------YAMCRSWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
+ W +A + L + ++ MA S D E+G
Sbjct: 185 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 236
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
+ D +R +W P+LL++PL LGL +NP Y+ ++ F PQ +GI+GG+P
Sbjct: 237 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 295
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 383
+ Y VG+ YLDPH +P ++G LE
Sbjct: 296 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 355
Query: 384 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 437
D STYH ++ I +++DPSLA+ +C +D+F++ C K ++ P+
Sbjct: 356 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 415
Query: 438 FTVTQTHKK 446
F Q K
Sbjct: 416 FEFLQRRPK 424
>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 401
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 171/380 (45%), Gaps = 75/380 (19%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAE-----------------FNQDFSSRILISYRKG 155
IW LG + A + D A NN + F DF SRI I+YR
Sbjct: 29 IWCLG--REYAPSQPPSDPASNNPRSPSRQPNASTLNDTTWPKAFLSDFGSRIWITYRSN 86
Query: 156 FDPIGDSK-----------------------ITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
F PI +K TSD GWGCM+RS Q L+A LGR
Sbjct: 87 FTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQSLLANTFSVLLLGR 146
Query: 193 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWE 251
WR+ + E +++ +F D +PFSIH + G ++ G G W GP
Sbjct: 147 DWRRGEKV---EEESKLISMFADHPEAPFSIHRFVNRGAESCGKYPGEWFGP-------S 196
Query: 252 ALARCQRAETGLGCQS-LP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
A A+C + L QS +P + +Y+ + D +D H + G+
Sbjct: 197 ATAKCIQL---LSTQSEVPQLRVYLTNDTSD---------VYEDKFAHVAHDESGRIQ-- 242
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P L+L+ LG++ V P Y LR T+PQS+GI GG+P AS Y VG Q+ +LDPH
Sbjct: 243 PTLILIGTRLGIDNVTPAYWDGLRAALTYPQSVGIAGGRPSASHYFVGAQDCHLFFLDPH 302
Query: 370 DVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
+P D L + + +Y++ +R IH+ +DPS+ IGF +D+DD+ D+ K
Sbjct: 303 TTRPATLYRPDGLYTQEELDSYYTSRLRRIHIKDMDPSMLIGFLVKDEDDWADW----KK 358
Query: 428 LAEESNGAPLFTVTQTHKKP 447
+ G P+ + + +P
Sbjct: 359 RIRSTPGQPIVHIFPSQHQP 378
>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
Full=Autophagy-related protein 4
gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 506
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 151/309 (48%), Gaps = 50/309 (16%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF SRI ++YR F DP S ++ SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFALIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A A+L RLGR WR+ D E +I+ LF D +P+S+HN ++ G A G
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAEK-DIIALFADDPRAPYSLHNFVKYGATACGK 287
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA ++GL S G P V D
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDS-- 328
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+V + + P L+LV LG++K+N Y L T PQS+GI GG+P +S Y
Sbjct: 329 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
VGVQ + YLDPH +P + +D + T H+ +R +H+ +DPS+ IGF
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLI 447
Query: 413 RDKDDFDDF 421
+D+DD+D +
Sbjct: 448 KDEDDWDTW 456
>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
Length = 457
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/402 (28%), Positives = 170/402 (42%), Gaps = 78/402 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEA-----------LGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE + D + + + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKSDEPSDQSPNGSCDDMTDESFSRNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
I S +T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 96 PQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWANAFVFENPESESWTSQTVK 155
Query: 195 -----------------------RKPLQKPFDREYVE------ILHLFGDSETSPFSIHN 225
+ P++ E VE I+ F DS + F +H
Sbjct: 156 KLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEEQYHRRIISWFADSPFANFGLHR 215
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
L++ GK G AG W GP + L R + E + + IYV +
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAH----LLR-KAVEKARDPELQGITIYVAQDCTVYKSDV 270
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
+C S SV S I++L+P+ LG E+ N Y ++ + +GI+
Sbjct: 271 IDALCPFTDSEKTSVKS--------IIILIPVRLGGERTNMEYFEFVKGILSLDYCIGII 322
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GGKP S Y G Q++S IY+DPH Q +++ D + ++H + + +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSVKDFPLE--SFHCPSPKKMSFKKMDPS 380
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESN-GAPLFTVTQTHKK 446
IG YC D F+ +K+ + S PLFT H +
Sbjct: 381 CTIGLYCPDMQGFERAAEEITKILKLSKEKYPLFTFVNGHSR 422
>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
Length = 458
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENSDSDSWTSNTVK 155
Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
F +RE + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNEIYHRKIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF+ +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSRDFDFT 429
>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
Length = 481
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 122/469 (26%), Positives = 195/469 (41%), Gaps = 104/469 (22%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
IS T IW LG + + +G+ + +SR +YR+ F PIG + +
Sbjct: 25 ISIDTFPIWALG-----------KEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPS 73
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D WGCMLR +QML+ + LL +GR + ++K D Y +IL +F D + + +SIH
Sbjct: 74 TDQYWGCMLRCAQMLLGEVLLRRHIGRHFEWDIEKTSDV-YEKILQMFFDEKDALYSIHQ 132
Query: 226 LLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYVV 275
+ Q G + G W GP + W +A + L Q +L MA
Sbjct: 133 IAQMGVSEGKEVSEWFGPNTAAQVIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYP 192
Query: 276 SGDE----DGERG-------GAPVVCID-DASRHCSVFSKGQAD-------------WTP 310
S D GE G + ++C++ D + F G + W P
Sbjct: 193 SEDAVKLIMGEFGFKSDRISSSHIICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRP 252
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+LL++PL LGL +N Y+ ++ F PQ +GI+GGKP + Y VG+ YLDPH
Sbjct: 253 LLLMIPLRLGLTSINSCYLSAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHH 312
Query: 371 VQP--------------------------VINIGKDDLE---------------ADTSTY 389
+P + + G +LE + STY
Sbjct: 313 CRPKTSKFFVEKEQQQQSSGDSTPEKVEKIDDNGFHELEDLEPLPSQTSDVYTKMNDSTY 372
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV- 448
H +++ + DSIDPSLA+ +C +++F++ C K ++ P+F + K +
Sbjct: 373 HCQMMQWMEYDSIDPSLALALFCETREEFENLCDELQKTTLTASNPPMFEFLEKRPKYLP 432
Query: 449 ---------------NHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
D+ + ED + +S+ DA A DD
Sbjct: 433 KFEPYTGVSMKIEMKEFDDIGAANSKIDEDFEVLDVSVEDAETGAEADD 481
>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 467
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 158/354 (44%), Gaps = 87/354 (24%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + +V G W P L+LV LG++K+ P Y L+ + PQS+GI GG+P AS
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKASLQIPQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEA---------------------------- 384
Y VGVQ + YLDPH +P++ L A
Sbjct: 311 HYFVGVQGNNFYYLDPHSTRPLLPFHPPSLAAATSDTPNLTASTTSVSSTTSSTTIVPPA 370
Query: 385 -----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
D ST H+ IR + + +DPS+ + F + D+ D+
Sbjct: 371 DSIPAPSDPRQSLYPPSDLSTCHTRRIRRLQIREMDPSMLLAFLVTSEADYQDW 424
>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
Length = 437
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 157/342 (45%), Gaps = 54/342 (15%)
Query: 130 DAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSK-----------------------IT 165
D+ N G + F DF +R+ I+YR F I S+ +
Sbjct: 94 DSDANGGWPSPFLDDFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFS 153
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCM+RS Q L+A AL RLGR WR+ +R IL LF D +PFSIH
Sbjct: 154 SDTGWGCMIRSGQSLLANALQVLRLGRAWRRGQDSQGERR---ILSLFADDPKAPFSIHR 210
Query: 226 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ G A G G W GP A R +AL+ G + + +Y+ D
Sbjct: 211 FVEHGAVACGKHPGEWFGPSATARCIQALSN--------GYEDAGLRVYITGDGSD---- 258
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
+D+ V + P L+LV + LG+++V P Y L+ + QS+GI
Sbjct: 259 -----VYEDS--FMKVAKDANNTFHPTLVLVGIRLGIDRVTPVYWEALKASLQLSQSIGI 311
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDS 401
GG+P AS Y VG Q YLDPH +P + + D + D + H+ +R +H+
Sbjct: 312 AGGRPSASHYFVGTQGSYFFYLDPHTTRPFLPLHSDLSDYTQEDIDSCHTRRLRRLHVKE 371
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
+DPS+ I F RD+ D+ ++ K E +G P+ V +
Sbjct: 372 MDPSMLIAFLIRDETDWQNW----RKAVAEVHGKPVIHVADS 409
>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
Length = 478
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 174/386 (45%), Gaps = 82/386 (21%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+GL + +SR+ +YR+ F PIG + ++D GWGCMLR +QML+ + LL +GR +
Sbjct: 47 DGLEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHF 106
Query: 195 RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR------ 248
++K Y +IL +F D + + +SIH + Q G G W GP +
Sbjct: 107 EWDIEKT-SEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLT 165
Query: 249 ---SWEALARCQRAETGLGCQ-SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV---- 300
W +A + L + +L MA S + + + + + ++ ++
Sbjct: 166 IFDDWSNIAVHVALDNILVKEDALTMATTYPSDN------ASYIFAVHNFLKYFTLNLTF 219
Query: 301 --FSK-GQ-----------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
F++ GQ DW P+L+++PL LGL +NP Y+P ++ F PQ +GI+G
Sbjct: 220 PNFAENGQIEKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFELPQCVGIIG 279
Query: 347 GKPGASTYIVGVQEESAIYLDPH-----------------------------DVQPVINI 377
GKP + Y VG+ YLDPH D+Q I+
Sbjct: 280 GKPNLAHYFVGIAGTKLFYLDPHHCRAKTTKRDAGVTTNTMISSITTTDAQLDIQNQIDD 339
Query: 378 GK----DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+DLE D STYH +++ + +SIDPSLA+ +C + DFD
Sbjct: 340 SDFHKLEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEYESIDPSLALALFCETRQDFDTL 399
Query: 422 CARASKLAEESNGAPLFTVTQTHKKP 447
C K S+ P+F + K+P
Sbjct: 400 CEELQKTTLPSSVPPMFEFLE--KRP 423
>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
Length = 343
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 139/303 (45%), Gaps = 48/303 (15%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
E D +SR+ +YRK F IG + TSD GWGCMLR QM+ AQAL+ LGR WR
Sbjct: 40 EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWIK 99
Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR---------S 249
K Y +L+ F D + S +SIH + Q G G + G W GP + + +
Sbjct: 100 GKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFDT 159
Query: 250 WEALA----------------RCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV 289
W +LA CQ + G + P +Y +E G R +
Sbjct: 160 WSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL- 218
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
W P++LL+PL LGL ++N YI TL+ F PQSLG++GGKP
Sbjct: 219 ------------------WKPLVLLIPLRLGLTEINEAYIETLKHCFMMPQSLGVIGGKP 260
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
++ Y +G E IYLDPH QP + D S + + + +DPS+A+
Sbjct: 261 NSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCLPDESFHCQHPPCRMSIAELDPSIAVV 320
Query: 410 FYC 412
C
Sbjct: 321 CSC 323
>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
Length = 358
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/340 (32%), Positives = 165/340 (48%), Gaps = 44/340 (12%)
Query: 137 LAEFNQDF---SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
AE+ +DF S + I RK G + TSD GWGCMLR QM+ AQAL+ LGR
Sbjct: 13 FAEY-EDFPETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRD 71
Query: 194 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
WR +K Y +L+ F D + S +SIH + Q G G + G W GP + + + L
Sbjct: 72 WRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKL 131
Query: 254 ARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFS 302
A + +A+++ V +E V C D+ RHC+ F
Sbjct: 132 AVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFP 183
Query: 303 KGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++GGKP ++ Y +
Sbjct: 184 AGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFI 243
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP--SLAIGFYCRD 414
G ES+ + P +G L A + + H + ++P S A+GF+C+
Sbjct: 244 GYVGESSSHRVP--------VGLCPLRA-----FCEQVPHARCNIVEPEGSRALGFFCKT 290
Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
+DDF+D+C + KL+ P+F + + + DVL
Sbjct: 291 EDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 330
>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
Length = 468
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 115/408 (28%), Positives = 173/408 (42%), Gaps = 71/408 (17%)
Query: 108 SSTSDIWLLGVCHKI------AQDEALGDAAGNNGLA----EFNQDFSSRILISYRKGFD 157
S S + LLG C+ Q EA +A+ G+ +F +DF SRI ++YR+ F
Sbjct: 29 SRNSPVLLLGKCYHFKAEEDEGQTEACREASDEEGVMGNVEDFRRDFGSRIWLTYREEFP 88
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE--------- 205
P+ S +TSD GWGCMLR+ QM++AQALL H LGR W + +P D E
Sbjct: 89 PLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWTWSGAMSLQPLDTETWTTSAAKR 148
Query: 206 ----------------------------------------YVEILHLFGDSETSPFSIHN 225
+ ++ FGDS ++ F +H
Sbjct: 149 LVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDGGFHRTLVSWFGDSPSAQFGLHR 208
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS-LPMAIYVVSGDEDGERG 284
+++ G A G AG W GP + + R G S + V S D
Sbjct: 209 MVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLAGISSYVSQDCTVYSADVIDSHK 268
Query: 285 GAPVVCID----DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+ + +S H S + D +++LVP+ LG EK NP Y + +
Sbjct: 269 ASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVRLGGEKTNPDYFNLAKSFLSLDY 328
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
+GI+GGKP + Y VG Q++S IY+DPH Q +++ D ++H + +
Sbjct: 329 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSPKKMPFT 386
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKK 446
+DPS GFY R DF+ ++L + S P F Q H +
Sbjct: 387 KMDPSCTFGFYSRSAQDFERIKHELTELLQPSAKEKYPAFIFVQGHGR 434
>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
Length = 500
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 64/310 (20%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+ I GD + +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A A+L R GR WR+ +RE I+ LF D +P+SI N + G A G
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI---YVVSGDEDGERGGAPVVCID 292
G W GP A ARC + + LP ++ + + DG
Sbjct: 290 YPGEWFGP-------SATARCIHSLRVYLTRDLPEVYEDNFMSTANPDGNH--------- 333
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ P L+LV LG++K+NP Y L T PQ++GI GG+P +S
Sbjct: 334 ---------------FHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSS 378
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
Y +G Q + YLDPH +P + + D + + H+ +RH+H++ +DPS+ IG
Sbjct: 379 HYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIG 438
Query: 410 FYCRDKDDFD 419
F +D+DD+D
Sbjct: 439 FLIKDEDDWD 448
>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 468
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 105/313 (33%), Positives = 145/313 (46%), Gaps = 56/313 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI +SYR GF PI S T+D GWGCM+R+
Sbjct: 128 FLDDFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRT 187
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A LL HRLGR WR+ + +R+ +L LF D +P+SIH ++ G A G
Sbjct: 188 GQSLLANTLLSHRLGRGWRRGEKSDEERK---LLSLFADDPRAPYSIHKFVEHGAAKCGK 244
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R EALA + +Y G P V D
Sbjct: 245 YPGEWFGPSATARCIEALANTNEKT---------LRVYST--------GDLPDVYEDS-- 285
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+LV LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 286 -FMEVARPDGKTFHPTLILVSTRLGIDKINQVYWESLTATLQMPQSVGIAGGRPSSSHYF 344
Query: 356 VGVQE------ESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
VG Q + YLDPH +P + D +D + H+ +R +H+ +DPS+
Sbjct: 345 VGAQRSDEDQGSNLFYLDPHHTRPALPYFDDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 404
Query: 407 AIGFYCRDKDDFD 419
IGF D++D++
Sbjct: 405 LIGFLITDEEDWE 417
>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
Length = 432
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 138/297 (46%), Gaps = 50/297 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+ +YR F I S+ T+D GWGCM+RS
Sbjct: 105 FLDDFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRS 164
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A AL LGR WR+ + +E E+L LF D+ +PFSIH + G A G
Sbjct: 165 GQSLLANALSILNLGRDWRRGSKI---KEECELLSLFADNPQAPFSIHRFVDYGASACGK 221
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R EAL+ C+ + +YV+S D + D
Sbjct: 222 HPGEWFGPSATARCIEALSN--------ECKHTDLNVYVMSDGSDVHEDQFRQIAGPDGI 273
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
R P L+L+ + LG+E V P Y LR +PQS+GI GG+P +S Y
Sbjct: 274 R-------------PTLILLGVRLGIESVTPVYWEALRAIIRYPQSVGIAGGRPSSSLYF 320
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGF 410
+GVQ YLDPH +P ++ D + TYH+ +R +H+ +DPS+ IGF
Sbjct: 321 IGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLDTYHTRRLRRLHIREMDPSMLIGF 377
>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
Length = 458
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 115/409 (28%), Positives = 172/409 (42%), Gaps = 80/409 (19%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE A+ D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDESKVLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG------------------------- 191
I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 96 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIESSDSDSWTSNTIH 155
Query: 192 -------------RPWRKPL--------QKPFDRE------YVEILHLFGDSETSPFSIH 224
R R P + P D + +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSEIYHRQIISWFGDSPVAVFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + G A +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVKGVLSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y G Q++S IY+DPH Q +++ D + T+H + + +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
S IGFYCR+ DF+ +K+ + S+ PLFT H + + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSRDFDFT 429
>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 321
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 147/291 (50%), Gaps = 32/291 (10%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
G + G W GP A+ W +LA + + C+ LP+ S
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 115
Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
D G+R + + + S +CS W P+LL+VPL LG+ ++NP Y+ +
Sbjct: 116 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 168
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
F PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ D + +
Sbjct: 169 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 228
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 229 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 278
>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
Length = 459
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 153/332 (46%), Gaps = 60/332 (18%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 163
+A DE L DA F DF SR+ ++YR F+PI S
Sbjct: 109 LAYDELLEDAGWP---IAFLDDFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLA 165
Query: 164 ----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
+SD GWGCM+RS Q L+A L+ +LGR WR+ R+ EIL F D +
Sbjct: 166 DQGGFSSDTGWGCMIRSGQSLLANTLVICQLGRDWRRGKAA---RQEREILARFADDPRA 222
Query: 220 PFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
P+S+HN ++ G A G G W GP A R +ALA + + +Y
Sbjct: 223 PYSLHNFVRHGAVACGKFPGEWFGPSATARCIQALANSNESS---------LRVYST--- 270
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
G P V D +V + P L+LV LG++K+N Y L T
Sbjct: 271 -----GDLPDVYEDS---FMAVAKPDGETFHPTLILVGTRLGIDKINQVYWEALTATLQM 322
Query: 339 PQSLGIVGGKPGASTYIVGVQEES--------AIYLDPHDVQPVINIGKDDLEA---DTS 387
PQS+GI GG+P AS Y +G Q YLDPH +P + +D + D +
Sbjct: 323 PQSVGIAGGRPSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN 382
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
T H+ +R +H+ +DPS+ IGF +D+DD+D
Sbjct: 383 TCHTRRLRRLHVRDMDPSMLIGFLIKDEDDWD 414
>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
Length = 379
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 172/336 (51%), Gaps = 30/336 (8%)
Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
H+I L +A L + +D SR+ +YR+GF PIG S+ TSD GWGCMLR QM
Sbjct: 13 HRIRCIFGLSNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQM 72
Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAG 238
++AQALL LGR W + D Y+ I++ F D++ +PFS+H + G++ G
Sbjct: 73 VLAQALLQLHLGRDWEWTAETR-DETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVG 131
Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
W GP + + + L + + ++V + D+ C
Sbjct: 132 EWFGPNTVAQVLKKLVKFD--------DWCSVVVHVALD---------STLATDEVVELC 174
Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
S W P+LL++PL LGL ++NP Y+ L+ F + G++GG+P + Y +G
Sbjct: 175 EDKSDAGTSWKPLLLIIPLRLGLSEINPIYVAGLKKCFELAGNCGMIGGRPNQALYFIGY 234
Query: 359 QEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
+ A++LDPH VQ NIG D+ E D S +H R I+ ++DPSLA+ F C
Sbjct: 235 VGDEALFLDPHTVQRSGNIGDKTGLDEREMDES-FHQRYARRINFKAMDPSLALCFLCAT 293
Query: 415 KDDFDDFCARASKLAEESNGAP---LFTVTQTHKKP 447
+ +FDD AR AE+ NG LF VT+T + P
Sbjct: 294 RTEFDDLLAR---FAEDLNGGSCQGLFEVTKTRQAP 326
>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
Length = 449
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 155/327 (47%), Gaps = 53/327 (16%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 159
+A D+ D +G F DF SRI ++YR FDPI
Sbjct: 99 LAYDDQSNDGGWPSG---FITDFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQL 155
Query: 160 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
GD S +SD GWGCM+RS Q L+A + RLGR WR Q E IL F D
Sbjct: 156 GDQSPFSSDSGWGCMIRSGQSLLANTIALVRLGRDWR---QGQSLEEECRILKDFADDPR 212
Query: 219 SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
+P+SIH+ ++ G A G G W GP A R +ALA +I V S
Sbjct: 213 APYSIHSFVRHGASACGKYPGEWFGPSATARCIQALANSHEP-----------SIRVYST 261
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
G P V DD + + G+A + P L+LV LGL+K+ P Y L
Sbjct: 262 ------GDGPDVYEDDFMKIAN--PTGEA-FHPTLVLVGTRLGLDKITPVYWEALIAALQ 312
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVI 394
PQS+GI GG+P +S Y +G Q YLDPH +P + ++ ++ + + H+ +
Sbjct: 313 MPQSVGIAGGRPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMDYTSEEIESCHTARL 372
Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDF 421
R IH+ +DPS+ IGF R ++D+ D+
Sbjct: 373 RRIHVREMDPSMLIGFLIRSEEDWQDW 399
>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
Length = 448
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 148/307 (48%), Gaps = 49/307 (15%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
F DF S++ SYR GF DP S ++ SD GWGCM+RS
Sbjct: 108 FLDDFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRS 167
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A +++ RL R WR+ + + +RE I+ LF D +P+SIH ++ G +A G
Sbjct: 168 GQSLLANSMVILRLSRGWRRGVGRDKERE---IVSLFADDPRAPYSIHKFVEHGAEACGK 224
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R + LA+ +S + +Y+ D + G
Sbjct: 225 YPGQWFGPSATARCIQELAKRH--------ESADVRVYITGDGSDVYKDG---------- 266
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
SV ++ P L+LV LG++KV P Y L+ + PQS+GI GG+P +S Y
Sbjct: 267 -FMSVAKPDGVNFKPTLILVGTRLGIDKVTPVYWEALKASLQMPQSVGIAGGRPSSSHYF 325
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
VGVQ YLDPH I D E A+ + H+ +R + + +DPS+ IGF
Sbjct: 326 VGVQGSHFFYLDPHQTMAAIPFHTDVDEYTPAEIDSCHTRRLRRLDIKEMDPSMLIGFLI 385
Query: 413 RDKDDFD 419
RD+ D++
Sbjct: 386 RDEKDWE 392
>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
Length = 545
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 98/393 (24%)
Query: 139 EFNQDFSSRILISYRKGF--------------------------DPIGDSKITSDVGWGC 172
+F D SRI +SYR GF DP G TSDVGWGC
Sbjct: 64 DFLADVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRG---YTSDVGWGC 120
Query: 173 MLRSSQMLVAQALLFHRLGRPWR----------------------------KPLQKPFDR 204
M+R+SQ L+A ALLF LGR WR K +
Sbjct: 121 MIRTSQSLLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSE 180
Query: 205 EYV----EILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 259
E I+ F DS SPFSIH ++ G KA AG W GP A S AL
Sbjct: 181 ETAVSEETIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYAL------ 234
Query: 260 ETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
C P + +Y +G GG V D+ + G P+L+L
Sbjct: 235 -----CNEFPDSGLKVYY-----NGNGGGD--VYEDE------LLETG----FPLLVLCG 272
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LG++ VNP Y +LR + PQS+GI GG+P S Y G Q E YLDPH +P +
Sbjct: 273 LRLGIDNVNPIYWDSLRQMLSLPQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPAVK 332
Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
+ DT+++HS I +HL +DPS+ +GFY + D++ F + E+++
Sbjct: 333 T----TDKDTTSFHSSRIWKLHLKEMDPSMLVGFYITSEADWETFKGSLTASKEKTSSQI 388
Query: 437 LFTVTQTHKKP-VNHSDVLGETGGVPEDDSLGV 468
+ H P + D GG +DD + V
Sbjct: 389 VHIHPSRHNIPSFDEEDEYVSIGGASDDDFVDV 421
>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
Length = 389
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 117/346 (33%), Positives = 170/346 (49%), Gaps = 34/346 (9%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + + D L QD SR+ +YR+GF PIG++++T
Sbjct: 21 IPKTNDTVWILGKQYNASDD-----------LEAIRQDVQSRLWCTYRRGFVPIGNTQLT 69
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQALL LGR W + D Y+ I++ F DS+ +PFS+H
Sbjct: 70 TDKGWGCMLRCGQMVLAQALLQLHLGRDWVWEAETR-DDIYLNIVNRFEDSKQAPFSLHQ 128
Query: 226 L-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
+ L + G W GP + + + L + C+ + I+V +
Sbjct: 129 IALMGDSSEEKRIGEWFGPNTVAQVLKKLVKFDD-----WCR---LVIHVALDN------ 174
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D+ C V K W P+LL++PL LGL +VNP YI L+ F P S G+
Sbjct: 175 ---TVATDEIVELC-VDKKEPEAWKPLLLIIPLRLGLSEVNPIYIEGLKKCFQLPGSCGM 230
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDS 401
+GG+P + Y +G A+YLDPH VQ V +G A+ T+H I S
Sbjct: 231 IGGRPNQALYFIGYVGGEALYLDPHTVQRVGTVGSKQDPAEQELDETFHQRYASRISFTS 290
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
+DPSLA+ F C + FD AR + LF VT+T + P
Sbjct: 291 MDPSLAVCFLCVSRQQFDQLVARFNDSVNGGTSQALFEVTKTRQAP 336
>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
Length = 383
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 168/353 (47%), Gaps = 48/353 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRKGF PIG +S
Sbjct: 16 IPPTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGCNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
TSD GWGCMLR QM++AQAL+ LG+ W+ + + + Y++IL F D + FSI
Sbjct: 65 FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQW-MPETKNNTYLKILRRFEDKRAAAFSI 123
Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
H + G + G G W GP + + + L + + I+V +
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSS--------LTIHVALDN----- 170
Query: 284 GGAPVVCIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+ ++D R C V + + W P+LLL+PL LGL ++NP YI
Sbjct: 171 ----TLIVNDILRQCRVEGGVTAEADGEIPLRAPSQWKPLLLLIPLRLGLSEINPVYING 226
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTS 387
L+ +F QSLG++GGKP + Y +G + IYLDPH Q I ++++E D S
Sbjct: 227 LKTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDIS 286
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
YH I + +DPS+A+ F+C + +F C + PLF +
Sbjct: 287 -YHCKSASRIPITGMDPSVALCFFCATEKEFKSLCKSMQEELILPEKQPLFEL 338
>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 480
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 159/312 (50%), Gaps = 41/312 (13%)
Query: 144 FSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFH----RLGRPWRKPL 198
F S +YR + PIG S SD GWGCM+R+ QML+ QA++ H L + + +
Sbjct: 154 FKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMMRHVFEDNLKYEYIEKI 213
Query: 199 QKPFDREYVEILHLF---GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+ + EY+ +L LF G+ + SP+SI N+ G G W GP A+ + L +
Sbjct: 214 TE-YREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIVLKRLTK 272
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILLL 314
+ P+ + + VC++ + + +V + DWT + ++
Sbjct: 273 IYK----------PVKQFTM------------YVCLE-GNIYLNVIQEKSKDWTQSVFIV 309
Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES--AIYLDPHDVQ 372
+PL LGL + P Y+ +++ FTFPQ++GI GG+ ++ Y +G+ + S IYLDPH VQ
Sbjct: 310 IPLRLGLNYIEPEYLSSVKKVFTFPQNVGIAGGRENSALYFIGISDSSNNLIYLDPHLVQ 369
Query: 373 ---PVINIGKDD-LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
P N+ ++ S++H + + L+ + S+AIGFY RD +DF DF R L
Sbjct: 370 KSVPTCNMQTNEQFYQYESSFHCTKFKKMPLNRMCTSVAIGFYIRDYNDFLDFQTRIKSL 429
Query: 429 AEESNGAPLFTV 440
+ N +FTV
Sbjct: 430 SSGENS--IFTV 439
>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
Length = 507
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 105/304 (34%), Positives = 148/304 (48%), Gaps = 52/304 (17%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-----KPLQKPFDREYVEILHLFGD--SE 217
TSD GWGCM+RS QML+AQ L+ H LGR WR P++ P D + +++ F D S+
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242
Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMC-----------RSWEALARCQ--------- 257
SPFS+H L+QA G GSW GP +C R +E LAR
Sbjct: 243 ESPFSLHRLVQAS---GQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299
Query: 258 -RAET-----GLGCQSLPMAIYVVSGDEDGERGGA-------PVVCIDD---ASRHCSVF 301
R E G + P + E+ + + P + D +S ++F
Sbjct: 300 YREEIMNLARGQPVRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQDGIQSSPSTTLF 359
Query: 302 SKGQADWTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
++LL+P+ LGL+K ++ RY+P + P +GI+GG+P S YI+G Q
Sbjct: 360 PSHA-----VILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIYILGCQN 414
Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
I+LDPH QPV+ D E + T+H V R I +DPS A+GFYCR + D D
Sbjct: 415 TQLIHLDPHFTQPVVRNVVDSEEFNVKTWHCLVPRVIEAAKLDPSCAVGFYCRSRGDLSD 474
Query: 421 FCAR 424
R
Sbjct: 475 LLER 478
>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 143/289 (49%), Gaps = 29/289 (10%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
L + DF SR+ +YR+ F IG S TSD GWGCMLR+ QMLVA+ LL RLGR +
Sbjct: 39 LEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRNYVW 98
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
D Y EIL LF D+ ++ S+ + L A A G W GP M + L R
Sbjct: 99 SESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMA---QVLKR 155
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
++ +SL + V VV ++D S + + G+ TP++L++
Sbjct: 156 ITKS------RSLGFGVTVAMDS---------VVSVEDVS--AEIINGGKP--TPLVLMI 196
Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA----IYLDPHDV 371
PL LGL VN Y+ L++ +GI+GGKP + Y VG QE +YLDPH
Sbjct: 197 PLRLGLNSVNEIYVNPLKIFLASKYCVGIMGGKPNQAHYFVGYQETVEDTWLLYLDPHTT 256
Query: 372 Q--PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
Q PV E + H+D + I +DPSLA+GF+ ++F
Sbjct: 257 QQSPVSVNNNMPFEQFDKSLHTDKLCWIKALKLDPSLAVGFFFNTVEEF 305
>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
Length = 427
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 172/387 (44%), Gaps = 68/387 (17%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 37 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 86
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 87 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 146
Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
R P +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 147 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 202
Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
+A R + + +YV + A +V D + A+
Sbjct: 203 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 249
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G
Sbjct: 250 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGXXXXXXXXXX 309
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 310 XXXCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 367
Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS 451
+ S+ P+FT+ + H + +HS
Sbjct: 368 VLGSSSATERYPMFTLAEGHAQ--DHS 392
>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 450
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 110/402 (27%), Positives = 168/402 (41%), Gaps = 95/402 (23%)
Query: 111 SDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFDP 158
S ++LLG C+ +++ D N+G + EF +DF SRI ++YR+ F
Sbjct: 39 SPVFLLGKCYHFKYEDSSVTSDGGSNSGSESKEDLSGNVDEFRKDFISRIWLTYREEFPQ 98
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW------------------------ 194
I S T+D GWGC LR+ QML+AQ L+ H LGR W
Sbjct: 99 IETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWTWTEALDIFSSESEFWTANTARKL 158
Query: 195 ------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAGK 231
++PL + E H F D + F +H L++ GK
Sbjct: 159 TPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQKIISWFADYPLAYFGLHQLVKLGK 218
Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
G AG W GP + L R E+ D E G +
Sbjct: 219 NSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYVA 257
Query: 292 DDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
D C+++S D +++LVP+ LG E+ N Y ++ + +GI
Sbjct: 258 QD----CTIYSADVYDLQCNKGTEKAVVILVPVRLGGERTNMEYFEFVKGILSLEFCIGI 313
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y VG Q++S IY+DPH Q +++ + + ++H + + +DP
Sbjct: 314 IGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSVKNFPLE--SFHCPSPKKMSFKKMDP 371
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 444
S IGFYCR+ +F+ +K+ + S PLFT H
Sbjct: 372 SCTIGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413
>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
Length = 459
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 118/437 (27%), Positives = 182/437 (41%), Gaps = 84/437 (19%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA-GNN----------GLAEFNQDFSSRILISYRKGF 156
S S ++LLG C+ DE + G+N + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVFLLGKCYHFKTDEPTEQSPNGSNYDVTEEEVSRNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------------------K 196
I S +T+D GWGC LR+ QML+AQ L+ H LGR W K
Sbjct: 96 PQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWPDALVNENPESESWTSHTVK 155
Query: 197 PLQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 224
L F+ + +I+ FGDS + F +H
Sbjct: 156 KLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRDEHYHRKIVSWFGDSPLANFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ G G AG W GP + L R + E + + +YV
Sbjct: 216 RLIEYGNKSGKMAGDWYGPAVVAH----LLR-KAVEEAKDPELQGITVYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D CS+ + +++L+P+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYKSDVVEMQCSLKDSEKPGAKSVIILIPVRLGGERTNMEYLEFVKGILSLEYCIGI 322
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
VGG+P S Y G Q++S IY+DPH Q +++ + + ++H + + +DP
Sbjct: 323 VGGRPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMDP 380
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNH--SDVLGETGGV 460
S IG YC + F+ +K+ + S+ PLFT H K + S V E
Sbjct: 381 SCTIGLYCPNVQGFERASEEITKILKASSKEKYPLFTFVNGHSKDYDFMMSPVQEEKALF 440
Query: 461 PEDDS--LGVMSMNDAV 475
ED++ L S D V
Sbjct: 441 SEDENKKLKRFSTEDFV 457
>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
nidulans FGSC A4]
Length = 402
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/390 (30%), Positives = 178/390 (45%), Gaps = 68/390 (17%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGV-----CHKIAQDEALGDAAGNN--------GLA 138
+RI + + P S IW LG C + DE+ G G
Sbjct: 11 KRIIQYIWDPEPKNDEEPGSPIWCLGTRYPPQCVEETADESRNPDHGQQQNTNTSAPGWP 70
Query: 139 E-FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 174
E F DF S+I ++YR F PI TSD GWGCM+
Sbjct: 71 EAFLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMI 130
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
RS Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G +
Sbjct: 131 RSGQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFC 187
Query: 234 GLAAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R ++ + +Y+ + D + V D
Sbjct: 188 GKHPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRD 238
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS
Sbjct: 239 E---------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSAS 287
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
Y V VQ YLDPH+ +P + + E + +TYH+ +R +++ +DPS+ IGF
Sbjct: 288 HYFVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGF 347
Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFTV 440
RD+DD++D+ AR L G P+ T+
Sbjct: 348 LIRDEDDWEDWKARIMSL----EGKPIITI 373
>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
Length = 409
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 166/345 (48%), Gaps = 42/345 (12%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
GG+P + Y +G E+ +YLDPH Q +G+ + TYH + ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQKTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTH 444
DPSLA+ F C+ D F KL +E G LF ++QT
Sbjct: 310 DPSLAVCFLCKTSDSFQQL---LDKLRQEVLGMCSPALFEISQTR 351
>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/342 (30%), Positives = 163/342 (47%), Gaps = 36/342 (10%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +D+W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D GWGCMLR QM++AQAL+ LGR W + D Y++I++ F D S +SIH
Sbjct: 92 TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G++ A G W+GP + + + L L + ++V
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
V +DD C +G A W P+LL++PL LG+ +NP YIP L+ S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
GG+P + Y +G E+ +YLDPH Q +G+ + TYH + ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQRTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309
Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
DPSLA+ F C+ D F + + LF ++QT
Sbjct: 310 DPSLAVCFLCKTSDSFQQLLEKLRQEVLGMCSPALFEISQTR 351
>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
Length = 433
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/329 (29%), Positives = 150/329 (45%), Gaps = 59/329 (17%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
TSD GWGCMLR +QML+ + LL +GR + ++ Y +IL +F D + + +SIH
Sbjct: 49 TSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIH 107
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYV 274
+ Q G G W GP + W +A + L + +L MA
Sbjct: 108 QIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTY 167
Query: 275 VSGD------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
S D E+G+ +H + + + +W P+LL++PL LGL +N Y
Sbjct: 168 PSEDAVKLIMENGQ-----------VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCY 216
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV-------------- 374
+P ++ F PQ +GI+GGKP + Y VG+ YLDPH +P
Sbjct: 217 LPAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTES 276
Query: 375 ----INIGK-DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
N + +DLE D STYH +++ + +SIDPSLA+ +C ++D
Sbjct: 277 EQHDTNFSELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESRED 336
Query: 418 FDDFCARASKLAEESNGAPLFTVTQTHKK 446
FD+ C K ++ P+F + K
Sbjct: 337 FDNLCQELQKTTLPASKPPMFEFLEKRPK 365
>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
Length = 449
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 52/327 (15%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 163
+A DEA+ G + F DF S+ ++YR F+PI S
Sbjct: 98 LAYDEAMNQDGG--WPSAFLDDFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQF 155
Query: 164 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
+SD GWGCM+RS Q L+A A+ LGR WR+ + +R+ +L F D
Sbjct: 156 MDQAGYSSDSGWGCMIRSGQSLLANAMAVLDLGRDWRRGVAAEKERQ---LLSKFADDPK 212
Query: 219 SPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
+P+SIH +Q G A G G W GP A R +AL + +Y
Sbjct: 213 APYSIHRFVQHGAVACGKYPGEWFGPSATARCIQALVNANEPH---------LRVYST-- 261
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
G P V D R + + P L+LV LG++K+ P Y L
Sbjct: 262 ------GDGPDVYED---RFFDIAKPSGETFHPTLILVGTRLGIDKITPVYWDALIAALQ 312
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVI 394
PQS+GI GG+P +S Y +G Q YLDPH + + +D +AD + H+ +
Sbjct: 313 MPQSIGIAGGRPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSVHTRRL 372
Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDF 421
R +H+ +DPS+ IGF D+DD+D++
Sbjct: 373 RRLHVREMDPSMLIGFVIHDEDDWDEW 399
>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
Length = 518
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 155/318 (48%), Gaps = 49/318 (15%)
Query: 130 DAAG-NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
DA G ++G +F D+ SR+ I+YR F P+ ++ T+D GWGCM+R++QM+VAQA++ +
Sbjct: 159 DANGVSSGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIMLN 218
Query: 189 RLGRPWRKPLQKP-----------FDREYVE---ILHLFGDSETSPFSIHNLLQ--AGKA 232
R GR WR +K FDRE ++ IL LF D +SP IH +++ A +
Sbjct: 219 RFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAKEK 278
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
A GSW P EA+ ++A L +I ++GD A + I
Sbjct: 279 GKKAVGSWYSPS------EAVFIMKKA--------LTESISPLTGD------TAMYLSI- 317
Query: 293 DASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
D H +W L+LV +V LG ++NP Y+P L F+ LG+ GG+P
Sbjct: 318 DGRVHIRDIEVETKNWMKTLILVIVVRLGAAELNPIYVPHLMRLFSMESCLGVTGGRPDH 377
Query: 352 STYIVGVQEESAIYLDPHDVQPVINI----------GKDDLEADTSTYHSDVIRHIHLDS 401
S + VG + IYLDPH I I K + +YH ++ +H
Sbjct: 378 SCWFVGFYGDQIIYLDPHVAHEYIPIDMNFNVNMTDNKKSKKCPERSYHCRLLSKMHFLD 437
Query: 402 IDPSLAIGFYCRDKDDFD 419
+DPS A+ F ++ FD
Sbjct: 438 MDPSCALCFRFESREQFD 455
>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
Length = 409
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 150/313 (47%), Gaps = 49/313 (15%)
Query: 140 FNQDFSSRILISYRKGFDPI---------------------GDSKITSDVGWGCMLRSSQ 178
F +DF S + ++YR F PI TSD GWGCM+RS Q
Sbjct: 86 FLEDFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQ 145
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAA 237
++A AL RLGR WR+ + KP E +L LF D +PFSIH ++ G+ G
Sbjct: 146 AVIANALAHLRLGRGWRRGM-KP--EEEKRLLALFADDPRAPFSIHKFVRHGEVECGKNP 202
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
G W GP A A C +A T + + +Y + ++ E V ++
Sbjct: 203 GEWFGP-------SAAAMCIQALTH-AYEPAGLRVYQTNSNDLYEEDFRKVAVVN----- 249
Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
VF P L+L + LG+E++ Y L PQ++GI GG+P +S Y +
Sbjct: 250 -GVFK-------PTLVLAGIRLGIERITNIYYEPLAACLRMPQTVGIAGGRPSSSHYFIA 301
Query: 358 VQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
VQ E+ YLDPH +P++ +D E + T H+ IR +H+ +DPS+ I F RD
Sbjct: 302 VQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHTRRIRRLHIREMDPSMLIAFLIRD 361
Query: 415 KDDFDDFCARASK 427
+ D++D+ R S+
Sbjct: 362 EADWEDWQRRISE 374
>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
Length = 521
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 159/340 (46%), Gaps = 56/340 (16%)
Query: 111 SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 170
+D+ LG + + DE+ +G F D+ SR+ I+YR F + D+ T+D GW
Sbjct: 146 NDVVFLGRRYSTSVDES----GLRSGFENFCSDYYSRLWITYRTDFPALLDTDTTTDCGW 201
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQK-----------PFDREYVE---ILHLFGDS 216
GCM+R++QM+VAQA++ +R GR WR +K FDRE ++ IL LF D
Sbjct: 202 GCMIRTTQMMVAQAIMVNRFGRDWRFTRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDK 261
Query: 217 ETSPFSIHNLL---QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273
T+P IH ++ GK A GSW P EA+ ++A L S P+
Sbjct: 262 PTAPLGIHKMVGIAAMGKGKK-AVGSWYSPS------EAVFIMKKA---LTESSSPLT-- 309
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTL 332
G ++ D H +W L+LV +V LG ++NP Y+P L
Sbjct: 310 ----------GNTAMLLSIDGRVHIRDIEVETKNWMKKLILVIVVRLGAAELNPIYVPHL 359
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH--------DVQPVINI----GKD 380
F LGI GG+P S++ VG + IYLDPH D+ P N+ K
Sbjct: 360 MRLFAMESCLGITGGRPDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSDSKK 419
Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
+ +YH ++ +H +DPS A+ F ++ FD+
Sbjct: 420 AKKCPEKSYHCRLLSKMHFFDMDPSCALCFQFESREQFDN 459
>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
Length = 491
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427
Query: 407 AIGFYCRDKDDF 418
IGF D++++
Sbjct: 428 LIGFLILDEENW 439
>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
TFB-10046 SS5]
Length = 989
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 134/284 (47%), Gaps = 47/284 (16%)
Query: 140 FNQDFSSRILISYRKGFDPI-----------------------------GDSKITSDVGW 170
F DF+SR+ ++YR F PI G+ TSD GW
Sbjct: 314 FYADFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGW 373
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPL---QKPFDREYVEILHLFGDSET--SPFSIHN 225
GCMLR+ Q L+A L+ LGR WR+P P YV+IL F D+ + +PFS+H
Sbjct: 374 GCMLRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHR 433
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIYVVSGDEDGERG 284
+ +GK +G G W GP + L RA+ G+ +A+ V + D
Sbjct: 434 MAMSGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVA-----IAVDGVLYETDIYSA 488
Query: 285 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+ D +R S F + W +L+LV LGL+ VNP Y L+ FTFPQSL
Sbjct: 489 SHYPMSSADGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTIFTFPQSL 548
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI-----GKDD 381
GI GG+P +S Y VG Q S YLDPH +P + + G DD
Sbjct: 549 GIAGGRPSSSYYFVGSQGNSLFYLDPHHTRPAVPLRTPPPGDDD 592
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
D T+H D +R + L +DPS+ +GF CRD+ D+ DF R +++++ + LF++ +
Sbjct: 699 DLKTFHCDRVRKMPLSGLDPSMLLGFLCRDEQDWKDFRRRMAEISKGRDT--LFSIQEEP 756
Query: 445 KKPVNHSDVLG 455
+ SD +G
Sbjct: 757 PSWPSDSDDMG 767
>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
Length = 572
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKSDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508
Query: 407 AIGFYCRDKDDF 418
IGF D++++
Sbjct: 509 LIGFLILDEENW 520
>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
Length = 572
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI S T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 390 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508
Query: 407 AIGFYCRDKDDF 418
IGF D++++
Sbjct: 509 LIGFLILDEENW 520
>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
4308]
Length = 378
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 172/375 (45%), Gaps = 54/375 (14%)
Query: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQD-EALGDAAGNNGLAE----------- 139
+RI + + P TS IW LG+ + +D G+ N +
Sbjct: 11 KRIVQYLWDPEPRNDEDPTSSIWCLGIEYHPEKDVSPRGETPDKNSARDNTTGTTNYRKP 70
Query: 140 --------FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
F DF SRI ++YR F PI ++ D M S L+A AL LG
Sbjct: 71 SEHAWPESFLLDFESRIWMTYRSNFPPI--PRVEGDDKSASMTLGS--LLANALSTLVLG 126
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 250
R WR+ + F+ E ++L LF D+ T+PFS+H ++ G ++ G G W GP A +
Sbjct: 127 RDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCI 183
Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 310
EAL+ C S + +YV + + + R +V + P
Sbjct: 184 EALSS--------QCGSPTLKVYVSNDTSEVYQ-----------DRFMNVARNSSGVFQP 224
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
L+L+ LG++ + P Y L+ T PQS+GI GG+P AS Y VG Q YLDPH
Sbjct: 225 TLILLGTRLGIDHITPVYWDGLKATLQLPQSVGIAGGRPSASHYFVGAQGSHLFYLDPHY 284
Query: 371 VQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
+P + G+ + + TYH+ +R IH+ +DPS+ IGF RD++D+DD+ R
Sbjct: 285 TRPALPDRQGGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRDQEDWDDWLNRIQA 344
Query: 428 LAEESNGAPLFTVTQ 442
+ G P+ V +
Sbjct: 345 V----KGRPIIHVLK 355
>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
Length = 758
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 354 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 384
Y G Q + YLDPH Q + G K D A
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341
Query: 385 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
D + H+ + +HL +DPS+ IGF +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398
>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
Length = 459
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 117/425 (27%), Positives = 171/425 (40%), Gaps = 94/425 (22%)
Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +DE + D + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155
Query: 198 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 224
QK R Y + I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQ-------- 262
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + C+ + D +++L+P+ LG E+ N Y+ ++ ++L I
Sbjct: 263 DCTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGIL---RALNI 319
Query: 345 VG----GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
V KP S Y G Q++S IY+DPH Q +++ D + T+H + +
Sbjct: 320 VWVLLVAKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFR 377
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHS 451
+DPS IGFYCR+ DF +K+ + S+ PLFT H + N
Sbjct: 378 KMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEE 437
Query: 452 DVLGE 456
D+ E
Sbjct: 438 DLFSE 442
>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
Length = 318
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 49/266 (18%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 77 VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 125
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 126 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 185
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 186 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 228
Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
D + C V G AD W P+LL+VPL LG+ ++NP Y+
Sbjct: 229 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 288
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVG 357
+ F PQSLG +GGKP + Y +G
Sbjct: 289 FKECFKMPQSLGALGGKPNNAYYFIG 314
>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
Length = 531
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ G +D +TPIL+L+ + LG+EKVNP Y +LR + QS+GI GG+P +S
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281
Query: 354 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 384
Y G Q + YLDPH Q + G K D A
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341
Query: 385 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
D + H+ + +HL +DPS+ IGF +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398
>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
Length = 491
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 144/312 (46%), Gaps = 56/312 (17%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKI----------------TSDVGWGCMLRS 176
F DF SRI ++YR GF DP S++ T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A +LL RLGR WR+ Q P E ++L LF D +P+SIHN + G A G
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R ALA +Y G P V D
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
V + P L+L+ LG++K+N Y +L T PQS+GI GG+P +S Y
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367
Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
VG Q YLDPH +P + +D +D + H+ +R +H+ +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427
Query: 407 AIGFYCRDKDDF 418
IGF D++++
Sbjct: 428 LIGFLILDEENW 439
>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
Length = 379
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 159/328 (48%), Gaps = 54/328 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
F DF S+I ++YR F PI TSD GWGCM+RS
Sbjct: 50 FLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRS 109
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
Q L+A ++ LGR WR+ + E ++L LF DS +PFSIH+ ++ G + G
Sbjct: 110 GQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFCGK 166
Query: 236 AAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R + LA R ++ + +Y+ + D + V D+
Sbjct: 167 HPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRDE- 216
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
KG P L+L+ L LG++++ Y L+ PQS+GI GG+P AS Y
Sbjct: 217 --------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSASHY 266
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
V VQ YLDPH+ +P + + E + +TYH+ +R +++ +DPS+ IGF
Sbjct: 267 FVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGFLI 326
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTV 440
RD+DD++D+ AR L G P+ T+
Sbjct: 327 RDEDDWEDWKARIMSL----EGKPIITI 350
>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
Length = 451
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 146/309 (47%), Gaps = 50/309 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 176
F +D +++ ++YR GFDPI S +SD GWGCM+RS
Sbjct: 117 FLEDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRS 176
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A + +LGR WR+ +E +++ +F D +P+SIHN ++ G A G
Sbjct: 177 GQSLLATTIGILQLGRDWRR---GKCQQEERQLISMFADDPRAPYSIHNFVRHGATACGK 233
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A A+C +A T LP+ +Y + +D + D
Sbjct: 234 FPGEWFGP-------SATAQCIQALTS--ASGLPLKVYSPNDGQDVYEDSFMKIAKPD-- 282
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
GQ D+ P L+L+ LG++K+ P Y L PQS+GI GG+P +S Y
Sbjct: 283 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLLAALQMPQSVGIAGGRPSSSHYF 333
Query: 356 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
VG Q YLDPH + I D E D + H+ +R +HL +DPS+ IGF
Sbjct: 334 VGSQGSYLFYLDPHHTRKAIPYHADVTKYTEEDIESCHTSRLRRLHLKEMDPSMLIGFLI 393
Query: 413 RDKDDFDDF 421
R + D+ ++
Sbjct: 394 RTESDWSEW 402
>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
Length = 1505
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 155/339 (45%), Gaps = 77/339 (22%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE------------------ 205
+T+D GWGCMLR+ Q L+A AL+ LGR W + + P R+
Sbjct: 785 LTTDSGWGCMLRTGQSLLANALINVHLGRSWMR--EAPPARQLEFLQELANLSLDTSAEK 842
Query: 206 ---------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
Y++IL F D S PF +H + + GK G G W GP
Sbjct: 843 QSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 902
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDE-DGERGGAPVVCIDDASRHCSVFSKGQAD 307
+ + L + + GL + ++ + DE G + + AS + KG
Sbjct: 903 AIKQLV-SEFPDAGLAVELAHDGVFYL--DEVRAAAGASRQLGKGRASATGTNGRKGDTA 959
Query: 308 WT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
T P+L+L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y +G Q S
Sbjct: 960 LTWHKPVLILIGIRLGLDSVNPIYYESVKATFSFPHSVGIAGGRPSSSYYFMGHQGNSLF 1019
Query: 365 YLDPHDVQPVINI------------------------GKDD---------LEADTSTYHS 391
YLDPH+V+P + + DD EA TST+H
Sbjct: 1020 YLDPHNVRPAVALRFPPSTFPAAVPRQLDIAHRFAFEEHDDEDEWWSHAYTEAQTSTFHC 1079
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
D +R + + S+DPS+ +GF +D++D D CAR L++
Sbjct: 1080 DKVRRMPIKSLDPSMLLGFLVKDEEDLADLCARIKALSK 1118
>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
206040]
Length = 452
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 103/317 (32%), Positives = 150/317 (47%), Gaps = 52/317 (16%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVG 169
G A F +D SS+ ++YR GF+PI S +SD G
Sbjct: 113 GTGWPAGFVEDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSG 172
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A + RLGR WR+ + +R ++ +F D +P+SIHN ++
Sbjct: 173 WGCMIRSGQSLLATTIGILRLGRDWRRDQSQEEERH---LISMFADDPRAPYSIHNFVRH 229
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A A+C +A T L + IY + +D
Sbjct: 230 GATACGKYPGEWFGP-------SATAQCIQALTS--SSGLSLNIYSPNDGQD-------- 272
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ + S S GQ + P L+L+ LG++K+ P Y L PQS+GI GG+
Sbjct: 273 --VYEDSFMKIAKSDGQT-FNPTLILIRTRLGIDKITPIYWDALIAALHMPQSVGIAGGR 329
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL----EADTSTYHSDVIRHIHLDSIDP 404
P +S Y VG Q YLDPH + I DD+ E D + H+ +R IH+ +DP
Sbjct: 330 PASSHYFVGSQGSYLFYLDPHHTRKAIPY-HDDVTKYTEEDIESCHTSRLRRIHIKEMDP 388
Query: 405 SLAIGFYCRDKDDFDDF 421
S+ IGF R + D+ ++
Sbjct: 389 SMLIGFLIRTESDWTEW 405
>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
heterostrophus C5]
Length = 471
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 160/358 (44%), Gaps = 91/358 (25%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SRI ++YR GF I S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + ++ GQ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEVAIDDDGQ--WQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDVQPVI--------------NIGKDDLE--------------- 383
Y V Q + YLDPH +P++ N ++ L
Sbjct: 311 HYFVATQGNNFFYLDPHSTRPLLPYRPPPSSTENESQNQSQNQLAVPSSLDASATSNSSS 370
Query: 384 ------------ADTSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+D +TY H+ IR + + +DPS+ I F DD++++
Sbjct: 371 TTIVPSATPTDGSDRTTYSEEELATCHTRRIRRLQIREMDPSMLIAFLITSADDYENW 428
>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 470
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 159/357 (44%), Gaps = 90/357 (25%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SRI ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR ++P +E+ +++ +F D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + L R E GL +YV SGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVHKNR-EAGL-------KVYV-SGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310
Query: 353 TYIVGVQEESAIYLDPHDVQPVINI------------GKDDLE----------------- 383
Y V Q + YLDPH +P++ LE
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLLPYRPSSSSTEEQVAAPSTLEASATSVTSTSSSTTIVP 370
Query: 384 -ADTSTYHSDV------------------IRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
A+ T SDV IR + + +DPS+ + F +DD++D+
Sbjct: 371 SANEVTAPSDVSKPSGYSLEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYEDW 427
>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
Length = 470
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/361 (30%), Positives = 158/361 (43%), Gaps = 90/361 (24%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 169
A N + F DF SRI ++YR GF PI S+ TSD G
Sbjct: 87 AQYGNWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
+GCM+RS Q ++A AL RLGR WR ++P +E+ +I+ +F D +PFSIH ++
Sbjct: 147 FGCMIRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDIVAMFADDPRAPFSIHRFVEH 205
Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R + L + E GL +YV SGD GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KNKEVGL-------KVYV-SGD------GADV 250
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+
Sbjct: 251 Y--EDKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGR 306
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD----------------------- 385
P AS Y V Q + YLDPH +P++ +
Sbjct: 307 PSASHYFVATQANNFFYLDPHSTRPLLPYRPSSWSTEEQASAPSTLEASATSATSTSSST 366
Query: 386 -----------------TSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
TS Y H+ IR + + +DPS+ + F +DD++D
Sbjct: 367 TIVPSANEVTAPSDASRTSGYSPEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYED 426
Query: 421 F 421
+
Sbjct: 427 W 427
>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
Length = 1541
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 165/367 (44%), Gaps = 86/367 (23%)
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK--PLQKPFD--------- 203
GF G +T+D GWGCMLR+ Q L+A ALL LGR W + P + D
Sbjct: 814 GFSRAG---LTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLS 870
Query: 204 -------------RE-------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 241
RE Y++IL F D S PF +H + + GK G G W
Sbjct: 871 LDSSVEMQSLQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 930
Query: 242 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 301
GP + + L + + G+ + ++ + DE GA R
Sbjct: 931 GPSTAAGAIKQLV-TEFPDAGIAVELAHDGVFYL--DEVRLAAGARSALQSGKGR----- 982
Query: 302 SKGQADWT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
+G A T P+++L+ + LGL+ VNP Y +++ TF+FP S+GI GG+P +S Y +G
Sbjct: 983 -QGDAAVTWRRPVVILIGIRLGLDSVNPIYYESVKETFSFPHSVGIAGGRPSSSYYFMGH 1041
Query: 359 QEESAIYLDPHDVQPVINI------------------------GKDD---------LEAD 385
Q S YLDPH+V+P + + KDD EA
Sbjct: 1042 QGNSLFYLDPHNVRPAVALRYPPSTFPTAVPHQLDVAHRFALEDKDDELEWWSHAYTEAQ 1101
Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 445
TST+H + +R + + S+DPS+ +GF +D++D D C R L + +F+ ++
Sbjct: 1102 TSTFHCEKVRRMPIKSLDPSMLLGFLVKDEEDLMDLCTRIKGLPKT-----IFSFAESAP 1156
Query: 446 KPVNHSD 452
K V+ D
Sbjct: 1157 KWVDDDD 1163
>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
Length = 1509
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 115/392 (29%), Positives = 172/392 (43%), Gaps = 88/392 (22%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE-------------- 205
+T+D GWGCMLR+ Q L+A AL+ LGR W++ Q F E
Sbjct: 776 LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAENQS 835
Query: 206 -------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
Y+ IL F D S PF +H + + GK G G W GP +
Sbjct: 836 LASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAGAI 895
Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----A 306
+ L E G+ + ++ + D R A SR + S + A
Sbjct: 896 KQLV-FDFPEAGIAVELAHDGVFYL----DEVRAAASAST--GKSRASGMLSGNRRAETA 948
Query: 307 DWT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
W P+L+L+ + LGLE VNP Y +++ TF+FPQS+GI GG+P +S Y +G Q S Y
Sbjct: 949 VWRRPVLILIGIRLGLETVNPIYYESVKATFSFPQSVGIAGGRPSSSYYFMGHQGNSLFY 1008
Query: 366 LDPHDVQPVINI------------------------GKDD---------LEADTSTYHSD 392
LDPH+V+P + + +DD EA TST+H +
Sbjct: 1009 LDPHNVRPAVPLRYPPTTFPAAAPSRFDVSHRYALEDRDDEDEWWSHAYTEAQTSTFHCE 1068
Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
+R + + S+DPS+ +GF +D++ D CAR L + +F+ ++ K V+ D
Sbjct: 1069 KVRRMPIKSLDPSMLLGFLVKDEEALVDLCARIKALPKT-----IFSFAESAPKWVDDDD 1123
Query: 453 V--LGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
E+ P D G +D VG + D
Sbjct: 1124 FDPSMESFSEPSADEAG---SDDDVGKGEDQD 1152
>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
Length = 1093
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 153/319 (47%), Gaps = 42/319 (13%)
Query: 160 GDSKITSDVGWGCMLRSSQMLVAQALL-------------FHRLGRPWRKPLQKPFDRE- 205
G +TSD GWGCMLR+ QML+A +L+ + P P + DR+
Sbjct: 431 GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPAPSLPPSET--DRQR 488
Query: 206 ---YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
YV+IL F D + PFS+H L AG G G W GP S + L A
Sbjct: 489 FEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAGSIKKLVSAFPA- 547
Query: 261 TGLGCQSLP------MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPIL 312
GLG P A++ S + + D ++ + +W +L
Sbjct: 548 CGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERA-NRMKEEWGDRAVL 606
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+L+ L LG+E V P Y +++ FTFPQ++GI GG+P +S Y VG Q + YLDPH +
Sbjct: 607 ILIGLRLGIEGVTPIYYDSVKALFTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTR 666
Query: 373 PVINI-----GKDDLE-----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
P + + G D ++ T+HSD +R +H+ +DPS+ GF R+ +++ D
Sbjct: 667 PAVPLRVPTDGPYDATGQFTLSEMKTFHSDKVRKMHISGLDPSMLCGFIVRNVEEWRDLR 726
Query: 423 ARASKLAEESNG-APLFTV 440
AR LA+ G AP+FT+
Sbjct: 727 ARVDALAKSKGGKAPIFTI 745
>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
Length = 1572
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 114/387 (29%)
Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK---PL-QKPFDRE----- 205
GF G +T+D GWGCMLR+ Q L+A AL+ LGR W++ PL Q+ F E
Sbjct: 824 GFSRAG---LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLS 880
Query: 206 ----------------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 241
Y++IL F D S PF +H + + GK G G W
Sbjct: 881 IADAAEKESLQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 940
Query: 242 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD-------A 294
GP + + L P A V DG V +D+ +
Sbjct: 941 GPSTASGAIKQL-----------VSEFPQAGIAVELARDG------VFYLDEVRAAASAS 983
Query: 295 SRHCSVFSKGQAD---------------WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
+ SV S G+A W P+L+L+ + LGLE VNP Y +++ TF+F
Sbjct: 984 ASAASVQSGGKARSSGAASGSRKGEGLIWRRPVLILIGIRLGLESVNPIYYESVKATFSF 1043
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--------------------- 377
P S+GI GG+P +S Y +G Q S YLDPH+V+P + +
Sbjct: 1044 PHSVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPSTFPDAVPRHLGIAHRF 1103
Query: 378 ---GKDD---------LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
KDD E TST+H + +R + + S+DPS+ +GF +D++ D CAR
Sbjct: 1104 VLEDKDDEDEWWSHAYSEVQTSTFHCEKVRRMPIKSLDPSMLLGFLVKDEESLQDLCARI 1163
Query: 426 SKLAEESNGAPLFTVTQTHKKPVNHSD 452
L + +F+ ++ K V+ D
Sbjct: 1164 KALPKT-----IFSFAESAPKWVDDDD 1185
>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
Length = 1257
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 142/309 (45%), Gaps = 61/309 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
F D++SR+ ++YR F PI D+ +
Sbjct: 317 FYSDYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWS 376
Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE---ILHLFGD- 215
TSD GWGCMLR+ Q L+A AL+ L R WR+P + +YV+ IL F D
Sbjct: 377 GEKGWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDN 436
Query: 216 -SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC--------- 265
S +PF IH + AGK G GSW GP + + L + + GL
Sbjct: 437 PSPLAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLV-GEFEDAGLEVALAVDSVVY 495
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEK 323
QS A S +++G G + V + + +G W P+L+LV + LG++
Sbjct: 496 QSDVYAASAASRNQNGVEGDSKTVGTSKSRKKG----QGPPKWGNRPVLILVGIRLGIDG 551
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
VNP Y +++ FTFPQ++GI GG+P +S Y VG Q +S YLDPH +P I +
Sbjct: 552 VNPIYYESVKTLFTFPQTVGIAGGRPSSSYYFVGAQGDSLFYLDPHHTRPAIPLRPPPAF 611
Query: 384 ADTSTYHSD 392
+TS +D
Sbjct: 612 DETSIISTD 620
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
T+H + +R + L ++DPS+ +GF CR+++++ D R +++A +F+V
Sbjct: 794 TFHCERVRKMPLSALDPSMLLGFLCRNEEEWKDLRERLAEMARTKKA--IFSV 844
>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 459
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 154/342 (45%), Gaps = 59/342 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------------- 185
F + F+S + +YR+GF P+ S +T+D GWGC+LRSSQML+AQ L
Sbjct: 98 FRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWSGN 157
Query: 186 ---------LFHRLGR---------------PWRKPLQKPFDREYVEILHLFGDSETSPF 221
L H + W L +P + IL F D+ T+PF
Sbjct: 158 QRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTAPF 217
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
IH L++ GK+ G AG W GP A R LP + V+ D
Sbjct: 218 GIHRLVELGKSSGKKAGDWYGP-------SIAAHILRKAVEASVVDLPNLVAYVAQD--- 267
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+ + D + C W +L+LVP+ LG + +NP YI +++
Sbjct: 268 -----CTIYLQDVRKLCE--RPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLECC 320
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
+GI+GGKP S + VG Q++ +YLDPH QP +++ K+ ++H R +
Sbjct: 321 IGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPTVDVTKN---FPLESFHCKNPRKMPFSR 377
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQ 442
+DPS IGFY + + +F+ C ++ ++ + P+F +
Sbjct: 378 MDPSCTIGFYAKGQMEFESLCTSVNEAVSASAETYPMFIFEE 419
>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 376
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/317 (32%), Positives = 155/317 (48%), Gaps = 42/317 (13%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFS 222
TSD GWGCM R QML+AQAL+ H LGR WR + ++I+ F DS + SP S
Sbjct: 67 TSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLS 126
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSG 277
+H L+Q G W GP ++C A+ R + L + + +Y V+
Sbjct: 127 LHRLVQMSDR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYR 180
Query: 278 DE--DGERG------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLG 320
+E D RG P + D H +++ + Q+D T ILLL+PL+ G
Sbjct: 181 EEIIDLARGLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFG 236
Query: 321 L-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
++NPRYI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+
Sbjct: 237 KGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNS 296
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 439
D ++H + + + +++PS A+GFYCR + + D R L S+
Sbjct: 297 PKFSVD--SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ--- 351
Query: 440 VTQTHKKPVNHS-DVLG 455
T +PV + +VLG
Sbjct: 352 -ASTRSRPVAFTVEVLG 367
>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
Length = 473
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 122/263 (46%), Gaps = 42/263 (15%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 169
A N + F DF SRI ++YR GF I S+ TSD G
Sbjct: 87 AQYGNWPSAFLDDFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
+GCM+RS Q ++A AL RLGR WR KP +E+ EIL LF D +PFSIH ++
Sbjct: 147 FGCMIRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEH 205
Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R + LA R E GL +YV D
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYVSGDGADVYEDKLKE 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
V IDD +W P L+LV LG++K+ P Y L+ + QS+GI GG+
Sbjct: 258 VAIDD-----------DGEWQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGR 306
Query: 349 PGASTYIVGVQEESAIYLDPHDV 371
P AS Y V Q + YLDPH
Sbjct: 307 PSASHYFVATQGNNFFYLDPHST 329
>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
Length = 431
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 171/384 (44%), Gaps = 89/384 (23%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 60 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 109
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
+TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 110 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSRSASPSRYHGPAH 169
Query: 195 -RKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
R P L++ +R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 170 WRPPRWAQGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 225
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
+A R + + +YV +D A VV +
Sbjct: 226 -----SLVAHILRKAVESCSEVTRLVVYV---SQDCTVYKADVVRL-------VARPDPA 270
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++L T P ++ +Y
Sbjct: 271 AEWKSVVILVPVRLGGETLNPVYVPCVKLMPTPP-------------------TDDFLLY 311
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 312 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFEMLCSEL 369
Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
+++ S+ P+FT+ + H +
Sbjct: 370 TRVLSSSSATERYPMFTLAEGHAQ 393
>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
Length = 268
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 92/268 (34%), Positives = 133/268 (49%), Gaps = 40/268 (14%)
Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
++ +W+LG + I +DE L D A SR+ +YRK F IG + TS
Sbjct: 22 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 69 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128
Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
Q G G + G W GP + + + LA + +A+++ V +E
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180
Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVG 357
TL+ F PQSLG++GGKP ++ Y +G
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIG 268
>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 452
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 167/371 (45%), Gaps = 65/371 (17%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S +S + LLG +++ +DEA + F + F+S + ++YR+GF + S +T+D
Sbjct: 70 SKSSPLILLGKSYEL-KDEANKE--------RFRRSFASLLWLTYRRGFPQLAGSSLTTD 120
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----------------------------- 198
GWGC+LR+ QML+A+ LL H + W +
Sbjct: 121 SGWGCVLRTGQMLLARGLLTHLMPPGWMWSVWYRAVKDDLDLPHHADCTDCKSNMRCRYQ 180
Query: 199 ------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+P + + +++ F D +PF IH L++ G + G AG W GP +
Sbjct: 181 SLGSLYDRPLEAMHRKVVSWFADHPKAPFGIHRLVELGASSGKKAGDWYGPSIVA---HI 237
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 312
L + A LP + V+ D + + D C W ++
Sbjct: 238 LQKAVAASV-----DLPNLVVYVAQD--------CTIYLQDVRGLCE--RPPPHSWKSVI 282
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+LVP+ LG + +NP YI ++ +GI+GG+P S + VG Q++ +YLDPH Q
Sbjct: 283 ILVPVRLGGQDLNPSYISCVKKLLELQCCIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQ 342
Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 432
+N+ K++ + ++H R + +DPS IGFY + + + C +++ S
Sbjct: 343 LTVNVTKENFPLE--SFHCKYPRKMPFSRMDPSCTIGFYASGQQELELLCTNVNEVVSTS 400
Query: 433 -NGAPLFTVTQ 442
G P+F ++
Sbjct: 401 AEGYPMFIFSE 411
>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
Length = 450
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 148/312 (47%), Gaps = 50/312 (16%)
Query: 140 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 176
F +D +++ ++YR GF+PI S +SD GWGCM+RS
Sbjct: 115 FTEDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRS 174
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
Q L+A + +LGR WR+ + +E ++ +F D +PFSIHN ++ G A G
Sbjct: 175 GQSLLATTIATLQLGRDWRRGKNQ---QEERRLISMFADDPRAPFSIHNFVRHGATACGK 231
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A A+C +A T L + +Y + +D V D
Sbjct: 232 FPGEWFGP-------SATAQCIQALTS--SSDLDLHVYSPNDGQDVYEDSFMKVAKPD-- 280
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
GQ D+ P L+L+ LG++K+ P Y L T PQS+GI GG+P +S Y
Sbjct: 281 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLIATLQMPQSVGIAGGRPSSSHYF 331
Query: 356 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
VG Q YLDPH + + +D + D + H+ +R +H+ +DPS+ IGF
Sbjct: 332 VGSQGSYLFYLDPHHTRKALPYHEDVANYTDEDIDSCHTSRLRRLHVKEMDPSMLIGFLI 391
Query: 413 RDKDDFDDFCAR 424
R + D+ ++ R
Sbjct: 392 RSESDWAEWRQR 403
>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
Length = 1202
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 114/390 (29%), Positives = 161/390 (41%), Gaps = 110/390 (28%)
Query: 140 FNQDFSSRILISYRKGFDPI---------------------------GDSKITSDVGWGC 172
F +DF+SRI ++YR GF PI + +++D GWGC
Sbjct: 545 FYEDFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGC 604
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE----------YVEILHLFGD--S 216
MLR+ Q L+A AL F LGR WR+ + P E Y +L F D S
Sbjct: 605 MLRTGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPS 664
Query: 217 ETSPFSIHNLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV- 274
PFS+H GK G G W GP + + LA + +L +A+ V
Sbjct: 665 PLCPFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLA------SNFAPANLGVAVSVD 718
Query: 275 --VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
V + P A R S + P+L+L+ LGL+KVNP Y ++
Sbjct: 719 GTVYRSDVQAAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESI 778
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH----------------------- 369
+ +FPQS+GI GG+P +S Y VGVQ+ S Y+DPH
Sbjct: 779 KAALSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPHHTKPAIPFRQPPPDIAALAAELP 838
Query: 370 -DVQPVINIGKDDL----------------EADTST-----------------YHSDVIR 395
D+ +N + L E D +T +H D +R
Sbjct: 839 LDIHSPLNAWQRSLGDSLPPTPGAEPPAPDECDDATRLRAWFANEYDETCFGSFHCDRVR 898
Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
+ L +DPS+ IGF CRD+ D+DD +RA
Sbjct: 899 KMPLSGLDPSMLIGFLCRDEADWDDLQSRA 928
>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/330 (29%), Positives = 154/330 (46%), Gaps = 60/330 (18%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 159
LG G+ E ++D SRI +YR GF+PI
Sbjct: 69 LGRRYGSGSKEEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 160 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
+ T+DVGWGCM+R+SQML+A A+ LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAIQLLLLGRGFT--YADSSEKKHSDIIDMF 186
Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D +PFS+HN ++A L G W GP A S + L + Q E+ S P
Sbjct: 187 TDDPKAPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDES-----SSPRF 241
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
++S D DD + + + + IL+L+P+ LGL KV+P Y +
Sbjct: 242 RVIISESCD---------IYDD--KIGKLLQENEDAEGAILILLPVRLGLNKVSPYYHNS 290
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L F+ PQ +GI GGKP +S Y G + +YLDPH Q V + T+H+
Sbjct: 291 LSSLFSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSV------KASSIYDTFHT 344
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
++ + ++ +DPS+ IG + K+D++ F
Sbjct: 345 HNVQSLKIEDMDPSMLIGILIKSKEDYESF 374
>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 470
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 172/385 (44%), Gaps = 79/385 (20%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
S ++ ++LLG + D+ + F +DF SR+ ++YR+ F + + +T+D
Sbjct: 76 SRSAPVYLLGERYYFRLDDEID---------RFQKDFVSRVWLTYRRDFPALEGTALTTD 126
Query: 168 VGWGCMLRSSQMLV---------------AQALLFH------------------------ 188
GWGCM+RS QML+ ++AL H
Sbjct: 127 CGWGCMIRSGQMLLAQGLLLHLLSREWTWSEALYTHFVEMEPIRSSSPSSMPLSLATDHS 186
Query: 189 -RLGRPWRKPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 246
R +P + P+ E + I+ F D ++PF +H ++ G +G AG W GP
Sbjct: 187 GRHSQPQTHCSRAPYGGEVHQNIVSWFSDHASAPFGLHRMVALGSIFGKRAGDWYGP--- 243
Query: 247 CRSWEALARCQRAETGLGCQSLPMAIYVVSG----DEDGERGGAPVVCIDDASRHCSVFS 302
+A + + +++YV D E+ A V D SR
Sbjct: 244 ----SIVAHIIKKAIESSSEVPDLSVYVSQDCTVYKADIEQLFAGEVPHTDTSR-----G 294
Query: 303 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 362
G+A +++LVP LG E NP Y L+ P LGI+GGKP S Y +G Q+
Sbjct: 295 AGKA----VIILVPARLGGETFNPVYKHCLKEFLRMPSCLGIIGGKPKHSLYFIGYQDNY 350
Query: 363 AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
+YLDPH QP I+ +D+ + ++H + R + + +DPS FY +++DDF C
Sbjct: 351 LLYLDPHYCQPYIDTSRDNFPLE--SFHCNAPRKLSITRMDPSCTFAFYAKNRDDFGKLC 408
Query: 423 ARASKL-----AEESNGAPLFTVTQ 442
SK+ AEE P+F++++
Sbjct: 409 EHLSKVLHSPQAEEK--YPIFSISE 431
>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
SS1]
Length = 1286
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 145/312 (46%), Gaps = 57/312 (18%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKIT---------------------------- 165
NN F DF+SR+ ++YR F PI DS +T
Sbjct: 333 NNWPPVFYSDFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGW 392
Query: 166 ---------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLF 213
SD GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V++L F
Sbjct: 393 PGSGEKGWTSDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWF 452
Query: 214 GDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
DS T PFS+H + AGK G G W GP + + L E GLG +A
Sbjct: 453 FDSPTPHCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---IA 508
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
D P + + + + G+A +L+L+ + LGL+ VNP Y T
Sbjct: 509 SDSQIFQSDVFAASHPPMDSPSSKKKLASTWGGRA----VLVLIGIRLGLDGVNPIYYET 564
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + L ST +
Sbjct: 565 IKALYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAV-----PLRPPPST--N 617
Query: 392 DVIRHIHLDSID 403
D++ I +SI+
Sbjct: 618 DIVLDISRESIE 629
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 51/91 (56%), Gaps = 15/91 (16%)
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
A+ T+H + +R + L +DPS+ +GF CRD+ D++DF AR + L++ T+
Sbjct: 836 AELKTFHCERVRKMPLSGLDPSMLVGFLCRDEGDWEDFKARVADLSKTHK-----TIFSI 890
Query: 444 HKKPVNH-SDVLGETGGVPEDDSLGVMSMND 473
H +P ++ SD +D LG+ SM++
Sbjct: 891 HDEPPSYPSD---------SEDHLGLESMSE 912
>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
Length = 400
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 157/316 (49%), Gaps = 28/316 (8%)
Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
N + E + +D SR+ +YR F P+G+ ++T+D GWGCMLR QM++AQAL+ LG
Sbjct: 52 NAIQELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLG 111
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
R W + D Y++I++ F D+ S +S+H + G++ G W+GP + + +
Sbjct: 112 REWYWT-SECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILK 170
Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 311
L C L I+V V +DD S+ W P+
Sbjct: 171 KLV-CFDDWCSL-------VIHVAMDS---------TVVLDDIYS----LSQDGESWKPL 209
Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
LL++PL LG+ +NP Y+P L+ F S G++GG+P + Y VG ++ +YLDPH
Sbjct: 210 LLIIPLRLGITDINPIYVPALKRCFELESSCGMIGGRPNQALYFVGYVDDEVLYLDPHTT 269
Query: 372 QPVINIGKDDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
Q +G+ A+ TYH ++ ++DPSLA+ F C+ + F+ + +
Sbjct: 270 QRTGAVGQKTTTAEQELDETYHQKYAARLNFSAMDPSLAVCFICKTQSSFELLLKQLREE 329
Query: 429 AEESNGAPLFTVTQTH 444
+ LF ++++
Sbjct: 330 VLTLSSPALFEISKSR 345
>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
aries]
Length = 438
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 165/353 (46%), Gaps = 35/353 (9%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 83 SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
+TSD GWGCMLRS QM++AQ LL H L R W Q P
Sbjct: 133 GTLTSDCGWGCMLRSGQMMLAQGLLLHLLPRDWTWS-QGAGLGPAEPPGLGSPSPGPGPX 191
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
G+A G AG W GP +A R C + + VS D
Sbjct: 192 XXXXXXSWGRAPGKKAGDWYGP-------SLVAHILRKAVE-SCSEVTRLVVYVSQDC-- 241
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
V D +R + S A+W +++LVP+ LG E +NP Y+P ++
Sbjct: 242 ------TVYKADVARLVAR-SDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELC 294
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
LGI+GG P S Y +G Q++ +YLDPH QP +++ + D + ++H R +
Sbjct: 295 LGIMGGTPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAK 352
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 451
+DPS +GFY D+ +F+ C+ +++ S+ P+FT+ + H + +HS
Sbjct: 353 MDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLVEGHAQ--DHS 403
>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
Length = 403
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 166/371 (44%), Gaps = 64/371 (17%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
I + +W+LG + ++ L +D S++ +YRKGF PIG +S
Sbjct: 16 IPQTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFIPIGGCNST 64
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 222
TSD GWGCMLR QM++AQAL+ LG+ W+ P K + Y++IL F D + FS
Sbjct: 65 FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQWMPETK--NNTYLKILSRFEDKRAAAFS 122
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIY 273
IH + G + G G W GP + + W +L + L +
Sbjct: 123 IHQIALTGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSSLTIHVALDNTLIVNDILKQCR 182
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
+ G+ G P+ K + W P+LLL+PL LGL ++NP YI L+
Sbjct: 183 IEGGETAEADGEVPL--------------KAPSQWKPLLLLIPLRLGLSEINPVYINGLK 228
Query: 334 L--------------------TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+ +F QSLG++GGKP + Y +G + IYLDPH Q
Sbjct: 229 VKFKILCMQKKKYICIQFFQTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQR 288
Query: 374 V----INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
I ++++E D TYH I + +DPS+A+ F+C + +F C +
Sbjct: 289 SGSVEDKISEEEIEMDI-TYHCKSASRIPITGMDPSVALCFFCATEKEFMSLCKSMQEEL 347
Query: 430 EESNGAPLFTV 440
PLF +
Sbjct: 348 ILPEKQPLFEL 358
>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
Length = 469
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 161/356 (45%), Gaps = 63/356 (17%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
++ + F +DF SR+ ++YR+ F + + +T+D GWGCM+RS QML+AQ LL H L R
Sbjct: 93 DDEIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSRE 152
Query: 194 WR--KPLQKPF----------------------------------------DREYVEILH 211
W + L + F D+ + I+
Sbjct: 153 WTWSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMR 212
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
F D SPF +H L+ G +G AG W GP +A + + ++
Sbjct: 213 WFSDHPGSPFGLHQLVTLGSIFGKKAGDWYGP-------SIVAHIIKKAIETSSEVPELS 265
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+YV S D + + D + G+A +++LVP+ LG E NP Y
Sbjct: 266 VYV-SQDCTVYKADIEQLFAGDVPHAETSRGAGKA----VIILVPVRLGGETFNPVYKHC 320
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L+ P LGI+GGKP S Y +G Q+ +YLDPH QP I+ K+D + ++H
Sbjct: 321 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPYIDTSKNDFPLE--SFHC 378
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL-----AEESNGAPLFTVTQ 442
+ R I + +DPS FY ++ +DF C K+ AEE P+F++++
Sbjct: 379 NSPRKISITRMDPSCTFAFYAKNSEDFGKLCDHLMKVLHSPRAEEK--YPIFSISE 432
>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 414
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 115/378 (30%), Positives = 177/378 (46%), Gaps = 63/378 (16%)
Query: 109 STSDIWLLGVCHKIAQDE-------ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
S S IWLLG + A++E + + L++F +DF +RI +YR GF I
Sbjct: 45 SHSPIWLLG--KQYAKNEPRPNLRRGFDENSAVGKLSDFLEDFRTRIWFTYRHGFPCIPG 102
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDREYVEILHLFGDSET 218
+K +D GWGC +RS QML+A+ +L H LGR W + L + + +++ LF D+ T
Sbjct: 103 TKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLLGQSGLPEDEALMHRKVIGLFCDNLT 162
Query: 219 SPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
SPFS+HNL+Q G+ +G AGSW GP ++ + + +A E GL +A++V+
Sbjct: 163 SPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQ-VAMNNAIERGL---VEGLAVHVIGD 218
Query: 278 DE----DGERGG-----APV----------------VCIDDASRHCSV------------ 300
E D ER G APV D R SV
Sbjct: 219 GELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSYLDLRRLTSVSNGDLLPSHDGE 278
Query: 301 ------FSKGQADWTP-ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
F W+ +L+L+PL LG+EK N Y L+ + +G++GG+
Sbjct: 279 SIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHLKRVLSTKFCVGVIGGRHHKCY 338
Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
Y G + I LDPH QP ++ + + ++H + + IDP +IGFY R
Sbjct: 339 YFCGWHTDYLIRLDPHYSQPAVDATQPGVS--LHSFHCKYPKKTLIADIDPWCSIGFYIR 396
Query: 414 DKDDFDDFCARASKLAEE 431
++ + F A S++ E
Sbjct: 397 NRLELQSFLADISEVGFE 414
>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
boliviensis]
Length = 463
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 109/409 (26%), Positives = 173/409 (42%), Gaps = 101/409 (24%)
Query: 76 NGWTAAVKRLV-TAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGN 134
NG AV R++ AG + SRT S +S + +C + + E GD
Sbjct: 80 NGIAVAVMRVLHLAGRCPHVSPGWAVKSRTSFSKISS----IHLCGRRYRFEGEGD---- 131
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+ F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 132 --IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDW 189
Query: 195 R---------KPLQKPF-------------------------DREYVEILHLFGDSETSP 220
L P +R + +I+ F D +P
Sbjct: 190 TWAEGTGLGPPELSGPASPSRYHGPARWMPPCWAQGAPELEQERRHRQIVSWFADHPQAP 249
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
F +H L++ G++ G AG W GP +A R + + +YV
Sbjct: 250 FGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVESSSEVTRLVVYV------ 296
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
S+ C+ G+ TP L + LR
Sbjct: 297 --------------SQDCT----GKGTCTPSLQEL----------------LRCELC--- 319
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
LGI+GGKP S Y +G Q++ +YLDPH QP +++ + + + ++H R +
Sbjct: 320 -LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--SFHCTSPRKMAFA 376
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 446
+DPS +GFY D+ +F+ C+ +++ S+ P+FT+ + H +
Sbjct: 377 KMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ 425
>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
Length = 499
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 175/387 (45%), Gaps = 76/387 (19%)
Query: 120 HKIAQDEALGDAAGNNG---LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC---- 172
+KI+ LGD+ N + F F SRI ++YRK F + S T+D GWGC
Sbjct: 83 NKISPVTILGDSYLLNSEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRS 142
Query: 173 --MLRSSQMLV-----------AQAL------LFH-----RLG----------------- 191
ML + +LV AQ L +F R G
Sbjct: 143 GQMLLAQGLLVHLMPRGWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPE 202
Query: 192 RPW----------RKPLQKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
RP +K L+ DR+ + +++ FGD T+PF IH L++ GK+ G A
Sbjct: 203 RPLLSEQATKCSRKKRLESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKA 262
Query: 238 GSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
G W GP + +A+AR + + +YV D + +C S+
Sbjct: 263 GDWYGPAIVAHILRKAVARASAVHS--------LVVYVAQ-DCTVYKEDVMHLCDPTPSQ 313
Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
S QA W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y V
Sbjct: 314 TPSDPLSHQA-WKSVIILVPVRLGGECLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFV 372
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
G Q+E +YLDPH QPV+++ + + + ++H + + + + +DPS IGFY + K
Sbjct: 373 GFQDEQLLYLDPHYCQPVVDVSQ--VNSSLESFHCNAPKKMPFNRMDPSCTIGFYAKSKK 430
Query: 417 DFDDFC-ARASKLAEESNGAPLFTVTQ 442
DF+ C A + L+ PLFT +
Sbjct: 431 DFESLCSAVGTALSSSKERYPLFTFIE 457
>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
Length = 462
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 109/353 (30%), Positives = 152/353 (43%), Gaps = 82/353 (23%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 169
A N + F DF SRI ++YR GF I S+ TSD G
Sbjct: 87 AQYGNWPSAFLDDFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
+GCM+RS Q ++A AL RLGR WR P +E+ IL LF D +PFSIH ++
Sbjct: 147 FGCMIRSGQCILANALQTLRLGRDWRY-QDDPTAQEHCNILSLFADDPQAPFSIHRFVEH 205
Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G A G G W GP A R + L + E GL +YV SGD GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KYKEAGL-------RVYV-SGD------GADV 250
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D + +V G+ W P L+LV LG++K+ P Y L+ + QS+GI GG+
Sbjct: 251 Y--EDKLKQVAVEEDGE--WIPTLILVGTRLGIDKITPVYWEALKASLQMKQSMGIAGGR 306
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI----NIGKDDLEA-------------------- 384
P AS Y V Q YLDPH +P + D+
Sbjct: 307 PSASHYFVATQANHFFYLDPHSTRPHLPYRPPTSSDETTTQLASSITSTSSSTTIVPSAS 366
Query: 385 ----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
D S+ H+ IR + + +DPS+ + F ++D++ +
Sbjct: 367 SLPPRSPPEPSTYTLDDISSCHTRRIRRLQIREMDPSMLLAFLVTSQEDYEKW 419
>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
Length = 208
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205
>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
Length = 271
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205
>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
Length = 292
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
S+ K S+LS +F ++FE + S++ A K S W+ ++R V +GSM R
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104
Query: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
+ LG +R ++ D+W LG C++++ ++E G + ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
RKGFD I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205
>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
MF3/22]
Length = 1147
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 135/298 (45%), Gaps = 62/298 (20%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
G N F DFSSR+ ++YR + PI D +
Sbjct: 335 GANWPPGFYSDFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGS 394
Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDS 216
TSD GWGCMLR+ Q L+A AL+ LGR WR+P Q + + YV+IL F DS
Sbjct: 395 GEKGWTSDSGWGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYATYVKILTWFFDS 454
Query: 217 ET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV 274
PFS+H + AGK G G W GP + + + AE GLG S+ V
Sbjct: 455 TDIHCPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHA-FAEAGLGV-SVATDGVV 512
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---------------W--TPILLLVPL 317
D P + RH + + + W P+L+LV +
Sbjct: 513 YETDVLAASNAGPYMY-----RHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGI 567
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
LG++ VNP Y ++ FTFPQS+GI GG+P +S Y VGVQ ++ YLDPH +P +
Sbjct: 568 RLGIDCVNPVYYDAVKALFTFPQSVGIAGGRPSSSYYFVGVQTDNLFYLDPHHSRPSV 625
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 29/42 (69%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
T+H D +R + L S+DPS+ IGF CRD+ D+ D R ++++
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCRDERDWKDLRERVTEMS 769
>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
Length = 511
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 163/382 (42%), Gaps = 80/382 (20%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 129 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 182
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPW-------------------------------RK 196
GWGCMLRS QM++AQ LL H L R W R
Sbjct: 183 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRW 242
Query: 197 PLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
P +R + +I+ F D +PF +H L++ G++ G AG W GP +
Sbjct: 243 AQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 295
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------KGQAD 307
A R C + + VS D +PV + + + +
Sbjct: 296 AHILRKAVE-SCSEVTRLVVYVSQDCTAAEASSPVSDTPASGPLHLLPLLLGVLFQQRCR 354
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W L V +L E LGI+GGKP S Y +G Q++ +YLD
Sbjct: 355 W----LFVCELLRCELC-----------------LGIMGGKPRHSLYFIGYQDDFLLYLD 393
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+ ++
Sbjct: 394 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 451
Query: 428 LAEESNGA---PLFTVTQTHKK 446
+ S+ P+FT+ + H +
Sbjct: 452 VLGSSSATERYPMFTLAEGHAQ 473
>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
Length = 491
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/452 (25%), Positives = 176/452 (38%), Gaps = 116/452 (25%)
Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
S S + LLG C+ +D+ L +G + EF +DF SRI ++YR+ F
Sbjct: 36 SRNSPVLLLGKCYHFKYEDEDKTLPTESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
I S +T+D GWGC LR+ QML+AQ L+ H LGR W P K
Sbjct: 96 PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155
Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
F +RE+ +I+ FGDS + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
L++ GK G AG W GP + R G + IYV
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
V D + + + AD +++LVP+ LG E+ N Y+ ++ + +GI
Sbjct: 264 -CTVYNYDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322
Query: 345 VGGKPGASTYIVGVQE----------------ESAIYLDPHDVQPVINIGKDDLEADT-- 386
+GGKP S Y G QE ++ + L+ + +P + G +D +
Sbjct: 323 IGGKPKQSYYFAGFQENEVQRSSMNSLKQKSSKNNLKLEGSEKRPQMGFGSEDEFKNILL 382
Query: 387 -------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 433
T+H + + +DPS IGFYCR+ DF+ +K+ + S+
Sbjct: 383 DHVQAFGPPSYPRLTFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFERASEEITKMLKFSS 442
Query: 434 GA--PLFTVTQTHKK-------PVNHSDVLGE 456
PLFT H + N D+ E
Sbjct: 443 KEKYPLFTFVNGHSRDYDFTSTTTNEEDLFSE 474
>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1355
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 133/292 (45%), Gaps = 65/292 (22%)
Query: 140 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 165
F DF SRI ++YR F PI DS +T
Sbjct: 334 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 393
Query: 166 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 217
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D+
Sbjct: 394 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 453
Query: 218 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
+ +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 454 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 502
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD--------W--TPILLLVPLVLGLEKVN 325
S +DG V A + + ++ W P+L+LV L LG++ VN
Sbjct: 503 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 562
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 563 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 614
>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
bisporus H97]
Length = 1261
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 101/292 (34%), Positives = 134/292 (45%), Gaps = 65/292 (22%)
Query: 140 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 165
F DF SRI ++YR F PI DS +T
Sbjct: 247 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 306
Query: 166 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 217
SD GWGCMLR+ Q L+A AL+ LGR WRKP + +Y V+IL F D+
Sbjct: 307 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 366
Query: 218 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
+ +PFS+H + AGK +G G W GP + + L P + V
Sbjct: 367 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 415
Query: 276 SGDEDGERGGAPVVCIDDA-------SRHCSVFSKGQA-DW--TPILLLVPLVLGLEKVN 325
S +DG V A + S S QA W P+L+LV L LG++ VN
Sbjct: 416 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 475
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
P Y T++ FT PQS+GI GG+PG+S Y VG Q ++ YLDPH +P I +
Sbjct: 476 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 527
>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
Length = 858
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 132/275 (48%), Gaps = 58/275 (21%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPI------------------------GDSKITS 166
AA + EF DF+SR+ ++YR GF PI G +TS
Sbjct: 148 AAASGWPQEFFSDFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTS 207
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIH 224
D GWGCMLR+ Q L+A AL+ +GR Y+ ++ LF DS + +PFS+H
Sbjct: 208 DAGWGCMLRTGQSLLANALVVAWMGRGALA--------LYIHLISLFLDSPSPSAPFSVH 259
Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
+ AG+A G G W GP + +AL + GLG V+ EDG
Sbjct: 260 RMALAGRALGKDVGQWFGPSTAAGAIKALVNAY-PDAGLG----------VAIAEDG--- 305
Query: 285 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
V R + + +W P+L+L+ + LGL+ VNP Y T++ +TFPQSL
Sbjct: 306 ----VVYQTQRRQ----KEREREWGDQPVLVLLGIRLGLDGVNPIYYDTIKQLYTFPQSL 357
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
GI GG+P +S Y VG Q YLDPH +P + +
Sbjct: 358 GIAGGRPSSSYYFVGAQAGDLFYLDPHHARPTVPL 392
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 33/47 (70%)
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
A+T T+H + +R + + +DPS+ IGF C+D+ D++D+ R SKL +
Sbjct: 537 AETRTFHCERVRKMPMSGLDPSMLIGFLCKDRADWEDWRTRVSKLPK 583
>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
FP-101664 SS1]
Length = 997
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 133/282 (47%), Gaps = 58/282 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 166
F DF+SRI ++YR F PI D+ + T+
Sbjct: 298 FYADFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTT 357
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 221
D GWGCMLR+ Q L+A AL+ LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 358 DAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPF 417
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
S+H + GK G G W GP + + L + P A V+ DG
Sbjct: 418 SVHRMALVGKDLGKDVGQWFGPSTAAGAIKTL-----------VHAFPEATLGVANAVDG 466
Query: 282 ERGGAPVVCIDDASRHCSVFSK----GQADW--TPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ V ASR ++ + DW +L+L+ + LG+E VNP Y T++
Sbjct: 467 TLYESDVYA---ASRSVMYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTL 523
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 524 YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPL 565
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
+ + T+H D +R + L +DPS+ +GF C+D+ ++ D R ++L N +F++
Sbjct: 696 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKDRIAELFR--NNKSIFSLAN 753
Query: 443 THKKPVNHSDVLG 455
+ + SD +G
Sbjct: 754 EPPQYPSDSDDMG 766
>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
Length = 425
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 142/307 (46%), Gaps = 73/307 (23%)
Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
F DF SRI ++YR GF+PI GD + +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
Q L+A ALL +LGR WR+ +R I+ LF D +P+S+ N ++ G A G
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAER---NIVALFADDARAPYSLQNFVKHGAIACGK 229
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
G W GP A R +ALA + + IY G P V D
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
S + + D + PTL L QS+GI GG+P +S Y
Sbjct: 270 ---SFLATARPD-----------------GETFHPTLIL---MEQSIGIAGGRPSSSHYF 306
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
VGVQ + YLDPH +P + ++ L + + H+ +R++H++ +DPS+ IGF
Sbjct: 307 VGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIGFLI 366
Query: 413 RDKDDFD 419
+D+DD+D
Sbjct: 367 QDEDDWD 373
>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
Length = 988
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 126/272 (46%), Gaps = 57/272 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 167
F DF+SRI ++YR F PI D+ + TSD
Sbjct: 305 FYSDFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSD 364
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD------REYVEILHLFGDSETS-- 219
GWGCMLR+ Q L+A LL LGR WR+P P+ YV+IL F D+ +
Sbjct: 365 AGWGCMLRTGQSLLANTLLHLHLGRDWRRP---PYPICTADYATYVQILTWFFDNPSPLC 421
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
PFS+H + GK G G W GP + + L E GLG S+ + D
Sbjct: 422 PFSVHRMALVGKELGKEVGQWFGPSTAAGAIKTLVHA-FPEAGLGV-SVATDSVIYQSD- 478
Query: 280 DGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
V S S G++ W +L+LV + LGL+ VNP Y T++ +T
Sbjct: 479 ---------VYTASRSNLGSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIYYDTIKALYT 529
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
FPQS+GI GG+P +S Y VG Q ++ YLDPH
Sbjct: 530 FPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 15/96 (15%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
T+H + IR + L +DPS+ IGF C+D++D+ D R + L+ T+ +P
Sbjct: 693 TFHCERIRKMPLSGLDPSMLIGFLCKDEEDWLDLRKRITDLSRTHK-----TIFSIQDEP 747
Query: 448 VN-HSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
N SD DD++G+ S+++ + ED+
Sbjct: 748 PNWPSD---------SDDNMGLESISEPDIDMPEDE 774
>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
Length = 336
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 149/334 (44%), Gaps = 71/334 (21%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG-GKPGA 351
D + C V P S VG PG
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTVGESTPG- 202
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGF 410
T Q + I+LDPH Q +N +++ D T+H + +++ ++DPS+A+GF
Sbjct: 203 -TLNASNQSDELIFLDPHTTQTFVNT-EENGTVDDQTFHCLQSPQRMNILNLDPSVALGF 260
Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+C+++ DFD++C+ K + N +F + Q H
Sbjct: 261 FCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
Length = 393
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 158/347 (45%), Gaps = 68/347 (19%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PI W
Sbjct: 57 VWILGKQHLLKTEKS-----------KLLADISARLWFTYRRKFSPID---------WN- 95
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
W K ++P +EY IL F D + +SIH + Q G
Sbjct: 96 ---------------------WEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGVG 132
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGERGGAPV 288
G + G W GP + + + LA + +A+YV + ED ++
Sbjct: 133 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCCAS 184
Query: 289 VCIDDA-------SRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
DA S + S SKG + W P+LL+VPL LG+ ++NP Y+ + F
Sbjct: 185 ALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFK 244
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
PQSLG +GGKP + Y +G + I+LDPH Q ++ ++ + D + + + +
Sbjct: 245 MPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQPPQRM 304
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++ ++DPS+A+GF+C+++ DFD +C+ K + N +F + Q H
Sbjct: 305 NILNLDPSVALGFFCQEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 350
>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
commune H4-8]
Length = 602
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 144/311 (46%), Gaps = 82/311 (26%)
Query: 112 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------- 164
+IWL+GVCH G +F DF++RI ++YR GF+ I D ++
Sbjct: 114 EIWLMGVCHA-------------PGAPDFYADFATRIWLTYRSGFELIRDRQLIDLPPPV 160
Query: 165 ------------------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
+SD GWGCMLR+ Q L+A ALL GR WR+ +
Sbjct: 161 ASLDGHLQGEWATDEAEPPGAYGFSSDSGWGCMLRTGQSLLANALLTAWFGRDWRRISEV 220
Query: 201 PFDRE--YVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
+ YV +L LF D+ T+PFSIH + AGK G G W GP + + L
Sbjct: 221 ETHQHSLYVHLLSLFLDTPHPTAPFSIHRMALAGKQLGKDIGQWFGPSTAAGAIKNL--- 277
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT------- 309
+ P+A G VV +D A VF+ ++W+
Sbjct: 278 --------VSAYPLA------------GIGVVVGMDGALSKSEVFTASHSEWSDEEAALD 317
Query: 310 ----PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
P+L+L+ L LGL++VNP Y T++ FTFPQS+GI GG+P +S + VG Q IY
Sbjct: 318 WGDRPVLILLNLRLGLDRVNPIYHDTIKALFTFPQSVGIAGGRPCSSYHFVGAQGSDLIY 377
Query: 366 LDPHDVQPVIN 376
LDPH + +
Sbjct: 378 LDPHHTRNTVR 388
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
AD +T+H + + + + DPS+ GF C+D D+DD+ AR S+L +
Sbjct: 524 HADLATFHCTNPKMMPISAQDPSMLAGFLCKDIADWDDWRARMSRLPNQ 572
>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
Length = 342
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 150/333 (45%), Gaps = 69/333 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS-SRILISYRKGFDPIGDSKITSDVGWG 171
+W+LG H + D L E F+ + L ++ G P +SD GWG
Sbjct: 35 VWILGKQHLLKTD----------SLPEIISHFTETSELTAHDGGTGP------SSDAGWG 78
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
CMLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 79 CMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKM-- 136
Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
C LP++ + + + G
Sbjct: 137 ---------------------------------CCILPLSADIATENPSGS--------- 154
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
+AS H S W P+LL+VPL LG+ ++NP Y+ + SLG +GGKP
Sbjct: 155 PNASNHSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK-------SLGALGGKPNN 207
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
+ Y +G + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+
Sbjct: 208 AYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILNLDPSVALGFF 267
Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
C+++ DFD +C+ K + N +F + Q H
Sbjct: 268 CKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 299
>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
Length = 492
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 156/343 (45%), Gaps = 79/343 (23%)
Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPIG----------------------------- 160
D + ++G+ E QD S+I ++YR GF+PI
Sbjct: 77 DISVDDGVIE--QDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNF 134
Query: 161 -----DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW-----RKPLQKPFDREYVEIL 210
+ T+DVGWGCM+R+SQ L+A LGR + R P + EI+
Sbjct: 135 HGLLDNDNFTTDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRDRSP-------RHDEII 187
Query: 211 HLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
+F D +PFS+HN ++ L G W GP A S + L C +
Sbjct: 188 DMFMDEPRAPFSLHNFIKVASESPLKVKPGQWFGPNAASLSIKRL-----------CDN- 235
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP----ILLLVPLVLGLEKV 324
+Y +G G VV + ++ + + ++ P IL+L+P+ LG++KV
Sbjct: 236 ---VYESNG-----TGRVKVVISESSNLYDDIITQMFTTLNPVPDAILVLLPVRLGIDKV 287
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
NP Y ++ QS+GI GGKP +S Y G + +YLDPH Q V N +
Sbjct: 288 NPLYHASVLELLALRQSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRN-----KTS 342
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
TYH++ + + +D +DPS+ IG +D +D++DF + +K
Sbjct: 343 VYDTYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKSSCTK 385
>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
Length = 324
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 130/300 (43%), Gaps = 56/300 (18%)
Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
++ +W+LG + + ++ E D +SR+ +YRK F IG + TSD
Sbjct: 26 TSEPVWILGRKYSVLTEKE-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 74
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
GWGCMLR QM+ AQAL+ LGR WR Y +L+ F D + S +SIH + Q
Sbjct: 75 GWGCMLRCGQMIFAQALVCRHLGRDWRWAQWTQQPDSYFNVLNAFIDRKDSYYSIHQIAQ 134
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G + G W GP + + + LA
Sbjct: 135 MGVGEGKSIGQWYGPNTVAQVLKKLA---------------------------------- 160
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
VF + I + +V G +N Y+ TL+ F PQSLG++GGK
Sbjct: 161 -----------VFDTWSSLAVHIAMDNTVVTGEININEAYVETLKHCFMMPQSLGVIGGK 209
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P ++ Y +G + IYLDPH QP + + L D S + + + +DPS+A+
Sbjct: 210 PNSAHYFIGYVGDELIYLDPHTTQPAVELTDSCLVPDESFHCQHPPSRMSIRELDPSIAV 269
>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
972h-]
gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
Length = 320
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 144/341 (42%), Gaps = 53/341 (15%)
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 150
M R ER L + T + IW LG +KI + +F D S I I
Sbjct: 4 MARFLERYLHFAPTNTEPPGTLIWFLGHSYKIEDSQ---------WPEKFLYDSFSLITI 54
Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
+YR G + G +TSD GWGCM+RS+Q L+A L R+ P +++ EIL
Sbjct: 55 TYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL---RICYP---------EKQLKEIL 100
Query: 211 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
LF D ++PFSIH + GK + G W GP C +AR +P
Sbjct: 101 ALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSC---SCVARLSDQNP-----DVP 152
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
+ +YV R V P+LLL+P LG++ +N Y
Sbjct: 153 LHVYVARNGNAIYRDQLSKVSF------------------PVLLLIPTRLGIDSINESYY 194
Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
L F +GI GG+P ++ Y Q + YLDPH + A T+
Sbjct: 195 DQLLQVFEIRSFVGITGGRPRSAHYFYARQNQYFFYLDPHCTHFAHTTTQ---PASEETF 251
Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
HS +R + + +DP + GF RD++++ F A A+
Sbjct: 252 HSATLRRVAIQDLDPCMIFGFLIRDEEEWHSFEANQKYFAD 292
>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
Length = 336
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRP-LD 202
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Q + I+LDPH Q ++ ++ + D + + + +++ ++DPS+A+GF+C
Sbjct: 203 YLTASNQSDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
Length = 336
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/333 (27%), Positives = 147/333 (44%), Gaps = 69/333 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G P S
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTAGESPPGS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGFY 411
+ Q I+LDPH Q ++ +++ D T+H + +++ ++DPS+A+GF+
Sbjct: 204 LTALN-QSNELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNILNLDPSVALGFF 261
Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
C+++ DFD++C+ K + N +F + Q H
Sbjct: 262 CKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
[Homo sapiens]
Length = 340
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 184
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 185 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 207
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 208 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 266
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 267 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 297
>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
Length = 336
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 181 DIKKMCCV-------------------------------------LPLSADTAGDRPPDS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
gorilla]
gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 336
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 150/330 (45%), Gaps = 60/330 (18%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 159
LG G++ E +D SRI +YR GF+PI
Sbjct: 69 LGRRYGSSSKEEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128
Query: 160 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
+ T+DVGWGCM+R+SQML+A A LGR + ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAFQLLLLGRDF--AYVDGSEKKHSDIIDMF 186
Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D +PFS+HN ++A L G W GP A S + L C+ G S
Sbjct: 187 TDEPKTPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRL--CKSQFDGSVSPSF-RV 243
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
I S D ++ G + I+++ IL+L+P+ LGL KV+P Y +
Sbjct: 244 IISESCDIYDDKIGKLLQEIENSE-------------DAILILLPVRLGLNKVSPYYHDS 290
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L F Q +GI GGKP +S Y G +YLDPH Q + + T+H+
Sbjct: 291 LSSLFCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPHYPQSM------KASSIYDTFHT 344
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ ++ + ++ +DPS+ IG + K+D++ F
Sbjct: 345 NKVQSLKIEDMDPSMLIGILIKSKEDYESF 374
>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.9]
Length = 992
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/284 (34%), Positives = 134/284 (47%), Gaps = 49/284 (17%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 165 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD 215
TSD GWGCMLR+ Q L+A ALL LGR WR+P +Y V+I+ F D
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFD 410
Query: 216 --SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273
S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 TPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGVI 469
Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
S + A I RH V G+A +++L+ + LGL+ VNP Y T++
Sbjct: 470 FQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTIK 520
Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+TFPQS+GI GG+P +S Y +G Q ++ YLDPH +P + +
Sbjct: 521 ALYTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPHHARPAVPL 564
Score = 45.1 bits (105), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 6/84 (7%)
Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
G E + LDP V D L T+H D +R + + +DPS+ +GF C+D++
Sbjct: 681 GDSEGAGEALDPMAEHYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDEN 736
Query: 417 DFDDFCARASKLAEESNGAPLFTV 440
D+ DF R + L +FTV
Sbjct: 737 DWFDFRRRVNDLMHRHKT--IFTV 758
>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1009
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 132/290 (45%), Gaps = 61/290 (21%)
Query: 140 FNQDFSSRILISYRKGFDPI-------------------------------GDSKITSDV 168
F DF+SRI ++YR F PI GD +SD
Sbjct: 308 FYADFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDA 367
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFSI 223
GWGCMLR+ Q L+A AL+ LGR WRKP +Y ++I+ F D + PFS+
Sbjct: 368 GWGCMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSV 427
Query: 224 HNLLQAGKAYGLAAGSWVGP--------YAMCRSWEALARCQRAETGLGCQSLPMA---I 272
H + GK G+ G W GP Y S ++ Q A L + P A I
Sbjct: 428 HRMALVGKQLGVKVGQWFGPSTAAGAIKYVSAHS--SMVPNQPARRTL-VHAFPEAGLGI 484
Query: 273 YVVSGD---EDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPR 327
YV + D E A I RH W P+L+L+ LG++ VNP
Sbjct: 485 YVAADGGTIYDSEVFAASHSGIGSPRRHTRRV------WGDRPVLILIGHRLGIDGVNPI 538
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
Y TL+ +T+PQS+GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 539 YYDTLKTLYTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPTIPL 588
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 1/51 (1%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 438
T+H D +R + L S+DPS+ IGF C+D+ ++ D +R ++L+ +S +P+F
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCKDESEWQDLKSRINELSRKSK-SPVF 777
>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
Length = 336
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 203
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293
>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 500
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 119/239 (49%), Gaps = 13/239 (5%)
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
++ FGD +PF +H L+ GK G AG W GP +A R
Sbjct: 235 LVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGP-------SVVAHILRKAVDKTSVVT 287
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
+A+YV +D VV + D S + + DW +++LVP+ LG E +NP Y
Sbjct: 288 NLAVYVA---QDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGGEALNPSY 344
Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
I ++ +GI+GGKP S Y +G Q+E +YLDPH QPV+++ + + + +
Sbjct: 345 IDCVKNFLKLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQINFSLE--S 402
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTHKK 446
+H + + + +DPS IGFY ++K DF+ C+ S+ L+ P+FT + H +
Sbjct: 403 FHCSSPKKMPFNRMDPSCTIGFYAKNKKDFESLCSAVSEALSSSKEKYPVFTFVEGHSQ 461
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 38/61 (62%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ F F SRI ++YR+ F + S T+D GWGCMLRS QML+AQ LL H + R W
Sbjct: 104 VERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPRDWVW 163
Query: 197 P 197
P
Sbjct: 164 P 164
>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
Length = 340
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 147/318 (46%), Gaps = 63/318 (19%)
Query: 137 LAEFNQDFSSRILISYRKGFDPI------------------------------GDSKITS 166
L E +SR+ +YR GF+PI + ++
Sbjct: 52 LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHN 225
DVGWGCM+R+SQ L+A AL LGR + P E VE I+ LFGD T PFS+HN
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171
Query: 226 LLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
++ A L G W GP A S + L C + E+ ++ ++I D E
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRL--CAKFESN-EIPNINVSICESCNLYDEEI 228
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
G +F + + +P+L+L PL LG++K+N Y P+L QS+G
Sbjct: 229 RG--------------IFEESE---SPLLILFPLRLGIDKINSIYYPSLLQLLALKQSVG 271
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
I GGKP +S Y G Q + +YLDPH++Q +D TYH+ + + + ++D
Sbjct: 272 IAGGKPSSSYYFFGFQGSNLLYLDPHNLQAA--------SSDPGTYHTSKFQTLSISNLD 323
Query: 404 PSLAIGFYCRDKDDFDDF 421
P A + ++ +DD+
Sbjct: 324 PLNAC--WSVNQMTYDDY 339
>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
boliviensis]
Length = 360
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+ + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 228 -LTASNESDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317
>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
Length = 1039
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 138/278 (49%), Gaps = 51/278 (18%)
Query: 140 FNQDFSSRILISYRKGFD-PIGDSKI-------------------------------TSD 167
F DF+SRI ++YR F PI D+++ +SD
Sbjct: 336 FYIDFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSD 395
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDREYVEILHLFGDSET--SPFS 222
GWGCMLR+ Q L+A AL+ LGR WR+P +Q YV+I+ F D+ +PFS
Sbjct: 396 TGWGCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFS 455
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H + AGK +G G W GP + + L E+GLG VS DG
Sbjct: 456 VHRMALAGKEFGTDVGQWFGPSVAAGAIKTLVNS-FPESGLG----------VSVATDGT 504
Query: 283 RGGAPVVCI---DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
+ V + + +SR P+LLL+ + LG+E VNP Y T++L +TFP
Sbjct: 505 LFQSDVFAVSHGEMSSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIYYETIKLLYTFP 564
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
QS+GI GG+P +S Y VG Q ++ YLDPH+ +P I +
Sbjct: 565 QSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTRPAIPL 602
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 29/43 (67%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
T+H + +R + L +DPS+ IGF CRD+ ++ DF R ++L +
Sbjct: 739 TFHCERVRKMPLSGLDPSMLIGFLCRDEAEWWDFKKRVAELPK 781
>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/254 (33%), Positives = 123/254 (48%), Gaps = 25/254 (9%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH + Q G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
G + G W GP + + + LA + +A+++ V +E
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112
Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
V C D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
F PQSLG++GGKP ++ Y +G E IYLDPH QP + D S +
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232
Query: 396 HIHLDSIDPSLAIG 409
+ + +DPS+A+G
Sbjct: 233 RMSIAELDPSIAVG 246
>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
jacchus]
Length = 360
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 53 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
+E I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 228 LTASNRSDE-LIFLDPHTTQTFVDAEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317
>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
Length = 437
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 140/316 (44%), Gaps = 85/316 (26%)
Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSK-----------------ITSDVGWGCM 173
+F DF S++ I+YR F PI GDS TSD GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 232
+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G A
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGATA 261
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCI 291
G G W GP A + +AL + + GL +Y+ S G + E+ V C
Sbjct: 262 CGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACD 313
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G +
Sbjct: 314 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGPE--- 359
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
+ STYH+ +R +H+ +DPS+ IGF
Sbjct: 360 ---------------------------------ELSTYHTRRLRRLHVREMDPSMLIGFL 386
Query: 412 CRDKDDFDDFCARASK 427
RD+DD++D R +
Sbjct: 387 VRDEDDWEDLKQRVRE 402
>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 497
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 120/233 (51%), Gaps = 13/233 (5%)
Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
+++ LFGD +PF +H L+ GK G AG W GP + + R A+T +G QS
Sbjct: 231 KLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAH----ILRKAVAKTSVG-QS 285
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
L A+YV +D V+ + D S V W +++LVP+ LG E +NP
Sbjct: 286 L--AVYVA---QDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGGEALNPS 340
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YI ++ + +GI+GGKP S Y +G Q+E +YLDPH QPV++ + + +
Sbjct: 341 YIECVKNILSLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDFTQANFSLE-- 398
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFT 439
++H + + +DPS IGFY R K+DF+ C+ L+ P+FT
Sbjct: 399 SFHCSSPKKMPFSRMDPSCTIGFYARTKEDFESMCSVVGMVLSSSKEKYPIFT 451
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 11/73 (15%)
Query: 108 SSTSDIWLLGVCHKI-AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
+ TS I++LG + + ++DE + F DF SRI ++YR+ F + S +T+
Sbjct: 87 NKTSPIFVLGHAYLLNSEDE----------VERFRLDFVSRIWLTYRREFPQLEGSTLTT 136
Query: 167 DVGWGCMLRSSQM 179
D GWGCMLRS QM
Sbjct: 137 DCGWGCMLRSGQM 149
>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
Length = 433
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 120/414 (28%), Positives = 171/414 (41%), Gaps = 102/414 (24%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVG 169
IWLLGV + + G +A + A F+ +DFSSR+ +YR+ F I + I +D G
Sbjct: 36 IWLLGVIYHRKMTQFYGASAVVDDGASFDAFLEDFSSRLWFTYRREFPAIPGTDIRTDCG 95
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWR------------------KPLQKPF-----DREY 206
WGCMLRSSQM++AQA + H LGR WR PL++ F D
Sbjct: 96 WGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTEAGEVRLPRHALWPLREGFRCTGGDGTA 155
Query: 207 VEIL----------HLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSW 250
V + FGD ++PFS+HNL+Q G+ G AG W GP Y + +
Sbjct: 156 VLVRCSPKPVNDPPRWFGDKADASTPFSLHNLVQRGRESGKKAGDWYGPSSVAYILKDAL 215
Query: 251 E-ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
E A R QR + IYV + +DD + CS S
Sbjct: 216 EDAAHRDQRLAQ--------LCIYVAQD---------CTIYMDDVTALCSAGSTEGVT-- 256
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS--------------LGIVGGKPGASTYI 355
+ PR + R F+ Q+ + K G S +
Sbjct: 257 ------------HRRLPRTVFARREMFSGGQTQRMCIHSSWLHLFVFFVCFLKYGISFLL 304
Query: 356 -VGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
+ EE IYLDPH Q ++++ D D ++H R + IDPS IGFYC+
Sbjct: 305 QLSAAEEKVIYLDPHYCQEMVDVNSQDFPLD--SFHCSWPRKMSFSRIDPSCTIGFYCKT 362
Query: 415 KDDFDDFCARASKLA---EESNGAPLFTV--------TQTHKKPVNHSDVLGET 457
K D +DF +L + + P+F + T T K+P VL +
Sbjct: 363 KHDLEDFTKNIRELTVPKQMRHEYPVFLISEGSCSDHTDTEKRPEEIVHVLQDV 416
>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
Length = 603
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 137/310 (44%), Gaps = 63/310 (20%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGF------DPIGDS------------------- 162
LG+ NN ++ DF SRI +YR F DP+ D
Sbjct: 55 LGNLYDNN--SDLLDDFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLR 112
Query: 163 --KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF--------DREYV---EI 209
+D GWGCMLR+SQ L+A L LGR WR+ PF +EYV ++
Sbjct: 113 ERTFNTDQGWGCMLRTSQSLLANTLQIMLLGRQWRR---NPFVDLTDYAKRKEYVNLIKL 169
Query: 210 LHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
L+LF D S SPFS+H + GK+ G G W GP + + L Q + L S
Sbjct: 170 LNLFMDNPSTLSPFSVHRMAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQ-TDINLSV-S 227
Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVN 325
+ + D GG ++W P+L+LV + LGL+ ++
Sbjct: 228 VASDSVIYKSDVYQASGGTSTT--------------ADSEWGNKPVLILVGVRLGLDGIH 273
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
PRY TL+ +GI GG+P +S Y G Q +S Y+DPH ++P INI E +
Sbjct: 274 PRYYETLKAFLRMQSCVGIAGGRPSSSYYFFGYQSDSLFYVDPHIMKPTINIKTPPTEGE 333
Query: 386 TSTYHSDVIR 395
T +++R
Sbjct: 334 LKTEIENLLR 343
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 37/62 (59%), Gaps = 5/62 (8%)
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
A STY D R +++ +DPS+ IGF +D+++F +F + +L ++ +F+V +
Sbjct: 470 ASISTYFCDKPRKMNISQMDPSMLIGFLVKDENEFFEFVNQIKELPQQ-----VFSVADS 524
Query: 444 HK 445
H+
Sbjct: 525 HR 526
>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
Length = 431
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 126/260 (48%), Gaps = 15/260 (5%)
Query: 194 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------Y 244
W K ++P EY IL F D + +SIH + Q G G + G W GP
Sbjct: 137 WEKHQEQP--EEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 194
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
A+ W +LA + + + + ++ D + + +D + C + G
Sbjct: 195 ALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLD-WNTDCPGQTSG 253
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
W P+LL+VPL LG+ ++NP Y + F PQSLG +GGKP ++ Y +G + I
Sbjct: 254 ---WKPLLLIVPLRLGINQINPIYADAFKECFKMPQSLGALGGKPNSAYYFIGFLGDELI 310
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
YLDPH Q ++ ++ D S + + + ++DPS+A+GF+C++++DFD++C
Sbjct: 311 YLDPHTTQTFVDTEENGTVNDQSFHCQQSPPRMKILNLDPSVALGFFCKEEEDFDNWCGL 370
Query: 425 ASKLAEESNGAPLFTVTQTH 444
K + +F + + H
Sbjct: 371 VQKEILKPQSLQMFELVEKH 390
>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
Length = 266
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 60/169 (35%), Positives = 95/169 (56%), Gaps = 6/169 (3%)
Query: 293 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 71 DSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 130
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
GKP ++ Y +G E IYLDPH QP + + D S + + + +DPS+
Sbjct: 131 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPSRMGIGELDPSI 190
Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
A+GF+C+ ++DF+D+C + KL++ P+F + + + DVL
Sbjct: 191 AVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 239
>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
Length = 430
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 153/355 (43%), Gaps = 73/355 (20%)
Query: 138 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 174
A F DF+SR ++YR F DP + S TSD GWGCM+
Sbjct: 121 AAFLDDFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMI 180
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
RS Q L+A A+ LGR WR+ + DRE +L LF D +P+SIHN ++ G+ Y
Sbjct: 181 RSGQSLLANAMAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSIHNFVRHGEKYC 237
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G W GP A R + L ++ E + IY G P + D+
Sbjct: 238 SKYPGEWFGPSATARCIQDLVNSRKQE---------LRIYST--------GDGPDIYEDN 280
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ + + P L+LV LG++K+ P Y L + QS+GI GG+P +S
Sbjct: 281 FMK---IAKPDGEVFHPTLVLVGTRLGIDKITPVYWEALIASVQMSQSVGIAGGRPSSSH 337
Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEA---DTSTYHSDVIRHIHLDSIDPSLAIGF 410
Y VG Q YLDPH + + D D + H+ +R IH+ +DP+
Sbjct: 338 YFVGSQGHFLFYLDPHHTRKALPYYSDVARYTIDDMDSCHTSRLRRIHVREMDPN----- 392
Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 465
C A+++ + + + + V SD GE GG+P D S
Sbjct: 393 -----------CHPANEIRDATGRSVIDEVELL-------SDEDGEDGGIPHDKS 429
>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 414
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 93/289 (32%), Positives = 134/289 (46%), Gaps = 34/289 (11%)
Query: 140 FNQDFSSRILISYRKGFDPIG---DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
F DF ++I ++YR F I D K S + LRS LV Q G W
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRS--QLVDQGGFTSDTG--WGC 158
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
E +IL LF D +P+SIH ++ G A G G W GP A AR
Sbjct: 159 SSSN----EERKILSLFADDPRAPYSIHKFVEHGASACGKHPGEWFGP-------SAAAR 207
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
C +A T +S + +Y+ D +D S+ +TP L+LV
Sbjct: 208 CIQALTNSQVES-ELRVYITGDGSD---------VYEDT--FMSIAKPNSTKFTPTLILV 255
Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
LGL+K+ P Y L+ + PQS+GI GG+P +S Y +GVQE YLDPH +P +
Sbjct: 256 GTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYFIGVQESDFFYLDPHQTRPAL 315
Query: 376 NIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+D D + H+ +R +H+ +DPS+ I F RD++D+ D+
Sbjct: 316 PFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLIRDENDWKDW 364
>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
Length = 485
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/255 (30%), Positives = 123/255 (48%), Gaps = 27/255 (10%)
Query: 193 PWRKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
P R P P D + +++ FGD ++PF +H L++ GK G AG W GP +
Sbjct: 210 PARCPSASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHM 269
Query: 250 W-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
+A+AR E +A+YV V +D C G W
Sbjct: 270 LRKAVARAAEFED--------LAVYVAQDC---------TVYKEDVMSLCESSGVG---W 309
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+++LVP+ LG E +NP YI ++ +GI+GGKP S + VG Q+E +YLDP
Sbjct: 310 KSVVILVPVRLGGESLNPSYIECVKNILKLKCCIGIIGGKPKHSLFFVGFQDEQLLYLDP 369
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK- 427
H QPV+++ + + + ++H + R ++ +DPS IG Y R K DF+ C S+
Sbjct: 370 HYCQPVVDVTQANFSLE--SFHCNSPRKMNFSRMDPSCTIGLYARSKTDFESLCTAVSEA 427
Query: 428 LAEESNGAPLFTVTQ 442
L+ P+FT +
Sbjct: 428 LSSSKEKYPIFTFVE 442
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 40/63 (63%), Gaps = 1/63 (1%)
Query: 134 NNGLAE-FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
N G E F Q F S + ++YR+ F + S +T+D GWGCMLRS QM++AQ LL H +
Sbjct: 92 NEGEVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPT 151
Query: 193 PWR 195
WR
Sbjct: 152 DWR 154
>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
leucogenys]
Length = 441
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 150/328 (45%), Gaps = 36/328 (10%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW--GCMLRSSQML-VAQALLFHR 189
G F DF SR+ ++YR + I D W G L ++ A +H
Sbjct: 98 GEGEHTAFPADFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHG 157
Query: 190 LGRPWRKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
R W P L++ +R + +I+ F D +PF +H L++ G++ G AG W
Sbjct: 158 PAR-WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWY 214
Query: 242 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 301
GP +A R + + +YV + A +V D +
Sbjct: 215 GP-------SLVAHILRKAVESCSEVTRLVVYVSQTCSMYKADVARLVARPDPT------ 261
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++
Sbjct: 262 ----AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCQLCLGIMGGKPRHSLYFIGYQDD 317
Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+YLDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 375
Query: 422 CARASKLAEESNGA---PLFTVTQTHKK 446
C+ +++ S+ P+FT+ + H +
Sbjct: 376 CSELTRVLSSSSAMERYPMFTLAEGHAQ 403
>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
Length = 252
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 118/233 (50%), Gaps = 32/233 (13%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I + +W+LG + D L + D SR+ +YRKGF IG++ T
Sbjct: 40 IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
SD GWGCMLR QM++ QAL+F LGR WR K D +Y++IL +F D ++P+SIH
Sbjct: 89 SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147
Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
+ G ++G G W GP + + + LA L ++ V+ D
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192
Query: 286 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ I++ + C+V + + W P++L++PL LG+ +NP Y+ ++++
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKVS 243
>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 302
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 148/309 (47%), Gaps = 42/309 (13%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAG 230
M R QML+AQAL+ H LGR WR + ++I+ F DS + SP S+H L+Q
Sbjct: 1 MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHRLVQMS 60
Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGDE--DGER 283
G W GP ++C A+ R + L + + +Y V+ +E D R
Sbjct: 61 DR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLAR 114
Query: 284 G------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLGL-EKVNPR 327
G P + D H +++ + Q+D T ILLL+PL+ G ++NPR
Sbjct: 115 GLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFGKGNRINPR 170
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
YI + F+ P +G++GG+ S+Y VG Q S IYLDPH QP N+ D
Sbjct: 171 YIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNSPKFSVD-- 228
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
++H + + + +++PS A+GFYCR + + D R L S+ T +P
Sbjct: 229 SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ----ASTRSRP 284
Query: 448 VNHS-DVLG 455
V + +VLG
Sbjct: 285 VAFTVEVLG 293
>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
RWD-64-598 SS2]
Length = 1038
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 139/318 (43%), Gaps = 74/318 (23%)
Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------ 164
+Q A G + EF DF+SRI ++YR F PI DS +
Sbjct: 271 SQSPASEKHPGQDWAPEFYADFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSP 330
Query: 165 --------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YV 207
T+D GWGCMLR+ Q L+A ALL LGR WR+P + + YV
Sbjct: 331 SPKSRRWFGGEKGWTTDTGWGCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYV 390
Query: 208 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+I+ F DS +PFS+H + AGK G G W GP + + L + + GLG
Sbjct: 391 QIITWFLDSPLPQAPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKRLVQA-FPDAGLGV 449
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------W--TPIL 312
V D A V+S D W +L
Sbjct: 450 ----------------------AVASDGALYQTDVYSASYVDVGSPRNVRKLRWGGRAVL 487
Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
+L + LG+ VNP Y T++ F PQS+GI GG+P +S Y +GVQ ++ IYLDPH +
Sbjct: 488 VLFGIRLGINGVNPIYYDTIKGLFEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPHHAR 547
Query: 373 PVINIGKDDLEADTSTYH 390
P I + + EAD H
Sbjct: 548 PAIPL-RPLPEADEGNQH 564
Score = 45.8 bits (107), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 5/69 (7%)
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
A+ T+H D +R + L +DPS+ +GF C+D++D+ DF R + L + T+
Sbjct: 716 AELKTFHCDRVRKMPLSGLDPSMLLGFLCQDEEDWIDFRHRITDLMHRNK-----TIFAI 770
Query: 444 HKKPVNHSD 452
+P N S+
Sbjct: 771 QDEPPNWSE 779
>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
Length = 263
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 93/169 (55%), Gaps = 6/169 (3%)
Query: 293 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
D+ RHC+ F G + W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 68 DSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 127
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
GKP ++ Y VG E IYLDPH QP + D S + + + +DPS+
Sbjct: 128 GKPNSAHYFVGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFHCQHPPCRMSIAELDPSI 187
Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 188 AVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 236
>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 141/318 (44%), Gaps = 62/318 (19%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWR------------------------------------K 196
MLRS QM++AQ LL H L R W
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
L++ +R + +I+ F D +PF +H L++ G++ G AG W GP +A
Sbjct: 61 ELER--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHI 111
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
R + +YV + A +V D + A+W +++LVP
Sbjct: 112 LRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVP 161
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP ++
Sbjct: 162 VRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVD 221
Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 435
+ + D + ++H R + DPS +GFY D+ +F C+ +++ S+
Sbjct: 222 VSQADFPLE--SFHCTSPRKMAFAKTDPSCTVGFYAGDRKEFGTLCSELTRVLSSSSATE 279
Query: 436 --PLFTVTQTHKKPVNHS 451
P+FT+ + H + +HS
Sbjct: 280 RYPMFTLAEGHAQ--DHS 295
>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
Length = 423
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 90/274 (32%), Positives = 136/274 (49%), Gaps = 44/274 (16%)
Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 220
+ TSD GWGCM+R+SQ L+A ALL +L + Q ++IL LF D TSP
Sbjct: 138 NDNFTSDAGWGCMIRTSQNLLAIALL--KLSEEHNESAQ-------LDILKLFQDDPTSP 188
Query: 221 FSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSG 277
FS+HN ++ + L G W GP A S + L ++ ET P I V
Sbjct: 189 FSLHNFIRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLET-------PGEIPYVYI 241
Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
E+ + DD +F++ Q P+LLL P+ LG+++VN Y ++ +
Sbjct: 242 SENAD-------LFDDEIE--DLFNEEQK---PLLLLFPVRLGIDQVNKYYYKSILQLLS 289
Query: 338 FPQSLGIVGGKPGASTYIVGVQEES-AIYLDPHDVQPV---INIGKDDLEADTSTYHSDV 393
P S+GI GGKP +S Y +G + E+ +Y DPH Q V INI +TYH+
Sbjct: 290 LPYSVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI---------TTYHTAN 340
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
+ ++ +DPS+ IG + D++ +F S+
Sbjct: 341 YNKLDIEMVDPSMMIGVLLKSMDEYKEFKQDCSE 374
>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
Length = 592
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 107/347 (30%), Positives = 155/347 (44%), Gaps = 62/347 (17%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG------- 160
S DIW H A+D D N EF D +RI ++YR F PI
Sbjct: 75 SGLKDIWQTLRFH-TAEDNEKDDL--NKWPQEFIDDVYTRIWLTYRTKFSPIDRDPEGPS 131
Query: 161 ----------------DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+ T+D GWGCM+R+SQ L+A ALL +GR WR +
Sbjct: 132 PLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQSLLANALLNLHIGRDWR--YTGELNE 189
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+ EI+ F D + PFSIH ++ GK G W GP A RS ++L
Sbjct: 190 MHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGEWFGPSAAARSIQSL---------- 239
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
C + V G + G+ V + A VF PIL+L+ L LG++
Sbjct: 240 -CNEFDSGVKVYIGSDSGDIYENDVFKV--AKDENGVFK-------PILILLGLRLGIDN 289
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+NP Y +L+ +S+GI GG+P S Y G Q + YLDPH QP + + D L+
Sbjct: 290 INPVYWDSLKAILNSKESIGIAGGRPSTSHYFFGFQGDHLFYLDPHLPQPAL-LHDDQLD 348
Query: 384 A------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
D ++ H+ +R IHL +DPS+ +GF +D++++
Sbjct: 349 TSVSESTEIVSSLDVNSVHTKKLRKIHLSEVDPSMLLGFLIKDENEW 395
>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
Length = 330
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 139/311 (44%), Gaps = 56/311 (18%)
Query: 173 MLRSSQMLVAQALLFHRLGRPW----------------------------------RKPL 198
MLRS QM++AQ LL H L R W +
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 258
+ +R + +I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 61 ELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILR 113
Query: 259 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
+ + +YV + A +V D + A+W +++LVP+
Sbjct: 114 KAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVPVR 163
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++
Sbjct: 164 LGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVS 223
Query: 379 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--- 435
+ D + ++H R + +DPS +G Y D+ +F+ C+ +++ S+
Sbjct: 224 QADFPLE--SFHCTSPRKMAFAKMDPSCTVGSYAGDRKEFETLCSELTRVLGSSSATERY 281
Query: 436 PLFTVTQTHKK 446
P+FT+ + H +
Sbjct: 282 PMFTLAEGHAQ 292
>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
LYAD-421 SS1]
Length = 999
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 133/278 (47%), Gaps = 50/278 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 166
F DF+SRI ++YR F PI D+ + TS
Sbjct: 303 FYADFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTS 362
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 221
D GWGCMLR+ Q L+A ALL LGR WR+P + +Y V+I+ F D+ + PF
Sbjct: 363 DAGWGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPF 422
Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
S+H + GK G G W GP + + L + GLG +A+ S +
Sbjct: 423 SVHRMALVGKDLGKEVGQWFGPSTAAGAIKTLVHS-FPDAGLG-----VAVASDSTLYES 476
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
+ A + RH + +W +L+L+ + LG+E VNP Y T++ +TFP
Sbjct: 477 DVYAASRSSVYSTRRH----GHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLYTFP 532
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
Q++GI GG+P +S Y VG Q ++ YLDPH +P I +
Sbjct: 533 QTVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAIPL 570
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 29/46 (63%)
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
+ + T+H D +R + L +DPS+ +GF C+D+ ++ D R ++L
Sbjct: 699 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKERITEL 744
>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
Length = 577
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 145/320 (45%), Gaps = 64/320 (20%)
Query: 138 AEFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDV 168
EF +D SR++ +YR F PI G S I T+D+
Sbjct: 127 VEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNSFTTDI 186
Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 222
GWGCM+R+ Q L+ AL LGR +R P K E +I+ F D+ PFS
Sbjct: 187 GWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEE--DIIEWFYDNPNKPFS 244
Query: 223 IHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
IH + G + G W GP C + ++L + E G+ + V SGD
Sbjct: 245 IHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLIY-EFPECGID----ECILSVSSGD--- 296
Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+ D+ + H F K + T IL+L+ + LG++K+N Y ++ S
Sbjct: 297 -------IYEDEINEH---FQKNEN--TIILILLGVKLGIDKINQCYFNDIKDILNSRYS 344
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
GI GG+P +S Y G E Y DPH +P + + +D + ST +S ++ +
Sbjct: 345 CGISGGRPSSSLYFFGHMNEYLYYFDPH--KPQLQLNEDFKNSCHSTDYSKIL----ISE 398
Query: 402 IDPSLAIGFYCRDKDDFDDF 421
IDPS+ IGFY + K D+D+F
Sbjct: 399 IDPSMLIGFYLKGKKDWDNF 418
>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
norvegicus]
Length = 224
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 93/169 (55%), Gaps = 6/169 (3%)
Query: 293 DASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
++ RHC+ G W P++LL+PL LGL +N Y+ TL+ F PQSLG++G
Sbjct: 29 ESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 88
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
GKP ++ Y +G E IYLDPH QP + + D S + + + +DPS+
Sbjct: 89 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPCRMGIGELDPSI 148
Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
A+GF+C+ ++DF+D+C + KL++ P+F + + + DVL
Sbjct: 149 AVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 197
>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 180
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 55/118 (46%), Positives = 80/118 (67%), Gaps = 1/118 (0%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W P+++LVP+ LG++ +NP YIPTL+ F+FPQ LG++GGKP +S Y VG Q+ +Y+D
Sbjct: 11 WHPVIILVPVRLGIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMD 70
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
PH VQP + + D L +Y ++ + + D IDPSLA+GF C + +FDDFC A
Sbjct: 71 PHFVQPTVKMDDDPLFP-IESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 127
>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
Length = 1034
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/269 (34%), Positives = 128/269 (47%), Gaps = 50/269 (18%)
Query: 140 FNQDFSSRILISYRKGF-DPIGDSKI-------------------------------TSD 167
F DF+SRI ++YR F PI D ++ +SD
Sbjct: 302 FYIDFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSD 361
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFS 222
GWGCMLR+ Q L+A AL+ LGR WRKP +Y V IL F D+ +PFS
Sbjct: 362 SGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFS 421
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H + AGK G G W GP + +AL E G+G +A+ V DG
Sbjct: 422 VHRMALAGKELGTDVGQWFGPSVAAGAIKALVNS-FPEAGIG-----VAVAV-----DGV 470
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V + + W P+LLL+ + LG+E VNP Y T+++ +TFPQ
Sbjct: 471 LYQTDVHAASHGDHFGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIYYDTIKMLYTFPQ 530
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPH 369
S+GI GG+P +S Y VG Q ++ YLDPH
Sbjct: 531 SVGIAGGRPSSSYYFVGSQADNLFYLDPH 559
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 28/45 (62%)
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
A+ T+H + +R + L +DPS+ +GF CRD+ ++ D R + L
Sbjct: 711 AELKTFHCERVRKMPLSGLDPSMLLGFLCRDEAEWVDLRKRVAGL 755
>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
Length = 246
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 112/246 (45%), Gaps = 53/246 (21%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180
Query: 293 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 327
D + C V G AD W P+LL+VPL LG+ ++NP
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240
Query: 328 YIPTLR 333
YI +
Sbjct: 241 YIEAFK 246
>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
Length = 337
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 121/247 (48%), Gaps = 22/247 (8%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 72 DRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SVVAHILRKAVE 124
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
C + + VS D + D +R S + A+W +++LVP+ LG E
Sbjct: 125 -SCSEVTRLVVYVSQDCTVYKA--------DVARLVS-WPDPTAEWKSVVILVPVRLGGE 174
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 175 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVNQANF 234
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 439
+ ++H R + +DPS +GFY ++ +F+ C+ ++ S+ P+FT
Sbjct: 235 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 292
Query: 440 VTQTHKK 446
V + H +
Sbjct: 293 VAEGHAQ 299
>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
passalidarum NRRL Y-27907]
Length = 363
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/280 (30%), Positives = 133/280 (47%), Gaps = 43/280 (15%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
F+ R+ + R FD SDVGWGCM+R+SQ L+A AL+ LQ +
Sbjct: 104 FNKRLFTTVRSLFD---SENFNSDVGWGCMIRTSQSLLANALM----------KLQPSAE 150
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 261
E +++LF D+ S FS+HN ++ L G W GP A S + L + +T
Sbjct: 151 HE---VINLFQDNIASAFSLHNFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKT 207
Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
G + + I S D E I++ SV L+L P+ LG+
Sbjct: 208 IQGVKYPHVFISENSDLYDEE--------IEELLVESSV-----------LILFPVRLGI 248
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
+ VN Y ++ P ++GI GGKP +S Y +G Q++ +Y DPH Q N
Sbjct: 249 DNVNSYYYDSIFQLLACPFTVGISGGKPSSSFYFLGYQDQDLLYFDPHSPQLYEN----- 303
Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ +TYH++ + +H+ +DPS+ +G +DK ++ +F
Sbjct: 304 -PINYTTYHTNNYQRLHIHMLDPSMMVGILVKDKSEYKEF 342
>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
Length = 450
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/309 (30%), Positives = 138/309 (44%), Gaps = 53/309 (17%)
Query: 143 DFSSRILISYRKGFDPI-----GDSKIT------------------------SDVGWGCM 173
D SR+ +YR F PI G S I SD+GWGCM
Sbjct: 64 DVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALTDPDSFYSDIGWGCM 123
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 232
+R+ Q L+A A+ +L R +R + D E + ++ F D P S+HN ++A K
Sbjct: 124 IRTGQALLANAIQRVKLAREFRINASRIDDNE-LNLIRWFQDDVKYPLSLHNFVKAEEKI 182
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G+ G W GP A RS + L E C I S D + D
Sbjct: 183 SGMKPGQWFGPSATARSIKTLI-----EGFPLCGIKNCIISTQSAD----------IYED 227
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ +R +F K + +LLL + LG++K+N Y + + P S+GI GGKP +S
Sbjct: 228 EVTR---IFHKDRD--ANLLLLFAVRLGVDKINSLYWKDIFKILSSPYSVGIAGGKPSSS 282
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Y G Q E+ YLDPH+ Q ++ DDLE S H +H+ DPS+ +G
Sbjct: 283 LYFFGYQNENLFYLDPHNTQQS-SLMMDDLEFYRSC-HGHKFNKLHISETDPSMLLGMLI 340
Query: 413 RDKDDFDDF 421
K+++D F
Sbjct: 341 SGKNEWDQF 349
>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
Length = 994
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/277 (32%), Positives = 129/277 (46%), Gaps = 49/277 (17%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 167
F DF+SRI ++YR F+PI D+ + TSD
Sbjct: 309 FYSDFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSD 368
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETSPFS 222
GWGCMLR+ Q L+A ALL LGR WR+P + + YV+I+ F D S PFS
Sbjct: 369 SGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFS 428
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+H + GK G G W GP + + L E GLG +A+ V D
Sbjct: 429 VHRMALVGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---VAVDGVIYQSDVY 484
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+ + +H G+ W +L+L+ + LG++ VNP Y ++ +T PQ
Sbjct: 485 AVSRSTMGLGSPRKH------GRPSWGDRAVLVLIGIRLGIDGVNPIYYDLIKALYTLPQ 538
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+LGI GG+P +S Y VG Q + YLDPH +P I +
Sbjct: 539 TLGIAGGRPSSSYYFVGSQANNLFYLDPHHARPTIPL 575
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 52/100 (52%), Gaps = 16/100 (16%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
T+H D +R + L +DPS+ IGF C+D++D+ D R ++L THK+
Sbjct: 711 TFHCDRVRKMPLSGLDPSMLIGFLCKDENDWIDLRRRLTELF------------NTHKRH 758
Query: 448 VNHSDVLGETGGVPED--DSLGVMSMNDAVGNAHEDDWQL 485
+ + E P D D++G+ S+++ + E+D +L
Sbjct: 759 IFS--IQDEPPNWPSDSEDNIGLESISEPDIDLPEEDDEL 796
>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
6054]
gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 514
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 153/323 (47%), Gaps = 43/323 (13%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
FS +L + + + I T+DVGWGCM+R+SQ L+A F RL L K D
Sbjct: 138 FSKSLLYNLQNFNNFIEKENFTTDVGWGCMIRTSQSLLANT--FVRL-------LDKQSD 188
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 261
I+ LF D+ +PFS+HN ++ + L G W GP A S + L C
Sbjct: 189 -----IIALFNDTYLAPFSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRL--CDGYYD 241
Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
+++ I V+ + ++ ++ +KG +L+L+P+ LG+
Sbjct: 242 NSTSETILPRINVLISESTDLYDSQIAQLLEPSTE-----TKG------LLVLLPVRLGI 290
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
+ +N Y +L + QS+GI GGKP +S Y G Q+ S IY+DPH Q I D
Sbjct: 291 DSINSYYFSSLLHLLSLEQSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQ----IFSSD 346
Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF---CARASKLAEESNGAPLF 438
+ D STY++ + + + +DPS+ IG + RD +++F C A+ +
Sbjct: 347 I--DMSTYYATRYQRVDIGKLDPSMLIGVFIRDLTSYENFKKSCLDAANKIVHFHATERS 404
Query: 439 TVTQTHKK-----PVNHSDVLGE 456
TV ++ +K +N SD+ E
Sbjct: 405 TVPESRRKNSEFVNINRSDLKDE 427
>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 557
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 141/346 (40%), Gaps = 48/346 (13%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL---- 198
D S +YR F I ITSD GWGCMLRS+QM++ QAL H R WR P
Sbjct: 171 DERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQLLAR 230
Query: 199 --QKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
Q F R + + S S +S+HN++ AG Y G W GP C L
Sbjct: 231 RRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMRDLVH 290
Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
+ LG L I+ V G + + K +
Sbjct: 291 IHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQPQAH 350
Query: 316 PLVLGLEK---------------------------VNPRYIPTLRLTFTFPQSLGIVGGK 348
PL L E+ +N Y+ +L TF+ PQS+G++GG+
Sbjct: 351 PLDLEWEEELMESANTVEWDTALLLLVPLRLGLTSLNEEYVQSLAHTFSLPQSVGVLGGR 410
Query: 349 PGASTYIVGVQEE-SAIY-LDPHDVQ--PVINIGKDDLEADTSTYHS-DVIRHIHLD--- 400
P + + G Q++ S I+ LDPH VQ P + + +A + S D +R H
Sbjct: 411 PRGARWFYGAQKDGSKIFGLDPHTVQTAPGRQTARVNGQASSVVELSDDYLRSCHTTCPE 470
Query: 401 -----SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP-LFTV 440
+DPS+A+GFYCR + D + +E + P LF+V
Sbjct: 471 MFPFCKMDPSIALGFYCRTRADLNHVLNSMGAWQKEHSSIPELFSV 516
>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
Length = 296
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 120/247 (48%), Gaps = 22/247 (8%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
DR + I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 31 DRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP-------SVVAHILRKAVE 83
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
C + + VS D + D +R S + A+W +++LVP+ LG E
Sbjct: 84 -SCSEVSRLVVYVSQDCTVYKA--------DVARLLS-WPDPTAEWKSVVILVPVRLGGE 133
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+NP Y+P ++ LGI+GGKP S Y +G Q++ +YLDPH QP +++ +
Sbjct: 134 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQPSF 193
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 439
+ ++H R + +DPS +GFY ++ +F+ C+ ++ S+ P+FT
Sbjct: 194 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 251
Query: 440 VTQTHKK 446
V + H +
Sbjct: 252 VAEGHAQ 258
>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
Length = 443
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 143/330 (43%), Gaps = 73/330 (22%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------- 162
LG N+ A N S++ +SYR GF+PI S
Sbjct: 69 LGQIFDNSNAA--NNYIESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFA 126
Query: 163 ---------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
TSD GWGCM+R+SQ L+A LL K + + EI+ LF
Sbjct: 127 NLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEQEIVKLF 173
Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
D SPFSIHN ++ + L G W GP A S + L + + G P
Sbjct: 174 QDDTKSPFSIHNFIRVASSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGIN--PPR 231
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
+++ + DD R VF+K +++ +++L P+ LG++KVN Y +
Sbjct: 232 VFISENSD----------LFDDEIR--DVFAKEKSN--SVIILFPIRLGIDKVNSYYYNS 277
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
+ + S GI GGKP +S Y +G ++ IY DPH Q V + + +YHS
Sbjct: 278 IFHLLSSKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQIV------ETPFNMDSYHS 331
Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+++ +DPS+ IG + D++ DF
Sbjct: 332 TNYNTLNISLLDPSMMIGILVTNIDEYIDF 361
>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
1558]
Length = 1159
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 82/248 (33%), Positives = 112/248 (45%), Gaps = 51/248 (20%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------------REYVEIL 210
+T+D GWGCMLR+ Q L+A AL+ LGR WR P Q YV IL
Sbjct: 580 LTTDAGWGCMLRTGQSLLANALIHLHLGRDWRVPSQPQVPPTSAAHLAELEAYSSYVRIL 639
Query: 211 HLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
F D S PFS+H + GK G G W GP + + L S
Sbjct: 640 SWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTL-----------VNSF 688
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------------W--T 309
P + V+ D +V D ++ S G +D W
Sbjct: 689 PPSGMAVATAVDS------IVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNR 742
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+L+L+ + LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q S +YLDPH
Sbjct: 743 AVLVLIGIRLGLDGVNPLYYESIKALFTFPQSVGIAGGRPSSSYYFVGTQANSLVYLDPH 802
Query: 370 DVQPVINI 377
+P + +
Sbjct: 803 FTRPAVPL 810
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 5/60 (8%)
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
+A T+H D +R I L +DPS+ +GF C+D+ DF+DFC+R ++L ++ +FT+ +
Sbjct: 962 KAQLGTFHCDKVRKIPLSGLDPSMLLGFVCKDEADFEDFCSRVAQLPQK-----IFTIQE 1016
>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 148/353 (41%), Gaps = 76/353 (21%)
Query: 141 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 166
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDGTSSPFSIHNF 186
Query: 227 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ L G W GP A S + L + L +P +S + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVF--ISENSD---- 240
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
DD R VF+K ++ +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKS--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
GGKP +S Y +G ++ IY DPH Q V + + +YH+ +++ +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 457
S+ IG + D++ DF S + +N F H PV ++ ++
Sbjct: 346 SMMIGILVTNIDEYIDF---KSSCIDNNNKIVHF---HPHTLPVQQDSIINQS 392
>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 411
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 142/309 (45%), Gaps = 57/309 (18%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 170
F D SRI +YR F PI S +D+GW
Sbjct: 74 FLSDVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGW 133
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+R+ Q L+A A+ LGR +R + + +I+ F D+ PFS+HN ++ G
Sbjct: 134 GCMIRTGQSLLANAIQIAILGREFRVN-DGDVNEQERKIISWFMDTPDEPFSLHNFVKKG 192
Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
+ G W GP A RS ++L Q + G+ + ++ + DE
Sbjct: 193 CELSSKKPGEWFGPAATSRSIQSLVE-QFPDCGIDRCIVSVSSADIFKDE---------- 241
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
I+D +F + ++ ILLL+ + LG++KVN Y+ +R S+GI GG+P
Sbjct: 242 -IND------IFKNKR--YSNILLLMGVKLGVDKVNEYYLKDIRKILESRYSVGISGGRP 292
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
+S Y G Q+++ +Y DPH QP +E+ T H+D I++ +DPS+ IG
Sbjct: 293 SSSLYFFGYQDDTLLYFDPHKPQPST------IESLLETCHTDNFDKINISDMDPSMLIG 346
Query: 410 FYCRDKDDF 418
+ +DD+
Sbjct: 347 VLLQGEDDW 355
>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
Length = 391
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 148/326 (45%), Gaps = 42/326 (12%)
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
Q +S I +YRK F I +S+ TSD GWGCMLRS QM+ AQ L H R+ Q
Sbjct: 51 QIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQILRVH-----IRQKKQHS 105
Query: 202 FDREYVEILHLFGDSE---------------TSPFSIHNLLQAGK-AYGLAAGSWVGPYA 245
D +Y ++L F D + SP+SI + + + + W P
Sbjct: 106 KDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWYRPDQ 164
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIY--VVSGDEDGERGGAPVVC-----------ID 292
+ + L + ++ E G + L + I ++ E G + C
Sbjct: 165 ILNALSLLHQQKQLE---GSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKNK 221
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
+ S+ C++ K I + + GL+++N Y+P L PQ GI+GG+ +
Sbjct: 222 EISKICNICQKKDPKALAIFFITRI--GLDEINKEYLPFLNDLIDLPQFQGIIGGRDDKA 279
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
YI+G + IYLDPH +Q IN G + D T+ +++I+ + + PS+A+GFYC
Sbjct: 280 YYILGRVNKRLIYLDPHYIQEHINRGNVVMLKD--TFFCKDVKYINEEQMSPSIALGFYC 337
Query: 413 RDKDDFDDFCARASKLAEESNGAPLF 438
+++ + D F ++ + + F
Sbjct: 338 QNQSELDKFFNSIEQIKKNYDNEKTF 363
>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 136/317 (42%), Gaps = 70/317 (22%)
Query: 141 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 166
N S++ +SYR GF+PI S TS
Sbjct: 80 NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGCM+R+SQ L+A LL K + + EI+ LF D +SPFSIHN
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDDTSSPFSIHNF 186
Query: 227 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ L G W GP A S + LA + + +P +S + D
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVF--ISENSD---- 240
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
DD R VF+K + +L+L P+ LG++KVN Y ++ S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
GGKP +S Y +G ++ IY DPH Q V + + +YH+ +++ +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345
Query: 405 SLAIGFYCRDKDDFDDF 421
S+ IG + D++ DF
Sbjct: 346 SMMIGILVTNIDEYIDF 362
>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
Length = 411
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/285 (30%), Positives = 135/285 (47%), Gaps = 54/285 (18%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP-----------FDREYVE---IL 210
T+D GWGCM+R++QM+VAQA++ +R GR WR +K FD E ++ IL
Sbjct: 88 TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147
Query: 211 HLFGDSETSPFSIHNLLQ-AGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
LF D ++P IH +++ A + G A G W P EA+ ++A T
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPS------EAVFIMKKAITESASPLT 201
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPR 327
+ +S D G + ++ ++H WT L+LV +V LG ++N
Sbjct: 202 GDTVMYLSID-----GRVHIRDLEVETKH----------WTKTLMLVIVVRLGAAELNRI 246
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
Y+P L F+ LGI GG+P S + VG + IYLDPH I I D++ +TS
Sbjct: 247 YVPHLMRLFSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAHEYIPI---DMDFNTS 303
Query: 388 -------------TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
+YH ++ +H +DPS A+ F ++ FD
Sbjct: 304 QEDPKKPKKCPERSYHCRLLSKMHFLDMDPSCALCFRFESREQFD 348
>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
Length = 408
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 128/268 (47%), Gaps = 37/268 (13%)
Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
I + T+DVGWGCM+R+SQ L+A +++ + + +E +++L F DSE
Sbjct: 123 IDNENFTTDVGWGCMIRTSQSLLANT---------YKRMISEDAQQE-IQLLDQFKDSEA 172
Query: 219 SPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS 276
+PFS+HN ++ L G W GP A S + L ++ G LP ++S
Sbjct: 173 APFSLHNFIRVANESPLQVKPGQWFGPNAASLSIQRLCNLVNSKENFG---LPGLSVLIS 229
Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
+ D DD + + K Q+ +L+L+P+ LG++K N Y ++
Sbjct: 230 ENSD---------LYDDKVQEF-LDKKKQS----LLILLPIRLGIDKTNEFYYSSILQLL 275
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
QS+GI GGKP +S Y G + +YLDPH Q A ++YH+ +
Sbjct: 276 NCKQSVGIAGGKPSSSFYFFGYDNDELLYLDPHYPQ--------GTNAGYNSYHTPRYQR 327
Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
+ + +DPS+ IG D D++ F A
Sbjct: 328 LTISQLDPSMMIGILVDDLQDYNTFKAE 355
>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 446
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 140/331 (42%), Gaps = 72/331 (21%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------ 162
LG N A N S++ +SYR GF+PI S
Sbjct: 68 VLGQTFDNFDTA--NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNF 125
Query: 163 ----------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
TSD GWGCM+R+SQ L+A LL K + + EI+ L
Sbjct: 126 ANLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKL 172
Query: 213 FGDSETSPFSIHNLLQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
F D +SPFSIHN ++ L +G W GP A S + L + + +P
Sbjct: 173 FQDGTSSPFSIHNFIRVASLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPR 232
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+S + D DD R VF+K + +L+L P+ LG++KVN Y
Sbjct: 233 VF--ISENSD---------LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYN 277
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
++ S GI GGKP +S Y +G ++ IY DPH Q V + + +YH
Sbjct: 278 SIFHLLASKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYH 331
Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ +++ +DPS+ IG + D++ DF
Sbjct: 332 TTNYNRLNISLLDPSMMIGILVTNIDEYIDF 362
>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
Length = 411
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 124/310 (40%), Gaps = 83/310 (26%)
Query: 138 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 174
A F DF S+ ++YR F DP + S +SD GWGCM+
Sbjct: 109 AAFLDDFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMI 168
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG 234
RS QML+A A+ LGR A G
Sbjct: 169 RSGQMLLANAMAITNLGR--------------------------------------VACG 190
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R ++L Q + + +Y G P V D
Sbjct: 191 KYPGEWFGPSATARCIQSLTNAQEQPS--------LRVYST--------GDGPDVYED-- 232
Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+ + + P L+LV LG++K+ P Y L PQS+GI GG+P AS Y
Sbjct: 233 -KFMKIAKPDGTRFHPTLILVGTRLGIDKITPVYWDALIAALQMPQSVGIAGGRPSASHY 291
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
+G Q YLDPH +P + D +AD T H+ +R +H+ +DPS+ IGF
Sbjct: 292 FIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADIDTAHTRRLRRLHVREMDPSMLIGFL 351
Query: 412 CRDKDDFDDF 421
+D DD+ ++
Sbjct: 352 IKDDDDWSEW 361
>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
Length = 551
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 93/161 (57%), Gaps = 6/161 (3%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VGGKP AS Y + Q+++ YLD
Sbjct: 383 WEPLLILIPMRLGLDGLNSIYHSSLLEIFKFPQNLGVVGGKPRASLYFIAAQDDNLFYLD 442
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH VQ I + ++ + +T+ + H+ +DPSL + F+C+ KDDF+DF R+ K
Sbjct: 443 PHTVQNHIEV-ENGSKFPLNTFFCSTTKRTHVSEVDPSLVVAFFCKTKDDFNDFVERSKK 501
Query: 428 LAEESNGAPLFTVTQTHKKPVNHSDV----LGETGGVPEDD 464
+ + P+F++ + D + ETGG DD
Sbjct: 502 MTSQMEN-PIFSIFDNEPDYDSSRDYEYEEIDETGGETSDD 541
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 5/119 (4%)
Query: 137 LAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
+ EF DF++R+L +YR+GF I D+ +D GWGCMLRS QML++ LL + LG W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199
Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
+ + +I+ +F D ++PFSIHN+ G+ G G W P + ++ + L
Sbjct: 200 RSSSAT----HPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILV 254
>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
Length = 332
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 91/308 (29%), Positives = 139/308 (45%), Gaps = 38/308 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+++ I++LFGDS S FSIH L+ G+ G W GP + A AE
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP--------SFASDIAAEHIN 148
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
+ YV + G G + SK + + P ++ VPL LG E
Sbjct: 149 EMRVFRTRGYVA---KLGSIVGPKI----------EELSKDEVGFNPCIIFVPLRLGPES 195
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I D++
Sbjct: 196 PENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DMK 250
Query: 384 ADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 441
D S +Y + ++ IDPS+++ F + +D++ F K E + LF
Sbjct: 251 GDWSYQSYFCKDNKSMNYSKIDPSISLVFLVKHVNDYEHF----KKSFENKTFSKLFIFK 306
Query: 442 QTHKKPVN 449
+K +N
Sbjct: 307 NEIEKKLN 314
>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1093
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 55/274 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
F DF+SR+ ++YR F PI D+ +
Sbjct: 369 FYADFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGG 428
Query: 165 ----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-LQKPFDR--EYVEILHLFGDSE 217
TSD GWGCMLR+ Q L+A ALL LGR WR+P +P YV++L F DS
Sbjct: 429 EKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSP 488
Query: 218 TS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
+ PFS+H + AGK G G W GP + + L A G G VV
Sbjct: 489 SPLCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVH---AFPGGGLGVAVAVDGVV 545
Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ +P D+ RH + G +L+L+ + LGL+ VNP Y T++
Sbjct: 546 YETDVFSASHSP-----DSRRHHRTSTWGDRG---VLILIGIRLGLDGVNPIYYDTIKEL 597
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+T+PQS+GI GG+P +S Y VG Q +S YLDPH
Sbjct: 598 YTWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 37/57 (64%), Gaps = 2/57 (3%)
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
A+ T+H + +R + L +DPS+ IGF CRD++++ D AR + +A++ P+F V
Sbjct: 779 AELRTFHCERVRKMPLSGLDPSMLIGFLCRDEEEWRDLRARIANMAKKFK--PIFAV 833
>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
Length = 734
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 91/163 (55%), Gaps = 13/163 (7%)
Query: 281 GERGGA---PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
GE G+ P+ C D S C W I++LVP+ LGL+K+N Y ++
Sbjct: 515 GENSGSFKDPLTCSDFFSSSCI-----PQRWKSIIILVPIKLGLDKLNEVYFREIKSMLE 569
Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
PQS+G++GGKP S Y VG Q+E IYLDPH V ++ + + +YH V + +
Sbjct: 570 LPQSIGLIGGKPKQSFYFVGYQDEHIIYLDPHFVHDTVSPNDINF---SDSYHHCVPQKM 626
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
+ +DPS+AIGFYC + DF+DFC R ++ E G P+ +V
Sbjct: 627 LISQLDPSMAIGFYCHTQSDFEDFCVRIKEI--EKRGFPVVSV 667
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
N + F DF + + SYRK F PI ++ IT+D+GWGCM+R+ QM
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQM 315
>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1193
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 82/240 (34%), Positives = 110/240 (45%), Gaps = 28/240 (11%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 208
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 562 LTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAKYAQ 621
Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+L F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 622 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 680
Query: 267 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 317
+ +I Y S D +P R +K + W +L+LV +
Sbjct: 681 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRRGDNEAK-EEKWGKRAVLILVGV 739
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y VG Q YLDPH +P I +
Sbjct: 740 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 799
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
TYH + I+ + L +DPS+ +GF C+D+DDF+DF R ++L ++ +FTV
Sbjct: 952 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 999
>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
Length = 495
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/321 (28%), Positives = 145/321 (45%), Gaps = 74/321 (23%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 170
F +D +R+ +YR F PI S +D+GW
Sbjct: 75 FLKDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGW 134
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGDSETSPFSIHNLLQ 228
GCM+R+ Q L+ L RLGR +R P +++ E I+ F D+ PFS+H +
Sbjct: 135 GCMIRTGQSLLGNTLQIVRLGRDFR---YDPENKDISENRIIEWFIDAPEKPFSLHQFIT 191
Query: 229 AG-KAYGLAAGSWVGPYAMCRSWEALAR----CQRAETGLGCQSLPMAIYVVSGDEDGER 283
G + G G W GP A RS ++L R C AE + V SGD
Sbjct: 192 EGMELSGKNPGEWFGPAATARSIQSLIRKFPDCGIAEC---------LVSVSSGD----- 237
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
+ D+ + VF+ + + +L+L+ + LGL VN Y ++R + S+G
Sbjct: 238 -----IYSDEVKQ---VFADNKKN---LLILLGVKLGLNAVNECYWDSIRHILSSKYSVG 286
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHIHLD 400
I GG+P +S Y G + + +Y DPH QP LE + +Y H++ + ++
Sbjct: 287 ISGGRPSSSLYFFGYEGDELLYFDPHSPQP-------SLEENNVSYKSCHTNKYGKLLMN 339
Query: 401 SIDPSLAIGFYCRDKDDFDDF 421
+DPS+ +GF R ++D+++F
Sbjct: 340 DMDPSMLLGFLIRGQEDWENF 360
>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
Length = 521
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 138/311 (44%), Gaps = 54/311 (17%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
EF D +R+ +YR F PI G S ++ +D+G
Sbjct: 115 EFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDIG 174
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+A AL LGR +R + E + I+ F D PFS+H +Q
Sbjct: 175 WGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHE-LRIIKWFEDDPKYPFSLHKFVQE 233
Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G + G G W GP A RS +AL A C I SGD
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPA-----CGIAHCVISTDSGD---------- 278
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
V +D+ +F + +LLL+ + LG++ VN Y +R + S+GI GG+
Sbjct: 279 VYMDEVE---PLFRADPS--AAVLLLLCVRLGVDVVNEVYWEHIRHILSSEHSVGIAGGR 333
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT-STYHSDVIRHIHLDSIDPSLA 407
P +S Y G Q+E YLDPH +P +N+ + D + H+ +H+ IDPS+
Sbjct: 334 PSSSLYFFGYQDEHLFYLDPH--KPQLNLASYQQDLDLFRSVHTQRFNKVHMSDIDPSML 391
Query: 408 IGFYCRDKDDF 418
IG KDD+
Sbjct: 392 IGILLNGKDDW 402
>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
Length = 616
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 84/140 (60%), Gaps = 5/140 (3%)
Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
Q++W +++LVP+ LGL+K+N Y ++ P S+G++GGKP S Y VG Q+E
Sbjct: 426 NQSNWKSLIILVPVKLGLDKLNEIYFSGIKAMLQMPSSIGLIGGKPKQSFYFVGFQDEHI 485
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
IYLDPH V I+ + ++YH + + +H IDPS+A GFYC DF+ FC
Sbjct: 486 IYLDPHFVHDTIHPFDSNF---LNSYHDCIPQKMHFSQIDPSMAFGFYCHTYKDFEQFCI 542
Query: 424 RASKLAEESNGAPLFTVTQT 443
R ++ E++G P+ ++ +T
Sbjct: 543 RIKEI--EASGFPILSIGET 560
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 61/100 (61%), Gaps = 7/100 (7%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR---P 193
+ F +DF S + SYRK F I ++ IT+D+GWGCMLR+ QM++A+ALL H P
Sbjct: 194 VERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENIP 253
Query: 194 WRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGK 231
+ + ++ + +Y +I+ F D S+ + +SIH ++ K
Sbjct: 254 YGEKIKT--NSKYKKIMSWFCDYPSKENFYSIHQIVHKNK 291
>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
Length = 314
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 140/341 (41%), Gaps = 57/341 (16%)
Query: 91 MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 150
M I ER L T S + IW LG H A + A F QD + +
Sbjct: 4 MSHILERYLRMFPTNHEPSGTFIWSLG--HSYATETGKWPEA-------FVQDTYDLLSL 54
Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
+YRK G +SD GWGCM+RS Q ++A L R +P P+ K IL
Sbjct: 55 TYRKCI--AGMECFSSDAGWGCMIRSMQTMLANCL---RRVQP-SLPVHK--------IL 100
Query: 211 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
H F D + S+H + AG + G+W GP + L C + P
Sbjct: 101 HYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHL-----------CSTHP 149
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI--LLLVPLVLGLEKVNPR 327
V DG ++ + Q TP LLL L LG++ ++
Sbjct: 150 QVGLNVCVSHDG-----------------AIMYRDQLRNTPYPRLLLFTLRLGIDTIHTS 192
Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
Y L T PQ++GIVGG+P A+ Y Q + YLDPH Q D A S
Sbjct: 193 YYEQLCHVLTIPQAIGIVGGRPRAAHYFYACQSQWFFYLDPHTTQTAHTF---DNPAPNS 249
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
++H +R + ++ +DP + +GF ++ DF R KL
Sbjct: 250 SFHVTTLRRLRINELDPCMVLGFAITSEECQTDFEQRIVKL 290
>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
Length = 465
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 86/134 (64%), Gaps = 5/134 (3%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++++PL LG++++N YI L+ + PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 217 WKSLIIMIPLKLGVDRINTSYIRKLKSILSIPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 276
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH VQ ++ ++ + T+ + + + +IDPSL++GFYC+DK FDD C R SK
Sbjct: 277 PHFVQDTVDPSSNNY---SETFCGCIPQKMSFSNIDPSLSVGFYCKDKSSFDDLCDRLSK 333
Query: 428 LAEESNGAPLFTVT 441
L E++ P+ +++
Sbjct: 334 L--ENDEFPIISIS 345
>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
H99]
Length = 1185
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/239 (33%), Positives = 113/239 (47%), Gaps = 26/239 (10%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------QKPFDRE---------YVE 208
+TSD GWGCMLR+ Q L+ AL+ LGR WR P + ++E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYAQ 619
Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+L F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 678
Query: 267 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSK-GQADWTPILLLVPLV 318
+ +I Y S D +P R +K G+ +L+LV +
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRGGDNKAKEGKWGKRAVLILVGIR 738
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
LGL+ VNP Y +++ FTFPQS+GI GG+P +S Y +G Q YLDPH +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFIGSQANHLFYLDPHLTRPAIPL 797
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
TYH + I+ + L +DPS+ +GF C+D+DDF+DF R ++L ++ +FTV
Sbjct: 946 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 993
>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1188
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 109/239 (45%), Gaps = 26/239 (10%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 208
+TSD GWGCMLR+ Q L+ AL+ LGR WR P E Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAKYAQ 619
Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
++ F D S PFS+H + GK G G W GP + + LA A G+
Sbjct: 620 MVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANS-FAPCGIAVA 678
Query: 267 SLPMAI------YVVSG--DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
+ +I Y S +D R RH + +G+ +L+LV +
Sbjct: 679 TATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEGKWGERAVLILVGIR 738
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
LGL+ VNP Y +++ FTFPQ+ G GG+P +S Y VG Q YLDPH +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQAGGSAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 797
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 34/53 (64%), Gaps = 5/53 (9%)
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
TYH + I+ + L +DPS+ +GF C+ +DDF++F R + L ++ +FTV
Sbjct: 947 TYHCEKIKKMPLSGLDPSMLLGFVCKSEDDFENFVERVALLPKK-----IFTV 994
>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
Length = 484
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 86/167 (51%), Gaps = 21/167 (12%)
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G++K+NP YIP L+ ++PQS+GIVGG+P AS Y+ GVQ+ S IYLDPH+ Q +
Sbjct: 339 GMDKINPVYIPQLQQVLSWPQSVGIVGGRPSASLYVCGVQDASFIYLDPHEAQLALG--- 395
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 439
TY DV+R + +DPSLAIGF C + +D AR LA + + APL T
Sbjct: 396 --------TYFCDVVRVLPSAQLDPSLAIGFVCTSSAELEDLFARLQALATQHSSAPLMT 447
Query: 440 VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
+T V G D + G D+W+L+
Sbjct: 448 LTTGSGAAV----------GCGSDADFTDDVLEGGTGQQQLDEWELV 484
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 65/117 (55%), Gaps = 6/117 (5%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
DF SR+ +YRK F +G S +TSDVGWGC LRS QML+A+ R G R L + +
Sbjct: 49 DFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVRHGWRAGAMMRVALGRDW 108
Query: 203 DR-----EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
R E V ++ D +P SIH + AG G+ G W+GP+ +C+ EAL
Sbjct: 109 QRCSDNLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPGRWLGPWMLCKGLEAL 165
>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
Length = 489
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 145/321 (45%), Gaps = 53/321 (16%)
Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT-------------------- 165
+ +N +F D SR+ +YR F PI G S ++
Sbjct: 69 SKNSNENPDFLSDVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNP 128
Query: 166 ----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
+DVGWGCM+R+ Q L+ AL RLGR +R + E + I+ F D +PF
Sbjct: 129 ACFNTDVGWGCMIRTGQSLLGNALQIARLGRGYR--IGSELKPEEISIIDWFVDIPDAPF 186
Query: 222 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
SIHN + G + G W GP A RS ++L R + CQ I V SGD
Sbjct: 187 SIHNFVSKGMELSSKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQ-----ISVSSGD-- 239
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V +D + VF++ + + ILLL+ + LG+ VN Y ++
Sbjct: 240 --------VYEEDVMK---VFNESKD--SRILLLLGVKLGINAVNEFYWNDIKRLLGSKF 286
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
S+GI GG+P +S Y +G Q +YLDPH QP ++ + + + HS + +
Sbjct: 287 SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQPFLSPSHQE-RSFYDSCHSSNYGKLAIQ 345
Query: 401 SIDPSLAIGFYCRDKDDFDDF 421
+DPS+ IG +++F ++
Sbjct: 346 DLDPSMLIGILISGEEEFKEW 366
>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
Length = 427
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 84/253 (33%), Positives = 119/253 (47%), Gaps = 30/253 (11%)
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 223
+D+GWGCM+R+ Q L+ AL LGR WR + EI F D+ PFS+
Sbjct: 55 TDIGWGCMIRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSL 114
Query: 224 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD---- 278
H + G + G G W GP A RS ++L + E G+ I V SGD
Sbjct: 115 HRFISKGMQLSGKKPGEWFGPAATARSIQSLVH-EFPECGID----KCLISVSSGDIYKT 169
Query: 279 --EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
ED G H GQ D T IL+L+ + LG+E +N Y ++R
Sbjct: 170 EVEDVFNEG-----------HTGEARNGQKDKT-ILILLGVKLGIETINRCYWDSIRRIL 217
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
+ S+GI GG+P +S Y G Q + +Y DPH QP + K+DL +T H+
Sbjct: 218 SSEYSIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQPSYD--KNDLFYETC--HTTNFGK 273
Query: 397 IHLDSIDPSLAIG 409
+ L +DPS+ +G
Sbjct: 274 LSLADMDPSMLLG 286
>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4; AltName:
Full=Pexophagy zeocin-resistant mutant protein 8
gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
Length = 533
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 117/267 (43%), Gaps = 50/267 (18%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
F D S+I ++YR GF PI K TSD GWGCM+R+SQ
Sbjct: 65 FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124
Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
L+A ALLF LGR W + P + E+ I+ F D PFSIHN +Q G K
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184
Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
G W GP A R+ + L C+ P + +Y S C D
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221
Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
+ G +D +TPIL+L+ + LG+EKVN LR + QS+GI G K
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSLKQSVGISGRKTSFLA 281
Query: 354 YI-VGVQEESAIYLDPHDVQPVINIGK 379
+ +G Q + YL P + + GK
Sbjct: 282 LLSIGFQGDYLFYLIPTFPKKALTFGK 308
>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 330
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 138/309 (44%), Gaps = 40/309 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARCQRAETG 262
+++ I++LFGDS S FSIH L+ G+ G W GP +A + E + + T
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRT- 155
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
RG + S+ + G + P ++ VPL LG E
Sbjct: 156 --------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVPLRLGPE 194
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
+ P L+ F PQ +G++GGKPG + Y + +LDPH Q I D+
Sbjct: 195 SPENEFKPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DM 249
Query: 383 EADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
+ D S +Y + + +DPS+++ F + +D++ F K E + LFT
Sbjct: 250 KGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTFSKLFTF 305
Query: 441 TQTHKKPVN 449
+K +N
Sbjct: 306 KDETEKELN 314
>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
Length = 476
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 150/323 (46%), Gaps = 56/323 (17%)
Query: 138 AEFNQDFSSRILISYRKGF-----DPIGDSKI-------------------TSDVGWGCM 173
++F D ++R+ +YR GF DP G S + T+D GWGCM
Sbjct: 91 SDFISDVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRK-PLQKP---------FDREYVEILHLFGDSETSPFSI 223
+R+SQ L+A ALL +GR WR P + P +++++ +I+ F D +PFSI
Sbjct: 151 IRTSQSLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQW-QIITWFADFPWAPFSI 209
Query: 224 HNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
+++ G + G W GP A RS L + ++ C+ + Y+ G+ D
Sbjct: 210 QQIVRYGSEHCNKKPGEWFGPSAASRSIVYLCK----QSYKACK---LNTYLTEGNGD-- 260
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
+D S + + P L+L + LG+ VNP Y L+ + QS+
Sbjct: 261 -------IYEDELLXVSCPEGTENGFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIHQSV 313
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEAD-TSTYHSDVIRHIH 398
GI GG+P +S Y G Q ++ Y+DPH Q + ++ D + ++ H+ IR +
Sbjct: 314 GIAGGRPSSSHYFFGYQGDNLFYMDPHTPQTALLADHVDDADYRXEYVASVHTKRIRKLG 373
Query: 399 LDSIDPSLAIGFYCRDKDDFDDF 421
L +DPS+ IG +D+ +
Sbjct: 374 LCEMDPSMLIGLLVTSLEDYKEL 396
>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 330
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 141/315 (44%), Gaps = 41/315 (13%)
Query: 143 DFSSR-ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---L 198
DF+ I I+YRK I + T+D GWGCM+RS QM +AQ L LG W+ +
Sbjct: 33 DFARHTIWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCI 90
Query: 199 QKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARC 256
+ +++ I++LFGDS S FSIH L+ G+ G W GP +A + E +
Sbjct: 91 NTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEM 150
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
+ T RG + S+ + G + P ++ VP
Sbjct: 151 RVFRT---------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVP 188
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
L LG E + P L+ F PQ +G++GGKPG + Y + +LDPH Q I
Sbjct: 189 LRLGPESPENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTTQNAI- 247
Query: 377 IGKDDLEADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 434
D++ D S +Y + + +DPS+++ F + +D++ F K E
Sbjct: 248 ----DMKGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTF 299
Query: 435 APLFTVTQTHKKPVN 449
+ LFT +K +N
Sbjct: 300 SKLFTFKDETEKELN 314
>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
Length = 285
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 127/291 (43%), Gaps = 44/291 (15%)
Query: 144 FSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
F S I I+YR+ F P+ + SD GWGCM+R QM +A+ L K
Sbjct: 2 FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGL--------------KR 47
Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAE 260
F + EI+ LF D + S FSI N+ +AGK + L AG W P +C + L +
Sbjct: 48 FQIKEDEIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKK--- 104
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
G + L I +S D ++ +D S G ++L + LG
Sbjct: 105 ---GFKDL--KIRTISSDR--------ILIFEDLEMEFSSEKNG------LILFLVCKLG 145
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
LEK Y+ F + S+G++GGKP + + VG E+ IYLDPH VQ +
Sbjct: 146 LEKTEENYLKIALKIFDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDPHYVQDF-----N 200
Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
D ++Y + ID S+ + +K++ F +L EE
Sbjct: 201 QNNVDQNSYFCKNYAVLDQKKIDSSIGNVLFFENKEELKMFFQFLDQLKEE 251
>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 523
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/289 (29%), Positives = 135/289 (46%), Gaps = 43/289 (14%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALL--FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
TSD GWGCM+R+SQ L+A ALL FH G +P + +++ LF D+ ++PF
Sbjct: 179 FTSDAGWGCMIRTSQNLLANALLRLFHTTGG---QPQNFAVTKTEADVIELFQDTLSAPF 235
Query: 222 SIHNLLQAGKAYGL--AAGSWVGPYA-------MCRSWEALARCQRAETGLGCQS---LP 269
S+HN ++A + L G W GP A + + + + +R+E G S +P
Sbjct: 236 SLHNFIKAANSLSLNIKPGQWFGPSAASLSIKKLVNDYNLIQQERRSERDSGRDSGHKVP 295
Query: 270 M-----------AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG-----QADWTPILL 313
+ D +R P V + S +C ++ + + PIL
Sbjct: 296 TPNLKLHSKSADSDSDSDSDAISKRNSIPYVYV---SENCDLYDDEINAIFELEQRPILF 352
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQ 372
L P+ LG+E+VN Y ++ S+GI GGKP +S Y +G + E+ IY DPH Q
Sbjct: 353 LFPIRLGIEQVNKYYYSSILQILASKFSVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQ 412
Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
V + +YH+ + +D +DPS+ IG D++ +F
Sbjct: 413 IV------QTPVNLESYHTSEYSKLKIDQLDPSMMIGILIETIDEYQEF 455
>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
Length = 402
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/327 (26%), Positives = 139/327 (42%), Gaps = 45/327 (13%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPWRKP--LQKP 201
SS I SYRK S +TSD GWGCM+R +QM +AQ + +H +P + ++
Sbjct: 71 SSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVIRHYHSFTQPEQLIVLIRHF 130
Query: 202 FDREYVEILHLFGDSETS-------PFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEAL 253
D + E+++ + + PFSI ++ K + G W P + + L
Sbjct: 131 LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKVEFKKEPGDWYKPNEILETLNYL 190
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP--- 310
+ + SL M IY+ + DA + + KG +W
Sbjct: 191 FKYSQY-------SLNMQIYI---------NYQCAFILQDAIKQMFNYDKGNQEWLKECI 234
Query: 311 -------------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
I + +P +GL++VN Y+ L + T P GI+GG + YIVG
Sbjct: 235 KNNNQFISQHDKGIAIFLPARIGLQRVNQDYLEVLNILMTLPYFQGIIGGVTNRAFYIVG 294
Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
++ IYLDPH VQ N +DL ++Y I+ IH SIDPS+ + R+ +
Sbjct: 295 RIQDYLIYLDPHFVQNAQNF--EDLSKTQASYTCQNIQLIHNKSIDPSIVVCLCVRNGLE 352
Query: 418 FDDFCARASKLAEESNGAPLFTVTQTH 444
D + + +E ++ T+
Sbjct: 353 LLDLWHSLNHMKQEFQEFFFISILDTN 379
>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
Length = 1055
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 142/308 (46%), Gaps = 46/308 (14%)
Query: 148 ILISYRKGFDPI-GDSKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWRKPLQKPFDR 204
+ ++YRKG+DPI GD+++TSD GWGC RS QML+AQAL+ + R R +P
Sbjct: 603 VWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSARMQRLEGVRPSTW 662
Query: 205 EYVE----ILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 258
++ E +L +F DS + FSI ++ + G W+ P
Sbjct: 663 QHEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLSP--------------- 707
Query: 259 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
+ + I ++ E G R V ++D G+ W P LL++PL
Sbjct: 708 -------SEVALIIRRLNPPETGMR----VRIVNDTLLSTRRILAGEP-WMPTLLMIPLR 755
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE--SAIYLDPHDVQPVIN 376
GL+ + P +P F +P +G +GGKPG++ Y VG+ + +YLDPH + ++
Sbjct: 756 AGLDTLQPESVPAFVAFFDWPWCVGAIGGKPGSAYYYVGIDHDRRRVLYLDPHTTRSRLD 815
Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG-- 434
+ +A T D ++ + + S+ +G + + D + R + E+ +G
Sbjct: 816 LSN---QAAEKTCVPDKLKSMDMSKSCSSICVGLFLPELRDLTELVQRYKR--EQLSGMW 870
Query: 435 -APLFTVT 441
PLF V
Sbjct: 871 STPLFHVV 878
>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
Length = 391
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 85/155 (54%), Gaps = 18/155 (11%)
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G++K+NP Y+P L+ T+PQS+GIVGG+P AS Y+ GVQ+ S ++LDPH+ QP + G
Sbjct: 216 GMDKINPVYLPQLQRILTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWGI 275
Query: 380 DDLEADT-----------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
T +TY D +R + ++DPS+AIGF C D +D
Sbjct: 276 AGDAGHTKEAGNGGSAVVLPASSLATYFCDTVRLMPATALDPSMAIGFLCMGAADLEDLF 335
Query: 423 ARASKLAEESNGAPLFTVTQ-THKKPVNHSDVLGE 456
R LA+E + APL T+T T + V D GE
Sbjct: 336 TRLDALAKEHSLAPLMTLTSGTAQAGVGLEDDFGE 370
>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
Length = 196
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 75/122 (61%), Gaps = 11/122 (9%)
Query: 308 WTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
W P+++LVPLVLGL++ VNPRY+P + PQS+GI+GGKP AS Y VG Q+E YL
Sbjct: 75 WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLPQSVGILGGKPCASLYFVGAQDEELFYL 134
Query: 367 DPHDVQPVINIGK----------DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
DPH VQ + + + + T TYH + H++ +DPS+ +GFYCR +
Sbjct: 135 DPHTVQLAVPLEQIWGCAQTGSPESGPFPTETYHCRSVLHMNARELDPSMVLGFYCRTRA 194
Query: 417 DF 418
DF
Sbjct: 195 DF 196
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 26/45 (57%), Positives = 31/45 (68%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
F SR+ I+YR+GF IG T+D GWGC LRS QML+A AL H
Sbjct: 1 FHSRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANALQSH 45
>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 808
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 126/287 (43%), Gaps = 71/287 (24%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
F +DF+S I ++YR + PI D+ +
Sbjct: 142 FYEDFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKS 201
Query: 165 -TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSET-- 218
TSD GWGCMLR+ Q L+A AL+ LGR WR+P F E YV+IL F D+ +
Sbjct: 202 WTSDAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPL 261
Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA----RCQRAETGLGCQSLPMAIYV 274
+PF +H + AGKA G G+W GP S + LA CQ + L A V
Sbjct: 262 APFGVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPECQLS-VSLAVDGTVFASDV 320
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTL 332
+ G V + SK G+A +L+LV + LGL+ VNP Y L
Sbjct: 321 YAASHMGM-----VTTSGRSISSRRSASKWGGRA----VLILVNIRLGLDNVNPIYYDAL 371
Query: 333 RLTFTFPQSLGIVGGKP--GASTYIVGVQEESAIYLDPHDVQPVINI 377
++ G+P G+S Y VG Q +S YLDPH +P I +
Sbjct: 372 KV------------GRPRQGSSYYFVGSQADSLFYLDPHHTRPYIPL 406
Score = 45.4 bits (106), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 30/46 (65%)
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
A+ T+H D +R + + ++DPS+ +GF CRD D+ DF R + ++
Sbjct: 519 AELRTFHCDRVRKMPMSALDPSMLLGFLCRDDADWKDFRTRVADVS 564
>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
Length = 355
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 142/323 (43%), Gaps = 32/323 (9%)
Query: 145 SSRILISYRKGFDPIGDS-KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
S+ + +YR IGDS + +D GWGC LR QM+V +AL R + K L P +
Sbjct: 52 SAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK-LSYPSE 110
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
+ IL F D S+H + K G AG W P + Q A +
Sbjct: 111 AARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQEA---M 167
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
G Q ++V +V +DD + +F +A LL VPL LG++
Sbjct: 168 GLQ-----VHVAMD---------SMVVLDDLRK---LFRADRA----TLLFVPLRLGIDI 206
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
V IP ++ F P +LGI+GG+PGA+ Y +G + + + LDPH Q + G D
Sbjct: 207 VQAEMIPAVKRFFHSPSALGIMGGRPGAAHYFIGYMDHNLLLLDPHTTQDPLRAGSQDAL 266
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV--T 441
+ + LD +DP++ + F D++ F + EE+ G LF++ T
Sbjct: 267 VSCRCSRPML---LDLDKVDPTMCLAFLLTDEESLQRFADDYNASVEET-GVRLFSMLDT 322
Query: 442 QTHKKPVNHSDVLGETGGVPEDD 464
++ V + L E +DD
Sbjct: 323 KSFASSVAVASSLAEEEEFSDDD 345
>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
Length = 330
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 127/278 (45%), Gaps = 29/278 (10%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDR 204
I ++YRK + + TSD GWGCM+RS QM +AQ+ + +G W + Q ++
Sbjct: 38 IWVTYRKNMKELPGGR-TSDSGWGCMIRSMQMALAQSFVSLVMGNSWKFTKTGFQVERNK 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
++ I++LFGD S FSIHNL+ G+ G W GP S+ + T
Sbjct: 97 FHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGP-----SFASEIAADHLNT-- 149
Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
I+V R G V S+ + P ++ VPL LG
Sbjct: 150 --------IHVFRTRGYVARLGRIV------KPDILDISEDNGNILPTIIFVPLRLGPVN 195
Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
+ P L+ F PQ +G+VGGKP + + YLDPH Q +++ D
Sbjct: 196 AEEDFRPILKKVFDIPQCVGMVGGKPNLAFFFHTFDGNLLYYLDPHTTQNAVSM---DGG 252
Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+Y + ++ + ++DPS+++ F ++KDDF+ F
Sbjct: 253 WSAESYFCNDVKSMKYKNLDPSVSLLFLIKNKDDFNKF 290
>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 162
G +E + R +SYR GF+PI +
Sbjct: 75 GDSEVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 221
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 222 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 278
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
PQS GI GGKP +S Y G Q S +YLDPH Q V A +YHS + +
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSSYQKLD 322
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARAS 426
+ +DPS+ G ++ +D+ D R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350
>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 338
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 83/143 (58%), Gaps = 5/143 (3%)
Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
S+ W +++L+P+ LG E++NP YI ++ FT +GI+GGKP S Y +G QE+
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSLFTLKHCIGIIGGKPKHSLYFIGFQED 215
Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
I+LDPH Q V+++ D ++H R + L +DPS IGFYC+ +DDF +F
Sbjct: 216 KLIHLDPHLCQDVVDMRSRDFPL--QSFHCMSPRKMSLMKMDPSCTIGFYCKTQDDFKEF 273
Query: 422 CARASKLAEESNGA---PLFTVT 441
C+ A ++ + + P+F +
Sbjct: 274 CSYAQEVLDSTKHVGDYPMFIFS 296
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 62/100 (62%), Gaps = 6/100 (6%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDE--ALGDAAGNNGLA---EFNQDFSSRILISYRKGF 156
S+T S T IWLLG C+ D+ +A ++ L F +DF+SR+ ++YR+ F
Sbjct: 42 SQTNFSYHTP-IWLLGECYHHRPDDPNETEQSAEDDCLTPMERFKRDFTSRLWLTYRREF 100
Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ + +T+D GWGCMLRS QM++AQ+ L H LGR +++
Sbjct: 101 QQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140
>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 388
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 127/279 (45%), Gaps = 39/279 (13%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ SYR+ F+P+ + TSDVGWGC +R+ QM++A A + +R G D V
Sbjct: 94 LYFSYRRQFEPLRNGA-TSDVGWGCTIRACQMMLAWAFMRYRNGG------SVTMDDNVV 146
Query: 208 EIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
+ L LF D T+PF IH + G +G+ G W GP M + AL R+ G
Sbjct: 147 DSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSSGG 206
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
G + L + D + G VV S+H ++LL+P+ LG +
Sbjct: 207 EGPEVLVAS--------DRQIGVQDVVVRLQRSQH-------------VVLLIPVKLGPQ 245
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
V+ Y L+ F S+G VGG+ ++ + G Q + I+LDPH VQ +
Sbjct: 246 TVSVTYANALKRFFEMGSSIGAVGGEKNSAYFFFGYQGDKIIHLDPHYVQCALT------ 299
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+++ + R + + + S +GFY D+ D F
Sbjct: 300 SPNSNGTLAGTWRSLPVMQCNTSALLGFYVSSCDELDQF 338
>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)
Query: 136 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 162
G E + R +SYR GF+PI +
Sbjct: 75 GDLEVQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFANIHSLVDND 134
Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 221
T+DVGWGCM+R+SQ ++A A+ DR E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177
Query: 222 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 278
S+HN ++ L G W GP A S + L + + T ++P+++ V SGD
Sbjct: 178 SLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232
Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
DD Q P+LLL+PL LG++ VN Y +L
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
PQS GI GGKP +S Y G Q S +YLDPH Q V A +YHS + + +
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSLYQKLD 322
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARAS 426
+ +DPS+ G ++ +D+ D R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350
>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
Length = 357
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 110/230 (47%), Gaps = 42/230 (18%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
N + F DF SR+ ++YR GF PI S+ TSD G+GCM
Sbjct: 91 NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150
Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
+RS Q ++A AL RLGR WR + D+++ EIL LF D +PFSIH ++ G A
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209
Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G G W GP A R + LA R E GL +Y VSGD GA V +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVY-VSGD------GADVY--E 252
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
D + +V G W P L+LV LG++K+ P Y L++ P L
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKIREMDPSML 300
>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1295
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 117/281 (41%), Gaps = 49/281 (17%)
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 185 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 228
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
GK G G W GP + + LA S P V DG + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668
Query: 289 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 336
AS + ++ G P +L+++P LGL+ VNP Y ++
Sbjct: 669 Y---QASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 31/49 (63%)
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
E T+H D ++ + L +DPS+ +GF C ++ +F+DFC R S+L +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979
>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
Length = 178
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/163 (39%), Positives = 88/163 (53%), Gaps = 32/163 (19%)
Query: 38 SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHER 97
S+ K S+LS +F ++FE + S++ A K + A R+ +RR+
Sbjct: 45 SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRI-----LRRVS-- 97
Query: 98 VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
++E G + ++G A F +DFSSRI I+YRKGFD
Sbjct: 98 -------------------------PEEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFD 132
Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
I SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 133 AIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 175
>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1295
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 117/281 (41%), Gaps = 49/281 (17%)
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
D G A N GL+ SR G+ G+ +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559
Query: 185 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 228
L+ LGR WR P QKP YV +L F D S PFS+H
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
GK G G W GP + + LA S P V DG + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668
Query: 289 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 336
AS + ++ G P +L+++P LGL+ VNP Y ++
Sbjct: 669 Y---QASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722
Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
S+GI GG+P +S Y V Q S YLDPH +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 31/49 (63%)
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
E T+H D ++ + L +DPS+ +GF C ++ +F+DFC R S+L +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979
>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 377
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/327 (27%), Positives = 128/327 (39%), Gaps = 105/327 (32%)
Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 174
A F DF SRI I+YR F I SK T+D GWGCM+
Sbjct: 90 AAFLDDFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMI 149
Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
RS Q L+A ALL +LGR WR+ + + + +L LF D +PFSIH ++ G A
Sbjct: 150 RSGQSLLANALLIQKLGRDWRRGSET---GKEIALLSLFADRPQAPFSIHRFVEHGAAAC 206
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A ARC C+ + +YV S D +D
Sbjct: 207 GKHPGEWFGP-------SATARCIDE-----CEHAGLNVYVTSDGSD---------VHED 245
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
R + G D P L+L+ + LG++ + P Y L+ +PQS+GI G
Sbjct: 246 KFRQIA----GLDDIKPTLILLGVRLGIDSITPVYWDALKAIIQYPQSVGIAG------- 294
Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
+H+ +DPS+ IGF +
Sbjct: 295 ------------------------------------------RLHIKEMDPSMLIGFLIK 312
Query: 414 DKDDFDDFCARASKLAEESNGAPLFTV 440
+ DD+ D+ R + G P+ V
Sbjct: 313 NNDDWHDWKHR----VRSAPGKPIIHV 335
>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.3]
Length = 873
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 149/353 (42%), Gaps = 76/353 (21%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
G+N F DF+SRI ++YR F PI DS +
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350
Query: 165 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRP-WRKPLQKPFDRE---YVEILHLFG 214
TSD GWGCMLR+ Q L+A ALL LGR WR+P + YV+I+ F
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFF 410
Query: 215 D--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI 272
D S SPFS+H + AGK G G W GP + + L E GLG +
Sbjct: 411 DTPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGV 469
Query: 273 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
S + A I RH V G+A +++L+ + LGL+ VNP Y T+
Sbjct: 470 IFQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTI 520
Query: 333 RLT-----------FTFPQSLGIVGGKPGASTYIV----------GVQEESAIYLDPHDV 371
+++ T P + G P AS I G E + LDP
Sbjct: 521 KVSIRTLRPYRWILMTVPYTSGFNASLP-ASPEISSDMDVRELGWGDSEGAGEALDPMAE 579
Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
V D L T+H D +R + + +DPS+ +GF C+D++D+ DF R
Sbjct: 580 HYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDENDWFDFRRR 628
>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 494
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
Length = 392
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 142/310 (45%), Gaps = 49/310 (15%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
+ D ++RI +YRK F P+ S+ T+DVGWGCMLR QM++A L+ +
Sbjct: 119 QLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLM----------AV 168
Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
+P + HL +++ N L+AG+ G ++ VG + + ALA+
Sbjct: 169 LQP------RVHHLLK------YTMENHHLKAGRFQGPSS---VGSALLHQVPSALAQLN 213
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
+ + + + Y S + I D R +GQA++ PI+L++PL
Sbjct: 214 QFRD----EEVKLRTYFASD----------TLVILDQLRP----EEGQAEFEPIMLVLPL 255
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
LG+EK+ P+Y L+L P +G +GG + YI G Q LDPH +
Sbjct: 256 RLGIEKIGPQYHARLQLLLRQPWCMGFIGGHDKRAMYIFGYQGHQYFGLDPHRCSAAVAQ 315
Query: 378 GKDDLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEES 432
+L ++H+ + I D +DPSLA+ R ++ DD + + +E+
Sbjct: 316 STAELRDRWVEVRDSFHTSKLSGIERDDLDPSLAVFLLARTAEELDDMLSVIGQPTSEDR 375
Query: 433 NGAPLFTVTQ 442
G L +V Q
Sbjct: 376 PGPALVSVVQ 385
>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
Length = 494
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
Length = 494
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
Length = 494
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
Length = 384
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 80/139 (57%), Gaps = 6/139 (4%)
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W +++L+P+ LG E +NP Y P ++ FT LG++GG+P S Y VG QE+ I+L
Sbjct: 203 NWCSVIILIPVRLGGESLNPIYEPCIKGLFTMDHCLGVIGGRPKHSLYFVGFQEDKLIHL 262
Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
DPH Q V+++ D + ++H R + + +DPS IGFYCR +DDF+ FC +
Sbjct: 263 DPHFCQEVVDMTPRDFPLE--SFHCMNPRKMSIARMDPSCTIGFYCRTRDDFNKFCTTVT 320
Query: 427 KLAEESNGA----PLFTVT 441
+ G P+F V+
Sbjct: 321 EEMLRQPGPKADYPMFIVS 339
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 7/91 (7%)
Query: 113 IWLLGVCHKIAQDE------ALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKIT 165
IWL GVC+ +E L D+ E F +DF+S++ ++YR+ F + S T
Sbjct: 88 IWLQGVCYHRRNEELTKELEPLTDSDRRLYTMELFKRDFASKVWLTYRREFPQLAGSMFT 147
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+D GWGCMLRS QML+A L+ H LGR +++
Sbjct: 148 TDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178
>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
Length = 494
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 85 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGBIYENEVEKV 257
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 358 GILIKGEKDW 367
>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 506
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 370 GILIKGEKDW 379
>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
Length = 506
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 370 GILIKGEKDW 379
>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 159/373 (42%), Gaps = 60/373 (16%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
N + + QD I I+YR+ F P+ S SD GWGCMLR QM +AQ L H
Sbjct: 57 NKIKQLVQD---TIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQMLKKHLKNHGD 113
Query: 195 RKPLQKPFDREYVEILHLFGDSETS----------------------PFSIHNL-LQAGK 231
++ D +Y IL F D+++ PFSI + A K
Sbjct: 114 KR------DEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKK 167
Query: 232 AYGLAAGSWVGP-YAM-----------CRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
+ L G W P Y + R+ E L ++ L L ++ + +
Sbjct: 168 EFNLDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFDIKFET 227
Query: 280 DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
D + +++ + SK + + V +GL++ N +Y+ L P
Sbjct: 228 DKD--------LEEQLEKTQLKSKN-----SLAIFVLTRIGLDEPNQKYLKVLDELMELP 274
Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK--DDLEADTSTYHSDVIRHI 397
GIVGG P + YI+G + IYLDPH VQ N G+ ++ + ++Y I +
Sbjct: 275 YFQGIVGGTPKRAFYILGRINDHYIYLDPHYVQEAENKGQIIENKMFNRTSYSCKYIHLL 334
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 457
+ +D S+ + +Y R+K + F K+ ++S+ +F ++ T + V++S+ L E+
Sbjct: 335 NQKHVDTSMGLSYYIRNKSELLQFWRDMKKIKQKSDDFFIF-LSDTTPEYVDYSNQLEES 393
Query: 458 GGVPEDDSLGVMS 470
DD + +
Sbjct: 394 SNKLNDDDVVFLQ 406
>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
Length = 494
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 132/310 (42%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
EF D SR+ +YR F PI G S ++ +D+G
Sbjct: 85 EFLLDVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 144
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R +K RE +I+ F D+ +PFSIHN +
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVDNEKSLKRES-KIVTWFNDTPEAPFSIHNFVST 203
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L C + V SGD +
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIYGFPE-----CGITDCVVSVSSGDI--YQNEVEK 256
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ +++ + IL L+ + LG+ VN Y ++ +S+GI GG+
Sbjct: 257 IYVENPD-------------SIILFLLGVKLGINAVNESYRESICGILNSARSVGIAGGR 303
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q +Y DPH QP + E+ + H+ + L +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNQFLYFDPHIPQPAVE------ESFVESCHTSKFGKLQLSEMDPSMLI 357
Query: 409 GFYCRDKDDF 418
G + ++D+
Sbjct: 358 GVLIKGEEDW 367
>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
Length = 700
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 84/149 (56%), Gaps = 10/149 (6%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAI 364
A W P+LL +PL LGL + NP Y ++ P S+GI+GG+P + +IVG +E +
Sbjct: 259 ATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQIPHSIGIMGGRPSHAVWIVGTAGDEDLL 318
Query: 365 YLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
LDPH QP +DDL A D T+H D + L+ +DPS+ IGF C +D+FD CA
Sbjct: 319 CLDPHTTQPA---SQDDLTAEDDVTHHCDCPVRLPLERLDPSMVIGFVCTTEDEFDQLCA 375
Query: 424 RASK---LAEESNGAPLFTVTQTHKKPVN 449
+ E + G PLF V ++ +P N
Sbjct: 376 HLERDVLSVETTCGHPLFEVHKS--RPSN 402
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 35/66 (53%), Gaps = 3/66 (4%)
Query: 179 MLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
M++A+A+ LG+ WR P + D Y + +F D ++S +SI N+ G A
Sbjct: 1 MMLAEAITRIHLGKDWRWTPGCQ--DEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPI 58
Query: 238 GSWVGP 243
GSW GP
Sbjct: 59 GSWFGP 64
>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
Length = 216
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 84/154 (54%), Gaps = 14/154 (9%)
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 28 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 87
Query: 367 DPHDVQPVINIG--------KDDL------EADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
DPH Q +++ +DD E STYH I +D +DPSLA+GF+C
Sbjct: 88 DPHFCQNFVDLDETTTTRDERDDYVEIKNDEFKDSTYHCPFILSTKIDKVDPSLALGFFC 147
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
+DD+++ R ++ PLF + +T K
Sbjct: 148 HTEDDYNELAKRLRTHLLPASTPPLFEMLETRPK 181
>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
purpuratus]
Length = 1018
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 81/144 (56%), Gaps = 10/144 (6%)
Query: 113 IWLLGVC-HKIAQDEALGDAAGNNGLAE-----FNQDFSSRILISYRKGFDPIGDSKITS 166
IW LG C H+ +D G + + F QDFSSR+ ++YR+ F + S TS
Sbjct: 346 IWFLGKCYHQRPEDPDPERPPGMDSVRSMVIEMFKQDFSSRLWMTYRREFPTLAGSNFTS 405
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDS--ETSPFS 222
D GWGCMLRS QM++A +L+ H LGR W KP + + + +I+ FGD + SPFS
Sbjct: 406 DCGWGCMLRSGQMMLAHSLILHFLGREWNIYKPQTQEMLQFHRQIVRWFGDQPLDMSPFS 465
Query: 223 IHNLLQAGKAYGLAAGSWVGPYAM 246
+H L+ G+ G G W GP ++
Sbjct: 466 VHRLVGIGQNNGKKVGDWYGPSSV 489
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 6/154 (3%)
Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
ID + S ++G W +++++P+ LG ++VNP YI ++ FT LGI+GGKP
Sbjct: 819 IDPSRSRTSTSTEGGKPWCAVVIMIPVRLGGDEVNPVYIRPIQSLFTLESCLGIIGGKPK 878
Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
S + VG QEE I+LDPH Q V+++ D ++H R + + +DPS IGF
Sbjct: 879 HSLFFVGFQEEKLIHLDPHYCQQVVDMKTRDFPL--WSFHCMSPRKMSISKMDPSCTIGF 936
Query: 411 YCRDKDDFDDFCAR----ASKLAEESNGAPLFTV 440
Y R ++ F+ C S L S+ P+F V
Sbjct: 937 YIRTEEQFEQLCKELPTVVSPLGSHSSDYPMFIV 970
>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
Length = 286
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 79/141 (56%), Gaps = 3/141 (2%)
Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
+A+W I++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +
Sbjct: 109 EAEWKSIIILVPVRLGGETLNPAYMPCIKELLRMEPCLGIIGGKPKHSLYFIGYQDDFLL 168
Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
YLDPH QP ++ KD + ++H R + +DPS +GFY + DF+ C++
Sbjct: 169 YLDPHYCQPCVDTMKDSFPLE--SFHCTAPRKLPFAKMDPSCTVGFYAGTRKDFEALCSQ 226
Query: 425 -ASKLAEESNGAPLFTVTQTH 444
L + P+FTV + H
Sbjct: 227 LLQALNSTATRYPMFTVAEGH 247
>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
Length = 440
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 83/155 (53%), Gaps = 16/155 (10%)
Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
+W P+L+++PL LGL +N Y P ++ F PQ +GI+GG+P + Y G+ + + +YL
Sbjct: 252 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 311
Query: 367 DPHDVQPVINIG---------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
DPH Q +++ K+D E STYH I +D +DPSLA+GF+
Sbjct: 312 DPHFCQNFVDLDEATTTKDERGDYVEIKND-EFRDSTYHCPFILSTKIDKVDPSLALGFF 370
Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
C +DD+ + R ++ PLF + +T K
Sbjct: 371 CHTEDDYSELANRLRTHLLPASTPPLFEMLETRPK 405
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 57/122 (46%), Gaps = 28/122 (22%)
Query: 128 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 59 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 118
Query: 187 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
LGR W +DR EY IL G SE G G W
Sbjct: 119 VRHLGRNWL------WDRDVMLTEYKRILPNMGVSE----------------GKEIGEWF 156
Query: 242 GP 243
GP
Sbjct: 157 GP 158
>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
Length = 389
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 137/340 (40%), Gaps = 45/340 (13%)
Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-LFH 188
D A + + + F I SYR + S +TSD GWGCMLR QM + Q + F+
Sbjct: 47 DLAVDQKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFY 106
Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSE-------------------TSPFSIHNLL-Q 228
L +E E++ F D++ SPFSI ++ Q
Sbjct: 107 NLSSS----------QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQ 156
Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED---GERGG 285
+ G W P + + L R + + L +++S + GG
Sbjct: 157 TKLELQKSPGEWYKPNDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFNKNGG 216
Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
D KGQ D + + + +GL+ N Y+ L T+PQ GI+
Sbjct: 217 -------DEEWLKEQIEKGQNDEFGVSIFILTRIGLDTCNQEYLKVLNDIMTYPQFQGIL 269
Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
GG P + YI+G IYLDPH VQ N ++E D S+Y I+ I + +DPS
Sbjct: 270 GGFPNKALYILGRVGNYYIYLDPHYVQNAQNY--QEMENDRSSYTCQSIQLIDSNQLDPS 327
Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT-VTQTH 444
+AI F C R K + NG F +T+TH
Sbjct: 328 MAISF-CVKNALDLLDLWRRLKQTKSENGESFFMALTETH 366
>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 357
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 125/277 (45%), Gaps = 35/277 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNAL----------- 177
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G V+ + + ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEQVKELLRQSMHVVLLIPVMLGIRVI 227
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP + E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQPAFTSSGNSGEL 287
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ R + S D S+ +GFY D F F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSFAVF 318
>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
8797]
Length = 448
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 157/363 (43%), Gaps = 68/363 (18%)
Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------ 165
N +F +D +R+ +YR F PI G S I+
Sbjct: 38 NEKMQFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFN 97
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
+D+GWGCM+R+ Q L+ AL R GR +R D +I+ F D+ +PFS+HN
Sbjct: 98 TDIGWGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD----DIIQWFKDTPDAPFSLHN 153
Query: 226 LLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
++ G + + G W GP A RS ++L C + G+ I VS + ++
Sbjct: 154 FVKKGVELADMKPGQWFGPAATSRSIQSLI-CNFPQCGID-----HCIVSVSSADIYKQD 207
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
+ D S +L+L + LG+ VN Y +R S+GI
Sbjct: 208 VEDMFDADPDSN--------------LLILFGVKLGVSAVNASYWEDIRRLLNSKFSVGI 253
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
GG+P +S Y G Q + +Y DPH QP + DD A +T HS + L +DP
Sbjct: 254 AGGRPSSSLYFFGYQNQELLYFDPHTPQPSL---IDD--AAFNTCHSIEFGKLELRDMDP 308
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD 464
S+ IG + D++++ ++ E S +F + + + + DV + G D+
Sbjct: 309 SMLIGIMIEGERDWENW----ARFTETSK---IFNILEERSEDCINVDV--DIDGDENDE 359
Query: 465 SLG 467
++G
Sbjct: 360 NIG 362
>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
Length = 354
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 32/299 (10%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA-LLFHRLGRPWRKPLQKPFDREY 206
+ SYR GF P+ + T+DV WGC++R++QML+AQA + F G + RE
Sbjct: 69 LYFSYRCGFTPLSNGS-TTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFVDGSALQILREK 127
Query: 207 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
V+ LF D ++PF IH + + YG+A G W G ++ +L + G G
Sbjct: 128 VQ--PLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSLRGGNG-- 183
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 326
P + V D E V + SR ++LL+P VLGL++++
Sbjct: 184 --PAVLVFV----DREVSALKVRDLLSHSRQ-------------VVLLIPAVLGLDRISV 224
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
+Y L +G++GG+ ++ Y VG Q + IYLDPH Q E T
Sbjct: 225 KYSKMLIRCLEMESCIGVIGGRKSSALYFVGHQSNNIIYLDPHRAQRAFTEVASPGEL-T 283
Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 445
+H + + + S+ GFY + F F A + A + PL +V + +
Sbjct: 284 GAWHL-----LPVTACSTSILFGFYIDSLESFKQFEADMLE-ANSALAFPLISVATSER 336
>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
Length = 362
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 183 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 242
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 243 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 300
Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
+++ S+ P+FT+ + H + +HS
Sbjct: 301 TRVLSSSSATERYPMFTLAEGHAQ--DHS 327
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 88 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 141
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 142 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 169
>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 360
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 181 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 240
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 241 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 298
Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
+++ S+ P+FT+ + H + +HS
Sbjct: 299 TRVLSSSSATERYPMFTLAEGHAQ--DHS 325
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 86 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 139
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 140 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 167
>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 298
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 135/304 (44%), Gaps = 46/304 (15%)
Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVE 208
+Y K F P+ T+D WGC +RS+Q L+ Q + L+ LG R P + +Y
Sbjct: 28 TYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDDIRNIF--PTNSKY-- 82
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
LF D SPF + ++ ++YG+ G WV P + + + R
Sbjct: 83 --ELFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKEIMNFFRI--------- 131
Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
PVV + V ++ + P+LLL L+LG E +Y
Sbjct: 132 ------------------PVVIAEHGCLSREVLNEALSHNIPVLLLFTLMLGYENFELKY 173
Query: 329 IPTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
+P L+LT + QS+G+VGG+ G + +IVG Q+E +Y DPHDV +I K D +
Sbjct: 174 LPFLKLTLSLIYQSVGVVGGQQGKAYFIVGHQKEKLLYFDPHDVNE--SITKID---QIN 228
Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
++ + D++ S+ +GF+ + D ++ L +S P+ V + +
Sbjct: 229 QLFKPPLKVMPADTLSSSMLVGFFITNLQDAEEL----PMLLNQSGECPIHIVDKIEEAK 284
Query: 448 VNHS 451
H+
Sbjct: 285 ETHT 288
>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
Length = 359
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
A+W +++LVP+ LG E +NP Y+P ++ LGI+GGKP S Y +G Q++ +Y
Sbjct: 180 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 239
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
LDPH QP +++ + D + ++H R + +DPS +GFY D+ +F+ C+
Sbjct: 240 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 297
Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
+++ S+ P+FT+ + H + +HS
Sbjct: 298 TRVLSSSSATERYPMFTLVEGHAQ--DHS 324
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
+S S I + +C + + E GD + F +DF SR+ ++YR+ F P+ +TSD
Sbjct: 85 TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGSLTSD 138
Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
GWGCMLRS QM++AQ LL H L R ++
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 166
>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
Length = 745
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH VQ +N D ++TY + + + +DPSL+IGFYCRD+ F+D C R S
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619
Query: 428 LAEESNGAPLFTVTQ 442
+ + P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 198 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 232
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
Length = 745
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
W +++++PL LG +K+N YI L+L PQSLG +GGKP S Y +G Q++ IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
PH VQ +N D ++TY + + + +DPSL+IGFYCRD+ F+D C R S
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619
Query: 428 LAEESNGAPLFTVTQ 442
+ + P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
F D +S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H P
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289
Query: 198 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 232
+KP Y ++L F D S+ + IH ++ +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326
>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
Length = 469
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 137/311 (44%), Gaps = 52/311 (16%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDVG 169
EF +D +SR+ +YR F PI G S + +D+G
Sbjct: 62 EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
WGCM+R+ Q L+A AL LGR +R + ++I+ F D+ PFS+H +Q
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181
Query: 229 AG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 287
G K G G W GP A+ RS +L C ++S D +
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHC--------IISTD-------SA 226
Query: 288 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
V +D+ K LLL+ + LG++ N Y ++ + QS+GI GG
Sbjct: 227 DVYLDEIDPLFRANPKANV-----LLLLGVRLGVDFTNEYYWDDIKNILSSSQSVGISGG 281
Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 407
+P +S Y G Q + YLDPH VQ + + + D E + H IHL +IDPS+
Sbjct: 282 RPSSSLYFFGYQGDYLFYLDPHKVQLNLALYESD-EERFHSVHPQTFNKIHLSAIDPSML 340
Query: 408 IGFYCRDKDDF 418
+GF +DD+
Sbjct: 341 LGFLLTGEDDW 351
>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 444
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 138/312 (44%), Gaps = 71/312 (22%)
Query: 146 SRILISYRKGFDPIGDSK----------------------------------ITSDVGWG 171
SR+ +SYR GFDPI ++ TSD GWG
Sbjct: 84 SRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAGWG 143
Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AG 230
CM+R+SQ L+A LL P D + +++ LF D+++SPFSIHN ++ AG
Sbjct: 144 CMIRTSQNLLANTLL-----------QLLPPDSKQ-DVIGLFQDNQSSPFSIHNFIKVAG 191
Query: 231 KA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
++ + G W GP A S + L + + G + + I S DGE
Sbjct: 192 ESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVFISENSDLYDGEINEI--- 248
Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
+ +G++ +L+L P+ LG++KVN Y ++ S GI GGKP
Sbjct: 249 ----------LSEEGRS----VLVLFPIRLGIDKVNSYYYDSIFQVLKSKFSCGISGGKP 294
Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
+S Y +G IY DPH Q V N + +YH+ +++ +DPS+ IG
Sbjct: 295 SSSFYFLGYDNSDLIYFDPHLPQLVEN------PINIESYHTRNYNRLNISLLDPSMMIG 348
Query: 410 FYCRDKDDFDDF 421
R DD+ +F
Sbjct: 349 ILLRSMDDYLEF 360
>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 485
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 128/310 (41%), Gaps = 57/310 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 76 EFLLDVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 135
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R F RE I++ F D+ +PFS+HN +
Sbjct: 136 WGCMIRTGQSLLGNALQILHLGRDFRVDEDDDFRRE-SRIVNWFNDTPEAPFSLHNFVST 194
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS + L E G+ + V SG D
Sbjct: 195 GTELSDKRPGEWFGPAATARSIQYLIY-GFPECGINA----CIVSVSSG--DIYENEVEE 247
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
V +D+ + + IL L+ + LG+ VN Y ++ S+GI GG+
Sbjct: 248 VFVDNPN-------------SSILFLLGVKLGINAVNESYRESICGILNSAWSVGIAGGR 294
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ ++ H+ + L +DPS+ I
Sbjct: 295 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVNSCHTSKFGRLQLSEMDPSMLI 348
Query: 409 GFYCRDKDDF 418
G + + D+
Sbjct: 349 GVLIKGEKDW 358
>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
Length = 463
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 129/312 (41%), Gaps = 67/312 (21%)
Query: 135 NGLAEFNQDF----SSRILISYRKGFDPIGDSK--------------------------- 163
N + NQDF +SR+ +YR F PI S
Sbjct: 52 NRNSNLNQDFLSDVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNP 111
Query: 164 --ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
+D+GWGCM+R+ Q L+ AL +LGR +R L + EI+ F D+ PF
Sbjct: 112 DCFNTDIGWGCMIRTGQSLLGNALQLAKLGRHFR--LDNKMGIKDDEIISWFRDTTQEPF 169
Query: 222 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
SIH ++ G K G W GP A S ++L + E G+ + V SGD
Sbjct: 170 SIHKFVEKGNKLANKKPGEWFGPAATSISIQSLIE-EFPECGID----KCLVSVSSGD-- 222
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+D R +F + + IL L+ + LGL+ VN Y +
Sbjct: 223 ---------IFEDDVRE--IFEENMD--SKILFLMGVKLGLDAVNSFYWEDILNILDSKF 269
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHI 397
S+GI GG+P +S Y G Q +Y DPH QP + D S Y H+ +
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQPSL--------VDPSVYETCHTTNFGKL 321
Query: 398 HLDSIDPSLAIG 409
+ +DPS+ IG
Sbjct: 322 DIKDMDPSMLIG 333
>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 1216
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 154/362 (42%), Gaps = 79/362 (21%)
Query: 142 QDFSSRILISYRKGFDPIGDSKI-------TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
Q + + IL +YRK F P+ KI TSD GWGCM+R+ QM+ AQ + H +
Sbjct: 257 QIYQNTILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRHLKKTDY 316
Query: 195 RKP----------LQKPFDRE----YVEILHLFGDSETSPFSIHNLL-QAGKAYGLAAGS 239
+ L++ +E Y+ + P+SIH + +A Y + G
Sbjct: 317 IEQHQLINIIIGFLEEEEVQEGGKGYIFNQQSYIQDRIRPYSIHQITNRAFCKYKIQPGQ 376
Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED-----------GERGGAPV 288
W P + + L + + + G ++L + ++ S D+ G +G +
Sbjct: 377 WYTPNQIAIILKELHKKNKIK---GTENLKIDVH--SSDKPIIFEKILQTLLGRQGKINL 431
Query: 289 VC--------------IDDA------------SRHCSVFSKGQADWT------------- 309
C DD+ S + + + D T
Sbjct: 432 NCNHENQQSRNSINQDQDDSFEKIMPPNQQEIEEFSSQYEESKEDQTDNLCCKDCFKTDN 491
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+ LL+P LGL++++P +I L+ + QS+G++GGKP + Y +G + +YLDPH
Sbjct: 492 KLFLLLPCRLGLDEISPIHIEILKKLLSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDPH 551
Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
++ + K+DL + S+Y + + + ++ I SL GFY D+ + F +L
Sbjct: 552 YIKECVR--KEDLMENISSYFEEDVFKMPINKISTSLVFGFYFSGVDELNKFYKFLRQLE 609
Query: 430 EE 431
+E
Sbjct: 610 KE 611
>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
Length = 392
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 143/340 (42%), Gaps = 41/340 (12%)
Query: 129 GDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
DA + + Q S I SYRK S +TSD GWGCM+R +QM +AQ +
Sbjct: 46 NDADIEQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQII--- 102
Query: 189 RLGRPWRKPLQ-----KPF----DREYVEILHLFGDSET----SPFSIHNLLQAGKA-YG 234
R ++KP Q + F D E + + F ++ +PFSI ++ K
Sbjct: 103 RYYNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162
Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----------VSGDEDGER 283
G W + ++ + L + + SL M IY+ + + +
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQY-------SLNMEIYINYDCAFILQDAIQQMFNQQE 215
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
G + + + +++ + F D I + +P +GL+ +N Y+ L P G
Sbjct: 216 GNE--IWLKERAKNNNQFDL--QDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQG 271
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
++GG + Y VG ++ IYLDPH VQ N DDL + ++Y I+ IH ID
Sbjct: 272 MIGGVSKRALYFVGRIQDYLIYLDPHFVQNAQNF--DDLSKNQASYTCQNIQLIHNSLID 329
Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
PS+ + R+ + D +E F++ +T
Sbjct: 330 PSIVVCLCIRNALELLDLWQIFQHFKQEYQDLFFFSLLET 369
>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 357
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG---RPWRKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G R + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G ++ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLATGGE-----GPVILAFPERQIFLEEVKELLRQSTHVVLLIPVMLGICVI 227
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ R + S D S+ +GFY D F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLSVF 318
>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
Length = 398
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 104/235 (44%), Gaps = 48/235 (20%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK--------------------------ITSDVGWGC 172
+F DF S++ I+YR F PI + TSD GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-K 231
M+RS Q L+A LLF RLGR WR+ + +E E++ LF D +PFSIH + G
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGAT 301
Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
A G G W GP A + +AL + + GL G + E+ V C
Sbjct: 302 ACGKCPGEWFGPSAASQCIQALVK-SNPQVGL------RVCITSDGSDIYEKQFKEVACD 354
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
+ P L+L+ + LG+++V P Y +L+ FPQS+GI G
Sbjct: 355 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAG 398
>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
Length = 351
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 68 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 127 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 171
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 172 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 221
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 222 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 281
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ R + S D S+ +GFY D F
Sbjct: 282 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 312
>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 357
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
+ SYR P+ + T+D+ WGCM+R+ QM++A A + + G P + LQ+ R
Sbjct: 74 LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
LF D ++PF IH + G +G+ G W GP + ++ AL
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
MA Y+ +G E G V+ + + T ++LL+P++LG+ +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 227
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
+ +Y ++ S+GI+GGK ++ ++ G Q++ +LDPH VQP E
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ R + S D S+ +GFY D F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 318
>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
Length = 603
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 83/168 (49%), Gaps = 38/168 (22%)
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
P+L+L+P+ LGL+ +N Y +L F FPQ+LG+VGGKP AS Y + VQ+++ YLDPH
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEIFKFPQNLGVVGGKPRASLYFIAVQDDNLFYLDPH 430
Query: 370 DVQPVINIGKDDLEAD-------------------------------------TSTYHSD 392
VQ I+I + E +T+
Sbjct: 431 TVQNHIDINNSNGEPSNFSFSSSPSSSNINIINTNNNNNNNNNNDKNNNNSFPVNTFFCS 490
Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
+ H+ +DPSL + F+C+ + DFDDF R+ +A + P+F++
Sbjct: 491 QTKRTHVSEVDPSLVVAFFCKSRSDFDDFVDRSKAMASQMEN-PIFSI 537
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 74/126 (58%), Gaps = 1/126 (0%)
Query: 130 DAAGNNGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
D G + + EF +DF++R+L +YR+GF I +++ +D GWGCMLRS QML++ LL H
Sbjct: 129 DIPGQSFIKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHH 188
Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
LG W+K Y I+ +F D ++PFSIHN+ G+ G G W P + +
Sbjct: 189 ALGDDWKKSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQ 248
Query: 249 SWEALA 254
+ ++L
Sbjct: 249 AIKSLV 254
>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 371
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 124/302 (41%), Gaps = 57/302 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
EF D SR+ +YR F PI + +D+G
Sbjct: 97 EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215
Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G G W GP A RS ++L G + I VS + E V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLI------YGFPECGIDDCIVSVSSGDIYENEVEKV 269
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y G Q ++ DPH QP + ++ + H+ + L +DP ++
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPRCSL 369
Query: 409 GF 410
F
Sbjct: 370 VF 371
>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 113/261 (43%), Gaps = 39/261 (14%)
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
YR + +S +T+D GWGC RS+Q L+ Q +L +L R +R + F + V L
Sbjct: 25 YRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYIL--KLHRKFRSLYDQVFGQN-VNPLD 81
Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
LF D ++PF I NL + A GL G W P M A + L C
Sbjct: 82 LFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIM----AATIKLIFDTLNLSC------ 131
Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
++S D + +H P L+L+P + GL K++ Y+
Sbjct: 132 --IISQDLTLDSNDI---------KHTKY---------PALILIPSLFGLSKMDDSYLSF 171
Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
L L SLG V G+ ++ Y VG E Y DPH + + + ++
Sbjct: 172 LLLCLCIESSLGFVSGQNASAYYFVGFDLEDFYYFDPHVTKEAV------VSPPYDSFFD 225
Query: 392 DVIRHIHLDSIDPSLAIGFYC 412
++ + +SI+PS+ +GFYC
Sbjct: 226 LELKSMKKESINPSVLLGFYC 246
>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
[Homo sapiens]
Length = 231
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 102/226 (45%), Gaps = 61/226 (26%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 29 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + +
Sbjct: 78 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEK---- 133
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
MCR + +S D G+R + +
Sbjct: 134 -------------MCR-----------------------VLPLSADTAGDRPPDSLTASN 157
Query: 293 DA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
+ S +CS W P+LL+VPL LG+ ++NP Y+ ++T
Sbjct: 158 QSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKVT 196
>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus A1163]
Length = 226
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)
Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 20 GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79
Query: 364 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
YLDPH +P + NI + + TYH+ +R IH+ +DPS+ IGF +D++D+
Sbjct: 80 FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137
>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus Af293]
Length = 226
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)
Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
G+ + P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ
Sbjct: 20 GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79
Query: 364 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
YLDPH +P + NI + + TYH+ +R IH+ +DPS+ IGF +D++D+
Sbjct: 80 FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137
>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
Length = 632
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 4/118 (3%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
++W P+LL VPL LGL NP Y ++ F P +GI+GG P + +IVGV + I
Sbjct: 385 SNWRPLLLFVPLRLGLHNPNPCYFNAIKAVFRLPNCIGILGGSPCHAVWIVGVTGDDVIC 444
Query: 366 LDPHDVQPVINIGKDDLEAD-TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
LDPH QP G+ +L+ D TYH + + L +DPS+ +GF C + +FDD C
Sbjct: 445 LDPHTTQPA---GRGNLKPDYDQTYHCENPIRMPLKRLDPSMVLGFLCSTEKEFDDLC 499
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 257 QR 258
R
Sbjct: 161 DR 162
>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/274 (25%), Positives = 120/274 (43%), Gaps = 39/274 (14%)
Query: 141 NQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
NQ + YR F I +S ++ D GWGC RSSQ LV Q +L RL + +
Sbjct: 14 NQILAEIPRFCYRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYIL--RLHKNFPDLYNS 71
Query: 201 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
F + L LF D +PF I N++ + GL G+W P + +++++ +
Sbjct: 72 TFGID-KNPLDLFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSIFQ----S 126
Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
L C +V D ++ + ++ P+L+L+P + G
Sbjct: 127 LHLNC--------IVPQDSTF------------------IYEELESTNYPVLILIPGLFG 160
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
LEK+ YI + L+ SLG V G ++ Y +G + Y DPH + +
Sbjct: 161 LEKIEKPYISFIFLSLCMNSSLGFVSGHNDSAFYFIGFDSDYFYYFDPHVTKQALTGPPY 220
Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
D + ++ + +++I+PS+ +GFYC D
Sbjct: 221 DSLFELK------LKSMKIENINPSVLLGFYCDD 248
>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
Length = 348
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 49/344 (14%)
Query: 136 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 195 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
AL M Y+ SG + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLSSGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
Q +DT S + L S S+ +GFY D F F
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298
Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 473
+++N + +F + + V SD +G + D ++S D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFSEDDPDVCSLVSFGD 337
>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
Length = 364
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 143/352 (40%), Gaps = 98/352 (27%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
L EF D + I ++ G + +SD GWGCMLR QM++AQAL+ LGR
Sbjct: 24 LEEF-PDTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRA--- 79
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
Q G G + G W GP + + + LA
Sbjct: 80 -------------------------------QMGVGEGKSIGEWFGPNTVAQVLKKLALF 108
Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF--------------- 301
+ +A+YV + V I+D + C V
Sbjct: 109 DEWNS--------LAVYVSMDN---------TVVIEDIKKMCCVLPLSADTDTESPPDSP 151
Query: 302 -----SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
SKG + W P+LL+VPL LG+ ++NP Y+ +L + + IV +
Sbjct: 152 TASNQSKGPSACGSAWKPLLLIVPLRLGINQINPVYVDAFKLQASC-HPILIVTKEGVRR 210
Query: 353 TYIVGVQEESA--------------------IYLDPHDVQPVINIGKDDLEADTSTYHSD 392
T I+ ++ S I+LDPH Q ++ ++ + D + +
Sbjct: 211 TRILPPKDSSGARASESLKVKHVSFKTGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQ 270
Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ +++ ++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 271 SPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 321
>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
Length = 460
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 134/301 (44%), Gaps = 56/301 (18%)
Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
+F D SR+ +YR F PI G S ++ +D+G
Sbjct: 60 QFLSDVHSRLHFTYRTKFVPIPRVSDGPSPLSFHFLIRENPLTTIENAIYNPDCFNTDIG 119
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+R+ Q L+ AL LGR +R + + D+E +I+ F D+ + FSIHN +
Sbjct: 120 WGCMIRTGQSLLGNALQIANLGRDFR--VNQGKDQEEYKIIDWFADTPQAHFSIHNFVSQ 177
Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
G K G W GP A RS + L Q + G+ I V SGD
Sbjct: 178 GLKLSNKKPGEWFGPAATSRSIQCLVE-QFPDCGID----KCLISVSSGD---------- 222
Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
+D R +F+ Q + ILLL+ + LG+ VN Y ++ T S+GI GG+
Sbjct: 223 -VFEDEVRE--IFA--QKPQSRILLLLGVKLGVNAVNEYYWDDVKKTLGSKFSVGIAGGR 277
Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
P +S Y +G Q IY DPH QP + + + T H+ + L +DPS+ I
Sbjct: 278 PSSSLYFMGFQGNELIYFDPHTPQPSLQTSANFYD----TCHALNFGKLLLSDLDPSMLI 333
Query: 409 G 409
G
Sbjct: 334 G 334
>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
Length = 378
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 137/386 (35%), Gaps = 130/386 (33%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF SRI ++YR+ F PI S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 192 RPWRKP----------------------------------LQKPFD--REYVE------- 208
R W P L+ P +E +E
Sbjct: 103 RAWTWPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHE 162
Query: 209 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
I+ FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 IRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + + + AD +++LVP+ L
Sbjct: 223 PDLQG-----ITIYVAQ--------DCTVYSSDVIDKQRTAMTADNADDKAVIILVPVRL 269
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G E+ N Y+ ++ TF P + K +DP
Sbjct: 270 GGERTNTDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 437
S IGFYCR+ DF +K+ S+ PL
Sbjct: 301 -------------------------SCTIGFYCRNIQDFKRASEEITKMLTISSKEKYPL 335
Query: 438 FTVTQTHKK-------PVNHSDVLGE 456
FT H + N D+ E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361
>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
Length = 179
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 70/114 (61%), Gaps = 3/114 (2%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
+ P L+L+ LG++++ P Y ++ T PQS+GI GG+P AS Y VGVQ YLD
Sbjct: 24 FRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHLFYLD 83
Query: 368 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
PH +P + NI + + + TYH+ +R IH+ +DPS+ IGF +D++D+
Sbjct: 84 PHQTRPALPQRNIDERYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137
>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 348
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 148/344 (43%), Gaps = 49/344 (14%)
Query: 136 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
G AE + + ++L SYR F+P+ + T+D+GWGC +R+ QM++A AL+ ++ G
Sbjct: 37 GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93
Query: 195 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
F+ V L HLF D ++PF IH + G +G GSW GP +
Sbjct: 94 ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
AL M Y+ +G + G V+ + D K
Sbjct: 150 MGAL----------------MEDYLRNGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+LLL+P++LG ++ Y L+ ++G VGGK G++ + +G Q + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248
Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
Q +DT S + L S S+ +GFY D F F
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298
Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 473
+++N + +F + + V SD +G + D ++S D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFNEDDPDVCSLVSFGD 337
>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
Length = 454
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 245 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFY 304
Query: 366 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
LDPH +P + + + +TYH+ +R +H+ +DPS+ IGF RD+DD++ +
Sbjct: 305 LDPHHTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 364
Query: 422 CARASKLAEESNGAPLFTVTQTHKKP 447
A G + V K P
Sbjct: 365 KRSVHNRAMIGTGKAIIHVFDKEKSP 390
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 24/119 (20%)
Query: 101 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 156
P+R+ S++ LL H+ + LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144
Query: 157 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
DP + T+D GWGCM+RS Q L+A AL LGR R
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRACR 203
>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/397 (23%), Positives = 163/397 (41%), Gaps = 67/397 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
I++LG H+I D+ + + + Q I I+YR+ + P+ S SD GWGC
Sbjct: 38 IYILG--HRIDIDQF----EIEDRINKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS------------- 219
MLR QM +AQ L H ++ D +Y I+ F D+++
Sbjct: 92 MLRVGQMAMAQMLKKHLKNHGDKR------DEDYDNIILAFADNDSQENKEFIEFQNSKD 145
Query: 220 ---------PFSIHNL-LQAGKAYGLAAGSWVGP-YAM-----------CRSWEALARCQ 257
PFSI + A K + L G W P Y + R+ E L
Sbjct: 146 KQKAHNFICPFSIQKIAYLAKKEFNLDPGEWYRPNYILFLLELLHNTIPIRASENLKLSV 205
Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
++ L L ++ + D + +++ + K + + V
Sbjct: 206 FNDSCLFLDQLMNRMFEAKFETDKD--------LEEQLEKTQLIGKN-----SLAIFVLT 252
Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
+GL++ N +Y+ L P GIVGG P + YI+G + +YLDPH VQ N
Sbjct: 253 RIGLDEPNQKYLKILDEIMELPYFQGIVGGTPKRAFYILGKINDHYLYLDPHYVQEAEN- 311
Query: 378 GKDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 433
KD + + ++Y I ++ +D S+ + FY R++ + F ++ + S+
Sbjct: 312 -KDQINENKMFNRTSYSCKNIHLLNQKHVDTSMGLSFYIRNQSELLQFWRNMKQIKQSSD 370
Query: 434 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMS 470
+F ++ + + V++S L E+ DD + +
Sbjct: 371 DFFIF-LSDSAPEYVDYSGQLEESSNKLNDDDVVFLQ 406
>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
Length = 259
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 114/272 (41%), Gaps = 56/272 (20%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + Q G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 103
Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D + C V P S G +P S
Sbjct: 104 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 126
Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
Q + I+LDPH Q ++ ++ D + + + +++ ++DPS+A+GF+C
Sbjct: 127 -LTASNQGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 185
Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+++ DFD++C+ K + N +F + Q H
Sbjct: 186 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 216
>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 394
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGC---EAIKRTMQG--AVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLNEGPSAA 255
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
+ T +R +H +D SL + F +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 297
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 109/223 (48%), Gaps = 33/223 (14%)
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
+Y KGF P+ T+D WGC +RS Q L+ Q + +L + + ++ F
Sbjct: 27 FTYHKGFSPLAGG-YTTDKNWGCCIRSGQGLLMQFV--SKLYQLYGDKIKNIFPNG--SK 81
Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
LF D +PF IH + + + +G+ AG WV P + ++ L
Sbjct: 82 FELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDLLSF------------- 128
Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
I+VV E+G C+ S S G P+LLL L+LG + + +Y+
Sbjct: 129 FGIHVVIA-ENG--------CLSRESLR-EALSYGH----PVLLLFTLMLGYKDFDLKYL 174
Query: 330 PTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
P LRLT + QS+G+VGG+ G + Y+VG Q+E+ +Y DPH+V
Sbjct: 175 PFLRLTLSLIYQSVGVVGGQQGKAYYLVGHQKENLLYFDPHEV 217
>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 463
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)
Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
D P L+L+ + LG+++V P Y L+ +PQS+GI GG+P +S Y +G Q Y
Sbjct: 253 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQASHFFY 312
Query: 366 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
LDPH +P + + + +TYH+ +R +H+ +DPS+ IGF RD+DD++ +
Sbjct: 313 LDPHHTRPALAYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 372
Query: 422 CARASKLAEESNGAPLFTVTQTHKKP 447
A G + V K P
Sbjct: 373 KRSVHNGAMIGTGKAIIHVFDKEKSP 398
>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
Length = 356
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 117/262 (44%), Gaps = 37/262 (14%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
TSD+GWGCM+R+ Q L+A AL G P EI+ LF D +PFSIH
Sbjct: 85 TSDIGWGCMIRTGQTLLANALQRTNKGTPCS------------EIIELFVDETKNPFSIH 132
Query: 225 NLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
N + GK L G W P + E L C + + SGD +
Sbjct: 133 NFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKC-----IVSISSGDIYEQ- 186
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTFTFPQSL 342
+ +DD+ + +K Q ILLL + LG+ +N +Y ++ +
Sbjct: 187 --DVLDELDDSEPPAN--TKQQH----ILLLFGIKLGINTINIEKYGQDIKDITNNKYTC 238
Query: 343 GIVGGKPGASTYIVGVQE--ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
GI GG+P +S + G + +Y DPH N D+ D STYHS + +
Sbjct: 239 GISGGQPKSSLFFFGYNNTHDRILYFDPHKPN---NFTTDN---DYSTYHSTEFNELEMF 292
Query: 401 SIDPSLAIGFYCR-DKDDFDDF 421
++DPS+ IGF + +K D++ F
Sbjct: 293 NLDPSMIIGFLVKNNKADWNKF 314
>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 394
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 121/274 (44%), Gaps = 33/274 (12%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLCEGLSAA 255
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
+ T +R +H +D SL + F +D++
Sbjct: 256 ASVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 394
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
+ T +R +H +D SL + F +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 394
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L H GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHN++++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + VV+ CI D +H F +G AD +L V + +
Sbjct: 149 EQLQTRVMVVTSANG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
Y+ +L PQ LG+VGG PG S Y + YLDPH + + A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255
Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
+ T +R +H +D SL + F +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
Length = 257
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 80/176 (45%), Gaps = 44/176 (25%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISTVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
+TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP 249
>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
Length = 378
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 115/315 (36%), Gaps = 118/315 (37%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGW 170
N +F DF SR ++YR F PI SK +SD GW
Sbjct: 109 NGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGFSSDSGW 168
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
GCM+RS Q L+A A RLGR WR+ QK E ++I+ +F D +P+SIHN + G
Sbjct: 169 GCMIRSGQSLLANATGIVRLGRDWRRGQQK---AEEIKIMRMFADDPAAPYSIHNFVDYG 225
Query: 231 KAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
+ G G W GP A +
Sbjct: 226 SSKCGKYPGEWFGPSATSQ----------------------------------------- 244
Query: 290 CIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
CI+ S + ++D + P L+L+ LG++K+ Y L PQS+GI G
Sbjct: 245 CINPDVYEDSFMATAKSDHGFFKPTLILISTRLGIDKITQVYWEALISALQMPQSVGIAG 304
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
+R +H+ +DPS+
Sbjct: 305 -----------------------------------------------LRRLHVQQMDPSM 317
Query: 407 AIGFYCRDKDDFDDF 421
IGF R ++++ ++
Sbjct: 318 LIGFIIRSEEEWKEW 332
>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
Length = 265
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
W +++LVP+ LG E +NP YI ++ +GI+GGKP S Y +G Q+E
Sbjct: 151 AHQSWQSVIILVPVRLGGESLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFIGFQDEQL 210
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
+YLDPH QPV+++ + + + ++H + + + +DPS IGFY + K DF+
Sbjct: 211 LYLDPHYCQPVVDVSQVNFSLE--SFHCNSPKKMPFSRMDPSCTIGFYAKSKKDFE 264
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 12/132 (9%)
Query: 65 ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQ 124
A A +N GWT VK T + + +LG S S T L +C ++
Sbjct: 14 AKLMSAWNNVKYGWT--VKSKTTFNKLSPV--TILGHSYLLNSEGT----LFFICLILSS 65
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
L + + F F SRI ++YRK F P+ S +T+D GWGCMLRS QML+AQ
Sbjct: 66 FCCLN----LDEVERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQG 121
Query: 185 LLFHRLGRPWRK 196
LL H + R +++
Sbjct: 122 LLVHLMHRVYKE 133
>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
Length = 483
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 85/257 (33%), Positives = 118/257 (45%), Gaps = 34/257 (13%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
SD+GWGCM+R+ Q L+ AL RL P P +K +++ F D ++PFS+
Sbjct: 144 FCSDIGWGCMIRTGQALLGNALA--RLRSP---PEEK-------QLIGWFEDRSSAPFSL 191
Query: 224 HNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
HN ++ G A G W GP A RS ++L + GL I SGD E
Sbjct: 192 HNFVREGNALSRKPPGEWFGPSATSRSIQSLVHA-FPQCGLNH----CIISTDSGDVYEE 246
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
G P++ + ILLL+ + LGL VN RY P ++ S+
Sbjct: 247 DVG-PIL--------------EREPQATILLLLGVKLGLNNVNSRYWPDVKHILGSSFSV 291
Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 402
GI GG+P +S Y G Q + YLDPH Q + D E S HS +H +
Sbjct: 292 GIAGGRPSSSLYFFGYQGDYLFYLDPHTSQLDLASCATDNEKYESV-HSARFNKVHFSEL 350
Query: 403 DPSLAIGFYCRDKDDFD 419
DPS+ IG + DD+D
Sbjct: 351 DPSMLIGVLIQGLDDWD 367
>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
Length = 378
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 134/386 (34%), Gaps = 130/386 (33%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
AGN + EF +DF SRI ++YR+ F I S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45 AGN--VEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102
Query: 192 RPWRKP----------------------------------LQKPF------------DRE 205
R W P L+ P D E
Sbjct: 103 RAWTWPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHE 162
Query: 206 YV-EILH-----LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
EI H FGDS + F +H L++ GK G AG W GP + R
Sbjct: 163 MQNEIYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
G + IYV V D + C+ + D +++LVP+ L
Sbjct: 223 PELQG-----ITIYVAQ--------DCTVYSSDVIDKQCASMAPDITDDKAVIILVPVRL 269
Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
G E+ N Y+ ++ TF P + K +DP
Sbjct: 270 GGERTNIDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE--ESNGAPL 437
S IGFYCR+ DF +K+ + PL
Sbjct: 301 -------------------------SCTIGFYCRNVQDFKRASEEITKMLKVFSKEKYPL 335
Query: 438 FTVTQTHKK-------PVNHSDVLGE 456
FT H + N D+ E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361
>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
Length = 194
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 78/158 (49%), Gaps = 27/158 (17%)
Query: 113 IWLLGVCHKI--------AQDEALGDAAGNNGLA----------------EFNQDFSSRI 148
IWLLG + I A EA D N G + +F DF+SR+
Sbjct: 29 IWLLGCSYIIKPTDHIQQALLEAQRDLMFNKGSSENEEENNQNMHMLWPPDFYDDFTSRL 88
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-KPFDREYV 207
++YR + PI S +D+GWGC LRS Q L+A L+ H LGR WR+ Q + ++Y
Sbjct: 89 WMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQAAWKQYS 148
Query: 208 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 243
I+H F D S +PFSIH + GK G G W GP
Sbjct: 149 RIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGP 186
>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
Length = 350
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 70/130 (53%), Gaps = 3/130 (2%)
Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG-- 378
L +VNP YI L+ F P S G++GG+P + Y +G E A+YLDPH VQ V IG
Sbjct: 180 LNEVNPIYIEGLKKCFQLPGSCGMIGGRPNQALYFIGYVGEEALYLDPHTVQRVGCIGEK 239
Query: 379 KDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 437
++ +E + +T+H I S+DPSLA+ F C + FD A + PL
Sbjct: 240 QESVEQEQDATFHQRHASRIAFASMDPSLAVCFLCCSRAQFDQLVAHFKERLNGGGSQPL 299
Query: 438 FTVTQTHKKP 447
F VT+T + P
Sbjct: 300 FEVTKTRQAP 309
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 2/103 (1%)
Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
QD SR+ +YR+GF PIG++++T+D GWGCMLR QM++A+AL LGR W+ ++
Sbjct: 72 QDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLGRDWQWS-EET 130
Query: 202 FDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGP 243
D Y++I++ F D++ +PFS+H + L + G W GP
Sbjct: 131 RDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGP 173
>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 296
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 117/271 (43%), Gaps = 54/271 (19%)
Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY--- 206
+YR F I ITSD GWGC RS+Q L+A L + P D EY
Sbjct: 30 FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNY-----------APVDAEYFFT 78
Query: 207 ----VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
+ + LF D PFSI NL+ + +G+ G+W P + + E++ +
Sbjct: 79 VFNEIPMFSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFK------- 131
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
L +++ ++S D + ++ D + +LG++
Sbjct: 132 ----DLKLSV-LISKDSN-------IIPEDVKTMRAPFLLLIPI-----------LLGMK 168
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE-ESAIYLDPHDVQPVINIGKDD 381
V ++IP ++ TF P+ LG V G S ++VG+ E ++ +Y DPH + +
Sbjct: 169 DVEQKFIPFIKYTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPHVTKQAVASS--- 225
Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
D S + R I + S++PS +GF+C
Sbjct: 226 --FDHSEFFEVPPRGIKMKSLNPSFLLGFFC 254
>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 343
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 125/293 (42%), Gaps = 37/293 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
I SYR GF + I SD GWGCMLRS QM+ A LL H P +Q + +
Sbjct: 27 IYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLLRHLKENP---QIQNQLKIQNI 83
Query: 208 E-----ILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 261
I+ F +++ PFSI + A + + L G W P + S + L + +
Sbjct: 84 NDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKKLLNNFQTFS 143
Query: 262 GLGCQS------LPMAIYVVSGDEDGERGGAPV------VCIDDASRHCSVFSKGQADWT 309
+ S P+ G++ + + + I++ + + + +
Sbjct: 144 EMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKIMKQNSNKYQ 203
Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
+++GL+ +Y+ L FT S+G ++G+ + YLDPH
Sbjct: 204 INKQNYKILIGLDYPEEKYLDILIKLFTHRLSIG-----------MIGLNNDKLTYLDPH 252
Query: 370 DVQPV-INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
VQ IN E + TY + ++ I+ ++ PS+ +GFY +D +D ++F
Sbjct: 253 IVQHADINTN----EINLKTYFQEEVKQINKHALGPSVGLGFYLKDLNDLNEF 301
>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
Length = 373
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 4/117 (3%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
F DF SR+ ++YR F IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR +
Sbjct: 147 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 206
Query: 200 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEAL 253
+ Y E+L F D S SP+SIH + + G + + G W P + + L
Sbjct: 207 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLL 262
>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
Length = 296
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 110/248 (44%), Gaps = 44/248 (17%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
+R + +I+ F D +PF +H L++ G++ G AG W GP +A R
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVE 103
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
+ + +YV S+ C+V L + L +
Sbjct: 104 SCSEVTRLVVYV--------------------SQDCTV-----------LHMRSLAIDPS 132
Query: 323 KVNPRYIPT-LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
K +P+ L+ LGI+GGKP S Y +G Q++ +YLDPH QP +++ + +
Sbjct: 133 KDRSTCLPSSLQELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQAN 192
Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLF 438
+ ++H R + +DPS +GFY D+ +F+ C+ +++ S+ P+F
Sbjct: 193 FPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMF 250
Query: 439 TVTQTHKK 446
T+ + H +
Sbjct: 251 TLAEGHAQ 258
>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
Length = 419
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 117/260 (45%), Gaps = 39/260 (15%)
Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
R FD + TSD GWGCM+R+SQ L+A AL K + +EIL L
Sbjct: 130 RSLFD---NENFTSDAGWGCMIRTSQNLLANAL---------LKLAGEANGNVQLEILKL 177
Query: 213 FGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
F D + FSIHN ++ A L+ G W GP A S L + Q P
Sbjct: 178 FQDDPNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLT------IEMTDQESPT 231
Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
+ V E+ + DD + K P+LLL P+ LG++ VN Y
Sbjct: 232 VVPFVYISENAD-------LYDDEIEETFLKEK-----RPLLLLFPVRLGIDHVNKYYYK 279
Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQPVINIGKDDLEADTSTY 389
++ S+GI GGKP +S Y +G + +E+ IY DPH Q + + ++Y
Sbjct: 280 SILQLLASRFSVGIAGGKPSSSFYFIGYENDENLIYFDPHLPQVF------ESPINLASY 333
Query: 390 HSDVIRHIHLDSIDPSLAIG 409
H+ + ++ +DPS+ IG
Sbjct: 334 HTLNYNKLSIEMLDPSMMIG 353
>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
Length = 269
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/227 (27%), Positives = 104/227 (45%), Gaps = 24/227 (10%)
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
+SIH + Q G++ A G W+GP + + + L R + +AI+V
Sbjct: 4 YSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD--- 52
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
V +DD C + W P+LL++PL LG+ +NP Y+P L+
Sbjct: 53 ------STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDS 102
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHI 397
S G++GG+P + Y +G ++ +YLDPH Q + + A+ TYH +
Sbjct: 103 SCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARL 162
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
+ ++DPSLA+ F C+ D F+ + + LF ++QT
Sbjct: 163 NFSAMDPSLAVCFLCKTSDSFESLLTQFKEEVLSLCSPALFEISQTR 209
>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
Length = 256
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 81/176 (46%), Gaps = 45/176 (25%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S S + L G C+ E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSK-ISSVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
S +TSD GWGCMLRS QM++AQ LL H L R WR
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192
Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
L+ DR + I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP 248
>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 516
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 98/400 (24%), Positives = 160/400 (40%), Gaps = 78/400 (19%)
Query: 142 QDFSSRILISYRKGFDPIGD------------SKITSDVGWGCMLRSSQMLVAQALLFHR 189
++F + I I+YRK F + + S+ SD GWGCM+R QM A+ L H
Sbjct: 71 ENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCMVRVGQMAFAEGLRRHL 130
Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSWVGPY 244
+ +K + K + V I D + +P+SI + + A + L G W P
Sbjct: 131 VEN--KKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIALSDFNLLPGEWYTPI 188
Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGD-------EDGERGGAPVVCID 292
+C L ++A G + L +A++ +V D D +RG +C +
Sbjct: 189 RICYILGLLHNERKAIKG--TEDLKVAVFSSSRPIVFQDFLERMCKVDPQRGKHAQICPN 246
Query: 293 -------------DASRHCSVFSKGQ---------ADWTPILLLV-PL------------ 317
D H + + Q ++ TP L LV P+
Sbjct: 247 QCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCPIHHELQYSMIVYI 306
Query: 318 --VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
++GL+ P Y+ + F SLG++GGKP + Y VG E+ IYLDPH VQ
Sbjct: 307 VCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDEFIYLDPHYVQEFS 366
Query: 376 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 435
N + TY + +ID S ++ +Y +D + ++F L + N
Sbjct: 367 NEKNFQSSSQLETYFCKKFQTYPSKNIDSSFSLMYYLKDLEQLEEFYQFMMGLKRDYNEH 426
Query: 436 PLFTVTQTHKKPVNHSDVLG---ETGGVPEDDSLGVMSMN 472
+ T S LG E+ + D +L +++ N
Sbjct: 427 FFMMMEDTEP-----SFCLGDGKESSNLISDKNLNILADN 461
>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
Length = 347
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 109/247 (44%), Gaps = 28/247 (11%)
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
M+R+ Q L+ AL LGR +R + +RE + ++ F D+ +PFS+HN + AG
Sbjct: 1 MIRTGQSLLGNALQILHLGRDFRVNGNESLERES-KFVNWFNDTPEAPFSLHNFVSAGTE 59
Query: 233 YG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
G W GP A RS ++L G + I VS + E V
Sbjct: 60 LSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKVFAE 113
Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
+ SR IL L+ + LG+ VN Y ++ + QS+GI GG+P +
Sbjct: 114 NPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGRPSS 159
Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
S Y G Q ++ DPH QP + ++ + H+ + L +DPS+ IG
Sbjct: 160 SLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLIGIL 213
Query: 412 CRDKDDF 418
+ + D+
Sbjct: 214 IKGEKDW 220
>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
Length = 546
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 79/149 (53%), Gaps = 12/149 (8%)
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
++LLVPL LGL++++ YIP+L T PQSLG +GG+P + + +G Q + LDPH
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSLLETLRVPQSLGFLGGRPNHAIFFIGAQGNTLTGLDPHT 439
Query: 371 VQPVINIGKD-DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
QP ++G+ E + H + + IDPSLA+ FY D+ F+D R
Sbjct: 440 TQPAADMGEGFPSERYVHSLHCQSAVSMDVHRIDPSLALAFYLPDRATFEDLIKRIG--- 496
Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGETG 458
E+N P F+V QT D GE G
Sbjct: 497 -ETN-PPPFSVEQTRP------DYEGEMG 517
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 70/138 (50%), Gaps = 23/138 (16%)
Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
W++G+ + ++E E D S + I+YR GF + T D GWGCM
Sbjct: 38 WIMGIPYTELREE------------ERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCM 85
Query: 174 LRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGD--SETSPFSIHN 225
LRS+QML+ QAL H LGR WR P L+ P EY ++ LF D E + FSIHN
Sbjct: 86 LRSAQMLMTQALQRHTLGRSWRVPRTLEERLRVP---EYRTLVRLFADHPGEANLFSIHN 142
Query: 226 LLQAGKAYGLAAGSWVGP 243
+ Q G Y G W GP
Sbjct: 143 MCQVGIRYDKLPGEWYGP 160
>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 31/273 (11%)
Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
++ +YR GF+ P I +D GWGC+LR+SQML+A L + GRP + L FD
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWAY--GRPADRRLALFFDH- 102
Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
+ET+PFSIHNL+++ + P C EA+ R + +
Sbjct: 103 ---------SAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGC---EAIKRTM--QDAIKT 148
Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
+ L + VV+ C+ H F +G A+ +L V + +
Sbjct: 149 EQLQTRVTVVTSTNG---------CVYADEVH-HTFKQG-AEVVLVLASVRVSAAAQLTQ 197
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
Y+ +L PQ LGIVGG PG S Y + YLDPH +
Sbjct: 198 ESYLQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPHQRTTAALLSDGPSATV 256
Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
+ T +R +H +D SL + F +D++
Sbjct: 257 SVTPSVSDVRCVHWSRVDTSLFLAFAVTTRDEW 289
>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 327
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 262
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 263 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 322 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 379
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH +
Sbjct: 188 SRYMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSS 247
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
+ E S +R + +D S +GF+ + ++ R L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299
>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
gambiense DAL972]
Length = 327
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S L +YR+ FDP+ S +TSD GWGC+ R++QML+A +L R+ +
Sbjct: 41 NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 262
+Y L D + +PFS+H +++ + L G + P +A + EA++ C + T
Sbjct: 92 QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144
Query: 263 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
G S P+++ + V+G E V C SR+ +L+L PL G
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187
Query: 322 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 379
+ ++ + +L P+S+G+VGG P YI+G +E +YLDPH +
Sbjct: 188 SRCMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSG 247
Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
+ E S +R + +D S +GF+ + ++ R L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299
>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
Length = 567
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 10/146 (6%)
Query: 297 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
+CS ++ + W P++++VP+ LG + L QSLG +GG+P S Y
Sbjct: 406 NCSRMAQAREPCSWRPLIVVVPVRLGARSEDQH----LSRIDKHLQSLGFIGGRPRHSYY 461
Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
VGV+ +A YLDPH QP +I K+ + +++H + L IDPSLA+GFYC D
Sbjct: 462 FVGVRGYNAYYLDPHITQPYQSIRKN---INVASFHCAHPGKMSLAHIDPSLALGFYCDD 518
Query: 415 KDDFDDFCARASKLAEESNGAPLFTV 440
K DF+D R +LA + P+ +V
Sbjct: 519 KSDFEDLIRRVEELA-AGDSHPILSV 543
>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
Length = 326
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 118/280 (42%), Gaps = 35/280 (12%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S L++YR F+P+ S +TSD GWGC+ R+SQML+A L H
Sbjct: 41 TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVLRRHAASEC----------- 89
Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETG 262
+++ D +PFS+H + +A +G A W P C EA+ C +
Sbjct: 90 -HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYW-APSQGC---EAIRSCVESAVR 144
Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL- 321
G + +++ V S ER + + D + +L+LVP+ G
Sbjct: 145 QGLLTQKLSVVVSSSGTIPER---------------EIHEHLRGDGS-VLVLVPVRCGTS 188
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
++ L P +G+VGG P YIVG +YLDPH + + +
Sbjct: 189 RRMTQTMFFALEHLLHIPSCMGVVGGVPNRGYYIVGTSGHRLLYLDPHCMTQNAMVSCEL 248
Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
+ T ++++R + D +D S GF D+++
Sbjct: 249 GKVGIVTPTTNLLRSVRWDHVDTSFFFGFLLDSLDEYEKL 288
>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
anatinus]
Length = 147
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 32/136 (23%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----R 195
F +DF SR+ ++YR+ F P+ S TSD GWGCMLRS QML+AQ L+ H L R W
Sbjct: 5 FQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWIWAEA 64
Query: 196 KPLQKP----------------------------FDREYVEILHLFGDSETSPFSIHNLL 227
P KP +R++ I+ F D +PFS+H L+
Sbjct: 65 GPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSLHRLV 124
Query: 228 QAGKAYGLAAGSWVGP 243
+ G+ G AG W GP
Sbjct: 125 RLGQGSGKRAGDWYGP 140
>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 172
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 69/127 (54%), Gaps = 11/127 (8%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 33 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH + + +
Sbjct: 82 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKMCRV 141
Query: 233 YGLAAGS 239
L+A +
Sbjct: 142 LPLSADT 148
>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 359
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 156/389 (40%), Gaps = 72/389 (18%)
Query: 80 AAVKRLVTAGSMRRI--------HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGD 130
A ++LV GS + HE + P G S ++LGV K Q D+ L +
Sbjct: 2 AYFQKLVQHGSYNILSKFYNQIGHEDIQKPIFIGGCS----FYILGVEFKTKQMDKQLAE 57
Query: 131 AAGNNGL----AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL- 185
L A S+ ++YR G++ + +S +T+DVGWGC +R+ QM++A A+
Sbjct: 58 QPPEVYLQYSSAPAFFRISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAME 117
Query: 186 ------LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFSIHNLLQAGKAY--GL 235
+ P+ P E + +L F DS T+P SIH++ ++
Sbjct: 118 TIVYSGALNNTQTPYI-----PTKEEIMNVLVPFIDSPNSTTPLSIHHVYESRFVVEKNK 172
Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
+ +++ P + +++ L + P+ C+ ++
Sbjct: 173 SGVNYLAPSVVAKAYSGLVNSWKL--------------------------CPIRCVMCSN 206
Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
+ + P L+ +P+VL N L+ + GIVGG + ++
Sbjct: 207 VSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFV 261
Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS-----LAIGF 410
G +YLDPH VQP K E DT +Y + +IDP+ GF
Sbjct: 262 FGFHALQFLYLDPHIVQPSF---KSFTEIDTKSYSPISTNRFSVHTIDPTKLDDFCTFGF 318
Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFT 439
++ + DDF A ++ E SN L T
Sbjct: 319 LIKNFHEIDDFMKFAKEVFEISNDKELRT 347
>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
Length = 556
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 69/122 (56%), Gaps = 4/122 (3%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
E + +SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR W+
Sbjct: 37 EIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLGRSWKWS 96
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P Q+ EY +L +F D ++ +SI + G + G + GSW GP + + + L+
Sbjct: 97 PEQE--SPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKKLSVY 154
Query: 257 QR 258
R
Sbjct: 155 DR 156
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 63/122 (51%), Gaps = 15/122 (12%)
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TSTYHSDV 393
F P +GI+GG P + +IVGV ++ I LDPH QP G+ +L+ D TYH D
Sbjct: 350 VFRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPA---GRGNLKPDYDQTYHCDN 406
Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE------SNGAPLFTVTQTHKKP 447
I L +DPS+ +GF C + +FDD C L EE +N PL + T +P
Sbjct: 407 PIRIPLKRLDPSMVLGFLCSTEKEFDDLC---HNLKEEVLHPSVANSWPLVEIHTT--RP 461
Query: 448 VN 449
N
Sbjct: 462 SN 463
>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 823
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 67/113 (59%), Gaps = 3/113 (2%)
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
IL+++P LGL KVN Y +++ F ++GI+GG+P + Y VG Q+ I LDPH
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIKYVFQCRLNVGIMGGRPNQALYFVGTQKTDLICLDPHL 670
Query: 371 VQPVINIGKDDLE--ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
VQ + + +++L TYH D + + + +D SLA GFY +D +DF+ F
Sbjct: 671 VQDTV-LNQEELSNVELNQTYHCDQAKKLSMTKLDTSLAFGFYLKDYNDFEVF 722
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 10/99 (10%)
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLG-----RPWRKPLQKPFDREYVEILHLFGD---S 216
T+DVGWGC +R QM++ QAL+ H +G + QK + Y +I+ L D S
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLN--YAKIIQLIHDNDCS 451
Query: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
+T FSI N+ + G + G W GP+A+ L R
Sbjct: 452 QTGAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNR 490
>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 327
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 131/290 (45%), Gaps = 42/290 (14%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMTLDFSFQYFC 95
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+ D +PFS+H ++++ G L W P C EA++ C R+ G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRSAIHRGAL 148
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 325
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 384
+ +L P +G+VGG PG YIVG +E +YLDPH + + E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIVGTGGQELLLYLDPHCMTQEALVS---CES 249
Query: 385 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
DT+ RH + D +D S IGF+ + ++D + L+ +
Sbjct: 250 DTAGVVRPTPRHLLCVPYDRVDTSFFIGFFVDSFELWEDLQKKIEGLSRQ 299
>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 89/199 (44%), Gaps = 44/199 (22%)
Query: 79 TAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
A ++ L AG + + SRT S +S + +C + + E GD +
Sbjct: 84 VAVMQVLHLAGRCPYVSPGWVVKSRTSFSKISS----IHLCGRRYRFEGEGD------IQ 133
Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---- 194
F +DF SR+ ++YR+ F P+ +TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 134 RFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAE 193
Query: 195 ---------------------------RKPLQKP---FDREYVEILHLFGDSETSPFSIH 224
R P +R + +I+ F D +PF +H
Sbjct: 194 GMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLH 253
Query: 225 NLLQAGKAYGLAAGSWVGP 243
L++ G++ G AG W GP
Sbjct: 254 RLVELGQSSGKKAGDWYGP 272
>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 359
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 144/344 (41%), Gaps = 52/344 (15%)
Query: 113 IWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRILISYRKGFDPIGDSKITS 166
++LGV K Q D+ L + L A F + S+ ++YR G++ + +S +T+
Sbjct: 39 FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAATFFR-ISNLFWMTYRSGYEKLPNSSLTT 97
Query: 167 DVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFS 222
DVGWGC +R+ QM++A A+ + + + P +E + +L F DS T+P S
Sbjct: 98 DVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIPTKQEVMNVLIPFIDSPNSTTPLS 157
Query: 223 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
IH++ ++ + +++ P + +++ L +
Sbjct: 158 IHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL--------------------- 196
Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
P+ C+ ++ + + P L+ +P+VL N L+ +
Sbjct: 197 -----CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKL 246
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
GIVGG + ++ G +YLDPH VQP K E DT +Y +
Sbjct: 247 FAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTEIDTKSYSPIGTNRFSVH 303
Query: 401 SIDPS-----LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 439
+IDP+ GF ++ + DDF A + E SN L T
Sbjct: 304 TIDPTKLDDFCTFGFLIKNLHEVDDFMKLAKDVFEISNDKELRT 347
>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
Length = 414
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
E SR+ ++YRKGF PIG SD GWGCM R QM++A+A+L LGR WR
Sbjct: 43 EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
P Q+ EY +L +F D + +SI + G + G + GSW GP + + + L+
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160
Query: 257 QR 258
R
Sbjct: 161 DR 162
>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
Length = 483
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 77/265 (29%), Positives = 122/265 (46%), Gaps = 39/265 (14%)
Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 224
+DVGWGCM+R+ Q L+ AL R+ + +P D + EI LF D+ S FS+
Sbjct: 135 TDVGWGCMIRTGQSLLGNAL--QRVKSTVKDQPYIYEMD-DTKEITDLFKDNTKSAFSLQ 191
Query: 225 NLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
N ++ G+ Y +A G W GP L + C I V SGD E
Sbjct: 192 NFVKCGRIYNKIAPGEWFGPATTATCIRYLIQENPCYGIEAC-----YISVSSGDIFKEN 246
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTP---ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
+G D P IL+L+ + LGL+ V+ RY ++ P
Sbjct: 247 ------------------IQGMIDRYPNGNILILLGIKLGLDSVHERYWGEIKTMLESPF 288
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
S+GI GG+P +S Y G +++ ++ DPH+ Q + DD + + H++ ++
Sbjct: 289 SVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQTAL---IDDFD---ESCHTENFGKLNFS 342
Query: 401 SIDPSLAIGFY--CRDKDDFDDFCA 423
+DPS+ +GF C D+F +F +
Sbjct: 343 DLDPSMLLGFLLPCSKWDEFQEFTS 367
>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
Length = 356
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 102/257 (39%), Gaps = 87/257 (33%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 79 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 127
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
MLR QM++AQAL+ LGR Q G
Sbjct: 128 MLRCGQMMLAQALICRHLGRA----------------------------------QMGVG 153
Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
G + G W GP + + + LA + +A+YV + V I+
Sbjct: 154 EGKSVGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 196
Query: 293 DASRHCSV--FSKGQAD----------------------WTPILLLVPLVLGLEKVNPRY 328
D + C + FS AD W P+LL+VPL LG+ ++NP Y
Sbjct: 197 DIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLLLIVPLRLGINQINPVY 256
Query: 329 IPTLRLTFTFPQSLGIV 345
+ + TF + G V
Sbjct: 257 VDAFK-TFVDTEENGTV 272
>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 388
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 73/295 (24%), Positives = 123/295 (41%), Gaps = 44/295 (14%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ + + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAAAKKLLYFSYRNCFPPLPN-RSTTDTRWGCLVRTTQMLVGSCLLRYHCKGA 112
Query: 194 WRKPLQKPFDREYVE----ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
+ P +R+ E I LF D ++P IH + + S + P
Sbjct: 113 YVLP-----ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSP------ 161
Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHCSVFSKGQADW 308
E G+ + +A + GD AP C ++ + S ++
Sbjct: 162 ---------TEAGMAIAAALIAFHAQGGD-------APFTFCCENRNIDESAVMAKLSEG 205
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
++L++P+VLG+ ++ +Y L GI GG AS Y+ G Q + ++DP
Sbjct: 206 QHVILIIPVVLGIAPMSGQYERMLLKILDMKACCGIAGGFKQASLYMFGHQGRNVFFMDP 265
Query: 369 HDVQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
H VQ G+ LE + DP + +GFY D+ +F
Sbjct: 266 HYVQRAYTSGRTVGTLEGARG--------DLAARRFDPCMVLGFYLHTPADYCEF 312
>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
Length = 256
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 12/122 (9%)
Query: 128 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
LG+ + G +A + +S + +YRK F PIG + T+D GWGCMLR QML+A+ L+
Sbjct: 30 LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 89
Query: 187 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
LG W +DR EY IL +F D + FSIH + G + G G W
Sbjct: 90 VRHLGHNWL------WDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWF 143
Query: 242 GP 243
GP
Sbjct: 144 GP 145
>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
Length = 256
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 11/114 (9%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG H + +++ + D S+R+ +YR+ F PIG + +SD GWGC
Sbjct: 27 VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
MLR QM++AQAL+ LGR W QK +EY IL F D + +SIH +
Sbjct: 76 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 129
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 1/91 (1%)
Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
Y + + I+LDPH Q ++ +D D + + + +++ ++DPS+A+GF+C+
Sbjct: 124 YSIHQMGDELIFLDPHTTQTFVDTEEDGTVDDQTFHCLQSPQRMNILNLDPSVALGFFCK 183
Query: 414 DKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++ DFD++C+ K + N +F + Q H
Sbjct: 184 EEKDFDNWCSLVQKEILKEN-LRMFELVQKH 213
>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 649
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 85/358 (23%), Positives = 145/358 (40%), Gaps = 36/358 (10%)
Query: 148 ILISYRKGFDPIGD-----SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
I SYR F I D +++D GWGCM+R SQML+A+AL H L + Q
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204
Query: 203 DRE---YVEILHLFGD--SETSPFS------------IHNLLQAGKAYGLAAGSWVGPYA 245
D E Y I+ LF D SE+ + + N Y L + A
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264
Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---RHCSVFS 302
+ R ++ + T + + I S + + G ++ D + S
Sbjct: 265 ILRQYQQ--NVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEAS 322
Query: 303 KGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
+ Q D IL++V L G+ K ++ +G + G YI+G QE+
Sbjct: 323 RKQNDRLNNILVMVHLKFGINKFEMQHKDYFIELLKIKNFVGALSGTETKGMYIIGFQED 382
Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR--DKDDFD 419
I LDPH +Q G+ L+ D TY + R I L+ + +++G++ + ++ +
Sbjct: 383 RLIVLDPHFIQKSTE-GEQGLDKDYCTYFNKTPRSISLECLSSDISLGYFIQVNEEQSIN 441
Query: 420 DFCARASKLAEESNGAPLFTV----TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 473
F + L E+ + PL ++ +T + + + E DS+ +S N+
Sbjct: 442 QFIDQILTLNEK-HKEPLLSILNDRIETDEMEIEEHQINKEVKDQENQDSVNNISQNE 498
>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
IL3000]
Length = 327
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 129/290 (44%), Gaps = 42/290 (14%)
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
L +YRK F+P+ S IT+D GWGC+ R+SQML+A AL R+ + F +Y
Sbjct: 45 LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMALDFSFQYFC 95
Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
+ D +PFS+H ++++ G L W P C EA++ C R G
Sbjct: 96 DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRRAIHRGAL 148
Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 325
+ + V G A + + +RH G A L+LVP+ G ++
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 384
+ +L P +G+VGG PG YI+G +E +YLDPH + + E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIIGTGGQELLLYLDPHCMTQEALVS---CES 249
Query: 385 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
DT RH + D +D S +GF+ + ++D + L+ +
Sbjct: 250 DTVGVVRPTPRHLLCVPYDRVDTSFFLGFFVDSFELWEDLQKKIEGLSRQ 299
>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
Length = 348
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 146/347 (42%), Gaps = 68/347 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------ITSDVGWGCMLRSSQMLVAQALLF 187
F ++F IL +YR F I ++ I SDVGWGCM R +QM +A +
Sbjct: 44 FLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVTQMSIAHGIC- 102
Query: 188 HRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAM 246
+ K + E +IL+ F D+E++ FSIHN++ G + +G+ SW+GP
Sbjct: 103 -----QFMKRFLGNLNIE--KILNNFQDNESAKFSIHNMVNIGLSEFGIDPTSWIGPTTS 155
Query: 247 CRSWEALARCQRAETGLGCQSLPMA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
L R+ ++ +A I V G + D A +H FS+
Sbjct: 156 SMIANKLINDNRSIIS----NIQIASITYVEG----------TIYRDQAVKH---FSEVG 198
Query: 306 ADWTPILLLVPLVLGLEKVNPR-YIPTLRLTFTFPQSLGIVGGKPGAS--TYIVGVQEES 362
+D + L + LG K N Y T+ Q + I+GG +S IV
Sbjct: 199 SDSCTFVWLC-MKLGTSKFNINSYKKTVISMSNVSQFICIMGGNNYSSGALLIVAFSNSF 257
Query: 363 AIYLDPH-DVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
LDPH V P N +DD T I+ ++ SL++ + CR+ +DF
Sbjct: 258 LYCLDPHIKVLPSFSDKNFIRDDFIQKVPT-------RIYWGELNSSLSMVYICRNLEDF 310
Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 465
DD C+ +++ + LF V +N+ D E + E DS
Sbjct: 311 DDLCSNLTRI-----NSDLFEV-------INNCDF--EVKSINELDS 343
>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 388
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 145/360 (40%), Gaps = 52/360 (14%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGA 112
Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 307
E G+ + +A + GD P C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
PH +Q + +D + + R + DP + +GFY +D+ F A
Sbjct: 265 PHYIQ-------NAYTSDRTVGTLEGARGELSARRFDPCMVLGFYLHTLEDYRVF-AEEL 316
Query: 427 KLAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 479
+A PL + Q ++ SD E G +P ++ +S N A G H
Sbjct: 317 AVANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376
>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
Length = 362
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 146/346 (42%), Gaps = 67/346 (19%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ-----DFSSRILISYRKGFDPIGDSKITSD 167
++LLG+ +K + + L +++ S+ + ++YR G++ + +S + +D
Sbjct: 39 LFLLGIEYKTTPLKKQAQELPQSSLLQYSSMAAYVRMSNLLWMTYRSGYEKLPNSSLNTD 98
Query: 168 VGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSI 223
VGWGC +R+ QM+++ A+ L ++ P E + ++ F D +T+P SI
Sbjct: 99 VGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIPKQNEILNVVIPFVDFFEQTTPLSI 158
Query: 224 HNLLQ---------AGKAYGLAAGSWVGPYA-MCRSWEALA-RCQRAETGLGCQSLPMAI 272
H++ + +G Y LA Y+ + SW+ A RC A S+P+
Sbjct: 159 HHVYESRFVVEQNKSGVNY-LAPTIVAKAYSDLVNSWKMCALRCVMASNT----SIPL-- 211
Query: 273 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
C + + + P L+ +P+++ + V R L
Sbjct: 212 -------------------------CDI---KKEPFKPTLVFLPIIMD-QLVKSR----L 238
Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI-NIGKDDLEAD---TST 388
+ + F GIV G + YI G ++LDPH VQP + K DL++ T
Sbjct: 239 QQIYKFNMFAGIVSGIGDRAVYIFGFHVMRCLFLDPHTVQPAAESFTKIDLKSYAPINPT 298
Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCR---DKDDFDDFCARASKLAEE 431
+ I I LD ID GF + + D F+ FC ++ E
Sbjct: 299 LNRFAIHSIELDKIDQFCTFGFLIKSLEEVDAFEKFCTETFDISHE 344
>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 328
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 248
WR + ++ D+E ++PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 306
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 307 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMKFFSLEHLLHSSTCIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
LDPH + + +A T + +++ + D +D S +GF
Sbjct: 231 LDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
Length = 328
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 124/291 (42%), Gaps = 48/291 (16%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLF 187
LG A NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 25 LGRVA-NNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL-- 81
Query: 188 HRLGRPWRKPLQKPFDREYVEILHLFGDSET---SPFSIHNLLQA--GKAYGLAAGSWVG 242
WR + + H F D +T +PFS+H +++A KA W
Sbjct: 82 ------WR------YSANDCRLDH-FRDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWT- 127
Query: 243 PYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS 302
GC+++ + + +R P + + S+ C +
Sbjct: 128 ------------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAR 164
Query: 303 K--GQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
+ ++ +L+L P+ G ++ +L +G+VGG P S YI+G
Sbjct: 165 EICSNLEFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTS 224
Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
+ +YLDPH + + +A T + +++ + D +D S +GF
Sbjct: 225 GQRLLYLDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 328
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
NN N + L++YR F P+ S +TSD GWGC++RSSQML+A AL
Sbjct: 28 VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81
Query: 192 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 248
WR + ++ D+E ++PFS+H +++A KA W
Sbjct: 82 --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 306
GC+++ + + +R P + + S+ C + +
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170
Query: 307 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
++ +L+L P+ G ++ +L +G+VGG P S YI+G + +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230
Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
LDPH + + +A T + +++ + D +D S +GF
Sbjct: 231 LDPHCMTQEALVSGHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 388
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 145/360 (40%), Gaps = 52/360 (14%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ + T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGT 112
Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+ P + E E I LF D ++P IH + S + P
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 307
E G+ + +A + GD P C + D + S+GQ
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
PH +Q + +D + + R + DP + +GFY +D+ F A
Sbjct: 265 PHYIQ-------NAYTSDKTVGTLEGARGELSARRFDPCMVLGFYIHTLEDYRVF-AEEL 316
Query: 427 KLAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 479
+A PL + Q ++ SD E G +P ++ +S N A G H
Sbjct: 317 VVANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376
>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
Length = 128
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 62/116 (53%), Gaps = 15/116 (12%)
Query: 113 IWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 170
+W+LG + I +DE L D A SR+ +YR+ F IG + TSD GW
Sbjct: 23 VWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTSDTGW 69
Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
GCMLR QM+ AQAL+ LGR WR +K Y +L+ F D + S +SIH +
Sbjct: 70 GCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125
>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 371
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 77/306 (25%), Positives = 131/306 (42%), Gaps = 44/306 (14%)
Query: 142 QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
++ SS + +SY+K + IT+D GWGC LR+SQM++AQ L H + K +Q
Sbjct: 52 EELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRH----LYEKRVQ 107
Query: 200 KPF--DREYVEILHL---FGDSET------SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
D+ ++ HL F +S + SPF H+LL +A L Y +
Sbjct: 108 SFIYNDKTKLDFQHLIMMFAESNSLENMDQSPFGFHSLL--TQAINLFQVPLKQQYTPVQ 165
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
+AL + Q L ++ +V+ V+ +D + + K
Sbjct: 166 GIKALKQ------QFKQQKLVKSLKIVT-------SSTGVIFQEDIRQKMKNWEKS---- 208
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
+LL++ LG K+N Y+ ++ +G +GG S ++VG + + LDP
Sbjct: 209 --LLLILHFKLGTGKLNQIYVEQIKSLMDLEYFVGAIGGIKNKSLFMVGYMNDQFLSLDP 266
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS---IDPSLAIGFYCRDKDDFDDFCARA 425
H Q N KD L + S + + DS + +I FY R + ++ F +
Sbjct: 267 HVQQ---NACKDPLNLNDEEMSSFFPKKVRADSCVKYEGDFSISFYIRSEKQYNIFLQKI 323
Query: 426 SKLAEE 431
S L ++
Sbjct: 324 SNLNKQ 329
>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 328
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 116/269 (43%), Gaps = 43/269 (15%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
L++YR F P+ S +TSD GWGC++RSSQML+A AL WR +
Sbjct: 44 FLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSANDCRLDHFR 95
Query: 208 EILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
+I D+E ++PFS+H +++A KA W G
Sbjct: 96 DI-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT-------------------PSQG 131
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGL- 321
C+++ + + +R P + + S+ C + + ++ +L+L P+ G
Sbjct: 132 CEAIRCCV-----NNAVDRRLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS 186
Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
++ +L +G+VGG P S YI+G + +YLDPH + +
Sbjct: 187 RRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLYLDPHCMTQEALVSSHA 246
Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
A T + +++ + D +D S +GF
Sbjct: 247 ERAGVVTVTASLVKSVRWDCVDTSCFLGF 275
>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
Length = 81
Score = 82.0 bits (201), Expect = 7e-13, Method: Composition-based stats.
Identities = 35/48 (72%), Positives = 41/48 (85%)
Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
RYIP L+ T TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ V
Sbjct: 10 RYIPLLKETLTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLV 57
>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
Length = 255
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 67/140 (47%), Gaps = 27/140 (19%)
Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 169
G A F DF+SR ++YR F DP + S TSD G
Sbjct: 116 GTGWPAAFLDDFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSG 175
Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
WGCM+RS Q L+A AL LGR WR+ + DRE +L LF D +P+S+HN ++
Sbjct: 176 WGCMIRSGQSLLANALAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSVHNFVRH 232
Query: 230 GKAY-GLAAGSWVGPYAMCR 248
G+ Y G W GP A R
Sbjct: 233 GEKYCSKYPGEWFGPSATAR 252
>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
Length = 321
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 120/323 (37%), Gaps = 91/323 (28%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
I+ L H + + DAA I I+YR+ + +G + +TSD GWGC
Sbjct: 38 IFGLSYTHDTPSELSFADAA---------HRIHDLITITYRQKYATLGHTYLTSDAGWGC 88
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH---------LFGDSETSPFSI 223
+RS QML+ +++ + L K F EY H L D E+S SI
Sbjct: 89 AIRSVQMLLVNSIVVY---------LDKSFHPEYTSHDHIAIKNNAKQLVFDKESSVLSI 139
Query: 224 HNL-LQAGKAYGLAAGSWVGPYAMCRS--------WEALARCQRAETGLGCQSLPMAIYV 274
HN+ +Q G+ P + C + WE +R L C
Sbjct: 140 HNIYIQDAIIKHNPTGTNFLPPSTCATAVADLYNFWE-----KRTFDVLMCTEY------ 188
Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
I + ++ P LL +P ++ + N ++
Sbjct: 189 ----------------IPEVTQ-------------PTLLFIPRIVTKSERN-----FIQT 214
Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
T PQS G V G A+ Y GVQE+ +LDPH VQ +G Y + I
Sbjct: 215 TSFLPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQDASEVG----------YFNRPI 264
Query: 395 RHIHLDSIDPSLAIGFYCRDKDD 417
+ D +D S G C +K D
Sbjct: 265 FEANFDELDNSFVFGMMCENKSD 287
>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum Pd1]
Length = 208
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 67/145 (46%), Gaps = 30/145 (20%)
Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------I 164
L D A N F DF SRI I+YR F PI +K
Sbjct: 59 LNDTAWPNA---FVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGF 115
Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
TSD GWGCM+RS Q L+A A LGR WR+ + + E +++ +F D +PFSIH
Sbjct: 116 TSDTGWGCMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIH 172
Query: 225 NLLQAG-KAYGLAAGSWVGPYAMCR 248
+ G ++ G G W GP A +
Sbjct: 173 KFVNRGAESCGKYPGEWFGPSATAK 197
>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 425
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 75/172 (43%), Gaps = 27/172 (15%)
Query: 101 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF---- 156
P+R+ S++ LL LG + F DF S+I ++YR F
Sbjct: 85 PTRSSDSATKPQRHLLPFAIHRGSTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLIP 144
Query: 157 ---DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
DP + T+D GWGCM+RS Q L+A AL LGR WR+
Sbjct: 145 KSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRRG 204
Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCR 248
+ +E ++L LF D +PFSIH ++ G A G G W GP A R
Sbjct: 205 TKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATAR 253
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 54/108 (50%), Gaps = 6/108 (5%)
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD-----LEADTSTYHSDVIRHIHL 399
+ G+P +S Y +G Q YLDPH +P + + +D + +TYH+ +R +H+
Sbjct: 255 IHGRPSSSHYFIGAQGSHFFYLDPHHTRPAL-VYRDAGDRPYTTEELNTYHTRRLRRLHI 313
Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
+DPS+ IGF RD+DD++ + A G + V K P
Sbjct: 314 KDMDPSMLIGFLIRDEDDWNSWKRSVHNGAMIGTGKAIIHVFDKEKSP 361
>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
Length = 142
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 65/144 (45%), Gaps = 40/144 (27%)
Query: 126 EALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI--------------------- 164
+ +G +G N EF DF+S++ ++YR F PI D+ +
Sbjct: 3 DMVGTTSGANWPPEFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLG 62
Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
TSD GWGCMLR+ Q L+A AL+F LGR WR+P P E S
Sbjct: 63 GERGWTSDSGWGCMLRTGQSLLANALVFMWLGREWRRP-PAPMPTE-------------S 108
Query: 220 PFSIHNLLQAGKAYGLAAGSWVGP 243
S+H + AGK G G W GP
Sbjct: 109 YASVHRMALAGKELGKDVGQWFGP 132
>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 388
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 135/323 (41%), Gaps = 52/323 (16%)
Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
+G EF + + ++L SYR F P+ S T+D WGC++R++QMLV LL +
Sbjct: 54 DGTTEFVKVATKKLLYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGA 112
Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
+ P + + E E I LF D ++P IH + S + P
Sbjct: 113 YVLP--EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161
Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHC---SVFSKGQADW 308
E G+ + +A GD P C + SRH +V +K +
Sbjct: 162 ------TEAGMAIAAALIAFRAQGGD-------VPFTFCCE--SRHIDEPAVMAK-LLEG 205
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
++L++P+VLG+ ++ +Y + GI GG AS Y+ G Q S ++DP
Sbjct: 206 QHVVLIIPVVLGIAPMSDQYELVMLKILDVKACCGIAGGFKQASLYMFGHQGRSVFFMDP 265
Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIR----HIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
H VQ A TS+ + + DP + +GFY +D+ F
Sbjct: 266 HYVQ----------NAYTSSRTVGTLEGSRGELRARRFDPCMVLGFYLHTPEDYRVF--- 312
Query: 425 ASKLAEESNGAPLFTVTQTHKKP 447
A +LA +N +F + ++P
Sbjct: 313 AEELA-VANSLVVFPLISFGRRP 334
>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
Length = 224
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 5/101 (4%)
Query: 126 EALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
E + + NN + +F DF+SR+ ++YR + PI S +D+GWGCMLRS Q L
Sbjct: 120 EEISEEEDNNNMYLRWPLDFYDDFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSL 179
Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
+A L+ H LGR WR+ Q R+ + I L + PF
Sbjct: 180 LANTLIIHFLGRDWRRQTQNQTTRKELCIGFLMSYHQEHPF 220
>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 325
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/314 (21%), Positives = 129/314 (41%), Gaps = 67/314 (21%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+++LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VYILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSCLGNTYLSSDAGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------REYVEILHLFGDSETSPFSIHN 225
+R++QM++ L+ ++ +Q+ D + ++ L D +S SIHN
Sbjct: 92 AIRATQMMIVNTLVI------FKDQMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHN 145
Query: 226 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
+ + K + +++ P C + +L + E ++
Sbjct: 146 IYIQEIIKVHNPTGTNFLPPSICCIAISSLLQ-----------------------EWDKK 182
Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
+ C+D +CS P L L+P ++ + + + T QS G
Sbjct: 183 LFNCITCLDHIP-NCSY---------PTLYLIPQIITFTEHQ-----LILDSLTLSQSRG 227
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
VGG ++ ++ G Q + +LDPH VQ + G Y + I L I
Sbjct: 228 FVGGIGESAIFVFGYQGTTLFFLDPHYVQNAGDFG----------YFNPPTYQIDLSLIS 277
Query: 404 PSLAIGFYCRDKDD 417
PS+ F C +++D
Sbjct: 278 PSIVFAFMCYNEND 291
>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 228
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 12/96 (12%)
Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
SRT S +S + +C + + E GD + F +DF SR+ ++YR+ F P+
Sbjct: 84 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133
Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWR 195
+TSD GWGCMLRS QM++AQ LL H L G+PWR
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKPWR 169
>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
Length = 312
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 125/298 (41%), Gaps = 59/298 (19%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
FNQ + I YR G K SD GWGC++R QM++A AL+ R+
Sbjct: 49 FNQKKDTLIWFCYRANIQFEG--KAISDQGWGCLVRVGQMMLANALM--------RECKI 98
Query: 200 KPFDREYVEILHLFGDSE----TSPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 253
++ I+HLF D++ +PFSI +++ A + G W GP M
Sbjct: 99 LAINKTKAMIIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIM------- 151
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT---P 310
S ED + + I+ + + Q D + P
Sbjct: 152 ----------------------SVIEDLNKNNMNIKQINLVNFLEQCVLESQIDLSFKKP 189
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
LL++ ++G + + I L+ Q G + GK + +++G Q+ +AI++DPH
Sbjct: 190 HLLIIHAIIGDKSLGQLEIQNLQSHMQISQFAGAIIGKNNKAFFLIGFQKNNAIFMDPHY 249
Query: 371 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
VQ K ++E + ++ L ++ ++A+ FY + ++ +F + +KL
Sbjct: 250 VQES---NKIEMECN--------LKCQPLKQLNGTIALAFYISNYMEYLEFKKQVNKL 296
>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
Length = 102
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 49/83 (59%), Gaps = 11/83 (13%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+W+LG C+ + ++ E D SR+ +YRK F PIG + +SD GWGC
Sbjct: 29 VWVLGECYNVKTEKT-----------ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGC 77
Query: 173 MLRSSQMLVAQALLFHRLGRPWR 195
MLR QM++AQAL+ +LGR WR
Sbjct: 78 MLRCGQMILAQALVCSQLGRAWR 100
>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
Length = 564
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 139/352 (39%), Gaps = 73/352 (20%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 219
+T+D WGC +RS+QM++A AL Q F IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261
Query: 220 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 254
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321
Query: 255 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---------RHCSV--F 301
+ CQ Q L V++ + E DD + R +
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKL 380
Query: 302 SKGQADWTP---------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D P +L++V + LGL+K++P Y + PQ +G+VGGKP +
Sbjct: 381 PNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGGKPNKA 440
Query: 353 TYIVG------VQEESAIYLDPHDVQP-VINIGKD-DLEA-DTSTYHSDVIRHIHLDSID 403
Y G + ++LDPH VQ N+ DL+ + + +H+ R + + +D
Sbjct: 441 FYFFGHIIDQDTNKVKLMFLDPHKVQDYTYNVETSYDLDVKEQAKFHTTEARLLKIKELD 500
Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
L GF + DF+ F +E +F++ Q + N+S +
Sbjct: 501 TCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552
>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
Length = 564
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 81/352 (23%), Positives = 136/352 (38%), Gaps = 73/352 (20%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 219
+T+D WGC +RS+QM++A AL Q F IL LF D+ S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261
Query: 220 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 254
FSI N+ LQ G+ YG+++ + + + +C +E +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321
Query: 255 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---------RHCSV--F 301
+ CQ Q L V++ + E DD + R +
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKL 380
Query: 302 SKGQADWTP---------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
D P +L++V + LGL+K++P Y + PQ +G+VGGKP +
Sbjct: 381 PNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGGKPNKA 440
Query: 353 TYIVG------VQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSID 403
Y G + ++LDPH VQ + D + + +H+ R + + +D
Sbjct: 441 FYFFGHIIDLDTNKVKLMFLDPHKVQDYTYDVETSYDLDVKEQAKFHTTEARLLKIKELD 500
Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
L GF + DF+ F +E +F++ Q + N+S +
Sbjct: 501 TCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552
>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 121/303 (39%), Gaps = 42/303 (13%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ SYR F P+ + T+D WGC+LR++QML+ LL + + P + +
Sbjct: 74 LYFSYRSCFPPLPNGS-TTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELK-A 131
Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
I LF D ++P IH + S + P E G+
Sbjct: 132 NISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSP---------------TEAGMA--- 173
Query: 268 LPMAIYVVSGDEDGERGGAPVV--CIDDASRHCSVFSK---GQADWTPILLLVPLVLGLE 322
MA +++ +G G P C + +V +K GQ ++L++P+VLGL
Sbjct: 174 --MAAALIACHAEG--GDVPFTFSCENRNIDEPAVVAKLLEGQH----VILIIPVVLGLA 225
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
++ +Y + GI GG AS Y+ G Q ++DPH +Q D
Sbjct: 226 PLSDKYESMMLKILDMKACCGIAGGFKQASFYMFGHQGRKVFFMDPHYIQKAYT--SDKT 283
Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
D+ DP + +GFY +D+ F A +LA ++ ++
Sbjct: 284 AGTLYGARGDLTAR----KFDPCMVLGFYLHTLEDYRVF---AEELAVVNSLVTFPLISW 336
Query: 443 THK 445
+HK
Sbjct: 337 SHK 339
>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 355
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 36/228 (15%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 177
+A DE D N +F DF SRI ++YR F+ I S + TS + L+S
Sbjct: 99 LAYDEPTKD---NGWPPQFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQL 155
Query: 178 ---QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
+ +++ RLGR WR+ Q P E EI+ LF D +P+S+H+ ++ G A
Sbjct: 156 GDQSPFSSDSMI--RLGRDWRR-GQSP--HEEREIIKLFADHPNAPYSLHSFVRHGASAC 210
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +ALA + + +Y G P V D+
Sbjct: 211 GKYPGEWFGPSATARCIQALANSHESS---------LRVYST--------GDGPDVYEDE 253
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+ +G+A + P L+LV LG++K+ P Y L + PQS
Sbjct: 254 FMKIAK--PEGEA-FHPTLILVGTRLGIDKITPVYWEALIASLQMPQS 298
>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 141
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 62/110 (56%), Gaps = 7/110 (6%)
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y +G Q++ +YLDPH QP +++ + D + ++H R + +DP
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDP 58
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 451
S +GFY D+ +F+ C+ +++ S+ P+FT+ + H + +HS
Sbjct: 59 SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ--DHS 106
>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
Length = 282
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 114/250 (45%), Gaps = 33/250 (13%)
Query: 20 DTPNRSLASVG-SELGSSESKSSKGSLLSSLFNSAFSVFETYS---ESSASEKKAVHNKS 75
D +R ++G E+ + SK S G+LLSS N+ S S S S
Sbjct: 12 DGSDREQLTIGDCEVCDTTSKYSVGALLSSAANATSSKISRASINLRSLLSGSATKKTND 71
Query: 76 NGWTAAVKRLVTAGSMRRIHERV---LGPSRTGISSST----SDIWLLGVCHKIAQ---- 124
+ + + + + S+R+ + V R IS S + +WLLG + ++
Sbjct: 72 DDVSTSESDIAISSSVRQKFDNVWFSFVYGRWRISRSKYKKKAPLWLLGEFYFTSRPDED 131
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
DE + A F D+ SRI ++YR P+ S T+D GWGC LR+ QM++AQA
Sbjct: 132 DEVVFRA--------FAIDYYSRIWLTYRTELSPLPGSSKTTDCGWGCTLRTCQMMLAQA 183
Query: 185 LLFHRLGRPWRKPLQKPFDRE-----YVEILHLFGDSETSPFSIHNLLQAGKAYGL--AA 237
L+ LGR WR + +R + +I+ LFGD + ++ L++ K A
Sbjct: 184 LVVLHLGREWRFWGDEEANRYRCGFGHYDIVSLFGDHLDADLGLYRLMKIAKERNEHDAV 243
Query: 238 GSWVGPYAMC 247
G+W Y+ C
Sbjct: 244 GNW---YSAC 250
>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
Length = 141
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 59/105 (56%), Gaps = 5/105 (4%)
Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
+GGKP S Y +G Q++ +YLDPH QP +++ + + + ++H R + +DP
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDP 58
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 446
S +GFY D+ +F+ C+ +++ S+ P+FT+ + H +
Sbjct: 59 SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ 103
>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
Length = 429
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 26/114 (22%)
Query: 140 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 176
F DF SRI ++YR F DP + +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239
Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
Q L+A A+L RLGR WR+ + D E +I+ LF D +PFS+HN ++ G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYG 290
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSID 403
G+P +S Y +GVQ + YLDPH +P + +D + T H+ +R +H+D +D
Sbjct: 302 GRPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMD 361
Query: 404 PSLAIGFYCRDKDDFD 419
PS+ IGF +D+DD+D
Sbjct: 362 PSMLIGFLIKDEDDWD 377
>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
Length = 158
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 46/68 (67%)
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
LGL+ VNP Y T+++ +TFPQS+GI GG+P +S Y VG Q ++ YLDPH +P + +
Sbjct: 1 LGLDGVNPIYYDTIKILYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPLR 60
Query: 379 KDDLEADT 386
LE ++
Sbjct: 61 PPTLEPES 68
>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
Length = 806
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 2/110 (1%)
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
+++++ + LGLE + Y L+ F+ Q +GI+GGKP + Y VG Q++ I+LDPH
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLRQCVGILGGKPNFALYFVGYQQDHMIFLDPHY 700
Query: 371 VQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
VQ + + D E + + I ++S+DP + +GF ++ D
Sbjct: 701 VQQALTSDEQLKDQELKDTYQSQRSAKKIKMESLDPCIGVGFLIQNSKDL 750
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 13/72 (18%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV----EILHLFGDSETS 219
I SD GWGCM+R QM++A + L K LQ+ + + IL + D +
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFL---------KLLQQNHNFHDILTHDSILSMILDQLDA 443
Query: 220 PFSIHNLLQAGK 231
PF IH + + G+
Sbjct: 444 PFGIHQITEEGR 455
>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
Length = 352
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 117/278 (42%), Gaps = 57/278 (20%)
Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--- 208
YR F P+ ++ +TSD GWGC +RS+QMLVA A+ K FD V
Sbjct: 92 YRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANAI---------GKLFTNDFDTGEVTDKM 142
Query: 209 ILHLFGD--SETSPFSIHNLL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
++ F D S PFSIHNL +A + S++ P A+ ++ + + + A G
Sbjct: 143 VIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK-KLANPKFG 201
Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
+ L + V+++ P ++L+P+ + +
Sbjct: 202 MEIL------------------------TTTFTFRVYTQ------PTIVLIPISIP-DSF 230
Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
N + + + F+F G+VGG + Y G+ + ++LDPH V+ N +
Sbjct: 231 NDK----IAVIFSFYLFSGMVGGSGRKAFYFFGIHHDQLLFLDPHTVR---NTVINSCSF 283
Query: 385 DTSTYHSDV--IRHIHLDSIDPSLAIGFYCRDKDDFDD 420
D YH + ++ + +D S + F + + DD
Sbjct: 284 DPQEYHPIIGDVKALSYSLLDRSAVLAFVVTSQRELDD 321
>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
Length = 426
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 69/169 (40%), Gaps = 50/169 (29%)
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGA------------------------------- 351
++ PRY LR PQS G++GG+P A
Sbjct: 234 RLEPRYAEPLRAALRLPQSAGMLGGRPRANRIFNTTSMCASSDQNLQLCFENSTRAIDPS 293
Query: 352 -------STYIVGVQEESA---IY-LDPHDVQPVINIGKDDL---EADTSTYHSDVIRHI 397
+ + G+ +Y LDPH VQP + +G D A S D + +
Sbjct: 294 KSGRPRAALFFPGLAARDGGADVYGLDPHTVQPALAVGDDGALGPGAAASVAPRDA-KKL 352
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
D++DPSLA+ FYC D+DDF DF RA L GAPLF V +
Sbjct: 353 AADALDPSLALAFYCADRDDFLDFVGRARALP----GAPLFEVVDAAPR 397
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 54/117 (46%), Gaps = 15/117 (12%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
+ +YR GF+ + T D GWGCMLRS+QML+ AL R G R +
Sbjct: 28 LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL--TRNGAAPR-----------L 74
Query: 208 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
LF D+ +++PF +HN + G Y + G W GP C L +R G
Sbjct: 75 ATAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLVDWRRNAPG 131
>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 200
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
I I+YRK I + T+D GWGCM+RS QM++AQ L LG W+ + +
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96
Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
+++ I++LFGDS S FSIH L+ G+ G W GP
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP 136
>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
Length = 348
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 28/124 (22%)
Query: 140 FNQDFSSRILISYRKGFDPI---------------------GD-SKITSDVGWGCMLRSS 177
F DF SR ++YR GF+PI GD S +SD GWGCM+RS
Sbjct: 120 FLDDFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSG 179
Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
Q L+A A+ + LGR WR ++ EI+ LF D +P+SIH + G +A
Sbjct: 180 QSLLANAMAMYELGRGWRLSDGGIAEK---EIISLFADDPRAPYSIHRFVGHG---AVAC 233
Query: 238 GSWV 241
GS++
Sbjct: 234 GSFL 237
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 364 IYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
YLDPH +P + + E + + H+ +R +H+ +DPS+ IGF RD+DD+D+
Sbjct: 238 FYLDPHHTRPGLPFHEHPSEYTQEEVGSCHTRRLRRLHIREMDPSMLIGFLIRDEDDWDN 297
Query: 421 F 421
+
Sbjct: 298 W 298
>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
Length = 325
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/316 (22%), Positives = 127/316 (40%), Gaps = 71/316 (22%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+ +LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VHILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 225
+R++QM++ AL+ ++ +Q+ D E L D +S SIHN
Sbjct: 92 AIRATQMMIVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145
Query: 226 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
+ Q K + +++ P C + +L + E
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSVCCIAISSLLQ--------------------------EW 179
Query: 284 GGAPVVCIDDASR--HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
P CI + CS P L L+P ++ + + + +L L+ QS
Sbjct: 180 DKKPFNCITCLNHIPSCS---------CPTLYLIPRIITFTE-HQLILDSLALS----QS 225
Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
G VGG ++ ++ G Q + +LDPH VQ + G + TY D+
Sbjct: 226 RGFVGGIGESAIFVFGCQGTTLFFLDPHYVQNAGDFGY----FNPPTYQIDI------SL 275
Query: 402 IDPSLAIGFYCRDKDD 417
I S+ F C ++++
Sbjct: 276 ISSSVVFAFMCYEENE 291
>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
Length = 353
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 130/325 (40%), Gaps = 47/325 (14%)
Query: 118 VCHKIAQ-DEALGDAAGNNGLAE----FNQDFSSRILISYRKGFDPIGD---------SK 163
+ + I Q D++L GN A+ F + F IL SYR F I S
Sbjct: 20 IIYNIDQHDDSLIFLFGNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSS 79
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
+T+D+GWGCMLR QM +A LL R K + IL F D E S FSI
Sbjct: 80 VTTDLGWGCMLRVIQMSLALGLL--------RYCKMKKYTYSLDYILQNFQDLEESLFSI 131
Query: 224 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
H ++ G + W GP + + L + P
Sbjct: 132 HQFVKVGCSIFNKKPKDWFGPTSASTIADYLVKNN-----------PFLFNNFRISSILF 180
Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTF-TFPQ 340
+ G I ++ S ++ ++ T + + LG +N +Y ++ F PQ
Sbjct: 181 KDGT----IYKSNLFQSFKNEEYSENTLTFVWLCTRLGSSALNIQKYKDSIFSIFKNVPQ 236
Query: 341 SLGIVGGKPGAST--YIVGVQEESAIYLDPH-DVQPVINIGKDDLEADTSTYHSDVIRHI 397
+ I GG +S+ IVG E+ LDPH +Q I + E + V I
Sbjct: 237 LICIAGGHNCSSSALLIVGASEKFLYCLDPHIKLQEAFVIKNFNREE----FIQQVPMRI 292
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFC 422
++++PSL+ F C D DDF+ C
Sbjct: 293 SWENLNPSLSFVFCCTDIDDFNHLC 317
>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia strain d4-2]
gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia]
gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
Length = 277
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 122/291 (41%), Gaps = 59/291 (20%)
Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
F Q + I SYR G + SD GWGC++R QM+VA +L+
Sbjct: 14 FLQLKETFIWFSYRANIQYEGRA--ISDQGWGCLIRVGQMIVANSLIRESTNS------- 64
Query: 200 KPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 253
KP D + +I+ LF D++ +PFSI +++ A Y + G W GP MC + L
Sbjct: 65 KPNDLK-TKIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLL 123
Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW---TP 310
Q A+T + I + C + + Q D P
Sbjct: 124 ---QSAKT------------------------IKQLKIINFLEQCVI--EKQIDLQFKQP 154
Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
LL++ ++G ++++ ++ L+ PQ G + GK + +++G Q I +DPH
Sbjct: 155 QLLIIHAIIGNKELDQYFVAELQKHMQIPQFAGAIVGKSKKAYFLIGYQNNQGIVMDPHY 214
Query: 371 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
VQ E++ +S ++ I L ++A+ +Y + D+
Sbjct: 215 VQ----------ESNLLQLNSQ-LKCIPLKEFSGTIALCYYISNSYDYQQL 254
>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
Length = 149
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 11/82 (13%)
Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKITS 166
S +S + LLG ++++ + G+ E F + FSS + +SYR+GF P+ S ++S
Sbjct: 74 SKSSPVCLLGQSYQLS----------STGVRESFRRVFSSLLWMSYRRGFRPLDGSTLSS 123
Query: 167 DVGWGCMLRSSQMLVAQALLFH 188
D GWGCMLRS+QML+AQ LL H
Sbjct: 124 DAGWGCMLRSAQMLLAQGLLLH 145
>gi|237837057|ref|XP_002367826.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
gi|211965490|gb|EEB00686.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
Length = 3559
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 87/175 (49%), Gaps = 16/175 (9%)
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 343
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 401
+V G+ + Y +G Q+++ +YLDPH +QP L A T ++ + + + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL 454
++PSLA+ F+ R++ A KL EE + + V + + P++ DVL
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKL-EEVDSFSMLQVVERRRPFSPLDLDDVL 3133
Score = 45.1 bits (105), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 86 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIAR---FTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 182 AQALLFHRL 190
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
Length = 646
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 22/120 (18%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
+ EF +DFS++I +SYR+GF IGD+ +D GWG W+K
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGY---------------------WKK 447
Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALAR 255
Q + I+ +F D T+PFSIHN+ G+ + G G W P + + ++L
Sbjct: 448 SGQNEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVN 507
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 26/99 (26%)
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
IVGGKP AS Y + Q+++ YLDPH VQ I+ + ++
Sbjct: 541 IVGGKPRASLYFIAAQDDNLFYLDPHTVQQAID-----------------------NEVE 577
Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
SL++ K+DF DF R+ KL +S PL+ + +
Sbjct: 578 FSLSVS--VETKEDFLDFLERSKKLVSKSE-FPLYNIAE 613
>gi|221481944|gb|EEE20310.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 3562
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 87/174 (50%), Gaps = 15/174 (8%)
Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 343
GA V C+ D S + +G LLL PL L EK+NP Y+ +L P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023
Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 401
+V G+ + Y +G Q+++ +YLDPH +QP L A T ++ + + + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079
Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
++PSLA+ F+ R++ A KL EE + + V + ++P + D+ G
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKL-EEVDSFSMLQVVE-RRRPFSPLDLDG 3131
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 86 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 182 AQALLFHRL 190
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
Length = 538
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 41/79 (51%), Gaps = 5/79 (6%)
Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
S IYLDPH VQ D T+ + R + L SIDPSLA+GFYC ++ D
Sbjct: 331 SVIYLDPHQVQEAAACPDD-----WRTFWCETPRSMPLPSIDPSLALGFYCSSLGEYRDL 385
Query: 422 CARASKLAEESNGAPLFTV 440
C+R L S GAPL V
Sbjct: 386 CSRLEALERRSGGAPLVCV 404
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 26/129 (20%)
Query: 179 MLVAQALLFHRLGRPWR----------------KPLQKPFDREYVEILHLFGDS--ETSP 220
M++AQ L+ H LGR WR +L LF D+ E +P
Sbjct: 1 MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60
Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
FS+H+L +AG+A G+ AG W+GP+ MC++ A A R Q + + + V E
Sbjct: 61 FSLHSLCRAGQACGVVAGRWLGPWVMCKTLAAAAGAARR------QGVDLGLTVAVLAES 114
Query: 281 GERGGAPVV 289
G GGAP++
Sbjct: 115 G--GGAPLL 121
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 26/37 (70%)
Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
K+NPRYIP L PQS+GIVGG+P +S Y VG Q
Sbjct: 215 KLNPRYIPQLEAVLAMPQSIGIVGGRPSSSLYFVGFQ 251
>gi|221505025|gb|EEE30679.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 3554
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 77/148 (52%), Gaps = 9/148 (6%)
Query: 311 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
LLL PL L EK+NP Y+ +L P SLG+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2988 CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLGMVAGRGQQAFYCIGTQQKALLYLDPH 3047
Query: 370 D-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDSIDPSLAIGFYCRDKDDFDDFCARASK 427
+QP L A T ++ + + + +++PSLA+ F+ R++ A K
Sbjct: 3048 SGIQPPAL----QLPAATPSFFAGSCWKVSDVAALNPSLAVAFFVRNERQLLGLAAALKK 3103
Query: 428 LAEESNGAPLFTVTQTHKKPVNHSDVLG 455
L EE + + V + ++P + D+ G
Sbjct: 3104 L-EEVDSFSMLQVVE-RRRPFSPLDLDG 3129
Score = 45.4 bits (106), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)
Query: 86 VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
+TA SM R+ V G S R IS D W G ++ D A + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
E + + +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196
Query: 182 AQALLFHRL 190
QAL H L
Sbjct: 1197 MQALRRHFL 1205
>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
Length = 266
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 134 NNGLAE--FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
NN + + F D S I SYRK F PI ++ IT+D+GWGCMLR+ QM++A+ALL H
Sbjct: 205 NNNIIQSNFLDDVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALLKH 261
>gi|193784751|dbj|BAG53904.1| unnamed protein product [Homo sapiens]
Length = 146
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 1/117 (0%)
Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
P SL G T ++ EE IYLDPH QP + D S + +
Sbjct: 4 PLSLSSAGSATHLPTCLILPGEE-LIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMS 62
Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
+ +DPS+A+GF+C+ +DDF+D+C + KL+ P+F + + + DVL
Sbjct: 63 IAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 119
>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 3465
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 59/106 (55%), Gaps = 7/106 (6%)
Query: 311 ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
LLL PL L EK+NP Y+P+L P S+G+V G+ + Y +G Q+++ +YLDPH
Sbjct: 2955 CLLLFPLTLCSGEKINPVYVPSLLAYLELPWSVGMVAGRGQQAFYCIGTQQKALLYLDPH 3014
Query: 370 D-VQ-PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
+Q P + + A S + + + +++PSL++ F+ R
Sbjct: 3015 SGIQPPALQL----PSATPSFFAGSCWKIADVAALNPSLSVAFFVR 3056
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 17/70 (24%)
Query: 139 EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
+ +Q S +YR GF P+ G+ K I SDVGWGC +R++QML+
Sbjct: 942 QLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQMLL 1001
Query: 182 AQALLFHRLG 191
QAL H LG
Sbjct: 1002 MQALRRHFLG 1011
>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 209
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 20/143 (13%)
Query: 142 QDFSSRILISYRKGFDPI----GDSKI---TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
++F + I ++YR+ F P+ D KI SD GWGCM+R QM +A+ L H +
Sbjct: 24 ENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRHHLQQKGI 83
Query: 195 ---RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSW 250
++ +Q D + FGD +P+SI + + A K + L G W P +C
Sbjct: 84 YDNKRIIQAFLDND-------FGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRICHVL 136
Query: 251 EALARCQRAETGLGCQSLPMAIY 273
L + L C+ L + ++
Sbjct: 137 SLLHN--DKKQILDCEDLKVGVF 157
>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
Length = 360
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 96/228 (42%), Gaps = 35/228 (15%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 177
+A D+ + D +G F DF S+I ++YR F+PI S + TS + L+S
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 178 --QMLVAQALLFHRLGR-PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
Q + + RLGR WR+ E +L F D +P+SIH+ ++ G A
Sbjct: 159 GDQSPFSSDTMV-RLGRGDWRRGESV---EEECRLLKDFADDPRAPYSIHSFVRHGASAC 214
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
G G W GP A R +AL + +I V S G P V D+
Sbjct: 215 GKYPGEWFGPSATARCIQALTNSHES-----------SIRVYST------GDGPDVYEDE 257
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
+ D+ P L+LV LG++K+ P Y L PQS
Sbjct: 258 ---FMQIAKPPGEDFHPTLVLVGTRLGIDKITPVYWEALIAALQMPQS 302
>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
Length = 93
Score = 62.4 bits (150), Expect = 4e-07, Method: Composition-based stats.
Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 19/91 (20%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIG-- 160
I + IW+LG + N L E + +D S + +YRKGF PIG
Sbjct: 16 IPQTDEPIWILGKKY--------------NALKELDMIRRDIRSMLWFTYRKGFIPIGGC 61
Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
+S TSD GWGCMLR QM++AQAL+ LG
Sbjct: 62 NSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92
>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 348
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 114/278 (41%), Gaps = 67/278 (24%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S I YR F + ++ +TSD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 205 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQR 258
+ + ++H F D S P+SIH+L G GS P++
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFS------------- 178
Query: 259 AETGLGCQSLPMAIYVVSG--DEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILL 313
IY ++ ++D R C V + ++ P ++
Sbjct: 179 -----------SVIYALTELVNKDFNRAF-----------ECHVITNKFLLKSINKPTIV 216
Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
+P + +K + R I F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 217 FIPFTIP-DKFDQRLIT----IFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRP 271
Query: 374 VI-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
+I K D E D SD I+ + ++ ++ S+ F
Sbjct: 272 CASSIMKFD-EKDYIAKLSD-IKSLRINELERSVVFSF 307
>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 348
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 116/277 (41%), Gaps = 65/277 (23%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S I YR F + ++ + SD GWGC +R+ QML+A A++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAII-------------KLFGS 131
Query: 205 EYVE---ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ + ++H F D + P+SIH+L + +G+
Sbjct: 132 DNINRKTVIHWFLDFYNVECPYSIHSLFTTQI---IVSGN-------------------- 168
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR--HCSVFSKG---QADWTPILLL 314
G LP+++ + E + D +R C V + + P ++
Sbjct: 169 --PNGSSFLPLSVVTYALTELVNK---------DLNRIFECHVITNKFLLNSINKPTIIF 217
Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
+P + ++ N R I F+F G+VGG + Y G+ + ++LDPH V+P
Sbjct: 218 IPFTIP-DEFNQRLIS----IFSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDPHFVRPC 272
Query: 375 I-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
+I K D E D SD I+ +H++ ++ S+ F
Sbjct: 273 ASSIMKFD-EKDYIAKLSD-IKSLHINELERSVVFSF 307
>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
Length = 473
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 81/340 (23%)
Query: 148 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 200
I +YR+GF DS +T+D GWGC++R QM++A+ L F+++ PL +
Sbjct: 52 IRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAELLKRHLKCFYKVDLFSFPPLLQ 111
Query: 201 PFDREYVEILHLFGDSE--------TSP----FSIHNLLQ-AGKAYGLAAGSWVGPYAMC 247
++L +F D + + P FSI +++ A K +G G W P +
Sbjct: 112 -------DVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKEWGKKPGEWYSPNQIV 164
Query: 248 RS-WEALARCQRAET-GLG-------------------------CQ----SLPMAIYVVS 276
++ ++ L GLG CQ S+ + +
Sbjct: 165 QAIYKILQEINIPYCYGLGFVPFYESQIDLRAIFQEMCMMEDCVCQKKVFSIEQFLKSLE 224
Query: 277 GDEDGERGGAPV---------VCIDDASRHC-----SVFSK--GQADWTPILLLVPLVL- 319
E G+ V VC +D S ++ K Q + P+ + +L
Sbjct: 225 KLEIGKEEMVQVMHGNDSISDVCCEDQSEQNKKEIGNLLKKYICQKCFVPVRAVAVCLLS 284
Query: 320 --GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
G ++ NP Y+ +R G++GG+P + +IVG + + LDPH VQ
Sbjct: 285 RIGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHLVQE---- 340
Query: 378 GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
K + E + + ID SL + FY ++ DD
Sbjct: 341 AKMNPEEYIKSCFPGEALFMSDKEIDCSLGLVFYLKNLDD 380
>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 348
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/278 (20%), Positives = 109/278 (39%), Gaps = 67/278 (24%)
Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
+S I YR F + ++ +TSD GWGC +R+ QML+A +++ K F
Sbjct: 85 TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSII-------------KLFGS 131
Query: 205 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
+ + ++H F D S P+SIH+L + +
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFT-------------------------TQIIVS 166
Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILLLVP 316
+ G LP ++ + + E + + + C + + + P ++ +P
Sbjct: 167 KNPNGSSFLPFSVVIYALTELVNKDF-------NRAFECHIITNKFLLNSINKPTIVFIP 219
Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP--- 373
+ E + L F+F G+VGG + Y G+ ++LDPH V+P
Sbjct: 220 FTIPDE-----FEQRLITIFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRPCAS 274
Query: 374 -VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
+I + D A S I+ + ++ ++ S+ F
Sbjct: 275 SIIKFDEKDYIAKLSD-----IKSLRINELERSVVFSF 307
>gi|412989956|emb|CCO20598.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Bathycoccus prasinos]
Length = 532
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 98/267 (36%), Gaps = 74/267 (27%)
Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
L G W+ P +C+ + + R ++ + L + DG GG P +
Sbjct: 234 ALCPGQWMAPSEICKRYGKMM--NRLDSFQNVRCLILG--------DGCGGGVPEFYPER 283
Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
K AD +L+LVPL G + +NP Y+ +L+ + + +GIVGGK AS
Sbjct: 284 VREEM----KTHAD-KDVLILVPLRCGASDAINPEYVKSLQKFLSVRECVGIVGGKKTAS 338
Query: 353 TYIVGVQE--------------------------------------ESAIYLDPHDVQPV 374
YIVG AIYLDPH +
Sbjct: 339 YYIVGFTSGKKSSDSYSGGEKEEEEEEKEEEENEEDEEEEEEEEEETRAIYLDPHVAKAY 398
Query: 375 INIGKDDLEADT-STYHSDV--------IRHIHLDSIDPSLAIGFYCRDKDDFDD----- 420
++ + + T S Y+ I + ++DPSL +GF + ++D+
Sbjct: 399 VSPRERSRDESTESAYYRSFFGSASEHGILYTPFHALDPSLVVGFLVGNDTNYDEMNNAS 458
Query: 421 ------FCARASKLAEESNGAPLFTVT 441
F + + ES PL TV
Sbjct: 459 SSSLDAFVDVLTNIERESGSTPLITVV 485
>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
Length = 98
Score = 58.9 bits (141), Expect = 6e-06, Method: Composition-based stats.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 13/81 (16%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 170
+W+LG + ++ L +D S + +YRKGF PIG +S TSD GW
Sbjct: 23 VWILGRVYNAIKE-----------LDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGW 71
Query: 171 GCMLRSSQMLVAQALLFHRLG 191
GCMLR QM++A+AL+ LG
Sbjct: 72 GCMLRCGQMVLARALITLHLG 92
>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 341
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 18/101 (17%)
Query: 148 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
IL +YR F+PI G + + SD GWGC +R++QML+AQA+ G+ D
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV--KMAGK----------D 113
Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGK-AYGLAAGSWVGP 243
+ +L LF DS +P S+H +++ G+ G+W GP
Sbjct: 114 ADDSVVLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGP 154
>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 193
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 25/150 (16%)
Query: 95 HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRI 148
HE V P G S ++LGV K Q D+ L + L A F + S+
Sbjct: 25 HEDVQKPIFVGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAAAFFR-ISNLF 79
Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKP 201
++YR G++ + +S +T+DVGWGC +R+ QM++A A+ + P+ P
Sbjct: 80 WMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----P 134
Query: 202 FDREYVEILHLFGDS--ETSPFSIHNLLQA 229
+E + +L F DS T+P SIH++ ++
Sbjct: 135 TKQEVMNVLIPFIDSPNSTTPLSIHHVYES 164
>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 658
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 69/179 (38%), Gaps = 52/179 (29%)
Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-----------------QEESAIY-LD 367
P Y TL +FPQS+G++GG P + + G QE Y LD
Sbjct: 418 PTYGSTLAKLLSFPQSVGMLGGTPRHALWFYGADEVDPPTFGDDGKALNGQECGGWYGLD 477
Query: 368 PHDVQ------PVINIGKDDLEADT------------------------STYHSDVIRHI 397
PH Q GKD++ +D +T H++ R I
Sbjct: 478 PHTTQVAPRGTRTTKYGKDEVSSDDIELNNCQWQVQLNDAYLRSLHFTPTTTHANHQRSI 537
Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES---NGAPLFTVTQTHKKPVNHSDV 453
L +DPS A+GFY RD DF F L++E N P VT T K P DV
Sbjct: 538 PLSKLDPSCALGFYIRDHSDFVQFTNAIDALSKEHCRPNKLPDI-VTVTEKTPNYEVDV 595
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 164 ITSDVGWGCMLRSSQMLVAQALLFH 188
+ SD GWGCMLRS+QM++AQ + H
Sbjct: 133 LKSDAGWGCMLRSAQMMMAQTVRMH 157
>gi|148682816|gb|EDL14763.1| mCG116861, isoform CRA_a [Mus musculus]
Length = 127
Score = 55.1 bits (131), Expect = 8e-05, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 48/81 (59%), Gaps = 1/81 (1%)
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
I+LDPH Q ++I + L D + + + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4 IFLDPHTTQTFVDIEESGLVDDQTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63
Query: 424 RASKLAEESNGAPLFTVTQTH 444
K + N +F + Q H
Sbjct: 64 LVQKEILKEN-LRMFELVQKH 83
>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 183
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 66/144 (45%), Gaps = 19/144 (13%)
Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
+ +LG C+ +E L N+ N I+ +YR+ + +G++ ++SD GWGC
Sbjct: 36 VHILGNCYYPETNENLNHLTFNDA----NLKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91
Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 225
+R++QM+V AL+ ++ +Q+ D E L D +S SIHN
Sbjct: 92 AIRATQMMVVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145
Query: 226 LL--QAGKAYGLAAGSWVGPYAMC 247
+ Q K + +++ P C
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSICC 169
>gi|78070455|gb|AAI07651.1| Atg4d protein [Rattus norvegicus]
Length = 168
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
+YLDPH QP +++ + + ++ +H R + +DPS +GFY ++ +F+ C+
Sbjct: 47 LYLDPHYCQPTVDVNQANFPLES--FHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCS 104
Query: 424 RASKLAEESNGA---PLFTVTQTHKK 446
++ S+ P+FTV + H +
Sbjct: 105 ELMRILSSSSVTERYPMFTVAEGHAQ 130
>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 384
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 2/81 (2%)
Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
S+G++GG PG + Y +G+ + IYLDPH +Q K D TY I +
Sbjct: 223 SIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQEAHQNEKTVQNID--TYFCKFINRVSQK 280
Query: 401 SIDPSLAIGFYCRDKDDFDDF 421
++ SLA GFY ++ + + F
Sbjct: 281 KLESSLAFGFYIKNLQELEQF 301
>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
Length = 206
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 30/135 (22%)
Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFD-------------------PI-GDSKITS 166
A+ D L E +DF IL++YR+G P+ + I +
Sbjct: 17 AMCDQNPGPKLRERLKDF---ILLTYRRGLSIHLPRFYAGNIPKRFYGIWPLWQQTDIKT 73
Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
D GWGC LR++QM +A+AL R PL + IL LF D+ +PFS+ NL
Sbjct: 74 DRGWGCALRATQMALAEAL------RDVLSPLDN-VQEQRSRILQLFYDTTEAPFSLENL 126
Query: 227 LQAGKAYGLAAGSWV 241
+ A +G +W+
Sbjct: 127 VMADVEHGANVVAWI 141
>gi|392343434|ref|XP_003754884.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
gi|392355909|ref|XP_003752169.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
Length = 126
Score = 52.4 bits (124), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 1/81 (1%)
Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
I+LDPH Q ++ + L D + + + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4 IFLDPHTTQTFVDTEESGLVDDHTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63
Query: 424 RASKLAEESNGAPLFTVTQTH 444
K + N +F + Q H
Sbjct: 64 LVQKEILKEN-LRMFELVQKH 83
>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
Length = 389
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSID 403
G+P +S Y +G Q YLDPH + + +D +E + ++ H+ +R IH+ +D
Sbjct: 262 GRPSSSHYFIGAQGSFLFYLDPHHTRVALPYREDPIEYTSEEIASCHTPRLRRIHVREMD 321
Query: 404 PSLAIGFYCRDKDDFDDF 421
PS+ IGF +++ D+ +
Sbjct: 322 PSMLIGFLIQNEVDWQEL 339
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 26/94 (27%)
Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 159
+A D+ + D +G F DF S+I ++YR F+PI
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158
Query: 160 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
GD S +SD GWGCM+RS Q ++A + RLGR
Sbjct: 159 GDQSPFSSDSGWGCMIRSGQSMLANTIAMVRLGR 192
>gi|328852767|gb|EGG01910.1| Hypothetical protein MELLADRAFT_123246 [Melampsora larici-populina
98AG31]
Length = 134
Score = 51.2 bits (121), Expect = 0.001, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNINPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|328852471|gb|EGG01617.1| Hypothetical protein MELLADRAFT_92005 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDRVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|407037202|gb|EKE38551.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 157
Score = 50.4 bits (119), Expect = 0.002, Method: Composition-based stats.
Identities = 40/137 (29%), Positives = 58/137 (42%), Gaps = 13/137 (9%)
Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
+ P L+ +P+VL N L+ + GIVGG + ++ G +YLD
Sbjct: 17 FKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFLYLD 71
Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS-----LAIGFYCRDKDDFDDFC 422
PH VQP K E DT +Y + +IDP+ GF ++ + DDF
Sbjct: 72 PHIVQPSF---KSFTEIDTKSYSPIGSNRFSVHTIDPTKLDDFCTFGFLIKNLHEVDDFM 128
Query: 423 ARASKLAEESNGAPLFT 439
A + E SN L T
Sbjct: 129 KLAKDVFEISNDKELRT 145
>gi|328859149|gb|EGG08259.1| Hypothetical protein MELLADRAFT_123247 [Melampsora larici-populina
98AG31]
Length = 134
Score = 50.4 bits (119), Expect = 0.002, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
P+L+L+ + GL++VN P Y T+ TFTFPQS+GI GG+P S ++
Sbjct: 83 PVLVLMNVQSGLDQVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130
>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
Length = 350
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 126/356 (35%), Gaps = 95/356 (26%)
Query: 128 LGDAAGNNGLAEFNQDFSSR--ILISYRKG-------------------FDPI----GDS 162
+ + N +N+ SR IL +YR G F P+ G
Sbjct: 1 MSNVVRENVNVLYNKRLESRFGILFTYRYGLEYKFPRPINFKRRRLFNIFSPLNLSNGIV 60
Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------RKPLQKPFDREYVE----- 208
I SD GWGC+LRS+QM ++QALL LG + R P + D+ +
Sbjct: 61 TIDSDKGWGCVLRSTQMAISQALLNLVLGPEFSVEQLEIRNRTPRNRKIDQSLLNIDTFE 120
Query: 209 -----------------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW-VGPY--AMCR 248
IL F D + FSI+N + A GP A+C
Sbjct: 121 KLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIYNFVIADYVLKTCTKFLHFGPTSAALC- 179
Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
A + +LP+ + D H S + +
Sbjct: 180 ----------ASKIINDLNLPIN----------------SIAFPDGVFHISDVREILEEK 213
Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP-GASTYIVGVQEESAIYLD 367
+L+ V L+++ +R F Q GI+GG S YI G + Y D
Sbjct: 214 RNLLVWVSNKKKLDRIER---ECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYND 270
Query: 368 PHDV--QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
PH + ++ D+ D + S ++ ++ + S + F +D+DDF DF
Sbjct: 271 PHLYCKKAFRSLEYVDIFRD---FTSRRVKSMNWRYFNASFTLLFLFKDRDDFQDF 323
>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
Length = 307
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%)
Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
+ F +DF SRI ++YR+ F + DS TSD GWGCM+
Sbjct: 195 IEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMI 232
>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
Length = 469
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 27/116 (23%)
Query: 148 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 200
I +YR+GF +S +T+D GWGC++R QM++A+ L F+ + PL +
Sbjct: 52 IRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAELLKRHLKCFYNVNLFQFPPLMQ 111
Query: 201 PFDREYVEILHLFGDSETSP------------FSIHNLLQ-AGKAYGLAAGSWVGP 243
E+L LF D + FSI +++ A + +G G W P
Sbjct: 112 -------EVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 12/104 (11%)
Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
+G ++ NP YI +R G++GG+P + +IVG ++ + LDPH VQ N+
Sbjct: 286 IGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHLVQQA-NMN 344
Query: 379 KDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
++ + + SD ID SL + FY ++++D
Sbjct: 345 PEEYVKSCFPGEALFMSD-------KEIDCSLGLVFYLKNEEDL 381
>gi|307108756|gb|EFN56995.1| hypothetical protein CHLNCDRAFT_143631 [Chlorella variabilis]
Length = 137
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 9/113 (7%)
Query: 66 SEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGIS-SSTSDIWLLGVCHKIAQ 124
+E AV S G + + L A + ++H+ + +G S + + +WLLG C+
Sbjct: 5 AELSAVDKLSLGLSRSYYALARALRLNKLHDLLA----SGASITPDAPVWLLGQCYSCPP 60
Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI--GDSKITSDVGWGCMLR 175
+ +A LA + S +SYR GF I G + + SD GWGC LR
Sbjct: 61 GAS--EAQQEEALARMLHHYQSIPWMSYRTGFTSIAAGSAHLQSDAGWGCTLR 111
>gi|294954843|ref|XP_002788322.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
gi|239903634|gb|EER20118.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
Length = 345
Score = 45.1 bits (105), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 52/113 (46%), Gaps = 26/113 (23%)
Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESA-------------------IYLDPHDVQPVIN 376
P +G++GG+ + Y+VGV E+ + +DPH VQ +
Sbjct: 207 LKLPWCVGVIGGQSTRAHYVVGVAEKDTYLQSSTWGRSGYRQTRTDLLSIDPHFVQSAV- 265
Query: 377 IGKDDLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
+EA + ++ +SD + ++PSL +GFY +D+ D ++ A ++
Sbjct: 266 -----VEAQSISFKNSDEPSRLQPTKLNPSLGVGFYVKDETDLEELSAELDRV 313
>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
Length = 346
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 20/94 (21%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
NN +A + S+ ++YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 191 GRP-------WRKPLQKPF-------DREYVEIL 210
P + +QK F REYV+++
Sbjct: 112 QEPNPGFGEDAAEKVQKNFIIHSMEERREYVQLI 145
>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
Length = 135
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 11/67 (16%)
Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
I +++W+LG + Q+ L +D SR+ +YR GF P+G+ ++T
Sbjct: 43 IPRRNTNVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91
Query: 166 SDVGWGC 172
+D GWG
Sbjct: 92 TDKGWGL 98
>gi|389585790|dbj|GAB68520.1| peptidase, partial [Plasmodium cynomolgi strain B]
Length = 894
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 418 FTKRKRTKYTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 466
>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
Length = 133
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 30/42 (71%), Gaps = 5/42 (11%)
Query: 148 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQAL 185
IL +YR F+PI G + + SD GWGC +R++QML+AQA+
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV 107
>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 346
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 6/63 (9%)
Query: 132 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 57 TSNNNIA---KHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111
Query: 191 GRP 193
P
Sbjct: 112 QEP 114
>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 346
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 6/61 (9%)
Query: 134 NNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
NN +A + S+ I+YR GF + +T+D GWGC LRS QML +L+ RL
Sbjct: 59 NNNVA---KHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RLQE 113
Query: 193 P 193
P
Sbjct: 114 P 114
>gi|221060360|ref|XP_002260825.1| peptidase [Plasmodium knowlesi strain H]
gi|193810899|emb|CAQ42797.1| peptidase, putative [Plasmodium knowlesi strain H]
Length = 1001
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
+F++R + KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 464 NFTNRRRTKHTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 513
>gi|156102174|ref|XP_001616780.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805654|gb|EDL47053.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1007
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
F+ R Y KG D I S SD GWGCM+R QM++A L+ +++ + +
Sbjct: 468 FAKRKRDRYSKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 516
>gi|149030140|gb|EDL85217.1| rCG23129 [Rattus norvegicus]
Length = 90
Score = 41.6 bits (96), Expect = 1.0, Method: Composition-based stats.
Identities = 16/44 (36%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
++DPS+A+GF+C+++ DFD++C+ K + N +F + Q H
Sbjct: 5 NLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 47
>gi|390457789|ref|XP_003732004.1| PREDICTED: cysteine protease ATG4B-like [Callithrix jacchus]
Length = 102
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 29/51 (56%)
Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
S+++GF+C+ +DDF+D C + KL+ P+F + + + DVL
Sbjct: 25 SISVGFFCKTEDDFNDRCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 75
>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
gorilla]
Length = 351
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
+R + +I+ F D +PF +H L++ G++ G AG W GP
Sbjct: 51 ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 91
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,942,788,756
Number of Sequences: 23463169
Number of extensions: 340944330
Number of successful extensions: 731588
Number of sequences better than 100.0: 787
Number of HSP's better than 100.0 without gapping: 759
Number of HSP's successfully gapped in prelim test: 28
Number of HSP's that attempted gapping in prelim test: 728680
Number of HSP's gapped (non-prelim): 1371
length of query: 486
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 339
effective length of database: 8,910,109,524
effective search space: 3020527128636
effective search space used: 3020527128636
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 79 (35.0 bits)