BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019849
(335 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 211/335 (62%), Positives = 258/335 (77%), Gaps = 12/335 (3%)
Query: 1 MEEKGKR-------VMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPI 53
ME+K KR + +++ ATF+G FS A+Q + KKSTQ + FGS+ V P+
Sbjct: 1 MEKKRKRRRFSSLLMQSTFFIVLAATFEGSFSAASQRCTLKKSTQHSC---FGSSLVLPV 57
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV 113
GNVYPLGYYSV+L IGNPPKL+ELDIDTGSDLTWVQC+APCTGCT P LY P+NNL+
Sbjct: 58 FGNVYPLGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHLYKPRNNLL 117
Query: 114 ACNDPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
+C DP CSA +C+ A DQCDYE+ YAD GSSLGVLVTD+FPLRL NGS L P++
Sbjct: 118 SCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEGSSLGVLVTDYFPLRLMNGSFLRPKM 177
Query: 173 IFGCGYNQRNPGP-KPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFL 231
FGCGY+Q++PGP PPPT GVLGLG GK SI+SQLQ+LG+ NV+GHCLS +GGG+LF
Sbjct: 178 TFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKGGGFLFF 237
Query: 232 GHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQA 291
G D VPS GI+W PMS+ L+K+Y+SGPAELL+GGK TG K + IFDSGSSYTYFN+Q
Sbjct: 238 GQDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSSYTYFNAQV 297
Query: 292 YKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWK 326
Y++TL+L+RK+L GKPL D EEKAL +CWKGT +
Sbjct: 298 YQSTLNLIRKELSGKPLRDAPEEKALAICWKGTKR 332
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 207/330 (62%), Positives = 248/330 (75%), Gaps = 12/330 (3%)
Query: 1 MEEKGKRVMGLLVL------LMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPIT 54
ME+K KR++ L+ + +M A F+GCFS A+Q P K KST + A R GS+ F +T
Sbjct: 1 MEKKRKRIVSLVTMTLLFFIVMAANFRGCFSAASQTPIKGKST-TPANDRVGSSVFFRVT 59
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA 114
GNVYP G+YSV L IGNPPK ++LDIDTGSDLTWVQC+APC GCT P + LY PKNN V
Sbjct: 60 GNVYPTGHYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLYKPKNNRVP 119
Query: 115 CNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
C C A +N C+ +QCDYEV YAD GSSLGVL++D+FPLRL NGSLL PR+
Sbjct: 120 CASSLCQAI---QNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQPRIA 176
Query: 174 FGCGYNQRNPGP-KPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLG 232
FGCGY+Q+ GP PP TAG+LGLG GKASILSQL++LG+T+NV+GHC S GG+LF G
Sbjct: 177 FGCGYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFG 236
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAY 292
L+P SGI WTPM R + YSSGPAELLFGGK TGIKGLQ+IFDSGSSYTYFN+Q Y
Sbjct: 237 DHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVY 296
Query: 293 KTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
++ L+L+RKDL G PL+D EEKAL VCWK
Sbjct: 297 QSILNLVRKDLSGMPLKDAPEEKALAVCWK 326
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/316 (63%), Positives = 243/316 (76%), Gaps = 4/316 (1%)
Query: 9 MGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLK 68
M L +++ A QGCFS A+Q P K +S+ + A R GS+ F +TGNVYP GYYSV L
Sbjct: 1 MFLFFIVISADLQGCFSAASQTPIKGESS-TPANDRVGSSVFFRVTGNVYPTGYYSVILN 59
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPEN 128
IGNPPK ++ DIDTGSDLTWVQC+APC GCT P + LY PKNNLV C++ C A EN
Sbjct: 60 IGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLYKPKNNLVPCSNSLCQAVSTGEN 119
Query: 129 IRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKP 187
C+A +DQCDYE+ YAD GSS+GVL++D FPLRL+NG+LL P++ FGCGY+Q++ GP P
Sbjct: 120 YHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQPKMAFGCGYDQKHLGPHP 179
Query: 188 PP-TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPM 246
PP TAG+LGLG GK SILSQL++LG+T+NV+GHC S GG+LF G L PSS I WTPM
Sbjct: 180 PPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDHLFPSSRITWTPM 239
Query: 247 SRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
R + YSSGPAELLFGGK TGIKGLQ+IFDSGSSYTYFN+Q Y++ L+L+RKDL GK
Sbjct: 240 LRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLAGK 299
Query: 307 PLEDTAEEKALPVCWK 322
PL+D A EK L VCWK
Sbjct: 300 PLKD-APEKELAVCWK 314
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/328 (59%), Positives = 239/328 (72%), Gaps = 5/328 (1%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCF--SEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVY 58
M+ K K + L LL F F F S + QP + KK + S HR S+AVF + GNVY
Sbjct: 1 MDVKMKGITALHTLLQFLLFSAIFPLSFSAQPRNAKKLS-SDNHHRLSSSAVFKVQGNVY 59
Query: 59 PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDP 118
PLG+Y+V+L IG PPKLY+LDID+GSDLTWVQC+APC GCT P + LY P +NLV C D
Sbjct: 60 PLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQ 119
Query: 119 FCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
CS L C + +DQCDYEV YADHGSSLGVLV D+ P + TNGS++ PR+ FGCG
Sbjct: 120 LCSEVQLSMEYTCASPDDQCDYEVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCG 179
Query: 178 YNQRNPGPK-PPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLV 236
Y+Q+ G PP T+GVLGLG G+ASILSQL SLGL NV+GHCLS RGGG+LF G D +
Sbjct: 180 YDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSARGGGFLFFGDDFI 239
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTL 296
PSSGI WT M EKHYSSGPAEL+F GK+T +KGL++IFDSGSSYTYFNSQAY+ +
Sbjct: 240 PSSGIVWTSMLPSSSEKHYSSGPAELVFNGKATVVKGLELIFDSGSSYTYFNSQAYQAVV 299
Query: 297 DLMRKDLKGKPLEDTAEEKALPVCWKGT 324
DL+ +DLKGK L+ ++ +LP+CWKG
Sbjct: 300 DLVTQDLKGKQLKRATDDPSLPICWKGA 327
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 193/326 (59%), Positives = 235/326 (72%), Gaps = 7/326 (2%)
Query: 6 KRVMGLLVLLMFATFQGCF--SEANQPPSKKKST---QSTAAHRFGSTAVFPITGNVYPL 60
K ++ L LL F F S + QP + KK HR S+AVF + GNVYPL
Sbjct: 2 KGIIALHTLLPFLLFSAILPLSFSAQPRNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPL 61
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
G+Y+V+L IG PPKLY+LDID+GSDLTWVQC+APC GCT P + LY P +NLV C D C
Sbjct: 62 GHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQLC 121
Query: 121 SAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
S HL C + +D CDYEV YADHGSSLGVLV D+ P + TNGS++ PR+ FGCGY+
Sbjct: 122 SEVHLSMAYNCPSPDDPCDYEVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYD 181
Query: 180 QRNPGPK-PPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPS 238
Q+ G PP T+GVLGLG G+ASILSQL SLGL RNV+GHCLS +GGG+LF G D +PS
Sbjct: 182 QKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIPS 241
Query: 239 SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDL 298
SGI WT M EKHYSSGPAEL+F GK+T +KGL++IFDSGSSYTYFNSQAY+ +DL
Sbjct: 242 SGIVWTSMLSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNSQAYQAVVDL 301
Query: 299 MRKDLKGKPLEDTAEEKALPVCWKGT 324
+ KDLKGK L+ ++ +LP+CWKG
Sbjct: 302 VTKDLKGKQLKRATDDPSLPICWKGA 327
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 175/295 (59%), Positives = 221/295 (74%), Gaps = 4/295 (1%)
Query: 31 PSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQ 90
P K S T S+ VFP++GNV+PLGYYSV ++IG+PPK ++ DIDTGSDLTWVQ
Sbjct: 17 PLSKSSIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQ 76
Query: 91 CNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSS 149
C+APC+GCTLPP Y PK N++ C++P C+A H P C +QCDYEV YAD GSS
Sbjct: 77 CDAPCSGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSS 136
Query: 150 LGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKASILSQLQ 208
+G LVTD FPL+L NGS + P + FGCGY+Q P PPP TAGVLGLG GK +L+QL
Sbjct: 137 MGALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLV 196
Query: 209 SLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
S GLTRNV+GHCLS +GGG+LF G +LVPS G+AWTP+ + HY++GPA+LLF GK
Sbjct: 197 SAGLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQ--DNHYTTGPADLLFNGKP 254
Query: 269 TGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
TG+KGL++IFD+GSSYTYFNS+AY+T ++L+ DLK PL+ E+K LP+CWKG
Sbjct: 255 TGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKG 309
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 368 bits (944), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 176/291 (60%), Positives = 219/291 (75%), Gaps = 3/291 (1%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
S + + R S+ VFP+ GNVYPLGYYSV++ IG + +E DID+GSDLTWVQC+APC
Sbjct: 28 SLRKKNSDRLLSSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPC 87
Query: 96 TGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLV 154
T CT P E LY P NN + C +P C++ H N C+ A+DQC YE+ YADHGSSLGVLV
Sbjct: 88 THCTKPREQLYKPNNNALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLV 147
Query: 155 TDHFPLRLTNGSLLGPRLIFGCGYNQRNPGP-KPPPTAGVLGLGLGKASILSQLQSLGLT 213
DH PL+LTNGSL PR+ FGCGY+ + P PPTAGVLGLG G+ S +SQL S+G+
Sbjct: 148 NDHVPLKLTNGSLAAPRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVV 207
Query: 214 RNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG 273
RNV+GHCLS GG +LF G + VPSSG+ WT MS + + +YSSGPAE+ FGGK+TGIK
Sbjct: 208 RNVVGHCLSDEGG-FLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKD 266
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
L ++FDSGSSYTYFNSQAY + L L++ +L+GKPLED E+K+LPVCWKGT
Sbjct: 267 LTLVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGT 317
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 174/279 (62%), Positives = 212/279 (75%), Gaps = 4/279 (1%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S+ V ++GNV+PLGYYSV L+IGNPPK +E DIDTGSD+TWVQC+APCTGC LPP+ Y
Sbjct: 38 SSVVLLLSGNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPKLQY 97
Query: 107 HPKNNLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
PK N V C+DP C A H P N +C +QCDYEV YAD GSS+G LV D FP +L NG
Sbjct: 98 KPKGNTVPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPFKLLNG 157
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
S + PRL FGCGY+Q P PPP TAGVLGLG GK +L+QL S GLTRNV+GHCLS +
Sbjct: 158 SAMQPRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSK 217
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSY 284
GGGYLF G L+PS G+AWTP+ + HY++GPAELLF GK TG+KGL++IFD+GSSY
Sbjct: 218 GGGYLFFGDTLIPSLGVAWTPLLPP--DNHYTTGPAELLFNGKPTGLKGLKLIFDTGSSY 275
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
TYFNS+ Y+T ++L+ DLK PL+ E+K LP+CWKG
Sbjct: 276 TYFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKG 314
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 175/291 (60%), Positives = 218/291 (74%), Gaps = 3/291 (1%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
S + + R S+ VFP+ GNVYPLGYYSV++ IG + +E DID+GSDLTWVQC+APC
Sbjct: 28 SLRKKNSDRLLSSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPC 87
Query: 96 TGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLV 154
T CT P E LY P NN + C +P C++ H N C+ A+DQC YE+ YADHGSSLGVLV
Sbjct: 88 THCTKPREQLYKPNNNALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLV 147
Query: 155 TDHFPLRLTNGSLLGPRLIFGCGYNQRNPGP-KPPPTAGVLGLGLGKASILSQLQSLGLT 213
DH PL+LTNGSL PR+ FGCGY+ + P PPTAGVLGLG G+ S +SQL S+G+
Sbjct: 148 NDHVPLKLTNGSLAAPRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVV 207
Query: 214 RNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG 273
RNV+GHCLS GG +LF G + VPSSG+ WT MS + + +YSSGPAE+ F GK+TGIK
Sbjct: 208 RNVVGHCLSDEGG-FLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKD 266
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
L ++FDSGSSYTYFNSQAY + L L++ +L+GKPLED E+K+LPVCWKGT
Sbjct: 267 LTLVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGT 317
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 178/301 (59%), Positives = 226/301 (75%), Gaps = 8/301 (2%)
Query: 31 PSKKKSTQSTAAH------RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGS 84
PS + S+A R GS+ VFP++GNVYPLGYY V L IGNPPKL++LDIDTGS
Sbjct: 30 PSDATTKDSSAQQVKLQNRRLGSSVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGS 89
Query: 85 DLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLY 143
DLTWVQC+APC GCT P Y P +N + C+ CS L +N C+ DQCDYE+ Y
Sbjct: 90 DLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGY 149
Query: 144 ADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKAS 202
+DH SS+G LVTD FPL+L NGS++ P L FGCGY+Q+NPGP PPP TAG+LGLG GK
Sbjct: 150 SDHASSIGALVTDEFPLKLANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVG 209
Query: 203 ILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAEL 262
I +QL+SLG+T+NV+ HCLS G G+L +G +LVPSSG+ WT ++ + K+Y +GPAEL
Sbjct: 210 ISTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAEL 269
Query: 263 LFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
LF K+TG+KG+ ++FDSGSSYTYFN++AY+ LDL+RKDL GKPL DT ++K+LPVCWK
Sbjct: 270 LFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWK 329
Query: 323 G 323
G
Sbjct: 330 G 330
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 173/277 (62%), Positives = 206/277 (74%), Gaps = 2/277 (0%)
Query: 49 AVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP 108
F I GNVYPLGYY+V+L IGNPPK+Y+LDIDTGSDLTWVQC+APC GCT+P LY P
Sbjct: 50 VAFQIKGNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKP 109
Query: 109 KNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
NLV C DP C A N C N+QCDYEV YAD GSSLGVL+ D+ PL+ TNGSL
Sbjct: 110 NGNLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSL 169
Query: 168 LGPRLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
P L FGCGY+Q++ G P TAGVLGLG GK SILSQL SLGL RNV+GHCLS RGG
Sbjct: 170 ARPILAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERGG 229
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
G+LF G LVP SG+ WTP+ + +HY +GPA+L F K T +KGLQ+IFDSGSSYTY
Sbjct: 230 GFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYTY 289
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
FNS+A+K ++L+ DL+GKPL E+ +LP+CW+G
Sbjct: 290 FNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICWRG 326
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 180/311 (57%), Positives = 232/311 (74%), Gaps = 7/311 (2%)
Query: 15 LMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPK 74
++ A FQ SEA + S + Q+ R ST VFP++GNVYPLGYY V L IGNPPK
Sbjct: 24 ILCARFQ--TSEATKDSSAQVKLQN---RRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPK 78
Query: 75 LYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRC-EA 133
L++LDIDTGSDLTWVQC+APC GCT P Y P +N + C+ CS LP++ C +
Sbjct: 79 LFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADP 138
Query: 134 NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAG 192
DQCDYE+ Y+DH SS+G LVTD PL+L NGS++ RL FGCGY+Q+NPGP PPP TAG
Sbjct: 139 EDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAG 198
Query: 193 VLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLE 252
+LGLG GK + +QL+SLG+T+NV+ HCLS G G+L +G +LVPSSG+ WT ++ +
Sbjct: 199 ILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPS 258
Query: 253 KHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTA 312
K+Y +GPAELLF K+TG+KG+ ++FDSGSSYTYFN++AY+ LDL+RKDL GKPL DT
Sbjct: 259 KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTK 318
Query: 313 EEKALPVCWKG 323
++K+LPVCWKG
Sbjct: 319 DDKSLPVCWKG 329
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 180/311 (57%), Positives = 232/311 (74%), Gaps = 7/311 (2%)
Query: 15 LMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPK 74
++ A FQ SEA + S + Q+ R ST VFP++GNVYPLGYY V L IGNPPK
Sbjct: 24 ILCARFQT--SEATKDSSAQVKLQN---RRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPK 78
Query: 75 LYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRC-EA 133
L++LDIDTGSDLTWVQC+APC GCT P Y P +N + C+ CS LP++ C +
Sbjct: 79 LFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADP 138
Query: 134 NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAG 192
DQCDYE+ Y+DH SS+G LVTD PL+L NGS++ RL FGCGY+Q+NPGP PPP TAG
Sbjct: 139 EDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAG 198
Query: 193 VLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLE 252
+LGLG GK + +QL+SLG+T+NV+ HCLS G G+L +G +LVPSSG+ WT ++ +
Sbjct: 199 ILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPS 258
Query: 253 KHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTA 312
K+Y +GPAELLF K+TG+KG+ ++FDSGSSYTYFN++AY+ LDL+RKDL GKPL DT
Sbjct: 259 KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTK 318
Query: 313 EEKALPVCWKG 323
++K+LPVCWKG
Sbjct: 319 DDKSLPVCWKG 329
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 189/319 (59%), Positives = 235/319 (73%), Gaps = 14/319 (4%)
Query: 8 VMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTL 67
+ + +++ F GCFS +NQP S +R G T VFP+ GNVYP G+YSV+L
Sbjct: 23 IEAMRFVVLSEMFLGCFSASNQPIS----------NRMGHTVVFPLQGNVYPQGFYSVSL 72
Query: 68 KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPE 127
+IGNPPK Y LDID+GSDLTW+QC+APC CT P Y P + CNDP CSA H P
Sbjct: 73 RIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGPITCNDPMCSALHWPS 132
Query: 128 NIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPK 186
C+A ++QCDYEV YADHGSSLGVLV D F L+LTNG+L PRL FGCGY+Q PGP
Sbjct: 133 KPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGCGYDQSYPGPN 192
Query: 187 PPP-TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTP 245
PP GVLGLG GK+SI++QL+SLGL R+++GHCLS RGGG+LFLG L + GI WTP
Sbjct: 193 APPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTP 252
Query: 246 MSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG 305
MSR E Y+ GPA+LLF G+++G+KGL+++FDSGSSYTYFN+QAYKTTL L+RK L G
Sbjct: 253 MSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNG 312
Query: 306 KPLEDTAEEKALPVCWKGT 324
K L++TA+E +LPVCW+G
Sbjct: 313 K-LKETADE-SLPVCWRGA 329
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 189/307 (61%), Positives = 230/307 (74%), Gaps = 14/307 (4%)
Query: 20 FQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELD 79
F GCFS +NQP S +R G T VFP+ GNVYP G+YSV+L+IGNPPK Y LD
Sbjct: 2 FLGCFSASNQPIS----------NRMGHTVVFPLQGNVYPQGFYSVSLRIGNPPKPYTLD 51
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEA-NDQCD 138
ID+GSDLTW+QC+APC CT P Y P + CNDP CSA H P C+A ++QCD
Sbjct: 52 IDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCD 111
Query: 139 YEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAGVLGLG 197
YEV YADHGSSLGVLV D F L+LTNG+L PRL FGCGY+Q PGP PP GVLGLG
Sbjct: 112 YEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLG 171
Query: 198 LGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSS 257
GK+SI++QL+SLGL R+++GHCLS RGGG+LFLG L + GI WTPMSR E Y+
Sbjct: 172 YGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYAL 231
Query: 258 GPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKAL 317
GPA+LLF G+++G+KGL+++FDSGSSYTYFN+QAYKTTL L+RK L GK L++TA+E +L
Sbjct: 232 GPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK-LKETADE-SL 289
Query: 318 PVCWKGT 324
PVCW+G
Sbjct: 290 PVCWRGA 296
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 179/311 (57%), Positives = 231/311 (74%), Gaps = 12/311 (3%)
Query: 15 LMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPK 74
++ A FQ SEA + S + Q+ R ST VFP++GNVYPLGYY V L IGNPPK
Sbjct: 24 ILCARFQT--SEATKDSSAQVKLQN---RRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPK 78
Query: 75 LYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRC-EA 133
L++LDIDTGSDLTWVQC+APC GCT Y P +N + C+ CS LP++ C +
Sbjct: 79 LFDLDIDTGSDLTWVQCDAPCNGCTK-----YKPNHNTLPCSHILCSGLDLPQDRPCADP 133
Query: 134 NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAG 192
DQCDYE+ Y+DH SS+G LVTD PL+L NGS++ RL FGCGY+Q+NPGP PPP TAG
Sbjct: 134 EDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAG 193
Query: 193 VLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLE 252
+LGLG GK + +QL+SLG+T+NV+ HCLS G G+L +G +LVPSSG+ WT ++ +
Sbjct: 194 ILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPS 253
Query: 253 KHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTA 312
K+Y +GPAELLF K+TG+KG+ ++FDSGSSYTYFN++AY+ LDL+RKDL GKPL DT
Sbjct: 254 KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTK 313
Query: 313 EEKALPVCWKG 323
++K+LPVCWKG
Sbjct: 314 DDKSLPVCWKG 324
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 185/308 (60%), Positives = 224/308 (72%), Gaps = 4/308 (1%)
Query: 18 ATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYE 77
A F FS NQ + KK S++A GS+ F I GNVYPLGYY+V+L IGNPPK+Y+
Sbjct: 21 AIFPTSFS--NQVLNSKKPIPSSSASSLGSSVAFQIKGNVYPLGYYTVSLAIGNPPKVYD 78
Query: 78 LDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEA-NDQ 136
LDIDTGSDLTWVQC+APC GCTLP LY P +LV C DP C+A N C N+Q
Sbjct: 79 LDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPHGDLVKCVDPLCAAIQSAPNHHCAGPNEQ 138
Query: 137 CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAGVLG 195
CDYEV YAD GSSLGVL+ D+ PL+ TNGSL P L FGCGY+Q + G PPP TAGVLG
Sbjct: 139 CDYEVEYADQGSSLGVLLRDNIPLKFTNGSLARPMLAFGCGYDQTHHGQNPPPSTAGVLG 198
Query: 196 LGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY 255
LG G+ SILSQL SLGL RNV+GHCLS RGGG+LF G L+P SG+ WTP+ + +HY
Sbjct: 199 LGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGFLFFGDQLIPPSGVVWTPLLQSSSAQHY 258
Query: 256 SSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEK 315
+GPA+L F K+T +KGL++IFDSGSSYTYFNSQA+K ++L+ DL+GKPL +
Sbjct: 259 KTGPADLFFDRKTTSVKGLELIFDSGSSYTYFNSQAHKALVNLIANDLRGKPLSRATGDP 318
Query: 316 ALPVCWKG 323
+LP+CWKG
Sbjct: 319 SLPICWKG 326
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 177/289 (61%), Positives = 219/289 (75%), Gaps = 3/289 (1%)
Query: 38 QSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG 97
+S+A + F S+ + P+ GNVYPLG+++V++ IGNPPK++ELDIDTGSDLTWVQC+APCTG
Sbjct: 30 KSSAVNPFDSSILLPVKGNVYPLGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTG 89
Query: 98 CTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTD 156
CTLP + LY P NN+V C +P CSA C+ NDQCDYEV YADHGSS+GVLV D
Sbjct: 90 CTLPHDRLYKPHNNVVRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKD 149
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPK-PPPTAGVLGLGLGKASILSQLQSLGLTRN 215
PLRLTNG++L P L FGCGY+Q N G + PP TAGVLGLG KA++ +QL +L RN
Sbjct: 150 PVPLRLTNGTILAPNLGFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRN 209
Query: 216 VLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
VLGHC S +GGG+LF G DLVPSSG++W P+ R K YS+GPAE+ FGG GI+GL
Sbjct: 210 VLGHCFSGQGGGFLFFGGDLVPSSGMSWMPILRTPGGK-YSAGPAEVYFGGNPVGIRGLI 268
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ FDSGSSYTYFNSQ Y L+L+R LKG+PL D E+K LP+CWKG+
Sbjct: 269 LTFDSGSSYTYFNSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICWKGS 317
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 177/282 (62%), Positives = 207/282 (73%), Gaps = 4/282 (1%)
Query: 46 GSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
S+ F I GNVYPLGYYSV L IGNPPK YELDIDTGSDLTWVQC+APC GCTLP +
Sbjct: 31 ASSIAFQIKGNVYPLGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPRDRQ 90
Query: 106 YHPKNNLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
Y P NLV C DP C+A N C N+QCDYEV YAD GSSLGVLV D PL+LTN
Sbjct: 91 YKPHGNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGSSLGVLVRDIIPLKLTN 150
Query: 165 GSLLGPRLIFGCGYNQRNPGPKPPPT-AGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
G+L L FGCGY+Q + G PPP+ AGVLGLG G+ASILSQL S GL RNV+GHCLS
Sbjct: 151 GTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVVGHCLSG 210
Query: 224 RGGGYLFLGHDLVPSSGIAWTPM--SRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
GGG+LF G L+P SG+ WTP+ S L KHY +GPA++ F GK+T +KGL++ FDSG
Sbjct: 211 TGGGFLFFGDQLIPQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVKGLELTFDSG 270
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SSYTYFNS A+K +DL+ D+KGKPL E+ +LP+CWKG
Sbjct: 271 SSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICWKG 312
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 215/324 (66%), Gaps = 5/324 (1%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPL 60
ME+ R M ++LM + FS A +K + S R S+ VFP+ GNVYPL
Sbjct: 1 MEKMNVRFM---IVLMVMSLVLGFSSAVDFRWRKTAGFSDRFTRAVSSVVFPVHGNVYPL 57
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+VT+ IG PP+ Y LD+DTGSDLTW+QC+APC C P LY P ++L+ CNDP C
Sbjct: 58 GYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLC 117
Query: 121 SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQ 180
A HL N RCE +QCDYEV YAD GSSLGVLV D F + T G L PRL GCGY+Q
Sbjct: 118 KALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQ 177
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSG 240
P GVLGLG GK SILSQL S G +NV+GHCLS GGG LF G DL SS
Sbjct: 178 IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSR 237
Query: 241 IAWTPMSRDLLEKHYSSGP-AELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
++WTPMSR+ KHYS ELLFGG++TG+K L +FDSGSSYTYFNS+AY+ L+
Sbjct: 238 VSWTPMSRE-YSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 296
Query: 300 RKDLKGKPLEDTAEEKALPVCWKG 323
+++L GKPL++ ++ LP+CW+G
Sbjct: 297 KRELSGKPLKEARDDHTLPLCWQG 320
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 215/324 (66%), Gaps = 5/324 (1%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPL 60
ME+ R M LL+++ FS A +K + S R S+ VFP+ GNVYPL
Sbjct: 1 MEKMNVRFMILLIVMSLVL---GFSSAVDFRWRKTAGFSDRFTRAVSSVVFPVHGNVYPL 57
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+VT+ IG PP+ Y LD+DTGSDLTW+QC+APC C P LY P ++L+ CNDP C
Sbjct: 58 GYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLC 117
Query: 121 SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQ 180
A HL N RCE +QCDYEV YAD GSSLGVLV D F + T G L PRL GCGY+Q
Sbjct: 118 KALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTKGLRLTPRLALGCGYDQ 177
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSG 240
P GVLGLG GK SILSQL S G +NV+GHCLS GGG LF G DL SS
Sbjct: 178 IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSR 237
Query: 241 IAWTPMSRDLLEKHYSSGP-AELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
++WTPMSR+ KHYS ELLFGG++TG+K L +FDSGSSYTYFNS+AY+ L+
Sbjct: 238 VSWTPMSRE-YSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 296
Query: 300 RKDLKGKPLEDTAEEKALPVCWKG 323
+++L GKPL++ ++ LP+CW+G
Sbjct: 297 KRELSGKPLKEARDDHTLPLCWQG 320
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 170/314 (54%), Positives = 211/314 (67%), Gaps = 2/314 (0%)
Query: 11 LLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIG 70
+++LM + FS A +K + S R S+ VFP+ GNVYPLGYY+VT+ IG
Sbjct: 5 FMIVLMVMSLVLGFSSAVDFRWRKTAGFSDRFTRAVSSVVFPVHGNVYPLGYYNVTINIG 64
Query: 71 NPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIR 130
PP+ Y LD+DTGSDLTW+QC+APC C P LY P ++L+ CNDP C A HL N R
Sbjct: 65 QPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQR 124
Query: 131 CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPT 190
CE +QCDYEV YAD GSSLGVLV D F + T G L PRL GCGY+Q P
Sbjct: 125 CETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQIPGASSHHPL 184
Query: 191 AGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDL 250
GVLGLG GK SILSQL S G +NV+GHCLS GGG LF G DL SS ++WTPMSR+
Sbjct: 185 DGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSRVSWTPMSRE- 243
Query: 251 LEKHYSSGP-AELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLE 309
KHYS ELLFGG++TG+K L +FDSGSSYTYFNS+AY+ L++++L GKPL+
Sbjct: 244 YSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLK 303
Query: 310 DTAEEKALPVCWKG 323
+ ++ LP+CW+G
Sbjct: 304 EARDDHTLPLCWQG 317
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 338 bits (866), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 177/326 (54%), Positives = 219/326 (67%), Gaps = 9/326 (2%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKK---KSTQSTAAHRFGSTAVFPITGNV 57
M + K++M + ++LM G S+ Q K S+ GS+ V P+ GNV
Sbjct: 5 MAKICKQIMSVFLVLMIV---GVSSDDQQQSWWKWFSSGASSSVVSSVGSSVVLPLYGNV 61
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACND 117
YP GYY V IG PPK Y LD DTGSDLTW+QC+APC CT P LY P N+LV C D
Sbjct: 62 YPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKD 121
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
P C++ H P+N RC+ DQCDYEV YAD GSS+GVLV D FP+ LT+G PRL GCG
Sbjct: 122 PICASLH-PDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCG 180
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVP 237
Y+Q PG P GVLGLG G +SI++QL S GL RNV+GHC S RGGGYLF G D+
Sbjct: 181 YDQL-PGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYD 239
Query: 238 SSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
SS + WTPMSRD L KHY+ G AEL+ G+S+G+K L ++FDSGSSYTYFN+Q Y+T L
Sbjct: 240 SSKVIWTPMSRDYL-KHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLS 298
Query: 298 LMRKDLKGKPLEDTAEEKALPVCWKG 323
++KDL GKPL++ E+ LPVCW+G
Sbjct: 299 FIKKDLHGKPLKEAVEDDTLPVCWRG 324
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 168/301 (55%), Positives = 205/301 (68%), Gaps = 2/301 (0%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTG 83
FS A +K + S R S+ VFP+ GNVYPLGYY+VT+ IG PP+ Y LD+DTG
Sbjct: 9 FSSAVDFRWRKTAGFSDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTG 68
Query: 84 SDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY 143
SDLTW+QC+APC C P LY P ++L+ CNDP C A HL N RCE +QCDYEV Y
Sbjct: 69 SDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEY 128
Query: 144 ADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI 203
AD GSSLGVLV D F + T G L PRL GCGY+Q P GVLGLG GK SI
Sbjct: 129 ADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSI 188
Query: 204 LSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGP-AEL 262
LSQL S G +NV+GHCLS GGG LF G DL SS ++WTPMSR+ KHYS EL
Sbjct: 189 LSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSRVSWTPMSRE-YSKHYSPAMGGEL 247
Query: 263 LFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
LFGG++TG+K L +FDSGSSYTYFNS+AY+ L++++L GKPL++ ++ LP+CW+
Sbjct: 248 LFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 307
Query: 323 G 323
G
Sbjct: 308 G 308
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 334 bits (856), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 173/324 (53%), Positives = 215/324 (66%), Gaps = 8/324 (2%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPL 60
ME+ R L++ M + FS A +K + + T R S+ VFP+ GNVYPL
Sbjct: 1 MEKMNVR---LIIASMVLSLVLGFSSAVDFRWRKAADRFT---RAASSVVFPVHGNVYPL 54
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+VT+ IG PP+ Y LD+DTGSDLTW+QC+APC C P LY P N+L+ CNDP C
Sbjct: 55 GYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHCLEAPHPLYQPSNDLIPCNDPLC 114
Query: 121 SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQ 180
A H N RCE +QCDYEV YAD GSSLGVLV D F L T G L PRL GCGY+Q
Sbjct: 115 KALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSLNYTKGLRLTPRLALGCGYDQ 174
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSG 240
P GVLGLG GK SILSQL S G +NV+GHCLS GGG LF G+DL SS
Sbjct: 175 IPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLGGGILFFGNDLYDSSR 234
Query: 241 IAWTPMSRDLLEKHYSSGP-AELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
++WTPM+R+ KHYS ELLFGG++TG+K L +FDSGSSYTYFNS+AY+ L+
Sbjct: 235 VSWTPMARE-NSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 293
Query: 300 RKDLKGKPLEDTAEEKALPVCWKG 323
+++L GKPL++ ++ LP+CW+G
Sbjct: 294 KRELSGKPLKEARDDHTLPLCWQG 317
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 333 bits (854), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 167/288 (57%), Positives = 202/288 (70%), Gaps = 3/288 (1%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
+ S+ + S+ VFP+ GNVYPLGYY V+L IG PPK Y LD DTGSDL+W+QC+APC
Sbjct: 40 AASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPC 99
Query: 96 TGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVT 155
CT P LY P NNLV C DP C++ H P +CE +QCDYEV YAD GSSLGVLV
Sbjct: 100 VRCTKAPHPLYRPNNNLVICKDPMCASLH-PPGYKCEHPEQCDYEVEYADGGSSLGVLVK 158
Query: 156 DHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
D FPL TNG L PRL GCGY+Q PG P GVLGLG GK+SI+SQL S G+ RN
Sbjct: 159 DVFPLNFTNGLRLAPRLALGCGYDQI-PGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRN 217
Query: 216 VLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
V+GHC+S RGGG+LF G DL SS + WTPM RD HYSSG AEL+ GGK+T K L
Sbjct: 218 VVGHCVSSRGGGFLFFGDDLYDSSRVVWTPMLRD-QHTHYSSGYAELILGGKTTVFKNLL 276
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ FDSGSSYTY NS AY+ + L+RK+L KP+ + +++ LP+CW+G
Sbjct: 277 VTFDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRG 324
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 162/278 (58%), Positives = 197/278 (70%), Gaps = 2/278 (0%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S+ VFP+ GNVYPLGYY+VT+ IG PP+ Y LD+DTGSDLTW+QC+APC C P LY
Sbjct: 22 SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 81
Query: 107 HPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS 166
P ++L+ CNDP C A HL N RCE +QCDYEV YAD GSSLGVLV D F + T G
Sbjct: 82 QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGL 141
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
L PRL GCGY+Q P GVLGLG GK SILSQL S G +NV+GHCLS GG
Sbjct: 142 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 201
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGP-AELLFGGKSTGIKGLQIIFDSGSSYT 285
G LF G DL SS ++WTPMSR+ KHYS ELLFGG++TG+K L +FDSGSSYT
Sbjct: 202 GILFFGDDLYDSSRVSWTPMSRE-YSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYT 260
Query: 286 YFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
YFNS+AY+ L++++L GKPL++ ++ LP+CW+G
Sbjct: 261 YFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQG 298
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 174/315 (55%), Positives = 220/315 (69%), Gaps = 7/315 (2%)
Query: 11 LLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIG 70
LLV ++FA+F S+ ++ RFGS+ +FP+ GNVYPLG+++V L IG
Sbjct: 5 LLVSILFASFAVSLSDK----FLFADSEQVKTLRFGSSVLFPVRGNVYPLGHFTVLLNIG 60
Query: 71 NPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAF-HLPENI 129
NP K++ELDIDTGSDLTWVQC+ C GCTLP + LY P NN V+ DP C+A L + I
Sbjct: 61 NPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDMLYRPHNNAVSREDPLCAALSSLGKFI 120
Query: 130 RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPG-PKPP 188
NDQC YEV YADHGSS+GVLV D P+RLTNG + P L FGCGY+Q N +PP
Sbjct: 121 FKNPNDQCAYEVEYADHGSSVGVLVKDLVPMRLTNGKRISPNLGFGCGYDQENGDLQQPP 180
Query: 189 PTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSR 248
AGVLGL KA+I+SQL LG NV+GHCL+ RGGG+LF G D+VPSSG++WTP+ R
Sbjct: 181 SIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGGFLFFGGDVVPSSGMSWTPILR 240
Query: 249 DLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
+ E YSSGPAE+ F G++ GI GL + FDSGSSYTYFNSQ Y+ L++ DLKG PL
Sbjct: 241 N-SEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSYTYFNSQVYRAIEKLLKNDLKGNPL 299
Query: 309 EDTAEEKALPVCWKG 323
+ +++K L +CWKG
Sbjct: 300 KLASDDKTLELCWKG 314
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 323 bits (829), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 173/322 (53%), Positives = 211/322 (65%), Gaps = 14/322 (4%)
Query: 12 LVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGN 71
L LL+ + F FS AN K S T+ H S+ V+ I GNVYP G Y+V++ IGN
Sbjct: 15 LFLLLSSIFPHHFSAAN----KNNSIPPTSIHSLISSLVYTIKGNVYPDGLYTVSINIGN 70
Query: 72 PPKLYELDIDTGSDLTWVQC---NAPCTGCTLPPESLYHPK-NNLVACNDPFCSA---FH 124
PPK YELDIDTGSDLTWVQC +APC GCT+P + LY P +V C+DP C A H
Sbjct: 71 PPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKDKLYKPNGKQVVKCSDPICVATQSTH 130
Query: 125 LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPG 184
+ I + + C Y V YADH S+LGVLV D+ + + S P + FGCGY Q+ G
Sbjct: 131 VLGQICSKQSPPCVYNVQYADHASTLGVLVRDYMHIGSPSSSTKDPLVAFGCGYEQKFSG 190
Query: 185 PKPPPT--AGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIA 242
P PP + AG+LGLG GK SILSQL S+G NVLGHCLS GGGYLFLG VPSSGI
Sbjct: 191 PTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCLSAEGGGYLFLGDKFVPSSGIV 250
Query: 243 WTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKD 302
WTP+ + LEKHY++GP +L F GK T KGLQIIFDSGSSYTYF+S Y +++ D
Sbjct: 251 WTPIIQSSLEKHYNTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSSPVYTIVANMVNND 310
Query: 303 LKGKPLEDTAEEKALPVCWKGT 324
LKGKPL ++ +LP+CWKG
Sbjct: 311 LKGKPLS-RVKDPSLPICWKGV 331
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 168/336 (50%), Positives = 218/336 (64%), Gaps = 24/336 (7%)
Query: 9 MGLLVLLMFATFQGC---FSEANQPPSKKKSTQSTAA----------------HRFGSTA 49
+G+LVLL+ + C F ++ S + S + A R GS+
Sbjct: 6 LGILVLLVLFSSSTCSAWFGSKHKSSSGRSSFRPDEASSSSSSSSSSPYILNRFRAGSSV 65
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
VFP+ GNVYP+G+Y+VTL IG PP+ Y LDIDTGSDLTW+QC+APC+ C+ P LY P
Sbjct: 66 VFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLYRPS 125
Query: 110 NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
N+LV C C++ HL +N CE QCDYEV YADH SSLGVL+ D + L TNG L
Sbjct: 126 NDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTNGVQLK 185
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL 229
R+ GCGY+Q P P P G+LGLG GK S+ SQL S GL RNV+GHCLS +GGGY+
Sbjct: 186 VRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYI 245
Query: 230 FLGHDLVPSSGIAWTPM-SRDLLEKHYS-SGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
F G D+ S + WTPM SRD KHYS +G AELLFGGK +G+ L +FD+GSSYTYF
Sbjct: 246 FFG-DVYDSFRLTWTPMSSRDY--KHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSYTYF 302
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
NS AY+ + ++K+ GKPL++ +++ LP+CW+G
Sbjct: 303 NSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRG 338
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 323 bits (827), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 163/288 (56%), Positives = 197/288 (68%), Gaps = 3/288 (1%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
+ S+ + S+ VFP+ GNVYPLGYY V+L IG PP Y LD TGSDL+W+QC+APC
Sbjct: 40 AASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPC 99
Query: 96 TGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVT 155
CT LY P NNLV C DP C+ H P +CE +QCDYEV YAD GSSLGVLV
Sbjct: 100 VRCTKAXHXLYRPNNNLVICKDPMCAXLH-PPGYKCEHPEQCDYEVEYADGGSSLGVLVK 158
Query: 156 DHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
D FPL TNG L PRL GCGY+Q PG P GVLGLG GK+SI+SQL S G+ RN
Sbjct: 159 DVFPLNFTNGLRLAPRLALGCGYDQI-PGXSYHPLDGVLGLGKGKSSIVSQLHSQGVIRN 217
Query: 216 VLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
V+GHC+S GGG+LF G DL SS + WTPM RD HYSSG AEL+ GGK+T K L
Sbjct: 218 VVGHCVSSHGGGFLFFGDDLYDSSRVVWTPMLRD-QHTHYSSGYAELILGGKTTVFKNLL 276
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ FDSGSSYTY NS AY+ + L+RK+L KP+ + +++ LP+CW+G
Sbjct: 277 VTFDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRG 324
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 159/281 (56%), Positives = 198/281 (70%), Gaps = 9/281 (3%)
Query: 44 RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
R GS+ VFP+ GNVYP+G+Y+VT+ IG PP+ Y LDIDTGSDLTW+QC+APC+ C+ P
Sbjct: 66 RSGSSVVFPVHGNVYPVGFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPH 125
Query: 104 SLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
LY P N+LV C P C++ H +N CE QCDYEV YADH SSLGVLV D + L T
Sbjct: 126 PLYRPSNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDVYVLNFT 185
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
NG L R+ GCGY+Q P P G+LGLG GK+S++SQL GL RNV+GHCLS
Sbjct: 186 NGVQLKVRMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSA 245
Query: 224 RGGGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
+GGGY+F G D+ SS +AWTPM SRD KHYS+G AEL+ GGK TG L +FD+GS
Sbjct: 246 QGGGYIFFG-DVYDSSRLAWTPMSSRDY--KHYSAGAAELVLGGKRTGFGNLLAVFDAGS 302
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SYTYFNS AY+ T K+L GKP+++ E++ LP+CW G
Sbjct: 303 SYTYFNSNAYQLT-----KELAGKPIKEAPEDQTLPLCWYG 338
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 154/294 (52%), Positives = 203/294 (69%), Gaps = 1/294 (0%)
Query: 30 PPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWV 89
P S S H GS+ VFPI GNVYP+G+Y+VTL IG PP+ Y LD+DTGS+LTW+
Sbjct: 41 PGEAMSSRPSLMNHAAGSSIVFPIYGNVYPVGFYNVTLNIGQPPRPYFLDVDTGSELTWL 100
Query: 90 QCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSS 149
QC+APC+ C+ P LY P N+ + C DP C++ ++ CE +QCDYE+ YAD S+
Sbjct: 101 QCDAPCSQCSETPHPLYKPSNDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKYADQYST 160
Query: 150 LGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
LGVL+ D + L TNG L R+ GCGY+Q P G+LGLG GKAS++SQL S
Sbjct: 161 LGVLLNDVYLLNFTNGVQLKVRMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNS 220
Query: 210 LGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
GL RNV+GHCLS RGGGY+F G ++ SS ++WTP+S KHYS+GPAEL+FGG+ T
Sbjct: 221 QGLVRNVMGHCLSSRGGGYIFFG-NVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKT 279
Query: 270 GIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
G+ L IIFD+GSSYTYFNSQAY+ + L+ K+L KP++ +++ LP+CW G
Sbjct: 280 GVGSLNIIFDTGSSYTYFNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHG 333
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 153/274 (55%), Positives = 189/274 (68%), Gaps = 1/274 (0%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
V P+ GNVYP G+Y+VTL +G PPK Y LD DTGSDLTW+QC+APC CT LY P
Sbjct: 44 VLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPS 103
Query: 110 NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
N+LV C DP C + H + RCE DQCDYEV YAD GSSLGVLV D FPL LTNG +
Sbjct: 104 NDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIR 163
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL 229
PRL GCGY+Q P G+LGLG G SI+SQL + G+ RNV+GHC + +GGGYL
Sbjct: 164 PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYL 223
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNS 289
F G + + WTPMSRD KHYS G EL+F G+STG++ L ++FDSGSSYTYFN+
Sbjct: 224 FFGDGIYDPYRLVWTPMSRD-YPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNA 282
Query: 290 QAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
QAY+ L+ ++L GKPL + ++ LP+CW+G
Sbjct: 283 QAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG 316
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 317 bits (811), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 157/282 (55%), Positives = 197/282 (69%), Gaps = 5/282 (1%)
Query: 44 RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
R GS+ VFP+ GNVYP+G+Y+VTL IG PP+ Y LDIDTGSDLTW+QC+APC+ C+ P
Sbjct: 58 RAGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPH 117
Query: 104 SLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
LY P N+ V C C++ H +N CE QCDYEV YADH SSLGVL+ D + L T
Sbjct: 118 PLYRPSNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFT 177
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
NG L R+ GCGY+Q P P P G+LGLG GK S+ SQL S GL RNV+GHCLS
Sbjct: 178 NGVQLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSA 237
Query: 224 RGGGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYS-SGPAELLFGGKSTGIKGLQIIFDSG 281
+GGGY+F G D+ SS + WTPM SRD KHYS +G AELLFGGK +GI L +FD+G
Sbjct: 238 QGGGYIFFG-DVYDSSRLTWTPMSSRDY--KHYSAAGAAELLFGGKKSGIGSLHAVFDTG 294
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SSYTYFN AY+ + + K+ GKPL++ +++ LP+CW+G
Sbjct: 295 SSYTYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRG 336
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 167/287 (58%), Positives = 201/287 (70%), Gaps = 2/287 (0%)
Query: 37 TQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT 96
T S +R GS+ VFP+ GNVYP GYY+VTL IG P K Y LD+DTGSDLTW+QC+APC
Sbjct: 45 TSSMMINRAGSSLVFPLHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCR 104
Query: 97 GCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
C P LY P NNLV C DP C++ P C+ DQCDYEV YAD GSSLGVLV D
Sbjct: 105 QCIEAPHPLYRPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEVEYADGGSSLGVLVKD 164
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
F L TNG L P L GCGY+Q PG P G+LGLG G +SI SQL S GL NV
Sbjct: 165 VFVLNFTNGKRLNPLLALGCGYDQL-PGRSNHPLDGILGLGRGISSIPSQLSSQGLVSNV 223
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI 276
+GHCLS RGGG+LF G D+ SSG+ WTPMSRD L KHYS G AEL+F GKSTGI+ L +
Sbjct: 224 IGHCLSGRGGGFLFFGEDIYDSSGVTWTPMSRDHL-KHYSPGFAELIFDGKSTGIRNLLV 282
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+FDSGSSYTY N+QAY+ + ++++L KP+ + +++ LP+CWKG
Sbjct: 283 VFDSGSSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCWKG 329
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 170/322 (52%), Positives = 210/322 (65%), Gaps = 14/322 (4%)
Query: 12 LVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGN 71
L LL+ + F FS AN K S T+ H S+ V+ I GNVYP G Y+V++ IGN
Sbjct: 15 LFLLLSSIFPHHFSAAN----KNNSIPPTSIHSLISSLVYTIKGNVYPDGIYTVSINIGN 70
Query: 72 PPKLYELDIDTGSDLTWVQC---NAPCTGCTLPPESLYHPK-NNLVACNDPFCSAFHLPE 127
PP YELDIDTGSDLTWVQC +APC GCTLP + LY P N LV C+DP C+A P
Sbjct: 71 PPNPYELDIDTGSDLTWVQCDGPDAPCKGCTLPKDKLYKPNGNQLVKCSDPICAAVQPPF 130
Query: 128 NI---RC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR-N 182
+ +C + C Y+V YAD+ S G L D+ + +GS + P ++FGCGY Q+ +
Sbjct: 131 STFGQKCAKPIPPCVYKVEYADNAESTGALARDYMHIGSPSGSNV-PLVVFGCGYEQKFS 189
Query: 183 PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIA 242
PP T GVLGLG GK SILSQL S+G NVLGHCLS GGGYLFLG +PSSGI
Sbjct: 190 GPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSAEGGGYLFLGDKFIPSSGIF 249
Query: 243 WTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKD 302
WTP+ + LEKHYS+GP +L F GK T KGLQIIFDSGSSYTYF+ + Y +++ D
Sbjct: 250 WTPIIQSSLEKHYSTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSPRVYTIVANMVNND 309
Query: 303 LKGKPLEDTAEEKALPVCWKGT 324
LKGKPL ++ +LP+CWKG
Sbjct: 310 LKGKPLRRETKDPSLPICWKGV 331
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 168/292 (57%), Positives = 198/292 (67%), Gaps = 3/292 (1%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
S + S +R S+ V P+ GNVYP GYY+VTL IG P K Y LD+DTGSDLTW+QC
Sbjct: 3 SGETMASSMLINRVPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQC 62
Query: 92 NAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLG 151
+APC CT P Y P+NNLV C DP C + H + RCE QCDYEV YAD GSS G
Sbjct: 63 DAPCVQCTEAPHPYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFG 122
Query: 152 VLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG 211
VLVTD F L T+ P L GCGY+Q PG P GVLGLG GK+SI+SQL SLG
Sbjct: 123 VLVTDTFNLNFTSEKRHSPLLALGCGYDQF-PGGSHHPIDGVLGLGKGKSSIVSQLSSLG 181
Query: 212 LTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI 271
L RNV+GHCLS GGG+LF G DL SS +AWTPMS D KHYS G AEL F GK+TG
Sbjct: 182 LVRNVIGHCLSGHGGGFLFFGDDLYDSSRVAWTPMSPD--AKHYSPGLAELTFDGKTTGF 239
Query: 272 KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
K L FDSG+SYTY NSQAY+ + L++K+L GKPL + +++ LP+CWKG
Sbjct: 240 KNLLTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG 291
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 152/293 (51%), Positives = 202/293 (68%), Gaps = 2/293 (0%)
Query: 31 PSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQ 90
PS+ S++S + GS+ V P+ GNVYP+G+Y+VTL IG P + Y LD+DTGSDLTW+Q
Sbjct: 37 PSEATSSRSRLLNPAGSSIVLPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQ 96
Query: 91 CNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
C+APCT C+ P LY P N+ V C DP C++ E+ CE DQCDYE+ YAD S+
Sbjct: 97 CDAPCTHCSETPHPLYRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTF 156
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
GVL+ D + L TNG L R+ GCGY+Q P G+LGLG GKAS++SQL S
Sbjct: 157 GVLLNDVYLLNFTNGVQLKVRMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQ 216
Query: 211 GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG 270
GL RNV+GHCLS +GGGY+F G + S+ + WTP+S + KHYS+GPAEL+FGG+ TG
Sbjct: 217 GLVRNVIGHCLSAQGGGYIFFG-NAYDSARVTWTPIS-SVDSKHYSAGPAELVFGGRKTG 274
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ L +FD+GSSYTYFNS AY+ L ++K+L GKPL+ +++ LP+CW G
Sbjct: 275 VGSLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCWHG 327
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/274 (55%), Positives = 188/274 (68%), Gaps = 1/274 (0%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
V P+ GNVYP G+Y+VTL +G PPK Y LD DTGSDLTW+QC+APC CT LY P
Sbjct: 44 VLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPS 103
Query: 110 NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
N+LV C DP C + H + RCE DQCDYEV YAD GSSLGVLV D FPL LTNG +
Sbjct: 104 NDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIR 163
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL 229
PRL GCGY+Q P G+LGLG G SI+SQL + G+ RNV+GHC + +GGGY
Sbjct: 164 PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYX 223
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNS 289
F G + + WTPMSRD KHYS G EL+F G+STG++ L ++FDSGSSYTYFN+
Sbjct: 224 FFGDGIYDPYRLVWTPMSRD-YPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNA 282
Query: 290 QAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
QAY+ L+ ++L GKPL + ++ LP+CW+G
Sbjct: 283 QAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG 316
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 149/282 (52%), Positives = 196/282 (69%), Gaps = 7/282 (2%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S+AVFP+ G+VYP G Y V + IGNPP+ Y LD+DTGSDLTW+QC+APC C+ P LY
Sbjct: 42 SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101
Query: 107 HP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
P KN LV C D C+A H L +C++ QCDYE+ YAD GSSLGVLVTD F LRL
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161
Query: 163 TNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
N S++ P L FGCGY+Q+ + T GVLGLG G S+LSQL+ G+T+NV+GHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
S RGGG+LF G D+VP S W PM+R +YS G A L FGG+ G++ ++++FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SS+TYF++Q Y+ +D ++ DL K L++ + +LP+CWKG
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVPDH-SLPLCWKG 321
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 149/282 (52%), Positives = 196/282 (69%), Gaps = 7/282 (2%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S+AVFP+ G+VYP G Y V + IGNPP+ Y LD+DTGSDLTW+QC+APC C+ P LY
Sbjct: 42 SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101
Query: 107 HP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
P KN LV C D C+A H L +C++ QCDYE+ YAD GSSLGVLVTD F LRL
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161
Query: 163 TNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
N S++ P L FGCGY+Q+ + T GVLGLG G S+LSQL+ G+T+NV+GHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
S RGGG+LF G D+VP S W PM+R +YS G A L FGG+ G++ ++++FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SS+TYF++Q Y+ +D ++ DL K L++ + +LP+CWKG
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVPDH-SLPLCWKG 321
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 149/282 (52%), Positives = 196/282 (69%), Gaps = 7/282 (2%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S+AVFP+ G+VYP G Y V + IGNPP+ Y LD+DTGSDLTW+QC+APC C+ P LY
Sbjct: 42 SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101
Query: 107 HP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
P KN LV C D C+A H L +C++ QCDYE+ YAD GSSLGVLVTD F LRL
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161
Query: 163 TNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
N S++ P L FGCGY+Q+ + T GVLGLG G S+LSQL+ G+T+NV+GHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
S RGGG+LF G D+VP S W PM+R +YS G A L FGG+ G++ ++++FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SS+TYF++Q Y+ +D ++ DL K L++ + +LP+CWKG
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVPDH-SLPLCWKG 321
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 151/290 (52%), Positives = 198/290 (68%), Gaps = 7/290 (2%)
Query: 39 STAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC 98
S A S+AVFP+ G+VYP G Y V + IGNPP+ Y LD+DTGSDLTW+QC+APC C
Sbjct: 34 SVTAGAEESSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC 93
Query: 99 TLPPESLYHP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLGVLV 154
+ P LY P KN LV C D C+A H L +C++ QCDYE+ YAD GSSLGVLV
Sbjct: 94 SKVPHPLYRPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLV 153
Query: 155 TDHFPLRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLT 213
TD F LRL N S++ P L FGCGY+Q+ + T GVLGLG G S+LSQL+ G+T
Sbjct: 154 TDSFALRLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGIT 213
Query: 214 RNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG 273
+NV+GHCLS RGGG+LF G D+VP S W PM+R +YS G A L FGG+ G++
Sbjct: 214 KNVVGHCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRP 273
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
++++FDSGSS+TYF++Q Y+ +D ++ DL K L++ + +LP+CWKG
Sbjct: 274 MEVVFDSGSSFTYFSAQPYQALVDAIKGDLS-KNLKEVPDH-SLPLCWKG 321
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 165/281 (58%), Positives = 193/281 (68%), Gaps = 4/281 (1%)
Query: 44 RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
R S+ V P+ GNVYP GYY+VTL IG P K Y LD+DTGSDLTW+QC+APC CT P
Sbjct: 1 RVPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH 60
Query: 104 SLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
Y P+NNLV C DP C + H + RCE QCDYEV YAD GSS GVLV D F L T
Sbjct: 61 PYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNLNFT 120
Query: 164 NGSLLGPRLIFG-CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ P L G CGY+Q PG P GVLGLG GK+SI+SQL SLGL RNV+GHCLS
Sbjct: 121 SEKRHSPLLALGLCGYDQF-PGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLS 179
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
GGG+LF G DL SS +AWTPMS D KHYS G AEL F GK+TG K L FDSG+
Sbjct: 180 GHGGGFLFFGDDLYDSSRVAWTPMSPD--AKHYSPGLAELTFDGKTTGFKNLLTTFDSGA 237
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SYTY NSQAY+ + L++K+L GKPL + +++ LP+CWKG
Sbjct: 238 SYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG 278
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 149/302 (49%), Positives = 198/302 (65%), Gaps = 6/302 (1%)
Query: 26 EANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSD 85
++P S+ S+AVFP+ G+VYP G Y V + IGNPPK Y LD+DTGSD
Sbjct: 29 RGDKPVRGGASSSVAGVETEASSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDTGSD 88
Query: 86 LTWVQCNAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEV 141
LTW+QC+APC C P LY P KN LV C D C++ H L +C++ +QCDY +
Sbjct: 89 LTWLQCDAPCRSCNKVPHPLYRPTKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVI 148
Query: 142 LYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKA 201
YAD GSS GVLV D F LRL NGS++ P L FGCGY+Q+ + PT GVLGLG G
Sbjct: 149 KYADQGSSTGVLVNDSFALRLANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSV 208
Query: 202 SILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAE 261
S+LSQ + G+T+NV+GHCLS+RGGG+LF G DLVP + WTPM R L +YS G A
Sbjct: 209 SLLSQFKQHGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSAS 268
Query: 262 LLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
L FG +S +K +++FDSGSS+TYF +Q Y+ + ++ DL + L++ ++ +LP+CW
Sbjct: 269 LYFGDQSLRVKLTEVVFDSGSSFTYFAAQPYQALVTALKGDLS-RTLKEVSDP-SLPLCW 326
Query: 322 KG 323
KG
Sbjct: 327 KG 328
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 145/282 (51%), Positives = 194/282 (68%), Gaps = 7/282 (2%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S+AVF + G+VYP G Y V + IGNPP+ Y LD+DTGSDLTW+QC+APC C P LY
Sbjct: 42 SSAVFQLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLY 101
Query: 107 HP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
P KN +V C D CS+ H L +C++ QCDYE+ YAD GSSLGVL+TD F +RL
Sbjct: 102 RPTKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRL 161
Query: 163 TNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
N S++ P L FGCGY+Q+ + PT GVLGLG G S+LSQL+ G+T+NV+GHCL
Sbjct: 162 ANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCL 221
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
S+RGGG+LF G +LVP S W PM R + +YS G A L FGG+S G++ ++++ DSG
Sbjct: 222 SIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SS+TYF +Q Y+ + ++ DL K L++ + +LP+CWKG
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLS-KTLKEVFDP-SLPLCWKG 321
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 148/293 (50%), Positives = 199/293 (67%), Gaps = 2/293 (0%)
Query: 31 PSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQ 90
P + S+ + + GS+ VFP+ GNVYP+G+Y+VTL IG P + Y LD+DTGSDLTW+Q
Sbjct: 39 PGEAISSWPSLLNPAGSSIVFPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQ 98
Query: 91 CNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
C+APCT C+ P L+ P N+ V C DP C++ E+ CE DQCDYE+ YAD S+
Sbjct: 99 CDAPCTHCSETPHPLHRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTY 158
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
GVL+ D + L +NG L R+ GCGY+Q P G+LGLG GKAS++SQL S
Sbjct: 159 GVLLNDVYLLNSSNGVQLKVRMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQ 218
Query: 211 GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG 270
GL RNV+GHCLS +GGGY+F G + S+ + WTP+S + KHYS+GPAEL+FGG+ TG
Sbjct: 219 GLVRNVIGHCLSSQGGGYIFFG-NAYDSARVTWTPIS-SVDSKHYSAGPAELVFGGRKTG 276
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ L +FD+GSSYTYFNS AY+ L + K+L GKPL+ +++ L +CW G
Sbjct: 277 VGSLTAVFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLCWHG 329
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 160/281 (56%), Positives = 190/281 (67%), Gaps = 4/281 (1%)
Query: 44 RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
R S+ V P+ GNVYP G+Y+VTL IG P K Y LD+DTGSDLTW+QC+ P CT P
Sbjct: 1 RVPSSIVLPLHGNVYPTGFYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPH 60
Query: 104 SLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
Y P NNLVAC DP C + H + RCE QCDYEV YAD GSSLGVLV D F L T
Sbjct: 61 PYYKPSNNLVACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNLNFT 120
Query: 164 NGSLLGPRLIFG-CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ P L G CGY+Q PG P GVLGLG GK SI+SQL LGL RNV+GHCLS
Sbjct: 121 SEKRQSPLLALGLCGYDQL-PGGTYHPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCLS 179
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
RGGG+LF G DL SS +AWTPMS + KHYS G AEL F GK+TG K L + FDSG+
Sbjct: 180 GRGGGFLFFGDDLYDSSRVAWTPMSPN--AKHYSPGFAELTFDGKTTGFKNLIVAFDSGA 237
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SYTY NSQ Y+ + L++++L KPL + +++ LP+CWKG
Sbjct: 238 SYTYLNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKG 278
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 146/293 (49%), Positives = 196/293 (66%), Gaps = 7/293 (2%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
S+ + A S+AVFP+ G+VYP G Y V + IGNPPK Y LD+D+GSDLTW+QC+APC
Sbjct: 39 SSIAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC 98
Query: 96 TGCTLPPESLYHP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLG 151
C P LY P K+ LV C C++ H L RC++ ++QCDY + YAD GSS G
Sbjct: 99 RSCNEVPHPLYRPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTG 158
Query: 152 VLVTDHFPLRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSL 210
VL+ D F LRLTNGS+ P + FGCGY+Q+ G PT GVLGLG G S+LSQL+
Sbjct: 159 VLINDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQR 218
Query: 211 GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG 270
G+T+NV+GHCLS+RGGG+LF G DLVP WTPM+R +YS G A L FG +S G
Sbjct: 219 GVTKNVVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLG 278
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
++ +++FDSGSS+TYF ++ Y+ + ++ L + LE+ + +LP+CWKG
Sbjct: 279 VRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLS-RTLEEE-PDTSLPLCWKG 329
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/293 (50%), Positives = 196/293 (66%), Gaps = 7/293 (2%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
S+ + A S+AVFP+ G+VYP G Y V + IGNPPK Y LD+D+GSDLTW+QC+APC
Sbjct: 39 SSIAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC 98
Query: 96 TGCTLPPESLYHP-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLG 151
C P LY P K+ LV C C++ H L RC++ ++QCDY + YAD GSS G
Sbjct: 99 RSCNEVPHPLYRPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTG 158
Query: 152 VLVTDHFPLRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSL 210
VL+ D F LRLTNGS+ P + FGCGY+Q+ G PT GVLGLG G S+LSQL+
Sbjct: 159 VLINDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQR 218
Query: 211 GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG 270
G+T+NV+GHCLS+RGGG+LF G DLVP WTPM+R +YS G A L FG +S G
Sbjct: 219 GVTKNVVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLG 278
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
++ +++FDSGSS+TYF ++ Y+ + + KD + LE+ + +LP+CWKG
Sbjct: 279 VRLAKVVFDSGSSFTYFAAKPYQALVTAL-KDGLSRTLEEE-PDTSLPLCWKG 329
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 147/294 (50%), Positives = 195/294 (66%), Gaps = 8/294 (2%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
S+ + A S+AVFP+ G+VYP G Y V + IGNPPK Y LD+D+GSDLTW+QC+APC
Sbjct: 37 SSIAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC 96
Query: 96 TGCTLPPESLYHP-KNNLVACNDPFCSAFH---LPENIRCEA-NDQCDYEVLYADHGSSL 150
C P LY P K+ LV C C++ H RCE+ ++QCDY + YAD GSS
Sbjct: 97 RSCNEVPHPLYRPTKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSST 156
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQS 209
GVLV D F LRLTNGS+ P + FGCGY+Q+ G PT GVLGLG G S+LSQL+
Sbjct: 157 GVLVNDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQ 216
Query: 210 LGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
G+T+NV+GHCLS+RGGG+LF G DLVP WTPM+R +YS G A L FG +S
Sbjct: 217 RGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSL 276
Query: 270 GIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
G++ +++FDSGSS+TYF ++ Y+ + ++ L + LE+ + +LP+CWKG
Sbjct: 277 GVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLS-RTLEEE-PDTSLPLCWKG 328
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 143/281 (50%), Positives = 190/281 (67%), Gaps = 7/281 (2%)
Query: 48 TAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH 107
AVFP+ G+VYP G Y V + IGNPPK Y LD+D+GSDLTW+QC+APC C P LY
Sbjct: 42 AAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYR 101
Query: 108 P-KNNLVACNDPFCSAFH--LPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
P K+ LV C C++ H L RC++ ++QCDY + YAD GSS GVL+ D F LRLT
Sbjct: 102 PTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLT 161
Query: 164 NGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
NGS+ P + FGCGY+Q+ G PT GVLGLG G S+LSQL+ G+T+NV+GHCLS
Sbjct: 162 NGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 221
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
+RGGG+LF G DLVP WTPM+R +YS G A L FG +S G++ +++FDSGS
Sbjct: 222 LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGS 281
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
S+TYF ++ Y+ + ++ L + LE+ + +LP+CWKG
Sbjct: 282 SFTYFAAKPYQALVTALKDGLS-RTLEEE-PDTSLPLCWKG 320
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 140/279 (50%), Positives = 184/279 (65%), Gaps = 7/279 (2%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP- 108
VF ++G+VYP G+Y VT+ IG+P K Y LD+DTGSDLTW+QC+APC C P LY P
Sbjct: 44 VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103
Query: 109 KNNLVACNDPFCSAFHL--PENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS 166
KN LV C + C+A H N +C QCDY++ Y D SSLGVLVTD F L L N S
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKS 163
Query: 167 LLGPRLIFGCGYNQR--NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
+ P L FGCGY+Q+ G P T G+LGLG G S+LSQL+ G+T+NVLGHCLS
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSY 284
GGG+LF G D+VP+S + W PM R +YS G A L F +S K ++++FDSGS+Y
Sbjct: 224 GGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGSTY 283
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
TYF++Q Y+ T+ ++ L K L+ ++ +LP+CWKG
Sbjct: 284 TYFSAQPYQATISAIKGSL-SKSLKQVSDP-SLPLCWKG 320
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 138/279 (49%), Positives = 182/279 (65%), Gaps = 7/279 (2%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP- 108
VF ++G+VYP G+Y VT+ IG+P K Y LD+DTGSDLTW+QC+APC C P LY P
Sbjct: 44 VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103
Query: 109 KNNLVACNDPFCSAFHL--PENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS 166
KN LV C + C+A H N +C QCDY++ Y D SSLGVLV D F L L N S
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKS 163
Query: 167 LLGPRLIFGCGYNQR--NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
+ P L FGCGY+Q+ G P T G+LGLG G S+LSQL+ G+T+NVLGHCLS
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSY 284
GGG+LF G D+VP+S + W M R +YS G A L F +S K ++++FDSGS+Y
Sbjct: 224 GGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGSTY 283
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
TYF++Q Y+ T+ ++ L K L+ ++ +LP+CWKG
Sbjct: 284 TYFSAQPYQATISAIKGSL-SKSLKQVSDP-SLPLCWKG 320
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 136/231 (58%), Positives = 161/231 (69%), Gaps = 13/231 (5%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S+ V P++GNV+PLGYYSV L+IG PPK +E DIDTGSDLTWVQC+APCTGCTLPP Y
Sbjct: 38 SSVVLPLSGNVFPLGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPIRQY 97
Query: 107 HPKNNLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
PK N V C DP C A H P +C +QCDYEV YAD GSS+G LV D FPL+L NG
Sbjct: 98 KPKGNTVPCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLNG 157
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
S + PRL FGCGY+Q P PPP TAGVLGLG GK +L QL + GLTRNV+GHCLS +
Sbjct: 158 SAMQPRLAFGCGYDQILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSK 217
Query: 225 GGGYLFLGHDLVPSSGIAWTPM-----------SRDLLEKHYSSGPAELLF 264
GGGYLF G L+P+ G+AWTP+ RD L++ Y+ + L F
Sbjct: 218 GGGYLFFGDTLIPTLGVAWTPLLSPEYTFFFHICRDRLQRDYTFFKSVLEF 268
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 138/280 (49%), Positives = 182/280 (65%), Gaps = 7/280 (2%)
Query: 49 AVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP 108
AVF + G+VYP G+Y VT+ IG+P K Y LDIDTGSDLTW+QC+APC C P LY P
Sbjct: 38 AVFQLNGDVYPTGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKP 97
Query: 109 -KNNLVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
KN LV C C+ H + N +C QCDY++ Y D SSLGVLVTD+F L L N
Sbjct: 98 TKNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNS 157
Query: 166 SLLGPRLIFGCGYNQR--NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
S + P FGCGY+Q+ G T G+LGLG G S++SQL+ LG+T+NVLGHCLS
Sbjct: 158 SSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLST 217
Query: 224 RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSS 283
GGG+LF G ++VP+S W PM R +YS G L F +S G+K ++++FDSGS+
Sbjct: 218 NGGGFLFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGST 277
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
YTYF +Q Y+ T+ ++ L K L+ ++ +LP+CWKG
Sbjct: 278 YTYFAAQPYQATVSALKAGLS-KSLQQVSDP-SLPLCWKG 315
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 193/316 (61%), Gaps = 16/316 (5%)
Query: 11 LLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIG 70
L VLL+ F PS ++ + STAVF + G VYP+G+Y VT+ IG
Sbjct: 30 LAVLLLLPPFA---------PSPARAATPGKSLSSASTAVFQLQGAVYPIGHYYVTMNIG 80
Query: 71 NPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENI 129
+P K Y LD+DTGSDLTW+QC+APC C P Y P KN +V C C++ L N
Sbjct: 81 DPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKIVPCAASLCTS--LTPNK 138
Query: 130 RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR--NPGPKP 187
+C QCDY++ Y D SSLGVL+ D+F L L N S + L FGCGY+Q+ G
Sbjct: 139 KCAVPQQCDYQIKYTDKASSLGVLIADNFTLSLRNSSTVRANLTFGCGYDQQVGKNGAVQ 198
Query: 188 PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMS 247
T G+LGLG G S+LSQL+ G+T+NVLGHC S GGG+LF G D+VP+S + W PM+
Sbjct: 199 AATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFGDDIVPTSRVTWVPMA 258
Query: 248 RDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
R +YS G L F +S G+K ++++FDSGS+Y YF ++ Y+ T+ ++ L K
Sbjct: 259 RTTSGNYYSPGSGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEPYQATVSALKAGLS-KS 317
Query: 308 LEDTAEEKALPVCWKG 323
L++ ++ +LP+CWKG
Sbjct: 318 LKEVSDV-SLPLCWKG 332
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 134/284 (47%), Positives = 183/284 (64%), Gaps = 12/284 (4%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
STAVF + G+VYP G+Y VT+ IGNP K Y LD+DTGSDLTW+QC+APC C P LY
Sbjct: 37 STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96
Query: 107 HP-KNNLVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHF--PLR 161
P N LV C + C+A H + N +C + QCDY++ Y D SS GVL+ D F P+R
Sbjct: 97 RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156
Query: 162 LTNGSLLGPRLIFGCGYNQR--NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGH 219
+N + P L FGCGY+Q+ G G+LGLG G S++SQL+ G+T+NV+GH
Sbjct: 157 SSN---IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGH 213
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
CLS GGG+LF G D+VPSS + W PM++ +YS G L F +S G+K ++++FD
Sbjct: 214 CLSTNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFD 273
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SGS+YTYF +Q Y+ + ++ L K L+ ++ LP+CWKG
Sbjct: 274 SGSTYTYFTAQPYQAVVSALKGGLS-KSLKQVSDP-TLPLCWKG 315
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 134/284 (47%), Positives = 183/284 (64%), Gaps = 12/284 (4%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
STAVF + G+VYP G+Y VT+ IGNP K Y LD+DTGSDLTW+QC+APC C P LY
Sbjct: 37 STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96
Query: 107 HP-KNNLVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHF--PLR 161
P N LV C + C+A H + N +C + QCDY++ Y D SS GVL+ D F P+R
Sbjct: 97 RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156
Query: 162 LTNGSLLGPRLIFGCGYNQR--NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGH 219
+N + P L FGCGY+Q+ G G+LGLG G S++SQL+ G+T+NV+GH
Sbjct: 157 SSN---IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGH 213
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
CLS GGG+LF G D+VPSS + W PM++ +YS G L F +S G+K ++++FD
Sbjct: 214 CLSTNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFD 273
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SGS+YTYF +Q Y+ + ++ L K L+ ++ LP+CWKG
Sbjct: 274 SGSTYTYFTAQPYQAVVSALKGGLS-KSLKQVSDP-TLPLCWKG 315
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 132/281 (46%), Positives = 184/281 (65%), Gaps = 13/281 (4%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP- 108
+F + GNVYP G+Y VT+ IGNP K Y LD+DTGSDLTW+QC+APC C P LY P
Sbjct: 41 IFQLQGNVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPT 100
Query: 109 KNNLVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHF--PLRLTN 164
N+LV C + C+A H N +C + QCDY++ Y D SS GVL+ D+F P+R +N
Sbjct: 101 ANSLVPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRSSN 160
Query: 165 GSLLGPRLIFGCGYNQR--NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ P L FGCGY+Q+ G T G+LGLG G S++SQL+ G+T+NVLGHCLS
Sbjct: 161 ---IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLS 217
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
GGG+LF G D+VP+S + W PM++ + +YS G L F +S G+K ++++FDSGS
Sbjct: 218 TNGGGFLFFGDDIVPTSRVTWVPMAK-ISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+YTYF +Q Y+ + ++ L K L+ ++ +LP+CWKG
Sbjct: 277 TYTYFTAQPYQAVVSALKSGLS-KSLKQVSDP-SLPLCWKG 315
>gi|356507650|ref|XP_003522577.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 326
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/270 (50%), Positives = 170/270 (62%), Gaps = 25/270 (9%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFH 124
+++ I + +LYELDIDTGSDLTW Q +APC GCTLP + L P LV C D C+A H
Sbjct: 1 MSITITSSSELYELDIDTGSDLTWFQWDAPCQGCTLPRDKLNKPHCKLVKCGDRLCAAIH 60
Query: 125 LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPG 184
+ ++QCDYEV YAD GSSLGVLV D+ L+ T+GSL P L
Sbjct: 61 --SEPCADPDEQCDYEVEYADQGSSLGVLVLDNIALKFTSGSLARPIL------------ 106
Query: 185 PKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWT 244
A +GL GK SILSQL SLGL RNV+GHCLS RGGG+LF G L+P SG+ WT
Sbjct: 107 -----AAPDMGLATGKTSILSQLHSLGLIRNVVGHCLSRRGGGFLFFGDQLIPQSGVVWT 161
Query: 245 PM----SRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMR 300
P+ S HY +GPA++ F GK+T +KGL++ FDSGSSYT FNS A+K + L+
Sbjct: 162 PLLQNSSVTYTRPHYKTGPADMFFNGKATSVKGLELTFDSGSSYTXFNSHAHKALVGLIT 221
Query: 301 KDLKGKPLEDTAEEKALPVCWKG--TWKCL 328
D+KGK E+ +LP+CWK T+K L
Sbjct: 222 NDIKGKSFSRATEDPSLPICWKNPKTFKSL 251
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 135/293 (46%), Positives = 184/293 (62%), Gaps = 16/293 (5%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S V + GNVYP+G++ +T+ IG+P K Y LDIDTGS LTW+QC+APCT C + P LY
Sbjct: 22 SAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLY 81
Query: 107 HPK-NNLVACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
P LV C D C+ + L + RC + QCDY + Y D SS+GVLV D F L +
Sbjct: 82 KPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSAS 140
Query: 164 NGSLLGPRLI-FGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNVLGHC 220
NG+ P I FGCGY+Q + P P +LGL GK ++LSQL+S G +T++VLGHC
Sbjct: 141 NGT--NPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 221 LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG--LQIIF 278
+S +GGG+LF G VP+SG+ WTPM+R+ K+YS G L F S I + +IF
Sbjct: 199 ISSKGGGFLFFGDAQVPTSGVTWTPMNRE--HKYYSPGHGTLHFDSNSKAISAAPMAVIF 256
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCWKGTWKCL 328
DSG++YTYF +Q Y+ TL +++ L + E T +++AL VCWKG K +
Sbjct: 257 DSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIV 309
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 147/350 (42%), Positives = 191/350 (54%), Gaps = 36/350 (10%)
Query: 9 MGLL--VLLMFATFQGCF----------------SEANQPPSKKKSTQSTAAHRFG---- 46
MG+L V L+F F C + N K S S +R G
Sbjct: 1 MGVLTNVFLVFVLFCVCMCVSQQADVYRLQPKYPAADNDEEGSKASFVSRDTNRIGRRLQ 60
Query: 47 --STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
TA+F + GNV P G Y VT+ +GNP K Y LD+D+GS+LTW+QC+APC C P
Sbjct: 61 AHQTAIFSLKGNVVPYGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHP 120
Query: 105 LYH-PKNNLVACNDPFCSAFHLPE---NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL 160
LY K +LV DP C+A + EA+ +CDY+V YADHG S G LV D
Sbjct: 121 LYKLKKGSLVPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAYADHGYSEGFLVRDSVRA 180
Query: 161 RLTNGSLLGPRLIFGCGYNQRNPGP-KPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGH 219
LTN ++L +FGCGYNQR P T G+LGLG G AS+ SQ GL +NV+GH
Sbjct: 181 LLTNKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGH 240
Query: 220 CL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-----STGIK 272
C+ + R GGY+F G DLV +S + W PM KHY G A++ FG K G K
Sbjct: 241 CIFGAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKK 300
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
IIFDSGS+YTYF +QAY L +++++L GK LE + + L +CW+
Sbjct: 301 LGGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWR 350
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 179/312 (57%), Gaps = 25/312 (8%)
Query: 31 PSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQ 90
P++ S+ A S++VFP+ GNVYP G Y + +GNPP+ Y LDIDT SDLTW+Q
Sbjct: 176 PNRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTWIQ 235
Query: 91 CNAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENI-RCEANDQCDYEVLYADHGS 148
C+APCT C +LY P ++N+V D C H + CE QCDYE+ YADH S
Sbjct: 236 CDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADHSS 295
Query: 149 SLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKASIL 204
S+GVL D L + NGS + FGC Y+Q+ N K T G+LGL K S+
Sbjct: 296 SMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVK---TDGILGLSKAKVSLP 352
Query: 205 SQLQSLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPM---------SRDLLEK 253
SQL + G+ NV+GHCL+ V GGGY+FLG D VP G++W PM +++
Sbjct: 353 SQLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKL 412
Query: 254 HYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAE 313
+Y SGP L GG+ ++ +I+FDSGSSYTYF +AY + + K + G+ L
Sbjct: 413 NYGSGPLSL--GGQERRVR--RIVFDSGSSYTYFTKEAYSELVASL-KQVSGEALIQDTS 467
Query: 314 EKALPVCWKGTW 325
+ LP CW+ +
Sbjct: 468 DPTLPFCWRAKF 479
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 121/262 (46%), Positives = 166/262 (63%), Gaps = 12/262 (4%)
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPE 127
IGNP K Y LD+DTGSDLTW+QC+APC C P LY P N LV C + C+A H +
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60
Query: 128 --NIRCEANDQCDYEVLYADHGSSLGVLVTDHF--PLRLTNGSLLGPRLIFGCGYNQR-- 181
N +C + QCDY++ Y D SS GVL+ D F P+R +N + P L FGCGY+Q+
Sbjct: 61 GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPGLTFGCGYDQQVG 117
Query: 182 NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGI 241
G G+LGLG G S++SQL+ G+T+NV+GHCLS GGG+LF G D+VPSS +
Sbjct: 118 KNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGDDVVPSSRV 177
Query: 242 AWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRK 301
W PM++ +YS G L F +S G+K ++++FDSGS+YTYF +Q Y+ + ++
Sbjct: 178 TWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSALKG 237
Query: 302 DLKGKPLEDTAEEKALPVCWKG 323
L K L+ ++ LP+CWKG
Sbjct: 238 GLS-KSLKQVSDP-TLPLCWKG 257
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 123/286 (43%), Positives = 165/286 (57%), Gaps = 10/286 (3%)
Query: 48 TAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH 107
TA +PI GN+YP G Y + ++IGNP KLY LD+DTGSDLTW+QC+APC C + P LY
Sbjct: 16 TAAYPIGGNIYPDGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYD 75
Query: 108 PKN-NLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
PK +V C P C+ C + QCDYEV Y D S++G+LV D L LTNG
Sbjct: 76 PKRARVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNG 135
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-- 222
+ R + GCGY+Q+ K P T GV+GL K S+ SQL + G+ NV+GHCL+
Sbjct: 136 TRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGG 195
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ-----II 277
GGGYLF G LVP+ G+ WTPM L + Y + + +GG+ ++G +
Sbjct: 196 SNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGAM 255
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
FDSG+S+TY AY L + + + LE + LP CW+G
Sbjct: 256 FDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRG 301
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/309 (42%), Positives = 180/309 (58%), Gaps = 20/309 (6%)
Query: 31 PSKKKSTQSTAAHRF-GSTAVFPITGNVYPLGYYSVTLKIGNPP--KLYELDIDTGSDLT 87
P K ST+A ST +FP+ GNVYP G Y + +G P + Y LDIDTGSDLT
Sbjct: 165 PVKVNDVLSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLT 224
Query: 88 WVQCNAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPE-NIRCEANDQCDYEVLYAD 145
W+QC+APCT C LY P K+NLV ++PFC + CE+ QCDYE+ YAD
Sbjct: 225 WIQCDAPCTSCAKGANQLYKPRKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYAD 284
Query: 146 HGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKA 201
H S+GVL D F L+L NGSL ++FGCGY+Q+ N K T G+LGL K
Sbjct: 285 HSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLK---TDGILGLSRAKI 341
Query: 202 SILSQLQSLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGP 259
S+ SQL S G+ NV+GHCL+ + G GY+F+G DLVPS G+ W PM + Y
Sbjct: 342 SLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQV 401
Query: 260 AELLFGGKSTGIKGL-----QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEE 314
++ +G + G +++FD+GSSYTYF +QAY + +++ + D ++E
Sbjct: 402 TKMSYGNAMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDE 461
Query: 315 KALPVCWKG 323
ALP+CW+
Sbjct: 462 -ALPICWRA 469
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 132/291 (45%), Positives = 181/291 (62%), Gaps = 16/291 (5%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S V + GNVYP+G++ VT+ IG+P K Y LDIDTGS LTW+QC+ PC C P LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 107 HPK-NNLVACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
P+ V C + C+ + L + ++C +QC Y + Y GSS+GVL+ D F L +
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSSIGVLIVDSFSLPAS 140
Query: 164 NGSLLGPRLI-FGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNVLGHC 220
NG+ P I FGCGYNQ +N P P G+LGLG GK ++LSQL+S G +T++VLGHC
Sbjct: 141 NGT--NPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 221 LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG--LQIIF 278
+S +G G+LF G VP+SG+ W+PM+R+ KHYS L F S I +++IF
Sbjct: 199 ISSKGKGFLFFGDAKVPTSGVTWSPMNRE--HKHYSPRQGTLQFNSNSKPISAAPMEVIF 256
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDL--KGKPLEDTAE-EKALPVCWKGTWK 326
DSG++YTYF Q Y TL +++ L + K L + E ++AL VCWKG K
Sbjct: 257 DSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDK 307
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 177/309 (57%), Gaps = 20/309 (6%)
Query: 31 PSKKKSTQSTAAHRF-GSTAVFPITGNVYPLGYYSVTLKIGNPP--KLYELDIDTGSDLT 87
P K ST+A ST +FP+ GNVYP G Y + +G P + Y LDIDTGS+LT
Sbjct: 170 PVKVNDVLSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELT 229
Query: 88 WVQCNAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPE-NIRCEANDQCDYEVLYAD 145
W+QC+APCT C LY P K+NLV ++ FC + CE QCDYE+ YAD
Sbjct: 230 WIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYAD 289
Query: 146 HGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKA 201
H S+GVL D F L+L NGSL ++FGCGY+Q+ N K T G+LGL K
Sbjct: 290 HSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLK---TDGILGLSRAKI 346
Query: 202 SILSQLQSLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGP 259
S+ SQL S G+ NV+GHCL+ + G GY+F+G DLVPS G+ W PM D Y
Sbjct: 347 SLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQV 406
Query: 260 AELLFGGKSTGIKGL-----QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEE 314
++ +G + G +++FD+GSSYTYF +QAY + + +++ G L +
Sbjct: 407 TKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSL-QEVSGLELTRDDSD 465
Query: 315 KALPVCWKG 323
+ LP+CW+
Sbjct: 466 ETLPICWRA 474
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 177/305 (58%), Gaps = 28/305 (9%)
Query: 39 STAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC 98
+T+ F S+ +FP+ G+VYP G Y + +G+PP+ Y LD+DTGSDLTW+QC+APCT C
Sbjct: 77 ATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSC 136
Query: 99 TLPPESLYHPKN-NLVACNDPFCSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLV 154
P LY PK NLV D C + N++ CE +QCDYE+ YADH SS+GVL
Sbjct: 137 AKGPNPLYKPKKGNLVPLKDSLC--VEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLA 194
Query: 155 TDHFPLRLTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKASILSQLQSL 210
+D L L NGSL ++FGC Y+Q+ N K T G+LGL K S+ SQL S
Sbjct: 195 SDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAK---TDGILGLSKAKVSLPSQLASQ 251
Query: 211 GLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH---YSSGPAELLFG 265
+ NVLGHCL+ GGGY+FLG D VP G+AW PM L H Y S ++ G
Sbjct: 252 RIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPM----LNSHSPNYHSQIMKISHG 307
Query: 266 GKSTGI-----KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVC 320
+ + + +++FD+GSSYTYF +AY + + KD+ + L + LPVC
Sbjct: 308 SRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDEGLIQDGSDPTLPVC 366
Query: 321 WKGTW 325
W+ +
Sbjct: 367 WRAKF 371
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 177/305 (58%), Gaps = 28/305 (9%)
Query: 39 STAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC 98
+T+ F S+ +FP+ G+VYP G Y + +G+PP+ Y LD+DTGSDLTW+QC+APCT C
Sbjct: 290 ATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSC 349
Query: 99 TLPPESLYHPKN-NLVACNDPFCSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLV 154
P LY PK NLV D C + N++ CE +QCDYE+ YADH SS+GVL
Sbjct: 350 AKGPNPLYKPKKGNLVPLKDSLC--VEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLA 407
Query: 155 TDHFPLRLTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKASILSQLQSL 210
+D L L NGSL ++FGC Y+Q+ N K T G+LGL K S+ SQL S
Sbjct: 408 SDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAK---TDGILGLSKAKVSLPSQLASQ 464
Query: 211 GLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH---YSSGPAELLFG 265
+ NVLGHCL+ GGGY+FLG D VP G+AW PM L H Y S ++ G
Sbjct: 465 RIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPM----LNSHSPNYHSQIMKISHG 520
Query: 266 GKSTGI-----KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVC 320
+ + + +++FD+GSSYTYF +AY + + KD+ + L + LPVC
Sbjct: 521 SRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDEGLIQDGSDPTLPVC 579
Query: 321 WKGTW 325
W+ +
Sbjct: 580 WRAKF 584
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 127/278 (45%), Positives = 173/278 (62%), Gaps = 16/278 (5%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK-NNLVACNDPFC 120
++ +T+ IG+P K Y LDIDTGS LTW+QC+APCT C + P LY P LV C D C
Sbjct: 402 HFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTCADSLC 461
Query: 121 SAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI-FGCG 177
+ + L + RC + QCDY + Y D SS+GVLV D F L +NG+ P I FGCG
Sbjct: 462 TDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNGT--NPTTIAFGCG 518
Query: 178 YNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLG-LTRNVLGHCLSVRGGGYLFLGHDL 235
Y+Q P P +LGL GK ++LSQL+S G +T++VLGHC+S +GGG+LF G
Sbjct: 519 YDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQ 578
Query: 236 VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG--LQIIFDSGSSYTYFNSQAYK 293
VP+SG+ WTPM+R+ K+YS G L F S I + +IFDSG++YTYF +Q Y+
Sbjct: 579 VPTSGVTWTPMNRE--HKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQ 636
Query: 294 TTLDLMRKDLKGK---PLEDTAEEKALPVCWKGTWKCL 328
TL +++ L + E T +++AL VCWKG K +
Sbjct: 637 ATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIV 674
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 123/224 (54%), Gaps = 31/224 (13%)
Query: 112 LVACNDPFCSAFHLPENIRCEAND-----QCDYEVLYADHGSSLGVLVTDHFPL-RLTNG 165
+V +DP A H E+ R + QCDYE+ YAD S++G L+ D F L R+
Sbjct: 1 MVRADDPLYVALH--EDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSLPRIATR 58
Query: 166 SLLGPRLIFGCGYNQR--NPGPKPPPTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLS 222
P L FGCGYNQ + P G+LGL GK S +SQL+ LG+ T++V+GHCLS
Sbjct: 59 ----PNLPFGCGYNQGIGENFQQTSPVNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLS 114
Query: 223 VRGGGYLFLGH---DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
GGG LF+G +LV L +YS G A L F S G+ + ++FD
Sbjct: 115 SGGGGLLFVGDGDGNLVL------------LHANYYSPGSATLYFDRHSLGMNPMDVVFD 162
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SGS+YTYF +Q Y+ T+ ++ L LE + + +LP+CWKG
Sbjct: 163 SGSTYTYFTAQPYQATVYAIKGGLSSTSLEQVS-DPSLPLCWKG 205
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 131/291 (45%), Positives = 180/291 (61%), Gaps = 16/291 (5%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S V + GNVYP+G++ VT+ I +P K Y LDIDTGS LTW+QC+ PC C P LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 107 HPK-NNLVACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
P+ V C + C+ + L + ++C +QC Y + Y GSS+GVL+ D F L +
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSSIGVLIVDSFSLPAS 140
Query: 164 NGSLLGPRLI-FGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNVLGHC 220
NG+ P I FGCGYNQ +N P P G+LGLG GK ++LSQL+S G +T++VLGHC
Sbjct: 141 NGT--NPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 221 LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG--LQIIF 278
+S +G G+LF G VP+SG+ W+PM+R+ KHYS L F S I +++IF
Sbjct: 199 ISSKGKGFLFFGDAKVPTSGVTWSPMNRE--HKHYSPRQGTLHFNSNSKPISAAPMEVIF 256
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDL--KGKPLEDTAE-EKALPVCWKGTWK 326
DSG++YTYF Q Y TL +++ L + K L + E ++AL VCWKG K
Sbjct: 257 DSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDK 307
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 126/299 (42%), Positives = 173/299 (57%), Gaps = 22/299 (7%)
Query: 44 RFGSTAV------FPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG 97
R G ++V F + GN+YP G Y + L +G+PPKLY LD+DTGSDLTW QC+APC
Sbjct: 15 RLGKSSVGNHSVRFHVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRN 74
Query: 98 CTLPPESLYHPKN-NLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVT 155
C + P LY+PK +V C+ P C+ + C ++ QCDYEV YAD S++GVLV
Sbjct: 75 CAIGPHGLYNPKKAKVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVE 134
Query: 156 DHFPLRLTNGSLLGPRLIFGCGYNQRNPGPK-PPPTAGVLGLGLGKASILSQLQSLGLTR 214
D +RLTNG+L+ + I GCGY+Q+ K P T GV+GL K ++ +QL G+ +
Sbjct: 135 DTLTVRLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIK 194
Query: 215 NVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK 272
NVLGHCL+ GGGYLF G +LVPS G+ WTPM Y + + +GG S +
Sbjct: 195 NVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLN 254
Query: 273 GLQ--------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ ++FDSG+S+TY QAY + L + K L + LP CW+G
Sbjct: 255 NDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQ---SGLLRVKSDTTLPYCWRG 310
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 134/299 (44%), Positives = 177/299 (59%), Gaps = 31/299 (10%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S++VFP++GNVYP G Y L++GNPPK Y LD+DTGSDLTW+QC+APC C LY
Sbjct: 176 SSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLY 235
Query: 107 HP-KNNLVACNDPFCSAFHLPENIRCEAND----QCDYEVLYADHGSSLGVLVTDHFPLR 161
P ++N+V+ D C + +N + +D QCDYE+ YADH SSLGVLV D L
Sbjct: 236 KPTRSNVVSSVDALC--LDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLV 293
Query: 162 LTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
TNGS ++FGCGY+Q N K T G++GL K S+ QL S GL +NV+
Sbjct: 294 TTNGSKTKLNVVFGCGYDQAGLLLNTLGK---TDGIMGLSRAKVSLPYQLASKGLIKNVV 350
Query: 218 GHCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSS-------GPAELLFGGKS 268
GHCLS GGGY+FLG D VP G+ W PM+ L Y + G +L F G+S
Sbjct: 351 GHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQS 410
Query: 269 TGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMR--KDLKGKPLEDTAEEKALPVCWKGTW 325
K +++FDSGSSYTYF +AY LDL+ ++ G L + LP+CW+ +
Sbjct: 411 ---KVGKMVFDSGSSYTYFPKEAY---LDLVASLNEVSGLGLVQDDSDTTLPICWQANF 463
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/301 (44%), Positives = 176/301 (58%), Gaps = 31/301 (10%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S++VFP++GNVYP G Y L++GNPPK Y LD+DTGSDLTW+QC+APC C Y
Sbjct: 178 SSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQY 237
Query: 107 HP-KNNLVACNDPFCSAFHLPENIRCEAND----QCDYEVLYADHGSSLGVLVTDHFPLR 161
P ++N+V+ D C + +N + +D QCDYE+ YADH SSLGVLV D L
Sbjct: 238 KPTRSNVVSSVDSLC--LDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLV 295
Query: 162 LTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
TNGS ++FGCGY+Q N K T G++GL K S+ QL S GL +NV+
Sbjct: 296 TTNGSKTKLNVVFGCGYDQEGLILNTLAK---TDGIMGLSRAKVSLPYQLASKGLIKNVV 352
Query: 218 GHCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSS-------GPAELLFGGKS 268
GHCLS GGGY+FLG D VP G+ W PM+ L Y + G +L F G+S
Sbjct: 353 GHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQS 412
Query: 269 TGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMR--KDLKGKPLEDTAEEKALPVCWKGTWK 326
K ++ FDSGSSYTYF +AY LDL+ ++ G L + LP+CW+ ++
Sbjct: 413 ---KVGKVFFDSGSSYTYFPKEAY---LDLVASLNEVSGLGLVQDDSDTTLPICWQANFQ 466
Query: 327 C 327
Sbjct: 467 I 467
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 131/292 (44%), Positives = 181/292 (61%), Gaps = 17/292 (5%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
S V + GNVYP+G++ VT+ I +P K Y LDIDTGS LTW+QC+ PC C P LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 107 HPK-NNLVACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
P+ V C + C+ + L + ++C +QC Y + Y GSS+GVL+ D F L +
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSSIGVLIVDSFSLPAS 140
Query: 164 NGSLLGPRLI-FGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNVLGHC 220
NG+ P I FGCGYNQ +N P P G+LGLG GK ++LSQL+S G +T++VLGHC
Sbjct: 141 NGT--NPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 221 LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG-GKSTGIKG--LQII 277
+S +G G+LF G VP+SG+ W+PM+R+ KHYS L F K + I +++I
Sbjct: 199 ISSKGKGFLFFGDAKVPTSGVTWSPMNRE--HKHYSPRQGTLHFNSNKQSPISAAPMEVI 256
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDL--KGKPLEDTAE-EKALPVCWKGTWK 326
FDSG++YTYF Q Y TL +++ L + K L + E ++AL VCWKG K
Sbjct: 257 FDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDK 308
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 132/312 (42%), Positives = 172/312 (55%), Gaps = 29/312 (9%)
Query: 28 NQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLT 87
N P K S AA S+A+FP+ GN+YP G PP+ Y LD DTGSDLT
Sbjct: 165 NGPHKISKLASSNAAAAMDSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLT 214
Query: 88 WVQCNAPCTGCTLPPESLYHPKN-NLVACNDPFCSAFHLPENI-RCEANDQCDYEVLYAD 145
W+QC+APCT C + Y P+ N+V D C + CE DQCDYE+ YAD
Sbjct: 215 WIQCDAPCTSCAKGANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYAD 274
Query: 146 HGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKP-PPTAGVLGLGLGKASIL 204
H SS+GVL TD L + NGSL IFGC Y+Q+ K T G+LGL K S+
Sbjct: 275 HSSSMGVLATDKLLLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLP 334
Query: 205 SQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWTPM---------SRDLLEK 253
SQL S G+ NV+GHCL+ GGGY+FLG D VP G+AW PM ++++
Sbjct: 335 SQLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKL 394
Query: 254 HYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAE 313
+Y S P L GG + +K I+FDSGSSYTYF +AY + L ++ G L +
Sbjct: 395 NYGSSPLSL--GGMESRVK--HILFDSGSSYTYFPKEAY-SELVASLNEVSGAGLVQSTS 449
Query: 314 EKALPVCWKGTW 325
+ LP+CW+ +
Sbjct: 450 DTTLPLCWRANF 461
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/299 (40%), Positives = 166/299 (55%), Gaps = 9/299 (3%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
+KK + A+ ST + PI GNV+P G Y ++ +GNPP+ Y LD+DTGSDLTW+QC
Sbjct: 160 TKKLDVKGAASAGTNSTVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQC 219
Query: 92 NAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
+APCT C P LY P K +V D C +N CE QCDYE+ YAD SS+
Sbjct: 220 DAPCTNCAKGPHPLYKPAKEKIVPPRDSLCQELQGDQNY-CETCKQCDYEIEYADRSSSM 278
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQS 209
GVL D L TNG +FGC Y+Q+ P T G+LGL S+ SQL S
Sbjct: 279 GVLAKDDMHLIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLAS 338
Query: 210 LGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
G+ NV GHC++ GGGY+FLG D VP G+ W P+ R + Y + ++ +G +
Sbjct: 339 KGIISNVFGHCITRETNGGGYMFLGDDYVPRWGMTWAPI-RGGPDNLYHTEAQKVNYGDQ 397
Query: 268 STGI-KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTW 325
+Q+IFDSGSSYTY + YK +D +++D + + LP+CWK +
Sbjct: 398 ELHAGNSVQVIFDSGSSYTYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKADF 454
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 131/303 (43%), Positives = 172/303 (56%), Gaps = 21/303 (6%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
K S +A+ + S+AVFP+ G++YP G Y + +G PP+ Y LDIDTGSDLTWVQC+A
Sbjct: 170 KPSKLISASLKSDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDA 229
Query: 94 PCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLP-ENIRCEANDQCDYEVLYADHGSSLG 151
PC+ C LY P + N+V+ D C + +C A QC+YEV YAD SSLG
Sbjct: 230 PCSSCGKGRSPLYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLG 289
Query: 152 VLVTDHFPLRLTNGSLLGPRLIFGCGYNQR----NPGPKPPPTAGVLGLGLGKASILSQL 207
VLV D F LR +NGSL IFGC Y+Q+ N K T G+LGL K S+ SQL
Sbjct: 290 VLVKDEFTLRFSNGSLTKLNAIFGCAYDQQGLLLNTLSK---TDGILGLSRAKVSLPSQL 346
Query: 208 QSLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG 265
S G+ NV+GHCL+ GGGYLFLG D VP G+AW M Y + + +G
Sbjct: 347 ASRGIINNVVGHCLTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYG 406
Query: 266 G-----KSTGIKGLQIIFDSGSSYTYFNSQA-YKTTLDLMRKDLKGKPLEDTAEEKALPV 319
+ G Q++FDSGSSYTYF +A Y+ +L G L+D+++ +
Sbjct: 407 SIPLSLDTWGSSREQVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLILQDSSDT----I 462
Query: 320 CWK 322
CWK
Sbjct: 463 CWK 465
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 132/304 (43%), Positives = 182/304 (59%), Gaps = 29/304 (9%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-------- 98
S V + GNVYP+G++ VT+ IG+P K Y LDIDTGS LTW+QC+ PC C
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFY 81
Query: 99 -----TLPPESLYHPK-NNLVACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSL 150
+ P LY P+ V C + C+ + L + ++C +QC Y + Y GSS+
Sbjct: 82 PRLIGSFVPHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSSI 140
Query: 151 GVLVTDHFPLRLTNGSLLGPRLI-FGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQLQ 208
GVL+ D F L +NG+ P I FGCGYNQ +N P P G+LGLG GK ++LSQL+
Sbjct: 141 GVLIVDSFSLPASNGT--NPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLK 198
Query: 209 SLG-LTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
S G +T++VLGHC+S +G G+LF G VP+SG+ W+PM+R+ KHYS L F
Sbjct: 199 SQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVTWSPMNRE--HKHYSPRQGTLQFNSN 256
Query: 268 STGIKG--LQIIFDSGSSYTYFNSQAYKTTLDLMRKDL--KGKPLEDTAE-EKALPVCWK 322
S I +++IFDSG++YTYF Q Y TL +++ L + K L + E ++AL VCWK
Sbjct: 257 SKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWK 316
Query: 323 GTWK 326
G K
Sbjct: 317 GKDK 320
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/301 (40%), Positives = 166/301 (55%), Gaps = 13/301 (4%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
+ A R STA+ PI GNV+P G Y ++ IGNPP+ Y LD+DTGSDLTW+QC+A
Sbjct: 158 RMEVAKAATARTNSTALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDA 217
Query: 94 PCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGV 152
PCT C P LY P K +V D C +N CE QCDYE+ YAD SS+GV
Sbjct: 218 PCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQGNQNY-CETCKQCDYEIEYADQSSSMGV 276
Query: 153 LVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLG 211
L D + TNG +FGC Y+Q+ P T G+LGL S SQL S G
Sbjct: 277 LARDDMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHG 336
Query: 212 LTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-- 267
+ NV GHC++ GGGY+FLG D VP G+ WT + R + Y + + +G +
Sbjct: 337 IIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSI-RSGPDNLYHTQAHHVKYGDQQL 395
Query: 268 ---STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+Q+IFDSGSSYTY ++ Y+ + ++ G ++DT+ ++ LP+CWK
Sbjct: 396 RRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYASPGF-VQDTS-DRTLPLCWKAD 453
Query: 325 W 325
+
Sbjct: 454 F 454
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 121/301 (40%), Positives = 164/301 (54%), Gaps = 13/301 (4%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
K AA STA+ PI GNV+P G Y ++ +GNPP+ Y LD+DTGSDLTW+QC+A
Sbjct: 158 KMEVAKAAAAGTNSTALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDA 217
Query: 94 PCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGV 152
PCT C P LY P K +V D C +N CE QCDYE+ YAD SS+GV
Sbjct: 218 PCTNCAKGPHPLYKPTKEKIVPPRDLLCQELQGNQNY-CETCKQCDYEIEYADQSSSMGV 276
Query: 153 LVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLG 211
L D L TNG +FGC Y+Q+ P T G+LGL S+ SQL S G
Sbjct: 277 LARDDMHLIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHG 336
Query: 212 LTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
+ N+ GHC++ GGGY+FLG D VP GI WT + R + Y + + +G +
Sbjct: 337 IISNIFGHCITREQGGGGYMFLGDDYVPRWGITWTSI-RSGPDNLYHTEAHHVKYGDQQL 395
Query: 270 GIK-----GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
++ +Q+IFDSGSSYTY + Y+ + ++ G + ++ LP+CWK
Sbjct: 396 RMREQAGNTVQVIFDSGSSYTYLPDEIYENLVAAIKYASPG--FVQDSSDRTLPLCWKAD 453
Query: 325 W 325
+
Sbjct: 454 F 454
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 172/303 (56%), Gaps = 14/303 (4%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
+K ++ ++T+A ST + PI GNV+P G Y ++ +GNPP+ Y LD+DTGSDLTW+QC
Sbjct: 164 NKLEAKRATSAGT-NSTVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQC 222
Query: 92 NAPCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
+APCT C P LY P K +V D C +N C QCDYE+ YAD SS+
Sbjct: 223 DAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQGDQNY-CATCKQCDYEIEYADRSSSM 281
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQS 209
GVL D + TNG +FGC Y+Q+ P T G+LGL S+ SQL S
Sbjct: 282 GVLAKDDMHMIATNGGREKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLAS 341
Query: 210 LGLTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
G+ NV GHC++ GGGY+FLG D VP G+ W P+ R + Y + ++ +G +
Sbjct: 342 QGIISNVFGHCITKEPNGGGYMFLGDDYVPRWGMTWAPI-RGGPDNLYHTEAQKVNYGDQ 400
Query: 268 STGIKG-----LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
+ G +Q+IFDSGSSYTY + YK + ++ D ++DT+ + LP+CWK
Sbjct: 401 QLRMHGQAGSSIQVIFDSGSSYTYLPDEIYKKLVTAIKYDYPSF-VQDTS-DTTLPLCWK 458
Query: 323 GTW 325
+
Sbjct: 459 ADF 461
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 121/301 (40%), Positives = 165/301 (54%), Gaps = 13/301 (4%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
+ A R STA+ PI GNV+P G Y ++ IGNPP+ Y LD+DTGSDLTW+QC+A
Sbjct: 158 RMEVAKAATARTNSTALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDA 217
Query: 94 PCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGV 152
PCT P LY P K +V D C +N CE QCDYE+ YAD SS+GV
Sbjct: 218 PCTNFAKGPHPLYKPAKEKIVPPRDLLCQELQGNQNY-CETCKQCDYEIEYADQSSSMGV 276
Query: 153 LVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLG 211
L D + TNG +FGC Y+Q+ P T G+LGL S SQL S G
Sbjct: 277 LARDDMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHG 336
Query: 212 LTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-- 267
+ NV GHC++ GGGY+FLG D VP G+ WT + R + Y + + +G +
Sbjct: 337 IIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSI-RSGPDNLYHTQAHHVKYGDQQL 395
Query: 268 ---STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+Q+IFDSGSSYTY ++ Y+ + ++ G ++DT+ ++ LP+CWK
Sbjct: 396 RRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYASPGF-VQDTS-DRTLPLCWKAD 453
Query: 325 W 325
+
Sbjct: 454 F 454
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/284 (42%), Positives = 158/284 (55%), Gaps = 13/284 (4%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY 106
+T + GN+YP G Y + + IG P KLY LD+DTGSDLTW+QC+APC C P LY
Sbjct: 7 ATVFSQLRGNIYPDGLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLY 66
Query: 107 HPKN-NLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
PK LV C P C+ + C QCDY+V YAD S++GVL+ D L LTN
Sbjct: 67 DPKKARLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTN 126
Query: 165 GSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS- 222
G+ I GCGY+Q+ P T GV+GL K S+ SQL G+ RNV+GHCL+
Sbjct: 127 GTRSKTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAG 186
Query: 223 -VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY--SSGPAELLFGGKSTGIKGLQIIFD 279
GGGYLF G LVP+ G+ WTP+ + + SG A+ K+ I G ++FD
Sbjct: 187 GSNGGGYLFFGDSLVPALGMTWTPIMGKSITGNIGGKSGDAD----DKTGDIGG--VMFD 240
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SG+S+TY +AY L M ++ L + LP CW+G
Sbjct: 241 SGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRG 284
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 117/296 (39%), Positives = 163/296 (55%), Gaps = 13/296 (4%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
K + AA STA+ PI GNV+P G Y ++ +GNPP+ Y LD+DTGSDLTW+QC+A
Sbjct: 174 KLEVKKAAAAGTNSTALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDA 233
Query: 94 PCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGV 152
PCT C P LY P K +V D C +N CE QCDYE+ YAD SS+GV
Sbjct: 234 PCTNCAKGPHPLYKPAKEKIVPPKDLLCQELQGNQNY-CETCKQCDYEIEYADRSSSMGV 292
Query: 153 LVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLG 211
L D + TNG +FGC Y+Q+ P T G+LGL S+ SQL + G
Sbjct: 293 LARDDMHIITTNGGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQG 352
Query: 212 LTRNVLGHCLSV--RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
+ NV GHC++ GGGY+FLG D VP G+ TP+ R + + + ++ +G +
Sbjct: 353 IISNVFGHCITRDPNGGGYMFLGDDYVPRWGMTSTPI-RSAPDNLFHTEAQKVYYGDQQL 411
Query: 270 GIKG-----LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVC 320
++G +Q+IFDSGSSYTY + YK + ++ + ++ LP+C
Sbjct: 412 SMRGASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPN--FVQDSSDRTLPLC 465
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 117/296 (39%), Positives = 163/296 (55%), Gaps = 13/296 (4%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
K + AA STA+ PI GNV+P G Y ++ +GNPP+ Y LD+DTGSDLTW+QC+A
Sbjct: 175 KLEVKKAAAAGTNSTALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDA 234
Query: 94 PCTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGV 152
PCT C P LY P K +V D C +N CE QCDYE+ YAD SS+GV
Sbjct: 235 PCTNCAKGPHPLYKPAKEKIVPPKDLLCQELQGNQNY-CETCKQCDYEIEYADRSSSMGV 293
Query: 153 LVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLG 211
L D + TNG +FGC Y+Q+ P T G+LGL S+ SQL + G
Sbjct: 294 LARDDMHIITTNGGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQG 353
Query: 212 LTRNVLGHCLSV--RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
+ NV GHC++ GGGY+FLG D VP G+ TP+ R + + + ++ +G +
Sbjct: 354 IISNVFGHCITRDPNGGGYMFLGDDYVPRWGMTSTPI-RSAPDNLFHTEAQKVYYGDQQL 412
Query: 270 GIKG-----LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVC 320
++G +Q+IFDSGSSYTY + YK + ++ + ++ LP+C
Sbjct: 413 SMRGASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPN--FVQDSSDRTLPLC 466
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/297 (41%), Positives = 168/297 (56%), Gaps = 13/297 (4%)
Query: 35 KSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP 94
K + A R S+A+ PI GNV+P G Y ++ IGNPP+ Y LD+DTGSDLTW+QC+AP
Sbjct: 131 KPDSAGAEARENSSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAP 190
Query: 95 CTGCTLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
CT C P LY P K N+V D +C +N + + QCDYE+ YAD SS+G+L
Sbjct: 191 CTNCAKGPHPLYKPEKPNVVPPRDSYCQELQGNQNY-GDTSKQCDYEITYADRSSSMGIL 249
Query: 154 VTDHFPLRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGL 212
D+ L +G +FGCGY+Q+ N P T G+LGL S+ +QL S G+
Sbjct: 250 ARDNMQLITADGERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGI 309
Query: 213 TRNVLGHCLSV--RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG 270
NV GHC++ GGY+FLG D VP G+ W P+ R+ E YS+ ++ +G +
Sbjct: 310 ISNVFGHCIAADPSNGGYMFLGDDYVPRWGMTWMPI-RNGPENLYSTEVQKVNYGDQQLN 368
Query: 271 I-----KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
+ K Q+IFDSGSSYTY Y T L K L L+D + ++ LP C K
Sbjct: 369 VRRKAGKLTQVIFDSGSSYTYLPHDDY-TNLIASLKSLSPSLLQDES-DRTLPFCMK 423
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 123/291 (42%), Positives = 166/291 (57%), Gaps = 13/291 (4%)
Query: 41 AAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL 100
A R S+A+ PI GNV+P G Y ++ IGNPP+ Y LD+DTGSDLTW+QC+APCT C
Sbjct: 137 AEARENSSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAK 196
Query: 101 PPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFP 159
P LY P K N+V D +C +N + + QCDYE+ YAD SS+G+L D+
Sbjct: 197 GPHPLYKPEKPNVVPPRDSYCQELQGNQNY-GDTSKQCDYEITYADRSSSMGILARDNMQ 255
Query: 160 LRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
L +G +FGCGY+Q+ N P T G+LGL S+ +QL S G+ NV G
Sbjct: 256 LITADGERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFG 315
Query: 219 HCLSV--RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----- 271
HC++ GGY+FLG D VP G+ W P+ R+ E YS+ ++ +G + +
Sbjct: 316 HCIAADPSNGGYMFLGDDYVPRWGMTWMPI-RNGPENLYSTEVQKVNYGDQQLNVRRKAG 374
Query: 272 KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
K Q+IFDSGSSYTY Y T L K L L+D + ++ LP C K
Sbjct: 375 KLTQVIFDSGSSYTYLPHDDY-TNLIASLKSLSPSLLQDES-DRTLPFCMK 423
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/274 (41%), Positives = 159/274 (58%), Gaps = 14/274 (5%)
Query: 62 YYSVTLKIGNPP--KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP-KNNLVACNDP 118
YY+ L +G P + Y LDIDTGS+LTW+QC+APCT C LY P K+NLV ++
Sbjct: 30 YYTRIL-VGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEA 88
Query: 119 FCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
FC + CE QCDYE+ YADH S+GVL D F L+L NGSL ++FGCG
Sbjct: 89 FCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCG 148
Query: 178 YNQRNPGPKP-PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGYLFLGHD 234
Y+Q+ T G+LGL K S+ SQL S G+ NV+GHCL+ + G GY+F+G D
Sbjct: 149 YDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSD 208
Query: 235 LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL-----QIIFDSGSSYTYFNS 289
LVPS G+ W PM D Y ++ +G + G +++FD+GSSYTYF +
Sbjct: 209 LVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 268
Query: 290 QAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
QAY + + +++ G L ++ LP+CW+
Sbjct: 269 QAYSQLVTSL-QEVSGLELTRDDSDETLPICWRA 301
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 125/289 (43%), Positives = 167/289 (57%), Gaps = 22/289 (7%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA---PCTGCTLPPESLY 106
VF + G+V+P G++ VT+ IG P K Y LDIDTGS+LTW++C+A PC C P LY
Sbjct: 27 VFKLGGDVHPTGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLY 86
Query: 107 HPKNNLVACNDPFCSAFH--LPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
PK LV C DP C A H L C E DQC Y++ YAD +SLGVL+ D F L
Sbjct: 87 RPK-KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKF--SLP 143
Query: 164 NGSLLGPRLIFGCGYNQ----RNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNVLG 218
GS + FGCGY+Q + P+ P G+LGLG G ++SQL+ G +++NV+G
Sbjct: 144 TGS--ARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIG 201
Query: 219 HCLSVRGGGYLFLGHDLVPSSG---IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
HCLS +GGGYLF+G + VPSS I +SR+ HYS G A L G G K +
Sbjct: 202 HCLSSKGGGYLFIGEENVPSSHLHIIYIYCISRE--PNHYSPGQATLHLGRNPIGTKPFK 259
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAE-EKALPVCWKG 323
IFDSGS+YTY + + ++ L L+ ++ + L +CWKG
Sbjct: 260 AIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKG 308
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 200 bits (509), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 124/289 (42%), Positives = 166/289 (57%), Gaps = 20/289 (6%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA---PCTGCTLPPESLY 106
VF + G+VYP+G++ VT+ IG P + Y LDIDTGS TW++C+A PC C P LY
Sbjct: 26 VFKLDGSVYPVGHFYVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLY 85
Query: 107 H-PKNNLVACNDPFCSAFH--LPENIRCE--ANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
+ LV C DP C A H L +C +QCDY+V Y D SSLGVL+ D F L
Sbjct: 86 RLTRKKLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKFSLP 145
Query: 162 LTNGSLLGPRLIFGCGYNQ----RNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNV 216
T G+ + FGCGY+Q + P+ P G+LGLG G + SQL+ G +++NV
Sbjct: 146 -TGGAR---NIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNV 201
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDL--LEKHYSSGPAELLFGGKSTGIKGL 274
+GHCLS +GGGYLF+G + VPSS + W PM+ HYS G A L G K L
Sbjct: 202 IGHCLSSKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKPL 261
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ IFDSGS+YTY + + ++ L L+ ++ ALP+CWKG
Sbjct: 262 KAIFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQVSDP-ALPLCWKG 309
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 109/275 (39%), Positives = 152/275 (55%), Gaps = 13/275 (4%)
Query: 57 VYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP-KNNLVAC 115
V P Y ++ IGNPP+ Y LDIDTGSD TW+ C+APCT CT P +Y P + +V
Sbjct: 10 VVPERQYYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHP 69
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
DP C +N CE QCDYE+ YAD SS GVL D+ L +G + +FG
Sbjct: 70 RDPLCEELQGNQNY-CETCKQCDYEITYADRSSSKGVLARDNMQLTTADGEMKNVDFVFG 128
Query: 176 CGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV--RGGGYLFLG 232
C +NQ+ P T G+LGL G S+ +QL + G+ NV GHC++ GGY+FLG
Sbjct: 129 CAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGYMFLG 188
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-----LQIIFDSGSSYTYF 287
D VP G+ W P+ R+ YS+ ++ +G + ++G Q+IFDSGSSYTYF
Sbjct: 189 DDYVPRWGMTWVPI-RNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFDSGSSYTYF 247
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
+ Y + L+ G +++ ++ LP C K
Sbjct: 248 PHEIYTNLIALLEDASPGFVRDES--DQTLPFCMK 280
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 117/286 (40%), Positives = 160/286 (55%), Gaps = 21/286 (7%)
Query: 51 FPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT----LPPESLY 106
FP+ GNVYP+G++ TL IG P K Y LD+DTGS+LTW++C+ P GC PP Y
Sbjct: 26 FPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYY 85
Query: 107 HPKN-NL-VACNDPFCSAFH--LPENIRCEAND--QCDYEVLYADHGSSLGVLVTDHFPL 160
P + NL V C P C A +P C ND +C YE+ Y G S G L TD +
Sbjct: 86 TPADGNLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT-GKSEGDLATDIISV 144
Query: 161 RLTNGSLLGPRLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGLTR-NVLG 218
+ R+ FGCGY Q P PP P G+LGLG+GKA + +QL+ + + NV+G
Sbjct: 145 NGRDKK----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIG 200
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQII 277
HCLS +G G L++G P+ G+ W PM L +YS G AE+ + G + +
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF--YYSPGLAEVFIDKQPIRGNPTFEAV 258
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
FDSGS+YT+ +Q Y + +R L LE+ + +ALP+CWKG
Sbjct: 259 FDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEV-KGRALPLCWKG 303
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 117/286 (40%), Positives = 159/286 (55%), Gaps = 21/286 (7%)
Query: 51 FPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT----LPPESLY 106
FP+ GNVYP+G++ TL IG P K Y LD+DTGS+LTW++C+ P GC PP Y
Sbjct: 26 FPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYY 85
Query: 107 HPKN-NL-VACNDPFCSAFH--LPENIRCEAND--QCDYEVLYADHGSSLGVLVTDHFPL 160
P + NL V C P C A +P C ND +C YE+ Y G S G L TD +
Sbjct: 86 TPADGNLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT-GKSEGDLATDIISV 144
Query: 161 RLTNGSLLGPRLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGLTR-NVLG 218
+ R+ FGCGY Q P PP P G+LGLG+GKA +QL+ + + NV+G
Sbjct: 145 NGRDKK----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIG 200
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQII 277
HCLS +G G L++G P+ G+ W PM L +YS G AE+ + G + +
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF--YYSPGLAEVFIDKQPIRGNPTFEAV 258
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
FDSGS+YT+ +Q Y + +R L LE+ + +ALP+CWKG
Sbjct: 259 FDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEV-KGRALPLCWKG 303
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 113/292 (38%), Positives = 157/292 (53%), Gaps = 19/292 (6%)
Query: 40 TAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT 99
AA GSTA V P Y ++ IGNP + Y LD+DTGS LTW+QC+APCT CT
Sbjct: 112 AAAAEEGSTAA------VLPERQYYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCT 165
Query: 100 LPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHF 158
P LY P K N+V D C +N C+ QCDYE+ YAD SS GVL D+
Sbjct: 166 KGPHPLYKPAKENIVPPRDSHCQELQGNQNY-CDTCKQCDYEIAYADRSSSAGVLARDNM 224
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVL 217
L +G L+FGC ++Q+ P ++ G+LGL G S+ +QL G+ NV
Sbjct: 225 ELITADGERENMDLVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVF 284
Query: 218 GHCLSV--RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI---- 271
GHC++ G Y+FLG D VP G+ W P+ R+ E YS+ ++ +G + +
Sbjct: 285 GHCIATDPSGSAYMFLGDDYVPRWGMTWVPV-RNGPEDVYSTVVQKVNYGCQELNVREQA 343
Query: 272 -KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
K Q+IFDSGSSYTYF + Y + + + G +++ ++ LP C K
Sbjct: 344 GKLTQVIFDSGSSYTYFPHEIYTSLITSLEAVSPGFVRDES--DQTLPFCMK 393
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 114/286 (39%), Positives = 156/286 (54%), Gaps = 21/286 (7%)
Query: 51 FPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT----LPPESLY 106
FP+ GNVYP+G++ TL IG P K Y LD+DTGS+LTW++C+ P GC PP Y
Sbjct: 26 FPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPYY 85
Query: 107 HPKNN--LVACNDPFCSAFH--LPENIRCEAND--QCDYEVLYADHGSSLGVLVTDHFPL 160
P + V C P C A +P C ND +C YE+ Y G S G L TD +
Sbjct: 86 TPADGKLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT-GKSEGDLATDIISV 144
Query: 161 RLTNGSLLGPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTR-NVLG 218
+ R+ FGCGY Q P PP G+LGLG+GKA +QL+ L + + NV+G
Sbjct: 145 NGRDKK----RIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIG 200
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQII 277
HCLS +G G L++G P+ G+ W PM L +YS G AE+ + G + +
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF--YYSPGLAEVFIDKQPIRGNPTFEAV 258
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
FDSGS+YT+ +Q Y + +R LE+ + +ALP+CWKG
Sbjct: 259 FDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEV-KGRALPLCWKG 303
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/295 (36%), Positives = 153/295 (51%), Gaps = 28/295 (9%)
Query: 47 STAVFP--ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA-PCTGCTLPPE 103
++ +FP + GN++P G Y + +G+PP+ Y LD+DTGS TWVQC+A PC C
Sbjct: 142 NSTLFPHSLAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH 201
Query: 104 SLYHPKNNLVA--CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
LY P A +DP C E + E +QCDYE+ YAD SS+GV V D
Sbjct: 202 PLYRPARTADALPASDPLC------EGAQHENPNQCDYEISYADGSSSMGVYVRDSMQFV 255
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKP-PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
+G ++FGCGY+Q+ T GVLGL S+ +QL S G+ N GHC
Sbjct: 256 GEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHC 315
Query: 221 LSVR---GGGYLFLGHDLVPSSGIAWTPMS-------RDLLEKHYSSGPAELLFGGKSTG 270
+S GGYLFLG D +P G+ W P+ R K + G +L GK T
Sbjct: 316 MSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT- 374
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTW 325
Q++FD+GS+YTYF +A + +++ + ++D + +K LP C K +
Sbjct: 375 ----QVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDS-DKTLPFCMKSDF 424
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/288 (38%), Positives = 148/288 (51%), Gaps = 53/288 (18%)
Query: 45 FGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
F S+ VF + G+VYP G+ VT+ IG K Y LDIDTGS LTW+
Sbjct: 18 FCSSMVFELHGDVYPTGHIYVTMSIGEQEKPYFLDIDTGSTLTWL--------------- 62
Query: 105 LYHPKNNLVACNDPFCSAFHLPENIR----CEAN-DQCDYEVLYADHGSSLGVLVTDHFP 159
E++R C+ N +QCDY+V YA SSLGVL+ D F
Sbjct: 63 ----------------------EDVRFKHDCKENPNQCDYDVRYAGGESSLGVLIADKFS 100
Query: 160 LRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNVLG 218
L G P L FGCGY+Q G P GVLG+G G + SQL+ G + NV+G
Sbjct: 101 L---PGRDARPTLTFGCGYDQEG-GKAEMPVDGVLGIGRGTRDLASQLKQQGAIAENVIG 156
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG---GKSTGIKGLQ 275
HCL ++GGGYLF GH+ VPSS + W PM + +YS G A L F G + ++
Sbjct: 157 HCLRIQGGGYLFFGHEKVPSSVVTWVPMVPN--NHYYSPGLAALHFNGNLGNPISVAPME 214
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
++ DSGS+YTY ++ Y+ + ++ L L + ALPVCW G
Sbjct: 215 VVIDSGSTYTYMPTETYRRLVFVVIASLSKSSLT-LVRDPALPVCWAG 261
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 101/261 (38%), Positives = 137/261 (52%), Gaps = 27/261 (10%)
Query: 47 STAVFP--ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA-PCTGCTLPPE 103
++ +FP + GN++P G Y + +G+PP+ Y LD+DTGS TWVQC+A PC C
Sbjct: 142 NSTLFPHSLAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH 201
Query: 104 SLYHPKNNLVA--CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
LY P A +DP C E + E +QCDYE+ YAD SS+GV V D
Sbjct: 202 PLYRPARTADALPASDPLC------EGAQHENPNQCDYEISYADGSSSMGVYVRDSMQFV 255
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKP-PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
+G ++FGCGY+Q+ T GVLGL S+ +QL S G+ N GHC
Sbjct: 256 GEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHC 315
Query: 221 LSVR---GGGYLFLGHDLVPSSGIAWTPMS-------RDLLEKHYSSGPAELLFGGKSTG 270
+S GGYLFLG D +P G+ W P+ R K + G +L GK T
Sbjct: 316 MSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT- 374
Query: 271 IKGLQIIFDSGSSYTYFNSQA 291
Q++FD+GS+YTYF +A
Sbjct: 375 ----QVVFDTGSTYTYFPDEA 391
>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
Length = 245
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/136 (58%), Positives = 104/136 (76%), Gaps = 4/136 (2%)
Query: 189 PTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMS- 247
P G+LGLG GK+S++SQL S GL RNV+GHCLS +GGGY+F G D+ SS + WTPMS
Sbjct: 11 PLDGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFG-DVYDSSRLTWTPMSS 69
Query: 248 RDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
RDL KHY +G AEL+FGGK TGI GL +FD+GSSYTYFNS AY+ + ++K+L GKP
Sbjct: 70 RDL--KHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAGKP 127
Query: 308 LEDTAEEKALPVCWKG 323
L++ +++ LP+CW G
Sbjct: 128 LKEAPDDQTLPLCWHG 143
>gi|308080924|ref|NP_001183009.1| uncharacterized protein LOC100501329 [Zea mays]
gi|238008766|gb|ACR35418.1| unknown [Zea mays]
Length = 205
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 67/140 (47%), Positives = 81/140 (57%), Gaps = 2/140 (1%)
Query: 39 STAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC 98
A R STA+ PI GNV+P G Y ++ IGNPP+ Y LD+DTGSDLTW+QC+APCT C
Sbjct: 66 KAATARTNSTALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNC 125
Query: 99 TLPPESLYHP-KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDH 157
P LY P K +V D C +N CE QCDYE+ YAD SS+GVL D
Sbjct: 126 AKGPHPLYKPAKEKIVPPRDLLCQELQGNQNY-CETCKQCDYEIEYADQSSSMGVLARDD 184
Query: 158 FPLRLTNGSLLGPRLIFGCG 177
+ TNG +FGC
Sbjct: 185 MHMIATNGGREKLDFVFGCA 204
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 102/350 (29%), Positives = 162/350 (46%), Gaps = 48/350 (13%)
Query: 1 MEEKGKRVMGLLVLLM---FATFQGCFSEANQPPSKKKSTQSTAAH------RFGSTAVF 51
ME + K + + V ++ FA+ F ++ K+K + +H R ++
Sbjct: 1 MELRRKLCIVVAVFVIVNEFASGNFVFKVQHKFAGKEKKLEHFKSHDTRRHSRMLASIDL 60
Query: 52 PITGN--VYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH-- 107
P+ G+ V +G Y +K+G+PPK Y + +DTGSD+ WV C PC C +H
Sbjct: 61 PLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNCK-PCPECPSKTNLNFHLS 119
Query: 108 -------PKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL 160
+ V C+D FCS + ++ C+ C Y ++YAD +S G + D L
Sbjct: 120 LFDVNASSTSKKVGCDDDFCS--FISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTL 177
Query: 161 RLTNGSL----LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
G L LG ++FGCG +Q G GV+G G S+LSQL + G +
Sbjct: 178 EQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKR 237
Query: 216 VLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG---- 270
V HCL +V+GGG +G +V S + TPM + + HY+ +L G G
Sbjct: 238 VFSHCLDNVKGGGIFAVG--VVDSPKVKTTPMVPN--QMHYNV----MLMGMDVDGTALD 289
Query: 271 -----IKGLQIIFDSGSSYTYFNSQAYKTTLD--LMRKDLKGKPLEDTAE 313
++ I DSG++ YF Y + ++ L R+ +K +EDT +
Sbjct: 290 LPPSIMRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEDTFQ 339
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 155/327 (47%), Gaps = 37/327 (11%)
Query: 17 FATFQGCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGN--VYPLGYYSVTLK 68
FA+ F ++ KKK+ + +H R ++ P+ G+ V +G Y +K
Sbjct: 20 FASANFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIK 79
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYH----PKNNLVACNDPF 119
+G+PPK Y + +DTGSD+ W+ C PC C SL+ + V C+D F
Sbjct: 80 LGSPPKEYHVQVDTGSDILWINCK-PCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDF 138
Query: 120 CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFG 175
CS + ++ C+ C Y ++YAD +S G + D L G L LG ++FG
Sbjct: 139 CS--FISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFG 196
Query: 176 CGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGH 233
CG +Q G GV+G G S+LSQL + G + V HCL +V+GGG +G
Sbjct: 197 CGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVG- 255
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS-----TGIKGLQIIFDSGSSYTYFN 288
+V S + TPM + + HY+ + G S + ++ I DSG++ YF
Sbjct: 256 -VVDSPKVKTTPMVPN--QMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFP 312
Query: 289 SQAYKTTLD--LMRKDLKGKPLEDTAE 313
Y + ++ L R+ +K +E+T +
Sbjct: 313 KVLYDSLIETILARQPVKLHIVEETFQ 339
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 155/327 (47%), Gaps = 37/327 (11%)
Query: 17 FATFQGCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGN--VYPLGYYSVTLK 68
FA+ F ++ KKK+ + +H R ++ P+ G+ V +G Y +K
Sbjct: 20 FASANFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIK 79
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYH----PKNNLVACNDPF 119
+G+PPK Y + +DTGSD+ W+ C PC C SL+ + V C+D F
Sbjct: 80 LGSPPKEYHVQVDTGSDILWINCK-PCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDF 138
Query: 120 CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFG 175
CS + ++ C+ C Y ++YAD +S G + D L G L LG ++FG
Sbjct: 139 CS--FISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFG 196
Query: 176 CGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGH 233
CG +Q G GV+G G S+LSQL + G + V HCL +V+GGG +G
Sbjct: 197 CGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVG- 255
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS-----TGIKGLQIIFDSGSSYTYFN 288
+V S + TPM + + HY+ + G S + ++ I DSG++ YF
Sbjct: 256 -VVDSPKVKTTPMVPN--QMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFP 312
Query: 289 SQAYKTTLD--LMRKDLKGKPLEDTAE 313
Y + ++ L R+ +K +E+T +
Sbjct: 313 KVLYDSLIETILARQPVKLHIVEETFQ 339
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 101/342 (29%), Positives = 153/342 (44%), Gaps = 45/342 (13%)
Query: 12 LVLLMFATFQGCFSEAN-------QPPSKKKSTQSTAAH------RFGSTAVFPITGNVY 58
LV+++ F C S N + K++S + H R S P+ GN +
Sbjct: 16 LVVIVELGFVVCLSNGNYVFNVQHKFAGKERSLSALKQHDARRHRRILSAVDLPLGGNGH 75
Query: 59 PL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPKNN 111
P G Y + +GNPPK Y + +DTGSD+ WV C A C C + +LY P+++
Sbjct: 76 PAEAGLYFAKIGLGNPPKDYYVQVDTGSDILWVNC-ANCDKCPTKSDLGVKLTLYDPQSS 134
Query: 112 LVA----CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
A C+D FC+A + C + C Y V+Y D S+ G V D+ G+L
Sbjct: 135 TSATRIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNL 194
Query: 168 ----LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL- 221
+IFGCG Q G G+LG G +S++SQL + G + V HCL
Sbjct: 195 QTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLD 254
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK---------STGIK 272
+V+GGG +G + P + TPM + + HY+ E+ GG TG +
Sbjct: 255 NVKGGGIFAIGEVVSPK--VNTTPMVPN--QPHYNVVMKEIEVGGNVLELPTDIFDTGDR 310
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEE 314
II DSG++ Y Y++ + + + G L E+
Sbjct: 311 RGTII-DSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQ 351
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 102/332 (30%), Positives = 152/332 (45%), Gaps = 44/332 (13%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGN--VYPLGYYSVTLKIGNPPKLYELDID 81
F+ + S+ KS S R + P+ G+ +G Y +K+G+PPK Y + +D
Sbjct: 37 FAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVD 96
Query: 82 TGSDLTWVQCNAPCTGCTLP-----PESLYHPKNNL----VACNDPFCSAFHLPENIRCE 132
TGSD+ WV C APC C + P SLY K + V C D FCS E C
Sbjct: 97 TGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSET--CG 153
Query: 133 ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQRNP-GPKP 187
A C Y V+Y D +S G + D+ L G+L L ++FGCG NQ G
Sbjct: 154 AKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTD 213
Query: 188 PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPM 246
G++G G SI+SQL + G T+ + HCL ++ GGG +G V S + TP+
Sbjct: 214 SAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGE--VESPVVKTTPI 271
Query: 247 SRDLLEKHYS---------SGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
+ + HY+ P +L ST G II DSG++ Y Y + ++
Sbjct: 272 VPN--QVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTII-DSGTTLAYLPQNLYNSLIE 328
Query: 298 LM--RKDLKGKPLEDT--------AEEKALPV 319
+ ++ +K +++T +KA PV
Sbjct: 329 KITAKQQVKLHMVQETFACFSFTSNTDKAFPV 360
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 102/332 (30%), Positives = 152/332 (45%), Gaps = 44/332 (13%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGN--VYPLGYYSVTLKIGNPPKLYELDID 81
F+ + S+ KS S R + P+ G+ +G Y +K+G+PPK Y + +D
Sbjct: 33 FAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVD 92
Query: 82 TGSDLTWVQCNAPCTGCTLP-----PESLYHPKNNL----VACNDPFCSAFHLPENIRCE 132
TGSD+ WV C APC C + P SLY K + V C D FCS E C
Sbjct: 93 TGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSET--CG 149
Query: 133 ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQRNP-GPKP 187
A C Y V+Y D +S G + D+ L G+L L ++FGCG NQ G
Sbjct: 150 AKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTD 209
Query: 188 PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPM 246
G++G G SI+SQL + G T+ + HCL ++ GGG +G V S + TP+
Sbjct: 210 SAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGE--VESPVVKTTPI 267
Query: 247 SRDLLEKHYS---------SGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
+ + HY+ P +L ST G II DSG++ Y Y + ++
Sbjct: 268 VPN--QVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTII-DSGTTLAYLPQNLYNSLIE 324
Query: 298 LM--RKDLKGKPLEDT--------AEEKALPV 319
+ ++ +K +++T +KA PV
Sbjct: 325 KITAKQQVKLHMVQETFACFSFTSNTDKAFPV 356
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/332 (30%), Positives = 151/332 (45%), Gaps = 44/332 (13%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGN--VYPLGYYSVTLKIGNPPKLYELDID 81
F+ + S+ KS S R + P+ G+ +G Y +K+G+PPK Y + +D
Sbjct: 36 FAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVD 95
Query: 82 TGSDLTWVQCNAPCTGCTLP-----PESLYHPK----NNLVACNDPFCSAFHLPENIRCE 132
TGSD+ WV C APC C + P SLY K + V C D FCS E C
Sbjct: 96 TGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFCSFIMQSET--CG 152
Query: 133 ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQRNP-GPKP 187
A C Y V+Y D +S G V D+ L G+L L ++FGCG NQ G
Sbjct: 153 AKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTE 212
Query: 188 PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPM 246
G++G G S++SQL + G + + HCL ++ GGG +G V S + TP+
Sbjct: 213 SAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGIFAIGE--VESPVVKTTPL 270
Query: 247 SRDLLEKHYS---------SGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
+ + HY+ P +L ST G II DSG++ Y Y + ++
Sbjct: 271 VPN--QVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTII-DSGTTLAYLPQNLYNSLIE 327
Query: 298 LM--RKDLKGKPLEDT--------AEEKALPV 319
+ ++ +K +++T +KA PV
Sbjct: 328 KITAKQQVKLHMVQETFACFSFTSNTDKAFPV 359
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 127/273 (46%), Gaps = 32/273 (11%)
Query: 44 RFGSTAV-FPITG--NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-- 98
+ S+AV P+ G + Y G Y +++G PP+ Y L +DTGSDL WV C+ PC GC
Sbjct: 14 KLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPA 72
Query: 99 -------TLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLG 151
+P + ++ V C+DP C+ C +QC Y Y D +LG
Sbjct: 73 FSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLG 132
Query: 152 VLVTD--HFPLRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQ 208
LV D H+ + T +IFGCG+ Q + G++G G S SQL
Sbjct: 133 YLVEDVLHYMVNAT------ATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLA 186
Query: 209 SLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPM-----SRDLLEKHYSSGPAE 261
G T NV HCL RGGG L LG+ + P I +TP+ +++ + S A
Sbjct: 187 KQGKTPNVFAHCLDGGERGGGILVLGNVIEPD--IQYTPLVPYMYHYNVVLQSISVNNAN 244
Query: 262 LLFGGKSTGIKGLQ-IIFDSGSSYTYFNSQAYK 293
L K +Q IFDSG++ Y +AY+
Sbjct: 245 LTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAYQ 277
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 143/317 (45%), Gaps = 44/317 (13%)
Query: 24 FSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKL 75
F ++ + KS + AH R S P+ GN +P G Y + IG P K
Sbjct: 27 FRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKD 86
Query: 76 YELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPK----NNLVACNDPFCSAFHLP 126
Y + +DTGSD+ WV C A C C + +LY K ++ V C+D FCS + P
Sbjct: 87 YYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGP 145
Query: 127 ENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQRN 182
C+ QC Y VLY D S+ G V D +G+ ++FGCG Q
Sbjct: 146 LP-GCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSG 204
Query: 183 P-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSG 240
G G+LG G +S+LSQL S G + V HCL +V GGG +G + P
Sbjct: 205 ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGEVVEPKVN 264
Query: 241 IAWTPMSRDLLEKHYSSGPAELLFGG----------KSTGIKGLQIIFDSGSSYTYFNSQ 290
I TP+ ++ + HY+ E+ GG +S KG I DSG++ YF +
Sbjct: 265 I--TPLVQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG--TIIDSGTTLAYFPQE 318
Query: 291 AYKTTLDLMRKDLKGKP 307
Y + L+ K L +P
Sbjct: 319 VY---VPLIEKILSQQP 332
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/274 (32%), Positives = 127/274 (46%), Gaps = 32/274 (11%)
Query: 44 RFGSTAV-FPITG--NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-- 98
+ S+AV P+ G + Y G Y +++G PP+ Y L +DTGSDL WV C+ PC GC
Sbjct: 14 KLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPA 72
Query: 99 -------TLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLG 151
+P + ++ V C+DP C+ C +QC Y Y D +LG
Sbjct: 73 FSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLG 132
Query: 152 VLVTD--HFPLRLTNGSLLGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQ 208
LV D H+ + T +IFGCG+ Q + G++G G S SQL
Sbjct: 133 YLVEDVLHYMVNAT------ATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLA 186
Query: 209 SLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPM-----SRDLLEKHYSSGPAE 261
G T NV HCL RGGG L LG+ + P I +TP+ +++ + S A
Sbjct: 187 KQGKTPNVFAHCLDGGERGGGILVLGNVIEPD--IQYTPLVPYMSHYNVVLQSISVNNAN 244
Query: 262 LLFGGKSTGIKGLQ-IIFDSGSSYTYFNSQAYKT 294
L K +Q IFDSG++ Y +AY+
Sbjct: 245 LTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAYQA 278
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 143/320 (44%), Gaps = 44/320 (13%)
Query: 21 QGCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNP 72
F ++ + KS + AH R S P+ GN +P G Y + IG P
Sbjct: 105 NAVFRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTP 164
Query: 73 PKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPK----NNLVACNDPFCSAF 123
K Y + +DTGSD+ WV C A C C + +LY K ++ V C+D FCS +
Sbjct: 165 SKDYYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLY 223
Query: 124 HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYN 179
P C+ QC Y VLY D S+ G V D +G+ ++FGCG
Sbjct: 224 DGPLP-GCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNK 282
Query: 180 QRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVP 237
Q G G+LG G +S+LSQL S G + V HCL +V GGG +G + P
Sbjct: 283 QSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGEVVEP 342
Query: 238 SSGIAWTPMSRDLLEKHYSSGPAELLFGG----------KSTGIKGLQIIFDSGSSYTYF 287
I TP+ ++ + HY+ E+ GG +S KG I DSG++ YF
Sbjct: 343 KVNI--TPLVQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG--TIIDSGTTLAYF 396
Query: 288 NSQAYKTTLDLMRKDLKGKP 307
+ Y + L+ K L +P
Sbjct: 397 PQEVY---VPLIEKILSQQP 413
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 143/320 (44%), Gaps = 44/320 (13%)
Query: 21 QGCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNP 72
F ++ + KS + AH R S P+ GN +P G Y + IG P
Sbjct: 105 NAVFRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTP 164
Query: 73 PKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPK----NNLVACNDPFCSAF 123
K Y + +DTGSD+ WV C A C C + +LY K ++ V C+D FCS +
Sbjct: 165 SKDYYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLY 223
Query: 124 HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYN 179
P C+ QC Y VLY D S+ G V D +G+ ++FGCG
Sbjct: 224 DGPLP-GCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNK 282
Query: 180 QRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVP 237
Q G G+LG G +S+LSQL S G + V HCL +V GGG +G + P
Sbjct: 283 QSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGEVVEP 342
Query: 238 SSGIAWTPMSRDLLEKHYSSGPAELLFGG----------KSTGIKGLQIIFDSGSSYTYF 287
I TP+ ++ + HY+ E+ GG +S KG I DSG++ YF
Sbjct: 343 KVNI--TPLVQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG--TIIDSGTTLAYF 396
Query: 288 NSQAYKTTLDLMRKDLKGKP 307
+ Y + L+ K L +P
Sbjct: 397 PQEVY---VPLIEKILSQQP 413
>gi|62954897|gb|AAY23266.1| Similar to nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|77548966|gb|ABA91763.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa Japonica
Group]
Length = 307
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 118/233 (50%), Gaps = 44/233 (18%)
Query: 112 LVACNDPFCSAFHLPENIRCEAND-----QCDYEVLYADHGSSLGVLVTDHFPL-RLTNG 165
+V +DP A H E+ R + QCDYE+ YAD S++G L+ D F L R+
Sbjct: 1 MVRADDPLYVALH--EDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSLPRIATR 58
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKA-SILSQLQSLGL-TRNVLGHCLSV 223
P L FGCGYNQ G+G+ S L+ LG+ T++V+GHCLS
Sbjct: 59 ----PNLPFGCGYNQ----------------GIGENFQQTSPLKMLGIITKHVVGHCLSS 98
Query: 224 RGGGYLFLGH-------------DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG 270
GGG LF+G L P + + + +L +YS G A L F S G
Sbjct: 99 GGGGLLFVGDGDGNLVLLHASLGSLCPIAISTPSSYNEPMLMNYYSPGSATLYFDRHSLG 158
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ + ++FDSGS+YTYF +Q Y+ T+ ++ L LE + + +LP+CWKG
Sbjct: 159 MNPMDVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSLEQVS-DPSLPLCWKG 210
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 155/343 (45%), Gaps = 56/343 (16%)
Query: 6 KRVMGLLVLLMFATFQGC---FSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGN 56
+ V+ L+LL F C F ++ +++S + +H R S + GN
Sbjct: 5 REVLVGLLLLSFCLPGFCNLVFEVQHKFKGRERSLNALKSHDVRRHGRLLSVIDLELGGN 64
Query: 57 VYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYH 107
+P G Y + IG+PP + + +DTGSD+ WV C C+ C P +S LY+
Sbjct: 65 GHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNC--PKKSDIGVDLQLYN 121
Query: 108 PKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
PK++ L+ C+ PFCSA + C+ + C Y+V+Y D ++ G V D+ L+
Sbjct: 122 PKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRA 181
Query: 164 NG----SLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
G S ++FGCG Q G G+LG G +S++SQL + G + +
Sbjct: 182 VGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFA 241
Query: 219 HCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI- 276
HCL S+ GGG +G + P + TP+ + + HY+ ++ G G L +
Sbjct: 242 HCLDSISGGGIFAIGEVVEPK--LKTTPVVPN--QAHYN-----VVLNGVKVGDTALDLP 292
Query: 277 ------------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
I DSG++ Y Y L LM K L +P
Sbjct: 293 LGLFETSYKRGAIIDSGTTLAYLPDSIY---LPLMEKILGAQP 332
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 155/343 (45%), Gaps = 56/343 (16%)
Query: 6 KRVMGLLVLLMFATFQGC---FSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGN 56
+ V+ L+LL F C F ++ +++S + +H R S + GN
Sbjct: 5 REVLVGLLLLSFCLPGFCNLVFEVQHKFKGRERSLNALKSHDVRRHGRLLSVIDLELGGN 64
Query: 57 VYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYH 107
+P G Y + IG+PP + + +DTGSD+ WV C C+ C P +S LY+
Sbjct: 65 GHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNC--PKKSDIGVDLQLYN 121
Query: 108 PKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
PK++ L+ C+ PFCSA + C+ + C Y+V+Y D ++ G V D+ L+
Sbjct: 122 PKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRA 181
Query: 164 NG----SLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
G S ++FGCG Q G G+LG G +S++SQL + G + +
Sbjct: 182 VGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFA 241
Query: 219 HCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI- 276
HCL S+ GGG +G + P + TP+ + + HY+ ++ G G L +
Sbjct: 242 HCLDSISGGGIFAIGEVVEPK--LXNTPVVPN--QAHYN-----VVLNGVKVGDTALDLP 292
Query: 277 ------------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
I DSG++ Y Y L LM K L +P
Sbjct: 293 LGLFETSYKRGAIIDSGTTLAYLPESIY---LPLMEKILGAQP 332
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 128/279 (45%), Gaps = 32/279 (11%)
Query: 44 RFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP 101
R + A P+ G P G Y ++IG PPK Y + +DTGSD+ WV C C C
Sbjct: 62 RLLAAADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNC-ISCNKCPRK 120
Query: 102 PE-----SLYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGV 152
+ LY PK + V+C+ FC+A + + C N C+Y V+Y D S+ G
Sbjct: 121 SDLGIDLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGY 180
Query: 153 LVTDHFPLRLTNGS----LLGPRLIFGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQL 207
V+D +G +IFGCG Q + G G++G G S+LSQL
Sbjct: 181 FVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQL 240
Query: 208 QSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGG 266
+ G + + HCL +++GGG +G + P + TP+ D+ HY+ + GG
Sbjct: 241 AAAGEVKKIFSHCLDTIKGGGIFAIGDVVQPK--VKSTPLVPDM--PHYNVNLESINVGG 296
Query: 267 KS---------TGIKGLQIIFDSGSSYTYFNSQAYKTTL 296
+ TG K II DSG++ TY YK L
Sbjct: 297 TTLQLPSHMFETGEKKGTII-DSGTTLTYLPELVYKDVL 334
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 141/309 (45%), Gaps = 42/309 (13%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWV 89
S ++ T R + A P+ G P G Y +K+G PPK Y + +DTGSD+ WV
Sbjct: 53 SALRAHDGTRHGRLLAAADLPLGGLGLPTDTGLYYTEIKLGTPPKHYYVQVDTGSDILWV 112
Query: 90 QCNAPCTGCTLPPES-------LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCD 138
C C C P +S LY PK ++V C+ FC+A + +C AN C+
Sbjct: 113 NC-ITCEQC--PHKSGLGLDLTLYDPKASSTGSMVMCDQAFCAATFGGKLPKCGANVPCE 169
Query: 139 YEVLYADHGSSLGVLVTDHFPL-RLTNGSLLGP---RLIFGCGYNQ-RNPGPKPPPTAGV 193
Y V Y D S++G VTD ++T P +IFGCG Q + G G+
Sbjct: 170 YSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGI 229
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLE 252
LG G S+LSQL + G + + HCL +++GGG +G + P + TP+ D +
Sbjct: 230 LGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGIFSIGDVVQPK--VKTTPLVAD--K 285
Query: 253 KHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTL-------- 296
HY+ + GG + + + I DSG++ TY +K +
Sbjct: 286 PHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTLTYLPELVFKEVMLAVFNKHQ 345
Query: 297 DLMRKDLKG 305
D+ D++G
Sbjct: 346 DITFHDVQG 354
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 131/290 (45%), Gaps = 41/290 (14%)
Query: 46 GSTAVFPITGNV--YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
G FP+ G+ Y +G Y +K+G+PP + + IDTGSD+ WV C++ C+ C P
Sbjct: 81 GGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNC---PH 136
Query: 104 S--------LYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLG 151
S + +L V C+DP CS+ +C N+QC Y Y D + G
Sbjct: 137 SSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSG 196
Query: 152 VLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQ 206
+TD F G L ++FGC Q K G+ G G GK S++SQ
Sbjct: 197 YYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQ 256
Query: 207 LQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
L S G+T V HCL GGG LG LVP G+ ++P+ + HY+ +
Sbjct: 257 LSSRGITPPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLVPS--QPHYNLNLLSIGV 312
Query: 265 GGK----------STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
G+ ++ +G I D+G++ TY +AY L+ + +
Sbjct: 313 NGQMLPLDAAVFEASNTRG--TIVDTGTTLTYLVKEAYDLFLNAISNSVS 360
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 131/290 (45%), Gaps = 41/290 (14%)
Query: 46 GSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
G FP+ G+ P +G Y +K+G+PP + + IDTGSD+ WV C++ C+ C P
Sbjct: 81 GGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNC---PH 136
Query: 104 S--------LYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLG 151
S + +L V C+DP CS+ +C N+QC Y Y D + G
Sbjct: 137 SSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSG 196
Query: 152 VLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQ 206
+TD F G L ++FGC Q K G+ G G GK S++SQ
Sbjct: 197 YYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQ 256
Query: 207 LQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
L S G+T V HCL GGG LG LVP G+ ++P+ + HY+ +
Sbjct: 257 LSSRGITPPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLVPS--QPHYNLNLLSIGV 312
Query: 265 GGK----------STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
G+ ++ +G I D+G++ TY +AY L+ + +
Sbjct: 313 NGQMLPLDAAVFEASNTRG--TIVDTGTTLTYLVKEAYDLFLNAISNSVS 360
>gi|357461293|ref|XP_003600928.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355489976|gb|AES71179.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 295
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 83/239 (34%), Positives = 117/239 (48%), Gaps = 54/239 (22%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF 119
+G Y+V+LKIG P + +++ IDTGSDLTW LY NN V
Sbjct: 15 VGGYTVSLKIGYPGQSFDVFIDTGSDLTW------------DKYKLYKLHNNFVY----- 57
Query: 120 CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
+R + +Y D + G LV D+ PL ++ +L P+ C
Sbjct: 58 ---------VRIKL-------AIYVDGLQTKGFLVQDNIPLESSDRTLQRPK----CTNI 97
Query: 180 QRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPS 238
+ KP P + G+LGLG G+ SILSQL+S GL +NV+GHC S + G
Sbjct: 98 LKVTDKKPKPISKGILGLGHGETSILSQLKSKGLIKNVVGHCFSGKEGQ----------- 146
Query: 239 SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
++ LE Y S PA L+F K T IK LQ+IFDSG++ + FNS+ +K +D
Sbjct: 147 -----GGNTKIDLEGRYFSEPANLIFDEKLTFIKDLQLIFDSGTTLSAFNSKDHKVLVD 200
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 127/288 (44%), Gaps = 37/288 (12%)
Query: 46 GSTAVFPITGNV--YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
G FP+ G+ Y +G Y +K+G+PP + + IDTGSD+ WV C++ C+ C P
Sbjct: 81 GGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNC---PH 136
Query: 104 S--------LYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLG 151
S + + V C+DP CS+ +C N+QC Y Y D + G
Sbjct: 137 SSGLGIDLHFFDAPGSFTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSG 196
Query: 152 VLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQ 206
+TD F G L ++FGC Q K G+ G G GK S++SQ
Sbjct: 197 YYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQ 256
Query: 207 LQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
L S G+T V HCL GGG LG LVP G+ ++P+ + HY+ +
Sbjct: 257 LSSRGITPPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLLPS--QPHYNLNLLSIGV 312
Query: 265 GGKSTGIKGLQI--------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
G+ I I D+G++ TY +AY L+ + +
Sbjct: 313 NGQILPIDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVS 360
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 129/281 (45%), Gaps = 36/281 (12%)
Query: 44 RFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP 101
R + A P+ G P G Y +K+G PPK Y + +DTGSD+ WV C C C P
Sbjct: 63 RLLAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNC-ISCEKC--P 119
Query: 102 PES-------LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
+S Y PK + V+C+ FC+A + + C AN C+Y V+Y D S+
Sbjct: 120 RKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTT 179
Query: 151 GVLVTDHFPLRLTNGSLL----GPRLIFGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILS 205
G VTD G + FGCG Q + G G+LG G S+LS
Sbjct: 180 GFFVTDALQFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLS 239
Query: 206 QLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
QL + G + + HCL +++GGG +G+ + P + TP+ D+ HY+ +
Sbjct: 240 QLAAAGKVKKIFAHCLDTIKGGGIFAIGNVVQPK--VKTTPLVADM--PHYNVNLKSIDV 295
Query: 265 GGKS---------TGIKGLQIIFDSGSSYTYFNSQAYKTTL 296
GG + TG + II DSG++ TY +K +
Sbjct: 296 GGTTLQLPAHVFETGERKGTII-DSGTTLTYLPELVFKEVM 335
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 151/341 (44%), Gaps = 54/341 (15%)
Query: 7 RVMGLLVLLMFATFQGCFSEAN---QPPSKKKSTQSTAAH------RFGSTAVFPITGNV 57
R + +LV ++ A GC + N +K+S + AH R S + GN
Sbjct: 4 RAVLILVAILVAEI-GCIANGNFVFPVERRKRSLNAVKAHDARRRGRILSAVDLNLGGNG 62
Query: 58 YPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPK- 109
P G Y L +G+PPK Y + +DTGSD+ WV C C+ C + +LY PK
Sbjct: 63 LPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNC-VKCSRCPRKSDLGIDLTLYDPKG 121
Query: 110 ---NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS 166
+ L++C+ FCSA + C++ C Y + Y D ++ G V D+ N +
Sbjct: 122 SETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDN 181
Query: 167 LL----GPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNVLGHC 220
L +IFGCG Q A G++G G +S+LSQL + G + + HC
Sbjct: 182 LRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHC 241
Query: 221 L-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYS-------------SGPAELLFGG 266
L ++RGGG +G + P ++ TP+ + HY+ P+++ G
Sbjct: 242 LDNIRGGGIFAIGEVVEPK--VSTTPLVPRM--AHYNVVLKSIEVDTDILQLPSDIFDSG 297
Query: 267 KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
G I DSG++ Y + Y +L+ K + +P
Sbjct: 298 NGKG-----TIIDSGTTLAYLPAIVYD---ELIPKVMARQP 330
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/323 (30%), Positives = 142/323 (43%), Gaps = 48/323 (14%)
Query: 24 FSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKL 75
F ++ + KS + AH R S P+ GN +P G Y + IG P K
Sbjct: 31 FRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKD 90
Query: 76 YELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPK----NNLVACNDPFCSAFHLP 126
Y + +DTGSD+ WV C A C C + +LY K ++ V C+D FCS + P
Sbjct: 91 YYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGP 149
Query: 127 ENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQRN 182
C+ QC Y VLY D S+ G V D +G+ ++FGCG Q
Sbjct: 150 LP-GCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSG 208
Query: 183 P-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSG 240
G G+LG G +S+LSQL S G + V HCL +V GGG +G + P
Sbjct: 209 ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGEVVEPK-- 266
Query: 241 IAWTPMSRDLL------EKHYSSGPAELLFGG----------KSTGIKGLQIIFDSGSSY 284
+ + M+ ++ HY+ E+ GG +S KG I DSG++
Sbjct: 267 VRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG--TIIDSGTTL 324
Query: 285 TYFNSQAYKTTLDLMRKDLKGKP 307
YF + Y + L+ K L +P
Sbjct: 325 AYFPQEVY---VPLIEKILSQQP 344
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 134/307 (43%), Gaps = 61/307 (19%)
Query: 49 AVFPITGN--VYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP----- 101
FPI+G+ + G Y + +G PP+ + + +DTGSD+ WV C PCT C
Sbjct: 32 VAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNC-VPCTNCKRASNVAL 90
Query: 102 PESLYHPKNNL----VACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTD 156
P S++ P+ + ++C D C +L N +C N C Y LY D S+ G L+ D
Sbjct: 91 PISIFDPEKSTSKTSISCTDEEC---YLASNSKCSFNSMSCPYSTLYGDGSSTAGYLIND 147
Query: 157 -----HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG 211
P + + RL FGCG NQ T G++G G + S+ SQL
Sbjct: 148 VLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW----LTDGLVGFGQAEVSLPSQLSKQN 203
Query: 212 LTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
++ N+ HCL +G G L +GH P G+ +TP+ + HY+ ELL +
Sbjct: 204 VSVNIFAHCLQGDNKGSGTLVIGHIREP--GLVYTPIVPK--QSHYN---VELL----NI 252
Query: 270 GIKGLQ--------------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEK 315
G+ G +I DSG++ TY AY D + D
Sbjct: 253 GVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAY---------DQFQAKVRDCMRSG 303
Query: 316 ALPVCWK 322
LPV ++
Sbjct: 304 VLPVAFQ 310
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/310 (30%), Positives = 141/310 (45%), Gaps = 47/310 (15%)
Query: 44 RFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP 101
R + A P+ G P G Y +K+G PPK Y + +DTGSD+ WV C C+ C P
Sbjct: 66 RLLAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNC-ISCSKC--P 122
Query: 102 PES-------LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
+S Y PK + V+C+ FC+A + + C AN C+Y V+Y D S+
Sbjct: 123 RKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTT 182
Query: 151 GVLVTDHFPLRLTNGSLL----GPRLIFGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILS 205
G +TD G + FGCG Q + G G+LG G S+LS
Sbjct: 183 GFFITDALQFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLS 242
Query: 206 QLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVP--------SSGIAWTP---MSRDLLEK 253
QL + G + + HCL +++GGG +G+ + P + G+ P + LL +
Sbjct: 243 QLAAAGKAKKIFAHCLDTIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSR 302
Query: 254 -HYSSGPAELLFGGKS---------TGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM---R 300
HY+ + GG + TG K II DSG++ TY +K +D++
Sbjct: 303 PHYNVNLKSIDVGGTTLQLPAHVFETGEKKGTII-DSGTTLTYLPELVFKQVMDVVFSKH 361
Query: 301 KDLKGKPLED 310
+D+ L+D
Sbjct: 362 RDIAFHNLQD 371
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 129/292 (44%), Gaps = 34/292 (11%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWV 89
S ++ T R +TA P+ G P G Y +++G PPK + + +DTGSD+ WV
Sbjct: 55 SALRAHDGTRHGRLLATADLPLGGLGLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDILWV 114
Query: 90 QCNAPCTGCTLPPES-------LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCD 138
C C C P +S LY PK + V C+ FC+ +C AN C+
Sbjct: 115 NC-ITCDQC--PHKSGLGLDLTLYDPKASSTGSTVMCDQGFCADTFGGRLPKCSANVPCE 171
Query: 139 YEVLYADHGSSLGVLVTDHFPLRLTNGS----LLGPRLIFGCGYNQ-RNPGPKPPPTAGV 193
Y V Y D S++G V D G +IFGCG Q + G G+
Sbjct: 172 YSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGI 231
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLE 252
LG G S+LSQL + G + + HCL +++GGG +G + P + TP+ D +
Sbjct: 232 LGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGGIFAIGDVVQPK--VKTTPLVAD--K 287
Query: 253 KHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTL 296
HY+ + GG + + + I DSG++ TY +K +
Sbjct: 288 PHYNVNLKTIDVGGTTLELPADIFKPGEKRGTIIDSGTTLTYLPELVFKKVM 339
>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
Length = 154
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 83/144 (57%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G +++FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEVVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEEV 149
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 137/323 (42%), Gaps = 38/323 (11%)
Query: 21 QGCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNP 72
G FS + +K+S + AH R + P+ G P +G Y + IG P
Sbjct: 48 HGFFSLKYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTP 107
Query: 73 PKLYELDIDTGSDLTWV---QCNAPCTGCTLPPE-SLYHPKNNL----VACNDPFCSAFH 124
+ Y + +DTGSD+ WV QCN +L E +LY K +L V+C+ FC A +
Sbjct: 108 ARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAIN 167
Query: 125 LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQ 180
C AN C Y +YAD SS G V D +G L +IFGC Q
Sbjct: 168 GGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQ 227
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGHDLVPSS 239
G+LG G S++SQL S G R + HCL + GGG +GH + P
Sbjct: 228 SGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPK- 286
Query: 240 GIAWTPMSRDLLEKHYSSGPAELLFGGK---------STGIKGLQIIFDSGSSYTYFNSQ 290
+ TP+ + + HY+ + GG G K II DSG++ Y
Sbjct: 287 -VNTTPLVPN--QTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTII-DSGTTLAYLPEV 342
Query: 291 AYKTTLDLM---RKDLKGKPLED 310
Y L + + DLK + D
Sbjct: 343 VYDQLLSKIFSWQSDLKVHTIHD 365
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 137/323 (42%), Gaps = 38/323 (11%)
Query: 21 QGCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNP 72
G FS + +K+S + AH R + P+ G P +G Y + IG P
Sbjct: 48 HGFFSLKYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTP 107
Query: 73 PKLYELDIDTGSDLTWV---QCNAPCTGCTLPPE-SLYHPKNNL----VACNDPFCSAFH 124
+ Y + +DTGSD+ WV QCN +L E +LY K +L V+C+ FC A +
Sbjct: 108 ARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAIN 167
Query: 125 LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQ 180
C AN C Y +YAD SS G V D +G L +IFGC Q
Sbjct: 168 GGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQ 227
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGHDLVPSS 239
G+LG G S++SQL S G R + HCL + GGG +GH + P
Sbjct: 228 SGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPK- 286
Query: 240 GIAWTPMSRDLLEKHYSSGPAELLFGGK---------STGIKGLQIIFDSGSSYTYFNSQ 290
+ TP+ + + HY+ + GG G K II DSG++ Y
Sbjct: 287 -VNTTPLVPN--QTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTII-DSGTTLAYLPEV 342
Query: 291 AYKTTLDLM---RKDLKGKPLED 310
Y L + + DLK + D
Sbjct: 343 VYDQLLSKIFSWQSDLKVHTIHD 365
>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
Length = 154
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 83/144 (57%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS+G AELL + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEEV 149
>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
Length = 154
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 82/144 (56%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y L +R L LE+
Sbjct: 126 PAQIYNEILSKVRGTLSESSLEEV 149
>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
Length = 149
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 82/144 (56%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y L +R L LE+
Sbjct: 126 PAQIYNEILSKVRGTLSESSLEEV 149
>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
Length = 148
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 83/143 (58%), Gaps = 5/143 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KVAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS+G AELL + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLED 310
+Q Y + +R L LE+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEE 148
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 137/305 (44%), Gaps = 36/305 (11%)
Query: 33 KKKSTQSTAAH------RFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGS 84
+K+S + AH R S + GN P G Y L +G+PP+ Y + +DTGS
Sbjct: 32 RKRSLSAVRAHDVRRRGRILSAVDLNLGGNGLPTETGLYFTKLGLGSPPRDYYVQVDTGS 91
Query: 85 DLTWVQCNAPCTGCTLPPE-----SLYHPK----NNLVACNDPFCSAFHLPENIRCEAND 135
D+ WV C C+ C + +LY PK +++V+C+ FCSA C++
Sbjct: 92 DILWVNC-VECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGPIPGCKSEI 150
Query: 136 QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPPPTA 191
C Y + Y D ++ G V D+ NG+L +IFGCG Q A
Sbjct: 151 PCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEA 210
Query: 192 --GVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSR 248
G++G G +S+LSQL + G + + HCL +VRGGG +G + P ++ TP+
Sbjct: 211 LDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGGIFAIGEVVEPK--VSTTPLVP 268
Query: 249 DLLEKHYSSGPAEL------LFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKD 302
+ + E+ L + G + DSG++ Y Y +L++K
Sbjct: 269 RMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPDIVYD---ELIQKV 325
Query: 303 LKGKP 307
L +P
Sbjct: 326 LARQP 330
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 127/272 (46%), Gaps = 37/272 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
FP+ G+ P +G Y +K+G+PPK Y + IDTGSD+ WV C +PCTGC P S
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGC--PSSSGLNI 133
Query: 105 ---LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVT 155
++P ++ + C+D C+A C+ +D C Y Y D + G V+
Sbjct: 134 QLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVS 193
Query: 156 D--HFPLRLTNGSLLG--PRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSL 210
D +F + N ++FGC +Q K G+ G G + S++SQL SL
Sbjct: 194 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253
Query: 211 GLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
G++ V HCL S GGG L LG + P G+ +TP+ + HY+ ++ G+
Sbjct: 254 GVSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPS--QPHYNLNLESIVVNGQK 309
Query: 269 TGIKG--------LQIIFDSGSSYTYFNSQAY 292
I I DSG++ Y AY
Sbjct: 310 LPIDSSLFTTSNTQGTIVDSGTTLAYLADGAY 341
>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
Length = 154
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 82/144 (56%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEEV 149
>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
Length = 154
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 82/144 (56%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK-GLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMRESLF--YYSPGLAELLIDNQPIGGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEEV 149
>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
Length = 152
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 82/144 (56%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLG-LTRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ +T NV+GHCLS +G G
Sbjct: 6 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSSKGKGV 65
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 66 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 123
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 124 PAQIYNEIVSKVRGTLSESSLEEV 147
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 127/284 (44%), Gaps = 41/284 (14%)
Query: 46 GSTAVFPITGNVYP-LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
G F + G+ P +G Y +K+GNP + + + IDTGSD+ WV C +PC GC P S
Sbjct: 66 GGIVNFSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGC--PDSS 122
Query: 105 LYHPKNNL-----------VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
+ NL + C DP C+A + D C Y Y D + G
Sbjct: 123 GLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFY 182
Query: 154 VTD--HFPLRLTNGSLL--GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQ 208
VTD HF + L ++ ++FGC Q + G+ G G G+ S++SQL
Sbjct: 183 VTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLS 242
Query: 209 SLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYS--------SG 258
S G+T V HCL GGG L LG L PS I ++P+ + HY+ SG
Sbjct: 243 SRGITPKVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPS--QPHYTLKLQSIALSG 298
Query: 259 ---PAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
P +F + G + I DSG++ Y + Y + ++
Sbjct: 299 QLFPNPTMFPISNAG----ETIIDSGTTLAYLVEEVYDWIVSVI 338
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 127/272 (46%), Gaps = 37/272 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
FP+ G+ P +G Y +K+G+PPK Y + IDTGSD+ WV C +PCTGC P S
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGC--PSSSGLNI 133
Query: 105 ---LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVT 155
++P ++ + C+D C+A C+ +D C Y Y D + G V+
Sbjct: 134 QLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVS 193
Query: 156 D--HFPLRLTNGSLL--GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSL 210
D +F + N ++FGC +Q K G+ G G + S++SQL SL
Sbjct: 194 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253
Query: 211 GLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
G++ V HCL S GGG L LG + P G+ +TP+ + HY+ ++ G+
Sbjct: 254 GVSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPS--QPHYNLNLESIVVNGQK 309
Query: 269 TGIKG--------LQIIFDSGSSYTYFNSQAY 292
I I DSG++ Y AY
Sbjct: 310 LPIDSSLFTTSNTQGTIVDSGTTLAYLADGAY 341
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 129/299 (43%), Gaps = 34/299 (11%)
Query: 35 KSTQSTAAHRFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
K+ S+ R S F + GN P G Y + +G+P K Y + +DTGSD+ WV C
Sbjct: 39 KAHDSSRRGRILSAVDFNLGGNGLPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNC- 97
Query: 93 APCTGCTLPPE-----SLYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY 143
CT C + +LY PK + V+C FCS+ + + C+A + C Y + Y
Sbjct: 98 VECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISY 157
Query: 144 ADHGSSLGVLVTDHFPLRLTNG----SLLGPRLIFGCGYNQRNPGPKPPPTA--GVLGLG 197
D ++ G V D+ NG + +IFGCG Q A G++G G
Sbjct: 158 GDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFG 217
Query: 198 LGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYS 256
+S+LSQL + G + + HCL GGG +G + P + TP+ ++ HY+
Sbjct: 218 QANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPK--VKTTPLVPNM--AHYN 273
Query: 257 SGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
+ G + G + DSG++ Y Y LM K L +P
Sbjct: 274 VILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYD---QLMSKVLAKQP 329
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 127/284 (44%), Gaps = 41/284 (14%)
Query: 46 GSTAVFPITGNVYP-LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
G F + G+ P +G Y +K+GNP + + + IDTGSD+ WV C +PC GC P S
Sbjct: 66 GGIVNFSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGC--PDSS 122
Query: 105 LYHPKNNL-----------VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
+ NL + C DP C+A + D C Y Y D + G
Sbjct: 123 GLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFY 182
Query: 154 VTD--HFPLRLTNGSLL--GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQ 208
VTD HF + L ++ ++FGC Q + G+ G G G+ S++SQL
Sbjct: 183 VTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLS 242
Query: 209 SLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYS--------SG 258
S G+T V HCL GGG L LG L PS I ++P+ + HY+ SG
Sbjct: 243 SRGITPKVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPS--QPHYTLKLQSIALSG 298
Query: 259 ---PAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
P +F + G + I DSG++ Y + Y + ++
Sbjct: 299 QLFPNPTMFPISNAG----ETIIDSGTTLAYLVEEVYDWIVSVI 338
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 127/272 (46%), Gaps = 37/272 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
FP+ G+ P +G Y +K+G+PPK Y + IDTGSD+ WV C +PCTGC P S
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGC--PSSSGLNI 133
Query: 105 ---LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVT 155
++P ++ + C+D C+A C+ +D C Y Y D + G V+
Sbjct: 134 QLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVS 193
Query: 156 D--HFPLRLTNGSLLG--PRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSL 210
D +F + N ++FGC +Q K G+ G G + S++SQL SL
Sbjct: 194 DTMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253
Query: 211 GLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
G++ V HCL S GGG L LG + P G+ +TP+ + HY+ ++ G+
Sbjct: 254 GVSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPS--QPHYNLNLESIVVNGQK 309
Query: 269 TGIKG--------LQIIFDSGSSYTYFNSQAY 292
I I DSG++ Y AY
Sbjct: 310 LPIDSSLFTTSNTQGTIVDSGTTLAYLADGAY 341
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 129/295 (43%), Gaps = 46/295 (15%)
Query: 46 GSTAVFPITGNVYPL-------GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC 98
G FP+ G+ P Y +K+G+PP + + IDTGSD+ WV C++ C+ C
Sbjct: 81 GGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNC 139
Query: 99 TLPPES--------LYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADH 146
P S + +L V C+DP CS+ +C N+QC Y Y D
Sbjct: 140 ---PHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG 196
Query: 147 GSSLGVLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKA 201
+ G +TD F G L ++FGC Q K G+ G G GK
Sbjct: 197 SGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKL 256
Query: 202 SILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGP 259
S++SQL S G+T V HCL GGG LG LVP G+ ++P+ + HY+
Sbjct: 257 SVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLVPS--QPHYNLNL 312
Query: 260 AELLFGGK----------STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+ G+ ++ +G I D+G++ TY +AY L+ + +
Sbjct: 313 LSIGVNGQMLPLDAAVFEASNTRG--TIVDTGTTLTYLVKEAYDLFLNAISNSVS 365
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 121/277 (43%), Gaps = 33/277 (11%)
Query: 43 HRFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT- 99
R + FP+TG+ P G Y + +G PP Y + +DTGSD+TW+ C APCT C
Sbjct: 15 RRLAAVVDFPLTGDDDPFVTGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNC-APCTSCVT 73
Query: 100 ---LPPESL--YHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
LP L Y P + ++C D C A + C + C Y Y D S+
Sbjct: 74 ETQLPSIKLTTYDPSRSSTDGALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQ 133
Query: 151 GVLVTDHFPLR-LTNGSLLG--PRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQ 206
G + D + + N + + + FGCG Q N G++G G SI SQ
Sbjct: 134 GYFIQDVMTFQEIHNNTQVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQ 193
Query: 207 LQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
L S+G N HCL +GGG + +G V I++TP+ + HY+ G +
Sbjct: 194 LASMGKVGNRFAHCLQGDNQGGGTIVIGS--VSEPNISYTPI---VSRNHYAVGMQNIAV 248
Query: 265 GGK---------STGIKGLQIIFDSGSSYTYFNSQAY 292
G+ +T +I DSG++ Y AY
Sbjct: 249 NGRNVTTPASFDTTSTSAGGVIMDSGTTLAYLVDPAY 285
>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
Length = 141
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 81/143 (56%), Gaps = 5/143 (3%)
Query: 172 LIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGLTR-NVLGHCLSVRGGGYL 229
+ FGCGY Q P PP P G+LGLG+GKA +QL+ + + NV+GHCLS +G G L
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFN 288
++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 61 YVGDFNPPSRGVTWVPMRESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVP 118
Query: 289 SQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 119 AQIYNEIVSKVRGTLSEPSLEEV 141
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/273 (32%), Positives = 126/273 (46%), Gaps = 37/273 (13%)
Query: 51 FPITG--NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPP 102
FP+ G N Y +G Y +K+GNP K + + IDTGSD+ WV C +PCTGC +
Sbjct: 75 FPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQL 133
Query: 103 ESLYHPKNNLVA----CNDPFCSA-FHLPENIRCEANDQ---CDYEVLYADHGSSLGVLV 154
ES ++P ++ A C+D C+A F E I +N Q C Y Y D + G V
Sbjct: 134 ES-FNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYV 192
Query: 155 TDHFPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQS 209
+D G+ ++FGC +Q K G+ G G + S++SQL S
Sbjct: 193 SDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNS 252
Query: 210 LGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
LG++ V HCL S GGG L LG + P G+ +TP+ + HY+ + G+
Sbjct: 253 LGVSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPS--QPHYNLNLESIAVNGQ 308
Query: 268 STGIKG--------LQIIFDSGSSYTYFNSQAY 292
I I DSG++ Y AY
Sbjct: 309 KLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAY 341
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/273 (32%), Positives = 126/273 (46%), Gaps = 37/273 (13%)
Query: 51 FPITG--NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPP 102
FP+ G N Y +G Y +K+GNP K + + IDTGSD+ WV C +PCTGC +
Sbjct: 77 FPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQL 135
Query: 103 ESLYHPKNNLVA----CNDPFCSA-FHLPENIRCEANDQ---CDYEVLYADHGSSLGVLV 154
ES ++P ++ A C+D C+A F E I +N Q C Y Y D + G V
Sbjct: 136 ES-FNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYV 194
Query: 155 TDHFPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQS 209
+D G+ ++FGC +Q K G+ G G + S++SQL S
Sbjct: 195 SDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNS 254
Query: 210 LGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
LG++ V HCL S GGG L LG + P G+ +TP+ + HY+ + G+
Sbjct: 255 LGVSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPS--QPHYNLNLESIAVNGQ 310
Query: 268 STGIKG--------LQIIFDSGSSYTYFNSQAY 292
I I DSG++ Y AY
Sbjct: 311 KLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAY 343
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 123/278 (44%), Gaps = 36/278 (12%)
Query: 46 GSTAVFPITGNVYP--LGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP 101
G F + G+ P LGY Y+ +K+G PP+ + + IDTGSD+ W+ CN C+ C
Sbjct: 63 GGVVDFRVQGSSDPSTLGYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNT-CSNCPKS 121
Query: 102 P---------ESLYHPKNNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLG 151
+++ LV C+DP C++ +C +QC Y Y D + G
Sbjct: 122 SGLGIELNFFDTVGSSTAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSG 181
Query: 152 VLVTDHFPLRLTNGSLL------GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASIL 204
V V+D + G ++FGC Q K G+LG G G+ S++
Sbjct: 182 VYVSDAMYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVV 241
Query: 205 SQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAEL 262
SQL S G+T V HCL GGG L LG L PS I ++P+ + HY+ +
Sbjct: 242 SQLSSRGITPKVFSHCLKGDGNGGGILVLGEILEPS--IVYSPLVPS--QPHYNLNLQSI 297
Query: 263 LFGGKSTGIKGLQI--------IFDSGSSYTYFNSQAY 292
G+ I I DSG++ +Y +AY
Sbjct: 298 AVNGQVLSINPAVFATSDKRGTIIDSGTTLSYLVQEAY 335
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 128/286 (44%), Gaps = 37/286 (12%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGN--VYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
+K Q FPI+G+ ++ +G Y + +G PP+ + +D+DTGS++ WV+C
Sbjct: 10 RKHDQRRLRRMLPEVVSFPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVKC 69
Query: 92 NAPCTGCT----LP-PESLYHPKNNL----VACNDPFCSAFHLPENIRCEAND-QCDYEV 141
APCTGC +P P S + P+ + ++C D C L + ++C C Y +
Sbjct: 70 -APCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGV--LNKKLQCSPERLSCPYSL 126
Query: 142 LYADHGSSLGVLVTDHF-----PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGL 196
LY D S+ G + D F P + RL+FGCG Q G+LG
Sbjct: 127 LYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSW----SVDGLLGF 182
Query: 197 GLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH 254
G S+ +QL ++ N+ HCL V G G L +G P + +TPM E H
Sbjct: 183 GPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPD--LVYTPMV--FGEDH 238
Query: 255 YSSGPAELLFGGKSTGIKGL-------QIIFDSGSSYTYFNSQAYK 293
Y+ + G++ +I DSG++ TY AY
Sbjct: 239 YNVQLLNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYD 284
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 89/327 (27%), Positives = 140/327 (42%), Gaps = 45/327 (13%)
Query: 26 EANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTG 83
E +Q ++ K+ G FP+ G P +G Y +++G+PP+ + + +DTG
Sbjct: 42 ELSQLKARDKARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKIRLGSPPRDFYVQVDTG 101
Query: 84 SDLTWVQCNAPCTGCTLPPES-------LYHPKNNL----VACNDPFCSAFHLPENIRCE 132
SD+ WV C A C GC P S + P +++ V+C+D CS + C
Sbjct: 102 SDVLWVSC-ASCNGC--PQTSGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSSDSGCS 158
Query: 133 A-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKP 187
N+ C Y Y D + G V+D + GS L P ++FGC +Q K
Sbjct: 159 VQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKS 218
Query: 188 PPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWT 244
G+ G G S++SQL S GL V HCL GGG L LG + P+ + +T
Sbjct: 219 DRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENGGGGILVLGEIVEPN--MVFT 276
Query: 245 PMSRDLLEKHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTL 296
P+ + HY+ + G++ I G I D+G++ Y + AY +
Sbjct: 277 PLVPS--QPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV 334
Query: 297 DLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ + + + PV KG
Sbjct: 335 E---------AITNAVSQSVRPVVSKG 352
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 124/278 (44%), Gaps = 35/278 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPKNN---- 111
G Y + IG P K Y + +DTGSD+ WV C C GC ++Y P+ +
Sbjct: 88 GLYFTRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGE 146
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG----SL 167
LV C+ FC A + C + C+Y + Y D S+ G VTD +G +
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 168 LGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRG 225
+ FGCG + G G+LG G +S+LSQL + G R + HCL +V G
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----------KGLQ 275
GG +G+ + P + TP+ D+ HY+ + GG + G+ KG
Sbjct: 267 GGIFAIGNVVQPK--VKTTPLVSDM--PHYNVILKGIDVGGTALGLPTNIFDSGNSKG-- 320
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLM---RKDLKGKPLED 310
I DSG++ Y YK ++ +D+ + L+D
Sbjct: 321 TIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQD 358
>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
Length = 154
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 79/143 (55%), Gaps = 5/143 (3%)
Query: 172 LIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGYL 229
+ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G L
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 68
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFN 288
++G P+ G+ W PM L +YS G A L + G + +FDSGS+YTY
Sbjct: 69 YVGDFNPPTRGVTWVPMRESLF--YYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYMP 126
Query: 289 SQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 127 AQIYNELVSKIRGTLSESSLEEV 149
>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
Length = 147
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 81/144 (56%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 1 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 60
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 61 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 118
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + + L LE+
Sbjct: 119 PAQIYNEIVSKVIGTLSESSLEEV 142
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 128/286 (44%), Gaps = 46/286 (16%)
Query: 46 GSTAVFPITGNVYP--LGYY--------SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
G FP+ G P +G+Y L++G+PP+ + + IDTGSD+ WV C++ C
Sbjct: 63 GGVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSS-C 121
Query: 96 TGCTLPPESLYH-----------PKNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLY 143
GC P S H P +L++C+D CS + C A N+QC Y Y
Sbjct: 122 NGC--PVSSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQY 179
Query: 144 ADHGSSLGVLVTD--HFPLRLTNGSLL---GPRLIFGCGYNQRNPGPKPPPTA-GVLGLG 197
D + G V+D HF L GS++ ++FGC Q KP G+ G G
Sbjct: 180 GDGSGTSGYYVSDLLHFDTIL-GGSVMKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFG 238
Query: 198 LGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY 255
S++SQL S G+T V HCL GGG L LG + P+ I +TP+ + HY
Sbjct: 239 QQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVEPN--IVYTPLVPS--QPHY 294
Query: 256 SSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYK 293
+ + G++ I I DSG++ Y AY
Sbjct: 295 NLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLTEAAYD 340
>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
Length = 150
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 79/143 (55%), Gaps = 5/143 (3%)
Query: 172 LIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGYL 229
+ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G L
Sbjct: 7 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 66
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFN 288
++G P+ G+ W PM L +YS G A L + G + +FDSGS+YTY
Sbjct: 67 YVGDFNPPTRGVTWVPMRESLF--YYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYVP 124
Query: 289 SQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 125 AQIYNELVSKIRGTLSESSLEEV 147
>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 154
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 81/144 (56%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGLTR-NVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + + NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G P+ G+ W PM L +YS G AE+ + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPTRGVTWAPMRESLF--YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 126 PAQIYNEIVSKVRVTLSESSLEEV 149
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 118/268 (44%), Gaps = 37/268 (13%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHP 108
T + Y G Y +++G PP+ + + IDTGSD+ WV C PC C L + + P
Sbjct: 32 TADPYVAGLYYTRIELGTPPRPFYVQIDTGSDILWVNCK-PCNACPLTSGLGVALNFFDP 90
Query: 109 KNNLVA----CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR--- 161
+ + A C D C + + C + C Y Y D +LG V+D F
Sbjct: 91 RGSSTASPLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYV 150
Query: 162 ---LTNGSLLGPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVL 217
+TN + ++ FGC YNQ KP G+ G G S++SQL S GL +
Sbjct: 151 NQYVTNNA--SAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIF 208
Query: 218 GHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-------- 267
HCL + GGG L LG P G+ +TP+ + HY+ + G+
Sbjct: 209 SHCLEGADPGGGILVLGEITEP--GMVYTPIVPS--QPHYNLNLQGIAVNGQQLSIDPQV 264
Query: 268 --STGIKGLQIIFDSGSSYTYFNSQAYK 293
+T +G I D G++ Y +AY+
Sbjct: 265 FATTNTRG--TIIDCGTTLAYLAEEAYE 290
>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
Length = 140
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
Query: 172 LIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGYL 229
+ FGCGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G L
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFN 288
+ G PS G+ W PM +YS G AELL + G + +FDSGS+YT+
Sbjct: 61 YFGDFNPPSRGVTWVPMKESX--XYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVP 118
Query: 289 SQAYKTTLDLMRKDLKGKPLED 310
+Q Y + +R L LE+
Sbjct: 119 AQIYNEIVSKVRGTLSESSLEE 140
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 124/278 (44%), Gaps = 35/278 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPKNN---- 111
G Y + IG P K Y + +DTGSD+ WV C C GC ++Y P+ +
Sbjct: 88 GLYFTRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGE 146
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG----SL 167
LV C+ FC A + C + C+Y + Y D S+ G VTD +G +
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 168 LGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRG 225
+ FGCG + G G+LG G +S+LSQL + G R + HCL +V G
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----------KGLQ 275
GG +G+ + P + TP+ D+ HY+ + GG + G+ KG
Sbjct: 267 GGIFAIGNVVQPK--VKTTPLVPDM--PHYNVILKGIDVGGTALGLPTNIFDSGNSKG-- 320
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLM---RKDLKGKPLED 310
I DSG++ Y YK ++ +D+ + L+D
Sbjct: 321 TIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQD 358
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 122/276 (44%), Gaps = 31/276 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPKNN---- 111
G Y + IG P K Y + +DTGSD+ WV C C GC ++Y P+ +
Sbjct: 88 GLYFTRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGE 146
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG----SL 167
LV C+ FC A + C + C+Y + Y D S+ G VTD +G +
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 168 LGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRG 225
+ FGCG + G G+LG G +S+LSQL + G R + HCL +V G
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI--------KGLQII 277
GG +G+ + P + TP+ D+ HY+ + GG + G+ I
Sbjct: 267 GGIFAIGNVVQPK--VKTTPLVPDM--PHYNVILKGIDVGGTALGLPTNIFDSGNSKGTI 322
Query: 278 FDSGSSYTYFNSQAYKTTLDLM---RKDLKGKPLED 310
DSG++ Y YK ++ +D+ + L+D
Sbjct: 323 IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQD 358
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 125/289 (43%), Gaps = 34/289 (11%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE----- 103
P+ G+ P +G Y + IG PPK Y L +DTGSD+ WV C C C
Sbjct: 69 LPLGGSGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSSLGMDL 127
Query: 104 SLYHPKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFP 159
+LY K + LV C+ FC + C AN C Y +Y D S+ G V D
Sbjct: 128 TLYDIKESSSGKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVL 187
Query: 160 LRLTNGSL----LGPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLT 213
+G L ++FGCG Q A G+LG G +S++SQL S G
Sbjct: 188 YDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKV 247
Query: 214 RNVLGHCLS-VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG------G 266
+ + HCL+ V GGG +GH + P + TP+ D + HYS + G
Sbjct: 248 KKMFAHCLNGVNGGGIFAIGHVVQPK--VNMTPLLPD--QPHYSVNMTAVQVGHTFLSLS 303
Query: 267 KSTGIKGLQ--IIFDSGSSYTYFNSQAYKTTLDLM---RKDLKGKPLED 310
T +G + I DSG++ Y Y+ + M DLK + L D
Sbjct: 304 TDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHD 352
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 139/350 (39%), Gaps = 47/350 (13%)
Query: 1 MEEKGKRVMGLLVLLMFATF---QGCFSEANQPPSKKKSTQSTAAH------RFGSTAVF 51
M E RV+ L +++ F G FS + ++S AH R +
Sbjct: 5 MAEAQSRVLLLTMMISFTIVSANNGVFSVKYKYAGLQRSLSDLKAHDDQRQLRILAGVDL 64
Query: 52 PITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC----NAPCTGCTLPPESL 105
P+ G P LG Y + IG P K Y + +DTGSD+ WV C P T +L
Sbjct: 65 PLGGIGRPDILGLYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTL 124
Query: 106 YHPKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
Y+ + LV C+ FC + + C AN C Y +Y D S+ G V D
Sbjct: 125 YNINESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYA 184
Query: 162 LTNGSL----LGPRLIFGCGYNQRNP--GPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
+G L +IFGCG Q G+LG G +S++SQL G +
Sbjct: 185 RVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKK 244
Query: 216 VLGHCLS-VRGGGYLFLGHDLVPSSGIAWTP-----------MSRDLLEKHYSSGPAELL 263
+ HCL GGG +GH + P + TP M+ + + S P ++
Sbjct: 245 IFAHCLDGTNGGGIFVIGHVVQPK--VNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVF 302
Query: 264 FGGKSTGIKGLQIIFDSGSSYTYFNSQAYK---TTLDLMRKDLKGKPLED 310
G G I DSG++ Y YK + + + DLK + D
Sbjct: 303 EAGDRKG-----AIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRD 347
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 127/278 (45%), Gaps = 22/278 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT-GCTLPPESLYHPKNNL----VACND 117
Y VT+ IG P + + + DTGSDLTWVQC PCT C E L+ P + V C
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCK-PCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
P C +++ C C+Y V Y D + G L + F L + G ++FGC
Sbjct: 185 PQCK-IGGGQDLTC-GGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAG--VVFGCS 240
Query: 178 YNQRN---PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--GGYLFLG 232
+ + + AG+LGLG G +SILSQ + G + +V +CL RG GYL +G
Sbjct: 241 HEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRR-GNSGDVFSYCLPPRGSSAGYLTIG 299
Query: 233 HDLVPSSGIAWTPMSRD--LLEKHYSSGPAELLFGGKSTGIKG----LQIIFDSGSSYTY 286
P S +++TP+ D L Y + G + I + + DSG+ T+
Sbjct: 300 AAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTVIDSGTVITH 359
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ AY D R+ + G + ++L C+ T
Sbjct: 360 MPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVT 397
>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
Length = 154
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/144 (38%), Positives = 80/144 (55%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGLTR-NVLGHCLSVRGGGY 228
++ FGCGY Q P PP P G+LGLG+GKA +QL+ + + NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G P+ G+ W PM L +YS G AE+ + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPTRGVTWVPMRESLF--YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L E+
Sbjct: 126 PAQIYSEIVSKVRGTLSESSFEEV 149
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 116/275 (42%), Gaps = 33/275 (12%)
Query: 44 RFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC--- 98
R S P+ G+ P +G Y + +G P + + + +DTGSD+ WV C A C C
Sbjct: 64 RLLSAIDLPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRK 122
Query: 99 -----TLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
P ++ V+C+D FCS ++ + C + C Y +LY D S+ G L
Sbjct: 123 SDLVELTPYDADASSTAKSVSCSDNFCS--YVNQRSECHSGSTCQYVILYGDGSSTNGYL 180
Query: 154 VTDHFPLRLTNGSL----LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQ 208
V D L L G+ +IFGCG Q G G++G G +S +SQL
Sbjct: 181 VRDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLA 240
Query: 209 SLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK- 267
S G + HCL GG +F ++V S + TPM HYS + G
Sbjct: 241 SQGKVKRSFAHCLDNNNGGGIFAIGEVV-SPKVKTTPMLSK--SAHYSVNLNAIEVGNSV 297
Query: 268 ---------STGIKGLQIIFDSGSSYTYFNSQAYK 293
S KG +I DSG++ Y Y
Sbjct: 298 LQLSSDAFDSGDDKG--VIIDSGTTLVYLPDAVYN 330
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 139/327 (42%), Gaps = 45/327 (13%)
Query: 26 EANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTG 83
E +Q ++ ++ G FP+ G P +G Y L++G PP+ + + +DTG
Sbjct: 42 ELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTG 101
Query: 84 SDLTWVQCNAPCTGCTLPPES-------LYHPKNNL----VACNDPFCSAFHLPENIRCE 132
SD+ WV C A C GC P S + P +++ ++C+D CS + C
Sbjct: 102 SDVLWVSC-ASCNGC--PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCS 158
Query: 133 A-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKP 187
N+ C Y Y D + G V+D + GS L P ++FGC +Q K
Sbjct: 159 VQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKS 218
Query: 188 PPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWT 244
G+ G G S++SQL S G+ V HCL GGG L LG + P+ + +T
Sbjct: 219 DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPN--MVFT 276
Query: 245 PMSRDLLEKHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTL 296
P+ + HY+ + G++ I G I D+G++ Y + AY +
Sbjct: 277 PLVPS--QPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV 334
Query: 297 DLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ + + + PV KG
Sbjct: 335 E---------AITNAVSQSVRPVVSKG 352
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 139/327 (42%), Gaps = 45/327 (13%)
Query: 26 EANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTG 83
E +Q ++ ++ G FP+ G P +G Y L++G PP+ + + +DTG
Sbjct: 42 ELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTG 101
Query: 84 SDLTWVQCNAPCTGCTLPPES-------LYHPKNNL----VACNDPFCSAFHLPENIRCE 132
SD+ WV C A C GC P S + P +++ ++C+D CS + C
Sbjct: 102 SDVLWVSC-ASCNGC--PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCS 158
Query: 133 A-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKP 187
N+ C Y Y D + G V+D + GS L P ++FGC +Q K
Sbjct: 159 VQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKS 218
Query: 188 PPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWT 244
G+ G G S++SQL S G+ V HCL GGG L LG + P+ + +T
Sbjct: 219 DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPN--MVFT 276
Query: 245 PMSRDLLEKHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTL 296
P+ + HY+ + G++ I G I D+G++ Y + AY +
Sbjct: 277 PLVPS--QPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV 334
Query: 297 DLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ + + + PV KG
Sbjct: 335 E---------AITNAVSQSVRPVVSKG 352
>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
Length = 154
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 80/144 (55%), Gaps = 5/144 (3%)
Query: 171 RLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDS S+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSDSTYTHV 125
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDT 311
+Q Y + +R L LE+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEEV 149
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 119/275 (43%), Gaps = 32/275 (11%)
Query: 46 GSTAVFPITGNV--YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP- 102
G F + G+ Y +G Y +K+G+PP+ + + IDTGSD+ WV CN+ C C
Sbjct: 47 GGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSG 105
Query: 103 --------ESLYHPKNNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVL 153
+S V C+DP C++ +C + DQC Y Y D + G
Sbjct: 106 LGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYY 165
Query: 154 VTDHFPLRLTNGSLL----GPRLIFGC-GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
V+D G L ++FGC Y + G+ G G G+ S++SQL
Sbjct: 166 VSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLS 225
Query: 209 SLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGG 266
+ G+T V HCL GGG L LG L P GI ++P+ + HY+ + G
Sbjct: 226 TRGITPRVFSHCLKGDGSGGGILVLGEILEP--GIVYSPLVPS--QPHYNLNLLSIAVNG 281
Query: 267 KSTGIKGLQI--------IFDSGSSYTYFNSQAYK 293
+ I I DSG++ Y ++AY
Sbjct: 282 QLLPIDPAAFATSNSQGTIVDSGTTLAYLVAEAYD 316
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 139/327 (42%), Gaps = 45/327 (13%)
Query: 26 EANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTG 83
E +Q ++ ++ G FP+ G P +G Y L++G PP+ + + +DTG
Sbjct: 42 ELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTG 101
Query: 84 SDLTWVQCNAPCTGCTLPPES-------LYHPKNNL----VACNDPFCSAFHLPENIRCE 132
SD+ WV C A C GC P S + P +++ ++C+D CS + C
Sbjct: 102 SDVLWVSC-ASCNGC--PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCS 158
Query: 133 A-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKP 187
N+ C Y Y D + G V+D + GS L P ++FGC +Q K
Sbjct: 159 VQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKS 218
Query: 188 PPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWT 244
G+ G G S++SQL S G+ V HCL GGG L LG + P+ + +T
Sbjct: 219 DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPN--MVFT 276
Query: 245 PMSRDLLEKHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTL 296
P+ + HY+ + G++ I G I D+G++ Y + AY +
Sbjct: 277 PLVPS--QPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV 334
Query: 297 DLMRKDLKGKPLEDTAEEKALPVCWKG 323
+ + + + PV KG
Sbjct: 335 E---------AITNAVSQSVRPVVSKG 352
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGC---FSEANQPPSKKKSTQSTAAH------RFGSTAVF 51
ME V+ +++ F + C ++ +++S ++ AH RF S
Sbjct: 1 MEIARFAVVSFFLVISFFSSGDCNLVLKVQHKFKGRERSLEAFKAHDIQRRGRFLSAIDL 60
Query: 52 PITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES----- 104
+ GN +P G Y + +G P + Y + +DTGSD+ WV C A CT C P +S
Sbjct: 61 QLGGNGHPSESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNC-AGCTNC--PKKSDLGIE 117
Query: 105 ------LYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHF 158
+N V CN FC++ + C C+Y V Y D S+ G V DH
Sbjct: 118 LSLYSPSSSSTSNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHV 177
Query: 159 PLRLTNGSL----LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLT 213
L G+ ++FGCG Q G G+LG G +S++SQL S G
Sbjct: 178 VLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKV 237
Query: 214 RNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPM 246
+ V HCL ++ GGG +G + P + TP+
Sbjct: 238 KRVFAHCLDNINGGGIFAIGEVVQPK--VRTTPL 269
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 125/275 (45%), Gaps = 39/275 (14%)
Query: 51 FPITG--NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
FP+ G N Y +G Y +K+GNP K Y + IDTGSD+ WV C +PCTGC P S
Sbjct: 75 FPVEGSANPYMVGLYFTRVKLGNPAKEYFVQIDTGSDILWVAC-SPCTGC--PTSSGLNI 131
Query: 105 ---LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQ----CDYEVLYADHGSSLGVL 153
++P ++ + C+D C+A C+++D C Y Y D + G
Sbjct: 132 QLEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFY 191
Query: 154 VTD--HFPLRLTNGSLL--GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQ 208
V+D +F + N ++FGC +Q K G+ G G + S++SQL
Sbjct: 192 VSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLY 251
Query: 209 SLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGG 266
SLG++ HCL S GGG L LG + P G+ +TP+ + HY+ + G
Sbjct: 252 SLGVSPKTFSHCLKGSDNGGGILVLGEIVEP--GLVFTPLVPS--QPHYNLNLESIAVSG 307
Query: 267 KSTGIKGLQI--------IFDSGSSYTYFNSQAYK 293
+ I I DSG++ Y AY
Sbjct: 308 QKLPIDSSLFATSNTQGTIVDSGTTLVYLVDGAYD 342
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 117/279 (41%), Gaps = 33/279 (11%)
Query: 44 RFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC--- 98
R S P+ G+ P +G Y + +G P + + + +DTGSD+ WV C A C C
Sbjct: 64 RLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRK 122
Query: 99 -----TLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
P + V+C+D FCS ++ + C + C Y ++Y D S+ G L
Sbjct: 123 SDLVELTPYDVDASSTAKSVSCSDNFCS--YVNQRSECHSGSTCQYVIMYGDGSSTNGYL 180
Query: 154 VTDHFPLRLTNGSL----LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQ 208
V D L L G+ +IFGCG Q G G++G G +S +SQL
Sbjct: 181 VKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLA 240
Query: 209 SLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK- 267
S G + HCL GG +F ++V S + TPM HYS + G
Sbjct: 241 SQGKVKRSFAHCLDNNNGGGIFAIGEVV-SPKVKTTPMLSK--SAHYSVNLNAIEVGNSV 297
Query: 268 ---------STGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
S KG +I DSG++ Y Y L+
Sbjct: 298 LELSSNAFDSGDDKG--VIIDSGTTLVYLPDAVYNPLLN 334
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 127/267 (47%), Gaps = 37/267 (13%)
Query: 49 AVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-LPPESLYH 107
A P+ G V GY+ TL +G P + + + +DTGS +T+V C + C ++ +
Sbjct: 48 ATLPLHGAVKDYGYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFD 107
Query: 108 PKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
P ++ ++ C+ C P C +C Y+ YA+ SS G+LV+D LR
Sbjct: 108 PASSSSSAVIGCDSDKCICGRPP--CGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLR-- 163
Query: 164 NGSLLGPRLIFGCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
+G++ ++FGC YNQ G+LGLG + S+++QL G+ +V
Sbjct: 164 DGAV---EVVFGCETKETGEIYNQE--------ADGILGLGNSEVSLVNQLAGSGVIDDV 212
Query: 217 LGHCL-SVRGGGYLFLGHDLVPSSGIA--WTPMSRDLLEKHYSSGPAELLF-GGKSTGIK 272
C SV G G L LG +A +T + L HY S E L+ GG+ +K
Sbjct: 213 FALCFGSVEGDGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVK 272
Query: 273 ------GLQIIFDSGSSYTYFNSQAYK 293
G + DSG+++TY S+A++
Sbjct: 273 PERYEEGYGTVLDSGTTFTYLPSEAFQ 299
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 120/262 (45%), Gaps = 35/262 (13%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPESLYHPKNNLV 113
+G Y +K+GNP K + + IDTGSD+ WV C +PCTGC + ES ++P ++
Sbjct: 2 VGLYFTRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLES-FNPDSSST 59
Query: 114 A----CNDPFCSA-FHLPENIRCEANDQ---CDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
A C+D C+A F E I +N Q C Y Y D + G V+D G
Sbjct: 60 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMG 119
Query: 166 SLL----GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHC 220
+ ++FGC +Q K G+ G G + S++SQL SLG++ V HC
Sbjct: 120 NEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC 179
Query: 221 L--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG----- 273
L S GGG L LG + P G+ +TP+ + HY+ + G+ I
Sbjct: 180 LKGSDNGGGILVLGEIVEP--GLVYTPLVPS--QPHYNLNLESIAVNGQKLPIDSSLFTT 235
Query: 274 ---LQIIFDSGSSYTYFNSQAY 292
I DSG++ Y AY
Sbjct: 236 SNTQGTIVDSGTTLAYLADGAY 257
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 122/289 (42%), Gaps = 34/289 (11%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE----- 103
P+ G+ P +G Y + IG PPK Y L +DTGSD+ WV C C C
Sbjct: 71 LPLGGSGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSNLGMDL 129
Query: 104 SLYHPKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFP 159
+LY K + V C+ FC + C AN C Y +Y D S+ G V D
Sbjct: 130 TLYDIKESSSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVL 189
Query: 160 LRLTNGSL----LGPRLIFGCGYNQRN--PGPKPPPTAGVLGLGLGKASILSQLQSLGLT 213
+G L ++FGCG Q G+LG G +S++SQL S G
Sbjct: 190 YDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKV 249
Query: 214 RNVLGHCLS-VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG------G 266
+ + HCL+ V GGG +GH + P + TP+ D + HYS + G
Sbjct: 250 KKMFAHCLNGVNGGGIFAIGHVVQPK--VNMTPLLPD--QPHYSVNMTAVQVGHAFLSLS 305
Query: 267 KSTGIKGLQ--IIFDSGSSYTYFNSQAYK---TTLDLMRKDLKGKPLED 310
T +G + I DSG++ Y Y+ + DLK + L D
Sbjct: 306 TDTSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHD 354
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/258 (31%), Positives = 119/258 (46%), Gaps = 35/258 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYHPK----NN 111
Y +K+G+PPK Y + IDTGSD+ WV C +PCTGC P S ++P ++
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGC--PSSSGLNIQLEFFNPDTSSTSS 173
Query: 112 LVACNDPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVTD--HFPLRLTNGSL 167
+ C+D C+A C+ +D C Y Y D + G V+D +F + N
Sbjct: 174 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQT 233
Query: 168 L--GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCL--S 222
++FGC +Q K G+ G G + S++SQL SLG++ V HCL S
Sbjct: 234 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGS 293
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG--------L 274
GGG L LG + P G+ +TP+ + HY+ ++ G+ I
Sbjct: 294 DNGGGILVLGEIVEP--GLVYTPLVPS--QPHYNLNLESIVVNGQKLPIDSSLFTTSNTQ 349
Query: 275 QIIFDSGSSYTYFNSQAY 292
I DSG++ Y AY
Sbjct: 350 GTIVDSGTTLAYLADGAY 367
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 116/262 (44%), Gaps = 34/262 (12%)
Query: 7 RVMGLLVLLMFATFQGCFSEANQPPSKK-----KSTQSTAAH------RFGSTAVFPITG 55
RV GL++++ + P +K +S + AH RF + P+ G
Sbjct: 3 RVSGLILIVFLLFVDASNANLVFPVQRKFNGPHRSLDAIKAHDDRRRGRFLAAIDVPLGG 62
Query: 56 NVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LY 106
N P G Y + +G+P K + + +DTGSD+ WV C A CT C P +S LY
Sbjct: 63 NGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNC-AGCTAC--PKKSGLGMDLTLY 119
Query: 107 HPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
P +N V C D FC+ + C+ + C Y + Y D ++ G V D
Sbjct: 120 DPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDE 179
Query: 163 TNGSLL----GPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNV 216
+G+L +IFGCG Q A G++G G +S+LSQL + G + +
Sbjct: 180 VSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRI 239
Query: 217 LGHCL-SVRGGGYLFLGHDLVP 237
HCL S GGG +G + P
Sbjct: 240 FSHCLDSHHGGGIFSIGQVMEP 261
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 141/335 (42%), Gaps = 54/335 (16%)
Query: 6 KRVMGLLVLLMFATFQGCFSEANQ--PPSKK-----KSTQSTAAH------RFGSTAVFP 52
+R++ L+V L C + AN P +K ++ + AH RF S
Sbjct: 5 ERLVRLVVSLFVVVQLCCHANANMVFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDLA 64
Query: 53 ITGNVYPLG---YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE------ 103
+ GN P YY+ KIG P Y + +DTGSD WV C GCT P+
Sbjct: 65 LGGNGRPTSTGLYYT---KIGLGPNDYYVQVDTGSDTLWVNC----VGCTTCPKKSGLGM 117
Query: 104 --SLYHPKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDH 157
+LY P ++ +V C+D FC++ + C+ + C Y + Y D ++ G + D
Sbjct: 118 ELTLYDPNSSKTSKVVPCDDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDD 177
Query: 158 FPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLG 211
G L +IFGCG Q T+ G++G G +S+LSQL + G
Sbjct: 178 LTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAG 237
Query: 212 LTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG 270
+ V HCL +V GGG +G + P + TP+ + HY+ ++ G
Sbjct: 238 KVKRVFSHCLDTVNGGGIFAIGEVVQPK--VKTTPLVPRM--AHYNVVLKDIEVAGDPIQ 293
Query: 271 I--------KGLQIIFDSGSSYTYFNSQAYKTTLD 297
+ G I DSG++ Y Y L+
Sbjct: 294 LPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQLLE 328
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 130/306 (42%), Gaps = 44/306 (14%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP---------ES 104
T + Y +G Y +K+G+P K + + IDTGSD+ W+ C C+ C ++
Sbjct: 74 TSDPYFVGLYFTKVKLGSPAKEFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDT 132
Query: 105 LYHPKNNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTD--HFPLR 161
LV+C DP CS C + +QC Y Y D + G V+D +F
Sbjct: 133 AGSSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTV 192
Query: 162 LTNGSLLG---PRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVL 217
L S++ +IFGC Q K G+ G G G S++SQL S G+T V
Sbjct: 193 LLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVF 252
Query: 218 GHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-------- 267
HCL GGG L LG L PS I ++P+ + HY+ + G+
Sbjct: 253 SHCLKGGENGGGVLVLGEILEPS--IVYSPLVPS--QPHYNLNLQSIAVNGQLLPIDSNV 308
Query: 268 --STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTW 325
+T +G I DSG++ Y +AY + K + + + P+ KG
Sbjct: 309 FATTNNQG--TIVDSGTTLAYLVQEAYNPFV---------KAITAAVSQFSKPIISKGNQ 357
Query: 326 KCLLGN 331
L+ N
Sbjct: 358 CYLVSN 363
>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
Length = 142
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 77/139 (55%), Gaps = 5/139 (3%)
Query: 176 CGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGYLFLGH 233
CGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G L++G+
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGN 60
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFNSQAY 292
PS G+ W PM +YS G AELL + G + +FDSGS+YT SQ Y
Sbjct: 61 FNPPSRGVTWVPMRESSF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIY 118
Query: 293 KTTLDLMRKDLKGKPLEDT 311
+ +R L LE+
Sbjct: 119 NEIVSKVRGTLSESSLEEV 137
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 121/273 (44%), Gaps = 38/273 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH- 107
FP+ G P +G Y L++G PP+ + + IDTGSD+ WV C + C GC P S H
Sbjct: 38 FPVQGTFDPFLVGLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGS-CNGC--PVNSGLHI 94
Query: 108 ----------PKNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTD 156
P +L++C+D CS + C A N+ C Y Y D + G V+D
Sbjct: 95 PLNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSD 154
Query: 157 --HFPLRLTNGSLLGPR---LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSL 210
HF L GS++ ++FGC Q K G+ G G S++SQL S
Sbjct: 155 LLHFDTVL-GGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQ 213
Query: 211 GLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
G++ HCL GGG L LG + P+ I +TP+ + HY+ + G++
Sbjct: 214 GISPRAFSHCLKGDDSGGGILVLGEIVEPN--IVYTPLVPS--QPHYNLNMQSISVNGQT 269
Query: 269 TGI--------KGLQIIFDSGSSYTYFNSQAYK 293
I I DSG++ Y AY
Sbjct: 270 LAIDPSVFGTSSSQGTIIDSGTTLAYLAEAAYD 302
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 123/295 (41%), Gaps = 41/295 (13%)
Query: 44 RFGSTAVFPITGNVYPLG---YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL 100
RF S + GN P YY+ KIG PK Y + +DTGSD WV C GCT
Sbjct: 55 RFLSVVDVALGGNGRPTSNGLYYT---KIGLGPKDYYVQVDTGSDTLWVNC----VGCTA 107
Query: 101 PPE--------SLYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGS 148
P+ +LY P + V C+D FC++ + + C C Y + Y D +
Sbjct: 108 CPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGST 167
Query: 149 SLGVLVTDHFPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKAS 202
+ G + D G L +IFGCG Q T+ G++G G +S
Sbjct: 168 TSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSS 227
Query: 203 ILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAE 261
+LSQL + G + + HCL S+ GGG +G + P + TP+ + + HY+ +
Sbjct: 228 VLSQLAAAGKVKRIFSHCLDSISGGGIFAIGEVVQPK--VKTTPLLQGM--AHYNVVLKD 283
Query: 262 LLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
+ G + G I DSG++ Y Y L+ + G L
Sbjct: 284 IEVAGDPIQLPSDILDSSSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKL 338
>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
Length = 136
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 75/129 (58%), Gaps = 5/129 (3%)
Query: 171 RLIFGCGYNQRNPGPKPPP-TAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
++ FGCGY Q P PP G+LGLG+GKA +QL+ + T NV+GHCLS +G G
Sbjct: 8 KIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 67
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYF 287
L++G PS G+ W PM L +YS G AELL + G + +FDSGS+YT+
Sbjct: 68 LYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 288 NSQAYKTTL 296
+Q Y +
Sbjct: 126 PAQIYNEIV 134
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 119/274 (43%), Gaps = 32/274 (11%)
Query: 46 GSTAVFPITGNV--YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP- 102
G F + G+ Y +G Y +K+G PP+ + + IDTGSD+ WV C++ C+ C
Sbjct: 62 GGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSS-CSNCPQTSG 120
Query: 103 --------ESLYHPKNNLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVL 153
++ LV C+ P C++ +C ++QC Y Y D + G
Sbjct: 121 LGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYY 180
Query: 154 VTDHFPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQ 208
V+D F G L ++FGC Q K G+ G G G+ S++SQL
Sbjct: 181 VSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLS 240
Query: 209 SLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGG 266
S G+T V HCL GGG L LG L P GI ++P+ + HY+ + G
Sbjct: 241 SHGITPRVFSHCLKGEDSGGGILVLGEILEP--GIVYSPLVPS--QPHYNLDLQSIAVSG 296
Query: 267 KSTGIKGLQI--------IFDSGSSYTYFNSQAY 292
+ I I D+G++ Y +AY
Sbjct: 297 QLLPIDPAAFATSSNRGTIIDTGTTLAYLVEEAY 330
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 125/285 (43%), Gaps = 37/285 (12%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP---------ES 104
T + Y +G Y +K+G+P K + + IDTGSD+ W+ C C+ C ++
Sbjct: 74 TSDPYFVGLYFTKVKLGSPAKDFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDT 132
Query: 105 LYHPKNNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTD--HFPLR 161
LV+C DP CS C + +QC Y Y D + G V+D +F
Sbjct: 133 AGSSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTV 192
Query: 162 LTNGSLLG---PRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVL 217
L S++ ++FGC Q K G+ G G G S++SQL S G+T V
Sbjct: 193 LLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVF 252
Query: 218 GHCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-------- 267
HCL GGG L LG L PS I ++P+ L HY+ + G+
Sbjct: 253 SHCLKGGENGGGVLVLGEILEPS--IVYSPLVPSL--PHYNLNLQSIAVNGQLLPIDSNV 308
Query: 268 --STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK--GKPL 308
+T +G I DSG++ Y +AY +D + + KP+
Sbjct: 309 FATTNNQG--TIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPI 351
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 127/276 (46%), Gaps = 38/276 (13%)
Query: 44 RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC----- 98
RF FP+ GN LG Y + +GNP + ++ +DTGSD+ WV+C +PC C
Sbjct: 64 RFLQGISFPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQD 122
Query: 99 TLPPESLYH----PKNNLVACNDPFCSAFHLPENIRCEA---NDQCDYEVLYADHGSSLG 151
+PP S+Y+ +++ +C+DP C+ E + C N C Y Y D +S+G
Sbjct: 123 IIPPLSIYNLSASSTSSVSSCSDPLCTG----EEVVCSRSGNNSACAYVSSYQDKSASVG 178
Query: 152 VLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG 211
V D L G+ R+ FGC N P G++G GL ++ +Q+ +
Sbjct: 179 AYVRDDMHYVLHGGNATTSRIFFGCATNITGSW----PVDGIMGFGLISKTVPNQIATQR 234
Query: 212 LTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPM-------SRDLLEKHYSS----- 257
V HCL GGG L G + ++ + +TP+ + DLL +S
Sbjct: 235 NMSRVFSHCLGGEKHGGGILEFG-EAPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPI 293
Query: 258 GPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYK 293
P E + ST G +I DSG+++ ++A +
Sbjct: 294 DPKEFSYVRNSTNNTG--VIIDSGTTFVLLTTKANR 327
>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
Length = 142
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 77/139 (55%), Gaps = 5/139 (3%)
Query: 176 CGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGYLFLGH 233
CGY Q P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G L++G
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGD 60
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFNSQAY 292
PS G+ W PM L +YS G AELL + G + +FDSGS+YT+ +Q Y
Sbjct: 61 FNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAQIY 118
Query: 293 KTTLDLMRKDLKGKPLEDT 311
+ + L LE+
Sbjct: 119 NEIVSKVIGTLSESSLEEV 137
>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
Length = 142
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 55/139 (39%), Positives = 76/139 (54%), Gaps = 5/139 (3%)
Query: 176 CGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGLTR-NVLGHCLSVRGGGYLFLGH 233
CGY Q P PP P G+LGLG+GKA QL+ + + N++GHCLS +G G L++G
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSKGKGVLYVGD 60
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFNSQAY 292
PS G+ W PM L +YS G AELL + G + +FDSGS+YT+ + Y
Sbjct: 61 FNPPSRGVTWVPMRESLF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAHIY 118
Query: 293 KTTLDLMRKDLKGKPLEDT 311
+ +R L LE+
Sbjct: 119 SEIVSKVRGTLSESSLEEV 137
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 122/272 (44%), Gaps = 36/272 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
F + G P +G Y +++G PP + + IDTGSD+ WV CN+ C+GC P S
Sbjct: 61 FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGC--PQTSGLQI 117
Query: 105 ---LYHP----KNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTD 156
+ P ++++AC+D C+ + C + N+QC Y Y D + G V+D
Sbjct: 118 QLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSD 177
Query: 157 HFPLR-LTNGSLLGPR---LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLG 211
L + GS+ ++FGC Q K G+ G G + S++SQL S G
Sbjct: 178 MMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 237
Query: 212 LTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
+ V HCL GGG L LG + P+ I +T + + HY+ + G++
Sbjct: 238 IAPRVFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLVP--AQPHYNLNLQSIAVNGQTL 293
Query: 270 GIKGLQI--------IFDSGSSYTYFNSQAYK 293
I I DSG++ Y +AY
Sbjct: 294 QIDSSVFATSNSRGTIVDSGTTLAYLAEEAYD 325
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 118/264 (44%), Gaps = 40/264 (15%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYHPK----NN 111
Y ++IG PPK + + +DTGSD+ WV C C C P +S LY PK +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNC-VSCDKC--PTKSGLGIDLALYDPKGSSSGS 143
Query: 112 LVACNDPFCSAFH-----LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS 166
V+C++ FC+A + LP C A C+Y Y D S+ G V+D +G+
Sbjct: 144 AVSCDNKFCAATYGSGEKLP---GCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGN 200
Query: 167 L----LGPRLIFGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+IFGCG Q + G++G G S LSQL S G + + HCL
Sbjct: 201 AQTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL 260
Query: 222 -SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI--------K 272
+++GGG +G + P + TP+ ++ HY+ + G + + +
Sbjct: 261 DTIKGGGIFAIGEVVQPK--VKSTPLLPNM--SHYNVNLQSIDVAGNALQLPPHIFETSE 316
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTL 296
I DSG++ TY YK L
Sbjct: 317 KRGTIIDSGTTLTYLPELVYKDIL 340
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 122/272 (44%), Gaps = 36/272 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
F + G P +G Y +++G PP + + IDTGSD+ WV CN+ C+GC P S
Sbjct: 11 FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGC--PQTSGLQI 67
Query: 105 ---LYHP----KNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTD 156
+ P ++++AC+D C+ + C + N+QC Y Y D + G V+D
Sbjct: 68 QLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSD 127
Query: 157 HFPLR-LTNGSLLGPR---LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLG 211
L + GS+ ++FGC Q K G+ G G + S++SQL S G
Sbjct: 128 MMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 187
Query: 212 LTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
+ V HCL GGG L LG + P+ I +T + + HY+ + G++
Sbjct: 188 IAPRVFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLVP--AQPHYNLNLQSIAVNGQTL 243
Query: 270 GI--------KGLQIIFDSGSSYTYFNSQAYK 293
I I DSG++ Y +AY
Sbjct: 244 QIDSSVFATSNSRGTIVDSGTTLAYLAEEAYD 275
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 121/271 (44%), Gaps = 36/271 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
F + G P +G Y +++G PP + + IDTGSD+ WV CN+ C GC P S
Sbjct: 64 FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CNGC--PQTSGLQI 120
Query: 105 ---LYHP----KNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTD 156
+ P ++++AC+D C+ + C + N+QC Y Y D + G V+D
Sbjct: 121 QLNFFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSD 180
Query: 157 HFPLR-LTNGSLLGPR---LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLG 211
L + GS+ ++FGC Q K G+ G G + S++SQL S G
Sbjct: 181 MMHLNTIFEGSMTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 240
Query: 212 LTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
+ + HCL GGG L LG + P+ I +T + + HY+ + G++
Sbjct: 241 IAPRIFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLVP--AQPHYNLNLQSISVNGQTL 296
Query: 270 GIKGLQI--------IFDSGSSYTYFNSQAY 292
I I DSG++ Y +AY
Sbjct: 297 QIDSSVFATSNSRGTIVDSGTTLAYLAEEAY 327
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 106/243 (43%), Gaps = 25/243 (10%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE---------SLYHPK----NNLV 113
+ IG P Y + +DTGSDL W+ C+ +GC + ++Y P + +
Sbjct: 117 VSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTI 176
Query: 114 ACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHG-SSLGVLVTD--HFPLRLTNGSLLG 169
CN+ CS RC A C Y+V Y +G SS GVLV D H L
Sbjct: 177 PCNNTLCS-----RQSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQSRALD 231
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL 229
++IFGCG Q G+ GLG+ S+ S L G T N C G G +
Sbjct: 232 AKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFGRDGIGRI 291
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNS 289
G SSG TP + L Y+ ++ GG+ ++ IFDSG+S+TY N
Sbjct: 292 SFGD--TGSSGQGETPFNLRQLHPTYNVSITKINVGGRDADLE-FSAIFDSGTSFTYLND 348
Query: 290 QAY 292
AY
Sbjct: 349 PAY 351
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 126/281 (44%), Gaps = 36/281 (12%)
Query: 44 RFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP 101
R + A P+ G P G Y + IG P K Y + +DTGSD+ WV C C C P
Sbjct: 68 RLLAAADIPLGGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRC--P 124
Query: 102 PES-------LYHPKN----NLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
+S LY PK+ + V+C+ FC+A + C + C+Y V Y D S+
Sbjct: 125 RKSGLGLELTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTT 184
Query: 151 GVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQ-RNPGPKPPPTAGVLGLGLGKASILS 205
G V+D +G + FGCG Q + G G++G G S+LS
Sbjct: 185 GYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLS 244
Query: 206 QLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
QL + G + + HCL ++ GGG +G+ + P + TP+ ++ HY+ +
Sbjct: 245 QLSAAGKVKKIFAHCLDTINGGGIFAIGNVVQPK--VKTTPLVPNM--PHYNVNLKSIDV 300
Query: 265 GGKS---------TGIKGLQIIFDSGSSYTYFNSQAYKTTL 296
GG + TG K II DSG++ TY YK +
Sbjct: 301 GGTALKLPSHMFDTGEKKGTII-DSGTTLTYLPEIVYKEIM 340
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 118/270 (43%), Gaps = 32/270 (11%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP---------ES 104
T + Y +G Y +K+G+P K + + IDTGSD+ W+ CN C C ++
Sbjct: 62 TSDPYLVGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNT-CNNCPKSSGLGIDLNYFDT 120
Query: 105 LYHPKNNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
LV+C+DP CS +C + +QC Y Y D + G V D +
Sbjct: 121 ASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVI 180
Query: 164 NGSLL----GPRLIFGCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
G + ++FGC Y + G+ G G G S++SQ+ S G+ V
Sbjct: 181 MGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFS 240
Query: 219 HCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK--------- 267
HCL + GGG L LG L P+ I +TP+ L+ HY+ + G+
Sbjct: 241 HCLKGQGSGGGILVLGEILEPN--IVYTPLVP--LQPHYNLNLQSIAVNGQILPIDQDVF 296
Query: 268 STGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
+TG I DSG++ Y +AY L+
Sbjct: 297 ATG-NNRGTIVDSGTTLAYLVQEAYDPFLN 325
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 138/311 (44%), Gaps = 38/311 (12%)
Query: 31 PSKKKSTQSTAAHR--------FGSTAVFP--ITGNVYP-LGYYSVTLKIGNPPKLYELD 79
PSK ++ + T A R F TA+ I + P G Y + L IG PP
Sbjct: 49 PSKTQAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAI 108
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACNDPFCSAFHLPENIRCEAND 135
+DTGSDLTW QC PCT C L+ PKN+ +C FC A L ++ C
Sbjct: 109 VDTGSDLTWTQCR-PCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLA--LGKDRSCSKEK 165
Query: 136 QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVL 194
+C + YAD + G L ++ + T G + P FGCG++ + G ++G++
Sbjct: 166 KCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHS--SGGIFDKSSSGIV 223
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGYLFLGHDLVPSSGIAWTPMSR 248
GLG G+ S++SQL+S + +CL S F V G TP+ +
Sbjct: 224 GLGGGELSLISQLKS--TINGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQ 281
Query: 249 DLLEKHY-------SSGPAELLFGG--KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
+ Y S G L + G K T ++ II DSG++YT+ + Y +
Sbjct: 282 KSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSV 341
Query: 300 RKDLKGKPLED 310
+KGK + D
Sbjct: 342 ANSIKGKRVRD 352
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 131/292 (44%), Gaps = 40/292 (13%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
++ QST + G FP+ G P +G Y +++G+PPK + + IDTGSD+ WV C
Sbjct: 56 RRILQSTTS---GGVVDFPVQGTFNPFLVGLYFTRVQLGSPPKDFYVQIDTGSDVLWVSC 112
Query: 92 NAPCTGCTLP-----PESLYHPKNN----LVACNDPFCSAFHLPENIRCEA-NDQCDYEV 141
++ C GC + P + + P ++ LV+C+D C+A + C + +QC Y
Sbjct: 113 SS-CNGCPVTSGLQIPLTFFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTF 171
Query: 142 LYADHGSSLGVLVTDHF---PLRLTNGSL------LGPRLIFGCGYNQRNPGPKPPPTA- 191
Y D + G V D L L++G L + F C Q K
Sbjct: 172 QYGDGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVD 231
Query: 192 GVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRD 249
G+ G G + S++SQL S G+T V HCL GGG L LG + P+ I +TP+
Sbjct: 232 GIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDSGGGVLVLGEIVEPN--IVYTPLVPS 289
Query: 250 LLEKHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYK 293
+ HY+ + G++ I I DSG++ Y AY
Sbjct: 290 --QPHYNLYLQSISVAGQTLAIDPSVFGASSNQGTIVDSGTTLAYLAEGAYD 339
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 119/290 (41%), Gaps = 34/290 (11%)
Query: 51 FPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
P+ GN P G Y + IG P K Y + +DTGSD+ WV C C C P +S
Sbjct: 67 LPLGGNGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNC-VFCDTC--PRKSGLGI 123
Query: 105 ---LYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDH 157
LY P + V C FC A H C C Y + Y D S+ G VTD
Sbjct: 124 ELTLYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDF 183
Query: 158 FPLRLTNG----SLLGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGL 212
+G +L + FGCG + G G+LG G +S+LSQL + G
Sbjct: 184 LQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGK 243
Query: 213 TRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI 271
R V HCL ++ GGG +G + P ++ TP+ + HY+ + GG +
Sbjct: 244 VRKVFAHCLDTINGGGIFAIGDVVQPK--VSTTPLVPGM--PHYNVNLEAIDVGGVKLQL 299
Query: 272 --------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAE 313
+ I DSG++ Y Y + + PL++ +
Sbjct: 300 PTNIFDIGESKGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQD 349
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 92/187 (49%), Gaps = 17/187 (9%)
Query: 150 LGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKP-PPTAGVLGLGLGKASILSQLQ 208
+GV V D +G ++FGCGY+Q+ T GVLGL S+ +QL
Sbjct: 1 MGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLA 60
Query: 209 SLGLTRNVLGHCLSVR---GGGYLFLGHDLVPSSGIAWTPMS-------RDLLEKHYSSG 258
S G+ N GHC+S GGYLFLG D +P G+ W P+ R K + G
Sbjct: 61 SRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHG 120
Query: 259 PAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALP 318
+L GK T Q++FD+GS+YTYF +A + +++ + ++D + +K LP
Sbjct: 121 DQQLNAQGKLT-----QVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDS-DKTLP 174
Query: 319 VCWKGTW 325
C K +
Sbjct: 175 FCMKSDF 181
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 118/260 (45%), Gaps = 34/260 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYHPKNN---- 111
Y + IG P K Y + +DTGSD+ WV C C C P +S LY PK++
Sbjct: 33 YYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRC--PRKSGLGLELTLYDPKDSSTGS 89
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL---- 167
V+C+ FC+A + C + C+Y V Y D S+ G V+D +G
Sbjct: 90 KVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRP 149
Query: 168 LGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRG 225
+ FGCG Q + G G++G G S+LSQL + G + + HCL ++ G
Sbjct: 150 ANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTING 209
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS---------TGIKGLQI 276
GG +G+ + P + TP+ ++ HY+ + GG + TG K I
Sbjct: 210 GGIFAIGNVVQPK--VKTTPLVPNM--PHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTI 265
Query: 277 IFDSGSSYTYFNSQAYKTTL 296
I DSG++ TY YK +
Sbjct: 266 I-DSGTTLTYLPEIVYKEIM 284
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 137/298 (45%), Gaps = 33/298 (11%)
Query: 31 PSKKKSTQSTAAHR--------FGSTAVFP--ITGNVYP-LGYYSVTLKIGNPPKLYELD 79
PSK ++ + T A R F TA+ I + P G Y + L IG PP
Sbjct: 49 PSKTQAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAI 108
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACNDPFCSAFHLPENIRCEAND 135
+DTGSDLTW QC PCT C L+ PKN+ +C FC A L ++ C
Sbjct: 109 VDTGSDLTWTQCR-PCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLA--LGKDRSCSKEK 165
Query: 136 QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVL 194
+C + YAD + G L ++ + T G + P FGCG++ + G ++G++
Sbjct: 166 KCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHS--SGGIFDKSSSGIV 223
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH 254
GLG G+ S++SQL+S + +CL L + D SS I + R +
Sbjct: 224 GLGGGELSLISQLKS--TINGLFSYCL-------LPVSTDSSISSRINFGASGR-VSGYG 273
Query: 255 YSSGPAELLFGG--KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
S P L + G K T ++ II DSG++YT+ + Y + +KGK + D
Sbjct: 274 TVSTPLRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRD 331
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 124/275 (45%), Gaps = 21/275 (7%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACND 117
Y+ TLK+G P + + + IDTGS +T++ C C+ C + P + +AC D
Sbjct: 12 YFYTTLKLGTPERTFSVIIDTGSTITYIPCK-DCSHCGKHTAEWFDPDKSTTAKKLACGD 70
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
P C+ + C ND+C Y YA+ SS G ++ D F ++ + RL+FGC
Sbjct: 71 PLCNCG--TPSCTCN-NDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---RLVFGCE 124
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVP 237
N G++G+G + SQL + +V C G L LG +P
Sbjct: 125 -NGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLGDVTLP 183
Query: 238 S-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI------KGLQIIFDSGSSYTYFNSQ 290
+ +TP+ L +Y+ + G++ +G + DSG+++TY +
Sbjct: 184 EGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFTYLPTD 243
Query: 291 AYKTTLDLMRKDLKGKPLEDT--AEEKALPVCWKG 323
A+K + ++ K L+ T A+ + +CWKG
Sbjct: 244 AFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKG 278
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 118/260 (45%), Gaps = 34/260 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYHPKNN---- 111
Y + IG P K Y + +DTGSD+ WV C C C P +S LY PK++
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRC--PRKSGLGLELTLYDPKDSSTGS 60
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL---- 167
V+C+ FC+A + C + C+Y V Y D S+ G V+D +G
Sbjct: 61 KVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRP 120
Query: 168 LGPRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRG 225
+ FGCG Q + G G++G G S+LSQL + G + + HCL ++ G
Sbjct: 121 ANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTING 180
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS---------TGIKGLQI 276
GG +G+ + P + TP+ ++ HY+ + GG + TG K I
Sbjct: 181 GGIFAIGNVVQPK--VKTTPLVPNM--PHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTI 236
Query: 277 IFDSGSSYTYFNSQAYKTTL 296
I DSG++ TY YK +
Sbjct: 237 I-DSGTTLTYLPEIVYKEIM 255
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/260 (31%), Positives = 113/260 (43%), Gaps = 35/260 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC----TLPPE-SLYHPKNN---- 111
G Y + IG P K Y + +DTGSD+ WV C C C TL E +LY+ +
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGK 136
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL---- 167
LV+C+D FC C+AN C Y +Y D S+ G V D G L
Sbjct: 137 LVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+IFGCG Q A G+LG G +S++SQL S G + + HCL R
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN 256
Query: 226 GGYLF-LGHDLVPSSGIAWTP-----------MSRDLLEKHYSSGPAELLFGGKSTGIKG 273
GG +F +G + P + TP M+ + + + + PA+L G G
Sbjct: 257 GGGIFAIGRVVQPK--VNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG--- 311
Query: 274 LQIIFDSGSSYTYFNSQAYK 293
I DSG++ Y Y+
Sbjct: 312 --AIIDSGTTLAYLPEIIYE 329
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/328 (27%), Positives = 137/328 (41%), Gaps = 48/328 (14%)
Query: 22 GCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNPP 73
G FS + +++S AH R + P+ G+ P +G Y + IG P
Sbjct: 37 GVFSVKYRYAGQQRSLSDLKAHDDRRQLRILAGVDLPLGGSGRPDTVGLYYAKVGIGTPS 96
Query: 74 KLYELDIDTGSDLTWVQC----NAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHL 125
K Y + +DTGSD+ WV C P T +LY+ K++ LV C++ FC +
Sbjct: 97 KDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVPCDEEFCYEVNG 156
Query: 126 PENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP----RLIFGCGYNQR 181
C AN C Y +Y D S+ G V D +G L +IFGCG Q
Sbjct: 157 GPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFGCGARQS 216
Query: 182 -NPGPKPPPT-AGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGHDLVPS 238
+ GP G+LG G +S++SQL + + + HCL + GGG +GH + P
Sbjct: 217 GDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGGIFAIGHVVQPK 276
Query: 239 SGIAWTPMSRDLLEKHYSSG-------------PAELLFGGKSTGIKGLQIIFDSGSSYT 285
+ TP+ + + HY+ P E G G I DSG++
Sbjct: 277 --VNMTPLIPN--QPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKG-----AIIDSGTTLA 327
Query: 286 YFNSQAYK---TTLDLMRKDLKGKPLED 310
Y Y+ + + + DLK + D
Sbjct: 328 YLPEIVYEPLVSKIISQQPDLKVHIVRD 355
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/260 (31%), Positives = 113/260 (43%), Gaps = 35/260 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC----TLPPE-SLYHPKNN---- 111
G Y + IG P K Y + +DTGSD+ WV C C C TL E +LY+ +
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGK 136
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL---- 167
LV+C+D FC C+AN C Y +Y D S+ G V D G L
Sbjct: 137 LVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+IFGCG Q A G+LG G +S++SQL S G + + HCL R
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN 256
Query: 226 GGYLF-LGHDLVPSSGIAWTP-----------MSRDLLEKHYSSGPAELLFGGKSTGIKG 273
GG +F +G + P + TP M+ + + + + PA+L G G
Sbjct: 257 GGGIFAIGRVVQPK--VNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKG--- 311
Query: 274 LQIIFDSGSSYTYFNSQAYK 293
I DSG++ Y Y+
Sbjct: 312 --AIIDSGTTLAYLPEIIYE 329
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/258 (31%), Positives = 112/258 (43%), Gaps = 31/258 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC----TLPPE-SLYHPKNN---- 111
G Y + IG P K Y + +DTGSD+ WV C C C TL E +LY+ +
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGK 136
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL---- 167
LV+C+D FC C+AN C Y +Y D S+ G V D G L
Sbjct: 137 LVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+IFGCG Q A G+LG G +S++SQL S G + + HCL R
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN 256
Query: 226 GGYLF-LGH---------DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
GG +F +G LVP+ M+ + + + + PA+L G G
Sbjct: 257 GGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG----- 311
Query: 276 IIFDSGSSYTYFNSQAYK 293
I DSG++ Y Y+
Sbjct: 312 AIIDSGTTLAYLPEIIYE 329
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 74/248 (29%), Positives = 105/248 (42%), Gaps = 25/248 (10%)
Query: 22 GCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNPP 73
G FS + +++S + AH RF + P+ G+ P +G Y + IG P
Sbjct: 38 GIFSVKYKYAGRERSLSTLKAHDISRQLRFLAGIDIPLGGSGRPDAVGLYYAKIGIGTPS 97
Query: 74 KLYELDIDTGSDLTWVQC-------NAPCTGCTLPPESLYHPKN-NLVACNDPFCSAFHL 125
K Y + +DTGSD+ WV C G L P L LV+C++ FC +
Sbjct: 98 KDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNG 157
Query: 126 PENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQR 181
C N C Y +Y D S+ G V D+ +G L + FGCG Q
Sbjct: 158 GPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQS 217
Query: 182 NPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGHDLVPS 238
A G+LG G +SI+SQL S + + HCL GGG +GH + P
Sbjct: 218 GDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGGIFAMGHVVQPK 277
Query: 239 SGIAWTPM 246
+ TP+
Sbjct: 278 --VNMTPL 283
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 74/248 (29%), Positives = 105/248 (42%), Gaps = 25/248 (10%)
Query: 22 GCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNPP 73
G FS + +++S + AH RF + P+ G+ P +G Y + IG P
Sbjct: 38 GVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPS 97
Query: 74 KLYELDIDTGSDLTWVQC-------NAPCTGCTLPPESLYHPKN-NLVACNDPFCSAFHL 125
K Y + +DTGSD+ WV C G L P L LV+C++ FC +
Sbjct: 98 KDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNG 157
Query: 126 PENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL----LGPRLIFGCGYNQR 181
C N C Y +Y D S+ G V D+ +G L + FGCG Q
Sbjct: 158 GPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQS 217
Query: 182 NPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGHDLVPS 238
A G+LG G +SI+SQL S + + HCL GGG +GH + P
Sbjct: 218 GDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGGIFAMGHVVQPK 277
Query: 239 SGIAWTPM 246
+ TP+
Sbjct: 278 --VNMTPL 283
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 81/271 (29%), Positives = 116/271 (42%), Gaps = 36/271 (13%)
Query: 51 FPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP 108
F + G+ PL G Y +K+G PP + + IDTGSD+ WV CN+ C GC P S
Sbjct: 65 FSVEGSSDPLLVGLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNS-CNGC--PRSSGLGI 121
Query: 109 KNNLVAC-----------NDPFC-SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
+ N +DP C SAF ++QC Y Y D + G V++
Sbjct: 122 QLNFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSE 181
Query: 157 HFPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLG 211
+ G + ++FGC Q K G+ G G G S++SQL + G
Sbjct: 182 SMYFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARG 241
Query: 212 LTRNVLGHCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
+T V HCL GGG L LG L P GI ++P+ + HY+ + G++
Sbjct: 242 ITPKVFSHCLKGEGNGGGILVLGEVLEP--GIVYSPLVPS--QPHYNLYLQSISVNGQTL 297
Query: 270 GIK--------GLQIIFDSGSSYTYFNSQAY 292
I I DSG++ Y +AY
Sbjct: 298 PIDPSVFATSINRGTIIDSGTTLAYLVEEAY 328
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 95/205 (46%), Gaps = 18/205 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPKNN---- 111
G Y + IG P K Y + +DTGSD+ WV C C GC ++Y P+ +
Sbjct: 88 GLYFTRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGE 146
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG----SL 167
LV C+ FC A + C + C+Y + Y D S+ G VTD +G +
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 168 LGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRG 225
+ FGCG + G G+LG G +S+LSQL + G R + HCL +V G
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDL 250
GG +G+ + P + TP+ D+
Sbjct: 267 GGIFAIGNVVQPK--VKTTPLVPDM 289
>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
Length = 127
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 51/128 (39%), Positives = 74/128 (57%), Gaps = 5/128 (3%)
Query: 176 CGYNQRNPGPKPP-PTAGVLGLGLGKASILSQLQSLGLTR-NVLGHCLSVRGGGYLFLGH 233
CGY Q P PP P G+LGLG+GKA + +QL+ + + NV+GHCLS +G G L++G
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFNSQAY 292
P+ G+ W PM L +YS G AE+ + G + +FDSGS+YT+ +Q Y
Sbjct: 61 FNPPTRGVTWVPMRESLF--YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQIY 118
Query: 293 KTTLDLMR 300
+ +R
Sbjct: 119 NEIVSKVR 126
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 123/301 (40%), Gaps = 39/301 (12%)
Query: 42 AHRFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT 99
R + P+ GN P G Y + IG P K Y + +DTGSD+ WV C C C
Sbjct: 66 GRRLLTAVDLPLGGNGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNC-ISCDSC- 123
Query: 100 LPPES-------LYHP----KNNLVACNDPFC-SAFHLPENIRCEANDQCDYEVLYADHG 147
P +S LY P + V C FC +A + C AN C Y + Y D
Sbjct: 124 -PRKSGLGIDLTLYDPTASASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGS 182
Query: 148 SSLGVLVTDHFPLRLTNG----SLLGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKAS 202
S+ G V D +G +L + FGCG G G+LG G +S
Sbjct: 183 STTGFFVADFLQYDQVSGDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSS 242
Query: 203 ILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAE 261
+LSQL S G + HCL +V GGG +G+ + P + TP+ + HY+
Sbjct: 243 MLSQLTSAGKVTKIFSHCLDTVNGGGIFAIGNVVQPK--VKTTPLVPGM--PHYNVVLKT 298
Query: 262 LLFGGKS---------TGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM---RKDLKGKPLE 309
+ GG + G I DSG++ Y YK L + D+ K ++
Sbjct: 299 IDVGGSTLQLPTNIFDIGGGSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQ 358
Query: 310 D 310
D
Sbjct: 359 D 359
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 128/287 (44%), Gaps = 29/287 (10%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
+ + Q T + P+ G V GY+ TL +G P K + + +DTGS +T+V C+
Sbjct: 48 QARDFQPTFRRSLLRNSTMPLHGAVKDYGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCS 107
Query: 93 APCTGC-------TLPPESLYHPKNNLVACNDPFCSAFHLPENIRCE-ANDQCDYEVLYA 144
+ +GC PE+ + ++C P CS + RC + QC Y YA
Sbjct: 108 SCGSGCGPNHQDAAFDPEA--SSTASRISCTSPKCSC----GSPRCGCSTQQCTYTRSYA 161
Query: 145 DHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASI 203
+ SS G+L+ D L + L G +IFGC R G A G+ GLG AS+
Sbjct: 162 EQSSSSGILLEDVLAL---HDGLPGAPIIFGC--ETRETGEIFRQRADGLFGLGNSDASV 216
Query: 204 LSQLQSLGLTRNVLGHCLS-VRGGGYLFLGHDLVPSS-GIAWTPMSRDLLEKHYS----- 256
++QL G+ +V C V G G L LG VP S + +TP+ Y
Sbjct: 217 VNQLVKAGVIDDVFSLCFGMVEGDGALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKML 276
Query: 257 --SGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRK 301
+ +LL +S +G + DSG+++TY S +K + K
Sbjct: 277 SLAVEGQLLPVSQSLFDQGYGTVLDSGTTFTYMPSPVFKAFAGAVEK 323
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 90/315 (28%), Positives = 139/315 (44%), Gaps = 51/315 (16%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
++ Q + +H +TA P+ ++ P GYY+ + IG PP+ + L +DTGS LT+V C+
Sbjct: 64 RRHLQRSESHS-TATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST 122
Query: 94 PCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVL-------YADH 146
C C + + P S+ + P ++C CD E++ YA+
Sbjct: 123 -CEQCGKHQDPNFQPD----------WSSTYQP--LKCSMECTCDSEMMHCVYDRQYAEM 169
Query: 147 GSSLGVLVTDHFPLRLTNGSLLGP-RLIFGCG-------YNQRNPGPKPPPTAGVLGLGL 198
SS GVL D + S L P R +FGC Y+QR G++GLG
Sbjct: 170 SSSSGVLGED--IVSFGKQSELKPQRTVFGCENVETGDIYSQR--------ADGIMGLGR 219
Query: 199 GKASILSQLQSLGLTRNVLGHC---LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY 255
G SI+ QL G+ N C + V GGG + LG + P +G+ +T S +Y
Sbjct: 220 GDLSIVDQLVEKGVIGNSFSLCYGGMDV-GGGAMVLG-GISPPAGMVFT-HSDPARSAYY 276
Query: 256 SSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLE 309
+ E+ GK I + I DSG++Y Y A+K D + K+L L
Sbjct: 277 NIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLI 336
Query: 310 DTAEEKALPVCWKGT 324
+ +C+ G
Sbjct: 337 QGPDRNYNDICFSGV 351
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 77/280 (27%), Positives = 127/280 (45%), Gaps = 32/280 (11%)
Query: 44 RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC----- 98
RF FP+ GN LG Y + +GNP + ++ +DTGSD+ WV+C +PC C
Sbjct: 64 RFLQGISFPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQD 122
Query: 99 TLPPESLYH----PKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLV 154
+PP S+Y+ +++ +C+DP C+ + R +N C Y + Y D +S+G V
Sbjct: 123 IIPPLSIYNLSASSTSSVSSCSDPLCTGEQAVCS-RSGSNSACAYGISYQDKSTSIGAYV 181
Query: 155 TDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTR 214
D L G+ + FGC N P G++G G ++ +Q+ +
Sbjct: 182 KDDMHYVLQGGNATTSHIFFGCAINITGSWPAD----GIMGFGQISKTVPNQIATQRNMS 237
Query: 215 NVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPM-------SRDLLEKHYSS-----GPA 260
V HCL GGG L G + ++ + +TP+ + DLL +S
Sbjct: 238 RVFSHCLGGEKHGGGILEFGEE-PNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSK 296
Query: 261 ELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMR 300
E + ST G +I DSG+S+ ++A + ++
Sbjct: 297 EFSYVSNSTNETG--VIIDSGTSFALLATKANRILFSEIK 334
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/315 (28%), Positives = 139/315 (44%), Gaps = 51/315 (16%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
++ Q + +H +TA P+ ++ P GYY+ + IG PP+ + L +DTGS LT+V C+
Sbjct: 64 RRHLQRSESHS-TATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST 122
Query: 94 PCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVL-------YADH 146
C C + + P S+ + P ++C CD E++ YA+
Sbjct: 123 -CEQCGKHQDPNFQPD----------WSSTYQP--LKCSMECTCDSEMMHCVYDRQYAEM 169
Query: 147 GSSLGVLVTDHFPLRLTNGSLLGP-RLIFGCG-------YNQRNPGPKPPPTAGVLGLGL 198
SS GVL D + S L P R +FGC Y+QR G++GLG
Sbjct: 170 SSSSGVLGED--IVSFGKQSELKPQRTVFGCENVETGDIYSQR--------ADGIMGLGR 219
Query: 199 GKASILSQLQSLGLTRNVLGHC---LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY 255
G SI+ QL G+ N C + V GGG + LG + P +G+ +T S +Y
Sbjct: 220 GDLSIVDQLVEKGVIGNSFSLCYGGMDV-GGGAMVLG-GISPPAGMVFT-HSDPARSAYY 276
Query: 256 SSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLE 309
+ E+ GK I + I DSG++Y Y A+K D + K+L L
Sbjct: 277 NIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLI 336
Query: 310 DTAEEKALPVCWKGT 324
+ +C+ G
Sbjct: 337 QGPDRNYNDICFSGV 351
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 148/355 (41%), Gaps = 59/355 (16%)
Query: 4 KGKRVMGLLVLLMFATFQGCFSEANQPPSKKK--------------STQSTAAHRFGST- 48
+ +L++L+FA GC S ++K + + A+R G
Sbjct: 6 RASSFFSVLLVLLFALSVGCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLL 65
Query: 49 -AVFPITGNV---YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
AV G V G Y ++IG+PPK Y + +DTGSD+ WV C C GC P S
Sbjct: 66 GAVDLALGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGC--PTRS 122
Query: 105 -------LYHPKNN--LVACNDPFC---SAFHLPENIRCEANDQ-CDYEVLYADHGSSLG 151
Y P + V C FC SA +P C + C + + Y D ++ G
Sbjct: 123 GLGIELTQYDPAGSGTTVGCEQEFCVANSAGGVPPT--CPSTSSPCQFRITYGDGSTTTG 180
Query: 152 VLVTDHFPLRLTNG----SLLGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQ 206
VTD +G + + FGCG + G G+LG G +S+LSQ
Sbjct: 181 FYVTDFVQYNQVSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQ 240
Query: 207 LQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG 265
L + R + HCL +VRGGG +G+ + P + TP+ ++ HY+ + G
Sbjct: 241 LAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPK--VKTTPLVPNV--THYNVNLQGISVG 296
Query: 266 GKSTGI----------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
G + + KG I DSG++ Y + Y+T L + + PL +
Sbjct: 297 GATLQLPTSTFDSGDSKG--TIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHN 349
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 148/355 (41%), Gaps = 59/355 (16%)
Query: 4 KGKRVMGLLVLLMFATFQGCFSEANQPPSKKK--------------STQSTAAHRFGST- 48
+ +L++L+FA GC S ++K + + A+R G
Sbjct: 6 RASSFFSVLLVLLFALSVGCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLL 65
Query: 49 -AVFPITGNV---YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
AV G V G Y ++IG+PPK Y + +DTGSD+ WV C C GC P S
Sbjct: 66 GAVDLALGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGC--PTRS 122
Query: 105 -------LYHPKNN--LVACNDPFC---SAFHLPENIRCEANDQ-CDYEVLYADHGSSLG 151
Y P + V C FC SA +P C + C + + Y D ++ G
Sbjct: 123 GLGIELTQYDPAGSGTTVGCEQEFCVANSAGGVPPT--CPSTSSPCQFRITYGDGSTTTG 180
Query: 152 VLVTDHFPLRLTNG----SLLGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQ 206
VTD +G + + FGCG + G G+LG G +S+LSQ
Sbjct: 181 FYVTDFVQYNQVSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQ 240
Query: 207 LQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG 265
L + R + HCL +VRGGG +G+ + P + TP+ ++ HY+ + G
Sbjct: 241 LAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPK--VKTTPLVPNV--THYNVNLQGISVG 296
Query: 266 GKSTGI----------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
G + + KG I DSG++ Y + Y+T L + + PL +
Sbjct: 297 GATLQLPTSTFDSGDSKG--TIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHN 349
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 111/254 (43%), Gaps = 26/254 (10%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES------LY 106
T V LG+ L +G P + + + +DTGSDL W+ C C GCT P + Y
Sbjct: 106 TLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFY 163
Query: 107 HPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY--ADHGSSLGVLVTDHFPL 160
P + V CN FC C QC Y+++Y AD SS G LV D L
Sbjct: 164 IPSMSSTSQAVPCNSQFCEL-----RKECSTTSQCPYKMVYVSADTSSS-GFLVEDVLYL 217
Query: 161 RLTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+ +L +++FGCG Q G+ GLG+ SI S L GLT N
Sbjct: 218 STEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFA 277
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIF 278
C S G G + G SS TP+ + Y+ +E+ G T ++ IF
Sbjct: 278 MCFSRDGIGRISFGDQ--GSSDQEETPLDVNPQHPTYTISISEMTVGNSLTDLE-FSTIF 334
Query: 279 DSGSSYTYFNSQAY 292
D+G+S+TY AY
Sbjct: 335 DTGTSFTYLADPAY 348
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 124/289 (42%), Gaps = 32/289 (11%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWV 89
S+ + G F ++G P +G Y +++GNPPK + + IDTGSD+ WV
Sbjct: 50 SRDRVRHGRMLQSSGGVIDFSVSGTYDPFLVGLYYTRVQLGNPPKDFYVQIDTGSDVLWV 109
Query: 90 QCNAPCTGCTLP-----PESLYHPKN----NLVACNDPFCSAFHLPENIRC-EANDQCDY 139
CN+ C GC P + + P + +LV+C+D C+ + C ++QC Y
Sbjct: 110 SCNS-CNGCPATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSNQCAY 168
Query: 140 EVLYADHGSSLGVLVTDHFPLRLT----NGSLLGPRLIFGCGYNQRNPGPKPPPTA-GVL 194
Y D + G V D L + S ++FGC +Q K G+
Sbjct: 169 VFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIF 228
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLE 252
G G S++SQL S G+ V HCL GGG L LG + P+ + +TP+ +
Sbjct: 229 GFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIVEPN--VVYTPLVPS--Q 284
Query: 253 KHYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYK 293
HY+ + G+ I I DSG++ Y +AY
Sbjct: 285 PHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSGTTLAYLAEEAYN 333
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 111/254 (43%), Gaps = 26/254 (10%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES------LY 106
T V LG+ L +G P + + + +DTGSDL W+ C C GCT P + Y
Sbjct: 106 TLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFY 163
Query: 107 HPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY--ADHGSSLGVLVTDHFPL 160
P + V CN FC C QC Y+++Y AD SS G LV D L
Sbjct: 164 IPSMSSTSQAVPCNSQFCEL-----RKECSTTSQCPYKMVYVSADTSSS-GFLVEDVLYL 217
Query: 161 RLTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+ +L +++FGCG Q G+ GLG+ SI S L GLT N
Sbjct: 218 STEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFA 277
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIF 278
C S G G + G SS TP+ + Y+ +E+ G T ++ IF
Sbjct: 278 MCFSRDGIGRISFGDQ--GSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLE-FSTIF 334
Query: 279 DSGSSYTYFNSQAY 292
D+G+S+TY AY
Sbjct: 335 DTGTSFTYLADPAY 348
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 111/254 (43%), Gaps = 26/254 (10%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES------LY 106
T V LG+ L +G P + + + +DTGSDL W+ C C GCT P + Y
Sbjct: 106 TLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFY 163
Query: 107 HPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY--ADHGSSLGVLVTDHFPL 160
P + V CN FC C QC Y+++Y AD SS G LV D L
Sbjct: 164 IPSMSSTSQAVPCNSQFCEL-----RKECSTTSQCPYKMVYVSADTSSS-GFLVEDVLYL 217
Query: 161 RLTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+ +L +++FGCG Q G+ GLG+ SI S L GLT N
Sbjct: 218 STEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFA 277
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIF 278
C S G G + G SS TP+ + Y+ +E+ G T ++ IF
Sbjct: 278 MCFSRDGIGRISFGDQ--GSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLE-FSTIF 334
Query: 279 DSGSSYTYFNSQAY 292
D+G+S+TY AY
Sbjct: 335 DTGTSFTYLADPAY 348
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 119/284 (41%), Gaps = 30/284 (10%)
Query: 41 AAHRFGSTAVFPITGNVYPLGYYS----VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT 96
AA S F Y +G + + +G PP + + +DTGSDL W+ CN CT
Sbjct: 76 AAAVHHSPLTFVPANETYQIGAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCN--CT 133
Query: 97 GCTLPPES--------LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLY 143
C ES +Y K + V CN C +C ++D C YEV Y
Sbjct: 134 KCVRGVESNGEKIAFNIYDLKGSSTSQTVLCNSNLCEL-----QRQCPSSDSICPYEVNY 188
Query: 144 ADHG-SSLGVLVTDHFPLRLTNGSL--LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGK 200
+G S+ G LV D L + R+ FGCG Q G+ GLG+G
Sbjct: 189 LSNGTSTTGFLVEDVLHLITDDDETKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMGN 248
Query: 201 ASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPA 260
S+ S L GLT N C G G + G + G TP + L Y+
Sbjct: 249 ESVPSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQG--KTPFNLRALHPTYNITVT 306
Query: 261 ELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+++ GG + ++ IFDSG+S+T+ N AYK + +K
Sbjct: 307 QIIVGGNAADLE-FHAIFDSGTSFTHLNDPAYKQITNSFNSAIK 349
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 131/306 (42%), Gaps = 43/306 (14%)
Query: 51 FPITGNV--YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--LY 106
FP+ G Y +G Y + +G+PPK + + IDTGSD+ WV C + C GC P+S L+
Sbjct: 69 FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGC---PQSSGLH 124
Query: 107 HPKN----------NLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVT 155
P N +L++C+D CS + C + +QC Y Y D + G V+
Sbjct: 125 IPLNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVS 184
Query: 156 DHFPLRLTNGSLL---GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLG 211
D GS + ++FGC +Q K G+ G G S++SQ+ S G
Sbjct: 185 DLLNFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQG 244
Query: 212 LTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI 271
+T V HCL GGG L + I ++P+ + HY+ + GKS I
Sbjct: 245 ITPKVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS--QPHYNLNLQSISVNGKSLAI 302
Query: 272 K--------GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
I DSG++ Y +AY D + + + P+ KG
Sbjct: 303 DPEVFATSTNRGTIVDSGTTLAYLAEEAY---------DPFVSAITEAVSQSVRPLLSKG 353
Query: 324 TWKCLL 329
T +C L
Sbjct: 354 T-QCYL 358
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 131/306 (42%), Gaps = 43/306 (14%)
Query: 51 FPITGNV--YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--LY 106
FP+ G Y +G Y + +G+PPK + + IDTGSD+ WV C + C GC P+S L+
Sbjct: 54 FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGC---PQSSGLH 109
Query: 107 HPKN----------NLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVT 155
P N +L++C+D CS + C + +QC Y Y D + G V+
Sbjct: 110 IPLNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVS 169
Query: 156 DHFPLRLTNGSLL---GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLG 211
D GS + ++FGC +Q K G+ G G S++SQ+ S G
Sbjct: 170 DLLNFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQG 229
Query: 212 LTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI 271
+T V HCL GGG L + I ++P+ + HY+ + GKS I
Sbjct: 230 ITPKVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS--QPHYNLNLQSISVNGKSLAI 287
Query: 272 K--------GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
I DSG++ Y +AY D + + + P+ KG
Sbjct: 288 DPEVFATSTNRGTIVDSGTTLAYLAEEAY---------DPFVSAITEAVSQSVRPLLSKG 338
Query: 324 TWKCLL 329
T +C L
Sbjct: 339 T-QCYL 343
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 126/289 (43%), Gaps = 24/289 (8%)
Query: 44 RFGSTAV-FPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL 100
RF + V F + G PL G Y + +GNP K Y + +DTGSD+ WV C PC+GC
Sbjct: 7 RFLAEGVDFSLGGTADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCR-PCSGCPR 65
Query: 101 P-----PESLYHPKN----NLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSL 150
P ++Y P+ +LV+C+DP C +C + + C+Y Y D +S
Sbjct: 66 KSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSE 125
Query: 151 GVLVTDHFPLRLTNGSLLG---PRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQ 206
G V D + + + L +++FGC Q + G++G G + S+ +Q
Sbjct: 126 GYYVRDAMQYNVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQ 185
Query: 207 LQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRD-----LLEKHYSSGPAE 261
L + V HCL G L + G+ +TP+ D ++ + S
Sbjct: 186 LAAQQNIPRVFSHCLEGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNR 245
Query: 262 LLFGGKS-TGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLE 309
L + + +I DSG++ YF S AY + +R+ P+
Sbjct: 246 LPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVR 294
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 111/266 (41%), Gaps = 31/266 (11%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHP 108
TG +G Y + IG P K Y L +DTG+D+ WV C C C +LY+
Sbjct: 64 TGRPDSVGLYYAKIGIGTPSKDYYLQVDTGTDMMWVNC-IQCKECPTRSNLGMDLTLYNI 122
Query: 109 KNN----LVACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
K + LV C+ C + L + ND C Y +Y D S+ G V D
Sbjct: 123 KESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQ 182
Query: 163 TNGSL----LGPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNV 216
+G L +IFGCG Q A G+LG G S++SQL S G + +
Sbjct: 183 VSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKM 242
Query: 217 LGHCLS-VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG--------GK 267
HCL+ V GGG +GH + P+ + TP+ D + HYS + G
Sbjct: 243 FAHCLNGVNGGGIFAIGHVVQPT--VNTTPLLPD--QPHYSVNMTAIQVGHTFLNLSTDA 298
Query: 268 STGIKGLQIIFDSGSSYTYFNSQAYK 293
S I DSG++ Y Y+
Sbjct: 299 SEQRDSKGTIIDSGTTLAYLPDGIYQ 324
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 127/287 (44%), Gaps = 24/287 (8%)
Query: 41 AAHRFGSTAVFPITGNVYP-LGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGC 98
A H F + V P G Y +T +G PP K+Y + DTGSD+ W+QC PC C
Sbjct: 64 ANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGI-ADTGSDIVWLQCE-PCEQC 121
Query: 99 TLPPESLYHPKNNLVACNDPFCSAF-HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDH 157
+++P + N P S H + C + C Y++ Y D S G L D
Sbjct: 122 YNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDT 181
Query: 158 FPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLG---- 211
L T+GS + P+++ GCG + N G ++G++GLG G S+++QL S+G
Sbjct: 182 LSLESTSGSPVSFPKIVIGCGTD--NAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFS 239
Query: 212 ------LTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG 265
L + + G + G +V + I P+ L + +S G + FG
Sbjct: 240 YCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFG 299
Query: 266 GKSTGIKGL-QIIFDSGSSYTYFNSQAY----KTTLDLMRKDLKGKP 307
G S G II DSG++ T S Y +DL++ D P
Sbjct: 300 GSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDP 346
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 129/292 (44%), Gaps = 38/292 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC--TGCTLPPESLYHPKNNL----VACN 116
Y VT+ IG PP+ + + DTGSDLTWVQC PC + C E L+ P + V C+
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQC-LPCPDSSCYPQQEPLFDPSKSSTYVDVPCS 180
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR---LI 173
P C + + RC A C+Y V Y D + G L + F L+ S L P ++
Sbjct: 181 APECHIGGV-QQTRCGAT-SCEYSVKYGDESETHGSLAEETF--TLSPPSPLAPAATGVV 236
Query: 174 FGCG------YNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSVRGG 226
FGC +N G AG+LGLG G +SILSQ +S+ V +CL RG
Sbjct: 237 FGCSHEYISVFNDTGMG-----VAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGS 291
Query: 227 --GYLFLGHDLVPS----SGIAWTPMSRDL--LEKHYSSGPAELLFGGKSTGIKG----L 274
GYL +G S +++TP+ + L Y A + G + I L
Sbjct: 292 STGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL 351
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWK 326
+ DSG+ T+ + AY D R + + K L C+ T +
Sbjct: 352 GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQ 403
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 130/292 (44%), Gaps = 35/292 (11%)
Query: 58 YPLGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---- 112
Y YY ++ IG PP +LY + +DTGSD W QC PC C +++P +
Sbjct: 85 YAGSYYVMSYSIGTPPFQLYGV-VDTGSDGIWFQC-KPCKPCLNQTSPIFNPSKSSTYKN 142
Query: 113 VACNDPFCSAFHLPENIRCEAN--DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG- 169
+ C+ P C E RC +N +C+YE+ Y D S G + D L +GS +
Sbjct: 143 IRCSSPICKR---GEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISF 199
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLS-----V 223
P+++ GCG+ +N +G++G G G SI+SQL S+G +CL+
Sbjct: 200 PKIVIGCGH--KNSLTTEGLASGIIGFGRGNFSIVSQLGSSIG---GKFSYCLASLFSKA 254
Query: 224 RGGGYLFLGH-DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII----- 277
L+ G +V G+ TP+ + +Y + G +K +I
Sbjct: 255 NISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEG 314
Query: 278 ---FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWK 326
DSGS+ T + Y + +K K ++D ++ L +C+K T K
Sbjct: 315 NAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQ--LSLCYKTTLK 364
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 106/247 (42%), Gaps = 23/247 (9%)
Query: 59 PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP------ESLYHP---- 108
P + + +G P + + + +DTGSDL W+ C C GCT P + Y P
Sbjct: 3 PSSLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSS 60
Query: 109 KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG-SSLGVLVTDHFPLRLTNG-- 165
+ V CN FC C QC Y+++Y G SS G LV D L N
Sbjct: 61 TSKAVPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHP 115
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+L +++ GCG Q G+ GLG+ + S+ S L GLT N C G
Sbjct: 116 QILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDG 175
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYT 285
G + G SS TP+ + Y+ + + G K T + + IFD+G+S+T
Sbjct: 176 IGRISFGDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDFI-TIFDTGTSFT 232
Query: 286 YFNSQAY 292
Y AY
Sbjct: 233 YLADPAY 239
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 109/253 (43%), Gaps = 24/253 (9%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP------ESLY 106
T V LG+ L +G P + + + +DTGSDL W+ C C GCT P + Y
Sbjct: 99 TLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFY 156
Query: 107 HP----KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG-SSLGVLVTDHFPLR 161
P + V CN FC C QC Y+++Y G SS G LV D L
Sbjct: 157 IPGMSSTSKAVPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLS 211
Query: 162 LTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGH 219
N +L +++ GCG Q G+ GLG+ + S+ S L GLT N
Sbjct: 212 TENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSM 271
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
C G G + G SS TP+ + Y+ + + G K T + + IFD
Sbjct: 272 CFGRDGIGRISFGDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDFI-TIFD 328
Query: 280 SGSSYTYFNSQAY 292
+G+S+TY AY
Sbjct: 329 TGTSFTYLADPAY 341
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 110/253 (43%), Gaps = 24/253 (9%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP------ESLY 106
T V LG+ L +G P + + + +DTGSDL W+ C C GCT P + Y
Sbjct: 98 TLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFY 155
Query: 107 HP----KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG-SSLGVLVTDHFPLR 161
P + V CN FC C QC Y+++Y G SS G LV D L
Sbjct: 156 IPGMSSTSKAVPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLS 210
Query: 162 LTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGH 219
N +L +++ GCG Q G+ GLG+ + S+ S L GLT N
Sbjct: 211 TENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSM 270
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
C G G + G SS TP++ + Y+ + + G K T + + IFD
Sbjct: 271 CFGRDGIGRISFGDQ--GSSDQEETPLNINQQHPTYAITISGITIGNKPTDLDFI-TIFD 327
Query: 280 SGSSYTYFNSQAY 292
+G+S+TY AY
Sbjct: 328 TGTSFTYLADPAY 340
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 109/253 (43%), Gaps = 24/253 (9%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP------ESLY 106
T V LG+ L +G P + + + +DTGSDL W+ C C GCT P + Y
Sbjct: 99 TLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFY 156
Query: 107 HP----KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG-SSLGVLVTDHFPLR 161
P + V CN FC C QC Y+++Y G SS G LV D L
Sbjct: 157 IPGMSSTSKAVPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLS 211
Query: 162 LTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGH 219
N +L +++ GCG Q G+ GLG+ + S+ S L GLT N
Sbjct: 212 TENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSM 271
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
C G G + G SS TP+ + Y+ + + G K T + + IFD
Sbjct: 272 CFGRDGIGRISFGDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDFI-TIFD 328
Query: 280 SGSSYTYFNSQAY 292
+G+S+TY AY
Sbjct: 329 TGTSFTYLADPAY 341
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 136/312 (43%), Gaps = 39/312 (12%)
Query: 31 PSKKKSTQSTAAH--------RFGSTAVFP--ITGNVYP-LGYYSVTLKIGNPPKLYELD 79
PSK ++ + T A RF +A+ I + P G Y + L IG PP
Sbjct: 49 PSKTRTERLTDAFHRSASRVGRFRQSAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAI 108
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACNDPFCSAFHLPENIRCEAND 135
+DTGSDLTW QC PCT C + PKN+ +C FC A L + C
Sbjct: 109 VDTGSDLTWTQCR-PCTHCYKQVVPFFDPKNSSTYRDSSCGTSFCLA--LGNDRSCRNGK 165
Query: 136 QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVL 194
+C + YAD + G L + + T G + P FGC + R+ G ++G++
Sbjct: 166 KCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVH--RSGGIFDEHSSGIV 223
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGYLFLGHDLVPSSGIAWTPMSR 248
GLG+ + S++SQL+S R +CL S F +V +G TP+
Sbjct: 224 GLGVAELSMISQLKSTINGR--FSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVM 281
Query: 249 DLLEKHY--------SSGPAELLFGG--KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDL 298
+ +Y S G L + G K ++ II DSG++YTY + Y +
Sbjct: 282 KGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEES 341
Query: 299 MRKDLKGKPLED 310
+ +KGK + D
Sbjct: 342 VAHSIKGKRVRD 353
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 124/291 (42%), Gaps = 34/291 (11%)
Query: 46 GSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
G FP+ G P +G Y +K+G PP+ + + IDTGSD+ WV C + C GC E
Sbjct: 65 GGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSE 123
Query: 104 -----SLYHP----KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLV 154
S + P +LV+C+D C + E+ C N+ C Y Y D + G +
Sbjct: 124 LQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES-GCSPNNLCSYSFKYGDGSGTSGFYI 182
Query: 155 TDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQS 209
+D S L +FGC Q +P G+ GLG G S++SQL
Sbjct: 183 SDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAV 242
Query: 210 LGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
GL V HCL GGG + LG P + +TP+ + HY+ + G+
Sbjct: 243 QGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDT--VYTPLVPS--QPHYNVNLQSIAVNGQ 298
Query: 268 STGIK--------GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK--GKPL 308
I G I D+G++ Y +AY + + + G+P+
Sbjct: 299 ILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPI 349
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 126/287 (43%), Gaps = 24/287 (8%)
Query: 41 AAHRFGSTAVFPITGNVYP-LGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGC 98
A H F + V P G Y +T +G PP K+Y + DTGSD+ W+QC PC C
Sbjct: 64 ANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGI-ADTGSDIVWLQCE-PCEQC 121
Query: 99 TLPPESLYHPKNNLVACNDPFCSAF-HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDH 157
+++P + N P S H + C + C Y++ Y D S G L D
Sbjct: 122 YNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDT 181
Query: 158 FPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLG---- 211
L T+GS + P+ + GCG + N G ++G++GLG G S+++QL S+G
Sbjct: 182 LSLESTSGSPVSFPKTVIGCGTD--NAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFS 239
Query: 212 ------LTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG 265
L + + G + G +V + I P+ L + +S G + FG
Sbjct: 240 YCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFG 299
Query: 266 GKSTGIKGL-QIIFDSGSSYTYFNSQAY----KTTLDLMRKDLKGKP 307
G S G II DSG++ T S Y +DL++ D P
Sbjct: 300 GSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDP 346
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 141/323 (43%), Gaps = 61/323 (18%)
Query: 43 HRFGSTAVFPI------TGNVYP-----LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
+R G+ AV + T N+ G + + L IGNP Y +DTGSDL W QC
Sbjct: 76 NRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQC 135
Query: 92 NAPCTGCTLPPESLYHPKN----NLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG 147
PCT C P ++ P+ + V C+ C+A LP + E D C+Y Y D+
Sbjct: 136 K-PCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNA--LPRSNCNEDKDACEYLYTYGDYS 192
Query: 148 SSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
S+ G+L T+ F N S+ G + FGCG N G +G++GLG G S++SQL
Sbjct: 193 STRGLLATETFTFEDEN-SISG--IGFGCGV--ENEGDGFSQGSGLVGLGRGPLSLISQL 247
Query: 208 QSLGLTRNVLGHCLS----VRGGGYLFLG---HDLVPSSGIAW---TPMSRDLLEKHYSS 257
+ + +CL+ LF+G +V +G + + LL
Sbjct: 248 KETKFS-----YCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQP 302
Query: 258 GPAELLFGGKSTGIKGLQI---------------IFDSGSSYTYFNSQAYKTTLDLMRKD 302
L G + G K L + I DSG++ TY A+K +++++
Sbjct: 303 SFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFK----VLKEE 358
Query: 303 LKGK---PLEDTAEEKALPVCWK 322
+ P++D+ L +C+K
Sbjct: 359 FTSRMSLPVDDSG-STGLDLCFK 380
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 117/267 (43%), Gaps = 21/267 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP-----PESLYHPK----NNLV 113
Y + +GNP K Y + +DTGSD+ WV C PC+GC P ++Y P+ +LV
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCR-PCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 114 ACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG--- 169
+C+DP C +C +A + C+Y Y D +S G V D + + + L
Sbjct: 61 SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTT 120
Query: 170 PRLIFGCGYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY 228
+++FGC Q + G++G G + S+ +QL + V HCL G
Sbjct: 121 SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGG 180
Query: 229 LFLGHDLVPSSGIAWTPMSRD-----LLEKHYSSGPAELLFGGKS-TGIKGLQIIFDSGS 282
L + G+ +TP+ D ++ + S L + + +I DSG+
Sbjct: 181 GILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGT 240
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLE 309
+ YF S AY + +R+ P+
Sbjct: 241 TLAYFPSGAYNVFVQAIREATSATPVR 267
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 124/291 (42%), Gaps = 34/291 (11%)
Query: 46 GSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
G FP+ G P +G Y +K+G PP+ + + IDTGSD+ WV C + C GC E
Sbjct: 65 GGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSE 123
Query: 104 -----SLYHP----KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLV 154
S + P +LV+C+D C + E+ C N+ C Y Y D + G +
Sbjct: 124 LQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES-GCSPNNLCSYSFKYGDGSGTSGYYI 182
Query: 155 TDHFPLRLTNGSLLGPR----LIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQS 209
+D S L +FGC Q +P G+ GLG G S++SQL
Sbjct: 183 SDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAV 242
Query: 210 LGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
GL V HCL GGG + LG P + +TP+ + HY+ + G+
Sbjct: 243 QGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDT--VYTPLVPS--QPHYNVNLQSIAVNGQ 298
Query: 268 STGIK--------GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK--GKPL 308
I G I D+G++ Y +AY + + + G+P+
Sbjct: 299 ILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPI 349
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 120/281 (42%), Gaps = 39/281 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYHPKNN-- 111
G Y ++IG+PPK Y + +DTGSD+ WV C GC P S Y P +
Sbjct: 83 GLYYTRIEIGSPPKGYYVQVDTGSDILWVN-GISCDGC--PTRSGLGIELTQYDPAGSGT 139
Query: 112 LVACNDPFCSAFHLPENI--RC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG--- 165
V C FC A + C A C + + Y D S+ G VTD +G
Sbjct: 140 TVGCEQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQ 199
Query: 166 -SLLGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-S 222
+ + FGCG + G G+LG G AS+LSQL + R + HCL +
Sbjct: 200 TTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDT 259
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----------K 272
VRGGG +G+ + P + TP+ + HY+ + GG + + K
Sbjct: 260 VRGGGIFAIGNVVQPPI-VKTTPLVPN--ATHYNVNLQGISVGGATLQLPTSTFDSGDSK 316
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLM---RKDLKGKPLED 310
G I DSG++ Y + Y+T L + DL + ED
Sbjct: 317 G--TIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYED 355
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 127/284 (44%), Gaps = 32/284 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACN 116
G Y ++L +G PP DTGSDL W QC PC C + L+ PK++ +C+
Sbjct: 93 GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCK-PCERCYKQVDPLFDPKSSKTYRDFSCD 151
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
CS L + C N C Y+ Y D ++G + +D L T GS + P+ + G
Sbjct: 152 ARQCS---LLDQSTCSGN-ICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIG 207
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHC---LSVRGGGYL-- 229
CG+ N G +G++GLG G S++SQ+ S+G +C LS R G
Sbjct: 208 CGH--ENDGTFSDKGSGIVGLGAGPLSLISQMGSSVG---GKFSYCLVPLSSRAGNSSKL 262
Query: 230 -FLGHDLVPSSGIAWTP-MSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFDS 280
F + +V G+ TP +S + + Y S G + FG S G II DS
Sbjct: 263 NFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDS 322
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
G++ T + + ++G+ ED + L VC+ T
Sbjct: 323 GTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPS--GFLSVCYSAT 364
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 129/295 (43%), Gaps = 54/295 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + L IG PP Y +DTGSDL W QC APC C P + P + LV C
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFG 175
P C+A P C C Y+ Y D S+ GVL ++ F N S ++ + FG
Sbjct: 149 SPLCAALPYPA---CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFG 205
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS---------VRGG 226
CG N G + ++G++GLG G S++SQ LG +R +CL+ + G
Sbjct: 206 CG--NINSG-QLANSSGMVGLGRGPLSLVSQ---LGPSR--FSYCLTSFLSPEPSRLNFG 257
Query: 227 GYLFLGHDLVPSSG--IAWTPMSRD---------------LLEKHYSSGPAELLFGGKST 269
+ L SSG + TP+ + L +K P L+F
Sbjct: 258 VFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDP--LVFAINDD 315
Query: 270 GIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDTAE-EKALPVCW 321
G G + DSG+S T+ AY D +R++L +PL T + E L C+
Sbjct: 316 GTGG--VFIDSGTSLTWLQQDAY----DAVRRELVSVLRPLPPTNDTEIGLETCF 364
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 73/241 (30%), Positives = 104/241 (43%), Gaps = 25/241 (10%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP--------ESLYHP----KNNLVA 114
+ +G P + + + +DTGSDL W+ C C GCT P + Y P + V
Sbjct: 113 VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSFQATFYIPGMSSTSKAVP 170
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHG-SSLGVLVTDHFPLRLTNG--SLLGPR 171
CN FC C QC Y+++Y G SS G LV D L N +L +
Sbjct: 171 CNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQ 225
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFL 231
++ GCG Q G+ GLG+ + S+ S L GLT N C G G +
Sbjct: 226 IMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISF 285
Query: 232 GHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQA 291
G SS TP+ + Y+ + + G K T + + IFD+G+S+TY A
Sbjct: 286 GDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDFI-TIFDTGTSFTYLADPA 342
Query: 292 Y 292
Y
Sbjct: 343 Y 343
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 119/271 (43%), Gaps = 29/271 (10%)
Query: 60 LGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VA 114
LG+Y + L IG PP K+Y + DTGSDLTW C PC C ++ P+ + ++
Sbjct: 69 LGHYLMELSIGTPPFKIYGI-ADTGSDLTWTSC-VPCNNCYKQRNPMFDPQKSTTYRNIS 126
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LI 173
C+ C H + C +C+Y YA + GVL + L T G + + ++
Sbjct: 127 CDSKLC---HKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIV 183
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL-----SVRGGG 227
FGCG+N N G G++GLG G S++SQ+ S G R CL V
Sbjct: 184 FGCGHN--NTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKR--FSQCLVPFHTDVSVSS 239
Query: 228 YLFLGH-DLVPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFD 279
+ G V G+ TP+ + Y S L F G S ++ + D
Sbjct: 240 KMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGNMFLD 299
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
SG+ T +Q Y + +R ++ KP+ D
Sbjct: 300 SGTPPTILPTQLYDQVVAQVRSEVAMKPVTD 330
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 118/265 (44%), Gaps = 30/265 (11%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT----LPPE-SLYHP 108
T + Y +G Y +K+G+PP+ + + IDTGSD+ WV CN+ C C L E S + P
Sbjct: 77 TSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNS-CNDCPRTSGLGIELSFFDP 135
Query: 109 ----KNNLVACNDPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
+LV+C+ P C++ C ++QC Y Y D + G V+D
Sbjct: 136 SSSSTTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTV 195
Query: 164 NGSLL----GPRLIFGCGYNQRNPGPK-PPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
G L ++FGC Q K G+ G G S++SQL SLG+T V
Sbjct: 196 LGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFS 255
Query: 219 HCLSVR--GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----- 271
HCL GGG L LG L P+ I ++P+ + HY+ + G+ I
Sbjct: 256 HCLKGEGDGGGKLVLGEILEPN--IIYSPLVPS--QSHYNLNLQSISVNGQLLPIDPAVF 311
Query: 272 ---KGLQIIFDSGSSYTYFNSQAYK 293
I DSG++ TY AY
Sbjct: 312 ATSNNQGTIVDSGTTLTYLVETAYD 336
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 140/323 (43%), Gaps = 61/323 (18%)
Query: 43 HRFGSTAVFPI------TGNVYP-----LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
+R G+ AV + T N+ G + + L IGNP Y +DTGSDL W QC
Sbjct: 77 NRLGAVAVLAVASNPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQC 136
Query: 92 NAPCTGCTLPPESLYHPKN----NLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG 147
PCT C P ++ P+ + V C+ C+A LP + E D C+Y Y D+
Sbjct: 137 K-PCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNA--LPRSNCNEDKDSCEYLYTYGDYS 193
Query: 148 SSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
S+ G+L T+ F N S+ G + FGCG N G +G++GLG G S++SQL
Sbjct: 194 STRGLLATETFTFEDEN-SISG--IGFGCGV--ENEGDGFSQGSGLVGLGRGPLSLISQL 248
Query: 208 QSLGLTRNVLGHCLS----VRGGGYLFLG---HDLVPSSGI---AWTPMSRDLLEKHYSS 257
+ + +CL+ LF+G +V +G + LL
Sbjct: 249 KETKFS-----YCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQP 303
Query: 258 GPAELLFGGKSTGIKGLQI---------------IFDSGSSYTYFNSQAYKTTLDLMRKD 302
L G + G K L + I DSG++ TY A+K +++++
Sbjct: 304 SFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFK----VLKEE 359
Query: 303 LKGK---PLEDTAEEKALPVCWK 322
+ P++D+ L +C+K
Sbjct: 360 FTSRMSLPVDDSG-STGLDLCFK 381
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/256 (30%), Positives = 120/256 (46%), Gaps = 28/256 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P+ +
Sbjct: 74 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPELS--------T 124
Query: 121 SAFHLPENIRCEANDQ---CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGC 176
S L N C +D+ C YE YA+ SS GVL D + N S L P R +FGC
Sbjct: 125 SYQALKCNPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLSPQRAVFGC 182
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGGYLFLGH 233
N+ G++GLG GK S++ QL G+ +V C + V GGG + LG
Sbjct: 183 E-NEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV-GGGAMVLGK 240
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYF 287
+ P G+ ++ S +Y+ ++ GKS + + DSG++Y YF
Sbjct: 241 -ISPPPGMVFS-HSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYF 298
Query: 288 NSQAYKTTLDLMRKDL 303
+A+ D + K++
Sbjct: 299 PKEAFIAIKDAVIKEI 314
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/256 (30%), Positives = 120/256 (46%), Gaps = 28/256 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P+ +
Sbjct: 74 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPELS--------T 124
Query: 121 SAFHLPENIRCEANDQ---CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGC 176
S L N C +D+ C YE YA+ SS GVL D + N S L P R +FGC
Sbjct: 125 SYQALKCNPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLSPQRAVFGC 182
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGGYLFLGH 233
N+ G++GLG GK S++ QL G+ +V C + V GGG + LG
Sbjct: 183 E-NEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV-GGGAMVLGK 240
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYF 287
+ P G+ ++ S +Y+ ++ GKS + + DSG++Y YF
Sbjct: 241 -ISPPPGMVFS-HSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYF 298
Query: 288 NSQAYKTTLDLMRKDL 303
+A+ D + K++
Sbjct: 299 PKEAFIAIKDAVIKEI 314
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 128/295 (43%), Gaps = 54/295 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + L IG PP Y +DTGSDL W QC APC C P + P + LV C
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFG 175
P C+A P C C Y+ Y D S+ GVL ++ F N S ++ + FG
Sbjct: 149 SPLCAALPYPA---CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFG 205
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS---------VRGG 226
CG N G + ++G++GLG G S++SQ LG +R +CL+ + G
Sbjct: 206 CG--NINSG-QLANSSGMVGLGRGPLSLVSQ---LGPSR--FSYCLTSFLSPEPSRLNFG 257
Query: 227 GYLFLGHDLVPSSG--IAWTPMSRD---------------LLEKHYSSGPAELLFGGKST 269
+ L SSG + TP+ + L +K P L+F
Sbjct: 258 VFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDP--LVFAINDD 315
Query: 270 GIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDTAE-EKALPVCW 321
G G + DSG+S T+ AY D +R +L +PL T + E L C+
Sbjct: 316 GTGG--VFIDSGTSLTWLQQDAY----DAVRHELVSVLRPLPPTNDTEIGLETCF 364
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 124/264 (46%), Gaps = 44/264 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P+
Sbjct: 78 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPE---------LS 127
Query: 121 SAFH-LPENIRCEANDQ---CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFG 175
S++ L N C +D+ C YE YA+ SS GVL D + N S L P R +FG
Sbjct: 128 SSYKALKCNPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLTPQRAVFG 185
Query: 176 CG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRG 225
C ++QR G++GLG GK S++ QL G+ +V C + V G
Sbjct: 186 CENVETGDLFSQR--------ADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV-G 236
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFD 279
GG + LG + P +G+ ++ S +Y+ ++ GKS + + D
Sbjct: 237 GGAMVLGK-ISPPAGMVFS-HSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLD 294
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDL 303
SG++Y YF +A+ D + K++
Sbjct: 295 SGTTYAYFPKEAFIAIKDAIIKEI 318
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 111/273 (40%), Gaps = 36/273 (13%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-----------------LPPESLYHPK 109
+ +G PP + + +DTGSDL W+ CN CT C L S P
Sbjct: 105 VSVGTPPLSFLVALDTGSDLFWLPCN--CTKCVHGIGLSNGEKIAFNIYDLKGSSTSQP- 161
Query: 110 NNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHG-SSLGVLVTD--HFPLRLTNG 165
V CN C +C ++D C YEV Y +G S+ G LV D H
Sbjct: 162 ---VLCNSSLCEL-----QRQCPSSDTICPYEVNYLSNGTSTTGFLVEDVLHLITDDDKT 213
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
R+ FGCG Q G+ GLG+ S+ S L GLT N C G
Sbjct: 214 KDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMSNESVPSILAKEGLTSNSFSMCFGSDG 273
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYT 285
G + G + G TP + L Y+ +++ G K ++ IFDSG+S+T
Sbjct: 274 LGRITFGDNSSLVQG--KTPFNLRALHPTYNITVTQIIVGEKVDDLE-FHAIFDSGTSFT 330
Query: 286 YFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALP 318
Y N AYK + ++K + T+ LP
Sbjct: 331 YLNDPAYKQITNSFNSEIKLQR-HSTSSSNELP 362
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 120/277 (43%), Gaps = 31/277 (11%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
++S+ TA PI N G Y V + +G PP DTGSD+ W QC
Sbjct: 57 RRSSHRNTVVLESDTAEAPIFNNG---GEYLVEISVGTPPFSIVAVADTGSDVIWTQCK- 112
Query: 94 PCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSS 149
PC+ C ++ P + VAC+ P CS + + C + +C Y + Y D S
Sbjct: 113 PCSNCYQQNAPMFDPSKSTTYKNVACSSPVCS--YSGDGSSCSDDSECLYSIAYGDDSHS 170
Query: 150 LGVLVTDHFPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
G L D ++ T+G + PR + GCG++ N G +G++GLG G AS+++QL
Sbjct: 171 QGNLAVDTVTMQSTSGRPVAFPRTVIGCGHD--NAGTFNANVSGIVGLGRGPASLVTQLG 228
Query: 209 SLGLTRNVLGHCLSVRGGGYL-------FLGHDLVPSSGIAWTPMSRDLLEKHY------ 255
T +CL G G F + V SG TP+ K +
Sbjct: 229 P--ATGGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLE 286
Query: 256 --SSGPAELLFGGKSTGIKG-LQIIFDSGSSYTYFNS 289
S G + F ++ + G II DSG++ TY S
Sbjct: 287 AVSVGDTKFNFPEGASKLGGESNIIIDSGTTLTYLPS 323
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 111/257 (43%), Gaps = 28/257 (10%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE---------SLYHPKNNL----V 113
+ +G P Y + +DTGSDL W+ CN CT C + ++Y K + V
Sbjct: 117 VSVGTPASSYLVALDTGSDLFWLPCN--CTKCVHGIQLSTGQKIAFNIYDNKESSTSKNV 174
Query: 114 ACNDPFCSAFHLPENIRCEAND--QCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGSLL-- 168
ACN C + +C ++ C Y+V Y +++ S+ G LV D L N
Sbjct: 175 ACNSSLCE-----QKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQH 229
Query: 169 -GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
P + FGCG Q G+ GLG+ S+ S L GLT N C + G G
Sbjct: 230 ANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCFAADGLG 289
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+ G D S TP + Y+ +++ GG S ++ IFD+G+S+TY
Sbjct: 290 RITFG-DNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADLE-FNAIFDTGTSFTYL 347
Query: 288 NSQAYKTTLDLMRKDLK 304
N+ AYK +K
Sbjct: 348 NNPAYKQITQSFDSKIK 364
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 89/193 (46%), Gaps = 22/193 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V + IG PP +DTGSDL W QC+APC C P LY P + V+C P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 119 FCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGC 176
C A P + RC D C Y Y D S+ GVL T+ F L GS R + FGC
Sbjct: 152 MCQALQSPWS-RCSPPDTGCAYYFSYGDGTSTDGVLATETFTL----GSDTAVRGVAFGC 206
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS---VRGGGYLFLGH 233
G ++G++G+G G S++SQ LG+TR +C + LFLG
Sbjct: 207 GTENLGSTDN---SSGLVGMGRGPLSLVSQ---LGVTR--FSYCFTPFNATAASPLFLGS 258
Query: 234 DLVPSSGIAWTPM 246
SS TP
Sbjct: 259 SARLSSAAKTTPF 271
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 124/282 (43%), Gaps = 46/282 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + L +G PP DTGSD+ W QC PCT C +++P + V+C+
Sbjct: 83 GEYLMKLSVGTPPFPIIAVADTGSDIIWTQC-VPCTNCYQQDLPMFNPSKSTTYRKVSCS 141
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
P CS E+ C C Y + Y D+ S G D + T+G ++ PR G
Sbjct: 142 SPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIG 199
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGY--LF 230
CG++ N G +G++GLGLG AS++ Q+ S +CL+ G GG L
Sbjct: 200 CGHD--NAGSFDANVSGIVGLGLGPASLIKQMGS--AVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 231 LGHDL-VPSSGIAWTPMS-RDLLEKHYS--------------SGPAELLFGGKSTGIKGL 274
G + V SG TP+ D + YS A + GGK+
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKA------ 309
Query: 275 QIIFDSGSSYT------YFN-SQAYKTTLDLMRKDLKGKPLE 309
II DSG++ T Y N ++A +++L R D + LE
Sbjct: 310 NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLE 351
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 105/264 (39%), Gaps = 32/264 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y V + IG+PP L +D+GSD+ WVQC PC C + L+ P + V C
Sbjct: 125 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPATSATFSAVPCG 183
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C C + CDYEV Y D + G L + LT G + GC
Sbjct: 184 SAVCRTLRTSG---CGDSGGCDYEVSYGDGSYTKGALALET----LTLGGTAVEGVAIGC 236
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLV 236
G+ R AG+LGLG G S++ QL +CL+ RG G L LG
Sbjct: 237 GHRNRGLFVG---AAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGAGSLVLGRSEA 291
Query: 237 PSSGIAWTPMSRD-LLEKHYSSGPA------------ELLFGGKSTGIKGLQIIFDSGSS 283
G W P+ R+ Y G + E LF G G ++ D+G++
Sbjct: 292 VPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGG--VVMDTGTA 349
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKP 307
T +AY D + P
Sbjct: 350 VTRLPQEAYAALRDAFVAAVGALP 373
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 116/275 (42%), Gaps = 24/275 (8%)
Query: 49 AVFPI-TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH 107
A P+ +G G Y+VT+ +G P K + L DTGSDLTW QC C E
Sbjct: 118 ATLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLD 177
Query: 108 PKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
P + ++C+ FC C ++ C Y+V Y D S+G T+ L +
Sbjct: 178 PTKSTSYKNISCSSAFCKLLDTEGGESC-SSPTCLYQVQYGDGSYSIGFFATETLTLSSS 236
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-- 221
N + +FGCG Q+N G AG+LGLG K S+ S Q+ + + +CL
Sbjct: 237 N---VFKNFLFGCG--QQNSGLFRGA-AGLLGLGRTKLSLPS--QTAQKYKKLFSYCLPA 288
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEK-HYSSGPAELLFGGKSTGIKGLQI---- 276
S GYL G + S + +TP+S D Y EL GG I
Sbjct: 289 SSSSKGYLSFGGQV--SKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSG 346
Query: 277 -IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
+ DSG+ T S AY +K + P D
Sbjct: 347 TVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTD 381
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 117/262 (44%), Gaps = 34/262 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPKNN--LV 113
G Y ++IG+P K Y + +DTGSD+ WV C C GC + Y P + V
Sbjct: 83 GLYYTQIEIGSPSKGYYVQVDTGSDILWVNC-IRCDGCPTTSGLGIELTQYDPAGSGTTV 141
Query: 114 ACNDPFCSAFHLPENI--RCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNG----S 166
C+ FC A + P + C + C + + Y D S+ G V+D +G +
Sbjct: 142 GCDQEFCVA-NSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTT 200
Query: 167 LLGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVR 224
+ FGCG + G G+LG G +S+LSQL + R + HCL +V
Sbjct: 201 PSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVH 260
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----------KGL 274
GGG +G+ + P + TP+ +++ HY+ + GG + + KG
Sbjct: 261 GGGIFAIGNVVQPK--VKTTPLVQNV--THYNVNLQGISVGGATLQLPSSTFDSGDSKG- 315
Query: 275 QIIFDSGSSYTYFNSQAYKTTL 296
I DSG++ Y + Y+T L
Sbjct: 316 -TIIDSGTTLAYLPREVYRTLL 336
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 89/193 (46%), Gaps = 22/193 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V + IG PP +DTGSDL W QC+APC C P LY P + V+C P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 119 FCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGC 176
C A P + RC D C Y Y D S+ GVL T+ F L GS R + FGC
Sbjct: 152 MCQALQSPWS-RCSPPDTGCAYYFSYGDGTSTDGVLATETFTL----GSDTAVRGVAFGC 206
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS---VRGGGYLFLGH 233
G ++G++G+G G S++SQ LG+TR +C + LFLG
Sbjct: 207 GTENLGSTDN---SSGLVGMGRGPLSLVSQ---LGVTR--FSYCFTPFNATAASPLFLGS 258
Query: 234 DLVPSSGIAWTPM 246
SS TP
Sbjct: 259 SARLSSAAKTTPF 271
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 128/290 (44%), Gaps = 50/290 (17%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACNDPFC 120
+ L IGNP Y +DTGSDL W QC PCT C P ++ P+ + V C+ C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCK-PCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 59
Query: 121 SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQ 180
+A LP + E D C+Y Y D+ S+ G+L T+ F N S+ G + FGCG
Sbjct: 60 NA--LPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN-SISG--IGFGCGV-- 112
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGYLFLG---H 233
N G +G++GLG G S++SQL+ +CL+ LF+G
Sbjct: 113 ENEGDGFSQGSGLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLAS 167
Query: 234 DLVPSSGIAW---TPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI-------------- 276
+V +G + + LL L G + G K L +
Sbjct: 168 GIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGG 227
Query: 277 -IFDSGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCWK 322
I DSG++ TY A+K +++++ + P++D+ L +C+K
Sbjct: 228 MIIDSGTTITYLEETAFK----VLKEEFTSRMSLPVDDSG-STGLDLCFK 272
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 124/282 (43%), Gaps = 46/282 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + L +G PP DTGSD+ W QC PCT C +++P + V+C+
Sbjct: 83 GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCE-PCTNCYQQDLPMFNPSKSTTYRKVSCS 141
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
P CS E+ C C Y + Y D+ S G D + T+G ++ PR G
Sbjct: 142 SPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIG 199
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGY--LF 230
CG++ N G +G++GLGLG AS++ Q+ S +CL+ G GG L
Sbjct: 200 CGHD--NAGSFDANVSGIVGLGLGPASLIKQMGS--AVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 231 LGHDL-VPSSGIAWTPMS-RDLLEKHYS--------------SGPAELLFGGKSTGIKGL 274
G + V SG TP+ D + YS A + GGK+
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKA------ 309
Query: 275 QIIFDSGSSYT------YFN-SQAYKTTLDLMRKDLKGKPLE 309
II DSG++ T Y N ++A +++L R D + LE
Sbjct: 310 NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLE 351
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 117/264 (44%), Gaps = 44/264 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG+PP+ + L +DTGS +T+V C + C C + + P+ L + P
Sbjct: 87 GYYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRFQPE--LSSTYQPVK 143
Query: 120 CSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG- 177
C+A + C+ N QC YE YA+ +S GVL D L+ R +FGC
Sbjct: 144 CNA-----DCNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFG-KESELVPQRAVFGCET 197
Query: 178 ------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGGY 228
Y QR G++GLG G S++ QL G+ N C + V GG
Sbjct: 198 MESGDLYTQR--------ADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAM 249
Query: 229 LFLGHDLVPSSGIAWTPMSR------DLLEKHYSSGPAEL---LFGGKSTGIKGLQIIFD 279
+ G P + + SR +L E H + P +L F GK I D
Sbjct: 250 VLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGA------ILD 303
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDL 303
SG++Y YF +AY D + K +
Sbjct: 304 SGTTYAYFPEKAYYAFKDAIMKKI 327
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 80/288 (27%), Positives = 120/288 (41%), Gaps = 39/288 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G+PP L +D+GSD+ WVQC PC C + + L+ P + V+C
Sbjct: 169 GEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCK-PCLECYVQADPLFDPATSATFSGVSCG 227
Query: 117 DPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C LP + + C+YEV YAD + G L + LT G ++ G
Sbjct: 228 SAICRI--LPTSACGDGELGGCEYEVSYADGSYTKGALALET----LTLGGTAVEGVVIG 281
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--------- 226
CG+ R AG++GLG G S++ QL G +CL+ RGG
Sbjct: 282 CGHRNRG---LFVGAAGLMGLGWGPMSLVGQLG--GEVGGAFSYCLASRGGYGSGAADDD 336
Query: 227 -GYLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKSTGIK-GL--------- 274
G+L LG G W P+ R+ Y G + + G + ++ GL
Sbjct: 337 AGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAG 396
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK-PLEDTAEEKALPVCW 321
++ D+G++ T +AY D L G P L C+
Sbjct: 397 DVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCY 444
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 117/264 (44%), Gaps = 44/264 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG+PP+ + L +DTGS +T+V C + C C + + P+ L + P
Sbjct: 87 GYYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRFQPE--LSSTYQPVK 143
Query: 120 CSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG- 177
C+A + C+ N QC YE YA+ +S GVL D L+ R +FGC
Sbjct: 144 CNA-----DCNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFG-KESELVPQRAVFGCET 197
Query: 178 ------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGGY 228
Y QR G++GLG G S++ QL G+ N C + V GG
Sbjct: 198 MESGDLYTQR--------ADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAM 249
Query: 229 LFLGHDLVPSSGIAWTPMSR------DLLEKHYSSGPAEL---LFGGKSTGIKGLQIIFD 279
+ G P + + SR +L E H + P +L F GK I D
Sbjct: 250 VLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGA------ILD 303
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDL 303
SG++Y YF +AY D + K +
Sbjct: 304 SGTTYAYFPEKAYYAFKDAIMKKI 327
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 115/273 (42%), Gaps = 38/273 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP 108
FP+ G P +G Y +K+G PP+ + IDTGSD+ WV C + C GC P S
Sbjct: 63 FPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGC--PQTSGLQI 119
Query: 109 KNNLV-----------ACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTD 156
+ N +C D C + + C N+QC Y Y D + G V+D
Sbjct: 120 QLNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSD 179
Query: 157 --HFPLRLTNGSLL---GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSL 210
HF + G+L ++FGC Q K G+ G G S++SQL S
Sbjct: 180 LMHFA-SIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQ 238
Query: 211 GLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
G+ V HCL GGG L LG + P+ I ++P+ + HY+ + G+
Sbjct: 239 GIAPRVFSHCLKGDNSGGGVLVLGEIVEPN--IVYSPLVPS--QPHYNLNLQSISVNGQI 294
Query: 269 TGI--------KGLQIIFDSGSSYTYFNSQAYK 293
I I DSG++ Y +AY
Sbjct: 295 VRIAPSVFATSNNRGTIVDSGTTLAYLAEEAYN 327
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 79/281 (28%), Positives = 115/281 (40%), Gaps = 27/281 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + IG P + Y +DTGSDL W QC APC C P + P N+ + C+
Sbjct: 90 GEYLMEMGIGTPARFYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPANSSTYRSLGCS 148
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C+A + P + C Y+ Y D S+ GVL + F + + PR+ FGC
Sbjct: 149 APACNALYYPLCYQ----KTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGC 204
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL--FLGHD 234
G N G +G++G G G S++SQL S + + VR Y + +
Sbjct: 205 G--NLNAG-SLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVRSRLYFGAYATLN 261
Query: 235 LVPSSGIAWTP-MSRDLLEKHYSSGPAELLFGGKSTGIKGLQI-----------IFDSGS 282
+S + TP + L Y + GG I + I DSG+
Sbjct: 262 STNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGT 321
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGK-PLEDTAEEKALPVCWK 322
+ TY AY + L PL D E L C++
Sbjct: 322 TITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQ 362
>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
Length = 310
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 84/159 (52%), Gaps = 9/159 (5%)
Query: 172 LIFGCGYNQRNPG-PKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGY 228
+ G ++Q+ P T+G+LGL S+ SQL S G+ NV GHC++ GGGY
Sbjct: 14 FVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGGY 73
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS--TGIKGLQIIFDSGSSYTY 286
+FLG D VP G+ W P+ R + Y + ++ +G + GI +Q+I G+SYTY
Sbjct: 74 MFLGDDYVPRWGMTWAPI-RGGPDNLYHTEAQKVNYGDQELHAGIP-VQVISRCGTSYTY 131
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTW 325
+ YK +D +++D + + LP+CWK +
Sbjct: 132 LPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKADF 168
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 114/268 (42%), Gaps = 43/268 (16%)
Query: 46 GSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
G FP+ G P +G Y +K+G PP+ + + IDTGSD+ WV C + C GC E
Sbjct: 113 GGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSE 171
Query: 104 -----SLYHP----KNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLV 154
S + P +LV+C+D C + E+ C N+ C Y Y D + G +
Sbjct: 172 LQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES-GCSPNNLCSYSFKYGDGSGTSGYYI 230
Query: 155 TDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTR 214
+D L +G L PR G+ GLG G S++SQL GL
Sbjct: 231 SDFMCSNLQSGDLQRPR----------------RAVDGIFGLGQGSLSVISQLAVQGLAP 274
Query: 215 NVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK 272
V HCL GGG + LG P + +TP+ + HY+ + G+ I
Sbjct: 275 RVFSHCLKGDKSGGGIMVLGQIKRPDT--VYTPLVPS--QPHYNVNLQSIAVNGQILPID 330
Query: 273 --------GLQIIFDSGSSYTYFNSQAY 292
G I D+G++ Y +AY
Sbjct: 331 PSVFTIATGDGTIIDTGTTLAYLPDEAY 358
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 110/258 (42%), Gaps = 26/258 (10%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP---------ESLYHP 108
Y +G Y +K+G+PP+ + + IDTGSD+ WV CN+ C C +S
Sbjct: 61 YLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSGLGIQLNFFDSSSSS 119
Query: 109 KNNLVACNDPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
LV C+DP C++ +C +QC Y Y D + G V+D G
Sbjct: 120 TAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGES 179
Query: 168 L----GPRLIFGCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
L ++FGC + + G+ G G G+ S++SQL + G+T V HCL
Sbjct: 180 LVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI--------KGL 274
G G L + G+ ++P+ + HY+ + GK I
Sbjct: 240 GEGIGGGILVLGEILEPGMVYSPLVPS--QPHYNLNLQSIAVNGKLLPIDPSVFATSNSQ 297
Query: 275 QIIFDSGSSYTYFNSQAY 292
I DSG++ Y ++AY
Sbjct: 298 GTIVDSGTTLAYLVAEAY 315
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 118/264 (44%), Gaps = 35/264 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACN 116
G Y + + +G PP+ + +DTGSDL WVQC APC C P+ L+ P + +C
Sbjct: 6 GEYVLQISLGTPPQQFSAIVDTGSDLCWVQC-APCARCFEQPDPLFIPLASSSYSNASCT 64
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
D C A P C + C Y Y D ++ G + L NGS L R+ FGC
Sbjct: 65 DSLCDALPRPT---CSMRNTCTYSYSYGDGSNTRGDFAFETVTL---NGSTLA-RIGFGC 117
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLGH 233
G+NQ G++GLG G S+ SQL S ++ +CL S G
Sbjct: 118 GHNQEGTFAG---ADGLIGLGQGPLSLPSQLNS--SFTHIFSYCLVDQSTTGTFSPITFG 172
Query: 234 DLVPSSGIAWTPMSRDLLE-KHYSSGPAELLFGGK------------STGIKGLQIIFDS 280
+ +S ++TP+ ++ +Y G + G + + G+ G +I DS
Sbjct: 173 NAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGG--VILDS 230
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLK 304
G++ TY+ A+ L +R+ +
Sbjct: 231 GTTITYWRLAAFIPILAELRRQIS 254
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 119/270 (44%), Gaps = 27/270 (10%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPE--S 104
T + PLG+ Y + +G P Y + +DTGSDL W+ C+ C C T P +
Sbjct: 97 TLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFN 154
Query: 105 LYHPKNNL----VACNDPFCSAFHLPENIRCEA-NDQCDYEVLY-ADHGSSLGVLVTD-- 156
+Y P N+ V C+ CS HL + C + +D C Y+V Y +D+ SS G LV D
Sbjct: 155 IYSPNNSSTSKEVQCSSSLCS--HLDQ---CSSPSDTCPYQVSYLSDNTSSTGYLVEDIL 209
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
H + R+ GCG +Q G+ GLG+ S+ S L + GL N
Sbjct: 210 HLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNS 269
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI 276
C G + G P G TP + Y+ ++ GG + + + +
Sbjct: 270 FSLCFGPARMGRIEFGDKGSP--GQNETPFNLGRRHPTYNVSITQIGVGGHISDLD-VAV 326
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
IFDSG+S+TY N AY D ++ K
Sbjct: 327 IFDSGTSFTYLNDPAYSLFADKFASMVEEK 356
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 122/264 (46%), Gaps = 42/264 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C++ C C + + P +L + P
Sbjct: 75 GYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS-CEQCGKHQDPRFQP--DLSSTYRP-- 129
Query: 121 SAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGC 176
+ N C +D QC YE YA+ SS GV+ D + N S L P R +FGC
Sbjct: 130 ----VKCNPSCNCDDEGKQCTYERRYAEMSSSSGVIAED--VVSFGNESELKPQRAVFGC 183
Query: 177 G-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGG 226
Y+QR G++GLG G+ S++ QL G+ + C + V GG
Sbjct: 184 ENVETGDLYSQR--------ADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDV-GG 234
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDS 280
G + LG + P + ++ S +Y+ EL GK +K + DS
Sbjct: 235 GAMVLGQ-ISPPPNMVFS-HSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDS 292
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLK 304
G++Y YF A+ D + K+++
Sbjct: 293 GTTYAYFPEAAFHALKDAIMKEIR 316
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 127/281 (45%), Gaps = 27/281 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
G Y +T +G PP +DTGSD+ W+QC PC C +++P + N P C
Sbjct: 85 GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQC-KPCEQCYKQTTPIFNPSKSSSYKNIP-C 142
Query: 121 SAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG-SLLGPRLIFGC 176
S+ +L +++R C + C+Y + ++D S G L + L T G S+ P+ + GC
Sbjct: 143 SS-NLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGC 201
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCL------SVRGGGYL 229
G+N R G T+G++GLG+G S+ +QL+ S+G +CL S +
Sbjct: 202 GHNNR--GMFQGETSGIVGLGIGPVSLTTQLKSSIG---GKFSYCLLPLLVDSNKTSKLN 256
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL------QIIFDSGSS 283
F +V G+ TP + + Y G K + L II DSG++
Sbjct: 257 FGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTT 316
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
T S Y + + +K ++D + L +C+ T
Sbjct: 317 LTLLPSHVYTNLESAVAQLVKLDRVDD--PNQLLNLCYSIT 355
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 81/257 (31%), Positives = 116/257 (45%), Gaps = 30/257 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ + IG PP+ + L +DTGS +T+V C+ C C + + P+ L + P
Sbjct: 88 GYYTTRIWIGTPPQTFALIVDTGSTVTYVPCST-CEQCGRHQDPKFEPE--LSSTYQPVS 144
Query: 121 SAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGC 176
NI C ++ QC YE YA+ SS GVL D + N S L P R IFGC
Sbjct: 145 C------NIDCTCDNERKQCVYERQYAEMSSSSGVLGED--IISFGNQSELVPQRAIFGC 196
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHD 234
NQ G++GLG G SI+ QL G+ + C GGG + LG
Sbjct: 197 E-NQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILG-G 254
Query: 235 LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS--------TGIKGLQIIFDSGSSYTY 286
+ P SG+ + S + ++Y+ + GK G G + DSG++Y Y
Sbjct: 255 ISPPSGMVFAE-SDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHG--TVLDSGTTYAY 311
Query: 287 FNSQAYKTTLDLMRKDL 303
A+ D M K+L
Sbjct: 312 LPEAAFTAFKDAMMKEL 328
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 77/153 (50%), Gaps = 12/153 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y +++ IG PP+ Y +DTGSDL W QC APC C P + P + + CN
Sbjct: 87 GEYLMSMGIGTPPRYYSAILDTGSDLIWTQC-APCMLCVDQPTPFFDPAQSPSYAKLPCN 145
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C+A + P R + C Y+ Y D ++ GVL + F + + PR+ FGC
Sbjct: 146 SPMCNALYYPLCYR----NVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGC 201
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
G N G +G++G G G S++SQL S
Sbjct: 202 G--NLNAG-SLFNGSGMVGFGRGPLSLVSQLGS 231
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 119/270 (44%), Gaps = 27/270 (10%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPE--S 104
T + PLG+ Y + +G P Y + +DTGSDL W+ C+ C C T P +
Sbjct: 120 TLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFN 177
Query: 105 LYHPKNNL----VACNDPFCSAFHLPENIRCEA-NDQCDYEVLY-ADHGSSLGVLVTD-- 156
+Y P N+ V C+ CS HL + C + +D C Y+V Y +D+ SS G LV D
Sbjct: 178 IYSPNNSSTSKEVQCSSSLCS--HLDQ---CSSPSDTCPYQVSYLSDNTSSTGYLVEDIL 232
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
H + R+ GCG +Q G+ GLG+ S+ S L + GL N
Sbjct: 233 HLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNS 292
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI 276
C G + G P G TP + Y+ ++ GG + + + +
Sbjct: 293 FSLCFGPARMGRIEFGDKGSP--GQNETPFNLGRRHPTYNVSITQIGVGGHISDLD-VAV 349
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
IFDSG+S+TY N AY D ++ K
Sbjct: 350 IFDSGTSFTYLNDPAYSLFADKFASMVEEK 379
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 113/278 (40%), Gaps = 24/278 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-- 112
G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230
Query: 113 --VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
V+C P CS + R + C Y V Y D S+G D L + ++ G
Sbjct: 231 ANVSCAAPACSDL----DTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKGF 285
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGY 228
R FGCG +RN G AG+LGLG GK S+ +Q+ V HCL R G GY
Sbjct: 286 R--FGCG--ERNEGLF-GEAAGLLGLGRGKTSL--PVQTYDKYGGVFAHCLPARSTGTGY 338
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-----STGIKGLQIIFDSGSS 283
L G P++ + TPM D Y G + GG+ + I DSG+
Sbjct: 339 LDFGAG-SPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTV 397
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
T AY + + + + L C+
Sbjct: 398 ITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCY 435
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 110/246 (44%), Gaps = 29/246 (11%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P ++ Y + L++G PP E +IDTGSDL W QC PCT C ++ P N+
Sbjct: 50 PYADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPSNS 108
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGP 170
S F + RC N C Y+++YAD S G L T+ + T+G + P
Sbjct: 109 ---------STF---KEKRCNGN-SCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMP 155
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
GCG+N P +G++GL G +S+++Q+ G ++ +C + +G +
Sbjct: 156 ETTIGCGHNSS---WFKPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQGTSKIN 210
Query: 231 LGHD-LVPSSGIAWTPMSRDLLE--------KHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
G + +V G+ T M + S G + G + II DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270
Query: 282 SSYTYF 287
++ TYF
Sbjct: 271 TTLTYF 276
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 110/246 (44%), Gaps = 29/246 (11%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P ++ Y + L++G PP E +IDTGSDL W QC PCT C ++ P N+
Sbjct: 50 PYADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPSNS 108
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGP 170
S F + RC N C Y+++YAD S G L T+ + T+G + P
Sbjct: 109 ---------STF---KEKRCNGN-SCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMP 155
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
GCG+N P +G++GL G +S+++Q+ G ++ +C + +G +
Sbjct: 156 ETTIGCGHNSS---WFKPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQGTSKIN 210
Query: 231 LGHD-LVPSSGIAWTPMSRDLLE--------KHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
G + +V G+ T M + S G + G + II DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270
Query: 282 SSYTYF 287
++ TYF
Sbjct: 271 TTLTYF 276
>gi|213998828|gb|ACJ60781.1| nucellin [Hordeum brachyantherum subsp. californicum]
Length = 133
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 70/127 (55%), Gaps = 4/127 (3%)
Query: 187 PPPTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTP 245
P P G+LGLG+GKA QL+ + T NV+GHCLS +G G L++G PS G+ W P
Sbjct: 6 PSPVDGILGLGMGKAGFAVQLKGQKMITGNVIGHCLSSQGKGVLYVGDFNPPSRGVTWVP 65
Query: 246 MSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
M L +YS G AE L + G + +FDSGS+YT+ +Q Y + +R L
Sbjct: 66 MKESLF--YYSPGLAEPLIDNQPIRGNPTFEAVFDSGSTYTHVPAQVYNEIVSKVRGTLS 123
Query: 305 GKPLEDT 311
LE+
Sbjct: 124 ESSLEEV 130
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 122/290 (42%), Gaps = 36/290 (12%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
S A P + + +G Y +T +G PP KLY + +DTGSD+ W+QC PC C +
Sbjct: 71 SLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGI-VDTGSDIVWLQCE-PCQECYNQTTPM 128
Query: 106 YHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
++P + + C C + E+ C + C+Y Y D+ S G L D L
Sbjct: 129 FNPSKSSSYKNIPCPSKLCQSM---EDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLE 185
Query: 162 LTNG-SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
TNG ++ P ++ GCG N N ++G++G G G AS ++QL S T +C
Sbjct: 186 STNGLTVSFPNIVIGCGTN--NILSYEGASSGIVGFGSGPASFITQLGS--STGGKFSYC 241
Query: 221 L----------SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY-------SSGPAELL 263
L S F V G+ TP+ + E Y S G +
Sbjct: 242 LTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVE 301
Query: 264 FGGKSTGIKGLQIIFDSGSSYTYFNSQAY----KTTLDLMRKDLKGKPLE 309
GG G II DSG++ T Y +DL++ + P +
Sbjct: 302 IGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQ 351
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 110/273 (40%), Gaps = 29/273 (10%)
Query: 46 GSTAVFPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
G F + G P +G Y +K+G PPK + + IDTGSD+ WV CN C+ C +
Sbjct: 59 GGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNT-CSNCPQSSQ 117
Query: 104 ---------SLYHPKNNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVL 153
++ L+ C+DP C++ C +QC Y Y D + G
Sbjct: 118 LGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYY 177
Query: 154 VTDHFPLRLTNGS----LLGPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQ 208
V+D L G ++FGC +Q K G+ G G G S++SQL
Sbjct: 178 VSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLS 237
Query: 209 SLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
S G+T V HCL G G L + I ++P+ + HY+ + G+
Sbjct: 238 SRGITPKVFSHCLKGDGDGGGVLVLGEILEPSIVYSPLVPS--QPHYNLNLQSIAVNGQL 295
Query: 269 TGIKGLQI---------IFDSGSSYTYFNSQAY 292
I I D G++ Y +AY
Sbjct: 296 LPINPAVFSISNNRGGTIVDCGTTLAYLIQEAY 328
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 120/272 (44%), Gaps = 30/272 (11%)
Query: 60 LGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VA 114
LG+Y + + IG PP K+Y + DTGSDLTW C PC C ++ P+ + ++
Sbjct: 22 LGHYLMEVSIGTPPFKIYGI-ADTGSDLTWTSC-VPCNKCYKQRNPIFDPQKSTSYRNIS 79
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LI 173
C+ C H + C C+Y YA + GVL + L T G + + ++
Sbjct: 80 CDSKLC---HKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIV 136
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL-----SVRGGG 227
FGCG+N N G G++GLG G S +SQ+ S G R CL V
Sbjct: 137 FGCGHN--NTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKR--FSQCLVPFHTDVSVSS 192
Query: 228 YLFLGH-DLVPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKST-GIKGLQIIF 278
+ LG V G+ TP+ + Y S G L F G S+ ++ +
Sbjct: 193 KMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGNVFL 252
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
DSG+ T +Q Y + +R ++ KP+ +
Sbjct: 253 DSGTPPTILPTQLYDRLVAQVRSEVAMKPVTN 284
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/280 (28%), Positives = 120/280 (42%), Gaps = 24/280 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G+P + Y + +DTGS L+W+QC C + + L+ P + ++C
Sbjct: 11 GNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCT 70
Query: 117 DPFCSAF--HLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
CS+ N CE +++ C Y Y D S+G L D L L L P +
Sbjct: 71 SSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDL--LTLAPSQTL-PGFV 127
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS-LGLTRNVLGHCLSVR-GGGYLFL 231
+GCG + + AG+LGLG K S+L Q+ S G +CL R GGG+L +
Sbjct: 128 YGCGQDSEGLFGR---AAGILGLGRNKLSMLGQVSSKFGY---AFSYCLPTRGGGGFLSI 181
Query: 232 GHDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGIKGLQ----IIFDSGSSYTY 286
G + S +TPM+ D Y + GG++ G+ Q I DSG+ T
Sbjct: 182 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 241
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWK 326
Y K + K L C+KG K
Sbjct: 242 LPMSVYTPFQQAFVKIMSSK-YARAPGFSILDTCFKGNLK 280
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 113/278 (40%), Gaps = 25/278 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-- 112
G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P ++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230
Query: 113 --VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
V+C P CS ++ + C Y V Y D S+G D L + ++ G
Sbjct: 231 ANVSCAAPACSDL----DVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKGF 285
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGY 228
R FGCG +RN G AG+LGLG GK S+ +Q+ G V HCL R G GY
Sbjct: 286 R--FGCG--ERNDGLF-GEAAGLLGLGRGKTSL--PVQTYGKYGGVFAHCLPARSTGTGY 338
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSGSS 283
L G P++ TPM Y G + GG+ I I DSG+
Sbjct: 339 LDFGAGSPPAT--TTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTV 396
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
T AY + + + A L C+
Sbjct: 397 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY 434
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 113/278 (40%), Gaps = 25/278 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-- 112
G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P ++
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234
Query: 113 --VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
V+C P CS ++ + C Y V Y D S+G D L + ++ G
Sbjct: 235 ANVSCAAPACSDL----DVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKGF 289
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGY 228
R FGCG +RN G AG+LGLG GK S+ +Q+ G V HCL R G GY
Sbjct: 290 R--FGCG--ERNDGLF-GEAAGLLGLGRGKTSL--PVQTYGKYGGVFAHCLPARSTGTGY 342
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSGSS 283
L G P++ TPM Y G + GG+ I I DSG+
Sbjct: 343 LDFGAGSPPAT--TTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTV 400
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
T AY + + + A L C+
Sbjct: 401 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY 438
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 115/253 (45%), Gaps = 33/253 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACN 116
G Y + L IG PP Y +DTGSDL W QC PCT C P ++ P + V+C
Sbjct: 106 GEYLIELAIGTPPVSYPAVLDTGSDLIWTQCK-PCTRCYKQPTPIFDPKKSSSFSKVSCG 164
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CSA LP + +D C+Y Y D+ + GVL T+ F + + + FGC
Sbjct: 165 SSLCSA--LPSST---CSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGC 219
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS---VRGGGYLFLGH 233
G + N G +G++GLG G S++SQL+ +CL+ L LG
Sbjct: 220 G--EDNEGDGFEQASGLVGLGRGPLSLVSQLK-----EQRFSYCLTPIDDTKESVLLLGS 272
Query: 234 --DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGG-------KSTGIKGLQ----IIFDS 280
+ + + TP+ ++ L+ + E + G KST G +I DS
Sbjct: 273 LGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDS 332
Query: 281 GSSYTYFNSQAYK 293
G++ TY +AY+
Sbjct: 333 GTTITYVQQKAYE 345
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 111/255 (43%), Gaps = 13/255 (5%)
Query: 59 PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVA 114
P+ Y + IG PP DTGSDL WVQC APC C L+ P+ + V
Sbjct: 88 PITEYLMRFYIGTPPVERFAIADTGSDLIWVQC-APCEKCVPQNAPLFDPRKSSTFKTVP 146
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C+ C+ + + QC Y+ +Y DH G+L + N ++ P+L F
Sbjct: 147 CDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTF 206
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSVRGGGYLFLGH 233
GC ++ + + G++GLG+G S++SQL +G + LS + G+
Sbjct: 207 GCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRFGN 266
Query: 234 DLVPSS--GIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGIKGLQ----IIFDSGSSYTY 286
D + G+ TP+ + + +Y + G K Q I+ DSG+S+T
Sbjct: 267 DAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTI 326
Query: 287 FNSQAYKTTLDLMRK 301
Y + L+++
Sbjct: 327 LKQSFYNKFVALVKE 341
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 116/283 (40%), Gaps = 24/283 (8%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL- 112
+G+ G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 152 SGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSST 211
Query: 113 ---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
++C P CS + I+ + C Y V Y D S+G D L + ++ G
Sbjct: 212 YANISCAAPACSDLY----IKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AIKG 266
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGG 227
R FGCG +RN G AG+LGLG GK S+ +Q+ V HC R G G
Sbjct: 267 FR--FGCG--ERNEGLY-GEAAGLLGLGRGKTSL--PVQAYDKYGGVFAHCFPARSSGTG 319
Query: 228 YLFLGHDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
YL G +P+ S TPM D Y G + GGK I I DSG
Sbjct: 320 YLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSG 379
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ T AY + + + + L C+ T
Sbjct: 380 TVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFT 422
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 136/292 (46%), Gaps = 58/292 (19%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACN 116
G + + L IG PP+ Y +DTGSDL W QC PCT C P ++ P + ++C+
Sbjct: 98 GEFLMNLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPSPIFDPKKSSSFSKLSCS 156
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A LP++ +D C+Y Y D+ S+ G + T+ F T G + P + FGC
Sbjct: 157 SQLCKA--LPQS---SCSDSCEYLYTYGDYSSTQGTMATETF----TFGKVSIPNVGFGC 207
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ---------SLGLTRN---VLGHCLSVR 224
G + N G +G++GLG G S++SQL+ S+ T+ ++G SV
Sbjct: 208 G--EDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTSIDDTKTSTLLMGSLASVN 265
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAE-LLFGGKSTGIK----GLQ---- 275
G S+ I TP+ ++ L+ + E + GG IK LQ
Sbjct: 266 G-----------TSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGT 314
Query: 276 --IIFDSGSSYTYFNSQAYKTTLDLMRKDLK---GKPLEDTAEEKALPVCWK 322
+I DSG++ TY A+ DL++K+ G P+ D + L +C+
Sbjct: 315 GGLIIDSGTTITYLEESAF----DLVKKEFTSQMGLPV-DNSGATGLELCYN 361
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/293 (26%), Positives = 118/293 (40%), Gaps = 23/293 (7%)
Query: 46 GSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
GS +F GN + +Y+ + +G P + + +D GSDL WV C+ C C +
Sbjct: 89 GSQVIF--FGNEFNWLHYT-WIDLGTPSVPFLVALDVGSDLLWVPCD--CIQCAPLSANY 143
Query: 106 YHPKNNLVACNDP---------FCSAFHLPENIRCE-ANDQCDYEV-LYADHGSSLGVLV 154
Y + ++ +P FC + C+ AND C Y+ Y+D+ S+ G ++
Sbjct: 144 YSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGFMI 203
Query: 155 TDHFPL----RLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
D L + SLL ++FGCG Q GV+GLG G S+ + L
Sbjct: 204 EDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQE 263
Query: 211 GLTRNVLGHCLSVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKST 269
GL RN C G G + G D + P+ + Y G G
Sbjct: 264 GLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPLFGEF--AAYFIGVESFCVGSSCL 321
Query: 270 GIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
G Q + DSGSS+TY ++ YK + K +K E C+
Sbjct: 322 QRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELPWNYCYN 374
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 117/266 (43%), Gaps = 33/266 (12%)
Query: 60 LGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL---------YHPK 109
LG+ Y + +G P + + +DTGSDL W+ C C+ C + Y P
Sbjct: 100 LGFLYYANVSVGTPSLDFLVALDTGSDLFWLPCE--CSSCFTYLNTSNGGKFMLNHYSPN 157
Query: 110 NNL----VACNDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPLRLT 163
++ V C C+ RC +N C YE+ Y + + SS+G LV D L T
Sbjct: 158 DSTTSSTVPCTSSLCN--------RCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA-T 208
Query: 164 NGSLLGP---RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
+ SLL P ++ FGCG Q G++GLG+ K S+ S L GLT N C
Sbjct: 209 DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMC 268
Query: 221 LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDS 280
G G + G D P+ TP + L + Y+ + GG+ + IFDS
Sbjct: 269 FGADGYGRIDFG-DTGPADQ-KQTPFNTMLEYQSYNVTFNVINVGGEPNDVP-FTAIFDS 325
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLKGK 306
G+S+TY AY T M +K K
Sbjct: 326 GTSFTYLTEPAYSTITKQMDAGMKLK 351
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 123/273 (45%), Gaps = 38/273 (13%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---- 104
FP+ G P +G Y +K+G PP+ + + IDTGSD+ WV C + C GC P S
Sbjct: 63 FPVKGTFDPSQVGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGS-CNGC--PQTSGLQI 119
Query: 105 ---LYHPK----NNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTD 156
+ P+ ++L++C+D C + + C + N+QC Y Y D + G V+D
Sbjct: 120 QLNYFDPRSSSTSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSD 179
Query: 157 --HFPLRLTNGSLL---GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSL 210
HF + G+L ++FGC Q K G+ G G S++SQL
Sbjct: 180 LMHFA-GIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQ 238
Query: 211 GLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
G+ V HCL GGG L LG + P+ I ++P+ + + HY+ + G+
Sbjct: 239 GIAPRVFSHCLKGDNSGGGVLVLGEIVEPN--IVYSPLVQS--QPHYNLNLQSISVNGQI 294
Query: 269 TGI--------KGLQIIFDSGSSYTYFNSQAYK 293
I I DSG++ Y +AY
Sbjct: 295 VPIAPAVFATSNNRGTIVDSGTTLAYLAEEAYN 327
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 113/285 (39%), Gaps = 28/285 (9%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL- 112
+G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSST 232
Query: 113 ---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
V+C P CS + R + C Y V Y D S+G D L + ++ G
Sbjct: 233 YANVSCAAPACSDLY----TRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYD-AVKG 287
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL 229
R FGCG +RN G AG+LGLG GK S+ +Q+ V HCL R G
Sbjct: 288 FR--FGCG--ERNEGLF-GEAAGLLGLGRGKTSL--PVQTYDKYGGVFAHCLPARSSGTG 340
Query: 230 FLGHDLVPSSGIA-----WTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFD 279
+L D P S A TPM D Y G + GG+ I I D
Sbjct: 341 YL--DFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAGTIVD 398
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
SG+ T AY + + + + L C+ T
Sbjct: 399 SGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFT 443
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 78/158 (49%), Gaps = 17/158 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V + IG PP +DTGSDL W QC+APC C P LY P + V+C P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 119 FCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGC 176
C A P + RC D C Y Y D S+ GVL T+ F L GS R + FGC
Sbjct: 152 MCQALQSPWS-RCSPPDTGCAYYFSYGDGTSTDGVLATETFTL----GSDTAVRGVAFGC 206
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTR 214
G ++G++G+G G S++SQ LG+TR
Sbjct: 207 GTENLGSTDN---SSGLVGMGRGPLSLVSQ---LGVTR 238
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 127/286 (44%), Gaps = 51/286 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G + + L IG P + Y +DTGSDL W QC PC C P ++ P+ + + C+
Sbjct: 95 GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCS 153
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A + +D C+Y Y DH S+ GVL T+ F + S +G FGC
Sbjct: 154 SDLCVALPI-----SSCSDGCEYRYSYGDHSSTQGVLATETFTFGDASVSKIG----FGC 204
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGYLFLG 232
G + R G AG++GLG G S++SQ LG+ + +CL+ +G L +G
Sbjct: 205 GEDNR--GRAYSQGAGLVGLGRGPLSLISQ---LGVPK--FSYCLTSIDDSKGISTLLVG 257
Query: 233 HDLVPSSGIAWTPMSRD--------LLEKHYSSGPAEL-----LFGGKSTGIKGLQIIFD 279
+ S I TP+ ++ L + S G L F + G GL I D
Sbjct: 258 SEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGL--IID 314
Query: 280 SGSSYTYFNSQAY----KTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
SG++ TY A+ K + M+ D+ D + L +C+
Sbjct: 315 SGTTITYLKDNAFAALKKEFISQMKLDV------DASGSTELELCF 354
>gi|213998796|gb|ACJ60765.1| nucellin [Hordeum marinum subsp. gussoneanum]
Length = 133
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 72/132 (54%), Gaps = 5/132 (3%)
Query: 183 PGPKPP-PTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGYLFLGHDLVPSSG 240
P PP P G+LGLG+GKA +QL+ + T NV+GHCLS +G G L++G+ PS G
Sbjct: 1 PADSPPLPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGNFNPPSRG 60
Query: 241 IAWTPMSRDLLEKHYSSGPAELLFGGKST-GIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
+ W PM +YS G AELL + G + +FDSGS+YT SQ Y + +
Sbjct: 61 VTWVPMRESSF--YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYNEIVPKV 118
Query: 300 RKDLKGKPLEDT 311
R L L +
Sbjct: 119 RGTLSESSLAEV 130
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 130/300 (43%), Gaps = 42/300 (14%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
+G + G Y + +G P + L +DTGSD+TW+QC APCT C ++L++P ++
Sbjct: 6 FSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQC-APCTNCYKQKDALFNPSSSS 64
Query: 112 ---LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL--RLTNGS 166
++ C+ C + + C +N +C Y+ Y D ++G LVTD+ L G
Sbjct: 65 SFKVLDCSSSLCLNLDV---MGCLSN-KCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQ 120
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
++ + GCG++ AG+LGLG G S + L + TRN+ +CL R
Sbjct: 121 VVLTNIPLGCGHDNEGTFGT---AAGILGLGRGPLSFPNNLDA--STRNIFSYCLPDRES 175
Query: 227 -----GYLFLGHDLVPSSG---IAWTPMSRD-LLEKHYSSGPAELLFGGK---------- 267
L G +P + + + P R+ + +Y + GG
Sbjct: 176 DPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVF 235
Query: 268 ---STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
S G G IFDSG++ T ++AY D R L A+ K C+ T
Sbjct: 236 QLDSHGNGG--TIFDSGTTITRLEARAYTAVRDAFRA--ATMHLTSAADFKIFDTCYDFT 291
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 107/252 (42%), Gaps = 28/252 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G+PP L +D+GSD+ WVQC PC C + L+ P + V+C
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C +CDY V Y D + G L + LT G + GC
Sbjct: 187 SAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALE----TLTLGGTAVQGVAIGC 242
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGH 233
G+ RN G AG+LGLG G S++ QL G V +CL+ R G G L LG
Sbjct: 243 GH--RNSGLF-VGAAGLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAGGAGSLVLGR 297
Query: 234 DLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIK-GL---------QIIFDSGS 282
G W P+ R + Y G + GG+ ++ GL ++ D+G+
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGT 357
Query: 283 SYTYFNSQAYKT 294
+ T +AY
Sbjct: 358 AVTRLPREAYAA 369
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 130/293 (44%), Gaps = 35/293 (11%)
Query: 11 LLVLLMFATFQGCFSEANQPPS----KKKSTQSTAAHRFGST--AVFPITGNVYPLGYYS 64
++VL + + F+ PP +S A+ R +T P V+ Y
Sbjct: 7 IIVLFLQISLCFLFTTTASPPHGFTMDLIHRRSNASSRVSNTQSGSSPYANTVFDNSVYL 66
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFH 124
+ L++G PP + IDTGS++TW QC PC C ++ P + S F
Sbjct: 67 MKLQVGTPPFEIQAIIDTGSEITWTQC-LPCVHCYEQNAPIFDPSKS---------STF- 115
Query: 125 LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFGCGYNQRNP 183
+ RC+ + C YEV Y DH ++G L T+ L T+G + P I GCG+N N
Sbjct: 116 --KEKRCDGH-SCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHN--NS 170
Query: 184 GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHD-LVPSSGIA 242
K P +G++GL G +S+++Q+ G ++ +C S +G + G + +V G+
Sbjct: 171 WFK-PSFSGMVGLNWGPSSLITQMG--GEYPGLMSYCFSGQGTSKINFGANAIVAGDGVV 227
Query: 243 WTPMSRDLLEKHY--------SSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
T M + + S G + G + I+ DSG++ TYF
Sbjct: 228 STTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTYF 280
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 127/286 (44%), Gaps = 51/286 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G + + L IG P + Y +DTGSDL W QC PC C P ++ P+ + + C+
Sbjct: 95 GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCS 153
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A + +D C+Y Y DH S+ GVL T+ F + S +G FGC
Sbjct: 154 SDLCVALPI-----SSCSDGCEYRYSYGDHSSTQGVLATETFTFGDASVSKIG----FGC 204
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGYLFLG 232
G + R G AG++GLG G S++SQ LG+ + +CL+ +G L +G
Sbjct: 205 GEDNR--GRAYSQGAGLVGLGRGPLSLISQ---LGVPK--FSYCLTSIDDSKGISTLLVG 257
Query: 233 HDLVPSSGIAWTPMSRD--------LLEKHYSSGPAEL-----LFGGKSTGIKGLQIIFD 279
+ S I TP+ ++ L + S G L F + G GL I D
Sbjct: 258 SEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGL--IID 314
Query: 280 SGSSYTYFNSQAY----KTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
SG++ TY A+ K + M+ D+ D + L +C+
Sbjct: 315 SGTTITYLKDSAFAALKKEFISQMKLDV------DASGSTELELCF 354
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 106/220 (48%), Gaps = 25/220 (11%)
Query: 51 FPITGNVYPLG-YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
FP+ P+ Y TL+IG PP+ + + IDTGSD+ WV C + C GC L + + P
Sbjct: 69 FPVERGTNPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCIS-CVGCPLQNVTFFDPG 127
Query: 110 NN----LVACNDPFC-SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
+ +AC+D C S H + +Y+V Y+D + G ++D
Sbjct: 128 ASSSAVKLACSDKRCFSDLH-----KKSGCSPLEYKVEYSDGSFTSGYYISDLISFETVM 182
Query: 165 GSLLGPR----LIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGLTRNVLG 218
S L + +FGC N P T+ G++GLG G+ ++SQL S L V
Sbjct: 183 SSNLTVKSSAPFVFGCS-NLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFS 241
Query: 219 HCLS--VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYS 256
CLS GGG + LG + +P++ +TP+ R + HY+
Sbjct: 242 LCLSGGQEGGGVIILGENRLPNT--VYTPLVRS--QTHYN 277
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 121/285 (42%), Gaps = 36/285 (12%)
Query: 43 HRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP 102
H F S V +G+ G Y V +G PP+ + L +D+GSDL WVQC APC C
Sbjct: 48 HDFQSPVV---SGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQC-APCLQCYAQD 103
Query: 103 ESLYHPKN----NLVACNDPFCSAFHLPENIRCEAN--DQCDYEVLYADHGSSLGVLVTD 156
LY P N N V C P C E C+ + C YE YAD S GV +
Sbjct: 104 TPLYAPSNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYE 163
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
T + ++ FGCG + + GVLGLG G S SQ+ N
Sbjct: 164 ----SATVDDVRIDKVAFGCGRDNQGSFAA---AGGVLGLGQGPLSFGSQVGY--AYGNK 214
Query: 217 LGHCLS-----VRGGGYLFLGHDLVPS-SGIAWTPM-SRDLLEKHYSSGPAELLFGGKST 269
+CL +L G +L+ + + +TP+ S Y +++ GG+S
Sbjct: 215 FAYCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESL 274
Query: 270 GIK----GLQI------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I L IFDSG++ TY+ AY+ L K+++
Sbjct: 275 PISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVR 319
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 41 AAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL 100
A FGS V +G G Y V + +G+PP+ + ID+GSD+ WVQC PCT C
Sbjct: 115 AEEAFGSDVV---SGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCE-PCTQCYH 170
Query: 101 PPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
+ +++P ++ V+C CS H+ +N C +C YEV Y D + G L +
Sbjct: 171 QSDPVFNPADSSSYAGVSCASTVCS--HV-DNAGCH-EGRCRYEVSYGDGSYTKGTLALE 226
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
LT G L + GCG++ + AG+LGLG G S + QL G
Sbjct: 227 ----TLTFGRTLIRNVAIGCGHHNQG---MFVGAAGLLGLGSGPMSFVGQLG--GQAGGT 277
Query: 217 LGHCLSVRG---GGYLFLGHDLVPSSGIAWTPM 246
+CL RG G L G + VP G AW P+
Sbjct: 278 FSYCLVSRGIQSSGLLQFGREAVP-VGAAWVPL 309
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 113/278 (40%), Gaps = 25/278 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-- 112
G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P ++
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231
Query: 113 --VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
V+C P CS ++ + C Y V Y D S+G D L + ++ G
Sbjct: 232 ANVSCAAPACSDL----DVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKGF 286
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGY 228
R FGCG +RN G AG+LGLG GK S+ +Q+ G V HCL R G GY
Sbjct: 287 R--FGCG--ERNDGLF-GEAAGLLGLGRGKTSL--PVQTYGKYGGVFAHCLPPRSTGTGY 339
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSGSS 283
L G P++ TPM Y G + GG+ I I DSG+
Sbjct: 340 LDFGAGSPPAT--TTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTV 397
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
T AY + + + A L C+
Sbjct: 398 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY 435
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 114/283 (40%), Gaps = 24/283 (8%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL- 112
+G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 113 ---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
++C P CS + R + C Y V Y D S+G D L + ++ G
Sbjct: 231 YANISCAAPACSDL----DTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKG 285
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGG 227
R FGCG +RN G AG+LGLG GK S+ +Q+ V HCL R G G
Sbjct: 286 FR--FGCG--ERNEGLF-GEAAGLLGLGRGKTSL--PVQTYDKYGGVFAHCLPARSSGTG 338
Query: 228 YLFLGHDLVPSSGIAW-TPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
YL G ++G TPM D Y G + GG+ I I DSG
Sbjct: 339 YLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSG 398
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ T AY + + + + L C+ T
Sbjct: 399 TVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFT 441
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 81/247 (32%), Positives = 113/247 (45%), Gaps = 28/247 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y ++L +G PP DTGSDL W QC PC C L+ PK++ ++C+
Sbjct: 91 GEYLMSLSLGTPPFEILAIADTGSDLIWTQC-TPCDKCYKQIAPLFDPKSSKTYRDLSCD 149
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFG 175
C +L E+ C + C Y Y D + G L D L TNG + P+ + G
Sbjct: 150 TRQCQ--NLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIG 207
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL------SVRGGGY 228
CG +RN G +G++GLG G S++SQ+ S+G +CL S
Sbjct: 208 CG--RRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVG---GKFSYCLVPFSSESAGNSSK 262
Query: 229 LFLGHDLVPS-SGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFDS 280
L G + V S SG+ TP+ + Y S G ++ FGG S G II DS
Sbjct: 263 LHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGNIIIDS 322
Query: 281 GSSYTYF 287
G+S T F
Sbjct: 323 GTSLTLF 329
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 115/264 (43%), Gaps = 19/264 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + IG+PP +DTGS L W+QC +PC C L+ P + C+
Sbjct: 87 GEYLMRFYIGSPPVERLAMVDTGSSLIWLQC-SPCHNCFPQETPLFEPLKSSTYKYATCD 145
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG--SLLGPRLIF 174
C+ P C QC Y ++Y D S+G+L T+ T G ++ P IF
Sbjct: 146 SQPCTLLQ-PSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIF 204
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL----SVRGGGYL 229
GCG + G+ GLG G S++SQL +G + +CL S
Sbjct: 205 GCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG---HKFSYCLLPYDSTSTSKLK 261
Query: 230 FLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGK--STGIKGLQIIFDSGSSYTY 286
F ++ ++G+ TP+ + L +Y + G K STG I+ DSG+ TY
Sbjct: 262 FGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTY 321
Query: 287 FNSQAYKTTLDLMRKDLKGKPLED 310
+ Y + +++ L K L+D
Sbjct: 322 LENTFYNNFVASLQETLGVKLLQD 345
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 94/205 (45%), Gaps = 26/205 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------LYHPKNN-- 111
G Y ++IG+PPK Y + +DTGSD+ WV C C GC P S Y P +
Sbjct: 82 GLYYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGC--PTRSGLGIELTQYDPAGSGT 138
Query: 112 LVACNDPFC---SAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNG-- 165
V C FC SA +P C + C + + Y D ++ G VTD +G
Sbjct: 139 TVGCEQEFCVANSAGGVPPT--CPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNG 196
Query: 166 --SLLGPRLIFGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL- 221
+ + FGCG + G G+LG G +S+LSQL + R + HCL
Sbjct: 197 QTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPM 246
+VRGGG +G+ + P + TP+
Sbjct: 257 TVRGGGIFAIGNVVQPK--VKTTPL 279
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/253 (32%), Positives = 106/253 (41%), Gaps = 24/253 (9%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL- 112
+G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 113 ---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
V+C P CS NI + C Y V Y D S+G D L + ++ G
Sbjct: 231 YANVSCAAPACSDL----NIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKG 285
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGG 227
R FGCG +RN G AG+LGLG GK S+ +Q+ V HCL R G G
Sbjct: 286 FR--FGCG--ERNEGLF-GEAAGLLGLGRGKTSL--PVQTYDKYGGVFAHCLPARSTGTG 338
Query: 228 YLFLGH-DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
YL G L + TPM + Y G + GG+ I I DSG
Sbjct: 339 YLDFGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSG 398
Query: 282 SSYTYFNSQAYKT 294
+ T AY +
Sbjct: 399 TVITRLPPAAYSS 411
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 136/322 (42%), Gaps = 37/322 (11%)
Query: 33 KKKSTQSTAAHRFGSTAVF-PITGNVYPL--------GYYSVTLKIGNPPKLYELDIDTG 83
+KK Q + R S + P + N+ PL G Y + L +G+PPK Y + +DTG
Sbjct: 82 RKKDVQGASFSRHKSGHLLEPNSANI-PLNPGLSIGSGNYYLKLGLGSPPKYYTMILDTG 140
Query: 84 SDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLP--ENIRCEANDQC 137
S L+W+QC C + L+ P + + C+ CS + C A+ C
Sbjct: 141 SSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLLKAATLNDPLCTASGVC 200
Query: 138 DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLG 197
Y Y D S+G L D L LT L P +GCG + K AG++GL
Sbjct: 201 VYTASYGDASYSMGYLSRDL--LTLTPSQTL-PSFTYGCGQDNEGLFGK---AAGIVGLA 254
Query: 198 LGKASILSQLQ-SLGLTRNVLGHCL---SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEK 253
K S+L+QL G +CL + GGG+L +G + S +TPM R+
Sbjct: 255 RDKLSMLAQLSPKYGY---AFSYCLPTSTSSGGGFLSIGK--ISPSSYKFTPMIRNSQNP 309
Query: 254 H-YSSGPAELLFGGKSTGI--KGLQI--IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
Y A + G+ G+ G Q+ I DSG+ T Y + K + +
Sbjct: 310 SLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLPISIYAALREAFVK-IMSRRY 368
Query: 309 EDTAEEKALPVCWKGTWKCLLG 330
E L C+KG+ K + G
Sbjct: 369 EQAPAYSILDTCFKGSLKSMSG 390
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 108/253 (42%), Gaps = 29/253 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCN---------APCTGCTLPPESLYHPKNN-- 111
Y +++G P + + +DTGSDL WV C+ A TG PP Y P+ +
Sbjct: 110 YYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANATGPDAPPLRPYSPRRSST 169
Query: 112 --LVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTDHFPLRLTN---- 164
VAC++P C + N C YEV Y + SS GVLV D L
Sbjct: 170 SEQVACDNPLCGRRN---GCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPG 226
Query: 165 --GSLLGPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGL-TRNVLGH 219
G L ++FGCG Q A G++GLG+GK S+ S L + GL +
Sbjct: 227 AAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSM 286
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
C G G + G S G A TP + L Y+ + G +S + + D
Sbjct: 287 CFGDDGVGRVNFGD--AGSRGQAETPFTVRSLNPTYNVSFTSIGIGSESVAAE-FAAVMD 343
Query: 280 SGSSYTYFNSQAY 292
SG+S+TY + Y
Sbjct: 344 SGTSFTYLSDPEY 356
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 129/301 (42%), Gaps = 36/301 (11%)
Query: 46 GSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
GS + P GN + +Y+ + IG P + + +D+GSDL W+ CN C C P S
Sbjct: 83 GSKTISP--GNYFGWLHYT-WIDIGTPSVSFLVALDSGSDLLWIPCN--CVQCA-PLSSA 136
Query: 106 YHPKNNLVACN--DP--------FCSAFHLPENI-RCEA-NDQCDYEVLYA-DHGSSLGV 152
Y+ N DP F + L E+ CE+ +QC Y V YA ++ SS G+
Sbjct: 137 YYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCPYTVTYASENTSSSGL 196
Query: 153 LVTD--HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
LV D H S + R++ GCG Q K GV+GLG G+ S+ S L
Sbjct: 197 LVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMGLGPGEISVPSFLAKA 256
Query: 211 GLTRNVLGHCLSVRGGGYLFLGHDLVPSS--GIAWTPMSRDLLEKHYSSGPAELLFGGKS 268
GL RN C G ++ G D+ PS+ + P + + Y G G
Sbjct: 257 GLMRNSFSMCFDEEDSGRIYFG-DVGPSTQQSTRFLPYKNEFVA--YFVGVEVCCVGNSC 313
Query: 269 TGIKGLQIIFDSGSSYTYFNSQAYKTT-------LDLMRKDLKGKPLE---DTAEEKALP 318
+ DSG S+T+ + Y+ ++ K ++G P E +T+ E +P
Sbjct: 314 LKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWEYCYETSFEPKVP 373
Query: 319 V 319
Sbjct: 374 A 374
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 105/252 (41%), Gaps = 28/252 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G+PP L +D+GSD+ WVQC PC C + L+ P + V+C
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C +CDY V Y D + G L + LT G + GC
Sbjct: 187 SAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALE----TLTLGGTAVQGVAIGC 242
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGH 233
G+ RN G AG+LGLG G S++ QL G V +CL+ R G G L LG
Sbjct: 243 GH--RNSGLF-VGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVLGR 297
Query: 234 DLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIK----------GLQIIFDSGS 282
G W P+ R + Y G + GG+ ++ ++ D+G+
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 357
Query: 283 SYTYFNSQAYKT 294
+ T +AY
Sbjct: 358 AVTRLPREAYAA 369
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 24/273 (8%)
Query: 35 KSTQSTAAHRFGSTAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
KS + RF + P+ G G Y V + G+P + Y + +DTGS L+W+QC
Sbjct: 89 KSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKP 148
Query: 94 PCTGCTLPPESLYHPKNNL----VACNDPFCSAF--HLPENIRCE-ANDQCDYEVLYADH 146
C + + L+ P + ++C CS+ N CE +++ C Y Y D
Sbjct: 149 CVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDS 208
Query: 147 GSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
S+G L D L L L P ++GCG + + AG+LGLG K S+L Q
Sbjct: 209 SYSMGYLSQDL--LTLAPSQTL-PGFVYGCGQDSDGLFGR---AAGILGLGRNKLSMLGQ 262
Query: 207 LQS-LGLTRNVLGHCLSVR-GGGYLFLGHDLVPSSGIAWTPMSRDLLEKH-YSSGPAELL 263
+ S G +CL R GGG+L +G + S +TPM+ D Y +
Sbjct: 263 VSSKFGY---AFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAIT 319
Query: 264 FGGKSTGIKGLQ----IIFDSGSSYTYFNSQAY 292
GG++ G+ Q I DSG+ T Y
Sbjct: 320 VGGRALGVAAAQYRVPTIIDSGTVITRLPMSVY 352
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 107/248 (43%), Gaps = 29/248 (11%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P V+ Y + L++G PP +IDTGSDL W QC PC C ++ P +
Sbjct: 50 PYADTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQC-MPCPNCYTQFAPIFDPSKS 108
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGP 170
S F + RC N C YE++YAD S G+L T+ ++ T+G +
Sbjct: 109 ---------STF---KEKRCHGN-SCPYEIIYADESYSTGILATETVTIQSTSGEPFVMA 155
Query: 171 RLIFGCGYNQRN---PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
GCG N N PG ++G++GL +G +S++SQ+ ++ +C S +G
Sbjct: 156 ETSIGCGLNNSNLMTPG-YAASSSGIVGLNMGPSSLISQMDL--PIPGLISYCFSSQGTS 212
Query: 228 YLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL-------QIIFD 279
+ G + +V G M + Y + G K G I D
Sbjct: 213 KINFGTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFID 272
Query: 280 SGSSYTYF 287
SG++YTY
Sbjct: 273 SGTTYTYL 280
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 123/315 (39%), Gaps = 34/315 (10%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTG 83
FS+ N+ T A F VF T ++ + + +G P + + +DTG
Sbjct: 23 FSDGNE-------TVRVDALGFFKVNVFMETCELFMRDLHYANVTVGTPSDWFMVALDTG 75
Query: 84 SDLTWVQCNAPCTGCTLPPES---------LYHPK----NNLVACNDPFCSAFHLPENIR 130
SDL W+ C+ CT C ++ +Y P + V CN C+ R
Sbjct: 76 SDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCT-----RGDR 128
Query: 131 CEA-NDQCDYEVLYADHG-SSLGVLVTD--HFPLRLTNGSLLGPRLIFGCGYNQRNPGPK 186
C + C Y++ Y +G SS GVLV D H + + R+ FGCG Q
Sbjct: 129 CASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHD 188
Query: 187 PPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPM 246
G+ GLGL S+ S L G+ N C G G + G S TP+
Sbjct: 189 GAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK--GSVDQRETPL 246
Query: 247 SRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
+ Y+ ++ GG +TG +FDSG+S+TY AY + K
Sbjct: 247 NIRQPHPTYNITVTKISVGG-NTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDK 305
Query: 307 PLEDTAEEKALPVCW 321
+ T E C+
Sbjct: 306 RYQTTDSELPFEYCY 320
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 97/218 (44%), Gaps = 28/218 (12%)
Query: 51 FPITGNVYP--LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP 108
FP+ G P +G Y +K+G PP+ + IDTGSD+ WV C + C GC P S
Sbjct: 63 FPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGC--PQTSGLQI 119
Query: 109 KNNLV-----------ACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTD 156
+ N +C D C + + C N+QC Y Y D + G V+D
Sbjct: 120 QLNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSD 179
Query: 157 --HFPLRLTNGSLL---GPRLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSL 210
HF + G+L ++FGC Q K G+ G G S++SQL S
Sbjct: 180 LMHFA-SIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQ 238
Query: 211 GLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPM 246
G+ V HCL GGG L LG + P+ I ++P+
Sbjct: 239 GIAPRVFSHCLKGDNSGGGVLVLGEIVEPN--IVYSPL 274
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 125/275 (45%), Gaps = 37/275 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PPK + L +DTGSDL W+QC PC C E+ Y PK + + CN
Sbjct: 160 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNEAFYDPKTSASFKNITCN 218
Query: 117 DPFCSAFHLPE-NIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT-----NGSLLG 169
DP CS PE ++C++++Q C Y Y D ++ G + F + LT +
Sbjct: 219 DPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKV 278
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---- 225
++FGCG+ R +G+LGLG G S SQLQS L + +CL R
Sbjct: 279 ENMMFGCGHWNRGLFSG---ASGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDTN 333
Query: 226 -GGYLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIKGLQ---- 275
L G DL+ + + +T + +E Y +L GG++ I
Sbjct: 334 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNIS 393
Query: 276 ------IIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ +YF AY+ + + +K
Sbjct: 394 PDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMK 428
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 116/279 (41%), Gaps = 38/279 (13%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
++G G Y V +G P + + L +DTGSDL +VQC APC C LY P N+
Sbjct: 24 VSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQC-APCDLCYEQDGPLYQPSNSS 82
Query: 113 ----VACNDPFCSAFHLPENIRCEAN-------DQCDYEVLYADHGSSLGVLVTDHFPLR 161
V C+ C P C ++ C YE Y D+ S++GV +
Sbjct: 83 TFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYE----T 138
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
T G + + FGCG RN G GVLGLG G S SQ N +CL
Sbjct: 139 ATVGGIRVNHVAFGCG--NRNQG-SFVSAGGVLGLGQGALSFTSQAGY--AFENKFAYCL 193
Query: 222 S-----VRGGGYLFLGHDLVPS-SGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGIKGL 274
+ L G D++ + + +TP+ S L Y + FGG++ I
Sbjct: 194 TSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDS 253
Query: 275 Q----------IIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
IFDSG++ TY++ QAY + K +
Sbjct: 254 AWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSV 292
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 119/289 (41%), Gaps = 44/289 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G + + + IG P Y +DTGSDL W QC PC C ++ P + + + C+
Sbjct: 116 GEFLMDMSIGTPALAYAAIVDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYSTLPCS 174
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS LP + A C Y Y D S+ GVL + F L T P + FGC
Sbjct: 175 SSLCS--DLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK----LPGVAFGC 228
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---------GG 227
G N G AG++GLG G S++SQ LGL + +CL+ G
Sbjct: 229 G--DTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGLGK--FSYCLTSLDDTSKSPLLLGS 281
Query: 228 YLFLGHDLVPSSGIAWTPMSRD--------LLEKHYSSGPAEL-----LFGGKSTGIKGL 274
+ D ++ I TP+ ++ + K + G + F + G G
Sbjct: 282 LAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGG- 340
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
+I DSG+S TY Q Y+ +K P+ D L +C+K
Sbjct: 341 -VIVDSGTSITYLELQGYRPLKKAFAAQMK-LPVAD-GSAVGLDLCFKA 386
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 115/283 (40%), Gaps = 24/283 (8%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL- 112
+G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSST 229
Query: 113 ---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
V+C P C F L + R + C Y V Y D S+G D L + ++ G
Sbjct: 230 YANVSCAAPAC--FDL--DTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKG 284
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGG 227
R FGCG +RN G AG+LGLG GK S+ +Q+ V HCL R G G
Sbjct: 285 FR--FGCG--ERNEGLF-GEAAGLLGLGRGKTSL--PVQTYDKYGGVFAHCLPARSSGTG 337
Query: 228 YLFLGHDLVPSSGIAW-TPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
YL G ++G TPM D Y G + GG+ I I DSG
Sbjct: 338 YLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSG 397
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ T AY + + + + L C+ T
Sbjct: 398 TVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFT 440
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 76/153 (49%), Gaps = 12/153 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG+PP+ + IDTGSDL W QC APC C P + P + + C+
Sbjct: 86 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASLPCS 144
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+A + P C N C Y+ Y D SS GVL + F + + PR+ FGC
Sbjct: 145 SAMCNALYSP---LCFQN-ACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGC 200
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
G N G +G++G G G S++SQL S
Sbjct: 201 G--NMNAG-TLFNGSGMVGFGRGALSLVSQLGS 230
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 127/292 (43%), Gaps = 57/292 (19%)
Query: 57 VYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE-----SLYHPKNN 111
V+ L Y + +GNP K Y + +DTGSD+ WV C C C + +LY P ++
Sbjct: 21 VHWLSLYFAKIGLGNPSKDYYVQVDTGSDILWVNC-IGCDKCPTKSDLGIKLTLYDPASS 79
Query: 112 L----VACNDPFCSAFH---LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
+ V+C+D FC++ + LP+ C+ C Y V+Y D S+ G V+D
Sbjct: 80 VSATRVSCDDDFCTSTYNGLLPD---CKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVT 136
Query: 165 GSLL----GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
G+L + FGCG Q GLG + ++L HC
Sbjct: 137 GNLQTGLSNGTVTFGCGAQQSG--------------GLGTSG-----EALDGILGAFAHC 177
Query: 221 L-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK---------STG 270
L +V GGG +G + P + TPM + + HY+ E+ GG +G
Sbjct: 178 LDNVNGGGIFAIGELVSPK--VNTTPMVPN--QAHYNVYMKEIEVGGTVLELPTDVFDSG 233
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
+ II DSG++ Y Y + ++ +R G L T EE+ +C+K
Sbjct: 234 DRRGTII-DSGTTLAYLPEVVYDSMMNEIRSQQPGLSLH-TVEEQF--ICFK 281
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 76/153 (49%), Gaps = 12/153 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG+PP+ + IDTGSDL W QC APC C P + P + + C+
Sbjct: 83 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASLPCS 141
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+A + P C N C Y+ Y D SS GVL + F + + PR+ FGC
Sbjct: 142 SAMCNALYSP---LCFQN-ACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGC 197
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
G N G +G++G G G S++SQL S
Sbjct: 198 G--NMNAG-TLFNGSGMVGFGRGALSLVSQLGS 227
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/249 (30%), Positives = 111/249 (44%), Gaps = 46/249 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+++ L +D+GS +T+V C + C C + + P+
Sbjct: 91 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPE----------M 139
Query: 121 SAFHLPE--NIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIF 174
S+ + P N+ C +D QC YE YA+H SS GVL D + N S L P R +F
Sbjct: 140 SSTYQPVKCNMDCNCDDDREQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRAVF 197
Query: 175 GCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVR 224
GC Y+QR G++GLG G S++ QL GL N G C + V
Sbjct: 198 GCETVETGDLYSQR--------ADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVG 249
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IF 278
GG + G D S + +T D +Y+ + GK + +
Sbjct: 250 GGSMILGGFDY--PSDMVFTDSDPD-RSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVL 306
Query: 279 DSGSSYTYF 287
DSG++Y Y
Sbjct: 307 DSGTTYAYL 315
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 119/266 (44%), Gaps = 46/266 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPE--SLYHPKNNL 112
GYY+ L IG PP+++ L +DTGS +T+V C+ C C PE S Y P
Sbjct: 82 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQPESSSTYQPVKCT 140
Query: 113 VACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP- 170
+ CN C+++ QC YE YA+ +S GVL D + N S L P
Sbjct: 141 IDCN--------------CDSDRMQCVYERQYAEMSTSSGVLGEDL--ISFGNQSELAPQ 184
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV------R 224
R +FGC N G++GLG G SI+ QL + +NV+ S+
Sbjct: 185 RAVFGCE-NVETGDLYSQHADGIMGLGRGDLSIMDQL----VDKNVISDSFSLCYGGMDV 239
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IF 278
GGG + LG + P S +A+ S + +Y+ E+ GK + +
Sbjct: 240 GGGAMVLG-GISPPSDMAFA-YSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVL 297
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLK 304
DSG++Y Y A+ D + K+L+
Sbjct: 298 DSGTTYAYLPEAAFLAFKDAIVKELQ 323
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/248 (32%), Positives = 113/248 (45%), Gaps = 44/248 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG PP+++ L +D+GS +T+V C + C C + + P+ L + P
Sbjct: 92 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPE--LSSTYQPVK 148
Query: 120 CSAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFG 175
C N+ C +D QC YE YA+H SS GVL D + N S L P R +FG
Sbjct: 149 C-------NMDCNCDDDKEQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRAVFG 199
Query: 176 CG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRG 225
C Y+QR G++GLG G S++ QL GL N G C + V G
Sbjct: 200 CETVETGDLYSQR--------ADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDV-G 250
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFD 279
GG + LG PS I +T D +Y+ + GK + + D
Sbjct: 251 GGSMILGGFDYPSDMI-FTDSDPD-RSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLD 308
Query: 280 SGSSYTYF 287
SG++Y Y
Sbjct: 309 SGTTYAYL 316
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 76/151 (50%), Gaps = 11/151 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y +L++G P +++DTGSD +W+QC PC C E+L+ P + + C+
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCK-PCPDCYEQHEALFDPSKSSTYSDITCSSR 192
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C C ++ +C YE+ YAD ++G L D L T+ P +FGCG+
Sbjct: 193 ECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAV---PGFVFGCGH 249
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
N + G+LGLG GKAS+ SQ+ +
Sbjct: 250 NNAGSFGE---IDGLLGLGRGKASLSSQVAA 277
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 123/283 (43%), Gaps = 33/283 (11%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
++ S+ ++R I+G G Y V + +G+PP+ + ID+GSD+ WVQC
Sbjct: 110 RRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 169
Query: 93 APCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGS 148
PCT C + ++ P ++ V+C+ C EN C A +C YEV Y D
Sbjct: 170 -PCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRL---ENAGCHAG-RCRYEVSYGDGSY 224
Query: 149 SLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
+ G L + LT G + + GCG+ R G+ G + S + QL
Sbjct: 225 TKGTLALE----TLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSM---SFVGQLG 277
Query: 209 SLGLTRNVLGHCLSVRG---GGYLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLF 264
G T +CL RG G L G + +P +G AW P+ R+ Y G A L
Sbjct: 278 --GQTGGAFSYCLVSRGTDSSGSLVFGREALP-AGAAWVPLVRNPRAPSFYYIGLAGLGV 334
Query: 265 GG----------KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
GG + T + ++ D+G++ T + AY+ D
Sbjct: 335 GGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRD 377
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 114/262 (43%), Gaps = 33/262 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL--VACNDP 118
G Y + + IG P +DTGSDL W +CN PCT C+ + V C
Sbjct: 40 GEYLIQMAIGTPALSLSAIMDTGSDLVWTKCN-PCTDCSTSSIYDPSSSSTYSKVLCQSS 98
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C P C + C+Y Y D S+ G+L + F +++ SL P + FGCG+
Sbjct: 99 LCQP---PSIFSCNNDGDCEYVYPYGDRSSTSGILSDETF--SISSQSL--PNITFGCGH 151
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSVRGGGY----LFLGH 233
+ + G++G G G S++SQL S+G N +CL R LF+G+
Sbjct: 152 DNQGFD----KVGGLVGFGRGSLSLVSQLGPSMG---NKFSYCLVSRTDSSKTSPLFIGN 204
Query: 234 DL-VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS----TGIKGLQ------IIFDSGS 282
+ ++ + TP+ + HY + GG+S TG +Q +I DSG+
Sbjct: 205 TASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGT 264
Query: 283 SYTYFNSQAYKTTLDLMRKDLK 304
+ T+ AY + M +
Sbjct: 265 TLTFLQQTAYDAVKEAMVSSIN 286
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 119/284 (41%), Gaps = 23/284 (8%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---LYHPKN----NLVAC 115
Y + + +G PP DTGSDL WV C++ G ++ P + ++C
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSC 162
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRL--TNGSLLGPRLI 173
C A C+A+ +C Y+ Y D ++GVL T+ F G + PR+
Sbjct: 163 QSNACQAL---SQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVN 219
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYL 229
FGC + G++GLG G S++SQL + L +CL L
Sbjct: 220 FGC----STASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTL 275
Query: 230 FLGHDLVPSS-GIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFN 288
G V S G A TP+ ++ +Y+ + GG+ +II DSG++ T+ +
Sbjct: 276 NFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHDSRIIVDSGTTLTFLD 335
Query: 289 SQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLLGNF 332
+ + + +K + ++ E+ L +C+ K NF
Sbjct: 336 PALLGPLVTELERRIKLQRVQ--PPEQLLQLCYDVQGKSETDNF 377
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 128/299 (42%), Gaps = 24/299 (8%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
++ S +S + S+ +PI+ Y Y + IG+P D+GS L W+QC
Sbjct: 70 ARGDSIRSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQC 129
Query: 92 NAP-CTGCTLPPESLYHPKNNLV----ACNDPFCSAFHLPENIRCEANDQ-CDYEVLYAD 145
P C C L++P ++ CN C E RC+ +Q C Y Y D
Sbjct: 130 GTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLD 189
Query: 146 HGSSLGVLVTD--HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI 203
+ GV+ TD FP ++ R+IFGCGYN +P PP G++GL KAS+
Sbjct: 190 DSYTEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPP--GLVGLTNNKASL 247
Query: 204 LSQLQ--------SLGLTRNVLGHCLSVRGGGYLFLGH--DLVPSSGIAWTPMSRD--LL 251
+ Q+ S+ +N+ G G GH LVP+S + + D +
Sbjct: 248 VGQMDVDQFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLVPNSDGWYIFKNVDGIYV 307
Query: 252 EKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
+ G +F G GL + D+G++YT ++ + L+ + + P +D
Sbjct: 308 NEFEVEGYPAWVFKYTEGGQGGLTM--DTGTTYTELHNSVMDPLIKLLEEHITIVPEKD 364
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 131/306 (42%), Gaps = 44/306 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y T+ +G P K++ + DTGSDL W+QC PC C + ++ P+ + ++C
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK-PCQACFNQKDPIFDPEGSSSYTTMSCG 96
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI-FG 175
D C + LP R + CDY Y D + G L ++ L T G L + I FG
Sbjct: 97 DTLCDS--LP---RKSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFG 151
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGYLF 230
CG+ R +G++GLG G S +SQL L + +CL + +F
Sbjct: 152 CGHLNRGSFND---ASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSPMF 206
Query: 231 LGHDLVPSSG-----IAWTPMSRD-LLEKHYSSGPAELLFGGKSTGIKGLQ--------- 275
G + S A+TPM + +E Y ++ G++ I
Sbjct: 207 FGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSG 266
Query: 276 -IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW-----KGTWKCLL 329
+IFDSG++ T Y+ L +R + ++ ++ L +C+ K ++K +
Sbjct: 267 GMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSA--GLDLCYDVSGSKASYKMKI 324
Query: 330 GNFEWH 335
+H
Sbjct: 325 PAMVFH 330
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 118/282 (41%), Gaps = 34/282 (12%)
Query: 52 PIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN 110
P+T G +Y G Y V L +G P + + +DTGSDL W+QC PC C + ++ P+N
Sbjct: 117 PVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRN 175
Query: 111 N----LVACNDPFCSAFHLPENIRCE----ANDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
+ + C P C A + C A +C Y+V Y D S+G +D F L
Sbjct: 176 SSSFQRIPCLSPLCKALEIHS---CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGT 232
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ ++ + FGCG++ G+ L S + + T N +CL
Sbjct: 233 GSKAM---SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLV 289
Query: 223 ------VRGGGYLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKS--TGIKG 273
R L G +PS+ A +P+ ++ L+ Y + + GG +K
Sbjct: 290 DRSNPMTRSSSSLIFGAAAIPSTA-ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKS 348
Query: 274 LQ--------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
LQ +I DSG+S T F + Y T D R P
Sbjct: 349 LQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLP 390
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 130/274 (47%), Gaps = 33/274 (12%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN-- 111
+G + G Y V + IG+P KL L +DTGSD+ W+QC +PC C ++++ P+ +
Sbjct: 5 SGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSS 63
Query: 112 --LVACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
++C+ P C L + C + D +C Y+V Y D ++G L +D F + S
Sbjct: 64 FRRLSCSTPQC---KLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS-- 118
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY 228
++FGCG++ AG+LGLG GK S SQL S + ++ VR
Sbjct: 119 --PVVFGCGHDNEGLFVG---AAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSA 173
Query: 229 LFLGHDLVPSSG-IAWTPMSRD-LLEKHYSSGPAELLFGG-------------KSTGIKG 273
L G +P+S A+T + ++ L+ Y +G + + GG STG G
Sbjct: 174 LLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGG 233
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
+I DSG+S T + AY D R + P
Sbjct: 234 --VIIDSGTSVTRLPTYAYTVMRDAFRSATQKLP 265
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 107/261 (40%), Gaps = 37/261 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y V + IG+PP L +D+GSD+ WVQC PC C + L+ P + + V+C
Sbjct: 123 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPASSATFSAVSCG 181
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C C + C+YEV Y D + G L + LT G + GC
Sbjct: 182 SAICRTLRTSG---CGDSGGCEYEVSYGDGSYTKGTLALE----TLTLGGTAVEGVAIGC 234
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG---------G 227
G+ R AG+LGLG G S++ QL +CL+ RGG G
Sbjct: 235 GHRNRG---LFVGAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGGSGSGAADAAG 289
Query: 228 YLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKSTGIK----------GLQI 276
L LG G W P+ R+ Y G + + G + ++ G +
Sbjct: 290 SLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGV 349
Query: 277 IFDSGSSYTYFNSQAYKTTLD 297
+ D+G++ T +AY D
Sbjct: 350 VMDTGTAVTRLPQEAYAALRD 370
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/255 (30%), Positives = 113/255 (44%), Gaps = 34/255 (13%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL--YHPKNNL----VACND 117
+V+L +G PP+ + +DTGS+L+W+ C AP G S + P+ +L V C+
Sbjct: 67 TVSLAVGTPPQNVTMVLDTGSELSWLLC-APGGGGGGGGRSALSFRPRASLTFASVPCDS 125
Query: 118 PFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C + LP C+ A+ QC + YAD SS G L T+ F T G R FGC
Sbjct: 126 AQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVF----TVGQGPPLRAAFGC 181
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLGHDL 235
+ P TAG+LG+ G S +SQ TR +C+S R G L LGH
Sbjct: 182 MATAFDTSPDGVATAGLLGMNRGALSFVSQAS----TRR-FSYCISDRDDAGVLLLGHSD 236
Query: 236 VPSSGIAWTPMSRDLL------EKHYSSGPAELLFGGKSTGIKGL----------QIIFD 279
+P + +TP+ + + YS + GGK I Q + D
Sbjct: 237 LPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 296
Query: 280 SGSSYTYFNSQAYKT 294
SG+ +T+ AY
Sbjct: 297 SGTQFTFLLGDAYSA 311
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/274 (30%), Positives = 131/274 (47%), Gaps = 33/274 (12%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN-- 111
+G + G Y V + IG+P KL L +DTGSD+ W+QC +PC C ++++ P+ +
Sbjct: 5 SGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSS 63
Query: 112 --LVACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
++C+ P C L + C + D +C Y+V Y D ++G L +D F L +
Sbjct: 64 FRRLSCSTPQC---KLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSF---LVSRGRT 117
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY 228
P ++FGCG++ AG+LGLG GK S SQL S + ++ VR
Sbjct: 118 SP-VVFGCGHDNEGLFVG---AAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSA 173
Query: 229 LFLGHDLVPSSG-IAWTPMSRD-LLEKHYSSGPAELLFGG-------------KSTGIKG 273
L G +P+S A+T + ++ L+ Y +G + + GG STG G
Sbjct: 174 LLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGG 233
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
+I DSG+S T + AY D R + P
Sbjct: 234 --VIIDSGTSVTRLPTYAYTVMRDAFRSATQKLP 265
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 124/287 (43%), Gaps = 39/287 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y T+ +G P K++ + DTGSDL W+QC PC C + ++ P+ + ++C
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK-PCQACFNQKDPIFDPEGSSSYTTMSCG 96
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI-FG 175
D C + LP C N CDY Y D + G L ++ L T G L + I FG
Sbjct: 97 DTLCDS--LPRK-SCSPN--CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFG 151
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGYLF 230
CG+ R +G++GLG G S +SQL L + +CL + +F
Sbjct: 152 CGHLNRGSFND---ASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSPMF 206
Query: 231 LGHDLVPSSG-----IAWTPMSRD-LLEKHYSSGPAELLFGGKSTGIKGLQ--------- 275
G + S A+TPM + +E Y ++ G++ I
Sbjct: 207 FGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSG 266
Query: 276 -IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
+IFDSG++ T Y+ L +R + ++ ++ L +C+
Sbjct: 267 GMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSS--AGLDLCY 311
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/256 (30%), Positives = 112/256 (43%), Gaps = 19/256 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC--TGCTLPPESLYHPKNN----LVA 114
G Y + + IG P DTGSDLTWVQC +PC T C LY P N+ L+
Sbjct: 94 GNYLMRIYIGTPSVERLAIADTGSDLTWVQC-SPCDNTKCFAQNTPLYDPLNSSTFTLLP 152
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C+ C+ + + C C Y Y D+ S G L +D L L ++ F
Sbjct: 153 CDSQPCTQLPYSQYV-CSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHY-NSKICF 210
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL---SVRGGGYLF 230
GCG+ + K T G++GLG G S++SQL +G + +CL S L
Sbjct: 211 GCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIG---HKFSYCLLPFSSNSNSKLK 267
Query: 231 LGH-DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKS--TGIKGLQIIFDSGSSYTYF 287
G +V +G+ TP+ Y + G K+ TG II DSGS+ TY
Sbjct: 268 FGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYL 327
Query: 288 NSQAYKTTLDLMRKDL 303
Y + L+++ +
Sbjct: 328 EESFYNEFVSLVKETV 343
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 122/276 (44%), Gaps = 46/276 (16%)
Query: 39 STAAHRFGSTAVFPI-TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG 97
S F S+ P+ GN G + + L IG P + Y +DTGSDL W QC PC
Sbjct: 76 SAKTASFESSVEAPVHAGN----GEFLMKLAIGTPAETYSAIMDTGSDLIWTQCK-PCKD 130
Query: 98 CTLPPESLYHPKN----NLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
C P ++ PK + + C+ C+A + +D C+Y Y D+ S+ GVL
Sbjct: 131 CFDQPTPIFDPKKSSSFSKLPCSSDLCAALPIS-----SCSDGCEYLYSYGDYSSTQGVL 185
Query: 154 VTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT 213
T+ F + S +G FGCG + N G AG++GLG G S++SQ L
Sbjct: 186 ATETFAFGDASVSKIG----FGCG--EDNDGSGFSQGAGLVGLGRGPLSLISQ-----LG 234
Query: 214 RNVLGHCLS----VRGGGYLFLGHDLVPSSGIAWTPMSRD--------LLEKHYSSGPAE 261
+CL+ +G L +G + + I TP+ ++ L + S G
Sbjct: 235 EPKFSYCLTSMDDSKGISSLLVGSEATMKNAIT-TPLIQNPSQPSFYYLSLEGISVGDTL 293
Query: 262 L-----LFGGKSTGIKGLQIIFDSGSSYTYFNSQAY 292
L F ++ G GL I DSG++ TY A+
Sbjct: 294 LPIEKSTFSIQNDGSGGL--IIDSGTTITYLEDSAF 327
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 121/287 (42%), Gaps = 27/287 (9%)
Query: 55 GNVYPLG-----YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH-- 107
G+++P G Y + +G P + + +DTGSDL WV C+ C C P S YH
Sbjct: 89 GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCA--PLSSYHGS 144
Query: 108 ---------PKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDH 157
P + + + P P + C Y + Y +++ +S G+L+ D
Sbjct: 145 LDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDM 204
Query: 158 FPLRLTNG-SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
L G + + +I GCG Q + G+LGLG+ S+ S L GL RN
Sbjct: 205 LHLDSREGHAPVNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVRNS 264
Query: 217 LGHCLSVRGGGYLFLGHDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
C G +F G VP+ + PM+ L + Y+ + G K T G Q
Sbjct: 265 FSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKL--QTYAVNVDKYCIGHKCTEGAGFQ 322
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
+ D+G+S+T AYK+ M D + ++++ + C+
Sbjct: 323 ALVDTGTSFTSLPLDAYKSI--TMEFDKQINASRASSDDYSFEYCYS 367
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 128/293 (43%), Gaps = 30/293 (10%)
Query: 47 STAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC-TGCTLPPES 104
S A P+T G +G Y + +G P K Y + +DTGS LTW+QC+ PC C
Sbjct: 100 SLASVPLTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCS-PCRVSCHRQSGP 158
Query: 105 LYHPKNN----LVACNDPFCSAFHLP--ENIRCEANDQCDYEVLYADHGSSLGVLVTDHF 158
++ PK + V+C+ P C C ++ C Y+ Y D S+G L D
Sbjct: 159 VFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKD-- 216
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVL 217
++ G+ P +GCG + + +AG++GL K S+L QL +LG +
Sbjct: 217 --TVSFGANSVPNFYYGCGQDNEGLFGR---SAGLMGLARNKLSLLYQLAPTLGYS---F 268
Query: 218 GHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGK-----STG 270
+CL S GYL +G G ++TPM + L+ Y + + GK S+
Sbjct: 269 SYCLPSTSSSGYLSIGS--YNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSE 326
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
L I DSG+ T + Y + +KG + A L C++G
Sbjct: 327 YTSLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGS-TKRAAAYSILDTCFEG 378
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 106/254 (41%), Gaps = 18/254 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACNDP 118
Y +T+ IG+P + +DTGSD++WVQC PC+ C +SL+ P + +C+
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSASSTYSPFSCSSA 189
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C + ++ QC Y V Y D S+ G +D LT GS FGC
Sbjct: 190 ACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSD----TLTLGSNAIKGFQFGC-- 243
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPS 238
+Q G T G++GLG S++S Q+ G +CL G FL
Sbjct: 244 SQSESGGFSDQTDGLMGLGGDAQSLVS--QTAGTFGKAFSYCLPPTPGSSGFLTLGAASR 301
Query: 239 SGIAWTPMSRDL-LEKHYSSGPAELLFGGKS----TGIKGLQIIFDSGSSYTYFNSQAYK 293
SG TPM R + +Y + GG+ T + + DSG+ T AY
Sbjct: 302 SGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMDSGTVITRLPPTAYS 361
Query: 294 TTLDLMRKDLKGKP 307
+ +K P
Sbjct: 362 ALSSAFKAGMKKYP 375
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 121/287 (42%), Gaps = 27/287 (9%)
Query: 55 GNVYPLG-----YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH-- 107
G+++P G Y + +G P + + +DTGSDL WV C+ C C P S YH
Sbjct: 89 GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCA--PLSSYHGS 144
Query: 108 ---------PKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDH 157
P + + + P P + C Y + Y +++ +S G+L+ D
Sbjct: 145 LDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDM 204
Query: 158 FPLRLTNG-SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
L G + + +I GCG Q + G+LGLG+ S+ S L GL RN
Sbjct: 205 LHLDSREGHAPVNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVRNS 264
Query: 217 LGHCLSVRGGGYLFLGHDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
C G +F G VP+ + PM+ L + Y+ + G K T G Q
Sbjct: 265 FSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKL--QTYAVNVDKYCIGHKCTEGAGFQ 322
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
+ D+G+S+T AYK+ M D + ++++ + C+
Sbjct: 323 ALVDTGTSFTSLPLDAYKSI--TMEFDKQINASRASSDDYSFEYCYS 367
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 130/286 (45%), Gaps = 43/286 (15%)
Query: 35 KSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP 94
+ + ++++ F + P V+ Y + L+IG PP E +DTGS+ W QC P
Sbjct: 31 RRSNASSSRVFNTQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQC-LP 89
Query: 95 CTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVL 153
C C ++ P + S F + IRC+ +D C YE++Y + G L
Sbjct: 90 CVHCYNQTAPIFDPSKS---------STF---KEIRCDTHDHSCPYELVYGGKSYTKGTL 137
Query: 154 VTDHFPLRLTNGS-LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGL 212
VT+ + T+G + P I GCG N N G K P AGV+GL G S+++Q+ G
Sbjct: 138 VTETVTIHSTSGQPFVMPETIIGCGRN--NSGFK-PGFAGVVGLDRGPKSLITQMG--GE 192
Query: 213 TRNVLGHCLSVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG- 270
++ +C + +G + G + +V G+ +S + K G L S G
Sbjct: 193 YPGLMSYCFAGKGTSKINFGANAIVAGDGV----VSTTVFVKTAKPGFYYLNLDAVSVGN 248
Query: 271 ------------IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+KG I+ DSGS+ TYF ++ +L+RK ++
Sbjct: 249 TRIETVGTPFHALKG-NIVIDSGSTLTYFP----ESYCNLVRKAVE 289
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 130/286 (45%), Gaps = 43/286 (15%)
Query: 35 KSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP 94
+ + ++++ F + P V+ Y + L+IG PP E +DTGS+ W QC P
Sbjct: 37 RRSNASSSRVFNTQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQC-LP 95
Query: 95 CTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVL 153
C C ++ P + S F + IRC+ +D C YE++Y + G L
Sbjct: 96 CVHCYNQTAPIFDPSKS---------STF---KEIRCDTHDHSCPYELVYGGKSYTKGTL 143
Query: 154 VTDHFPLRLTNGS-LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGL 212
VT+ + T+G + P I GCG N N G K P AGV+GL G S+++Q+ G
Sbjct: 144 VTETVTIHSTSGQPFVMPETIIGCGRN--NSGFK-PGFAGVVGLDRGPKSLITQMG--GE 198
Query: 213 TRNVLGHCLSVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTG- 270
++ +C + +G + G + +V G+ +S + K G L S G
Sbjct: 199 YPGLMSYCFAGKGTSKINFGANAIVAGDGV----VSTTVFVKTAKPGFYYLNLDAVSVGN 254
Query: 271 ------------IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+KG I+ DSGS+ TYF ++ +L+RK ++
Sbjct: 255 TRIETVGTPFHALKG-NIVIDSGSTLTYFP----ESYCNLVRKAVE 295
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 136/308 (44%), Gaps = 36/308 (11%)
Query: 43 HRFGSTAVFPITGNVYPLGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLP 101
H+ A IT N G Y ++ +G PP +LY + IDTGSD+ W+QC PC C
Sbjct: 69 HKAHKAAKATITQND---GEYLISYSVGIPPFQLYGI-IDTGSDMIWLQCK-PCEKCYNQ 123
Query: 102 PESLYHPKNNLVACNDPFCSAF-HLPENIRCEANDQ--CDYEVLYADHGSSLGVLVTDHF 158
++ P + PF S E+ C ++++ C+Y + Y D S G L +
Sbjct: 124 TTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETL 183
Query: 159 PLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ----SLGLT 213
L TNGS + R + GCG N N ++G++GLG G S+++QL+ S+G
Sbjct: 184 TLGSTNGSSVKFRRTVIGCGRN--NTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRK 241
Query: 214 RNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPM-SRD------LLEKHYSSGPAELLFGG 266
+ +S F +V G TP+ + D L + +S G + F
Sbjct: 242 FSYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTS 301
Query: 267 KS--TGIKGLQIIFDSGSSYTYFNSQAY----KTTLDLMRKDLKGKPLEDTAEEKALPVC 320
S G KG II DSG++ T + Y DL+ D PL K L +C
Sbjct: 302 SSFRFGEKG-NIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPL------KQLSLC 354
Query: 321 WKGTWKCL 328
++ T+ L
Sbjct: 355 YRSTFDEL 362
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 122/283 (43%), Gaps = 50/283 (17%)
Query: 42 AHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP 101
+ RF + G+ Y + +G+P + +DTGSD+ W +C C GC+
Sbjct: 67 SRRFLLEVDLMLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKL-CQGCSSK 125
Query: 102 -------------PESLYHPKNNLVA----CNDPFCSAFHLPENIRCEANDQ-CDYEVLY 143
P +LY P+ ++ A C+DP CS E C N+ C Y++ Y
Sbjct: 126 KNVIVCSSIIMQGPITLYDPELSITASPATCSDPLCS-----EGGSCRGNNNSCAYDISY 180
Query: 144 ADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI 203
D SS G+ D + L + + L + GC + P G++G G K S+
Sbjct: 181 EDTSSSTGIYFRD--VVHLGHKASLNTTMFLGCATSISGLWP----VDGIMGFGRSKVSV 234
Query: 204 LSQLQSLGLTRNVLGHCLS--VRGGGYLFLG-HDLVPSSGIAWTPM-SRDLLEKHYSSGP 259
+QL + + N+ HCLS GGG L LG +D P + +TPM + D++ Y+
Sbjct: 235 PNQLAAQAGSYNIFYHCLSGEKEGGGILVLGKNDEFPE--MVYTPMLANDIV---YNVKL 289
Query: 260 AELLFGGKSTGIKGLQI-----------IFDSGSSYTYFNSQA 291
L K+ I+ + I DSG+S F S+A
Sbjct: 290 VSLSVNSKALPIEASEFEYNATVGNGGTIIDSGTSSATFPSKA 332
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 120/272 (44%), Gaps = 41/272 (15%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP-CTGCTLPPESLYHPKNN----LVACND 117
Y + IG+PP DTGS++ W+QC +P CT C L++P + + C
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167
Query: 118 PFC--SAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTD--HFPLRLTNGSLLGPRL 172
C + + L E + C+++ Q C Y + Y DH S G + TD FP + R+
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227
Query: 173 IFGCGYNQ-RNPGPKPPP--TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV----RG 225
FGCGYN PG P GV+GLG AS++ Q LT +C+S +
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQ-----LTLGQFSYCISTPDVQKP 282
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDL-------------LEKHYSSGPAELLFGGKSTGIK 272
G + + L S T ++ +L ++ G E +F GI
Sbjct: 283 NGTIEIRFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIG 342
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
GL I DSG++YT + Y + LD + +LK
Sbjct: 343 GL--IMDSGTTYT----ELYFSALDALIGELK 368
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 128/281 (45%), Gaps = 41/281 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG PPK Y L +DTGSDL W+QC PC C Y PK + + C+
Sbjct: 88 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCHDCFEQNGPYYDPKESSSFRNIGCH 146
Query: 117 DPFCSAFHLPE-NIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTN--GSLLGPR- 171
DP C P+ + C+A +Q C Y Y D ++ G T+ F + LT+ G R
Sbjct: 147 DPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRV 206
Query: 172 --LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---- 225
++FGCG+ R +G+LGLG G S SQLQS L + +CL R
Sbjct: 207 ENVMFGCGHWNRGLFHG---ASGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDTN 261
Query: 226 -GGYLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGK------------ 267
L G DL+ + +T + + ++ Y ++ GG+
Sbjct: 262 VSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMT 321
Query: 268 STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
S G+ G I DSG++ +YF AY+ D K +KG P+
Sbjct: 322 SDGVGG--TIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPI 360
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 115/286 (40%), Gaps = 28/286 (9%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------- 104
T V LG+ + + +G P + + +DTGSDL W+ C+ CT C ++
Sbjct: 94 TVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDL 151
Query: 105 -LYHPK----NNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHG-SSLGVLVTD- 156
+Y P + V CN C+ RC + C Y++ Y +G SS GVLV D
Sbjct: 152 NIYSPNASSTSTKVPCNSTLCT-----RGDRCASPESDCPYQIRYLSNGTSSTGVLVEDV 206
Query: 157 -HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
H + + R+ FGCG Q G+ GLGL S+ S L G+ N
Sbjct: 207 LHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAAN 266
Query: 216 VLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
C G G + G S TP++ Y+ ++ GG +TG
Sbjct: 267 SFSMCFGNDGAGRISFGDK--GSVDQRETPLNIRQPHPTYNITVTKISVGG-NTGDLEFD 323
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
+FDSG+S+TY AY + K + T E C+
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCY 369
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 113/279 (40%), Gaps = 30/279 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--LYHPKN---------N 111
Y VT+ +G P L IDTGSDL+WVQC APC T P+ L+ P N
Sbjct: 120 YVVTVGLGTPAVSQVLLIDTGSDLSWVQC-APCNSTTCYPQKDPLFDPSRSSTYAPIPCN 178
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
AC D + QC Y + Y D + GV + L + G +
Sbjct: 179 TDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNET--LTMAPGVTV-KD 235
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--GGYL 229
FGCG++Q P K G+LGLG S++ Q S + +CL G+L
Sbjct: 236 FHFGCGHDQDGPNDK---YDGLLGLGGAPESLVVQTSS--VYGGAFSYCLPAANDQAGFL 290
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ----IIFDSGSSYT 285
LG + +SG +TPM R+ + Y + GG+ + +I DSG+ T
Sbjct: 291 ALGAPVNDASGFVFTPMVRE-QQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVT 349
Query: 286 YFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
AY RK + PL E L C+ T
Sbjct: 350 ELQHTAYAALQAAFRKAMAAYPLLPNGE---LDTCYNFT 385
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 129/285 (45%), Gaps = 43/285 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACN 116
G Y + L IG PP Y +DTGSDL W QC PCT C P ++ P + V+C
Sbjct: 106 GEYLMELAIGTPPVSYPAVLDTGSDLIWTQCK-PCTQCYKQPTPIFDPKKSSSFSKVSCG 164
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CSA +P + +D C+Y Y D+ + GVL T+ F + + + FGC
Sbjct: 165 SSLCSA--VPSST---CSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGC 219
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG---GYLFLGH 233
G + N G +G++GLG G S++SQL+ +CL+ L LG
Sbjct: 220 G--EDNEGDGFEQASGLVGLGRGPLSLVSQLK-----EPRFSYCLTPMDDTKESILLLGS 272
Query: 234 --DLVPSSGIAWTPMSRDLLEKHY--------SSGPAELLFGGKSTGIKGLQ----IIFD 279
+ + + TP+ ++ L+ + S G L KST G +I D
Sbjct: 273 LGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSI-EKSTFEVGDDGNGGVIID 331
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCW 321
SG++ TY +A++ ++K+ + PL+ T+ L +C+
Sbjct: 332 SGTTITYIEQKAFEA----LKKEFISQTKLPLDKTS-STGLDLCF 371
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 119/273 (43%), Gaps = 38/273 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y + L IG PP + DTGSDL W QC PCT C ++ P+++ + C
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQC-IPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGCG 177
C+ L ++ C+Y YAD+ + GVL + L T G + + +IFGCG
Sbjct: 119 SCN--KLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCG 176
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL-------SVRGGGYL 229
+N + G++GLG G S++SQ+ SLG N+ CL S+
Sbjct: 177 HNNSGFNDRE---MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNF 233
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG------------GKSTG-IKGLQI 276
G +++ + ++ +S+D +G L G G S G I I
Sbjct: 234 GKGSEVLGNGTVSTPLISKD------GTGYFATLLGISVEDINLPFSNGSSLGTITKGNI 287
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLE 309
+ DSG++ TY + Y ++ +R + +P
Sbjct: 288 LIDSGTTITYLPEEFYHRLIEQVRNKVALEPFR 320
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 129/281 (45%), Gaps = 44/281 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL-PPESLYHPKNNL----VAC 115
G Y V++++G+PP+ L DTGSDLTWV+C+A T C++ PP S + +++ C
Sbjct: 81 GQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHC 140
Query: 116 NDPFCSAFHLPENIRC---EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
C P C + C YE +Y+D + G + L ++G + +
Sbjct: 141 FSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKS 200
Query: 173 I-FGCGYNQRNP---GPKPPPTAGVLGLGLGKASILSQL-QSLGLTRN--VLGHCLSVRG 225
I FGCG++ P G +GV+GLG G S SQL + G + + +L + LS
Sbjct: 201 IAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPP 260
Query: 226 GGYLFLGHDLVPS-----SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---- 276
YL +G D+V + S +++TP+ L+ P K + G+++
Sbjct: 261 TSYLMIG-DVVSTKKDNKSMMSFTPL---LINPE---APTFYYISIKGVFVDGVKLHIDP 313
Query: 277 -------------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+ DSG++ T+ AY+ L ++++K
Sbjct: 314 SVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVK 354
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 78/255 (30%), Positives = 112/255 (43%), Gaps = 34/255 (13%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL--YHPKNNL----VACND 117
+V+L +G PP+ + +DTGS+L+W+ C AP G S + P+ +L V C
Sbjct: 66 TVSLAVGTPPQNVTMVLDTGSELSWLLC-APGGGGGGGGRSALSFRPRASLTFASVPCGS 124
Query: 118 PFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C + LP C+ A+ QC + YAD SS G L T+ F T G R FGC
Sbjct: 125 AQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVF----TVGQGPPLRAAFGC 180
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLGHDL 235
+ P TAG+LG+ G S +SQ TR +C+S R G L LGH
Sbjct: 181 MATAFDTSPDGVATAGLLGMNRGALSFVSQAS----TRR-FSYCISDRDDAGVLLLGHSD 235
Query: 236 VPSSGIAWTPMSRDLL------EKHYSSGPAELLFGGKSTGIKGL----------QIIFD 279
+P + +TP+ + + YS + GGK I Q + D
Sbjct: 236 LPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 295
Query: 280 SGSSYTYFNSQAYKT 294
SG+ +T+ AY
Sbjct: 296 SGTQFTFLLGDAYSA 310
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 96/210 (45%), Gaps = 47/210 (22%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN---LVACN- 116
GYY+ L IG PP+++ L +D+GS +T+V C + C C L PK+ LV+C
Sbjct: 90 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQVMLSSPKDQILCLVSCKV 148
Query: 117 ----------------DPFCSAFHLPE--NIRCEAND---QCDYEVLYADHGSSLGVLVT 155
P S+ + P N+ C +D QC YE YA+H SS GVL
Sbjct: 149 QIFKISYGLFDEDPKFQPELSSTYQPVKCNMDCNCDDDKEQCVYEREYAEHSSSKGVLGE 208
Query: 156 DHFPLRLTNGSLLGP-RLIFGCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
D + N S L P R +FGC Y+QR G++GLG G S++ QL
Sbjct: 209 DL--ISFGNESHLTPQRAVFGCKTVETGDLYSQR--------ADGIIGLGQGDLSLVGQL 258
Query: 208 QSLGLTRNVLGHC---LSVRGGGYLFLGHD 234
GL N G C L V GG + G D
Sbjct: 259 VDKGLISNSFGLCYGGLDVGGGSMIVGGFD 288
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 134/297 (45%), Gaps = 44/297 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PPK L +DTGSDL+W+QC+ PC C S Y+PK++ ++C
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGSHYYPKDSSTYRNISCY 227
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT--NGSLLGPRL 172
DP C + ++ C+A +Q C Y YAD ++ G ++ F + LT NG ++
Sbjct: 228 DPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQV 287
Query: 173 I---FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-----VR 224
+ FGCG+ + +G+LGLG G S SQ+QS + + +CL+
Sbjct: 288 VDVMFGCGHWNKGFFYG---ASGLLGLGRGPISFPSQIQS--IYGHSFSYCLTDLFSNTS 342
Query: 225 GGGYLFLGHD--LVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIK------- 272
L G D L+ + + +T + E Y ++ GG+ I
Sbjct: 343 VSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWS 402
Query: 273 --------GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
G I DSGS+ T+F AY + K +K + + A++ + C+
Sbjct: 403 SEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQI--AADDFVMSPCY 457
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/253 (33%), Positives = 107/253 (42%), Gaps = 24/253 (9%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL- 112
+G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 113 ---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
V+C P CS NI + C Y V Y D S+G D L + ++ G
Sbjct: 231 YANVSCAAPACSDL----NIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKG 285
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGG 227
R FGCG +RN G AG+LGLG GK S+ +Q+ V HCL R G G
Sbjct: 286 FR--FGCG--ERNEGLF-GEAAGLLGLGRGKTSL--PVQTYDKYGGVFAHCLPARSTGTG 338
Query: 228 YL-FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
YL F L +S TPM D Y G + GG+ I I DSG
Sbjct: 339 YLDFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSG 398
Query: 282 SSYTYFNSQAYKT 294
+ T AY +
Sbjct: 399 TVITRLPPAAYSS 411
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 134/331 (40%), Gaps = 56/331 (16%)
Query: 22 GCFSEANQPPSKKKSTQSTAAHRFGST---------AVFP--ITGNVYPLGYYSVTLKIG 70
F Q ++K ST ST +T A+F ++G+ G Y V L++G
Sbjct: 7 AAFGRVLQEAAQKNSTNSTLPRESLATIQDFQGEDPALFSRLVSGSSIGSGQYFVELRVG 66
Query: 71 NPPKLYELDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKNNL----VACNDPFCSAFH 124
P K + L +DTGSDLTW+QCN P T + PP Y ++ + C D C
Sbjct: 67 TPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCTDDECQFLP 126
Query: 125 LPENIRCEAN--DQCDYEVLYADHGSSLGVLVTDHFPL--RLTNGSLLG---------PR 171
P C CDY Y+D + G+L + + R +G G
Sbjct: 127 APIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHKTRRIRIKN 186
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRG---G 226
+ GC ++ + G +GVLGLG G S+ +Q + L + +CL +RG
Sbjct: 187 VALGC--SRESVGASFLGASGVLGLGQGPISLATQTRHTALG-GIFSYCLVDYLRGSNAS 243
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHY--------------SSGPAELLFGGKSTGIK 272
+L +G +A TP+ R+ + + G A +G G K
Sbjct: 244 SFLVMGR--THWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNK 301
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
G IFDSG++ +Y AY L + +
Sbjct: 302 G--TIFDSGTTLSYLREPAYSKVLGALNASI 330
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 125/290 (43%), Gaps = 40/290 (13%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL-----YHPKNN----LVA 114
+V+L +G PP+ + +DTGS+L+W+ C G + + P+ + V
Sbjct: 64 TVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVP 123
Query: 115 CNDPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
C CS+ LP C+ A+ QC + YAD +S G L TD F + G R
Sbjct: 124 CGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAV----GEAPPLRSA 179
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLG 232
FGC + P TAG+LG+ G S ++Q TR +C+S R G L LG
Sbjct: 180 FGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQAS----TRR-FSYCISDRDDAGVLLLG 234
Query: 233 HDLVPSSGIAWTPMSRDLL------EKHYSSGPAELLFGGKSTGIKGL----------QI 276
H +P + +TP+ + L YS + GGK+ I Q
Sbjct: 235 HSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQT 294
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDT--AEEKALPVCWK 322
+ DSG+ +T+ AY K K + L+D A ++AL C++
Sbjct: 295 MVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFR 344
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 107/247 (43%), Gaps = 32/247 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V++ +G P K Y + DTGSDL+WVQC PC C + L+ P + VAC
Sbjct: 147 GNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVACG 205
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C + C ++ +C YEV Y D + G LV D L ++ P +FGC
Sbjct: 206 APECQEL---DASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFGC 259
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQ---LQSLGLTRNVLGHCL--SVRGGGYLFL 231
G +N G G+ GLG K S+ SQ G T +CL S G GYL L
Sbjct: 260 G--DQNAGLF-GQVDGLFGLGREKVSLPSQGAPSYGPGFT-----YCLPSSSSGRGYLSL 311
Query: 232 GHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI------KGLQIIFDSGSSYT 285
G P + +T ++ Y + GG++ I + DSG+ T
Sbjct: 312 GG--APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVIT 369
Query: 286 YFNSQAY 292
+AY
Sbjct: 370 RLPPRAY 376
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 124/278 (44%), Gaps = 30/278 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + DP
Sbjct: 81 GYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST-CEQCGRHQDPKF----------DPES 129
Query: 121 SAFHLPE--NIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIF 174
S+ + P NI C + QC YE YA+ +S GVL D + N S L P R +F
Sbjct: 130 SSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGED--VISFGNQSELIPQRAVF 187
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLG 232
GC N G++GLG G S++ QL G + C GGG + LG
Sbjct: 188 GCE-NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLG 246
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK----STGIKGLQ--IIFDSGSSYTY 286
+ P S + +T S + +Y+ E+ GK S+GI + + DSG++Y Y
Sbjct: 247 -GISPPSDMIFT-YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAY 304
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
++A+ D + ++ D + +C+ G
Sbjct: 305 LPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGA 342
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 133/298 (44%), Gaps = 47/298 (15%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-LPPESLYHPKNN 111
I+G G Y V +++G PP+ L DTGSDL WV+C+A C C+ PP S + P+++
Sbjct: 78 ISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA-CRNCSHHPPSSAFLPRHS 136
Query: 112 L----VACNDPFCSAF-HLPENI--RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
C DP C H P ++ + C + YAD S G + L+ +
Sbjct: 137 SSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLS 196
Query: 165 GSLLGPR-LIFGCGYNQRNP---GPKPPPTAGVLGLGLGKASILSQL-QSLG--LTRNVL 217
GS + + L FGCG+ P G + GV+GLG G S SQL + G + ++
Sbjct: 197 GSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLM 256
Query: 218 GHCLSVRGGGYLFLG---HD--LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK 272
+ LS +L +G H L ++ I++TP+ + L P S I
Sbjct: 257 DYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLS------PTFYYITIHSITID 310
Query: 273 GLQI-----------------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAE 313
G+++ + DSG++ TY AY+ L +R+ +K L + AE
Sbjct: 311 GVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK---LPNAAE 365
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 125/278 (44%), Gaps = 30/278 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P+++ + CN
Sbjct: 81 GYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST-CEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 117 -DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIF 174
D C + + QC YE YA+ +S GVL D + N S L P R +F
Sbjct: 140 IDCICDSDGV----------QCVYERQYAEMSTSSGVLGED--VISFGNQSELIPQRAVF 187
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLG 232
GC N G++GLG G S++ QL G + C GGG + LG
Sbjct: 188 GCE-NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLG 246
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK----STGIKGLQ--IIFDSGSSYTY 286
+ P S + +T S + +Y+ E+ GK S+GI + + DSG++Y Y
Sbjct: 247 -GISPPSDMIFT-YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAY 304
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
++A+ D + ++ D + +C+ G
Sbjct: 305 LPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGA 342
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 107/247 (43%), Gaps = 32/247 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V++ +G P K Y + DTGSDL+WVQC PC C + L+ P + VAC
Sbjct: 147 GNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVACG 205
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C + C ++ +C YEV Y D + G LV D L ++ P +FGC
Sbjct: 206 APECQEL---DASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFGC 259
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQ---LQSLGLTRNVLGHCL--SVRGGGYLFL 231
G +N G G+ GLG K S+ SQ G T +CL S G GYL L
Sbjct: 260 G--DQNAGLF-GQVDGLFGLGREKVSLPSQGAPSYGPGFT-----YCLPSSSSGRGYLSL 311
Query: 232 GHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI------KGLQIIFDSGSSYT 285
G P + +T ++ Y + GG++ I + DSG+ T
Sbjct: 312 GG--APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVIT 369
Query: 286 YFNSQAY 292
+AY
Sbjct: 370 RLPPRAY 376
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 77/151 (50%), Gaps = 10/151 (6%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y +TL +G+PP+ +++ +DTGSDL WVQC PC C P + P + AC
Sbjct: 37 GEYLMTLTLGSPPQSFDVIVDTGSDLNWVQC-LPCRVCYQQPGPKFDPSKSRSFRKAACT 95
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
D C+ LP ++ A + C Y+ Y D ++ G L + L G+ P FGC
Sbjct: 96 DNLCNVSALP--LKACAANVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGC 153
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
G +N G AG++GLG G S+ SQL
Sbjct: 154 G--TQNLGTF-AGAAGLVGLGQGPLSLNSQL 181
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 121/270 (44%), Gaps = 54/270 (20%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P+++ + CN
Sbjct: 86 GYYTTRLFIGTPPQEFALIVDTGSTVTYVPCST-CEQCGKHQDPRFQPESSSTYKPMQCN 144
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFG 175
P C+ + QC YE YA+ SS G+L D L N S L P R IFG
Sbjct: 145 -PSCNC--------DDEGKQCTYERRYAEMSSSSGLLAED--VLSFGNESELTPQRAIFG 193
Query: 176 CG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY 228
C ++QR G++GLG G S++ QL + + V+G+ S+ GG
Sbjct: 194 CETVETGELFSQR--------ADGIMGLGRGPLSVVDQL----VIKEVVGNSFSLCYGGM 241
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKH--------YSSGPAELLFGGKSTGIKGLQI---- 276
+G +V + P D++ H Y+ EL GK +
Sbjct: 242 DVVGGAMV----LGNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKH 297
Query: 277 --IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+ DSG++Y Y +A+ D + K++K
Sbjct: 298 GTVLDSGTTYAYLPEEAFVAFKDAIIKEIK 327
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 117/275 (42%), Gaps = 34/275 (12%)
Query: 52 PIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN 110
P+T G +Y G Y V L +G P + + +DTGSDL W+QC PC C + ++ P+N
Sbjct: 42 PVTSGLLYGSGEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRN 100
Query: 111 N----LVACNDPFCSAFHLPENIRCE----ANDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
+ + C P C A + C A +C Y+V Y D S+G +D F L
Sbjct: 101 SSSFQRIPCLSPLCKALEVHS---CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGT 157
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ ++ + FGCG++ G+ L S + + T N +CL
Sbjct: 158 GSKAM---SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLV 214
Query: 223 ------VRGGGYLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKS--TGIKG 273
R L G +PS+ A +P+ ++ L+ Y + + GG +K
Sbjct: 215 DRSNPMTRSSSSLIFGVAAIPSTA-ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKS 273
Query: 274 LQ--------IIFDSGSSYTYFNSQAYKTTLDLMR 300
LQ +I DSG+S T F + Y T D R
Sbjct: 274 LQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFR 308
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 84/276 (30%), Positives = 111/276 (40%), Gaps = 24/276 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y VT+ +G P Y + DTGSD TWVQC C E L+ P + ++C
Sbjct: 184 GNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCA 243
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P CS + + + C Y V Y D S+G D L + ++ G R FGC
Sbjct: 244 APACSDLY----TKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AIKGFR--FGC 296
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHD 234
G +RN G AG+LGLG GK S+ +Q+ V HC R G GYL G
Sbjct: 297 G--ERNEGLF-GEAAGLLGLGRGKTSL--PVQAYDKYGGVFAHCFPARSSGTGYLDFGPG 351
Query: 235 LVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSGSSYTYFN 288
P+ S TPM D Y G + GGK I I DSG+ T
Sbjct: 352 SSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRLP 411
Query: 289 SQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
AY + + + + L C+ T
Sbjct: 412 PAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFT 447
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 83/267 (31%), Positives = 116/267 (43%), Gaps = 48/267 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPE--SLYHPKNNL 112
GYY+ L IG PP+++ L +DTGS +T+V C+ C C PE S Y P
Sbjct: 110 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQPESSSTYQPVKCT 168
Query: 113 VACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP- 170
+ CN C+ + QC YE YA+ +S GVL D + N S L P
Sbjct: 169 IDCN--------------CDGDRMQCVYERQYAEMSTSSGVLGED--VISFGNQSELAPQ 212
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGG 227
R +FGC N G++GLG G SI+ QL + + C + V GGG
Sbjct: 213 RAVFGCE-NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV-GGG 270
Query: 228 YLFLGHDLVPSS-GIAWTPMSR------DLLEKHYSSGPAEL---LFGGKSTGIKGLQII 277
+ LG PS A++ R DL E H + L +F GK +
Sbjct: 271 AMVLGGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHG------TV 324
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLK 304
DSG++Y Y A+ D + K+L+
Sbjct: 325 LDSGTTYAYLPEAAFLAFKDAIVKELQ 351
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 127/287 (44%), Gaps = 50/287 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDP-- 118
G + + L IG PP+ Y +DTGSDL W QC PCT C P ++ PK +
Sbjct: 95 GEFLMKLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPTPIFDPKKSSSFSKLSCS 153
Query: 119 --FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A LP++ +D C+Y Y D+ S+ G+L ++ LT G + P + FGC
Sbjct: 154 SKLCEA--LPQST---CSDGCEYLYGYGDYSSTQGMLASE----TLTFGKVSVPEVAFGC 204
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGYLF-- 230
G + N G +G++GLG G S++SQL+ +CL+ + L
Sbjct: 205 G--EDNEGSGFSQGSGLVGLGRGPLSLVSQLK-----EPKFSYCLTSVDDTKASTLLMGS 257
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHY--------SSGPAEL-----LFGGKSTGIKGLQII 277
L S I TP+ ++ + + S G L F + G GL I
Sbjct: 258 LASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGL--I 315
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCW 321
DSG++ TY A+ DL+ K+ + P+ D + L VC+
Sbjct: 316 IDSGTTITYLEQSAF----DLVAKEFTSQINLPV-DNSGSTGLEVCF 357
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 119/287 (41%), Gaps = 49/287 (17%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTG 83
F + + P+ +S+ +T +FG Y ++K+G+P + L +DTG
Sbjct: 76 FQQHTKNPAALRSSTTTLGRKFGE---------------YYTSIKLGSPGQEAILIVDTG 120
Query: 84 SDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP-FCSAFHLPENIRCEANDQCD 138
S+LTW+QC PC C +++Y + V CN+ CS C QC
Sbjct: 121 SELTWLQC-LPCKVCAPSVDTIYDAARSASYRPVTCNNSQLCSNSSQGTYAYCARGSQCQ 179
Query: 139 YEVLYADHGSSLGVLVTDHFPLRLTNGS--LLGPRLIFGCGYNQRNPGPKPPPTAGVLGL 196
+ Y D S G L TD + G + FGC Q + P +G+LGL
Sbjct: 180 FAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCA--QGDLELVPTGASGILGL 237
Query: 197 GLGKASILSQL-QSLGLTRNVLGHCLSVRGG-----GYLFLGHDLVPSSGIAWTPMS--- 247
GK ++ QL Q G HC R G +F G+ +P + +T ++
Sbjct: 238 NAGKMALPMQLGQRFGWK---FSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTN 294
Query: 248 RDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+L K Y S EL+F +G +I DSGSS++ F
Sbjct: 295 SELQRKFYHVALKGVSINSHELVFLP-----RGSVVILDSGSSFSSF 336
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 107/259 (41%), Gaps = 24/259 (9%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SL 105
T + LG+ + T+ +G P K + + +DTGSDL WV C+ AP G T + S+
Sbjct: 93 TFRISSLGFLHYTTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTYASDFELSI 152
Query: 106 YHPK----NNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYAD-HGSSLGVLVTD--H 157
Y+PK + V CN+ C+ RC C Y V Y S+ G+LV D H
Sbjct: 153 YNPKGSSTSRKVTCNNSLCA-----HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLH 207
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
+ + FGCG Q G+ GLGL K S+ S L G T +
Sbjct: 208 LTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSF 267
Query: 218 GHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C G G + G P TP + + L Y+ ++ G + +
Sbjct: 268 SMCFGPDGIGRISFGDKGGPDQ--EETPFNLNALHPTYNITVTQVRVGTTLIDLD-FTAL 324
Query: 278 FDSGSSYTYFNSQAYKTTL 296
FDSG+S+TY Y L
Sbjct: 325 FDSGTSFTYLVDPIYTNVL 343
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/253 (33%), Positives = 106/253 (41%), Gaps = 24/253 (9%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL- 112
+G G Y VT+ +G P Y + DTGSD TWVQC C E L+ P +
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSST 228
Query: 113 ---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
V+C P CS NI + C Y V Y D S+G D L + ++ G
Sbjct: 229 YANVSCAAPACSDL----NIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD-AVKG 283
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGG 227
R FGCG +RN G AG+LGLG GK S+ +Q+ V HCL R G G
Sbjct: 284 FR--FGCG--ERNEGLF-GEAAGLLGLGRGKTSL--PVQTYDKYGGVFAHCLPARSTGTG 336
Query: 228 YL-FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
YL F +S TPM D Y G + GG+ I I DSG
Sbjct: 337 YLDFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSG 396
Query: 282 SSYTYFNSQAYKT 294
+ T AY +
Sbjct: 397 TVITRLPPPAYSS 409
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 91/190 (47%), Gaps = 37/190 (19%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+++ L +D+GS +T+V C + C C + + P+
Sbjct: 91 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPE----------M 139
Query: 121 SAFHLPE--NIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIF 174
S+ + P N+ C +D QC YE YA+H SS GVL D + N S L P R +F
Sbjct: 140 SSTYQPVKCNMDCNCDDDREQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRAVF 197
Query: 175 GCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVR 224
GC Y+QR G++GLG G S++ QL GL N G C + V
Sbjct: 198 GCETVETGDLYSQR--------ADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVG 249
Query: 225 GGGYLFLGHD 234
GG + G D
Sbjct: 250 GGSMILGGFD 259
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 36/261 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y +T+++G+P K + IDTGSD++WVQC PC+ C + L+ P + +C+
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCSSA 191
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
C+ L + ++ QC Y V Y D S+ G +D L GS + FGC
Sbjct: 192 ACA--QLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL----GSNAVRKFQFGCSN 245
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
G+N + T G++GLG G S++S Q+ G +CL + G+L L
Sbjct: 246 VESGFNDQ--------TDGLMGLGGGAQSLVS--QTAGTFGAAFSYCLPATSSSSGFLTL 295
Query: 232 GHDLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKS----TGIKGLQIIFDSGSSYTY 286
G +SG TPM R + Y + GG+ T + I DSG+ T
Sbjct: 296 G---AGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTIMDSGTVLTR 352
Query: 287 FNSQAYKTTLDLMRKDLKGKP 307
AY + +K P
Sbjct: 353 LPPTAYSALSSAFKAGMKQYP 373
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 123/275 (44%), Gaps = 37/275 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PPK + L +DTGSDL W+QC PC C Y PK + + CN
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNGMFYDPKTSASFKNITCN 216
Query: 117 DPFCSAFHLPE-NIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT-----NGSLLG 169
DP CS P+ ++CE+++Q C Y Y D ++ G + F + LT +
Sbjct: 217 DPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKV 276
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY- 228
++FGCG+ R +G+LGLG G S SQLQS L + +CL R
Sbjct: 277 GNMMFGCGHWNRGLFSG---ASGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSNTN 331
Query: 229 ----LFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIKGLQ---- 275
L G DL+ + + +T + +E Y +L GGK+ I
Sbjct: 332 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNIS 391
Query: 276 ------IIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ +YF AY+ + + +K
Sbjct: 392 SDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMK 426
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/253 (29%), Positives = 114/253 (45%), Gaps = 22/253 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG PP+ + L +D+GS +T+V C A C C + + P +L + P
Sbjct: 86 GYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSTYSPVK 142
Query: 120 CSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C+ + C+++ +QC YE YA+ SS GVL D T L R +FGC
Sbjct: 143 CNV-----DCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFG-TESELKPQRAVFGC-E 195
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLV 236
N G++GLG G+ SI+ QL G+ + C GGG + LG
Sbjct: 196 NSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNSQ 290
P G+ +T S + +Y+ E+ GK+ + + DSG++Y Y Q
Sbjct: 256 P-PGMIYT-HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQ 313
Query: 291 AYKTTLDLMRKDL 303
A+ D + +
Sbjct: 314 AFVAFKDAVSSQV 326
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 114/271 (42%), Gaps = 31/271 (11%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------L 105
T V LG+ L +G P + + +DTGSDL W+ C C GCT PP S
Sbjct: 88 TLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASF 145
Query: 106 YHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY--ADHGSSLGVLVTD--H 157
Y P + V CN FC C C Y+++Y AD SS G LV D +
Sbjct: 146 YIPSLSSTSQAVPCNSDFCGL-----RKECSKTSSCPYKMVYVSADTSSS-GFLVEDVLY 199
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
T+ L +++FGCG Q G+ GLG+ S+ S L GLT N
Sbjct: 200 LSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSF 259
Query: 218 GHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH--YSSGPAELLFGGKSTGIKGLQ 275
C G G + G SS TP+ D+ +KH Y+ + G ++ +
Sbjct: 260 SMCFGRDGIGRISFGDQ--GSSDQEETPL--DINQKHPTYAITITGIAVGNNLMDLE-VS 314
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
IFD+G+S+TY AY D ++
Sbjct: 315 TIFDTGTSFTYLADPAYTYITDGFHSQVQAN 345
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 115/270 (42%), Gaps = 25/270 (9%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SL 105
T + LG+ + T+K+G P + + +DTGSDL WV C+ AP G T E S+
Sbjct: 97 TSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSI 156
Query: 106 YHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTD--HF 158
Y+PK N V CN+ C+ N C Y V Y S+ G+L+ D H
Sbjct: 157 YNPKVSTTNKKVTCNNSLCAQ----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHL 212
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
N + + FGCG Q G+ GLG+ K S+ S L GL +
Sbjct: 213 TTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFS 272
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-LQII 277
C G G + G SS TP + + +Y+ + G +T I +
Sbjct: 273 MCFGHDGVGRISFGDK--GSSDQEETPFNLNPSHPNYNITVTRVRVG--TTLIDDEFTAL 328
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
FD+G+S+TY Y TT+ +D + P
Sbjct: 329 FDTGTSFTYLVDPMY-TTVSESAQDKRHSP 357
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 114/271 (42%), Gaps = 31/271 (11%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------L 105
T V LG+ L +G P + + +DTGSDL W+ C C GCT PP S
Sbjct: 88 TLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASF 145
Query: 106 YHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY--ADHGSSLGVLVTD--H 157
Y P + V CN FC C C Y+++Y AD SS G LV D +
Sbjct: 146 YIPSLSSTSQAVPCNSDFCGL-----RKECSKTSSCPYKMVYVSADTSSS-GFLVEDVLY 199
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
T+ L +++FGCG Q G+ GLG+ S+ S L GLT N
Sbjct: 200 LSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSF 259
Query: 218 GHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH--YSSGPAELLFGGKSTGIKGLQ 275
C G G + G SS TP+ D+ +KH Y+ + G ++ +
Sbjct: 260 SMCFGRDGIGRISFGDQ--GSSDQEETPL--DINQKHPTYAITITGIAVGNNLMDLE-VS 314
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
IFD+G+S+TY AY D ++
Sbjct: 315 TIFDTGTSFTYLADPAYTYITDGFHSQVQAN 345
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/253 (29%), Positives = 114/253 (45%), Gaps = 22/253 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG PP+ + L +D+GS +T+V C A C C + + P +L + P
Sbjct: 86 GYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSTYSPVK 142
Query: 120 CSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C+ + C+++ +QC YE YA+ SS GVL D T L R +FGC
Sbjct: 143 CNV-----DCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFG-TESELKPQRAVFGC-E 195
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLV 236
N G++GLG G+ SI+ QL G+ + C GGG + LG
Sbjct: 196 NSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNSQ 290
P G+ +T S + +Y+ E+ GK+ + + DSG++Y Y Q
Sbjct: 256 P-PGMIYT-HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQ 313
Query: 291 AYKTTLDLMRKDL 303
A+ D + +
Sbjct: 314 AFVAFKDAVSSQV 326
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 107/250 (42%), Gaps = 24/250 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +D+GS +T+V C A C C + + P +L + P
Sbjct: 87 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSSYSP-- 141
Query: 121 SAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
+ N+ C + QC YE YA+ SS GVL D L R +FGC
Sbjct: 142 ----VKCNVDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFG-RESELKAQRAVFGC- 195
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDL 235
N G++GLG G+ SI+ QL G+ + C GGG + LG
Sbjct: 196 ENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGG-- 253
Query: 236 VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNS 289
VP+ S L +Y+ E+ GK+ + + DSG++Y Y
Sbjct: 254 VPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPE 313
Query: 290 QAYKTTLDLM 299
QA+ D +
Sbjct: 314 QAFMAFKDAV 323
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/255 (30%), Positives = 109/255 (42%), Gaps = 33/255 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCN---------APCTGCTLPPESLYHPKNN-- 111
Y +++G P + + +DTGSDL WV C+ A TG P Y P+ +
Sbjct: 108 YYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSST 167
Query: 112 --LVACNDPFCSAFHLPENIRCEA--NDQCDYEVLYAD-HGSSLGVLVTDHFPLRLTN-- 164
VAC++P C + C A N C YEV Y + SS GVLV D L
Sbjct: 168 SKQVACDNPLCG-----QRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPG 222
Query: 165 ----GSLLGPRLIFGCGYNQRNPGPKPPPTA--GVLGLGLGKASILSQLQSLGL-TRNVL 217
G L ++FGCG Q A G++GLG+GK S+ S L + GL +
Sbjct: 223 PGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASGLVASDSF 282
Query: 218 GHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C G G + G S G A TP + L Y+ + G +S + +
Sbjct: 283 SMCFGDDGVGRVNFGD--AGSRGQAETPFTVRSLNPTYNVSFTSIGVGSESVAAE-FAAV 339
Query: 278 FDSGSSYTYFNSQAY 292
DSG+S+TY + Y
Sbjct: 340 MDSGTSFTYLSDPEY 354
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 85/181 (46%), Gaps = 24/181 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y V++ +G P + + DTGSDL+WVQC PC+ C + L+ P + V C
Sbjct: 144 GNYVVSMGLGTPARDMTVVFDTGSDLSWVQCT-PCSDCYEQKDPLFDPARSSTYSAVPCA 202
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C ++ C + +C YEV+Y D + G L D L LT +L P +FGC
Sbjct: 203 SPECQGL---DSRSCSRDKKCRYEVVYGDQSQTDGALARDT--LTLTQSDVL-PGFVFGC 256
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS---LGLTRNVLGHCL--SVRGGGYLFL 231
G + G++GLG K S+ SQ S G + +CL S GYL L
Sbjct: 257 GEQDTGLFGR---ADGLVGLGREKVSLSSQAASKYGAGFS-----YCLPSSPSAAGYLSL 308
Query: 232 G 232
G
Sbjct: 309 G 309
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 114/260 (43%), Gaps = 29/260 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL---YHPKNN----LV 113
GYY+ + IG P + + L +DTGS +T+V C++ CT C + P N+ V
Sbjct: 97 GYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSS-CTHCGHHQACFDPRFKPDNSSSYQTV 155
Query: 114 ACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR- 171
+CN P C C+A QC YE +YA+ SS GVL D L NGS L P
Sbjct: 156 SCNSPDCIT------KMCDARVHQCKYERVYAEMSSSKGVLGKDL--LGFGNGSRLQPHP 207
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGYL 229
L+FGC + G++GLG G SI+ QL G + C GGG +
Sbjct: 208 LLFGCETAETGD-LYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSM 266
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG------LQIIFDSGSS 283
LG +P S +Y+ +E+ G S + L + DSG++
Sbjct: 267 VLG--AIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGTT 324
Query: 284 YTYFNSQAYKTTLDLMRKDL 303
Y Y +A+ D + + L
Sbjct: 325 YAYLPDKAFDAFKDAITQQL 344
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 126/272 (46%), Gaps = 26/272 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y +++ IG PP Y DTGSDLTW QC PC C +++P + V CN
Sbjct: 90 GEYLMSVSIGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCN 148
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C H ++ C CDY Y D S G L + ++T GS + + GC
Sbjct: 149 TQTC---HAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFE----KITIGS-SSVKSVIGC 200
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSV---RGGGYLFLG 232
G+ +GV+GLG G+ S++SQ+ Q+ G++R +CL G + G
Sbjct: 201 GHASSG---GFGFASGVIGLGGGQLSLVSQMSQTSGISRR-FSYCLPTLLSHANGKINFG 256
Query: 233 HDLVPSS-GIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-KGLQIIFDSGSSYTYFNS 289
+ V S G+ TP+ S++ + +Y + A + + K +I DSG++ T
Sbjct: 257 ENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLTILPK 316
Query: 290 QAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
+ Y + + K +K K ++D +L +C+
Sbjct: 317 ELYDGVVSSLLKVVKAKRVKD--PHGSLDLCF 346
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 116/269 (43%), Gaps = 21/269 (7%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----N 111
N Y +G + + + IG PP +DTGSDL W+QC APC GC + ++ P N
Sbjct: 62 NAY-IGQHLMEIYIGTPPIKITGLVDTGSDLIWIQC-APCLGCYKQIKPMFDPLKSSTYN 119
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-P 170
++C+ P C H + C +C+Y Y D+ + GVL D G +
Sbjct: 120 NISCDSPLC---HKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLS 176
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL----GLTRNVLGHCLSVRGG 226
R +FGCG+N N G G++GLG G S++SQ+ L ++ ++ ++
Sbjct: 177 RFLFGCGHN--NTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKIS 234
Query: 227 GYLFLGH-DLVPSSGIAWTPMSRDLLEKHYSSG----PAELLFGGKSTGIKGLQIIFDSG 281
+ G V +G+ TP+ + Y E + ++ I ++ DSG
Sbjct: 235 SRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKANMLVDSG 294
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
+ Q Y +R + KP+ D
Sbjct: 295 TPPILLPQQLYDKVFAEVRNKVALKPITD 323
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/259 (29%), Positives = 114/259 (44%), Gaps = 43/259 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + + IG P Y +DTGSDL W QC PC C ++ P ++ V C+
Sbjct: 103 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCS 161
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS LP + +C + +C Y Y D S+ GVL T+ F L + P ++FGC
Sbjct: 162 SASCS--DLPTS-KCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVFGC 214
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV---RGGGYLFLGH 233
G N G AG++GLG G S++SQ LGL + +CL+ L LG
Sbjct: 215 G--DTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDK--FSYCLTSLDDTNNSPLLLGS 267
Query: 234 ------DLVPSSGIAWTPMSRD--------LLEKHYSSGPAEL-----LFGGKSTGIKGL 274
+S + TP+ ++ + K + G + F + G G
Sbjct: 268 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG- 326
Query: 275 QIIFDSGSSYTYFNSQAYK 293
+I DSG+S TY Q Y+
Sbjct: 327 -VIVDSGTSITYLEVQGYR 344
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 112/269 (41%), Gaps = 24/269 (8%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SL 105
T + LG+ + T+K+G P + + +DTGSDL WV C+ AP G T E S+
Sbjct: 95 TSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSI 154
Query: 106 YHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTD--HF 158
Y+PK N V CN+ C+ N C Y V Y S+ G+L+ D H
Sbjct: 155 YNPKISTTNKKVTCNNSLCAQ----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHL 210
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
N + + FGCG Q G+ GLG+ K S+ S L GL +
Sbjct: 211 TTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFS 270
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-LQII 277
C G G + G SS TP + + +Y+ + G +T I +
Sbjct: 271 MCFGHDGVGRISFGDK--GSSDQEETPFNLNPSHPNYNITVTRVRVG--TTLIDDEFTAL 326
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
FD+G+S+TY Y T + + K
Sbjct: 327 FDTGTSFTYLVDPMYTTVSESFHSQAQDK 355
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 113/259 (43%), Gaps = 43/259 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + + IG P Y +DTGSDL W QC PC C ++ P ++ V C+
Sbjct: 93 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCS 151
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS LP + +C + +C Y Y D S+ GVL T+ F L + P ++FGC
Sbjct: 152 SASCS--DLPTS-KCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVFGC 204
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---------GG 227
G N G AG++GLG G S++SQ LGL + +CL+ G
Sbjct: 205 G--DTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDK--FSYCLTSLDDTNNSPLLLGS 257
Query: 228 YLFLGHDLVPSSGIAWTPMSRD--------LLEKHYSSGPAEL-----LFGGKSTGIKGL 274
+ +S + TP+ ++ + K + G + F + G G
Sbjct: 258 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG- 316
Query: 275 QIIFDSGSSYTYFNSQAYK 293
+I DSG+S TY Q Y+
Sbjct: 317 -VIVDSGTSITYLEVQGYR 334
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 93/196 (47%), Gaps = 22/196 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G+PP+ + ID+GSD+ WVQC PCT C + L+ P ++ V+C+
Sbjct: 41 GEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCK-PCTQCYHQTDPLFDPADSASFMGVSCS 99
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C EN C + +C YEV Y D + G L + LT G + + GC
Sbjct: 100 SAVCDRV---ENAGCNSG-RCRYEVSYGDGSYTKGTLALE----TLTFGRTVVRNVAIGC 151
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGYLFLGH 233
G++ R G+ G + S + QL G T N +CL RG G+L G
Sbjct: 152 GHSNRGMFVGAAGLLGLGGGSM---SFMGQLS--GQTGNAFSYCLVSRGTNTNGFLEFGS 206
Query: 234 DLVPSSGIAWTPMSRD 249
+ +P G AW P+ R+
Sbjct: 207 EAMP-VGAAWIPLVRN 221
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 109/256 (42%), Gaps = 30/256 (11%)
Query: 54 TGNVYPLGYYSVTL-KIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE------SLY 106
T V LG+ L +G P + + +DTGSDL W+ C C GC P S Y
Sbjct: 92 TLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCPPPASGASGSASFY 149
Query: 107 HPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY--ADHGSSLGVLVTDHFPL 160
P + V CN FC C C Y+++Y AD SS G LV D L
Sbjct: 150 IPSMSSTSQAVPCNSDFCD-----HRKDCSTTSSCPYKMVYVSADTSSS-GFLVEDVLYL 203
Query: 161 RLTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+ +L +++FGCG Q G+ GLG+ S+ S L GLT +
Sbjct: 204 STEDNHPQILKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSDSFS 263
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH--YSSGPAELLFGGKSTGIKGLQI 276
C G G + G SS TP+ D+ +KH Y+ + G + ++
Sbjct: 264 MCFGRDGIGRISFGDQ--GSSDQEETPL--DINQKHPTYAITITGITVGTEPMDLE-FST 318
Query: 277 IFDSGSSYTYFNSQAY 292
IFD+G+++TY AY
Sbjct: 319 IFDTGTTFTYLADPAY 334
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 107/259 (41%), Gaps = 24/259 (9%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SL 105
T + LG+ + T+ +G P K + + +DTGSDL WV C+ AP G T + S+
Sbjct: 93 TFRISSLGFLHYTTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTYASDFELSI 152
Query: 106 YHPK----NNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYAD-HGSSLGVLVTD--H 157
Y+PK + V C++ C+ RC C Y V Y S+ G+LV D H
Sbjct: 153 YNPKGSSTSRKVTCDNSLCA-----HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLH 207
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
+ + FGCG Q G+ GLGL K S+ S L G T +
Sbjct: 208 LTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSF 267
Query: 218 GHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C G G + G P TP + + L Y+ ++ G + +
Sbjct: 268 SMCFGPDGIGRISFGDKGSPDQ--EETPFNLNALHPTYNITVTQVRVGTTLIDLD-FTAL 324
Query: 278 FDSGSSYTYFNSQAYKTTL 296
FDSG+S+TY Y L
Sbjct: 325 FDSGTSFTYLVDPIYTNVL 343
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 122/287 (42%), Gaps = 41/287 (14%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKN 110
++G+ G Y V L++G P K + L IDTGSDLTW+QCN P T + PP Y +
Sbjct: 17 VSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSS 76
Query: 111 NL----VACNDPFCSAFHLPENIRC--EANDQCDYEVLYADHGSSLGVLVTDHFPL--RL 162
+ + C D C P C ++ CDY Y+D + G+L + + R
Sbjct: 77 SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136
Query: 163 TNGSLLG---PRLI----FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
+G G R I G ++ + G +GVLGLG G S+ +Q + L
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALG-G 195
Query: 216 VLGHCLS--VRG---GGYLFLGHDLVPSSGIAWTPMSRDLLEKHY--------------S 256
+ +CL +RG +L +G +A TP+ R+ + +
Sbjct: 196 IFSYCLVDYLRGSNASSFLVMGR--TRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPV 253
Query: 257 SGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
G A +G G KG IFDSG++ +Y AY L + +
Sbjct: 254 DGIASSDWGIDGDGNKG--TIFDSGTTLSYLREPAYSKVLGALNASI 298
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 123/281 (43%), Gaps = 37/281 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + + IG P + +DTGSDL W QC PCT C P +++P++ + + C
Sbjct: 94 GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPCE 152
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
+C LP C N++C Y Y D ++ G + T+ F ++ P + FGC
Sbjct: 153 SQYCQ--DLPSE-TCN-NNECQYTYGYGDGSTTQGYMATETFTFETSS----VPNIAFGC 204
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG---GYLFLGH 233
G + N G AG++G+G G S+ SQ LG+ + +C++ G L LG
Sbjct: 205 G--EDNQGFGQGNGAGLIGMGWGPLSLPSQ---LGVGQ--FSYCMTSYGSSSPSTLALGS 257
Query: 234 DL--VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG----LQ------IIFDSG 281
VP + T + L +Y + GG + GI LQ +I DSG
Sbjct: 258 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSG 317
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
++ TY AY + +++++ L C++
Sbjct: 318 TTLTYLPQDAYNAVAQAFTDQINLPTVDESS--SGLSTCFQ 356
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 112/271 (41%), Gaps = 36/271 (13%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH----------PKNNL---- 112
+ IG P + + +D GSDL WV C+ C C S Y P +L
Sbjct: 104 IDIGTPSTSFLVALDAGSDLLWVPCD--CIHCAPLSASFYSNLDRDLNEYSPSRSLSSKH 161
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLR-----LTNGS 166
++C+ C + N + QC Y + Y +D+ SS G+LV D F L+ +N S
Sbjct: 162 LSCSHRLCD---MGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGSTSNSS 218
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
+ P ++ GCG Q G++GLG G++S+ S L GL R+ C +
Sbjct: 219 VQAP-VVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCFNEDDS 277
Query: 227 GYLFLGHDLVPSSGIAWTP-MSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYT 285
G LF G S+ TP + D + Y G G + FDSG+S+T
Sbjct: 278 GRLFFGDQ--GSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVTSFNAQFDSGTSFT 335
Query: 286 YFNSQAY-------KTTLDLMRKDLKGKPLE 309
+ AY ++ R +G P E
Sbjct: 336 FLPGHAYGAIAEEFDKQVNATRSTFQGSPWE 366
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 121/289 (41%), Gaps = 31/289 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-LYHPKN----NLVACND 117
Y + + +G PP DTGSDL WV C++ G + ++HP +L++C
Sbjct: 100 YLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQS 159
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL----RLTNGSLLGPRLI 173
C A C+A+ +C Y+ Y D ++GVL T+ F G + PR+
Sbjct: 160 AACQAL---SQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVS 216
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGY 228
FGC + G++GLG G S++SQL + +CL +
Sbjct: 217 FGCSTGSAGSF----RSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSSST 272
Query: 229 LFLGHDLVPSS-GIAWTPMSRDLLEKHYSSGPAELLFGGKSTG-IKGLQIIFDSGSSYTY 286
L G V S G A TP+ ++ +Y+ + G+ +II DSG++ T+
Sbjct: 273 LSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSRIIVDSGTTLTF 332
Query: 287 FNSQAYKTTLDLMRKDL---KGKPLEDTAEEKALPVCWKGTWKCLLGNF 332
+ + + + + + + +P E+ L +C+ K +F
Sbjct: 333 LDPALLRPLVAELERRIRLPRAQP-----PEQLLQLCYDVQGKSQAEDF 376
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 130/302 (43%), Gaps = 27/302 (8%)
Query: 29 QPPSKKKSTQSTAAHRFGSTAV-FPITGNVYP-LGYYSVTLKIGNPPKLYELDIDTGSDL 86
Q PS++ Q AA S+AV P++ Y G Y V + +G P + + L DTGS+L
Sbjct: 55 QLPSRRGGRQRVAAEVASSSAVSLPMSSGAYAGTGQYFVKVLVGTPAQEFTLVADTGSEL 114
Query: 87 TWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENI-RCEANDQ-CDYE 140
TWV+ C G PP ++ P+ + V C+ C +P ++ C ++ C Y+
Sbjct: 115 TWVK----CAGGASPPGLVFRPEASKSWAPVPCSSDTCK-LDVPFSLANCSSSASPCSYD 169
Query: 141 VLYAD-HGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGCGYNQRNPGPKPPPTAGVLGLGL 198
Y + +LGV+ TD + L G + + ++ GC + + G GVL LG
Sbjct: 170 YRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGC--SSTHDGQSFKSVDGVLSLGN 227
Query: 199 GKASILSQLQSL---GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY 255
K S S+ + + ++ H GYL G VP + T + D Y
Sbjct: 228 AKISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFY 287
Query: 256 SSGPAELLFGGKSTGI-------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
+ G++ I K +I DSG++ T + AYK + + K L G P
Sbjct: 288 GVKVDAVHVAGQALDIPAEVWDPKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPK 347
Query: 309 ED 310
D
Sbjct: 348 VD 349
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/259 (29%), Positives = 114/259 (44%), Gaps = 43/259 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + + IG P Y +DTGSDL W QC PC C ++ P ++ V C+
Sbjct: 72 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCS 130
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS LP + +C + +C Y Y D S+ GVL T+ F L + P ++FGC
Sbjct: 131 SASCS--DLPTS-KCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVFGC 183
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV---RGGGYLFLGH 233
G N G AG++GLG G S++SQ LGL + +CL+ L LG
Sbjct: 184 G--DTNEGDGFSQGAGLVGLGRGPLSLVSQ---LGLDK--FSYCLTSLDDTNNSPLLLGS 236
Query: 234 ------DLVPSSGIAWTPMSRD--------LLEKHYSSGPAEL-----LFGGKSTGIKGL 274
+S + TP+ ++ + K + G + F + G G
Sbjct: 237 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG- 295
Query: 275 QIIFDSGSSYTYFNSQAYK 293
+I DSG+S TY Q Y+
Sbjct: 296 -VIVDSGTSITYLEVQGYR 313
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 112/269 (41%), Gaps = 24/269 (8%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SL 105
T + LG+ + T+K+G P + + +DTGSDL WV C+ AP G T E S+
Sbjct: 97 TSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSI 156
Query: 106 YHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTD--HF 158
Y+PK N V CN+ C+ N C Y V Y S+ G+L+ D H
Sbjct: 157 YNPKVSTTNKKVTCNNSLCAQ----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHL 212
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
N + + FGCG Q G+ GLG+ K S+ S L GL +
Sbjct: 213 TTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFS 272
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-LQII 277
C G G + G SS TP + + +Y+ + G +T I +
Sbjct: 273 MCFGHDGVGRISFGDK--GSSDQEETPFNLNPSHPNYNITVTRVRVG--TTLIDDEFTAL 328
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
FD+G+S+TY Y T + + K
Sbjct: 329 FDTGTSFTYLVDPMYTTVSESFHSQAQDK 357
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/285 (31%), Positives = 126/285 (44%), Gaps = 46/285 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ +KIG PP + L +DTGS +T+V PC+ CT H N+ P
Sbjct: 33 GYYTSRVKIGTPPHEFSLIVDTGSTVTYV----PCSSCT-------HCGNHQDPRFSPAL 81
Query: 121 SAFHLPENIRCEANDQ-CD----YEVLYADHGSSLGVLVTDHFPLRLTNGSLL-GPRLIF 174
S+ + P E + CD Y+ YA+ +S GVL D + +N S L G RL+F
Sbjct: 82 SSSYKPLECGSECSTGFCDGSRKYQRQYAEKSTSSGVLGKD--VIGFSNSSDLGGQRLVF 139
Query: 175 GCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGYLFL 231
GC G TA G++GLG G SI+ QL +V C GGG + L
Sbjct: 140 GC--ETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMIL 197
Query: 232 G-----HDLVPSSGIAWTPMSRDLLEKHYSSGPAEL-----LFGGKSTGIKGLQIIFDSG 281
G D+V ++ +L+ K G + L +F GK + DSG
Sbjct: 198 GGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGK------YGTVLDSG 251
Query: 282 SSYTYFNS---QAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
++Y YF QA+K+ + LK P D EK +C+ G
Sbjct: 252 TTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPD---EKFKDICYAG 293
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 117/282 (41%), Gaps = 39/282 (13%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTG 83
F + + P+ +S+ +T +FG Y ++K+G+P + L +DTG
Sbjct: 76 FQQHTKNPAALRSSTTTLGRKFGE---------------YYTSIKLGSPGQEAILIVDTG 120
Query: 84 SDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP-FCSAFHLPENIRCEANDQCD 138
S+LTW++C PC C +++Y ++ V CN+ CS C QC
Sbjct: 121 SELTWLKC-LPCKVCAPSVDTIYDAARSVSYKPVTCNNSQLCSNSSQGTYAYCARGSQCQ 179
Query: 139 YEVLYADHGSSLGVLVTDHFPLRLTNGS--LLGPRLIFGCGYNQRNPGPKPPPTAGVLGL 196
+ Y D S G L TD + G + FGC Q + P +G+LGL
Sbjct: 180 FAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCA--QGDLELVPTGASGILGL 237
Query: 197 GLGKASILSQL-QSLGLTRNVLGHCLSVRGG-----GYLFLGHDLVPSSGIAWTPMS--- 247
GK ++ QL Q G HC R G +F G+ +P + +T ++
Sbjct: 238 NAGKMALPMQLGQRFGWK---FSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTN 294
Query: 248 RDLLEKHYSSGPAELLFGGKSTGI--KGLQIIFDSGSSYTYF 287
+L K Y + + +G +I DSGSS++ F
Sbjct: 295 SELQRKFYHVALKGVSINSHELVLLPRGSVVILDSGSSFSSF 336
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 106/226 (46%), Gaps = 32/226 (14%)
Query: 44 RFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP 101
RF + +TG V G Y +T+K+GNP + Y L TGSD+ WV C++ CT C P
Sbjct: 55 RFAAKKQQGVTGFVLEAMPGLYCITVKLGNPSRHYYLAFHTGSDVMWVPCSS-CTDCPTP 113
Query: 102 PE-----SLYHPKNNLVA---------CNDPFCSAFHLPENIRCEANDQCDYEVLYADHG 147
+ LY PKN+ + C D + H + + DQC Y +YAD
Sbjct: 114 DDIGFSLDLYDPKNSSTSSEISCSDDRCADALKTG-HAICHTSHSSGDQCGYNQIYADGV 172
Query: 148 -SSLGVLVTD--HFPLRLTNGSLL--GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKAS 202
++ G V+D HF + + N S +IFGC ++ GV+G G S
Sbjct: 173 LATTGYYVSDDIHFDIFMGNESFASSSASVIFGCSKSRSG----HLQADGVIGFGKDAPS 228
Query: 203 ILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPM 246
++SQL S G++ + CL S GGG L L D V G+ +T +
Sbjct: 229 LISQLNSQGVS-HAFSRCLDDSDDGGGVLIL--DEVGEPGLEFTSL 271
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/250 (30%), Positives = 112/250 (44%), Gaps = 24/250 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG PP+ + L +D+GS +T+V C A C C + + P +L + P
Sbjct: 87 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSSYSPVK 143
Query: 120 CSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGCG 177
C+ + C+++ QC YE YA+ SS GVL D + S L P R +FGC
Sbjct: 144 CNV-----DCTCDSDKKQCTYERQYAEMSSSSGVLGEDI--VSFGRESELKPQRAVFGC- 195
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDL 235
N G++GLG G+ SI+ QL G+ + C GGG + LG
Sbjct: 196 ENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGG-- 253
Query: 236 VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNS 289
VP+ S L +Y+ E+ GK+ + + DSG++Y Y
Sbjct: 254 VPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPE 313
Query: 290 QAYKTTLDLM 299
QA+ D +
Sbjct: 314 QAFVAFKDAV 323
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 112/243 (46%), Gaps = 24/243 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG PP+ + L +D+GS +T+V C++ C C + + P +L + P
Sbjct: 86 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSS-CEQCGNHQDPRFQP--DLSSSYSPVK 142
Query: 120 CSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGCG 177
C+ + C+++ QC YE YA+ SS GVL D + S L P+ IFGC
Sbjct: 143 CNV-----DCTCDSDKKQCTYERQYAEMSSSSGVLGED--IVSFGRESELKPQHAIFGC- 194
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDL 235
N G++GLG G+ SI+ QL G+ + C GGG + LG L
Sbjct: 195 ENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGML 254
Query: 236 VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNS 289
P I S L +Y+ E+ GK+ ++ + DSG++Y Y
Sbjct: 255 APPDMIFSN--SDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPE 312
Query: 290 QAY 292
QA+
Sbjct: 313 QAF 315
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/254 (29%), Positives = 112/254 (44%), Gaps = 22/254 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG PP+ + L +D+GS +T+V C A C C + + P +L + P
Sbjct: 83 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSTYSPVK 139
Query: 120 CSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
CSA + C+++ QC YE YA+ SS GVL D T L R +FGC
Sbjct: 140 CSA-----DCTCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFG-TESELKPQRAVFGC-E 192
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGHDLV 236
N G++GLG G+ SI+ QL G+ + C GGG + LG
Sbjct: 193 NSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 252
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNSQ 290
P + S + +Y+ E+ GK+ + + DSG++Y Y Q
Sbjct: 253 PPDMV--FSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLPEQ 310
Query: 291 AYKTTLDLMRKDLK 304
A+ D + ++
Sbjct: 311 AFVAFKDAVTSKVR 324
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 80/160 (50%), Gaps = 12/160 (7%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL--- 112
N Y +G Y + L IG PP +DTGSDL WVQC PC GC ++ P +
Sbjct: 58 NAY-IGQYLMELYIGTPPIKISGTVDTGSDLIWVQC-VPCLGCYNQINPMFDPLKSSTYT 115
Query: 113 -VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
++C+ P C ++ E C +CDY YAD + GVL + L G + +
Sbjct: 116 NISCDSPLCYKPYIGE---CSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQ 172
Query: 172 -LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
++FGCG+N N G G++GLG G S++SQ+ L
Sbjct: 173 GILFGCGHN--NTGNFNDHEMGLIGLGGGPTSLVSQIGPL 210
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 128/276 (46%), Gaps = 29/276 (10%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPL----GYYSVTLKIGNPPKLYELDIDTGSDLTW 88
+ KS ++ + +T VF P G Y+VT+ +G P K + L DTGSDLTW
Sbjct: 98 RVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDLTW 157
Query: 89 VQCNAPCTGCTLPP-ESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLY 143
QC PC+G P + + P + ++C+ C + C +++ C Y V Y
Sbjct: 158 TQCE-PCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKY 216
Query: 144 ADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI 203
G ++G L T+ + ++ + + GCG +RN G + TAG+LGLG ++
Sbjct: 217 GT-GYTVGFLATETLTITPSD---VFENFVIGCG--ERN-GGRFSGTAGLLGLGRSPVAL 269
Query: 204 LSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAE 261
SQ S +N+ +CL S G+L G + S +TP++ + E Y +
Sbjct: 270 PSQTSS--TYKNLFSYCLPASSSSTGHLSFGGGV--SQAAKFTPITSKIPE-LYGLDVSG 324
Query: 262 LLFGGKSTGI-----KGLQIIFDSGSSYTYFNSQAY 292
+ GG+ I + I DSG++ TY S A+
Sbjct: 325 ISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAH 360
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 128/287 (44%), Gaps = 50/287 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACN 116
G + + L IG PP+ Y +DTGSDL W QC PCT C ++ P + ++C+
Sbjct: 95 GEFLMKLAIGTPPETYSAILDTGSDLIWTQCK-PCTQCFHQSTPIFDPKKSSSFSKLSCS 153
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A LP++ N+ C+Y Y D+ S+ G+L ++ LT G P + FGC
Sbjct: 154 SQLCEA--LPQS---SCNNGCEYLYSYGDYSSTQGILASE----TLTFGKASVPNVAFGC 204
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV---RGGGYLFLGH 233
G + N G AG++GLG G S++SQL+ +CL+ L +G
Sbjct: 205 GAD--NEGSGFSQGAGLVGLGRGPLSLVSQLK-----EPKFSYCLTTVDDTKTSTLLMGS 257
Query: 234 DL---VPSSGIAWTPMSRD--------LLEKHYSSGPAEL-----LFGGKSTGIKGLQII 277
SS I TP+ L + S G L F + G GL I
Sbjct: 258 LASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGL--I 315
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCW 321
DSG++ TY A+ +L+ K+ K P+ D++ L VC+
Sbjct: 316 IDSGTTITYLEESAF----NLVAKEFTAKINLPV-DSSGSTGLDVCF 357
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 110/256 (42%), Gaps = 30/256 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKN----NLVA 114
G Y V++ +G P + + DTGSDL+WVQC PC+ GC + L+ P + + V
Sbjct: 83 GNYVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYHQQDPLFAPSSSSTFSAVR 141
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT-------NGSL 167
C +P C + +D+C YEV+Y D ++G L D L T N S
Sbjct: 142 CGEPECPRARQSCS-SSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSN 200
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVR 224
P +FGCG N K G+ GLG GK S+ S Q+ G +CL S
Sbjct: 201 KLPGFVFGCGENNTGLFGK---ADGLFGLGRGKVSLSS--QAAGKYGEGFSYCLPSSSSN 255
Query: 225 GGGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGIKG------LQII 277
GYL LG + +TPM +R Y + G++ + +I
Sbjct: 256 AHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLI 315
Query: 278 FDSGSSYTYFNSQAYK 293
DSG+ T +AY
Sbjct: 316 VDSGTVITRLAPRAYS 331
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 127/292 (43%), Gaps = 41/292 (14%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-LPPESLYHPKNN 111
++G G Y V L+IG PP+ L DTGSDL WV+C+A C C+ P +++ P+++
Sbjct: 74 VSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHS 132
Query: 112 L----VACNDPFCSAFHLPENI----RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
C DP C P+ + C YE YAD + G+ + L+ +
Sbjct: 133 STFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTS 192
Query: 164 NGSLLGPRLI-FGCGYN---QRNPGPKPPPTAGVLGLGLGKASILSQL-QSLG--LTRNV 216
+G + + FGCG+ Q G GV+GLG G S SQL + G + +
Sbjct: 193 SGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCL 252
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI 276
+ + LS YL +G+ S + +TP+ + L P KS + G ++
Sbjct: 253 MDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLS------PTFYYVKLKSVFVNGAKL 306
Query: 277 -----------------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDT 311
+ DSG++ + AY++ + +R+ +K P+ D
Sbjct: 307 RIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK-LPIADA 357
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 131/297 (44%), Gaps = 28/297 (9%)
Query: 52 PITGNVYPLGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN 110
P++ LG Y ++ +G PP K+Y +DTGS++ W+QC PC C +++P
Sbjct: 78 PVSTLTPELGEYLISYSVGTPPFKVYGF-MDTGSNIVWLQC-QPCNTCFNQTSPIFNPSK 135
Query: 111 NLVACNDPFCSAFHLPEN---IRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG- 165
+ N P S+ N I C D C+Y + Y S G L D L T+G
Sbjct: 136 SSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGS 195
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---- 221
S+L P ++ GCG+ N ++GV+G+G G S++ Q+ S + +CL
Sbjct: 196 SVLFPNIVIGCGH--INVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSK-FSYCLIPYN 252
Query: 222 -SVRGGGYLFLGHDLVPSSGIAW-TPMSRDLLEKHY--------SSGPAELLFGGKSTGI 271
L G D+V S I TPM + +++Y S G + +G +S
Sbjct: 253 SDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNA- 311
Query: 272 KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCL 328
I+ DSG+ T + + + +++K +E + L +C+ T K L
Sbjct: 312 STQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIE--PPDHHLSLCYNTTGKQL 366
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 112/254 (44%), Gaps = 33/254 (12%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPF 119
+V+L +G PP+ + +DTGS+L+W+ C AP + P+ + V C
Sbjct: 86 TVSLAVGTPPQNVTMVLDTGSELSWLLC-APAGARNKFSAMSFRPRASSTFAAVPCASAQ 144
Query: 120 CSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C + LP C+ A+ +C + YAD SS G L TD F + GS R FGC
Sbjct: 145 CRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAV----GSGPPLRAAFGCMS 200
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLGHDLVP 237
+ + P +AG+LG+ G S +SQ TR +C+S R G L LGH +P
Sbjct: 201 SAFDSSPDGVASAGLLGMNRGALSFVSQAS----TRR-FSYCISDRDDAGVLLLGHSDLP 255
Query: 238 SS-GIAWTPMSRDLL------EKHYSSGPAELLFGGKSTGIKGL----------QIIFDS 280
+ + +TPM + L YS + GGK I Q + DS
Sbjct: 256 TFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDS 315
Query: 281 GSSYTYFNSQAYKT 294
G+ +T+ AY
Sbjct: 316 GTQFTFLLGDAYSA 329
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 136/309 (44%), Gaps = 69/309 (22%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
+ S S G P T +YP Y Y+ T +G PP+ + +DTGS LTWV C
Sbjct: 72 RASHHSQKGSSSGGHKSIPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPC 131
Query: 92 --NAPCTGCTLPPES---LYHPKNN----LVACNDPFCSAFHLPENI-RCE--------- 132
N C C+ P + ++HPKN+ LV C +P C H E++ +C
Sbjct: 132 TSNYDCRNCSSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANC 191
Query: 133 --ANDQC-DYEVLYADHGSSLGVLVTDHF--PLRLTNGSLLGPRLIFGCGYNQRNPGPKP 187
A++ C Y V+Y GS+ G+L+ D P R +G +LG L+ +Q
Sbjct: 192 TPASNVCPPYAVVYGS-GSTAGLLIADTLRAPGRAVSGFVLGCSLV---SVHQ------- 240
Query: 188 PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG-------GGYLFLGHDLVPSSG 240
P +G+ G G G S+ +Q LGL++ +CL R G L LG D + G
Sbjct: 241 -PPSGLAGFGRGAPSVPAQ---LGLSK--FSYCLLSRRFDDNAAVSGSLVLGGD---NDG 291
Query: 241 IAWTPMSRDLL-EKHYSSGPAELLFGGKSTGIKGLQI---------------IFDSGSSY 284
+ + P+ + +K + L G + G K +++ I DSG+++
Sbjct: 292 MQYVPLVKSAAGDKQPYAVYYYLALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTF 351
Query: 285 TYFNSQAYK 293
TY + ++
Sbjct: 352 TYLDPTVFQ 360
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 118/267 (44%), Gaps = 37/267 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C++ C C + + P D
Sbjct: 11 GYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS-CEQCGRHQDPKFQP--------DLSS 61
Query: 121 SAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGC 176
+ + NI C +D QC YE YA+ +S GVL D + N S L P R +FGC
Sbjct: 62 TYQSVKCNIDCNCDDEKQQCVYERQYAEMSTSSGVLGED--IISFGNLSALAPQRAVFGC 119
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY--LFLGHD 234
N G++G+G G SI+ L G+ + C G G + LG
Sbjct: 120 E-NMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLG-G 177
Query: 235 LVPSSGIAWTP--------MSRDLLEKHYSSGPAEL---LFGGKSTGIKGLQIIFDSGSS 283
+ P S + ++ + DL E H + P L +F GK I DSG++
Sbjct: 178 ISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHG------TILDSGTT 231
Query: 284 YTYFNSQAYKTTLDLMRKDLKG-KPLE 309
Y Y A+ + D + K+L KP+
Sbjct: 232 YAYLPEAAFVSFKDAIMKELHSLKPIR 258
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/298 (28%), Positives = 125/298 (41%), Gaps = 29/298 (9%)
Query: 46 GSTAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC-TGCTLPPE 103
GS A P+T G Y +G Y + +G P K Y + +DTGS LTW+QC+ PC C
Sbjct: 119 GSLASVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCS-PCRVSCHRQSG 177
Query: 104 SLYHPKNN----LVACNDPFCSAFHLP--ENIRCEANDQCDYEVLYADHGSSLGVLVTDH 157
++ PK + V+C+ P C+ C ++D C Y+ Y D S+G L D
Sbjct: 178 PVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKD- 236
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNV 216
++ GS P +GCG + + +AG++GL K S+L QL +LG +
Sbjct: 237 ---TVSFGSNSVPNFYYGCGQDNEGLFGR---SAGLMGLARNKLSLLYQLAPTLGYS--- 287
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGK-----STG 270
+CL P ++TPM S L + Y + + GK S+
Sbjct: 288 FSYCLPSSSSSGYLSIGSYNPGQ-YSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSE 346
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCL 328
L I DSG+ T + Y + +KG D L C+ G L
Sbjct: 347 YSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRAD--AYSILDTCFVGQASSL 402
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 108/241 (44%), Gaps = 24/241 (9%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK----NNLVACNDPFCSA 122
+++G P + + +DTGSDL W+ C C C ++Y P + V C P C
Sbjct: 125 VEVGTPSSKFLVALDTGSDLFWLPCE--CKLCAKNGSTMYSPSLSSTSKTVPCGHPLC-- 180
Query: 123 FHLPENIRC--EANDQCDYEVLY--ADHGSSLGVLVTDHFPL----RLTNGSLLGPRLIF 174
P+ +++ C YEV Y A+ GSS GVLV D L G + ++F
Sbjct: 181 -ERPDACATAGKSSSSCPYEVKYVSANTGSS-GVLVEDVLHLVDGGGGGGGKAVQAPIVF 238
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT-RNVLGHCLSVRGGGYLFLGH 233
GCG Q + G++GLGL K S+ S L S GL + C S G G + G
Sbjct: 239 GCGQVQTGAFLRGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGD 298
Query: 234 DLVPSSGIAWTPM--SRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQA 291
P A TP+ + L +Y+ + K+ ++ + DSG+S+TY + A
Sbjct: 299 AGSPDQ--AETPLIAAGSLQPSYYNISVGAITVDSKAMAVE-FTAVVDSGTSFTYLDDPA 355
Query: 292 Y 292
Y
Sbjct: 356 Y 356
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 68/225 (30%), Positives = 101/225 (44%), Gaps = 25/225 (11%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
S + +T S + FG+ V +G G Y + + +G+PP+ + ID+GSD+ WVQC
Sbjct: 114 SPRDATSSYSVEEFGAEVV---SGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQC 170
Query: 92 NAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG 147
PCT C + ++ P ++ V C+ C EN C A C YEV+Y D
Sbjct: 171 Q-PCTQCYHQTDPVFDPADSASFMGVPCSSSVCERI---ENAGCHAGG-CRYEVMYGDGS 225
Query: 148 SSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
+ G L + LT G + + GCG+ R G+ G + S++ QL
Sbjct: 226 YTKGTLALE----TLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSM---SLVGQL 278
Query: 208 QSLGLTRNVLGHCLSVRG---GGYLFLGHDLVPSSGIAWTPMSRD 249
G T +CL RG G L G +P G AW P+ R+
Sbjct: 279 G--GQTGGAFSYCLVSRGTDSAGSLEFGRGAMP-VGAAWIPLIRN 320
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 106/256 (41%), Gaps = 27/256 (10%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
T + G+ + + +G PP + + +DTGSDL W+ CN CT C L +
Sbjct: 103 THQIAAFGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCN--CTSCV---RGLKTQNGKV 157
Query: 113 VACNDPFCSAFHLPENIRCEAN-----------DQCDYEVLY-ADHGSSLGVLVTDHFPL 160
+ N +N+ C +N C YEV Y ++ SS G LV D L
Sbjct: 158 IDLNIYELDKSSTRKNVPCNSNMCKQTQCHSSGSSCRYEVEYLSNDTSSSGFLVEDVLHL 217
Query: 161 RLTNGSL--LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
N + ++ GCG Q G+ GLG+ S+ S L GL +
Sbjct: 218 ITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILAQKGLISDSFS 277
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH--YSSGPAELLFGGKSTGIKGLQI 276
C G G + G SS TP +L E H Y+ +++ GG + +
Sbjct: 278 MCFGSDGSGRITFGD--TGSSDQGKTPF--NLRESHPTYNVTITQIIVGGYAADHE-FHA 332
Query: 277 IFDSGSSYTYFNSQAY 292
IFDSG+S+TY N AY
Sbjct: 333 IFDSGTSFTYLNDPAY 348
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 126/295 (42%), Gaps = 48/295 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + + IG P Y +DTGSDL W QC PC C ++ P ++ V C+
Sbjct: 98 GEFLMDVAIGTPALSYAAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCS 156
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS LP + C + +C Y Y D S+ GVL ++ F L L P + FGC
Sbjct: 157 SALCS--DLPTST-CTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKL--PGVAFGC 211
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV----RGGGYLFLG 232
G N G AG++GLG G S++SQ LGL + +CL+ G L LG
Sbjct: 212 G--DTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGLDK--FSYCLTSLDDGDGKSPLLLG 264
Query: 233 HDLVPSSG------IAWTPMSRDLLEKHY--------SSGPAELL-----FGGKSTGIKG 273
S + TP+ ++ + + + G + F + G G
Sbjct: 265 GSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGG 324
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAE--EKALPVCWKGTWK 326
+I DSG+S TY Q Y+ ++K + T + E L +C++G K
Sbjct: 325 --VIVDSGTSITYLELQGYRA----LKKAFVAQMALPTVDGSEIGLDLCFQGPAK 373
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 126/283 (44%), Gaps = 39/283 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG+PPK + L +DTGSDL W+QC PC C Y PK+++ + CN
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNITCN 252
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSL------L 168
DP C P+ R C+ Q C Y Y D ++ G + F + LT+ +
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--- 225
++FGCG+ R AG+LGLG G S SQLQS L + +CL R
Sbjct: 313 VENVMFGCGHWNRGLFHG---AAGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRDSDT 367
Query: 226 --GGYLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIKGLQ--- 275
L G DL+ + +T + + ++ Y + GG+ I
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427
Query: 276 -------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKG-KPLED 310
I DSG++ +YF+ AY+ + + +KG K +ED
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVED 470
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 126/283 (44%), Gaps = 39/283 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG+PPK + L +DTGSDL W+QC PC C Y PK+++ + CN
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNITCN 252
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSL------L 168
DP C P+ R C+ Q C Y Y D ++ G + F + LT+ +
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--- 225
++FGCG+ R AG+LGLG G S SQLQS L + +CL R
Sbjct: 313 VENVMFGCGHWNRGLFHG---AAGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRDSDT 367
Query: 226 --GGYLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIKGLQ--- 275
L G DL+ + +T + + ++ Y + GG+ I
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427
Query: 276 -------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKG-KPLED 310
I DSG++ +YF+ AY+ + + +KG K +ED
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVED 470
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/280 (27%), Positives = 122/280 (43%), Gaps = 43/280 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL-PPESLYHPKNNLV----AC 115
G Y V L++G PP+ L DTGSDL WV+C+A C CT P S + +++ C
Sbjct: 87 GQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSA-CRNCTRHTPGSAFLARHSTTFSPNHC 145
Query: 116 NDPFCSAFHLPENIRC---EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR- 171
D C LP++ RC + C YE Y D + G + L ++G +
Sbjct: 146 YDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKG 205
Query: 172 LIFGCGYNQRNP---GPKPPPTAGVLGLGLGKASILSQL-QSLG--LTRNVLGHCLSVRG 225
+ FGC + P G GV+GLG G S+ SQL G + ++ H +S
Sbjct: 206 IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSP 265
Query: 226 GGYLFLG---HDLVP-SSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI----- 276
YL +G +D+ P + +TP+ + L P G +S + G+++
Sbjct: 266 TSYLLIGSTQNDVAPGKRRMRFTPLHINPLS------PTFYYIGIESVSVDGIKLPINPS 319
Query: 277 ------------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ T+ AY L ++++ ++
Sbjct: 320 VWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVR 359
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 78/256 (30%), Positives = 114/256 (44%), Gaps = 42/256 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + L IG P + + +DTGSDL W QC PCT C +++P+ + + C+
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A P C +N+ C Y Y D + G + T+ LT GS+ P + FGC
Sbjct: 152 SQLCQALQSP---TC-SNNSCQYTYGYGDGSETQGSMGTE----TLTFGSVSIPNITFGC 203
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGYLFLGH 233
G N N G AG++G+G G S+ SQL +T+ +C++ G L LG
Sbjct: 204 GEN--NQGFGQGNGAGLVGMGRGPLSLPSQLD---VTK--FSYCMTPIGSSTSSTLLLGS 256
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL----------------QII 277
+ +S A +P + L+E + G S G L II
Sbjct: 257 --LANSVTAGSP-NTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGII 313
Query: 278 FDSGSSYTYFNSQAYK 293
DSG++ TYF AY+
Sbjct: 314 IDSGTTLTYFADNAYQ 329
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 124/275 (45%), Gaps = 34/275 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + IG PP + DTGSDL WVQC PC C +++PK + V C
Sbjct: 92 GEYFMRISIGTPPIEVLVIADTGSDLIWVQCQ-PCQECYKQKSPIFNPKQSSTYRRVLCE 150
Query: 117 DPFCSAFHLPENIR-CEAN---DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
+C+A L ++R C A+ C Y Y DH ++G L T+ F + TN S+ L
Sbjct: 151 TRYCNA--LNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSI--QEL 206
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-------SVRG 225
FGCG N G +G++GLG G S++SQL + N +CL +
Sbjct: 207 AFGCG--NSNGGNFDEVGSGIVGLGGGSLSLISQLGT--KIDNKFSYCLVPILEKSNFSL 262
Query: 226 GGYLFLGHDLVPSSGI-AWTPMSRDLLEKHY-------SSGPAELLFGGKST--GIKGLQ 275
G +F + + S TP+ E Y S G L + ++
Sbjct: 263 GKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGN 322
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
II DSG++ T+ +S+ Y ++ K ++G+ + D
Sbjct: 323 IIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSD 357
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 129/292 (44%), Gaps = 31/292 (10%)
Query: 25 SEANQPPSKKKSTQSTAAHRFGSTAVFPITGNV-YPLGYYSVTLKIG-----NPPKLYEL 78
S AN + ++ ++ AA +A P+T + + Y T+ +G +P +
Sbjct: 146 SRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANLTV 205
Query: 79 DIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPEN---IRC 131
+DTGSDLTWVQC PC+ C + L+ P + V CN C+A C
Sbjct: 206 IVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKAATGTPGSC 264
Query: 132 -EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPT 190
N++C Y + Y D S GVL TD + L SL G +FGCG + R T
Sbjct: 265 GGGNERCYYALAYGDGSFSRGVLATD--TVALGGASLDG--FVFGCGLSNRG---LFGGT 317
Query: 191 AGVLGLGLGKASILSQ--LQSLGLTRNVLGHCLSVRGGGYLFLGHDLVP---SSGIAWTP 245
AG++GLG + S++SQ L+ G+ L S G L LG D ++ +A+T
Sbjct: 318 AGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTR 377
Query: 246 MSRDLLE-KHYSSGPAELLFGGKSTGIKGL---QIIFDSGSSYTYFNSQAYK 293
M D + Y GG + +GL ++ DSG+ T Y+
Sbjct: 378 MIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYR 429
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 90/207 (43%), Gaps = 23/207 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-----TLPPESLYHPKNNL 112
Y G Y + IG P Y + +DTGS WV C C L + Y P++++
Sbjct: 78 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKLTFYDPRSSV 136
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD--HFPLRLTNGS 166
V C+D C++ C +C Y YAD G ++G+L TD H+ NG
Sbjct: 137 SSKEVKCDDTICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191
Query: 167 L--LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-S 222
+ FGCG Q G++G G + LSQL + G T+ + HCL S
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDS 251
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRD 249
GGG +G + P + TP+ ++
Sbjct: 252 TNGGGIFAIGEVVEPK--VKTTPIVKN 276
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 75/251 (29%), Positives = 111/251 (44%), Gaps = 27/251 (10%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SLYHPK- 109
Y L Y T+++G P + + +DTGSDL WV C+ AP G + S+Y PK
Sbjct: 1 YSLHY--TTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKK 58
Query: 110 ---NNLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYAD-HGSSLGVLVTDHFPLRLTN 164
+ V CN+ C+ + +C EA C Y V Y S+ G+L+ D L+ N
Sbjct: 59 SSTSKTVPCNNSLCA-----QRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEN 113
Query: 165 --GSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ + FGCG Q G+ GLG+ + S+ S L GL N C S
Sbjct: 114 KHSEPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFS 173
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-LQIIFDSG 281
G G + G S TP + + L +Y+ + G +T I + +FDSG
Sbjct: 174 DDGVGRINFGDK--GSLEQEETPFNLNQLHPNYNITVTSIRVG--TTLIDADITALFDSG 229
Query: 282 SSYTYFNSQAY 292
+S++YF Y
Sbjct: 230 TSFSYFTDPIY 240
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 121/279 (43%), Gaps = 37/279 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG PPK Y L +DTGSDL W+QC PC C Y PK + + C+
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKESSSFENITCH 248
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT--NGSLLGPR- 171
DP C P+ + C+ +Q C Y Y D ++ G + F + LT NG
Sbjct: 249 DPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHV 308
Query: 172 --LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---- 225
++FGCG+ R AG+LGLG G S SQLQS + + +CL R
Sbjct: 309 ENVMFGCGHWNRGLFHG---AAGLLGLGRGPLSFASQLQS--IYGHSFSYCLVDRNSDTS 363
Query: 226 -GGYLFLGHD--LVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIK------- 272
L G D L+ + +T + ++ Y G ++ G+ I
Sbjct: 364 VSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLS 423
Query: 273 ---GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
G I DSG++ TYF AY+ + K +KG L
Sbjct: 424 KEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYEL 462
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 91/201 (45%), Gaps = 26/201 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG PP + IDT SDL W QC PCTGC + +++P+ + + C+
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 117 DPFCSAFHLPENIRC--EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C + RC + ++ C Y Y+ + ++ G L D +L G + F
Sbjct: 146 SDTCDELDVH---RCGHDDDESCQYTYTYSGNATTEGTLAVD----KLVIGEDAFRGVAF 198
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFL 231
GC + P PP +GV+GLG G S++SQ L+ +CL + R G L L
Sbjct: 199 GCSTSSTGGAP-PPQASGVVGLGRGPLSLVSQ-----LSVRRFAYCLPPPASRIPGKLVL 252
Query: 232 GHDLVPSSGIA---WTPMSRD 249
G D + PM RD
Sbjct: 253 GADADAARNATNRIAVPMRRD 273
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 126/292 (43%), Gaps = 48/292 (16%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
+ ++S + + + G+ P+T + G Y + IG PP L ++DTGSDL WV+C
Sbjct: 57 AAERSRRRLSVYTSGTGTKAPVTKS-QKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKC 115
Query: 92 NAPCTGCTLPPESLYHPKNNLVA----CNDPFCSAFHLPENIRCEANDQ---CDYEVLYA 144
+PC GC PP LY P + + C+ C A I + +D C Y Y
Sbjct: 116 -SPCNGCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYG 174
Query: 145 DHG--SSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRN---------PGPKPPPTAGV 193
G S+ GVL T+ F FG GY N G + TAG+
Sbjct: 175 HSGDHSTQGVLGTETF--------------TFGDGYVANNVSFGRSDTIDGSQFGGTAGL 220
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY--LFLGH--DLVPSSG-IAWTPMSR 248
+GLG G S++SQ LG R +CL+ Y + G L S+G ++ TP+
Sbjct: 221 VGLGRGHLSLVSQ---LGAGR--FAYCLAADPNVYSTILFGSLAALDTSAGDVSSTPLVT 275
Query: 249 D---LLEKHYSSGPAELLFGGKSTGIK-GLQIIFDSGSSYTYFNSQAYKTTL 296
+ + HY + GG IK G I GS +F+S A T+L
Sbjct: 276 NPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSL 327
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 91/201 (45%), Gaps = 26/201 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG PP + IDT SDL W QC PCTGC + +++P+ + + C+
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 117 DPFCSAFHLPENIRC--EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C + RC + ++ C Y Y+ + ++ G L D +L G + F
Sbjct: 146 SDTCDELDVH---RCGHDDDESCQYTYTYSGNATTEGTLAVD----KLVIGEDAFRGVAF 198
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFL 231
GC + P PP +GV+GLG G S++SQ L+ +CL + R G L L
Sbjct: 199 GCSTSSTGGAP-PPQASGVVGLGRGPLSLVSQ-----LSVRRFAYCLPPPASRIPGKLVL 252
Query: 232 GHDLVPSSGIA---WTPMSRD 249
G D + PM RD
Sbjct: 253 GADADAARNATNRIAVPMRRD 273
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 121/298 (40%), Gaps = 34/298 (11%)
Query: 22 GCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDID 81
G + PP +H +G + ++G+ G Y V +G PP+ + L +D
Sbjct: 24 GVENHTANPPVITAVIAGPPSHDYGFQSPV-VSGSTLGSGQYFVDFFLGTPPQKFSLIVD 82
Query: 82 TGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEAN--D 135
+GSDL WVQC +PC C LY P N+ V C C E C+
Sbjct: 83 SGSDLLWVQC-SPCRQCYAQDSPLYVPSNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPG 141
Query: 136 QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLG 195
C YE LYAD SS GV + T + ++ FGCG + + GVLG
Sbjct: 142 ACAYEYLYADTSSSKGVFAYE----SATVDGVRIDKVAFGCGSDNQGSFAA---AGGVLG 194
Query: 196 LGLGKASILSQLQSLGLTRNVLGHCLS-----VRGGGYLFLGHDLVPS-SGIAWTPM-SR 248
LG G S SQ+ N +CL L G +L+ + + +TP+ S
Sbjct: 195 LGQGPLSFGSQVGY--AYGNKFAYCLVNYLDPTSVSSSLIFGDELISTIHDMQYTPIVSN 252
Query: 249 DLLEKHYSSGPAELLFGGKSTGI--KGLQI--------IFDSGSSYTYFNSQAYKTTL 296
Y ++ GGKS I +I IFDSG++ TY+ AY L
Sbjct: 253 PKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHIL 310
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 90/207 (43%), Gaps = 23/207 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-----TLPPESLYHPKNNL 112
Y G Y + IG P Y + +DTGS WV C C L + Y P++++
Sbjct: 54 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKLTFYDPRSSV 112
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD--HFPLRLTNGS 166
V C+D C++ C +C Y YAD G ++G+L TD H+ NG
Sbjct: 113 SSKEVKCDDTICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 167
Query: 167 L--LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-S 222
+ FGCG Q G++G G + LSQL + G T+ + HCL S
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDS 227
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRD 249
GGG +G + P + TP+ ++
Sbjct: 228 TNGGGIFAIGEVVEPK--VKTTPIVKN 252
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 80/153 (52%), Gaps = 12/153 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + L IG+PP+ + +DTGSDL W QC PC C ++ PK + ++C+
Sbjct: 109 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCK-PCQQCFDQSTPIFDPKQSSSFYKISCS 167
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL-RLTNGSLLGPRLIFG 175
C A LP + C ++D C+Y Y D S+ GVL + F T + P L FG
Sbjct: 168 SELCGA--LPTST-C-SSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFG 223
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
CG + N G AG++GLG G S++SQL+
Sbjct: 224 CGND--NNGDGFSQGAGLVGLGRGPLSLVSQLK 254
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 90/207 (43%), Gaps = 23/207 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-----TLPPESLYHPKNNL 112
Y G Y + IG P Y + +DTGS WV C C L + Y P++++
Sbjct: 54 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKLTFYDPRSSV 112
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD--HFPLRLTNGS 166
V C+D C++ C +C Y YAD G ++G+L TD H+ NG
Sbjct: 113 SSKEVKCDDTICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 167
Query: 167 L--LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-S 222
+ FGCG Q G++G G + LSQL + G T+ + HCL S
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDS 227
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRD 249
GGG +G + P + TP+ ++
Sbjct: 228 TNGGGIFAIGEVVEPK--VKTTPIVKN 252
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 120/269 (44%), Gaps = 58/269 (21%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C + C C + + P +
Sbjct: 86 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPC-SDCEHCGKHQDPRFQPDES--------- 135
Query: 121 SAFHLPENIRCEANDQCD-------YEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RL 172
S +H ++C + CD YE YA+ SS GVL D + N S + P R
Sbjct: 136 STYH---PVKCNMDCNCDHDGVNCVYERRYAEMSSSSGVLGED--IISFGNQSEVVPQRA 190
Query: 173 IFGCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV-- 223
+FGC Y+QR G++GLG G+ SI+ QL + +NV+ S+
Sbjct: 191 VFGCENVETGDLYSQR--------ADGIMGLGRGQLSIVDQL----VDKNVINDSFSLCY 238
Query: 224 ----RGGGYLFLGH-----DLVPSSGIAWTP--MSRDLLEKHYSSGPAELLFGGKSTGIK 272
GGG + LG D+V S + + +L E H + P +L ST +
Sbjct: 239 GGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKL---SPSTFDR 295
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLMRK 301
+ DSG++Y Y +A+ D + K
Sbjct: 296 KHGTVLDSGTTYAYLPEEAFVAFRDAIIK 324
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 90/207 (43%), Gaps = 23/207 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-----TLPPESLYHPKNNL 112
Y G Y + IG P Y + +DTGS WV C C L + Y P++++
Sbjct: 78 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKLTFYDPRSSV 136
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD--HFPLRLTNGS 166
V C+D C++ C +C Y YAD G ++G+L TD H+ NG
Sbjct: 137 SSKEVKCDDTICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191
Query: 167 L--LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-S 222
+ FGCG Q G++G G + LSQL + G T+ + HCL S
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDS 251
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRD 249
GGG +G + P + TP+ ++
Sbjct: 252 TNGGGIFAIGEVVEPK--VKTTPIVKN 276
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 80/255 (31%), Positives = 111/255 (43%), Gaps = 38/255 (14%)
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPE--SLYHPKNNLVACNDPFC 120
IG PP+ + L +DTGS +T+V CN+ C C P+ YHP V CN P C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDTYHP----VKCN-PDC 55
Query: 121 SAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGCGY 178
+ C+ NDQC YE YA+ SS G+L D + N S L P R +FGC
Sbjct: 56 T---------CDTENDQCTYERQYAEMSSSSGILGEDL--VSFGNMSELKPQRAVFGC-E 103
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGYLFLGHDLV 236
N G++GLG G SI+ QL G+ + C GGG + LG +
Sbjct: 104 NAETGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQ-IS 162
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNSQ 290
P S + ++ D +Y+ L GK I I DSG++Y Y
Sbjct: 163 PPSDMVFSHSDPD-RSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEA 221
Query: 291 AYKTTLDLMRKDLKG 305
A+ + + +L G
Sbjct: 222 AFLPFIQAITSELHG 236
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 116/273 (42%), Gaps = 38/273 (13%)
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPE--SLYHPKNNLVACNDPFC 120
IG PP+ + L +DTGS +T+V CN+ C C P+ YHP V CN P C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDTYHP----VKCN-PDC 55
Query: 121 SAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGCGY 178
+ C+ NDQC YE YA+ SS G+L D + N S L P R +FGC
Sbjct: 56 T---------CDTENDQCTYERQYAEMSSSSGILGEDL--VSFGNMSELKPQRAVFGC-E 103
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGYLFLGHDLV 236
N G++GLG G SI+ QL G+ + C GGG + LG +
Sbjct: 104 NAETGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQ-IS 162
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGSSYTYFNSQ 290
P S + ++ D +Y+ L GK I I DSG++Y Y
Sbjct: 163 PPSDMVFSHSDPD-RSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEA 221
Query: 291 AYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
A+ + + +L G + VC+ G
Sbjct: 222 AFLPFIQAITSELHGLKQIRGPDPNYNDVCFSG 254
>gi|168021169|ref|XP_001763114.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685597|gb|EDQ71991.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 641
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 86/312 (27%), Positives = 120/312 (38%), Gaps = 69/312 (22%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC--TLPPESLYHPKNNL-VACNDPF 119
Y V +++G KL+ IDTGS +W+ C P P +Y P+ + V C P
Sbjct: 126 YYVKMRVGKSKKLFHFLIDTGSQPSWLHCKWPAIEKHPVAGPNGMYVPEKEVQVDCRSPE 185
Query: 120 C-SAFHLP---ENIR----C-EAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
C S +P NIR C E ND +C Y++ Y D G V D L G L
Sbjct: 186 CLSLQRIPSNFNNIRNLFPCNEPNDWRCTYDITYLDRSHLRGFYVQDVVSLATLEGEQLD 245
Query: 170 PRLIFGCG--------------------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
++ G Y + P T G+LGL G S +SQL+
Sbjct: 246 AKITLGYATPNHRAAPFGFCSWHASSDRYGEEELERSPLTTDGLLGLNKGTESFVSQLKR 305
Query: 210 LG-LTRNVLGHCLSV-------RGGGYLFLGHD-LVPSSGIAWTPMSRDL---------- 250
G ++ +V+GHC G++F G L+ S I W+PM+
Sbjct: 306 QGAISSHVVGHCFRSLDTTDFETNSGFMFFGKSKLLDSLPITWSPMASPTSDGFILVVKL 365
Query: 251 -----LEKHYSSGPAELLFGGKSTGIK----------GLQIIFDSGSSYTYFNSQAYKTT 295
L++ S AE L+ IK II DSGS+ T+ Y
Sbjct: 366 KVPLPLKRDGQSSIAEYLYKVYVKKIKLGELSLEMTDKSNIIIDSGSTTTHILDSIYNPI 425
Query: 296 LDLMRKD--LKG 305
D + K LKG
Sbjct: 426 RDEVAKQALLKG 437
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 119/265 (44%), Gaps = 44/265 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC------TLPPE--SLYHPKNNL 112
GYY+ L IG PP+++ L +DTGS +T+V C+ C C P+ S Y P
Sbjct: 79 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQPDLSSTYQPVKCT 137
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-R 171
+ CN C +N R QC YE YA+ +S GVL D + N S L P R
Sbjct: 138 LDCN---C------DNDRM----QCVYERQYAEMSTSSGVLGED--VVSFGNQSELAPQR 182
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV------RG 225
+FGC N G++GLG G SI+ QL + +NV+ S+ G
Sbjct: 183 AVFGCE-NVETGDLYSQHADGIMGLGRGDLSIMDQL----VDKNVVSDSFSLCYGGMDVG 237
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFD 279
GG + LG + P S + + S + +Y+ E+ GK + + D
Sbjct: 238 GGAMVLG-GISPPSDMVFA-QSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLD 295
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLK 304
SG++Y Y +A+ + + K+L+
Sbjct: 296 SGTTYAYLPEEAFLAFKEAIVKELQ 320
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 113/290 (38%), Gaps = 32/290 (11%)
Query: 47 STAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
ST P T G G Y VT+ +G P Y + DTGSD TWVQC C E L
Sbjct: 146 STPSLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPL 205
Query: 106 YHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
+ P + V+C D C+ + C C Y V Y D ++G D L
Sbjct: 206 FDPAKSSTYANVSCTDSACADL---DTNGCTGG-HCLYAVQYGDGSYTVGFFAQD--TLT 259
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+ + ++ G R FGCG K TAG++GLG GK S+ +Q+ +CL
Sbjct: 260 IAHDAIKGFR--FGCGEKNNGLFGK---TAGLMGLGRGKTSL--TVQAYNKYGGAFAYCL 312
Query: 222 SV--RGGGYLFLGHDLVPSSG---IAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----- 271
G GYL D P S TPM D + Y G + GG+ +
Sbjct: 313 PALTTGTGYL----DFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF 368
Query: 272 KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
+ DSG+ T + AY K + + + L C+
Sbjct: 369 STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCY 418
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 100/246 (40%), Gaps = 35/246 (14%)
Query: 76 YELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN-----------LVACNDPFCSAFH 124
+ + IDTGSD+ WV CN C+ C P S + N L+ C+D C++
Sbjct: 81 FNVQIDTGSDILWVNCNT-CSNC--PQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGV 137
Query: 125 LPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG----PRLIFGCGYN 179
C +QC Y Y D + G V+D L G ++FGC +
Sbjct: 138 QGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSIS 197
Query: 180 QRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLV 236
Q K G+ G G G S++SQL S G+T V HCL GGG L LG L
Sbjct: 198 QSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGILVLGEILE 257
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---------IFDSGSSYTYF 287
PS I ++P+ + HY+ + G+ I I D G++ Y
Sbjct: 258 PS--IVYSPLVPS--QPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCGTTLAYL 313
Query: 288 NSQAYK 293
+AY
Sbjct: 314 IQEAYD 319
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 90/207 (43%), Gaps = 23/207 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-----TLPPESLYHPKNNL 112
Y G Y + IG P Y + +DTGS WV C C L + Y P++++
Sbjct: 78 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKLTFYDPRSSV 136
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD--HFPLRLTNGS 166
V C+D C++ C +C Y YAD G ++G+L TD H+ NG
Sbjct: 137 SSKEVKCDDTICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191
Query: 167 L--LGPRLIFGCGYNQRNP-GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-S 222
+ FGCG Q G++G G + LSQL + G T+ + HCL S
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDS 251
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRD 249
GGG +G + P + TP+ ++
Sbjct: 252 TNGGGIFAIGEVVEPK--VKTTPIVKN 276
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 117/273 (42%), Gaps = 59/273 (21%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCN--APCTGCTLPPESLYHPK----NNLVACNDP 118
V L IG PP++ + +DTGS L+W+QC+ AP PP + + P + + C P
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKP---PPTASFDPSLSSTFSTLPCTHP 155
Query: 119 FCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C F LP + C+ N C Y YAD + G LV + F + SL P LI
Sbjct: 156 VCKPRIPDFTLPTS--CDQNRLCHYSYFYADGTYAEGNLVREKFTF---SRSLFTPPLIL 210
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-------GGG 227
GC +P G+LG+ G+ S SQ + +T+ +C+ R G
Sbjct: 211 GCATESTDP-------RGILGMNRGRLSFASQSK---ITK--FSYCVPTRVTRPGYTPTG 258
Query: 228 YLFLGHDLVPSSG---------IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL---- 274
+LGH+ P+S A + +L Y+ + GG+ I
Sbjct: 259 SFYLGHN--PNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRA 316
Query: 275 ------QIIFDSGSSYTYFNSQAY-KTTLDLMR 300
Q + DSGS +TY ++AY K +++R
Sbjct: 317 DAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVVR 349
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 113/273 (41%), Gaps = 27/273 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VAC 115
G Y V +++G P + + + DTGSD TWVQC PC C E L+ P + ++C
Sbjct: 159 GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSATYANISC 217
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
+ +CS ++ C C Y + Y D ++G D L L ++ R FG
Sbjct: 218 SSSYCSDLYVSG---CSGG-HCLYGIQYGDGSYTIGFYAQDT--LTLAYDTIKNFR--FG 269
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGH 233
CG R + AG+LGLG GK S+ +Q+ V +CL + G G+L LG
Sbjct: 270 CGEKNRGLFGR---AAGLLGLGRGKTSL--PVQAYDKYGGVFAYCLPATSAGTGFLDLGP 324
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-----LQIIFDSGSSYTYFN 288
P++ TPM D Y G + GG I G + DSG+ T
Sbjct: 325 G-APAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 383
Query: 289 SQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
AY K ++G L C+
Sbjct: 384 PSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCY 416
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 83/165 (50%), Gaps = 11/165 (6%)
Query: 47 STAVFPITGNVYPL---GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
+T + I +V P+ + + IG+PP L IDTGSDLTW+QC PC C
Sbjct: 69 TTEIADIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQC-LPCK-CYPQTI 126
Query: 104 SLYHPKNNLVACNDPFCSAFH-LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
+HP + N SA H +P+ R E C Y + Y D ++ G+L + +
Sbjct: 127 PFFHPSRSSTYRNASCESAPHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQT 186
Query: 163 TNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
++ L+ P ++FGCG Q N G +GVLGLG G SI+++
Sbjct: 187 SDEGLISKPNIVFGCG--QDNSGFT--QYSGVLGLGPGTFSIVTR 227
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 119/290 (41%), Gaps = 40/290 (13%)
Query: 46 GSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
GS A+F GN + +Y+ + IG P + + +D GSDL WV C+ C C P +
Sbjct: 79 GSDALF--LGNEFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCA-PLSAS 132
Query: 106 YHPK---------------NNLVACNDPFCSAFHLPENIRCEANDQCDY-EVLYADHGSS 149
Y+ + + ++CND C L + + + D C Y Y+++ SS
Sbjct: 133 YYDRLGRDLNEYSPSLSSTSKPLSCNDQLC---ELGSDCK-SSKDPCPYLASYYSENTSS 188
Query: 150 LGVLVTDHFPL----RLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILS 205
G+L+ D L + S + +I GCG Q G++GLG G S+ S
Sbjct: 189 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 248
Query: 206 QLQSLGLTRNVLGHCLSVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
L GL RN C G + G LV ++ P+ + Y L
Sbjct: 249 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVT--YLIEVEGYLV 306
Query: 265 GGKSTGIKGLQIIFDSGSSYTYFNSQAYKTT-------LDLMRKDLKGKP 307
G S G Q + DSG+S+T+ + Y+ ++ R KG P
Sbjct: 307 GSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 356
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 89/201 (44%), Gaps = 26/201 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG PP + IDT SDL W QC PCTGC + +++P+ + + C+
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 117 DPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C + RC +D C Y Y+ + ++ G L D +L G + F
Sbjct: 146 SDTCDELDVH---RCGHDDDESCQYTYTYSGNATTEGTLAVD----KLVIGEDAFRGVAF 198
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFL 231
GC + P PP +GV+GLG G S++SQL +CL + R G L L
Sbjct: 199 GCSTSSTGGAP-PPQASGVVGLGRGPLSLVSQLSV-----RRFAYCLPPPASRIPGKLVL 252
Query: 232 GHDLVPSSGIA---WTPMSRD 249
G D + PM RD
Sbjct: 253 GADADAARNATNRIAVPMRRD 273
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 110/254 (43%), Gaps = 17/254 (6%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT--LPPE------SLYHPKNNLV 113
+Y+V + +G P + + +DTGSDL WV C+ C C P+ +Y P+ +
Sbjct: 99 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CIKCAPLASPDYGDLKFDMYSPRKSST 155
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGS--LLGP 170
+ P S+ P+ A++ C Y + Y +++ SS GVLV D L +G +
Sbjct: 156 SRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQSKITQA 215
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
+ FGCG Q G+LGLG+ S+ S L S G+ N C G G +
Sbjct: 216 PITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGEDGHGRIN 275
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQ 290
G SS TP++ +Y+ + GGKS K + DSG+S+T +
Sbjct: 276 FGD--TGSSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTK-FSAVVDSGTSFTALSDP 332
Query: 291 AYKTTLDLMRKDLK 304
Y +K
Sbjct: 333 MYTEITSTFNAQVK 346
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 92/336 (27%), Positives = 146/336 (43%), Gaps = 44/336 (13%)
Query: 7 RVMGLLVLLMFATFQGCFSEANQPPSKKKST---QSTAAHRFGSTAVFPITGNVYPLGYY 63
R+ L ++ Q S+ + +K+ T S+ + G +G G Y
Sbjct: 96 RIQTLHKRVLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATLESGMTLGSGEY 155
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPF 119
+ + +G+PPK + L +DTGSDL W+QC PC C + Y PK + + CNDP
Sbjct: 156 FMDVLVGSPPKHFSLILDTGSDLNWIQC-LPCHDCFQQNGAFYDPKASASYKNITCNDPR 214
Query: 120 CSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT----NGSLLG-PRL 172
C+ P+ + C++++Q C Y Y D ++ G + F + LT + L +
Sbjct: 215 CNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENM 274
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG-----GG 227
+FGCG+ R AG+LGLG G S SQLQS L + +CL R
Sbjct: 275 MFGCGHWNRGLFHG---AAGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDTNVSS 329
Query: 228 YLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGK------------STG 270
L G DL+ + +T +L++ Y ++ G+ S G
Sbjct: 330 KLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDG 389
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
G I DSG++ +YF AY+ + + + KGK
Sbjct: 390 AGG--TIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK 423
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 119/290 (41%), Gaps = 40/290 (13%)
Query: 46 GSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
GS A+F GN + +Y+ + IG P + + +D GSDL WV C+ C C P +
Sbjct: 89 GSDALF--LGNEFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCA-PLSAS 142
Query: 106 YHPK---------------NNLVACNDPFCSAFHLPENIRCEANDQCDY-EVLYADHGSS 149
Y+ + + ++CND C L + + + D C Y Y+++ SS
Sbjct: 143 YYDRLGRDLNEYSPSLSSTSKPLSCNDQLC---ELGSDCK-SSKDPCPYLASYYSENTSS 198
Query: 150 LGVLVTDHFPL----RLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILS 205
G+L+ D L + S + +I GCG Q G++GLG G S+ S
Sbjct: 199 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 258
Query: 206 QLQSLGLTRNVLGHCLSVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLF 264
L GL RN C G + G LV ++ P+ + Y L
Sbjct: 259 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVT--YLIEVEGYLV 316
Query: 265 GGKSTGIKGLQIIFDSGSSYTYFNSQAYKTT-------LDLMRKDLKGKP 307
G S G Q + DSG+S+T+ + Y+ ++ R KG P
Sbjct: 317 GSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 366
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 121/280 (43%), Gaps = 36/280 (12%)
Query: 36 STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC 95
ST S FGS V +G G Y V + +G+PP+ + ID+GSD+ WVQC PC
Sbjct: 19 STASYGVEDFGSEVV---SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCK-PC 74
Query: 96 TGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLG 151
T C + L+ P ++ V+C+ C +N C + +C YEV Y D S+ G
Sbjct: 75 TQCYHQTDPLFDPADSASFMGVSCSSAVCDQV---DNAGCNSG-RCRYEVSYGDGSSTKG 130
Query: 152 VLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG 211
L + LT G + + GCG+ + G+ G + LS+ +
Sbjct: 131 TLALE----TLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERG-- 184
Query: 212 LTRNVLGHCLSVR---GGGYLFLGHDLVPSSGIAWTPMSRDLLE-KHYSSGPAELLFGGK 267
N +CL R G+L G + +P G AW P+ R+ +Y G + L G
Sbjct: 185 ---NAFSYCLVSRVTNSNGFLEFGSEAMP-VGAAWIPLIRNPHSPSYYYIGLSGLGVGDM 240
Query: 268 S----------TGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
T + ++ D+G++ T F + AY+ D
Sbjct: 241 KVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRD 280
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 113/273 (41%), Gaps = 27/273 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VAC 115
G Y V +++G P + + + DTGSD TWVQC PC C E L+ P + ++C
Sbjct: 94 GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSATYANISC 152
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
+ +CS ++ C C Y + Y D ++G D L L ++ R FG
Sbjct: 153 SSSYCSDLYVSG---CSGG-HCLYGIQYGDGSYTIGFYAQDT--LTLAYDTIKNFR--FG 204
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGH 233
CG R + AG+LGLG GK S+ +Q+ V +CL + G G+L LG
Sbjct: 205 CGEKNRGLFGR---AAGLLGLGRGKTSL--PVQAYDKYGGVFAYCLPATSAGTGFLDLGP 259
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-----LQIIFDSGSSYTYFN 288
P++ TPM D Y G + GG I G + DSG+ T
Sbjct: 260 G-APAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 318
Query: 289 SQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
AY K ++G L C+
Sbjct: 319 PSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCY 351
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 124/294 (42%), Gaps = 25/294 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA 114
GN Y +Y+ + IG P + + +D GSDL W+ C+ C C S Y + +
Sbjct: 93 GNDYGWLHYT-WIDIGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLN 149
Query: 115 CNDPFCSAF--HLP-ENIRCEANDQCD-------YEV-LYADHGSSLGVLVTDHFPLR-- 161
P S+ HL + CE++ CD Y + Y+++ SS G+L+ D L
Sbjct: 150 QYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSG 209
Query: 162 ---LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+N S+ P +I GCG Q G++GLGLG+ S+ S L GL +N
Sbjct: 210 IDDASNSSVRAP-VIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFS 268
Query: 219 HCLSVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C + G +F G L + P D + Y G G + +
Sbjct: 269 LCFNDDDSGRIFFGDQGLATQQTTLFLP--SDGKYETYIVGVEACCIGSSCIKQTSFRAL 326
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLLGN 331
DSG+S+T+ ++Y+ +D K + + E C+K + K LL N
Sbjct: 327 VDSGASFTFLPDESYRNVVDEFDKQVNATRF--SFEGYPWEYCYKSSSKELLKN 378
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 124/294 (42%), Gaps = 25/294 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA 114
GN Y +Y+ + IG P + + +D GSDL W+ C+ C C S Y + +
Sbjct: 74 GNDYGWLHYT-WIDIGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLN 130
Query: 115 CNDPFCSAF--HLP-ENIRCEANDQCD-------YEV-LYADHGSSLGVLVTDHFPLR-- 161
P S+ HL + CE++ CD Y + Y+++ SS G+L+ D L
Sbjct: 131 QYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSG 190
Query: 162 ---LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+N S+ P +I GCG Q G++GLGLG+ S+ S L GL +N
Sbjct: 191 IDDASNSSVRAP-VIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFS 249
Query: 219 HCLSVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C + G +F G L + P D + Y G G + +
Sbjct: 250 LCFNDDDSGRIFFGDQGLATQQTTLFLP--SDGKYETYIVGVEACCIGSSCIKQTSFRAL 307
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLLGN 331
DSG+S+T+ ++Y+ +D K + + E C+K + K LL N
Sbjct: 308 VDSGASFTFLPDESYRNVVDEFDKQVNATRF--SFEGYPWEYCYKSSSKELLKN 359
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 134/325 (41%), Gaps = 57/325 (17%)
Query: 43 HRFGSTAVFPITGNVYPLGYY---------------SVTLKIGNPPKLYELDIDTGSDLT 87
H T P+ V P GY ++++ +G PP+ + IDTGS+L+
Sbjct: 31 HCEAKTLALPLKSQVIPSGYLPRPPNKLRFHHNVSLTISITVGTPPQNMSMVIDTGSELS 90
Query: 88 WVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSA----FHLPENIRCEANDQCDY 139
W+ CN T P ++P + ++C+ P C+ F +P + C++N+ C
Sbjct: 91 WLHCNTNTTATI--PYPFFNPNISSSYTPISCSSPTCTTRTRDFPIPAS--CDSNNLCHA 146
Query: 140 EVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPP-TAGVLGLGL 198
+ YAD SS G L +D F GS P ++FGC + + + T G++G+ L
Sbjct: 147 TLSYADASSSEGNLASDTFGF----GSSFNPGIVFGCMNSSYSTNSESDSNTTGLMGMNL 202
Query: 199 GKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSG-IAWTPMSRD------LL 251
G S++SQL+ + + G S G L LG G + +TP+ +
Sbjct: 203 GSLSLVSQLKIPKFSYCISGSDFS----GILLLGESNFSWGGSLNYTPLVQISTPLPYFD 258
Query: 252 EKHYSSGPAELLFGGKSTGIKGL----------QIIFDSGSSYTYFNSQAYKTTLDLMRK 301
Y+ + K I G Q +FD G+ ++Y Y D
Sbjct: 259 RSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLN 318
Query: 302 DLKG--KPLEDT--AEEKALPVCWK 322
G + L+D + A+ +C++
Sbjct: 319 QTNGTLRALDDPNFVFQIAMDLCYR 343
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 80/153 (52%), Gaps = 12/153 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + L IG+PP+ + +DTGSDL W QC PC C ++ PK + ++C+
Sbjct: 364 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCK-PCQQCFDQSTPIFDPKQSSSFYKISCS 422
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL-RLTNGSLLGPRLIFG 175
C A LP + C ++D C+Y Y D S+ GVL + F T + P L FG
Sbjct: 423 SELCGA--LPTST-C-SSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFG 478
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
CG + N G AG++GLG G S++SQL+
Sbjct: 479 CGND--NNGDGFSQGAGLVGLGRGPLSLVSQLK 509
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 74/258 (28%), Positives = 110/258 (42%), Gaps = 35/258 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACNDP 118
Y +T+++G+P K + ID+GSD++WVQC PC C + L+ P + +C+
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCK-PCLQCHSQVDPLFDPSLSSTYSPFSCSSA 189
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
C+ N C ++ QC Y V YAD S+ G +D L GS FGC
Sbjct: 190 ACAQLGQDGN-GCSSSSQCQYIVRYADGSSTTGTYSSDTLAL----GSNTISNFQFGCSH 244
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
G+N T G++GLG G S+ S Q+ G +CL + G+L L
Sbjct: 245 VESGFNDL--------TDGLMGLGGGAPSLAS--QTAGTFGTAFSYCLPPTPSSSGFLTL 294
Query: 232 GHDLVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGGKS----TGIKGLQIIFDSGSSYTY 286
G +SG TPM R + Y + GG T + ++ DSG+ T
Sbjct: 295 GAG---TSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITR 351
Query: 287 FNSQAYKTTLDLMRKDLK 304
AY + +K
Sbjct: 352 LPRTAYSALSSAFKAGMK 369
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 96/216 (44%), Gaps = 34/216 (15%)
Query: 32 SKKKSTQSTAAHRFGSTAVF----------PITG-NVYPLG--YYSVTLKIGNPPKLYEL 78
SK ++ +A ++A F P TG +V P G Y V L IG PP+
Sbjct: 58 SKARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSA 117
Query: 79 DIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIR---CEAND 135
+DTGSDL W QC APC C P+ L+ P + A +P A L +I CE D
Sbjct: 118 LLDTGSDLIWTQC-APCASCLAQPDPLFAPGES--ASYEPMRCAGQLCSDILHHGCEMPD 174
Query: 136 QCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFGCG---YNQRNPGPKPPPTA 191
C Y Y D ++GV T+ F + G L+ L FGCG N G +
Sbjct: 175 TCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNG------S 228
Query: 192 GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
G++G G S++SQ L+ +CL+ G G
Sbjct: 229 GIVGFGRNPLSLVSQ-----LSIRRFSYCLTSYGSG 259
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 83/184 (45%), Gaps = 22/184 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L +G P + IDT SDL W QC PC C + +++P + +V CN
Sbjct: 86 GEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQ-PCVKCYKQLDPVFNPVASTSYAVVPCN 144
Query: 117 DPFCSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
C R + D C Y Y + ++ G+L D RL G + ++
Sbjct: 145 SDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVD----RLAIGDDVFRGVV 200
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLF 230
FGC + + G PP +GV+GLG G S++SQL R + +CL R G L
Sbjct: 201 FGC--SSSSVGGPPPQVSGVVGLGRGALSLVSQLS----VRRFM-YCLPPPVSRSAGRLV 253
Query: 231 LGHD 234
LG D
Sbjct: 254 LGAD 257
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/265 (33%), Positives = 113/265 (42%), Gaps = 32/265 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC--TGCTLPPESLYHPKNNL----VACN 116
Y VTL G P L +DTGSD++WVQC PC T C + L+ P + +ACN
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQCT-PCNSTKCYPQKDPLFDPSKSSTYAPIACN 189
Query: 117 DPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + C + QC Y V YAD S GV + L L G + FG
Sbjct: 190 TDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNET--LTLAPGITV-EDFHFG 246
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--GGYLFLGH 233
CG +QR P K G+LGLG S++ Q S + +CL G+L LG
Sbjct: 247 CGRDQRGPSDK---YDGLLGLGGAPVSLVVQTSS--VYGGAFSYCLPALNSEAGFLVLGS 301
Query: 234 DLVPS---SGIAWTPMSRDL--LEKHYSSGPAELLFGGK-----STGIKGLQIIFDSGSS 283
PS S +TPM R L Y + GGK + +G II DSG+
Sbjct: 302 P--PSGNKSAFVFTPM-RHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMII-DSGTV 357
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPL 308
T AY +RK LK PL
Sbjct: 358 DTELPETAYNALEAALRKALKAYPL 382
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 117/281 (41%), Gaps = 50/281 (17%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P++ N Y Y + L IG PP DTGSDL W+QC PCT C ++ +++
Sbjct: 51 PVSANHYD---YLMELSIGTPPVKIYAQADTGSDLIWLQC-IPCTNCYKQLNPMFDSQSS 106
Query: 112 L----VACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGS 166
+AC CS + + C + C Y Y D + GVL + L T G
Sbjct: 107 STFSNIACGSESCSKLY---STSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGE 163
Query: 167 LLGPR-LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+ + +IFGCG+N N G G++GLG G S++SQ+ S L N+ CL
Sbjct: 164 PVAFKGVIFGCGHN--NNGAFNDKEMGIIGLGRGPLSLVSQIGS-SLGGNMFSQCL---- 216
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYT 285
+ + PS +PMS GK + + G ++ S T
Sbjct: 217 -----VPFNTNPS---ISSPMSF-----------------GKGSEVLGNGVVSTPLVSKT 251
Query: 286 YFNSQAYKTTLDLMRKDLK-----GKPLEDTAEEKALPVCW 321
+ S + T L + +D+ G LE A+ +P W
Sbjct: 252 TYQSFYFVTLLGISVEDINLPFNAGSSLEPAAKGNVIPQIW 292
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 79/164 (48%), Gaps = 9/164 (5%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP--KNNLVACNDP 118
G Y++ +++G+PPK + +DTGSDL W+QC PC+ C + +Y P + +
Sbjct: 2 GAYTMEIELGSPPKKFNAIVDTGSDLVWIQCK-PCSQCYSQSDPIYDPSASSTFAKTSCS 60
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCG 177
S LP + + C Y Y D S+ G + LR + GS P FGCG
Sbjct: 61 TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+ N G AG++GLG GK S+ +QL S N +CL
Sbjct: 121 --RLNSG-SFGGAAGIVGLGQGKISLSTQLGS--AINNKFSYCL 159
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 121/261 (46%), Gaps = 24/261 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y +++ IG PP Y DTGSDL W QC PC C ++ P + V CN
Sbjct: 90 GEYLMSVSIGTPPVDYIGMADTGSDLMWAQC-LPCLKCYKQSRPIFDPLKSTSFSHVPCN 148
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A ++ C A CDY Y D + G L + ++T GS + + GC
Sbjct: 149 SQNCKAI---DDSHCGAQGVCDYSYTYGDQTYTKGDLGFE----KITIGS-SSVKSVIGC 200
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSV---RGGGYLFLG 232
G+ +GV+GLG G+ S++SQ+ Q+ G++R +CL G + G
Sbjct: 201 GHESGG---GFGFASGVIGLGGGQLSLVSQMSQTSGISRR-FSYCLPTLLSHANGKINFG 256
Query: 233 HDLVPSS-GIAWTPM-SRDLLEKHYSSGPAELLFGGKSTG-IKGLQIIFDSGSSYTYFNS 289
+ V S G+ TP+ S++ + +Y + A + + K +I DSG++ ++
Sbjct: 257 QNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPK 316
Query: 290 QAYKTTLDLMRKDLKGKPLED 310
+ Y + + K +K K ++D
Sbjct: 317 ELYDGVVSSLLKVVKAKRVKD 337
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/255 (27%), Positives = 107/255 (41%), Gaps = 24/255 (9%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SL 105
T + LG+ + T+++G P + + +DTGSDL WV C+ AP G + S+
Sbjct: 91 TFRISSLGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAYASDFELSI 150
Query: 106 YHPKNN----LVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYAD-HGSSLGVLVTD--H 157
Y PK + V CN+ C+ RC C Y V Y S+ G+LV D H
Sbjct: 151 YDPKQSSTSKKVTCNNNLCA-----HRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLH 205
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
+N + + FGCG Q G+ GLG+ + S+ S L GLT +
Sbjct: 206 LTSEDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSF 265
Query: 218 GHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C G G + G P TP + + Y+ ++ G + +
Sbjct: 266 SMCFGHDGVGRISFGDKGSPDQ--EETPFNSNPSHPSYNISVTQVRVGTTLVDVD-FTAL 322
Query: 278 FDSGSSYTYFNSQAY 292
FDSG+S+TY + Y
Sbjct: 323 FDSGTSFTYLINPIY 337
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 119/271 (43%), Gaps = 55/271 (20%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDPFC 120
V+L IG PP+ ++ +DTGS L+W+QC+ PP S++ P +++ CN P C
Sbjct: 84 VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPR-KPPPSSVFDPSLSSSFSVLPCNHPLC 142
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
F LP + C+ N C Y YAD + G LV + + + P LI GC
Sbjct: 143 KPRIPDFTLPTS--CDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQST---PPLILGC 197
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGG----GYL 229
+ G+LG+ LG+ S SQ + LT+ +C+ VR G G
Sbjct: 198 AEESSD-------AKGILGMNLGRLSFASQAK---LTK--FSYCVPTRQVRPGFTPTGSF 245
Query: 230 FLGHDLVPSSG-------IAWTPMSR--DLLEKHYSSGPAELLFGGKSTGIK-------- 272
+LG + P+SG + ++ R +L Y+ + G + I
Sbjct: 246 YLGEN--PNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDP 303
Query: 273 --GLQIIFDSGSSYTYFNSQAY-KTTLDLMR 300
Q + DSGS +TY +AY K +++R
Sbjct: 304 SGAGQTMIDSGSEFTYLVDEAYNKVREEVVR 334
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 109/251 (43%), Gaps = 43/251 (17%)
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFH 124
IG P Y +DTGSDL W QC PC C ++ P ++ V C+ CS
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSASCS--D 229
Query: 125 LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPG 184
LP + +C + +C Y Y D S+ GVL T+ F L + P ++FGCG N G
Sbjct: 230 LPTS-KCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVFGCG--DTNEG 282
Query: 185 PKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---------GGYLFLGHDL 235
AG++GLG G S++SQ LGL + +CL+ G +
Sbjct: 283 DGFSQGAGLVGLGRGPLSLVSQ---LGLDK--FSYCLTSLDDTNNSPLLLGSLAGISEAS 337
Query: 236 VPSSGIAWTPMSRDLLE--------KHYSSGPAEL-----LFGGKSTGIKGLQIIFDSGS 282
+S + TP+ ++ + K + G + F + G G +I DSG+
Sbjct: 338 AAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG--VIVDSGT 395
Query: 283 SYTYFNSQAYK 293
S TY Q Y+
Sbjct: 396 SITYLEVQGYR 406
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 121/286 (42%), Gaps = 39/286 (13%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNN----LVACNDP 118
+V+L +G PP+ + +DTGS+L+W+ C TG + P+ + V C
Sbjct: 62 TVSLAVGTPPQNVTMVLDTGSELSWLLC---ATGRAAAAAADSFRPRASATFAAVPCGSA 118
Query: 119 FCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
CS+ LP C+ A+ +C + YAD +S G L TD F + G R FGC
Sbjct: 119 RCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAV----GDAPPLRSAFGCM 174
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLGHDLV 236
+ P TAG+LG+ G S ++Q TR +C+S R G L LGH +
Sbjct: 175 SAAYDSSPDAVATAGLLGMNRGALSFVTQAS----TRR-FSYCISDRDDAGVLLLGHSDL 229
Query: 237 PSSGIAWTPMSRD------LLEKHYSSGPAELLFGGKSTGIK----------GLQIIFDS 280
P + +TP+ + YS + GGK I Q + DS
Sbjct: 230 PFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDS 289
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDT--AEEKALPVCWK 322
G+ +T+ AY K K LED A ++A C++
Sbjct: 290 GTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFR 335
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/277 (27%), Positives = 120/277 (43%), Gaps = 40/277 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-LPPESLYHPKNNL----VAC 115
G Y V L+IG PP+ L DTGSDL WV+C+A C C+ P +++ P+++ C
Sbjct: 81 GQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHSSTFSPAHC 139
Query: 116 NDPFCSAFHLPENI-RC---EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
DP C P RC + C YE YAD + G+ + L+ ++G +
Sbjct: 140 YDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLK 199
Query: 172 LI-FGCGYN---QRNPGPKPPPTAGVLGLGLGKASILSQL-QSLG--LTRNVLGHCLSVR 224
+ FGCG+ Q G GV+GLG G S SQL + G + ++ + LS
Sbjct: 200 SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPP 259
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI-------- 276
YL +G S + +TP+ + L P KS + G ++
Sbjct: 260 PTSYLIIGDGGDAVSKLFFTPLLTNPLS------PTFYYVKLKSVFVNGAKLRIDPSIWE 313
Query: 277 ---------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+ DSG++ + AY+ + +++ +K
Sbjct: 314 IDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIK 350
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 117/253 (46%), Gaps = 24/253 (9%)
Query: 69 IGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFH 124
IG PP Y DTGSDLTW QC PC C +++P + V CN C H
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC---H 141
Query: 125 LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPG 184
++ C CDY Y D S G L + ++T GS + + GCG+
Sbjct: 142 AVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFE----KITIGS-SSVKSVIGCGHASSG-- 194
Query: 185 PKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSV---RGGGYLFLGHDLVPSS- 239
+GV+GLG G+ S++SQ+ Q+ G++R +CL G + G + V S
Sbjct: 195 -GFGFASGVIGLGGGQLSLVSQMSQTSGISRR-FSYCLPTLLSHANGKINFGQNAVVSGP 252
Query: 240 GIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-KGLQIIFDSGSSYTYFNSQAYKTTLD 297
G+ TP+ S++ + +Y + A + + K +I DSG++ ++ + Y +
Sbjct: 253 GVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVS 312
Query: 298 LMRKDLKGKPLED 310
+ K +K K ++D
Sbjct: 313 SLLKVVKAKRVKD 325
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/244 (29%), Positives = 107/244 (43%), Gaps = 25/244 (10%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SLYHPK----NNLV 113
T+++G P + + +DTGSDL WV C+ AP G + S+Y PK + V
Sbjct: 114 TTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSSTSKTV 173
Query: 114 ACNDPFCSAFHLPENIRC-EANDQCDYEVLYAD-HGSSLGVLVTD--HFPLRLTNGSLLG 169
CN+ C+ + +C EA C Y V Y S+ G+L+ D H + +
Sbjct: 174 PCNNNLCA-----QRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEHKHSEPIQ 228
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL 229
+ FGCG Q G+ GLG+ + S+ S L GL N C S G G +
Sbjct: 229 AYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRI 288
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG-LQIIFDSGSSYTYFN 288
G S TP + + L +Y+ + G +T I + +FDSG+S++YF
Sbjct: 289 NFGDK--GSLEQEETPFNLNQLHPNYNITVTSIRVG--TTLIDADITALFDSGTSFSYFT 344
Query: 289 SQAY 292
Y
Sbjct: 345 DPIY 348
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 111/279 (39%), Gaps = 29/279 (10%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---------LYHPK- 109
L Y +VT +G P + + +DTGSDL W+ C+ CT C ++ +Y P
Sbjct: 103 LHYANVT--VGTPSDWFLVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNA 158
Query: 110 ---NNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHG-SSLGVLVTD--HFPLRL 162
+ V CN C+ RC + C Y++ Y +G SS GVLV D H
Sbjct: 159 SSTSTKVPCNSTLCT-----RGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ + R+ GCG Q G+ GLGL S+ S L G+ N C
Sbjct: 214 KSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
G G + G S TP++ Y+ ++ G +TG +FDSG+
Sbjct: 274 NDGAGRISFGDK--GSVDQRETPLNIRQPHPTYNITVTKISVEG-NTGDLEFDAVFDSGT 330
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
S+TY AY + K + T E C+
Sbjct: 331 SFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCY 369
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 75/153 (49%), Gaps = 15/153 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG PP + IDT SDL W QC PCTGC + +++P+ + + C+
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 117 DPFCSAFHLPENIRC--EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C + RC + ++ C Y Y+ + ++ G L D +L G + F
Sbjct: 146 SDTCDELDVH---RCGHDDDESCQYTYTYSGNATTEGTLAVD----KLVIGEDAFRGVAF 198
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
GC + P PP +GV+GLG G S++SQL
Sbjct: 199 GCSTSSTGGAP-PPQASGVVGLGRGPLSLVSQL 230
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 111/257 (43%), Gaps = 30/257 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCN----APCTGC---TLPPESL--YHPKNN----LV 113
+ IG P + + +DTGSDL W+ CN AP T +L + L Y+P ++ +
Sbjct: 104 IDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 114 ACNDPFCSAFHLPENIRCEA-NDQCDYEVLY-ADHGSSLGVLVTDHFPL------RLTNG 165
C+ C + CE+ +QC Y V Y + + SS G+LV D L RL NG
Sbjct: 164 LCSHKLCDSAS-----DCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 166 -SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
S + R++ GCG Q G++GLG + S+ S L GL RN C
Sbjct: 219 SSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 278
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSY 284
G ++ G D+ PS + TP + Y G G DSG S+
Sbjct: 279 DSGRIYFG-DMGPSIQQS-TPFLQLENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSF 336
Query: 285 TYFNSQAY-KTTLDLMR 300
TY + Y K L++ R
Sbjct: 337 TYLPEEIYRKVALEIDR 353
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 114/256 (44%), Gaps = 42/256 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + L IG P + + +DTGSDL W QC PCT C +++P+ + + C+
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A P C +N+ C Y Y D + G + T+ LT GS+ P + FGC
Sbjct: 152 SQLCQALQSP---TC-SNNSCQYTYGYGDGSETQGSMGTE----TLTFGSVSIPNITFGC 203
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGYLFLGH 233
G N N G AG++G+G G S+ SQL +T+ +C++ G L LG
Sbjct: 204 GEN--NQGFGQGNGAGLVGMGRGPLSLPSQLD---VTK--FSYCMTPIGSSNSSTLLLGS 256
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL----------------QII 277
+ +S A +P + L++ + G S G L II
Sbjct: 257 --LANSVTAGSP-NTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGII 313
Query: 278 FDSGSSYTYFNSQAYK 293
DSG++ TYF AY+
Sbjct: 314 IDSGTTLTYFVDNAYQ 329
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 125/294 (42%), Gaps = 32/294 (10%)
Query: 47 STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESL 105
ST + +G++ Y V + +G P + L DTGSDLTW QC PC G C +++
Sbjct: 30 STTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAI 88
Query: 106 YHPKNNL----VACNDPFCSAFHLPENIRCE----ANDQCDYEVLYADHGSSLGVLVTDH 157
+ P + + C C+ + I+ E + C Y+ Y D+ +S+G L +
Sbjct: 89 FDPSKSSSYTNITCTSSLCTQLT-SDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQER 147
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
+ T+ + +FGCG Q N G +AG++GLG SI+ Q S +
Sbjct: 148 LTITATD---IVDDFLFGCG--QDNEGL-FNGSAGLMGLGRHPISIVQQTSS--NYNKIF 199
Query: 218 GHCLSVRGG--GYLFLGHDLVPSSGIAWTPMSRDLLEKHY--------SSGPAELLFGGK 267
+CL G+L G ++ + +TP+S + + S G +L
Sbjct: 200 SYCLPATSSSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSS 259
Query: 268 STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
ST G II DSG+ T Y R+ ++ P+ + E L C+
Sbjct: 260 STFSAGGSII-DSGTVITRLAPTVYAALRSAFRRXMEKYPVAN--EAGLLDTCY 310
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 111/271 (40%), Gaps = 44/271 (16%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y + L IG PP+ +DTGSDL W QC APC C P+ L+ P + + C+
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPAASSSYVPMRCSGQ 161
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG- 177
C+ + C+ D C Y Y D ++LGV T+ F ++G L L FGCG
Sbjct: 162 LCNDIL---HHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGT 218
Query: 178 --YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLF- 230
N G +G++G G S++SQ L+ +CL S R +F
Sbjct: 219 MNVGSLNNG------SGIVGFGRDPLSLVSQ-----LSIRRFSYCLTPYTSTRKSTLMFG 267
Query: 231 -LGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGIKGLQI------------ 276
L + A + + LL+ + + F G + G + L+I
Sbjct: 268 SLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGS 327
Query: 277 ---IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ T F + L R L+
Sbjct: 328 GGVIVDSGTALTLFPAAVLTEVLRAFRAQLR 358
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 120/281 (42%), Gaps = 38/281 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + + IG P +DTGSDL W QC PCT C P +++P++ + + C
Sbjct: 94 GEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPCE 152
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
+C LP + C Y Y D S+ G + T+ F ++ P + FGC
Sbjct: 153 SQYCQ--DLPSE---SCYNDCQYTYGYGDGSSTQGYMATETFTFETSS----VPNIAFGC 203
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY---LFLGH 233
G + N G AG++G+G G S+ SQ LG+ + +C++ G L LG
Sbjct: 204 G--EDNQGFGQGNGAGLIGMGWGPLSLPSQ---LGVGQ--FSYCMTSSGSSSPSTLALGS 256
Query: 234 DL--VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG----LQ------IIFDSG 281
VP + T + L +Y + GG + GI LQ +I DSG
Sbjct: 257 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSG 316
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
++ TY AY + P+++++ L C++
Sbjct: 317 TTLTYLPQDAYNAVAQAFTDQINLSPVDESS--SGLSTCFQ 355
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 123/282 (43%), Gaps = 37/282 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PPK + L +DTGSDL W+QC PC C Y PK++ + C+
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYACFEQNGPYYDPKDSSSFKNITCH 251
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGS-----LLG 169
DP C P+ + C+ Q C Y Y D ++ G + F + LT +
Sbjct: 252 DPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIV 311
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG-- 227
++FGCG+ R AG+LGLG G S +QLQSL + +CL R
Sbjct: 312 ENVMFGCGHWNRGLFHG---AAGLLGLGRGPLSFATQLQSL--YGHSFSYCLVDRNSNSS 366
Query: 228 ---YLFLGHD--LVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIK------- 272
L G D L+ + +T + ++ Y ++ GG+ I
Sbjct: 367 VSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLS 426
Query: 273 ---GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDT 311
G I DSG++ TYF AY+ + + +KG PL +T
Sbjct: 427 AQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVET 468
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/259 (29%), Positives = 111/259 (42%), Gaps = 44/259 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + + IG P Y IDTGSDL W QC PC C ++ P ++ + C+
Sbjct: 100 GEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYAALPCS 158
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS LP + +C + +C Y Y D S+ GVL + F L T P + FGC
Sbjct: 159 STLCS--DLPSS-KCTSA-KCGYTYTYGDSSSTQGVLAAETFTLAKTK----LPDVAFGC 210
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---------GG 227
G N G AG++GLG G S++SQ LGL N +CL+ G
Sbjct: 211 G--DTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGL--NKFSYCLTSLDDTSKSPLLLGS 263
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLE--------KHYSSGPAELL-----FGGKSTGIKGL 274
+ +S + TP+ R+ + K + G + F + G G
Sbjct: 264 LATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGG- 322
Query: 275 QIIFDSGSSYTYFNSQAYK 293
+I DSG+S TY Q Y+
Sbjct: 323 -VIVDSGTSITYLELQGYR 340
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/255 (27%), Positives = 109/255 (42%), Gaps = 24/255 (9%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPE---SL 105
T + LG+ + T+++G P + + +DTGSDL WV C+ AP G + + S+
Sbjct: 87 TFRISSLGFLHYTTVELGTPGVKFMVALDTGSDLFWVPCDCSRCAPTHGASYASDFELSI 146
Query: 106 YHPKNN----LVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYAD-HGSSLGVLVTDHFP 159
Y+P+ + V CN+ C+ + RC C Y V Y S+ G+LV D
Sbjct: 147 YNPRESSTSKKVTCNNDMCA-----QRNRCLGTFSSCPYIVSYVSAQTSTSGILVKDVLH 201
Query: 160 LRLTNG--SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
L +G + + FGCG Q G+ GLG+ K S+ S L GL +
Sbjct: 202 LTTEDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLSREGLIADSF 261
Query: 218 GHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C G G + G P TP + + Y+ + G ++ +
Sbjct: 262 SMCFGHDGIGRISFGDKGSPDQ--EETPFNVNPAHPTYNVTVTQARVGTMLIDVE-FTAL 318
Query: 278 FDSGSSYTYFNSQAY 292
FDSG+S+TY AY
Sbjct: 319 FDSGTSFTYMVDPAY 333
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 24/277 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + IG PP DTGSDL WVQC +PC C L+ P + C
Sbjct: 88 GEYLMRFYIGTPPVERLATADTGSDLIWVQC-SPCASCFPQSTPLFQPLKSSTFMPTTCR 146
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGS-SLGVLVTDHFPLRLTNG--SLLGPRLI 173
C+ LPE C + +C Y Y D S S G+L T+ G ++ P
Sbjct: 147 SQPCTLL-LPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSF 205
Query: 174 FGCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL----SVRGGG 227
FGCG YN P T G++GLG G S++SQ+ +G + +CL S
Sbjct: 206 FGCGLYNNITVFPSYKLT-GIMGLGAGPLSLVSQIGDQIG---HKFSYCLLPLGSTSTSK 261
Query: 228 YLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKS--TGIKGLQIIFDSGSSY 284
F ++ G+ TPM + L +Y + K+ TG +I DSG+
Sbjct: 262 LKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLL 321
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
TY Y +++ L + ++D LP C+
Sbjct: 322 TYLGESFYYNFAASLQESLAVELVQDVL--SPLPFCF 356
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 77/171 (45%), Gaps = 14/171 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G+PP L +D+GSD+ WVQC PC C + L+ P + V+C
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C +CDY V Y D + G L + LT G + GC
Sbjct: 187 SAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALE----TLTLGGTAVQGVAIGC 242
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
G+ RN G AG+LGLG G S++ QL G V +CL+ RG G
Sbjct: 243 GH--RNSGLF-VGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 114/268 (42%), Gaps = 23/268 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + +G P DTGSDL W QC PC C L+ PK++ ++C+
Sbjct: 90 GEYLMKFSLGTPAFDILAIADTGSDLIWTQCK-PCDQCYEQDAPLFDPKSSSTYRDISCS 148
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFG 175
C + E N C Y Y D + G + D L T+G +L P+ I G
Sbjct: 149 TKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIG 208
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGYL 229
CG+N N G +G++GLG G S++SQL S +CL +
Sbjct: 209 CGHN--NGGSFTEKGSGIVGLGGGPISLISQLGST--IDGKFSYCLVPLSSNATNSSKLN 264
Query: 230 FLGHDLVPSSGIAWTPM-SRD------LLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
F + +V G+ TP+ S+D L + S G + F G S G II DSG+
Sbjct: 265 FGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGT 324
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLED 310
+ T F + ++ + G P+ED
Sbjct: 325 TLTLFPEDFFSELSSAVQDAVAGTPVED 352
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 115/263 (43%), Gaps = 19/263 (7%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT--LPPE------SLYH 107
N + +Y+V + +G P + + +DTGSDL WV C+ C C P+ +Y
Sbjct: 102 NQFGFLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLSSPDYGNLKFDVYS 158
Query: 108 PKNNLVACNDPFCSAFHLPENIRCE-ANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNG 165
P+ + + P CS+ C A++ C Y++ Y +D+ SS GVLV D L +G
Sbjct: 159 PRKSSTSRKVP-CSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESG 217
Query: 166 --SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
+ + FGCG Q G+LGLG+ S+ S L S G+ N C
Sbjct: 218 HSKITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGE 277
Query: 224 RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSS 283
G G + G S+ TP++ +Y+ + GGK+ K + DSG+S
Sbjct: 278 DGHGRINFGD--TGSADQLETPLNIYKHNPYYNISIVGAMAGGKTFSTK-FSAVVDSGTS 334
Query: 284 YTYFNSQAYKTTLDLMRKDLKGK 306
+T + Y K +K K
Sbjct: 335 FTALSDPMYTEITSAFDKQVKEK 357
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 111/261 (42%), Gaps = 57/261 (21%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK----NNLVACNDPFC 120
+ L IG PP+ + +DTGS L+W+QC+ PP + + P +++ C P C
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWIQCHK-----KQPPTASFDPSLSSTFSILPCTHPLC 131
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
F LP + C+ N C Y YAD + G LV + F + S+ P LI GC
Sbjct: 132 KPRIPDFTLPTS--CDQNRLCHYSYFYADGTYAEGNLVREKFTF---SRSVSTPPLILGC 186
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG-------GYL 229
+P G+LG+ LG+ S Q + +T+ +C+ R G
Sbjct: 187 ATESTDP-------RGILGMNLGRLSFAKQSK---ITK--FSYCVPPRQTRPGFTPTGSF 234
Query: 230 FLGHDLVPSS-GIAWTPM---SRDLLEKH----YSSGPAELLFGGKSTGIKGL------- 274
+LG++ PSS G + M SR + Y+ + GK I
Sbjct: 235 YLGNN--PSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAG 292
Query: 275 ---QIIFDSGSSYTYFNSQAY 292
Q + DSGS +TY S+AY
Sbjct: 293 GSGQTMIDSGSEFTYLVSEAY 313
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 116/257 (45%), Gaps = 42/257 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + L IG P + + +DTGSDL W QC PCT C +++P+ + + C+
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A P C +N+ C Y Y D + G + T+ LT GS+ P + FGC
Sbjct: 152 SQLCQALSSP---TC-SNNFCQYTYGYGDGSETQGSMGTE----TLTFGSVSIPNITFGC 203
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY---LFLGH 233
G N N G AG++G+G G S+ SQL +T+ +C++ G L LG
Sbjct: 204 GEN--NQGFGQGNGAGLVGMGRGPLSLPSQLD---VTK--FSYCMTPIGSSTPSNLLLGS 256
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL----------------QII 277
+ +S A +P + L++ + G S G L II
Sbjct: 257 --LANSVTAGSP-NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGII 313
Query: 278 FDSGSSYTYFNSQAYKT 294
DSG++ TYF + AY++
Sbjct: 314 IDSGTTLTYFVNNAYQS 330
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 77/171 (45%), Gaps = 14/171 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G+PP L +D+GSD+ WVQC PC C + L+ P + V+C
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C +CDY V Y D + G L + LT G + GC
Sbjct: 187 SAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALE----TLTLGGTAVQGVAIGC 242
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
G+ RN G AG+LGLG G S++ QL G V +CL+ RG G
Sbjct: 243 GH--RNSGLF-VGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 129/310 (41%), Gaps = 29/310 (9%)
Query: 29 QPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTW 88
Q K + + ST + +G++ Y V + +G P + L DTGSDLTW
Sbjct: 102 QSRLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTW 161
Query: 89 VQCNAPCTG-CTLPPESLYHPKNNL----VACNDPFCSAFHLPE-NIRCEAN-DQCDYEV 141
QC PC G C ++++ P + + C C+ RC ++ C Y +
Sbjct: 162 TQCE-PCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGI 220
Query: 142 LYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKA 201
Y D +S+G L + + T+ + +FGCG Q N G +AG++GLG
Sbjct: 221 QYGDKSTSVGFLSQERLTITATD---IVDDFLFGCG--QDNEGL-FSGSAGLIGLGRHPI 274
Query: 202 SILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY---- 255
S + Q S + + +CL + G+L G ++ + +TP+S + +
Sbjct: 275 SFVQQTSS--IYNKIFSYCLPSTSSSLGHLTFGASAATNANLKYTPLSTISGDNTFYGLD 332
Query: 256 ----SSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDT 311
S G +L ST G II DSG+ T AY R+ ++ P+ +
Sbjct: 333 IVGISVGGTKLPAVSSSTFSAGGSII-DSGTVITRLAPTAYAALRSAFRQGMEKYPVAN- 390
Query: 312 AEEKALPVCW 321
E+ C+
Sbjct: 391 -EDGLFDTCY 399
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 125/296 (42%), Gaps = 28/296 (9%)
Query: 32 SKKKSTQSTAAHRFGSTAV-FPITGNVYP-LGYYSVTLKIGNPPKLYELDIDTGSDLTWV 89
S++ ++ AA S+AV P++ Y G Y V L++G P + + L DTGSDLTWV
Sbjct: 83 SRQGGSRRVAAEVASSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWV 142
Query: 90 QCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENI-RCEA-NDQCDYEVLY 143
+C PP ++ PK + + C+ C +P + C + C Y+ Y
Sbjct: 143 KCAGAS-----PPGRVFRPKTSRSWAPIPCSSDTCK-LDVPFTLANCSSPASPCTYDYRY 196
Query: 144 AD-HGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGCGYNQRNPGPKPPPTAGVLGLGLGKA 201
+ + G++ T+ + L G + + ++ GC + + G GVL LG K
Sbjct: 197 KEGSAGARGIVGTESATIALPGGKVAQLKDVVLGC--SSSHDGQSFRSADGVLSLGNAKI 254
Query: 202 SILSQLQSL---GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSG 258
S +Q + + ++ H GYL G VP + T + D Y
Sbjct: 255 SFATQAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVK 314
Query: 259 PAELLFGGKSTGI-------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
+ GK+ I K +I DSG++ T + AYK + + K L G P
Sbjct: 315 VDAIHVAGKALDIPAEVWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVP 370
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 125/279 (44%), Gaps = 37/279 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG PP+ + L +DTGSDL W+QC PC C + Y PK + + C+
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQC-VPCYDCFVQNGPYYDPKESSSFKNIGCH 248
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTN--GSLLGPR- 171
DP C P+ + C+A +Q C Y Y D ++ G + F + LT+ G R
Sbjct: 249 DPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV 308
Query: 172 --LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---- 225
++FGCG+ R AG+LGLG G S SQLQS L + +CL R
Sbjct: 309 ENVMFGCGHWNRGLFHG---AAGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDTN 363
Query: 226 -GGYLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGIKGLQ---- 275
L G DL+ + +T + + ++ Y ++ GG+ I
Sbjct: 364 VSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLS 423
Query: 276 ------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
I DSG++ +YF +Y+ D K +KG P+
Sbjct: 424 PEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPV 462
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 113/258 (43%), Gaps = 36/258 (13%)
Query: 70 GNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP---------KNNLVACNDPFC 120
G+P + +DTGSDLTWVQC PC+ C + L+ P + N AC D
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 213
Query: 121 SAFHLPENIRC--EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
+A P + +++C Y + Y D S GVL TD L G+ LG +FGCG
Sbjct: 214 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---GGASLGG-FVFGCGL 269
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFL-GH 233
+ R TAG++GLG + S++SQ S V +CL S G L L G
Sbjct: 270 SNRG---LFGGTAGLMGLGRTELSLVSQTAS--RYGGVFSYCLPAATSGDASGSLSLGGG 324
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELL------FGGKSTGIKGL---QIIFDSGSSY 284
D SS TP++ + + P L GG + +GL ++ DSG+
Sbjct: 325 DDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVI 384
Query: 285 TYFNSQAYKTT-LDLMRK 301
T Y+ + MR+
Sbjct: 385 TRLAPSVYRAVRAEFMRQ 402
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 123/290 (42%), Gaps = 29/290 (10%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P T + LG Y ++ +G P +DTGSD+ W+QC PC C ++ +
Sbjct: 78 PETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQ-PCKKCYEQTTPIFDSSKS 136
Query: 112 ----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS- 166
+ C C + + C + C Y + Y D SLG L + L TNGS
Sbjct: 137 QTYKTLPCPSNTCQSV---QGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSP 193
Query: 167 LLGPRLIFGCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC----L 221
+ P + GCG YN K +G++GLG G S+++QL T +C L
Sbjct: 194 VQFPGTVIGCGRYNAIGIEEK---NSGIVGLGRGPMSLITQLSP--STGGKFSYCLVPGL 248
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPM-SRD------LLEKHYSSGPAELLFGGKSTGIKGL 274
S F +V G TP+ S++ L + +S G + FG +G KG
Sbjct: 249 STASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGKG- 307
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
II DSG++ T + Y + K + + + D + L +C+K T
Sbjct: 308 NIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRD--PNQVLGLCYKVT 355
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 111/277 (40%), Gaps = 32/277 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKNN----LVACN 116
Y VT+ +G P L++DTGSDL+WVQC PC C + L+ P + V C
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C + + C A QC Y V Y D + GV +D L N ++ G FGC
Sbjct: 199 GPVCGGLGIYAS-SCSAA-QCGYVVSYGDGSKTTGVYSSDTLTLS-PNDAVRG--FFFGC 253
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--GYLFLGHD 234
G+ Q G+LGLG +AS++ Q+ G V +CL R GYL LG
Sbjct: 254 GHAQSGFTGND----GLLGLGREEASLVE--QTAGTYGGVFSYCLPTRPSTTGYLTLGG- 306
Query: 235 LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---------IFDSGSSYT 285
PS + LL ++ ++ G S G + L + + D+G+ T
Sbjct: 307 --PSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVIT 364
Query: 286 YFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
AY R + L C+
Sbjct: 365 RLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYN 401
>gi|62954896|gb|AAY23265.1| Similar to probable aspartic proteinase (EC 3.4.23.-) - barley
[Oryza sativa Japonica Group]
gi|77548965|gb|ABA91762.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa
Japonica Group]
gi|125576451|gb|EAZ17673.1| hypothetical protein OsJ_33214 [Oryza sativa Japonica Group]
Length = 96
Score = 75.5 bits (184), Expect = 3e-11, Method: Composition-based stats.
Identities = 33/49 (67%), Positives = 38/49 (77%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC 98
VFP+ GNVYP G + VT+ IG P K Y LDIDTGSDLTWV+C+APC C
Sbjct: 31 VFPLHGNVYPSGRFFVTMNIGVPEKPYFLDIDTGSDLTWVECDAPCQSC 79
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 68/233 (29%), Positives = 100/233 (42%), Gaps = 25/233 (10%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
+++ + +T GS V I+ G Y V + +G+PP L +D+GSD+ W+QC
Sbjct: 106 QRRLSPTTMTTEVGSEVVSGISEGS---GEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCR 162
Query: 93 APCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIR-CEANDQCDYEVLYADHG 147
PC C + L+ P + V C+ C LP C + C Y+V Y D
Sbjct: 163 -PCAECYQQADPLFDPAASASFTAVPCDSGVCRT--LPGGSSGCADSGACRYQVSYGDGS 219
Query: 148 SSLGVLVTDHFPLRLTNG-SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
+ GVL + LT G S + GCG+ R AG+LGLG G S++ Q
Sbjct: 220 YTQGVLAMET----LTFGDSTPVQGVAIGCGHRNRGLFVG---AAGLLGLGWGPMSLVGQ 272
Query: 207 LQSLGLTRNVLGHCLSVR----GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY 255
L +CL+ R G G L G D G W P+ R+ + +
Sbjct: 273 LGG--AAGGAFSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSF 323
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 129/323 (39%), Gaps = 45/323 (13%)
Query: 32 SKKKSTQSTAAHRFG----STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLT 87
SKK R G +A + ++ GYY+ + IG PP + L +DTGS +T
Sbjct: 5 SKKNDIVDRRFERRGRKLEESARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTGSTVT 64
Query: 88 WVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPEN------IRCEAND------ 135
+V C++ CT C S + + C DP PEN I C ++D
Sbjct: 65 YVPCSS-CTHCGHHQASF---STHRLFCRDPRFK----PENSSSYQKIGCRSSDCITGLC 116
Query: 136 -----QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI-FGCGYNQRNPGPKPPP 189
QC YE +YA+ +S GVL D L S L +L+ FGC G
Sbjct: 117 DSNSHQCKYERMYAEMSTSKGVLGKD--LLDFGPASRLQSQLLSFGC--ETAESGDLYLQ 172
Query: 190 TA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGYLFLGHDLVPSSGIAWTPM 246
A G++GLG G SI+ QL G + C GGG + LG PS +
Sbjct: 173 VADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMVLGAIPAPSGMVFAKSD 232
Query: 247 SRDLLEKHYSSGPAELLFGGKSTGIKG------LQIIFDSGSSYTYFNSQAYKTTLDLMR 300
R +Y+ E+ G S + I DSG++Y Y +A++ D +
Sbjct: 233 PR--RSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSGTTYAYLPDRAFEAFTDAVV 290
Query: 301 KDLKGKPLEDTAEEKALPVCWKG 323
L D + +C+ G
Sbjct: 291 AQLGSLQAVDGPDPNYPDICYAG 313
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 114/271 (42%), Gaps = 54/271 (19%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDPFC 120
V+L IG PP+ ++ +DTGS L+W+QC+ PP + + P +++ CN P C
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
F LP C+ N C Y YAD + G LV + + + P LI GC
Sbjct: 142 KPRIPDFTLPTT--CDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQST---PPLILGC 196
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG-------GYL 229
+ G+LG+ LG+ S SQ + + +C+ R G
Sbjct: 197 AEASTD-------EKGILGMNLGRRSFASQAKI-----SKFSYCVPTRQARAGLSSTGSF 244
Query: 230 FLGHDLVPSSG-------IAWTPMSR--DLLEKHYSSGPAELLFGGKSTGIKGL------ 274
+LG++ P+SG + +TP R +L Y+ + G I
Sbjct: 245 YLGNN--PNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDP 302
Query: 275 ----QIIFDSGSSYTYFNSQAY-KTTLDLMR 300
Q I DSGS +TY +AY K +++R
Sbjct: 303 SGAGQTIIDSGSEFTYLVDEAYNKVREEVVR 333
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 115/292 (39%), Gaps = 49/292 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG P + Y +DTGSDL W QC APC C P + P + + C
Sbjct: 88 GEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCA 146
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C+A + P + C Y+ Y D S+ GVL + F + P + FGC
Sbjct: 147 SPACNALYYPLCYQ----KVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGC 202
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS---------VRGGG 227
G N G +G++G G G S++SQL S + +CL+ + G
Sbjct: 203 G--NLNAG-SLANGSGMVGFGRGSLSLVSQLGSPRFS-----YCLTSFLSPVPSRLYFGV 254
Query: 228 YLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGG-------------KSTGIKG 273
Y L S + TP + L Y + GG + G G
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGG 314
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCWK 322
I DSG++ TY AY D +R + PL + + L C++
Sbjct: 315 --TIIDSGTTITYLAEPAY----DAVRAAFASQITLPLLNVTDASVLDTCFQ 360
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 115/255 (45%), Gaps = 26/255 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G PP+ + + +DTGSDL W+QC APC C ++ P ++ V C
Sbjct: 147 GEYLVDVYLGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQSGPIFDPAASISYRNVTCG 205
Query: 117 DPFCS-----AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT-NGSLLGP 170
D C A P R +D C Y Y D ++ G L + F + LT +G+
Sbjct: 206 DDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVD 265
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GG 227
+ FGCG+ R AG+LGLG G S SQL+ + + +CL G G
Sbjct: 266 GVAFGCGHRNRGLFHG---AAGLLGLGRGPLSFASQLRGV-YGGHAFSYCLVEHGSAAGS 321
Query: 228 YLFLGHD--LVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGGKSTGIKGLQI-----IFD 279
+ GHD L+ + +T + + Y +L GG++ I + I D
Sbjct: 322 KIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIID 381
Query: 280 SGSSYTYFNSQAYKT 294
SG++ +YF AY+
Sbjct: 382 SGTTLSYFPEPAYQA 396
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/305 (26%), Positives = 125/305 (40%), Gaps = 61/305 (20%)
Query: 53 ITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP-----CTGCTL----P 101
+T YP Y YSV +G PP+ L +DTGS L W C P C CT P
Sbjct: 62 VTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDP 121
Query: 102 PESLYHPKN-----NLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
+ + +N + C P C+ + ++ C +C Y L GS+ G LV+D
Sbjct: 122 TKIPIYARNKSSTVQSLPCRSPKCN-WVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVSD 180
Query: 157 HFPLRLTNGSLLGPRLIFGCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
L N P +FGC + R P G+ G G G ASI +Q LGLT+
Sbjct: 181 VLGLSKLNRI---PDFLFGCSLVSNRQP-------EGIAGFGRGLASIPAQ---LGLTK- 226
Query: 216 VLGHCL------SVRGGGYLFL----GHDLVPSSGIAWTPMSR----DLLEKHYSSGPAE 261
+CL G L L H ++G+A+ P ++ ++Y ++
Sbjct: 227 -FSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSK 285
Query: 262 LLFGGKSTGIKGLQI----------IFDSGSSYTYFNSQAYKTTLDLMRKDL----KGKP 307
+L GGK I + I DSGS++T+ + + K + + K
Sbjct: 286 ILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKE 345
Query: 308 LEDTA 312
+ED++
Sbjct: 346 IEDSS 350
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 109/273 (39%), Gaps = 26/273 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VAC 115
G Y V +++G P + + DTGSD TWVQC PC C E L+ P + ++C
Sbjct: 163 GNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQ-PCVAYCYQQKEPLFTPTKSATYANISC 221
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
+CS + R + C Y V Y D ++G D LT G FG
Sbjct: 222 TSSYCSDL----DTRGCSGGHCLYAVQYGDGSYTVGFYAQD----TLTLGYDTVKDFRFG 273
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGH 233
CG R K AG++GLG GK S+ +Q+ V +C+ + G G+L G
Sbjct: 274 CGEKNRGLFGK---AAGLMGLGRGKTSV--PVQAYDKYSGVFAYCIPATSSGTGFLDFGP 328
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK-----STGIKGLQIIFDSGSSYTYFN 288
++ TPM D Y G + GG +T + DSG+ T
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388
Query: 289 SQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
AY+ K ++G + L C+
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCY 421
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 110/256 (42%), Gaps = 31/256 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKN----NLVA 114
G Y V++ +G P + + DTGSDL+WVQC PC+ GC + L+ P + + V
Sbjct: 152 GNYVVSVGLGTPARDLTVVFDTGSDLSWVQCG-PCSSGGCYKQQDPLFAPSDSSTFSAVR 210
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL--------RLTNGS 166
C C A +D+C YEV+Y D + G L D L N +
Sbjct: 211 CGARECRARQSCGG--SPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDN 268
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SV 223
L P +FGCG N + G+ GLG GK S+ S Q+ G +CL S
Sbjct: 269 KL-PGFVFGCGENNTGLFGQ---ADGLFGLGRGKVSLSS--QAAGKFGEGFSYCLPSSSS 322
Query: 224 RGGGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGIK----GLQIIF 278
GYL LG + + +TPM +R Y + G++ + L +I
Sbjct: 323 SAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIV 382
Query: 279 DSGSSYTYFNSQAYKT 294
DSG+ T +AY+
Sbjct: 383 DSGTVITRLAPRAYRA 398
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 114/285 (40%), Gaps = 33/285 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y T+++G P +++ + +DTGSDLTWVQC +PC C +SL+ P + +AC
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGTCYSQNDSLFIPNTSTSFTKLACG 59
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
C+ P C C Y Y D S G V D + NG P FG
Sbjct: 60 TELCNGLPYP---MCNQT-TCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFG 115
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV-----RGGGYLF 230
CG++ G+LGLG G S SQL++ + +CL L
Sbjct: 116 CGHDNEGSFAG---ADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSPLL 170
Query: 231 LGHDLVPS-SGIAWTP-MSRDLLEKHYSSGPAELLFGGKSTGIKGLQI----------IF 278
G VP+ G+ + ++ + +Y + GGK I IF
Sbjct: 171 FGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIF 230
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
DSG++ T + ++ L M P + + L +C G
Sbjct: 231 DSGTTVTQLAGEVHQEVLAAMNASTMDYP-RKSDDSSGLDLCLGG 274
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 71/270 (26%), Positives = 113/270 (41%), Gaps = 26/270 (9%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------- 104
T + LG+ + T++IG P + + +DTGSDL WV C+ CT C +
Sbjct: 90 TFRISSLGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAASDSTAFASDFDL 147
Query: 105 -LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTD-- 156
+Y+P + V CN+ C+ H + + +N C Y V Y S+ G+LV D
Sbjct: 148 NVYNPNGSSTSKKVTCNNSLCT--HRSQCLGTFSN--CPYMVSYVSAETSTSGILVEDVL 203
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
H + L+ +IFGCG Q G+ GLG+ K S+ S L G T +
Sbjct: 204 HLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADS 263
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI 276
C G G + G S TP + + Y+ ++ G ++
Sbjct: 264 FSMCFGRDGIGRISFGDK--GSFDQDETPFNLNPSHPTYNITVTQVRVGTTVIDVE-FTA 320
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
+FDSG+S+TY Y + ++ +
Sbjct: 321 LFDSGTSFTYLVDPTYTRLTESFHSQVQDR 350
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 125/272 (45%), Gaps = 25/272 (9%)
Query: 47 STAVFPITGNVYPLGYYS--VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
+TA I N P YYS +++G P K + + +DT S L+WV C PC L P
Sbjct: 108 ATASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCE-PCINACLIPT- 165
Query: 105 LYHPKNN----LVACNDPFCSAFHLPENIR--CEA-NDQCDYEVLYADHGSSLGVLVTDH 157
++P + +V C C+A R C A + C Y Y D+ S+GV+ +D
Sbjct: 166 -FNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSD- 223
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
LT G L + IFGC R G + +G+LG+ + K S+ SQ+ ++G +
Sbjct: 224 ---TLTYG-LGSQKFIFGCCNLFRGVGGR---YSGILGMSVNKFSLFSQM-TVGHRYRAM 275
Query: 218 GHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRD--LLEKHYSSGPAELL-FGGKSTGIKG 273
+C R G+L G S + +TP+ D H S+ E + +S+G +
Sbjct: 276 SYCFPHPRNQGFLQFGRYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSLDVQSSGNQT 335
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG 305
++ FD+G+ YT + + D + ++G
Sbjct: 336 MRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEG 367
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 100/253 (39%), Gaps = 39/253 (15%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT--------LPPESLYHPKNNL----VA 114
+ +G P + + +DTGSDL WV C+ C C L P Y P+ + V
Sbjct: 87 VALGTPNATFVVALDTGSDLFWVPCD--CKRCAPIANTSELLKP---YSPRQSSTSKPVT 141
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTDHFPLRLTN--------- 164
C+ C N N C Y V Y + SS GVLV D + +
Sbjct: 142 CSHSLCDR----PNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMTRQSSSSRSGNGG 197
Query: 165 --GSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT-RNVLGHCL 221
G +G R++FGCG Q G+LGLG+ + S+ S L + GL + C
Sbjct: 198 NVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDRVSVPSLLAAAGLVGSDSFSMCF 257
Query: 222 SVRGGGYLFLGHDLVPSSGIAW--TPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
S G G + G PS A TP Y+ + GK + D
Sbjct: 258 SPDGNGRINFGE---PSDAGAQNETPFIVSKTRPTYNISVTAVNVKGKGAMAAEFAAVVD 314
Query: 280 SGSSYTYFNSQAY 292
SG+S+TY N AY
Sbjct: 315 SGTSFTYLNDPAY 327
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 108/256 (42%), Gaps = 26/256 (10%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------- 104
T + LG+ + T++IG P + + +DTGSDL WV C+ CT C S
Sbjct: 86 TFRISSLGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAATDSSAFASDFDL 143
Query: 105 -LYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTD-- 156
+Y+P + V CN+ C H + + +N C Y V Y S+ G+LV D
Sbjct: 144 NVYNPNGSSTSKKVTCNNSLC--MHRSQCLGTLSN--CPYMVSYVSAETSTSGILVEDVL 199
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
H + L+ +IFGCG Q G+ GLG+ K S+ S L G T +
Sbjct: 200 HLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADS 259
Query: 217 LGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI 276
C G G + G S TP + + Y+ ++ G ++
Sbjct: 260 FSMCFGRDGIGRISFGDK--GSFDQDETPFNLNPSHPTYNITVTQVRVGTTLIDVE-FTA 316
Query: 277 IFDSGSSYTYFNSQAY 292
+FDSG+S+TY Y
Sbjct: 317 LFDSGTSFTYLVDPTY 332
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 133/315 (42%), Gaps = 57/315 (18%)
Query: 47 STAVFPITGNVYPLGY---YSVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP 102
+T PI ++ P Y VTL IG PP+L ++ +DTGS ++W+ C N PP
Sbjct: 50 TTKTNPIVPSISPYKYSMALVVTLPIGTPPQLQQMVLDTGSQVSWIHCDNKKGPQKKQPP 109
Query: 103 ESLYHPKNNL-----VACNDPFCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
+ + + CN P C LP + C+AN C Y Y D G L
Sbjct: 110 TTSSFDPSLSSSFFALPCNHPLCKPQVPDISLPTD--CDANRLCHYSFSYTDGTVVEGNL 167
Query: 154 VTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT 213
V ++ L + SL P +I GC NQ + G+LG+ LG+ S +Q + +T
Sbjct: 168 VRENIAL---SPSLTTPPIILGCA-NQSDDA------RGILGMNLGRLSFPNQAK---IT 214
Query: 214 RNVLGHCLSVR----GGGYLFLGHDLVPSSG----IAWTPMSRDLLEKHYSSGPAE--LL 263
+ + + V+ G G L+LG++ P+S + S+ ++ + P L
Sbjct: 215 K--FSYFVPVKQTQPGSGSLYLGNN--PNSSCFRYVKLLTFSKSQSQRMPNLDPLAFTLP 270
Query: 264 FGGKSTGIKGL---------------QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
G S G K L Q I DSGS ++Y +AY + + K + K
Sbjct: 271 MQGISIGGKKLNIPPSVFKPDTTGFGQTIIDSGSEFSYMVDKAYNVIRNELVKKVGSKIK 330
Query: 309 EDTAEEKALPVCWKG 323
+D +C+ G
Sbjct: 331 KDYIYGGVADICFDG 345
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/264 (27%), Positives = 114/264 (43%), Gaps = 27/264 (10%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--------LYHPK 109
Y L Y +V+ +G P + + +DTGS+L W+ C+ C+ C S +Y P
Sbjct: 59 YILHYANVS--VGTPSVSFLVALDTGSNLLWLPCD--CSSCVHSLRSPSGTVDLNIYSPN 114
Query: 110 NNL----VACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSL-GVLVTD--HFPLR 161
+ V CN CS + RC ++ C Y+V+Y +G+S G +V D H
Sbjct: 115 TSSTSEKVPCNSTLCSQ---TQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISD 171
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+ + ++ FGCG Q G+ GLG+ S+ S L G T C
Sbjct: 172 DSQSKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCF 231
Query: 222 SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGIKGLQIIFDS 280
S G G + G S+G T ++ Y+ + GG+++ + IFDS
Sbjct: 232 SPNGIGRISFGDK--GSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDLV-YSAIFDS 288
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLK 304
G+S+TY N AY + K +K
Sbjct: 289 GTSFTYLNDPAYTLIAESFNKLVK 312
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 113/262 (43%), Gaps = 36/262 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + + G+PP+ + +DTGSDL W QC PC C ++ P + V+C
Sbjct: 78 GEYLIDISFGSPPQKASVIVDTGSDLIWTQC-LPCETCNAAASVIFDPVKSSTYDTVSCA 136
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
FCS+ LP C Y+ +Y D S+ G L T +T G+ P + FGC
Sbjct: 137 SNFCSS--LPFQ---SCTTSCKYDYMYGDGSSTSGALST----ETVTVGTGTIPNVAFGC 187
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL--FLGHD 234
G+ AG++GLG G S++SQ S +T +CL G L D
Sbjct: 188 GHTNLGSFAG---AAGIVGLGQGPLSLISQASS--ITSKKFSYCLVPLGSTKTSPMLIGD 242
Query: 235 LVPSSGIAWTPMSRDLLEKHY---------SSGPAELL----FGGKSTGIKGLQIIFDSG 281
+ G+A+T + + + SG A F ++G G I DSG
Sbjct: 243 SAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGF--ILDSG 300
Query: 282 SSYTYFNSQAYKTTLDLMRKDL 303
++ TY + A+ + ++ ++
Sbjct: 301 TTLTYLETGAFNALVAALKAEV 322
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 112/290 (38%), Gaps = 32/290 (11%)
Query: 47 STAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
ST P T G G Y VT+ +G P Y + DTGSD TWVQC C L
Sbjct: 146 STPSLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPL 205
Query: 106 YHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
+ P + V+C D C+ + C C Y V Y D ++G D L
Sbjct: 206 FDPAKSSTYANVSCTDSACADL---DTNGCTGG-HCLYAVQYGDGSYTVGFFAQD--TLT 259
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+ + ++ G R FGCG K TAG++GLG GK S+ +Q+ +CL
Sbjct: 260 IAHDAIKGFR--FGCGEKNNGLFGK---TAGLMGLGRGKTSL--TVQAYNKYGGAFAYCL 312
Query: 222 SV--RGGGYLFLGHDLVPSSG---IAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----- 271
G GYL D P S TPM D + Y G + GG+ +
Sbjct: 313 PALTTGTGYL----DFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF 368
Query: 272 KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
+ DSG+ T + AY K + + + L C+
Sbjct: 369 STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCY 418
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 64/130 (49%), Gaps = 10/130 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V L +G PP+ +++ +DTGSDL W+QC APC C ++ P +L V C
Sbjct: 150 GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPATSLSYRNVTCG 208
Query: 117 DPFCSAFH---LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT--NGSLLGPR 171
DP C P R +D C Y Y D ++ G L + F + LT S
Sbjct: 209 DPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDD 268
Query: 172 LIFGCGYNQR 181
++FGCG++ R
Sbjct: 269 VVFGCGHSNR 278
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 126/279 (45%), Gaps = 41/279 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G+PPK + L +DTGSDL W+QC PC C + Y PK + + CN
Sbjct: 168 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQC-LPCYDCFQQNGAFYDPKASASYKNITCN 226
Query: 117 DPFCSAFHLPE-NIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRL-TNGSLLG---- 169
D C+ P+ + C++++Q C Y Y D ++ G + F + L TNG
Sbjct: 227 DQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNV 286
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---- 225
++FGCG+ R AG+LGLG G S SQLQS L + +CL R
Sbjct: 287 ENMMFGCGHWNRGLFHG---AAGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDTN 341
Query: 226 -GGYLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGK------------ 267
L G DL+ + +T +L++ Y +L G+
Sbjct: 342 VSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNIS 401
Query: 268 STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
S G G I DSG++ +YF AY+ + + + KGK
Sbjct: 402 SDGAGG--TIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK 438
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 115/292 (39%), Gaps = 49/292 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG P + Y +DTGSDL W QC APC C P + P + + C
Sbjct: 88 GEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCA 146
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C+A + P + C Y+ Y D S+ GVL + F + P + FGC
Sbjct: 147 SPACNALYYPLCYQ----KVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGC 202
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS---------VRGGG 227
G N G +G++G G G S++SQL S + +CL+ + G
Sbjct: 203 G--NLNAG-LLANGSGMVGFGRGSLSLVSQLGSPRFS-----YCLTSFLSPVPSRLYFGV 254
Query: 228 YLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGG-------------KSTGIKG 273
Y L S + TP + L Y + GG + G G
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGG 314
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCWK 322
I DSG++ TY AY D +R + PL + + L C++
Sbjct: 315 --TIIDSGTTITYLAEPAY----DAVRAAFASQITLPLLNVTDASVLDTCFQ 360
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 102/216 (47%), Gaps = 23/216 (10%)
Query: 7 RVMGLLVLLMFATF--QGCFSEAN-------QPPSKKK----STQSTAAHRFGST-AVFP 52
R+ L++++FA + C S + Q SK K ++ST A R + V
Sbjct: 9 RIRASLLIIIFALTCSKECTSHSRLTLRTKTQESSKIKIGYLHSKSTPASRLDNLWTVSH 68
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
+T P + + + IGNPP L IDTGSDLTW+ C PC C +HP +
Sbjct: 69 VTPIPNPAAFLA-NISIGNPPVPQLLLIDTGSDLTWIHC-LPCK-CYPQTIPFFHPSRSS 125
Query: 113 VACNDPFCSAFH-LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
N SA H +P+ R E C Y + Y D ++ G+L + ++ L+ +
Sbjct: 126 TYRNASCVSAPHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQ 185
Query: 172 -LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
++FGCG Q N G +GVLGLG G SI+++
Sbjct: 186 NIVFGCG--QDNSGFT--KYSGVLGLGPGTFSIVTR 217
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 120/269 (44%), Gaps = 29/269 (10%)
Query: 61 GYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VAC 115
G Y + L +G+PP +Y L +DTGSDL W QC PC GC ++ P + + C
Sbjct: 80 GDYLMKLTLGSPPVDIYGL-VDTGSDLVWAQC-TPCGGCYRQKSPMFEPLRSKTYSPIPC 137
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIF 174
CS F C C Y YAD + GVL + T+G ++ +IF
Sbjct: 138 ESEQCSFF----GYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIF 193
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGYL 229
GCG++ N G G++G+G G S++SQ+ +L ++ CL G +
Sbjct: 194 GCGHS--NSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKR-FSQCLVPFHTDAHTSGTI 250
Query: 230 FLGHDL-VPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFDSG 281
G + V G+ TP++ + + Y S G + F T KG I+ DSG
Sbjct: 251 NFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKG-NIMIDSG 309
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
+ TY + Y+ ++ ++ P+ED
Sbjct: 310 TPATYIPQEFYERLVEELKVQSSLLPIED 338
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 78/177 (44%), Gaps = 11/177 (6%)
Query: 80 IDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEA 133
IDT SD+ WVQC APC C + LY P + C+ P C N A
Sbjct: 160 IDTASDVPWVQC-APCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPA 218
Query: 134 NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
DQC Y V Y D +S G ++D L + FGC + PG T+G+
Sbjct: 219 GDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFSNKTSGI 278
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSR 248
+ LG G S+ +Q ++ +V +CL + G+ LG V +S A TPM R
Sbjct: 279 MALGRGAQSLPTQTKA--TYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPMLR 333
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 110/264 (41%), Gaps = 48/264 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC-TGCTLPPESLYHPKNNL----VAC 115
G Y + L +G PP + IDTGSDLTW QC APC T C P LY P + + C
Sbjct: 94 GAYHMILSVGTPPLAFPAIIDTGSDLTWTQC-APCTTACFAQPTPLYDPARSSTFSKLPC 152
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL------RLTNGSLLG 169
P C A LP R C Y+ YA G + G L D + + S G
Sbjct: 153 ASPLCQA--LPSAFRACNATGCVYDYRYA-VGFTAGYLAADTLAIGDGDGDGDASSSFAG 209
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGG 226
+ FGC G +G++GLG S+LSQ +G+ R +CL + G
Sbjct: 210 --VAFGCSTAN---GGDMDGASGIVGLGRSALSLLSQ---IGVGR--FSYCLRSDADAGA 259
Query: 227 GYLFLGH------DLVPSSGIAWTPMSRDLLEKHY-------SSGPAEL-----LFGGKS 268
+ G D V S+ + P++ +Y + G +L FG +
Sbjct: 260 SPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTA 319
Query: 269 TGIKGLQIIFDSGSSYTYFNSQAY 292
G G +I DSG+++TY Y
Sbjct: 320 AGAGG--VIVDSGTTFTYLAEAGY 341
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 64/130 (49%), Gaps = 10/130 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V L +G PP+ +++ +DTGSDL W+QC APC C ++ P +L V C
Sbjct: 150 GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASLSYRNVTCG 208
Query: 117 DPFCSAFH---LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT--NGSLLGPR 171
DP C P R +D C Y Y D ++ G L + F + LT S
Sbjct: 209 DPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDD 268
Query: 172 LIFGCGYNQR 181
++FGCG++ R
Sbjct: 269 VVFGCGHSNR 278
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 118/290 (40%), Gaps = 48/290 (16%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPPES-----LYHPKNNLVACNDP 118
VTL IG PP+L ++ +DTGS L+W+QC N PP + ++ CN P
Sbjct: 84 VTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHP 143
Query: 119 FCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C F LP + C+AN C Y YAD + G LV + + + P +I
Sbjct: 144 LCKPRVPDFSLPTD--CDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTT---PPIIL 198
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR----GGGYLF 230
GC + G+LG+ LG+ SQ + +T+ +C+ + G +
Sbjct: 199 GCATQSDD-------ARGILGMNLGRLGFPSQAK---ITK--FSYCVPTKQAQPASGSFY 246
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAE--LLFGGKSTGIKGL-------------- 274
LG++ SS ++ ++ + P L G S G K L
Sbjct: 247 LGNNPASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGS 306
Query: 275 -QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
Q + DSGS +TY +AY + + K + K + +C+ G
Sbjct: 307 GQTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDG 356
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 108/266 (40%), Gaps = 28/266 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G P K L DTGSDLTW QC C + ++ P + ++C
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCT 211
Query: 117 DPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
CS+ N ++ C Y + Y D ++G D L LT + +FG
Sbjct: 212 SAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDK--LTLTQNDVF-DGFMFG 268
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSVRGG--GYLFLG 232
CG N + K TAG++GLG SI+ Q Q G +CL G G+L G
Sbjct: 269 CGQNNKGLFGK---TAGLIGLGRDPLSIVQQTAQKFG---KYFSYCLPTSRGSNGHLTFG 322
Query: 233 H------DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
+ +GI +TP + +Y + GGK+ I + I DSG
Sbjct: 323 NGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTIIDSG 382
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKP 307
+ T S AY + ++ + P
Sbjct: 383 TVITRLPSTAYGSLKSAFKQFMSKYP 408
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 121/283 (42%), Gaps = 38/283 (13%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCN----APCTG---CTLPPESL--YHPKNN----LV 113
+ IG P + + +DTGS+L W+ CN AP T +L + L Y+P ++ +
Sbjct: 104 IDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 114 ACNDPFCSAFHLPENIRCEA-NDQCDYEVLY-ADHGSSLGVLVTDHFPL------RLTNG 165
C+ C + CE+ +QC Y V Y + + SS G+LV D L RL NG
Sbjct: 164 LCSHKLCDSAS-----DCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 166 -SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
S + R++ GCG Q G++GLG + S+ S L GL RN C
Sbjct: 219 SSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 278
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKH--YSSGPAELLFGGKSTGIKGLQIIFDSGS 282
G ++ G D+ PS + TP + K+ Y G G DSG
Sbjct: 279 DSGRIYFG-DMGPSIQQS-TPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQ 336
Query: 283 SYTYFNSQAY-KTTLDLMR------KDLKGKPLEDTAEEKALP 318
S+TY + Y K L++ R K+ +G E E A P
Sbjct: 337 SFTYLPEEIYRKVALEIDRHINATSKNFEGVSWEYCYESSAEP 379
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 112/278 (40%), Gaps = 50/278 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PP+ + + +DTGSDL W+QC APC C ++ P + V C
Sbjct: 147 GEYLIDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVTCG 205
Query: 117 DPFCSAFHLPENIRC---EANDQCDYEVLYADHGSSLGVLVTDHFPLRLT--NGSLLGPR 171
D C PE R A D C Y Y D ++ G L + F + LT S
Sbjct: 206 DQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDG 265
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--------- 222
++FGCG+ R AG+LGLG G S SQL R V GH S
Sbjct: 266 VVFGCGHRNRGLFHG---AAGLLGLGRGPLSFASQL------RAVYGHTFSYCLVEHGSD 316
Query: 223 -----VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ-- 275
V G YL L H + + A T D Y +L GG I
Sbjct: 317 AGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTF---YYVKLKGVLVGGDLLNISSDTWD 373
Query: 276 --------IIFDSGSSYTYFNSQAYKTT----LDLMRK 301
I DSG++ +YF AY+ +DLM +
Sbjct: 374 VGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSR 411
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 108/261 (41%), Gaps = 33/261 (12%)
Query: 78 LDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEA 133
L IDTGSD+TW+QC+ PC C +SL+ P + + CN C L
Sbjct: 3 LLIDTGSDITWIQCD-PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQ--QLQSFSHSCL 59
Query: 134 NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAG 192
N C+Y V Y D ++ G + LR + L+ P FGCG+ + G
Sbjct: 60 NSSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKG------LFNG 113
Query: 193 VLGL-GLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLGHDLVPSSGIAWTPMS 247
GL GLGK+SI Q+ V +CL S G L G + + +TP+
Sbjct: 114 AAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLV 173
Query: 248 RDLLEKHYSSGPAELLFG--GKSTGIKGLQI----IFDSGSSYTYFNSQAYKTTLDLMRK 301
SSGP++ G + G + L I + DSG+ + F AY+ D +
Sbjct: 174 DS------SSGPSQYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQ 227
Query: 302 DLKGKPLEDTAEEKALPVCWK 322
L G L+ C++
Sbjct: 228 ILPG--LQTAVSVAPFDTCFR 246
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 138/325 (42%), Gaps = 63/325 (19%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
+ S+Q+ AA R TA++P + G Y+ ++ +G PP+ + +DTGS L+WV C +
Sbjct: 70 EPSSQAPAAVR---TALYP-----HSYGGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTS 121
Query: 94 P--CTGC-----TLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEAN------DQ 136
C C + +++HPKN+ LV C +P C H C + D
Sbjct: 122 SYQCRNCSSSPSAMSAMAVFHPKNSSSSRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDV 181
Query: 137 C-DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF-----GCGYNQRNPGPKPPPT 190
C Y V+Y GS+ G+L++D LRL+ S F GC + +PP
Sbjct: 182 CPPYLVVYGS-GSTSGLLISDT--LRLSPSSSSSAPAPFRNFAIGCSIVSVH---QPP-- 233
Query: 191 AGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--GYLFLGHDLVPSSGIAWTPMSR 248
+G+ G G G S+ SQL+ + +L G L LG +VP+ T
Sbjct: 234 SGLAGFGRGAPSVPSQLKVPKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYV 293
Query: 249 DLLEKHYSSGPAELLF---------GGKSTGI--------KGLQIIFDSGSSYTYFNSQA 291
LL S P + + GGK + G I DSG+++TY +
Sbjct: 294 PLLNNAASKPPYSVYYYLALTGISVGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTV 353
Query: 292 YKTTLDLMRKDLKG-----KPLEDT 311
+K M + G +P+ED
Sbjct: 354 FKPVAAAMESAVGGRYNRSRPVEDA 378
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 110/258 (42%), Gaps = 30/258 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y +T+ G P + + DTGSD+ W+QC C E L+ P + V+C
Sbjct: 14 GNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSCT 73
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
+P C + R ++ C Y V Y D S++G L D F L IFGC
Sbjct: 74 EPACVGL----STRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKF---KNFIFGC 126
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKA-SILSQLQ-SLGLTRNVLGHCL--SVRGGGYLFLG 232
G Q N G TAG++GLG S+ SQ+ SLG NV +CL + GYL +G
Sbjct: 127 G--QNNTGLF-QGTAGLVGLGRSSTYSLNSQVAPSLG---NVFSYCLPSTSSATGYLNIG 180
Query: 233 HDLVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGG-----KSTGIKGLQIIFDSGSSYTY 286
+ P + +T M D + Y + GG ST + + I DSG+ T
Sbjct: 181 N---PQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITR 237
Query: 287 FNSQAYKTTLDLMRKDLK 304
AY +R +
Sbjct: 238 LPPTAYSALKTAVRAAMT 255
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 73/155 (47%), Gaps = 13/155 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y T+++G P +++ + +DTGSDLTWVQC +PC C ++L+ P + +AC
Sbjct: 11 GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGKCYSQNDALFLPNTSTSFTKLACG 69
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
C+ P C C Y Y D + G V D + NG P FG
Sbjct: 70 SALCNGLPFP---MCNQT-TCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFG 125
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
CG++ G+LGLG G S SQL+S+
Sbjct: 126 CGHDNEGSFAG---ADGILGLGQGPLSFHSQLKSV 157
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 106/257 (41%), Gaps = 27/257 (10%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-------- 104
T V LG+ + + +G P + + +DTGSDL W+ C+ T C ++
Sbjct: 94 TIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDC-STNCVRELKAPGGSSLDL 152
Query: 105 -LYHPK----NNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHG-SSLGVLVTD- 156
+Y P ++ V CN C+ RC + C Y++ Y +G SS GVLV D
Sbjct: 153 NIYSPNASSTSSKVPCNSTLCTRVD-----RCASPLSDCPYQIRYLSNGTSSTGVLVEDV 207
Query: 157 -HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
H N + R+ GCG Q G+ GLGL S+ S L G+ N
Sbjct: 208 LHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAAN 267
Query: 216 VLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ 275
C G G + G S TP++ Y+ ++ GG +TG
Sbjct: 268 SFSMCFGDDGAGRISFGDK--GSVDQRETPLNIRQPHPTYNVTVTQISVGG-NTGDLEFD 324
Query: 276 IIFDSGSSYTYFNSQAY 292
+FD+G+S+TY Y
Sbjct: 325 AVFDTGTSFTYLTDAPY 341
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 106/264 (40%), Gaps = 35/264 (13%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
+G + G Y + +G P L IDTGSDL W+QC+ PC C ++ P+ +
Sbjct: 76 FSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCS-PCRRCYAQRGQVFDPRRSS 134
Query: 112 ---LVACNDPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
V C+ P C A P + A C Y V Y D SS G L TD L N +
Sbjct: 135 TYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDK--LAFANDTY 192
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG-- 225
+ + GCG + AG+LG+G GK SI +Q+ +V +CL R
Sbjct: 193 VN-NVTLGCGRDNEGLFDS---AAGLLGVGRGKISISTQVAP--AYGSVFEYCLGDRTSR 246
Query: 226 ---GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK--------------S 268
YL G P S +S Y A GG+ +
Sbjct: 247 STRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTA 306
Query: 269 TGIKGLQIIFDSGSSYTYFNSQAY 292
TG G ++ DSG++ + F AY
Sbjct: 307 TGRGG--VVVDSGTAISRFARDAY 328
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 114/256 (44%), Gaps = 50/256 (19%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P +L P
Sbjct: 87 GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CEHCGRHQDPKFQP--DLSETYQPVK 143
Query: 120 CSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGCG 177
C+ + C+ + +QC Y+ YA+ SS GVL D + N S L P R +FGC
Sbjct: 144 CTP-----DCNCDGDTNQCMYDRQYAEMSSSSGVLGED--VVSFGNLSELAPQRAVFGCE 196
Query: 178 -------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGG 227
Y+QR G++GLG G SI+ QL + + C + V GGG
Sbjct: 197 NDETGDLYSQR--------ADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV-GGG 247
Query: 228 YLFLGHDLVPSSGIAWTPMSRD--------LLEKHYSSGPAEL---LFGGKSTGIKGLQI 276
+ LG + P + +T D L E H + +L +F GK
Sbjct: 248 AMILG-GISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHG------T 300
Query: 277 IFDSGSSYTYFNSQAY 292
+ DSG++Y Y A+
Sbjct: 301 VLDSGTTYAYLPETAF 316
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 103/240 (42%), Gaps = 21/240 (8%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y V++ +G P + + DTGSDL+WVQC PC C + L+ P + P C A
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCK-PCNNCYKQHDPLFDPSQSTTYSAVP-CGA 245
Query: 123 FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRN 182
++ C + +C YEV+Y D + G L D L ++ L G +FGCG +
Sbjct: 246 QECLDSGTCSSG-KCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQG--FVFGCGDDDTG 302
Query: 183 PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG--HCL--SVRGGGYLFLGHDLVPS 238
+ G+ GLG + S+ SQ + R G +CL S R GYL LG P
Sbjct: 303 LFGRAD---GLFGLGRDRVSLASQAAA----RYGAGFSYCLPSSWRAEGYLSLGSAAAPP 355
Query: 239 SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSGSSYTYFNSQAYK 293
++R Y + G++ + K + DSG+ T S+AY
Sbjct: 356 HAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRLPSRAYS 415
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 121/318 (38%), Gaps = 52/318 (16%)
Query: 21 QGCFSEANQPPSKKKSTQSTAAH------RFGSTAVFPITGNVYP--LGYYSVTLKIGNP 72
G FS + +K+S + AH R + P+ G P +G Y + IG P
Sbjct: 48 HGFFSLKYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTP 107
Query: 73 PKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCE 132
+ Y + ++ LT TG LV+C+ FC A + C
Sbjct: 108 ARDYYVQME----LTLYDIKESLTG-------------KLVSCDQDFCYAINGGPPSYCI 150
Query: 133 ANDQCDYEVLYADHGSSLGVLVTDHFPL-------RLTNGSLLGPRLIFGCGYNQRNPGP 185
AN C Y +YAD SS G V + L N LL + C Q
Sbjct: 151 ANMSCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLNNNPLL--EVPLRCSATQSGDLS 208
Query: 186 KPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGHDLVPSSGIAWT 244
G+LG G S++SQL S G R + HCL + GGG +GH + P + T
Sbjct: 209 SEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPK--VNTT 266
Query: 245 PMSRDLLEKHYSSGPAELLFGGK---------STGIKGLQIIFDSGSSYTYFNSQAYKTT 295
P+ + + HY+ + GG G K II DSG++ Y Y
Sbjct: 267 PLVPN--QTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTII-DSGTTLAYLPEVVYDQL 323
Query: 296 LDLM---RKDLKGKPLED 310
L + + DLK + D
Sbjct: 324 LSKIFSWQSDLKVHTIHD 341
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 120/283 (42%), Gaps = 28/283 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PP DTGSDL W QC PC C + L+ PK + V+C+
Sbjct: 92 GEYLMNISLGTPPFPIMAIADTGSDLLWTQCK-PCDDCYTQVDPLFDPKASSTYKDVSCS 150
Query: 117 DPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIF 174
C+A L C D C Y Y D + G + D L T+ + + +I
Sbjct: 151 SSQCTA--LENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIII 208
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGY 228
GCG+N N G +G++GLG G S+++QL +CL + R
Sbjct: 209 GCGHN--NAGTFNKKGSGIVGLGGGAVSLITQLGD--SIDGKFSYCLVPLTSENDRTSKI 264
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFDSG 281
F + +V +G+ TP+ E Y S G E+ + G +G II DSG
Sbjct: 265 NFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSG 324
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
++ T ++ Y D + + + +D + L +C+ T
Sbjct: 325 TTLTLLPTEFYSELEDAVASSIDAEKKQD--PQTGLSLCYSAT 365
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 110/274 (40%), Gaps = 42/274 (15%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA--------PCTGCTLPPESL 105
+G LG Y V++ G PP+ L DTGSDL W+QC+ P C+ P +
Sbjct: 45 SGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFV 104
Query: 106 YHPKNNL--VACNDPFCSAFHLPE----NIRCEANDQCDYEVLYADHGSSLGVLVTDHFP 159
L V C+ C P + A C Y YAD S+ G L D
Sbjct: 105 ASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDT-- 162
Query: 160 LRLTNGSLLGPR---LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
++NG+ G + FGCG RN G T GV+GLG G+ S +Q S L
Sbjct: 163 ATISNGTSGGAAVRGVAFGCG--TRNQGGSFSGTGGVIGLGQGQLSFPAQSGS--LFAQT 218
Query: 217 LGHCL-------SVRGGGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKS 268
+CL R +LFLG + A+TP+ S L Y G + G +
Sbjct: 219 FSYCLLDLEGGRRGRSSSFLFLGRP-ERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRV 277
Query: 269 TGIKGLQ----------IIFDSGSSYTYFNSQAY 292
+ G + + DSGS+ TY AY
Sbjct: 278 LPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAY 311
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 108/271 (39%), Gaps = 29/271 (10%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT----------LPPESLYH----PKNNL 112
+ +G PP + + +DTGSDL W+ C+ C C + + Y +N
Sbjct: 109 VSVGTPPLWFLVALDTGSDLFWLPCD--CISCVHGGLRTRTGKILKFNTYDLDKSSTSNE 166
Query: 113 VACNDP-FCSAFHLPENIRC-EANDQCDYEVLY-ADHGSSLGVLVTD--HFPLRLTNGSL 167
V+CN+ FC + +C A C Y+V Y ++ SS G +V D H
Sbjct: 167 VSCNNSTFCR-----QRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITDDDQTKD 221
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
R+ FGCG Q G+ GLG+ S+ S L GL N C G
Sbjct: 222 ADTRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILAREGLISNSFSMCFGSDSAG 281
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+ G P TP + L Y+ +++ ++ IFDSG+S+TY
Sbjct: 282 RITFGDTGSPDQ--RKTPFNVRKLHPTYNITITKIIVEDSVADLE-FHAIFDSGTSFTYI 338
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDTAEEKALP 318
N AY ++ +K K + + +P
Sbjct: 339 NDPAYTRIGEMYNSKVKAKRHSSQSPDSNIP 369
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/261 (31%), Positives = 119/261 (45%), Gaps = 35/261 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACN 116
G Y +++ IG PP DTGSDLTWVQC PC C L+ K + +C+
Sbjct: 83 GEYFMSISIGTPPSKVFAIADTGSDLTWVQCK-PCQQCYKQNSPLFDKKKSSTYKTESCD 141
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
C A E E+ D C Y Y D+ + G + T+ + ++GS + P +FG
Sbjct: 142 SKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFG 201
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLS-----VRGGGYL 229
CGYN N G +G++GLG G S++SQL S+G +CLS G +
Sbjct: 202 CGYN--NGGTFEETGSGIIGLGGGPLSLVSQLGSSIG---KKFSYCLSHTAATTNGTSVI 256
Query: 230 FLGHDLVPS-----SGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGL--- 274
LG + +PS S TP+ + E +Y + G +L + G G+ G
Sbjct: 257 NLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSK 316
Query: 275 ---QIIFDSGSSYTYFNSQAY 292
II DSG++ T +S Y
Sbjct: 317 RTGNIIIDSGTTLTLLDSGFY 337
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 92/191 (48%), Gaps = 22/191 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKN----NLVACN 116
Y VT+ +G P +++DTGSD++WVQC PC+ C + L+ P + V C
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS + E C + QC Y V Y D ++ GV +D L L G+ +G +FGC
Sbjct: 202 ADACSELRIYE-AGCSGS-QCGYVVSYGDGSNTTGVYGSDT--LALAPGNTVG-TFLFGC 256
Query: 177 GYNQRNPGPKPPPTAGVLG-LGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGH 233
G+ Q AG+ G L LG+ S+ + Q+ G V +CL + GYL LG
Sbjct: 257 GHAQAG------MFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGG 310
Query: 234 DLVPSSGIAWT 244
+SG A T
Sbjct: 311 P-TSASGFATT 320
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/251 (29%), Positives = 108/251 (43%), Gaps = 33/251 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y T+ IG + +DT S+LTWVQC PC C E L+ P ++ V CN
Sbjct: 113 YVATVGIGGGEA--TVIVDTASELTWVQCE-PCDACHDQQEPLFDPSSSPSYAAVPCNSS 169
Query: 119 FCSAFHLPENIRCEANDQ----CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C A + + +A D C Y + Y D S GVL D L L + G +F
Sbjct: 170 SCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDR--LSLAGEDIQG--FVF 225
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFL 231
GCG + + P T+G++GLG + S++S Q++ V +CL + G L L
Sbjct: 226 GCGTSNQGPFGG---TSGLMGLGRSQLSLIS--QTMDQFGGVFSYCLPPKESGSSGSLVL 280
Query: 232 GHDLVP---SSGIAWTPMSRDLLEK-HYSSGPAELLFGGKSTGIKGL------QIIFDSG 281
G D S+ I +T M D L+ Y + + GG+ G + I DSG
Sbjct: 281 GDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIVDSG 340
Query: 282 SSYTYFNSQAY 292
+ T Y
Sbjct: 341 TIITSLVPSVY 351
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/261 (31%), Positives = 120/261 (45%), Gaps = 35/261 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACN 116
G Y +++ IG PP + DTGSDLTWVQC PC C L+ K + +C+
Sbjct: 83 GEYFMSISIGTPPSKFLAIADTGSDLTWVQCK-PCQQCYKQNTPLFDKKKSSTYKTESCD 141
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
C+A E E+ + C Y Y D + G + T+ + ++GS + P FG
Sbjct: 142 SITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFG 201
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLS-----VRGGGYL 229
CGYN N G +G++GLG G S++SQL S+G +CLS G +
Sbjct: 202 CGYN--NGGTFEETGSGIIGLGGGPLSLVSQLGSSIG---KKFSYCLSHTSATTNGTSVI 256
Query: 230 FLGHDLVPS-----SGIAWTPMSRDLLEKHY-------SSGPAELLF---GGKSTGIKGL 274
LG + + S S I TP+ + E +Y + G +L + GG S K
Sbjct: 257 NLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSK 316
Query: 275 Q---IIFDSGSSYTYFNSQAY 292
+ II DSG++ T +S Y
Sbjct: 317 KTGNIIIDSGTTLTLLDSGFY 337
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 65/141 (46%), Gaps = 15/141 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y L +G PPK + +DTGSD+ W+QC APC C + ++ PK + ++C
Sbjct: 172 GEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCR 230
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C P C + C Y+V Y D + G T+ R T P++ GC
Sbjct: 231 SPLCLRLDSPG---CNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR----VPKVALGC 283
Query: 177 GYNQRNPGPKPPPTAGVLGLG 197
G++ AG+LGLG
Sbjct: 284 GHDNEGLFVG---AAGLLGLG 301
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/193 (32%), Positives = 89/193 (46%), Gaps = 29/193 (15%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPEN------I 129
+DT S+LTWVQC APC C + L+ P ++ V CN C A L
Sbjct: 168 VDTASELTWVQC-APCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAA 226
Query: 130 RCEANDQ----CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGP 185
C+ DQ C Y + Y D S GVL D RL+ + +FGCG + N GP
Sbjct: 227 ACQGQDQSAAACSYTLSYRDGSYSRGVLAHD----RLSLAGEVIDGFVFGCGTS--NQGP 280
Query: 186 KPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGHD---LVPSS 239
T+G++GLG + S++S Q++ V +CL ++ G L +G D S+
Sbjct: 281 PFGGTSGLMGLGRSQLSLVS--QTMDQFGGVFSYCLPLKESDSSGSLVIGDDSSVYRNST 338
Query: 240 GIAWTPMSRDLLE 252
I + M D L+
Sbjct: 339 PIVYASMVSDPLQ 351
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 108/266 (40%), Gaps = 57/266 (21%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN-----------LV 113
V+L IG PP+ +L +DTGS L+W+QC+ LPP L PK L+
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPP--LPKPKTTSFDPSLSSSFSLL 125
Query: 114 ACNDPFCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
CN P C F LP + C+ N C Y YAD + G LV + F + SL
Sbjct: 126 PCNHPICKPRIPDFTLPTS--CDQNRLCHYSYFYADGTLAEGNLVREKFTF---SKSLST 180
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--- 226
P +I GC G+LG+ G+ S +SQ + + +C+ R G
Sbjct: 181 PPVILGCAQASTE-------NRGILGMNRGRLSFISQAKI-----SKFSYCVPSRTGSNP 228
Query: 227 -GYLFLGHDLVPSSGIAWTPM--------SRDLLEKHYSSGPAELLFGGKSTGIKGL--- 274
G +LG D SS + M S +L Y+ + GK +
Sbjct: 229 TGLFYLG-DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFK 287
Query: 275 -------QIIFDSGSSYTYFNSQAYK 293
Q + DSGS TY +AY+
Sbjct: 288 PDAGGSGQTMIDSGSDLTYLVDEAYE 313
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 120/278 (43%), Gaps = 32/278 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACN----DP 118
Y + IG P + + +DTGSDL W+ C CT C P L N N +
Sbjct: 104 YYANVSIGTPGLYFLVALDTGSDLFWLPCE--CTKC---PTYLTKRDNGKFWLNHYSSNA 158
Query: 119 FCSAFHLP-ENIRCEANDQCD-------YEVLY-ADHGSSLGVLVTDHFPLRLTNGSLLG 169
++ +P + CE +QC Y+ Y +++ SS G LV D + T+ S L
Sbjct: 159 SSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMA-TDDSQLK 217
Query: 170 P---RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
P ++ GCG Q G++GLG+GK S+ S L S GLT + C G
Sbjct: 218 PVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYYGY 277
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
G + G D+ P G TP + L Y+ +++ + T + L I DSG+S+TY
Sbjct: 278 GRIDFG-DIGP-VGQRETPFNPASLS--YNVTILQIIVTNRPTNVH-LTAIIDSGASFTY 332
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPV--CWK 322
Y + M ++ LE + P C++
Sbjct: 333 LTDPFYSIITENMDAAME---LERIKSDSDFPFEYCYR 367
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 87/179 (48%), Gaps = 21/179 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKN----NLVACN 116
Y VT+ +G P +++DTGSD++WVQC PC+ C + L+ P + V C
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS + E C + QC Y V Y D ++ GV +D L L G+ +G +FGC
Sbjct: 202 ADACSELRIYE-AGCSGS-QCGYVVSYGDGSNTTGVYGSDT--LALAPGNTVG-TFLFGC 256
Query: 177 GYNQRNPGPKPPPTAGVLG-LGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLG 232
G+ Q AG+ G L LG+ S+ + Q+ G V +CL + GYL LG
Sbjct: 257 GHAQAG------MFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLG 309
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 90/210 (42%), Gaps = 20/210 (9%)
Query: 46 GSTAVFPI-TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
GS P +G+ G Y VT+ +G P + DTGSDLTW QC C E
Sbjct: 120 GSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEP 179
Query: 105 LYHPKNNL----VACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHF 158
+++P + ++C+ P C + C A+ C Y + Y D S+G D
Sbjct: 180 IFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAS-TCVYGIQYGDQSYSVGFFAQDKL 238
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVL 217
L T+ + +FGCG N R AG++GLG S++SQ Q G +
Sbjct: 239 ALTSTD---VFNNFLFGCGQNNRGLFVG---VAGLIGLGRNALSLVSQTAQKYG---KLF 289
Query: 218 GHCL--SVRGGGYLFLGHDLVPSSGIAWTP 245
+CL + GYL G S + +TP
Sbjct: 290 SYCLPSTSSSTGYLTFGSGGGTSKAVKFTP 319
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 109/266 (40%), Gaps = 57/266 (21%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN-----------NLV 113
V+L IG PP+ +L +DTGS L+W+QC+ LPP L PK +L+
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPP--LPKPKTASFDPSLSSSFSLL 125
Query: 114 ACNDPFCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
CN P C F LP + C+ N C Y YAD + G LV + F + SL
Sbjct: 126 PCNHPICKPRIPDFTLPTS--CDQNRLCHYSYFYADGTLAEGNLVREKFTF---SKSLST 180
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--- 226
P +I GC G+LG+ G+ S +SQ + + +C+ R G
Sbjct: 181 PPVILGCAQASTE-------NRGILGMNHGRLSFISQAKI-----SKFSYCVPSRTGSNP 228
Query: 227 -GYLFLGHDLVPSSGIAWTPM--------SRDLLEKHYSSGPAELLFGGKSTGIKGL--- 274
G +LG D SS + M S +L Y+ + GK I
Sbjct: 229 TGLFYLG-DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFK 287
Query: 275 -------QIIFDSGSSYTYFNSQAYK 293
Q + DSGS TY +AY+
Sbjct: 288 PDAGGSGQTMIDSGSDLTYLVDEAYE 313
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 120/288 (41%), Gaps = 43/288 (14%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-LPPESLYHPKNN 111
I+G G Y V+L+IG PP+ L DTGSDL WV+C +PC C+ P S + +++
Sbjct: 76 ISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKC-SPCRNCSHRSPGSAFFARHS 134
Query: 112 L----VACNDPFCSAFHLPENIRC---EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
+ C P C P C + C Y+ YAD ++ G + L +
Sbjct: 135 TTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTST 194
Query: 165 GSLLGPR-LIFGCGYNQRNP---GPKPPPTAGVLGLGLGKASILSQL-QSLG--LTRNVL 217
G + L FGCG+ P G GV+GLG S SQL + G + ++
Sbjct: 195 GKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLM 254
Query: 218 GHCLSVRGGGYLFLG---HDLVPSSGI-AWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG 273
+ LS +L +G + V GI ++TP+ + L P K + G
Sbjct: 255 DYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLS------PTFYYIAIKGVYVNG 308
Query: 274 LQI-----------------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+++ I DSG++ T+ AY L +K +K
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVK 356
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 123/294 (41%), Gaps = 30/294 (10%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWV 89
+K S ++++A + V T + Y LG Y +T+ +G P + IDTGSD++WV
Sbjct: 97 AKLSSPRNSSAKELQQSGVTIPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWV 156
Query: 90 QCNAPCTG--CTLPPESLYHPKNNLV----ACNDPFCSAFHLPENIRCEANDQCDYEVLY 143
QC APC C+ + L+ P + +C+ C+ E C N C Y V Y
Sbjct: 157 QC-APCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLG-GEGNGC-LNSHCQYIVKY 213
Query: 144 ADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI 203
DH ++ G +D L ++ FGC + + G++GLG S+
Sbjct: 214 VDHSNTTGTYGSDTLGLTTSDAV---KNFQFGCSHRANGFVGQ---LDGLMGLGGDTESL 267
Query: 204 LSQLQSLGLTRNVLGHCL---SVRGGGYLFLGHDL--VPSSGIAWTPMSRDLLEKHYSSG 258
+S Q+ +CL S GG+L LG SS + TP+ R + Y
Sbjct: 268 VS--QTAATYGKAFSYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVF 325
Query: 259 PAELLFGGKSTGI-----KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
+ G + G ++ DSG+ T AY+ +K++K P
Sbjct: 326 LQAITVAGTKLNVPASVFSGASVV-DSGTVITQLPPTAYQALRTAFKKEMKAYP 378
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 97/210 (46%), Gaps = 15/210 (7%)
Query: 6 KRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSV 65
+++ L +LL+ F + + K S+ TA P++ + Y Y +
Sbjct: 5 RKIHLLAILLLVFIFPSIEAHNGRFTVKLIPRNSSQVLFNRITAQTPVSVHHYD---YLM 61
Query: 66 TLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF----CS 121
L IG PP +DTGSDL W+QC PCT C ++ P+++ N + CS
Sbjct: 62 ELSIGTPPVKTYAQVDTGSDLIWLQC-IPCTNCYKQLNPMFDPQSSSTYSNIAYGSESCS 120
Query: 122 AFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIFGCGYN 179
+ + C + + C+Y Y D + GVL + L T G + + +IFGCG+N
Sbjct: 121 KLY---STSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHN 177
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
N G G++GLG G S++SQ+ S
Sbjct: 178 --NNGVFNDKEMGIIGLGRGPLSLVSQIGS 205
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 126/293 (43%), Gaps = 49/293 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP---PESLYHPKN----NLV 113
G Y++ + +G PP + + +DTGS+L W QC APCT C P P + P + +
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRC-FPRPTPAPVLQPARSSTFSRL 146
Query: 114 ACNDPFCSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
CN FC +LP + R C A C Y Y G + G L T+ LT G P
Sbjct: 147 PCNGSFCQ--YLPTSSRPRTCNATAACAYNYTYGS-GYTAGYLATE----TLTVGDGTFP 199
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
++ FGC ++G++GLG G S++SQL ++G L ++ G +
Sbjct: 200 KVAFGCSTENGVDN-----SSGIVGLGRGPLSLVSQL-AVGRFSYCLRSDMADGGASPIL 253
Query: 231 LGH--DLVPSSGIAWTPMSRD-LLEK--HY-------SSGPAEL-----LFGGKSTGIKG 273
G L S + TP+ ++ L++ HY + EL FG TG+ G
Sbjct: 254 FGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGG 313
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALP----VCWK 322
I+ DSG++ TY Y + + L T P +C+K
Sbjct: 314 GTIV-DSGTTLTYLAKDGYAMVKQAFQSQMAN--LNQTTPASGAPYDLDLCYK 363
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 110/262 (41%), Gaps = 43/262 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + IG+PP L DTGSD+ WVQC+ PC+ C + L+ P N+ V CN
Sbjct: 121 GEYLVRVGIGSPPLEQHLVADTGSDVIWVQCS-PCSDCYAQGDPLFDPANSASFSPVPCN 179
Query: 117 DPFC-SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C +A + +C+Y+V Y D + GVL + L +G + G
Sbjct: 180 SGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL---DGGTEVQGVAMG 236
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS------VRGGGYL 229
CG+ R + AG+LGLG G S++ QL +CL+ G G L
Sbjct: 237 CGHENRGLFAE---AAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLAGYYSGEGSGSGSL 291
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ-------------- 275
LG + +G W P+ R+ P+ G G+ G +
Sbjct: 292 VLGREDAAPTGAVWVPLVRN------PDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDG 345
Query: 276 ---IIFDSGSSYTYFNSQAYKT 294
++ D+G++ T ++AY
Sbjct: 346 GGGVVMDTGTAVTRLPAEAYAA 367
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 116/263 (44%), Gaps = 33/263 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + IG PP DT SDL WVQC +PC C L+ P + ++C+
Sbjct: 88 GEYLMRFYIGTPPVERLAIADTASDLIWVQC-SPCETCFPQDTPLFEPHKSSTFANLSCD 146
Query: 117 DPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C++ ++ C + C Y Y D S+ GVL T+ + + ++ P+ IFG
Sbjct: 147 SQPCTSSNI---YYCPLVGNLCLYTNTYGDGSSTKGVLCTES--IHFGSQTVTFPKTIFG 201
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL---SVRGGGYLFL 231
CG N G++GLG G S++SQL +G + +CL + L
Sbjct: 202 CGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIG---HKFSYCLLPFTSTSTIKLKF 258
Query: 232 GHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ----------IIFDS 280
G+D + +G+ TP+ +++ HY S L G + G K LQ II D
Sbjct: 259 GNDTTITGNGVVSTPL---IIDPHYPSY-YFLHLVGITIGQKMLQVRTTDHTNGNIIIDL 314
Query: 281 GSSYTYFNSQAYKTTLDLMRKDL 303
G+ TY Y + L+R+ L
Sbjct: 315 GTVLTYLEVNFYHNFVTLLREAL 337
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 109/274 (39%), Gaps = 42/274 (15%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA--------PCTGCTLPPESL 105
+G LG Y V++ G PP+ L DTGSDL W+QC+ P C+ P +
Sbjct: 44 SGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFV 103
Query: 106 YHPKNNL--VACNDPFCSAFHLPEN----IRCEANDQCDYEVLYADHGSSLGVLVTDHFP 159
L V C+ C P A C Y YAD S+ G L D
Sbjct: 104 ASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDT-- 161
Query: 160 LRLTNGSLLGPR---LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
++NG+ G + FGCG RN G T GV+GLG G+ S +Q S L
Sbjct: 162 ATISNGTSGGAAVRGVAFGCG--TRNQGGSFSGTGGVIGLGQGQLSFPAQSGS--LFAQT 217
Query: 217 LGHCL-------SVRGGGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKS 268
+CL R +LFLG + A+TP+ S L Y G + G +
Sbjct: 218 FSYCLLDLEGGRRGRSSSFLFLGRP-ERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRV 276
Query: 269 TGIKGLQ----------IIFDSGSSYTYFNSQAY 292
+ G + + DSGS+ TY AY
Sbjct: 277 LPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAY 310
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 130/303 (42%), Gaps = 68/303 (22%)
Query: 44 RFGSTAVFPITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP--CTGCT 99
R G+ + ++YP Y Y+ T+ +G PP+ + +DTGS L+WV C + C C+
Sbjct: 68 RQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCS 127
Query: 100 ----LPPESLYHPKNN----LVACNDPFCSAFHLPENIR-CEANDQC------------- 137
P ++HPKN+ L+ C +P C H P+++ C A C
Sbjct: 128 SLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANAN 187
Query: 138 ----DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
Y V+Y GS+ G+L++D LR ++ + GC + +PP +G+
Sbjct: 188 NVCPPYLVVYGS-GSTAGLLISDT--LRTPGRAVRN--FVIGCSLASVH---QPP--SGL 237
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCL---------SVRGGGYLFLGHDLVPSSGIAWT 244
G G G S+ SQ LGLT+ +CL +V G L G+ +
Sbjct: 238 AGFGRGAPSVPSQ---LGLTK--FSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYA 292
Query: 245 PMSRDLLEK-----HYSSGPAELLFGGKSTGIKGLQI---------IFDSGSSYTYFNSQ 290
P++R + +Y + GGKS + I DSG++++YF+
Sbjct: 293 PLARSASARPPYSVYYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRT 352
Query: 291 AYK 293
++
Sbjct: 353 VFE 355
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 120/293 (40%), Gaps = 31/293 (10%)
Query: 47 STAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
S A P+T G +G Y L +G P Y + +DTGS LTW+QC+ C L
Sbjct: 117 SLASVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPL 176
Query: 106 YHPKNNLVACNDPFCSAFHLPE-------NIRCEANDQCDYEVLYADHGSSLGVLVTDHF 158
Y P+ + P CSA E C + C Y+ Y D S+G L D
Sbjct: 177 YDPRASSTYATVP-CSASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRD-- 233
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVL 217
++ GS P +GCG + + +AG++GL K S+L QL SLG +
Sbjct: 234 --TVSFGSGSYPNFYYGCGQDNEGLFGR---SAGLIGLARNKLSLLYQLAPSLGYS---F 285
Query: 218 GHCLSVRGG-GYLFLGHDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGI---- 271
+CL GYL +G S ++TPM+ L+ Y + + GG +
Sbjct: 286 SYCLPTPASTGYLSIGP--YTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAE 343
Query: 272 -KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
L I DSG+ T + Y + + G ++ L C++G
Sbjct: 344 YSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVG--VQSAPAFSILDTCFQG 394
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 126/293 (43%), Gaps = 49/293 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP---PESLYHPKN----NLV 113
G Y++ + +G PP + + +DTGS+L W QC APCT C P P + P + +
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRC-FPRPTPAPVLQPARSSTFSRL 146
Query: 114 ACNDPFCSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
CN FC +LP + R C A C Y Y G + G L T+ LT G P
Sbjct: 147 PCNGSFCQ--YLPTSSRPRTCNATAACAYNYTYGS-GYTAGYLATE----TLTVGDGTFP 199
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
++ FGC ++G++GLG G S++SQL ++G L ++ G +
Sbjct: 200 KVAFGCSTENGVDN-----SSGIVGLGRGPLSLVSQL-AVGRFSYCLRSDMADGGASPIL 253
Query: 231 LGH--DLVPSSGIAWTPMSRD-LLEK--HY-------SSGPAEL-----LFGGKSTGIKG 273
G L S + TP+ ++ L++ HY + EL FG TG+ G
Sbjct: 254 FGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGG 313
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALP----VCWK 322
I+ DSG++ TY Y + + L T P +C+K
Sbjct: 314 GTIV-DSGTTLTYLAKDGYAMVKQAFQSQMAN--LNQTTPASGAPYDLDLCYK 363
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 121/286 (42%), Gaps = 31/286 (10%)
Query: 63 YSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACND 117
Y ++ IG PP +LY + +DT +D W QCN PC C ++ P + + C+
Sbjct: 89 YIISFLIGTPPFQLYGV-MDTANDNIWFQCN-PCKPCFNTTSPMFDPSKSSTYKTIPCSS 146
Query: 118 PFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIF 174
P C EN C ++D+ C+Y Y S G L D L N + + + ++
Sbjct: 147 PKCKNV---ENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVI 203
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL--GLTRNVLGHCLSVRG--GGYLF 230
GCG+ RN GP +G +GLG G S +SQL S G L S G G F
Sbjct: 204 GCGH--RNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLHF 261
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSS-------GPAELLFGGKSTGIKGL-QIIFDSGS 282
+V G TP++ E YS+ G + F ++ L I DSG+
Sbjct: 262 GDKSVVSGVGTVSTPITAG--EIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGT 319
Query: 283 SYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCL 328
+ T Y ++ +K + + ++ L C+K T K L
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAKSPNQQFKL--CYKATLKNL 363
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 108/269 (40%), Gaps = 25/269 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y +T+ G P K + DTGS++ W+QC C E L+ P + N SA
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSA 75
Query: 123 FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRN 182
+ R + C Y V Y D S++G L T+ F L N + IFGCG N +
Sbjct: 76 ACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGN---VFNNFIFGCGQNNQG 132
Query: 183 PGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSS 239
AG++GLG S+ SQL SLG N+ +CL + GYL +G+ L
Sbjct: 133 LFTGA---AGLIGLGRSPYSLNSQLATSLG---NIFSYCLPSTSSATGYLNIGNPLRTPG 186
Query: 240 GIAWTPMSR-------DLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAY 292
A SR DL+ S G L ST + + I DSG+ T AY
Sbjct: 187 YTAMLTNSRAPTLYFIDLI--GISVGGTRLAL--SSTVFQSVGTIIDSGTVITRLPPTAY 242
Query: 293 KTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
R + A L C+
Sbjct: 243 GALRTAFRAAM--TQYTRAAAASILDTCY 269
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 112/251 (44%), Gaps = 30/251 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG P + + L +D+GS +T+V C A C C + N++ +DP
Sbjct: 90 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCG----NHQSESPNIIEAHDPRF 144
Query: 120 ---CSAFHLPE--NIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP- 170
S+ + P N+ C ++ QC YE YA+ SS GVL D + S L P
Sbjct: 145 QPDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGED--IMSFGKESELKPQ 202
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGG 227
R +FGC N G++GLG G+ SI+ QL G+ + C + V GGG
Sbjct: 203 RAVFGC-ENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGG 260
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI------KGLQIIFDSG 281
+ LG P + S + +Y+ E+ GK+ + + DSG
Sbjct: 261 TMVLGGMPAPPDMV--FSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSG 318
Query: 282 SSYTYFNSQAY 292
++Y Y QA+
Sbjct: 319 TTYAYLPEQAF 329
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 106/276 (38%), Gaps = 27/276 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V++ +G P K L DTGSDLTW QC C + ++ P + ++C+
Sbjct: 129 GNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCS 188
Query: 117 DPFCSAFHLPENIR--CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
P CS + C A C Y + Y D S+G + L T+ + +F
Sbjct: 189 SPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTD---VIENFLF 245
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL--SVRGGGYLFL 231
GCG N R AG++GLG K SI+ Q Q G V +CL + GYL
Sbjct: 246 GCGQNNRGLFGS---AAGLIGLGQDKISIVKQTAQKYG---QVFSYCLPKTSSSTGYLTF 299
Query: 232 GHDLVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGGKSTGIKGLQI-----IFDSGSSYT 285
+ +TP+++ + Y + GG I I DSG+ T
Sbjct: 300 -GGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVIT 358
Query: 286 YFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
AY K + P E L C+
Sbjct: 359 RLPPDAYSALKSAFEKGMAKYP--KAPELSILDTCY 392
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 108/251 (43%), Gaps = 17/251 (6%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA--CNDPFCSAFH 124
+ IG P + + +DTGSD+ WV C+ C C + Y+ + + S+ H
Sbjct: 106 IDIGTPNVSFLVALDTGSDMFWVPCD--CIECAPLSAAFYNALDRDLNQYSPSLSSSSRH 163
Query: 125 LP-------ENIRCEA-NDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGSL--LGPRLI 173
LP +N C+ D+C Y Y +D+ SS G L+ D L N + + +I
Sbjct: 164 LPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKNSIQASVI 223
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGH 233
GCG Q + G+LGLG G S+ + L GL RN + CL+ +G G + G
Sbjct: 224 LGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSGRILFG- 282
Query: 234 DLVPSSGIAWTPMSRDLLE-KHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAY 292
D ++ TP D E +Y G G + D+G+S+TY Y
Sbjct: 283 DQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTYLPKGVY 342
Query: 293 KTTLDLMRKDL 303
+T + K +
Sbjct: 343 ETVVAEFEKQV 353
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 85/182 (46%), Gaps = 9/182 (4%)
Query: 130 RCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPLRLTNGSLLGP---RLIFGCGYNQRNPG 184
RC +N C YE+ Y + + SS+G LV D L T+ SLL P ++ FGCG Q
Sbjct: 26 RCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA-TDDSLLKPVEAKITFGCGTVQTGIF 84
Query: 185 PKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWT 244
G++GLG+ K S+ S L GLT N C G G + G D P+ T
Sbjct: 85 ATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFG-DTGPADQ-KQT 142
Query: 245 PMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
P + L + Y+ + GG+ + IFDSG+S+TY AY T M +K
Sbjct: 143 PFNTMLEYQSYNVTFNVINVGGEPNDVP-FTAIFDSGTSFTYLTEPAYSTITKQMDAGMK 201
Query: 305 GK 306
K
Sbjct: 202 LK 203
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 112/251 (44%), Gaps = 30/251 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG P + + L +D+GS +T+V C A C C + N++ +DP
Sbjct: 89 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCG----NHQSESPNIIEAHDPRF 143
Query: 120 ---CSAFHLPE--NIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP- 170
S+ + P N+ C ++ QC YE YA+ SS GVL D + S L P
Sbjct: 144 QPDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQ 201
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGG 227
R +FGC N G++GLG G+ SI+ QL G+ + C + V GGG
Sbjct: 202 RAVFGC-ENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGG 259
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI------KGLQIIFDSG 281
+ LG P + S + +Y+ E+ GK+ + + DSG
Sbjct: 260 TMVLGGMPAPPDMV--FSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSG 317
Query: 282 SSYTYFNSQAY 292
++Y Y QA+
Sbjct: 318 TTYAYLPEQAF 328
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 111/260 (42%), Gaps = 29/260 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH----------PKNNL---- 112
+ IG P + + +D GSDL W+ C+ C C S Y P +L
Sbjct: 100 IDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKH 157
Query: 113 VACNDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPLR----LTNGS 166
++C+ C + C+++ Q C Y V Y +++ SS G+LV D L+ L+N S
Sbjct: 158 LSCSHQLCD-----KGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGSLSNSS 212
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
+ P ++ GCG Q G+LGLG G++S+ S L GL + C +
Sbjct: 213 VQAP-VVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHDSFSLCFNEDDS 271
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
G +F G D P+ + + + D L Y G G + ++ DSG+S+T+
Sbjct: 272 GRIFFG-DQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKMTSFKVQVDSGTSFTF 330
Query: 287 FNSQAYKTTLDLMRKDLKGK 306
Y + + + G
Sbjct: 331 LPGHVYGAIAEEFDQQVNGS 350
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 115/268 (42%), Gaps = 40/268 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y VT+ IG L +DTGSDLTWVQC PC C E L++P N+ + CN P
Sbjct: 66 YIVTVGIGGQNS--TLIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSP 122
Query: 119 FCSAFH-------LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
C A L N + + CDY++ Y D S G L + +LT G
Sbjct: 123 TCVALQPTAGSSGLCSN---KNSTSCDYQIDYGDGSYSRGELGFE----KLTLGKTEIDN 175
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGY 228
IFGCG RN +G++GL + S++SQ S L +V +CL G G
Sbjct: 176 FIFGCG---RNNKGLFGGASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGS 230
Query: 229 LFLG----HDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKSTGI------KGLQII 277
L LG + S I++T M ++ + Y + GG + + +G+ +
Sbjct: 231 LTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSL 290
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKG 305
DSG+ T + YK K G
Sbjct: 291 LDSGTVITRLSPSIYKAFKAEFEKQFSG 318
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 128/339 (37%), Gaps = 53/339 (15%)
Query: 26 EANQPPSK--KKSTQSTAAHRFGSTAVFPITG-------------NVYPLGYYSVTLKIG 70
EA +PP + T+ HR + +F +G N P Y V L IG
Sbjct: 363 EAARPPRDGGRSLTRREVLHRMAARLLFSASGRAASARVDPGPYANGVPDTEYLVHLAIG 422
Query: 71 NPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACNDPFCSAFHLP 126
PP+ +L +DTGSDL W QC PC C P N +++ C+ P C
Sbjct: 423 TPPQPVQLILDTGSDLVWTQCR-PCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWS 481
Query: 127 ENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS--LLGPRLIFGCGYNQRNP 183
+ N C Y YAD + G L + F +G+ P L FGCG N
Sbjct: 482 SCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLF--NN 539
Query: 184 GPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG-----YLFLGHDLVPS 238
G G+ G G G S+ SQL+ + HC + G L L +L
Sbjct: 540 GIFTSNETGIAGFGRGALSLPSQLKV-----DNFSHCFTAITGSEPSSVLLGLPANLYSD 594
Query: 239 S--GIAWTPMSRDL--LEKHYSS------GPAEL-----LFGGKSTGIKGLQIIFDSGSS 283
+ + TP+ ++ L +Y S G L F K G G I DSG+
Sbjct: 595 ADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGG--TIIDSGTG 652
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
T AYK D ++ P+++ +C+
Sbjct: 653 MTTLPQDAYKLVHDAFTAQVR-LPVDNATSSSLSRLCFS 690
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 116/274 (42%), Gaps = 46/274 (16%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
++G + G Y + +G+PP + IDTGSDL W+QC PC C LY P+++
Sbjct: 78 MSGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQC-VPCRHCYRQVTPLYDPRSSS 136
Query: 112 ---LVACNDPFC-SAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDH--FP--LRL 162
+ C P C P C+A C Y V+Y D +S G L TD FP +
Sbjct: 137 THRRIPCASPRCRDVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHV 193
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
N +L GCG++ N G AG+LG+G G+ S +QL +V +CL
Sbjct: 194 HNVTL-------GCGHD--NVG-LLESAAGLLGVGRGQLSFPTQLAP--AYGHVFSYCLG 241
Query: 223 VR------GGGYLFLGHDLVPSSGIAWTPMSRDLLEK--HYSSGPAELLFGGKSTGIKGL 274
R G YL G P S A+TP+ + +Y + G + TG
Sbjct: 242 DRLSRAQNGSSYLVFGRTPEPPS-TAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNA 300
Query: 275 Q-----------IIFDSGSSYTYFNSQAYKTTLD 297
I+ DSG++ + F AY D
Sbjct: 301 SLALNPATGRGGIVVDSGTAISRFARDAYAAVRD 334
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 115/268 (42%), Gaps = 40/268 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y VT+ IG L +DTGSDLTWVQC PC C E L++P N+ + CN P
Sbjct: 145 YIVTVGIGGQNS--TLIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSP 201
Query: 119 FCSAFH-------LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
C A L N + + CDY++ Y D S G L + +LT G
Sbjct: 202 TCVALQPTAGSSGLCSN---KNSTSCDYQIDYGDGSYSRGELGFE----KLTLGKTEIDN 254
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGY 228
IFGCG RN +G++GL + S++SQ S L +V +CL G G
Sbjct: 255 FIFGCG---RNNKGLFGGASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGS 309
Query: 229 LFLG----HDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKSTGI------KGLQII 277
L LG + S I++T M ++ + Y + GG + + +G+ +
Sbjct: 310 LTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSL 369
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKG 305
DSG+ T + YK K G
Sbjct: 370 LDSGTVITRLSPSIYKAFKAEFEKQFSG 397
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 73/246 (29%), Positives = 109/246 (44%), Gaps = 30/246 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF- 119
GYY+ L IG P + + L +D+GS +T+V C A C C + + P +L + P
Sbjct: 89 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCGNHQDPRFQP--DLSSTYSPVK 145
Query: 120 CSAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFG 175
C N+ C ++ QC YE YA+ SS GVL D + S L P R +FG
Sbjct: 146 C-------NVDCTCDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQRAVFG 196
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC---LSVRGGGYLFLG 232
C N G++GLG G+ SI+ QL G+ + C + V GGG + LG
Sbjct: 197 C-ENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGGTMVLG 254
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI------KGLQIIFDSGSSYTY 286
P + S + +Y+ E+ GK+ + + DSG++Y Y
Sbjct: 255 GMPAPPDMV--FSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 312
Query: 287 FNSQAY 292
QA+
Sbjct: 313 LPEQAF 318
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 110/262 (41%), Gaps = 31/262 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNN----LVACN 116
Y VT +G P L++DTGSDL+WVQC PC C + L+ P + V C
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCK-PCAAPSCYRQKDPLFDPAQSSSYAAVPCG 195
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+ + + C A QC Y V Y D ++ GV +D L N ++ G +FGC
Sbjct: 196 RSACAGLGIYASA-CSAA-QCGYVVSYGDGSNTTGVYSSDTLTLA-ANATVQG--FLFGC 250
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--GYLFLGHD 234
G+ Q G G+LG G + S++ Q+ G V +CL + GYL LG
Sbjct: 251 GHAQS--GGLFTGIDGLLGFGREQPSLVQ--QTAGAYGGVFSYCLPTKSSTTGYLTLGGP 306
Query: 235 LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---------IFDSGSSYT 285
SG+A + LL + ++ G S G + L + + D+G+ T
Sbjct: 307 ----SGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVIT 362
Query: 286 YFNSQAYKTTLDLMRKDLKGKP 307
AY R + P
Sbjct: 363 RLPPAAYAALRSAFRSGMASYP 384
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 109/257 (42%), Gaps = 30/257 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCN----APCTG---CTLPPESL--YHP----KNNLV 113
+ IG P + + +DTGSDL W+ CN AP T +L + L Y+P + +
Sbjct: 104 IDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVF 163
Query: 114 ACNDPFCSAFHLPENIRCEA-NDQCDYEVLY-ADHGSSLGVLVTDHFPL------RLTNG 165
C+ C + C++ +QC Y V Y + + SS G+LV D L RL NG
Sbjct: 164 LCSHKLCGSAS-----DCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 166 -SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
S + R++ GCG Q G++GLG + S+ S L GL RN C
Sbjct: 219 SSSVKARVVVGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 278
Query: 225 GGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSY 284
G ++ G D+ PS + P + Y G G DSG S+
Sbjct: 279 DSGRIYFG-DMGPSIQQS-APFLQLENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSF 336
Query: 285 TYFNSQAY-KTTLDLMR 300
TY + Y K L++ R
Sbjct: 337 TYLPEEIYRKVALEIDR 353
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 77/154 (50%), Gaps = 15/154 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
+ V +G PP + IDTGSDL WVQC PC C ++ P + ++ + P
Sbjct: 91 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCR-PCADCFRQSTPIFDPSKSSTYVDLSYDSP 149
Query: 119 FCSAFHLPENIRCEAN--DQCDYEVLYADHGSSLGVLVTDHFPLRLTN-GSLLGPRLIFG 175
C P + + + N +QC Y YAD +S G L T+ ++ G++ ++FG
Sbjct: 150 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 204
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
CG++ R G +G+LGL G SI+S+L S
Sbjct: 205 CGHSNR--GRFDGQQSGILGLSAGDQSIVSRLGS 236
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 83/167 (49%), Gaps = 20/167 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VACND 117
Y VT+ IG P L DTGSDLTW QC PC G C E ++P ++ V+C+
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSSYHNVSCSS 192
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
P C PE+ C A++ C Y + Y D ++G L + F LTN +L + FGCG
Sbjct: 193 PMCGN---PES--CSASN-CLYGIGYGDGSVTVGFLAKEKFT--LTNSDVL-DDIYFGCG 243
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
N + +AG+LGLG GK S LQ+ N+ +C R
Sbjct: 244 ENNKGV---FIGSAGILGLGPGKFSF--PLQTTTTYNNIFSYCCGCR 285
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 80/295 (27%), Positives = 124/295 (42%), Gaps = 51/295 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT-GCTLPPESLYHPKNN----LVAC 115
G Y V + +G+P K Y + +DTGS +W+QC PCT C + + +++P + V C
Sbjct: 101 GNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKTYKTVPC 159
Query: 116 -------------NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
N+P CS + ++ C Y+ Y D SLG L D L L
Sbjct: 160 SSSQCSSLKSATLNEPTCS----------KQSNACVYKASYGDSSFSLGYLSQDV--LTL 207
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
T L ++GCG + + + T G++GL + S+LSQL G N +CL
Sbjct: 208 TPSQTLS-SFVYGCGQDNQGLFGR---TDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261
Query: 223 VRGG-------GYLFLG-HDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGIKG 273
G+L +G L PSS +TP+ ++ Y + G+ G+
Sbjct: 262 TSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAA 321
Query: 274 ----LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ I DSG+ T + Y TTL + K + L C+KG+
Sbjct: 322 SSYKVPTIIDSGTVITRLPTPVY-TTLKNAYVTILSKKYQQAPGISLLDTCFKGS 375
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 92/327 (28%), Positives = 138/327 (42%), Gaps = 55/327 (16%)
Query: 25 SEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPL---------GYYSVTLKIGNPPKL 75
S+ QP K+S + A S P++G + G Y + + +G PPK
Sbjct: 153 SQKEQP---KQSYKPVVAAPAASRTTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKH 209
Query: 76 YELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIR- 130
+ L +DTGSDL W+QC PC C Y PK++ ++C+DP C P+ +
Sbjct: 210 FSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSAPDPPKP 268
Query: 131 CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT--NGS---LLGPRLIFGCGYNQRNPG 184
C+A +Q C Y Y D ++ G + F + LT NG+ ++FGCG+ R
Sbjct: 269 CKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENVMFGCGHWNRGLF 328
Query: 185 PKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY-----LFLGHD--LVP 237
AG+LGLG G S SQ+QS L +CL R L G D L+
Sbjct: 329 HG---AAGLLGLGKGPLSFASQMQS--LYGQSFSYCLVDRNSNASVSSKLIFGEDKELLS 383
Query: 238 SSGIAWTP----------------MSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSG 281
+ +T + +++ P E + S G G I DSG
Sbjct: 384 HPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEE-TWHLSSEGAGG--TIIDSG 440
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPL 308
++ TYF AY+ + + +KG L
Sbjct: 441 TTLTYFAEPAYEIIKEAFVRKIKGYQL 467
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 79/171 (46%), Gaps = 20/171 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + +G P L +DTGSD+TW+QC PC C ++ P+++ + +
Sbjct: 132 GEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMGYD 190
Query: 117 DPFCSAFHLPENIRCEANDQ----CDYEVLYADHGSSLGVLVTDHFPLRLT-NGSLLGPR 171
P C A R D C Y V Y D GS+ V D LT G + P
Sbjct: 191 APDCQALG-----RSGGGDAKRMTCVYAVGYGDDGSTT---VGDFIEETLTFAGGVQVPH 242
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
+ GCG++ N G P AG+LGLG G+ S SQ+ +LG +CL+
Sbjct: 243 MSIGCGHD--NKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLA 291
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 131/303 (43%), Gaps = 68/303 (22%)
Query: 44 RFGSTAVFPITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP--CTGCT 99
R G+ + ++YP Y Y+ T+ +G PP+ + +DTGS L+WV C + C C+
Sbjct: 68 RQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCS 127
Query: 100 ----LPPESLYHPKNN----LVACNDPFCSAFHLPENIR-CEANDQC------------- 137
P ++HPKN+ L+ C +P C H P+++ C A C
Sbjct: 128 SLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANAN 187
Query: 138 ----DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
Y V+Y GS+ G+L++D LR ++ + GC + +PP +G+
Sbjct: 188 NVCPPYLVVYGS-GSTAGLLISDT--LRTPGRAVR--NFVIGCSLASVH---QPP--SGL 237
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCL---------SVRGGGYLFLGHDLVPSSGIAWT 244
G G G S+ SQ LGLT+ +CL +V G L G+ +
Sbjct: 238 AGFGRGAPSVPSQ---LGLTK--FSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYA 292
Query: 245 PMSRDLLEK-----HYSSGPAELLFGGKSTGI---------KGLQIIFDSGSSYTYFNSQ 290
P++R + +Y + GGKS + G I DSG++++YF+
Sbjct: 293 PLARSASARPPYSVYYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRT 352
Query: 291 AYK 293
++
Sbjct: 353 VFE 355
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 106/267 (39%), Gaps = 57/267 (21%)
Query: 35 KSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP 94
+ + ++++ + A P V+ Y + L+IG PP E +DTGS+L W QC P
Sbjct: 37 RRSNASSSRVSNTQAGSPYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQC-LP 95
Query: 95 CTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVL 153
C C ++ P + S F + RC D C Y+++Y D + G L
Sbjct: 96 CLHCYDQKAPIFDPSKS---------STF---KETRCNTPDHSCPYKLVYDDKSYTQGTL 143
Query: 154 VTDHFPLRLTNGS-LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGL 212
T+ + T+G + P I GC N G + P ++G++GL G S++SQ+
Sbjct: 144 ATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFR-PSSSGIVGLSRGSLSLISQM----- 197
Query: 213 TRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK 272
GG Y P G+ T M K G L S G
Sbjct: 198 ------------GGAY--------PGDGVVSTTM----FAKTAKRGQYYLNLDAVSVGDT 233
Query: 273 GLQ------------IIFDSGSSYTYF 287
++ I+ DSG+ TYF
Sbjct: 234 RIETVGTPFHALNGNIVIDSGTPLTYF 260
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 104/247 (42%), Gaps = 31/247 (12%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P V+ Y + L++G PP E IDTGS++TW QC PC C ++ P +
Sbjct: 369 PYADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQC-LPCVHCYKQNAPIFDPSKS 427
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGP 170
S F + RC + C YEV Y D + G L TD + T+G +
Sbjct: 428 ---------STF---KEKRCH-DHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMA 474
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
I GCG RN P G +GL G S+++Q+ G ++ +C + G +
Sbjct: 475 ETIIGCG---RNNSWFRPSFEGFVGLNWGPLSLITQMG--GEYPGLMSYCFAGNGTSKIN 529
Query: 231 LGHD-LVPSSGIAWTPMSRDLLEKHY--------SSGPAEL-LFGGKSTGIKGLQIIFDS 280
G + +V G+ T M + S G + G ++G I+ DS
Sbjct: 530 FGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEG-NIVIDS 588
Query: 281 GSSYTYF 287
G++ TYF
Sbjct: 589 GTTLTYF 595
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 90/178 (50%), Gaps = 22/178 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKN----NLVACND 117
+ VT+ G P + Y + DTGSD++W+QC PC+G C + ++ P ++V C
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
P C+A + +C +N C Y+V Y D SS GVL H L LT+ L P FGCG
Sbjct: 194 PQCAA---ADGSKC-SNGTCLYKVEYGDGSSSAGVL--SHETLSLTSTRAL-PGFAFGCG 246
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQ-LQSLGLTRNVLGHCLSVRGG--GYLFLG 232
Q N G G++GLG G+ S+ SQ S G T +CL GYL +G
Sbjct: 247 --QTNLG-DFGDVDGLIGLGRGQLSLSSQAAASFGGT---FSYCLPSDNTTHGYLTIG 298
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 77/154 (50%), Gaps = 15/154 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
+ V +G PP + IDTGSDL WVQC PC C ++ P + ++ + P
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCR-PCADCFRQSTPIFDPSKSSTYVDLSYDSP 117
Query: 119 FCSAFHLPENIRCEAN--DQCDYEVLYADHGSSLGVLVTDHFPLRLTN-GSLLGPRLIFG 175
C P + + + N +QC Y YAD +S G L T+ ++ G++ ++FG
Sbjct: 118 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
CG++ R G +G+LGL G SI+S+L S
Sbjct: 173 CGHSNR--GRFDGQQSGILGLSAGDQSIVSRLGS 204
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 77/154 (50%), Gaps = 15/154 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
+ V +G PP + IDTGSDL WVQC PC C ++ P + ++ + P
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCR-PCADCFRQSTPIFDPSKSSTYVDLSYDSP 117
Query: 119 FCSAFHLPENIRCEAN--DQCDYEVLYADHGSSLGVLVTDHFPLRLTN-GSLLGPRLIFG 175
C P + + + N +QC Y YAD +S G L T+ ++ G++ ++FG
Sbjct: 118 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
CG++ R G +G+LGL G SI+S+L S
Sbjct: 173 CGHSNR--GRFDGQQSGILGLSAGDQSIVSRLGS 204
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 80/295 (27%), Positives = 124/295 (42%), Gaps = 51/295 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT-GCTLPPESLYHPKNN----LVAC 115
G Y V + +G+P K Y + +DTGS +W+QC PCT C + + +++P + V C
Sbjct: 101 GNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKTYKTVPC 159
Query: 116 -------------NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRL 162
N+P CS + ++ C Y+ Y D SLG L D L L
Sbjct: 160 SSSQCSSLKSATLNEPTCS----------KQSNACVYKASYGDSSFSLGYLSQD--VLTL 207
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
T L ++GCG + + + T G++GL + S+LSQL G N +CL
Sbjct: 208 TPSQTLS-SFVYGCGQDNQGLFGR---TDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261
Query: 223 VRGG-------GYLFLG-HDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGIKG 273
G+L +G L PSS +TP+ ++ Y + G+ G+
Sbjct: 262 TSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAA 321
Query: 274 ----LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ I DSG+ T + Y TTL + K + L C+KG+
Sbjct: 322 SSYKVPTIIDSGTVITRLPTPVY-TTLKNAYVTILSKKYQQAPGISLLDTCFKGS 375
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 106/259 (40%), Gaps = 30/259 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN--------------L 112
+ IG P + + +D GSDL WV CN C C S Y +
Sbjct: 107 IDIGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKH 164
Query: 113 VACNDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPL-----RLTNG 165
++C+ C + C++ Q C Y + Y ++ SS G+L+ D L +N
Sbjct: 165 ISCSHNLCDSGQ-----SCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNC 219
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
++ P +I GCG Q G+ GLGLG+ S+LS L L +N C + G
Sbjct: 220 TIQAP-VILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDG 278
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYT 285
G +F G D P+S + + D + Y G + + DSG+S+T
Sbjct: 279 SGRIFFG-DEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFT 337
Query: 286 YFNSQAYKTTLDLMRKDLK 304
Y +AY+ + K L
Sbjct: 338 YLPEEAYENIVIEFDKRLN 356
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 110/260 (42%), Gaps = 29/260 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL----------YHPKNNL---- 112
+ IG P + + +D GSDL W+ C+ C C S Y P +L
Sbjct: 101 IDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKH 158
Query: 113 VACNDPFCSAFHLPENIRCEAN-DQCDYEVLY-ADHGSSLGVLVTDHFPLR----LTNGS 166
++C+ C + C+++ QC Y V Y +++ SS G+LV D L+ L+N S
Sbjct: 159 LSCSHRLCD-----KGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGTLSNSS 213
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
+ P ++ GCG Q G+LGLG G++S+ S L GL C +
Sbjct: 214 VQAP-VVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHYSFSLCFNEDDS 272
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
G +F G D P+S + + + D L Y G G + + DSG+S+T+
Sbjct: 273 GRMFFG-DQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKMTSFKAQVDSGTSFTF 331
Query: 287 FNSQAYKTTLDLMRKDLKGK 306
Y + + + G
Sbjct: 332 LPGHVYGAITEEFDQQVNGS 351
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 117/285 (41%), Gaps = 35/285 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG PP Y +DTGSDL W QC APC C P + K + + C
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCR 145
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFG 175
C++ P + C Y+ Y D S+ GVL + F N + + + FG
Sbjct: 146 SSRCASLSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFG 201
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDL 235
CG N G ++G++G G G S++SQL + + + + Y + +L
Sbjct: 202 CG--SLNAG-DLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANL 258
Query: 236 VPSSGIAWTPMSRD-------------LLEKHYSSG----PAE-LLFGGKSTGIKGLQII 277
++ + +P+ L K S G P + L+F G G +I
Sbjct: 259 SSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGG--VI 316
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
DSG+S T+ AY+ + + + DT + L C++
Sbjct: 317 IDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDT--DIGLDTCFQ 359
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/287 (30%), Positives = 116/287 (40%), Gaps = 37/287 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--LYHPKNN----LVACN 116
Y VTL IG P + IDTGSDL+WVQC PC P+ L+ P + + C
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNASDCYPQKDPLFDPSKSSTFATIPCA 183
Query: 117 DPFCSAFHLPE---NIRCEAND-----QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
C LP + C N QC Y + Y + + GV T+ L S +
Sbjct: 184 SDACK--QLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL---GSSAV 238
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV--RGG 226
FGCG +Q P K G+LGLG S++SQ S + +CL G
Sbjct: 239 VKSFRFGCGSDQHGPYDK---FDGLLGLGGAPESLVSQTAS--VYGGAFSYCLPPLNSGA 293
Query: 227 GYLFLG---HDLVPSSGIAWTPMS--RDLLEKHYSSGPAELLFGGKSTGIKGLQI----I 277
G+L LG +SG +TPM + Y + GGK+ I I
Sbjct: 294 GFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAKGNI 353
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
DSG+ T + AYK R + PL A+ AL C+ T
Sbjct: 354 VDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADS-ALDTCYNFT 399
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 124/267 (46%), Gaps = 31/267 (11%)
Query: 49 AVFPITGNVYPLGYYSVTLKIGNP-PKLYELDIDTGSDLTWVQCNAPCTGC-TLPPESLY 106
+ FP+ G+V GYY + +G+P P+ +++ +DTGS LT+V C A C C T + +
Sbjct: 98 STFPLHGSVKEHGYYYANIALGDPSPRTFQVIVDTGSTLTYVPC-ATCAKCGTHTGGTRF 156
Query: 107 HPKNNLVACNDPFCSAFHLP---ENIRCEANDQCDYEVLYADHGSSLGVLVTD--HFPLR 161
P + C + C A P R A ++C Y YA+ G LV D HF
Sbjct: 157 DPTGKWLTCQEKQCKAAGGPGICAGGRGAAANRCTYSRTYAEGSGVSGDLVRDKMHFGGD 216
Query: 162 L---TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGK-ASILSQL-QSLGLTRNV 216
+ TNG+L ++FGC N + G++GLG + ASI +QL + GL R V
Sbjct: 217 IAPATNGTL---DVVFGC-TNAESGTIHDQEADGLIGLGNNQFASIPNQLADTHGLPR-V 271
Query: 217 LGHCL-SVRGGGYLFLGHDLVPSS----GIAWTPMS-RDLLEKHYSSGPAELLFGGKSTG 270
C S GGG L G +P++ + +T M + +Y A + G +
Sbjct: 272 FSLCFGSFEGGGALSFGR--LPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVA 329
Query: 271 IK-----GLQIIFDSGSSYTYFNSQAY 292
G + DSG+++TY ++ +
Sbjct: 330 TPSDLAVGYGTVMDSGTTFTYVPTKVF 356
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 106/259 (40%), Gaps = 30/259 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN--------------L 112
+ IG P + + +D GSDL WV CN C C S Y +
Sbjct: 107 IDIGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKH 164
Query: 113 VACNDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPL-----RLTNG 165
++C+ C + C++ Q C Y + Y ++ SS G+L+ D L +N
Sbjct: 165 ISCSHNLCDSGQ-----SCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNC 219
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
++ P +I GCG Q G+ GLGLG+ S+LS L L +N C + G
Sbjct: 220 TIQAP-VILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDG 278
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYT 285
G +F G D P+S + + D + Y G + + DSG+S+T
Sbjct: 279 SGRIFFG-DEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFT 337
Query: 286 YFNSQAYKTTLDLMRKDLK 304
Y +AY+ + K L
Sbjct: 338 YLPEEAYENIVIEFDKRLN 356
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/336 (26%), Positives = 138/336 (41%), Gaps = 71/336 (21%)
Query: 35 KSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA- 93
K+ QS + + ++FP + G YSV+L G PP+ DTGS L W C A
Sbjct: 109 KTPQSKSNTSIQNVSLFPRS-----YGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAG 163
Query: 94 -PCTGCTLP---PESL--YHPK----NNLVACNDPFCSAFHLPE-NIRC--------EAN 134
C+ C+ P P ++ + PK +V C +P C+ P RC + +
Sbjct: 164 YRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCS 223
Query: 135 DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVL 194
D C L G++ G+L+++ L L N + P + GC + +P AG+
Sbjct: 224 DSCPGYGLQYGSGATAGILLSET--LDLENKRV--PDFLVGCSVMSVH---QP---AGIA 273
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSG----------IAWT 244
G G G S+ SQ++ L R HCL RG + LV SG +
Sbjct: 274 GFGRGPESLPSQMR---LKR--FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYA 328
Query: 245 P------MSRDLLEKHYSSGPAELLFGGK------------STGIKGLQIIFDSGSSYTY 286
P +S ++Y +L GGK STG G I DSGS++T+
Sbjct: 329 PFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGG--AIIDSGSTFTF 386
Query: 287 FNSQAYKTTLDLMRKDLKGKP-LEDTAEEKALPVCW 321
+ ++ D + K L P +D + L C+
Sbjct: 387 LDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCF 422
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/260 (27%), Positives = 110/260 (42%), Gaps = 30/260 (11%)
Query: 54 TGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE--------- 103
T + LG+ + T+++G P + + +DTGSDL WV C+ CT C+
Sbjct: 91 TFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCD--CTRCSATRSSAFASALAS 148
Query: 104 ----SLYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLV 154
S+Y+P + V CN+ C+ H + + +N C Y V Y S+ G+LV
Sbjct: 149 DFDLSVYNPNGSSTSKKVTCNNSLCT--HRNQCLGTFSN--CPYMVSYVSAETSTSGILV 204
Query: 155 TD--HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGL 212
D H N L+ +IFGCG Q G+ GLG+ K S+ S L G
Sbjct: 205 EDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGF 264
Query: 213 TRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK 272
T + C G G + G S TP + + Y+ ++ G ++
Sbjct: 265 TADSFSMCFGRDGIGRISFGDK--GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVE 322
Query: 273 GLQIIFDSGSSYTYFNSQAY 292
+FDSG+S+TY Y
Sbjct: 323 -FTALFDSGTSFTYLVDPTY 341
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 105/264 (39%), Gaps = 35/264 (13%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
+G + G Y + +G P L IDTGSDL W+QC+ PC C ++ P+ +
Sbjct: 76 FSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCS-PCRRCYAQRGQVFDPRRSS 134
Query: 112 ---LVACNDPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
V C+ P C A P + A C Y V Y D SS G L TD L N +
Sbjct: 135 TYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDK--LAFANDTY 192
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG-- 225
+ + GCG + AG+LG+ GK SI +Q+ +V +CL R
Sbjct: 193 VN-NVTLGCGRDNEGLFDS---AAGLLGVARGKISISTQVAP--AYGSVFEYCLGDRTSR 246
Query: 226 ---GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK--------------S 268
YL G P S +S Y A GG+ +
Sbjct: 247 STRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTA 306
Query: 269 TGIKGLQIIFDSGSSYTYFNSQAY 292
TG G ++ DSG++ + F AY
Sbjct: 307 TGRGG--VVVDSGTAISRFARDAY 328
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + IGNP + Y L++DTGSD+TW+QC APC+ C + +Y P N+ V C
Sbjct: 10 GEYFARMGIGNPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYCG 68
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A C+ C Y V+Y D +S G L + F L N S + FGC
Sbjct: 69 SALCQALDYSA---CQGMG-CSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFGC 123
Query: 177 GYNQ 180
G++
Sbjct: 124 GHSN 127
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 86/194 (44%), Gaps = 19/194 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VAC 115
G Y VT+ +G+P + DTGSDLTW QC PC G C E ++ P +L V+C
Sbjct: 87 GNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVSC 145
Query: 116 NDPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
+ P C N ++ C Y + Y D S+G + L T+ + F
Sbjct: 146 DSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQF 202
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL--SVRGGGYLFL 231
GCG N R TAG+LGL S++SQ Q G V +CL S GYL
Sbjct: 203 GCGQNNRGLFGG---TAGLLGLARNPLSLVSQTAQKYG---KVFSYCLPSSSSSTGYLSF 256
Query: 232 GHDLVPSSGIAWTP 245
G S + +TP
Sbjct: 257 GSGDGDSKAVKFTP 270
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 115/265 (43%), Gaps = 33/265 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP-------PESLYHPKNNLV 113
G Y VT+ +G P K + L DTGSDLTW QC PC G P P + KN V
Sbjct: 138 GAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCE-PCLGGCFPQNQPKFDPTTSTSYKN--V 194
Query: 114 ACNDPFCSAF---HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
+C+ FC + P C +N C Y + Y G ++G L T+ + ++ +
Sbjct: 195 SCSSEFCKLIAEGNYPAQ-DCISN-TCLYGIQYGS-GYTIGFLATETLAIASSD---VFK 248
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLG-LGLGKASILSQLQSLGLTRNVLGHCL--SVRGGG 227
+FGC R G G LGLG++ I Q+ +N+ +CL S G
Sbjct: 249 NFLFGCSEESRG------TFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPSSTG 302
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG--LQIIFDSGSSYT 285
+L G ++ S TP+S L++ Y + G+ I G + I DSG+++T
Sbjct: 303 HLSFGVEV--SQAAKSTPISPK-LKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFT 359
Query: 286 YFNSQAYKTTLDLMRKDLKGKPLED 310
+ S Y R+ + L +
Sbjct: 360 FLPSPTYSALGSAFREMMANYTLTN 384
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/255 (30%), Positives = 114/255 (44%), Gaps = 32/255 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKN----NLVACND 117
+ VT+ G P + Y L DTGSD++W+QC PC+G C + ++ P + V C
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSAVPCGH 178
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
P C+A +C +N C Y+V Y D S+ GVL H L LT+ L P FGCG
Sbjct: 179 PQCAA----AGGKCSSNGTCLYKVQYGDGSSTAGVL--SHETLSLTSARAL-PGFAFGCG 231
Query: 178 -YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV--RGGGYLFLGHD 234
N + G G++GLG G+ S+ SQ + +CL GYL +G
Sbjct: 232 ETNLGDFG----DVDGLIGLGRGQLSLSSQAAA--SFGAAFSYCLPSYNTSHGYLTIG-T 284
Query: 235 LVPSS---GIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGIKGLQI-----IFDSGSSYT 285
P+S G+ +T M + Y ++ GG + + + DSG+ T
Sbjct: 285 TTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLT 344
Query: 286 YFNSQAYKTTLDLMR 300
Y +AY D +
Sbjct: 345 YLPPEAYTALRDRFK 359
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 58/124 (46%), Gaps = 12/124 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y L +G PPK + +DTGSD+ W+QC APC C + ++ PK + ++C
Sbjct: 145 GEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCR 203
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
P C P C + C Y+V Y D + G T+ R T P++ GC
Sbjct: 204 SPLCLRLDSPG---CNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR----VPKVALGC 256
Query: 177 GYNQ 180
G++
Sbjct: 257 GHDN 260
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 109/245 (44%), Gaps = 23/245 (9%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---------LYHPKNNL 112
+Y+V + +G P + + +DTGSDL WV C+ C C P +S +Y P +
Sbjct: 35 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCA-PFQSPNYGSLKFDVYSPAQST 90
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGS----L 167
+ P S +N ++ C Y + Y +D+ SS GVLV D L LT+ S +
Sbjct: 91 TSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV--LYLTSDSAQSKI 148
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
+ ++FGCG Q G+LGLG+ S+ S L S GL N C G G
Sbjct: 149 VTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHG 208
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+ G SS TP++ +Y+ + G KS + I DSG+S+T
Sbjct: 209 RINFGD--TGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 265
Query: 288 NSQAY 292
+ Y
Sbjct: 266 SDPMY 270
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 116/271 (42%), Gaps = 55/271 (20%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDPFC 120
V+L IG PP+ ++ +DTGS L+W+QC+ PP +++ P +++ CN P C
Sbjct: 79 VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPR-KPPPSTVFDPSLSSSFSVLPCNHPLC 137
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
F LP + C+ N C Y YAD + G LV + + + P LI GC
Sbjct: 138 KPRIPDFTLPTS--CDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQST---PPLILGC 192
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGG----GYL 229
+ + G+LG+ LG+ S SQ + +T+ +C+ VR G G
Sbjct: 193 AEDASD-------DKGILGMNLGRLSFASQAK---ITK--FSYCVPTRQVRPGFTPTGSF 240
Query: 230 FLGHDLVPSSG----IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI--------- 276
+LG + P+S I+ S+ + + G G K L I
Sbjct: 241 YLGEN--PNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADP 298
Query: 277 ------IFDSGSSYTYFNSQAY-KTTLDLMR 300
+ DSGS +TY AY K +++R
Sbjct: 299 SGAGQSMIDSGSEFTYLVDVAYNKVREEVVR 329
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 86/194 (44%), Gaps = 19/194 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VAC 115
G Y VT+ +G+P + DTGSDLTW QC PC G C E ++ P +L V+C
Sbjct: 145 GNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVSC 203
Query: 116 NDPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
+ P C N ++ C Y + Y D S+G + L T+ + F
Sbjct: 204 DSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQF 260
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL--SVRGGGYLFL 231
GCG N R TAG+LGL S++SQ Q G V +CL S GYL
Sbjct: 261 GCGQNNRGLFGG---TAGLLGLARNPLSLVSQTAQKYG---KVFSYCLPSSSSSTGYLSF 314
Query: 232 GHDLVPSSGIAWTP 245
G S + +TP
Sbjct: 315 GSGDGDSKAVKFTP 328
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 95/208 (45%), Gaps = 19/208 (9%)
Query: 35 KSTQSTAAHRFGSTAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
KS + RF + P+ G G Y V + G+P + Y + +DTGS L+W+QC
Sbjct: 89 KSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKP 148
Query: 94 PCTGCTLPPESLYHPKNNL----VACNDPFCSAF--HLPENIRCE-ANDQCDYEVLYADH 146
C + + L+ P + ++C CS+ N CE +++ C Y Y D
Sbjct: 149 CVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDS 208
Query: 147 GSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
S+G L D L L L P ++GCG + + AG+LGLG K S+L Q
Sbjct: 209 SYSMGYLSQDL--LTLAPSQTL-PGFVYGCGQDSDGLFGR---AAGILGLGRNKLSMLGQ 262
Query: 207 LQS-LGLTRNVLGHCLSVR-GGGYLFLG 232
+ S G +CL R GGG+L +G
Sbjct: 263 VSSKFGY---AFSYCLPTRGGGGFLSIG 287
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 126/315 (40%), Gaps = 43/315 (13%)
Query: 20 FQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELD 79
F S AN + ST S + P+ N G Y + + +G PP
Sbjct: 64 FHRSISRANHFRANGVSTNSIQS---------PVISNN---GEYLMNISLGTPPVSMHGI 111
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEAND 135
DTGSDL W QC PC C E ++ P + +++C CS +L C ++
Sbjct: 112 ADTGSDLLWRQCK-PCDSCYEQIEPIFDPAKSKTYQILSCEGKSCS--NLGGQGGCSDDN 168
Query: 136 QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVL 194
C Y Y D + G L D + T G + P+++FGCG+N N G +G++
Sbjct: 169 TCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGCGHN--NGGTFELHGSGLV 226
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG------YLFLGHDLVPSSGIAWTPMSR 248
GLG G S++SQL+ L R +CL G F +V +G TP++
Sbjct: 227 GLGGGPLSMISQLRPLIGGR--FSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLAS 284
Query: 249 DLLEKHYSSGPAELLFGGKSTGIKGL-------------QIIFDSGSSYTYFNSQAYKTT 295
+ Y + G K KG II DSG++ T Y T
Sbjct: 285 RQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTL 344
Query: 296 LDLMRKDLKGKPLED 310
+ + GKP+ D
Sbjct: 345 ESNVVSAIGGKPVRD 359
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 88/264 (33%), Positives = 123/264 (46%), Gaps = 42/264 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G + + + IG P + +DTGSDLTW QC PCT C P +Y P + V C+
Sbjct: 113 GEFLMKMAIGTPSLSFSAILDTGSDLTWTQCK-PCTDCYPQPTPIYDPSQSSTYSKVPCS 171
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A LP AN C+Y Y D S+ G+L + F LT+ SL P + FGC
Sbjct: 172 SSMCQA--LPMYSCSGAN--CEYLYSYGDQSSTQGILSYESF--TLTSQSL--PHIAFGC 223
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL-----SVRGGGYLF 230
G Q N G G++G G G S++SQL QSLG N +CL S LF
Sbjct: 224 G--QENEGGGFSQGGGLVGFGRGPLSLISQLGQSLG---NKFSYCLVSITDSPSKTSPLF 278
Query: 231 LGHDL-VPSSGIAWTPM--SRDLLEKHYSSGPAELLFGGK----STGIKGLQ------II 277
+G + + ++ TP+ SR +Y S + GG+ + G LQ +I
Sbjct: 279 IGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEG-ISVGGQLLDIADGTFDLQLDGTGGVI 337
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRK 301
DSG++ TY Y D+++K
Sbjct: 338 IDSGTTVTYLEQSGY----DVVKK 357
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 105/266 (39%), Gaps = 28/266 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G P K L DTGSDLTW QC C + ++ P + ++C
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCT 211
Query: 117 DPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
CS N ++ C Y + Y D ++G D L LT + +FG
Sbjct: 212 STACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKD--TLTLTQNDVF-DGFMFG 268
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSVRGG--GYLFLG 232
CG N R K TAG++GLG SI+ Q Q G +CL G G+L G
Sbjct: 269 CGQNNRGLFGK---TAGLIGLGRDPLSIVQQTAQKFG---KYFSYCLPTSRGSNGHLTFG 322
Query: 233 H------DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSG 281
+ +GI +TP + Y + GGK+ I + I DSG
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTIIDSG 382
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKP 307
+ T S Y + ++ + P
Sbjct: 383 TVITRLPSTVYGSLKSTFKQFMSKYP 408
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 109/245 (44%), Gaps = 23/245 (9%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---------LYHPKNNL 112
+Y+V + +G P + + +DTGSDL WV C+ C C P +S +Y P +
Sbjct: 62 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCA-PFQSPNYGSLKFDVYSPAQST 117
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGS----L 167
+ P S +N ++ C Y + Y +D+ SS GVLV D L LT+ S +
Sbjct: 118 TSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV--LYLTSDSAQSKI 175
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
+ ++FGCG Q G+LGLG+ S+ S L S GL N C G G
Sbjct: 176 VTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHG 235
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+ G SS TP++ +Y+ + G KS + I DSG+S+T
Sbjct: 236 RINFGD--TGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 292
Query: 288 NSQAY 292
+ Y
Sbjct: 293 SDPMY 297
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 109/245 (44%), Gaps = 23/245 (9%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---------LYHPKNNL 112
+Y+V + +G P + + +DTGSDL WV C+ C C P +S +Y P +
Sbjct: 99 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCA-PLQSPNYGSLKFDVYSPAQST 154
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGS----L 167
+ P S +N ++ C Y + Y +D+ SS GVLV D L LT+ S +
Sbjct: 155 TSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV--LYLTSDSAQSKI 212
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
+ ++FGCG Q G+LGLG+ S+ S L S GL N C G G
Sbjct: 213 VTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHG 272
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+ G SS TP++ +Y+ + G KS + I DSG+S+T
Sbjct: 273 RINFGD--TGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 329
Query: 288 NSQAY 292
+ Y
Sbjct: 330 SDPMY 334
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 109/245 (44%), Gaps = 23/245 (9%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---------LYHPKNNL 112
+Y+V + +G P + + +DTGSDL WV C+ C C P +S +Y P +
Sbjct: 99 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCA-PFQSPNYGSLKFDVYSPAQST 154
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGS----L 167
+ P S +N ++ C Y + Y +D+ SS GVLV D L LT+ S +
Sbjct: 155 TSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV--LYLTSDSAQSKI 212
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
+ ++FGCG Q G+LGLG+ S+ S L S GL N C G G
Sbjct: 213 VTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHG 272
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+ G SS TP++ +Y+ + G KS + I DSG+S+T
Sbjct: 273 RINFGD--TGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 329
Query: 288 NSQAY 292
+ Y
Sbjct: 330 SDPMY 334
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 84/187 (44%), Gaps = 26/187 (13%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHL-------PEN 128
+DT S+LTWVQC PC C + L+ P ++ V CN C A + P
Sbjct: 135 VDTASELTWVQCQ-PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCA 193
Query: 129 IRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPP 188
E C Y + Y D S GVL D LRL + G +FGCG + N G
Sbjct: 194 DDNEQQPACSYALSYRDGSYSRGVLARDK--LRLAGQDIEG--FVFGCGTS--NQGAPFG 247
Query: 189 PTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGHDLVP---SSGIA 242
T+G++GLG S++S Q++ V +CL +R G L LG D S+ I
Sbjct: 248 GTSGLMGLGRSHVSLVS--QTMDQFGGVFSYCLPMRESGSSGSLVLGDDSSAYRNSTPIV 305
Query: 243 WTPMSRD 249
+T M D
Sbjct: 306 YTAMVSD 312
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 74/151 (49%), Gaps = 19/151 (12%)
Query: 70 GNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP---------KNNLVACNDPFC 120
G+P + +DTGSDLTWVQC PC+ C + L+ P + N AC D
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 161
Query: 121 SAFHLPENIRC--EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
+A P + +++C Y + Y D S GVL TD L G+ LG +FGCG
Sbjct: 162 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---GGASLGG-FVFGCGL 217
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
+ R TAG++GLG + S++SQ S
Sbjct: 218 SNRG---LFGGTAGLMGLGRTELSLVSQTAS 245
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 109/245 (44%), Gaps = 23/245 (9%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES---------LYHPKNNL 112
+Y+V + +G P + + +DTGSDL WV C+ C C P +S +Y P +
Sbjct: 76 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCA-PFQSPNYGSLKFDVYSPAQST 131
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGS----L 167
+ P S +N ++ C Y + Y +D+ SS GVLV D L LT+ S +
Sbjct: 132 TSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV--LYLTSDSAQSKI 189
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
+ ++FGCG Q G+LGLG+ S+ S L S GL N C G G
Sbjct: 190 VTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHG 249
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+ G SS TP++ +Y+ + G KS + I DSG+S+T
Sbjct: 250 RINFGD--TGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 306
Query: 288 NSQAY 292
+ Y
Sbjct: 307 SDPMY 311
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 112/261 (42%), Gaps = 37/261 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKN----NLVACN 116
Y VT+ +G P L++DTGSD++WVQC PC C + L+ P + V C
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSSSYSAVPCA 200
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS L N C QC Y V Y D ++ GV +D L +N +L G +FGC
Sbjct: 201 AASCSQLALYSN-GCSGG-QCGYVVSYGDGSTTTGVYSSDTLTLTGSN-ALKG--FLFGC 255
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHD 234
G+ Q+ G+LGLG S++SQ S V +CL + GY+ LG
Sbjct: 256 GHAQQGLFAG---VDGLLGLGRQGQSLVSQASS--TYGGVFSYCLPPTQNSVGYISLGG- 309
Query: 235 LVPSS--GIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---------IFDSGSS 283
PSS G + TP+ + Y ++ G S G + L I + D+G+
Sbjct: 310 --PSSTAGFSTTPLLTASNDPTYY----IVMLAGISVGGQPLSIDASVFASGAVVDTGTV 363
Query: 284 YTYFNSQAYKTTLDLMRKDLK 304
T AY R +
Sbjct: 364 VTRLPPTAYSALRSAFRAAMA 384
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 120/290 (41%), Gaps = 45/290 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG PP Y +DTGSDL W QC APC C P + K + + C
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCAAQPTPYFDVKRSATYRALPCR 145
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFG 175
C+A P + C Y+ Y D S+ GVL + F + + + + FG
Sbjct: 146 SSRCAALSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFG 201
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLG 232
CG N G + ++G++G G G S++SQ LG +R +CL+ L+ G
Sbjct: 202 CG--SLNAG-ELANSSGMVGFGRGPLSLVSQ---LGPSR--FSYCLTSYLSPTPSRLYFG 253
Query: 233 -----HDLVPSSG--IAWTPMSRD--------LLEKHYSSGPAE-----LLFGGKSTGIK 272
+ SSG + TP + L K S G L+F G
Sbjct: 254 VFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTG 313
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
G +I DSG+S T+ AY+ + + + DT + L C++
Sbjct: 314 G--VIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDT--DIGLDTCFQ 359
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 112/261 (42%), Gaps = 37/261 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKN----NLVACN 116
Y VT+ +G P L++DTGSD++WVQC PC C + L+ P + V C
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSSSYSAVPCA 189
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
CS L N C QC Y V Y D ++ GV +D L +N +L G +FGC
Sbjct: 190 AASCSQLALYSN-GCSGG-QCGYVVSYGDGSTTTGVYSSDTLTLTGSN-ALKG--FLFGC 244
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHD 234
G+ Q+ G+LGLG S++SQ S V +CL + GY+ LG
Sbjct: 245 GHAQQGLFAG---VDGLLGLGRQGQSLVSQASS--TYGGVFSYCLPPTQNSVGYISLGG- 298
Query: 235 LVPSS--GIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---------IFDSGSS 283
PSS G + TP+ + Y ++ G S G + L I + D+G+
Sbjct: 299 --PSSTAGFSTTPLLTASNDPTYY----IVMLAGISVGGQPLSIDASVFASGAVVDTGTV 352
Query: 284 YTYFNSQAYKTTLDLMRKDLK 304
T AY R +
Sbjct: 353 VTRLPPTAYSALRSAFRAAMA 373
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 117/285 (41%), Gaps = 35/285 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG PP Y +DTGSDL W QC APC C P + K + + C
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCR 145
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGPRLIFG 175
C++ P + C Y+ Y D S+ GVL + F N + + + FG
Sbjct: 146 SSRCASLSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFG 201
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDL 235
CG N G ++G++G G G S++SQL + + + + Y + +L
Sbjct: 202 CG--SLNAG-DLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANL 258
Query: 236 VPSSGIAWTPMSRD-------------LLEKHYSSG----PAE-LLFGGKSTGIKGLQII 277
++ + +P+ L K S G P + L+F G G +I
Sbjct: 259 SSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGG--VI 316
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
DSG+S T+ AY+ + + + DT + L C++
Sbjct: 317 IDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDT--DIGLDTCFQ 359
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 129/307 (42%), Gaps = 31/307 (10%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWV 89
+K S + A +AV T + Y LG Y +T+ IG P + IDTGSD++WV
Sbjct: 96 AKVSSRYNNVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWV 155
Query: 90 QCNAPCTG--CTLPPESLYHPKNNLV----ACNDPFCSAFHLPENIRCEANDQCDYEVLY 143
QC APC C+ + L+ P + +C C+ L + QC Y V Y
Sbjct: 156 QC-APCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCA--QLGDEGNGCLKSQCQYIVKY 212
Query: 144 ADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI 203
D ++ G +D L LT+ + FGC + R G G++GLG S+
Sbjct: 213 GDGSNTAGTYGSD--TLSLTSSDAV-KSFQFGC--SHRAAG-FVGELDGLMGLGGDTESL 266
Query: 204 LSQLQSLGLTRNVLGHCL---SVRGGGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGP 259
+S Q+ +CL S GGG+L LG SS + TPM R + Y
Sbjct: 267 VS--QTAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFL 324
Query: 260 AELLFGGK-----STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEE 314
+ G ++ G ++ DSG+ T AY+ +K++K P A
Sbjct: 325 QGITVAGTMLNVPASVFSGASVV-DSGTVITQLPPTAYQALRTAFKKEMKAYP--SAAPV 381
Query: 315 KALPVCW 321
+L C+
Sbjct: 382 GSLDTCF 388
>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
Length = 191
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 73/153 (47%), Gaps = 19/153 (12%)
Query: 33 KKKSTQSTAAH-------RFGSTAVFPITGNVYPL--GYYSVTLKIGNPPKLYELDIDTG 83
++K+T S H RF S+ F + GN P G Y L +G+P K Y + +DTG
Sbjct: 31 RRKTTLSGIKHHDHHRRGRFLSSVDFNLGGNGLPTRTGLYFTKLGLGSPKKDYYVQVDTG 90
Query: 84 SDLTWVQCNAPCTGCTLPPE-----SLYHPK----NNLVACNDPFCSAFHLPENIRCEAN 134
SD+ WV C C+ C + +LY PK + L++C+ FCS+ + C A
Sbjct: 91 SDILWVNC-VECSRCPTKSQIGMDLTLYDPKGSHTSELISCDHEFCSSTYDGPIPGCRAE 149
Query: 135 DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
C Y + Y D ++ G V D+ NG+L
Sbjct: 150 TPCPYSITYGDGSATTGYYVRDYLTFDRINGNL 182
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 135/323 (41%), Gaps = 39/323 (12%)
Query: 27 ANQPPSKKKST---QSTAA------HRFG--STAVFPIT-GNVYPLGYYSVTLKIGNPPK 74
A+ PPS++ ++ Q AA H S A P++ G +G Y L +G P
Sbjct: 86 ASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPST 145
Query: 75 LYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLP--EN 128
Y + +DTGS LTW+QC+ C L+ P+ + V C+ C
Sbjct: 146 SYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSASQCDELQAATLNP 205
Query: 129 IRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPP 188
C A++ C Y+ Y D S+G L TD ++ GS P +GCG + +
Sbjct: 206 SACSASNVCIYQASYGDSSFSVGSLSTD----TVSFGSTRYPSFYYGCGQDNEGLFGR-- 259
Query: 189 PTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPM 246
+AG++GL K S+L QL SLG + +CL + GYL +G ++TPM
Sbjct: 260 -SAGLIGLARNKLSLLYQLAPSLGYS---FSYCLPTAASTGYLSIG-PYNTGHYYSYTPM 314
Query: 247 SRDLLEKH-YSSGPAELLFGGKSTGI-----KGLQIIFDSGSSYTYFNSQAYKTTLDLMR 300
+ L+ Y + + GG + L I DSG+ T + + +
Sbjct: 315 ASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVA 374
Query: 301 KDLKGKPLEDTAEEKALPVCWKG 323
+ + G + L C++G
Sbjct: 375 QAMAGA--QRAPAFSILDTCFEG 395
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 104/264 (39%), Gaps = 20/264 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES----------LYHPKNNL 112
Y + +G P + + +DTGSDL WV C+ C C P S +Y P +
Sbjct: 100 YYAWVDVGTPTTSFLVALDTGSDLFWVPCD--CIQCA-PLSSYRGNLDRDLGIYKPAEST 156
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNG-SLLGP 170
+ + P P + C Y + Y +++ +S G+L+ D L G + +
Sbjct: 157 TSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNA 216
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
+I GCG Q G+LGLG+ S+ S L GL RN C G +F
Sbjct: 217 SVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSSGRIF 276
Query: 231 LGHDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNS 289
G V S + P+ L + Y+ + G K Q + DSG+S+T
Sbjct: 277 FGDQGVSSQQSTPFVPLYGKL--QTYAVNVDKSCIGHKCLEGSSFQALVDSGTSFTSLPP 334
Query: 290 QAYKTTLDLMRKDLKGK--PLEDT 311
YK K + P ED+
Sbjct: 335 DVYKAFTTEFDKQINASRVPYEDS 358
>gi|168025647|ref|XP_001765345.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683398|gb|EDQ69808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 879
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 74/258 (28%), Positives = 117/258 (45%), Gaps = 26/258 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQC----NAPCTGCTLPPESLYHPKN--NLVAC- 115
+ V +K+G PPK + +DTGS TWV C N L P + P++ + + C
Sbjct: 227 FHVEMKLGVPPKKFHFHMDTGSRDTWVYCQVSRNLDEPPIELGPNGKFEPRDESSYIQCI 286
Query: 116 --NDPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
CS + ++ C + D+ C ++ YAD + GVLV + + + S +
Sbjct: 287 GHTASLCSEYQYEPHL-CNSVDKYHCVNDLNYADDSTYSGVLVNESLMVSTIDNSDMDAM 345
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG-LTRNVLGHCLSVRGG--GY 228
+F C +P T G++GLG K ++ Q + +++NVLG CL+ G GY
Sbjct: 346 GLFWCINEASHPFTG---TDGIIGLGNCKKTLGDQWTTNKVISQNVLGVCLAKGPGPVGY 402
Query: 229 LFLGHDL---VPSSGIAW---TPMSRDLLEKHYSSGPAELLFGGKS-TGIKGLQIIFDSG 281
+ LG + S W TPMS E YSS A + F K+ + FD+G
Sbjct: 403 ISLGVNFKKKFEESTSVWSKLTPMS-SAGECAYSSPLASISFHDKTFVFTSETNLGFDTG 461
Query: 282 SSYTYFNSQAYKTTLDLM 299
S Y + Y+ LD++
Sbjct: 462 SDMMYLEAVIYEPLLDML 479
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 83/265 (31%), Positives = 115/265 (43%), Gaps = 43/265 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
G Y V++ +G+P K L DTGSDLTW +C+A T P +S + V+C+ P C
Sbjct: 132 GNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET--FDPTKSTSYAN---VSCSTPLC 186
Query: 121 SAFHLPEN--IRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL-LGPRLIFGCG 177
S+ RC A+ C Y + Y D S+G L + RLT GS + FGCG
Sbjct: 187 SSVISATGNPSRCAAS-TCVYGIQYGDGSYSIGFLGKE----RLTIGSTDIFNNFYFGCG 241
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLV 236
+ K AG+LGLG K S++S Q+ + +CL S G+L G
Sbjct: 242 QDVDGLFGK---AAGLLGLGRDKLSVVS--QTAPKYNQLFSYCLPSSSSTGFLSFGSS-- 294
Query: 237 PSSGIAWTPMSRDLLEKHYSSGPAE---LLFGGKSTGIKGLQI----------IFDSGSS 283
S +TP+ SSGP+ L G + G + L I I DSG+
Sbjct: 295 QSKSAKFTPL---------SSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTV 345
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPL 308
T AY RK + P+
Sbjct: 346 VTRLPPAAYSALRSAFRKAMASYPM 370
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 124/284 (43%), Gaps = 29/284 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG PP DTGSDL W QC APC C + L+ PK + V+C+
Sbjct: 88 GEYLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSSTYKDVSCS 146
Query: 117 DPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIF 174
C+A L C ND C Y + Y D+ + G + D L ++ + + +I
Sbjct: 147 SSQCTA--LENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGY 228
GCG+N N G +G++GLG G S++ QL +CL +
Sbjct: 205 GCGHN--NAGTFNKKGSGIVGLGGGPVSLIKQLG--DSIDGKFSYCLVPLTSKKDQTSKI 260
Query: 229 LFLGHDLVPSSGIAWTPM----SRD----LLEKHYSSGPAELLFGGKSTGIKGLQIIFDS 280
F + +V SG+ TP+ S++ L K S G ++ + G + II DS
Sbjct: 261 NFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDS 320
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
G++ T ++ Y D + + + +D + L +C+ T
Sbjct: 321 GTTLTLLPTEFYSELEDAVASSIDAEKKQD--PQSGLSLCYSAT 362
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 81/256 (31%), Positives = 122/256 (47%), Gaps = 37/256 (14%)
Query: 61 GYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----AC 115
G + +++ IG PP K++ + DTGSDLTWVQC PC C ++ K + C
Sbjct: 83 GEFFMSITIGTPPIKVFAI-ADTGSDLTWVQCK-PCQQCYKENGPIFDKKKSSTYKSEPC 140
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIF 174
+ C A E E+N+ C Y Y D S G + T+ + +GS + P +F
Sbjct: 141 DSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVF 200
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-----VRGGGYL 229
GCGYN N G +G++GLG G S++SQL S +++ +CLS G +
Sbjct: 201 GCGYN--NGGTFDETGSGIIGLGGGHLSLISQLGS-SISKK-FSYCLSHKSATTNGTSVI 256
Query: 230 FLGHDLVPS-----SGIAWTPM-SRDLLEKHY------SSGPAELLFGGKS-----TGI- 271
LG + +PS SG+ TP+ ++ L +Y S G ++ + G S GI
Sbjct: 257 NLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGIL 316
Query: 272 --KGLQIIFDSGSSYT 285
II DSG++ T
Sbjct: 317 SETSGNIIIDSGTTLT 332
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 125/294 (42%), Gaps = 29/294 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + IG PP DTGSDL W QCN PC C L+ PK + V+C+
Sbjct: 84 GEYLMNISIGTPPVPILAIADTGSDLIWTQCN-PCEDCYQQTSPLFDPKESSTYRKVSCS 142
Query: 117 DPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIF 174
C A E+ C ++ C Y + Y D+ + G + D + + + R +I
Sbjct: 143 SSQCRAL---EDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMII 199
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG--GGY 228
GCG+ N G P +G++GLG G S++SQL+ +CL S G
Sbjct: 200 GCGH--ENTGTFDPAGSGIIGLGGGSTSLVSQLRK--SINGKFSYCLVPFTSETGLTSKI 255
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFDSG 281
F + +V G+ T M + +Y S G ++ F G I+ DSG
Sbjct: 256 NFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSG 315
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLLGNFEWH 335
++ T S Y ++ +K + ++D + L +C++ + + + H
Sbjct: 316 TTLTLLPSNFYYELESVVASTIKAERVQD--PDGILSLCYRDSSSFKVPDITVH 367
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 73/153 (47%), Gaps = 16/153 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L G P + IDT SDL W+QC PC C + +++PK + +V C
Sbjct: 90 GEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQ-PCVSCYRQLDPVFNPKLSSSYAVVPCT 148
Query: 117 DPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C+ + RC +D C Y Y+ HG + G L D +L G + ++F
Sbjct: 149 SDTCAQL---DGHRCHEDDDGACQYTYKYSGHGVTKGTLAID----KLAIGGDVFHAVVF 201
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
GC + + G +G++GLG G S++SQL
Sbjct: 202 GC--SDSSVGGPAAQASGLVGLGRGPLSLVSQL 232
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 124/284 (43%), Gaps = 29/284 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG PP DTGSDL W QC APC C + L+ PK + V+C+
Sbjct: 88 GEYLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSSTYKDVSCS 146
Query: 117 DPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIF 174
C+A L C ND C Y + Y D+ + G + D L ++ + + +I
Sbjct: 147 SSQCTA--LENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGY 228
GCG+N N G +G++GLG G S++ QL +CL +
Sbjct: 205 GCGHN--NAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKI 260
Query: 229 LFLGHDLVPSSGIAWTPM----SRD----LLEKHYSSGPAELLFGGKSTGIKGLQIIFDS 280
F + +V SG+ TP+ S++ L K S G ++ + G + II DS
Sbjct: 261 NFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDS 320
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
G++ T ++ Y D + + + +D + L +C+ T
Sbjct: 321 GTTLTLLPTEFYSELEDAVASSIDAEKKQD--PQSGLSLCYSAT 362
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 108/264 (40%), Gaps = 28/264 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC--TGCTLPPESLYHPKNNL----VACN 116
Y VTL G P L +DTGSD++WVQC APC T C + L+ P + +AC
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQC-APCNSTECYPQKDPLFDPSKSSTYAPIACG 183
Query: 117 DPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C+ C + QC Y V Y D S+ GV + + FG
Sbjct: 184 ADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFA---PGITVKDFHFG 240
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDL 235
CG++QR P K G+LGLG S++ +Q+ + +CL FL +
Sbjct: 241 CGHDQRGPSDK---FDGLLGLGGAPESLV--VQTASVYGGAFSYCLPALNSEAGFLALGV 295
Query: 236 VPS-----SGIAWTPMSRDLLEK-HYSSGPAELLFGGK-----STGIKGLQIIFDSGSSY 284
PS S +TPM ++ Y + GGK + +G +I DSG+
Sbjct: 296 RPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLI-DSGTIV 354
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPL 308
T AY +RK P+
Sbjct: 355 TELPETAYNALNAALRKAFAAYPM 378
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 135/323 (41%), Gaps = 39/323 (12%)
Query: 27 ANQPPSKKKST---QSTAA------HRFG--STAVFPIT-GNVYPLGYYSVTLKIGNPPK 74
A+ PPS++ ++ Q AA H S A P++ G +G Y L +G P
Sbjct: 86 ASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPST 145
Query: 75 LYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLP--EN 128
Y + +DTGS LTW+QC+ C L+ P+ + V C+ C
Sbjct: 146 SYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSASQCDELQAATLNP 205
Query: 129 IRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPP 188
C A++ C Y+ Y D S+G L TD ++ GS P +GCG + +
Sbjct: 206 SACSASNVCIYQASYGDSSFSVGYLSTD----TVSFGSTSYPSFYYGCGQDNEGLFGR-- 259
Query: 189 PTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPM 246
+AG++GL K S+L QL SLG + +CL + GYL +G ++TPM
Sbjct: 260 -SAGLIGLARNKLSLLYQLAPSLGYS---FSYCLPTAASTGYLSIG-PYNTGHYYSYTPM 314
Query: 247 SRDLLEKH-YSSGPAELLFGGKSTGI-----KGLQIIFDSGSSYTYFNSQAYKTTLDLMR 300
+ L+ Y + + GG + L I DSG+ T + + +
Sbjct: 315 ASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVA 374
Query: 301 KDLKGKPLEDTAEEKALPVCWKG 323
+ + G + L C++G
Sbjct: 375 QAMAGA--QRAPAFSILDTCFEG 395
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 73/160 (45%), Gaps = 17/160 (10%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P +Y Y + L++G PP +IDTGSD+ W QC PC C ++ P +
Sbjct: 410 PYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQC-MPCPNCYSQFAPIFDPSKS 468
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGP 170
S F RC N C YE++YAD S G+L T+ + T+G +
Sbjct: 469 ---------STFR---EQRCNGN-SCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMA 515
Query: 171 RLIFGCGYNQRNPGPK--PPPTAGVLGLGLGKASILSQLQ 208
GCG + N ++G++GL +G S++SQ+
Sbjct: 516 ETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMD 555
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 105/251 (41%), Gaps = 35/251 (13%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
P ++ Y + L++G PP +IDTGSDL W QC PC C + ++ P +
Sbjct: 71 PYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQC-MPCPDCYSQFDPIFDPSKS 129
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS-LLGP 170
S F+ RC C YE++Y D+ S G+L T+ + T+G +
Sbjct: 130 ---------STFN---EQRCHGK-SCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMA 176
Query: 171 RLIFGCGYNQRNPGPK--PPPTAGVLGLGLGKASILSQLQ--SLGLTRNVLGHCLSVRGG 226
GCG + + ++G++GL +G S++SQ+ GL + +C S +G
Sbjct: 177 ETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGL----ISYCFSGQGT 232
Query: 227 GYLFLGHD-LVPSSGIAWTPM---------SRDLLEKHYSSGPAELLFGGKSTGIKGLQI 276
+ G + +V G M +L E L G + I
Sbjct: 233 SKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETL--GTPFHAEDGNI 290
Query: 277 IFDSGSSYTYF 287
+ DSGS+ TYF
Sbjct: 291 VIDSGSTVTYF 301
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 62/123 (50%), Gaps = 10/123 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + IG+P + Y L++DTGSD+TW+QC APC+ C + +Y P N+ V C
Sbjct: 43 GEYFARMGIGSPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYCG 101
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A C+ C Y V+Y D +S G L + F L N S + FGC
Sbjct: 102 SALCQALDYSA---CQGMG-CSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFGC 156
Query: 177 GYN 179
G++
Sbjct: 157 GHS 159
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 75/156 (48%), Gaps = 15/156 (9%)
Query: 63 YSVTLKIGNP-PKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACND 117
Y + L IG P P+ L +DTGSDL W QC CT C P ++ + V C+D
Sbjct: 94 YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA--CTVCFDQPVPVFRASVSHTFSRVPCSD 151
Query: 118 PFCS-AFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRL---TNGSLLGPRL 172
P C A +LP + C A D+ C Y Y DH + G + D F + + + P +
Sbjct: 152 PLCGHAVYLPLS-GCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNI 210
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
FGCG N G P +G+ G G G S+ SQL+
Sbjct: 211 RFGCG--MMNYGLFTPNQSGIAGFGTGPLSLPSQLK 244
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 116/273 (42%), Gaps = 33/273 (12%)
Query: 43 HRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP 102
++ + A I+G G Y V + +G+PP+ + ID+GSD+ WVQC PC+ C
Sbjct: 123 YKVANFATDVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCK-PCSRCYQQS 181
Query: 103 ESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHF 158
+ ++ P ++ V+C C EN C A +C YEV Y D + G L +
Sbjct: 182 DPVFDPADSSSFAGVSCGSDVCDRL---ENTGCNAG-RCRYEVSYGDGSYTKGTLALE-- 235
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
LT G ++ + GCG+ + G+ G + S + QL G T
Sbjct: 236 --TLTVGQVMIRDVAIGCGHTNQGMFIGAAGLLGLGGGSM---SFIGQLG--GQTGGAFS 288
Query: 219 HCLSVRG---GGYLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKSTGI--K 272
+CL RG G L G +P G W + R+ Y G A + GG + +
Sbjct: 289 YCLVSRGTGSTGALEFGRGALP-VGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEE 347
Query: 273 GLQ--------IIFDSGSSYTYFNSQAYKTTLD 297
Q ++ D+G++ T F + AY D
Sbjct: 348 TFQLTEYGTNGVVMDTGTAVTRFPTAAYVAFRD 380
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 72/153 (47%), Gaps = 13/153 (8%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
++ S+ ++R I+G G Y V + +G+PP+ + ID+GSD+ WVQC
Sbjct: 171 RRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 230
Query: 93 APCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGS 148
PCT C + ++ P ++ V+C+ C EN C A +C YEV Y D
Sbjct: 231 -PCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRL---ENAGCHAG-RCRYEVSYGDGSY 285
Query: 149 SLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR 181
+ G L + LT G + + GCG+ R
Sbjct: 286 TKGTLALE----TLTFGRTMVRSVAIGCGHRNR 314
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 110/264 (41%), Gaps = 32/264 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y + L +G PP+ +DTGSDL W QC+ CT C P+ L+ P+ + + +P A
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPR--MSSSYEPMRCA 154
Query: 123 FHLPENI---RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
L +I C D C Y Y D ++LG T+ F ++G L FGCG
Sbjct: 155 GQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCG-- 212
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQLQ----SLGLTRNVLGHCLSVRGGGYLFLGHDL 235
N G +G++G G S++SQL S LT +++ G +G
Sbjct: 213 TMNVG-SLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYASSRKSTLQFGSLADVGLYD 271
Query: 236 VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---------------IFDS 280
+ + TP +L+ + + F G + G + L+I I DS
Sbjct: 272 DATGPVQTTP----ILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDS 327
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLK 304
G++ T F + + R L+
Sbjct: 328 GTALTLFPAAVLAEVVRAFRSQLR 351
>gi|449533387|ref|XP_004173657.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 254
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 84/189 (44%), Gaps = 38/189 (20%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN-----------NLV 113
V+L IG PP+ +L +DTGS L+W+QC+ LPP L PK +L+
Sbjct: 69 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPP--LPKPKTATFDPSLSSSFSLL 126
Query: 114 ACNDPFCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
CN P C F LP + C+ N C Y YAD + G LV + F + SL
Sbjct: 127 PCNHPICKPRIPDFTLPTS--CDQNRLCHYSYFYADGTLAEGNLVREKFTF---SNSLST 181
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--- 226
P +I GC G+LG+ G+ S +SQ + + +C+ R G
Sbjct: 182 PPVILGCAQGSTE-------NRGILGMNHGRLSFISQAK-----ISKFSYCVPSRTGPNP 229
Query: 227 -GYLFLGHD 234
G +LG +
Sbjct: 230 TGLFYLGDN 238
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 83/189 (43%), Gaps = 39/189 (20%)
Query: 48 TAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC--------T 99
A P+ G++ GYY+ L IG PP+ + L +DTGS++T+V C C
Sbjct: 35 NARMPLYGDILSYGYYATKLYIGTPPQEFTLVVDTGSNMTFVPCCGSEEYCGKHEDPAFQ 94
Query: 100 LPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFP 159
S Y P N C+ P C +L QC Y++ Y D S GVL D
Sbjct: 95 TESSSTYQPVN----CH-PSCDCDYL--------RSQCSYKMHYGDGSYSRGVLAED--I 139
Query: 160 LRLTNGSLLGP-RLIFGCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG 211
+ N S P RL+FGC Y+ R G++GLG G+++I+ QL G
Sbjct: 140 ISFGNESEFAPQRLVFGCELDAIGSLYSLR--------ADGIIGLGRGRSTIVDQLVDKG 191
Query: 212 LTRNVLGHC 220
+ + C
Sbjct: 192 VISDSFSLC 200
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 75/153 (49%), Gaps = 16/153 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L IG P + IDT SDL W+QC PC C + +++P+ + +V C+
Sbjct: 86 GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQ-PCVSCYRQLDPIFNPRLSSSYAVVPCS 144
Query: 117 DPFCSAFHLPENIRCEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
CS + RC+ +D C Y Y+ + + G L D +L G + ++
Sbjct: 145 SDTCSQL---DGHRCDEDDDQACRYNYKYSGNAVTNGTLAID----KLAVGGNVFHAVVL 197
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
GC + + G PP +G++GL G S+LSQL
Sbjct: 198 GC--SDSSVGGPPPQASGLVGLARGPLSLLSQL 228
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 68/145 (46%), Gaps = 16/145 (11%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA-- 122
V+L IG PP+ ++ +DTGS L+W+QC P + L +++ CN C
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKPRV 139
Query: 123 --FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQ 180
+ LP + C+ N C Y YAD + G LV + F + S P LI GC +
Sbjct: 140 PDYTLPTS--CDQNRLCHYSYFYADGTYAEGNLVREKFTF---SSSQTTPPLILGCATDS 194
Query: 181 RNPGPKPPPTAGVLGLGLGKASILS 205
+ T G+LG+ LG+ S S
Sbjct: 195 SD-------TQGILGMNLGRLSFSS 212
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 108/262 (41%), Gaps = 33/262 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y +T+ +G+P + IDTGSD++WVQC PC+ C + L+ P + +C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
C+ N C ++ QC Y V Y D S+ G +D L GS FGC
Sbjct: 187 ACAQLGQEGN-GCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVKSFQFGCSN 241
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
G+N + T G++GLG G S++S Q+ G +CL + G+L L
Sbjct: 242 VESGFNDQ--------TDGLMGLGGGAQSLVS--QTAGTLGRAFSYCLPPTPSSSGFLTL 291
Query: 232 -GHDLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIKG----LQIIFDSGSSYT 285
+SG TPM R + Y + GG+ I + DSG+ T
Sbjct: 292 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVIT 351
Query: 286 YFNSQAYKTTLDLMRKDLKGKP 307
AY + +K P
Sbjct: 352 RLPPTAYSALSSAFKAGMKQYP 373
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 109/270 (40%), Gaps = 40/270 (14%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP--------------KNNL 112
+ IG P + + +D GSDL+WV C+ C C SLY P +
Sbjct: 106 IDIGTPNVSFLVALDAGSDLSWVPCD--CIQCAPLSASLYKPLDRDLSEYRPSLSTTSRH 163
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLYAD-HGSSLGVLVTDHFPLRLTNGS----- 166
++CN C +N++ D C Y YAD + SS G LV D L +
Sbjct: 164 LSCNHQLCELGSHCKNLK----DPCPYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQ 219
Query: 167 -LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+ +I GCG Q GV+GLG G S+ S L GL R C V G
Sbjct: 220 KRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNG 279
Query: 226 GGYLFL---GHDLVPSSGIAWTPMSRD--LLE-KHYSSGPAELLFGGKSTGIKGLQIIFD 279
G + GH S+ + T + D L+E + Y G + L K +G K L D
Sbjct: 280 SGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESYCVGNSCL----KQSGFKAL---VD 332
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLE 309
SG+S+TY Y + K + + +
Sbjct: 333 SGASFTYLPIDVYNKIVLEFDKQVNAQRIS 362
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 103/244 (42%), Gaps = 26/244 (10%)
Query: 80 IDTGSDLTWVQCN----APCTGCTLPPE---SLYHPK----NNLVACNDPFCSAFHLPEN 128
+DTGSDL WV C+ AP G T E S+Y+PK N V CN+ C+ +
Sbjct: 4 LDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCA-----QR 58
Query: 129 IRCEAN-DQCDYEVLYAD-HGSSLGVLVTD--HFPLRLTNGSLLGPRLIFGCGYNQRNPG 184
+C C Y V Y S+ G+L+ D H N + + FGCG Q
Sbjct: 59 NQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSF 118
Query: 185 PKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWT 244
G+ GLG+ K S+ S L GL + C G G + G SS T
Sbjct: 119 LDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDK--GSSDQEET 176
Query: 245 PMSRDLLEKHYSSGPAELLFGGKSTGIKG-LQIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
P + + +Y+ + G +T I +FD+G+S+TY Y TT+ +D
Sbjct: 177 PFNLNPSHPNYNITVTRVRVG--TTLIDDEFTALFDTGTSFTYLVDPMY-TTVSESAQDK 233
Query: 304 KGKP 307
+ P
Sbjct: 234 RHSP 237
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 84/186 (45%), Gaps = 21/186 (11%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC-TGCTLPPESLYHPKNN----LVA 114
+G Y L +G P Y + +D+GS LTW+QC APC C LY P+ + V
Sbjct: 105 VGNYITRLGLGTPTTTYVMVVDSGSSLTWLQC-APCAVSCHPQAGPLYDPRASSTYAAVP 163
Query: 115 CNDPFCSAFHLP--ENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
C+ P C+ C + C Y+ Y D S G L D L ++GS P
Sbjct: 164 CSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLS-SSGSF--PGF 220
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCL---SVRGGGY 228
+GCG + + AG++GL K S+LSQL S+G N +CL + GY
Sbjct: 221 YYGCGQDNVGLFGR---AAGLIGLARNKLSLLSQLAPSVG---NSFAYCLPTSAAASAGY 274
Query: 229 LFLGHD 234
L G +
Sbjct: 275 LSFGSN 280
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 95/212 (44%), Gaps = 25/212 (11%)
Query: 45 FGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
FGS V +G G Y V + +G+PP+ + ID+GSD+ WVQC PC+ C +
Sbjct: 122 FGSDVV---SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQ-PCSECYQQSDP 177
Query: 105 LYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL 160
++ P + ++C+ C +N C + +C YEV Y D + G L +
Sbjct: 178 VFDPAGSATYAGISCDSSVCDRL---DNAGCN-DGRCRYEVSYGDGSYTRGTLALET--- 230
Query: 161 RLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
LT G +L + GCG+ R G+ G + S + QL G T +C
Sbjct: 231 -LTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAM---SFVGQLG--GQTGGAFSYC 284
Query: 221 LSVRG---GGYLFLGHDLVPSSGIAWTPMSRD 249
L RG G L G +P G AW P+ R+
Sbjct: 285 LVSRGTESTGTLEFGRGAMP-VGAAWVPLIRN 315
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/259 (29%), Positives = 106/259 (40%), Gaps = 24/259 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y VT G P K L IDTGSD+TW+QC PC+ C + ++ P+ + ++C
Sbjct: 136 GNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCK-PCSDCYSQVDPIFEPQQSSSYKHLSCL 194
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+ + R C YE+ Y D S G D LT GS P FGC
Sbjct: 195 SSACTELTTMNHCRLGG---CVYEINYGDGSRSQG----DFSQETLTLGSDSFPSFAFGC 247
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL--GLTRNVLGHCLSVRGGGYLFLGHD 234
G+ N G +AG+LGLG S SQ +S G L +S G +G
Sbjct: 248 GHT--NTGLF-KGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQG 304
Query: 235 LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI------KGLQIIFDSGSSYTYFN 288
+P++ +S Y G + GG+ I +G I+ DSG+ T
Sbjct: 305 SIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIV-DSGTVITRLV 363
Query: 289 SQAYKTTLDLMRKDLKGKP 307
QAY R + P
Sbjct: 364 PQAYDALKTSFRSKTRNLP 382
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 99/213 (46%), Gaps = 25/213 (11%)
Query: 41 AAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL 100
AA FGS V +G G Y V + +G+PP+ + +D+GSD+ WVQC PCT C
Sbjct: 117 AAEAFGSDVV---SGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCE-PCTQCYH 172
Query: 101 PPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
+ +++P ++ V+C CS H+ +N C +C YEV Y D + G L +
Sbjct: 173 QSDPVFNPADSSSFSGVSCASTVCS--HV-DNAACH-EGRCRYEVSYGDGSYTKGTLALE 228
Query: 157 HFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
+T G L + GCG++ + G+ G + S + QL G T
Sbjct: 229 ----TITFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPM---SFVGQLG--GQTGGA 279
Query: 217 LGHCLSVRG---GGYLFLGHDLVPSSGIAWTPM 246
+CL RG G L G + +P G AW P+
Sbjct: 280 FSYCLVSRGIESSGLLEFGREAMP-VGAAWVPL 311
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 71/148 (47%), Gaps = 9/148 (6%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y + L +G PP+ +DTGSDL W QC+ CT C P+ L+ P+ + + +P A
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPR--MSSSYEPMRCA 154
Query: 123 FHLPENI---RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
L +I C D C Y Y D ++LG T+ F ++G L FGCG
Sbjct: 155 GQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCG-- 212
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQL 207
N G +G++G G S++SQL
Sbjct: 213 TMNVG-SLNNASGIVGFGRDPLSLVSQL 239
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 125/287 (43%), Gaps = 30/287 (10%)
Query: 43 HRFGSTAVFP-ITGNVYPLGYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTL 100
HR GS VF +T N G Y + L +G PP +Y L +DTGSDL W QC PC GC
Sbjct: 32 HRLGSNGVFTRVTSNN---GDYLMKLTLGTPPVDVYGL-VDTGSDLVWAQC-TPCQGCYR 86
Query: 101 PPESLYHP-KNNL---VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
++ P ++N + C+ C++ C C Y YAD + GVL +
Sbjct: 87 QKSPMFEPLRSNTYTPIPCDSEECNSLF---GHSCSPQKLCAYSYAYADSSVTKGVLARE 143
Query: 157 HFPLRLTNGS-LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
T+G ++ ++FGCG++ N G G++GLG G S++SQ +L ++
Sbjct: 144 TVTFSSTDGEPVVVGDIVFGCGHS--NSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKR 201
Query: 216 -----VLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY-------SSGPAELL 263
V H G F V G+A TP+ + + Y S G +
Sbjct: 202 FSQCLVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVS 261
Query: 264 FGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
F KG I+ DSG+ TY + Y + ++ P++D
Sbjct: 262 FNSSEMLSKG-NIMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDD 307
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 122/279 (43%), Gaps = 37/279 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PPK + L +DTGSDL W+QC PC C Y P + + C+
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYECFEQNGPHYDPGQSSSYRNIGCH 237
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGS-----LLG 169
D C P+ + C+A +Q C Y Y D ++ G + F + LT S
Sbjct: 238 DSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRV 297
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---- 225
++FGCG+ R AG+LGLG G S SQLQS L + +CL R
Sbjct: 298 ENVMFGCGHWNRGLFHG---AAGLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDAN 352
Query: 226 -GGYLFLGH--DLVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGKSTGI--KGLQI- 276
L G DL+ + +T + + ++ Y ++ GG+ I + QI
Sbjct: 353 VSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIA 412
Query: 277 -------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
I DSG++ +YF AY+ + +KG P+
Sbjct: 413 TDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPV 451
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 126/304 (41%), Gaps = 27/304 (8%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
++ S+ S A S + P G +G Y + +G P K Y + +DTGS LTW+QC+
Sbjct: 93 RRGSSSSPDAESLASVPLGP--GTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCS 150
Query: 93 APCTGCTLPPESLYHPKNNLVACND----PFCSAFHLP--ENIRCEANDQCDYEVLYADH 146
C +++P+++ + P C A C ++ C Y+ Y D
Sbjct: 151 PCLVSCHRQSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDS 210
Query: 147 GSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
S+G L D ++ GS P +GCG Q N G +AG++GL K S+L Q
Sbjct: 211 SFSVGYLSKD----TVSFGSTSVPNFYYGCG--QDNEGLF-GQSAGLIGLARNKLSLLYQ 263
Query: 207 LQ-SLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLF 264
L S+G + +CL +L ++TPM++ L+ Y +
Sbjct: 264 LAPSMGYS---FSYCLPTSSSSSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITV 320
Query: 265 GGK-----STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPV 319
GK ++ L I DSG+ T + Y + +KG P + L
Sbjct: 321 AGKPLSVSASAYSSLPTIIDSGTVITRLPTDVYSALSKAVAGAMKGTP--RASAFSILDT 378
Query: 320 CWKG 323
C++G
Sbjct: 379 CFQG 382
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 104/261 (39%), Gaps = 38/261 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-------TLPPESLYHPKNNLVAC 115
Y V L +G PP+ L +DTGSDL W QC APC C L P + + V C
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVWTQC-APCLNCFDQGAIPVLDPAA--SSTHAAVRC 150
Query: 116 NDPFCSAFHLPENIRCEAN---DQCDYEVLYADHGSSLGVLVTDHFPL----RLTNGSLL 168
+ P C A R ++ C Y Y D ++G L +D F G +
Sbjct: 151 DAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVS 210
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV---RG 225
RL FGCG+ N G G+ G G G+ S+ SQ LG+T +C +
Sbjct: 211 ERRLTFGCGHF--NKGIFQANETGIAGFGRGRWSLPSQ---LGVTS--FSYCFTSMFEST 263
Query: 226 GGYLFLG---HDLVPSSGIAWTPMSRD--------LLEKHYSSGPAELLFGGKSTGIKGL 274
+ LG +L + + TP+ RD L K + G + + ++
Sbjct: 264 SSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREA 323
Query: 275 QIIFDSGSSYTYFNSQAYKTT 295
I DSG+S T Y+
Sbjct: 324 SAIIDSGASITTLPEDVYEAV 344
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 63/133 (47%), Gaps = 13/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
I+G G Y + +G PP+ + +DTGSD+ W+QC APC C + ++ P+ +
Sbjct: 116 ISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQC-APCKRCYAQSDPVFDPRKSR 174
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
+AC P C H ++ C Q C Y+V Y D + G T+ R T +
Sbjct: 175 SFASIACRSPLC---HRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVA- 230
Query: 168 LGPRLIFGCGYNQ 180
R+ GCG++
Sbjct: 231 ---RVALGCGHDN 240
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 109/283 (38%), Gaps = 53/283 (18%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP----ESLYHPKNN 111
N P Y V L IG PP+ +L +DTGSDL W QC PC C P ++ N
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCK-PCVSCFDQPLPYFDTSRSSTNA 86
Query: 112 LVACNDPFCSAFHLPENIRCEANDQ----CDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
L+ C C P C +Q C Y Y D+ ++G+L D F G+
Sbjct: 87 LLPCESTQCKLD--PTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKF--TFVAGTS 142
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG 227
L P + FGCG N N G G+ G G G S+ SQL+ + HC + G
Sbjct: 143 L-PGVTFGCGLN--NTGVFNSNETGIAGFGRGPLSLPSQLKVGNFS-----HCFTTITGA 194
Query: 228 Y-----LFLGHDLVPS--SGIAWTPMSRDLLEKHYSSGPAE-----LLFGGKSTGIKGLQ 275
L L DL + + TP+ + Y+ A L G + G L
Sbjct: 195 IPSTVLLDLPADLFSNGQGAVQTTPLIQ------YAKNEANPTLYYLSLKGITVGSTRLP 248
Query: 276 I--------------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
+ I DSG+S T Q Y+ D +K
Sbjct: 249 VPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK 291
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 82/188 (43%), Gaps = 21/188 (11%)
Query: 32 SKKKSTQSTAAHRFGS--TAVFPITGNVYPLGY--YSVTLKIGNP-PKLYELDIDTGSDL 86
S+ ++ + R G+ P+ + +GY Y + IG P P+ L++DTGSD+
Sbjct: 57 SRARAAKQLCPSRSGTPVRVTAPVASGSHVVGYTEYLIHFGIGTPRPQQVALEVDTGSDV 116
Query: 87 TWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEAN--DQCDYE 140
W QC PC C P + + V C DP C A +R A C Y+
Sbjct: 117 VWTQCR-PCFDCFTQPLPRFDTSASDTVHGVLCTDPICRA------LRPHACFLGGCTYQ 169
Query: 141 VLYADHGSSLGVLVTDHFPLR-LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLG 199
V Y D+ ++G L D F G + P L+FGCG Q N G G+ G G G
Sbjct: 170 VNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCG--QYNTGNFHSNETGIAGFGRG 227
Query: 200 KASILSQL 207
S+ QL
Sbjct: 228 PLSLPRQL 235
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/259 (30%), Positives = 116/259 (44%), Gaps = 35/259 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV----ACN 116
G + +++ IG PP DTGSDLTWVQC PC C ++ K + C+
Sbjct: 83 GEFFMSITIGTPPMKVFAIADTGSDLTWVQCK-PCQQCYKENGPIFDKKKSSTYKSEPCD 141
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
C A E E+ + C Y Y D S G + T+ + +GS + P +FG
Sbjct: 142 SRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFG 201
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-----VRGGGYLF 230
CGYN N G +G++GLG G S++SQL S +++ +CLS G +
Sbjct: 202 CGYN--NGGTFDETGSGIIGLGGGHLSLISQLGS-SISKK-FSYCLSHKSATTNGTSVIN 257
Query: 231 LGHDLVPS-----SGIAWTPMSRDLLEKHY-------SSGPAELLFGGKST-----GI-- 271
LG + +PS SG+ TP+ +Y S G ++ + G S GI
Sbjct: 258 LGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFS 317
Query: 272 -KGLQIIFDSGSSYTYFNS 289
II DSG++ T +S
Sbjct: 318 ETSGNIIIDSGTTLTLLDS 336
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 115/275 (41%), Gaps = 30/275 (10%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV 113
+G G Y VT G P K L IDTGSDLTW+QC PC C ++++ PK +
Sbjct: 128 SGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCK-PCADCYSQVDAIFEPKQSSS 186
Query: 114 ACNDPFCSAFHLPENIRCEANDQ------CDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
P C + E I E+N C YE+ Y D SS G D LT GS
Sbjct: 187 YKTLP-CLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQG----DFSQETLTLGSD 241
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL--GLTRNVLGHCLSVRG 225
FGCG+ ++G+LGLG S SQ +S G L S
Sbjct: 242 SFQNFAFGCGHTNTGLFKG---SSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTS 298
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLL-EKHYSSGPAELLFGGKSTGI------KGLQIIF 278
G +G +P+S + +TP+ + + Y G + GG I +G I+
Sbjct: 299 TGSFSVGKGSIPASAV-FTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIV- 356
Query: 279 DSGSSYTYFNSQAY---KTTLDLMRKDL-KGKPLE 309
DSG+ T QAY KT+ +DL KP
Sbjct: 357 DSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFS 391
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 110/251 (43%), Gaps = 40/251 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P+ + P
Sbjct: 91 GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CKHCGSHQDPKFRPEAS--ETYQPVK 147
Query: 121 SAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIFGC 176
+ +C +D QC YE YA+ +S GVL D + N S L P R IFGC
Sbjct: 148 CTW------QCNCDDDRKQCTYERRYAEMSTSSGVLGED--VVSFGNQSELSPQRAIFGC 199
Query: 177 G-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGG 227
YNQR G++GLG G SI+ QL + + C GGG
Sbjct: 200 ENDETGDIYNQR--------ADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGG 251
Query: 228 YLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSG 281
+ LG + P + + +T S + +Y+ E+ GK + + DSG
Sbjct: 252 AMVLG-GISPPADMVFT-HSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSG 309
Query: 282 SSYTYFNSQAY 292
++Y Y A+
Sbjct: 310 TTYAYLPESAF 320
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 108/262 (41%), Gaps = 33/262 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y +T+ +G+P + IDTGSD++WVQC PC+ C + L+ P + +C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
C+ N C ++ QC Y V Y D S+ G +D L GS FGC
Sbjct: 187 DCAQLGQEGN-GCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSN 241
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
G+N + T G++GLG G S++S Q+ G +CL + G+L L
Sbjct: 242 VESGFNDQ--------TDGLMGLGGGAQSLVS--QTAGTLGRAFSYCLPPTPSSSGFLTL 291
Query: 232 -GHDLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIKG----LQIIFDSGSSYT 285
+SG TPM R + Y + GG+ I + DSG+ T
Sbjct: 292 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVIT 351
Query: 286 YFNSQAYKTTLDLMRKDLKGKP 307
AY + +K P
Sbjct: 352 RLPPTAYSALSSAFKAGMKQYP 373
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 89/201 (44%), Gaps = 18/201 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-- 112
G+ G Y VT+ +G P L DTGSDLTW QC C E +++P +
Sbjct: 124 GSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSY 183
Query: 113 --VACNDPFCSAFHLPENI--RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
V+C+ C + C A++ C Y + Y D S+G L + F LTN +
Sbjct: 184 YNVSCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKEKFT--LTNSDVF 240
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGG 226
+ FGCG N + AG+LGLG K S SQ + + +CL S
Sbjct: 241 -DGVYFGCGENNQG---LFTGVAGLLGLGRDKLSFPSQTAT--AYNKIFSYCLPSSASYT 294
Query: 227 GYLFLGHDLVPSSGIAWTPMS 247
G+L G + S + +TP+S
Sbjct: 295 GHLTFGSAGISRS-VKFTPIS 314
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 113/284 (39%), Gaps = 33/284 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNNL----VACN 116
Y VTL IG P + IDTGSDL+WVQC PC C + L+ P ++ V C+
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 229
Query: 117 DPFCSAF------HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C H + A C+Y + Y + ++ GV T+ L+ ++
Sbjct: 230 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVA 286
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
FGCG +Q P K G+LGLG S++SQ S +CL GG F
Sbjct: 287 DFGFGCGDHQHGPYEK---FDGLLGLGGAPESLVSQTSS--QFGGPFSYCLPPTSGGAGF 341
Query: 231 LGHDLVP-------SSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIK----GLQIIF 278
L P +SG+++TPM R + Y + GG I ++
Sbjct: 342 LTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVI 401
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
DSG+ T + AY R + L + L C+
Sbjct: 402 DSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD 445
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/299 (23%), Positives = 114/299 (38%), Gaps = 27/299 (9%)
Query: 38 QSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG 97
Q ++ S + +G++ G Y V + +G P + L DTGSDLTW QC
Sbjct: 120 QDSSVSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS 179
Query: 98 CTLPPESLYHPKNNL----VACNDPFCSAFHLPENIR--CEANDQ-CDYEVLYADHGSSL 150
C ++++ P + + C C+ C A+ + C Y + Y D S+
Sbjct: 180 CYKQQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSV 239
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
G + + T+ + +FGCG Q N G +AG++GLG S + Q+
Sbjct: 240 GYFSRERLSVTATD---IVDNFLFGCG--QNNQGLF-GGSAGLIGLGRHPISFVQ--QTA 291
Query: 211 GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHY--------SSGPAEL 262
+ R + +CL L +S + +TP S + S G A+L
Sbjct: 292 AVYRKIFSYCLPATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKL 351
Query: 263 LFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
+ G I DSG+ T AY R+ + P E L C+
Sbjct: 352 PVSSSTFSTGGA--IIDSGTVITRLPPTAYTALRSAFRQGMSKYP--SAGELSILDTCY 406
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 87/196 (44%), Gaps = 22/196 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V + +G+PP+ + ID+GSD+ WVQC PC C + ++ P + V+C
Sbjct: 129 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ-PCKLCYKQSDPVFDPAKSGSYTGVSCG 187
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C EN C + C YEV+Y D + G L + LT + + GC
Sbjct: 188 SSVCDRI---ENSGCHSGG-CRYEVMYGDGSYTKGTLALE----TLTFAKTVVRNVAMGC 239
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGYLFLGH 233
G+ R G+ G + S + QL G T G+CL RG G L G
Sbjct: 240 GHRNRGMFIGAAGLLGIGGGSM---SFVGQLS--GQTGGAFGYCLVSRGTDSTGSLVFGR 294
Query: 234 DLVPSSGIAWTPMSRD 249
+ +P G +W P+ R+
Sbjct: 295 EALP-VGASWVPLVRN 309
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 87/196 (44%), Gaps = 22/196 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V + +G+PP+ + ID+GSD+ WVQC PC C + ++ P + V+C
Sbjct: 130 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ-PCKLCYKQSDPVFDPAKSGSYTGVSCG 188
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C EN C + C YEV+Y D + G L + LT + + GC
Sbjct: 189 SSVCDRI---ENSGCHSG-GCRYEVMYGDGSYTKGTLALE----TLTFAKTVVRNVAMGC 240
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG---GGYLFLGH 233
G+ R G+ G + S + QL G T G+CL RG G L G
Sbjct: 241 GHRNRGMFIGAAGLLGIGGGSM---SFVGQLS--GQTGGAFGYCLVSRGTDSTGSLVFGR 295
Query: 234 DLVPSSGIAWTPMSRD 249
+ +P G +W P+ R+
Sbjct: 296 EALP-VGASWVPLVRN 310
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 87/186 (46%), Gaps = 42/186 (22%)
Query: 52 PITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP--CTGCTLPPES--- 104
P T +YP Y Y+ T +G PP+ + +DTGS LTWV C + C C+ P S
Sbjct: 86 PATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVP 145
Query: 105 LYHPKNN----LVACNDPFCSAFHLPENI--RCE--------------ANDQC-DYEVLY 143
++HPKN+ LV C +P C H N+ +C A++ C Y V+Y
Sbjct: 146 VFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVY 205
Query: 144 ADHGSSLGVLVTDHF--PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKA 201
GS+ G+L+ D P R G +LG L+ +Q P +G+ G G G
Sbjct: 206 GS-GSTAGLLIADTLRAPGRAVPGFVLGCSLV---SVHQ--------PPSGLAGFGRGAP 253
Query: 202 SILSQL 207
S+ +QL
Sbjct: 254 SVPAQL 259
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 87/186 (46%), Gaps = 42/186 (22%)
Query: 52 PITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP--CTGCTLPPES--- 104
P T +YP Y Y+ T +G PP+ + +DTGS LTWV C + C C+ P S
Sbjct: 54 PATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVP 113
Query: 105 LYHPKNN----LVACNDPFCSAFHLPENI--RCE--------------ANDQC-DYEVLY 143
++HPKN+ LV C +P C H N+ +C A++ C Y V+Y
Sbjct: 114 VFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVY 173
Query: 144 ADHGSSLGVLVTDHF--PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKA 201
GS+ G+L+ D P R G +LG L+ +Q P +G+ G G G
Sbjct: 174 GS-GSTAGLLIADTLRAPGRAVPGFVLGCSLV---SVHQ--------PPSGLAGFGRGAP 221
Query: 202 SILSQL 207
S+ +QL
Sbjct: 222 SVPAQL 227
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 133/320 (41%), Gaps = 71/320 (22%)
Query: 25 SEANQPPSKKK--------STQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLY 76
S NQ PS+ ST AH + P+ + Y G YS++L G PP+
Sbjct: 33 SYTNQNPSQDHLQKLNYLVSTSLARAHHLKNPQTTPVFSHSY--GGYSISLSFGTPPQTL 90
Query: 77 ELDIDTGSDLTWVQCNAP--CTGCTLPPE-SLYHPKNN----LVACNDPFCSAFHLPENI 129
+DTGS W C C C+ S + PK++ ++ C +P CS H ++
Sbjct: 91 SFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIH-QTDL 149
Query: 130 RCEANDQ----CD-----YEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG-YN 179
RC D C Y +LY G++ GV +++ L L+ P + GC ++
Sbjct: 150 RCTDCDNNSRNCSQICPPYLILYGS-GTTGGVALSETLHLH----GLIVPNFLVGCSVFS 204
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLV--- 236
R P AG+ G G G +S+ SQL GLT+ +CL LV
Sbjct: 205 SRQP-------AGIAGFGRGPSSLPSQL---GLTK--FSYCLLSHKFDDTQESSSLVLDS 252
Query: 237 ------PSSGIAWTPMSRD-------LLEKHYSSGPAELLFGGKSTGI--KGLQ------ 275
++ + +TP+ ++ +Y + GG+S I K L
Sbjct: 253 QSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGN 312
Query: 276 --IIFDSGSSYTYFNSQAYK 293
I DSG+++TY +++A++
Sbjct: 313 GGTIIDSGTTFTYMSTEAFE 332
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 88/201 (43%), Gaps = 18/201 (8%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-- 112
G+ G Y VT+ +G P L DTGSDLTW QC C E +++P +
Sbjct: 125 GSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSY 184
Query: 113 --VACNDPFCSAFHLPENI--RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
V+C+ C + C A++ C Y + Y D S+G L D F L S +
Sbjct: 185 YNVSCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKDKFTL---TSSDV 240
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGG 226
+ FGCG N + AG+LGLG K S SQ + + +CL S
Sbjct: 241 FDGVYFGCGENNQG---LFTGVAGLLGLGRDKLSFPSQTAT--AYNKIFSYCLPSSASYT 295
Query: 227 GYLFLGHDLVPSSGIAWTPMS 247
G+L G + S + +TP+S
Sbjct: 296 GHLTFGSAGISRS-VKFTPIS 315
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 115/284 (40%), Gaps = 33/284 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNNL----VACN 116
Y VTL IG P + IDTGSDL+WVQC PC C + L+ P ++ V C+
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 149
Query: 117 DPFCSAF------HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C H + A C+Y + Y + ++ GV T+ L+ ++
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVA 206
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGY 228
FGCG +Q P K G+LGLG S++SQ S +CL + G G+
Sbjct: 207 DFGFGCGDHQHGPYEK---FDGLLGLGGAPESLVSQTSS--QFGGPFSYCLPPTSGGAGF 261
Query: 229 LFLG-----HDLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIK----GLQIIF 278
L LG +SG+++TPM R + Y + GG I ++
Sbjct: 262 LTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVI 321
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
DSG+ T + AY R + L + L C+
Sbjct: 322 DSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD 365
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 108/262 (41%), Gaps = 33/262 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y +T+ +G+P + IDTGSD++WVQC PC+ C + L+ P + +C
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 256
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
C+ N C ++ QC Y V Y D S+ G +D L GS FGC
Sbjct: 257 DCAQLGQEGN-GCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSN 311
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
G+N + T G++GLG G S++S Q+ G +CL + G+L L
Sbjct: 312 VESGFNDQ--------TDGLMGLGGGAQSLVS--QTAGTLGRAFSYCLPPTPSSSGFLTL 361
Query: 232 -GHDLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIKG----LQIIFDSGSSYT 285
+SG TPM R + Y + GG+ I + DSG+ T
Sbjct: 362 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVIT 421
Query: 286 YFNSQAYKTTLDLMRKDLKGKP 307
AY + +K P
Sbjct: 422 RLPPTAYSALSSAFKAGMKQYP 443
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 108/262 (41%), Gaps = 33/262 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y +T+ +G+P + IDTGSD++WVQC PC+ C + L+ P + +C
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 110
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
C+ N C ++ QC Y V Y D S+ G +D L GS FGC
Sbjct: 111 DCAQLGQEGN-GCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSN 165
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
G+N + T G++GLG G S++S Q+ G +CL + G+L L
Sbjct: 166 VESGFNDQ--------TDGLMGLGGGAQSLVS--QTAGTLGRAFSYCLPPTPSSSGFLTL 215
Query: 232 -GHDLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIKG----LQIIFDSGSSYT 285
+SG TPM R + Y + GG+ I + DSG+ T
Sbjct: 216 GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVIT 275
Query: 286 YFNSQAYKTTLDLMRKDLKGKP 307
AY + +K P
Sbjct: 276 RLPPTAYSALSSAFKAGMKQYP 297
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 130/314 (41%), Gaps = 32/314 (10%)
Query: 27 ANQPPSKKKSTQSTA-AHRFGSTAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGS 84
A P ++ S + A A GS A P++ G +G Y + +G P Y + +DTGS
Sbjct: 84 AKTPSARATSLDADADAGLAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGS 143
Query: 85 DLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENI----RCEANDQ 136
LTW+QC+ C +++PK++ V C+ CS LP C +++
Sbjct: 144 SLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCS--DLPSATLNPSACSSSNV 201
Query: 137 CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGL 196
C Y+ Y D S+G L D ++ GS P +GCG + + +AG++GL
Sbjct: 202 CIYQASYGDSSFSVGYLSKD----TVSFGSTSLPNFYYGCGQDNEGLFGR---SAGLIGL 254
Query: 197 GLGKASILSQLQ-SLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPM-SRDLLEKH 254
K S+L QL SLG + +CL P ++TPM S L +
Sbjct: 255 ARNKLSLLYQLAPSLGYS---FTYCLPSSSSSGYLSLGSYNPGQ-YSYTPMVSSSLDDSL 310
Query: 255 YSSGPAELLFGGK-----STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLE 309
Y + + G S+ L I DSG+ T + Y + +KG
Sbjct: 311 YFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALSKAVAAAMKGT--S 368
Query: 310 DTAEEKALPVCWKG 323
+ L C+KG
Sbjct: 369 RASAYSILDTCFKG 382
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 80/180 (44%), Gaps = 12/180 (6%)
Query: 77 ELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNNLVA----CNDPFCSAFHLPENIR 130
+ +DT SD+ WVQC APC C + LY P ++++ C+ P C + N
Sbjct: 175 SMVVDTASDVPWVQC-APCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGC 233
Query: 131 CEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPP 188
A + C Y VLY D + G V+D L + + FGC + PG
Sbjct: 234 TGAGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS-KFQFGCSHALLRPGSFNN 292
Query: 189 PTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--GYLFLGHDLVPSSGIAWTPM 246
TAG + LG G S+ SQ + NV +CL G G+L LG +S A TPM
Sbjct: 293 KTAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPM 352
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 99/224 (44%), Gaps = 20/224 (8%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
SKK +T + + ST + G+ G Y VT+ +G P L DTGSDLTW QC
Sbjct: 75 SKKLATDHVSESK--STDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQC 132
Query: 92 NAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENI--RCEANDQCDYEVLYAD 145
C E +++P + V+C+ C + C A++ C Y + Y D
Sbjct: 133 QPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASN-CIYGIQYGD 191
Query: 146 HGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILS 205
S+G L + F LTN + + FGCG N + AG+LGLG K S S
Sbjct: 192 QSFSVGFLAKEKF--TLTNSDVF-DGVYFGCGENNQGLFTG---VAGLLGLGRDKLSFPS 245
Query: 206 QLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMS 247
Q + + +CL S G+L G + S + +TP+S
Sbjct: 246 QTAT--AYNKIFSYCLPSSASYTGHLTFGSAGISRS-VKFTPIS 286
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/344 (25%), Positives = 142/344 (41%), Gaps = 81/344 (23%)
Query: 36 STQSTAAHRFG---STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
S+ T AH+ S +VF + + G YS L G P + L DTGS L W C
Sbjct: 51 SSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCT 110
Query: 93 AP--CTGCTLP---PESL--YHPK----NNLVACNDPFCSAFHLPE--------NIRCEA 133
+ C+ C+ P P + + PK + LV C +P CS P+ N + E
Sbjct: 111 SRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTEN 170
Query: 134 NDQ-CDYEVLYADHGSSLGVLVTD--HFPLRLTNGSLLGPRLIFGCGY-NQRNPGPKPPP 189
Q C V+ GS+ G+L+++ FP + P + GC + + P
Sbjct: 171 CTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXI------PNFVVGCSFLSIHQP------ 218
Query: 190 TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG------GGYLFLGHDLVPSSGIAW 243
+G+ G G G S+ SQ +GL + +CL+ R G L L V SSG+ +
Sbjct: 219 -SGIAGFGRGSESLPSQ---MGLKK--FAYCLASRKFDDSPHSGQLILDSTGVKSSGLTY 272
Query: 244 TP------MSRDLLEKHYSSGPAELLFGGKSTGI-----------KGLQIIFDSGSSYTY 286
TP +S + +++Y +++ G ++ + G II DSGS++T+
Sbjct: 273 TPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSII-DSGSTFTF 331
Query: 287 FNSQAYKTTLDLMRKDLKG-------------KPLEDTAEEKAL 317
+ + K L +P D ++EK++
Sbjct: 332 MDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSV 375
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 75/156 (48%), Gaps = 13/156 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V L +G PPK Y + +DTGS L+W+QC C + LY P + ++C
Sbjct: 123 GNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCA 182
Query: 117 DPFCSAFHLP--ENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
CS + CE + + C Y Y D S+G L D L LT+ L P+
Sbjct: 183 SVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDL--LTLTSSQTL-PQFT 239
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
+GCG + + + AG++GL K S+L+QL +
Sbjct: 240 YGCGQDNQGLFGR---AAGIIGLARDKLSMLAQLST 272
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 109/281 (38%), Gaps = 63/281 (22%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y V L +G PP+ +DTGSDL W QC APC C P+ ++ P + + +P A
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQC-APCASCLPQPDPIFSPGAS--SSYEPMRCA 160
Query: 123 FHLPENI---RCEANDQCDYEVLYADHGSSLGVLVTDHFPL----RLTNGSLLGPRLIFG 175
L +I C+ D C Y Y D ++ GV T+ F + L L FG
Sbjct: 161 GELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFG 220
Query: 176 CGYNQR---NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----------- 221
CG + N G +G++G G S++SQ L + R +CL
Sbjct: 221 CGTMNKGSLNNG------SGIVGFGRAPLSLVSQ---LAIRR--FSYCLTPYASGRKSTL 269
Query: 222 ---SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI-- 276
S+RGG Y T + LL + + F G + G + L+I
Sbjct: 270 LFGSLRGGVY----------DAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPI 319
Query: 277 -------------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ T F + + R L+
Sbjct: 320 SAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQLR 360
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 105/242 (43%), Gaps = 23/242 (9%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPESL----YHPKNNLVA---- 114
+ IG P + + +DTGSDL W+ C AP + + P + Y P + A
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174
Query: 115 CNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGSSL-GVLVTDH-FPLRLTNGSLLGPR 171
C+DP C + C A DQC YE+ Y +S G L D+ + +R + G+ +
Sbjct: 175 CSDPLCEM-----SSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNPVKLP 229
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFL 231
+ GCG Q K G++GLG S+ ++L S G + C+S G G L
Sbjct: 230 VYLGCGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTF 289
Query: 232 GHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQ 290
G + P S +L+ + + + G + + +FD+G+S+TY +
Sbjct: 290 GDEGPAAQRTTPIIPKSVSMLDTYIVE--IDSITVGNTNLLMASHALFDTGTSFTYLSKT 347
Query: 291 AY 292
Y
Sbjct: 348 VY 349
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 65/133 (48%), Gaps = 13/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN-- 110
I+G G Y + IG PP L +DTGSD+ WVQC APC C + ++ P +
Sbjct: 139 ISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQC-APCADCYQQADPIFEPASSA 197
Query: 111 --NLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
+ ++CN C + + E C ND C YEV Y D ++G VT+ +T GS
Sbjct: 198 SFSTLSCNTRQCRSLDVSE---CR-NDTCLYEVSYGDGSYTVGDFVTE----TITLGSAP 249
Query: 169 GPRLIFGCGYNQR 181
+ GCG+N
Sbjct: 250 VDNVAIGCGHNNE 262
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 114/294 (38%), Gaps = 32/294 (10%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
K T ST T + +G Y VT+++G K L +DTGSDLTWVQC
Sbjct: 109 KAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ- 165
Query: 94 PCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPE-------NIRCEANDQCDYEVL 142
PC C LY P + V CN C C+Y V
Sbjct: 166 PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVS 225
Query: 143 YADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKAS 202
Y D + G L ++ L G L+FGCG RN +G++GLG S
Sbjct: 226 YGDGSYTRGDLASESIVL----GDTKLENLVFGCG---RNNKGLFGGASGLMGLGRSSVS 278
Query: 203 ILSQLQSLGLTRNVLGHCL-SVRGG--GYLFLGHDLVP---SSGIAWTPMSRD-LLEKHY 255
++S Q+L V +CL S+ G G L G+D S+ + +TP+ ++ L Y
Sbjct: 279 LVS--QTLKTFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFY 336
Query: 256 SSGPAELLFGGKS--TGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
GG T G I+ DSG+ T YK K G P
Sbjct: 337 ILNLTGASIGGVELKTLSFGRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFP 390
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 112/292 (38%), Gaps = 40/292 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVAC------- 115
+ V IG PP +DTGS LTW+QC PC C LY+P ++
Sbjct: 110 FLVNFSIGQPPVPQYAVMDTGSSLTWIQCE-PCINCHQQKGPLYNPSSSSTYVSCSDFDR 168
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRL-TNGSLLGPRLIF 174
D +A H C+Y YAD ++ G + +G + +IF
Sbjct: 169 TDTTFTATH---------GSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIF 219
Query: 175 GCGYNQRN-PGPKPPPTAGVLGLGLGKASILSQLQS-------------LGLTRNVLGHC 220
GCG+N PGP +GV GLG +SI+S+L G R LG+
Sbjct: 220 GCGHNNTQLPGPT-GYASGVFGLGDSGSSIISKLGFGFSYCIGNIGDPLYGFHRLTLGNK 278
Query: 221 LSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDS 280
L + G LVP T + + ++ P ++F +I+ DS
Sbjct: 279 LKIEG-----YSTPLVPRGLYYITLVGISIGQERLDIDP--IVFQRVDLNGISSRIVIDS 331
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLLGNF 332
G++ +Y QAY D + L G + L +C+ G L F
Sbjct: 332 GATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGKLNQDLQGF 383
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 71/154 (46%), Gaps = 18/154 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y V L IG PP+ +DTGSDL W QC APC C P+ L+ P + A +P A
Sbjct: 96 YVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLSQPDPLFAPGQS--ASYEPMRCA 152
Query: 123 FHLPENI---RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI---FGC 176
L +I CE D C Y Y D ++GV T+ F + G L + FGC
Sbjct: 153 GTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGC 212
Query: 177 G---YNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
G N G +G++G G S++SQL
Sbjct: 213 GSVNVGSLNNG------SGIVGFGRNPLSLVSQL 240
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 105/259 (40%), Gaps = 26/259 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC-TGCTL----PPESLYHPKNNLVACND 117
Y +T+ IG P + IDTGSD++WV C+A G +L S Y P +C+
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSLFFDPGKSSTYTP----FSCSS 180
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
C+ +N C N C Y V Y D ++ G +D L N + FGC
Sbjct: 181 AACTRLEGRDN-GCSLNSTCQYTVRYGDGSNTTGTYGSDTLAL---NSTEKVENFQFGCS 236
Query: 178 YNQRNPGP--KPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGH 233
+PG T G++GLG G S++S Q+ + +CL + R G+L LG
Sbjct: 237 -ETSDPGEGLDEDQTDGLMGLGGGAPSLVS--QTAATYGSAFSYCLPATTRSSGFLTLGA 293
Query: 234 DLVPSSGIAWTPMSRDLLE-KHYSSGPAELLFGGKSTGIK----GLQIIFDSGSSYTYFN 288
+SG TPM R Y + GG I I DSG+ T
Sbjct: 294 S-TGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGSIMDSGTIITRLP 352
Query: 289 SQAYKTTLDLMRKDLKGKP 307
+AY R ++ P
Sbjct: 353 PRAYSALSAAFRAGMRRYP 371
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 78/178 (43%), Gaps = 18/178 (10%)
Query: 80 IDTGSDLTWVQC-NAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEAN 134
+DT SD+ WVQC P + C + LY P + AC+ P C P C ++
Sbjct: 186 LDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLG-PYANGCSSS 244
Query: 135 D----QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPT 190
QC Y V Y D ++ G LV D L T+ P+ FGC + R + T
Sbjct: 245 SNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQV---PKFEFGCSHAARGSFSR-SKT 300
Query: 191 AGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPM 246
AG++ LG G S++SQ + V +C + G+ LG SS A TPM
Sbjct: 301 AGIMALGRGVQSLVSQTST--KYGQVFSYCFPPTASHKGFFVLGVPRRSSSRYAVTPM 356
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 107/262 (40%), Gaps = 16/262 (6%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPES---LYHPKNNLVAC 115
Y + +G P + + +DTGSDL WV C+ AP +G + +Y P + +
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 155
Query: 116 NDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPLRLTNGSL-LGPRL 172
+ P CS C Q C Y + Y +++ +S G+L+ D L + + +
Sbjct: 156 HLP-CSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLG 232
I GCG Q G+LGLG+ S+ S L GL +N C G +F G
Sbjct: 215 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFG 274
Query: 233 HDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQA 291
VPS + P+ L + Y+ + G K + + DSG+S+T
Sbjct: 275 DQGVPSQQSTPFVPLYGKL--QTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDV 332
Query: 292 YKTTLDLMRKDLKGK--PLEDT 311
YK K + P EDT
Sbjct: 333 YKAFTMEFDKQMNATRVPYEDT 354
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 106/267 (39%), Gaps = 35/267 (13%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
I+G + G Y ++ +G PP L IDTGSD+ W+QC PC C LY P+ +
Sbjct: 89 ISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCK-PCVHCYRQLSPLYDPRGSS 147
Query: 113 VACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR 171
P CS C+ C Y ++Y D S+ G L TD L +N + +G
Sbjct: 148 TYAQTP-CSPPQCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDR--LVFSNDTSVG-N 203
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSVR-----G 225
+ GCG++ AG+LG+ G S +Q+ S G +CL R
Sbjct: 204 VTLGCGHDNEGLFGS---AAGLLGVARGNNSFATQVADSYG---RYFAYCLGDRTRSGSS 257
Query: 226 GGYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGK--------------STG 270
YL G +TP+ S Y GG+ +TG
Sbjct: 258 SSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATG 317
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLD 297
G ++ DSG+S T F AY D
Sbjct: 318 RGG--VVVDSGTSITRFARDAYGALRD 342
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 69/156 (44%), Gaps = 15/156 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC-------TLPPESLYHPKNNLVAC 115
Y + + +G PP+ L +DTGSDL W QC APC C L P + + + C
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQC-APCLDCFEQGAAPVLDPAA--SSTHAALPC 146
Query: 116 NDPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN--GSLLGPRL 172
+ P C A R + C Y Y D ++G L TD F + G L R+
Sbjct: 147 DAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRV 206
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
FGCG+ N G G+ G G G+ S+ SQL
Sbjct: 207 TFGCGHI--NKGIFQANETGIAGFGRGRWSLPSQLN 240
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 68/160 (42%), Gaps = 23/160 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V IG PP +DTGSDL W QC+APC C P LY P ++ V+C
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSR 159
Query: 119 FCSAFHLPE-----------NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
C A LP + C Y Y D S+ GVL T+ F
Sbjct: 160 LCDA--LPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF---GAGT 214
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
L FGCG + ++G++G+G G S++SQL
Sbjct: 215 TVHDLAFGCGTDNLGGTDN---SSGLVGMGRGPLSLVSQL 251
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 100/232 (43%), Gaps = 30/232 (12%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEAND 135
+DT S+LTWVQC APC C L+ P ++ ++ CN C A + A
Sbjct: 141 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199
Query: 136 -----QCDYEVLYADHGSSLGVLVTDHFPL--RLTNGSLLGPRLIFGCGYNQRNPGPKPP 188
C Y + Y D S GVL D L + +G +FGCG + + P
Sbjct: 200 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDG------FVFGCGTSNQGPFGG-- 251
Query: 189 PTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGHDLVP---SSGIA 242
T+G++GLG + S++S Q++ V +CL ++ G L LG D S+ I
Sbjct: 252 -TSGLMGLGRSQLSLIS--QTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIV 308
Query: 243 WTPMSRDLLEK-HYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYK 293
+T M D ++ Y + GG+ ++I DSG+ T Y
Sbjct: 309 YTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYN 360
>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 203
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 57/107 (53%), Gaps = 6/107 (5%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y T++IG PP+ ++ IDTGSDL WV CN+ C GC L + + P + +AC+D
Sbjct: 78 YYTTVQIGTPPRELDVVIDTGSDLVWVSCNS-CVGCPLHNVTFFDPGASSSAVKLACSDK 136
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
CS+ L + RC + C Y+V Y D + G ++D +G
Sbjct: 137 RCSS-DLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSG 182
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 87/344 (25%), Positives = 142/344 (41%), Gaps = 81/344 (23%)
Query: 36 STQSTAAHRFG---STAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
S+ T AH+ S +VF + + G YS L G P + L DTGS L W C
Sbjct: 51 SSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCT 110
Query: 93 AP--CTGCTLP---PESL--YHPK----NNLVACNDPFCSAFHLPE--------NIRCEA 133
+ C+ C+ P P + + PK + LV C +P CS P+ N + E
Sbjct: 111 SRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTEN 170
Query: 134 NDQ-CDYEVLYADHGSSLGVLVTD--HFPLRLTNGSLLGPRLIFGCGY-NQRNPGPKPPP 189
Q C V+ GS+ G+L+++ FP + P + GC + + P
Sbjct: 171 CTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKI------PNFVVGCSFLSIHQP------ 218
Query: 190 TAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG------GGYLFLGHDLVPSSGIAW 243
+G+ G G G S+ SQ +GL + +CL+ R G L L V SSG+ +
Sbjct: 219 -SGIAGFGRGSESLPSQ---MGLKK--FAYCLASRKFDDSPHSGQLILDSTGVKSSGLTY 272
Query: 244 TP------MSRDLLEKHYSSGPAELLFGGKSTGI-----------KGLQIIFDSGSSYTY 286
TP +S + +++Y +++ G ++ + G II DSGS++T+
Sbjct: 273 TPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSII-DSGSTFTF 331
Query: 287 FNSQAYKTTLDLMRKDLKG-------------KPLEDTAEEKAL 317
+ + K L +P D ++EK++
Sbjct: 332 MDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSV 375
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 113/264 (42%), Gaps = 40/264 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + GNPP+ +DTGSDL WVQC PC C + + P + + C
Sbjct: 88 GEYLIDISYGNPPQKSTAIVDTGSDLNWVQC-LPCKSCYETLSAKFDPSKSASYKTLGCG 146
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
FC LP C A+ C Y+ +Y D S+ G L TD +T G+ P + FGC
Sbjct: 147 SNFCQ--DLPFQ-SCAAS--CQYDYMYGDGSSTSGALSTDD----VTIGTGKIPNVAFGC 197
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY---LFLGH 233
G + G+ L S++SQL G +CL G L++G
Sbjct: 198 GNSNLGTFAGAGGLVGLGKGPL---SLVSQLG--GTATKKFSYCLVPLGSTKTSPLYIG- 251
Query: 234 DLVPSSGIAWTPMSRD--------------LLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
D + G+A+TPM + +E + PA F +TG GL I D
Sbjct: 252 DSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPAN-TFDIAATGRGGL--ILD 308
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDL 303
SG++ TY + A+ + ++ L
Sbjct: 309 SGTTLTYLDVDAFNPMVAALKAAL 332
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 69/249 (27%), Positives = 103/249 (41%), Gaps = 27/249 (10%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--------- 104
T + L Y +VT IG P + + + +DTGSDL W+ CN T C E+
Sbjct: 82 TEEISFLHYANVT--IGTPAQWFLVALDTGSDLFWLPCNCNST-CVRSMETDQGERIKLN 138
Query: 105 LYHP----KNNLVACNDPFCSAFHLPENIRCEA-NDQCDYEVLYADHGS-SLGVLVTDHF 158
+Y+P ++ V CN C+ + RC + C Y + Y GS S GVLV D
Sbjct: 139 IYNPSKSKSSSKVTCNSTLCALRN-----RCISPVSDCPYRIRYLSPGSKSTGVLVEDVI 193
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+ G R+ FGC +Q K G++GL + ++ + L G+ +
Sbjct: 194 HMSTEEGEARDARITFGCSESQLGLF-KEVAVNGIMGLAIADIAVPNMLVKAGVASDSFS 252
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIF 278
C G G + G SS TP+S + Y + GK T F
Sbjct: 253 MCFGPNGKGTISFGDK--GSSDQLETPLSGTISPMFYDVSITKFKV-GKVTVDTEFTATF 309
Query: 279 DSGSSYTYF 287
DSG++ T+
Sbjct: 310 DSGTAVTWL 318
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 100/232 (43%), Gaps = 30/232 (12%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEAND 135
+DT S+LTWVQC APC C L+ P ++ ++ CN C A + A
Sbjct: 142 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200
Query: 136 -----QCDYEVLYADHGSSLGVLVTDHFPL--RLTNGSLLGPRLIFGCGYNQRNPGPKPP 188
C Y + Y D S GVL D L + +G +FGCG + + P
Sbjct: 201 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDG------FVFGCGTSNQGPFGG-- 252
Query: 189 PTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGHDLVP---SSGIA 242
T+G++GLG + S++S Q++ V +CL ++ G L LG D S+ I
Sbjct: 253 -TSGLMGLGRSQLSLIS--QTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIV 309
Query: 243 WTPMSRDLLEK-HYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYK 293
+T M D ++ Y + GG+ ++I DSG+ T Y
Sbjct: 310 YTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYN 361
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 94/228 (41%), Gaps = 21/228 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V + +G PP + + DTGSD TWVQC C + L+ P + V+C DP
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C+ + C A C Y + Y D ++G D L + ++ G + FGCG
Sbjct: 223 ACADL---DASGCNAG-HCLYGIQYGDGSYTVGFFAKD--TLAVAQDAIKGFK--FGCGE 274
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL--GHD 234
R + TAG+LGLG G SI +Q+ +CL S GYL
Sbjct: 275 KNRGLFGQ---TAGLLGLGRGPTSI--TVQAYEKYGGSFSYCLPASSAATGYLEFGPLSP 329
Query: 235 LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
S TPM D Y G + GGK G + +SG+
Sbjct: 330 SSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGT 377
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 124/282 (43%), Gaps = 43/282 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PPK + L +DTGSDL W+QC PC C Y PK++ ++C+
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCH 251
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT--NG-SLLG-- 169
DP C P+ C+A +Q C Y Y D ++ G + F + LT NG S L
Sbjct: 252 DPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHV 311
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY- 228
++FGCG+ R AG+LGLG G S SQ+QS L +CL R
Sbjct: 312 ENVMFGCGHWNRGLFHG---AAGLLGLGKGPLSFASQMQS--LYGQSFSYCLVDRNSNAS 366
Query: 229 ----LFLGHD--LVPSSGIAWTP----------------MSRDLLEKHYSSGPAELLFGG 266
L G D L+ + +T ++ +++ P E +
Sbjct: 367 VSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEE-TWHL 425
Query: 267 KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
S G G I DSG++ TYF AY+ + + +KG L
Sbjct: 426 SSEGAGG--TIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYEL 465
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 67/160 (41%), Gaps = 17/160 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y V L +G PP+ L +DTGSDL W QC APC C L P + + C P
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFHQGLPLLDPAASSTYAALPCGAP 150
Query: 119 FCSAFHLP------ENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS----LL 168
C A + N C Y Y D ++G + TD F NG L
Sbjct: 151 RCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLP 210
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
RL FGCG+ N G G+ G G G+ S+ SQL
Sbjct: 211 TRRLTFGCGHF--NKGVFQSNETGIAGFGRGRWSLPSQLN 248
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 119/300 (39%), Gaps = 43/300 (14%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
++ K Q++ GS+ + V +G PP +DTGS L W+QC
Sbjct: 65 ARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQC 124
Query: 92 NAPCTGCTLPPESLYHPKNN--------LVACNDPFCSAFHLPENIRCEANDQCDYEVLY 143
PC C+ + + HP N +C+D FC N C ++++C YE +Y
Sbjct: 125 Q-PCKHCS--SDHMIHPVFNPALSSTFVECSCDDRFC---RYAPNGHCGSSNKCVYEQVY 178
Query: 144 ADHGSSLGVLVTDHFPLRLTNG-SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKAS 202
S GVL + NG +++ + FGCGY N G+LGLG S
Sbjct: 179 ISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGY--ENGEQLESHFTGILGLGAKPTS 236
Query: 203 ILSQLQSLGLTRNVLGHC---LSVRGGGY--LFLGHDLVPSSGIAW--TPMSRDLLEKHY 255
+ QL S +C L+ + GY L LG D + I TP+ + Y
Sbjct: 237 LAVQLGS------KFSYCIGDLANKNYGYNQLVLGED----ADILGDPTPIEFETENSIY 286
Query: 256 SSGPAELLFGGKSTGIKGLQ---------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
+ G I+ + +I DSG+ YT+ AY+ + ++ L K
Sbjct: 287 YMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILDPK 346
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 79/287 (27%), Positives = 126/287 (43%), Gaps = 42/287 (14%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF 119
+G Y++ + +G P + + DTGSDL W QC APCT C P + P ++ P
Sbjct: 83 VGGYNMNISVGTPLLTFSVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPC 141
Query: 120 CSAF--HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
S+F LP +IR C Y Y G + G L T+ L++ + S P + FGC
Sbjct: 142 TSSFCQFLPNSIRTCNATGCVYNYKYGS-GYTAGYLATET--LKVGDASF--PSVAFGCS 196
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLGHD 234
+ G T+G+ GLG G S++ Q LG+ R +CL S G + G
Sbjct: 197 -TENGVGNS---TSGIAGLGRGALSLIPQ---LGVGR--FSYCLRSGSAAGASPILFGSL 247
Query: 235 LVPSSG-IAWTPMSRD--LLEKHY-------SSGPAEL-----LFGGKSTGIKGLQIIFD 279
+ G + TP + + +Y + G +L FG G+ G I+ D
Sbjct: 248 ANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIV-D 306
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEE--KALPVCWKGT 324
SG++ TY Y +++++ + + T + L +C+K T
Sbjct: 307 SGTTLTYLAKDGY----EMVKQAFLSQTADVTTVNGTRGLDLCFKST 349
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 73/160 (45%), Gaps = 19/160 (11%)
Query: 63 YSVTLKIGNP-PKLYELDIDTGSDLTWVQCNAPCTGCTLPP----ESLYHPKNNLVACND 117
Y + L IG P P+ L +DTGSDL W QC C C P ++L V C+D
Sbjct: 100 YLIHLSIGTPRPQRVALTLDTGSDLVWTQC--ACHVCFAQPFPTFDALASQTTLAVPCSD 157
Query: 118 PFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGS--------LL 168
P C++ P + C ND C Y YAD + G +V D F R G+ +
Sbjct: 158 PICTSGKYPLS-GCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVA 216
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
P + FGCG Q N G +G+ G G S+ SQL+
Sbjct: 217 VPNVRFGCG--QYNKGIFKSNESGIAGFSRGPMSLPSQLK 254
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 109/267 (40%), Gaps = 29/267 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y V + +G PP+ +++ +DTGSDL W+QC APC C ++ P + V C
Sbjct: 148 GEYLVEVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFDQRGPVFDPMASTSYRNVTCG 206
Query: 117 DPFC---SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT-NGSLLGPRL 172
D C S P R +D C Y Y D ++ G L + F + LT + S +
Sbjct: 207 DTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGV 266
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLG 232
+ GCG+ R G+ L AS L + + ++ H +V G + G
Sbjct: 267 VLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAV--GSKIVFG 324
Query: 233 HD--LVPSSGIAWTPMSRDLLEK-HYSSGPAELLFGGKSTGIKGLQ-----------IIF 278
D L+ + +T + E Y +L GG+ I I
Sbjct: 325 DDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTII 384
Query: 279 DSGSSYTYFNSQAYKTT----LDLMRK 301
DSG++ +YF AYK +D M K
Sbjct: 385 DSGTTLSYFPEPAYKAIRQAFVDRMDK 411
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 87/180 (48%), Gaps = 23/180 (12%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+V+L +G+PP+ + +DTGS+L+W+ C +P P S Y P + C+ P C
Sbjct: 1001 TVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSP----IPCSSPIC 1056
Query: 121 --SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
LP + C+ C V YAD S G L +D+F + GS P +FGC
Sbjct: 1057 RTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI----GSSALPGTLFGCMD 1112
Query: 177 -GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDL 235
G++ + + T G++G+ G S ++Q LGL + +C+S R + L DL
Sbjct: 1113 SGFSSNS--EEDAKTTGLMGMNRGSLSFVTQ---LGLPK--FSYCISGRDSSGVLLFGDL 1165
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 96/246 (39%), Gaps = 26/246 (10%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTL----PPESLYHP----KNNLVA 114
+ +G P + + +DTGSDL WV C+ AP + P Y P + V
Sbjct: 111 VAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIANASDLRGGPDLRPYSPGKSSTSKAVT 170
Query: 115 CNDPFCSAFHLPENIRCEAND--QCDYEVLYAD-HGSSLGVLVTD--HFPLRLTNG--SL 167
C C P N C Y V Y + SS GVLV D H G +
Sbjct: 171 CEHALC---ERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDVLHLSREAAGGASTA 227
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT-RNVLGHCLSVRGG 226
+ ++ GCG Q G+LGLG+ K S+ S L + GL + C S G
Sbjct: 228 VTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVPSVLHAAGLVASDSFSMCFSPDGF 287
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
G + G G A TP + Y+ + GK + I DSG+S+TY
Sbjct: 288 GRINFGDS--GRRGQAETPFTVRNTHPTYNISVTAMSVSGKEVAAE-FAAIVDSGTSFTY 344
Query: 287 FNSQAY 292
N AY
Sbjct: 345 LNDPAY 350
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 52/277 (18%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
++G + G Y + +G+PP + IDTGSDL W+QC PC C LY P+N+
Sbjct: 82 MSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQC-LPCRRCYRQVTPLYDPRNSK 140
Query: 112 ---LVACNDPFCS-AFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPL----RL 162
+ C P C P C+A C Y V+Y D +S G L TD L R+
Sbjct: 141 THRRIPCASPQCRGVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRV 197
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
N +L GCG++ AG+LG G G+ S +QL +V +CL
Sbjct: 198 HNVTL-------GCGHDNEG---LLASAAGLLGAGRGQLSFPTQLAP--AYGHVFSYCLG 245
Query: 223 VR------GGGYLFLGHD-LVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGK------- 267
R YL G +PS+ A+TP+ + Y GG+
Sbjct: 246 DRMSRARNSSSYLVFGRTPELPST--AFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSN 303
Query: 268 -------STGIKGLQIIFDSGSSYTYFNSQAYKTTLD 297
+TG G ++ DSG++ + F AY D
Sbjct: 304 ASLALNPATGRGG--VVVDSGTAISRFTRDAYAAVRD 338
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 69/161 (42%), Gaps = 17/161 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y V L +G PP+ L +DTGSDL W QC APC C L P + + C P
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFDQGIPLLDPAASSTYAALPCGAP 144
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL-----RLTNGSLLGP-RL 172
C A C Y Y D ++G + TD F R +GSL RL
Sbjct: 145 RCRALPFTSC----GGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRL 200
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT 213
FGCG+ N G G+ G G G+ S+ SQL + +
Sbjct: 201 TFGCGHF--NKGVFQSNETGIAGFGRGRWSLPSQLNATSFS 239
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 66/133 (49%), Gaps = 13/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
I+G G Y + IGNP + + +DTGSD+ W+QC PC C E ++ P ++
Sbjct: 141 ISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSS 199
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
++C+ P C+A + E C N C YEV Y D ++G T+ LT GS L
Sbjct: 200 SYEPLSCDTPQCNALEVSE---CR-NATCLYEVSYGDGSYTVGDFATE----TLTIGSTL 251
Query: 169 GPRLIFGCGYNQR 181
+ GCG++
Sbjct: 252 VQNVAVGCGHSNE 264
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 70/168 (41%), Gaps = 14/168 (8%)
Query: 52 PITGNVYPL-GYYSVTLKIGNP-PKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
P+T P G Y + IG P P+ L +DTGSDL W QC PC C P L+ P
Sbjct: 75 PVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCT-PCPVCFDQPFPLFDPS 133
Query: 110 NN----LVACNDPFCSAFHLPENIRCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
+ VAC DP C C +C Y Y D + G + D F N
Sbjct: 134 VSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPN 193
Query: 165 GSLLGPR----LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
G P L FGCG N G +G+ G G G S+ SQL+
Sbjct: 194 GEGAPPVAVSGLAFGCG--DYNTGVFASNESGIAGFGRGPLSLPSQLR 239
>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 430
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 55/98 (56%), Gaps = 6/98 (6%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y T++IG PP+ ++ IDTGSDL WV CN+ C GC L + + P + +AC+D
Sbjct: 78 YYTTVQIGTPPRELDVVIDTGSDLVWVSCNS-CVGCPLHNVTFFDPGASSSAVKLACSDK 136
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
CS+ L + RC + C Y+V Y D + G ++D
Sbjct: 137 RCSS-DLQKKSRCSLLESCTYKVEYGDGSVTSGYYISD 173
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 120/301 (39%), Gaps = 56/301 (18%)
Query: 56 NVYPLGY---YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP-KNN 111
N+ P Y + V +G P +DTGS++ WV+C APC CT L P K++
Sbjct: 89 NLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRC-APCKRCTQQNGPLLDPSKSS 147
Query: 112 LVA---CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN-GSL 167
A C + C H + C +QC Y + YA SS GVL T+ ++ G
Sbjct: 148 TYASLPCTNTMC---HYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVN 204
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS-------------LGLTR 214
P ++FGC + N K GV GLG G S ++++ S G +
Sbjct: 205 AVPSVVFGCSH--ENGDYKDRRFTGVFGLGKGITSFVTRMGSKFSYCLGNIADPHYGYNQ 262
Query: 215 NVLGHCLSVRGGGY---LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI 271
V G + G + GH V GI S G L + +
Sbjct: 263 LVFGEKANFEGYSTPLKVVNGHYYVTLEGI--------------SVGEKRLDIDSTAFSM 308
Query: 272 KGLQ--IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLL 329
KG + + DSG++ T+ A++ + +R+ L G L W+G++ C
Sbjct: 309 KGNEKSALIDSGTALTWLAESAFRALDNEVRQLLDG----------VLMPFWRGSFACYK 358
Query: 330 G 330
G
Sbjct: 359 G 359
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 120/293 (40%), Gaps = 24/293 (8%)
Query: 33 KKKSTQSTAAH--RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQ 90
+K+ ++TAA GSTA + G V G + ++ + +EL +DTGS T++
Sbjct: 4 RKRPFKNTAARGRALGSTAR-EVYGEVLETGVLVASFELAGA-QTFELIVDTGSSRTYLP 61
Query: 91 CNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSL 150
C C C Y+ + + CSA +C + C Y+V Y + S
Sbjct: 62 CKG-CASCGAHEAGRYYDYDASADFSRVECSACAGIGG-KCGTSGVCRYDVHYLEGSGSE 119
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGP-KPPPTAGVLGLGLGKASILSQLQS 209
G LV D L GS+ ++FGC +R G K G+ G G ++ +QL S
Sbjct: 120 GYLVRDVVSL---GGSVGNATVVFGC--EERELGSIKQQSADGLFGFGRQAYALRAQLAS 174
Query: 210 LGLTRNVLGHCLS-------VRGGGYLFLGH-DL-VPSSGIAWTPMSRDLLEKHYSSGPA 260
+ ++ C+ GG L LG+ D + + +TPM + +Y
Sbjct: 175 ASVIDDLFSMCVEGYEKLSGEHVGGLLTLGNFDFGADAPALVYTPMVSSAM--YYQVTTT 232
Query: 261 ELLFGGKST-GIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTA 312
G G +G+ I DSG+SYTY + L L + LE A
Sbjct: 233 SWTLGNSVVEGSRGVLTIIDSGTSYTYVPGNMHARFLQLAEDAARESGLEKVA 285
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 119/303 (39%), Gaps = 50/303 (16%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
++ K Q++ GS+ + V +G PP +DTGS L W+QC
Sbjct: 37 ARFKYLQNSIVKELGSSDFQVDVHQAIKTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQC 96
Query: 92 NAPCTGCTLPPESLYHPKNN--------LVACNDPFCSAFHLPENIRCEANDQCDYEVLY 143
+ PC C+ + HP N +C+D FC N C +N +C YE +Y
Sbjct: 97 H-PCKHCS--SNHMIHPVFNPALSSTFVECSCDDRFC---RYAPNGHCSSN-KCVYEQVY 149
Query: 144 ADHGSSLGVLVTDHFPLRLTNG-SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKAS 202
S GVL + NG +++ + FGCG+ N G+LGLG S
Sbjct: 150 ISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGH--ENGEQLESEFTGILGLGAKPTS 207
Query: 203 ILSQLQSLGLTRNVLGHC---LSVRGGGY--LFLGHDLVPSSGIAWTPMSRDLLEKHYSS 257
+ QL S +C L+ + GY L LG D + I P +E +
Sbjct: 208 LAVQLGS------KFSYCIGDLANKNYGYNQLVLGED----ADILGDPTP---IEFETEN 254
Query: 258 GPAELLFGGKSTGIKGLQI--------------IFDSGSSYTYFNSQAYKTTLDLMRKDL 303
G + G S G K L I I D+G+ YT+ AY+ + ++ L
Sbjct: 255 GIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWLADIAYRELYNEIKSIL 314
Query: 304 KGK 306
K
Sbjct: 315 DPK 317
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 106/266 (39%), Gaps = 24/266 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-----------LYHPKNN 111
Y + +G P + + +DTGSDL WV C+ C C P S +Y P +
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCA--PLSGYRGNLDRDLRIYRPAES 151
Query: 112 LVACNDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPLRLTNGSL-L 168
+ + P CS C Q C Y + Y +++ +S G+L+ D L + +
Sbjct: 152 TTSRHLP-CSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPV 210
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY 228
+I GCG Q G+LGLG+ S+ S L GL +N C G
Sbjct: 211 NASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGR 270
Query: 229 LFLGHDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+F G VPS + P+ L + Y+ + G K + + DSG+S+T
Sbjct: 271 IFFGDQGVPSQQSTPFVPLYGKL--QTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSL 328
Query: 288 NSQAYKTTLDLMRKDLKGK--PLEDT 311
YK K + P EDT
Sbjct: 329 PFDVYKAFTMEFDKQMNATRVPYEDT 354
>gi|7413629|emb|CAB85978.1| putative protein [Arabidopsis thaliana]
Length = 356
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 55/98 (56%), Gaps = 6/98 (6%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y T++IG PP+ ++ IDTGSDL WV CN+ C GC L + + P + +AC+D
Sbjct: 78 YYTTVQIGTPPRELDVVIDTGSDLVWVSCNS-CVGCPLHNVTFFDPGASSSAVKLACSDK 136
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTD 156
CS+ L + RC + C Y+V Y D + G ++D
Sbjct: 137 RCSS-DLQKKSRCSLLESCTYKVEYGDGSVTSGYYISD 173
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 105/270 (38%), Gaps = 38/270 (14%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
I+G G Y L +G PP+ + +DTGSD+ W+QC PC C + L++P +
Sbjct: 143 ISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQC-LPCAKCYGQTDPLFNPAASS 201
Query: 112 ---LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
V C P C + C C+Y+V Y D ++G T+ R G ++
Sbjct: 202 TYRKVPCATPLCKKLDISG---CRNKRYCEYQVSYGDGSFTVGDFSTETLTFR---GQVI 255
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--- 225
R+ GCG++ G+ L S Q+ +CL R
Sbjct: 256 -RRVALGCGHDNEGLFIGAAGLLGLGRGSLSFPS-----QTGAQFSKRFSYCLVDRSASG 309
Query: 226 -GGYLFLGHDLVPSSGIAWTPMSRDLLEKHY--------------SSGPAELLFGGKSTG 270
L G +P S I +S L+ Y +S PA +F +TG
Sbjct: 310 TASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPAS-VFRMDATG 368
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMR 300
G +I DSG+S T AY T D R
Sbjct: 369 NGG--VIIDSGTSVTRLVDSAYSTMRDAFR 396
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 119/279 (42%), Gaps = 36/279 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IGNP DTGSDL WVQC PC C ++ P+ + V C
Sbjct: 91 GEYLMRISIGNPQVEILAIADTGSDLIWVQCQ-PCEMCYKQNSPIFDPRRSSSYRNVLCG 149
Query: 117 DPFCSAFHLPENIRCEAN---DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP--- 170
+ FC+ E C+A C Y Y D S G L + F + TN +
Sbjct: 150 NEFCNKLD-GEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAY 208
Query: 171 --RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-----------------QSLG 211
+ FGCG +N G +G++GLG G S++SQL QS
Sbjct: 209 FQEVAFGCG--TKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNY 266
Query: 212 LTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI 271
++ G+ +++ G Y + L+P + ++ + + P L+ G+ +
Sbjct: 267 TSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPYTNLWNGE---V 323
Query: 272 KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
+ II DSG++ T+ +S+ + + + +KG+ + D
Sbjct: 324 EKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSD 362
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 123/291 (42%), Gaps = 37/291 (12%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
I+G G Y + IG P + +DTGSD+ W+QC APC C + ++ P ++
Sbjct: 134 ISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQC-APCADCYHQADPIFEPASST 192
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
++C+ C + + E C N+ C YEV Y D ++G VT+ L GS
Sbjct: 193 SYSPLSCDTKQCQSLDVSE---CR-NNTCLYEVSYGDGSYTVGDFVTETITL----GSAS 244
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---G 225
+ GCG+N AG+LGLG GK S SQ+ + + +CL R
Sbjct: 245 VDNVAIGCGHNNEGLFIG---AAGLLGLGGGKLSFPSQINA-----SSFSYCLVDRDSDS 296
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ---------- 275
L L+P + A +R+ L+ Y G L GG+ I
Sbjct: 297 ASTLEFNSALLPHAITAPLLRNRE-LDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGG 355
Query: 276 IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWK 326
II DSG++ T + AY D K K P+ T+E C+ + K
Sbjct: 356 IIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPV--TSEVALFDTCYDLSRK 404
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 113/260 (43%), Gaps = 28/260 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKN----NLVACN 116
Y V + G P + IDTGSD++W+QC PC+ C + LY P + + V C
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 171
Query: 117 DPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C C + QC + + YAD S++G D L L G+++ FG
Sbjct: 172 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDK--LTLAPGAIV-QNFYFG 228
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGG-GYLFLGH 233
CG+ + GVLGLG + S+ ++ V +CL SV G+L LG
Sbjct: 229 CGHGKH---AVRGLFDGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFLALGA 279
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGP-AELLFGGKSTGIKGLQ----IIFDSGSSYTYFN 288
P SG +TPM + +S+ A + GGK ++ +I DSG+ T
Sbjct: 280 GKNP-SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQ 338
Query: 289 SQAYKTTLDLMRKDLKGKPL 308
S AY+ RK ++ L
Sbjct: 339 STAYRALRSAFRKAMEAYRL 358
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 107/262 (40%), Gaps = 16/262 (6%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLPPES---LYHPKNNLVAC 115
Y + +G P + + +DTGSDL WV C+ AP +G + +Y P + +
Sbjct: 66 YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 125
Query: 116 NDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPLRLTNGSL-LGPRL 172
+ P CS C Q C Y + Y +++ +S G+L+ D L + + +
Sbjct: 126 HLP-CSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 184
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLG 232
I GCG Q G+LGLG+ S+ S L GL +N C G +F G
Sbjct: 185 IIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFG 244
Query: 233 HDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQA 291
VPS + P+ L + Y+ + G K + + DSG+S+T
Sbjct: 245 DQGVPSQQSTPFVPLYGKL--QTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPLDV 302
Query: 292 YKTTLDLMRKDLKGK--PLEDT 311
YK K + P EDT
Sbjct: 303 YKAFTMEFDKQMNATRVPYEDT 324
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 114/260 (43%), Gaps = 28/260 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
G Y ++ IG PP +DTGSDL W+QC PC C +P+ + DP
Sbjct: 86 GEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCE-PCKQC--------YPQ--ITPIFDPSL 134
Query: 121 SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG-SLLGPRLIFGCGYN 179
S+ + +NI C +D C + G L + L T G S+ P+ + GCGY
Sbjct: 135 SSSY--QNIPC-LSDTC--HSMRTTSCDVRGYLSVETLTLDSTTGYSVSFPKTMIGCGY- 188
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQLQSL--GLTRNVLGHCLSVRGGGYLFLGHDLVP 237
RN G P++G++GLG G S+ SQL + G LG L F +V
Sbjct: 189 -RNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNSTSKLNFGDAAIVY 247
Query: 238 SSGIAWTPMSRD-------LLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQ 290
G TP+ + L + +S G + FGG + G I+ DSG+++T+
Sbjct: 248 GDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFTFLPYD 307
Query: 291 AYKTTLDLMRKDLKGKPLED 310
Y + + + + +ED
Sbjct: 308 VYYRFESAVAEYINLEHVED 327
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 107/255 (41%), Gaps = 37/255 (14%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-------LPPESL--YHPKNN- 111
YY+V +++G P + + +DTGSDL WV C+ C C P +L Y P+ +
Sbjct: 111 YYAV-VEVGTPNATFLVALDTGSDLFWVPCD--CKQCASIANVTGQPATALRPYSPRESS 167
Query: 112 ---LVACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTN--- 164
V C++ C P N C YEV Y + + S+ GVLV D L
Sbjct: 168 TSKQVTCDNALC---DRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERPGA 224
Query: 165 ----GSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT-RNVLGH 219
G L ++FGCG Q G++GLG S+ S L S GL +
Sbjct: 225 AAEAGEALQAPVVFGCGQVQTGTFLDGAAFDGLMGLGRENVSVPSVLASSGLVASDSFSM 284
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMS--RDLLEKHYSSGPAELLFGGKSTGIKGLQII 277
C G G + G SSG TP + R L +++ E KS + +I
Sbjct: 285 CFGDDGVGRINFGDS--GSSGQGETPFTGRRTLYNVSFTAVNVET----KSVAAEFAAVI 338
Query: 278 FDSGSSYTYFNSQAY 292
DSG+S+TY Y
Sbjct: 339 -DSGTSFTYLADPEY 352
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 110/265 (41%), Gaps = 47/265 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VAC 115
G Y VT+ IG P L DTGSDLTW QC PC G C E ++P ++ V+C
Sbjct: 130 GNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSTYQNVSC 188
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
+ P C E+ + C Y ++Y D + G L + F LTN +L + FG
Sbjct: 189 SSPMC------EDAESCSASNCVYSIVYGDKSFTQGFLAKEKF--TLTNSDVL-EDVYFG 239
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLG 232
CG N + G+ L + Q+ N+ +CL + G+L G
Sbjct: 240 CGENNQGLFDGVAGLLGLGPGKLSLPA-----QTTTTYNNIFSYCLPSFTSNSTGHLTFG 294
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG----GKSTGIKGLQI----------IF 278
+ S + +TP+ SS P+ +G G S G K L I I
Sbjct: 295 SAGI-SESVKFTPI---------SSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAII 344
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDL 303
DSG+ +T ++ Y + ++ +
Sbjct: 345 DSGTVFTRLPTKVYAELRSVFKEKM 369
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 113/260 (43%), Gaps = 28/260 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKN----NLVACN 116
Y V + G P + IDTGSD++W+QC PC+ C + LY P + + V C
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 137
Query: 117 DPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C C + QC + + YAD S++G D L L G+++ FG
Sbjct: 138 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDK--LTLAPGAIV-QNFYFG 194
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGG-GYLFLGH 233
CG+ + GVLGLG + S+ ++ V +CL SV G+L LG
Sbjct: 195 CGHGKH---AVRGLFDGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFLALGA 245
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGP-AELLFGGKSTGIKGLQ----IIFDSGSSYTYFN 288
P SG +TPM + +S+ A + GGK ++ +I DSG+ T
Sbjct: 246 GKNP-SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQ 304
Query: 289 SQAYKTTLDLMRKDLKGKPL 308
S AY+ RK ++ L
Sbjct: 305 STAYRALRSAFRKAMEAYRL 324
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 87/198 (43%), Gaps = 18/198 (9%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK-- 109
P+ GNV LGYY L IG P + +DTGS L C+ CT C ++ P+
Sbjct: 70 PVYGNVPELGYYYTYLTIGTPGQTVSGILDTGSTLPAFPCSG-CTRCGPSKTGMFKPELS 128
Query: 110 --NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
++ C+D C F + C N+QC Y + Y + S+ G L D L + +G
Sbjct: 129 STSSTFGCSDARC--FCGANSCSCN-NEQCGYSIRYLEGSSTSGFLAEDM--LAVGDG-- 181
Query: 168 LGP--RLIFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR 224
GP +FGC Q G A GV G+G AS+ QL G+ + C
Sbjct: 182 -GPAANFVFGCA--QSESGLLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAP 238
Query: 225 GGGYLFLGHDLVPSSGIA 242
G L LG+ +P+ A
Sbjct: 239 REGVLLLGNVALPADAPA 256
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 104/269 (38%), Gaps = 23/269 (8%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN---NLVACNDPF 119
Y + +G PP + + +DTGSDL W+ CN T C E + P++ NL N
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 120 CSAFHLPENIRCEANDQCD-------YEVLYADHGSSLGVLVTD--HFPLRLTNGSLLGP 170
S+ + RC + +C Y++ Y++ + G L+ D H N + +
Sbjct: 161 TSSSIRCSDKRCFGSKKCSSPKSICPYQISYSNSTGTTGTLLQDVLHLATEDENLTPVKT 220
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
+ GCG Q + GVLGLG+ S+ S L +T + C G
Sbjct: 221 NVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITADSFSMCFGRVIGNVGR 280
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQ 290
+ + TP Y + GG G + L FD+GSS+T+
Sbjct: 281 ISFGDKGYTDQEETPFISVAPSTAYGLNVTGVSVGGDPVGTR-LFAKFDTGSSFTHLMEP 339
Query: 291 AYKTTLDLMRKDLKGKPLEDTAEEKALPV 319
AY K +D E+K PV
Sbjct: 340 AYGVLT---------KSFDDLVEDKRRPV 359
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 67/133 (50%), Gaps = 13/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
++G G Y + + IG PP + +DTGSD++W+QC APC+ C + ++ P ++
Sbjct: 139 VSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPISSN 197
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
+ C++P C + L E C N C YEV Y D ++G T+ +T GS
Sbjct: 198 SYSPIRCDEPQCKSLDLSE---CR-NGTCLYEVSYGDGSYTVGEFATE----TVTLGSAA 249
Query: 169 GPRLIFGCGYNQR 181
+ GCG+N
Sbjct: 250 VENVAIGCGHNNE 262
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 77/178 (43%), Gaps = 20/178 (11%)
Query: 46 GSTAVFPITGNVY----PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGC-- 98
G +A P+ Y P Y V L G PP+ +L +DTGSD+TW QC P + C
Sbjct: 67 GRSASAPVNPGAYDDGFPFTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFN 126
Query: 99 -TLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL 153
TLP L+ P + + C+ P C + C+Y + Y D S G +
Sbjct: 127 QTLP---LFDPSASSSFASLPCSSPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEI 183
Query: 154 VTDHFPLRLTNG---SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
+ F G S P L+FGCG+ R G G+ G G G S+ SQL+
Sbjct: 184 GREVFTFASGTGEGSSAAVPGLVFGCGHANR--GVFTSNETGIAGFGRGSLSLPSQLK 239
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 59/132 (44%), Gaps = 12/132 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
I+G G Y + +G PPK + +DTGSD+ W+QC APC C + +++P +
Sbjct: 32 ISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSG 90
Query: 112 ---LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
V C P C P C C Y+V Y D + G VT+ R T
Sbjct: 91 SFAKVLCRTPLCRRLESPG---CNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE-- 145
Query: 169 GPRLIFGCGYNQ 180
++ GCG++
Sbjct: 146 --QVALGCGHDN 155
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 106/261 (40%), Gaps = 26/261 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VACND 117
Y V + +G P + L DTGS LTW QC PC G C + ++ P + + C
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCE-PCAGSCYKQQDPIFDPSKSSSYTNIKCTS 198
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
C+ F + C Y+V Y D+ S G L + + T+ + +FGCG
Sbjct: 199 SLCTQFR-SAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD---IVHDFLFGCG 254
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDL 235
Q N G TAG++GL S + Q S + + +CL + G+L G
Sbjct: 255 --QDNEGLF-RGTAGLMGLSRHPISFVQQTSS--IYNKIFSYCLPSTPSSLGHLTFGASA 309
Query: 236 VPSSGIAWTPMSRDLLEKHY--------SSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
++ + +TP S E + S G +L ST G II DSG+ T
Sbjct: 310 ATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSII-DSGTVITRL 368
Query: 288 NSQAYKTTLDLMRKDLKGKPL 308
AY R+ + P+
Sbjct: 369 PPTAYAALRSAFRQFMMKYPV 389
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 72/274 (26%), Positives = 115/274 (41%), Gaps = 63/274 (22%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFC 120
VTL IG PP+ ++ +DTGS L+W+QC+ PP + + P + ++ C P C
Sbjct: 90 VTLPIGTPPQPQQMVLDTGSQLSWIQCHN-----KTPPTASFDPSLSSSFYVLPCTHPLC 144
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
F LP C+ N C Y YAD + G LV + + S P LI GC
Sbjct: 145 KPRVPDFTLPTT--CDQNRLCHYSYFYADGTYAEGNLVREKLAF---SPSQTTPPLILGC 199
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--------GGY 228
R+ G+LG+ LG+ S Q + + +C+ R G
Sbjct: 200 SSESRD-------ARGILGMNLGRLSFPFQAKVTKFS-----YCVPTRQPANNNNFPTGS 247
Query: 229 LFLGHDLVPSSG------IAWTPMSRDL--LEKHYSSGPAE-LLFGGKSTGIK------- 272
+LG++ P+S + P S+ + L+ + P + + GG+ I
Sbjct: 248 FYLGNN--PNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPN 305
Query: 273 ---GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
Q + DSGS +T+ AY D +R+++
Sbjct: 306 AGGSGQTMVDSGSEFTFLVDVAY----DRVREEI 335
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 74/168 (44%), Gaps = 14/168 (8%)
Query: 46 GSTAVFPI-TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES 104
GS P +G+ G Y VT+ +G P + DTGSDLTW QC C E
Sbjct: 120 GSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEP 179
Query: 105 LYHPKNNL----VACNDPFCSAFH--LPENIRCEANDQCDYEVLYADHGSSLGVLVTDHF 158
+++P + ++C+ P C + C A+ C Y + Y D S+G D
Sbjct: 180 IFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAS-TCVYGIQYGDQSYSVGFFAQDKL 238
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
L T+ + +FGCG N R AG++GLG S++S+
Sbjct: 239 ALTSTD---VFNNFLFGCGQNNRGLFVG---VAGLIGLGRNALSLMSK 280
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 59/132 (44%), Gaps = 12/132 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
I+G G Y + +G PPK + +DTGSD+ W+QC APC C + +++P +
Sbjct: 119 ISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSG 177
Query: 112 ---LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
V C P C P C C Y+V Y D + G VT+ R T
Sbjct: 178 SFAKVLCRTPLCRRLESPG---CNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE-- 232
Query: 169 GPRLIFGCGYNQ 180
++ GCG++
Sbjct: 233 --QVALGCGHDN 242
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 120/288 (41%), Gaps = 32/288 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
+ + IG PP +DTGS LTWV C+ PC+ C+ ++ P + N CS
Sbjct: 93 FLMNFSIGEPPIPQLAVMDTGSSLTWVMCH-PCSSCSQQSVPIFDPSKSSTYSNLS-CS- 149
Query: 123 FHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCG--Y 178
E +C+ N +C Y V Y GSS G+ + L + S++ P LIFGCG +
Sbjct: 150 ----ECNKCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKF 205
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVP 237
+ + G GV GLG G+ S+L S G +C+ ++R Y F L
Sbjct: 206 SISSNGYPYQGINGVFGLGSGRFSLLP---SFG---KKFSYCIGNLRNTNYKFNRLVLGD 259
Query: 238 SSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI-----------KGLQIIFDSGSSYTY 286
+ + + +++ Y + GG+ I +I DSG+ +T+
Sbjct: 260 KANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTW 319
Query: 287 FNSQAYKTTLDLMRKDLKGKPLEDTAEEKALP--VCWKGTWKCLLGNF 332
++ L ++L L ++K P +C+ G L F
Sbjct: 320 LTKYGFE-VLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGF 366
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/342 (26%), Positives = 138/342 (40%), Gaps = 42/342 (12%)
Query: 7 RVMGL-LVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSV 65
R GL L L Q C +E + +++ + A+ S V + Y
Sbjct: 20 RAAGLRLELTHVDAKQNCSTEERMRRATERTHRRLASMGEASAPVH------WAESQYIA 73
Query: 66 TLKIGNPPKLYELDIDTGSDLTWVQCNAPC--TGCTLPPESLYHPKNNL----VACNDPF 119
IG+PP+ E IDTGS+L W QC+ C GC S Y P + VACND
Sbjct: 74 EYLIGDPPQQAEAIIDTGSNLIWTQCST-CQPAGCFSQNLSFYDPSRSRTARPVACNDTA 132
Query: 120 CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
C+ L RC +++ + G GVL T+ F + + ++ L FGC
Sbjct: 133 CA---LGSETRCARDNKACAVLTAYGAGVIGGVLGTEAFTFQPQSENV---SLAFGCIAA 186
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQL--------------QSLGLTRNVLGHCLSVRG 225
R +G++GLG G S++SQL QS +R +G +
Sbjct: 187 TRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTSRLFVGASAGLSS 246
Query: 226 GG-----YLFLGH-DLVPSSGIAWTPMSRDLLEKHYSSGP-AELLFGGKSTGIKGLQIIF 278
GG FL + D+ P S + P++ + + P A +TG+ +I
Sbjct: 247 GGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTLI- 305
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVC 320
DSGS +T AY+ D + + L + A + L +C
Sbjct: 306 DSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLC 347
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 104/261 (39%), Gaps = 32/261 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V+L +G PP+ + DTGSD+ W+QC PC C + L++P + + C
Sbjct: 79 GEYFVSLGVGTPPRTVNMVADTGSDVLWLQC-LPCQSCYGQTDPLFNPSFSSTFQSITCG 137
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C IR +QC Y+V Y D ++G T+ L+ GS + GC
Sbjct: 138 SSLCQQLL----IRGCRRNQCLYQVSYGDGSFTVGEFSTE----TLSFGSNAVNSVAIGC 189
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGH 233
G+N + G+ L S + QL +V +CL R G L G+
Sbjct: 190 GHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYG-----SVFSYCLPTRESTGSVPLIFGN 244
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ-----------IIFDSGS 282
V S+ T ++ L+ Y + GG S I +I DSG+
Sbjct: 245 QAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGT 304
Query: 283 SYTYFNSQAYKTTLDLMRKDL 303
+ T + AY D R +
Sbjct: 305 AVTRLVTSAYNPMRDAFRAGM 325
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 104/261 (39%), Gaps = 32/261 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y V+L +G PP+ + DTGSD+ W+QC PC C + L++P + + C
Sbjct: 79 GEYFVSLGVGTPPRTVNMVADTGSDVLWLQC-LPCQSCYGQTDPLFNPSFSSTFQSITCG 137
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C IR +QC Y+V Y D ++G T+ L+ GS + GC
Sbjct: 138 SSLCQQLL----IRGCRRNQCLYQVSYGDGSFTVGEFSTE----TLSFGSNAVNSVAIGC 189
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGH 233
G+N + G+ L S + QL +V +CL R G L G+
Sbjct: 190 GHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYG-----SVFSYCLPTRESTGSVPLIFGN 244
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ-----------IIFDSGS 282
V S+ T ++ L+ Y + GG S I +I DSG+
Sbjct: 245 QAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGT 304
Query: 283 SYTYFNSQAYKTTLDLMRKDL 303
+ T + AY D R +
Sbjct: 305 AVTRLVTSAYNPMRDAFRAGM 325
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 124/289 (42%), Gaps = 45/289 (15%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+V+L +G+PP+ + +DTGS+L+W+ C +P P S Y P + C+ P C
Sbjct: 41 TVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSP----IPCSSPVC 96
Query: 121 --SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC-- 176
LP + C+ C V YAD S G L +D+F + GS P +FGC
Sbjct: 97 RTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI----GSSALPGTLFGCMD 152
Query: 177 -GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGH 233
G++ + + T G++G+ G S ++Q LGL + +C+S R G LF
Sbjct: 153 SGFSSNS--EEDAKTTGLMGMNRGSLSFVTQ---LGLPK--FSYCISGRDSSGVLLFGDS 205
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPA-ELLFGGKSTGIKGL---------------QII 277
L + +TP+ + Y A + G G K L Q +
Sbjct: 206 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 265
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDT--AEEKALPVCWK 322
DSG+ +T+ Y + + KG PL D + A+ +C++
Sbjct: 266 VDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYR 314
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/284 (27%), Positives = 129/284 (45%), Gaps = 45/284 (15%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT---LPPESLYHPK----NNLVAC 115
+ V +G PP +DTGS L W+QC APC C+ + P ++ P + ++C
Sbjct: 102 FLVNFSMGQPPVPQLAIMDTGSSLLWIQC-APCKSCSQQIIGP--MFDPSISSTYDSLSC 158
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN-GSLLGPRLIF 174
+ C + P C+++ QC Y Y + S+GV+ T+ ++ G ++F
Sbjct: 159 KNIICR--YAPSG-ECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLF 215
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGH 233
GC + RN K GV GLG G S+++Q+ S +C+ ++ Y + +
Sbjct: 216 GCSH--RNGNYKDRRFTGVFGLGSGITSVVNQMGS------KFSYCIGNIADPDYSY--N 265
Query: 234 DLVPSSGIAW----TPMSRDLLEKHY-------SSGPAELLF---GGKSTGIKGLQIIFD 279
LV S G+ TP+ D+++ HY S G L+ K T K ++I D
Sbjct: 266 QLVLSEGVNMEGYSTPL--DVVDGHYQVILEGISVGETRLVIDPSAFKRTE-KQRRVIID 322
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SG++ T+ Y+ L+ ++L + L E L C+KG
Sbjct: 323 SGTAPTWLAENEYR-ALEREVRNLLDRFLTPFMRESFL--CYKG 363
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 114/284 (40%), Gaps = 39/284 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--LYHPKNNL----VACN 116
Y VT+ +G P L IDTGSDL+WVQC PC T P+ L+ P + + CN
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWVQCQ-PCNSTTCYPQKDPLFDPSKSSTYAPIPCN 182
Query: 117 DPFCSAFHLPENIR---CEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C L ++ C + D QC + + Y D + GV + L L G +
Sbjct: 183 TDACR--DLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNET--LALAPGVAV-K 237
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG---- 226
FGCG++Q K G+LGLG S++ +Q+ + +CL
Sbjct: 238 DFRFGCGHDQDGANDK---YDGLLGLGGAPESLV--VQTASVYGGAFSYCLPALNNQVGF 292
Query: 227 ----GYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ----IIF 278
G +V +SG +TPM R+ E Y + GG+ + +I
Sbjct: 293 LALGGGGAPSGGVVNTSGFVFTPMIRE-EETFYVVNMTGITVGGEPIDVPPSAFSGGMII 351
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
DSG+ T AY RK + PL E L C+
Sbjct: 352 DSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGE---LDTCYD 392
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/329 (23%), Positives = 136/329 (41%), Gaps = 62/329 (18%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
K K+T AH + + P + V + IG+PP L +DT SDL W+QC
Sbjct: 63 KAKTTGDIIAHLSPNVPIIPQA--------FLVNISIGSPPITQLLHMDTASDLLWIQC- 113
Query: 93 APCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLG 151
PC C ++ P + N+ ++ + +++ AN + C+Y + Y D S G
Sbjct: 114 LPCINCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDTGSKG 173
Query: 152 VLVTDHFPLRLT---NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ-- 206
+L + + S ++FGCG++ +P G+LGLG G+ S++ +
Sbjct: 174 ILAREMLLFNTIYDESSSAALHDVVFGCGHDNYG---EPLVGTGILGLGYGEFSLVHRFG 230
Query: 207 ------------------LQSLGLT-RNVLGHC--LSVRGGGYLFLGHDLVPSSGIAWTP 245
+ LG N+LG L + G Y ++ + + GI P
Sbjct: 231 KKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFY-YVTIEAISVDGIIL-P 288
Query: 246 MSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG 305
+ + +++ +G GG I D+G+S T +AYK + + +G
Sbjct: 289 IDPRVFNRNHQTG-----LGGT---------IIDTGNSLTSLVEEAYKPLKNRIEDIFEG 334
Query: 306 K-PLEDTAEEKALPVCWKGTWKCLLGNFE 333
+ D +++ + + +C GNFE
Sbjct: 335 RFTAADVSQDDMIKM------ECYNGNFE 357
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 114/262 (43%), Gaps = 31/262 (11%)
Query: 61 GYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF 119
G Y + L +G PP +Y L +DT SDL W QC PC GC ++ P L CN F
Sbjct: 29 GDYLMKLTLGTPPVDVYGL-VDTDSDLVWAQC-TPCQGCYKQKNPMFDP---LKECNSFF 83
Query: 120 CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
+ C CDY YAD ++ G+L + T+G + +IFGCG+N
Sbjct: 84 --------DHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIFGCGHN 135
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGYLFLGH- 233
N G G++GLG G S++SQ+ +L ++ CL G + LG
Sbjct: 136 --NTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKR-FSQCLVPFHADPHTSGTISLGEA 192
Query: 234 DLVPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
V G+ TP+ + + Y S G + F KG I+ DSG+ TY
Sbjct: 193 SDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKG-NIMIDSGTPETY 251
Query: 287 FNSQAYKTTLDLMRKDLKGKPL 308
+ Y ++ ++ + P+
Sbjct: 252 LPQEFYDRLVEELKVQINLPPI 273
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 119/289 (41%), Gaps = 48/289 (16%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA----- 114
+G Y + +G P K Y + +DTGS LTW+QC+ C +++PK +
Sbjct: 126 VGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSC 185
Query: 115 ----CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C+D +A P + C ++ C Y+ Y D S+G L D ++ GS P
Sbjct: 186 SAQQCSD-LTTATLSPAS--CSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVP 238
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCLSVRGGGYL 229
+GCG Q N G +AG++GL K S+L QL S+G + +CL
Sbjct: 239 NFYYGCG--QDNEGLF-GQSAGLIGLARNKLSLLYQLAPSMGYS---FSYCLPTSSSSSS 292
Query: 230 FLGHDLVPSSG-IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK--------------GL 274
+ G ++TPM+ L+ + L+ K TGIK L
Sbjct: 293 GYLSIGSYNPGQYSYTPMASSSLD--------DSLYFIKMTGIKVAGKPLSVSSSAYSSL 344
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
I DSG+ T + Y + +KG P + L C++G
Sbjct: 345 PTIIDSGTVITRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQG 391
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 70/154 (45%), Gaps = 11/154 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V + IG PP+ +L +DTGSDLTW QC APC C ++P ++ + C+
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 119 FCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG---PRLIF 174
C + N C Y YADH + G L +D F + ++ G P L F
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
GCG N G G+ G G S+ +QL+
Sbjct: 230 GCGL--FNNGIFVSNETGIAGFSRGALSMPAQLK 261
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 70/154 (45%), Gaps = 11/154 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V + IG PP+ +L +DTGSDLTW QC APC C ++P ++ + C+
Sbjct: 85 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 143
Query: 119 FCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG---PRLIF 174
C + N C Y YADH + G L +D F + ++ G P L F
Sbjct: 144 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 203
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
GCG N G G+ G G S+ +QL+
Sbjct: 204 GCGL--FNNGIFVSNETGIAGFSRGALSMPAQLK 235
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 70/154 (45%), Gaps = 11/154 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y V + IG PP+ +L +DTGSDLTW QC APC C ++P ++ + C+
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 119 FCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG---PRLIF 174
C + N C Y YADH + G L +D F + ++ G P L F
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
GCG N G G+ G G S+ +QL+
Sbjct: 230 GCGL--FNNGIFVSNETGIAGFSRGALSMPAQLK 261
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 62/128 (48%), Gaps = 13/128 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK----NNLVACN 116
G + + L IG P Y +DTGSDLTW QC PC+ C P +Y P V+C
Sbjct: 19 GEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-MPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A LP + A C+Y Y D+ S+ G+L + F L++ S+ P + FGC
Sbjct: 78 SSLCLA--LPASACISAT--CEYLYTYGDYSSTQGILSYETF--TLSSQSI--PHIAFGC 129
Query: 177 GYNQRNPG 184
G + G
Sbjct: 130 GQDNEGSG 137
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/273 (26%), Positives = 105/273 (38%), Gaps = 22/273 (8%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH------ 107
TGN + YY+ + +G P + + +DTGSDL W+ C+ C C P S YH
Sbjct: 200 TGNDFGWLYYTW-VDVGTPNTSFMVALDTGSDLFWIPCD--CIECA--PLSGYHGSLDRD 254
Query: 108 -----PKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTD--HFP 159
P + + + P L + C Y Y ++ +S G+LV D H
Sbjct: 255 LGIYKPAESTTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLD 314
Query: 160 LRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGH 219
R ++ + +I GCG Q G+LGLG+ S+ S L GL RN
Sbjct: 315 SRESHAPVKA-SVIIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSM 373
Query: 220 CLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFD 279
C + G F + + P+ L + Y+ + G K Q I D
Sbjct: 374 CFTKDSGRIFFGDQGVSTQQSTPFVPLYGKL--QTYTVNVDKSCVGHKCFESTSFQAIVD 431
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTA 312
SG+S+T YK K + L A
Sbjct: 432 SGTSFTALPLDIYKAVAIEFDKQVNASRLPQEA 464
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/287 (27%), Positives = 125/287 (43%), Gaps = 42/287 (14%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF 119
+G Y++ + +G P + + DTGSDL W QC APCT C P + P ++ P
Sbjct: 83 VGGYNMNISVGTPLLTFPVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPC 141
Query: 120 CSAF--HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
S+F LP +IR C Y Y G + G L T+ L++ + S P + FGC
Sbjct: 142 TSSFCQFLPNSIRTCNATGCVYNYKYGS-GYTAGYLATET--LKVGDASF--PSVAFGCS 196
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLGHD 234
+ G T+G+ GLG G S++ Q LG+ R +CL S G + G
Sbjct: 197 -TENGVGNS---TSGIAGLGRGALSLIPQ---LGVGR--FSYCLRSGSAAGASPILFGSL 247
Query: 235 LVPSSG-IAWTPMSRD--LLEKHY-------SSGPAEL-----LFGGKSTGIKGLQIIFD 279
+ G + TP + + +Y + G +L FG G+ G I+ D
Sbjct: 248 ANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIV-D 306
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEE--KALPVCWKGT 324
SG++ TY Y +++++ + T + L +C+K T
Sbjct: 307 SGTTLTYLAKDGY----EMVKQAFLSQTANVTTVNGTRGLDLCFKST 349
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 62/128 (48%), Gaps = 13/128 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK----NNLVACN 116
G + + L IG P Y +DTGSDLTW QC PC+ C P +Y P V+C
Sbjct: 19 GEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-IPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C A LP + A C+Y Y D+ S+ G+L + F L++ S+ P + FGC
Sbjct: 78 SSLCLA--LPASACISAT--CEYLYTYGDYSSTQGILSYETF--TLSSQSI--PHIAFGC 129
Query: 177 GYNQRNPG 184
G + G
Sbjct: 130 GQDNEGSG 137
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 118/270 (43%), Gaps = 41/270 (15%)
Query: 61 GYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VAC 115
G Y ++ +G PP K Y + +DTGSD+ W+QC PC C ++P + ++C
Sbjct: 85 GDYIMSYSVGTPPIKSYGI-VDTGSDIVWLQCE-PCEQCYNQTTPKFNPSKSSSYKNISC 142
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIF 174
+ C + + C C+Y + Y + S G L + L T G + P+ +
Sbjct: 143 SSKLCQSVR---DTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVI 199
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL--------------QSLGLTRNVLGHC 220
GCG N N G ++GV+GLG G AS+++QL S+ L +G
Sbjct: 200 GCGTN--NIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSS 257
Query: 221 LSVRGGGYLFLGHDLVPSSGIAWTPMSRD-------LLEKHYSSGPAELLFGGKSTGIKG 273
G + GH+++ TP+ + L + +S G + F G S G++
Sbjct: 258 KLNFGDVAIVSGHNVLS------TPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEE 311
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
II DS + T+ S Y T L+ DL
Sbjct: 312 GNIIIDSSTIVTFVPSDVY-TKLNSAIVDL 340
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 103/248 (41%), Gaps = 29/248 (11%)
Query: 72 PPKLYELDIDTGSDLTWVQCNAPCTGCTLPP-----ESLYHPKNN----LVACNDPFCSA 122
P + + +D+ SD+ WVQC C +PP +S Y P + +C+ P C+A
Sbjct: 25 PGVIQTVVLDSASDVPWVQC----VPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTA 80
Query: 123 FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRN 182
P C AN+QC Y V Y D S+ G + D L N ++ G FGC + ++
Sbjct: 81 LG-PYANGC-ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGN-AVSG--FKFGCSHAEQ- 134
Query: 183 PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSG 240
G AG++ LG G S+LSQ S N +C+ + G+ LG SS
Sbjct: 135 -GSFDARAAGIMALGGGPESLLSQTAS--RYGNAFSYCIPATASDSGFFTLGVPRRASSR 191
Query: 241 IAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIK----GLQIIFDSGSSYTYFNSQAYKTT 295
TPM R Y + GG+ G+ + DS ++ T AY+
Sbjct: 192 YVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQAL 251
Query: 296 LDLMRKDL 303
R +
Sbjct: 252 RAAFRSSM 259
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 119/289 (41%), Gaps = 48/289 (16%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA----- 114
+G Y + +G P K Y + +DTGS LTW+QC+ C +++PK +
Sbjct: 126 VGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSC 185
Query: 115 ----CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C+D +A P + C ++ C Y+ Y D S+G L D ++ GS P
Sbjct: 186 SAQQCSD-LTTATLNPAS--CSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVP 238
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCLSVRGGGYL 229
+GCG Q N G +AG++GL K S+L QL S+G + +CL
Sbjct: 239 NFYYGCG--QDNEGLF-GQSAGLIGLARNKLSLLYQLAPSMGYS---FSYCLPTSSSSSS 292
Query: 230 FLGHDLVPSSG-IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK--------------GL 274
+ G ++TPM+ L+ + L+ K TGIK L
Sbjct: 293 GYLSIGSYNPGQYSYTPMASSSLD--------DSLYFIKMTGIKVAGKPLSVSSSAYSSL 344
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
I DSG+ T + Y + +KG P + L C++G
Sbjct: 345 PTIIDSGTVITRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQG 391
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/311 (25%), Positives = 120/311 (38%), Gaps = 28/311 (9%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
+KK + S++V G +G Y L +G P Y + +DTGS LTW+QC+
Sbjct: 101 RKKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCS 160
Query: 93 APCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLP--ENIRCEANDQCDYEVLYADH 146
C ++ P+ + V C+ C C ++ C Y+ Y D
Sbjct: 161 PCSVSCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDS 220
Query: 147 GSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
S+G L D ++ GS P +GCG + + +AG++GL K S+L Q
Sbjct: 221 SYSVGYLSKD----TVSFGSGSFPGFYYGCGQDNEGLFGR---SAGLIGLAKNKLSLLYQ 273
Query: 207 LQ-SLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKH-YSSGPAEL 262
L SLG +CL S GYL +G ++TPM+ L+ Y + +
Sbjct: 274 LAPSLGY---AFSYCLPTSSAAAGYLSIGS--YNPGQYSYTPMASSSLDASLYFVTLSGI 328
Query: 263 LFGGKSTGI-----KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKAL 317
G + + L I DSG+ T Y T L L
Sbjct: 329 SVAGAPLAVPPSEYRSLPTIIDSGTVITRLPPNVY-TALSRAVAAAMASAAPRAPTYSIL 387
Query: 318 PVCWKGTWKCL 328
C++G+ L
Sbjct: 388 DTCFRGSAAGL 398
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 67/146 (45%), Gaps = 5/146 (3%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y + L IG PP + DTGSDLTW QC PC C +Y P + P SA
Sbjct: 71 YLMELAIGKPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPLPCSSA 129
Query: 123 FHLPENIR-CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR 181
LP R C + C Y Y D S G+L T+ L ++ + + FGCG +
Sbjct: 130 TCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDN- 188
Query: 182 NPGPKPPPTAGVLGLGLGKASILSQL 207
G + G +GLG G S+L+QL
Sbjct: 189 --GGDSLNSTGTVGLGRGTLSLLAQL 212
>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
Length = 490
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 130/302 (43%), Gaps = 67/302 (22%)
Query: 44 RFGSTAVFPITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQC----NAPCTG 97
R G+ + ++YP Y Y+ T+ +G PP+ + ++TGS L+WV +A C+
Sbjct: 68 RQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLETGSHLSWVPSTSSYSANCSS 127
Query: 98 CTLP-PESLYHPKNN----LVACNDPFCSAFHLPENIR-CEANDQC-------------- 137
+ P ++HPKN+ L+ C +P C H P+++ C A C
Sbjct: 128 LSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANN 187
Query: 138 ---DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVL 194
Y V+Y GS+ G+L++D LR ++ + GC + +PP +G+
Sbjct: 188 VCPPYLVVYGS-GSTAGLLISDT--LRTPGRAVRN--FVIGCSLASVH---QPP--SGLA 237
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCL---------SVRGGGYLFLGHDLVPSSGIAWTP 245
G G G S+ SQ LGLT+ +CL +V G L G+ + P
Sbjct: 238 GFGRGAPSVPSQ---LGLTK--FSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAP 292
Query: 246 MSRDLLEK-----HYSSGPAELLFGGKSTGIKGLQI---------IFDSGSSYTYFNSQA 291
++R + +Y + GGKS + I DSG++++YF+
Sbjct: 293 LARSASARPPYSVYYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTV 352
Query: 292 YK 293
++
Sbjct: 353 FE 354
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 95/239 (39%), Gaps = 25/239 (10%)
Query: 80 IDTGSDLTWVQC-NAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCE-A 133
+DT SD+ WVQC P C L + LY P + + C P C C
Sbjct: 173 VDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPT 232
Query: 134 NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
D+C Y V Y D ++ G VTD + T ++ FGC + R G AG+
Sbjct: 233 TDECKYIVNYGDGKATTGTYVTDTLTMSPT---IVVKDFRFGCSHAVR--GSFSNQNAGI 287
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLE 252
L LG G+ S+L Q+ N +C+ G+L LG + S ++TP+ ++
Sbjct: 288 LALGGGRGSLLE--QTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKN--- 342
Query: 253 KH----YSSGPAELLFGGKSTGIK----GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
KH Y ++ GK + + DSG+ T Q Y R +
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAM 401
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 70/161 (43%), Gaps = 13/161 (8%)
Query: 30 PPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWV 89
P S S + + R +T +G G Y + + +G PP+ + + +DTGSDL W+
Sbjct: 121 PASPSSSPRRALSERMVATVE---SGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWL 177
Query: 90 QCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRC---EANDQCDYEVL 142
QC APC C ++ P + V C D C PE R D C Y
Sbjct: 178 QC-APCLDCFDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYW 236
Query: 143 YADHGSSLGVLVTDHFPLRLT--NGSLLGPRLIFGCGYNQR 181
Y D ++ G L + F + LT S ++FGCG+ R
Sbjct: 237 YGDQSNTTGDLALESFTVNLTAPGASRRVDDVVFGCGHWNR 277
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 105/266 (39%), Gaps = 24/266 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES-----------LYHPKNN 111
Y + +G P + + +DTGSDL WV C+ C C P S +Y P +
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCA--PLSGYRGNLDRDLRIYRPAES 151
Query: 112 LVACNDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTDHFPLRLTNGSL-L 168
+ + P CS C Q C Y + Y +++ +S G+L+ D L + +
Sbjct: 152 TTSRHLP-CSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPV 210
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY 228
+I GCG Q G+L LG+ S+ S L GL +N C G
Sbjct: 211 NASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCFKEDSSGR 270
Query: 229 LFLGHDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
+F G VPS + P+ L + Y+ + G K + + DSG+S+T
Sbjct: 271 IFFGDQGVPSQQSTPFVPLYGKL--QTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSL 328
Query: 288 NSQAYKTTLDLMRKDLKGK--PLEDT 311
YK K + P EDT
Sbjct: 329 PFDVYKAFTMEFDKQMNATRVPYEDT 354
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 88/192 (45%), Gaps = 18/192 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y +T+ IG+P + +DTGSD++WVQC PC+ C +SL+ P + +C+
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSSSSTYSPFSCSSA 180
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C+ + + QC Y V Y D S+ G + LT GS FGC
Sbjct: 181 PCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSS----DTLTLGSSAMTDFQFGC-- 234
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--GYLFLGHDLV 236
+Q G T G++GLG G S+ S Q+ G +CL G G+L LG
Sbjct: 235 SQSESGGFNDQTDGLMGLGGGAQSLAS--QTAGTFGTAFSYCLPPTSGSSGFLTLGTG-- 290
Query: 237 PSSGIAWTPMSR 248
SSG TPM R
Sbjct: 291 -SSGFVKTPMLR 301
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 119/289 (41%), Gaps = 48/289 (16%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA----- 114
+G Y + +G P K Y + +DTGS LTW+QC+ C +++PK +
Sbjct: 124 VGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSC 183
Query: 115 ----CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C+D +A P + C ++ C Y+ Y D S+G L D ++ GS P
Sbjct: 184 SAQQCSD-LTTATLNPAS--CSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVP 236
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCLSVRGGGYL 229
+GCG Q N G +AG++GL K S+L QL S+G + +CL
Sbjct: 237 NFYYGCG--QDNEGLF-GQSAGLIGLARNKLSLLYQLAPSMGYS---FSYCLPTSSSSSS 290
Query: 230 FLGHDLVPSSG-IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK--------------GL 274
+ G ++TPM+ L+ + L+ K TGIK L
Sbjct: 291 GYLSIGSYNPGQYSYTPMASSSLD--------DSLYFIKMTGIKVAGKPLSVSSSAYSSL 342
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
I DSG+ T + Y + +KG P + L C++G
Sbjct: 343 PTIIDSGTVITRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQG 389
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 108/253 (42%), Gaps = 44/253 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
GYY+ L IG PP+ + L +DTGS +T+V C+ C C + + P++
Sbjct: 91 GYYTARLWIGTPPQRFALIVDTGSTVTYVPCST-CRHCGSHQDPKFRPED---------- 139
Query: 121 SAFHLPENIRCEAN-----DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-RLIF 174
S + P + N QC YE YA+ +S G L D + N + L P R IF
Sbjct: 140 SETYQPVKCTWQCNCDNDRKQCTYERRYAEMSTSSGALGED--VVSFGNQTELSPQRAIF 197
Query: 175 GCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRG 225
GC YNQR G++GLG G SI+ QL + + C G
Sbjct: 198 GCENDETGDIYNQR--------ADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVG 249
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI------IFD 279
GG + LG + P + + +T S + +Y+ E+ GK + + D
Sbjct: 250 GGAMVLG-GISPPADMVFT-RSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLD 307
Query: 280 SGSSYTYFNSQAY 292
SG++Y Y A+
Sbjct: 308 SGTTYAYLPESAF 320
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/239 (28%), Positives = 105/239 (43%), Gaps = 21/239 (8%)
Query: 74 KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENI 129
+ Y+L +DTGS T+V C C C Y ++ + C + A E +
Sbjct: 49 QTYDLIVDTGSARTYVPCKG-CARCGEHAHGYYDYDRSMEFERLDCGEA-SDATLCEETM 106
Query: 130 R--CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKP 187
+ C+++ +C Y V YA+ SS G +V D +RL G+ L L FGC + N
Sbjct: 107 KGTCQSDGRCSYVVSYAEGSSSRGYVVRDR--VRLGEGT-LSAMLAFGCEEAETN-AIYE 162
Query: 188 PPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--GGYLFLGH-DL-VPSSGIAW 243
G+ G G G A++ +QL S GL NV C+ G GG L LG D + +A
Sbjct: 163 QKADGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFDFGADAPALAR 222
Query: 244 TPMSRDLLEKHYSSGPAELLFGGKS--TGIKGLQIIFDSGSSYTYFNSQ---AYKTTLD 297
TP+ D + + G S + DSG+++T+ ++KT LD
Sbjct: 223 TPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVWVSFKTRLD 281
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 111/272 (40%), Gaps = 28/272 (10%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--LYHPKNNL 112
G+ Y Y T+ +G P L +DTGS LTWVQC PC P+ L+ P +
Sbjct: 121 GSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCK-PCNSSQCYPQRLPLFDPNTSS 179
Query: 113 ----VACNDPFCSAFHL---PENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
V C+ C A + + + C YE+ Y + G TD L L G
Sbjct: 180 SYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTD--ALTLGPG 237
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+++ R FGCG++Q+ K GVLGLG S+ Q S V HCL G
Sbjct: 238 AIV-KRFHFGCGHHQQR--GKFDMADGVLGLGRLPQSLAWQ-ASARRGGGVFSHCLPPTG 293
Query: 226 --GGYLFLG--HDLVPSSGIAWTP-MSRDLLEKHYSSGPAELLFGGKSTGIKGLQ----I 276
G+L LG HD +S +TP ++ D Y P + G+ I +
Sbjct: 294 VSTGFLALGAPHD---TSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREGV 350
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
I DSG+ + AY R + PL
Sbjct: 351 ITDSGTVLSALQETAYTALRTAFRSAMAEYPL 382
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 121/284 (42%), Gaps = 29/284 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG P + DTGSDLTWVQC PC C L+ P + + C
Sbjct: 92 GEYFMKMSIGTPLVEVIVIADTGSDLTWVQC-LPCDPCYRQKSPLFDPSRSSSYRHMLCG 150
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS--LLGPRLIF 174
FC+A + E + C+Y Y D + G L T+ F + T+ L P ++F
Sbjct: 151 SRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSP-IVF 209
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGY 228
GCG N G +G++GLG G S++SQL S + + +CL S
Sbjct: 210 GCGTG--NGGTFDELGSGIVGLGGGALSLVSQLSS--IIKGKFSYCLVPLSEQSNVTSKI 265
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK---------STGIKGLQIIFD 279
F ++ + TP+ + +Y + G K + ++ +I D
Sbjct: 266 KFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIID 325
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SG++ T+ +S+ + ++ + +K + + D VC++
Sbjct: 326 SGTTLTFLDSEFFTELERVLEETVKAERVSD--PRGLFSVCFRS 367
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 116/277 (41%), Gaps = 66/277 (23%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----------VA 114
++L IG PP+ ++ +DTGS L+W+QC+ LPP+ PK + +
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHRK----KLPPK----PKTSFDPSLSSSFSTLP 125
Query: 115 CNDPFCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C+ P C F LP + C++N C Y YAD + G LV + T + P
Sbjct: 126 CSHPLCKPRIPDFTLPTS--CDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE---ITP 180
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGG- 226
LI GC + G+LG+ G+ S +SQ + + +C+ S R G
Sbjct: 181 PLILGCATESSD-------DRGILGMNRGRLSFVSQAKI-----SKFSYCIPPKSNRPGF 228
Query: 227 ---GYLFLGHDLVPSSGIAWT-----PMSR---DLLEKHYSSGPAELLFGGKSTGIKGL- 274
G +LG D S G + P S+ +L Y+ + FG K I G
Sbjct: 229 TPTGSFYLG-DNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSV 287
Query: 275 ---------QIIFDSGSSYTYFNSQAY-KTTLDLMRK 301
Q + DSGS +T+ AY K ++M +
Sbjct: 288 FRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTR 324
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 143/337 (42%), Gaps = 44/337 (13%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPL 60
+E+K + M + T Q + AN + KS++ + G+ +G
Sbjct: 111 IEKKDTKSMSWKQEVKVITIQQQNNLANAVVASLKSSKDEFS---GNIMATLESGASLGT 167
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PPK L +DTGSDL+W+QC+ PC C Y+P + ++C
Sbjct: 168 GEYFIDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGPHYNPNESSSYRNISCY 226
Query: 117 DPFCSAFHLPENIR-CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT--NGSLLGPRL 172
DP C P+ ++ C+ +Q C Y YAD ++ G + F + LT NG +
Sbjct: 227 DPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHV 286
Query: 173 I---FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-----VR 224
+ FGCG+ + G+LGLG G S SQLQS + + +CL+
Sbjct: 287 VDVMFGCGHWNKGFFHG---AGGLLGLGRGPLSFPSQLQS--IYGHSFSYCLTDLFSNTS 341
Query: 225 GGGYLFLGHD--LVPSSGIAWTPM---SRDLLEKHYSSGPAELLFGGK------------ 267
L G D L+ + +T + + Y ++ GG+
Sbjct: 342 VSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWS 401
Query: 268 STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
S G+ G I DSGS+ T+F AY + K +K
Sbjct: 402 SEGVGG--TIIDSGSTLTFFPDSAYDVIKEAFEKKIK 436
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 104/258 (40%), Gaps = 40/258 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G + V + +G PP+ + IDTGSDLTW+Q + PC C + ++ P N +AC+
Sbjct: 23 GEFLVPIYLGTPPQKAVVIIDTGSDLTWIQ-SEPCRACFEQADPIFDPSKSSTYNKIACS 81
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+ L C A C Y Y D + G + +T G + FG
Sbjct: 82 SSACA--DLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKE----TITATDTAGEEVKFGA 135
Query: 177 G-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGG---GYLF 230
YN G G+LGLG G S+ SQL S + N +CL + G ++
Sbjct: 136 SVYNTGTFGDT--GGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSETSTMY 191
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI-------------- 276
G VPS + +TP+ + Y + G S G L I
Sbjct: 192 FGDAAVPSGEVQYTPIVPNADHPTY----YYIAVQGISVGGSLLDIDQSVYEIDSGGSGG 247
Query: 277 -IFDSGSSYTYFNSQAYK 293
I DSG++ TY + +
Sbjct: 248 TIIDSGTTITYLQQEVFN 265
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 119/289 (41%), Gaps = 48/289 (16%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVA----- 114
+G Y + +G P K Y + +DTGS LTW+QC+ C +++PK +
Sbjct: 124 VGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSC 183
Query: 115 ----CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C+D +A P + C ++ C Y+ Y D S+G L D ++ GS P
Sbjct: 184 SAQQCSD-LTTATLNPAS--CSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVP 236
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCLSVRGGGYL 229
+GCG Q N G +AG++GL K S+L QL S+G + +CL
Sbjct: 237 NFYYGCG--QDNEGLF-GQSAGLIGLARNKLSLLYQLAPSMGYS---FSYCLPTSSSSSS 290
Query: 230 FLGHDLVPSSG-IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK--------------GL 274
+ G ++TPM+ L+ + L+ K TGIK L
Sbjct: 291 GYLSIGSYNPGQYSYTPMASSSLD--------DSLYFIKMTGIKVAGKPLSVSSSAYSSL 342
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
I DSG+ T + Y + +KG P + L C++G
Sbjct: 343 PTIIDSGTVITRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQG 389
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 65/133 (48%), Gaps = 13/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
I+G G Y + IG P + + +DTGSD+ W+QC PC C E ++ P ++
Sbjct: 138 ISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSS 196
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
++C+ P C+A + E C N C YEV Y D ++G T+ LT GS L
Sbjct: 197 SYEPLSCDTPQCNALEVSE---CR-NATCLYEVSYGDGSYTVGDFATE----TLTIGSTL 248
Query: 169 GPRLIFGCGYNQR 181
+ GCG++
Sbjct: 249 VQNVAVGCGHSNE 261
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 109/267 (40%), Gaps = 36/267 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLY----HPKNNLVACNDP 118
Y VT+ +GN + IDTGSDLTWVQC+ PC C ++ N + CN
Sbjct: 133 YIVTIGLGNQN--MTVIIDTGSDLTWVQCD-PCMSCYSQQGPVFNPSNSSSYNSLLCNSS 189
Query: 119 FCS--AFHLPENIRCEAND--QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C F CE+N+ C++ V Y D + G L +H L+ G + +F
Sbjct: 190 TCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEH----LSFGGISVSNFVF 245
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFL 231
GCG RN +G++GLG S++SQ + V +CL G L +
Sbjct: 246 GCG---RNNKGLFGGVSGIMGLGRSNLSMISQTNT--TFGGVFSYCLPTTDSGASGSLVI 300
Query: 232 GHD------LVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGG---KSTGIKGLQIIFDSG 281
G++ L P IA+T M S L Y + GG + T I+ DSG
Sbjct: 301 GNESSLFKNLTP---IAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSG 357
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPL 308
+ T Y K G P+
Sbjct: 358 TVITRLAPSLYNALKAEFLKQFSGYPI 384
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 111/271 (40%), Gaps = 46/271 (16%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
KK+ + A + S+A + G GY+ T+ IG P +E+ +DTGS T+V C
Sbjct: 108 KKRRRRRRRALKQSSSAGLELNGKARDTGYFYATVLIGTPGHQFEVIVDTGSTYTFVTC- 166
Query: 93 APCTGC-----TLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHG 147
PC C P ++ V C C A+ C+Y+ +++
Sbjct: 167 YPCASCGQHGSNAPYDAAKSSSYERVPCGSGCIFG-------ACRASGLCEYDEKFSEDS 219
Query: 148 SSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
G +V+D + GSL PR+ FGC + N K G++ LG +A + QL
Sbjct: 220 QVGGHVVSDVIDV---GGSLGTPRIHFGCNSLETNM-LKTQKANGMIALGRAEAGLHRQL 275
Query: 208 QSL----GLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAEL 262
+ G G CL S GGG L LG L E+HY++
Sbjct: 276 KKKAYPPGSYDGTFGLCLGSFEGGGVLSLGK----------------LPEQHYAN----- 314
Query: 263 LFGGKSTGIKGLQIIFDSGSSYTYFNSQAYK 293
F + T ++++ GS Y+N + ++
Sbjct: 315 -FVTRKTHTSTVKLV--KGSKSQYYNVEVHR 342
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 108/259 (41%), Gaps = 17/259 (6%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT--LPPE------SLYHPKNNLV 113
+Y+V + +G P + + +DTGSDL WV C+ C C + P Y P+ +
Sbjct: 104 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSST 160
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNGS--LLGP 170
+ P S ++ A+ C Y + Y +D+ SS GVLV D L G ++
Sbjct: 161 SRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVLVEDVLYLITEYGQPKIVTA 220
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
+ FGCG Q G+LGLG+ S+ S L S G+ N C G G +
Sbjct: 221 PITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLASEGVAANSFSMCFGDDGRGRIN 280
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQ 290
G SS TP++ +Y+ + G KS I DSG+S+T +
Sbjct: 281 FGD--TGSSDQQETPLNIYKQNPYYNISITGAMVGSKSFNTN-FNAIVDSGTSFTALSDP 337
Query: 291 AYKTTLDLMRKDLKGKPLE 309
Y ++ KP +
Sbjct: 338 MYSEITSSFNSQVQDKPTQ 356
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 116/277 (41%), Gaps = 66/277 (23%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----------VA 114
++L IG PP+ ++ +DTGS L+W+QC+ LPP+ PK + +
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHRK----KLPPK----PKTSFDPSLSSSFSTLP 125
Query: 115 CNDPFCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C+ P C F LP + C++N C Y YAD + G LV + T + P
Sbjct: 126 CSHPLCKPRIPDFTLPTS--CDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE---ITP 180
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGG- 226
LI GC + G+LG+ G+ S +SQ + + +C+ S R G
Sbjct: 181 PLILGCATESSD-------DRGILGMNRGRLSFVSQAKI-----SKFSYCIPPKSNRPGF 228
Query: 227 ---GYLFLGHDLVPSSGIAWT-----PMSR---DLLEKHYSSGPAELLFGGKSTGIKGL- 274
G +LG D S G + P S+ +L Y+ + FG K I G
Sbjct: 229 TPTGSFYLG-DNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSV 287
Query: 275 ---------QIIFDSGSSYTYFNSQAY-KTTLDLMRK 301
Q + DSGS +T+ AY K ++M +
Sbjct: 288 FRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTR 324
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 104/269 (38%), Gaps = 23/269 (8%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN---NLVACNDPF 119
Y + +G PP + + +DTGSDL W+ CN T C E + P++ NL N
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 120 CSAFHLPENIRCEANDQCD-------YEVLYADHGSSLGVLVTD--HFPLRLTNGSLLGP 170
S+ + RC + +C Y++ Y++ + G L+ D H N + +
Sbjct: 161 TSSSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPVKA 220
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF 230
+ GCG Q + GVLGLG+ S+ S L +T N C G
Sbjct: 221 NVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGR 280
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQ 290
+ + TP Y + + G I+ L FD+GSS+T+
Sbjct: 281 ISFGDRGYTDQEETPFISVAPSTAYGVNISGVSVAGDPVDIR-LFAKFDTGSSFTHLREP 339
Query: 291 AYKTTLDLMRKDLKGKPLEDTAEEKALPV 319
AY K ++ E++ PV
Sbjct: 340 AYGVLT---------KSFDELVEDRRRPV 359
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 103/248 (41%), Gaps = 29/248 (11%)
Query: 72 PPKLYELDIDTGSDLTWVQCNAPCTGCTLPP-----ESLYHPKNNL----VACNDPFCSA 122
P + + +D+ SD+ WVQC C +PP +S Y P + +C+ P C+A
Sbjct: 155 PGVIQTVVLDSASDVPWVQC----VPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTA 210
Query: 123 FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRN 182
P C AN+QC Y V Y D S+ G + D L N ++ G FGC + ++
Sbjct: 211 LG-PYANGC-ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGN-AVSG--FKFGCSHAEQ- 264
Query: 183 PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSG 240
G AG++ LG G S+LSQ S N +C+ + G+ LG SS
Sbjct: 265 -GSFDARAAGIMALGGGPESLLSQTAS--RYGNAFSYCIPATASDSGFFTLGVPRRASSR 321
Query: 241 IAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIK----GLQIIFDSGSSYTYFNSQAYKTT 295
TPM R Y + GG+ G+ + DS ++ T AY+
Sbjct: 322 YVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQAL 381
Query: 296 LDLMRKDL 303
R +
Sbjct: 382 RSAFRSSM 389
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 121/269 (44%), Gaps = 29/269 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + +++ IG PP DTGSDLTW QC PC C + +++P+ + V+C
Sbjct: 88 GEFLMSIFIGTPPVNVIAIADTGSDLTWTQC-LPCRECFNQSQPIFNPRRSSSYRKVSCA 146
Query: 117 DPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + E+ C + Q C Y Y D + G L +D ++T GS P+ + G
Sbjct: 147 SDTCRSL---ESYHCGPDLQSCSYGYSYGDRSFTYGDLASD----QITIGSFKLPKTVIG 199
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGYLF 230
CG+ +N G T+G++GLG G S++SQ++++ + +CL + G +
Sbjct: 200 CGH--QNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTIS 257
Query: 231 LGHDLVPS-SGIAWTPMSRDLLEKHYSSGPAELLFGGK----STGIKGL----QIIFDSG 281
G V S + TP+ + Y + G K + GI + II DSG
Sbjct: 258 FGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSG 317
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
++ T Y + + +K K ++D
Sbjct: 318 TTLTLLPRSLYYGVFSTLARVIKAKRVDD 346
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 68/155 (43%), Gaps = 12/155 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y V L +G P + L +DTGSDL W QC APC C + P + + C
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQC-APCRDCFDQDLPVLDPAASSTYAALPCGAA 142
Query: 119 FCSAFHLPE-NIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNG---SLLGPRLI 173
C A +R N + C Y Y D ++G + TD F + G SL RL
Sbjct: 143 RCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLT 202
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
FGCG+ N G G+ G G G+ S+ SQL
Sbjct: 203 FGCGH--LNKGVFQSNETGIAGFGRGRWSLPSQLN 235
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 77/168 (45%), Gaps = 19/168 (11%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFC 120
V + +G PP+ + + D +D TW+QC PC C P+S++ P + L++C C
Sbjct: 189 VQIGVGGPPQKFYMIFDLQTDFTWLQCQ-PCIKCYDQPDSIFDPSQSSSYTLLSCETKHC 247
Query: 121 SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQ 180
+L N C + C Y + Y D ++ GVL+ + S R+ GC +
Sbjct: 248 ---NLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFE---SSGWVDRVSLGC--SN 299
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY 228
+N GP + G GLG G S S++ + ++ +CL GY
Sbjct: 300 KNQGPF-VGSDGTFGLGRGSLSFPSRINASSMS-----YCLVESKDGY 341
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 8/132 (6%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-- 112
G++ G Y VT+ +G P K + L DTGSDLTW QC C E++++P +
Sbjct: 145 GSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSY 204
Query: 113 --VACNDPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
++C C + NI A+ C Y + Y D S+G + L T+ +
Sbjct: 205 ANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD---VF 261
Query: 170 PRLIFGCGYNQR 181
FGCG N +
Sbjct: 262 NDFYFGCGQNNK 273
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 70/160 (43%), Gaps = 19/160 (11%)
Query: 97 GCTLPPE--------SLYHPK----NNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYA 144
GCT P+ +LY P +N V C D FC+ + C+ + C Y + Y
Sbjct: 32 GCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYG 91
Query: 145 DHGSSLGVLVTDHFPLRLTNGSLL----GPRLIFGCGYNQRNPGPKPPPTA--GVLGLGL 198
D ++ G V D +G+L +IFGCG Q A G++G G
Sbjct: 92 DGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQ 151
Query: 199 GKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVP 237
+S+LSQL + G + + HCL S GGG +G + P
Sbjct: 152 ANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEP 191
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 116/273 (42%), Gaps = 26/273 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL--PPES-LYHPKNNLVACNDPF 119
Y +T+ IG+P + IDTGSD++WV+CN+ G TL P +S Y P +C+
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCNS-TDGLTLFDPSKSTTYAP----FSCSSAA 183
Query: 120 CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
C+ L N +N C Y V Y D ++ G +D L ++ FGC ++
Sbjct: 184 CA--QLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASDTV---TDFHFGCSHH 238
Query: 180 QRN-PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGHDLV 236
+ + G K G++GLG S++S Q+ +CL + R G+L G
Sbjct: 239 EEDFDGEK---IDGLMGLGGDAQSLVS--QTAATYGKSFSYCLPPTNRTSGFLTFGAPNG 293
Query: 237 PSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIKGLQI----IFDSGSSYTYFNSQA 291
S G TPM R Y ++ GG GI+ + + DSG+ T+ +A
Sbjct: 294 TSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSVMDSGTVITWLPRRA 353
Query: 292 YKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
Y R + + A L C+ T
Sbjct: 354 YSALSSAFRSSMTRLRHQRAAPLGILDTCYDFT 386
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 112/304 (36%), Gaps = 51/304 (16%)
Query: 63 YSVTLKIGNP-PKLYELDIDTGSDLTWVQCNAPCTGCTLPP----ESLYHPKNNLVACND 117
Y + L IG P + L +DTGSD+ W QC PC C P ++ VAC+D
Sbjct: 92 YLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE-PCAECFTQPLPRFDTAASNTVRSVACSD 150
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR--LTNGSLLGPRLIFG 175
P C+A H C Y Y D S G + D F G + P + FG
Sbjct: 151 PLCNA-HSEHGCFLHG---CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFG 206
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR----------G 225
CG N G G+ G G G S+ SQL+ +C + R G
Sbjct: 207 CG--MYNAGRFLQTETGIAGFGRGPLSLPSQLKV-----RQFSYCFTTRFEAKSSPVFLG 259
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDL----LEKHYSSGPAELLFGGKSTGIKGLQI--IFD 279
G H P I TP R L HY L F G + G L + I
Sbjct: 260 GAGDLKAHATGP---ILSTPFVRSLPPGTDNSHYV-----LSFKGVTVGKTRLPVPEIKA 311
Query: 280 SGSSYTYFNSQAYKTTL-DLMRKDLKGK-------PLEDTAEEKALPVCWKGTWKCLLGN 331
GS T+ +S TT D + + LK P+ TA+E + W G +
Sbjct: 312 DGSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADEDDICFSWDGKKTAAMPK 371
Query: 332 FEWH 335
+H
Sbjct: 372 LVFH 375
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/285 (25%), Positives = 107/285 (37%), Gaps = 33/285 (11%)
Query: 46 GSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
GS A+F GN +Y+ + IG P + + +D GSDL WV C+ C C S
Sbjct: 93 GSQALF--FGNELDWLHYT-WIDIGTPNVSFLVALDAGSDLLWVPCD--CIQCAPLSASY 147
Query: 106 YH---------------PKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYAD--HGS 148
Y+ + ++C+ C +N + D C Y Y D + +
Sbjct: 148 YNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPK----DPCPYIFNYDDFENTT 203
Query: 149 SLGVLVTDHFPLRL----TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASIL 204
S G LV D L T +L ++ GCG Q GV+GLG G S+
Sbjct: 204 SAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVP 263
Query: 205 SQLQSLGLTRNVLGHCLSVRGGGYLFLG-HDLVPSSGIAWTPMSRDLLEKHYSSGPAELL 263
S L GL +N C G + G + P+ + Y G
Sbjct: 264 SLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVA--YFVGVESYC 321
Query: 264 FGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
G G + + DSGSS+TY S+ Y + K + K +
Sbjct: 322 VGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRI 366
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 86/192 (44%), Gaps = 19/192 (9%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCT-GCTLPPESLYHP----KNNLVACNDPFCS 121
L +G PP+ + S +WV C++ C CT SL+ P + + C P CS
Sbjct: 3 LSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCT--TASLFQPGLSTSHTKLPCGSPSCS 60
Query: 122 AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQR 181
AF + C + C Y Y + SS G LV+D + + L GCG R
Sbjct: 61 AFS-AVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCG---R 116
Query: 182 NPGP--KPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGH----D 234
+ G + T+G +G G S + QL +LG R+ +CL S G L +G+ +
Sbjct: 117 DSGGLLELLDTSGFVGFDKGNVSFMGQLSALGY-RSKFIYCLPSDTFRGKLVIGNYKLRN 175
Query: 235 LVPSSGIAWTPM 246
SS +A+TPM
Sbjct: 176 ASISSSMAYTPM 187
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 109/265 (41%), Gaps = 47/265 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNNL----VAC 115
G Y VT+ IG P L DTGSDLTW QC PC G C E ++P ++ V+C
Sbjct: 130 GNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSTYQNVSC 188
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
+ P C E+ + C Y + Y D + G L + F LTN +L + FG
Sbjct: 189 SSPMC------EDAESCSASNCVYSIGYGDKSFTQGFLAKEKF--TLTNSDVL-EDVYFG 239
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLG 232
CG N + G+ L + Q+ N+ +CL + G+L G
Sbjct: 240 CGENNQGLFDGVAGLLGLGPGKLSLPA-----QTTTTYNNIFSYCLPSFTSNSTGHLTFG 294
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFG----GKSTGIKGLQI----------IF 278
+ S + +TP+ SS P+ +G G S G K L I I
Sbjct: 295 SAGI-SESVKFTPI---------SSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAII 344
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDL 303
DSG+ +T ++ Y + ++ +
Sbjct: 345 DSGTVFTRLPTKVYAELRSVFKEKM 369
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 108/243 (44%), Gaps = 31/243 (12%)
Query: 6 KRVMGLLVLLMFATFQGCFSEANQP---PSKKKSTQSTAAHRFGSTAVFPITGNVYPLGY 62
+ ++ L + +F + CFS P P + ++ + R S + TG + L +
Sbjct: 5 RLLVQLFISFIFLRSKQCFSSNQSPIILPLRIQNNHHISTRRLFSNSSSKTTGKL--LFH 62
Query: 63 YSVTLK----IGNPPKLYELDIDTGSDLTWVQCNAP--CTGCTLPPESLYHPKNNLVACN 116
++VTL IG PP+ + +DTGS+L+W++C T P S + K + C+
Sbjct: 63 HNVTLTASLTIGTPPQNITMVLDTGSELSWLRCKKEPNFTSIFNPLASKTYTK---IPCS 119
Query: 117 DPFC----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
C S LP + C+ C + + YAD S G L + F GSL P
Sbjct: 120 SQTCKTRTSDLTLP--VTCDPAKLCHFIISYADASSVEGHLAFETFRF----GSLTRPAT 173
Query: 173 IFGC-GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLF 230
+FGC + + T G++G+ G S ++Q +G + +C+S + G+L
Sbjct: 174 VFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQ---MGFRK--FSYCISGLDSTGFLL 228
Query: 231 LGH 233
LG
Sbjct: 229 LGE 231
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 62/133 (46%), Gaps = 13/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
I+G G Y L +G P + + +DTGSD+ W+QC APC C + ++ P +
Sbjct: 135 ISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQC-APCIKCYSQTDPVFDPTKSR 193
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
+ C P C P C Q C Y+V Y D ++G T+ R G+
Sbjct: 194 SFANIPCGSPLCRRLDYPG---CSTKKQICLYQVSYGDGSFTVGEFSTETLTFR---GTR 247
Query: 168 LGPRLIFGCGYNQ 180
+G R++ GCG++
Sbjct: 248 VG-RVVLGCGHDN 259
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 103/267 (38%), Gaps = 30/267 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACN 116
G Y + +G P + + +DTGSD+TW+QC PC+ C + +Y+P LV C
Sbjct: 143 GEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCE-PCSDCYQQSDPIYNPALSSSYKLVGCQ 201
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C + C N C Y+V Y D + G T+ LT G + GC
Sbjct: 202 ANLCQQLDVSG---CSRNGSCLYQVSYGDGSYTQGNFATET----LTLGGAPLQNVAIGC 254
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLGH 233
G++ G+ G L S L+ + +CL R L G
Sbjct: 255 GHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENG-----KIFSYCLVDRDSESSSTLQFGR 309
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK----STGIKGLQ------IIFDSGSS 283
VP+ + + L+ Y + + GGK S + G+ +I DSG++
Sbjct: 310 AAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTA 369
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLED 310
T + AY + D R K P D
Sbjct: 370 VTRLQTAAYDSLRDAFRAGTKNLPSTD 396
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 114/283 (40%), Gaps = 30/283 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNNL----VACN 116
Y VTL IG P + IDTGSDL+WVQC PC C + L+ P ++ V C+
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 176
Query: 117 DPFC---SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
C +A A C+Y + Y + ++ GV T+ L+ ++
Sbjct: 177 SDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVADFG 233
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
FGCG +Q P K G+LGLG S++SQ S +CL + G G+L L
Sbjct: 234 FGCGDHQHGPYEK---FDGLLGLGGAPESLVSQTSS--QFGGPFSYCLPPTSGGAGFLAL 288
Query: 232 GH-----DLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKSTGIK----GLQIIFDSG 281
G ++G +TPM R + Y + GG + ++ DSG
Sbjct: 289 GAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSSGMVIDSG 348
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+ T + AY R + L + L C+ T
Sbjct: 349 TVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFT 391
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 109/264 (41%), Gaps = 31/264 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y + L IG PP + DTGSDLTW QC PC C +Y + P SA
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPIYDTAVSSSFSPVPCASA 151
Query: 123 FHLP--ENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYN 179
LP + C A+ C Y Y D S GVL T+ G +G + FGCG +
Sbjct: 152 TCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVG-GIAFGCGVD 210
Query: 180 QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLF--LGHDLVP 237
G + G +GLG G S+++QL + + + G LF L P
Sbjct: 211 N---GGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAELAAP 267
Query: 238 SSGIAWTPMSRDLLEKHY------------SSGPAEL-----LFGGKSTGIKGLQIIFDS 280
S+G A S L++ Y S G A L F + G G+ I DS
Sbjct: 268 STGAAV--QSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGM--IVDS 323
Query: 281 GSSYTYFNSQAYKTTLDLMRKDLK 304
G+++T+ A++ +D + L+
Sbjct: 324 GTTFTFLVESAFRVVVDHVAGVLR 347
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 96/244 (39%), Gaps = 22/244 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y VT+ +G P K + L DTGSD+TW QC C E +P + ++C+
Sbjct: 129 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCS 188
Query: 117 DPFCSAFHLPENI-RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + + ++ C Y+V Y D S+G T+ L +N + +FG
Sbjct: 189 SALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFG 245
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGH 233
CG G+ L S Q+ + + +CL S GYL LG
Sbjct: 246 CGQQNNGLFGGAAGLLGLGRTKLALPS-----QTAKTYKKLFSYCLPASSSSKGYLSLGG 300
Query: 234 DLVPSSGIAWTPMSRDLLEK-HYSSGPAELLFGGKSTGIK----GLQIIFDSGSSYTYFN 288
+ S + +TP+S D Y L GG+ I + DSG+ T +
Sbjct: 301 QV--SKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 358
Query: 289 SQAY 292
AY
Sbjct: 359 PTAY 362
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 114/281 (40%), Gaps = 18/281 (6%)
Query: 55 GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN----APCTGC--TLPPE-SLYH 107
GN + YY+ + +G P + + +DTGSDL WV C+ AP G TL + +Y
Sbjct: 136 GNDFGWLYYT-WVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYK 194
Query: 108 PKNNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLY-ADHGSSLGVLVTD--HFPLRLT 163
P + + + P CS P C + Q C Y Y ++ +S G+L+ D H R +
Sbjct: 195 PAESTTSRHLP-CSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRES 253
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
+ + ++ GCG Q G+LGLG+ S+ S L GL RN C
Sbjct: 254 HAPVKA-SVVIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKE 312
Query: 224 RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSS 283
G F + + P+ + Y+ + G K + + DSG+S
Sbjct: 313 DSGRIFFGDQGVSIQQSTPFVPLYGKY--QTYAVNVDKSCVGHKCFEATSFEALVDSGTS 370
Query: 284 YTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
+T YK K + + T E+ + C+ +
Sbjct: 371 FTALPLNVYKAVAVEFDKQVHAPRI--TQEDASFEYCYSAS 409
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
++G G Y + + IG PP + +DTGSD++W+QC APC+ C + ++ P ++
Sbjct: 139 VSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPVSSN 197
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
+ C+ P C + L E C N C YEV Y D ++G T+ +T G+
Sbjct: 198 SYSPIRCDAPQCKSLDLSE---CR-NGTCLYEVSYGDGSYTVGEFATE----TVTLGTAA 249
Query: 169 GPRLIFGCGYN 179
+ GCG+N
Sbjct: 250 VENVAIGCGHN 260
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 96/244 (39%), Gaps = 22/244 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y VT+ +G P K + L DTGSD+TW QC C E +P + ++C+
Sbjct: 69 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCS 128
Query: 117 DPFCSAFHLPENI-RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + + ++ C Y+V Y D S+G T+ L +N + +FG
Sbjct: 129 SALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFG 185
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGH 233
CG G+ L S Q+ + + +CL S GYL LG
Sbjct: 186 CGQQNNGLFGGAAGLLGLGRTKLALPS-----QTAKTYKKLFSYCLPASSSSKGYLSLGG 240
Query: 234 DLVPSSGIAWTPMSRDLLEK-HYSSGPAELLFGGKSTGIK----GLQIIFDSGSSYTYFN 288
+ S + +TP+S D Y L GG+ I + DSG+ T +
Sbjct: 241 QV--SKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLS 298
Query: 289 SQAY 292
AY
Sbjct: 299 PTAY 302
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 96/244 (39%), Gaps = 22/244 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y VT+ +G P K + L DTGSD+TW QC C E +P + ++C+
Sbjct: 117 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCS 176
Query: 117 DPFCSAFHLPENI-RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + + ++ C Y+V Y D S+G T+ L +N + +FG
Sbjct: 177 SALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFG 233
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLGH 233
CG G+ L S Q+ + + +CL S GYL LG
Sbjct: 234 CGQQNNGLFGGAAGLLGLGRTKLALPS-----QTAKTYKKLFSYCLPASSSSKGYLSLGG 288
Query: 234 DLVPSSGIAWTPMSRDLLEK-HYSSGPAELLFGGKSTGIK----GLQIIFDSGSSYTYFN 288
+ S + +TP+S D Y L GG+ I + DSG+ T +
Sbjct: 289 QV--SKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 346
Query: 289 SQAY 292
AY
Sbjct: 347 PTAY 350
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 56/125 (44%), Gaps = 13/125 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y L +G PPK + +DTGSD+ W+QC PCT C + ++ P + + C
Sbjct: 128 GEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCK-PCTKCYSQTDQIFDPSKSKSFAGIPCY 186
Query: 117 DPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C P C N+ C Y+V Y D + G T+ R PR+ G
Sbjct: 187 SPLCRRLDSPG---CSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRA----AVPRVAIG 239
Query: 176 CGYNQ 180
CG++
Sbjct: 240 CGHDN 244
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/249 (26%), Positives = 100/249 (40%), Gaps = 33/249 (13%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPES--------------- 104
L Y +VT IG P + + + +DTGSDL W+ CN T C E+
Sbjct: 110 LHYANVT--IGTPAQWFLVALDTGSDLFWLPCNCNST-CVRSMETDQGETHMNAQRIRLN 166
Query: 105 LYHP----KNNLVACNDPFCSAFHLPENIRCEAN-DQCDYEVLYADHGS-SLGVLVTDHF 158
+Y+P ++ V CN C+ + RC + C Y + Y GS S GVLV D
Sbjct: 167 IYNPSISTSSSKVTCNSTLCALRN-----RCISPLSDCPYRIRYLSPGSKSTGVLVEDVI 221
Query: 159 PLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLG 218
+ G R+ FGC Q + G++GL + ++ + L G+ +
Sbjct: 222 HMSTEEGEARDARITFGCSETQLGLF-QEVAVNGIMGLAMADIAVPNMLVKAGVASDSFS 280
Query: 219 HCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIF 278
C G G + G SS TP+ + Y + G + K IF
Sbjct: 281 MCFGPNGKGTISFGDK--GSSDQHETPLGGTISPLFYDVSITKFKVGKVTVETK-FSAIF 337
Query: 279 DSGSSYTYF 287
DSG++ T+
Sbjct: 338 DSGTAVTWL 346
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 120/284 (42%), Gaps = 34/284 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + + +G PP+ L +DTGSD+ W+QC APC C + ++ P + + CN
Sbjct: 35 GEYFIRVSVGTPPRGMYLVMDTGSDILWLQC-APCVSCYHQCDEVFDPYKSSTYSTLGCN 93
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN--GSLLGPRLIF 174
C L ++ ++C Y+V Y D S G TD L T+ G ++ ++
Sbjct: 94 SRQC----LNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPL 149
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG-----GYL 229
GCG++ AG+LGLG G S +Q+ S R +CL+ R L
Sbjct: 150 GCGHDNEG---YFVGAAGLLGLGKGPLSFPNQINSENGGR--FSYCLTGRDTDSTERSSL 204
Query: 230 FLGHDLVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGGKSTGI--KGLQ--------IIF 278
G VP +G+ +TP + +L + Y + GG I Q +I
Sbjct: 205 IFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVII 264
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
DSG+S T + AY + + R L T E C+
Sbjct: 265 DSGTSVTRLQNAAYASLREAFRAGTSDLVL--TTEFSLFDTCYN 306
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 88/202 (43%), Gaps = 27/202 (13%)
Query: 37 TQSTAAHRFGSTAVFPITGNVY----PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
+++ A S+A P++ Y P+ Y + L IG PP+ +L +DTGSDL W QC
Sbjct: 61 SKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSDLVWTQCQ 120
Query: 93 APCTGC---TLPPESLYHPKNNLVACNDPFCSAFHL---PENIRC--EANDQCDYEVLYA 144
PC C +LP Y+ + P C + P C + C + Y
Sbjct: 121 -PCAVCFNQSLP----YYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAFSYSYG 175
Query: 145 DHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASIL 204
D +++G L D + G+ + P ++FGCG N N G G+ G G G S+
Sbjct: 176 DKSATIGFL--DVETVSFVAGASV-PGVVFGCGLN--NTGIFRSNETGIAGFGRGPLSLP 230
Query: 205 SQLQSLGLTRNVLGHCLSVRGG 226
SQL+ + HC + G
Sbjct: 231 SQLKVGNFS-----HCFTAVSG 247
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y VT+++G + IDTGSDLTWVQC PC C ++ P + + CN
Sbjct: 145 YIVTMELGGQD--MTVIIDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIPCNSS 201
Query: 119 FCSAFHLPENI--RCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + L CE+N C Y V Y D + G L +H L+ G + +FG
Sbjct: 202 TCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEH----LSFGGISVSNFVFG 257
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS 209
CG N + +G++GLG S++SQ S
Sbjct: 258 CGKNNKGLFGG---VSGLMGLGRSNLSLISQTNS 288
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 104/272 (38%), Gaps = 33/272 (12%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNN 111
N P Y V L IG PP+ +L +DTGSDL W QC PC C + P +
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 133
Query: 112 LVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
L +C+ C + + + N C Y Y D + G L D F S+
Sbjct: 134 LTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV-- 191
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--- 226
P + FGCG N G G+ G G G S+ SQL+ + HC + G
Sbjct: 192 PGVAFGCGL--FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS-----HCFTAVNGLKP 244
Query: 227 --GYLFLGHDLVPS--SGIAWTPMSRD--------LLEKHYSSGPAELLFGGKSTGIKGL 274
L L DL S + TP+ ++ L K + G L +K
Sbjct: 245 STVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 304
Query: 275 Q--IIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ T ++ Y+ D +K
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVK 336
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 115/262 (43%), Gaps = 32/262 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACN 116
G Y + + +G PP+ L +DTGSD+ W+QC APC C ++++ P + + C+
Sbjct: 56 GEYFIRISVGTPPRRMYLVMDTGSDILWLQC-APCVNCYHQSDAIFDPYKSSTYSTLGCS 114
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN--GSLLGPRLIF 174
C + C+AN +C Y+V Y D + G TD L T+ G ++ ++
Sbjct: 115 TRQCLNLDIGT---CQAN-KCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPL 170
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-----GGGYL 229
GCG++ AG+LGLG G S +Q+ R +CL+ R G L
Sbjct: 171 GCGHDNEG---YFVGAAGLLGLGKGPLSFPNQVDPQNGGR--FSYCLTDRETDSTEGSSL 225
Query: 230 FLGHDLVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGGKSTGI--KGLQ--------IIF 278
G VP +G +TP ++ + Y + GG I Q +I
Sbjct: 226 VFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVII 285
Query: 279 DSGSSYTYFNSQAYKTTLDLMR 300
DSG+S T + AY + D R
Sbjct: 286 DSGTSVTRLQNAAYASLRDAFR 307
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 59/127 (46%), Gaps = 8/127 (6%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + IG+P + + +DTGSD+TW+QC APC C + L+ P + V C+
Sbjct: 194 GEYFSRIGIGSPARQLYMVLDTGSDVTWLQC-APCADCYAQSDPLFDPALSSSYATVPCD 252
Query: 117 DPFCSAFHLP--ENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
P C A N N C YEV Y D ++G T+ L +GS +
Sbjct: 253 SPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLG-GDGSAAVHDVAI 311
Query: 175 GCGYNQR 181
GCG++
Sbjct: 312 GCGHDNE 318
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 123/302 (40%), Gaps = 66/302 (21%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP--CTGCT-------------LPPESL 105
G YSV+L G PP+ +DTGSD+ W C + C C+ +P ES
Sbjct: 65 GGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKES- 123
Query: 106 YHPKNNLVACNDPFCSAFHLPENIRCEA--------NDQCDYEVLYADHGSSLGVLVTDH 157
+ L+ C +P CS H NI C+ N C +++ G++ GV +++
Sbjct: 124 --SSSKLLGCKNPKCSWIH-HSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSET 180
Query: 158 FPLRLTNGSLLGPRLIFGCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNV 216
L SL P + GC ++ P AG+ G G G +S+ SQL + +
Sbjct: 181 LHLH----SLSKPNFLVGCSVFSSHQP-------AGIAGFGRGLSSLPSQLGLGKFSYCL 229
Query: 217 LGHCL--SVRGGGYLFLGHDLVPS----SGIAWTPM-------SRDLLEKHYSSGPAELL 263
L H + L L + + S + + +TP ++ +Y G +
Sbjct: 230 LSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRIT 289
Query: 264 FGGKSTGI--KGLQ--------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKG----KPLE 309
GG + K L +I DSG+++T+ +A++ D + +K K +E
Sbjct: 290 VGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIE 349
Query: 310 DT 311
D
Sbjct: 350 DA 351
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/348 (25%), Positives = 135/348 (38%), Gaps = 74/348 (21%)
Query: 29 QPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTW 88
+P S+ +TA+ + P++ Y G YSV+L G P + DTGS L W
Sbjct: 61 KPDEDALSSTTTAS---ATVVKSPLSAKSY--GGYSVSLSFGTPSQTIPFVFDTGSSLVW 115
Query: 89 VQCNAP--CTGC-------TLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEAND 135
+ C + C+GC TL P + PKN+ ++ C P C + P N++C D
Sbjct: 116 LPCTSRYLCSGCDFSGLDPTLIPR--FIPKNSSSSKIIGCQSPKCQFLYGP-NVQCRGCD 172
Query: 136 Q--------CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG-YNQRNPGPK 186
C +L GS+ GVL+T+ +L L P + GC + R P
Sbjct: 173 PNTRNCTVGCPPYILQYGLGSTAGVLITE----KLDFPDLTVPDFVVGCSIISTRQP--- 225
Query: 187 PPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDL----------- 235
AG+ G G G S+ SQ+ L R HCL R + DL
Sbjct: 226 ----AGIAGFGRGPVSLPSQMN---LKR--FSHCLVSRRFDDTNVTTDLDLDTGSGHNSG 276
Query: 236 VPSSGIAWTP------MSRDLLEKHYSSGPAELLFGGKSTGIKGLQI----------IFD 279
+ G+ +TP +S ++Y + G K I + I D
Sbjct: 277 SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVD 336
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLE-DTAEEKALPVCWKGTWK 326
SGS++T+ ++ + + E D +E L C+ + K
Sbjct: 337 SGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISGK 384
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 116/292 (39%), Gaps = 45/292 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC-TGCTLPPESLYHPKN----NLVAC 115
G Y +TL IG PP Y DTGSDL W QC APC T C P LY+P + +++ C
Sbjct: 110 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC 168
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIF 174
N C Y Y G + GV ++ F + P + F
Sbjct: 169 NSSLSMCAGALAGAAPPPGCACMYNQTYG-TGWTAGVQGSETFTFGSSAADQARVPGVAF 227
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGYLF 230
GC + +AG++GLG G S++SQ LG R +CL+ L
Sbjct: 228 GCSNASSSDWNG---SAGLVGLGRGSLSLVSQ---LGAGR--FSYCLTPFQDTNSTSTLL 279
Query: 231 LG-HDLVPSSGIAWTPM----SRDLLEKHYSSGPAELLFGGKSTGIKGLQI--------- 276
LG + +G+ TP +R + +Y L G S G K L I
Sbjct: 280 LGPSAALNGTGVRSTPFVASPARAPMSTYY-----YLNLTGISLGAKALPISPGAFSLKP 334
Query: 277 ------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
I DSG++ T + AY+ ++ + P D ++ L +C+
Sbjct: 335 DGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFA 386
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 79/176 (44%), Gaps = 12/176 (6%)
Query: 80 IDTGSDLTWVQCN-APCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEAN 134
+DT SD+TWVQC+ P C + LY P + + +CN P C+ P C N
Sbjct: 173 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG-PYANGCTNN 231
Query: 135 DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVL 194
+QC Y V Y D S+ G ++D L +T + + FGC + + AG++
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDL--LTITPATAVR-SFQFGCSHGVQGSFSFGSSAAGIM 288
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRD 249
LG G S++S Q+ V HC G+ LG V + TPM ++
Sbjct: 289 ALGGGPESLVS--QTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 342
>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 324
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 72/158 (45%), Gaps = 34/158 (21%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----------VA 114
++L IG PP+ ++ +DTGS L+W+QC+ LPP+ PK + +
Sbjct: 76 ISLPIGTPPQAQQMVLDTGSQLSWIQCHRK----KLPPK----PKTSFDPSLSSSFSTLP 127
Query: 115 CNDPFCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C+ P C F LP + C++N C Y YAD + G LV + T + P
Sbjct: 128 CSHPLCKPRIPDFTLPTS--CDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE---ITP 182
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQ 208
LI GC + G+LG+ G+ S +SQ +
Sbjct: 183 PLILGCATESSD-------DRGILGMNRGRLSFVSQAK 213
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 63/120 (52%), Gaps = 13/120 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKN----NLVACND 117
+ VT+ G+P + Y L IDTGSD++W+QC PC+G C + ++ P + V C
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQC-LPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219
Query: 118 PFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
P C+A +C + C Y+V Y D S+ GVL H L L++ L P FGCG
Sbjct: 220 PQCAA----AGGKCSNSGTCLYKVTYGDGSSTAGVL--SHETLSLSSTRDL-PGFAFGCG 272
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 63/127 (49%), Gaps = 9/127 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y ++ +G PP +DTGSD+ W+QC PC C ++ P + + C+
Sbjct: 92 GEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQ-PCEDCYNQTTPIFDPSQSKTYKTLPCS 150
Query: 117 DPFCSAFHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIF 174
C + + C +N D+C+Y + Y D+ S G L + L T+GS + P+ +
Sbjct: 151 SNICQSVQSAAS--CSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVI 208
Query: 175 GCGYNQR 181
GCG+N +
Sbjct: 209 GCGHNNK 215
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 84/184 (45%), Gaps = 28/184 (15%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIR----- 130
+DT S+LTWVQC APC C L+ P ++ V C+ P C A
Sbjct: 158 VDTASELTWVQC-APCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGA 216
Query: 131 --CEAND--QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPK 186
C+A C Y + Y D S GVL D RL+ + +FGCG + N GP
Sbjct: 217 PPCDAGRPAACSYALSYRDGSYSRGVLAHD----RLSLAGEVIDGFVFGCGTS--NQGPP 270
Query: 187 PPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR----GGGYLFLGHDLVPSSGIA 242
T+G++GLG + S++S Q++ V +CL + G L LG D PS+
Sbjct: 271 FGGTSGLMGLGRSQLSLVS--QTVDQFGGVFSYCLPLSRESDASGSLVLGDD--PSAYRN 326
Query: 243 WTPM 246
TP+
Sbjct: 327 STPV 330
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 106/271 (39%), Gaps = 16/271 (5%)
Query: 52 PITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN 111
PI + G Y + +G P DTGSDL+W+QC PC C L+ P +
Sbjct: 77 PIITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSWLQC-TPCKTCYPQEAPLFDPTQS 135
Query: 112 L----VACNDPFCSAFHLPENIR-CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT--- 163
V C C+ F P+N R C ++ QC Y Y ++G L D T
Sbjct: 136 STYVDVPCESQPCTLF--PQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMG 193
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLS 222
G P+ +FGC + G +GLG G S+ SQL +G + S
Sbjct: 194 QGGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFS 253
Query: 223 VRGGGYLFLGHDLVPSSGIAWTP-MSRDLLEKHYSSGPAELLFGGKS--TGIKGLQIIFD 279
G L G + P++ + TP M +Y + G K TG G II D
Sbjct: 254 STSTGKLKFG-SMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIID 312
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
S T+ Y + +++ + + ED
Sbjct: 313 SVPILTHLEQGIYTDFISSVKEAINVEVAED 343
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 104/272 (38%), Gaps = 33/272 (12%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNN 111
N P Y V L IG PP+ +L +DTGSDL W QC PC C + P +
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 133
Query: 112 LVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
L +C+ C + + + N C Y Y D + G L D F S+
Sbjct: 134 LTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV-- 191
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--- 226
P + FGCG N G G+ G G G S+ SQL+ + HC + G
Sbjct: 192 PGVAFGCGL--FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS-----HCFTAVNGLKP 244
Query: 227 --GYLFLGHDLVPS--SGIAWTPMSRD--------LLEKHYSSGPAELLFGGKSTGIKGL 274
L L DL S + TP+ ++ L K + G L +K
Sbjct: 245 STVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 304
Query: 275 Q--IIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ T ++ Y+ D +K
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVK 336
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 80/178 (44%), Gaps = 12/178 (6%)
Query: 78 LDIDTGSDLTWVQCN-APCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCE 132
+ +DT SD+TWVQC+ P C + LY P + + +CN P C+ P C
Sbjct: 146 MVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG-PYANGCT 204
Query: 133 ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAG 192
N+QC Y V Y D S+ G ++D L +T + + FGC + + AG
Sbjct: 205 NNNQCQYRVRYPDGTSTAGTYISDL--LTITPATAVR-SFQFGCSHGVQGSFSFGSSAAG 261
Query: 193 VLGLGLGKASILSQLQSLGLTRNVLGHCL-SVRGGGYLFLGHDLVPSSGIAWTPMSRD 249
++ LG G S++S Q+ V HC G+ LG V + TPM ++
Sbjct: 262 IMALGGGPESLVS--QTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 317
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 115/274 (41%), Gaps = 54/274 (19%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + +GNPP+ + L IDTGSDLTW+QC PC C ++ P + ++ CN
Sbjct: 85 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCK-PCKACFDQSGPVFDPSQSTSFKIIPCN 143
Query: 117 DPFCSAFHLPENIRCEANDQ------CDYEVLYADHGSSLGVLVTDHFPLRLTN--GSLL 168
C L + C N C Y Y D + G L + + L++ SL
Sbjct: 144 AAACD---LVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLE 200
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---- 224
++ GCG++ + G+LGLG G S SQL+S + ++ +CL R
Sbjct: 201 IRDMVIGCGHSNKGLFQG---AGGLLGLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNL 256
Query: 225 --------GGGYLFLGHDLVPSSGIAWTPMSR--DLLEKHYSSG-------------PAE 261
G G+ H + +TP R + +E Y G PAE
Sbjct: 257 SVSSAISFGAGFALSRH----FDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAE 312
Query: 262 LLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTT 295
F + G G I DSG++ TY N AY+
Sbjct: 313 -RFAIATNGSGG--TIIDSGTTLTYLNRDAYRAV 343
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 116/263 (44%), Gaps = 31/263 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y VT+++G + + +DTGSDL+WVQC PC C + +++P + V C+ P
Sbjct: 135 YIVTVELGG--RKMTVIVDTGSDLSWVQCQ-PCKRCYNQQDPVFNPSTSPSYRTVLCSSP 191
Query: 119 FCSAFHLPE-NI-RCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + N+ C +N C+Y V Y D + G L T+H L L N + + IFG
Sbjct: 192 TCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEH--LDLGNSTAVN-NFIFG 248
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV---RGGGYLFLG 232
CG RN +G++GLG S++SQ + + V +CL + G L +G
Sbjct: 249 CG---RNNQGLFGGASGLVGLGRSSLSLISQTSA--MFGGVFSYCLPITETEASGSLVMG 303
Query: 233 HD---LVPSSGIAWTPMSRDLLEKHYSSGPAELLFG-----GKSTGIKGLQIIFDSGSSY 284
+ ++ I++T M + Y + G S G G+ I DSG+
Sbjct: 304 GNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMI--DSGTVI 361
Query: 285 TYFNSQAYKTTLDLMRKDLKGKP 307
T Y+ D K G P
Sbjct: 362 TRLPPSIYQALKDEFVKQFSGFP 384
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 104/272 (38%), Gaps = 33/272 (12%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNN 111
N P Y V L IG PP+ +L +DTGSDL W QC PC C + P +
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 133
Query: 112 LVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
L +C+ C + + + N C Y Y D + G L D F S+
Sbjct: 134 LTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV-- 191
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--- 226
P + FGCG N G G+ G G G S+ SQL+ + HC + G
Sbjct: 192 PGVAFGCGL--FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS-----HCFTAVNGLKP 244
Query: 227 --GYLFLGHDLVPS--SGIAWTPMSRD--------LLEKHYSSGPAELLFGGKSTGIKGL 274
L L DL S + TP+ ++ L K + G L +K
Sbjct: 245 STVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNG 304
Query: 275 Q--IIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG++ T ++ Y+ D +K
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVK 336
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 108/273 (39%), Gaps = 28/273 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQ---CNAPCTGCTLPPESLYHPKN----NLVAC 115
Y + + +G PP + IDTGS L+WVQ C C +++P N + V C
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65
Query: 116 NDPFCSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
+ C+ H+ + E +D C Y + Y S+G L D L +N S+
Sbjct: 66 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTL-ASNRSI--DNF 122
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL--SVRGGGYL 229
IFGCG + G AG++G G S +Q+ Q T +C G L
Sbjct: 123 IFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYT--AFSYCFPRDHENEGSL 176
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK-----GLQIIFDSGSSY 284
+G + WT + + Y+ +++ G I I DSG++
Sbjct: 177 TIGP-YARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTAD 235
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKAL 317
TY S + M K+++ K +E+ +
Sbjct: 236 TYILSPVFDALDKAMTKEMQAKGYTRGWDERRI 268
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/272 (27%), Positives = 113/272 (41%), Gaps = 30/272 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSA 122
+ +G P Y + +DTGS LTW+QC+ C +++PK++ V C+ CS
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCS- 59
Query: 123 FHLPENI----RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
LP C +++ C Y+ Y D S+G L D ++ GS P +GCG
Sbjct: 60 -DLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSLPNFYYGCG- 113
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCLSVRGGGYLFLGHDLVP 237
Q N G +AG++GL K S+L QL SLG + +CL P
Sbjct: 114 -QDNEGLF-GRSAGLIGLARNKLSLLYQLAPSLGYS---FTYCLPSSSSSGYLSLGSYNP 168
Query: 238 SSGIAWTPM-SRDLLEKHYSSGPAELLFGGK-----STGIKGLQIIFDSGSSYTYFNSQA 291
++TPM S L + Y + + G S+ L I DSG+ T +
Sbjct: 169 GQ-YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSV 227
Query: 292 YKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
Y + +KG + L C+KG
Sbjct: 228 YSALSKAVAAAMKGT--SRASAYSILDTCFKG 257
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 108/255 (42%), Gaps = 30/255 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G P K + DTGSDL WVQ + PCTGC+ +++ P+ + + C+
Sbjct: 53 GGYVMDISVGTPGKRFRAIADTGSDLVWVQ-SEPCTGCS--GGTIFDPRQSSTFREMDCS 109
Query: 117 DPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLT-NGSLLGPRLIF 174
C+ LP + CE + C Y Y G + G D L T +GS P
Sbjct: 110 SQLCA--ELPGS--CEPGSSTCSYSYEYG-SGETEGEFARDTISLGTTSDGSQKFPSFAV 164
Query: 175 GCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGY 228
GCG N G G++GLG G S+ SQL + + +CL
Sbjct: 165 GCGMVNSGFDG-----VDGLVGLGQGPVSLTSQLSA--AIDSKFSYCLVDINSQSESSPL 217
Query: 229 LFLGHDLVPSSGIAWTPMS--RDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
LF + +GI T ++ D +Y + G++ G G II DSG++ TY
Sbjct: 218 LFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTII-DSGTTLTY 276
Query: 287 FNSQAYKTTLDLMRK 301
S Y L M
Sbjct: 277 VPSGVYGRVLSRMES 291
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 81/184 (44%), Gaps = 24/184 (13%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
V P+ G Y + +G P L +DT SDLTW+QC PC C ++ P+
Sbjct: 121 VAPVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPR 179
Query: 110 NNL----VACNDPFCSAFHLPENIRCEANDQ----CDYEVLYAD-HGS---SLGVLVTDH 157
++ + + P C A R D C Y V Y D HGS S+G LV +
Sbjct: 180 HSTSYGEMNYDAPDCQALG-----RSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEET 234
Query: 158 FPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVL 217
G + L GCG++ N G P AG+LGLG G+ SI Q+ LG +
Sbjct: 235 LTF---AGGVRQAYLSIGCGHD--NKGLFGAPAAGILGLGRGQISIPHQIAFLGYNAS-F 288
Query: 218 GHCL 221
+CL
Sbjct: 289 SYCL 292
>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 350
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 66/137 (48%), Gaps = 11/137 (8%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT-LPPESLYHPKNN 111
++G G Y V L+IG PP+ L DTGSDL WV+C+A C C+ P +++ P+++
Sbjct: 74 VSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHS 132
Query: 112 L----VACNDPFCSAFHLPENI----RCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
C DP C P+ + C YE YAD + G+ + L+ +
Sbjct: 133 STFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTS 192
Query: 164 NGSLLGPRLI-FGCGYN 179
+G + + FGCG+
Sbjct: 193 SGKEARLKSVAFGCGFR 209
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 108/273 (39%), Gaps = 28/273 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQ---CNAPCTGCTLPPESLYHPKN----NLVAC 115
Y + + +G PP + IDTGS L+WVQ C C +++P N + V C
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 84
Query: 116 NDPFCSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
+ C+ H+ + E +D C Y + Y S+G L D L +N S+
Sbjct: 85 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTL-ASNRSI--DNF 141
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL--SVRGGGYL 229
IFGCG + G AG++G G S +Q+ Q T +C G L
Sbjct: 142 IFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYT--AFSYCFPRDHENEGSL 195
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK-----GLQIIFDSGSSY 284
+G + WT + + Y+ +++ G I I DSG++
Sbjct: 196 TIGP-YARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTAD 254
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKAL 317
TY S + M K+++ K +E+ +
Sbjct: 255 TYILSPVFDALDKAMTKEMQAKGYTRGWDERRI 287
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 110/276 (39%), Gaps = 34/276 (12%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
G + V + G PP+ + L +DTGS +TW QC PC C + P +L
Sbjct: 160 GNFLVDVAFGTPPQKFTLILDTGSSITWTQCK-PCVRCLKASRRHFDPSASLT------- 211
Query: 121 SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQ 180
+ L I + Y + Y D +S+G D L ++ + P+ FGCG N
Sbjct: 212 --YSLGSCIPSTVGNT--YNMTYGDKSTSVGNYGCDTMTLEHSD---VFPKFQFGCGRN- 263
Query: 181 RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG--GGYLFLGHDLVPS 238
N G G+LGLG G+ S +SQ S + V +CL G LF S
Sbjct: 264 -NEGDFGSGADGMLGLGQGQLSTVSQTAS--KFKKVFSYCLPEEDSIGSLLFGEKATSQS 320
Query: 239 SGIAWT-----PMSRDLLEK-HYSSGPAELLFGGKSTGIKGLQI-----IFDSGSSYTYF 287
S + +T P + L E +Y ++ G K I I DSG+ T
Sbjct: 321 SSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVITRL 380
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDTAEEKA--LPVCW 321
+AY +K + PL + +K L C+
Sbjct: 381 PQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCY 416
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/307 (26%), Positives = 120/307 (39%), Gaps = 72/307 (23%)
Query: 43 HRFGSTAVFPITGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQC--NAPCTGC 98
H G A P+ +YP Y Y+ +L +G PP+ + +DTGS LTWV C N C C
Sbjct: 64 HHQGQAASSPVRAALYPHSYGGYAFSLSLGTPPQPLPVLLDTGSHLTWVPCTSNYQCQNC 123
Query: 99 TLPPES--LYHP---------------------KNNLVACNDPFCSAFHLPENIRCEAND 135
+ S ++HP K++L C N A +
Sbjct: 124 SAAAGSFPVFHPKSSSSSLLVSCSSPSCLWIHSKSHLSDCARDSAPCRPSTANCSATATN 183
Query: 136 QC-DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVL 194
C Y V+Y GS+ G+LV+D LRL+ GC + +PP +G+
Sbjct: 184 VCPPYLVVYGS-GSTAGLLVSDT--LRLSPRGAASRNFAVGCSLASVH---QPP--SGLA 235
Query: 195 GLGLGKASILSQLQSLGLTRNVLGHCLSVRG-------GGYLFLGHDLVPSSGIAWTPMS 247
G G G S+ +QL N +CL R G L LG S+G A M
Sbjct: 236 GFGRGAPSVPAQLGV-----NKFSYCLLSRRFDDDAAISGELVLGAS---SAGKAKAMMQ 287
Query: 248 RDLLEKHYSSGPAELLF----------GGKSTGIKGLQI-----------IFDSGSSYTY 286
L K+ + P ++ GGKS + + I DSG+++TY
Sbjct: 288 YAPLLKNAGARPPYSVYYYLSLTGIAVGGKSVALPARALAPVSGGGGGGAIIDSGTTFTY 347
Query: 287 FNSQAYK 293
+ +K
Sbjct: 348 LDPTVFK 354
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 14/165 (8%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNN----LVACN 116
Y +++ +G P + IDTGSD++WVQCN PC C +L+ P + V+C
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCN-PCPNPPCYAQTGALFDPAKSSTYRAVSCA 185
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+ N N +C Y V Y D ++ G D L + ++ G FGC
Sbjct: 186 AAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKG--FQFGC 243
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+ + + T G++GLG G S++S Q+ N +CL
Sbjct: 244 SHVESGFSDQ---TDGLMGLGGGAQSLVS--QTAAAYGNSFSYCL 283
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 124/294 (42%), Gaps = 44/294 (14%)
Query: 28 NQPPSKKKSTQSTAAHRFGSTAVFPITGNV---------YPLGY--YSVTLKIGNPPKLY 76
++ + S AA R STA P + + PLG Y V++ +G P +
Sbjct: 92 DRDQDRVDSIHRLAAARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGTPKRDL 151
Query: 77 ELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACNDPFCSAFHLPENIRCE 132
+ DTGSDL+WVQC PC GC + L+ P + V C C ++ C
Sbjct: 152 LVVFDTGSDLSWVQCK-PCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRL---DSGSCS 207
Query: 133 ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL---IFGCGYNQRNPGPKPPP 189
+ +C YEV+Y D + G L D L ++ S +L +FGCG + K
Sbjct: 208 SG-KCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGK--- 263
Query: 190 TAGVLGLGLGKASILSQLQS---LGLTRNVLGHCL--SVRGGGYLFLGHDLVPSSGIAWT 244
G+ GLG + S+ SQ + G + +CL S GYL LG P++ +T
Sbjct: 264 ADGLFGLGRDRVSLASQAAAKYGAGFS-----YCLPSSSTAEGYLSLGSAAPPNA--RFT 316
Query: 245 PM-SRDLLEKHYSSGPAELLFGGKSTGI-----KGLQIIFDSGSSYTYFNSQAY 292
M +R Y + G++ + + + DSG+ T S+AY
Sbjct: 317 AMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLPSRAY 370
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 117/296 (39%), Gaps = 52/296 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G + + L +G P Y +DTGSDL W QC PC C ++ P + + C+
Sbjct: 114 GEFLMDLSVGTPALPYAAIVDTGSDLVWTQCK-PCVECFNQTTPVFDPAASSTYAALPCS 172
Query: 117 DPFCSAFHLPENIRCEANDQCD----YEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
C+ ++ Y Y D S+ GVL T+ F L P +
Sbjct: 173 SALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQK----VPGV 228
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGY 228
FGCG N G AG++GLG G S++SQ LG+ R +CL+ G
Sbjct: 229 AFGCG--DTNEGDGFTQGAGLVGLGRGPLSLVSQ---LGIDR--FSYCLTSLDDAAGRSP 281
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSS-------------GPAEL-----LFGGKSTG 270
L LG S+ A P L K+ S G L F + G
Sbjct: 282 LLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDG 341
Query: 271 IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK---PLEDTAEEKALPVCWKG 323
G +I DSG+S TY +AY+ +RK P D A E L +C++G
Sbjct: 342 TGG--VIVDSGTSITYLELRAYRA----LRKAFVAHMSLPTVD-ASEIGLDLCFQG 390
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 60/129 (46%), Gaps = 11/129 (8%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLP-PESLYHPKN----NLVAC 115
G Y VT+ +G P + + L DTGS +TW QC PC G P E + P N V+C
Sbjct: 133 GNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQ-PCLGSCYPQKEQKFDPTKSTSYNNVSC 191
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
+ C+ E +N C Y+++Y D S G T+ + + S + +FG
Sbjct: 192 SSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTI---SSSDVFTNFLFG 248
Query: 176 CGYNQRNPG 184
CG Q N G
Sbjct: 249 CG--QSNNG 255
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 115/272 (42%), Gaps = 34/272 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y V + IG P K L DTGS L W QC PC C P ++ P + P CS+
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCK-PCKAC-YPKVPVFDPTKSASFKGLP-CSS 188
Query: 123 FHLPENIR--CEANDQCDYEVLYADHGSSLGVLVTD-----HFPLRLTNGSLLGPRLIFG 175
L ++IR C ++ +C Y Y D+ SS G L T+ H N ++ G
Sbjct: 189 -KLCQSIRQGC-SSPKCTYLTAYVDNSSSTGTLATETISFSHLKYDFKN-------ILIG 239
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG--GYLFLGH 233
C + +G++GL S+ S Q+ + + +C+ G G+L G
Sbjct: 240 CSDQVSG---ESLGESGIMGLNRSPISLAS--QTANIYDKLFSYCIPSTPGSTGHLTFGG 294
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKG----LQIIFDSGSSYTYFNS 289
VP+ + ++P+S+ Y + GG+ I + DSG+ T
Sbjct: 295 K-VPND-VRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAVLTRLPP 352
Query: 290 QAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
+AY + R+ +KG PL D ++ L C+
Sbjct: 353 KAYSALRSVFREMMKGYPLLD--QDDFLDTCY 382
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 108/255 (42%), Gaps = 30/255 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G P K + DTGSDL WVQ + PCTGC+ +++ P+ + + C+
Sbjct: 53 GGYVMDISVGTPGKRFRAIADTGSDLVWVQ-SEPCTGCS--GGTIFDPRQSSTFREMDCS 109
Query: 117 DPFCSAFHLPENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN-GSLLGPRLIF 174
C+ LP + CE + C Y Y G + G D L T+ GS P
Sbjct: 110 SQLCT--ELPGS--CEPGSSACSYSYEYG-SGETEGEFARDTISLGTTSGGSQKFPSFAV 164
Query: 175 GCG-YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRGGGY 228
GCG N G G++GLG G S+ SQL + + +CL
Sbjct: 165 GCGMVNSGFDG-----VDGLVGLGQGPVSLTSQLSA--AIDSKFSYCLVDINSQSESSPL 217
Query: 229 LFLGHDLVPSSGIAWTPMS--RDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTY 286
LF + +GI T ++ D +Y + G++ G G II DSG++ TY
Sbjct: 218 LFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTII-DSGTTLTY 276
Query: 287 FNSQAYKTTLDLMRK 301
S Y L M
Sbjct: 277 VPSGVYGRVLSRMES 291
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 60/130 (46%), Gaps = 12/130 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDP 118
Y + + +G PP+ +++ +DTGSDL W+QC APC C ++ P + + C DP
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNLTCGDP 204
Query: 119 FC-----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT--NGSLLGPR 171
C P R D C Y Y D +S G L + F + LT S
Sbjct: 205 RCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVDG 264
Query: 172 LIFGCGYNQR 181
++FGCG+ R
Sbjct: 265 VVFGCGHRNR 274
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 63/133 (47%), Gaps = 12/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
++G G Y + IG+PPK + +DTGSD+ WVQC APC C + ++ P +
Sbjct: 145 VSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQC-APCADCYQQADPIFEPSFSS 203
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
+ C C + + E C ND C YEV Y D ++G T+ L +GS
Sbjct: 204 SYAPLTCETHQCKSLDVSE---CR-NDSCLYEVSYGDGSYTVGDFATETITL---DGSAS 256
Query: 169 GPRLIFGCGYNQR 181
+ GCG++
Sbjct: 257 LNNVAIGCGHDNE 269
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 127/313 (40%), Gaps = 49/313 (15%)
Query: 42 AHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTL 100
A G+T P T N G Y + L IG PP Y+ DTGSDL W QC APCT C
Sbjct: 70 AASSGATVSAP-TQNSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFR 127
Query: 101 PPESLYHPKNN----LVACNDPF--CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLV 154
P LY+P ++ ++ CN C+A C Y V Y +S+
Sbjct: 128 QPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQG 186
Query: 155 TDHFPLRLT-NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLT 213
++ F T G P + FGC + + G +G++GLG G+ S++SQ LG+
Sbjct: 187 SETFTFGSTPAGQSRVPGIAFGC--STASSGFNASSASGLVGLGRGRLSLVSQ---LGVP 241
Query: 214 RNVLGHCLS----VRGGGYLFLG--HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF--- 264
+ +CL+ L LG L ++G++ TP S+ P +
Sbjct: 242 K--FSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTP-----FVASPSTAPMNTFYYLN 294
Query: 265 -GGKSTGIKGLQI---------------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPL 308
G S G L I I DSG++ T + AY+ + L P
Sbjct: 295 LTGISLGTTALSIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQ-QVRAAVVSLVTLPT 353
Query: 309 EDTAEEKALPVCW 321
D + L +C+
Sbjct: 354 TDGSAATGLDLCF 366
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 121/294 (41%), Gaps = 48/294 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNN----LVAC 115
G Y + L IG PP Y+ DTGSDL W QC APCT C P LY+P ++ ++ C
Sbjct: 90 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 148
Query: 116 NDPF--CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT-NGSLLGPRL 172
N C+A C Y V Y +S+ ++ F T G P +
Sbjct: 149 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGHARVPGI 207
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGY 228
FGC + + G +G++GLG G+ S++SQ LG+ + +CL+
Sbjct: 208 AFGC--STASSGFNASSASGLVGLGRGRLSLVSQ---LGVPK--FSYCLTPYQDTNSTST 260
Query: 229 LFLG--HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF----GGKSTGIKGLQI------ 276
L LG L ++G++ TP S+ P + G S G L I
Sbjct: 261 LLLGPSASLNGTAGVSSTP-----FVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFS 315
Query: 277 ---------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
I DSG++ T + AY+ + L P D + + L +C+
Sbjct: 316 LNADGTGGLIIDSGTTITLLGNTAYQ-QVRAAVVSLVTLPTTDGSADTGLDLCF 368
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 103/272 (37%), Gaps = 36/272 (13%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN--- 110
+G Y + L G PP+ + +DTGS++ W+ CN PC+GC+ + K+
Sbjct: 115 SGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCN-PCSGCSSKQQPFEPSKSSTY 173
Query: 111 NLVACNDPFCSAFHLPENIRCEAND---QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSL 167
N + C C + C +D C Y D +L ++ L+ GS
Sbjct: 174 NYLTCASQQCQLLRV-----CTKSDNSVNCSLTQRYGDQSEVDEILSSET----LSVGSQ 224
Query: 168 LGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SV 223
+FGC R + P ++G G S +SQ + L + +CL S
Sbjct: 225 QVENFVFGCSNAARGLIQRTP---SLVGFGRNPLSFVSQTAT--LYDSTFSYCLPSLFSS 279
Query: 224 RGGGYLFLGHDLVPSSGIAWTP-MSRDLLEKHYSSGPAELLFGGKSTGI----------K 272
G L LG + + + G+ +TP +S Y G + G + I
Sbjct: 280 AFTGSLLLGKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDEST 339
Query: 273 GLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
G I DSG+ T AY D R L
Sbjct: 340 GRGTIIDSGTVITRLVEPAYNAMRDSFRSQLS 371
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 100/249 (40%), Gaps = 30/249 (12%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLP-----PE-----SLYHPKNNL 112
+ +G P + + +DTGSDL WV C+ AP T PE +
Sbjct: 109 VAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGGGGPELRQYSPSKSSTSKT 168
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLYA-DHGSSLGVLVTDHFPLR-------LTN 164
V C C N A C Y V YA + SS G LV D L
Sbjct: 169 VTCASNLCDQ----PNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAA 224
Query: 165 GSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTR-NVLGHCLSV 223
G+ + ++FGCG Q G++GLG+ K S+ S L S G+ + N C S
Sbjct: 225 GAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCFSK 284
Query: 224 RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSS 283
G G + G S+ + TP +Y+ + G K+ + G I DSG+S
Sbjct: 285 DGLGRINFGD--TGSADQSETPFIVKSTHSYYNISITSMSVGDKNLPL-GFYAIADSGTS 341
Query: 284 YTYFNSQAY 292
+TY N AY
Sbjct: 342 FTYLNDPAY 350
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 100/249 (40%), Gaps = 30/249 (12%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCN----APCTGCTLP-----PE-----SLYHPKNNL 112
+ +G P + + +DTGSDL WV C+ AP T PE +
Sbjct: 109 VAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGGGGPELRQYSPSKSSTSKT 168
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLYA-DHGSSLGVLVTDHFPLR-------LTN 164
V C C N A C Y V YA + SS G LV D L
Sbjct: 169 VTCASNLCDQ----PNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAA 224
Query: 165 GSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTR-NVLGHCLSV 223
G+ + ++FGCG Q G++GLG+ K S+ S L S G+ + N C S
Sbjct: 225 GAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCFSK 284
Query: 224 RGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSS 283
G G + G S+ + TP +Y+ + G K+ + G I DSG+S
Sbjct: 285 DGLGRINFGD--TGSADQSETPFIVKSTHSYYNISITSMSVGDKNLPL-GFYAIADSGTS 341
Query: 284 YTYFNSQAY 292
+TY N AY
Sbjct: 342 FTYLNDPAY 350
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 109/258 (42%), Gaps = 30/258 (11%)
Query: 70 GNPPKLYE--LDIDTGSDLTWVQCNAPCT--GCTLPPESLYHPKNN----LVACNDPFCS 121
G+P + + + IDT D+ W+QC APC C + L+ P + V C P C
Sbjct: 140 GDPTVVSQQTMAIDTTVDVPWIQC-APCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACR 198
Query: 122 AFHLPENIRCE---ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
+ P C AN +C Y + Y+D ++ G +TD + +G+ FGC +
Sbjct: 199 SLG-PYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTLTI---SGTTAVRNFRFGCSH 254
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL-SVRGGGYLFLGHDLV 236
R G TAG + LG G S+L+Q +SLG N +C+ G+L +G
Sbjct: 255 AVR--GRFSDLTAGTMSLGGGAQSLLAQTARSLG---NAFSYCVPQASASGFLSIGGPAT 309
Query: 237 PSSG--IAWTPMSRDLLEKH-YSSGPAELLFGGKSTGIKGLQI----IFDSGSSYTYFNS 289
+S A TP+ R + Y ++ G+ GI + + DS + T
Sbjct: 310 TNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPP 369
Query: 290 QAYKTTLDLMRKDLKGKP 307
AY+ R ++ P
Sbjct: 370 TAYRALRRAFRNAMRAYP 387
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 14/165 (8%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNN----LVACN 116
Y +++ +G P + IDTGSD++WVQCN PC C +L+ P + V+C
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCN-PCPNPPCHAQTGALFDPAKSSTYRAVSCA 185
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+ N N +C Y V Y D ++ G D L + ++ G FGC
Sbjct: 186 AAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKG--FQFGC 243
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+ + + T G++GLG G S++S Q+ N +CL
Sbjct: 244 SHLESGFSDQ---TDGLMGLGGGAQSLVS--QTAAAYGNSFSYCL 283
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/258 (25%), Positives = 100/258 (38%), Gaps = 24/258 (9%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCS--AFH 124
+ IG P + + +D GSD+ WV C+ C C Y+ + + P S + H
Sbjct: 109 IDIGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRH 166
Query: 125 LPENIR-CE-------ANDQCDYEVLYAD-HGSSLGVLVTDHFPL----RLTNGSLLGPR 171
LP + C+ + D C YEV YA + SS G + D L + + +
Sbjct: 167 LPCGHKLCDVHSFCKGSKDPCPYEVQYASANTSSSGYVFEDKLHLTSDGKHAEQNSVQAS 226
Query: 172 LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFL 231
+I GCG Q GVLGLG G S+ S L GL +N CL G +
Sbjct: 227 IILGCGRKQTGDYLHGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICLDENESGRIIF 286
Query: 232 GHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQ 290
G V + P+ Y G G Q + DSGSS+T+ ++
Sbjct: 287 GDQGHVTQHSTPFLPI------IAYMVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNE 340
Query: 291 AYKTTLDLMRKDLKGKPL 308
Y+ + K + +
Sbjct: 341 VYQKVVTEFDKQVNASRI 358
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/300 (24%), Positives = 118/300 (39%), Gaps = 56/300 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN--------- 111
G Y V + +G P K + + +DTGS L+W+QC C + + ++ P +
Sbjct: 105 GNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCS 164
Query: 112 --------LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
N P CS A C Y+ Y D S+G L D L LT
Sbjct: 165 SSQCSSLKSSTLNAPGCS----------NATGACVYKASYGDTSFSIGYLSQD--VLTLT 212
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-- 221
+ ++GCG + + + +AG++GL K S+L QL + N +CL
Sbjct: 213 PSAAPSSGFVYGCGQDNQGLFGR---SAGIIGLANDKLSMLGQLSN--KYGNAFSYCLPS 267
Query: 222 ------SVRGGGYLFLGHDLVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKSTGIKG- 273
+ G+L +G + SS +TP+ ++ + Y G + GK G+
Sbjct: 268 SFSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSAS 327
Query: 274 ---LQIIFDSGSSYTYFNSQAY----KTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWK 326
+ I DSG+ T Y K+ + +M K P L C+KG+ K
Sbjct: 328 SYNVPTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAP-----GFSILDTCFKGSVK 382
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 113/275 (41%), Gaps = 35/275 (12%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
V P+ G Y + +G P L +DT SDLTW+QC PC C ++ P+
Sbjct: 125 VAPVVSRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPR 183
Query: 110 NNL----VACNDPFCSAFHLPENIRCEANDQ----CDYEVLYADHGSSLGVLVTDHFPLR 161
++ ++ N C A R D C Y V Y D +++G + +
Sbjct: 184 HSTSYREMSFNAADCQALG-----RSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTF- 237
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
G + PR+ GCG++ N G P AG+LGLG G S +Q+ G L L
Sbjct: 238 --AGGVRLPRISIGCGHD--NKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFL 293
Query: 222 SVRG--GGYLFLGHDLVPSS-GIAWTPMSRDL-LEKHYSSGPAELLFGG-KSTGI--KGL 274
S G L G V +S +++TP +L + Y + GG + G+ + L
Sbjct: 294 SGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDL 353
Query: 275 Q---------IIFDSGSSYTYFNSQAYKTTLDLMR 300
Q +I DSG++ T AY D R
Sbjct: 354 QLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFR 388
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 92/191 (48%), Gaps = 23/191 (12%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFC 120
V+L +G PP+ + IDTGS+L+W+ CN TL + + P + + C+ P C
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLHCNK-----TLSYPTTFDPTRSTSYQTIPCSSPTC 87
Query: 121 S--AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
+ P C++N+ C + YAD SS G L +D F + GS L+FGC
Sbjct: 88 TNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHI----GSSDISGLVFGCMD 143
Query: 179 NQ-RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGH-DL 235
+ + + + G++G+ G S +SQ LG + +C+S G L LG +L
Sbjct: 144 SVFSSNSDEDSKSTGLMGMNRGSLSFVSQ---LGFPK--FSYCISGTDFSGLLLLGESNL 198
Query: 236 VPSSGIAWTPM 246
S + +TP+
Sbjct: 199 TWSVPLNYTPL 209
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 135/334 (40%), Gaps = 59/334 (17%)
Query: 2 EEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNV-YPL 60
E KG V LVL G + Q +K+++ S A S P+T + +
Sbjct: 68 ERKGDWVEKQLVL------DGLHVRSIQNHIRKRTSSSQIADS--SETQVPLTSGIKFQT 119
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
Y VT+ +G+ + + +DTGSDLTWVQC PC C L+ P + + CN
Sbjct: 120 LNYIVTMGLGS--QNMSVIVDTGSDLTWVQCE-PCRSCYNQNGPLFKPSTSPSYQPILCN 176
Query: 117 DPFCSAFHLPENIRC----EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
C + L C + CDY V Y D + G L + +L G +
Sbjct: 177 STTCQSLELGA---CGSDPSTSATCDYVVNYGDGSYTSGELGIE----KLGFGGISVSNF 229
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGY 228
+FGCG RN +G++GLG + S++SQ + V +CL G
Sbjct: 230 VFGCG---RNNKGLFGGASGLMGLGRSELSMISQTNA--TFGGVFSYCLPSTDQAGASGS 284
Query: 229 LFLGH------DLVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGGKSTGIKGLQ-----I 276
L +G+ ++ P IA+T M +L L Y + GG S ++ +
Sbjct: 285 LVMGNQSGVFKNVTP---IAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGV 341
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
I DSG+ + YK LK K LE
Sbjct: 342 ILDSGTVISRLAPSVYKA--------LKAKFLEQ 367
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 114/300 (38%), Gaps = 56/300 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN--------- 111
G Y V + +G P K + + +DTGS L+W+QC C + + ++ P +
Sbjct: 111 GNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCS 170
Query: 112 --------LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT 163
N P CS A C Y+ Y D S+G L D L LT
Sbjct: 171 SSQCSSLKSSTLNAPGCS----------NATGACVYKASYGDTSFSIGYLSQD--VLTLT 218
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
++GCG + + + ++G++GL K S+L QL N +CL
Sbjct: 219 PSEAPSSGFVYGCGQDNQGLFGR---SSGIIGLANDKISMLGQLSK--KYGNAFSYCLPS 273
Query: 224 RG--------GGYLFLGHDLVPSSGIAWTPMSRDL-LEKHYSSGPAELLFGGKSTGIKG- 273
G+L +G + SS +TP+ ++ + Y + GK G+
Sbjct: 274 SFSAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSAS 333
Query: 274 ---LQIIFDSGSSYTYFNSQAY----KTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWK 326
+ I DSG+ T Y K+ + +M K P L C+KG+ K
Sbjct: 334 SYNVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAP-----GFSILDTCFKGSVK 388
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 107/276 (38%), Gaps = 32/276 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CT-LPPESLYHPKNNLVA 114
Y +G YSV K+G P + + L DTGSDLTW+ C C C+ + H +
Sbjct: 7 YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 66
Query: 115 CNDPFCSAFHLPENIRCEANDQ------------CDYEVLYADHGSSLGVLVTDHFPLRL 162
+ F + L + + E D C Y+ Y+D ++LG + + L
Sbjct: 67 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 126
Query: 163 TNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI-LSQLQSLG--LTRNVLG 218
G + ++ GC ++ G GV+GLG K S + + G + ++
Sbjct: 127 KEGRKMKLHNVLIGC--SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVD 184
Query: 219 HCLSVRGGGYLFLGHDLVPSS---GIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI---- 271
H YL G + + +T + ++ Y+ + GG I
Sbjct: 185 HLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 244
Query: 272 ---KGL-QIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
KG I DSGSS T+ AY+ + +R L
Sbjct: 245 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL 280
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 106/269 (39%), Gaps = 28/269 (10%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQ---CNAPCTGCTLPPESLYHPKN----NLVACNDPF 119
+ +G PP + IDTGS L+WVQ C C +++P N + V C+
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEA 62
Query: 120 CSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+ H+ + E +D C Y + Y S+G L D L +N S+ IFGC
Sbjct: 63 CNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTL-ASNRSI--DNFIFGC 119
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL--SVRGGGYLFLGH 233
G + G AG++G G S +Q+ Q T +C G L +G
Sbjct: 120 GEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYT--AFSYCFPRDHENEGSLTIGP 173
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK-----GLQIIFDSGSSYTYFN 288
+ WT + + Y+ +++ G I I DSG++ TY
Sbjct: 174 -YARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTADTYIL 232
Query: 289 SQAYKTTLDLMRKDLKGKPLEDTAEEKAL 317
S + M K+++ K +E+ +
Sbjct: 233 SPVFDALDKAMTKEMQAKGYTRGWDERRI 261
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 80/179 (44%), Gaps = 16/179 (8%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
K K+T AH + + P + V + IG+PP L +DT SDL W+QC
Sbjct: 63 KAKATGDIIAHLSPNVPIIPQA--------FLVNISIGSPPVTQLLHMDTASDLLWLQCR 114
Query: 93 APCTGCTLPPESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLG 151
PC C ++ P + N+ ++ + ++R A + C+Y + Y D S G
Sbjct: 115 -PCINCYAQSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRSCEYSMRYMDGTGSKG 173
Query: 152 VLVTDHFPLRLT---NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
+L + + S ++FGCG++ +P G+LGLG G+ S++ +
Sbjct: 174 ILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYG---EPLVGTGILGLGYGEFSLVHRF 229
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 120/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS TWV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 121/294 (41%), Gaps = 48/294 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG-CTLPPESLYHPKNN----LVAC 115
G Y + L IG PP Y+ DTGSDL W QC APCT C P LY+P ++ ++ C
Sbjct: 30 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 88
Query: 116 NDPF--CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT-NGSLLGPRL 172
N C+A C Y V Y +S+ ++ F T G P +
Sbjct: 89 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGHARVPGI 147
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGY 228
FGC + + G +G++GLG G+ S++SQ LG+ + +CL+
Sbjct: 148 AFGC--STASSGFNASSASGLVGLGRGRLSLVSQ---LGVPK--FSYCLTPYQDTNSTST 200
Query: 229 LFLG--HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF----GGKSTGIKGLQI------ 276
L LG L ++G++ TP S+ P + G S G L I
Sbjct: 201 LLLGPSASLNGTAGVSSTP-----FVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFS 255
Query: 277 ---------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
I DSG++ T + AY+ + L P D + + L +C+
Sbjct: 256 LNADGTGGLIIDSGTTITLLGNTAYQ-QVRAAVVSLVTLPTTDGSADTGLDLCF 308
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 87/202 (43%), Gaps = 27/202 (13%)
Query: 37 TQSTAAHRFGSTAVFPITGNVY----PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
+++ A S+A P++ Y P+ Y + L IG PP+ +L +DTGS L W QC
Sbjct: 61 SKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ 120
Query: 93 APCTGC---TLPPESLYHPKNNLVACNDPFCSAFHL---PENIRC--EANDQCDYEVLYA 144
PC C +LP Y+ + P C + P C + C Y Y
Sbjct: 121 -PCAVCFNQSLP----YYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYSYG 175
Query: 145 DHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASIL 204
D +++G L D + G+ + P ++FGCG N N G G+ G G G S+
Sbjct: 176 DKSATIGFL--DVETVSFVAGASV-PGVVFGCGLN--NTGIFRSNETGIAGFGRGPLSLP 230
Query: 205 SQLQSLGLTRNVLGHCLSVRGG 226
SQL+ + HC + G
Sbjct: 231 SQLKVGNFS-----HCFTAVSG 247
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 109/266 (40%), Gaps = 32/266 (12%)
Query: 63 YSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPC-TGCTLPPESLYHPKNNLV----ACN 116
Y +T+++G+PP K + IDTGSD++WV+C PC C + L+ P + +C+
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCK-PCWQQCRPQVDPLFDPSLSSTYSPFSCS 198
Query: 117 DPFCSAFHLPENIR-CEANDQCDYEVLYADHG-SSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C+ N C ++ QC Y +Y D + G +D L + +++ + F
Sbjct: 199 SAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRF 258
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL------GLTRNVLGHCL--SVRGG 226
GC + + G+ GL G + QSL +CL +
Sbjct: 259 GCSHAE----------TGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSSS 308
Query: 227 GYLFLGHDLVPSSGIAWTPMSR-DLLEKHYSSGPAELLFGGKS----TGIKGLQIIFDSG 281
G+L LG S+G TPM R + Y + GG+ T + +I DSG
Sbjct: 309 GFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFSAGMIMDSG 368
Query: 282 SSYTYFNSQAYKTTLDLMRKDLKGKP 307
+ T AY + + +K P
Sbjct: 369 TVVTRLPPTAYSSLSSAFKAGMKQYP 394
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 104/281 (37%), Gaps = 48/281 (17%)
Query: 56 NVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNN 111
N P Y V L IG PP+ +L +DTGSDL W QC PC C + P +
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 86
Query: 112 LVACNDPFCSAFHLPE--NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG 169
L +C+ C + + + N C Y Y D + G L D F S+
Sbjct: 87 LTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV-- 144
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY- 228
P + FGCG N G G+ G G G S+ SQL+ + HC + G
Sbjct: 145 PGVAFGCGL--FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS-----HCFTTITGAIP 197
Query: 229 ----LFLGHDLVPS--SGIAWTPMSRDLLEKHYSSGPAE-----LLFGGKSTGIKGLQI- 276
L L DL + + TP+ + Y+ A L G + G L +
Sbjct: 198 STVLLDLPADLFSNGQGAVQTTPLIQ------YAKNEANPTLYYLSLKGITVGSTRLPVP 251
Query: 277 -------------IFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
I DSG+S T Q Y+ D +K
Sbjct: 252 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK 292
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/276 (23%), Positives = 105/276 (38%), Gaps = 32/276 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG---CTLPPESLYHPKNNLVA 114
Y +G YSV K+G P + + L DTGSDLTW+ C C + H +
Sbjct: 78 YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137
Query: 115 CNDPFCSAFHLPENIRCEANDQ------------CDYEVLYADHGSSLGVLVTDHFPLRL 162
+ F + L + + E D C Y+ Y+D ++LG + + L
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197
Query: 163 TNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI-LSQLQSLG--LTRNVLG 218
G + ++ GC ++ G GV+GLG K S + + G + ++
Sbjct: 198 KEGRKMKLHNVLIGC--SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVD 255
Query: 219 HCLSVRGGGYLFLGHDLVPSS---GIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI---- 271
H YL G + + +T + ++ Y+ + GG I
Sbjct: 256 HLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 272 ---KGL-QIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
KG I DSGSS T+ AY+ + +R L
Sbjct: 316 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL 351
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 112/278 (40%), Gaps = 30/278 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG---CTLPPESLYHPKNN----LVAC 115
Y VT +G P +++DTGSDL+WVQC PC+ C + L+ P + V C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C+ + + QC Y V Y D ++ GV +D L ++ ++ G FG
Sbjct: 199 GGPVCAGLGI-YAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-AVQG--FFFG 254
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLGH 233
CG+ Q G+LGLG + S++ Q+ G V +CL + GYL LG
Sbjct: 255 CGHAQSGLFNG---VDGLLGLGREQPSLVE--QTAGTYGGVFSYCLPTKPSTAGYLTLG- 308
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI---------IFDSGSSY 284
L SG A + LL + ++ G S G + L + + D+G+
Sbjct: 309 -LGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVI 367
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
T AY R + L C+
Sbjct: 368 TRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN 405
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 60/136 (44%), Gaps = 15/136 (11%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN- 111
+G + G Y + +G P L IDTGSDL W+QC+ PC C ++ P+ +
Sbjct: 76 FSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCS-PCRRCYAQRGQVFDPRRSS 134
Query: 112 ---LVACNDPFCSAFHLPENIRCE----ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN 164
V C+ P C A P C+ A C Y V Y D SS G L TD L N
Sbjct: 135 TYRRVPCSSPQCRALRFPG---CDSGGAAGGGCRYMVAYGDGSSSTGDLATDK--LAFAN 189
Query: 165 GSLLGPRLIFGCGYNQ 180
+ + + GCG +
Sbjct: 190 DTYVN-NVTLGCGRDN 204
>gi|168051774|ref|XP_001778328.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670305|gb|EDQ56876.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 67/137 (48%), Gaps = 18/137 (13%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + I PP+ + IDTGSDLTWVQC PC C L +++P ++ VAC
Sbjct: 10 GEYFIDIFIDTPPRHILVIIDTGSDLTWVQC-TPCLHCYLQKGLVFNPHSSESYDPVACG 68
Query: 117 DPFCSAFHLPENIR--CEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT---------N 164
+P AF N R C + Q C Y Y D ++ T+ F + T +
Sbjct: 69 EPK-RAFVESSNNRSTCVTDSQGCSYFYWYGDSSNTTSDFATETFTVNKTIKNDEGGGED 127
Query: 165 GSLLGPRLIFGCGYNQR 181
+L +++FGCG+N +
Sbjct: 128 DTLQISKIMFGCGHNNQ 144
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 87/202 (43%), Gaps = 27/202 (13%)
Query: 37 TQSTAAHRFGSTAVFPITGNVY----PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN 92
+++ A S+A P++ Y P+ Y + L IG PP+ +L +DTGS L W QC
Sbjct: 5 SKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ 64
Query: 93 APCTGC---TLPPESLYHPKNNLVACNDPFCSAFHL---PENIRC--EANDQCDYEVLYA 144
PC C +LP Y+ + P C + P C + C Y Y
Sbjct: 65 -PCAVCFNQSLP----YYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYSYG 119
Query: 145 DHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASIL 204
D +++G L D + G+ + P ++FGCG N N G G+ G G G S+
Sbjct: 120 DKSATIGFL--DVETVSFVAGASV-PGVVFGCGLN--NTGIFRSNETGIAGFGRGPLSLP 174
Query: 205 SQLQSLGLTRNVLGHCLSVRGG 226
SQL+ + HC + G
Sbjct: 175 SQLKVGNFS-----HCFTAVSG 191
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 119/292 (40%), Gaps = 48/292 (16%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNA-PCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+V+L +G+PP+ + +DTGS+L+W+ C P T P S Y P CN C
Sbjct: 61 TVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTP----TPCNSSIC 116
Query: 121 SAFHLPENI--RCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC- 176
+ I C+ N++ C V YAD S+ G L + F L P +FGC
Sbjct: 117 TTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSL----AGAAQPGTLFGCM 172
Query: 177 ---GYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG-GYLFL 231
GY N K T G++G+ G S+++Q ++ +C+S G L L
Sbjct: 173 DSAGYTSDINEDSK---TTGLMGMNRGSLSLVTQ-----MSLPKFSYCISGEDALGVLLL 224
Query: 232 GHDLVPSSGIAWTPMSRDLLEKHYSSGPA-ELLFGGKSTGIKGLQI-------------- 276
G S + +TP+ Y + A + G K LQ+
Sbjct: 225 GDGTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQ 284
Query: 277 -IFDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDT--AEEKALPVCWKG 323
+ DSG+ +T+ Y + D + KG +ED E A+ +C+
Sbjct: 285 TMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHA 336
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 85/181 (46%), Gaps = 28/181 (15%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+V+L +G+PP+ + +DTGS+L+W+ C AP P S Y P + C P C
Sbjct: 64 TVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSP----IPCTSPTC 119
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
F +P + C+ C + YAD S G L +D F + G+ P IFGC
Sbjct: 120 RTRTRDFSIP--VSCDKKKLCHAIISYADASSIEGNLASDTFHI----GNSAIPATIFGC 173
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLG 232
G++ + + T G++G+ G S ++Q +GL + +C+S + G L G
Sbjct: 174 MDSGFSSNS--DEDSKTTGLIGMNRGSLSFVTQ---MGLQK--FSYCISGQDSSGILLFG 226
Query: 233 H 233
Sbjct: 227 E 227
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 114/274 (41%), Gaps = 54/274 (19%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + +GNPP+ + L IDTGSDLTW+QC PC C ++ P + ++ CN
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCK-PCKACFDQSGPVFDPSQSTSFKIIPCN 227
Query: 117 DPFCSAFHLPENIRCEANDQ------CDYEVLYADHGSSLGVLVTDHFPLRLTN--GSLL 168
C L + C N C Y Y D + G L + + L++ SL
Sbjct: 228 AAACD---LVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLE 284
Query: 169 GPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---- 224
++ GCG++ + G+LGLG G S SQL+S + ++ +CL R
Sbjct: 285 IRDMVIGCGHSNKGLFQG---AGGLLGLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNL 340
Query: 225 --------GGGYLFLGHDLVPSSGIAWTPMSR--DLLEKHYSSG-------------PAE 261
G G+ H + +TP R + +E Y G PAE
Sbjct: 341 SVSSAISFGAGFALSRH----FDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAE 396
Query: 262 LLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTT 295
F G G I DSG++ TY N AY+
Sbjct: 397 -RFAIAPNGSGG--TIIDSGTTLTYLNRDAYRAV 427
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 81/186 (43%), Gaps = 26/186 (13%)
Query: 50 VFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPK 109
V P+ G Y + +G P L +DT SDLTW+QC PC C ++ P+
Sbjct: 128 VAPVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPR 186
Query: 110 NNL----VACNDPFCSAFHLPENIRCEANDQ----CDYEVLYAD---HGS---SLGVLVT 155
++ + + P C A R D C Y VLY D HGS S+G LV
Sbjct: 187 HSTSYGEMNYDAPDCQALG-----RSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVE 241
Query: 156 DHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
+ G + L GCG++ N G P AG+LGL G+ SI Q+ LG +
Sbjct: 242 ETLTF---AGGVRQAYLSIGCGHD--NKGLFGAPAAGILGLSRGQISIPHQIAFLGYNAS 296
Query: 216 VLGHCL 221
+CL
Sbjct: 297 -FSYCL 301
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 88/191 (46%), Gaps = 33/191 (17%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+V+L +G+PP+ + +DTGS+L+W+ C AP P S Y P + C P C
Sbjct: 57 TVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSP----IPCTSPTC 112
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
F +P + C+ C + YAD S G L +D F + G+ P IFGC
Sbjct: 113 RTRTRDFSIP--VSCDKKKLCHAIISYADASSIEGNLASDTFHI----GNSAIPATIFGC 166
Query: 177 ---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLG 232
G++ + + T G++G+ G S ++Q +GL + +C+S + G L G
Sbjct: 167 MDSGFSSNS--DEDSKTTGLIGMNRGSLSFVTQ---MGLQK--FSYCISGQDSSGILLFG 219
Query: 233 HDLVPSSGIAW 243
S +W
Sbjct: 220 E-----SSFSW 225
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 118/293 (40%), Gaps = 50/293 (17%)
Query: 44 RFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPE 103
FGS V +G G Y + IG P + + +DTGSD+ W+QC PC C +
Sbjct: 138 EFGSEVV---SGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQAD 193
Query: 104 SLYHPKNNL----VACNDPFCSAFHLPENIRCEAND----QCDYEVLYADHGSSLGVLVT 155
+++P +++ V C+ CS + +AND C YEV Y D ++G T
Sbjct: 194 PIFNPSSSVSFSTVGCDSAVCS--------QLDANDCHGGGCLYEVSYGDGSYTVGSYAT 245
Query: 156 DHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRN 215
+ LT G+ + GCG++ G+ L S +QL + T
Sbjct: 246 ET----LTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGT--QTGR 296
Query: 216 VLGHCLSVR---GGGYLFLGHDLVPSSGIAWTPMSRD-LLEKHY--------------SS 257
+CL R G L G + VP I +TP+ + L Y S
Sbjct: 297 AFSYCLVDRDSESSGTLEFGPESVPIGSI-FTPLVANPFLPTFYYLSMVAISVGGVILDS 355
Query: 258 GPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
P+E ++TG G II DSG++ T + AY D + P D
Sbjct: 356 VPSEAFRIDETTGRGG--IIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRAD 406
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 118/278 (42%), Gaps = 36/278 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y + L IG PP + DTGSDLTW QC PC C +Y P + P SA
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 123 FHLP--ENIRCE-ANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI------ 173
LP + C + C Y Y+D S+G+L T+ LT GS + + +
Sbjct: 125 TCLPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTE----TLTIGSSVPGQTVSVGSVA 180
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLG- 232
FGCG + G + G +GLG G S+L+QL +G L + FLG
Sbjct: 181 FGCGTDN---GGDSLNSTGTVGLGRGTLSLLAQL-GVGKFSYCLTDFFNSTMDSPFFLGT 236
Query: 233 -HDLVPSSG-IAWTPMSRDLLE--------KHYSSGPAEL-----LFGGKSTGIKGLQII 277
+L P G + TP+ + L + S G L F ++ G G+ +
Sbjct: 237 LAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMV- 295
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEK 315
DSG+++T ++ +D + + L P+ ++ +
Sbjct: 296 -DSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDS 332
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 80/191 (41%), Gaps = 15/191 (7%)
Query: 38 QSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG 97
Q ++ S + +G++ G Y V + +G P + L DTGSDLTW QC
Sbjct: 121 QDSSVEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS 180
Query: 98 CTLPPESLYHPKNNL----VACNDPFCSAFHLP--ENIRCEANDQ-CDYEVLYADHGSSL 150
C + ++ P + + C C+ + C A+ + C Y + Y D S+
Sbjct: 181 CYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSV 240
Query: 151 GVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
G + + T+ + +FGCG N + +AG++GLG S + Q+
Sbjct: 241 GYFSRERLTVTATD---VVDNFLFGCGQNNQGLFGG---SAGLIGLGRHPISFVQ--QTA 292
Query: 211 GLTRNVLGHCL 221
R + +CL
Sbjct: 293 AKYRKIFSYCL 303
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 108/271 (39%), Gaps = 38/271 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC--TLP-----PESLYHPKNNLV 113
G Y + + G P + IDTGSD+ W+ C C GC T P S Y P
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQ-CQGCHSTAPIFDPAKSSSYKP----F 167
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
AC+ C + C N +C +EVLY D G L +D L GS P
Sbjct: 168 ACDSQPCQEI----SGNCGGNSKCQFEVLYGDGTQVDGTLASDAITL----GSQYLPNFS 219
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFL 231
FGC + P G+ G S+L+Q + L +CL S G L L
Sbjct: 220 FGCAESLSEDTYSSPGLMGLG---GGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVL 276
Query: 232 GHD-LVPSSGIAWTPMSRD--------LLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGS 282
G + V SS + +T + +D + K S G + + G II DSG+
Sbjct: 277 GKEAAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTII-DSGT 335
Query: 283 SYTYFNSQAYKTTLDLMRKDLKG---KPLED 310
+ TY AYK D R+ L P+ED
Sbjct: 336 TITYLVPSAYKDLRDAFRQQLSSLQPTPVED 366
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 76/303 (25%), Positives = 121/303 (39%), Gaps = 54/303 (17%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCN------APCTGCTLPPESLYHPKNNLVACNDP 118
++L IG P + EL +DTGS L+W+QC+ T SL ++L C+ P
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDL-PCSHP 140
Query: 119 FCSA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C F LP + C++N C Y YAD + G LV + F + S P LI
Sbjct: 141 LCKPRIPDFTLPTS--CDSNRLCHYSYFYADGTFAEGNLVKEKFTF---SNSQTTPPLIL 195
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG-------G 227
GC + G+LG+ LG+ S +SQ + + +C+ R G
Sbjct: 196 GCAKESTD-------EKGILGMNLGRLSFISQAKI-----SKFSYCIPTRSNRPGLASTG 243
Query: 228 YLFLGHDLVPSSGIAWT-----PMSRDL--LEKHYSSGPAE-LLFGGKSTGIKGL----- 274
+LG D S G + P S+ + L+ + P + + G K I G
Sbjct: 244 SFYLG-DNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPD 302
Query: 275 -----QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLL 329
Q + DSGS +T+ AY + + + + + + +C+ G +
Sbjct: 303 AGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEI 362
Query: 330 GNF 332
G
Sbjct: 363 GRL 365
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 116/302 (38%), Gaps = 40/302 (13%)
Query: 39 STAAHRFGSTAVFPITGNVYP-LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCN----A 93
STA S P+T Y G Y V ++G P + + L DTGSDLTWV+C +
Sbjct: 85 STAPMPEASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRAS 144
Query: 94 PCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEANDQ----CDYEVLYAD 145
L ++ P N+ + C+ C ++ C A C Y+ Y D
Sbjct: 145 SPDASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKD 204
Query: 146 HGSSLGVLVTDHFPLRLT-NGSLLGPRL---IFGCGYNQRNPGPKPPPTAGVLGLGLGKA 201
S+ GV+ TD + L+ +GS +L + GC + G + GVL LG
Sbjct: 205 KSSARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYD--GQSFQSSDGVLSLGNSNI 262
Query: 202 SILSQLQSL---GLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLL-----EK 253
S S+ + + ++ H YL G G A +P LL
Sbjct: 263 SFASRAAARFGGRFSYCLVDHLAPRNATSYLTFG-----PVGAAHSPSRTPLLLDAQVAP 317
Query: 254 HYSSGPAELLFGGKSTGI--------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG 305
Y+ + GK+ I K I DSG+S T + AYK + + K L
Sbjct: 318 FYAVTVDAVSVAGKALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLAR 377
Query: 306 KP 307
P
Sbjct: 378 VP 379
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 70/152 (46%), Gaps = 23/152 (15%)
Query: 65 VTLKIGNPPKLYELDIDTGSDLTWVQCN------APCTGCTLPPESLYHPKNNLVACNDP 118
++L IG P + EL +DTGS L+W+QC+ T SL ++L C+ P
Sbjct: 83 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDL-PCSHP 141
Query: 119 FCS----AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C F LP + C++N C Y YAD + G LV + F + S P LI
Sbjct: 142 LCKPRIPDFTLPTS--CDSNRLCHYSYFYADGTFAEGNLVKEKFTF---SNSQTTPPLIL 196
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
GC + G+LG+ LG+ S +SQ
Sbjct: 197 GCAKESTD-------VKGILGMNLGRLSFISQ 221
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 116/293 (39%), Gaps = 46/293 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC-TGCTLPPESLYHPKN----NLVAC 115
G Y +TL IG PP Y DTGSDL W QC APC T C P LY+P + +++ C
Sbjct: 112 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC 170
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIF 174
N C Y Y G + GV ++ F + P + F
Sbjct: 171 NSSLSMCAGALAGAAPPPGCACMYYQTYG-TGWTAGVQGSETFTFGSSAADQARVPGVAF 229
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRGGGYLF 230
GC + +AG++GLG G S++SQ LG R +CL+ L
Sbjct: 230 GCSNASSSDWNG---SAGLVGLGRGSLSLVSQ---LGAGR--FSYCLTPFQDTNSTSTLL 281
Query: 231 LG-HDLVPSSGIAWTPM----SRDLLEKHYSSGPAELLFGGKSTGIKGLQI--------- 276
LG + +G+ TP +R + +Y L G S G K L I
Sbjct: 282 LGPSAALNGTGVRSTPFVASPARAPMSTYY-----YLNLTGISLGAKALPISPGAFSLKP 336
Query: 277 ------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGK-PLEDTAEEKALPVCWK 322
I DSG++ T + AY+ ++ L P D ++ L +C+
Sbjct: 337 DGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFA 389
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 84/180 (46%), Gaps = 22/180 (12%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y VT++IG + + +DTGSDLTWVQC PC C + L++P + + CN
Sbjct: 67 YIVTVEIGG--RNMTVIVDTGSDLTWVQCQ-PCRLCYNQQDPLFNPSGSPSYQTILCNSS 123
Query: 119 FCSAFHLPE-NI-RCEAND-QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + N+ C +N C+Y V Y D + G L + L T+ S IFG
Sbjct: 124 TCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVS----NFIFG 179
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLG 232
CG RN +G++GLG S++SQ + + V +CL + G L LG
Sbjct: 180 CG---RNNKGLFGGASGLMGLGKSDLSLVSQTSA--IFEGVFSYCLPTTAADASGSLILG 234
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 115/287 (40%), Gaps = 52/287 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDL-------TWVQCNAPCTGCTLPPESLYHPKNNLV 113
GYY+ +KIG PP + L +D S + ++ P L S Y P
Sbjct: 33 GYYTSRVKIGTPPHEFSLIVDRSSFVSPKTMFCSFFFLQDPRFSPAL--SSSYKPLECGN 90
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL-GPRL 172
C+ FC Y+ YA+ +S GVL D + +N S L G RL
Sbjct: 91 ECSTGFCDGSR-------------KYQRQYAEKSTSSGVLGKD--VISFSNSSDLGGQRL 135
Query: 173 IFGCGYNQRNPGPKPPPTA-GVLGLGLGKASILSQLQSLGLTRNVLGHCLS--VRGGGYL 229
+FGC G TA G++GLG G SI+ QL +V C GGG +
Sbjct: 136 VFGC--ETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAM 193
Query: 230 FLG-----HDLVPSSGIAWTPMSRDLLEKHYSSGPAEL-----LFGGKSTGIKGLQIIFD 279
LG D+V +S +L+ K G + L +F GK + D
Sbjct: 194 ILGGFQPPKDMVFTSSDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGK------YGTVLD 247
Query: 280 SGSSYTYFNS---QAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SG++Y YF QA+K+ + LK P D EK +C+ G
Sbjct: 248 SGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPD---EKFKDICYAG 291
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 117/294 (39%), Gaps = 54/294 (18%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL--------VA 114
Y + IG+PP+ E IDTGSDL W QC C + + L P NL V
Sbjct: 86 YIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGL--PYYNLSQSSTFVPVP 143
Query: 115 CNDP--FCSA--FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP 170
C D FC+A HL C + C + Y G +G L T+ F S
Sbjct: 144 CADKAGFCAANGVHL-----CGLDGSCTFIASYG-AGRVIGSLGTESFAFESGTTS---- 193
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL-----SVRG 225
L FGC R +G++GLG G+ S++SQ +G TR +CL S
Sbjct: 194 -LAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQ---IGATR--FSYCLTPYFHSSGA 247
Query: 226 GGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSG---PAELLFGGK-------STGIKGLQ 275
+LF+G G A P + + YS+ P E + GK ST + Q
Sbjct: 248 SSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQ 307
Query: 276 ---------IIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVC 320
+I D+GS T S AY+ + + L L E+ L +C
Sbjct: 308 LFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELC 361
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/244 (28%), Positives = 104/244 (42%), Gaps = 19/244 (7%)
Query: 62 YYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT--LPPE------SLYHPKNNLV 113
+Y+V + +G P + + +DTGSDL WV C+ C C + P Y P+ +
Sbjct: 88 HYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSST 144
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLY-ADHGSSLGVLVTDHFPLRLTNG---SLLG 169
+ P S ++ A+ C Y + Y +D+ SS GVLV D L G ++
Sbjct: 145 SRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVTEYGRQPKIVT 204
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGL-TRNVLGHCLSVRGGGY 228
+ FGCG Q G+LGLG+ S+ S L S G+ N C + G G
Sbjct: 205 APITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVAAANSFSMCFAQDGHGR 264
Query: 229 LFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFN 288
+ G SS TP++ +Y+ G KS K I DSG+S+T +
Sbjct: 265 INFGD--TGSSDQQETPLNMYKQNPYYNISITGATVGSKSIHTK-FNAIVDSGTSFTALS 321
Query: 289 SQAY 292
Y
Sbjct: 322 DPMY 325
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 83/177 (46%), Gaps = 21/177 (11%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+VTL +G+PP+ + +DTGS+L+W+ C +P G P S Y P V C+ P C
Sbjct: 66 TVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPIC 121
Query: 121 --SAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC- 176
LP C+ C + YAD S G L + F + GS+ P +FGC
Sbjct: 122 RTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTLFGCM 177
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLG 232
+ + + G++G+ G S ++Q LG ++ +C+S G+L LG
Sbjct: 178 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQ---LGFSK--FSYCISGSDSSGFLLLG 229
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 113/283 (39%), Gaps = 44/283 (15%)
Query: 48 TAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT---LPPES 104
T+V +GN +G Y V K+G PP+L + +DT +D W+ C+ C+GC+ +
Sbjct: 15 TSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG-CSGCSNASTSFNT 73
Query: 105 LYHPKNNLVACNDPFCSAFHLPENIRCEAN----DQCDYEVLYADHGSSLGVLVTDHFPL 160
+ V+C+ C+ + C ++ C + Y S LV D
Sbjct: 74 NSSSTYSTVSCSTAQCTQ---ARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQD---- 126
Query: 161 RLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
LT + P FGC N + PP G++GLG G S++SQ S L V +C
Sbjct: 127 TLTLAPDVIPNFSFGC-INSASGNSLPP--QGLMGLGRGPMSLVSQTTS--LYSGVFSYC 181
Query: 221 L----SVRGGGYLFLGHDLVPSSGIAWTPMSRD---------------LLEKHYSSGPAE 261
L S G L LG P S I +TP+ R+ + P
Sbjct: 182 LPSFRSFYFSGSLKLGLLGQPKS-IRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVY 240
Query: 262 LLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
L F S G I DSG+ T F Y+ D RK +
Sbjct: 241 LTFDANS----GAGTIIDSGTVITRFAQPVYEAIRDEFRKQVN 279
>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 118/305 (38%), Gaps = 46/305 (15%)
Query: 58 YPLGYYSVTLKIGN--PPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYH----PKNN 111
Y G YSV + IG+ Y+L +D LTW+QC PC S+++ P +
Sbjct: 74 YSGGIYSVRVGIGSGGTQHFYKLALDLVRPLTWMQCK-PCVPEKRQDGSVFNTAASPHYH 132
Query: 112 LVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHF---------PLRL 162
+A DP C A P + D + Y D + GVL +D F P+
Sbjct: 133 HIASTDPRCMA---PYTRAGQGRCTFDVKFQYGD-SRARGVLGSDDFVFDGSGPGSPISS 188
Query: 163 TNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS 222
NG L+FGC +N + AGV+ L S + QL + GL +CL+
Sbjct: 189 VNG------LVFGCAHNTHD-FYNHDLWAGVMSLNRHPTSFIRQLSARGLAAPRFSYCLA 241
Query: 223 VRGG----GYLFLGHDLVPSSGIAWTPMSRDLLEK----HYSSGPAELLFGGKSTGIKGL 274
R G+L G D+ S TP+ L + +Y L G + T I +
Sbjct: 242 SRQHRDRRGFLRFGADIPDQSHARSTPLLHGDLAQGGGMYYVGVVGVSLGGRRLTAITPV 301
Query: 275 QI-----------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
I D G+S T + Y + + ++ + ++ C++G
Sbjct: 302 MFELNRRSLRGGCIIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQHAIFSPGQKHCFRG 361
Query: 324 TWKCL 328
W+ +
Sbjct: 362 KWESI 366
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 108/265 (40%), Gaps = 42/265 (15%)
Query: 61 GYYSVTLKIGNPP-KLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPF 119
G Y +T +G PP KLY + DTGSD+ W+QC PC C + P + N P
Sbjct: 85 GEYLMTYSVGTPPFKLYGI-ADTGSDIVWLQCE-PCKECYNQTTPKFKPSKSSTYKNIPC 142
Query: 120 CSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFGCGY 178
S C++ Q G L D L + G + P+ + GCG
Sbjct: 143 SSDL-------CKSGQQ--------------GNLSVDTLTLESSTGHPISFPKTVIGCGT 181
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGYLFLG 232
+ N ++G++GLG G AS+++QL S + +CL S F
Sbjct: 182 D--NTVSFEGASSGIVGLGGGPASLITQLGSSIDAK--FSYCLLPNPVESNTTSKLNFGD 237
Query: 233 HDLVPSSGIAWTPMSRD-------LLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYT 285
+V G+ TP+ + L + +S G + F G S G II DSG++ T
Sbjct: 238 TAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSGTTLT 297
Query: 286 YFNSQAYKTTLDLMRKDLKGKPLED 310
+ Y + + +K K + D
Sbjct: 298 VIPTDVYNNLESAVLELVKLKRVND 322
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 67/154 (43%), Gaps = 13/154 (8%)
Query: 54 TGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLV 113
+G LG Y +K+G+P + + L +DTGS+ TW+ C+ T
Sbjct: 104 SGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNCSKSFEAVTCASRK--------- 154
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRL 172
C F L ++ + +D C Y++ YAD S+ G TD + LTNG L
Sbjct: 155 -CKVDLSELFSL--SVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNL 211
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQ 206
GC + N T G+LGLG K S + +
Sbjct: 212 TIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDK 245
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 118/294 (40%), Gaps = 44/294 (14%)
Query: 39 STAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC 98
S A + T+V +GN +G Y V ++G PP+L + +DT +D W+ C+ C+GC
Sbjct: 81 SLVAGKSKPTSVPVASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSG-CSGC 139
Query: 99 T---LPPESLYHPKNNLVACNDPFCSAFHLPENIRCEAN----DQCDYEVLYADHGSSLG 151
+ + + V+C+ C+ + C ++ C + Y S
Sbjct: 140 SNASTSFNTNSSSTYSTVSCSTTQCTQ---ARGLTCPSSTPQPSICSFNQSYGGDSSFSA 196
Query: 152 VLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG 211
LV D LT + P FGC N + PP G++GLG G S++SQ S
Sbjct: 197 NLVQD----TLTLSPDVIPNFSFGC-INSASGNSLPP--QGLMGLGRGPMSLVSQTTS-- 247
Query: 212 LTRNVLGHCL-SVRG---GGYLFLGHDLVPSSGIAWTPMSRD---------------LLE 252
L V +CL S R G L LG P S I +TP+ R+ +
Sbjct: 248 LYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKS-IRYTPLLRNPRRPSLYYVNLTGVSVGS 306
Query: 253 KHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
P L F S G I DSG+ T F Y+ D RK + G
Sbjct: 307 VQVPVDPVYLTFDSNS----GAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNGS 356
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 82/177 (46%), Gaps = 21/177 (11%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+VTL +G+PP+ + +DTGS+L+W+ C +P G P S Y P V C+ P C
Sbjct: 62 TVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPIC 117
Query: 121 --SAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC- 176
LP C+ C + YAD S G L D F + GS+ P +FGC
Sbjct: 118 RTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVI----GSVTRPGTLFGCM 173
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLG 232
+ + + G++G+ G S ++Q LG ++ +C+S G L LG
Sbjct: 174 DSGLSSDSEEDAKSTGLMGMNRGSLSFVNQ---LGFSK--FSYCISGSDSSGILLLG 225
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 84/189 (44%), Gaps = 34/189 (17%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSA------------F 123
+DTGSDLTWVQC PC+ C + L+ P + V CN C A
Sbjct: 180 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238
Query: 124 HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP 183
+ +++C Y + Y D S GVL TD + L S+ G +FGCG + R
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATD--TVALGGASVDG--FVFGCGLSNRGL 294
Query: 184 GPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCL----SVRGGGYLFLGHDLVPS 238
TAG++GLG + S++SQ G V +CL S G L LG D S
Sbjct: 295 FGG---TAGLMGLGRTELSLVSQTAPRFG---GVFSYCLPAATSGDAAGSLSLGGD--TS 346
Query: 239 SGIAWTPMS 247
S TP+S
Sbjct: 347 SYRNATPVS 355
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 84/189 (44%), Gaps = 34/189 (17%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSA------------F 123
+DTGSDLTWVQC PC+ C + L+ P + V CN C A
Sbjct: 181 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 239
Query: 124 HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNP 183
+ +++C Y + Y D S GVL TD + L S+ G +FGCG + R
Sbjct: 240 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATD--TVALGGASVDG--FVFGCGLSNRGL 295
Query: 184 GPKPPPTAGVLGLGLGKASILSQLQ-SLGLTRNVLGHCL----SVRGGGYLFLGHDLVPS 238
TAG++GLG + S++SQ G V +CL S G L LG D S
Sbjct: 296 FGG---TAGLMGLGRTELSLVSQTAPRFG---GVFSYCLPAATSGDAAGSLSLGGD--TS 347
Query: 239 SGIAWTPMS 247
S TP+S
Sbjct: 348 SYRNATPVS 356
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 59/125 (47%), Gaps = 12/125 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + +GNP K Y + +DTGSD+ W+QC PC+ C + ++ P + + C+
Sbjct: 157 GEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQ-PCSDCYQQSDPIFTPAASSSYSPLTCD 215
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C++ + C N QC Y+V Y D + G VT+ GS + GC
Sbjct: 216 SQQCNSLQMSS---CR-NGQCRYQVNYGDGSFTFGDFVTETMSF---GGSGTVNSIALGC 268
Query: 177 GYNQR 181
G++
Sbjct: 269 GHDNE 273
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 114/283 (40%), Gaps = 44/283 (15%)
Query: 48 TAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCT---LPPES 104
T+V +GN +G Y V K+G PP+L + +DT +D W+ C+ C+GC+ +
Sbjct: 89 TSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG-CSGCSNASTSFNT 147
Query: 105 LYHPKNNLVACNDPFCSAFHLPENIRCEAN----DQCDYEVLYADHGSSLGVLVTDHFPL 160
+ V+C+ C+ + C ++ C + Y S LV D
Sbjct: 148 NSSSTYSTVSCSTAQCTQ---ARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQD---- 200
Query: 161 RLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHC 220
LT + P FGC N + PP G++GLG G S++SQ S L V +C
Sbjct: 201 TLTLAPDVIPNFSFGC-INSASGNSLPP--QGLMGLGRGPMSLVSQTTS--LYSGVFSYC 255
Query: 221 L-SVRG---GGYLFLGHDLVPSSGIAWTPMSRD---------------LLEKHYSSGPAE 261
L S R G L LG P S I +TP+ R+ + P
Sbjct: 256 LPSFRSFYFSGSLKLGLLGQPKS-IRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVY 314
Query: 262 LLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLK 304
L F S G I DSG+ T F Y+ D RK +
Sbjct: 315 LTFDANS----GAGTIIDSGTVITRFAQPVYEAIRDEFRKQVN 353
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y ++ +G P K ++IDTGS ++WV C C GC P + ++ V+C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 121/292 (41%), Gaps = 48/292 (16%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAP----CTGCTLPP--ESLYHPKNNLVACND 117
+V+L +G PP+ + IDTGS+L+W+ CN + T P S Y P + C+
Sbjct: 74 TVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSP----IPCSS 129
Query: 118 PFCS--AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C+ P C++N C + YAD SS G L TD F + GS P ++FG
Sbjct: 130 STCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYI----GSSGIPNVVFG 185
Query: 176 CGYNQ-RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG-GGYLFLGH 233
C + + + G++G+ G S +SQ +G + +C+S G L LG
Sbjct: 186 CMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQ---MGFPK--FSYCISEYDFSGLLLLGD 240
Query: 234 D------------LVPSSG-------IAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL 274
L+ S +A+T + H E +F TG
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG-- 298
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDT--AEEKALPVCWK 322
Q + DSG+ +T+ AY D G + ED+ + A+ +C++
Sbjct: 299 QTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYR 350
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 143/322 (44%), Gaps = 47/322 (14%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFP--ITGNVY 58
+E RV G+ + FA +G ++ P + T R+ A+ ++G
Sbjct: 106 LERDSSRVAGIAAKIRFAV-EG-IDRSDLKPVNNEDT------RYQPEALTTPVVSGVSQ 157
Query: 59 PLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVA 114
G Y + +G P K L +DTGSD+ W+QC PC+ C + +++P ++ +
Sbjct: 158 GSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCSDCYQQSDPVFNPTSSSTYKSLT 216
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
C+ P CS L E C +N +C Y+V Y D ++G L TD + N + +
Sbjct: 217 CSAPQCS---LLETSACRSN-KCLYQVSYGDGSFTVGELATD--TVTFGNSGKIND-VAL 269
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG-YLFLGH 233
GCG++ N G +LGLG G SI +Q+++ + +CL R G L
Sbjct: 270 GCGHD--NEGLFTGAAG-LLGLGGGALSITNQMKATSFS-----YCLVDRDSGKSSSLDF 321
Query: 234 DLVP-SSGIAWTPMSRDL-LEKHYSSGPAELLFGGK------------STGIKGLQIIFD 279
+ V SG A P+ R+ ++ Y G + GG+ ++G G +I D
Sbjct: 322 NSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGG--VILD 379
Query: 280 SGSSYTYFNSQAYKTTLDLMRK 301
G++ T +QAY + D K
Sbjct: 380 CGTAVTRLQTQAYNSLRDAFLK 401
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 108/274 (39%), Gaps = 31/274 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + +G PP DTGSDL W QC PC C E L+ PK + + C+
Sbjct: 92 GAYLMNISLGTPPVPMLGIADTGSDLIWRQC-LPCPNCYEQVEPLFDPKESETYKTLDCD 150
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
+ FC L + C+ ++ C Y Y D + G L +D + T G P + FG
Sbjct: 151 NEFCQ--DLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFG 208
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGYL 229
CG++ N G G++GLG G S++ QL S +CL S
Sbjct: 209 CGHD--NGGTFNEKDGGLIGLGGGPLSLVMQLSS--EVGGQFSYCLVPLSSDSTVSSKIN 264
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL-------------QI 276
F +V SG TP+ + + Y L G ++ KG I
Sbjct: 265 FGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNI 324
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
I DSG++ T Y + + G+ D
Sbjct: 325 IIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTD 358
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 60/125 (48%), Gaps = 13/125 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + IG PP + +DTGSD++WVQC APC C + ++ P ++ ++C
Sbjct: 149 GEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPIFEPTSSASFTSLSCE 207
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C + + E C N C YEV Y D ++G VT+ L GS + GC
Sbjct: 208 TEQCKSLDVSE---CR-NGTCLYEVSYGDGSYTVGDFVTETVTL----GSTSLGNIAIGC 259
Query: 177 GYNQR 181
G+N
Sbjct: 260 GHNNE 264
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 84/184 (45%), Gaps = 34/184 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDI--------DTGSDLTWVQCNAPCTGCTLPPESLYH----P 108
G Y + +G P YE D D GSD+TW+QC PC C P +Y+
Sbjct: 123 GEYIAKITVGTP---YENDSSFEALLSPDMGSDVTWLQC-MPCFRCYHQPGPVYNRLKSS 178
Query: 109 KNNLVACNDPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGV--LVTDHFP--LRLT 163
+ V C P C A L + C + ++C Y+V Y D SS G + T FP +R+
Sbjct: 179 SASDVGCYAPACRA--LGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPGVRV- 235
Query: 164 NGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV 223
P + GCG + N G P P AG+LGLG G S SQ+ G +CL+
Sbjct: 236 ------PGVAIGCGSD--NQGLFPAPAAGILGLGRGSLSFPSQIA--GRYGRSFSYCLAG 285
Query: 224 RGGG 227
+G G
Sbjct: 286 QGTG 289
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 60/125 (48%), Gaps = 12/125 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + IG+PPK + +DTGSD+ WVQC APC C + ++ P + + C
Sbjct: 51 GEYFSRVGIGSPPKHVYMVVDTGSDVNWVQC-APCADCYQQADPIFEPSFSSSYAPLTCE 109
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C + + E C ND C YEV Y D ++G T+ L +GS + GC
Sbjct: 110 THQCKSLDVSE---CR-NDSCLYEVSYGDGSYTVGDFATETITL---DGSASLNNVAIGC 162
Query: 177 GYNQR 181
G++
Sbjct: 163 GHDNE 167
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 120/297 (40%), Gaps = 55/297 (18%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNN----LVA 114
G Y +TL IG PP Y DTGSDL W QC APC+G C P LY+P ++ ++
Sbjct: 90 GEYLMTLSIGTPPLSYPAIADTGSDLIWTQC-APCSGDQCFAQPAPLYNPASSTTFGVLP 148
Query: 115 CNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG----- 169
CN + C Y Y G + GV ++ F T GS
Sbjct: 149 CNSSLSMCAGVLAGKAPPPGCACMYNQTYG-TGWTAGVQGSETF----TFGSAAADQARV 203
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS----VRG 225
P + FGC + +AG++GLG G S++SQ LG R +CL+
Sbjct: 204 PGIAFGCSNASSSDWNG---SAGLVGLGRGSLSLVSQ---LGAGR--FSYCLTPFQDTNS 255
Query: 226 GGYLFLG-HDLVPSSGIAWTPM----SRDLLEKHYSSGPAELLFGGKSTGIKGLQI---- 276
L LG + +G+ TP ++ + +Y L G S G K L I
Sbjct: 256 TSTLLLGPSAALNGTGVRSTPFVASPAKAPMSTYY-----YLNLTGISLGAKALSISPDA 310
Query: 277 -----------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
I DSG++ T + AY+ ++ L P D ++ L +C+
Sbjct: 311 FSLKADGTGGLIIDSGTTITSLVNAAYQQVRAAVQS-LVTLPAIDGSDSTGLDLCYA 366
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 15/128 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC---TGCTLPPESLYHPK----NNLV 113
G Y + +G P + Y DTGSD++W+QC PC GC ++ PK + +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPL 240
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
+C+ C HL + C+AN C YEV Y D ++G L T+ F R +N P L
Sbjct: 241 SCDSEQC---HLLDEAACDAN-SCIYEVEYGDGSFTVGELATETFSFRHSNSI---PNLP 293
Query: 174 FGCGYNQR 181
GCG++
Sbjct: 294 IGCGHDNE 301
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 15/128 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPC---TGCTLPPESLYHPK----NNLV 113
G Y + +G P + Y DTGSD++W+QC PC GC ++ PK + +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPL 240
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
+C+ C HL + C+AN C YEV Y D ++G L T+ F R +N P L
Sbjct: 241 SCDSEQC---HLLDEAACDAN-SCIYEVEYGDGSFTVGELATETFSFRHSNSI---PNLP 293
Query: 174 FGCGYNQR 181
GCG++
Sbjct: 294 IGCGHDNE 301
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 61/129 (47%), Gaps = 9/129 (6%)
Query: 57 VYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----L 112
V G Y + +G+PP +DTGSD+ W+QC PC C ++ P +
Sbjct: 85 VASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCE-PCEDCYKQTTPIFDPSKSKTYKT 143
Query: 113 VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PR 171
+ C+ C + N C +++ C+Y + Y D S G L + L T+GS + P+
Sbjct: 144 LPCSSNTCESLR---NTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPK 200
Query: 172 LIFGCGYNQ 180
+ GCG+N
Sbjct: 201 TVIGCGHNN 209
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 107/260 (41%), Gaps = 26/260 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG---CTLPPESLYHPKNN----LVAC 115
Y +++ +G+P + IDTGSD++WVQC PC C +L+ P + C
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCE-PCPAPSPCHAHAGALFDPAASSTYAAFNC 193
Query: 116 NDPFCSAF-HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIF 174
+ C+ E C+A +C Y V Y D ++ G +D L +GS + F
Sbjct: 194 SAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL---SGSDVVRGFQF 250
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV--RGGGYLFLG 232
GC + + G T G++GLG S++SQ + +CL G+L LG
Sbjct: 251 GCSHAELGAG-MDDKTDGLIGLGGDAQSLVSQTAA--RYGKSFSYCLPATPASSGFLTLG 307
Query: 233 HDLVPSSG----IAWTPMSRD-LLEKHYSSGPAELLFGGKSTGIK----GLQIIFDSGSS 283
G A TPM R + +Y + ++ GGK G+ + DSG+
Sbjct: 308 APASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSLVDSGTV 367
Query: 284 YTYFNSQAYKTTLDLMRKDL 303
T AY R +
Sbjct: 368 ITRLPPAAYAALSSAFRAGM 387
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 111/244 (45%), Gaps = 40/244 (16%)
Query: 12 LVLLMFATF-----QGCFSEANQPP------SKKKSTQSTAAHRFGSTAVFPITGNVYPL 60
L++ +F +F + C S +NQPP ++K T + F +T+ T + L
Sbjct: 6 LLVQLFISFILLQSKHCLS-SNQPPIVLALRTQKHRTPISTPRLFSTTS--KTTDKL--L 60
Query: 61 GYYSVTLKI----GNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPPESLYHPKNNLVAC 115
+++VTL + G P + + +DTGS+L+W+ C P P L + C
Sbjct: 61 FHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEPNFNSIFNP--LASKTYTKIPC 118
Query: 116 NDPFCS--AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
+ P C LP + C+ C + + YAD S G L + F + GS+ GP +
Sbjct: 119 SSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRV----GSVTGPATV 174
Query: 174 FGC---GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYL 229
FGC G++ + + T G++G+ G S ++Q +G + +C+S R G L
Sbjct: 175 FGCMDSGFSSNS--EEDAKTTGLMGMNRGSLSFVNQ---MGFRK--FSYCISDRDSSGVL 227
Query: 230 FLGH 233
LG
Sbjct: 228 LLGE 231
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 83/179 (46%), Gaps = 19/179 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG---CTLPPESLYHPKNN----LVAC 115
Y VT +G P +++DTGSDL+WVQC PC+ C + L+ P + V C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C+ + + QC Y V Y D ++ GV +D L ++ ++ G FG
Sbjct: 199 GGPVCAGLGI-YAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-AVQG--FFFG 254
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLG 232
CG+ Q G+LGLG + S++ Q+ G V +CL + GYL LG
Sbjct: 255 CGHAQSGLFNG---VDGLLGLGREQPSLVE--QTAGTYGGVFSYCLPTKPSTAGYLTLG 308
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 111/261 (42%), Gaps = 28/261 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y VT+++G + + +DTGSDL+WVQC PC C + +++P + V CN
Sbjct: 66 YIVTVELGG--RKMTVIVDTGSDLSWVQCQ-PCNRCYNQQDPVFNPSKSPSYRTVLCNSL 122
Query: 119 FCSAFHLPENIR--CEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + L C +N C+Y V Y D + G + +H L G+ IFG
Sbjct: 123 TCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNL----GNTTVNNFIFG 178
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLG 232
CG ++N G +G++GLG S++SQ+ + V +CL G L +G
Sbjct: 179 CG--RKNQGLF-GGASGLVGLGRTDLSLISQISP--MFGGVFSYCLPTTEAEASGSLVMG 233
Query: 233 HD---LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL---QIIFDSGSSYTY 286
+ ++ I++T M + L Y + GG ++I DSG+ +
Sbjct: 234 GNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVISR 293
Query: 287 FNSQAYKTTLDLMRKDLKGKP 307
Y+ K G P
Sbjct: 294 LPPSIYQALKAEFVKQFSGYP 314
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 112/264 (42%), Gaps = 19/264 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y +TL IG PP DTGSDL WVQC +PC C L+ P + C+
Sbjct: 90 GEYLMTLYIGTPPVERLAIADTGSDLIWVQC-SPCQNCFPQDTPLFEPLKSSTFKAATCD 148
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG--SLLGPRLIF 174
C++ P +C QC Y Y D ++GV+ T+ T ++ P IF
Sbjct: 149 SQPCTSVP-PSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIF 207
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCL---SVRGGGYLF 230
GCG G++GLG G S++SQL +G +CL S L
Sbjct: 208 GCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYK---FSYCLLPFSSNSTSKLK 264
Query: 231 LGHD-LVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKS--TGIKGLQIIFDSGSSYTY 286
G + +V ++G+ TP+ + L Y + G K TG II DSG+ TY
Sbjct: 265 FGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVLTY 324
Query: 287 FNSQAYKTTLDLMRKDLKGKPLED 310
Y + +++ L + +D
Sbjct: 325 LEQTFYNNFVASLQEVLSVESAQD 348
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 118/281 (41%), Gaps = 26/281 (9%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFC 120
G Y ++ +G PP +DT SD+ WVQC C C ++ P + N P C
Sbjct: 86 GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQL-CETCYNDTSPMFDPSYSKTYKNLP-C 143
Query: 121 SAFHLP--ENIRCEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
S+ + C ++++ C++ V Y D S G L+ + L N + PR + G
Sbjct: 144 SSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIG 203
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLFLG 232
C N + G++GLG G S++ QL S +++ +CL S R F
Sbjct: 204 CIRNTN----VSFDSIGIVGLGGGPVSLVPQLSS-SISKK-FSYCLAPISDRSSKLKFGD 257
Query: 233 HDLVPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGL-QIIFDSGSSY 284
+V G T + +K Y S G + F S+ G II DSG+++
Sbjct: 258 AAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSGTTF 317
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTW 325
T Y + +K + ED ++ +L C+K T+
Sbjct: 318 TVLPDDVYSKLESAVADVVKLERAEDPLKQFSL--CYKSTY 356
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 59/135 (43%), Gaps = 15/135 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + +G PP+ + + +DTGSDL W+QC APC C ++ P + V C
Sbjct: 149 GEYLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVTCG 207
Query: 117 DPFC--------SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLT--NGS 166
D C P R D C Y Y D ++ G L + F + LT S
Sbjct: 208 DHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGAS 267
Query: 167 LLGPRLIFGCGYNQR 181
++FGCG+ R
Sbjct: 268 RRVDGVVFGCGHRNR 282
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 82/194 (42%), Gaps = 28/194 (14%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYE---LDIDTGSDLTW 88
+KK+ +TA + P+ G Y V L+IG P + DTGSDL+W
Sbjct: 70 AKKEIQLATAIAAGDKKLLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSW 129
Query: 89 VQCNAPCTGCT----LPPESLYHPKN-NLVACNDPFCSAFHLPENIRCEA-------NDQ 136
QC PCT C+ PP + ++C DP C C A +
Sbjct: 130 TQCE-PCTNCSSFTPYPPHDPSKSRTFRRLSCFDPMCEL--------CTAVVDGGGGSAG 180
Query: 137 CDYEVLYADHGSSLGVLVTDHFPLRLT---NGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
C + Y D G+ G LV+D F G L + FGC + + + + T G+
Sbjct: 181 CLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAFGCAHVEDSKAVRGYST-GI 239
Query: 194 LGLGLGKASILSQL 207
L LG+GK S ++QL
Sbjct: 240 LALGIGKPSFVTQL 253
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 82/194 (42%), Gaps = 28/194 (14%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYE---LDIDTGSDLTW 88
+KK+ +TA + P+ G Y V L+IG P + DTGSDL+W
Sbjct: 91 AKKEIQLATAIAAGDKKLLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSW 150
Query: 89 VQCNAPCTGCT----LPPESLYHPKN-NLVACNDPFCSAFHLPENIRCEA-------NDQ 136
QC PCT C+ PP + ++C DP C C A +
Sbjct: 151 TQCE-PCTNCSSFTPYPPHDPSKSRTFRRLSCFDPMCEL--------CTAVVDGGGGSAG 201
Query: 137 CDYEVLYADHGSSLGVLVTDHFPLRLT---NGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
C + Y D G+ G LV+D F G L + FGC + + + + T G+
Sbjct: 202 CLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAFGCAHVEDSKAVRGYST-GI 260
Query: 194 LGLGLGKASILSQL 207
L LG+GK S ++QL
Sbjct: 261 LALGIGKPSFVTQL 274
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 66/256 (25%), Positives = 105/256 (41%), Gaps = 22/256 (8%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAF--H 124
+ IG P + + +D GSDL WV C+ C C S Y + + P S+ H
Sbjct: 117 IDIGTPHVSFLVALDAGSDLLWVPCD--CLQCAPLSASYYSSLDRDLNEYSPSHSSTSKH 174
Query: 125 LP-ENIRCEANDQCD-------YEV-LYADHGSSLGVLVTDHFPLRLTNG------SLLG 169
L + CE C+ Y + Y ++ SS G+LV D L +NG S+
Sbjct: 175 LSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLA-SNGDNALSYSVRA 233
Query: 170 PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYL 229
P ++ GCG Q G++GLGL + S+ S L GL RN C G +
Sbjct: 234 P-VVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPSFLAKAGLIRNSFSMCFDEDDSGRI 292
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNS 289
F G D P++ + ++ D Y G G + + D+G+S+T+ +
Sbjct: 293 FFG-DQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGSSCLKQTSFRALVDTGTSFTFLPN 351
Query: 290 QAYKTTLDLMRKDLKG 305
Y+ + + +
Sbjct: 352 GVYERITEEFDRQVNA 367
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 114/299 (38%), Gaps = 33/299 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + + +G PP DTGSDL W QC PC C E L+ PK + + CN
Sbjct: 92 GSYLMNISLGTPPVSMLGIADTGSDLIWRQC-LPCDDCYKQVEPLFDPKKSKTYKTLGCN 150
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIFG 175
+ FC L + C ++ C Y D + L ++ F + T G P L FG
Sbjct: 151 NDFCQ--DLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFG 208
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL------SVRGGGYL 229
CG++ N G +G++GLG G S++ QL S +CL S
Sbjct: 209 CGHS--NGGTFNEKDSGLIGLGGGPLSLVMQLSS--KVGGQFSYCLVPLSSDSTASSKIN 264
Query: 230 FLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGL-------------QI 276
F +V SG TP+ + + Y + G + KG I
Sbjct: 265 FGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNI 324
Query: 277 IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLLGNFEWH 335
I DSG++ T Y + K + G+ D +C+ G K + H
Sbjct: 325 IIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTD--PRGTFSLCYSGVKKLEIPTITAH 381
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 82/194 (42%), Gaps = 28/194 (14%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYE---LDIDTGSDLTW 88
+KK+ +TA + P+ G Y V L+IG P + DTGSDL+W
Sbjct: 73 AKKEIQLATAIAAGDKKLLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSW 132
Query: 89 VQCNAPCTGCT----LPPESLYHPKN-NLVACNDPFCSAFHLPENIRCEA-------NDQ 136
QC PCT C+ PP + ++C DP C C A +
Sbjct: 133 TQCE-PCTNCSSFTPYPPHDPSKSRTFRRLSCFDPMCEL--------CTAVVDGGGGSAG 183
Query: 137 CDYEVLYADHGSSLGVLVTDHFPLRLT---NGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
C + Y D G+ G LV+D F G L + FGC + + + + T G+
Sbjct: 184 CLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAFGCAHVEDSKAVRGYST-GI 242
Query: 194 LGLGLGKASILSQL 207
L LG+GK S ++QL
Sbjct: 243 LALGIGKPSFVTQL 256
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 112/276 (40%), Gaps = 47/276 (17%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + IG P + + +DTGSD+ W+QC PC C + +++P +++ V C+
Sbjct: 6 GEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQADPIFNPSSSVSFSTVGCD 64
Query: 117 DPFCSAFHLPENIRCEAND----QCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
CS + +AND C YEV Y D ++G T+ LT G+ +
Sbjct: 65 SAVCS--------QLDANDCHGGGCLYEVSYGDGSYTVGSYATE----TLTFGTTSIQNV 112
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYL 229
GCG++ G+ L S +QL + T +CL R G L
Sbjct: 113 AIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGT--QTGRAFSYCLVDRDSESSGTL 167
Query: 230 FLGHDLVPSSGIAWTPMSRD-LLEKHY--------------SSGPAELLFGGKSTGIKGL 274
G + VP I +TP+ + L Y S P+E ++TG G
Sbjct: 168 EFGPESVPIGSI-FTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGG- 225
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
II DSG++ T + AY D + P D
Sbjct: 226 -IIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRAD 260
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 121/284 (42%), Gaps = 42/284 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G+ S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGQMSVLKQSSP---TFDGFSYCLPLQMSERGFFSKTT 168
Query: 227 GYLFLGHDLVPS-SGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQII 277
GY LG + + + + +T M +R + + + G+ G+ KG ++
Sbjct: 169 GYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKG--VV 226
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
FDSGS +Y +A +R+ L + AEE++ C+
Sbjct: 227 FDSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 267
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 93/194 (47%), Gaps = 27/194 (13%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN----NLVACNDPF 119
+V+L +G+PP+ + +DTGS+L+W+ C T S+++P + + V C P
Sbjct: 70 TVSLTVGSPPQNVTMVLDTGSELSWLHCKK-----TQFLNSVFNPLSSKTYSKVPCLSPT 124
Query: 120 CS--AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC- 176
C L + C+A C V YAD S G L + F L GSL P IFGC
Sbjct: 125 CKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRL----GSLTKPATIFGCM 180
Query: 177 --GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLS-VRGGGYLFLGH 233
G++ + + T G++G+ G S ++Q +G + +C+S G L LG+
Sbjct: 181 DSGFSSNS--EEDSKTTGLIGMNRGSLSFVNQ---MGYPK--FSYCISGFDSAGVLLLGN 233
Query: 234 DLVP-SSGIAWTPM 246
P +++TP+
Sbjct: 234 ASFPWLKPLSYTPL 247
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 64/154 (41%), Gaps = 17/154 (11%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN-------NLVACNDPF 119
+ +G P + IDTGS ++WVQC C + N V C+
Sbjct: 27 ISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQV 86
Query: 120 CSAFHLPENIR---CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C H+ +NI E D C Y + YA S G L D L L N + + IFGC
Sbjct: 87 CHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDR--LTLANSYSI-QKFIFGC 143
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSL 210
G + R G +AG++G G S +Q+ L
Sbjct: 144 GSDNRYNG----HSAGIIGFGNKSYSFFNQIAQL 173
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 123/284 (43%), Gaps = 44/284 (15%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQMSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED-TAEEKALPVCW 321
DSGS +Y +A L ++R+ ++ L+ AEE++ C+
Sbjct: 226 DSGSELSYIPDRA----LSVLRQRIRELLLKRGAAEEESERNCY 265
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 121/284 (42%), Gaps = 42/284 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G+ S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGQMSVLKQSSP---TFDGFSYCLPLQMSERGFFSKTT 168
Query: 227 GYLFLGHDLVPS-SGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQII 277
GY LG + + + + +T M +R + + + G+ G+ KG ++
Sbjct: 169 GYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKG--VV 226
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
FDSGS +Y +A +R+ L + AEE++ C+
Sbjct: 227 FDSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 267
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 102/264 (38%), Gaps = 32/264 (12%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCS--AFH 124
+ IG P + + +D GSD+ WV C+ C C Y+ + + P S + H
Sbjct: 109 IDIGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRH 166
Query: 125 LP--------ENIRCEANDQCDYEVLYAD-HGSSLGVLVTDHFPLRLTNG-----SLLGP 170
LP ++ + D C Y V Y+ + SS G + D L +NG + +
Sbjct: 167 LPCGHKLCDVHSVCKGSKDPCPYAVQYSSANTSSSGYVFEDKLHL-TSNGKHAEQNSVQA 225
Query: 171 RLIFGCGYNQ-----RNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRG 225
+I GCG Q R GP GVLGLG G S+ S L GL +N C
Sbjct: 226 SIILGCGRKQTGEYLRGAGPD-----GVLGLGPGNISVPSLLAKAGLIQNSFSICFEENE 280
Query: 226 GGYLFLGHD-LVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSY 284
G + G V + P+ Y G G Q + DSGSS+
Sbjct: 281 SGRIIFGDQGHVTQHSTPFLPIDGKF--NAYIVGVESFCVGSLCLKETRFQALIDSGSSF 338
Query: 285 TYFNSQAYKTTLDLMRKDLKGKPL 308
T+ ++ Y+ + K + +
Sbjct: 339 TFLPNEVYQKVVIEFDKQVNATSI 362
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 138/324 (42%), Gaps = 51/324 (15%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPL 60
+E RV G++ + FA +G +P + + T +T V ++G
Sbjct: 106 LERDSSRVAGIVAKIRFAV-EGVDRSDLKPVYNEDTRYQTEDL---TTPV--VSGASQGS 159
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + +G P K L +DTGSD+ W+QC PC C + +++P ++ + C+
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSSTYKSLTCS 218
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL----RLTNGSLLGPRL 172
P CS L E C +N +C Y+V Y D ++G L TD ++ N +L
Sbjct: 219 APQCS---LLETSACRSN-KCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVAL----- 269
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG-YLFL 231
GCG++ G+ G L SI +Q+++ + +CL R G L
Sbjct: 270 --GCGHDNEGLFTGAAGLLGLGGGVL---SITNQMKATSFS-----YCLVDRDSGKSSSL 319
Query: 232 GHDLVP-SSGIAWTPMSRD-LLEKHYSSGPAELLFGGK------------STGIKGLQII 277
+ V G A P+ R+ ++ Y G + GG+ ++G G +I
Sbjct: 320 DFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGG--VI 377
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRK 301
D G++ T +QAY + D K
Sbjct: 378 LDCGTAVTRLQTQAYNSLRDAFLK 401
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K L+IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFSFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQMSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + +G P + + +DTGSD+ W+QC APC C + ++ P + + C
Sbjct: 127 GEYFTRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQADPVFDPTKSRTYAGIPCG 185
Query: 117 DPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C P C N C Y+V Y D + G T+ R T + R+ G
Sbjct: 186 APLCRRLDSPG---CNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVT----RVALG 238
Query: 176 CGYNQ 180
CG++
Sbjct: 239 CGHDN 243
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 124/306 (40%), Gaps = 45/306 (14%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTG 83
+ A++ P + K + A + + + P V + Y V +K+G P + + +DT
Sbjct: 7 ITMASKDPERLKYLSTLADQKTTAVPIAP-GQQVLKIANYVVRVKLGTPGQQMFMVLDTS 65
Query: 84 SDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEA--NDQC 137
+D WV PC+GCT + + P + + C++ CS C A + C
Sbjct: 66 NDAAWV----PCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVR---GFSCPATGSSAC 118
Query: 138 DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLG 197
+ Y S LV D + L N + P FGC N + G PP G+LGLG
Sbjct: 119 LFNQSYGGDSSLAATLVQDA--ITLANDVI--PGFTFGC-INAVSGGSIPP--QGLLGLG 171
Query: 198 LGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLGHDLVPSSGIAWTPMSRD---- 249
G S++SQ + + V +CL S G L LG P S I TP+ R+
Sbjct: 172 RGPISLISQAGA--MYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKS-IRTTPLLRNPHRP 228
Query: 250 ----LLEKHYSSG------PAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
+ S G P+E L +TG I DSG+ T F Y D
Sbjct: 229 SLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG---TIIDSGTVITRFVQPVYFAIRDEF 285
Query: 300 RKDLKG 305
RK + G
Sbjct: 286 RKQVNG 291
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 124/306 (40%), Gaps = 45/306 (14%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTG 83
+ A++ P + K + A + + + P V + Y V +K+G P + + +DT
Sbjct: 7 ITMASKDPERLKYLSTLADQKTTAVPIAP-GQQVLKIANYVVRVKLGTPGQQMFMVLDTS 65
Query: 84 SDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEA--NDQC 137
+D WV PC+GCT + + P + + C++ CS C A + C
Sbjct: 66 NDAAWV----PCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVR---GFSCPATGSSAC 118
Query: 138 DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLG 197
+ Y S LV D + L N + P FGC N + G PP G+LGLG
Sbjct: 119 LFNQSYGGDSSLAATLVQDA--ITLANDVI--PGFTFGC-INAVSGGSIPP--QGLLGLG 171
Query: 198 LGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLGHDLVPSSGIAWTPMSRD---- 249
G S++SQ + + V +CL S G L LG P S I TP+ R+
Sbjct: 172 RGPISLISQAGA--MYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKS-IRTTPLLRNPHRP 228
Query: 250 ----LLEKHYSSG------PAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
+ S G P+E L +TG I DSG+ T F Y D
Sbjct: 229 SLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG---TIIDSGTVITRFVQPVYFAIRDEF 285
Query: 300 RKDLKG 305
RK + G
Sbjct: 286 RKQVNG 291
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y ++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 57/126 (45%), Gaps = 9/126 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + L +G PP DTGS+L W QC PC C + L+ PK + V+C+
Sbjct: 92 GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCK-PCDDCYTQVDPLFDPKASSTYKDVSCS 150
Query: 117 DPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLT-NGSLLGPRLIF 174
C+A L C D+ C Y V YAD ++G D L T N + +I
Sbjct: 151 SSQCTA--LENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIII 208
Query: 175 GCGYNQ 180
GCG N
Sbjct: 209 GCGQNN 214
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 82/179 (45%), Gaps = 19/179 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG---CTLPPESLYHPKNN----LVAC 115
Y VT +G P +++DTGSDL+WVQC PC C + L+ P + V C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C+ + + QC Y V Y D ++ GV +D L ++ ++ G FG
Sbjct: 199 GGPVCAGLGI-YAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-AVQG--FFFG 254
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLG 232
CG+ Q G+LGLG + S++ Q+ G V +CL + GYL LG
Sbjct: 255 CGHAQSGLFNG---VDGLLGLGREQPSLVE--QTAGTYGGVFSYCLPTKPSTAGYLTLG 308
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 102/243 (41%), Gaps = 27/243 (11%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y +T IG PP+ DTGSDL W +C A CT C Y+P + + C+
Sbjct: 80 GAYDMTFSIGTPPQELSALADTGSDLIWAKCGA-CTRCVPQGSPSYYPNKSSSFSKLPCS 138
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYA----DHGSSLGVLVTDHFPLRLTNGSLLGPRL 172
CS LP + +CDY+ Y H + G L ++ F L GS P +
Sbjct: 139 GSLCS--DLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTL----GSDAVPGI 192
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGY--LF 230
FGC +G++GLG G S++SQL +CL+ L
Sbjct: 193 GFGC---TTMSEGGYGSGSGLVGLGRGPLSLVSQLN-----VGAFSYCLTSDAAKTSPLL 244
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ-IIFDSGSSYTYFNS 289
G + +G+ TP+ R +Y+ + G +T G IIFDSG++ +
Sbjct: 245 FGSGALTGAGVQSTPLLR-TSTYYYTVNLESISIGAATTAGTGSSGIIFDSGTTVAFLAE 303
Query: 290 QAY 292
AY
Sbjct: 304 PAY 306
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFSFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGAMSVLKQSSP---TFDCFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 82/303 (27%), Positives = 119/303 (39%), Gaps = 37/303 (12%)
Query: 47 STAVFPIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA----PCTGCTLP 101
S A PI G++ Y +T+ IG+P + IDTGSD++W++C + P T T
Sbjct: 114 SEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCKSRLYDPGTSSTYA 173
Query: 102 PESLYHPKNNLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLR 161
P S C+ P C+ C + C Y V Y D ++ G +D L
Sbjct: 174 PFS----------CSAPACAQLGR-RGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLA 222
Query: 162 LTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
T+ L+ FGC + G + T G++GLG S +S Q+ + +CL
Sbjct: 223 GTSEPLIS-GFQFGCSAVEH--GFEEDNTDGLMGLGGDAQSFVS--QTAATYGSAFSYCL 277
Query: 222 --SVRGGGYLFLG-HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQI-- 276
+ G+L LG S+ + TPM R + LL G S G K L+I
Sbjct: 278 PPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYG----LLLRGISVGGKTLEIPS 333
Query: 277 -------IFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGTWKCLL 329
I DSG+ T AY R + + A L C+ T
Sbjct: 334 SVFSAGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEG 393
Query: 330 GNF 332
NF
Sbjct: 394 NNF 396
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFSFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGAMSVLKQSSP---TFDCFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 138/324 (42%), Gaps = 51/324 (15%)
Query: 1 MEEKGKRVMGLLVLLMFATFQGCFSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPL 60
+E RV G++ + FA +G ++ P + T+ ++ ++G
Sbjct: 106 LERDSSRVAGIVAKIRFAV-EGV-DRSDLKPVYNEDTR----YQTEDLTTPVVSGASQGS 159
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACN 116
G Y + +G P K L +DTGSD+ W+QC PC C + +++P ++ + C+
Sbjct: 160 GEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSSTYKSLTCS 218
Query: 117 DPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPL----RLTNGSLLGPRL 172
P CS L E C +N +C Y+V Y D ++G L TD ++ N +L
Sbjct: 219 APQCS---LLETSACRSN-KCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVAL----- 269
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGG-YLFL 231
GCG++ G+ G L SI +Q+++ + +CL R G L
Sbjct: 270 --GCGHDNEGLFTGAAGLLGLGGGVL---SITNQMKATSFS-----YCLVDRDSGKSSSL 319
Query: 232 GHDLVP-SSGIAWTPMSRD-LLEKHYSSGPAELLFGGK------------STGIKGLQII 277
+ V G A P+ R+ ++ Y G + GG+ ++G G +I
Sbjct: 320 DFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGG--VI 377
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRK 301
D G++ T +QAY + D K
Sbjct: 378 LDCGTAVTRLQTQAYNSLRDAFLK 401
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y ++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|357152658|ref|XP_003576193.1| PREDICTED: F-box/FBD/LRR-repeat protein At5g22660-like
[Brachypodium distachyon]
Length = 594
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 86/237 (36%), Gaps = 68/237 (28%)
Query: 96 TGCTLP----PESLYHPKN-NLVACNDPFCSAFHLPENIRCEAN---DQCDYEVLYADHG 147
+ C +P P LY P+ N + C D C H +I + +QCDYE+ Y +
Sbjct: 373 SSCGVPSDHVPHDLYKPRRMNKLLCGDERCVKVHKDLDIEQDCTLDPNQCDYEIEYTNGE 432
Query: 148 SSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
+S+GVL+ D F L T L L FGCGY + G + P GVL +G
Sbjct: 433 NSMGVLLADTFSLPTTTNDRLN--LAFGCGYGHQG-GQEVTPVDGVLRIGFTTHRPTQHT 489
Query: 208 QSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGK 267
Q P ++ PM
Sbjct: 490 Q----------------------------PQRTVSKEPM--------------------- 500
Query: 268 STGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAE-EKALPVCWKG 323
+++FDSGS+Y+ + Y + + L+G L + + LP CW+
Sbjct: 501 -------EVVFDSGSTYSIVLEETYARLVSAVGVTLQGSSLAEVVDPNPELPRCWQD 550
>gi|125595845|gb|EAZ35625.1| hypothetical protein OsJ_19916 [Oryza sativa Japonica Group]
Length = 152
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 64/136 (47%), Gaps = 17/136 (12%)
Query: 80 IDTGSDLTWVQCNAPCTGCTLPP-----ESLYHPKNN----LVACNDPFCSAFHLPENIR 130
+DT SD+TWVQC+ C PP + LY P + + +CN P C+ P
Sbjct: 21 LDTASDVTWVQCSP----CPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG-PYANG 75
Query: 131 CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPT 190
C N+QC Y V Y D S+ G ++D L +T + + FGC +
Sbjct: 76 CTNNNQCQYRVRYPDGTSTAGTYISDL--LTITPATAVR-SFQFGCSKGVKGSFSFGSSA 132
Query: 191 AGVLGLGLGKASILSQ 206
AG++ LG G S++SQ
Sbjct: 133 AGIMALGGGPESLVSQ 148
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 73/171 (42%), Gaps = 13/171 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN---NLVACNDPF 119
Y + +G PP + + +DTGSDL W+ CN T C E + P++ NL N
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 120 CSAFHLPENIRCEANDQCD-------YEVLYADHGSSLGVLVTD--HFPLRLTNGSLLGP 170
S+ + RC + +C Y++ Y++ + G L+ D H N + +
Sbjct: 161 TSSSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPVKA 220
Query: 171 RLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
+ GCG Q + GVLGLG+ S+ S L +T N C
Sbjct: 221 NVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCF 271
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y ++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 62/133 (46%), Gaps = 13/133 (9%)
Query: 53 ITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL 112
++G G Y + IG PP + +DTGSD++WVQC APC C + + P ++
Sbjct: 141 VSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPXFEPTSSA 199
Query: 113 ----VACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLL 168
++C C + + E C N C YEV Y D ++G VT+ L GS
Sbjct: 200 SFTSLSCETEQCKSLDVSE---CR-NGTCLYEVSYGDGSYTVGDFVTETVTL----GSTS 251
Query: 169 GPRLIFGCGYNQR 181
+ GCG+N
Sbjct: 252 LGNIAIGCGHNNE 264
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 111/283 (39%), Gaps = 31/283 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG PP DTGSDL W QC PC C ++ P + V+C
Sbjct: 89 GEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 117 DPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNG---SLLGPRL 172
C L + + C + CD+ Y D + GV+ T+ L +G S+L +
Sbjct: 148 SQQC---RLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSIL--NI 202
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS-LGLTRNVLGHCLSVR-----GG 226
+FGCG+N N G G+ G G S+ SQ+ S LG R + R
Sbjct: 203 VFGCGHN--NSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITS 260
Query: 227 GYLFLGHDLVPSSGIAWTPMSRDLLEKHY-------SSGPAELLFGGKSTGIKGLQIIFD 279
+F V S + TP+ +Y S G F S + D
Sbjct: 261 KIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFID 320
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWK 322
+G+ T Y + +++ + +P++D + L C++
Sbjct: 321 AGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL--CYR 361
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFSFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDCFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLKRG---AAEEESERNCY 265
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 89/204 (43%), Gaps = 27/204 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT-GCTLPPES---LYHPKNNL----VA 114
+ + +K+G P Y + +DTGS L+WVQC PCT C + P ++ P N+ V
Sbjct: 53 FLIPVKLGTPAVQYLVTMDTGSSLSWVQCR-PCTIKCHVQPAKVGPIFDPSNSSTFRHVG 111
Query: 115 CNDPFCSAFHLPENIRCEA----NDQCDYEVLY-ADHGSSLGVLVTDHFPL---RLTNGS 166
C+ CS I+ +A D C Y + Y S+G VTD L T +
Sbjct: 112 CSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTT 171
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG 226
L +FGC + + K AG+ GLG S Q+ L L+ +CL
Sbjct: 172 LSLANFVFGCSMDTQYSTHK---EAGIFGLGTSNYS-FEQIAPL-LSYKAFSYCLPSDEA 226
Query: 227 --GYLFLGHDL---VPSSGIAWTP 245
GYL +G D VP+S TP
Sbjct: 227 HQGYLSIGPDSSGGVPTSMFPGTP 250
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 82/179 (45%), Gaps = 19/179 (10%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG---CTLPPESLYHPKNN----LVAC 115
Y VT +G P +++DTGSDL+WVQC PC C + L+ P + V C
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C+ + + QC Y V Y D ++ GV +D L ++ ++ G FG
Sbjct: 107 GGPVCAGLGI-YAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-AVQG--FFFG 162
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR--GGGYLFLG 232
CG+ Q G+LGLG + S++ Q+ G V +CL + GYL LG
Sbjct: 163 CGHAQSGLFNG---VDGLLGLGREQPSLVE--QTAGTYGGVFSYCLPTKPSTAGYLTLG 216
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 104/276 (37%), Gaps = 32/276 (11%)
Query: 58 YPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG---CTLPPESLYHPKNNLVA 114
Y +G Y V K+G P + + L DTGSDLTW+ C C + H +
Sbjct: 78 YGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137
Query: 115 CNDPFCSAFHLPENIRCEANDQ------------CDYEVLYADHGSSLGVLVTDHFPLRL 162
+ F + L + + E D C Y+ Y+D ++LG + + L
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197
Query: 163 TNGSLLG-PRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASI-LSQLQSLG--LTRNVLG 218
G + ++ GC ++ G GV+GLG K S + + G + ++
Sbjct: 198 KEGRKMKLHNVLIGC--SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVD 255
Query: 219 HCLSVRGGGYLFLGHDLVPSS---GIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI---- 271
H YL G + + +T + ++ Y+ + GG I
Sbjct: 256 HLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 272 ---KGL-QIIFDSGSSYTYFNSQAYKTTLDLMRKDL 303
KG I DSGSS T+ AY+ + +R L
Sbjct: 316 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL 351
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 111/271 (40%), Gaps = 38/271 (14%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC--TLP-----PESLYHPKNNLV 113
G Y + + G P + IDTGSD+ W+ C C GC T P S Y P
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQ-CQGCHSTAPIFDPAKSSSYKP----F 167
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
AC+ C + C N +C +EV Y D G L +D L GS P
Sbjct: 168 ACDSQPCQEI----SGNCGGNSKCQFEVSYGDGTQVDGTLASDAITL----GSQYLPNFS 219
Query: 174 FGCGYN-QRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLF 230
FGC + + P P G++GLG G S+L+Q + L +CL S G L
Sbjct: 220 FGCAESLSEDTSPSP----GLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLV 275
Query: 231 LGHD-LVPSSGIAWTPMSRD-LLEKHYSSGPAELLFGGKSTGIKGLQI------IFDSGS 282
LG + V SS + +T + +D + Y + G + G I I DSG+
Sbjct: 276 LGKEAAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGT 335
Query: 283 SYTYFNSQAYKTTLDLMRKDLKG---KPLED 310
+ T+ AY D R+ L P+ED
Sbjct: 336 TITHLVPSAYTALRDAFRQQLSSLQPTPVED 366
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 121/290 (41%), Gaps = 48/290 (16%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
+ L IGNPP + +DTGSDL W+QC PC C + +Y+ + + CN+P
Sbjct: 93 FLANLSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCNEP 151
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVL------VTDHFPLRLTNGSLLGPRL 172
C + L +C + C Y+ YAD + G+L T H+ ++
Sbjct: 152 PCVS--LGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTA-----QV 204
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGY 228
FGCG N GVLGLG G S++SQL ++G +C + GG+
Sbjct: 205 GFGCGLQNLN-FITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGF 263
Query: 229 LFLGHDLVPSSGIAWTPMS------RDLLEKHYSSGPAEL-----LFGGKSTGIKGLQII 277
L G + + TPM +LL G L F K G G +I
Sbjct: 264 LVFGDATYLNGDM--TPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGG--VI 319
Query: 278 FDSGSSYTYFNSQAYKT----TLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
DSGS+ + F + Y+ +D ++K PL + P C++G
Sbjct: 320 IDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSS------PDCFEG 363
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 125/323 (38%), Gaps = 76/323 (23%)
Query: 54 TGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP--CTGCTLPPES----- 104
T YP Y YS+ L +G PP+ +DTGS L W C + C+ C P
Sbjct: 81 TTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIP 140
Query: 105 LYHPKNN----LVACNDPFCS-------AFHLPENIRCEANDQ-----CDYEVLYADHGS 148
+ PKN+ L+ C +P C F P+ C+ Q C ++ GS
Sbjct: 141 TFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQ---CKPESQNCSLTCPAYIIQYGLGS 197
Query: 149 SLGVLVTD--HFPLRLTNGSLLGPRLIFGCGY-NQRNPGPKPPPTAGVLGLGLGKASILS 205
+ G L+ D +FP + P+ + GC + R P +G+ G G G+ S+ S
Sbjct: 198 TAGFLLLDNLNFPGKTV------PQFLVGCSILSIRQP-------SGIAGFGRGQESLPS 244
Query: 206 QLQSLGLTRNVLGHCLSVRGGGYLFLGHDLV---------PSSGIAWTPMSRD------L 250
Q+ L R +CL DLV ++G+++TP +
Sbjct: 245 QMN---LKR--FSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPA 299
Query: 251 LEKHYSSGPAELLFGGKSTGIKGLQI----------IFDSGSSYTYFNSQAYKTTLDLMR 300
+++Y +++ GGK I + I DSGS++T+ Y
Sbjct: 300 FKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFV 359
Query: 301 KDLKGK--PLEDTAEEKALPVCW 321
K L+ ED + L C+
Sbjct: 360 KQLEKNYSRAEDAETQSGLSPCF 382
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 139
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 140 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFSFG 193
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 194 CNMDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDCFSYCLPLQKSERGFFSKTT 249
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + + G+ G+ KG ++F
Sbjct: 250 GYFSLGK-VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKG--VVF 306
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 307 DSGSELSYIPDRALSVLSQRIRELLLKRG---AAEEESERNCY 346
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 86/199 (43%), Gaps = 13/199 (6%)
Query: 134 NDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
N++C Y YA+ SS G +V D F + R++FGC N G+
Sbjct: 4 NEKCYYSRTYAERSSSEGWMVEDAFGFPDDQPPV---RMVFGC-ENGETGEIYRQLADGI 59
Query: 194 LGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPS-SGIAWTPMSRDLLE 252
+G+G + SQL + G+ +V C G L LG +P + +TP+ +L
Sbjct: 60 MGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNLHL 119
Query: 253 KHYSSGPAELLFGGKSTGI------KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGK 306
+Y+ + G + +G ++ DSG+++TY ++A+ +
Sbjct: 120 HYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYALSH 179
Query: 307 PLEDT--AEEKALPVCWKG 323
L+ T A+ + +CWKG
Sbjct: 180 GLQSTPGADPQYNDICWKG 198
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 71/151 (47%), Gaps = 15/151 (9%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+VTL +G+PP+ + +DTGS+L+W+ C +P G P S Y P V C+ P C
Sbjct: 66 TVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPIC 121
Query: 121 --SAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC- 176
LP C+ C + YAD S G L + F + GS+ P +FGC
Sbjct: 122 RTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTLFGCM 177
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
+ + + G++G+ G S ++QL
Sbjct: 178 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL 208
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFSFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDCFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLKRG---AAEEESERNCY 265
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 111/277 (40%), Gaps = 39/277 (14%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSAFHLP 126
+ IG PP + +DTGSD+ WV C PCT C L+ P ++ + P C P
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMC-TPCTNCDNHLGLLFDP--SMSSTFSPLCKT---P 158
Query: 127 ENIR-CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTN-GSLLGPRLIFGCGYN---QR 181
+ + C D + V YAD+ ++ G+ D T+ G+ P ++FGCG+N
Sbjct: 159 CDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNIGQDT 218
Query: 182 NPGPKPPPTAGVLGLGLGKASILSQLQS-------------LGLTRNVLGHCLSVRGGGY 228
+PG G+LGL G S+ +++ + +LG + G
Sbjct: 219 DPGHN-----GILGLNNGPDSLATKIGQKFSYCIGDLADPYYNYHQLILGEGADLEGYST 273
Query: 229 LFLGHDLVPSSGIAWTPMS-RDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYF 287
F H+ G + M + EK P TG +I D+GS+ T+
Sbjct: 274 PFEVHN-----GFYYVTMEGISVGEKRLDIAPETFEMKKNRTG----GVIIDTGSTITFL 324
Query: 288 NSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCWKGT 324
++ +R L + T E+ C+ G+
Sbjct: 325 VDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGS 361
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 134/348 (38%), Gaps = 74/348 (21%)
Query: 29 QPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTW 88
+P S+ +TA+ + P++ Y G YSV+L G P + DTGS L
Sbjct: 61 KPDEDALSSTTTAS---ATVVKSPLSAKSY--GGYSVSLSFGTPSQTIPFVFDTGSSLVC 115
Query: 89 VQCNAP--CTGC-------TLPPESLYHPKNN----LVACNDPFCSAFHLPENIRCEAND 135
+ C + C+GC TL P + PKN+ ++ C P C + P N++C D
Sbjct: 116 LPCTSRYLCSGCDFSGLDPTLIPR--FIPKNSSSSKIIGCQSPKCQFLYGP-NVQCRGCD 172
Query: 136 Q--------CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG-YNQRNPGPK 186
C +L GS+ GVL+T+ +L L P + GC + R P
Sbjct: 173 PNTRNCTVGCPPYILQYGLGSTAGVLITE----KLDFPDLTVPDFVVGCSIISTRQP--- 225
Query: 187 PPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDL----------- 235
AG+ G G G S+ SQ+ L R HCL R + DL
Sbjct: 226 ----AGIAGFGRGPVSLPSQMN---LKR--FSHCLVSRRFDDTNVTTDLDLDTGSGHNSG 276
Query: 236 VPSSGIAWTP------MSRDLLEKHYSSGPAELLFGGKSTGIKGLQI----------IFD 279
+ G+ +TP +S ++Y + G K I + I D
Sbjct: 277 SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVD 336
Query: 280 SGSSYTYFNSQAYKTTLDLMRKDLKGKPLE-DTAEEKALPVCWKGTWK 326
SGS++T+ ++ + + E D +E L C+ + K
Sbjct: 337 SGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISGK 384
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 104/253 (41%), Gaps = 20/253 (7%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGC---TLPPESLYHPKNNLVACNDPF 119
+ + G+P K L +DTGS LTW QC PC+ C + P+ Y P + + D
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQC-FPCSDCYAQKIYPK--YRPAAS-ITYRDAM 113
Query: 120 CSAFHLPENIRCEAN---DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-LIFG 175
C H N + C Y+ Y D + G L + + +G + FG
Sbjct: 114 CEDSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFG 173
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDL 235
C N + G T G+LGLG+GK SI+ + S LG + L LG
Sbjct: 174 C--NTLSDGSYFTGT-GILGLGVGKYSIIGEFGS--KFSFCLGEISEPKASHNLILG--- 225
Query: 236 VPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTT 295
+ + P ++ E H ++ G + T +Q+ D+GS+ ++ ++ Y
Sbjct: 226 -DGANVQGHPTVINITEGHTIFQLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKF 284
Query: 296 LDLMRKDLKGKPL 308
+D + +PL
Sbjct: 285 VDAFDDLIGSRPL 297
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 82/194 (42%), Gaps = 28/194 (14%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYE---LDIDTGSDLTW 88
+K++ +TA + P+ G Y V L+IG P + DTGSDL+W
Sbjct: 71 AKEEIQLATAIAAGDKKLLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSW 130
Query: 89 VQCNAPCTGCT----LPPESLYHPKN-NLVACNDPFCSAFHLPENIRCEA-------NDQ 136
QC PCT C+ PP + ++C DP C C A +
Sbjct: 131 TQCE-PCTNCSSFTPYPPHDPSKSRTFRRLSCFDPMCEL--------CTAVVDGGGGSAG 181
Query: 137 CDYEVLYADHGSSLGVLVTDHFPLRLT---NGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
C + Y D G+ G LV+D F G L + FGC + + + + T G+
Sbjct: 182 CLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAFGCAHVEDSKAVRGYST-GI 240
Query: 194 LGLGLGKASILSQL 207
L LG+GK S ++QL
Sbjct: 241 LALGIGKPSFVTQL 254
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 96/250 (38%), Gaps = 13/250 (5%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y +T+ +G+P + IDTGSD++WVQC PC+ C +SL+ P ++ SA
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCK-PCSQCHSQADSLFDPSSSSTYSAFSCTSA 185
Query: 123 FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRN 182
R ++ QC Y V Y D + G +D L GS FGC +Q
Sbjct: 186 ACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL----GSSTVENFQFGC--SQSE 239
Query: 183 PGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGGGYLFLGHDLVPSSGIA 242
G L G A L+ Q+ G +CL G FL S +
Sbjct: 240 SGNLLQDQTAGLMGLGGGAESLAT-QTAGTFGKAFSYCLPPTPGSSGFLTLGASTSGFVV 298
Query: 243 WTPMSRDL-LEKHYSSGPAELLFGGKSTGIKGLQI----IFDSGSSYTYFNSQAYKTTLD 297
TPM R + +Y + GG+ I I DSG+ T AY
Sbjct: 299 KTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSIMDSGTIITRLPRTAYSALSS 358
Query: 298 LMRKDLKGKP 307
+ +K P
Sbjct: 359 AFKAGMKQYP 368
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/245 (26%), Positives = 103/245 (42%), Gaps = 23/245 (9%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCT-GCTLPPES--LYHPKN----NLVACNDPF 119
+K+G PP + +DTG+ L++VQC PCT C ++ ++ P + V C++
Sbjct: 210 IKLGTPPVWNLVAVDTGATLSFVQCE-PCTLRCHKQTDAGEIFDPSKSESFSRVGCSENK 268
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGS-SLGVLVTDHFPLRLTNGSLLGPRLIF 174
C A HL E D C Y + + S S+G LV D + P +F
Sbjct: 269 CRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGKYAKGYSFPDFLF 328
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SVRGGGYLFLG 232
GC + + AG++G S Q+ L + +C R GYL +G
Sbjct: 329 GCSLDTEYHQYE----AGLVGFADEPFSFFEQVAPL-VNYKAFSYCFPSDRRKTGYLSIG 383
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAY 292
+S +TP+ + Y+ E+L G + ++I DSGS +T S +
Sbjct: 384 DYTRVNS--TYTPLFLARQQSRYALKLDEVLVNGMALVTTPSEMIVDSGSRWTILLSDTF 441
Query: 293 KTTLD 297
T LD
Sbjct: 442 -TQLD 445
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 70/167 (41%), Gaps = 12/167 (7%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + + IG PP DTGSDL W QC PC C ++ P + V+C
Sbjct: 89 GEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 117 DPFCSAFHLPENIRC-EANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLG-PRLIF 174
C L + + C + CD+ Y D + GV+ T+ L +G ++F
Sbjct: 148 SQQC---RLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVF 204
Query: 175 GCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL 221
GCG+N N G G+ G G S+ SQ+ S + CL
Sbjct: 205 GCGHN--NSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCL 249
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---RFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 82/194 (42%), Gaps = 28/194 (14%)
Query: 32 SKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYE---LDIDTGSDLTW 88
+K++ +TA + P+ G Y V L+IG P + DTGSDL+W
Sbjct: 92 AKEEIQLATAIAAGDKKLLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSW 151
Query: 89 VQCNAPCTGCT----LPPESLYHPKN-NLVACNDPFCSAFHLPENIRCEA-------NDQ 136
QC PCT C+ PP + ++C DP C C A +
Sbjct: 152 TQCE-PCTNCSSFTPYPPHDPSKSRTFRRLSCFDPMCEL--------CTAVVDGGGGSAG 202
Query: 137 CDYEVLYADHGSSLGVLVTDHFPLRLT---NGSLLGPRLIFGCGYNQRNPGPKPPPTAGV 193
C + Y D G+ G LV+D F G L + FGC + + + + T G+
Sbjct: 203 CLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAFGCAHVEDSKAVRGYST-GI 261
Query: 194 LGLGLGKASILSQL 207
L LG+GK S ++QL
Sbjct: 262 LALGIGKPSFVTQL 275
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 122/301 (40%), Gaps = 47/301 (15%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y + L IG PP DTGSDLTW+Q + PC C ++ P N+ + C
Sbjct: 78 GEYMMNLSIGTPPFPILAIADTGSDLTWLQ-SKPCDQCYPQKGPIFDPSNSTTFHKLPCT 136
Query: 117 DPFCSAFHLPENIR-CEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C+A L E+ R C C Y Y DH + G L +D + + N S+ + FG
Sbjct: 137 TAPCNA--LDESARSCTDPTTCGYTYSYGDHSYTTGYLASDT--VTVGNASVQIRNVAFG 192
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQS----------LGLTRNVLGHCLSVRG 225
CG RN G +G++GLG G S +SQL L L +
Sbjct: 193 CG--TRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPA 250
Query: 226 GGYLFLGHDLVPSSG------IAWTPMSRDLLEKHY-------SSGPAELLF-------- 264
+ G + V SS A TP+ +Y + G +LL+
Sbjct: 251 TSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTA 310
Query: 265 ---GGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
G + ++ II DSG++ T+ + Y + +++K + + D + +C+
Sbjct: 311 SYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVND-VKNSMFSLCF 369
Query: 322 K 322
K
Sbjct: 370 K 370
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 128/314 (40%), Gaps = 69/314 (21%)
Query: 54 TGNVYPLGY--YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAP--CTGCTLP---PESL- 105
T YP Y YS+ L +G PP+ +DTGS L W C + C+ C P P +
Sbjct: 77 TTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIP 136
Query: 106 -YHPKNN----LVACNDPFCSAFHLPE-NIRCEANDQ---------CDYEVLYADHGSSL 150
+ PKN+ L+ C +P C P+ RC + C ++ G++
Sbjct: 137 TFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATA 196
Query: 151 GVLVTD--HFPLRLTNGSLLGPRLIFGCGY-NQRNPGPKPPPTAGVLGLGLGKASILSQL 207
G L+ D +FP + P+ + GC + R P +G+ G G G+ S+ SQ+
Sbjct: 197 GFLLLDNLNFPGKTV------PQFLVGCSILSIRQP-------SGIAGFGRGQESLPSQM 243
Query: 208 QSLGLTRNVLGHCLSVRGGGYLFLGHDLV---------PSSGIAWTPMSRD-----LLEK 253
L R +CL DLV ++G+++TP + + +
Sbjct: 244 N---LKR--FSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFRE 298
Query: 254 HYSSGPAELLFGGKSTGI--KGLQ--------IIFDSGSSYTYFNSQAYK-TTLDLMRKD 302
+Y +L+ GG I K L+ I DSGS++T+ Y + +R+
Sbjct: 299 YYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQL 358
Query: 303 LKGKPLEDTAEEKA 316
K E+ E ++
Sbjct: 359 GKKYSREENVEAQS 372
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 139
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 140 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 193
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q + +CL S RG
Sbjct: 194 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---RFDGFSYCLPLQKSERGFFSKTT 249
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 250 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 306
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 307 DSGSELSYIPDRALSVLSQRIRELLLRR---GAAEEESERNCY 346
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---RFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---RFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 120/284 (42%), Gaps = 42/284 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K L+IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PGFSFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q T + +CL S RG
Sbjct: 113 CNMDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---TFDGFSYCLPLQMSERGFFSKTT 168
Query: 227 GYLFLGHDLVPS-SGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQII 277
GY LG + + + + +T M +R + + + G+ G+ KG ++
Sbjct: 169 GYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKG--VV 226
Query: 278 FDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
FDSGS +Y +A +R+ L + AEE++ C+
Sbjct: 227 FDSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 267
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 120/284 (42%), Gaps = 54/284 (19%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTL-----PPESLYHPKNN----LV 113
+S+T+ IG PP+ +L +DTGSDL W QC + PP +Y P + +
Sbjct: 91 HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPP--VYDPGESSTFAFL 148
Query: 114 ACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLI 173
C+D C C + ++C YE +Y +++GVL ++ F L RL
Sbjct: 149 PCSDRLCQEGQFSFK-NCTSKNRCVYEDVYGS-AAAVGVLASETFTFGARRAVSL--RLG 204
Query: 174 FGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL---SVRGGGYLF 230
FGCG + G T G+LGL S+++QL+ + R +CL + + L
Sbjct: 205 FGCG--ALSAGSLIGAT-GILGLSPESLSLITQLK---IQR--FSYCLTPFADKKTSPLL 256
Query: 231 LGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLF-----GGKSTGIKGLQI--------- 276
G + ++ +R + S P + ++ G S G K L +
Sbjct: 257 FG----AMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRP 312
Query: 277 ------IFDSGSSYTYFNSQAY----KTTLDLMRKDLKGKPLED 310
I DSGS+ Y A+ + +D++R + + +ED
Sbjct: 313 DGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVED 356
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 115/276 (41%), Gaps = 53/276 (19%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDP 118
Y V ++G PP+ L +DT +D W+ C+ C GC P + ++P + V C P
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSG-CAGC--PTTTPFNPAASKSYRAVPCGSP 164
Query: 119 FCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCG 177
CS N C N + C + + YAD SSL ++ L + + FGC
Sbjct: 165 ACS---RAPNPSCSLNTKSCGFSLTYAD--SSLEAALSQD---SLAVANDVVKSYTFGC- 215
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLGH 233
Q+ G PP +LGLG G S LSQ + + +CL S+ G L LG
Sbjct: 216 -LQKATGTATPPQG-LLGLGRGPLSFLSQTKD--MYEGTFSYCLPSFKSLNFSGTLRLGR 271
Query: 234 DLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIK-------------------GL 274
P I TP+ L+ H SS L+ TGI+ G
Sbjct: 272 KGQPLR-IKTTPL---LVNPHRSS-----LYYVSMTGIRVGKKVVPIPPAALAFDPATGA 322
Query: 275 QIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLED 310
+ DSG+ +T + AY D +R+ ++G PL
Sbjct: 323 GTVLDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSS 358
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 119/283 (42%), Gaps = 42/283 (14%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL---VACNDPF 119
Y +++ +G P K ++IDTGS +WV C C GC P + ++ V+C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 120 C----SAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C S H ++ E C + V Y D +S G+L D L ++ + P FG
Sbjct: 59 CLLGGSDPHCQDS---ENYPDCPFRVSYQDGSASYGILYQD--TLTFSDVQKI-PSFTFG 112
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRG-----G 226
C + + G+LG+G G S+L Q + +CL S RG
Sbjct: 113 CNLDSFGAN-EFGNVDGLLGMGAGPMSVLKQSSP---RFDGFSYCLPLQKSERGFFSKTT 168
Query: 227 GYLFLGHDLVPSSGIAWTPM-SRDLLEKHYSSGPAELLFGGKSTGI-------KGLQIIF 278
GY LG + + + +T M +R + + A + G+ G+ KG ++F
Sbjct: 169 GYFSLGK-VATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKG--VVF 225
Query: 279 DSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKALPVCW 321
DSGS +Y +A +R+ L + AEE++ C+
Sbjct: 226 DSGSELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCY 265
>gi|399218365|emb|CCF75252.1| unnamed protein product [Babesia microti strain RI]
Length = 535
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 110/289 (38%), Gaps = 52/289 (17%)
Query: 46 GSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESL 105
G PI G ++ YY + + IG PP + + +DTGS L + C C C
Sbjct: 163 GKKFKIPIYGTLHDFAYYFIKIFIGTPPSVQWVVLDTGSSLLGITC-GNCIQCGNHQNPN 221
Query: 106 YHPKNNLVACNDPFCSAFHLPENIRCEANDQCDY----EVLYADHGSSLGVLVTDHFPLR 161
Y P + A I+C +QC E + H S G ++ +
Sbjct: 222 YEPYESATA--------------IKCTDVNQCKLKGCDECRFMQHYSE-GSFISGDYYTD 266
Query: 162 LTNGSLLGPRLIF---GCG-------YNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLG 211
+ + P F GC YNQR G+ G+ SI+SQL
Sbjct: 267 VISFDKSSPGYKFNNLGCVLYENKLIYNQR--------ANGIFGMSPNDDSIISQLFKRP 318
Query: 212 LTRNVLGHCLSVRGGGYLFLG-----HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGG 266
N+ CLS GG + G ++ +S +AWT ++ D +Y + +
Sbjct: 319 EIDNIFSICLSDEGGELIIGGIEPELFNIKNNSEMAWTRLNTD---NNYYIHINSMSYLS 375
Query: 267 KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLD------LMRKDLKGKPLE 309
I + DSG++ T + YK+ ++ M ++++G L+
Sbjct: 376 DHVEITNTKFSIDSGTTNTVLMEKMYKSIVNGVMNICFMDREIEGYDLD 424
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 13/125 (10%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACN 116
G Y L +G P + + +DTGSD+ W+QC APC C + +++P + + C
Sbjct: 145 GEYFTRLGVGTPARYVFMVLDTGSDVVWIQC-APCKKCYSQTDPVFNPTKSRSFANIPCG 203
Query: 117 DPFCSAFHLPENIRCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C P C C Y+V Y D + G T+ R G+ +G R+ G
Sbjct: 204 SPLCRRLDSPG---CSTKKHICLYQVSYGDGSFTYGEFSTETLTFR---GTRVG-RVALG 256
Query: 176 CGYNQ 180
CG++
Sbjct: 257 CGHDN 261
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 116/292 (39%), Gaps = 48/292 (16%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNA-PCTGCTLPP--ESLYHPKNNLVACNDPFC 120
+++L IG+PP+ + +DTGS+L+W+ C P T P S Y P CN C
Sbjct: 60 TISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTP----TPCNSSVC 115
Query: 121 SAFHLPENI--RCEANDQ-CDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC- 176
I C+ N++ C V YAD S+ G L + F L P +FGC
Sbjct: 116 MTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSL----AGAAQPGTLFGCM 171
Query: 177 ---GYNQR-NPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVRGG-GYLFL 231
GY N K T G++G+ G S+++Q + +C+S G L L
Sbjct: 172 DSAGYTSDINEDAK---TTGLMGMNRGSLSLVTQ-----MVLPKFSYCISGEDAFGVLLL 223
Query: 232 GHDLVPSSGIAWTPMSRDLLEKHYSSGPA-ELLFGGKSTGIKGLQI-------------- 276
G S + +TP+ Y A + G K LQ+
Sbjct: 224 GDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQ 283
Query: 277 -IFDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLEDT--AEEKALPVCWKG 323
+ DSG+ +T+ Y + D + KG +ED E A+ +C+
Sbjct: 284 TMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHA 335
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 73/150 (48%), Gaps = 14/150 (9%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCT-GCTLPP--ESLYHPKNNLVACNDPFC 120
+V+L +G PP+ + +DTGS+L+W++CN T T P S Y P V C+ C
Sbjct: 86 TVSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQTTFDPNRSSSYSP----VPCSSLTC 141
Query: 121 S--AFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
+ P C++N C + YAD SS G L +D F + G+ P IFGC
Sbjct: 142 TDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYI----GNSDMPGTIFGCMD 197
Query: 179 NQRNPGPKP-PPTAGVLGLGLGKASILSQL 207
+ + + G++G+ G S +SQ+
Sbjct: 198 SSFSTNTEEDSKNTGLMGMNRGSLSFVSQM 227
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 112/276 (40%), Gaps = 39/276 (14%)
Query: 52 PIT-GNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKN 110
P+T G G Y + + IG P K + + IDTGSD+ W+QC PC C + ++ P +
Sbjct: 148 PVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCK-PCDDCYQQVDPIFDPAS 206
Query: 111 ----NLVACNDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGS 166
+ + C P C + C ND C Y+V Y D ++G T+ +GS
Sbjct: 207 SSSFSRLGCQTPQCRNLDV---FACR-NDSCLYQVSYGDGSYTVGDFATETVSFG-NSGS 261
Query: 167 LLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----S 222
+ ++ GCG++ G+ G L S+ SQ+++ + +CL S
Sbjct: 262 V--DKVAIGCGHDNEGLFVGAAGLIGLGGGPL---SLTSQIKA-----SSFSYCLVNRDS 311
Query: 223 VRGGGYLFLGHDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGI----------- 271
V F + PS + ++ Y G + GG+ I
Sbjct: 312 VDSSTLEF--NSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSG 369
Query: 272 KGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
KG II D G++ T +QAY D K K P
Sbjct: 370 KG-GIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLP 404
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 122/293 (41%), Gaps = 53/293 (18%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQC-NAPCTGCTLPPE--SLYHPKNNLVACNDPFC 120
+V+L +G PP+ + +DTGS+L+W+ C P S Y P + C P C
Sbjct: 71 TVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINSVFNPHLSSSYTP----IPCMSPIC 126
Query: 121 SA----FHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG- 175
F +P + C++N+ C V YAD S G L +D F + + P +IFG
Sbjct: 127 KTRTRDFLIP--VSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSG----QPGIIFGS 180
Query: 176 --CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR-GGGYLFLG 232
G++ + T G++G+ G S ++Q +G + +C+S + G L G
Sbjct: 181 MDSGFSSN--ANEDSKTTGLMGMNRGSLSFVTQ---MGFPK--FSYCISGKDASGVLLFG 233
Query: 233 HDLVPSSG-IAWTPMSR---------------DLLEKHYSSGPAEL---LFGGKSTGIKG 273
G + +TP+ + L+ S P ++ +F TG
Sbjct: 234 DATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAG- 292
Query: 274 LQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKG--KPLED--TAEEKALPVCWK 322
Q + DSG+ +T+ Y + +G LED E A+ +C++
Sbjct: 293 -QTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFR 344
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 115/295 (38%), Gaps = 34/295 (11%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
K T ST T + +G Y VT+++G K L +DTGSDLTWVQC
Sbjct: 106 KAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ- 162
Query: 94 PCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIR--CEANDQ-----CDYEVL 142
PC C LY P + V CN C + C N+ C+Y V
Sbjct: 163 PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVS 222
Query: 143 YADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKAS 202
Y D + G L ++ L G +FGCG N + G+ ++S
Sbjct: 223 YGDGSYTRGDLASESILL----GDTKLENFVFGCGRNNKGLFGGSSGLMGLG-----RSS 273
Query: 203 ILSQLQSLGLTRNVLGHCL-SVRGG--GYLFLGHD---LVPSSGIAWTPMSRD-LLEKHY 255
+ Q+L V +CL S+ G G L G+D S+ +++TP+ ++ L Y
Sbjct: 274 VSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFY 333
Query: 256 SSGPAELLFGG---KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
GG KS+ G I+ DSG+ T YK K G P
Sbjct: 334 ILNLTGASIGGVELKSSSF-GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFP 387
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 68/150 (45%), Gaps = 9/150 (6%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y + L IG PP + DTGSDLTW QC PC C +Y P + P SA
Sbjct: 77 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 135
Query: 123 FHLP--ENIRCEA-NDQCDYEVLYADHGSSLGVLVTDHFPL--RLTNGSLLGPRLIFGCG 177
LP + C + C Y Y+D S G+L T+ L + ++ + FGCG
Sbjct: 136 TCLPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCG 195
Query: 178 YNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
+ G + G +GLG G S+L+QL
Sbjct: 196 TDN---GGDSLNSTGTVGLGRGTLSLLAQL 222
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 107/263 (40%), Gaps = 31/263 (11%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y VT+ +G+ + IDTGSDLTWVQC PC C ++ P V+CN
Sbjct: 65 YIVTMGLGSTN--MTVIIDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSS 121
Query: 119 FCSA--FHLPENIRCEAN-DQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
C + F C +N C+Y V Y D + G L + +L+ G + +FG
Sbjct: 122 TCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVE----QLSFGGVSVSDFVFG 177
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSVR---GGGYLFLG 232
CG RN +G++GLG S++SQ + V +CL G L +G
Sbjct: 178 CG---RNNKGLFGGVSGLMGLGRSYLSLVSQTNA--TFGGVFSYCLPTTESGASGSLVMG 232
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSSGPAELLFGGKSTGIKGLQ--------IIFDSGSSY 284
++ + +R L S+ L G G+ LQ ++ DSG+
Sbjct: 233 NESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGV-ALQVPSFGNGGVLIDSGTVI 291
Query: 285 TYFNSQAYKTTLDLMRKDLKGKP 307
T S YK L K G P
Sbjct: 292 TRLPSSVYKALKALFLKQFTGFP 314
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 121/288 (42%), Gaps = 52/288 (18%)
Query: 67 LKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNN----LVACNDPFCSA 122
L IGNPP + +DTGSDL W+QC PC C + +Y+ + + CN+P C
Sbjct: 110 LSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCNEPPC-- 166
Query: 123 FHLPENIRCEANDQCDYEVLYADHGSSLGVL------VTDHFPLRLTNGSLLGPRLIFGC 176
L +C + C Y+ YAD + G+L T H+ ++ FGC
Sbjct: 167 LSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTA-----QVGFGC 221
Query: 177 GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLG 232
G N GVLGLG G S++SQL ++G +C + GG+L G
Sbjct: 222 GLQNLN-FVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFG 280
Query: 233 HDLVPSSGIAWTPMSRDLLEKHYSS------GPAE-------LLFGGKSTGIKGLQIIFD 279
+ + TPM + E +Y + G E F K G G +I D
Sbjct: 281 DATYLNGDM--TPMV--IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGG--VIID 334
Query: 280 SGSSYTYFNSQAYKT----TLDLMRKDLKGKPLEDTAEEKALPVCWKG 323
SGS+ + F + Y+ +D ++K PL + P C++G
Sbjct: 335 SGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSS------PDCFEG 376
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 89/209 (42%), Gaps = 30/209 (14%)
Query: 60 LGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTG--CTLPPESLYHPKNN----LV 113
L Y + G K + +DTGSDLTWVQC PC G C + L+ P + V
Sbjct: 178 LNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCE-PCPGSSCYAQRDPLFDPAASPTFAAV 236
Query: 114 ACNDPFCSAFHLPEN------IRCEANDQ--CDYEVLYADHGSSLGVLVTDHFPLRLTNG 165
C P C+A R N + C Y + Y D S GVL D L T
Sbjct: 237 PCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTT- 295
Query: 166 SLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL--SV 223
L G +FGCG + R TAG++GLG S++S Q+ V +CL +
Sbjct: 296 KLDG--FVFGCGLSNRGLFGG---TAGLMGLGRTDLSLVS--QTAARFGGVFSYCLPATT 348
Query: 224 RGGGYLFLGHDLVPSSG---IAWTPMSRD 249
G L LG PSS +A+T M D
Sbjct: 349 TSTGSLSLGPG--PSSSFPNMAYTRMIAD 375
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 109/263 (41%), Gaps = 44/263 (16%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNL-------VAC 115
Y V KIG P + L +DT +D W+ PC+GC +++ NN+ V C
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWI----PCSGCVGCSSTVF---NNVKSTTFKTVGC 148
Query: 116 NDPFCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFG 175
P C +P N +C C + + Y SS+ ++ + L S+ P FG
Sbjct: 149 EAPQCK--QVP-NSKC-GGSACAFNMTYGS--SSIAANLSQDV-VTLATDSI--PSYTFG 199
Query: 176 CGYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFL 231
C G PP G+LGLG G S+LSQ Q+ L ++ +CL S+ G L L
Sbjct: 200 CLTEAT--GSSIPP-QGLLGLGRGPMSLLSQTQN--LYQSTFSYCLPSFRSLNFSGSLRL 254
Query: 232 GHDLVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGI----------KGLQIIFDS 280
G P I TP+ ++ Y + G + I G IFDS
Sbjct: 255 GPVGQPKR-IKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDS 313
Query: 281 GSSYTYFNSQAYKTTLDLMRKDL 303
G+ +T + AY D RK +
Sbjct: 314 GTVFTRLVAPAYTAVRDAFRKRV 336
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 118/297 (39%), Gaps = 35/297 (11%)
Query: 33 KKKSTQSTAAHRFGSTAVFPITGNVYPLGY-YSVTLKIGNPPKLYELDIDTGSDLTWVQC 91
K K+ S+ + S P+T + Y VT+++G K L +DTGSDLTWVQC
Sbjct: 56 KIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KNMSLIVDTGSDLTWVQC 113
Query: 92 NAPCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIR--CEANDQ-----CDYE 140
PC C LY P + V CN C + C N+ C+Y
Sbjct: 114 Q-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYV 172
Query: 141 VLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGK 200
V Y D + G L ++ L G +FGCG N + G+ +
Sbjct: 173 VSYGDGSYTRGDLASESILL----GDTKLENFVFGCGRNNKGLFGGSSGLMGLG-----R 223
Query: 201 ASILSQLQSLGLTRNVLGHCL-SVRGG--GYLFLGHD---LVPSSGIAWTPMSRD-LLEK 253
+S+ Q+L V +CL S+ G G L G+D S+ +++TP+ ++ L
Sbjct: 224 SSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRS 283
Query: 254 HYSSGPAELLFGG---KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
Y GG KS+ G I+ DSG+ T YK K G P
Sbjct: 284 FYILNLTGASIGGVELKSSSF-GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFP 339
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 115/295 (38%), Gaps = 34/295 (11%)
Query: 34 KKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNA 93
K T ST T + +G Y VT+++G K L +DTGSDLTWVQC
Sbjct: 106 KAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ- 162
Query: 94 PCTGCTLPPESLYHPKNN----LVACNDPFCSAFHLPENIR--CEANDQ-----CDYEVL 142
PC C LY P + V CN C + C N+ C+Y V
Sbjct: 163 PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVS 222
Query: 143 YADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLGLGKAS 202
Y D + G L ++ L G +FGCG N + G+ ++S
Sbjct: 223 YGDGSYTRGDLASESILL----GDTKLENFVFGCGRNNKGLFGGSSGLMGLG-----RSS 273
Query: 203 ILSQLQSLGLTRNVLGHCL-SVRGG--GYLFLGHD---LVPSSGIAWTPMSRD-LLEKHY 255
+ Q+L V +CL S+ G G L G+D S+ +++TP+ ++ L Y
Sbjct: 274 VSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFY 333
Query: 256 SSGPAELLFGG---KSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKP 307
GG KS+ G I+ DSG+ T YK K G P
Sbjct: 334 ILNLTGASIGGVELKSSSF-GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFP 387
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/306 (26%), Positives = 122/306 (39%), Gaps = 45/306 (14%)
Query: 24 FSEANQPPSKKKSTQSTAAHRFGSTAVFPITGNVYPLGYYSVTLKIGNPPKLYELDIDTG 83
+ A++ P + K + A + + + P V + Y V +K+G P + + +DT
Sbjct: 60 ITMASKDPERLKYLSTLADQKTTAVPIAP-GQQVLKIANYVVRVKLGTPGQQMFMVLDTS 118
Query: 84 SDLTWVQCNAPCTGCTLPPESLYHPKNNL----VACNDPFCSAFHLPENIRCEA--NDQC 137
+D WV PC+GCT + + P + + C+ CS C A + C
Sbjct: 119 NDAAWV----PCSGCTGCSSTTFLPNASTTLGSLDCSGAQCSQVR---GFSCPATGSSAC 171
Query: 138 DYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGYNQRNPGPKPPPTAGVLGLG 197
+ Y S LV D + L N + P FGC N + G PP G+LGLG
Sbjct: 172 LFNQSYGGDSSLTATLVQDA--ITLANDVI--PGFTFGC-INAVSGGSIPP--QGLLGLG 224
Query: 198 LGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLGHDLVPSSGIAWTPMSRD---- 249
G S++SQ + + V +CL S G L LG P S I TP+ R+
Sbjct: 225 RGPISLISQAGA--MYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKS-IRTTPLLRNPHRP 281
Query: 250 ----------LLEKHYSSGPAELLFGGKSTGIKGLQIIFDSGSSYTYFNSQAYKTTLDLM 299
+ + P+E L +TG I DSG+ T F Y D
Sbjct: 282 SLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG---TIIDSGTVITRFVQPVYFAIRDEF 338
Query: 300 RKDLKG 305
RK + G
Sbjct: 339 RKQVNG 344
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 64/155 (41%), Gaps = 14/155 (9%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHPKNNLVACNDPFCSA 122
Y + L IG PP + DTGSDLTW QC PC C +Y + P SA
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCK-PCKLCFPQDTPIYDTAASASFSPVPCASA 153
Query: 123 FHLP-----ENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGP-----RL 172
LP N C Y Y D S GVL T+ ++ GP +
Sbjct: 154 TCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGV 213
Query: 173 IFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL 207
FGCG + G + G +GLG G S+++QL
Sbjct: 214 AFGCGVDN---GGLSYNSTGTVGLGRGSLSLVAQL 245
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/261 (26%), Positives = 103/261 (39%), Gaps = 36/261 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y V IG P + + +DT +D W+ C+ C GC+ L+ P + + C P
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSG-CVGCS--SSVLFDPSKSSSSRTLQCEAP 144
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C P C + C + + Y GS++ +T LT S + P FGC
Sbjct: 145 QCKQAPNPS---CTVSKSCGFNMTYG--GSTIEAYLTQD---TLTLASDVIPNYTFGC-- 194
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLGHD 234
P G++GLG G S++SQ Q+ L ++ +CL S G L LG
Sbjct: 195 -INKASGTSLPAQGLMGLGRGPLSLISQSQN--LYQSTFSYCLPNSKSSNFSGSLRLGPK 251
Query: 235 LVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGI----------KGLQIIFDSGSS 283
P I TP+ ++ Y + G K I G IFDSG+
Sbjct: 252 NQPIR-IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTV 310
Query: 284 YTYFNSQAYKTTLDLMRKDLK 304
YT AY + R+ +K
Sbjct: 311 YTRLVEPAYVAVRNEFRRRVK 331
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/261 (26%), Positives = 103/261 (39%), Gaps = 36/261 (13%)
Query: 63 YSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPPESLYHP----KNNLVACNDP 118
Y V IG P + + +DT +D W+ C+ C GC+ L+ P + + C P
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSG-CVGCS--SSVLFDPSKSSSSRTLQCEAP 144
Query: 119 FCSAFHLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGCGY 178
C P C + C + + Y GS++ +T LT S + P FGC
Sbjct: 145 QCKQAPNPS---CTVSKSCGFNMTYG--GSTIEAYLTQD---TLTLASDVIPNYTFGC-- 194
Query: 179 NQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCL----SVRGGGYLFLGHD 234
P G++GLG G S++SQ Q+ L ++ +CL S G L LG
Sbjct: 195 -INKASGTSLPAQGLMGLGRGPLSLISQSQN--LYQSTFSYCLPNSKSSNFSGSLRLGPK 251
Query: 235 LVPSSGIAWTPMSRDLLEKH-YSSGPAELLFGGKSTGI----------KGLQIIFDSGSS 283
P I TP+ ++ Y + G K I G IFDSG+
Sbjct: 252 NQPIR-IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTV 310
Query: 284 YTYFNSQAYKTTLDLMRKDLK 304
YT AY + R+ +K
Sbjct: 311 YTRLVEPAYVAVRNEFRRRVK 331
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 124/300 (41%), Gaps = 50/300 (16%)
Query: 61 GYYSVTLKIGNPPKLYELDIDTGSDLTWVQCNAPCTGCTLPP--ESLYHPKNN----LVA 114
G Y + L IG PP+L IDTGSDL W++C+ C C L E+++ + +
Sbjct: 3 GEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDN-CDHCDLDHHGETIFFSDASSSYKKLP 61
Query: 115 CNDPFCSAFHLPE-NIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPR-- 171
CN CS RCE + C Y+ Y D + G + +D R ++G+ R
Sbjct: 62 CNSTHCSGMSSAGIGPRCE--ETCKYKYEYGDGSRTSGDVGSDRISFR-SHGAGEDHRSF 118
Query: 172 ---LIFGCGYNQRNPGPKPPPTAGVLGLGLGKASILSQL-QSLGLTRNVLGHCLSV---- 223
+FGCG R T G++GLG S++ QL LG +CL
Sbjct: 119 FDGFLFGCG---RKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYK---FSYCLVSYDSP 172
Query: 224 -RGGGYLFL-------GHDLVPSSGIAWTPMSRDLLEKHYSS---GPAELLFGGKSTG-- 270
+LFL GHD+V + + + + L S G ++ K +G
Sbjct: 173 PSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHN 232
Query: 271 -----IKGLQIIFDSGSSYTYFNSQAYKTTLDLMRKDLKGKPLEDTAEEKA-LPVCWKGT 324
+ + DSG++YT Y+ MRK ++ + + T A L +C+ +
Sbjct: 233 TSVGPFLANKTVIDSGTTYTLLTPPVYEA----MRKSIEEQVILPTLGNSAGLDLCFNSS 288
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 86/179 (48%), Gaps = 29/179 (16%)
Query: 64 SVTLKIGNPPKLYELDIDTGSDLTWVQCN-----APCTGCTLPPESLYHPKNNLVACNDP 118
+V + +G PP+ + +DTGS+L+W+ CN AP S Y P V C+ P
Sbjct: 64 TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASA---SSSYAP----VPCSSP 116
Query: 119 FCSAF--HLPENIRCEANDQCDYEVLYADHGSSLGVLVTDHFPLRLTNGSLLGPRLIFGC 176
C+ LP C+++ C + YAD S+ G+L D F L GS P L FGC
Sbjct: 117 ACTWLGRDLPVRPFCDSS-ACRVSLSYADASSADGLLAADTFLL----GSSPMPAL-FGC 170
Query: 177 --GYNQRNPGPKPPPTAGVLGLGLGKASILSQLQSLGLTRNVLGHCLSV-RGGGYLFLG 232
Y+ + PPT G+LG+ G S ++Q TR +C++ +G G L LG
Sbjct: 171 ITSYSSSTDPSETPPT-GLLGMNRGGLSFVTQTA----TRR-FAYCIAAGQGPGILLLG 223
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.138 0.441
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,040,721,508
Number of Sequences: 23463169
Number of extensions: 280660059
Number of successful extensions: 556056
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 743
Number of HSP's successfully gapped in prelim test: 994
Number of HSP's that attempted gapping in prelim test: 552431
Number of HSP's gapped (non-prelim): 1913
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)