BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013680
(438 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 263/416 (63%), Positives = 322/416 (77%), Gaps = 14/416 (3%)
Query: 13 CILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKR 72
C + S ++FSSKL+HRFSDEAK IS+ GN S D WPK+ S EY +LLL ND KR
Sbjct: 17 CCQFEASIGLTFSSKLIHRFSDEAKSISISRKGNAS-GDLWPKRYSFEYFQLLLGNDLKR 75
Query: 73 QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSN 132
Q+ ++ S +NQLLFPS+GSQ FFGN+ WLHYTWIDIGTPNVSFLVALDAGS+
Sbjct: 76 QRMKL------GSQKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSD 129
Query: 133 LLWVPCQCIQCAPLSASYYT-SLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP 191
LLWVPC CIQCAPLSASYY SLDR+LSEY PS SS+S+++SC H LC+ S+CK+ KDP
Sbjct: 130 LLWVPCDCIQCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDP 189
Query: 192 CPYIADYST-EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
CPYI +Y E+T+S+G+LV+D LHLAS H + +Q+SV++GCGRKQ GS+ DGAAP
Sbjct: 190 CPYIFNYDDFENTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAP 249
Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 310
DGVMGLG GD+SVPSLLAKAGLIQN FS+CFDENDSG + FGD+G A+QQST FLPI
Sbjct: 250 DGVMGLGPGDISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGT 309
Query: 311 YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
Y AYFVGVESYC+GNSCL +SGF+ALVDSG+SFT+LP+E+Y E+V +FDK V++KRIS Q
Sbjct: 310 YVAYFVGVESYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQ 369
Query: 371 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTL 426
W YCYNASS+E+ +P ++L F +NQ+FVV N +S P H F+ F L
Sbjct: 370 DGLWDYCYNASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPH-----HQGFTMFCL 420
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 523 bits (1348), Expect = e-146, Method: Compositional matrix adjust.
Identities = 262/433 (60%), Positives = 325/433 (75%), Gaps = 14/433 (3%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
L IC C L + S ++FSSKL+HRFS+EAK IS + NVS + +WP KNS +YL+
Sbjct: 7 LFVICF---CFLSNHSIGLTFSSKLIHRFSEEAKSLLISGNDNVS-SQTWPNKNSFQYLQ 62
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LLL ND KRQK ++ Q NQLLFPS GS T F+GN WLHYTWIDIGTPNVSF
Sbjct: 63 LLLDNDLKRQKMKLGAQ-------NQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSF 115
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
LVALDAGS+L WVPC CIQCAPLSAS Y LDR+LSEY PS S++S+++SC+H LC+ S
Sbjct: 116 LVALDAGSDLSWVPCDCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGS 175
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS--KHAPQSSVQSSVIIGCGRKQT 241
CK+LKDPCPYIADY+ +TSSSG+LV+DILHLAS S ++ Q VQ+SVI+GCGRKQT
Sbjct: 176 HCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQT 235
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS 301
G YLDGAAPDGVMGLG G +SVPSLLAKAGLI+ SFS+CFD N SG++ FGDQG +Q+S
Sbjct: 236 GGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKS 295
Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
T LP YDAY + VESYC+GNSCL QSGF+ALVDSGASFT+LP ++Y ++V++FDK
Sbjct: 296 TPLLPTQGNYDAYLIEVESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQ 355
Query: 362 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
V+++RIS QG W YCYN SS+++ VP MRL F NQS ++ N + P+N+ C
Sbjct: 356 VNAQRISSQGGPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCL 415
Query: 422 SYFTLEYNFTGIL 434
+ + N+ GI+
Sbjct: 416 TLQPTDLNY-GII 427
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 257/433 (59%), Positives = 332/433 (76%), Gaps = 11/433 (2%)
Query: 3 NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
+L+ + M + +++D + AV+FSSKL+HRFSDEAK ++S++GN+ ADSWPKK S +Y
Sbjct: 5 SLIPLLMAY-LLVVDAAIAVTFSSKLIHRFSDEAKAFFVSRNGNI-FADSWPKKRSFDYY 62
Query: 63 ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
LLLS+D KRQK ++ + QLLFPSEGS F GN+F WLHYTWIDIGTPNVS
Sbjct: 63 RLLLSSDLKRQKLKL-------GAEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVS 115
Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
FLVALDAGS+LLWVPC C+QCAPLSASYY L R+L+EY PS SS+SK +SC+ LC+
Sbjct: 116 FLVALDAGSDLLWVPCDCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELG 175
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
S CKS KDPCPY+A Y +E+TSSSG L++D LHLA FS+HA +SSV +SVIIGCGRKQ+G
Sbjct: 176 SDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSG 235
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST 302
++ DGAAPDG+MGLG GD+SVPSLLAKAGL++N+FSICFD+N SG++ FGDQG TQ+ST
Sbjct: 236 AFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKST 295
Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
SF+P+ K+ Y + VE Y +G+S L +GFQALVDSG SFTFLP EIY ++VV+FDK V
Sbjct: 296 SFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQV 355
Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEVGDHACF 421
++ R S +G+ WKYCYN+SS+E+L +P + L+F+ NQSF+V N + ENE + C
Sbjct: 356 NATRSSFKGSPWKYCYNSSSQELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCL 415
Query: 422 SYFTLEYNFTGIL 434
+ F GI+
Sbjct: 416 PIQPIHEEF-GII 427
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 255/422 (60%), Positives = 326/422 (77%), Gaps = 10/422 (2%)
Query: 14 ILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQ 73
+++D + AV+FSSKL+HRFSDEAK ++S++GN+ ADSWPKK S +Y LLLS+D KRQ
Sbjct: 5 LVVDAAIAVTFSSKLIHRFSDEAKAFFVSRNGNI-FADSWPKKRSFDYYRLLLSSDLKRQ 63
Query: 74 KTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNL 133
K ++ + QLLFPSEGS F GN+F WLHYTWIDIGTPNVSFLVALDAGS+L
Sbjct: 64 KLKL-------GAEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDL 116
Query: 134 LWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP 193
LWVPC C+QCAPLSASYY L R+L+EY PS SS+SK +SC+ LC+ S CKS KDPCP
Sbjct: 117 LWVPCDCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCP 176
Query: 194 YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGV 253
Y+A Y +E+TSSSG L++D LHLA FS+HA +SSV +SVIIGCGRKQ+G++ DGAAPDG+
Sbjct: 177 YLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGL 236
Query: 254 MGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 313
MGLG GD+SVPSLLAKAGL++N+FSICFD+N SG++ FGDQG TQ+STSF+P+ K+
Sbjct: 237 MGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVT 296
Query: 314 YFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 373
Y + VE Y +G+S L +GFQALVDSG SFTFLP EIY ++VV+FDK V++ R S +G+
Sbjct: 297 YLIEVEGYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 356
Query: 374 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEVGDHACFSYFTLEYNFTG 432
WKYCYN+SS+E+L +P + L+F+ NQSF+V N + ENE + C + F G
Sbjct: 357 WKYCYNSSSQELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEF-G 415
Query: 433 IL 434
I+
Sbjct: 416 II 417
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 235/400 (58%), Positives = 308/400 (77%), Gaps = 9/400 (2%)
Query: 16 LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVA-DSWPKKNSVEYLELLLSNDWKRQK 74
++G+ V+FSS+L+HRFS+EAK S+ + SV +WP++NS EY LLL +D RQ+
Sbjct: 17 MEGAVGVTFSSRLIHRFSEEAKAHLASRGSDGSVLLQAWPERNSSEYFRLLLRSDVTRQR 76
Query: 75 TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
R+ S+ ++L+P EG QT FGN YWLHYTWIDIGTPNVSFLVALDAGS++L
Sbjct: 77 MRL-------GSQYEMLYPFEGGQTFLFGNALYWLHYTWIDIGTPNVSFLVALDAGSDML 129
Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
WVPC CI+CA LSA Y LDR+L++Y PS S++S+++ C H LC S CK KDPCPY
Sbjct: 130 WVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKGSKDPCPY 189
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
YS+ +TSSSGY+ +D LHL S KHA Q+SVQ+S+I+GCGRKQTG YL GA PDGV+
Sbjct: 190 AVQYSSANTSSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRGAGPDGVL 249
Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY 314
GLG G++SVPSLLAKAGLIQNSFSICF+EN+SG + FGDQG TQ ST FLPI K++AY
Sbjct: 250 GLGPGNISVPSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPIDGKFNAY 309
Query: 315 FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV++FDK V++ I LQ NSW
Sbjct: 310 IVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSIVLQ-NSW 368
Query: 375 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
+YCYNASS+E++ +P + L FS+NQ+++++N IF P ++
Sbjct: 369 EYCYNASSQELISIPPLNLAFSRNQTYLIQNPIFIDPASQ 408
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 236/407 (57%), Positives = 307/407 (75%), Gaps = 13/407 (3%)
Query: 16 LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVA-DSWPKKNSVEYLELLLSNDWKRQK 74
++G+ +FSS+L+HRFS+EAK S+ SV +WP++NS EY LLL +D RQ+
Sbjct: 17 MEGAVGATFSSRLIHRFSEEAKAHLASRGNKSSVLLQAWPQRNSSEYFRLLLRSDVARQR 76
Query: 75 TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
R+ S+ + L+PSEG QT FFGN YWLHYTWIDIGTPNVSFLVALDAGS++L
Sbjct: 77 MRL-------GSQYETLYPSEGGQTFFFGNALYWLHYTWIDIGTPNVSFLVALDAGSDML 129
Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
WVPC CI+CA LSA Y LDR+L++Y PS S++S+++ C H LC S CK KDPCPY
Sbjct: 130 WVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPCPY 189
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
Y++ +TSSSGY+ +D LHL S KHA Q+SVQ+S+I+GCGRKQTG YL GA PDGV+
Sbjct: 190 EVQYASANTSSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVL 249
Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY 314
GLG G++SVPSLLAKAGLIQNSFSIC DEN+SG + FGDQG TQ ST FLPI AY
Sbjct: 250 GLGPGNISVPSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFLPI----IAY 305
Query: 315 FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV +FDK V++ RI LQ +SW
Sbjct: 306 MVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQ-SSW 364
Query: 375 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
+YCYNASS+E++ +P ++L FS+NQ+F+++N IF P ++ ++ F
Sbjct: 365 EYCYNASSQELVNIPPLKLAFSRNQTFLIQNPIFYDPASQEQEYTIF 411
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 232/400 (58%), Positives = 302/400 (75%), Gaps = 9/400 (2%)
Query: 9 MLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSN 68
++ +L+D S V+FSS+L+HRFSDE K +S+ ++S SWP+K S++Y ++L+++
Sbjct: 21 LVMASLLIDKSAEVTFSSRLIHRFSDEVKALRVSRKDSLSY--SWPEKKSMDYYQILVNS 78
Query: 69 DWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
D++RQK ++ Q Q LFPS+GS+T G+ F WLHYTWIDIGTP+VSFLVALD
Sbjct: 79 DFQRQKMKLGPQY-------QFLFPSQGSKTMSLGDDFGWLHYTWIDIGTPHVSFLVALD 131
Query: 129 AGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSL 188
AGS+LLWVPC C+QCAPLSASYY+SLDR+L+EY PS SS+SK++SCSH LC+ +C S
Sbjct: 132 AGSDLLWVPCDCLQCAPLSASYYSSLDRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSP 191
Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
K PCPY DY TE+TSSSG LV+DILHLAS +A SV++ V+IGCG KQ+G YLDG
Sbjct: 192 KQPCPYSMDYYTENTSSSGLLVEDILHLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGV 251
Query: 249 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 308
APDG+MGLGL ++SVPS LAKAGLI+NSFS+CFDE+DSG +FFGDQGP TQQST FL +
Sbjct: 252 APDGLMGLGLAEISVPSFLAKAGLIRNSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLD 311
Query: 309 EKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
Y Y VGVE +C+G+SCL Q+ F+ALVD+G SFTFLP +Y + +FD+ V++ S
Sbjct: 312 GNYTTYVVGVEGFCVGSSCLKQTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATISS 371
Query: 369 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
G WKYCY +SS + KVP ++LIF N SFV+ N +F
Sbjct: 372 FNGYPWKYCYKSSSNHLTKVPSVKLIFPLNNSFVIHNPVF 411
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 235/408 (57%), Positives = 306/408 (75%), Gaps = 10/408 (2%)
Query: 1 MVNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
M + M +L++ A FS++L+HRFSDE K ++SG ++ SWP+ ++E
Sbjct: 1 MAARFLVAMSVVVLLIESCMAAMFSARLIHRFSDEVKAFRAARSG---LSGSWPEWRTME 57
Query: 61 YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
Y ++L+ +DW+RQK + S+ Q LFPSEGS+T FGN + WLHYTWIDIGTPN
Sbjct: 58 YYKMLVRSDWERQKVML-------GSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTPN 110
Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
+SFLVALDAGS+LLW+PC CIQCAPLSASYY SLDR+L++Y PS SS+SK++SCSH LC+
Sbjct: 111 ISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLCE 170
Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
S +C S K CPY +Y +E+TSSSG L++DILHL S A SSV++ VIIGCG +Q
Sbjct: 171 SSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMRQ 230
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
TG YLDG APDG+MGLGLG++SVPS L+KAGL++NSFS+CF+++DSG +FFGDQG ATQQ
Sbjct: 231 TGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQQ 290
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+T FLP KY+ Y VGVE+ CIG+SC+ Q+ F+ALVDSGASFTFLP E Y VV +FDK
Sbjct: 291 TTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFDK 350
Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
V++ R S +G W+YCY +SS+E+LK P + L F+ N SFVV N +F
Sbjct: 351 QVNATRFSFEGYPWEYCYKSSSKELLKNPSVILKFALNNSFVVHNPVF 398
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/417 (54%), Positives = 309/417 (74%), Gaps = 12/417 (2%)
Query: 1 MVNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISK-SGNVSVADSWPKKNSV 59
M N + + + ++ S A++ S LVHRFSDEAK W S+ +GNVS A WP NS+
Sbjct: 1 MANCALLLLFIASLFVNCSLALTLSLNLVHRFSDEAKSLWESRRTGNVS-AKFWPPTNSL 59
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
+Y ++L+ D KR++ + S+ +LFPSEGSQ FFGN+F WLHYTWID+GTP
Sbjct: 60 KYFQMLMDYDLKRRRLNI-------GSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTP 112
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
+V FLVALD GS+LLWVPC CIQCAPLSA+YY+ LDR+LSEY+P+ SS+SK++ C H LC
Sbjct: 113 SVPFLVALDVGSDLLWVPCDCIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLC 172
Query: 180 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
++CKS DPC Y DY +++TS+SG++++D L L SFSKH S +Q+SV+ GCGRK
Sbjct: 173 AWSTTCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRK 232
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
Q+GSYLDGAAPDGVMGLG G++SVP+LLA+ GL++N+FS+CFD N SG + FGD GPATQ
Sbjct: 233 QSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQ 292
Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD 359
Q+T FLP+ ++ AYF+GVES+C+G+SCL +SGFQALVDSG+SFT+LP E+Y ++V +FD
Sbjct: 293 QTTQFLPLFGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFD 352
Query: 360 KL--VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
K V++ RI L+ W YCYN S+ +P M+L+F NQ F + + ++ P N+
Sbjct: 353 KQVKVNATRIVLRELPWNYCYNISTLVSFNIPSMQLVFPLNQIF-IHDPVYVLPANQ 408
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 232/388 (59%), Positives = 299/388 (77%), Gaps = 10/388 (2%)
Query: 21 AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
A FS++L+HRFSDE K ++SG ++ SWP+ ++EY ++L+ +DW+RQK +
Sbjct: 2 AAMFSARLIHRFSDEVKAFRAARSG---LSGSWPEWRTMEYYKMLVRSDWERQKVML--- 55
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
S+ Q LFPSEGS+T FGN + WLHYTWIDIGTPN+SFLVALDAGS+LLW+PC C
Sbjct: 56 ----GSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDC 111
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
IQCAPLSASYY SLDR+L++Y PS SS+SK++SCSH LC+S +C S K CPY +Y +
Sbjct: 112 IQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYS 171
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
E+TSSSG L++DILHL S A SSV++ VIIGCG +QTG YLDG APDG+MGLGLG+
Sbjct: 172 ENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGE 231
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
+SVPS L+KAGL++NSFS+CF+++DSG +FFGDQG ATQQ+T FLP KY+ Y VGVE+
Sbjct: 232 ISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEA 291
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
CIG+SC+ Q+ F+ALVDSGASFTFLP E Y VV +FDK V++ R S +G W+YCY +
Sbjct: 292 CCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKS 351
Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
SS+E+LK P + L F+ N SFVV N +F
Sbjct: 352 SSKELLKNPSVILKFALNNSFVVHNPVF 379
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 473 bits (1218), Expect = e-131, Method: Compositional matrix adjust.
Identities = 228/389 (58%), Positives = 291/389 (74%), Gaps = 8/389 (2%)
Query: 20 DAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKL 79
+ +FSS+L+HRFS E KE +S+ G+V+ WP+K S EY ++L+S+D KRQK ++
Sbjct: 16 ELATFSSRLIHRFSKEYKEVSVSRGGDVN-GTWWPEKKSKEYYQILVSSDLKRQKLKL-- 72
Query: 80 QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
QLLFPS+GS+T GN F WLHYTWIDIGTP+VSF+VALD+GS+L WVPC
Sbjct: 73 -----GPHYQLLFPSQGSKTMSLGNDFGWLHYTWIDIGTPHVSFMVALDSGSDLFWVPCD 127
Query: 140 CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 199
C+QCAPLSAS+Y+SLDR+LSEY PS SS+SK +SCSH LC +CK+ K CPY +Y
Sbjct: 128 CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKNPKQSCPYSINYY 187
Query: 200 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 259
TE TSSSG LV+DI+HLAS +SV++ VIIGCG KQ+G YLDG APDG++GLGL
Sbjct: 188 TESTSSSGLLVEDIIHLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDGVAPDGLLGLGLQ 247
Query: 260 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
++SVPS LAKAGLIQNSFS+CF+E+DSG +FFGDQGPATQQS FL + Y Y VGVE
Sbjct: 248 EISVPSFLAKAGLIQNSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKLNGNYTTYIVGVE 307
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
C+G SCL QS F ALVDSG SFTFLP +++ + +FD V++ R S +G SWKYCY
Sbjct: 308 VCCVGTSCLKQSSFSALVDSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYK 367
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
SS+++ K+P +RLIF +N SF+V+N +F
Sbjct: 368 TSSQDLPKIPSLRLIFPQNNSFMVQNPVF 396
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 232/415 (55%), Positives = 308/415 (74%), Gaps = 9/415 (2%)
Query: 21 AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
+++F+S+++HRFS+E K S S N SV SWP+K S+EY + L+S D++RQK ++
Sbjct: 21 SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKL--- 77
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
SR QLLFPSEGS+T GN F WLHYTWIDIGTP+VSFLVALDAGS+LLWVPC C
Sbjct: 78 ----GSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNC 133
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
IQCAPLSASYY SLD++L+EY PSSSS+SK++SCSH LC S SC+S K CPY+ DY T
Sbjct: 134 IQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYIT 193
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
E+TSSSG L+ D+LHL+S +++ ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG+
Sbjct: 194 ENTSSSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGE 253
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
+SV S LAK L+QNSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+ KY+ Y VGVE+
Sbjct: 254 ISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEA 313
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN 379
CI NSCL Q+ F+AL+DSG SFT+LP E Y +V++FDK L ++ +S +G WKYCY
Sbjct: 314 CCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYK 373
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
S++ M KVP + L+F N SFVV + +F ++ CF+ + + GIL
Sbjct: 374 ISADAMPKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDI-GIL 427
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 227/389 (58%), Positives = 297/389 (76%), Gaps = 8/389 (2%)
Query: 21 AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
+++F+S+++HRFS+E K S S N SV SWP+K S+EY + L+S D++RQK ++
Sbjct: 21 SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKL--- 77
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
SR QLLFPSEGS T GN F WLHYTWIDIGTP+VSFLVALDAGS+LLWVPC C
Sbjct: 78 ----GSRFQLLFPSEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNC 133
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
IQCAPLSASYY SLD++L+EY PSSSS+SK++SCSH LC S SC+S K CPY+ DY T
Sbjct: 134 IQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYIT 193
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
E+TSSSG L+ D+LHL+S +++ ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG+
Sbjct: 194 ENTSSSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGE 253
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
+SV S LAK L+QNSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+ KY+ Y VGVE+
Sbjct: 254 ISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEA 313
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN 379
CI NSCL Q+ F+AL+DSG SFT+LP E Y +V++FDK L ++ +S +G WKYCY
Sbjct: 314 CCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYK 373
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
S++ M KVP + L+F N SFVV + +F
Sbjct: 374 ISADAMPKVPSVTLLFPLNNSFVVHDPVF 402
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 222/412 (53%), Positives = 288/412 (69%), Gaps = 14/412 (3%)
Query: 22 VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQS 81
++FS++LVHRF+DE K WP + S+ Y ++LL+ D R+K +V
Sbjct: 22 ITFSARLVHRFADEMKPV-------RPPTGYWPDQRSMRYYQMLLTGDILRRKIKV---- 70
Query: 82 NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI 141
+R QLLFPS GS+T GN F WLHYTWIDIGTP+ SFLVALDAGS+LLW+PC C+
Sbjct: 71 --GGTRYQLLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCV 128
Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
QCAPLS+SYY++LDR+L+EY PS S SSK++SCSH LC S+CKS + CPY+ Y +E
Sbjct: 129 QCAPLSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSE 188
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
+TSSSG LV+DILHL S + SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+
Sbjct: 189 NTSSSGLLVEDILHLQSGGTLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGES 247
Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
SVPS LAK+GLI SFS+CF+E+DSG +FFGDQGP +QQSTSFLP+ Y Y +GVES
Sbjct: 248 SVPSFLAKSGLIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESC 307
Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
CIGNSCL + F+A VDSG SFTFLP +Y + +FD+ V+ R S +G+ W+YCY S
Sbjct: 308 CIGNSCLKMTSFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWEYCYVPS 367
Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGI 433
S+++ KVP L+F +N SFVV + +F F NE C + E + I
Sbjct: 368 SQDLPKVPSFTLMFQRNNSFVVYDPVFVFYGNEGVIGFCLAILPTEGDMGTI 419
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 218/393 (55%), Positives = 281/393 (71%), Gaps = 14/393 (3%)
Query: 22 VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQS 81
++FS++LVHRF+DE K WP + S+ Y +LL+ D R+K +V
Sbjct: 21 ITFSARLVHRFADEMKPV-------RPPTGYWPDRWSMGYYRMLLTGDILRRKIKV---- 69
Query: 82 NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI 141
+R QLLFPS GS+T GN F WLHYTWIDIGTP+ SFLVALDAGS+LLW+PC C+
Sbjct: 70 --GGARYQLLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCV 127
Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
QCAPLS+SYY++LDR+L+EY PS S SSK++SCSH LC S+CKS + CPY+ Y +E
Sbjct: 128 QCAPLSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSE 187
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
+TSSSG LV+DILHL S + SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+
Sbjct: 188 NTSSSGLLVEDILHLQSGGSLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGES 246
Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
SVPS LAK+GLI +SFS+CF+E+DSG +FFGDQGP QQSTSFLP+ Y Y +GVES
Sbjct: 247 SVPSFLAKSGLIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESC 306
Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
C+GNSCL + F+ VDSG SFTFLP +Y + +FD+ V+ R S +G+ W+YCY S
Sbjct: 307 CVGNSCLKMTSFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPS 366
Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
S+E+ KVP + L F +N SFVV + +F F NE
Sbjct: 367 SQELPKVPSLTLTFQQNNSFVVYDPVFVFYGNE 399
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 225/436 (51%), Positives = 300/436 (68%), Gaps = 20/436 (4%)
Query: 1 MVNLVAICMLFGCILL-DGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSV 59
M + A +LF L+ + S A FSS+L+HRFSDE + ++ S+P+K S
Sbjct: 1 MASRSAFILLFILSLVSEKSLASLFSSRLIHRFSDEGR-------ASIKSPGSFPEKRSF 53
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
EY LL S D +RQK N ++ Q L PSEGS+T GN F WLHYTWIDIGTP
Sbjct: 54 EYYRLLTSIDSRRQKM-------NLGAKFQSLVPSEGSKTISPGNYFGWLHYTWIDIGTP 106
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPL 178
+VSFLVALD+GS+LLW+PC C+QCAPLS++YY+SL ++L+E+DPS+S++SK CSH L
Sbjct: 107 SVSFLVALDSGSDLLWIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKL 166
Query: 179 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
C+S +C+S K+ CPY Y++E+TSSSG LV+D+LHLA +S +A SSV++ V++GCG
Sbjct: 167 CESAPACESPKEQCPYTVTYASENTSSSGLLVEDVLHLA-YSANA-SSSVKARVVVGCGE 224
Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPAT 298
KQ+G +L G APDGVMGLG G++SVPS LAKAGL++NSFS+CFDE DSG ++FGD GP+T
Sbjct: 225 KQSGEFLKGIAPDGVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPST 284
Query: 299 QQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
QQST FLP ++ AYFVGVE C+GNSCL QS F L+DSG SFTFLP EIY EV ++
Sbjct: 285 QQSTRFLPYKNEFVAYFVGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEI 344
Query: 359 DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDH 418
D +++ ++G W+YCY S E KVP ++L FS N +FV+ +F +E
Sbjct: 345 DSHINATVKKIEGGPWEYCYETSFEP--KVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQ 402
Query: 419 ACFSYFTLEYNFTGIL 434
C E G++
Sbjct: 403 FCLPISASEEGTGGVI 418
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 213/388 (54%), Positives = 273/388 (70%), Gaps = 14/388 (3%)
Query: 22 VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQS 81
+FS KL HRFS+E K V D WP + ++ Y E LL ND+ R K
Sbjct: 25 TTFSVKLFHRFSEEMKPV------QVQTGD-WPDRRTLHYHEKLLRNDFLRHKI------ 71
Query: 82 NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI 141
N +R++LLFPS+GS+T FGN F WLHYTWIDIGTP+ SFLVALDAGS+LLWVPC CI
Sbjct: 72 NLGGARHKLLFPSQGSKTMSFGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCI 131
Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-CPYIADYST 200
CAPLSAS+Y++LDR+L+EY PS S SSK++SCSH LC S+CK+ K CPY +Y +
Sbjct: 132 HCAPLSASFYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLS 191
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
++TSSSG LV+DI HL S SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+
Sbjct: 192 DNTSSSGLLVEDIFHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGE 251
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
SVPS LAK+GLI++SFS+CF+E+DSG +FFGDQG QQST FL + + Y VGVE+
Sbjct: 252 SSVPSFLAKSGLIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVET 311
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
CIGNSC + F A DSG SFTFLP Y + +FDK V++ R + QG+ W+YCY
Sbjct: 312 CCIGNSCPKVTSFNAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGSPWEYCYVP 371
Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
SS+++ K+P + L+F +N SFVV N +F
Sbjct: 372 SSQQLPKIPTLTLMFQQNNSFVVYNPVF 399
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/416 (51%), Positives = 293/416 (70%), Gaps = 20/416 (4%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
+ C+LF + + + A FSS+L+HRFSDE + + S +DS P K S+EY
Sbjct: 7 FLLFCVLF--LATEETLASLFSSRLIHRFSDEGRASIKTPSS----SDSLPNKQSLEYYR 60
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL +D++RQ+ N ++ Q L PSEGS+T GN F WLHYTWIDIGTP+VSF
Sbjct: 61 LLAESDFRRQRM-------NLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
LVALD GSNLLW+PC C+QCAPL+++YY+SL ++L+EY+PSSSS+SK CSH LC S
Sbjct: 114 LVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA 173
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
S C+S K+ CPY +Y + +TSSSG LV+DILHL + + SSV++ V+IGCG+K
Sbjct: 174 SDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKK 233
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
Q+G YLDG APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293
Query: 300 QSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
QST FL + KY Y VGVE+ CIGNSCL Q+ F +DSG SFT+LP EIY +V ++
Sbjct: 294 QSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEI 353
Query: 359 DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
D+ +++ + +G SW+YCY +S+E KVP ++L FS N +FV+ +F F +++
Sbjct: 354 DRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQ 407
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/416 (51%), Positives = 296/416 (71%), Gaps = 13/416 (3%)
Query: 2 VNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEY 61
V ++ +L +L+ AV+FSS+++HRFSDEAK + G SWPK+ S EY
Sbjct: 3 VGVLLWLLLAKGFVLETVIAVTFSSRIIHRFSDEAKVHLRNNGG--ENVQSWPKRGSSEY 60
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
LLL++D RQK ++ S++Q +PSEGS+T FGN F WLHYTWIDIGTPNV
Sbjct: 61 FRLLLNSDLTRQKMKL-------GSQDQSFYPSEGSKTLSFGNDFVWLHYTWIDIGTPNV 113
Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS 181
SFLVALD GS++ WVPC CI+CAPLSA++Y +LDR+L++Y PS SSSS+++ C H LC
Sbjct: 114 SFLVALDTGSDMFWVPCDCIECAPLSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQ 173
Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
S+CK KD CPYI +Y++++TSSSG+L++D LHLA S +A ++S+Q+SVI+GCGRKQ+
Sbjct: 174 NSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLA--SNNATKNSIQASVILGCGRKQS 231
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ-Q 300
G +L+GAAP+G++GLG G +SVP+LLAKAGLI+NS SIC +E SG + FGDQG ATQ +
Sbjct: 232 GYFLEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSGRILFGDQGHATQRR 291
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
ST FL + YFVGVE +C+G+ C ++ F+A +D+G SFT+LP +Y VV +F+K
Sbjct: 292 STPFLLDDGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTYLPKGVYETVVAEFEK 351
Query: 361 LVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEV 415
V + RI+ Q S + CYNASS E P M+ FSKNQSF+++N S + +
Sbjct: 352 QVHATRITSQIQSDFNCCYNASSRESNNFPPMKFTFSKNQSFIIQNPFISMDQEDT 407
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 212/415 (51%), Positives = 292/415 (70%), Gaps = 20/415 (4%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
+ C+LF + + + A FSS+++HRFSDE + + S ++S P+K S+EY
Sbjct: 7 FILFCVLF--LATEETLASVFSSRMIHRFSDEGRASIRTPSS----SESLPEKQSLEYYR 60
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL +D++RQ+ N ++ Q L PSEGS+T GN F WLHYTWIDIGTP+VSF
Sbjct: 61 LLAKSDFRRQRM-------NLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
LVALD GS+LLW+PC C+QCAPL+++YY+SL ++L+EY+PSSSS+SK CSH LC S
Sbjct: 114 LVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA 173
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
S C+S K+ CPY +Y + +TSSSG LV+DILHL + + SSV++ V+IGCG+K
Sbjct: 174 SDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKK 233
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
Q+G YLDG APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293
Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD 359
QST FL + E Y VGVE+ CIGNSCL Q+ F +DSG SFT+LP EIY +V ++ D
Sbjct: 294 QSTPFLQL-ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEID 352
Query: 360 KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
+ +++ S +G SW+YCY +S E KVP ++L FS N +FV+ +F F +++
Sbjct: 353 RHINATSKSFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQ 405
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 211/415 (50%), Positives = 288/415 (69%), Gaps = 20/415 (4%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
+ C+LF + +G+ A FSS+L+HRFSDE + + S ++S P+K S+ Y
Sbjct: 7 FILFCVLF--LATEGTLASVFSSRLIHRFSDEGRASIKTPSS----SESLPEKQSLAYYR 60
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL +D++RQ+ N ++ Q L PSEGS+T GN F WLHYTWIDIGTP+VSF
Sbjct: 61 LLAKSDFRRQRM-------NLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSF 113
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSL-DRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
LVALD GS+LLW+PC C+QCAPL+++YY+SL ++L+EY+PSSSSSSK CSH LC S
Sbjct: 114 LVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVFLCSHKLCGSA 173
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRK 239
S C S K+ C Y Y + +TSSSG LV+DILHL + + SSV++ V++GCG+K
Sbjct: 174 SDCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVVGCGKK 233
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
Q+G YLDG APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ Q
Sbjct: 234 QSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293
Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD 359
QS FL + E Y VGVE+ CIGNSCL Q+ F +DSG SFT+LP EIY +V ++ D
Sbjct: 294 QSAPFLQL-ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEID 352
Query: 360 KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
+ +++ S +G SW+YCY +S E KVP ++L FS N +FV+ +F F +++
Sbjct: 353 RHINATSKSFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQ 405
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/399 (50%), Positives = 268/399 (67%), Gaps = 18/399 (4%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
+FSS++VHR SDEA+ + G WP++ S Y LL +D +RQK R+
Sbjct: 26 TFSSRMVHRLSDEARLEAGPRMG------LWPQRGSGGYYRALLRSDLQRQKRRL----- 74
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
+ +NQLL S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQ
Sbjct: 75 --AGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
CAPLS SY +LDR+L Y P+ S++S+++ CSH LC+ S C + K PC Y DY +E+
Sbjct: 133 CAPLS-SYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSEN 191
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
T+SSG L++D LHL S HAP V +SVIIGCGRKQ+G YLDG APDG++GLG+ D+S
Sbjct: 192 TTSSGLLIEDSLHLNSREGHAP---VNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADIS 248
Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 322
VPS LA+AGL++NSFS+CF E+ SG +FFGDQG ++QQST F+P+ K Y V V+ C
Sbjct: 249 VPSFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSC 308
Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
IG+ CL S FQALVDSG SFT LP ++Y +FDK +++ R+ + ++WKYCY+AS
Sbjct: 309 IGHKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASP 368
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
EM VP + L F+ N+SF N I F +E G A F
Sbjct: 369 LEMPDVPTIILAFAANKSFQAVNPILPF-NDEQGALARF 406
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/397 (50%), Positives = 268/397 (67%), Gaps = 17/397 (4%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
S+++VHR SDEA+ + WP++ S +Y L+ +D +RQK RV
Sbjct: 29 SARMVHRLSDEAR-----LAAGARGGRRWPRRGSGDYFRALVRSDLQRQKRRV------- 76
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
+ QLL S+G GN WL+YTW+D+GTPN SFLVALD GS+L WVPC CIQCA
Sbjct: 77 GGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCA 136
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
PLS SY+ SLDR+L Y PS S++S+++ CSH LC S C + K PCPY DY +E+T+
Sbjct: 137 PLS-SYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTT 195
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
SSG L++D+LHL S HAP V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVP
Sbjct: 196 SSGLLIEDMLHLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVP 252
Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
S LA+AGL++NSFS+CF ++DSG +FFGDQG TQQST F+P+ K Y V V+ YCIG
Sbjct: 253 SFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIG 312
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ C +GFQALVD+G SFT LP + Y + ++FDK +++ R S S++YCY+ E
Sbjct: 313 HKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLE 372
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
M VP + L F++N+SF N I F + + G+ A F
Sbjct: 373 MPDVPTITLTFAENKSFQAVNPILPFNDRQ-GEFAVF 408
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/397 (50%), Positives = 268/397 (67%), Gaps = 17/397 (4%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
S+++VHR SDEA+ + WP++ S +Y L+ +D +RQK RV
Sbjct: 29 SARMVHRLSDEAR-----LAAGARGGRRWPRRGSGDYFRALVRSDLQRQKRRV------- 76
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
+ QLL S+G GN WL+YTW+D+GTPN SFLVALD GS+L WVPC CIQCA
Sbjct: 77 GGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCA 136
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
PLS SY+ SLDR+L Y PS S++S+++ CSH LC S C + K PCPY DY +E+T+
Sbjct: 137 PLS-SYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTT 195
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
SSG L++D+LHL S HAP V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVP
Sbjct: 196 SSGLLIEDMLHLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVP 252
Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
S LA+AGL++NSFS+CF ++DSG +FFGDQG TQQST F+P+ K Y V V+ YCIG
Sbjct: 253 SFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIG 312
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ C +GFQALVD+G SFT LP + Y + ++FDK +++ R S S++YCY+ E
Sbjct: 313 HKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLE 372
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
M VP + L F++N+SF N I F + + G+ A F
Sbjct: 373 MPDVPTITLTFAENKSFQAVNPILPFNDRQ-GEFAVF 408
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/394 (49%), Positives = 258/394 (65%), Gaps = 22/394 (5%)
Query: 21 AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQ 80
+ + S+++VHR SDEA+ WP+ S Y L+ +D +RQK
Sbjct: 71 SATLSTRMVHRLSDEARLAAGPHGAR------WPRHGSGGYYRALVRSDLQRQK------ 118
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
++QLL SE GN F WL+YTW+D+GTPN SF+VALD GS+L WVPC C
Sbjct: 119 -----RKHQLLSVSEAGGIFSPGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWVPCDC 173
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
I+CAPL A Y +LDR+L Y P+ S++S+++ CSH LC S C S K PCPY DY
Sbjct: 174 IECAPL-AGYRETLDRDLGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQ 232
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
E+T+SSG L++DILHL S HAP V++SV+IGCGRKQ+GSYLDG APDG++GLG+ D
Sbjct: 233 ENTTSSGLLIEDILHLDSRESHAP---VKASVVIGCGRKQSGSYLDGIAPDGLLGLGMAD 289
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
+SVPS LA+AGL++NSFS+CF E DSG +FFGDQG + QQST F+P+ KY Y V V+
Sbjct: 290 ISVPSFLARAGLVRNSFSMCFKE-DSGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDK 348
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
C+G+ C + F+ALVDSG SFT LP +Y V V+FDK V + RI+ + S++YCY+A
Sbjct: 349 SCVGHKCFEATSFEALVDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSA 408
Query: 381 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
S +M VP + L F+ N+SF N + E
Sbjct: 409 SPLKMPDVPTVTLTFAANKSFQAVNPTIVLKDGE 442
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/390 (49%), Positives = 262/390 (67%), Gaps = 17/390 (4%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
S+++V+R SDEA+ ++ WP++ S +Y L+ +D +RQK R+
Sbjct: 135 STRMVYRLSDEARMAAGTRGAR------WPRRGSGDYYRSLVRSDLQRQKRRL------G 182
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
++QLL S+ GN F WL+YTW+D+GTPN SF+VALD GS+L W+PC CI+CA
Sbjct: 183 GGKHQLLSFSKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWIPCDCIECA 242
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
PLS Y+ SLDR+L Y P+ S++S+++ CSH LC S C + K PCPY Y E+T+
Sbjct: 243 PLSG-YHGSLDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTT 301
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
SSG LV+DILHL S HAP V++SVIIGCGRKQ+GSYLDG APDG++GLG+ D+SVP
Sbjct: 302 SSGLLVEDILHLDSRESHAP---VKASVIIGCGRKQSGSYLDGIAPDGLLGLGMADISVP 358
Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
S LA+AGL++NSFS+CF + DSG +FFGDQG +TQQST F+P+ K Y V V+ C+G
Sbjct: 359 SFLARAGLVRNSFSMCFTK-DSGRIFFGDQGVSTQQSTPFVPLYGKLQTYTVNVDKSCVG 417
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ C + FQA+VDSG SFT LP +IY V ++FDK V++ R+ + S+ YCY+AS
Sbjct: 418 HKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLV 477
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
M VP + L F+ N+SF N F + E
Sbjct: 478 MPDVPTVTLTFAGNKSFQPVNPTFLLHDEE 507
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/390 (48%), Positives = 256/390 (65%), Gaps = 21/390 (5%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
SS++VHR SDEA+ + G WP++ S EY L+ +D +RQK R+ + S
Sbjct: 28 SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQCA
Sbjct: 80 ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
PLS Y +LDR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
SSG L++D LHL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVP
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVP 246
Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
S LA+AGL+QNSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG
Sbjct: 247 SFLARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIG 306
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ CL + F+ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS E
Sbjct: 307 HKCLEGTSFKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLE 366
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
M VP + L F+ ++S N I F + +
Sbjct: 367 MPDVPTITLTFAADKSLQAVNPILPFNDKQ 396
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/390 (48%), Positives = 256/390 (65%), Gaps = 21/390 (5%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
SS++VHR SDEA+ + G WP++ S EY L+ +D +RQK R+ + S
Sbjct: 28 SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQCA
Sbjct: 80 ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
PLS Y +LDR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
SSG L++D LHL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVP
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVP 246
Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
S LA+AGL+QNSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG
Sbjct: 247 SFLARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIG 306
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ CL + F+ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS E
Sbjct: 307 HKCLEGTSFKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLE 366
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
M VP + L F+ ++S N I F + +
Sbjct: 367 MPDVPTITLTFAADKSLQAVNPILPFNDKQ 396
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/390 (48%), Positives = 255/390 (65%), Gaps = 21/390 (5%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
SS++VHR SDEA+ + G WP++ S EY L+ +D +RQK R+ + S
Sbjct: 28 SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQCA
Sbjct: 80 ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
PLS Y +LDR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
SSG L++D LHL H P V +SVIIGCG+KQ+G YLDG APDG++ LG+ D+SVP
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVP 246
Query: 265 SLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
S LA+AGL+QNSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG
Sbjct: 247 SFLARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIG 306
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ CL + F+ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS E
Sbjct: 307 HKCLEGTSFKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLE 366
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
M VP + L F+ ++S N I F + +
Sbjct: 367 MPDVPTITLTFAADKSLQAVNPILPFNDKQ 396
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/387 (48%), Positives = 253/387 (65%), Gaps = 21/387 (5%)
Query: 28 LVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSR 87
+VHR SDEA+ + G WP++ S EY L+ +D +RQK R+ + S
Sbjct: 1 MVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL----- 49
Query: 88 NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLS 147
S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQCAPLS
Sbjct: 50 ------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLS 103
Query: 148 ASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSG 207
Y +LDR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG
Sbjct: 104 G-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSG 162
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
L++D LHL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS L
Sbjct: 163 LLIEDTLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFL 219
Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
A+AGL+QNSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG+ C
Sbjct: 220 ARAGLVQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKC 279
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
L + F+ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS EM
Sbjct: 280 LEGTSFKALVDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPD 339
Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENE 414
VP + L F+ ++S N I F + +
Sbjct: 340 VPTITLTFAADKSLQAVNPILPFNDKQ 366
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 277 bits (709), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 130/256 (50%), Positives = 178/256 (69%), Gaps = 3/256 (1%)
Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
DR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG L++D L
Sbjct: 3 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 62
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
HL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 63 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 119
Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
NSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG+ CL + F+
Sbjct: 120 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 179
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS EM VP + L
Sbjct: 180 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 239
Query: 395 FSKNQSFVVRNHIFSF 410
F+ ++S N I F
Sbjct: 240 FAADKSLQAVNPILPF 255
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 155/401 (38%), Positives = 228/401 (56%), Gaps = 14/401 (3%)
Query: 7 ICMLFGCIL--LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLEL 64
+ M+ C+L L + A + L H+FS +A E S++G + A WP + ++E+ +
Sbjct: 11 LVMVHCCVLWMLATTFANALRMDLFHKFSKQAIEAMRSRNG-MDYAQDWPTEGTIEFQTM 69
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
L +D R TR + SS +Q + + FG LHY++IDIGTPNV FL
Sbjct: 70 LRDHDVARH-TRTARRILAASSMDQYVLIQGNATEQLFGGG---LHYSYIDIGTPNVQFL 125
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
V LD GS+LLW+PC+C CAPLSA L+ Y PS SS++K V CS PLC+ S+
Sbjct: 126 VVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVLCSDPLCEMSST 185
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C + D CPY +Y + +TS+SG L +D ++ F + + + V+ V +GCG+ QTGS
Sbjct: 186 CMAPTDQCPYEINYVSANTSTSGALYEDYMY---FMRESGGNPVKLPVYLGCGKVQTGSL 242
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF 304
L GAAP+G+MGLG D+SVP+ LA G + +SFS+C SG++ FGD+GPA Q++T
Sbjct: 243 LKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEGPAAQRTTPI 302
Query: 305 LPIG-EKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
+P D Y V ++S +GN+ L + AL D+G SFT+L +Y + V +D +S
Sbjct: 303 IPKSVSMLDTYIVEIDSITVGNTNLLMAS-HALFDTGTSFTYLSKTVYPQFVQAYDAQMS 361
Query: 364 -SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
K + + W CY S+ +VP + L S S V
Sbjct: 362 LPKWNDPRFSKWDLCYQTSNTN-FQVPVVSLALSGGNSLDV 401
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 156/419 (37%), Positives = 230/419 (54%), Gaps = 35/419 (8%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
L+A+ ++ L+ +DA SF L HRFS + RW G AD WP + + EY
Sbjct: 14 LLAMAVVVVASLIAAADASSFGFDLHHRFSPVVR-RWAEARGGPLAADQWPARGTPEYYS 72
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
L +D R+ + + LL + G+ T+ G L+Y +++GTPN +F
Sbjct: 73 ALSRHDRARRAL-------AGGADDGLLTFAAGNDTYQSGT----LYYAEVELGTPNATF 121
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDR-NLSEYDPSSSSSSKNVSCSHPLCKSR 182
LVALD GS+L WVPC C QCA + ++ T D +L Y P SS+SK V+C +PLC R
Sbjct: 122 LVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQR 181
Query: 183 SSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS---SVQSSVIIGCGR 238
+ C + + CPY Y + +TSSSG LV D+LHL + + P + ++Q+ V+ GCG+
Sbjct: 182 NGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHL-TRERPGPGAAGEALQAPVVFGCGQ 240
Query: 239 KQTGSYLD--GAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQG 295
QTG++LD G A DG+MGLG+G VSVPS LA +GL+ +SFS+CF ++ G V FGD G
Sbjct: 241 VQTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAG 300
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 355
Q T F + Y V S +G+ + F A++DSG SFT+L Y ++
Sbjct: 301 SRGQAETPFT-VRSLNPTYNVSFTSIGVGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLA 358
Query: 356 VKFDKLVSSKRISLQGNS-----WKYCYNASSEEM-LKVPDMRL------IFSKNQSFV 402
KF+ VS +R++ S ++YCY S + + +PD+ L +F Q F+
Sbjct: 359 TKFNSQVSERRVNFSSGSADPFPFEYCYRLSPNQTEVAMPDVSLTAKGGALFPVTQPFI 417
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/405 (38%), Positives = 218/405 (53%), Gaps = 16/405 (3%)
Query: 5 VAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLEL 64
V I +L + A FS ++ HRFS+ K +W +GN A +WP K S EY
Sbjct: 7 VFIVILLSILGFRSCHARIFSFQMHHRFSEPVK-KWSEGAGNGFPAGNWPAKGSFEYYAE 65
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
L D + R+ S + LL S+G+ T F + +LHYT + +GTP FL
Sbjct: 66 LAHRDRALRGRRL-------SDIDGLLTFSDGNST-FRISSLGFLHYTTVSLGTPGKKFL 117
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
VALD GS+L WVPC C +CAP + Y S D LS Y+P SS+S+ V+C + LC R+
Sbjct: 118 VALDTGSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCDNSLCAHRNR 176
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C CPY+ Y + +TS+SG LV+D+LHL + Q V++ V GCG+ QTGS+
Sbjct: 177 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNR--QEFVEAYVTFGCGQVQTGSF 234
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF 304
LD AAP+G+ GLGL +SVPS+L+K G +SFS+CF + G + FGD+G Q+ T F
Sbjct: 235 LDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPF 294
Query: 305 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVS 363
+ + Y + V +G + L F AL DSG SFT+L IY V+ F +
Sbjct: 295 -NLNALHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQD 352
Query: 364 SKRISLQGNSWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRNHI 407
S+R +++CY+ S E +P M L F V + I
Sbjct: 353 SRRPPDSRIPFEFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPI 397
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 155/395 (39%), Positives = 220/395 (55%), Gaps = 23/395 (5%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F+ K+ HRFSD K W + N WP+K S EY L D Q R + S+
Sbjct: 26 FTFKMHHRFSDSFKN-WSGLTRN------WPEKGSFEYYAALAHRD---QMLRGRRLSDA 75
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
++S L F S+G+ T F + +LHYT +++GTP V F+VALD GS+L WVPC C +C
Sbjct: 76 DAS---LAF-SDGNST-FRISSLGFLHYTTVELGTPGVKFMVALDTGSDLFWVPCDCSRC 130
Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
AP + Y S D LS Y+P SS+SK V+C++ +C R+ C CPYI Y + T
Sbjct: 131 APTHGASYAS-DFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGTFSSCPYIVSYVSAQT 189
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
S+SG LV D+LHL + + + V++ V GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 190 STSGILVKDVLHLTT--EDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 247
Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
PS+L++ GLI +SFS+CF + G + FGD+G Q+ T F + + Y V V +
Sbjct: 248 PSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPF-NVNPAHPTYNVTVTQARV 306
Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 382
G + L F AL DSG SFT++ Y+ V KF L KR ++YCY+ S
Sbjct: 307 G-TMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIPFEYCYDMSP 365
Query: 383 EEMLK-VPDMRLIFSKNQSFVVRNHIFSF-PENEV 415
+ VP M L + F V + I +NE+
Sbjct: 366 DANASLVPSMSLTMKGGRHFTVYDPIIVISTQNEI 400
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 146/406 (35%), Positives = 226/406 (55%), Gaps = 18/406 (4%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
L+ I ML + F+ ++ HRFSDE K+ W +G + +P K S EY
Sbjct: 12 LIPILMLLS---FGSCNGRIFTFEMHHRFSDEVKQ-WSDSTGRFA---KFPPKGSFEYFN 64
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
L+ DW + R L + + S + L F S+G+ T + +LHYT + +GTP + F
Sbjct: 65 ALVLRDWLIRGRR--LSESESESESSLTF-SDGNSTSRI-SSLGFLHYTTVKLGTPGMRF 120
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+VALD GS+L WVPC C +CAP + Y S + LS Y+P S+++K V+C++ LC R+
Sbjct: 121 MVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRN 179
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
C CPY+ Y + TS+SG L++D++HL + K+ + V++ V GCG+ Q+GS
Sbjct: 180 QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGS 237
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
+LD AAP+G+ GLG+ +SVPS+LA+ GL+ +SFS+CF + G + FGD+G + Q+ T
Sbjct: 238 FLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETP 297
Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
F + + Y + V +G + L F AL D+G SFT+L +Y V F
Sbjct: 298 F-NLNPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQ 355
Query: 364 SKRISLQGN-SWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHI 407
KR S ++YCY+ S++ +P + L N F + + I
Sbjct: 356 DKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPI 401
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 150/399 (37%), Positives = 217/399 (54%), Gaps = 35/399 (8%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F L HRFS + RW G AD WP + + EY L +D R+
Sbjct: 36 FGFDLHHRFSPVVR-RWAEARGGPLAADRWPARGTPEYYSALSRHDRARRAL-------A 87
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
+ + LL + G+ T+ G L+Y +++GTPN +FLVALD GS+L WVPC C QC
Sbjct: 88 GGADDGLLTFAAGNDTYQSGT----LYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQC 143
Query: 144 APLSASYYTSLDR-NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTE 201
A + ++ T D L Y P SS+S+ V+C +PLC R+ C + + CPY Y +
Sbjct: 144 ATIPSANATGPDAPPLRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSA 203
Query: 202 DTSSSGYLVDDILHLASFSKHAPQS---SVQSSVIIGCGRKQTGSYLD--GAAPDGVMGL 256
+TSSSG LV D+LHL + + P + ++Q+ V+ GCG+ QTG++LD G A DG+MGL
Sbjct: 204 NTSSSGVLVQDVLHL-TRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGL 262
Query: 257 GLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
G+G VSVPS LA +GL+ +SFS+CF ++ G V FGD G Q T F + Y
Sbjct: 263 GMGKVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFT-VRSLNPTYN 321
Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-- 373
V S IG+ + F A++DSG SFT+L Y ++ KF+ VS +R++ S
Sbjct: 322 VSFTSIGIGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSAD 380
Query: 374 ---WKYCYNASSEEM-LKVPDMRL------IFSKNQSFV 402
++YCY S + + +PD+ L +F Q F+
Sbjct: 381 PFPFEYCYRLSPNQTEVAMPDVSLTAKGGALFPVTQPFI 419
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/358 (38%), Positives = 205/358 (57%), Gaps = 9/358 (2%)
Query: 54 PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
P + EY L +D R+++ + L + +G+ T+ NQF +LHY
Sbjct: 54 PSPGTAEYYAALAGHDDLRRRS-LSLAAAPAPGAGGPFAFVDGNDTYRL-NQFGFLHYAV 111
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTPNV+FLVALD GS+L WVPC C++CAPLS+ Y +L ++ Y P SS+S+ V
Sbjct: 112 VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLSSPDYGNLKFDV--YSPRKSSTSRKVP 169
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
CS +C ++ C + + CPY +Y +++TSS G LV+D+++LA+ S H+ Q+ +
Sbjct: 170 CSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHS--KITQAPIT 227
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GCG+ QTGS+L AAP+G++GLG+ SVPSLLA G+ NSFS+CF E+ G + FGD
Sbjct: 228 FGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGEDGHGRINFGD 287
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
G A Q T L I + Y + + G + + F A+VDSG SFT L +Y E
Sbjct: 288 TGSADQLETP-LNIYKHNPYYNISIVGAMAGGKTFS-TKFSAVVDSGTSFTALSDPMYTE 345
Query: 354 VVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
+ FDK V KR + ++YCY SS+ + P++ L F V++ I +
Sbjct: 346 ITSAFDKQVKEKRNPADSSLPFEYCYTISSKGAVSPPNISLTAKGGSVFPVKDPIITI 403
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 139/386 (36%), Positives = 216/386 (55%), Gaps = 17/386 (4%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F+ ++ HRFSDE K+ W +G +P K S EY L+ DW + R+ +
Sbjct: 29 FTFEMHHRFSDEVKQ-WSDSTGRFV---KFPPKGSFEYFNALVLRDWLIRGRRLSDSESE 84
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
+S S+G+ T + +LHYT + +GTP + F+VALD GS+L WVPC C +C
Sbjct: 85 SSLTF-----SDGNSTSRI-SSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKC 138
Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
AP + Y S + LS Y+P S+++K V+C++ LC R+ C CPY+ Y + T
Sbjct: 139 APTEGATYAS-EFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQT 197
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
S+SG L++D++HL + K+ + V++ V GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 198 STSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 255
Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
PS+LA+ GL+ +SFS+CF + G + FGD+G + Q+ T F + + Y + V +
Sbjct: 256 PSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRV 314
Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 382
G + L F AL D+G SFT+L +Y V F KR S ++YCY+ S+
Sbjct: 315 GTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSN 373
Query: 383 EEMLK-VPDMRLIFSKNQSFVVRNHI 407
+ +P + L N F + + I
Sbjct: 374 DANASLIPSLSLTMKGNSHFTINDPI 399
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 142/351 (40%), Positives = 199/351 (56%), Gaps = 14/351 (3%)
Query: 5 VAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLEL 64
V I +L + A FS ++ HRFS+ K +W +GN A +WP K S EY
Sbjct: 7 VFIVILLSILGFRSCHARIFSFQMHHRFSEPVK-KWSEGAGNGFPAGNWPAKGSFEYYAE 65
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
L D + R+ S + LL S+G+ T F + +LHYT + +GTP FL
Sbjct: 66 LAHRDRALRGRRL-------SDIDGLLTFSDGNST-FRISSLGFLHYTTVSLGTPGKKFL 117
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
VALD GS+L WVPC C +CAP + Y S D LS Y+P SS+S+ V+C++ LC R+
Sbjct: 118 VALDTGSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCNNSLCAHRNR 176
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C CPY+ Y + +TS+SG LV+D+LHL + + Q V++ V GCG+ QTGS+
Sbjct: 177 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSF 234
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF 304
LD AAP+G+ GLGL +SVPS+L+K G +SFS+CF + G + FGD+G Q+ T F
Sbjct: 235 LDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPF 294
Query: 305 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 355
+ + Y + V +G + L F AL DSG SFT+L IY V+
Sbjct: 295 -NLNALHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNVL 343
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 142/399 (35%), Positives = 209/399 (52%), Gaps = 35/399 (8%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
FS K+ HRFSD+ K W SG ++ DSWP K ++EY +L +
Sbjct: 28 FSFKMHHRFSDQLKN-WSGVSGKFTLPDSWPVKGTIEYY--------------AQLAFRD 72
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWL-------------HYTWIDIGTPNVSFLVALDAG 130
R Q L +G GN + + YT + +GTP F+VALD G
Sbjct: 73 RFFRGQRLSEFDGPLAFSDGNSSFRISSLGFALFDVFFFFYTTVQLGTPGTKFMVALDTG 132
Query: 131 SNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 190
S+L WVPC C +CAP S Y S D LS Y P SS+SK V C++ LC R C
Sbjct: 133 SDLFWVPCDCSRCAPTEGSPYAS-DFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFG 191
Query: 191 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
CPY+ Y + +TS++G L++D+LHL + KH+ +Q+ + GCG+ Q+GS+LD AAP
Sbjct: 192 NCPYVVSYVSAETSTTGILIEDLLHLKTEHKHS--EPIQAYITFGCGQVQSGSFLDVAAP 249
Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 310
+G+ GLG+ +SVPS+L++ GL+ NSFS+CF ++ G + FGD+G Q+ T F + +
Sbjct: 250 NGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQL 308
Query: 311 YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
+ Y + V S +G + L + AL DSG SF++ IY+++ F R
Sbjct: 309 HPNYNITVTSIRVGTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPN 367
Query: 371 GN-SWKYCYNASSEEMLKV-PDMRLIFSKNQSFVVRNHI 407
++YCYN S + + P + L F V + I
Sbjct: 368 PRIPFEYCYNMSPDANASLTPGISLTMKGGGPFPVYDPI 406
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 150/386 (38%), Positives = 214/386 (55%), Gaps = 20/386 (5%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F+ K+ HRFSD K+ +S S + + ++P K S EY L D Q R + N
Sbjct: 28 FTFKMHHRFSDMLKD--LSDS---TTSRNFPSKGSFEYYAELAHRD---QMLRGRKLYNV 79
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
+ L F S+G+ T F + +LHYT +++GTP + F+VALD GS+L WVPC C +C
Sbjct: 80 EAP---LAF-SDGNST-FRISSLGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKC 134
Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
AP Y S D LS YDP SS+SK V+C++ LC R+ C CPY+ Y + T
Sbjct: 135 APTQGVAYAS-DFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQT 193
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
S+SG LV+D+LHL S + + Q S+++ V GCG+ Q+GS+L+ AAP+G+ GLG+ +SV
Sbjct: 194 STSGILVEDVLHLTS--EDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISV 251
Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
PS+L++ GL +SFS+CF + G + FGD+G Q+ T F + +Y + V +
Sbjct: 252 PSILSREGLTADSFSMCFGHDGVGRISFGDKGSPDQEETPFNS-NPSHPSYNISVTQVRV 310
Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS- 381
G + L F AL DSG SFT+L IYA V F KR ++YCY+ S
Sbjct: 311 GTT-LVDVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSP 369
Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHI 407
+P M L F V + I
Sbjct: 370 GANSSLIPSMSLTMKGRGHFTVFDPI 395
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 135/361 (37%), Positives = 212/361 (58%), Gaps = 13/361 (3%)
Query: 54 PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
P + EY L +D R+++ L + F ++G+ T+ N F +LHY
Sbjct: 48 PPHGTAEYYAALAGHDGLRRRS---LGVGGGGGGAEFAF-ADGNDTYRL-NDFGFLHYAV 102
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTPNV+FLVALD GS+L WVPC C++CAPL + Y SL ++ Y P+ S++S+ V
Sbjct: 103 VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLQSPNYGSLKFDV--YSPAQSTTSRKVP 160
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S A V + ++
Sbjct: 161 CSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS--AQSKIVTAPIM 218
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+CF ++ G + FGD
Sbjct: 219 FGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD 278
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG SFT L +Y +
Sbjct: 279 TGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTALSDPMYTQ 336
Query: 354 VVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
+ FD + S R L + +++CY+ S+ ++ P++ L F V + I + +
Sbjct: 337 ITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITD 395
Query: 413 N 413
N
Sbjct: 396 N 396
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/359 (37%), Positives = 201/359 (55%), Gaps = 14/359 (3%)
Query: 54 PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
P + EY L +D +R R + L F ++G+ T+ N F +LHY
Sbjct: 48 PPAGTAEYYAALAGHDLRR---RSLAAAAGGGGAGNLAF-ADGNDTYRL-NDFGFLHYAV 102
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTPNV+FLVALD GS+L WVPC CI+CAPL++ Y L ++ Y P SS+S+ V
Sbjct: 103 VALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDYGDLKFDM--YSPRKSSTSRKVP 160
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV-QSSV 232
CS LC ++ C + + CPY Y +E+TSS G LV+D+L+L + S QS + Q+ +
Sbjct: 161 CSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESG---QSKITQAPI 217
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
GCG+ Q+GS+L AAP+G++GLG+ SVPSLLA G+ NSFS+CF E+ G + FG
Sbjct: 218 TFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGEDGHGRINFG 277
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYA 352
D G + Q T L I ++ Y + + +G + F A+VDSG SFT L +Y
Sbjct: 278 DTGSSDQLETP-LNIYKQNPYYNISITGAMVGGKSF-DTKFSAVVDSGTSFTALSDPMYT 335
Query: 353 EVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
E+ F+ V R L + ++YCY+ S++ + P++ L F V I +
Sbjct: 336 EITSTFNAQVKESRKHLDASMPFEYCYSISAQGAVNPPNISLTAKGGSIFPVNGPIITI 394
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 134/361 (37%), Positives = 211/361 (58%), Gaps = 13/361 (3%)
Query: 54 PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
P + EY L +D R+++ L + F ++G+ T+ N F +LHY
Sbjct: 48 PPHGTAEYYAALAGHDGLRRRS---LGVGGGGGGAEFAF-ADGNDTYRL-NDFGFLHYAV 102
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTPNV+FLVALD GS+L WVPC C++CAP + Y SL ++ Y P+ S++S+ V
Sbjct: 103 VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSPAQSTTSRKVP 160
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S A V + ++
Sbjct: 161 CSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS--AQSKIVTAPIM 218
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+CF ++ G + FGD
Sbjct: 219 FGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD 278
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG SFT L +Y +
Sbjct: 279 TGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSFTALSDPMYTQ 336
Query: 354 VVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
+ FD + S R L + +++CY+ S+ ++ P++ L F V + I + +
Sbjct: 337 ITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITITD 395
Query: 413 N 413
N
Sbjct: 396 N 396
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 140/384 (36%), Positives = 217/384 (56%), Gaps = 21/384 (5%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
L+ I ML + F+ ++ HRFSDE K+ W +G + +P K S EY
Sbjct: 12 LIPILMLLS---FGSCNGRIFTFEMHHRFSDEVKQ-WSDSTGRFA---KFPPKGSFEYFN 64
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
L+ DW + R L + + S + L F S+G+ T + +LHYT + +GTP + F
Sbjct: 65 ALVLRDWLIRGRR--LSESESESESSLTF-SDGNSTSRI-SSLGFLHYTTVKLGTPGMRF 120
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+VALD GS+L WVPC C +CAP + Y S + LS Y+P S+++K V+C++ LC R+
Sbjct: 121 MVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRN 179
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
C CPY+ Y + TS+SG L++D++HL + K+ + V++ V GCG+ Q+GS
Sbjct: 180 QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGS 237
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
+LD AAP+G+ GLG+ +SVPS+LA+ GL+ +SFS+CF + G + FGD+G + Q+ T
Sbjct: 238 FLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETP 297
Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
F + + Y + V +G + L F AL D+G SFT+L +Y V +
Sbjct: 298 F-NLNPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTV----SESAQ 351
Query: 364 SKRISLQGN-SWKYCYNASSEEML 386
KR S ++YCY+ + +L
Sbjct: 352 DKRHSPDSRIPFEYCYDMREKLVL 375
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 154/426 (36%), Positives = 221/426 (51%), Gaps = 45/426 (10%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
LVA+ ++ L+ DA S L HRFS ++ W G+ A WP + S EY
Sbjct: 14 LVAVAIVAVSFLVAAGDASSVGFDLHHRFSPVVRQ-WAEARGHPFAAQDWPARGSPEYYS 72
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYW----LHYTWIDIGTP 119
L +D R L SR L ++G T GN L+Y +++GTP
Sbjct: 73 ALSRHD------RAVL------SRRALADGADGLVTFAAGNDTLQYIGSLYYAVVEVGTP 120
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
N +FLVALD GS+L WVPC C QCA + A+ L Y P SS+SK V+C + LC
Sbjct: 121 NATFLVALDTGSDLFWVPCDCKQCASI-ANVTGQPATALRPYSPRESSTSKQVTCDNALC 179
Query: 180 KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS------VQSSV 232
+ C + + CPY Y + +TS+SG LV D+LHL ++ P ++ +Q+ V
Sbjct: 180 DRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHL---TRERPGAAAEAGEALQAPV 236
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFF 291
+ GCG+ QTG++LDGAA DG+MGLG +VSVPS+LA +GL+ +SFS+CF ++ G + F
Sbjct: 237 VFGCGQVQTGTFLDGAAFDGLMGLGRENVSVPSVLASSGLVASDSFSMCFGDDGVGRINF 296
Query: 292 GDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTE 349
GD G + Q T F Y+ F V VE+ + + F A++DSG SFT+L
Sbjct: 297 GDSGSSGQGETPFTGRRTLYNVSFTAVNVETKSVA------AEFAAVIDSGTSFTYLADP 350
Query: 350 IYAEVVVKFDKLVSSKRISLQGNS-----WKYCY--NASSEEMLKVPDMRLIFSKNQSFV 402
Y E+ F+ LV +R + S ++YCY + E L +PD+ L F
Sbjct: 351 EYTELATNFNSLVRERRTNFSSGSADPFPFEYCYALGPNQTEAL-IPDVSLTTKGGARFP 409
Query: 403 VRNHIF 408
V +
Sbjct: 410 VTQPVI 415
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 137/389 (35%), Positives = 201/389 (51%), Gaps = 23/389 (5%)
Query: 29 VHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNS--- 85
+H S RW G+ A + + EY L +D R + +
Sbjct: 33 LHHRSSPVVRRWAEARGHPGAAWWAEAEGTPEYYAALHRHDRAHLARRGLAEGDGEGLLT 92
Query: 86 -SRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
+ L F EGS LHY + +GTPN +FLVALD GS+L WVPC C QCA
Sbjct: 93 FASGNLTFRLEGS-----------LHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCA 141
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTE 201
P++ + +L Y P SS+SK V+C H LC+ ++C + + CPY Y +
Sbjct: 142 PIANASDLRGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSA 201
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
+TSSSG LV+D+LHL+ + ++V + V++GCG+ QTG++LDGAA DG++GLG+ V
Sbjct: 202 NTSSSGVLVEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKV 261
Query: 262 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
SVPS+L AGL+ +SFS+CF + G + FGD G Q T F + + Y + V +
Sbjct: 262 SVPSVLHAAGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFT-VRNTHPTYNISVTA 320
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
+ + F A+VDSG SFT+L Y E+ F+ V +R +L + ++YCY
Sbjct: 321 MSVSGKEVAAE-FAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYE 379
Query: 380 -ASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
+ L VP++ L F V I
Sbjct: 380 LGRGQTELFVPEVSLTTRGGAVFPVTRPI 408
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/366 (36%), Positives = 207/366 (56%), Gaps = 14/366 (3%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFS +RW G+V + WP+ S +Y+ L +D +R + +
Sbjct: 39 HRFSSPV-QRWAEARGHV-LPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDKPP 96
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
L SEG+ T N +LHY + +GTP +F+VALD GS+L W+PCQC C P +++
Sbjct: 97 PLTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASA 155
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
S S Y PS SS+S+ V C+ C+ R C + CPY Y + DTSSSG+L
Sbjct: 156 ASGSA----SFYIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFL 210
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L++ + A +++ ++ GCG+ QTGS+LD AAP+G+ GLG+ +S+PS+LA+
Sbjct: 211 VEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQ 268
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
GL NSF++CF + G + FGDQG + Q+ T L + ++ Y + + +GNS LT
Sbjct: 269 KGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEMTVGNS-LT 326
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
F + D+G SFT+L Y + F V + R + ++YCY+ +SSE+ ++
Sbjct: 327 DLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQ 386
Query: 388 VPDMRL 393
P + L
Sbjct: 387 TPSISL 392
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 9/321 (2%)
Query: 94 SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS 153
++G+ T+ N F +LHY + +GTPNV+FLVALD GS+L WVPC C++CAP + Y S
Sbjct: 20 ADGNDTYRL-NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS 78
Query: 154 LDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 213
L ++ Y P+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+
Sbjct: 79 LKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV 136
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L+L S S A V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL
Sbjct: 137 LYLTSDS--AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLA 194
Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 333
NSFS+CF ++ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F
Sbjct: 195 ANSFSMCFGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-F 252
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMR 392
A+VDSG SFT L +Y ++ FD + S R L + +++CY+ S+ ++ P++
Sbjct: 253 SAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVS 311
Query: 393 LIFSKNQSFVVRNHIFSFPEN 413
L F V + I + +N
Sbjct: 312 LTAKGGSIFPVNDPIITITDN 332
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 133/366 (36%), Positives = 207/366 (56%), Gaps = 14/366 (3%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFS +RW G+V + WP+ S +Y+ L +D +R + +
Sbjct: 39 HRFSSPV-QRWAEARGHV-LPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDKPP 96
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
L SEG+ T N +LHY + +GTP +F+VALD GS+L W+PCQC C P +++
Sbjct: 97 PLTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASA 155
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
S S Y PS SS+S+ V C+ C+ R C + CPY Y + DTSSSG+L
Sbjct: 156 ASGSA----SFYIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFL 210
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L++ + A +++ ++ GCG+ QTGS+LD AAP+G+ GLG+ +S+PS+LA+
Sbjct: 211 VEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQ 268
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
GL NSF++CF + G + FGDQG + Q+ T L + ++ Y + + +GNS LT
Sbjct: 269 KGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LT 326
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
F + D+G SFT+L Y + F V + R + ++YCY+ +SSE+ ++
Sbjct: 327 DLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQ 386
Query: 388 VPDMRL 393
P + L
Sbjct: 387 TPSISL 392
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 133/366 (36%), Positives = 207/366 (56%), Gaps = 14/366 (3%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFS +RW G+V + WP+ S +Y+ L +D +R + +
Sbjct: 39 HRFSSPV-QRWAEARGHV-LPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDKPP 96
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
L SEG+ T N +LHY + +GTP +F+VALD GS+L W+PCQC C P +++
Sbjct: 97 PLTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASA 155
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
S S Y PS SS+S+ V C+ C+ R C + CPY Y + DTSSSG+L
Sbjct: 156 ASGSA----SFYIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFL 210
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L++ + A +++ ++ GCG+ QTGS+LD AAP+G+ GLG+ +S+PS+LA+
Sbjct: 211 VEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQ 268
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
GL NSF++CF + G + FGDQG + Q+ T L + ++ Y + + +GNS LT
Sbjct: 269 KGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LT 326
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
F + D+G SFT+L Y + F V + R + ++YCY+ +SSE+ ++
Sbjct: 327 DLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQ 386
Query: 388 VPDMRL 393
P + L
Sbjct: 387 TPSISL 392
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 191/311 (61%), Gaps = 8/311 (2%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
N F +LHY + +GTPNV+FLVALD GS+L WVPC C++CAP + Y SL ++ Y P
Sbjct: 56 NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSP 113
Query: 164 SSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S +
Sbjct: 114 AQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS 173
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+CF +
Sbjct: 174 --KIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 231
Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 343
+ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG SF
Sbjct: 232 DGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSF 289
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
T L +Y ++ FD + S R L + +++CY+ S+ ++ P++ L F
Sbjct: 290 TALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFP 348
Query: 403 VRNHIFSFPEN 413
V + I + +N
Sbjct: 349 VNDPIITITDN 359
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 191/311 (61%), Gaps = 8/311 (2%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
N F +LHY + +GTPNV+FLVALD GS+L WVPC C++CAP + Y SL ++ Y P
Sbjct: 70 NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSP 127
Query: 164 SSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S +
Sbjct: 128 AQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS 187
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+CF +
Sbjct: 188 --KIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 245
Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 343
+ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG SF
Sbjct: 246 DGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSGTSF 303
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
T L +Y ++ FD + S R L + +++CY+ S+ ++ P++ L F
Sbjct: 304 TALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFP 362
Query: 403 VRNHIFSFPEN 413
V + I + +N
Sbjct: 363 VNDPIITITDN 373
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 144/402 (35%), Positives = 218/402 (54%), Gaps = 21/402 (5%)
Query: 18 GSDAVSFSS-KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
G DA + S + HRFS + RW+ G ++ WP S Y+ L +D R
Sbjct: 23 GGDASTAPSLEFHHRFSAPLR-RWVEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA--- 77
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
V ++S L F +EG+ T N +LHY + +GTP +F+VALD GS+L W+
Sbjct: 78 VSAAGGSSSDAPPLTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWL 135
Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
PCQC C P + T+ + + Y P SS+SK V C+ C + C + CPY
Sbjct: 136 PCQCDGCTPPA----TAASGSATFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKM 190
Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
Y + TSSSG+LV+D+L+L++ + H PQ +++ +++GCG+ QTGS+LD AAP+G+ GL
Sbjct: 191 VYVSAGTSSSGFLVEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGL 248
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
G+ +VSVPS+LA+ GL NSFS+CF + G + FGDQ + Q+ T L I ++ Y +
Sbjct: 249 GIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAI 307
Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWK 375
+ +GN T F + D+G SFT+L Y + F V + R + ++
Sbjct: 308 TISGITVGNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFE 366
Query: 376 YCYN-ASSEEMLKVPDMRLIFSKNQSFVVRN--HIFSFPENE 414
YCY+ +SSE +PD+ L F V + + S E+E
Sbjct: 367 YCYDLSSSEARFPIPDIILRTVTGSMFPVIDPGQVISIQEHE 408
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 144/402 (35%), Positives = 216/402 (53%), Gaps = 19/402 (4%)
Query: 18 GSDAVSFSS-KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
G DA + S + HRFS + RW+ G ++ WP S Y+ L +D R
Sbjct: 23 GGDASTAPSLEFHHRFSAPLR-RWVEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA--- 77
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
V ++S L F +EG+ T N +LHY + +GTP +F+VALD GS+L W+
Sbjct: 78 VSAAGGSSSDAPPLTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWL 135
Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
PCQC C P + + S Y P SS+SK V C+ C + C + CPY
Sbjct: 136 PCQCDGCTPPATAASGSFQATF--YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKM 192
Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
Y + TSSSG+LV+D+L+L++ + H PQ +++ +++GCG+ QTGS+LD AAP+G+ GL
Sbjct: 193 VYVSAGTSSSGFLVEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGL 250
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
G+ +VSVPS+LA+ GL NSFS+CF + G + FGDQ + Q+ T L I ++ Y +
Sbjct: 251 GIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAI 309
Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWK 375
+ +GN T F + D+G SFT+L Y + F V + R + ++
Sbjct: 310 TISGITVGNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFE 368
Query: 376 YCYN-ASSEEMLKVPDMRLIFSKNQSFVVRN--HIFSFPENE 414
YCY+ +SSE +PD+ L F V + + S E+E
Sbjct: 369 YCYDLSSSEARFPIPDIILRTVTGSMFPVIDPGQVISIQEHE 410
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 139/387 (35%), Positives = 209/387 (54%), Gaps = 18/387 (4%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
++ + HR S+ ++ S + + P+K +VEY L D R+ L+
Sbjct: 21 YTFTMHHRHSEPVRKWSHSTASGIPAP---PEKGTVEYYAELADRD------RL-LRGRK 70
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
S + L S+G+ T F + +LHYT + IGTP V F+VALD GS+L WVPC C +C
Sbjct: 71 LSQIDDGLAFSDGNST-FRISSLGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRC 129
Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
A +S + S D +L+ Y+P+ SS+SK V+C++ LC RS C CPY+ Y + +T
Sbjct: 130 AATDSSAFAS-DFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAET 188
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
S+SG LV+D+LHL H V+++VI GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 189 STSGILVEDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISV 246
Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
PS+L++ G +SFS+CF + G + FGD+G Q T F + + Y + V +
Sbjct: 247 PSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRV 305
Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 382
G + L F AL DSG SFT+L Y + F V +R ++YCY+ S
Sbjct: 306 GTT-LIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSP 364
Query: 383 EEMLK-VPDMRLIFSKNQSFVVRNHIF 408
+ +P + L F V + I
Sbjct: 365 DANTSLIPSVSLTMGGGSHFAVYDPII 391
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 142/389 (36%), Positives = 211/389 (54%), Gaps = 20/389 (5%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFS + RW G ++ WP S Y+ L +D R V S
Sbjct: 35 HRFSAPLR-RWAEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA---VSAAGGGGSGTPP 89
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
L F +EG+ T N +LHY + +GTP +F+VALD GS+L W+PCQC C P +
Sbjct: 90 LTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA-- 145
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
T+ + + Y P SS+SK V C+ C + C + CPY Y + TSSSG+L
Sbjct: 146 --TAASGSATFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFL 202
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L++ + H PQ +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+
Sbjct: 203 VEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQ 260
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
GL NSFS+CF + G + FGDQG + Q+ T L I +++ Y + + IGN T
Sbjct: 261 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LNINQQHPTYAITISGITIGNKP-T 318
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
F + D+G SFT+L Y + F V + R + ++YCY+ +SSE
Sbjct: 319 DLDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP 378
Query: 388 VPDMRLIFSKNQSFVVRN--HIFSFPENE 414
+PD+ L F V + + S E+E
Sbjct: 379 IPDIILRTVSGSLFPVIDPGQVISIQEHE 407
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 137/385 (35%), Positives = 214/385 (55%), Gaps = 18/385 (4%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HR+S +E W P + EY L +D +R ++ + +
Sbjct: 35 HRYSATVRE-WAGH-------HRAPPAGTAEYYAALARHDLRR-RSLAAGPAAGGGGGGE 85
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
+ F ++G+ T+ N+ +LHY + +GTPNV+FLVALD GS+L WVPC CI CAPL +
Sbjct: 86 VAF-ADGNDTYRL-NELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSP 143
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
Y D Y P SS+S+ V CS LC +S+C+S CPY +Y +++TSS+G L
Sbjct: 144 NYR--DLKFDTYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVL 201
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L +++ V + + GCGR QTGS+L AAP+G++GLG+ +SVPSLLA
Sbjct: 202 VEDVLYL--ITEYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLAS 259
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
G+ NSFS+CF ++ G + FGD G + QQ T L I ++ Y + + +G+
Sbjct: 260 EGVAANSFSMCFGDDGRGRINFGDTGSSDQQETP-LNIYKQNPYYNISITGAMVGSKSF- 317
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKV 388
+ F A+VDSG SFT L +Y+E+ F+ V K L + +++CY+ S + +
Sbjct: 318 NTNFNAIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSISPKGSVNP 377
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPEN 413
P++ L+ F V + I + ++
Sbjct: 378 PNISLMAKGGSIFPVNDPIITITDD 402
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 224 bits (571), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 143/401 (35%), Positives = 216/401 (53%), Gaps = 21/401 (5%)
Query: 18 GSDAVSFSS-KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
G DA + S + HRFS + RW+ G ++ WP S Y+ L +D R
Sbjct: 23 GGDASTAPSLEFHHRFSAPLR-RWVEARGR-ALPGGWPAPGSAAYVAALAGHDRHRA--- 77
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
V ++S L F +EG+ T N +LHY + +GTP +F+VALD GS+L W+
Sbjct: 78 VSAAGGSSSDAPPLTF-AEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWL 135
Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
PCQC C P + T+ + + Y P SS+SK V C+ C + C + CPY
Sbjct: 136 PCQCDGCTPPA----TAASGSATFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKM 190
Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
Y + TSSSG+LV+D+L+L++ + H PQ +++ +++GCG+ QTGS+LD AAP+G+ GL
Sbjct: 191 VYVSAGTSSSGFLVEDVLYLSTENAH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGL 248
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
G+ +VSVPS+LA+ GL NSFS+CF + G + FGDQ + Q+ T L I ++ Y +
Sbjct: 249 GIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAI 307
Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWK 375
+ +GN T F + D+G SFT+L Y + F V + R + ++
Sbjct: 308 TISGITVGNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFE 366
Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVVRN--HIFSFPENE 414
YCY+ SE +PD+ L F V + + S E+E
Sbjct: 367 YCYDL-SEARFPIPDIILRTVTGSMFPVIDPGQVISIQEHE 406
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 224 bits (571), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 132/392 (33%), Positives = 208/392 (53%), Gaps = 21/392 (5%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
SF + HRFSD K + D+ P K S EY + D + R+
Sbjct: 38 SFGFDIHHRFSDPVK--------GILGIDNIPDKGSREYYVAMAHRDRVFRGRRLA---- 85
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
+ +Q L T + + F +LH+ + +GTP S+LVALD GS+L W+PC C +
Sbjct: 86 DGGDVDQKLLTFSPDNTTYQISLFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCNCTK 145
Query: 143 CAP-LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYST 200
C + S + N+ YD SS+SKNV+C+ LC+ ++ C S CPY +Y +
Sbjct: 146 CVHGIQLSTGQKIAFNI--YDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLS 203
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
E+TS++G+LV+D+LHL + + Q + + GCG+ QTG++LDGAAP+G+ GLG+ D
Sbjct: 204 ENTSTTGFLVEDVLHLITDNDDQTQHA-NPLITFGCGQVQTGAFLDGAAPNGLFGLGMSD 262
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
VSVPS+LAK GL NSFS+CF + G + FGD + Q + I + Y + V
Sbjct: 263 VSVPSILAKQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQ 322
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYC 377
+G + F A+ D+G SFT+L Y ++ FD + +R S + ++YC
Sbjct: 323 IIVGGNSADLE-FNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYC 381
Query: 378 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFS 409
Y+ + + ++VP++ L ++ V + I +
Sbjct: 382 YDLRTNQTIEVPNINLTMKGGDNYFVMDPIIT 413
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 142/403 (35%), Positives = 213/403 (52%), Gaps = 24/403 (5%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F + HRFSD E I GN + P K + +Y ++ D R+
Sbjct: 39 FGLDIHHRFSDPVTE--ILGIGNDEL---LPHKGTPQYYAAMVHRDRVFHGRRLA----- 88
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
R+ + + G++TH F +LH+ + +GTP + FLVALD GS+L W+PC C C
Sbjct: 89 -DDRDTPITFAAGNETHQIA-AFGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCNCTSC 146
Query: 144 AP-LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
L +D N+ E D SS+ KNV C+ +CK ++ C S C Y +Y + D
Sbjct: 147 VRGLKTQNGKVIDLNIYELD--KSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSND 203
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
TSSSG+LV+D+LHL + + + + + IGCG+ QTG +L+GAAP+G+ GLG+ +VS
Sbjct: 204 TSSSGFLVEDVLHL--ITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVS 261
Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 322
VPS+LA+ GLI +SFS+CF + SG + FGD G + Q T F + E + Y V +
Sbjct: 262 VPSILAQKGLISDSFSMCFGSDGSGRITFGDTGSSDQGKTPF-NLRESHPTYNVTITQII 320
Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNS---WKYCY 378
+G F A+ DSG SFT+L Y + KF+ LV + R S L +S ++YCY
Sbjct: 321 VGGYAADHE-FHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCY 379
Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
+ S ++ ++VP + L + V + I G+ C
Sbjct: 380 DMSPDQTIEVPFLNLTMKGGDDYYVTDPIVPVSSEVEGNLLCL 422
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 142/392 (36%), Positives = 211/392 (53%), Gaps = 33/392 (8%)
Query: 28 LVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSR 87
L HRFS K RW G + A WP+ S EY L ++D R RV S
Sbjct: 13 LHHRFSPVVK-RWAESRGRPAAAAWWPE-GSPEYYSALSAHDRAR---RVLAGGKGES-- 65
Query: 88 NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLS 147
L F S T G+ LHY + +GTPN +F+VALD GS+L WVPC C +CAP++
Sbjct: 66 -LLSFADGNSTTRHAGS----LHYAKVALGTPNATFVVALDTGSDLFWVPCDCKRCAPIA 120
Query: 148 ASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSG 207
+ L Y P SS+SK V+CSH LC ++C + CPY Y + +TSSSG
Sbjct: 121 -----NTSELLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSG 175
Query: 208 YLVDDILHLA-------SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
LV+D+L++ S + +V + V+ GCG++QTG++LDGAA +G++GLG+
Sbjct: 176 VLVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDR 235
Query: 261 VSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
VSVPSLLA AGL+ +SFS+CF + +G + FG+ A Q+ + + + Y + V
Sbjct: 236 VSVPSLLAAAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNISVT 295
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCY 378
+ + + F A+VDSG SFT+L Y+ + F+ V KR +L + ++YCY
Sbjct: 296 AVNVKGKGAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCY 355
Query: 379 NAS-SEEMLKVPDMRL------IFSKNQSFVV 403
S + + +P++ L +F + FV+
Sbjct: 356 ALSRGQTEVLMPEVSLTTRGGAVFPVTRPFVI 387
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/386 (34%), Positives = 207/386 (53%), Gaps = 18/386 (4%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
++ + HR S+ ++ S + + P++ +VEY L D L+
Sbjct: 25 YTFTMHHRHSEPVRKWSHSAAAGIPAP---PEEGTVEYYAELADRDRL-------LRGRK 74
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
S + L S+G+ T F + +LHYT + IGTP V F+VALD GS+L WVPC C +C
Sbjct: 75 LSQIDAGLAFSDGNST-FRISSLGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRC 133
Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 203
A ++ + S D +L+ Y+P+ SS+SK V+C++ LC RS C CPY+ Y + +T
Sbjct: 134 AASDSTAFAS-DFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNCPYMVSYVSAET 192
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
S+SG LV+D+LHL H V+++VI GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 193 STSGILVEDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISV 250
Query: 264 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
PS+L++ G +SFS+CF + G + FGD+G Q T F + + Y + V +
Sbjct: 251 PSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRV 309
Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 382
G + + F AL DSG SFT+L Y + F V +R ++YCY+ S
Sbjct: 310 GTTVIDVE-FTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSP 368
Query: 383 EEMLK-VPDMRLIFSKNQSFVVRNHI 407
+ +P + L F V + I
Sbjct: 369 DANTSLIPSVSLTMGGGSHFAVYDPI 394
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 119/304 (39%), Positives = 176/304 (57%), Gaps = 7/304 (2%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y LHYT + +GTP F+VALD GS+L WVPC C +CAP S Y S D LS Y P S
Sbjct: 1 YSLHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYAS-DFELSVYSPKKS 59
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+SK V C++ LC R C CPY+ Y + +TS++G L++D+LHL + +KH+
Sbjct: 60 STSKTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHS--E 117
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+Q+ + GCG+ Q+GS+LD AAP+G+ GLG+ +SVPS+L++ GL+ NSFS+CF ++
Sbjct: 118 PIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGV 177
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFL 346
G + FGD+G Q+ T F + + + Y + V S +G + L + AL DSG SF++
Sbjct: 178 GRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRVGTT-LIDADITALFDSGTSFSYF 235
Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKV-PDMRLIFSKNQSFVVR 404
IY+++ F R ++YCYN S + + P + L F V
Sbjct: 236 TDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSPDANASLTPGISLTMKGGGPFPVY 295
Query: 405 NHIF 408
+ I
Sbjct: 296 DPII 299
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 142/388 (36%), Positives = 214/388 (55%), Gaps = 21/388 (5%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
+F L HR+SD K G +SV D P+K S+ Y + D KL S+
Sbjct: 40 TFGFDLHHRYSDPVK-------GMLSV-DDLPEKGSLHYYASMAHRDILIHGR--KLVSD 89
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
N S+ L F S G++T+ F + +LHY + IGTP++S+LVALD GS+L W+PC C
Sbjct: 90 NTST--PLTFFS-GNETYRF-SSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTN 145
Query: 143 CAPLSASYYTSLDR-NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
+ + S ++ + + Y P++SS+S+ + C++ LC +S C S + CPY Y +
Sbjct: 146 SGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSN 205
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
TSS+G LV+D+LHL + A ++ + +I GCGR QTGS+LDGAAP+G+ GLG+ ++
Sbjct: 206 GTSSTGVLVEDLLHLTT--DDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNI 263
Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
SVPS LA+ G NSFS+CF + G + FGD G + Q T F + + + Y V +
Sbjct: 264 SVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPF-NLRQLHPTYNVSITKI 322
Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNA 380
+G F A+ DSG SFT+L Y + F+ KR S+ ++YCY
Sbjct: 323 NVGGRDADLE-FSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEM 381
Query: 381 SSEEM-LKVPDMRLIFSKNQSFVVRNHI 407
SS + L++P + L+ F V + I
Sbjct: 382 SSNQTNLEIPTVNLVMQGGSQFNVTDPI 409
>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
Length = 313
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 102/190 (53%), Positives = 142/190 (74%), Gaps = 3/190 (1%)
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
SSV++ V+IGCG+KQ+G YLDG APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE D
Sbjct: 5 SSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED 64
Query: 286 SGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 344
SG ++FGD GP+ QQST FL + KY Y VGVE+ CIGNSCL Q+ F +DSG SFT
Sbjct: 65 SGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 124
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
+LP EIY +V ++ D+ +++ + +G SW+YCY +S+E KVP ++L FS N +FV+
Sbjct: 125 YLPEEIYRKVALEIDRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIH 182
Query: 405 NHIFSFPENE 414
+F F +++
Sbjct: 183 KPLFVFQQSQ 192
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/404 (32%), Positives = 208/404 (51%), Gaps = 23/404 (5%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
+F + HRFSD+ K G + + D P+K + +Y ++ D + R+
Sbjct: 32 TFGFDIHHRFSDQIK-------GMLGI-DDVPQKGTPQYYAVMAHRDRVFRGRRLA---- 79
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
+ + L + G+ TH + + LH+ + +GTP + FLVALD GS+L W+PC CI
Sbjct: 80 -GADHHSPLTFAAGNDTHQIASSGF-LHFANVSVGTPPLWFLVALDTGSDLFWLPCDCIS 137
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTE 201
C T + YD SS+S VSC++ C+ R C S C Y DY +
Sbjct: 138 CVHGGLRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSN 197
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
DTSS G++V+D+LHL + + + + GCG+ QTG +L+GAAP+G+ GLG+ ++
Sbjct: 198 DTSSRGFVVEDVLHLITDDDQTKDADTR--IAFGCGQVQTGVFLNGAAPNGLFGLGMDNI 255
Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
SVPS+LA+ GLI NSFS+CF + +G + FGD G Q+ T F + + + Y + +
Sbjct: 256 SVPSILAREGLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPF-NVRKLHPTYNITITKI 314
Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYC 377
+ +S + F A+ DSG SFT++ Y + ++ V +KR S Q + YC
Sbjct: 315 IVEDS-VADLEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYC 373
Query: 378 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
Y+ S + ++VP + L + V + I E GD C
Sbjct: 374 YDISISQTIEVPFLNLTMKGGDDYYVMDPIIQVSSEEEGDLLCL 417
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 147/400 (36%), Positives = 221/400 (55%), Gaps = 35/400 (8%)
Query: 28 LVHRFSDEAKERWISKSGNVSVADSWPKKNSV----EYLELLLSNDWKRQKTRVKLQSNN 83
L HR+S +RW + G+ V SWP V EY L +D R Q +
Sbjct: 31 LHHRYS-PIVQRWAEERGHAGV--SWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDG 87
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
L+ ++G+ T LHY + +GTPN +FLVALD GS+L WVPC C QC
Sbjct: 88 ------LVTFADGNITLRLDGS---LHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQC 138
Query: 144 APLSASYYTSLDRN----LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 199
APL T++D L +Y PS SS+SK V+C+ LC ++C + CPY Y+
Sbjct: 139 APLGN--LTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYA 196
Query: 200 TEDTSSSGYLVDDILHLA---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
+TSSSG LV+D+L+L + A ++V++ V+ GCG+ QTGS+LDGAA DG+MGL
Sbjct: 197 MANTSSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGL 256
Query: 257 GLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
G+ VSVPS+LA G+++ NSFS+CF ++ G + FGD G A Q T F+ + + Y
Sbjct: 257 GMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYN 315
Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-- 373
+ + S +G+ L GF A+ DSG SFT+L Y F+ +S +R + G++
Sbjct: 316 ISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRS 374
Query: 374 ----WKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIF 408
++YCY+ S ++ +++P + L + F V + ++
Sbjct: 375 GPFPFEYCYSLSPDQTTVELPIVSLTTNGGAVFPVTSPVY 414
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 147/400 (36%), Positives = 221/400 (55%), Gaps = 35/400 (8%)
Query: 28 LVHRFSDEAKERWISKSGNVSVADSWPKKNSV----EYLELLLSNDWKRQKTRVKLQSNN 83
L HR+S +RW + G+ V SWP V EY L +D R Q +
Sbjct: 31 LHHRYS-PIVQRWAEERGHAGV--SWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDG 87
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC 143
L+ ++G+ T LHY + +GTPN +FLVALD GS+L WVPC C QC
Sbjct: 88 ------LVTFADGNITLRLDGS---LHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQC 138
Query: 144 APLSASYYTSLDRN----LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 199
APL T++D L +Y PS SS+SK V+C+ LC ++C + CPY Y+
Sbjct: 139 APLGN--LTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYA 196
Query: 200 TEDTSSSGYLVDDILHLA---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
+TSSSG LV+D+L+L + A ++V++ V+ GCG+ QTGS+LDGAA DG+MGL
Sbjct: 197 MANTSSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGL 256
Query: 257 GLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
G+ VSVPS+LA G+++ NSFS+CF ++ G + FGD G A Q T F+ + + Y
Sbjct: 257 GMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYN 315
Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-- 373
+ + S +G+ L GF A+ DSG SFT+L Y F+ +S +R + G++
Sbjct: 316 ISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRS 374
Query: 374 ----WKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIF 408
++YCY+ S ++ +++P + L + F V + ++
Sbjct: 375 GPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVY 414
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 138/404 (34%), Positives = 206/404 (50%), Gaps = 28/404 (6%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
L+ + + F L A SF + HRFSD KE + S + P+K++ Y
Sbjct: 12 LLVLSVFFLAGGLRSGHAASFKFTIHHRFSDSIKEIFGS--------EGLPEKHTPGYYA 63
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
++ D R L + N + + +E + GN L+Y + IGTP + F
Sbjct: 64 AMVHRD--RLLHGRNLATTNGDTPLMFSYGNETYELSGLGN----LYYANVSIGTPGLYF 117
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRN---LSEYDPSSSSSSKNVSCSHPLCK 180
LVALD GS+L W+PC+C +C +Y T D L+ Y ++SS+S V CS LC+
Sbjct: 118 LVALDTGSDLFWLPCECTKCP----TYLTKRDNGKFWLNHYSSNASSTSIRVPCSSSLCE 173
Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
+ C S K CPY Y +E++SS+GYLV DILH+A+ + V V +GCG+ Q
Sbjct: 174 LANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMAT--DDSQLKPVDVKVTLGCGKVQ 231
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
TG + + AP+G++GLG+G VSVPS LA GL +SFS+CF G + FGD GP Q+
Sbjct: 232 TGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYYGYGRIDFGDIGPVGQR 291
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
T F P Y+ + + I + T A++DSGASFT+L Y+ + D
Sbjct: 292 ETPFNPASLSYNVTILQI----IVTNRPTNVHLTAIIDSGASFTYLTDPFYSIITENMDA 347
Query: 361 LVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
+ +RI + ++YCY S + + P++ + F V
Sbjct: 348 AMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTMEGGRKFDV 391
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 141/419 (33%), Positives = 217/419 (51%), Gaps = 46/419 (10%)
Query: 20 DAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKL 79
+A F+ + HR+S+ K +W + S + WP+K SVEY L D + R+
Sbjct: 22 NAHIFTFTMHHRYSEPVK-KWSHSAP--SPSHRWPEKGSVEYYAELADRDRFLRGRRL-- 76
Query: 80 QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
S + L S+G+ T F + +LHYT I++GTP V F+VALD GS+L WVPC
Sbjct: 77 -----SQFDAGLAFSDGNST-FRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCD 130
Query: 140 CIQCAPL---SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA 196
C +C+ + + + D +LS Y+P+ SS+SK V+C++ LC R+ C CPY+
Sbjct: 131 CTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMV 190
Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
Y + +TS+SG LV+D+LHL + V+++VI GCG+ Q+GS+LD AAP+G+ GL
Sbjct: 191 SYVSAETSTSGILVEDVLHLTQPDDN--HDLVEANVIFGCGQVQSGSFLDVAAPNGLFGL 248
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
G+ +SVPS+L++ G +SFS+CF + G + FGD+G Q T F + + Y +
Sbjct: 249 GMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVNPSHPTYNI 307
Query: 317 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEV---------------------- 354
+ +G + L F AL DSG SFT+L Y+ +
Sbjct: 308 TINQVRVGTT-LIDVEFTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVT 366
Query: 355 ----VVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHI 407
+++F V +R + YCY+ S + +P M L FVV + I
Sbjct: 367 IEVFMLQFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPI 425
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 143/410 (34%), Positives = 215/410 (52%), Gaps = 26/410 (6%)
Query: 7 ICMLFGCILLDGSDAV-SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELL 65
I ML +LD + + F + HRFSD+ V D P ++S +Y ++
Sbjct: 15 ILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVV--------GVLPGDGLPNRDSSKYYRVM 66
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
D + R+ S L+ ++G++T N +LHY + +GTP+ FLV
Sbjct: 67 AHRDRLIRGRRLA------SEDQSLVTFADGNET-IRVNALGFLHYANVTVGTPSDWFLV 119
Query: 126 ALDAGSNLLWVPCQC-IQCA-PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
ALD GS+L W+PC C C L A +SLD N+ Y P++SS+S V C+ LC
Sbjct: 120 ALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNI--YSPNASSTSSKVPCNSTLCTRVD 177
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
C S CPY Y + TSS+G LV+D+LHL S K++ +++ + +GCG QTG
Sbjct: 178 RCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNS--KPIRARITLGCGLVQTGV 235
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
+ DGAAP+G+ GLGL D+SVPS+LAK G+ NSFS+CF ++ +G + FGD+G Q+ T
Sbjct: 236 FHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETP 295
Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
L I + + Y V V +G + F A+ D+G SFT+L Y + F+ L
Sbjct: 296 -LNIRQPHPTYNVTVTQISVGGNT-GDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLAL 353
Query: 364 SKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFP 411
KR ++YCY S +++ + PD+ L S+ V + + P
Sbjct: 354 DKRYQTDSELPFEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVP 403
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 142/403 (35%), Positives = 216/403 (53%), Gaps = 28/403 (6%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFSD+ V D P ++S +Y ++ D + R + +N + S
Sbjct: 39 HRFSDQVV--------GVLPGDGLPNRDSSKYYRVMAHRD---RLIRGRRLANEDQS--- 84
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA-PLSA 148
L+ S+G++T + +LHY + +GTP+ FLVALD GS+L W+PC C C L A
Sbjct: 85 LVTFSDGNET-IRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKA 143
Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
+SLD N+ Y P++SS+S V C+ LC C S + CPY Y + TSS+G
Sbjct: 144 PGGSSLDLNI--YSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGV 201
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
LV+D+LHL S K + ++ + V +GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LA
Sbjct: 202 LVEDVLHLVSNDKSS--KAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLA 259
Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSC 327
K G+ NSFS+CF + +G + FGD+G Q+ T L I + + Y + V + GN+
Sbjct: 260 KEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVEGNTG 318
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEE 384
+ F A+ DSG SFT+L Y + F+ L KR + ++YCY S +++
Sbjct: 319 DLE--FDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKD 376
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLE 427
+ P + L S+ V + + P + D C + +E
Sbjct: 377 SFQYPAVNLTMKGGSSYPVYHPLVVIPMKDT-DVYCLAILKIE 418
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 140/402 (34%), Positives = 213/402 (52%), Gaps = 26/402 (6%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFSD+ V D P ++S +Y ++ D + R + +N + S
Sbjct: 39 HRFSDQVV--------GVLPGDGLPNRDSSKYYRVMAHRD---RLIRGRRLANEDQS--- 84
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA-PLSA 148
L+ S+G++T + +LHY + +GTP+ F+VALD GS+L W+PC C C L A
Sbjct: 85 LVTFSDGNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKA 143
Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
+SLD N+ Y P++SS+S V C+ LC C S + CPY Y + TSS+G
Sbjct: 144 PGGSSLDLNI--YSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGV 201
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
LV+D+LHL S K + ++ + V GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LA
Sbjct: 202 LVEDVLHLVSNDKSS--KAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLA 259
Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
K G+ NSFS+CF + +G + FGD+G Q+ T L I + + Y + V +G +
Sbjct: 260 KEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT- 317
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEEM 385
F A+ DSG SFT+L Y + F+ L KR + ++YCY S +++
Sbjct: 318 GDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDS 377
Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLE 427
+ P + L S+ V + + P + D C + +E
Sbjct: 378 FQYPAVNLTMKGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE 418
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 135/366 (36%), Positives = 198/366 (54%), Gaps = 21/366 (5%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFS + RW G+ + WP Y+ L +D R + R
Sbjct: 29 HRFSARVR-RWADSRGH-ELPGGWPSPGGFAYVAALAGHDRHRALSAA-------GGRPP 79
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
L F SEG+ T N +LHY + +GTP +F+VALD GS+L W+PCQC C +
Sbjct: 80 LTF-SEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPP 134
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
++ S Y PS SS+S+ V C+ C R C S CPY Y + DTSSSG+L
Sbjct: 135 PSSAASAPASFYIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFL 193
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L++ H PQ +++ ++ GCG QTGS+LD AAP+G+ GLG+ +SVPS+LA+
Sbjct: 194 VEDVLYLSTEDTH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
GL NSFS+CF + G + FGDQG + Q+ T L I +K+ Y + + +GN+ L
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LM 309
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
+ D+G SFT+L Y + F V + R + ++YCY+ +SSE ++
Sbjct: 310 DLEVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQ 369
Query: 388 VPDMRL 393
P + L
Sbjct: 370 TPSISL 375
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 135/366 (36%), Positives = 198/366 (54%), Gaps = 21/366 (5%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HRFS + RW G+ + WP Y+ L +D R + R
Sbjct: 29 HRFSARVR-RWADSRGH-ELPGGWPSPGGFAYVAALAGHDRHRALSAA-------GGRPP 79
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
L F SEG+ T N +LHY + +GTP +F+VALD GS+L W+PCQC C +
Sbjct: 80 LTF-SEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPP 134
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
++ S Y PS SS+S+ V C+ C R C S CPY Y + DTSSSG+L
Sbjct: 135 PSSAASAPASFYIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFL 193
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L++ H PQ +++ ++ GCG QTGS+LD AAP+G+ GLG+ +SVPS+LA+
Sbjct: 194 VEDVLYLSTEDTH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
GL NSFS+CF + G + FGDQG + Q+ T L I +K+ Y + + +GN+ L
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LM 309
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLK 387
+ D+G SFT+L Y + F V + R + ++YCY+ +SSE ++
Sbjct: 310 DLEVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQ 369
Query: 388 VPDMRL 393
P + L
Sbjct: 370 TPSISL 375
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 140/405 (34%), Positives = 206/405 (50%), Gaps = 35/405 (8%)
Query: 17 DGSDAVSFSSKLVHRFSDEAKERWI--SKSGNVSV-ADSW------PKKNSVEYLELLLS 67
+ S + F+ L HRFS ++ W+ ++ G V SW P S EY LL
Sbjct: 25 EASGGIGFN--LHHRFSPVVRQ-WMVDARGGGHGVPGSSWLLPEEAPAVGSPEYYSALLR 81
Query: 68 NDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVAL 127
+D R L S + L F ++G+ T + + +LHY +++GTP+ FLVAL
Sbjct: 82 HDRALFTRRRGLASAADGQSTTLTF-ADGNATRL--DTYEYLHYAEVEVGTPSSKFLVAL 138
Query: 128 DAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKS 187
D GS+L W+PC+C CA + Y PS SS+SK V C HPLC+ +C +
Sbjct: 139 DTGSDLFWLPCECKLCA----------KNGSTMYSPSLSSTSKTVPCGHPLCERPDACAT 188
Query: 188 L---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
CPY Y + +T SSG LV+D+LHL +VQ+ ++ GCG+ QTG++
Sbjct: 189 AGKSSSSCPYEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAF 248
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTS 303
L GAA G+MGLGL VSVPS LA +GL+ +SFS+CF + G + FGD G Q T
Sbjct: 249 LRGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDAGSPDQAETP 308
Query: 304 FLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
+ G +Y+ + V + + + + F A+VDSG SFT+L Y + F+ V
Sbjct: 309 LIAAGSLQPSYYNISVGAITVDSKAMAVE-FTAVVDSGTSFTYLDDPAYTFLTTNFNSRV 367
Query: 363 S--SKRISLQGNSWKYCYNASSEE--MLKVPDMRLIFSKNQSFVV 403
S S+ +++CY S + M ++P M L F +
Sbjct: 368 SEASETYGSGYEKFEFCYRLSPGQTSMKRLPAMSLTTKGGAVFPI 412
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 115/302 (38%), Positives = 171/302 (56%), Gaps = 8/302 (2%)
Query: 108 WLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+L+Y + +GTP V +LVALD GS+L W+PC C+ C ++ T N + Y P++SS
Sbjct: 128 FLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNTTQGPVNFNIYSPNNSS 185
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+SK V CS LC C S D CPY Y +++TSS+GYLV+DILHL +
Sbjct: 186 TSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTT--NDVQSKP 243
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
V + + +GCG+ Q+G++L AAP+G+ GLG+ +VSVPS+LA AGLI NSFS+CF G
Sbjct: 244 VNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMG 303
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
+ FGD+G Q T F +G ++ Y V + +G ++ + DSG SFT+L
Sbjct: 304 RIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVIFDSGTSFTYLN 361
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 405
Y+ KF +V K+ ++ + ++ CY S ++ P M L FV+ +
Sbjct: 362 DPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVINH 421
Query: 406 HI 407
I
Sbjct: 422 PI 423
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 115/302 (38%), Positives = 171/302 (56%), Gaps = 8/302 (2%)
Query: 108 WLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+L+Y + +GTP V +LVALD GS+L W+PC C+ C ++ T N + Y P++SS
Sbjct: 105 FLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNTTQGPVNFNIYSPNNSS 162
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+SK V CS LC C S D CPY Y +++TSS+GYLV+DILHL +
Sbjct: 163 TSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTT--NDVQSKP 220
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
V + + +GCG+ Q+G++L AAP+G+ GLG+ +VSVPS+LA AGLI NSFS+CF G
Sbjct: 221 VNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMG 280
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
+ FGD+G Q T F +G ++ Y V + +G ++ + DSG SFT+L
Sbjct: 281 RIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVIFDSGTSFTYLN 338
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 405
Y+ KF +V K+ ++ + ++ CY S ++ P M L FV+ +
Sbjct: 339 DPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVINH 398
Query: 406 HI 407
I
Sbjct: 399 PI 400
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 133/376 (35%), Positives = 200/376 (53%), Gaps = 23/376 (6%)
Query: 16 LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKT 75
+DG S + HRFS + W G+ + WP Y+ L +D R
Sbjct: 20 VDGRRRAPPSLEFHHRFSARLRG-WADARGH-ELPGGWPPPGGAAYVAALAGHDRHRALA 77
Query: 76 RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLW 135
++ + L SEG+ T N +LHY + +GTP +F+VALD GS+L W
Sbjct: 78 ---------AADHPPLTFSEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFW 127
Query: 136 VPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYI 195
+PCQC C P ++ S S Y PS SS+S+ V C+ C R C S CPY
Sbjct: 128 LPCQCDGCPPPASGASGSA----SFYIPSMSSTSQAVPCNSDFCDHRKDC-STTSSCPYK 182
Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
Y + DTSSSG+LV+D+L+L++ H PQ +++ ++ GCG+ QTGS+LD AAP+G+ G
Sbjct: 183 MVYVSADTSSSGFLVEDVLYLSTEDNH-PQI-LKAQIMFGCGQVQTGSFLDAAAPNGLFG 240
Query: 256 LGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
LG+ +SVPS+LA GL +SFS+CF + G + FGDQG + Q+ T L I +K+ Y
Sbjct: 241 LGIDMISVPSILAHKGLTSDSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYA 299
Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SW 374
+ + +G + F + D+G +FT+L Y + F V + R + +
Sbjct: 300 ITITGITVGTEPMDLE-FSTIFDTGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPF 358
Query: 375 KYCYN-ASSEEMLKVP 389
+YCY+ +SSE ++ P
Sbjct: 359 EYCYDLSSSEARIQTP 374
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 132/391 (33%), Positives = 204/391 (52%), Gaps = 24/391 (6%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
SF + HRFSD KE + V D P K + Y ++ D + R L +
Sbjct: 29 SFGFDIHHRFSDPVKEI-------LGVHD-LPDKGTRLYYVVMAHRDRIFRGRR--LAAA 78
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
+ S + +E Q FG +LH+ + +GTP +SFLVALD GS+L W+PC C +
Sbjct: 79 VHHSPLTFVPANETYQIGAFG----FLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTK 134
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
C S + N+ YD SS+S+ V C+ LC+ + C S CPY +Y +
Sbjct: 135 CVRGVESNGEKIAFNI--YDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNG 192
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
TS++G+LV+D+LHL + + + + GCG+ QTG++LDGAAP+G+ GLG+G+ S
Sbjct: 193 TSTTGFLVEDVLHLITDDDETKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMGNES 250
Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 322
VPS+LAK GL NSFS+CF + G + FGD Q T F + + Y + V
Sbjct: 251 VPSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQII 309
Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYN 379
+G + F A+ DSG SFT L Y ++ F+ + +R S + ++YCY+
Sbjct: 310 VGGNAADLE-FHAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYD 368
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
SS + +++P + L +++V + I +
Sbjct: 369 LSSNKTVELP-INLTMKGGDNYLVTDPIVTI 398
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 136/386 (35%), Positives = 207/386 (53%), Gaps = 38/386 (9%)
Query: 1 MVNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
M+ ++++ +L G L DA SF + HRFSD K + S + P+K++
Sbjct: 11 MLLVLSVFILAGS--LRSGDAASFKFDIHHRFSDSIKGIFHS--------EGLPEKHTPG 60
Query: 61 YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
Y ++ D + R+ + QL F + G+ T F + +L+Y + +GTP+
Sbjct: 61 YYATMVHRDRLVRGRRLAASDVDT----QLTF-AYGNDTAFIPD-LGFLYYANVSVGTPS 114
Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRN------LSEYDPSSSSSSKNVSC 174
+ FLVALD GS+L W+PC+C C +T L+ + L+ Y P+ S++S V C
Sbjct: 115 LDFLVALDTGSDLFWLPCECSSC-------FTYLNTSNGGKFMLNHYSPNDSTTSSTVPC 167
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
+ LC + C S ++ CPY Y + +TSS GYLV+D+LHLA+ + V++ +
Sbjct: 168 TSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLAT--DDSLLKPVEAKITF 222
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ 294
GCG QTG + AAP+G++GLG+ +SVPS LA GL NSFS+CF + G + FGD
Sbjct: 223 GCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFGDT 282
Query: 295 GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEV 354
GPA Q+ T F + E Y +Y V +G F A+ DSG SFT+L Y+ +
Sbjct: 283 GPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIFDSGTSFTYLTEPAYSTI 340
Query: 355 VVKFDKLVSSKRISLQGNS--WKYCY 378
+ D + KR SL G + ++YCY
Sbjct: 341 TKQMDAGMKLKRYSLFGPNFPFEYCY 366
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 179/310 (57%), Gaps = 13/310 (4%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
LHY + +GTP +F+VALD GS+L W+PCQC C P + T+ + + Y P SS+
Sbjct: 6 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA----TAASGSATFYIPGMSST 61
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
SK V C+ C + C + CPY Y + TSSSG+LV+D+L+L++ + H PQ +
Sbjct: 62 SKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTENAH-PQI-L 118
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL NSFS+CF + G
Sbjct: 119 KAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGR 178
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPT 348
+ FGDQ + Q+ T L I ++ Y + + +GN T F + D+G SFT+L
Sbjct: 179 ISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDMDFITIFDTGTSFTYLAD 236
Query: 349 EIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRN- 405
Y + F V + R + ++YCY+ +SSE +PD+ L F V +
Sbjct: 237 PAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTGSMFPVIDP 296
Query: 406 -HIFSFPENE 414
+ S E+E
Sbjct: 297 GQVISIQEHE 306
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 114/278 (41%), Positives = 157/278 (56%), Gaps = 7/278 (2%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y LHY + +GTP+VSFLVALD GSNLLW+PC C C S ++D N+ Y P++S
Sbjct: 59 YILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTVDLNI--YSPNTS 116
Query: 167 SSSKNVSCSHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
S+S+ V C+ LC R C S + CPY Y + TS++GY+V D+LHL S +
Sbjct: 117 STSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHL--ISDDSQ 174
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+V + + GCG+ QTGS+L G AP+G+ GLG+ ++SVPS LA G SFS+CF N
Sbjct: 175 SKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPN 234
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 344
G + FGD+G Q TSF + Y + + IG + + A+ DSG SFT
Sbjct: 235 GIGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQA-SDLVYSAIFDSGTSFT 293
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
+L Y + F+KLV R S + YCY+ S
Sbjct: 294 YLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRS 331
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/327 (37%), Positives = 185/327 (56%), Gaps = 26/327 (7%)
Query: 30 HRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ 89
HR+S +E W P + EY L +D +R+ + +
Sbjct: 28 HRYSATVRE-WAGHRA--------PPAGTAEYYAALAGHDLRRRSL---------AGGGE 69
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
+ F ++G+ T+ N+ +LHY + +GTPNV+FLVALD GS+L WVPC CI CAPL +
Sbjct: 70 VAF-ADGNDTYRL-NELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSP 127
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
Y D Y P SS+S+ V CS LC +S+C+S CPY Y +++TSS+G L
Sbjct: 128 NYR--DLKFDTYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVL 185
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
V+D+L+L + P+ V + + GCGR QTGS+L AAP+G++GLG+ +SVPSLLA
Sbjct: 186 VEDVLYLVTEYGRQPK-IVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLAS 244
Query: 270 AGL-IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
G+ NSFS+CF ++ G + FGD G + QQ T L + ++ Y + + +G+ +
Sbjct: 245 QGVAAANSFSMCFAQDGHGRINFGDTGSSDQQETP-LNMYKQNPYYNISITGATVGSKSI 303
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVV 355
+ F A+VDSG SFT L +Y ++
Sbjct: 304 -HTKFNAIVDSGTSFTALSDPMYTQIT 329
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 121/332 (36%), Positives = 179/332 (53%), Gaps = 20/332 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA-PLSASYYTSLDRNLSEYDPSSSS 167
LHY + +GTP+ F+VALD GS+L W+PC C C L A +SLD N+ Y P++SS
Sbjct: 54 LHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI--YSPNASS 111
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+S V C+ LC C S + CPY Y + TSS+G LV+D+LHL S K + +
Sbjct: 112 TSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSS--KA 169
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
+ + V GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+ NSFS+CF + +G
Sbjct: 170 IPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAG 229
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
+ FGD+G Q+ T L I + + Y + V +G + F A+ DSG SFT+L
Sbjct: 230 RISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAVFDSGTSFTYLT 287
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNS--WKYCY----------NASSEEMLKVPDMRLIF 395
Y + F+ L KR + ++YCY + +++ + P + L
Sbjct: 288 DAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTM 347
Query: 396 SKNQSFVVRNHIFSFPENEVGDHACFSYFTLE 427
S+ V + + P + D C + +E
Sbjct: 348 KGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE 378
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 128/391 (32%), Positives = 202/391 (51%), Gaps = 26/391 (6%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
SF + HRFSD KE + V D P K + +Y + D + R+ +
Sbjct: 29 SFGFDIHHRFSDPVKEI-------LGVHD-LPDKGTRQYYVAMAHRDRIFRGRRLAAGYH 80
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
S + +E Q FG +LH+ + +GTP +SFLVALD GS+L W+PC C +
Sbjct: 81 ---SPLTFIPSNETYQIEAFG----FLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTK 133
Query: 143 CAP-LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 201
C + S + N+ YD SS+S+ V C+ LC+ + C S CPY +Y +
Sbjct: 134 CVHGIGLSNGEKIAFNI--YDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSN 191
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
TS++G+LV+D+LHL + + + + GCG+ QTG++LDGAAP+G+ GLG+ +
Sbjct: 192 GTSTTGFLVEDVLHLITDDDKTKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMSNE 249
Query: 262 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
SVPS+LAK GL NSFS+CF + G + FGD Q T F + + Y + V
Sbjct: 250 SVPSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQI 308
Query: 322 CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCY 378
+G + F A+ DSG SFT+L Y ++ F+ + +R S ++ ++YCY
Sbjct: 309 IVGEK-VDDLEFHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCY 367
Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFS 409
S + +++ + L +++V + I +
Sbjct: 368 ELSPNQTVEL-SINLTMKGGDNYLVTDPIVT 397
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 102/264 (38%), Positives = 158/264 (59%), Gaps = 10/264 (3%)
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+VALD GS+L WVPC C +CAP + Y S + LS Y+P S+++K V+C++ LC R+
Sbjct: 1 MVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRN 59
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
C CPY+ Y + TS+SG L++D++HL + K+ + V++ V GCG+ Q+GS
Sbjct: 60 QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGS 117
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS 303
+LD AAP+G+ GLG+ +SVPS+LA+ GL+ +SFS+CF + G + FGD+G + Q+ T
Sbjct: 118 FLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETP 177
Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
F + + Y + V +G + L F AL D+G SFT+L +Y V +
Sbjct: 178 F-NLNPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTV----SESAQ 231
Query: 364 SKRISLQGN-SWKYCYNASSEEML 386
KR S ++YCY+ + +L
Sbjct: 232 DKRHSPDSRIPFEYCYDMREKLVL 255
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 134/392 (34%), Positives = 198/392 (50%), Gaps = 26/392 (6%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
FS ++ H FSD K+ ++ + D P+K S+EY ++L D R L SNN
Sbjct: 29 FSFEVHHMFSDRVKQ-------SLGLDDLVPEKGSLEYFKVLAQRD--RLIRGRGLASNN 79
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ + + G +LHY + +GTP FLVALD GS+L W+PC C
Sbjct: 80 EETPITFMRGNRTISIDLLG----FLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGST 135
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
C S R L+ Y P++SS+S ++ CS C S C S CPY Y ++D
Sbjct: 136 CIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKD 195
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
T ++G L +D+LHL + + V++++ +GCG+ QTG AA +G++GLGL D S
Sbjct: 196 TFTTGTLFEDVLHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYS 253
Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
VPS+LAKA + NSFS+CF + G + FGD+G Q T LP E Y V V
Sbjct: 254 VPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPT-EPSPTYAVSVTE 312
Query: 321 YCIGNSCLTQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYC 377
+G + G Q AL D+G SFT L Y + FD V+ KR + +++C
Sbjct: 313 VSVGGDAV---GVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFC 369
Query: 378 YNAS-SEEMLKVPDMRLIFSKNQSFVVRNHIF 408
Y+ S ++ + P + + F +RN +F
Sbjct: 370 YDLSPNKTTILFPRVAMTFEGGSQMFLRNPLF 401
>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 217
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 98/211 (46%), Positives = 130/211 (61%), Gaps = 21/211 (9%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
SS++VHR SDEA+ + G WP++ S EY L+ +D +RQK R+ + S
Sbjct: 28 SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQCA
Sbjct: 80 ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS 204
PLS Y +LDR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+
Sbjct: 131 PLSG-YRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTT 189
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
SSG L++D LHL H P V +SVIIG
Sbjct: 190 SSGLLIEDTLHLNYREDHVP---VNASVIIG 217
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 133/390 (34%), Positives = 195/390 (50%), Gaps = 22/390 (5%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
FS ++ H FSD K+ + + D P+K S+EY ++L D R L SNN
Sbjct: 30 FSFEVHHMFSDRVKQ-------TLGLDDLVPEKGSLEYFKVLAQRD--RLIRGRGLASNN 80
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ + + F G +LHY + +GTP FLVALD GSNL W+PC C
Sbjct: 81 EETPITFMRGNRTVSIDFLG----FLHYANVSVGTPATWFLVALDTGSNLFWLPCNCGST 136
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
C S R L+ Y P++SS+S ++ C+ C S C S CPY Y ++D
Sbjct: 137 CIRDLKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKD 196
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
T ++G L +D+LHL + + V++++ +GCGR QTG AA +G++GLG+ D S
Sbjct: 197 TFTTGTLFEDVLHLVT--EDVDLKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYS 254
Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
VPS+LAKA + NSFS+CF + G + FGD+G Q T LP E Y V V
Sbjct: 255 VPSILAKAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPT-EPSPTYAVNVTE 313
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
+G + AL D+G SFT L Y + FD V+ KR + +++CY+
Sbjct: 314 VSVGGDVVGVQ-LLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPEIPFEFCYD 372
Query: 380 AS-SEEMLKVPDMRLIFSKNQSFVVRNHIF 408
S + + P + + F +RN +F
Sbjct: 373 LSPNSTTILFPRVAMTFEGGSLMFLRNPLF 402
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 131/390 (33%), Positives = 196/390 (50%), Gaps = 28/390 (7%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
FS ++ H FSD K+ ++ + D P+K S+EY ++L D R L SNN
Sbjct: 29 FSFEVHHMFSDRVKQ-------SLGLDDLVPEKGSLEYFKVLAQRD--RLIRGRGLASNN 79
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ + + G +LHY + +GTP FLVALD GS+L W+PC C
Sbjct: 80 EETPITFMRGNRTISIDLLG----FLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGST 135
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
C S R L+ Y P++SS+S ++ CS C S C S CPY Y ++D
Sbjct: 136 CIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKD 195
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
T ++G L +D+LHL + + V++++ +GCG+ QTG AA +G++GLGL D S
Sbjct: 196 TFTTGTLFEDVLHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYS 253
Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
VPS+LAKA + NSFS+CF + G + FGD+G Q T LP VG ++
Sbjct: 254 VPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSVTEVSVGGDA 313
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
+G L AL D+G SFT L Y + FD V+ KR + +++CY+
Sbjct: 314 --VGVQLL------ALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYD 365
Query: 380 AS-SEEMLKVPDMRLIFSKNQSFVVRNHIF 408
S ++ + P + + F +RN +F
Sbjct: 366 LSPNKTTILFPRVAMTFEGGSQMFLRNPLF 395
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 127/421 (30%), Positives = 206/421 (48%), Gaps = 45/421 (10%)
Query: 11 FGCIL---LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLS 67
F CI+ L S + S S ++ HRFS++ K V P+ S++Y + L+
Sbjct: 16 FLCIMSLGLASSVSGSLSFEIHHRFSEQVK--------TVLGGHGLPEMGSLDYYKALVH 67
Query: 68 NDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ------FYWLHYTWIDIGTPNV 121
D R +L SNNN + + + + F +LHY + IGTP
Sbjct: 68 RDRGR-----RLTSNNNQTTISFAQGNSTEEISLYDQNLAPPLFFNYLHYANVTIGTPAQ 122
Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSA------SYYTSLDRNLSEYDPSSSSSSKNVSCS 175
FLVALD GS+L W+PC C S ++ + L+ Y+PS S+SS V+C+
Sbjct: 123 WFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCN 182
Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
LC R+ C S CPY Y + + S+G LV+D++H+++ A + + G
Sbjct: 183 STLCALRNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDARIT----FG 238
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
C Q G + + A +G+MGL + D++VP++L KAG+ +SFS+CF N G++ FGD+G
Sbjct: 239 CSETQLGLFQE-VAVNGIMGLAMADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKG 297
Query: 296 PATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAE 353
+ Q T P+G F V + + +G + ++ F A+ DSG + T+L Y
Sbjct: 298 SSDQHET---PLGGTISPLFYDVSITKFKVGKVTV-ETKFSAIFDSGTAVTWLLDPYYTA 353
Query: 354 VVVKFDKLVSSKRISLQGNS-WKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFS 409
+ F V +R+ +S +++CY + S EE K+P + ++ V + I
Sbjct: 354 LTTNFHLSVPDRRLPANVDSTFEFCYIITSTSDEE--KLPSISFEMKGGAAYDVFSPILV 411
Query: 410 F 410
F
Sbjct: 412 F 412
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 125/391 (31%), Positives = 195/391 (49%), Gaps = 23/391 (5%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F ++ H FSD K+ ++ + D P++ S+EY ++L D R L SNN
Sbjct: 29 FGFEVHHIFSDAVKQ-------SLGLDDLVPEQGSLEYFKVLAHRD--RLIRGRGLASNN 79
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ +G L+Y + +GTP SFLVALD GS+L W+PC C
Sbjct: 80 EDTPVTF----DGGNLTVSIKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT 135
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
C L+ Y P++S++S ++ CS C C S K CPY YS
Sbjct: 136 CIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISYSNS- 194
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
T ++G L+ D+LHLA+ ++ + V+++V +GCG+KQTG + + +GV+GLG+ S
Sbjct: 195 TGTTGTLLQDVLHLATEDENL--TPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252
Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
VPSLLAKA + +SFS+CF + G + FGD+G Q+ T F+ + AY + V
Sbjct: 253 VPSLLAKANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPS-TAYGLNVTG 311
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
+G + F A D+G+SFT L Y + FD LV KR + +++CY+
Sbjct: 312 VSVGGDPVGTRLF-AKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYD 370
Query: 380 ASSEEM-LKVPDMRLIFSKNQSFVVRNHIFS 409
S ++ P + + F ++ N F+
Sbjct: 371 LSPNATSIEFPFVEMTFVGGSKIILNNPFFT 401
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 124/391 (31%), Positives = 197/391 (50%), Gaps = 23/391 (5%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F ++ H FSD K+ ++ + D P++ S+EY ++L D R L SNN
Sbjct: 29 FGFEVHHIFSDSVKQ-------SLGLGDLVPEQGSLEYFKVLAHRD--RLIRGRGLASNN 79
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ + + F +G L+Y + +GTP SFLVALD GS+L W+PC C
Sbjct: 80 DET--PITF--DGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT 135
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
C L+ Y P++S++S ++ CS C C S CPY YS
Sbjct: 136 CIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYS-NS 194
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
T + G L+ D+LHLA+ ++ + V+++V +GCG+KQTG + + +GV+GLG+ S
Sbjct: 195 TGTKGTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252
Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVES 320
VPSLLAKA + NSFS+CF + G + FGD+G Q+ T F+ + AY V +
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPS-TAYGVNISG 311
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN 379
+ + F A D+G+SFT L Y + FD+LV +R + +++CY+
Sbjct: 312 VSVAGDPVDIRLF-AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYD 370
Query: 380 AS-SEEMLKVPDMRLIFSKNQSFVVRNHIFS 409
S + ++ P + + F ++ N F+
Sbjct: 371 LSPNATTIQFPLVEMTFIGGSKIILNNPFFT 401
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 187/368 (50%), Gaps = 26/368 (7%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
FS ++ H FSD K+ + D P+ S+EY ++L D R L SNN
Sbjct: 30 FSFEVHHMFSDVVKQ-------TLGFDDLVPENGSLEYFKVLAHRD--RFIRGRGLASNN 80
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ S GS N +LHY + +GTP FLVALD GS+L W+PC C
Sbjct: 81 EETP----LTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTT 136
Query: 143 CAP--LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
C A + S+ NL Y P++S++S ++ CS C C S + CPY S+
Sbjct: 137 CIHDLKDARFSESVPLNL--YTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS 194
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
+T ++G L+ D+LHL + + V ++V +GCG+ QTG++ A +GV+GL + +
Sbjct: 195 -NTVTTGTLLQDVLHLVT--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKE 251
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV 318
SVPSLLAKA + NSFS+CF S G + FGD+G Q+ T + + E AY V V
Sbjct: 252 YSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNV 310
Query: 319 ESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYC 377
+G + F AL D+G+SFT L Y FD L+ KR + + +++C
Sbjct: 311 TGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFC 369
Query: 378 YNASSEEM 385
Y+ E +
Sbjct: 370 YDLREEHL 377
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 187/368 (50%), Gaps = 26/368 (7%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
FS ++ H FSD K+ + D P+ S+EY ++L D R L SNN
Sbjct: 18 FSFEVHHMFSDVVKQ-------TLGFDDLVPENGSLEYFKVLAHRD--RFIRGRGLASNN 68
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ S GS N +LHY + +GTP FLVALD GS+L W+PC C
Sbjct: 69 EETP----LTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTT 124
Query: 143 CAP--LSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 200
C A + S+ NL Y P++S++S ++ CS C C S + CPY S+
Sbjct: 125 CIHDLKDARFSESVPLNL--YTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS 182
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
+T ++G L+ D+LHL + + V ++V +GCG+ QTG++ A +GV+GL + +
Sbjct: 183 -NTVTTGTLLQDVLHLVT--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKE 239
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV 318
SVPSLLAKA + NSFS+CF S G + FGD+G Q+ T + + E AY V V
Sbjct: 240 YSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNV 298
Query: 319 ESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYC 377
+G + F AL D+G+SFT L Y FD L+ KR + + +++C
Sbjct: 299 TGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFC 357
Query: 378 YNASSEEM 385
Y+ E +
Sbjct: 358 YDLREEHL 365
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 197/392 (50%), Gaps = 32/392 (8%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
S S ++ HRFS++ K V P+ S++Y + L+ D RQ T +
Sbjct: 21 SLSFEIHHRFSEQVK--------TVLGGHGLPEMGSLDYYKALVHRDRGRQLT------S 66
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
NN+++ + F ++G+ T + +LHY + IGTP FLVALD GS+L W+PC C
Sbjct: 67 NNNNQTTISF-AQGNSTE----EISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNS 121
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
S L+ Y+PS S SS V+C+ LC R+ C S CPY Y +
Sbjct: 122 TCVRSMETDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPG 181
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
+ S+G LV+D++H+++ A + + GC Q G + + A +G+MGL + D++
Sbjct: 182 SKSTGVLVEDVIHMSTEEGEARDARIT----FGCSESQLGLFKE-VAVNGIMGLAIADIA 236
Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVES 320
VP++L KAG+ +SFS+CF N G++ FGD+G + Q T P+ F V +
Sbjct: 237 VPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLET---PLSGTISPMFYDVSITK 293
Query: 321 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY- 378
+ +G + + F A DSG + T+L Y + F V +R+S +S +++CY
Sbjct: 294 FKVGKVTV-DTEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYI 352
Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
S+ + K+P + ++ V + I F
Sbjct: 353 ITSTSDEDKLPSVSFEMKGGAAYDVFSPILVF 384
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/290 (34%), Positives = 153/290 (52%), Gaps = 19/290 (6%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
F ++ H FSD K+ ++ + D P++ S+EY ++L D R L SNN
Sbjct: 29 FGFEVHHIFSDSVKQ-------SLGLGDLVPEQGSLEYFKVLAHRD--RLIRGRGLASNN 79
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC-IQ 142
+ + + F +G L+Y + +GTP SFLVALD GS+L W+PC C
Sbjct: 80 DET--PITF--DGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT 135
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED 202
C L+ Y P++S++S ++ CS C C S CPY YS
Sbjct: 136 CIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYS-NS 194
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
T + G L+ D+LHLA+ ++ + V+++V +GCG+KQTG + + +GV+GLG+ S
Sbjct: 195 TGTKGTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252
Query: 263 VPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEK 310
VPSLLAKA + NSFS+CF + G + FGD+G Q+ T F+ + +
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPR 302
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 89/223 (39%), Positives = 128/223 (57%), Gaps = 9/223 (4%)
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
L+ Y P+ S++S V C+ LC + C S ++ CPY Y + +TSS GYLV+D+LHLA
Sbjct: 3 LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 59
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
+ + V++ + GCG QTG + AAP+G++GLG+ +SVPS LA GL NSF
Sbjct: 60 T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 117
Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 337
S+CF + G + FGD GPA Q+ T F + E Y +Y V +G F A+
Sbjct: 118 SMCFGADGYGRIDFGDTGPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIF 175
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCY 378
DSG SFT+L Y+ + + D + KR SL G + ++YCY
Sbjct: 176 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCY 218
>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
Length = 414
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 122/411 (29%), Positives = 184/411 (44%), Gaps = 72/411 (17%)
Query: 4 LVAICMLFGCILLDGSD-AVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
V + +L C L + A FS ++ H FSD K+ N+ D P+K S+EY
Sbjct: 8 FVLLSVLVACWGLQRCESAGKFSFEVHHMFSDTVKQ-------NLGFGDLVPEKGSLEYF 60
Query: 63 ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
+LL D + R + S+NN E T GN+ T ++
Sbjct: 61 KLLAQRD---RLIRGRGLSSNNE---------EAPVTFILGNR------------TVSID 96
Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
FL GS+L W+PC C T+ R+L + + S+
Sbjct: 97 FL-----GSDLFWLPCNC----------GTTCIRDLED-----------------IGLSQ 124
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
C S CPY Y TS+ G L +D+LHL ++ V++++ +GCG+ QTG
Sbjct: 125 GGCSSPASVCPYQIPYLFNTTSTRGTLFEDVLHLV--TEDEGLEPVKANITLGCGQNQTG 182
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPATQQ 300
Y A +G++GLG+ D SVPS+LAK + NSFS+CF + G + FGD+G Q
Sbjct: 183 LYRKSLAVNGLLGLGMKDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTDQL 242
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
T +PI E Y V V +G L + AL D+G SFT L Y + FD
Sbjct: 243 QTPLVPI-EPNPTYAVNVTEVTVGGDIL-EIQMLALFDTGTSFTHLLEPAYGLLTKAFDD 300
Query: 361 LVSSKRISLQGN-SWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFS 409
V+ KR + +++CY+ S + K P + + F +R+ +F+
Sbjct: 301 HVTDKRRPIDPEIPFEFCYDTSPNIKSFKFPRVNMTFVGGSKLTLRDPLFT 351
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 180/373 (48%), Gaps = 41/373 (10%)
Query: 71 KRQKTRVKLQSNNNSSRNQLL----FPSEG-SQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
KR K L++++ ++LL P G SQ G L++ I +GTP+ F V
Sbjct: 46 KRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIG-----LYFAKIGLGTPSRDFHV 100
Query: 126 ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KS 181
+D GS++LWV C CI+C P + L+ YD +SS++K+VSCS C
Sbjct: 101 QVDTGSDILWVNCAGCIRC-PRKSDLV-----ELTPYDVDASSTAKSVSCSDNFCSYVNQ 154
Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
RS C S C Y+ Y + +S++GYLV D++HL + + S ++I GCG KQ+
Sbjct: 155 RSECHS-GSTCQYVIMYG-DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQS 212
Query: 242 GSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
G + AA DG+MG G + S S LA G ++ SF+ C D N+ G +F G
Sbjct: 213 GQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AIGEVVSP 270
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSC--LTQSGFQA------LVDSGASFTFLPTEIYA 352
P+ K Y V + + +GNS L+ + F + ++DSG + +LP +Y
Sbjct: 271 KVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYN 330
Query: 353 EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF 410
++ + L S ++L + ++++ + P + F K+ S V R ++F
Sbjct: 331 PLLNEI--LASHPELTLHTVQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVYPREYLFQV 388
Query: 411 PENEVGDHACFSY 423
E D CF +
Sbjct: 389 RE----DTWCFGW 397
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 177/371 (47%), Gaps = 37/371 (9%)
Query: 71 KRQKTRVKLQSNNNSSRNQLL----FPSEG-SQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
KR+K L++++ ++LL P G SQ G L++ I +GTP+ F V
Sbjct: 46 KREKDLGALRAHDVHRHSRLLSAIDLPLGGDSQPESIG-----LYFAKIGLGTPSRDFHV 100
Query: 126 ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KS 181
+D GS++LWV C CI+C P + L+ YD +SS++K+VSCS C
Sbjct: 101 QVDTGSDILWVNCAGCIRC-PRKSDLV-----ELTPYDADASSTAKSVSCSDNFCSYVNQ 154
Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
RS C S C Y+ Y + +S++GYLV D++HL + + S ++I GCG KQ+
Sbjct: 155 RSECHS-GSTCQYVILYG-DGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTIIFGCGSKQS 212
Query: 242 GSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
G + AA DG+MG G + S S LA G ++ SF+ C D N+ G +F G
Sbjct: 213 GQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AIGEVVSP 270
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQS--GFQA------LVDSGASFTFLPTEIYA 352
P+ K Y V + + +GNS L S F + ++DSG + +LP +Y
Sbjct: 271 KVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVYLPDAVYN 330
Query: 353 EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
++ + L S + ++L + + + + P + F K+ S V + F
Sbjct: 331 PLMNQI--LASHQELNLHTVQDSFTCFHYIDRLDRFPTVTFQFDKSVSLAVYPQEYLFQV 388
Query: 413 NEVGDHACFSY 423
E D CF +
Sbjct: 389 RE--DTWCFGW 397
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 177/364 (48%), Gaps = 30/364 (8%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF---YWLHYTWIDIGTPNVSFLVAL 127
+RQ + ++++++S R ++L + GN L++T I +G+P+ + V +
Sbjct: 30 RRQASLTGIKAHDSSRRGRIL---SAVDFNLGGNGLPTVTGLYFTKIGLGSPSKDYYVQV 86
Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
D GS++LWV C +C +C S + L+ YDP S +S+ VSC H C S +
Sbjct: 87 DTGSDILWVNCVECTRCPRKS-----DIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGR 141
Query: 187 SL----KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
L ++PCPY Y + ++++GY V D L + + ++ SS+I GCG Q+G
Sbjct: 142 ILGCKAENPCPYSISYG-DGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSG 200
Query: 243 SYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQ 299
++ + A DG++G G + SV S LA +G ++ FS C D N G +F G+
Sbjct: 201 TFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKV 260
Query: 300 QSTSFLPIGEKYDAYFVGVES-----YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEV 354
++T +P Y+ +E ++ +++G ++DSG + +LP +Y ++
Sbjct: 261 KTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQL 320
Query: 355 VVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
+ K L R+ + +Y C+ + P ++L F + S V H + F N
Sbjct: 321 MSKV--LAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLF--N 376
Query: 414 EVGD 417
GD
Sbjct: 377 YKGD 380
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/321 (30%), Positives = 150/321 (46%), Gaps = 28/321 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP S+ V +D GS+++WV C QC QC S +L L+ Y+ S
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESD 133
Query: 168 SSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S K VSC C S CK+ CPY+ Y + +S++GY V D++ S +
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDSVAGD 191
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ SVI GCG +Q+G LD + A DG++G G + S+ S LA +G ++ F+
Sbjct: 192 LKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAH 250
Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGF 333
C D + G +F G Q + P+ Y V + + +G LT Q G
Sbjct: 251 CLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGD 308
Query: 334 Q--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+ A++DSG + +LP IY +V K + ++ + +K C+ S P++
Sbjct: 309 RKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNV 367
Query: 392 RLIFSKNQSFVVRNHIFSFPE 412
F + V H + FP
Sbjct: 368 TFHFENSVFLRVYPHDYLFPH 388
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 126/259 (48%), Gaps = 20/259 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP V + V LD GS WV C QC + + + R L+ YDP SS
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SSK V C +C SR C ++ CPYI Y+ + + G L D+LH +
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 170
Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+SV GCG +Q+GS + A A DG++G G + + S LA AG + FS C D +
Sbjct: 171 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 230
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
G +F G + PI + + Y V ++S + + L T +
Sbjct: 231 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 288
Query: 338 DSGASFTFLPTEIYAEVVV 356
DSG++ +LP IY+E+++
Sbjct: 289 DSGSTLVYLPEIIYSELIL 307
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 126/259 (48%), Gaps = 20/259 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP V + V LD GS WV C QC + + + R L+ YDP SS
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SSK V C +C SR C ++ CPYI Y+ + + G L D+LH +
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 194
Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+SV GCG +Q+GS + A A DG++G G + + S LA AG + FS C D +
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
G +F G + PI + + Y V ++S + + L T +
Sbjct: 255 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312
Query: 338 DSGASFTFLPTEIYAEVVV 356
DSG++ +LP IY+E+++
Sbjct: 313 DSGSTLVYLPEIIYSELIL 331
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 126/259 (48%), Gaps = 20/259 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP V + V LD GS WV C QC + + + R L+ YDP SS
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SSK V C +C SR C ++ CPYI Y+ + + G L D+LH +
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 194
Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+SV GCG +Q+GS + A A DG++G G + + S LA AG + FS C D +
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
G +F G + PI + + Y V ++S + + L T +
Sbjct: 255 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312
Query: 338 DSGASFTFLPTEIYAEVVV 356
DSG++ +LP IY+E+++
Sbjct: 313 DSGSTLVYLPEIIYSELIL 331
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 126/259 (48%), Gaps = 20/259 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP V + V LD GS WV C QC + + + R L+ YDP SS
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SSK V C +C SR C ++ CPYI Y+ + + G L D+LH +
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 194
Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+SV GCG +Q+GS + A A DG++G G + + S LA AG + FS C D +
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
G +F G + PI + + Y V ++S + + L T +
Sbjct: 255 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312
Query: 338 DSGASFTFLPTEIYAEVVV 356
DSG++ +LP IY+E+++
Sbjct: 313 DSGSTLVYLPEIIYSELIL 331
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 126/259 (48%), Gaps = 20/259 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP V + V LD GS WV C QC + + + R L+ YDP SS
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SSK V C +C SR C ++ CPYI Y+ + + G L D+LH +
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 170
Query: 228 VQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+SV GCG +Q+GS + A A DG++G G + + S LA AG + FS C D +
Sbjct: 171 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 230
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL--------TQSGFQALV 337
G +F G + PI + + Y V ++S + + L T +
Sbjct: 231 GGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 288
Query: 338 DSGASFTFLPTEIYAEVVV 356
DSG++ +LP IY+E+++
Sbjct: 289 DSGSTLVYLPEIIYSELIL 307
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 149/320 (46%), Gaps = 28/320 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP S+ V +D GS+++WV C QC QC S +L L+ Y+ S
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESD 133
Query: 168 SSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S K VSC C S CK+ CPY+ Y + +S++GY V D++ S +
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDSVAGD 191
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ SVI GCG +Q+G LD + A DG++G G + S+ S LA +G ++ F+
Sbjct: 192 LKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAH 250
Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGF 333
C D + G +F G Q + P+ Y V + + +G L Q G
Sbjct: 251 CLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGD 308
Query: 334 Q--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+ A++DSG + +LP IY +V K + ++ + +K C+ S P++
Sbjct: 309 RKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNV 367
Query: 392 RLIFSKNQSFVVRNHIFSFP 411
F + V H + FP
Sbjct: 368 TFHFENSVFLRVYPHDYLFP 387
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 150/321 (46%), Gaps = 32/321 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP S+ V +D GS+++WV C QC QC S +L L+ Y+ S
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNIDESD 133
Query: 168 SSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S K VSC C S CK+ CPY+ Y + +S++GY V D++ S +
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDSVAGD 191
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ SVI GCG +Q+G LD + A DG++G G + S+ S LA +G ++ F+
Sbjct: 192 LKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAH 250
Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGF 333
C D + G +F G Q + P+ Y V + + +G LT Q G
Sbjct: 251 CLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGD 308
Query: 334 Q--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+ A++DSG + +LP IY +V K L ++ + +K C+ S P++
Sbjct: 309 RKGAIIDSGTTLAYLPEIIYEPLVKKEPAL----KVHIVDKDYK-CFQYSGRVDEGFPNV 363
Query: 392 RLIFSKNQSFVVRNHIFSFPE 412
F + V H + FP
Sbjct: 364 TFHFENSVFLRVYPHDYLFPH 384
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 152/321 (47%), Gaps = 30/321 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP+ + + +D G++++WV C QC +C S +L +L+ Y+ SS
Sbjct: 72 LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRS-----NLGMDLTLYNIKESS 126
Query: 168 SSKNVSCSHPLCKS-----RSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S K V C LCK + C S D CPY+ Y + +S++GY V D++ S
Sbjct: 127 SGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYG-DGSSTAGYFVKDVVLFDQVSG 185
Query: 222 HAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+S SVI GCG +Q+G SY + A DG++G G + S+ S L+ +G ++ F+
Sbjct: 186 DLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAH 245
Query: 280 CFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQ- 334
C + + G +F G T +T LP Y ++ +G++ L T + Q
Sbjct: 246 CLNGVNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQ---VGHTFLNLSTDASEQR 302
Query: 335 ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVP 389
++DSG + +LP IY +V K L + +Q +Y C+ S P
Sbjct: 303 DSKGTIIDSGTTLAYLPDGIYQPLVYKI--LSQQPNLKVQTLHDEYTCFQYSGSVDDGFP 360
Query: 390 DMRLIFSKNQSFVVRNHIFSF 410
++ F S V H + F
Sbjct: 361 NVTFYFENGLSLKVYPHDYLF 381
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 171/386 (44%), Gaps = 36/386 (9%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
+++ ++P + VE L R + RV+ SS + F G+ F
Sbjct: 31 LTLERAFPTNHGVEIAHL-------RSRDRVRHGRMLQSSGGVIDFSVSGTYDPFL---- 79
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
L+YT + +G P F V +D GS++LWV C P + + L L+ +DP SS
Sbjct: 80 VGLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPAT----SGLQIPLNFFDPGSS 135
Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
+++ VSCS +C S S+C + C Y+ Y + + +SGY V D++HL
Sbjct: 136 TTASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQYG-DGSGTSGYYVMDMIHLDVVID 194
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ S+ +SV+ GC QTG A DG+ G G D+SV S L+ G+ FS C
Sbjct: 195 SSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHC 254
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
+DSG G + + + P+ Y + ++S + L T S
Sbjct: 255 LKGDDSGGGIL-VLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSS 313
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPD 390
++DSG + +L E Y VV +V S++ + L+GN CY SS P
Sbjct: 314 QGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKGNR---CYVTSSSVSDIFPQ 370
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVG 416
+ L F+ S V+ + +N VG
Sbjct: 371 VSLNFAGGASLVLGAQDYLIQQNSVG 396
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 114/406 (28%), Positives = 184/406 (45%), Gaps = 36/406 (8%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
+++ ++P ++VE L L + D R R LQS+N + F +G+ F
Sbjct: 23 LTLERAFPTNHTVE-LSQLRARDALRH--RRMLQSSNGV----VDFSVQGTFDPFQ---- 71
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
L+YT + +GTP V F V +D GS++LWV C C C S L L+ +DP S
Sbjct: 72 VGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGS 126
Query: 166 SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SS+S ++CS C S ++C S + C Y Y + + +SGY V D++HL +
Sbjct: 127 SSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSDMMHLNTIF 185
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ + ++ + V+ GC +QTG A DG+ G G ++SV S L+ G+ FS
Sbjct: 186 EGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSH 245
Query: 280 CF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGF 333
C D + G + G+ TS +P Y+ + V ++ I +S S
Sbjct: 246 CLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS 305
Query: 334 QA-LVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPD 390
+ +VDSG + +L E Y V + S + +GN CY +S P
Sbjct: 306 RGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVVSRGNQ---CYLITSSVTEVFPQ 362
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILIL 436
+ L F+ S ++R + +N +G A + + GI IL
Sbjct: 363 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITIL 408
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 109/405 (26%), Positives = 182/405 (44%), Gaps = 34/405 (8%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
+++ ++P + VE L L + D R + ++ SS + F +G+ F
Sbjct: 26 LTLERAFPTNHGVE-LSQLRARDELRHRRMLQ------SSSGVVDFSVQGTFDPFQ---- 74
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
L+YT + +GTP V F V +D GS++LWV C P + + L L+ +DP SS
Sbjct: 75 VGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQT----SGLQIQLNFFDPGSS 130
Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S+S ++CS C S ++C S + C Y Y + + +SGY V D++HL + +
Sbjct: 131 STSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSDMMHLNTIFE 189
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ ++ + V+ GC +QTG A DG+ G G ++SV S L+ G+ FS C
Sbjct: 190 GSMTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHC 249
Query: 281 F--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGNSCLTQSGFQ 334
D + G + G+ TS +P Y+ V ++ I +S S +
Sbjct: 250 LKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSR 309
Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+VDSG + +L E Y V + S + + +GN CY +S P +
Sbjct: 310 GTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRGNQ---CYLITSSVTDVFPQV 366
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILIL 436
L F+ S ++R + +N +G A + + GI IL
Sbjct: 367 SLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITIL 411
>gi|195658449|gb|ACG48692.1| hypothetical protein [Zea mays]
gi|413938915|gb|AFW73466.1| hypothetical protein ZEAMMB73_105703 [Zea mays]
Length = 149
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/126 (48%), Positives = 79/126 (62%), Gaps = 13/126 (10%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
+FSS++VHR SDEA+ + G WP++ S Y LL +D +RQK R+
Sbjct: 26 TFSSRMVHRLSDEARLEAGPRMG------LWPQRGSGGYYRALLRSDLQRQKRRL----- 74
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
+ +NQLL S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQ
Sbjct: 75 --AGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132
Query: 143 CAPLSA 148
CAPLS+
Sbjct: 133 CAPLSS 138
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 88/262 (33%), Positives = 124/262 (47%), Gaps = 24/262 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I+IGTP + V +D GS++LWV C C +C S L +L YDP SS
Sbjct: 82 LYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKS-----DLGIDLRLYDPKGSS 136
Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S VSC C + K + PC Y Y + +S++GY V D L S
Sbjct: 137 SGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYG-DGSSTTGYFVSDSLQYNQVSGDG 195
Query: 224 PQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+SVI GCG +Q G A DG++G G + S+ S LA AG ++ FS C D
Sbjct: 196 QTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLD 255
Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
G +F GD +ST +P Y+ V +ES +G + L T
Sbjct: 256 TIKGGGIFAIGDVVQPKVKSTPLVPDMPHYN---VNLESINVGGTTLQLPSHMFETGEKK 312
Query: 334 QALVDSGASFTFLPTEIYAEVV 355
++DSG + T+LP +Y +V+
Sbjct: 313 GTIIDSGTTLTYLPELVYKDVL 334
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 172/363 (47%), Gaps = 40/363 (11%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF---YWLHYTWIDIGTPNVSFLVAL 127
+R+++ +++++ R ++L + GN L++T + +G+P + V +
Sbjct: 31 RRKRSLNAVKAHDARRRGRIL---SAVDLNLGGNGLPTETGLYFTKLGLGSPPKDYYVQV 87
Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR---- 182
D GS++LWV C +C +C S L +L+ YDP S +S+ +SC C +
Sbjct: 88 DTGSDILWVNCVKCSRCPRKS-----DLGIDLTLYDPKGSETSELISCDQEFCSATYDGP 142
Query: 183 -SSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL---HLASFSKHAPQSSVQSSVIIGCGR 238
CKS + PCPY Y + ++++GY V D L H+ + APQ+S S+I GCG
Sbjct: 143 IPGCKS-EIPCPYSITYG-DGSATTGYYVQDYLTYNHVNDNLRTAPQNS---SIIFGCGA 197
Query: 239 KQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGP 296
Q+G+ + A DG++G G + SV S LA +G ++ FS C D G +F G
Sbjct: 198 VQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGGIF--AIGE 255
Query: 297 ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPT 348
+ S P+ + Y V ++S + L + +G ++DSG + +LP
Sbjct: 256 VVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKGTIIDSGTTLAYLPA 315
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
+Y E++ K + R+ L ++ C+ + P ++L F + S V H
Sbjct: 316 IVYDELIPKV--MARQPRLKLYLVEQQFSCFQYTGNVDRGFPVVKLHFEDSLSLTVYPHD 373
Query: 408 FSF 410
+ F
Sbjct: 374 YLF 376
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/319 (28%), Positives = 146/319 (45%), Gaps = 26/319 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP ++ + +D GS+++WV C QC +C S SL +L+ YD SS
Sbjct: 82 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRS-----SLGMDLTLYDIKESS 136
Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S K V C CK + C + CPY+ Y + +S++GY V DI+ S
Sbjct: 137 SGKLVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVKDIVLYDQVSGD 194
Query: 223 APQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
S S++ GCG +Q+G S + A DG++G G + S+ S LA +G ++ F+ C
Sbjct: 195 LKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHC 254
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQA-- 335
+ + G +F G Q + P+ Y V + + +G++ L T + Q
Sbjct: 255 LNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDR 312
Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG + +LP IY +V K ++ + + C+ S P +
Sbjct: 313 KGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYT-CFQYSESVDDGFPAVT 371
Query: 393 LIFSKNQSFVVRNHIFSFP 411
F S V H + FP
Sbjct: 372 FFFENGLSLKVYPHDYLFP 390
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 135/321 (42%), Gaps = 26/321 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP + V +D GS++LWV C C +C S L L+ YDP SS
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 57
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ VSC C + C + PC Y Y + +S++GY V D+L S
Sbjct: 58 TGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSDLLQFDQVSGD 115
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S+V GCG +Q G A DG++G G + S+ S L+ AG ++ F+ C
Sbjct: 116 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 175
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
D + G +F G Q P+ Y V ++S +G + L T
Sbjct: 176 DTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKK 233
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + T+LP +Y E+++ K I+ C+ P +
Sbjct: 234 GTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGRVDDDFPKITF 291
Query: 394 IFSKNQSFVVRNHIFSFPENE 414
F + V H + F +
Sbjct: 292 HFENDLPLNVYPHDYFFENGD 312
>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
vinifera]
Length = 294
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/174 (36%), Positives = 102/174 (58%), Gaps = 4/174 (2%)
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF + +G + FGD+G
Sbjct: 1 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 355
+ Q+ T F P + Y + + +G + + F A+ DSG SFT+L Y +
Sbjct: 61 SSGQEETPFNPSKSQL-LYNISITQISVGGTSADLN-FDAIFDSGTSFTYLNDPAYTSIS 118
Query: 356 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHI 407
F+ KR S + ++YCY+ S ++ ++ P + L +F V + I
Sbjct: 119 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPI 172
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 154/326 (47%), Gaps = 22/326 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +G+P + V +D GS++LWV C C +C P+ T L LS YD +SS
Sbjct: 76 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVK----TDLGIPLSLYDSKASS 130
Query: 168 SSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+SKNV C C +S K PC Y Y + ++S G V D + L + +
Sbjct: 131 TSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYG-DGSTSDGDFVKDNITLDQVTGNLRT 189
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ + V+ GCG+ Q+G +A DG+MG G + SV S LA G ++ FS C D
Sbjct: 190 APLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNM 249
Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGV----ESYCIGNSCLTQSG-FQALVD 338
+ G +F G+ ++T +P Y+ G+ E + S + +G ++D
Sbjct: 250 NGGGIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIID 309
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSK 397
SG + +LP +Y ++ +K+ + +++ L + C++ +S P + L F
Sbjct: 310 SGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFED 366
Query: 398 NQSFVVRNHIFSFPENEVGDHACFSY 423
+ V H + F E D CF +
Sbjct: 367 SLKLSVYPHDYLFSLRE--DMYCFGW 390
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 151/327 (46%), Gaps = 36/327 (11%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I +G PN + V +D GS+ LWV C C C S L L+ YDP+SS
Sbjct: 76 LYYTKIGLG-PN-DYYVQVDTGSDTLWVNCVGCTTCPKKSG-----LGMELTLYDPNSSK 128
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKD-PCPYIADYSTEDTSSSGYLVDDIL--HLASF 219
+SK V C C S S CK KD CPY Y T+S Y+ DD+ +
Sbjct: 129 TSKVVPCDDEFCTSTYDGPISGCK--KDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGD 186
Query: 220 SKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
+ P ++ SVI GCG KQ+G S + DG++G G + SV S LA AG ++ F
Sbjct: 187 LRTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVF 243
Query: 278 SICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------T 329
S C D + G +F G+ ++T +P Y+ +E G+ +
Sbjct: 244 SHCLDTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIE--VAGDPIQLPTDIFDS 301
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML--K 387
SG ++DSG + +LP IY +++ K S + L + + C++ S E+ L
Sbjct: 302 TSGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFT-CFHYSDEKSLDDA 360
Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENE 414
P ++ F + + H + FP E
Sbjct: 361 FPTVKFTFEEGLTLTAYPHDYLFPFKE 387
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 135/321 (42%), Gaps = 26/321 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP + V +D GS++LWV C C +C S L L+ YDP SS
Sbjct: 88 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 142
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ VSC C + C + PC Y Y + +S++GY V D+L S
Sbjct: 143 TGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSDLLQFDQVSGD 200
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S+V GCG +Q G A DG++G G + S+ S L+ AG ++ F+ C
Sbjct: 201 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 260
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
D + G +F G Q P+ Y V ++S +G + L T
Sbjct: 261 DTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKK 318
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + T+LP +Y E+++ K I+ C+ P +
Sbjct: 319 GTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGRVDDDFPKITF 376
Query: 394 IFSKNQSFVVRNHIFSFPENE 414
F + V H + F +
Sbjct: 377 HFENDLPLNVYPHDYFFENGD 397
>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
Length = 306
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/174 (36%), Positives = 102/174 (58%), Gaps = 4/174 (2%)
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF + +G + FGD+G
Sbjct: 13 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 355
+ Q+ T F P + Y + + +G + + F A+ DSG SFT+L Y +
Sbjct: 73 SSGQEETPFNPSKSQL-LYNISITQISVGGTSADLN-FDAIFDSGTSFTYLNDPAYTSIS 130
Query: 356 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHI 407
F+ KR S + ++YCY+ S ++ ++ P + L +F V + I
Sbjct: 131 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPI 184
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 145/324 (44%), Gaps = 32/324 (9%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+YT I+IGTP F V +D GS++LWV C C +C S L +L+ YDP SSS
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSG-----LGIDLALYDPKGSSS 141
Query: 169 SKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
VSC + C + C + K PC Y A+Y + +S++G V D L S
Sbjct: 142 GSAVSCDNKFCAATYGSGEKLPGCTAGK-PCEYRAEYG-DGSSTAGSFVSDSLQYNQLSG 199
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+A +++VI GCG +Q G A DG++G G + S S LA AG ++ FS C
Sbjct: 200 NAQTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHC 259
Query: 281 FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQS 331
D G +F G+ +ST LP Y V ++S + + L T
Sbjct: 260 LDTIKGGGIFAIGEVVQPKVKSTPLLP---NMSHYNVNLQSIDVAGNALQLPPHIFETSE 316
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
++DSG + T+LP +Y +++ F K ++QG C+ S P
Sbjct: 317 KRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGF---LCFEYSESVDDGFPK 373
Query: 391 MRLIFSKNQSFVVRNHIFSFPENE 414
+ F + V H + F +
Sbjct: 374 ITFHFEDDLGLNVYPHDYFFQNGD 397
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 147/318 (46%), Gaps = 24/318 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP + V +D GS+++WV C QC +C S SL +L+ Y+ + S
Sbjct: 77 LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTS-----SLGIDLTLYNINESD 131
Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ K V C C + + + CPY+ Y + +S++GY V D++ A S
Sbjct: 132 TGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEIYG-DGSSTAGYFVKDVVQYARVSGDL 190
Query: 224 PQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
++ SVI GCG +Q+G + A DG++G G + S+ S LA G ++ F+ C
Sbjct: 191 KTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL 250
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ- 334
D + G +F G Q + P+ Y V + + +G+ L+ ++G +
Sbjct: 251 DGTNGGGIFV--IGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRK 308
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
A++DSG + +LP +Y +V K ++ + + C+ S P++
Sbjct: 309 GAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYT-CFQYSDSLDDGFPNVTF 367
Query: 394 IFSKNQSFVVRNHIFSFP 411
F + V H + FP
Sbjct: 368 HFENSVILKVYPHEYLFP 385
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 149/325 (45%), Gaps = 21/325 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT + +GTP V F V +D GS++LWV C C C S L L+ +DP SSS
Sbjct: 24 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGSSS 78
Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+S ++CS C S ++C S + C Y Y + + +SGY V D++HL + +
Sbjct: 79 TSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSDMMHLNTIFEG 137
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ ++ + V+ GC +QTG A DG+ G G ++SV S L+ G+ FS C
Sbjct: 138 SVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL 197
Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQA 335
D + G + G+ TS +P Y+ + V ++ I +S S +
Sbjct: 198 KGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRG 257
Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
+VDSG + +L E Y V + + + + CY +S P + L
Sbjct: 258 TIVDSGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVSRGNQCYLITSSVTEVFPQVSLN 316
Query: 395 FSKNQSFVVRNHIFSFPENEVGDHA 419
F+ S ++R + +N +G A
Sbjct: 317 FAGGASMILRPQDYLIQQNSIGGAA 341
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 166/359 (46%), Gaps = 32/359 (8%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF---YWLHYTWIDIGTPNVSFLVAL 127
+R+++ +++++ R ++L + GN L++T + +G+P + V +
Sbjct: 31 RRKRSLSAVRAHDVRRRGRIL---SAVDLNLGGNGLPTETGLYFTKLGLGSPPRDYYVQV 87
Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR---- 182
D GS++LWV C +C +C S L +L+ YDP S +S VSC C +
Sbjct: 88 DTGSDILWVNCVECSRCPRKS-----DLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGP 142
Query: 183 -SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
CKS + PCPY Y + ++++GY V D L + + S SS+I GCG Q+
Sbjct: 143 IPGCKS-EIPCPYSITYG-DGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQS 200
Query: 242 GSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ 299
G+ + A DG++G G + SV S LA +G ++ FS C D G +F G +
Sbjct: 201 GTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGGIF--AIGEVVE 258
Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIY 351
S P+ + Y V ++S + L + +G ++DSG + +LP +Y
Sbjct: 259 PKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPDIVY 318
Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
E++ K ++ L ++ C+ + P ++L F + S V H + F
Sbjct: 319 DELIQKVLARQPGLKLYLVEQQFR-CFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLF 376
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 157/371 (42%), Gaps = 43/371 (11%)
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
E+L L +D +R T V L N P++ L++T I IGTP
Sbjct: 56 EHLAALRKHDGRRLLTAVDLPLGGNG------IPTD-----------TGLYFTQIGIGTP 98
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
+ + V +D GS++LWV CI C S + L +L+ YDP++S+SSK V+C C
Sbjct: 99 SKGYYVQVDTGSDILWV--NCISCD--SCPRKSGLGIDLTLYDPTASASSKTVTCGQEFC 154
Query: 180 KSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
+ + SC + PC Y Y + +S++G+ V D L S + +SV
Sbjct: 155 ATATNGGVPPSCAA-NSPCQYSITYG-DGSSTTGFFVADFLQYDQVSGDGQTNLANASVT 212
Query: 234 IGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
GCG K G+ A DG++G G + S+ S L AG + FS C D + G +F
Sbjct: 213 FGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGGGIF-- 270
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQALVDSGASF 343
G Q P+ Y V +++ +G S L ++DSG +
Sbjct: 271 AIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIIDSGTTL 330
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
+LP +Y V+ + ++L+ C+ S P++ F + VV
Sbjct: 331 AYLPEVVYKAVLSAV--FSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVV 388
Query: 404 RNHIFSFPENE 414
H + F E
Sbjct: 389 YPHDYLFQNTE 399
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 121/263 (46%), Gaps = 24/263 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP + V +D GS++LWV C C +C S L L+ YDP SS
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 86
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ VSC C + C + PC Y Y + +S++GY V D+L S
Sbjct: 87 TGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSDLLQFDQVSGD 144
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S+V GCG +Q G A DG++G G + S+ S L+ AG ++ F+ C
Sbjct: 145 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 204
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
D + G +F G Q P+ Y V ++S +G + L T
Sbjct: 205 DTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKK 262
Query: 334 QALVDSGASFTFLPTEIYAEVVV 356
++DSG + T+LP +Y E+++
Sbjct: 263 GTIIDSGTTLTYLPEIVYKEIML 285
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 149/331 (45%), Gaps = 29/331 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP ++ + +D GS+++WV C QC +C S +L +L+ YD SS
Sbjct: 84 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRS-----NLGMDLTLYDIKESS 138
Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S K V C CK + C + CPY+ Y + +S++GY V DI+ S
Sbjct: 139 SGKFVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVKDIVLYDQVSGD 196
Query: 223 APQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
S S++ GCG +Q+G S + A G++G G + S+ S LA +G ++ F+ C
Sbjct: 197 LKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHC 256
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQA-- 335
+ + G +F G Q + P+ Y V + + +G++ L T + Q
Sbjct: 257 LNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDR 314
Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG + +LP IY +V K ++ + + C+ S P +
Sbjct: 315 KGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYT-CFQYSESVDDGFPAVT 373
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F S V H + FP GD C +
Sbjct: 374 FYFENGLSLKVYPHDYLFPS---GDFWCIGW 401
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 152/326 (46%), Gaps = 22/326 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +G+P + V +D GS++LWV C C +C P+ T L LS YD +SS
Sbjct: 77 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVK----TDLGIPLSLYDSKTSS 131
Query: 168 SSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+SKNV C C +S K PC Y Y TS ++ D+I L + +
Sbjct: 132 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT-LEQVTGNLRT 190
Query: 226 SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ + V+ GCG+ Q+G +A DG+MG G + S+ S LA G + FS C D
Sbjct: 191 APLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNM 250
Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGV----ESYCIGNSCLTQSG-FQALVD 338
+ G +F G+ ++T +P Y+ G+ + + S + +G ++D
Sbjct: 251 NGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIID 310
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSK 397
SG + +LP +Y ++ +K+ + +++ L + C++ +S P + L F
Sbjct: 311 SGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFED 367
Query: 398 NQSFVVRNHIFSFPENEVGDHACFSY 423
+ V H + F E D CF +
Sbjct: 368 SLKLSVYPHDYLFSLRE--DMYCFGW 391
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 152/326 (46%), Gaps = 22/326 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +G+P + V +D GS++LWV C C +C P+ T L LS YD +SS
Sbjct: 73 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVK----TDLGIPLSLYDSKTSS 127
Query: 168 SSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+SKNV C C +S K PC Y Y TS ++ D+I L + +
Sbjct: 128 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT-LEQVTGNLRT 186
Query: 226 SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ + V+ GCG+ Q+G +A DG+MG G + S+ S LA G + FS C D
Sbjct: 187 APLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNM 246
Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGV----ESYCIGNSCLTQSG-FQALVD 338
+ G +F G+ ++T +P Y+ G+ + + S + +G ++D
Sbjct: 247 NGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIID 306
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSK 397
SG + +LP +Y ++ +K+ + +++ L + C++ +S P + L F
Sbjct: 307 SGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFED 363
Query: 398 NQSFVVRNHIFSFPENEVGDHACFSY 423
+ V H + F E D CF +
Sbjct: 364 SLKLSVYPHDYLFSLRE--DMYCFGW 387
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 145/323 (44%), Gaps = 27/323 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT + +GTP F V +D GS++LWV C C QC + + L +L+ YDP +SS
Sbjct: 87 LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCP-----HKSGLGLDLTLYDPKASS 141
Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ V C C + S PC Y Y + +S+ G V+D L +
Sbjct: 142 TGSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYG-DGSSTVGSFVNDALQFDQVTGDG 200
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+SVI GCG +Q G + A DG++G G + S+ S LA AG ++ F+ C D
Sbjct: 201 QTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLD 260
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ-- 334
G +F G Q P+ Y V +++ +G + L + G +
Sbjct: 261 TIKGGGIFA--IGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRG 318
Query: 335 ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + T+LP ++ +V++ F+K + I+ C+ S P +
Sbjct: 319 TIIDSGTTLTYLPELVFKKVMLAVFNK---HQDITFHDVQDFLCFEYSGSVDDGFPTLTF 375
Query: 394 IFSKNQSFVVRNHIFSFPE-NEV 415
F + + V H + FP N+V
Sbjct: 376 HFEDDLALHVYPHEYFFPNGNDV 398
>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
Length = 263
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 101/183 (55%), Gaps = 4/183 (2%)
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
V++ ++ GCG+ QTG++LD AAP+G+ GLG+ VSVPS+LA G NSFS+CF + G
Sbjct: 11 VKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSFSMCFGSDGMG 70
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
++FGD G + Q T F + + Y + + +GNS + + A+VDSG SFT L
Sbjct: 71 RIYFGDTGSSDQGETPF-DVNHSHPTYNISLIGMEVGNSSIDVNS-SAIVDSGTSFTCLA 128
Query: 348 TEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 405
+Y ++ F V R S G ++YCY S ++ + +P + L F + +
Sbjct: 129 DPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKGGSQFPIND 188
Query: 406 HIF 408
I
Sbjct: 189 PII 191
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 123/266 (46%), Gaps = 25/266 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP+ + V +D GS+++WV C QC +C S SL L+ YD S+
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTS-----SLGMELTPYDLEEST 140
Query: 168 SSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ K VSC C S C + CPY+ Y + +S++GY V D + S
Sbjct: 141 TGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKDYVQYNRVSGD 198
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
++ S+ GCG +Q+G A DG++G G + S+ S LA ++ F+ C
Sbjct: 199 LETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHC 258
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA--- 335
D + G +F G Q + P+ Y V + +G+ L S F+A
Sbjct: 259 LDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR 316
Query: 336 ---LVDSGASFTFLPTEIYAEVVVKF 358
++DSG + +LP IY +V K
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKI 342
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/392 (27%), Positives = 172/392 (43%), Gaps = 34/392 (8%)
Query: 63 ELLLSNDWKRQKTR--VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
+L LS +R + R LQS S + FP +G+ F L+YT + +GTP
Sbjct: 10 KLKLSKLKERDRVRHGRMLQS---SGVGVVDFPVQGTFDPFL----VGLYYTRLQLGTPP 62
Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
F V +D GS++LWV C P+++ + L+ +DP SS ++ +SCS C
Sbjct: 63 RDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNF----FDPGSSPTASLISCSDQRCS 118
Query: 180 ----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
S S C + + C Y Y + + +SGY V D+LH + + ++ + ++ G
Sbjct: 119 LGLQSSDSVCSAQNNLCGYNFQYG-DGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFG 177
Query: 236 CGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ 294
C QTG A DG+ G G D+SV S LA G+ +FS C +DSG
Sbjct: 178 CSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGIL-VL 236
Query: 295 GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFL 346
G + + + P+ Y + ++S + L T S ++DSG + +L
Sbjct: 237 GEIVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYL 296
Query: 347 PTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
Y + +VS S R L +GN +CY SS P + L F+ S ++
Sbjct: 297 AEAAYDPFISAITSIVSPSVRPYLSKGN---HCYLISSSINDIFPQVSLNFAGGASMILI 353
Query: 405 NHIFSFPENEVGDHACFSYFTLEYNFTGILIL 436
+ ++ +G A + + GI IL
Sbjct: 354 PQDYLIQQSSIGGAALWCIGFQKIQGQGITIL 385
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 123/266 (46%), Gaps = 25/266 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IGTP+ + V +D GS+++WV C QC +C S SL L+ YD S+
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTS-----SLGMELTPYDLEEST 140
Query: 168 SSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ K VSC C S C + CPY+ Y + +S++GY V D + S
Sbjct: 141 TGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKDYVQYNRVSGD 198
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
++ S+ GCG +Q+G A DG++G G + S+ S LA ++ F+ C
Sbjct: 199 LETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHC 258
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA--- 335
D + G +F G Q + P+ Y V + +G+ L S F+A
Sbjct: 259 LDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR 316
Query: 336 ---LVDSGASFTFLPTEIYAEVVVKF 358
++DSG + +LP IY +V K
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKI 342
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 140/330 (42%), Gaps = 28/330 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +GTP + V +D GS++LWV C C +C S L +L+ YDP +SS
Sbjct: 83 LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSG-----LGLDLTFYDPKASS 137
Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S VSC C + K + PC Y Y + +S++G+ V D L +
Sbjct: 138 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFVTDALQFDQVTGDG 196
Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
++V GCG +Q G A DG++G G + S+ S LA AG ++ F+ C D
Sbjct: 197 QTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLD 256
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQ 334
G +F G Q P+ Y V ++S +G + L T
Sbjct: 257 TIKGGGIF--AIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKG 314
Query: 335 ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + T+LP ++ EV+ F+K + I C+ P +
Sbjct: 315 TIIDSGTTLTYLPELVFKEVMAAIFNK---HQDIVFHNVQDFMCFQYPGSVDDGFPTITF 371
Query: 394 IFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F + + V H + FP D C +
Sbjct: 372 HFEDDLALHVYPHEYFFPNGN--DMYCVGF 399
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/327 (28%), Positives = 148/327 (45%), Gaps = 36/327 (11%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I +G + + V +D GS+ LWV C C C S L +L+ YDP+ S
Sbjct: 75 LYYTKIGLGPKD--YYVQVDTGSDTLWVNCVGCTACPKKSG-----LGMDLTLYDPNLSK 127
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL--HLASFS 220
+SK V C C S S C CPY Y T+S Y+ DD+ +
Sbjct: 128 TSKAVPCDDEFCTSTYDGQISGCTK-GMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDL 186
Query: 221 KHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ P ++ SVI GCG KQ+G S + DG++G G + SV S LA AG ++ FS
Sbjct: 187 RTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFS 243
Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQ 330
C D G +F G Q P+ + Y V ++ + + +
Sbjct: 244 HCLDSISGGGIFA--IGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSS 301
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK--V 388
SG ++DSG + +LP IY +++ K S ++ L + + C++ S EE +
Sbjct: 302 SGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFT-CFHYSDEESVDDLF 360
Query: 389 PDMRLIFSKNQSFVV--RNHIFSFPEN 413
P ++ F + + R+++F F E+
Sbjct: 361 PTVKFTFEEGLTLTTYPRDYLFLFKED 387
>gi|413924528|gb|AFW64460.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 146
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 73/124 (58%), Gaps = 17/124 (13%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
SS++VHR SDEA+ + G WP++ S EY L+ +D +RQK R+ + S
Sbjct: 28 SSRMVHRLSDEARLEVGPRVG------WWPQRGSGEYYRALVRSDIQRQKRRLAVLSL-- 79
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
S+G T GN WL+Y W+D+GTP SFLVALD GS+L WVPC CIQCA
Sbjct: 80 ---------SKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCA 130
Query: 145 PLSA 148
PLS
Sbjct: 131 PLSG 134
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 175/389 (44%), Gaps = 37/389 (9%)
Query: 46 NVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ 105
+++ ++P + VE EL + + ++ LQS N + FP +G+ F
Sbjct: 24 TLTLERAFPSNDGVELSELRARDSLRHRRM---LQSTNYV----VDFPVKGT----FDPS 72
Query: 106 FYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
L+YT + +GTP V +D GS++LWV C P + + L L+ +DP S
Sbjct: 73 QVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQT----SGLQIQLNYFDPGS 128
Query: 166 SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SS+S +SC C+ S +SC + C Y Y + + +SGY V D++H AS
Sbjct: 129 SSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSDLMHFASIF 187
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ ++ +SV+ GC QTG A DG+ G G +SV S L+ G+ FS
Sbjct: 188 EGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSH 247
Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQS 331
C ++SG G + + + P+ Y + ++S + + T +
Sbjct: 248 CLKGDNSGGGVL-VLGEIVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSN 306
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKV- 388
+VDSG + +L E Y V+ ++ S + + +GN CY ++ + +
Sbjct: 307 NRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQ---CYLITTSSNVDIF 363
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGD 417
P + L F+ S V+R + +N +G+
Sbjct: 364 PQVSLNFAGGASLVLRPQDYLMQQNFIGE 392
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 143/318 (44%), Gaps = 28/318 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L++T I IGTP + V +D GS++LWV C P ++L L+ YDP S S
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144
Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ V+C C + SC S PC Y Y + +S++G+ V D L S
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202
Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +SV GCG K G A DG++G G + S+ S LA AG ++ F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
+ G +F G+ ++T +P Y+ G++ +G + L + +
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSGNSK 319
Query: 334 QALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG + ++P +Y A + FDK + IS+Q C+ S P++
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSGSVDDGFPEVT 376
Query: 393 LIFSKNQSFVVRNHIFSF 410
F + S +V H + F
Sbjct: 377 FHFEGDVSLIVSPHDYLF 394
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 154/350 (44%), Gaps = 24/350 (6%)
Query: 72 RQKTRVKLQSNNNSSRNQLL-FPSEGS----QTHFFGNQFYWLHYTWIDIGTPNVSFLVA 126
+++ RV+ SS ++ FP +G+ F+ F L+YT + +G+P F V
Sbjct: 47 KERDRVRHSRMLQSSGGGVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDFYVQ 106
Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KS 181
+D GS++LWV C P+S+ + L+ +DP SS ++ +SCS C S
Sbjct: 107 IDTGSDVLWVSCSSCNGCPVSSGLHIPLNF----FDPGSSPTASLISCSDQRCSLGLQSS 162
Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
S C + + C Y Y + + +SGY V D+LH + + + + ++ GC QT
Sbjct: 163 DSVCAAQNNQCGYTFQYG-DGSGTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGCSTLQT 221
Query: 242 GSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQGPAT 298
G A DG+ G G D+SV S LA G+ FS C +DSG + G+
Sbjct: 222 GDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVEPN 281
Query: 299 QQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQA-LVDSGASFTFLPTEIYAE 353
T +P Y+ + +V ++ I S S Q ++DSG + +L Y
Sbjct: 282 IVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLTEAAYDP 341
Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
+ V S +S + CY SS P + L F+ S ++
Sbjct: 342 FISAITSTV-SPSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGTSMIL 390
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 143/318 (44%), Gaps = 28/318 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L++T I IGTP + V +D GS++LWV C P ++L L+ YDP S S
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144
Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ V+C C + SC S PC Y Y + +S++G+ V D L S
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202
Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +SV GCG K G A DG++G G + S+ S LA AG ++ F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
+ G +F G+ ++T +P Y+ G++ +G + L + +
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSGNSK 319
Query: 334 QALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG + ++P +Y A + FDK + IS+Q C+ S P++
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSGSVDDGFPEVT 376
Query: 393 LIFSKNQSFVVRNHIFSF 410
F + S +V H + F
Sbjct: 377 FHFEGDVSLIVSPHDYLF 394
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 124/265 (46%), Gaps = 25/265 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT + +G+P F V +D GS++LWV C C C S L +L+ YDP+ S
Sbjct: 71 LYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSG-----LGMDLTLYDPNGSK 125
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+S V C C S CK CPY Y + +++SG V+D L S +
Sbjct: 126 TSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTSGSFVNDSLTFDEVSGN 183
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
SSVI GCG KQ+GS + A DG++G G + SV S LA +G ++ FS C
Sbjct: 184 LHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHC 243
Query: 281 FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSG 332
D + G +F G +T +P Y+ ++ G L + SG
Sbjct: 244 LDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VDGEPILLPLYLFDSGSG 301
Query: 333 FQALVDSGASFTFLPTEIYAEVVVK 357
++DSG + +LP IY +++ K
Sbjct: 302 RGTIIDSGTTLAYLPLSIYNQLLPK 326
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 165/377 (43%), Gaps = 32/377 (8%)
Query: 63 ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
+L LS +R R + + +S + FP +G+ F L++T + +G+P
Sbjct: 41 KLELSQLKERDSFRHRRILQSTTSGGVVDFPVQGTFNPFL----VGLYFTRVQLGSPPKD 96
Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-- 180
F V +D GS++LWV C P+++ L L+ +DP SS+++ VSCS C
Sbjct: 97 FYVQIDTGSDVLWVSCSSCNGCPVTS----GLQIPLTFFDPGSSTTAALVSCSDQRCTAG 152
Query: 181 ---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ-----SSVQSSV 232
S S C S + C Y Y + + +SGY V D++HL + + + + SSV
Sbjct: 153 IQSSDSLCSSRTNQCGYTFQYG-DGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSV 211
Query: 233 IIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--V 289
C QTG A DG+ G G ++SV S LA G+ FS C +DSG +
Sbjct: 212 SFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDSGGGVL 271
Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGNSCLTQSGFQA-LVDSGASFT 344
G+ T +P Y+ Y V ++ I S S Q +VDSG +
Sbjct: 272 VLGEIVEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSNQGTIVDSGTTLA 331
Query: 345 FLPTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
+L Y V +VS + R L +GN CY +S P + L F+ S +
Sbjct: 332 YLAEGAYDPFVSAITSVVSLNARTYLSKGNQ---CYLVTSSVNDVFPQVSLNFAGGASLI 388
Query: 403 VRNHIFSFPENEVGDHA 419
+ + +N VG A
Sbjct: 389 LNPQDYLLQQNSVGGAA 405
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 142/307 (46%), Gaps = 48/307 (15%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
++LL ++D R VKL+S+ S P EG + L++T + +GTP
Sbjct: 1 MQLLKAHDRGRM---VKLKSSAVS------LPVEGVADPYIAG----LYFTQVQLGTPPR 47
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
++ + +D GS+LLWV C CI C S L + YD +S+SS V CS P C
Sbjct: 48 TYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASASSSKVPCSDPSCT 102
Query: 181 -----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
S S C ++ C Y Y + + + GYLV+D+LH + ++VI G
Sbjct: 103 LITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLHY--------MVNATATVIFG 152
Query: 236 CGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
CG KQ+G A DG++G G D+S S LAK G N F+ C D E G + G
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLG 212
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-------QSGFQALV-DSGASFT 344
+ Q T +P Y+ V ++S + N+ LT Q + DSG +
Sbjct: 213 NVIEPDIQYTPLVPYMSHYN---VVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLA 269
Query: 345 FLPTEIY 351
+LP E Y
Sbjct: 270 YLPDEAY 276
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 147/326 (45%), Gaps = 35/326 (10%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++ I +G P + V +D GS++LWV C C +C S L L+ YDP SS+
Sbjct: 81 LYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKS-----DLGVKLTLYDPQSST 135
Query: 168 SSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S+ + C C + + K L PC Y Y + +S++G+ V D L +
Sbjct: 136 SATRIYCDDDFCAATYNGVLQGCTKDL--PCQYSVVYG-DGSSTAGFFVKDNLQFDRVTG 192
Query: 222 HAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ SS SVI GCG KQ+G A DG++G G + S+ S LA AG ++ F+ C
Sbjct: 193 NLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHC 252
Query: 281 FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQS 331
D G +F G+ +T +P Y+ +E +G + L T
Sbjct: 253 LDNVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIE---VGGNVLELPTDIFDTGD 309
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YCYNASSEEMLKV 388
++DSG + +LP +Y ++ K + S++ L+ ++ + C+ +
Sbjct: 310 RRGTIIDSGTTLAYLPEVVYESMMTK----IVSEQPGLKLHTVEEQFTCFQYTGNVNEGF 365
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENE 414
P ++ F+ + S V H + F +E
Sbjct: 366 PVVKFHFNGSLSLTVNPHDYLFQIHE 391
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 136/317 (42%), Gaps = 26/317 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I +GTP + V +D GS++LWV C C QC + + L +L+ YDP +SS
Sbjct: 85 LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCP-----HKSGLGLDLTLYDPKASS 139
Query: 168 SSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ V C C + K K PC Y Y + +S+ G V D L ++
Sbjct: 140 TGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYG-DGSSTIGSFVTDALQFDQVTRDG 198
Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+SVI GCG +Q G A DG++G G + S+ S L AG ++ F+ C D
Sbjct: 199 QTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLD 258
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF--------Q 334
G +F G Q P+ Y V +++ +G + L
Sbjct: 259 TIKGGGIF--SIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKG 316
Query: 335 ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + T+LP ++ EV++ F+K + I+ C+ P +
Sbjct: 317 TIIDSGTTLTYLPELVFKEVMLAVFNK---HQDITFHDVQGFLCFQYPGSVDDGFPTITF 373
Query: 394 IFSKNQSFVVRNHIFSF 410
F + + V H + F
Sbjct: 374 HFEDDLALHVYPHEYFF 390
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 177/388 (45%), Gaps = 37/388 (9%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
+++ ++P + VE EL + + ++ LQS N + FP +G+ F
Sbjct: 25 LTLERAFPSNDGVELSELRARDSLRHRRM---LQSTNYV----VDFPVKGT----FDPSQ 73
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
L+YT + +GTP F V +D GS++LWV C P + + L L+ +DP SS
Sbjct: 74 VGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQT----SGLQIQLNYFDPRSS 129
Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S+S +SCS C+ S +SC S + C Y Y + + +SGY V D++H A +
Sbjct: 130 STSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYG-DGSGTSGYYVSDLMHFAGIFE 188
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
++ +SV+ GC QTG A DG+ G G +SV S L+ G+ FS C
Sbjct: 189 GTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHC 248
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
++SG G + + + P+ + Y + ++S + + T +
Sbjct: 249 LKGDNSGGGVL-VLGEIVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNN 307
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKV-P 389
+VDSG + +L E Y V LV S + + +GN CY ++ + + P
Sbjct: 308 RGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQ---CYLITTSSNVDIFP 364
Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEVGD 417
+ L F+ S V+R + +N +G+
Sbjct: 365 QVSLNFAGGASLVLRPQDYLMQQNYIGE 392
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 142/319 (44%), Gaps = 26/319 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y + IGTP+ + V +D GS+++WV C QC +C S SL L+ Y+ S
Sbjct: 85 LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTS-----SLGMELTLYNIKDSV 139
Query: 168 SSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S K V C C S C + CPY+ Y + +S++GY V D++ S
Sbjct: 140 SGKLVPCDEEFCYEVNGGPLSGCTA-NMSCPYLEIYG-DGSSTAGYFVKDVVQYDRVSGD 197
Query: 223 APQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+S SVI GCG +Q+G A DG++G G + S+ S LA ++ F+ C
Sbjct: 198 LQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHC 257
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ 334
D + G +F G Q + P+ Y V + + +G L ++G +
Sbjct: 258 LDGINGGGIF--AIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDR 315
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
A++DSG + +LP +Y +V K ++ + + + C+ S P++
Sbjct: 316 KGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYT-CFQYSGSVDDGFPNVT 374
Query: 393 LIFSKNQSFVVRNHIFSFP 411
F + V H + FP
Sbjct: 375 FHFENSVFLKVHPHEYLFP 393
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 142/307 (46%), Gaps = 48/307 (15%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
++LL ++D R VKL+S+ S P EG + L++T + +GTP
Sbjct: 1 MQLLKAHDRGRM---VKLKSSAVS------LPVEGVADPYIAG----LYFTQVQLGTPPR 47
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
++ + +D GS+LLWV C CI C S L + YD +S+SS V CS P C
Sbjct: 48 TYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASASSSKVPCSDPSCT 102
Query: 181 -----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
S S C ++ C Y Y + + + GYLV+D+LH + ++VI G
Sbjct: 103 LITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLHY--------MVNATATVIFG 152
Query: 236 CGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
CG KQ+G A DG++G G D+S S LAK G N F+ C D E G + G
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLG 212
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-------QSGFQALV-DSGASFT 344
+ Q T +P Y+ V ++S + N+ LT Q + DSG +
Sbjct: 213 NVIEPDIQYTPLVPYMYHYN---VVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLA 269
Query: 345 FLPTEIY 351
+LP E Y
Sbjct: 270 YLPDEAY 276
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 126/266 (47%), Gaps = 26/266 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+Y I IG+P F V +D GS++LWV C C C S + +L Y+P SSS
Sbjct: 72 LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKS-----DIGVDLQLYNPKSSS 126
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+S ++C P C + CK C Y Y + ++++GY V+D + L +
Sbjct: 127 TSTLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVNDYIQLQRAVGN 184
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S S++ GCG KQ+G + A DG++G G + S+ S LA G ++ F+ C
Sbjct: 185 HKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL 244
Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
D G +F G+ ++T +P Y+ GV+ +G++ L T
Sbjct: 245 DSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYK 301
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKF 358
A++DSG + +LP IY ++ K
Sbjct: 302 RGAIIDSGTTLAYLPDSIYLPLMEKI 327
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/387 (25%), Positives = 160/387 (41%), Gaps = 38/387 (9%)
Query: 44 SGNVSVADSWPKKN-SVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF 102
+G V +P+ + S ++L L ++D +R + + N L P+E
Sbjct: 27 TGVFEVRRKFPRHDGSGKHLANLRAHDARRHGRSLAAAVDLPLGGNGL--PTETG----- 79
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
L++T I IGTP S+ V +D GS++LWV C P + L L+ YD
Sbjct: 80 ------LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRK----SGLGIELTLYD 129
Query: 163 PSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
PS SSS V+C C + SC PC Y Y + +S++G+ V D L
Sbjct: 130 PSGSSSGTGVTCGQDFCVATHGGVIPSCVPAA-PCQYSISYG-DGSSTTGFFVTDFLQYN 187
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNS 276
S ++ + +S+ GCG K G + A DG++G G + S+ S LA AG ++
Sbjct: 188 QVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKV 247
Query: 277 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
F+ C D + G +F G Q S P+ Y V +E+ +G L
Sbjct: 248 FAHCLDTINGGGIF--AIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFD 305
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
++DSG + +LP +Y ++ K + L+ + C+ S
Sbjct: 306 IGESKGTIIDSGTTLAYLPGVVYNAIMSKV--FAQYGDMPLKNDQDFQCFRYSGSVDDGF 363
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEV 415
P + F + H + F E+
Sbjct: 364 PIITFHFEGGLPLNIHPHDYLFQNGEL 390
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 144/320 (45%), Gaps = 24/320 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++ I IGTP+ + V +D GS++LWV C C +C S L +L+ YD +S+
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 208
Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+S V C C CK C Y Y + +S++GY V D + S +
Sbjct: 209 TSDAVGCDDNFCSLYDGPLPGCKPGLQ-CLYSVLYG-DGSSTTGYFVQDFVQYNRISGNF 266
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +V+ GCG KQ+G + A DG++G G + S+ S LA +G ++ FS C D
Sbjct: 267 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 326
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ-- 334
D G +F G + + P+ + Y V ++ +G L +SG +
Sbjct: 327 NVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG 384
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
++DSG + + P E+Y ++ K R+ ++ C++ + P + L
Sbjct: 385 TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPTVTLH 443
Query: 395 FSKNQSFVVRNHIFSFPENE 414
F K+ S V H + F E
Sbjct: 444 FDKSISLTVYPHEYLFQVKE 463
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 144/320 (45%), Gaps = 24/320 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++ I IGTP+ + V +D GS++LWV C C +C S L +L+ YD +S+
Sbjct: 73 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 127
Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+S V C C CK C Y Y + +S++GY V D + S +
Sbjct: 128 TSDAVGCDDNFCSLYDGPLPGCKPGLQ-CLYSVLYG-DGSSTTGYFVQDFVQYNRISGNF 185
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +V+ GCG KQ+G + A DG++G G + S+ S LA +G ++ FS C D
Sbjct: 186 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 245
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ-- 334
D G +F G + + P+ + Y V ++ +G L +SG +
Sbjct: 246 NVDGGGIFA--IGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG 303
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
++DSG + + P E+Y ++ K R+ ++ C++ + P + L
Sbjct: 304 TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPTVTLH 362
Query: 395 FSKNQSFVVRNHIFSFPENE 414
F K+ S V H + F E
Sbjct: 363 FDKSISLTVYPHEYLFQVKE 382
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 140/321 (43%), Gaps = 26/321 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L++T I IGTP + V +D GS++LWV C P ++L L+ YDP S S
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144
Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ V+C C + SC S PC Y Y + +S++G+ V D L S
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202
Query: 224 PQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +SV GCG K G A DG++G G + S+ S LA AG ++ F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQ 334
+ G +F G Q P+ Y V ++ +G + L + +
Sbjct: 263 TVNGGGIFA--IGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKG 320
Query: 335 ALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + ++P +Y A + FDK + IS+Q C+ S P++
Sbjct: 321 TIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSGSVDDGFPEVTF 377
Query: 394 IFSKNQSFVVRNHIFSFPENE 414
F + S +V H + F +
Sbjct: 378 HFEGDVSLIVSPHDYLFQNGK 398
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/316 (27%), Positives = 145/316 (45%), Gaps = 21/316 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +G+P + V +D GS++LW+ C+ C +C T+L+ LS +D ++SS
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASS 127
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+SK V C C S S + C Y Y+ E TS G + D+L L +
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKT 186
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ V+ GCG Q+G +G +A DGVMG G + SV S LA G + FS C D
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246
Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
G +F G ++T +P Y+ +G++ S + S + G +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDS 304
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
G + + P +Y ++ + +++ + + L + C++ S+ P + F +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDS 361
Query: 399 QSFVVRNHIFSFPENE 414
V H + F E
Sbjct: 362 VKLTVYPHDYLFTLEE 377
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 143/316 (45%), Gaps = 24/316 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++ I IGTP+ + V +D GS++LWV C C +C S L +L+ YD +S+
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 208
Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+S V C C CK C Y Y + +S++GY V D + S +
Sbjct: 209 TSDAVGCDDNFCSLYDGPLPGCKPGLQ-CLYSVLYG-DGSSTTGYFVQDFVQYNRISGNF 266
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +V+ GCG KQ+G + A DG++G G + S+ S LA +G ++ FS C D
Sbjct: 267 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 326
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ-- 334
D G +F G + + P+ + Y V ++ +G L +SG +
Sbjct: 327 NVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG 384
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
++DSG + + P E+Y ++ K R+ ++ C++ + P + L
Sbjct: 385 TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDDGFPTVTLH 443
Query: 395 FSKNQSFVVRNHIFSF 410
F K+ S V H + F
Sbjct: 444 FDKSISLTVYPHEYLF 459
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 149/328 (45%), Gaps = 45/328 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ--CAPLSASYYTSLDRNLSEYDPS 164
Y Y + +GTP F V +D GS + +VPC C P + + +DP
Sbjct: 75 YGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGP---------NHQDAAFDPE 125
Query: 165 SSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+SS++ +SC+ P C S C C Y Y+ E +SSSG L++D+L L A
Sbjct: 126 ASSTASRISCTSPKCSCGSPRCGCSTQQCTYTRSYA-EQSSSSGILLEDVLALHDGLPGA 184
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD- 282
P +I GC ++TG A DG+ GLG D SV + L KAG+I + FS+CF
Sbjct: 185 P-------IIFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGM 236
Query: 283 -ENDSGSVFFGDQ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS------G 332
E D G++ GD G + Q T L Y V + S + L S G
Sbjct: 237 VEGD-GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQG 295
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSW-KYCY-NASSEEMLK 387
+ ++DSG +FT++P+ ++ +K S KR+ + C+ A S + L+
Sbjct: 296 YGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLE 355
Query: 388 V-----PDMRLIFSKNQSFVVR--NHIF 408
P M + F + S V+ N++F
Sbjct: 356 ALSSVFPSMEVQFDQGTSLVLGPLNYLF 383
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 125/265 (47%), Gaps = 24/265 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L+Y I IG+P F V +D GS++LWV C+ C+ +D L Y+P SSS+
Sbjct: 72 LYYARIGIGSPPNDFHVQVDTGSDILWV--NCVGCSNCPKKSDIGVDLQL--YNPKSSST 127
Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S ++C P C + CK C Y Y + ++++GY V+D + L +
Sbjct: 128 STLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVNDYIQLQRAVGNH 185
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
S S++ GCG KQ+G + A DG++G G + S+ S LA G ++ F+ C D
Sbjct: 186 KTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLD 245
Query: 283 ENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
G +F G+ +T +P Y+ GV+ +G++ L T
Sbjct: 246 SISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYKR 302
Query: 334 QALVDSGASFTFLPTEIYAEVVVKF 358
A++DSG + +LP IY ++ K
Sbjct: 303 GAIIDSGTTLAYLPESIYLPLMEKI 327
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 149/325 (45%), Gaps = 23/325 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +G+P + V +D GS++LW+ C+ C +C T+L+ LS +D ++SS
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASS 127
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+SK V C C S S + C Y Y+ E TS G + D+L L +
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKT 186
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ V+ GCG Q+G +G +A DGVMG G + SV S LA G + FS C D
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246
Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
G +F G ++T +P Y+ +G++ S + S + G +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDS 304
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
G + + P +Y ++ + +++ + + L + C++ S+ P + F +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDS 361
Query: 399 QSFVVRNHIFSFPENEVGDHACFSY 423
V H + F E + CF +
Sbjct: 362 VKLTVYPHDYLFTLEE--ELYCFGW 384
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/269 (28%), Positives = 128/269 (47%), Gaps = 27/269 (10%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I+IG+P+ + V +D GS++LWV C +C C S L L++YDP+ S
Sbjct: 84 LYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSG-----LGIELTQYDPAGSG 138
Query: 168 SSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
++ V C C + S +C S PC + Y + +S++G+ V D + S
Sbjct: 139 TT--VGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYG-DGSSTTGFYVSDSVQYNQVSG 195
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ + +S+ GCG + G + A DG++G G D S+ S LA A ++ F+ C
Sbjct: 196 NGQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHC 255
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--- 335
D G +F G Q P+ + Y V ++ +G + L S F +
Sbjct: 256 LDTVHGGGIF--AIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDS 313
Query: 336 ---LVDSGASFTFLPTEIYAEVVVK-FDK 360
++DSG + +LP E+Y ++ FDK
Sbjct: 314 KGTIIDSGTTLAYLPREVYRTLLTAVFDK 342
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 122/265 (46%), Gaps = 29/265 (10%)
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
YT + +GTP +F V +D GS + ++PC+ C C +A ++ DP S+++
Sbjct: 14 YTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWF----------DPDKSTTA 63
Query: 170 KNVSCSHPLCK-SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
K ++C PLC SC D C Y Y+ E +SS G++++D P S
Sbjct: 64 KKLACGDPLCNCGTPSCTCNNDRCYYSRTYA-ERSSSEGWMIEDTFGF-------PDSDS 115
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
++ GC +TG A DG+MG+G + S L + +I++ FS+CF G
Sbjct: 116 PVRLVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGI 174
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG--------NSCLTQSGFQALVDSG 340
+ GD +T + P+ ++ V+ I ++ + G+ ++DSG
Sbjct: 175 LLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSG 234
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSK 365
+FT+LPT+ + + V K
Sbjct: 235 TTFTYLPTDAFKAMAKAVGDYVEKK 259
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 128/270 (47%), Gaps = 29/270 (10%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I+IG+P + V +D GS++LWV C +C C S L L++YDP+ S
Sbjct: 83 LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSG-----LGIELTQYDPAGSG 137
Query: 168 SSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
++ V C C + S +C S PC + Y + ++++G+ V D + S
Sbjct: 138 TT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVTDFVQYNQVSG 194
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ ++ +S+ GCG Q G L + A DG++G G D S+ S LA A ++ F+
Sbjct: 195 NGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAH 253
Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA-- 335
C D G +F G Q P+ Y V ++ +G + L S F +
Sbjct: 254 CLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311
Query: 336 ----LVDSGASFTFLPTEIYAEVVVK-FDK 360
++DSG + +LP E+Y ++ FDK
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDK 341
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 128/270 (47%), Gaps = 29/270 (10%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I+IG+P + V +D GS++LWV C +C C S L L++YDP+ S
Sbjct: 83 LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSG-----LGIELTQYDPAGSG 137
Query: 168 SSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
++ V C C + S +C S PC + Y + ++++G+ V D + S
Sbjct: 138 TT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVTDFVQYNQVSG 194
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ ++ +S+ GCG Q G L + A DG++G G D S+ S LA A ++ F+
Sbjct: 195 NGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAH 253
Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA-- 335
C D G +F G Q P+ Y V ++ +G + L S F +
Sbjct: 254 CLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311
Query: 336 ----LVDSGASFTFLPTEIYAEVVVK-FDK 360
++DSG + +LP E+Y ++ FDK
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDK 341
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/384 (26%), Positives = 157/384 (40%), Gaps = 39/384 (10%)
Query: 78 KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
+L++ + + +LL FP +G+ F L+YT I +G+P F V +D
Sbjct: 45 QLKARDKARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKIRLGSPPRDFYVQVDT 100
Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
GS++LWV C P + + L L+ +DP SS ++ VSCS C S S
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSSDSG 156
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C + C Y Y + + +SG+ V D+L + + + V+ GC QTG
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215
Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
+ A DG+ G G +SV S LA GL FS C EN G + G + +
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273
Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
F P+ Y V + S + L T +G ++D+G + +L Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
V VS + + +GN CY ++ P + L F+ S + + +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQ 390
Query: 413 NEVGDHACFSYFTLEYNFTGILIL 436
N VG A + GI IL
Sbjct: 391 NNVGGTAVWCIGFQRIQNQGITIL 414
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 141/306 (46%), Gaps = 35/306 (11%)
Query: 72 RQKTRVKLQSNNNSSRNQLL----FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVAL 127
++++ L++++NS + ++L P G+ + L+Y I IGTP + V +
Sbjct: 60 QKRSLAALKAHDNSRQLRILAGVDLPLGGTGR----PEAVGLYYAKIGIGTPARDYYVQV 115
Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS----- 181
D GS+++WV C QC +C S SL L+ YD S + K VSC C +
Sbjct: 116 DTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESLTGKLVSCDQDFCYAINGGP 170
Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
S C + C Y Y+ + +SS GY V DI+ S +S SVI GC Q+
Sbjct: 171 PSYCIA-NMSCSYTEIYA-DGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQS 228
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS 301
G A DG++G G + S+ S LA +G ++ F+ C D + G +F G Q
Sbjct: 229 GDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF--AIGHIVQPK 286
Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIY 351
+ P+ Y V +++ +G L + G ++DSG + +LP +Y
Sbjct: 287 VNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKG--TIIDSGTTLAYLPEVVY 344
Query: 352 AEVVVK 357
+++ K
Sbjct: 345 DQLLSK 350
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 165/374 (44%), Gaps = 52/374 (13%)
Query: 58 SVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIG 117
S EY E L ++D +R V FP G F L+YT I +G
Sbjct: 2 SREYFETLKAHDRRRLAAVVD-------------FPLTGDDDPFVTG----LYYTKIYLG 44
Query: 118 TPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
TP V + V +D GS++ W+ C C C ++ + S+ L+ YDPS SS+ +SC
Sbjct: 45 TPPVGYYVQVDTGSDVTWLNCAPCTSC--VTETQLPSI--KLTTYDPSRSSTDGALSCRD 100
Query: 177 PLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C + SC S C Y Y + +S+ GY + D++ + Q + +S
Sbjct: 101 SNCGAALGSNEVSCTS-AGYCAYSTTYG-DGSSTQGYFIQDVMTFQEIHNNT-QVNGTAS 157
Query: 232 VIIGCGRKQTGSYL-DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGS 288
V GCG Q+G+ L A DG++G G VS+PS LA G + N F+ C D G+
Sbjct: 158 VYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGGT 217
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQSGFQA--------LVDS 339
+ G ++ + S+ PI + + Y VG+++ + G + T + F ++DS
Sbjct: 218 IVIGS---VSEPNISYTPIVSR-NHYAVGMQNIAVNGRNVTTPASFDTTSTSAGGVIMDS 273
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN- 398
G + +L Y + V VS+ S+ + + A P ++L F
Sbjct: 274 GTTLAYLVDPAYTQFV----NAVSTFESSMFSSHSQCLQLAWCSLQADFPTVKLFFDAGA 329
Query: 399 -QSFVVRNHIFSFP 411
+ RN+++S P
Sbjct: 330 VMNLTPRNYLYSQP 343
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 139/305 (45%), Gaps = 31/305 (10%)
Query: 72 RQKTRVKLQSNNNSSRNQLL----FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVAL 127
++++ L++++NS + ++L P G+ + L+Y I IGTP + V +
Sbjct: 60 QKRSLAALKAHDNSRQLRILAGVDLPLGGTGR----PEAVGLYYAKIGIGTPARDYYVQV 115
Query: 128 DAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS----- 181
D GS+++WV C QC +C S SL L+ YD S + K VSC C +
Sbjct: 116 DTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESLTGKLVSCDQDFCYAINGGP 170
Query: 182 RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 241
S C + C Y Y+ + +SS GY V DI+ S +S SVI GC Q+
Sbjct: 171 PSYCIA-NMSCSYTEIYA-DGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQS 228
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS 301
G A DG++G G + S+ S LA +G ++ F+ C D + G +F G Q
Sbjct: 229 GDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF--AIGHIVQPK 286
Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAE 353
+ P+ Y V +++ +G L ++DSG + +LP +Y +
Sbjct: 287 VNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQ 346
Query: 354 VVVKF 358
++ K
Sbjct: 347 LLSKI 351
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 90/316 (28%), Positives = 144/316 (45%), Gaps = 39/316 (12%)
Query: 55 KKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWI 114
+ S EY L +D +R + + + FP G F L+YT I
Sbjct: 6 RGMSSEYYRTLREHDQRRLRRILP---------EVVAFPISGDDDTFTTG----LYYTRI 52
Query: 115 DIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+GTP F V +D GS++ WV C C C S ++ +S +DP S+S ++S
Sbjct: 53 YLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRAS-----NVALPISIFDPEKSTSKTSIS 107
Query: 174 CSHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF-SKHAPQSSVQS 230
C+ C S S C CPY Y + +S++GYL++D+L S ++ +S +
Sbjct: 108 CTDEECYLASNSKCSFNSMSCPYSTLYG-DGSSTAGYLINDVLSFNQVPSGNSTATSGTA 166
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGS 288
+ GCG QTG++L DG++G G +VS+PS L+K + N F+ C D SG+
Sbjct: 167 RLTFGCGSNQTGTWLT----DGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGT 222
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQSGFQ------ALVDSGA 341
+ G + + PI K Y V + + + G + T + F ++DSG
Sbjct: 223 LVIGH---IREPGLVYTPIVPKQSHYNVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGT 279
Query: 342 SFTFLPTEIYAEVVVK 357
+ T+L Y + K
Sbjct: 280 TLTYLVQPAYDQFQAK 295
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 157/384 (40%), Gaps = 39/384 (10%)
Query: 78 KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
+L++ + + +LL FP +G+ F L+YT + +GTP F V +D
Sbjct: 45 QLKARDEARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKLRLGTPPRDFYVQVDT 100
Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
GS++LWV C P + + L L+ +DP SS ++ +SCS C S S
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C + C Y Y + + +SG+ V D+L + + + V+ GC QTG
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215
Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
+ A DG+ G G +SV S LA G+ FS C EN G + G + +
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273
Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
F P+ Y V + S + L T +G ++D+G + +L Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
V VS + + +GN CY ++ P + L F+ S + + +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQ 390
Query: 413 NEVGDHACFSYFTLEYNFTGILIL 436
N VG A + GI IL
Sbjct: 391 NNVGGTAVWCIGFQRIQNQGITIL 414
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 157/384 (40%), Gaps = 39/384 (10%)
Query: 78 KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
+L++ + + +LL FP +G+ F L+YT + +GTP F V +D
Sbjct: 45 QLKARDEARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKLRLGTPPRDFYVQVDT 100
Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
GS++LWV C P + + L L+ +DP SS ++ +SCS C S S
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C + C Y Y + + +SG+ V D+L + + + V+ GC QTG
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215
Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
+ A DG+ G G +SV S LA G+ FS C EN G + G + +
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273
Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
F P+ Y V + S + L T +G ++D+G + +L Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
V VS + + +GN CY ++ P + L F+ S + + +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQ 390
Query: 413 NEVGDHACFSYFTLEYNFTGILIL 436
N VG A + GI IL
Sbjct: 391 NNVGGTAVWCIGFQRIQNQGITIL 414
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 152/369 (41%), Gaps = 39/369 (10%)
Query: 78 KLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
+L++ + + +LL FP +G+ F L+YT + +GTP F V +D
Sbjct: 45 QLKARDEARHGRLLQSLGGVIDFPVDGTFDPFV----VGLYYTKLRLGTPPRDFYVQVDT 100
Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSS 184
GS++LWV C P + + L L+ +DP SS ++ +SCS C S S
Sbjct: 101 GSDVLWVSCASCNGCPQT----SGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C + C Y Y + + +SG+ V D+L + + + V+ GC QTG
Sbjct: 157 CSVQNNLCAYTFQYG-DGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL 215
Query: 245 LDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGDQGPATQQST 302
+ A DG+ G G +SV S LA G+ FS C EN G + G + +
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILV--LGEIVEPNM 273
Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEV 354
F P+ Y V + S + L T +G ++D+G + +L Y
Sbjct: 274 VFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPF 333
Query: 355 VVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
V VS + + +GN CY ++ P + L F+ S + + +
Sbjct: 334 VEAITNAVSQSVRPVVSKGNQ---CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQ 390
Query: 413 NEVGDHACF 421
N V CF
Sbjct: 391 NNVASALCF 399
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 81/311 (26%), Positives = 137/311 (44%), Gaps = 23/311 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T + +GTP F V +D GS++LWV C C C S L L+ +D +SSS
Sbjct: 80 LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSG-----LGIQLNYFDTTSSS 134
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+++ V CSHP+C S+ + C + C Y Y + + +SGY V D + +
Sbjct: 135 TARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYG-DGSGTSGYYVSDTFYFDAVLGE 193
Query: 223 APQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ ++ ++++ GC Q+G A DG+ G G G++SV S L+ G+ FS C
Sbjct: 194 SLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL 253
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
DSG G + + P+ Y + ++S + L T S
Sbjct: 254 KGEDSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNR 312
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++D+G + +L E Y V V S+ + N CY S+ P +
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITAAV-SQLATPTINKGNQCYLVSNSVSEVFPPVSF 371
Query: 394 IFSKNQSFVVR 404
F+ + +++
Sbjct: 372 NFAGGATMLLK 382
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 172/406 (42%), Gaps = 29/406 (7%)
Query: 48 SVADSWPKKNSVEY---LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGN 104
+VA +P ++E L + D + + RV+ SS + FP EG+ +
Sbjct: 7 AVASGFPATLTLERAFPLNQRVELDELKARDRVRHGRFLQSSVGVVDFPVEGTYDPYR-- 64
Query: 105 QFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPS 164
L++T + +G+P F V +D GS++LWV C P S+ + L+ +DP
Sbjct: 65 --VGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNF----FDPG 118
Query: 165 SSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
SSS++ +SCS C S + C S + C Y Y + + +SGY V D+L+ +
Sbjct: 119 SSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSDLLNFDAI 177
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ +S +S++ GC QTG A DG+ G G D+SV S ++ G+ FS
Sbjct: 178 VGSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFS 236
Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQ 330
C + G ++ + P+ Y + ++S + L T
Sbjct: 237 HCLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATS 295
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
+ +VDSG + +L E Y V + VS L + CY +S P
Sbjct: 296 TNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSSVKGIFPT 354
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILIL 436
+ L F+ S ++ + +N +GD A + + GI IL
Sbjct: 355 VSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITIL 400
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/328 (27%), Positives = 140/328 (42%), Gaps = 27/328 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T + +G+P F V +D GS++LWV C C C S L L+ +D SSSS
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSG-----LGIQLNFFDSSSSS 119
Query: 168 SSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
++ V CS P+C S + C S D C Y Y + + +SGY V D L+ +
Sbjct: 120 TAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYG-DGSGTSGYYVSDTLYFDAILGQ 178
Query: 223 APQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ + + ++ GC Q+G A DG+ G G G++SV S L+ G+ FS C
Sbjct: 179 SLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL 238
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
+ SG G + + P+ Y + + S + L T +
Sbjct: 239 KGDGSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQ 297
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDM 391
+VDSG + +L E Y V + +VS I+ +GN CY S+ P
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKGNQ---CYLVSTSVSQMFPLA 354
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHA 419
F+ S V++ + P G A
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGSSGGSA 382
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 149/334 (44%), Gaps = 42/334 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++DP S
Sbjct: 82 YYTTRLWI--GTPPQQFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFDPES 129
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ K + C+ C S C Y Y+ E ++SSG L +D++ + S+ PQ
Sbjct: 130 SSTYKPIKCNIDCI-----CDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQSELIPQ 183
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L + G I +SFS+C+ D
Sbjct: 184 RAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD 237
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG-----FQAL 336
G V G P+ T P+ Y Y V + E + G SG + A+
Sbjct: 238 IGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFDGRYGAV 295
Query: 337 VDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNASSEEML----KVPD 390
+DSG ++ +LP E ++ D++ S K+I ++K C++ + + K P
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
+ ++F Q + + F ++V C F
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIF 389
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 149/334 (44%), Gaps = 42/334 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++DP S
Sbjct: 82 YYTTRLWI--GTPPQQFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFDPES 129
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ K + C+ C S C Y Y+ E ++SSG L +D++ + S+ PQ
Sbjct: 130 SSTYKPIKCN-----IDCICDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQSELIPQ 183
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L + G I +SFS+C+ D
Sbjct: 184 RAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD 237
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG-----FQAL 336
G V G P+ T P+ Y Y V + E + G SG + A+
Sbjct: 238 IGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFDGRYGAV 295
Query: 337 VDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNASSEEML----KVPD 390
+DSG ++ +LP E ++ D++ S K+I ++K C++ + + K P
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
+ ++F Q + + F ++V C F
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIF 389
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 142/312 (45%), Gaps = 21/312 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +G+P + V +D GS++LWV C+ C +C T+L+ +LS +D ++SS
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPS-----KTNLNFHLSLFDVNASS 127
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+SK V C C S S + C Y Y+ E TS G + D L L +
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSE-GNFIRDKLTLEQVTGDLQT 186
Query: 226 SSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ V+ GCG Q+G +A DGVMG G + SV S LA G + FS C D
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246
Query: 285 DSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCLTQSGFQALVDS 339
G +F G ++T +P Y+ +G++ + + S + G +VDS
Sbjct: 247 KGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIMRNGG--TIVDS 304
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDMRLIFSKN 398
G + + P +Y ++ + +++ + + L + C++ S + P + F +
Sbjct: 305 GTTLAYFPKVLYDSLI---ETILARQPVKLHIVEDTFQCFSFSENVDVAFPPVSFEFEDS 361
Query: 399 QSFVVRNHIFSF 410
V H + F
Sbjct: 362 VKLTVYPHDYLF 373
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/333 (26%), Positives = 147/333 (44%), Gaps = 38/333 (11%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP+ F + +D+GS + +VPC C QC + ++ + + P SS+
Sbjct: 94 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C + + C Y Y+ E +SSSG L +DI+ S+ PQ +V
Sbjct: 154 PVKCN-----VDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-- 205
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
GC +TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G
Sbjct: 206 ---FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 261
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V G P + P+ Y Y + ++ + L S ++DSG
Sbjct: 262 MVLGGMPAPPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 319
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 391
++ +LP + + V F V++K SL+ N C+ + + ++ PD+
Sbjct: 320 TYAYLPEQAF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDV 375
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
++F Q + + F ++V C F
Sbjct: 376 DMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 408
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 148/343 (43%), Gaps = 48/343 (13%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP F + +D+GS + +VPC C QC + + P SSS
Sbjct: 91 TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSSYS 140
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C S K C Y Y+ E +SSSG L +DI+ S+ PQ +V
Sbjct: 141 PVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRESELKPQRAV-- 192
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
GC +TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G
Sbjct: 193 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGA 248
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V G P+ + P+ Y Y + ++ + L S ++DSG
Sbjct: 249 MVLGGVPAPSDMVFSHSDPLRSPY--YNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGT 306
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 391
++ +LP + + V F V+SK SL+ N C+ + + K+ PD+
Sbjct: 307 TYAYLPEQAF----VAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDV 362
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
++F Q + + F ++V C F + T +L
Sbjct: 363 DMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLL 405
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/333 (26%), Positives = 147/333 (44%), Gaps = 38/333 (11%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP+ F + +D+GS + +VPC C QC + ++ + + P SS+
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C + + C Y Y+ E +SSSG L +DI+ S+ PQ +V
Sbjct: 153 PVKCN-----VDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-- 204
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
GC +TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G
Sbjct: 205 ---FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 260
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V G P + P+ Y Y + ++ + L S ++DSG
Sbjct: 261 MVLGGMPAPPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 318
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 391
++ +LP + + V F V++K SL+ N C+ + + ++ PD+
Sbjct: 319 TYAYLPEQAF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDV 374
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
++F Q + + F ++V C F
Sbjct: 375 DMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 407
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/261 (32%), Positives = 124/261 (47%), Gaps = 37/261 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCI--QCAPLSASYYTSLDRNLSEYDPS 164
Y Y + +GTP F V +D GS + +VPC C P + +DP+
Sbjct: 59 YGYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGP---------HHKDAAFDPA 109
Query: 165 SSSSSKNVSCSHPLCK-SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
SSSSS + C C R C S K C Y Y+ E +SS+G LV D L L +
Sbjct: 110 SSSSSAVIGCDSDKCICGRPPCGCSEKRECTYQRTYA-EQSSSAGLLVSDQLQLRDGAVE 168
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
V+ GC K+TG + A DG++GLG +VS+ + LA +G+I + F++CF
Sbjct: 169 ---------VVFGCETKETGEIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFALCFG 218
Query: 283 --ENDSGSVFFGDQGPA----TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 330
E D G++ GD A Q T+ L Y V +E+ +G L +
Sbjct: 219 SVEGD-GALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYE 277
Query: 331 SGFQALVDSGASFTFLPTEIY 351
G+ ++DSG +FT+LP+E +
Sbjct: 278 EGYGTVLDSGTTFTYLPSEAF 298
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 172/406 (42%), Gaps = 29/406 (7%)
Query: 48 SVADSWPKKNSVEY---LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGN 104
+VA +P ++E L + D + + RV+ SS + FP EG+ +
Sbjct: 22 AVASGFPATLTLERAFPLNQRVELDELKARDRVRHGRFLQSSVGVVDFPVEGTYDPYR-- 79
Query: 105 QFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPS 164
L++T + +G+P F V +D GS++LWV C P S+ + L+ +DP
Sbjct: 80 --VGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNF----FDPG 133
Query: 165 SSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
SSS++ +SCS C S + C S + C Y Y + + +SGY V D+L+ +
Sbjct: 134 SSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSDLLNFDAI 192
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ +S +S++ GC QTG A DG+ G G D+SV S ++ G+ FS
Sbjct: 193 VGSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFS 251
Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQ 330
C + G ++ + P+ Y + ++S + L T
Sbjct: 252 HCLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATS 310
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
+ +VDSG + +L E Y V + VS L + CY +S P
Sbjct: 311 TNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSSVKGIFPT 369
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILIL 436
+ L F+ S ++ + +N +GD A + + GI IL
Sbjct: 370 VSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITIL 415
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 161/366 (43%), Gaps = 50/366 (13%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
+ V + N+ +E+LL + + +L S+ Q P + + G+
Sbjct: 75 IQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATLPVQSGASIGSGD-- 132
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
+ + +GTP F + D GS+L W QC P + + Y + L DP+ S
Sbjct: 133 ---YAVTVGLGTPKKEFTLIFDTGSDLTWT-----QCEPCAKTCYKQKEPRL---DPTKS 181
Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
+S KN+SCS CK SC S C Y Y + + S G+ + L L+S
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGESCSS--PTCLYQVQYG-DGSYSIGFFATETLTLSS--- 235
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S+V + + GCG++ +G + GAA G++GLG +S+PS A+ + FS C
Sbjct: 236 ----SNVFKNFLFGCGQQNSGLF-RGAA--GLLGLGRTKLSLPSQTAQK--YKKLFSYCL 286
Query: 282 DENDS--GSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSCLT 329
+ S G + FG Q ++ F P+ E + + VG I S +
Sbjct: 287 PASSSSKGYLSFGGQ---VSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFS 343
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
SG ++DSG T LP+ Y+ + F KL++ + + + CY+ S E +K+P
Sbjct: 344 TSG--TVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIP 401
Query: 390 DMRLIF 395
+ + F
Sbjct: 402 KVGVSF 407
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 146/333 (43%), Gaps = 48/333 (14%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP F + +D+GS + +VPC C QC + + P SSS
Sbjct: 90 TRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCG----------NHQDPRFQPDLSSSYS 139
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C S K C Y Y+ E +SSSG L +DI+ S+ PQ +
Sbjct: 140 PVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRESELKPQHA--- 190
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
I GC +TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G
Sbjct: 191 --IFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGA 247
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V G P ++ P+ Y Y + ++ + L S ++DSG
Sbjct: 248 MVLGGMLAPPDMIFSNSDPLRSPY--YNIELKEIHVAGKALRVESRIFNSKHGTVLDSGT 305
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY---CYNASSEEMLKV----PDM 391
++ +LP + + V F + V+SK SL +G Y C+ + + K+ PD+
Sbjct: 306 TYAYLPEQAF----VAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDV 361
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
++F Q + + F ++V C F
Sbjct: 362 DMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVF 394
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/335 (26%), Positives = 139/335 (41%), Gaps = 38/335 (11%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T I +GTP + V +D GS++LWV C C +C S L +L+ YDP +SS
Sbjct: 86 LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSG-----LGLDLTFYDPKASS 140
Query: 168 SSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S VSC C + K + PC Y Y + +S++G+ + D L +
Sbjct: 141 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFITDALQFDQVTGDG 199
Query: 224 PQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+++ GCG +Q G + A DG++G G + S+ S LA AG + F+ C D
Sbjct: 200 QTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLD 259
Query: 283 ENDSGS--------------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
G VFF G + I Y V ++S +G + L
Sbjct: 260 TIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTL 319
Query: 329 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYN 379
T ++DSG + T+LP ++ +V+ D + S R I+ C+
Sbjct: 320 QLPAHVFETGEKKGTIIDSGTTLTYLPELVFKQVM---DVVFSKHRDIAFHNLQDFLCFQ 376
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 414
S P + F + + V H + FP
Sbjct: 377 YSGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGN 411
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 158/355 (44%), Gaps = 34/355 (9%)
Query: 106 FYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
F L++T + +G+P F V +D GS++LW+ CI C+ + + + L L +D +
Sbjct: 79 FVGLYFTKVKLGSPAKEFYVQIDTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAG 134
Query: 166 SSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS-F 219
SS++ VSC P+C + S C S + C Y Y + + ++GY V D ++ +
Sbjct: 135 SSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYG-DGSGTTGYYVSDTMYFDTVL 193
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ ++ S++I GC Q+G A DG+ G G G +SV S L+ G+ FS
Sbjct: 194 LGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFS 253
Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
C EN G + G+ + S + P+ Y + ++S + L
Sbjct: 254 HCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFA 310
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCYNASSEEML 386
T + +VDSG + +L E Y V VS SK I +GN CY S+
Sbjct: 311 TTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ---CYLVSNSVGD 367
Query: 387 KVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHACFSYFTLEYNFT--GILILQ 437
P + L F S V+ +++ + + C + +E FT G L+L+
Sbjct: 368 IFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLK 422
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 143/328 (43%), Gaps = 32/328 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++ I IGTP+ + V +D GS++LWV C C +C S L +L+ YD +S+
Sbjct: 77 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 131
Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+S V C C CK C Y Y + +S++GY V D + S +
Sbjct: 132 TSDAVGCDDNFCSLYDGPLPGCKPGLQ-CLYSVLYG-DGSSTTGYFVQDFVQYNRISGNF 189
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +V+ GCG KQ+G + A DG++G G + S+ S LA +G ++ FS C D
Sbjct: 190 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 249
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGE--------KYDAYFVGVESYCIGNSCLT----- 329
D G +F G + FL + Y V ++ +G L
Sbjct: 250 NVDGGGIFA--IGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDA 307
Query: 330 -QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
+SG + ++DSG + + P E+Y ++ K R+ ++ C++ +
Sbjct: 308 FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTGNVDD 366
Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENE 414
P + L F K+ S V H + F E
Sbjct: 367 GFPTVTLHFDKSISLTVYPHEYLFQVKE 394
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 159/355 (44%), Gaps = 34/355 (9%)
Query: 106 FYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
F L++T + +G+P F V +D GS++LW+ CI C+ + + + L L +D +
Sbjct: 79 FVGLYFTKVKLGSPAKDFYVQIDTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAG 134
Query: 166 SSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS-F 219
SS++ VSC+ P+C + S C S + C Y Y + + ++GY V D ++ +
Sbjct: 135 SSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYG-DGSGTTGYYVSDTMYFDTVL 193
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ ++ S+++ GC Q+G A DG+ G G G +SV S L+ G+ FS
Sbjct: 194 LGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFS 253
Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
C EN G + G+ + S + P+ Y + ++S + L
Sbjct: 254 HCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFA 310
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCYNASSEEML 386
T + +VDSG + +L E Y V VS SK I +GN CY S+
Sbjct: 311 TTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ---CYLVSNSVGD 367
Query: 387 KVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHACFSYFTLEYNFT--GILILQ 437
P + L F S V+ +++ + + C + +E FT G L+L+
Sbjct: 368 IFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLK 422
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 127/272 (46%), Gaps = 31/272 (11%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I+IG+P + V +D GS++LWV C C S L L++YDP+ S
Sbjct: 84 LYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSG-----LGIELTQYDPAGSG 138
Query: 168 SSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
++ V C C + S +C S PC + Y + +S++G+ V D + S
Sbjct: 139 TT--VGCEQEFCVANSAASGVPPACPSAASPCQFRITYG-DGSSTTGFYVTDFVQYNQVS 195
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ + S+ GCG + G + A DG++G G D S+ S LA A ++ F+
Sbjct: 196 GNGQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAH 255
Query: 280 CFDENDSGSVF-FGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 335
C D G +F G+ P ++T +P Y+ G+ +G + L S F +
Sbjct: 256 CLDTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGIS---VGGATLQLPTSTFDS 312
Query: 336 ------LVDSGASFTFLPTEIYAEVVVK-FDK 360
++DSG + +LP E+Y ++ FDK
Sbjct: 313 GDSKGTIIDSGTTLAYLPREVYRTLLTAVFDK 344
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 99/387 (25%), Positives = 164/387 (42%), Gaps = 60/387 (15%)
Query: 74 KTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL----------HYTWIDIGTPNVSF 123
K + +Q NN + N + E S+ F GN L ++ + +GTP
Sbjct: 126 KESITIQQQNNLA-NAFVASLESSKGEFSGNIMATLESGASLGTGEYFLDMFVGTPPKHV 184
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+ LD GS+L W IQC P Y ++N S Y P SS+ +N+SC P C+ S
Sbjct: 185 WLILDTGSDLSW-----IQCDPC----YDCFEQNGSHYYPKDSSTYRNISCYDPRCQLVS 235
Query: 184 S------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
S CK+ CPY DY+ ++ + + ++ + V+ GCG
Sbjct: 236 SSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGCG 295
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGSVFFG 292
G + GA+ G++GLG G +S PS + + +SFS C + + S + FG
Sbjct: 296 HWNKG-FFYGAS--GLLGLGRGPISFPSQIQ--SIYGHSFSYCLTDLFSNTSVSSKLIFG 350
Query: 293 DQGPATQQS----TSFLPIGEKYDA--YFVGVESYCIGNSCLTQS--------------- 331
+ T+ L E D Y++ ++S +G L S
Sbjct: 351 EDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADA 410
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPD 390
G ++DSG++ TF P Y + F+K + ++I+ CYN S M +++PD
Sbjct: 411 GGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPD 470
Query: 391 MRLIFSKNQ--SFVVRNHIFSFPENEV 415
+ F+ +F N+ + + +EV
Sbjct: 471 FGIHFADGGVWNFPAENYFYQYEPDEV 497
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 153/370 (41%), Gaps = 38/370 (10%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
+E L D R R L + FP EGS F L++T + +G+P
Sbjct: 47 VEHLRERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFM----VGLYFTRVKLGSPPK 102
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
+ V +D GS++LWV C C C S L+ L ++P +SS+S + CS C
Sbjct: 103 EYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCT 157
Query: 180 ----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
S + C++ + PC Y Y + + +SGY V D ++ S + ++ +S++
Sbjct: 158 AALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDSVMGNEQTANSSASIVF 216
Query: 235 GCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GC Q+G A DG+ G G +SV S L G+ FS C +D+G
Sbjct: 217 GCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGIL-V 275
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTF 345
G + + P+ Y + +ES I +S T S Q +VDSG + +
Sbjct: 276 LGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAY 335
Query: 346 LPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
L Y V VS SL +GN C+ SS P + L F + V
Sbjct: 336 LADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSSVDSSFPTVSLYFMGGVAMTV 392
Query: 404 RNHIFSFPEN 413
+ PEN
Sbjct: 393 K------PEN 396
>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
Length = 426
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 67/234 (28%), Positives = 118/234 (50%), Gaps = 15/234 (6%)
Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
+++ C S CPY Y + + S+G LV+D++H+++ A + I G Q
Sbjct: 124 TKARCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDAR------ITFGESQ 177
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
G + + A +G+MGL + D++VP++L KAG+ +SFS+CF N G++ FGD+G + Q
Sbjct: 178 LGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQL 236
Query: 301 STSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
T P+ F V + + +G + + F A DSG + T+L Y + F
Sbjct: 237 ET---PLSGTISPMFYDVSITKFKVGKVTV-DTEFTATFDSGTAVTWLIEPYYTALTTNF 292
Query: 359 DKLVSSKRISLQGNS-WKYCY-NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
V +R+S +S +++CY S+ + K+P + ++ V + I F
Sbjct: 293 HLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVF 346
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 152/366 (41%), Gaps = 42/366 (11%)
Query: 73 QKTRVKLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
++ R + + + SR +LL FP EGS + L++T + +G P F
Sbjct: 48 EELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYM----VGLYFTRVKLGNPAKEFF 103
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--- 181
V +D GS++LWV C P S + L+ L ++P SSS++ ++CS C +
Sbjct: 104 VQIDTGSDILWVTCSPCTGCPTS----SGLNIQLESFNPDSSSTASRITCSDDRCTAGFQ 159
Query: 182 --RSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
+ C+ S PC Y Y + + +SGY V D + + + ++ +S++ GC
Sbjct: 160 TGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGC 218
Query: 237 GRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
Q+G A DG+ G G +SV S L G+ FS C +D+G G
Sbjct: 219 SNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGIL-VLG 277
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTFLP 347
+ + P+ Y + +ES I +S T S Q +VDSG + +L
Sbjct: 278 EIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLA 337
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
Y V VS SL + C+ SS P + L F + V+
Sbjct: 338 DGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPTVTLYFMGGVAMSVK--- 393
Query: 408 FSFPEN 413
PEN
Sbjct: 394 ---PEN 396
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 152/366 (41%), Gaps = 42/366 (11%)
Query: 73 QKTRVKLQSNNNSSRNQLL--------FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
++ R + + + SR +LL FP EGS + L++T + +G P F
Sbjct: 50 EELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYM----VGLYFTRVKLGNPAKEFF 105
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--- 181
V +D GS++LWV C P S + L+ L ++P SSS++ ++CS C +
Sbjct: 106 VQIDTGSDILWVTCSPCTGCPTS----SGLNIQLESFNPDSSSTASRITCSDDRCTAGFQ 161
Query: 182 --RSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
+ C+ S PC Y Y + + +SGY V D + + + ++ +S++ GC
Sbjct: 162 TGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGC 220
Query: 237 GRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
Q+G A DG+ G G +SV S L G+ FS C +D+G G
Sbjct: 221 SNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGIL-VLG 279
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTFLP 347
+ + P+ Y + +ES I +S T S Q +VDSG + +L
Sbjct: 280 EIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLA 339
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 407
Y V VS SL + C+ SS P + L F + V+
Sbjct: 340 DGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPTVTLYFMGGVAMSVK--- 395
Query: 408 FSFPEN 413
PEN
Sbjct: 396 ---PEN 398
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/345 (28%), Positives = 151/345 (43%), Gaps = 42/345 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y + T I IGTP +F + +D GS L +VPC C QC D N + P
Sbjct: 89 YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG-------KHQDPN---FQPDW 138
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + + CS +C S C Y Y+ E +SSSG L +DI+ S+ PQ
Sbjct: 139 SSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDIVSFGKQSELKPQ 192
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L + G+I NSFS+C+ D
Sbjct: 193 RTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMD 246
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
G V G PA T P Y Y + ++ I N + + +
Sbjct: 247 VGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTI 304
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
+DSG ++ +LP + K ++S ++ +QG Y C++ ++ ++ P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGVGSDVSQLSKTFP 363
Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ L+FS + + F ++ C F E + T +L
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLL 408
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/345 (28%), Positives = 151/345 (43%), Gaps = 42/345 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y + T I IGTP +F + +D GS L +VPC C QC D N + P
Sbjct: 89 YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG-------KHQDPN---FQPDW 138
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + + CS +C S C Y Y+ E +SSSG L +DI+ S+ PQ
Sbjct: 139 SSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDIVSFGKQSELKPQ 192
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L + G+I NSFS+C+ D
Sbjct: 193 RTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMD 246
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
G V G PA T P Y Y + ++ I N + + +
Sbjct: 247 VGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTI 304
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
+DSG ++ +LP + K ++S ++ +QG Y C++ ++ ++ P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGVGSDVSQLSKTFP 363
Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ L+FS + + F ++ C F E + T +L
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLL 408
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 153/370 (41%), Gaps = 38/370 (10%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
+E L D R R L + FP EGS F L++T + +G+P
Sbjct: 47 VEHLRERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFM----VGLYFTRVKLGSPPK 102
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
+ V +D GS++LWV C C C S L+ L ++P +SS+S + CS C
Sbjct: 103 EYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCT 157
Query: 180 ----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
S + C++ + PC Y Y + + +SGY V D ++ + + ++ +S++
Sbjct: 158 AALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVF 216
Query: 235 GCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GC Q+G A DG+ G G +SV S L G+ FS C +D+G
Sbjct: 217 GCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGIL-V 275
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTF 345
G + + P+ Y + +ES I +S T S Q +VDSG + +
Sbjct: 276 LGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAY 335
Query: 346 LPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
L Y V VS SL +GN C+ SS P + L F + V
Sbjct: 336 LADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSSVDSSFPTVSLYFMGGVAMTV 392
Query: 404 RNHIFSFPEN 413
+ PEN
Sbjct: 393 K------PEN 396
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 40/309 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y + Y+ +GTP +D GSN++W+ CQ C C ++ ++PS
Sbjct: 89 YLISYS---VGTPPFKVYGFMDTGSNIVWLQCQPCNTC----------FNQTSPIFNPSK 135
Query: 166 SSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
SSS KN+ C+ CK + SC + D C Y Y D S G L +D L L S S
Sbjct: 136 SSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGG-DAKSQGDLSNDSLTLDSTSG 194
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S + +++IGCG D + GV+G+G G +S+ + + + + FS C
Sbjct: 195 ---SSVLFPNIVIGCGHINV--LQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSKFSYCL 248
Query: 282 -----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
D N S + FG+ + + ST + + + + YF+ +E++ +GN+ +
Sbjct: 249 IPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGER 308
Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
S L+DSG T LP +++V + V RI + CYN + ++ L
Sbjct: 309 SNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQ-LN 367
Query: 388 VPDMRLIFS 396
VPD+ F+
Sbjct: 368 VPDITAHFN 376
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 144/329 (43%), Gaps = 23/329 (6%)
Query: 92 FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
FP +G+ F L+YT + +GTP F V +D GS++LWV C P +
Sbjct: 70 FPVDGASDPFL----VGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKT---- 121
Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGY 208
+ L LS +DP SSS+ VSCS C S +S P C Y Y + + +SG+
Sbjct: 122 SELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGF 180
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLL 267
+ D + + + + + GC QTG A DG+ GLG G +SV S L
Sbjct: 181 YISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQL 240
Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
A GL FS C + SG G + T + P+ Y V ++S +
Sbjct: 241 AVQGLAPRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQI 299
Query: 328 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
L +G ++D+G + +LP E Y+ + VS + S++ C+
Sbjct: 300 LPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ-CFE 358
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
++ ++ P++ L F+ S V+R H +
Sbjct: 359 ITAGDVDVFPEVSLSFAGGASMVLRPHAY 387
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 169/393 (43%), Gaps = 68/393 (17%)
Query: 13 CILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDW 70
C + S A+S FS +L+HR D K + P +N ++ +
Sbjct: 15 CFIASFSHALSNGFSVELIHR--DSPKSPYYK-----------PTENKYQHF----VDAA 57
Query: 71 KRQKTRVK--LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
+R R + ++ S+ + P G Y + Y+ +GTP D
Sbjct: 58 RRSINRANHFFKDSDTSTPESTVIPDRGG---------YLMTYS---VGTPPTKIYGIAD 105
Query: 129 AGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCK 186
GS+++W+ C+ C QC ++ ++PS SSS KN+ CS LC S R +
Sbjct: 106 TGSDIVWLQCEPCEQC----------YNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSC 155
Query: 187 SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD 246
S ++ C Y Y + + S G L D L L S S +P S + ++IGCG G++
Sbjct: 156 SDQNSCQYKISYG-DSSHSQGDLSVDTLSLESTSG-SPVSFPK--IVIGCGTDNAGTF-- 209
Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQ 300
G A G++GLG G VS+ + L + I FS C + N S + FGD +
Sbjct: 210 GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGD 267
Query: 301 STSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG--------FQALVDSGASFTFLPTEIY 351
P+ +K YF+ ++++ +GN + G ++DSG + T +P+++Y
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVY 327
Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ LV R+ + CY+ S E
Sbjct: 328 TNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNE 360
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 82/298 (27%), Positives = 139/298 (46%), Gaps = 38/298 (12%)
Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
+ +Y + Y+ IGTP +D GS+ +W QC C P L++ +
Sbjct: 85 YAGSYYVMSYS---IGTPPFQLYGVVDTGSDGIWF--QCKPCKPC-------LNQTSPIF 132
Query: 162 DPSSSSSSKNVSCSHPLCK--SRSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
+PS SS+ KN+ CS P+CK ++ C S K C Y Y + + S G + D L L S
Sbjct: 133 NPSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITY-LDRSGSQGDISKDTLTLNS 191
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ +P S + ++IGCG K + + +G A G++G G G+ S+ S L + I FS
Sbjct: 192 -NDGSPISFPK--IVIGCGHKNSLT-TEGLA-SGIIGFGRGNFSIVSQLGSS--IGGKFS 244
Query: 279 ICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGN------ 325
C N S ++FGD + P+ + + YF +E++ +G+
Sbjct: 245 YCLASLFSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLK 304
Query: 326 --SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
S + + A++DSG++ T LP ++Y+++ +V KR+ CY +
Sbjct: 305 DSSLIPDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTT 362
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 138/319 (43%), Gaps = 35/319 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP F D GS+L+WV + C C+ + +DP SS+ + +
Sbjct: 59 ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCS------------GGTIFDPRQSSTFREM 106
Query: 173 SCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
CS LC SC+ C Y +Y + +T G D + L + S S S
Sbjct: 107 DCSSQLCAELPGSCEPGSSTCSYSYEYGSGETE--GEFARDTISLGTTSDG---SQKFPS 161
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
+GCG +G DG DG++GLG G VS+ S L+ A I + FS C +++S
Sbjct: 162 FAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESS 215
Query: 288 SVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQALVDSGASF 343
+ FG QST P + Y Y++ V + + G ++DSG +
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPG-TTIIDSGTTL 274
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
T++P+ +Y V+ + + +V+ R+ CY+ SS K P + + +
Sbjct: 275 TYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334
Query: 404 RNHIFSFPENEVGDHACFS 422
++ F ++ GD C +
Sbjct: 335 SSNYF-LVVDDSGDTVCLA 352
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 138/319 (43%), Gaps = 35/319 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP F D GS+L+WV + C C+ + +DP SS+ + +
Sbjct: 59 ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCS------------GGTIFDPRQSSTFREM 106
Query: 173 SCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
CS LC SC+ C Y +Y + +T G D + L + S S S
Sbjct: 107 DCSSQLCTELPGSCEPGSSACSYSYEYGSGETE--GEFARDTISLGTTSGG---SQKFPS 161
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
+GCG +G DG DG++GLG G VS+ S L+ A I + FS C +++S
Sbjct: 162 FAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESS 215
Query: 288 SVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQALVDSGASF 343
+ FG QST P + Y Y++ V + + G ++DSG +
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPG-TTIIDSGTTL 274
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
T++P+ +Y V+ + + +V+ R+ CY+ SS K P + + +
Sbjct: 275 TYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334
Query: 404 RNHIFSFPENEVGDHACFS 422
++ F ++ GD C +
Sbjct: 335 SSNYF-LVVDDSGDTVCLA 352
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/333 (26%), Positives = 144/333 (43%), Gaps = 48/333 (14%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP+ F + +D+GS + +VPC C QC + + P SS+
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCG----------NHQDPRFQPDLSSTYS 142
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C + + C Y Y+ E +SSSG L +DI+ S+ PQ +V
Sbjct: 143 PVKCN-----VDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-- 194
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
GC +TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G
Sbjct: 195 ---FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 250
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V G P + P+ Y Y + ++ + L S ++DSG
Sbjct: 251 MVLGGMPAPPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 308
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 391
++ +LP + + V F V++K SL+ N C+ + + ++ PD+
Sbjct: 309 TYAYLPEQAF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDV 364
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
++F Q + + F ++V C F
Sbjct: 365 DMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 397
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 137/310 (44%), Gaps = 36/310 (11%)
Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
+G+ + + L+Y ++IG P + + +D+GS+L W+ C C C + Y
Sbjct: 54 LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 107
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYL 209
+ SK V C H LC S + C+S + C Y+ Y+ + SS+G L
Sbjct: 108 -------RPTKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYA-DQGSSTGVL 159
Query: 210 VDDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLL 267
V+D SF+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L
Sbjct: 160 VND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQL 214
Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGN 325
+ G+ +N C G +FFGD Q++T + P+ + Y G S G+
Sbjct: 215 KQRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGD 273
Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
L + + DSG+SFT+ + Y +V +S S C+ E
Sbjct: 274 RSLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPF 332
Query: 386 LKVPDMRLIF 395
V D+R F
Sbjct: 333 KSVLDVRKEF 342
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 138/315 (43%), Gaps = 25/315 (7%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
N F L++T + +G P F V +D GS++LWV C P S + L L+ +D
Sbjct: 78 NPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDS----SGLGIELNLFDT 133
Query: 164 SSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
+ SSS++ + C+ P+C + S+ C + D C Y Y + + +SG+ V D +H
Sbjct: 134 TKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTDSMHFDIL 192
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ ++ ++++ GC Q G A DG+ G G G+ SV S L+ G+ FS
Sbjct: 193 LGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFS 252
Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--------L 328
C EN G + G+ + S + P+ Y + ++S + +
Sbjct: 253 HCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPI 309
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ +G + ++DSG + +L E+Y +V VS + C+ S
Sbjct: 310 SNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIF 367
Query: 389 PDMRLIFSKNQSFVV 403
P +R F S VV
Sbjct: 368 PVLRFNFEGIASMVV 382
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 138/315 (43%), Gaps = 25/315 (7%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
N F L++T + +G P F V +D GS++LWV C P S + L L+ +D
Sbjct: 78 NPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDS----SGLGIELNLFDT 133
Query: 164 SSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
+ SSS++ + C+ P+C + S+ C + D C Y Y + + +SG+ V D +H
Sbjct: 134 TKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTDSMHFDIL 192
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ ++ ++++ GC Q G A DG+ G G G+ SV S L+ G+ FS
Sbjct: 193 LGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFS 252
Query: 279 ICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--------L 328
C EN G + G+ + S + P+ Y + ++S + +
Sbjct: 253 HCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPI 309
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ +G + ++DSG + +L E+Y +V VS + C+ S
Sbjct: 310 SNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIF 367
Query: 389 PDMRLIFSKNQSFVV 403
P +R F S VV
Sbjct: 368 PVLRFNFEGIASMVV 382
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 114/447 (25%), Positives = 182/447 (40%), Gaps = 80/447 (17%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNN 83
FS L+HR E +S N S+ S KN+V + R K R++L N+
Sbjct: 29 FSINLIHR------ESPLSPFYNPSLTPSERIKNTV-------LRSFARSKRRLRLSQND 75
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQ 142
+ S + P E + +FY IGTP V D GS+L+WV C C +
Sbjct: 76 DRSPGTITIPDEPITEYLM--RFY--------IGTPPVERFAIADTGSDLIWVQCAPCEK 125
Query: 143 CAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHP---LCKSRSSCKSLKDPCPYIADY 198
C P +N +DP SS+ K V C S P L S+ +C C Y Y
Sbjct: 126 CVP----------QNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIY 175
Query: 199 STEDTSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
LV IL S + + ++++ + GC + + G++GLG
Sbjct: 176 GDHT------LVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLG 229
Query: 258 LGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQ----QSTSFL--PIG 308
+G +S+ S L I FS CF N + + FG+ Q ST + IG
Sbjct: 230 VGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIG 287
Query: 309 EKYDAYFVGVESYCIGNSCLTQSGFQA----LVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
Y Y++ +E IGN + S Q L+DSG SFT L Y + V ++
Sbjct: 288 PSY--YYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGV 345
Query: 365 KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEV--------- 415
+ + + + +C+ + + PD+ +F+ + V +++F +N +
Sbjct: 346 EAVKIPPLVYNFCFENKGKRK-RFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTS 404
Query: 416 -------GDHACFSYFTLEYNFTGILI 435
G+HA Y +EY+ G ++
Sbjct: 405 DEDDSIFGNHAQIGY-QVEYDLQGGMV 430
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 136/309 (44%), Gaps = 35/309 (11%)
Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
+G+ + + L+Y ++IG P + + +D+GS+L W+ C C C + Y
Sbjct: 47 LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 100
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
+ SK V C H LC S + C S + C Y+ Y+ + SS+G L+
Sbjct: 101 -------RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLI 152
Query: 211 DDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
+D SF+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L
Sbjct: 153 ND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 207
Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNS 326
+ G+ +N C G +FFGD Q++T + P+ + Y G S G+
Sbjct: 208 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDR 266
Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
L + + DSG+SFT+ + Y +V +S S C+ E
Sbjct: 267 SLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFK 325
Query: 387 KVPDMRLIF 395
V D+R F
Sbjct: 326 SVLDVRKEF 334
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 137/302 (45%), Gaps = 24/302 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T + +G P ++V +D GS++LWV C+ C C SA L+ L+ YDP SS
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRESS 55
Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
++ VSCS PLC + + C + C YI Y + ++S GY V D + S +
Sbjct: 56 TTSLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFSYG-DGSTSEGYYVRDAMQYNVISSN 114
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
++ S V+ GC +QTG A DG++G G ++SVP+ LA I FS C
Sbjct: 115 G-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL 173
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
E + G + ++ P+ Y V + + ++ L + +
Sbjct: 174 -EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDT 232
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + + P+ Y V + S+ + +QG + C+ S P++ L
Sbjct: 233 GVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGRLSDLFPNVTL 291
Query: 394 IF 395
F
Sbjct: 292 NF 293
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 162/375 (43%), Gaps = 26/375 (6%)
Query: 52 SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGS-QTHFFGNQFYWL 109
++P VE EL + + + R+ L SS ++ FP +GS + G++ L
Sbjct: 47 AFPLDELVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTML 104
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T + +G+P F V +D GS++LWV C P S + L +L +D S ++
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSLTA 160
Query: 170 KNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
+V+CS P+C S + C S + C Y Y + + +SGY + D + + +
Sbjct: 161 GSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESL 218
Query: 225 QSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
++ + ++ GC Q+G A DG+ G G G +SV S L+ G+ FS C
Sbjct: 219 VANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKG 278
Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA------ 335
+ SG F G + P+ Y + + S + L F+A
Sbjct: 279 DGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGT 337
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+VD+G + T+L E Y + VS + N + CY S+ P + L F
Sbjct: 338 IVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNF 396
Query: 396 SKNQSFVVRNHIFSF 410
+ S ++R + F
Sbjct: 397 AGGASMMLRPQDYLF 411
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 136/309 (44%), Gaps = 35/309 (11%)
Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
+G+ + + L+Y ++IG P + + +D+GS+L W+ C C C + Y
Sbjct: 56 LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 109
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
+ SK V C H LC S + C S + C Y+ Y+ + SS+G L+
Sbjct: 110 -------RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLI 161
Query: 211 DDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
+D SF+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L
Sbjct: 162 ND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 216
Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNS 326
+ G+ +N C G +FFGD Q++T + P+ + Y G S G+
Sbjct: 217 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDR 275
Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
L + + DSG+SFT+ + Y +V +S S C+ E
Sbjct: 276 SLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFK 334
Query: 387 KVPDMRLIF 395
V D+R F
Sbjct: 335 SVLDVRKEF 343
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 136/309 (44%), Gaps = 35/309 (11%)
Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
+G+ + + L+Y ++IG P + + +D+GS+L W+ C C C + Y
Sbjct: 56 LYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY------ 109
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
+ SK V C H LC S + C S + C Y+ Y+ + SS+G L+
Sbjct: 110 -------RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLI 161
Query: 211 DDILHLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLA 268
+D SF+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L
Sbjct: 162 ND-----SFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 216
Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNS 326
+ G+ +N C G +FFGD Q++T + P+ + Y G S G+
Sbjct: 217 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDR 275
Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
L + + DSG+SFT+ + Y +V +S S C+ E
Sbjct: 276 SLGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFK 334
Query: 387 KVPDMRLIF 395
V D+R F
Sbjct: 335 SVLDVRKEF 343
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 153/364 (42%), Gaps = 53/364 (14%)
Query: 93 PSEGSQTH--FFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSAS 149
PS + H N +Y T + IGTP F + +D+GS + +VPC C QC
Sbjct: 69 PSARMRLHDDLLTNGYY---TTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG----- 120
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL 209
+ + P SS+ V CS + +C S K C Y Y+ E +SSSG L
Sbjct: 121 -----NHQDPRFQPDLSSTYSPVKCS-----ADCTCDSDKSQCTYERQYA-EMSSSSGVL 169
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
+DI+ + S+ PQ +V GC +TG A DG+MGLG G +S+ L
Sbjct: 170 GEDIVSFGTESELKPQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVD 223
Query: 270 AGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 326
G+I +SFS+C+ D G V P + P+ Y Y + ++ +
Sbjct: 224 KGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPDMVFSRSDPVRSPY--YNIELKEIHVAGK 281
Query: 327 CLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSW 374
L S ++DSG ++ +LP + + V F V+SK L+ N
Sbjct: 282 ALRLDPRIFDSKHGTVLDSGTTYAYLPEQAF----VAFKDAVTSKVRPLKKIRGPDPNYK 337
Query: 375 KYCYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNF 430
C+ + + ++ PD+ ++F Q + + F ++V C F +
Sbjct: 338 DICFAGAGRNVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDP 397
Query: 431 TGIL 434
T +L
Sbjct: 398 TTLL 401
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 87/321 (27%), Positives = 144/321 (44%), Gaps = 33/321 (10%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
+S+ + P + VE L L + D R R+ LQ + L F +G+ +
Sbjct: 17 LSLERTIPLNHQVE-LTTLKARDRARHGGRI-LQ---DGGGGILDFSVQGTSDPYL---- 67
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
L++T + +G+P F V +D GS++LW+ C P S + L +L+ +D +SS
Sbjct: 68 VGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKS----SGLGIDLNYFDTASS 123
Query: 167 SSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S++ VSCS P+C + S C S + C Y Y + + +SGY V D ++
Sbjct: 124 STAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYG-DGSGTSGYYVYDAMYFDVIMG 182
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ S+ S+V+ GC Q+G A DG+ G G G +SV S ++ G+ FS C
Sbjct: 183 QSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHC 242
Query: 281 FDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQ 330
SG + G+ T +P+ Y+ + ++S + L T
Sbjct: 243 LKGQGSGGGILVLGEILEPNIVYTPLVPLQPHYN---LNLQSIAVNGQILPIDQDVFATG 299
Query: 331 SGFQALVDSGASFTFLPTEIY 351
+ +VDSG + +L E Y
Sbjct: 300 NNRGTIVDSGTTLAYLVQEAY 320
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 136/323 (42%), Gaps = 36/323 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ GTP ++ V D GS++ W IQC P S Y D +DP+ S++ V
Sbjct: 139 VGFGTPAQTYTVIFDTGSDVSW-----IQCLPCSGHCYKQHD---PIFDPTKSATYSVVP 190
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C HP C + K C Y +Y + +SS+G L + L L S +
Sbjct: 191 CGHPQCAAADGSKCSNGTCLYKVEYG-DGSSSAGVLSHETLSLTS-------TRALPGFA 242
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFF 291
GCG+ G + D DG++GLG G +S+ S A + +FS C D G +
Sbjct: 243 FGCGQTNLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTI 297
Query: 292 GDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGA 341
G PA+ + + +K D YFV + S IG L T G +DSG
Sbjct: 298 GPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDG--TFLDSGT 355
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
T+LP E Y + +F ++ + + + + CY+ + + + +P + FS F
Sbjct: 356 ILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVF 415
Query: 402 VVRNH-IFSFPENEVGDHACFSY 423
+ I FP++ C +
Sbjct: 416 DLSFFGILIFPDDTAPAIGCLGF 438
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 137/302 (45%), Gaps = 24/302 (7%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T + +G P ++V +D GS++LWV C+ C C SA L+ L+ YDP SS
Sbjct: 28 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRESS 82
Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
++ VSCS PLC + + C + C YI Y + ++S GY V D + S +
Sbjct: 83 TTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYG-DGSTSEGYYVRDAMQYNVISSN 141
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
++ S V+ GC +QTG A DG++G G ++SVP+ LA I FS C
Sbjct: 142 G-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL 200
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
E + G + ++ P+ Y V + + ++ L + +
Sbjct: 201 -EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDT 259
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + + P+ Y V + S+ + +QG + C+ S P++ L
Sbjct: 260 GVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGRLSDLFPNVTL 318
Query: 394 IF 395
F
Sbjct: 319 NF 320
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 154/350 (44%), Gaps = 50/350 (14%)
Query: 63 ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
E+L + + + R K N++++ + THF G + + +GTP
Sbjct: 90 EILRRDQLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGG-----YAVTVGLGTPKKD 144
Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-- 180
F + D GS+L W QC P S + +N ++DP+ S+S KN+SCS CK
Sbjct: 145 FSLLFDTGSDLTWT-----QCEPCSGGCF---PQNDEKFDPTKSTSYKNLSCSSEPCKSI 196
Query: 181 ---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
S C S + C Y Y T T G+L + L + S V + +IGCG
Sbjct: 197 GKESAQGCSS-SNSCLYGVKYGTGYT--VGFLATETLTIT-------PSDVFENFVIGCG 246
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQG 295
+ G + A G++GLG V++PS + +N FS C + S G + FG
Sbjct: 247 ERNGGRFSGTA---GLLGLGRSPVALPSQTSST--YKNLFSYCLPASSSSTGHLSFGG-- 299
Query: 296 PATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
Q+ F PI K + VG I S +G ++DSG + T+LP
Sbjct: 300 -GVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTAG--TIIDSGTTLTYLP 356
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS--SEEMLKVPDMRLIF 395
+ ++ + F +++++ ++ + + CY+ S + + + +P + + F
Sbjct: 357 STAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFF 406
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 140/297 (47%), Gaps = 44/297 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y + Y+ +GTP + +D GS+++W+ C+ C QC + ++PS
Sbjct: 87 YLMTYS---VGTPPFNVYGVVDTGSDIVWLQCKPCEQC----------YKQTTPIFNPSK 133
Query: 166 SSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA- 223
SSS KN+ CS LC+S R + + ++ C Y ++S + + S G L + L L S + H+
Sbjct: 134 SSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFS-DQSYSQGELSVETLTLDSTTGHSV 192
Query: 224 --PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
P++ +IGCG G + G++GLG+G VS+ + L + I FS C
Sbjct: 193 SFPKT------VIGCGHNNRGMF--QGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCL 242
Query: 282 -----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCL------ 328
D N + + FGD + P +K Y++ +E++ +GN +
Sbjct: 243 LPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLD 302
Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
++ G ++DSG + T LP+ +Y + +LV R+ CY+ +S++
Sbjct: 303 DSEEG-NIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQ 358
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 119/469 (25%), Positives = 203/469 (43%), Gaps = 79/469 (16%)
Query: 2 VNLVAICMLFGCILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSV 59
V+ + + F C + S AVS FS +L+HR D +K + P +N
Sbjct: 4 VSFLTLSFFFLCFSISFSQAVSNGFSIELIHR--DSSKSPFYK-----------PTQNKY 50
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
+++ + R RV N+S++N L E + + G+ Y + Y+ +GTP
Sbjct: 51 QHV----VDAVHRSINRV-----NHSNKNSLASTPESTVISYEGD--YIMSYS---VGTP 96
Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
+ +D GS+++W+ C+ C QC ++ +++PS SSS KN+SCS L
Sbjct: 97 PIKSYGIVDTGSDIVWLQCEPCEQC----------YNQTTPKFNPSKSSSYKNISCSSKL 146
Query: 179 CKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
C+S +SC K+ C Y +Y + + S G L + L L S + P S ++ +IGC
Sbjct: 147 CQSVRDTSCNDKKN-CEYSINYGNQ-SHSQGDLSLETLTLES-TTGRPVSFPKT--VIGC 201
Query: 237 GRKQTGSY--------LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
G GS+ G P ++ LG PS+ K SI GS
Sbjct: 202 GTNNIGSFKRVSSGVVGLGGGPASLI-TQLG----PSIGGKFSYCLVRMSITLKNMSMGS 256
Query: 289 --VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGF-------QALV 337
+ FGD + + PI +K + Y++ +E++ +G+ + +G ++
Sbjct: 257 SKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEEGNIII 316
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR----- 392
DS TF+P+++Y ++ LV+ +R+ + CYN SS+E P M
Sbjct: 317 DSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSDEEYDFPYMTAHFKG 376
Query: 393 ---LIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILILQK 438
L+++ N V + F A F F+ + G + QK
Sbjct: 377 ADILLYATNTFVEVARDVLCFAFAPSNGGAIFGSFSQQDFMVGYDLQQK 425
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 168/393 (42%), Gaps = 68/393 (17%)
Query: 13 CILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDW 70
C + S A+S FS +L+HR D K + P +N ++ +
Sbjct: 15 CFIASFSHALSNGFSVELIHR--DSPKSPYYK-----------PTENKYQHF----VDAA 57
Query: 71 KRQKTRVK--LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
+R R + ++ S+ + P G Y + Y+ +GTP D
Sbjct: 58 RRSINRANHFFKDSDTSTPESTVIPDRGG---------YLMTYS---VGTPPTKIYGIAD 105
Query: 129 AGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCK 186
GS+++W+ C+ C QC ++ ++PS SSS KN+ C LC S R +
Sbjct: 106 TGSDIVWLQCEPCEQC----------YNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSC 155
Query: 187 SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD 246
S ++ C Y Y + + S G L D L L S S +P S ++ +IGCG G++
Sbjct: 156 SDQNSCQYKISYG-DSSHSQGDLSVDTLSLESTSG-SPVSFPKT--VIGCGTDNAGTF-- 209
Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQ 300
G A G++GLG G VS+ + L + I FS C + N S + FGD +
Sbjct: 210 GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGD 267
Query: 301 STSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG--------FQALVDSGASFTFLPTEIY 351
P+ +K YF+ ++++ +GN + G ++DSG + T +P+++Y
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVY 327
Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ LV R+ + CY+ S E
Sbjct: 328 TNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNE 360
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 148/323 (45%), Gaps = 46/323 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP +F + +D GS + +VPC C QC +++P
Sbjct: 89 YYTTRIWI--GTPPQTFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFEPEL 136
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + VSC+ +C + + C Y Y+ E +SSSG L +DI+ + S+ PQ
Sbjct: 137 SSTYQPVSCN-----IDCTCDNERKQCVYERQYA-EMSSSSGVLGEDIISFGNQSELVPQ 190
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+ I GC ++TG A DG+MGLG GD+S+ L + G+I +SFS+C+ D
Sbjct: 191 RA-----IFGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMD 244
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
G + G P+ P+ +Y Y + +++ + L +
Sbjct: 245 IGGGAMILGGISPPSGMVFAESDPVRSQY--YNIDLKAIHVAGKQLHLDPSIFDGKHGTV 302
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
+DSG ++ +LP + K ++S + + G Y C++ + ++ ++ P
Sbjct: 303 LDSGTTYAYLPEAAFTAFKDAMMKELTSLK-QIHGPDPNYNDICFSGAESDVSQLSNTFP 361
Query: 390 DMRLIFSKNQ--SFVVRNHIFSF 410
+ ++FS Q S N++F +
Sbjct: 362 AVEMVFSNGQKLSLSPENYLFQY 384
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 166/376 (44%), Gaps = 33/376 (8%)
Query: 52 SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLH 110
++P VE EL + + + R+ L SS ++ FP +GS + L+
Sbjct: 47 AFPLDEPVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYL----VGLY 100
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+T + +G+P F V +D GS++LWV C P S + L +L +D S ++
Sbjct: 101 FTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSFTAG 156
Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+V+CS P+C S + C S + C Y Y + + +SGY + D + + +
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESLV 214
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
++ + ++ GC Q+G A DG+ G G G +SV S L+ G+ FS C +
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 285 DSGSVFF--GDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQA----- 335
SG F G+ + LP Y+ +GV + + + F+A
Sbjct: 275 GSGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILP---IDAAVFEASNTRG 331
Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
+VD+G + T+L E Y + V S+ ++L ++ + CY S+ P + L
Sbjct: 332 TIVDTGTTLTYLVKEAYDPFLNAISNSV-SQLVTLIISNGEQCYLVSTSISDMFPPVSLN 390
Query: 395 FSKNQSFVVRNHIFSF 410
F+ S ++R + F
Sbjct: 391 FAGGASMMLRPQDYLF 406
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/313 (26%), Positives = 137/313 (43%), Gaps = 26/313 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L+YT + +GTP F V +D GS++LWV C P S + L L+ +D SS+
Sbjct: 77 LYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQS----SQLGIELNFFDTVGSST 132
Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ + CS P+C SR + C + C Y Y + + +SGY V D ++ +
Sbjct: 133 AALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSDAMYFSLIMGQP 191
Query: 224 PQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
P + ++++ GC Q+G A DG+ G G G +SV S L+ G+ FS C
Sbjct: 192 PAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL- 250
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGF 333
+ D G + S + P+ Y + ++S + L + +
Sbjct: 251 KGDGDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRG 310
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+VD G + +L E Y +V + V S+++ + +GN CY S+ P +
Sbjct: 311 GTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CYLVSTSIGDIFPSV 367
Query: 392 RLIFSKNQSFVVR 404
L F S V++
Sbjct: 368 SLNFEGGASMVLK 380
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 148/343 (43%), Gaps = 48/343 (13%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP F + +D+GS + +VPC C QC + + P SSS
Sbjct: 91 TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSSYS 140
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C S K C Y Y+ E +SSSG L +DI+ S+ Q +V
Sbjct: 141 PVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRESELKAQRAV-- 192
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
GC +TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G
Sbjct: 193 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGA 248
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V G P+ + P+ Y Y + ++ + L S ++DSG
Sbjct: 249 MVLGGVPTPSDMVFSRSDPLRSPY--YNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGT 306
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY---CYNASSEEMLKV----PDM 391
++ +LP + + + F V+SK SL +G Y C+ + + K+ PD+
Sbjct: 307 TYAYLPEQAF----MAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDV 362
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
++F Q + + F ++V C F + T +L
Sbjct: 363 DMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLL 405
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 166/386 (43%), Gaps = 79/386 (20%)
Query: 13 CILLDGSDAVS--FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDW 70
C ++ S A++ F+ +L+HR D +K + + N + + S+ + N +
Sbjct: 16 CFIISLSHALNNGFTLELIHR--DSSKSPFYQPTQNKYERIANAVRRSINRV-----NHF 68
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
+ QS NS + + Y + Y+ IGTP +D G
Sbjct: 69 YKYSLTSTPQSTVNSDKGE-----------------YLMSYS---IGTPPFKVFGFVDTG 108
Query: 131 SNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKS 187
S+L+W+ C+ C QC P + +DPS SSS +N+ C C S +SC
Sbjct: 109 SDLVWLQCEPCKQCYP----------QITPIFDPSLSSSYQNIPCLSDTCHSMRTTSC-- 156
Query: 188 LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 247
D Y++ + S++GY V SF K +IGCG + TG++
Sbjct: 157 --DVRGYLSVETLTLDSTTGYSV-------SFPK----------TMIGCGYRNTGTFHGP 197
Query: 248 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFGDQGPATQQSTSF 304
++ G++GLG G +S+PS L + I FS C N + + FGD
Sbjct: 198 SS--GIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGDAAIVYGDGAMT 253
Query: 305 LPIGEK--YDAYFVGVESYCIGNSCLTQSG-------FQALVDSGASFTFLPTEIYAEVV 355
PI +K Y++ +E++ +GN + G L+DSG +FTFLP ++Y
Sbjct: 254 TPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFTFLPYDVYYRFE 313
Query: 356 VKFDKLVSSKRISLQGNSWKYCYNAS 381
+ ++ + + ++K CYN +
Sbjct: 314 SAVAEYINLEHVEDPNGTFKLCYNVA 339
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 123/275 (44%), Gaps = 26/275 (9%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y L+ T + +GTP F V +D GS++LW+ C P S+ L L+ +D S
Sbjct: 81 YGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSS----GLGIELNFFDTVGS 136
Query: 167 SSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S++ V CS P+C S + C + C Y Y + + +SG V D ++
Sbjct: 137 STAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYE-DGSGTSGVYVSDAMYFDMILG 195
Query: 222 HAPQSSVQSS--VIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ ++V SS ++ GC Q+G A DG++G G G++SV S L+ G+ FS
Sbjct: 196 QSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFS 255
Query: 279 ICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
C D N G + G+ + S + P+ Y + ++S + L
Sbjct: 256 HCLKGDGNGGGILVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFA 312
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
T ++DSG + ++L E Y +V D VS
Sbjct: 313 TSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVS 347
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 86/336 (25%), Positives = 155/336 (46%), Gaps = 46/336 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++ P S
Sbjct: 83 YYTTRLWI--GTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPES 130
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ +C S + C Y Y+ E ++SSG L +D++ + S+ APQ
Sbjct: 131 SSTYQPVKCT-----IDCNCDSDRMQCVYERQYA-EMSTSSGVLGEDLISFGNQSELAPQ 184
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L +I +SFS+C+ D
Sbjct: 185 RAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMD 238
Query: 286 --SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
G++ G P + + ++ P+ Y Y + ++ + N+ + +
Sbjct: 239 VGGGAMVLGGISPPSDMAFAYSDPVRSPY--YNIDLKEIHVAGKRLPLNANVFDGKHGTV 296
Query: 337 VDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
+DSG ++ +LP + + +VK +L S K+IS ++ C++ + ++ ++
Sbjct: 297 LDSGTTYAYLPEAAFLAFKDAIVK--ELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSF 354
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P + ++F Q + + + F ++V C F
Sbjct: 355 PVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVF 390
>gi|388505672|gb|AFK40902.1| unknown [Lotus japonicus]
Length = 207
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 60/84 (71%)
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
+ F+A VDSG SFTFLP Y + +FDK V++ R S +G+ W+YCY +SSE++ KVP
Sbjct: 2 TSFKAQVDSGTSFTFLPGHAYGAITEEFDKQVNASRSSFEGSPWEYCYPSSSEQLPKVPS 61
Query: 391 MRLIFSKNQSFVVRNHIFSFPENE 414
+ L+F +N SFVV N +F+F +N+
Sbjct: 62 LTLMFQQNNSFVVYNPVFTFYDNQ 85
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 143/340 (42%), Gaps = 32/340 (9%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
+E L D R R L + FP EGS F L++T + +G+P
Sbjct: 47 VEHLRERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFM----VGLYFTRVKLGSPPK 102
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC- 179
+ V +D GS++LWV C C C S L+ L ++P +SS+S + CS C
Sbjct: 103 EYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSSTSSKIPCSDDRCT 157
Query: 180 ----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
S + C++ + PC Y Y + + +SGY V D ++ + + ++ +S++
Sbjct: 158 AALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVF 216
Query: 235 GCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GC Q+G A DG+ G G +SV S L G+ FS C +D+G
Sbjct: 217 GCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGIL-V 275
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSGASFTF 345
G + + P+ Y + +ES I +S T S Q +VDSG + +
Sbjct: 276 LGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAY 335
Query: 346 LPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSE 383
L Y V VS SL +GN C+ SS
Sbjct: 336 LADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSR 372
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/322 (26%), Positives = 135/322 (41%), Gaps = 30/322 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L++T + +G P F V +D GS++LWV C P S+ L+ L ++P SSS+
Sbjct: 4 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSS----GLNIQLESFNPDSSST 59
Query: 169 SKNVSCSHPLCKS-----RSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
+ ++CS C + + C+ S PC Y Y + + +SGY V D + +
Sbjct: 60 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYYVSDTMFFETVM 118
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ ++ +S++ GC Q+G A DG+ G G +SV S L G+ FS
Sbjct: 119 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 178
Query: 280 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSG 332
C +D+G G + + P+ Y + +ES I +S T S
Sbjct: 179 CLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSN 237
Query: 333 FQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
Q +VDSG + +L Y V VS SL + C+ SS P +
Sbjct: 238 TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPTV 296
Query: 392 RLIFSKNQSFVVRNHIFSFPEN 413
L F + V+ PEN
Sbjct: 297 TLYFMGGVAMSVK------PEN 312
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 119/248 (47%), Gaps = 22/248 (8%)
Query: 46 NVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ 105
+++ ++P + VE EL + + ++ LQS N + FP +G+ F
Sbjct: 24 TLTLERAFPSNDGVELSELRARDSLRHRRM---LQSTNYV----VDFPVKGT----FDPS 72
Query: 106 FYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
L+YT + +GTP V +D GS++LWV C P ++ L L+ +DP S
Sbjct: 73 QVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTS----GLQIQLNYFDPGS 128
Query: 166 SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SS+S +SC C+ S +SC + C Y Y + + +SGY V D++H AS
Sbjct: 129 SSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSDLMHFASIF 187
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ ++ +SV+ GC QTG A DG+ G G +SV S L+ G+ FS
Sbjct: 188 EGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSH 247
Query: 280 CFDENDSG 287
C ++SG
Sbjct: 248 CLKGDNSG 255
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 123/280 (43%), Gaps = 33/280 (11%)
Query: 93 PSEGSQT-HFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSA 148
P E S +G+ + + L+Y + IG P + + +D GS+L W+ C C+ C +
Sbjct: 39 PEESSAVFQLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPH 98
Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTE 201
Y + +K V C LC S + C S K C Y Y+ +
Sbjct: 99 PLY-------------RPTKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYA-D 144
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSS-VQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLG 259
SS G L+ D SF+ SS V+ S+ GCG +Q GS + A DGV+GLG G
Sbjct: 145 QGSSLGVLLTD-----SFAVRLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSG 199
Query: 260 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGV 318
+S+ S L + G+ +N C G +FFGD ++T + + Y+ G
Sbjct: 200 SISLLSQLKQHGITKNVVGHCLSIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGT 259
Query: 319 ESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
S G L + ++DSG+SFT+ + Y +V
Sbjct: 260 ASLYFGGRSLGVRPMEVVLDSGSSFTYFGAQPYQALVTAL 299
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/299 (27%), Positives = 124/299 (41%), Gaps = 42/299 (14%)
Query: 103 GNQFYWLH-YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSA-SYYTSLD 155
GN + H Y ++IG P + + +D GSNL W+ C C C P YYT D
Sbjct: 30 GNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPAD 89
Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKSL-----KDP--CPYIADYSTEDTSSSG 207
NL V C PLC + R + DP C Y Y T S G
Sbjct: 90 GNL------------KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEG 135
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSL 266
L DI+ + K + GCG KQ +P DG++GLG+G + +
Sbjct: 136 DLATDIISVNGRDK--------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQ 187
Query: 267 LAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
L +I +N C G ++ GD P T+ T + P+ E Y G+ I
Sbjct: 188 LKGHKMIKENVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDK 246
Query: 326 SCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS 382
+ F+A+ DSG+++T +P +IY E+V K +S + ++G + C+
Sbjct: 247 QPIRGNPTFEAVFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALPLCWKGKK 305
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 126/302 (41%), Gaps = 27/302 (8%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
+E L D R L + + FP EGS + L++T + +G P
Sbjct: 45 VEHLKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSANPYM----VGLYFTRVKLGNPAK 100
Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS 181
+ V +D GS++LWV C P S + L+ L ++P SSS+S + CS C +
Sbjct: 101 EYFVQIDTGSDILWVACSPCTGCPTS----SGLNIQLEFFNPDSSSTSSRIPCSDDRCTA 156
Query: 182 R--------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
S S PC Y Y + + +SG+ V D ++ + + ++ +SV+
Sbjct: 157 ALQTGEAVCQSSDSPSSPCGYTFTYG-DGSGTSGFYVSDTMYFDTVMGNEQTANSSASVV 215
Query: 234 IGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
GC Q+G + A DG+ G G +SV S L G+ +FS C +D+G
Sbjct: 216 FGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKGSDNGGGIL- 274
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFT 344
G + F P+ Y + +ES + L T + +VDSG +
Sbjct: 275 VLGEIVEPGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNTQGTIVDSGTTLV 334
Query: 345 FL 346
+L
Sbjct: 335 YL 336
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 99/393 (25%), Positives = 179/393 (45%), Gaps = 51/393 (12%)
Query: 19 SDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKK-NSVEYLELLLSNDWKRQKTRV 77
S+ S S+++++R S + ++K G PK N E LL + + + +V
Sbjct: 55 SNVCSQSTRVLNRASSL---KVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQV 111
Query: 78 KLQSNNNSS---RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
+L N +S Q P+ T + + +GTP F ++ D GS+L
Sbjct: 112 RLSMNPSSGVFKEMQTTIPASIVPTG-------GAYVVTVGLGTPKKDFTLSFDTGSDLT 164
Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLK 189
W QC P + +N ++DP++S+S KNVSCS CK + + +
Sbjct: 165 WT-----QCEPCLGGCF---PQNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCIS 216
Query: 190 DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 249
+ C Y Y + T G+L + L +AS S V + + GC + G++ +G
Sbjct: 217 NTCLYGIQYGSGYTI--GFLATETLAIAS-------SDVFKNFLFGCSEESRGTF-NGTT 266
Query: 250 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLP- 306
G++GLG +++PS +N FS C + S G + FG + +ST P
Sbjct: 267 --GLLGLGRSPIALPSQTTNK--YKNLFSYCLPASPSSTGHLSFGVEVSQAAKSTPISPK 322
Query: 307 IGEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
+ + Y VG+ + L +G + ++DSG +FTFLP+ Y+ + F +++++
Sbjct: 323 LKQLYGLNTVGIS---VRGRELPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMAN 379
Query: 365 KRISLQGNSWKYCYNASS--EEMLKVPDMRLIF 395
++ +S++ CY+ S+ L +P + + F
Sbjct: 380 YTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFF 412
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 126/259 (48%), Gaps = 27/259 (10%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS-LDRNLSEYDPSSSS 167
L+YT I +GTP F V +D GSN+ WV +CAP + ++ + +S +DP S+
Sbjct: 40 LYYTRISLGTPPQQFYVDVDTGSNVAWV-----KCAPCTGCEHSGDVPVPMSTFDPRKST 94
Query: 168 SSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF-SKHA 223
+ ++SC+ C + C + CPY Y + +S++GY ++D+ S ++
Sbjct: 95 TKISISCTDAECGVLNKKLQCSPERLSCPYSLLYG-DGSSTAGYYLNDVFTFNQVPSDNS 153
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-- 281
S + ++ GCG QTGS+ + DG++G G VS+P+ LA+ + N F+ C
Sbjct: 154 TAKSGTARLVFGCGGTQTGSW----SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQG 209
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQSGFQ------ 334
D + GS+ G + + P+ D Y V + + I G + T + F
Sbjct: 210 DVSGRGSLVIGT---IREPDLVYTPMVFGEDHYNVQLLNIGISGRNVTTPASFDLEYTGG 266
Query: 335 ALVDSGASFTFLPTEIYAE 353
++DSG + T+L Y E
Sbjct: 267 VIIDSGTTLTYLVQPAYDE 285
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 146/332 (43%), Gaps = 46/332 (13%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP F + +D GS + +VPC C QC + P SSS+ K
Sbjct: 90 TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCG----------KHQDPRFQPESSSTYK 139
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
+ C+ P C +C C Y Y+ E +SSSG L +D+L + S+ PQ +
Sbjct: 140 PMQCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGLLAEDVLSFGNESELTPQRA--- 190
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--SGS 288
I GC +TG A DG+MGLG G +SV L ++ NSFS+C+ D G+
Sbjct: 191 --IFGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGA 247
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGV---ESYCIG-----NSCLTQSGFQALVDSG 340
+ G+ P + + Y + + + E + G N + ++DSG
Sbjct: 248 MVLGNIPPPPDMVFAH---SDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSG 304
Query: 341 ASFTFLPTEIYA---EVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV----PDMR 392
++ +LP E + + ++K K + K+I S+ C++ + ++ ++ P++
Sbjct: 305 TTYAYLPEEAFVAFKDAIIKEIKFL--KQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVN 362
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
++F Q + + F +V C F
Sbjct: 363 MVFGNGQKLSLSPENYLFRHTKVSGAYCLGIF 394
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 90/332 (27%), Positives = 140/332 (42%), Gaps = 48/332 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP + +D GS++ W +QCAP + Y + + ++PSSSSS
Sbjct: 16 YFAVVGVGTPRRDMYLVVDTGSDITW-----LQCAPCTNCY----KQKDALFNPSSSSSF 66
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
K + CS LC + L + C Y ADY + + D+++ +F P V
Sbjct: 67 KVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAF---GPGQVVL 123
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
+++ +GCG G++ A G++GLG G +S P+ L + +N FS C D N
Sbjct: 124 TNIPLGCGHDNEGTFGTAA---GILGLGRGPLSFPNNLDAS--TRNIFSYCLPDRESDPN 178
Query: 285 DSGSVFFGDQG-PATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCLTQ---SGFQ-- 334
++ FGD P T S F+P Y+V + +G + LT S FQ
Sbjct: 179 HKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLD 238
Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ DSG + T L Y V F + + CY+ + + V
Sbjct: 239 SHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISV 298
Query: 389 P----------DMRLIFSKNQSFVVRNHIFSF 410
P DMRL S V N+IF F
Sbjct: 299 PTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCF 330
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 135/320 (42%), Gaps = 37/320 (11%)
Query: 73 QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
+ R L + + +FP +G+ + + L+Y + IG P + + +D GS
Sbjct: 27 RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79
Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
+L W+ C C+ C+ + Y + +K V C +C + R
Sbjct: 80 DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQT 241
C S K C Y Y+ + SS G LV D L + A S V+ + GCG +Q
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
GS + +A DGV+GLG G VS+ S L + G+ +N C G +FFGD P ++
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ + + + Y G + G L + + DSG+SFT+ + Y +V
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301
Query: 361 LVSSKRISLQGNSWKYCYNA 380
+S + +S C+
Sbjct: 302 DLSKNLKEVPDHSLPLCWKG 321
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 83/299 (27%), Positives = 123/299 (41%), Gaps = 42/299 (14%)
Query: 103 GNQFYWLH-YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSA-SYYTSLD 155
GN + H Y ++IG P + + +D GSNL W+ C C C P YYT D
Sbjct: 30 GNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPAD 89
Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKSL-----KDP--CPYIADYSTEDTSSSG 207
NL V C PLC + R + DP C Y Y T S G
Sbjct: 90 GNLK------------VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEG 135
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSL 266
L DI+ + K + GCG KQ +P DG++GLG+G +
Sbjct: 136 DLATDIISVNGRDK--------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQ 187
Query: 267 LAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
L +I +N C G ++ GD P T+ T + P+ E Y G+ I
Sbjct: 188 LKGHKMIKENVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDK 246
Query: 326 SCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS 382
+ F+A+ DSG+++T +P +IY E+V K +S + ++G + C+
Sbjct: 247 QPIRGNPTFEAVFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWKGKK 305
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 135/320 (42%), Gaps = 37/320 (11%)
Query: 73 QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
+ R L + + +FP +G+ + + L+Y + IG P + + +D GS
Sbjct: 27 RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79
Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
+L W+ C C+ C+ + Y + +K V C +C + R
Sbjct: 80 DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQT 241
C S K C Y Y+ + SS G LV D L + A S V+ + GCG +Q
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
GS + +A DGV+GLG G VS+ S L + G+ +N C G +FFGD P ++
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ + + + Y G + G L + + DSG+SFT+ + Y +V
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301
Query: 361 LVSSKRISLQGNSWKYCYNA 380
+S + +S C+
Sbjct: 302 DLSKNLKEVPDHSLPLCWKG 321
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 135/320 (42%), Gaps = 37/320 (11%)
Query: 73 QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
+ R L + + +FP +G+ + + L+Y + IG P + + +D GS
Sbjct: 27 RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79
Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
+L W+ C C+ C+ + Y + +K V C +C + R
Sbjct: 80 DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQT 241
C S K C Y Y+ + SS G LV D L + A S V+ + GCG +Q
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
GS + +A DGV+GLG G VS+ S L + G+ +N C G +FFGD P ++
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ + + + Y G + G L + + DSG+SFT+ + Y +V
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301
Query: 361 LVSSKRISLQGNSWKYCYNA 380
+S + +S C+
Sbjct: 302 DLSKNLKEVPDHSLPLCWKG 321
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 135/320 (42%), Gaps = 37/320 (11%)
Query: 73 QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGS 131
+ R L + + +FP +G+ + + L+Y + IG P + + +D GS
Sbjct: 27 RPARGGLSVTAGAEESSAVFP-------LYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGS 79
Query: 132 NLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-------R 182
+L W+ C C+ C+ + Y + +K V C +C + R
Sbjct: 80 DLTWLQCDAPCVSCSKVPHPLY-------------RPTKNKLVPCVDQMCAALHGGLTGR 126
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQT 241
C S K C Y Y+ + SS G LV D L + A S V+ + GCG +Q
Sbjct: 127 HKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL----RLANSSIVRPGLAFGCGYDQQV 181
Query: 242 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQ 300
GS + +A DGV+GLG G VS+ S L + G+ +N C G +FFGD P ++
Sbjct: 182 GSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVPYSRA 241
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ + + + Y G + G L + + DSG+SFT+ + Y +V
Sbjct: 242 TWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALVDAIKG 301
Query: 361 LVSSKRISLQGNSWKYCYNA 380
+S + +S C+
Sbjct: 302 DLSKNLKEVPDHSLPLCWKG 321
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 142/336 (42%), Gaps = 46/336 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++ P
Sbjct: 79 YYTTRLWI--GTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPEL 126
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SSS K + C +P C +C C Y Y+ E +SSSG L +D++ + S+ PQ
Sbjct: 127 SSSYKALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNESQLTPQ 180
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
+V GC +TG A DG+MGLG G +SV L G+I++ FS+C+ E
Sbjct: 181 RAV-----FGCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 234
Query: 284 NDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
G++ G P S P Y Y + ++ + L +
Sbjct: 235 VGGGAMVLGKISPPAGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFNGKHGTV 292
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEMLKV---- 388
+DSG ++ + P E + + K + S KRI G Y C++ + ++ ++
Sbjct: 293 LDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRI--HGPDPNYDDVCFSGAGRDVAEIHNFF 350
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P++ + F Q ++ + F +V C F
Sbjct: 351 PEIDMEFGNGQKLILSPENYLFRHTKVRGAYCLGIF 386
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 126/274 (45%), Gaps = 34/274 (12%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP V +L D GS+L W C C++C Y L ++P S+S +V C
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQCLPCLKC-------YQQLR---PIFNPLKSTSFSHVPC 135
Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
+ C + ++ C Y Y + T S G L + + + S SSV+S +
Sbjct: 136 NTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS-------SSVKS--V 185
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---ENDSGSVF 290
IGCG +G + GV+GLG G +S+ S +++ I FS C + +G +
Sbjct: 186 IGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKIN 242
Query: 291 FGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQSGFQALVDSGASFTF 345
FG + P+ K Y++ +E+ IGN + G ++DSG + +F
Sbjct: 243 FGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQG-NVIIDSGTTLSF 301
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
LP E+Y VV K+V +KR+ GN W C++
Sbjct: 302 LPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFD 335
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 157/362 (43%), Gaps = 39/362 (10%)
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
+ N + R+ +S RN ++ S+ ++ F N +L I +GTP S +
Sbjct: 41 MYNSSETHFDRIVNALRRSSHRNTVVLESDTAEAPIFNNGGEYL--VEISVGTPPFSIVA 98
Query: 126 ALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSC 185
D GS+++W QC P S Y +N +DPS S++ KNV+CS P+C
Sbjct: 99 VADTGSDVIWT-----QCKPCSNCY----QQNAPMFDPSKSTTYKNVACSSPVCSYSGDG 149
Query: 186 KSLKD--PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
S D C Y Y +D+ S G L D + + S S + +IGCG G+
Sbjct: 150 SSCSDDSECLYSIAYG-DDSHSQGNLAVDTVTMQSTSG---RPVAFPRTVIGCGHDNAGT 205
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPA 297
+ A G++GLG G S+ + L A FS C NDS + FG
Sbjct: 206 F--NANVSGIVGLGRGPASLVTQLGPA--TGGKFSYCLIPIGTGSTNDSTKLNFGSNANV 261
Query: 298 TQQSTSFLPI--GEKYDAYF-VGVESYCIGNSCL------TQSGFQA--LVDSGASFTFL 346
+ T PI +Y ++ + +E+ +G++ ++ G ++ ++DSG + T+L
Sbjct: 262 SGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESNIIIDSGTTLTYL 321
Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
P+ + + +S YC+ A++ + ++P + + F + R +
Sbjct: 322 PSALLNSFGSAISQSMSLPHAQDPSEFLDYCF-ATTTDDYEMPPVTMHFEGADVPLQREN 380
Query: 407 IF 408
+F
Sbjct: 381 LF 382
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 141/332 (42%), Gaps = 51/332 (15%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++ I +G P+ + V +D GS++LWV C C +C S L L+ YDP+SS
Sbjct: 26 LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKS-----DLGIKLTLYDPASSV 80
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S+ VSC C S CK + PC Y Y + +S++GY V D + + +
Sbjct: 81 SATRVSCDDDFCTSTYNGLLPDCKK-ELPCQYNVVYG-DGSSTAGYFVSDAVQFERVTGN 138
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+V GCG +Q+G G A DG++G +F+ C
Sbjct: 139 LQTGLSNGTVTFGCGAQQSGGLGTSGEALDGILG--------------------AFAHCL 178
Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQ 334
D + G +F G+ +T +P Y+ Y +E +G + L SG +
Sbjct: 179 DNVNGGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIE---VGGTVLELPTDVFDSGDR 235
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLKVPDM 391
++DSG + +LP +Y ++ + +SL ++ C+ S PD+
Sbjct: 236 RGTIIDSGTTLAYLPEVVYDSMMNEIRS--QQPGLSLHTVEEQFICFKYSGNVDDGFPDI 293
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+ F + + V H + F +E D CF +
Sbjct: 294 KFHFKDSLTLTVYPHDYLFQISE--DIWCFGW 323
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 159/374 (42%), Gaps = 29/374 (7%)
Query: 52 SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLH 110
++P VE EL + + + R+ L SS ++ FP +GS + L+
Sbjct: 47 AFPLDELVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYL----VGLY 100
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+T + +G+P F V +D GS++LWV C P S + L +L +D S ++
Sbjct: 101 FTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSLTAG 156
Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+V+CS P+C S + C S + C Y Y + + +SGY + D + + +
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESLV 214
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
++ + ++ GC Q+G A DG+ G G G +SV S L+ G+ FS C +
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA------L 336
SG F G + P+ Y + + S + L F+A +
Sbjct: 275 GSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTI 333
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
VD+G + T+L E Y + VS + N + CY S+ P + L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNFA 392
Query: 397 KNQSFVVRNHIFSF 410
S ++R + F
Sbjct: 393 GGASMMLRPQDYLF 406
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 146/329 (44%), Gaps = 51/329 (15%)
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQL-----LFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
L+N ++R +R N ++ L L P G + + IGTP
Sbjct: 55 LTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGSGE------------YLMSVSIGTPP 102
Query: 121 VSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
V ++ D GS+L+W C C++C S + DP S+S +V C+ C
Sbjct: 103 VDYIGMADTGSDLMWAQCLPCLKCYKQSRPIF----------DPLKSTSFSHVPCNSQNC 152
Query: 180 KS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
K+ S C + + C Y Y + T + G L + + + S SSV+S +IGCG
Sbjct: 153 KAIDDSHCGA-QGVCDYSYTYG-DQTYTKGDLGFEKITIGS-------SSVKS--VIGCG 201
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---ENDSGSVFFGDQ 294
+ G + + V+GLG G +S+ S +++ I FS C + +G + FG
Sbjct: 202 HESGGGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQN 258
Query: 295 GPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEI 350
+ P+ K Y+V +E+ IGN S Q ++DSG + +FLP E+
Sbjct: 259 AVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKEL 318
Query: 351 YAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
Y VV K+V +KR+ GN W C++
Sbjct: 319 YDGVVSSLLKVVKAKRVKDPGNFWDLCFD 347
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 159/374 (42%), Gaps = 29/374 (7%)
Query: 52 SWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLH 110
++P VE EL + + + R+ L SS ++ FP +GS + L+
Sbjct: 47 AFPLDELVELSELRARD--RVRHARILLGGGRQSSVGGVVDFPVQGSSDPYL----VGLY 100
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+T + +G+P F V +D GS++LWV C P S + L +L +D S ++
Sbjct: 101 FTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHS----SGLGIDLHFFDAPGSLTAG 156
Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+V+CS P+C S + C S + C Y Y + + +SGY + D + + +
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMTDTFYFDAILGESLV 214
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
++ + ++ GC Q+G A DG+ G G G +SV S L+ G+ FS C +
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA------L 336
SG F G + P+ Y + + S + L F+A +
Sbjct: 275 GSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTI 333
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
VD+G + T+L E Y + VS + N + CY S+ P + L F+
Sbjct: 334 VDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CYLVSTSISDMFPSVSLNFA 392
Query: 397 KNQSFVVRNHIFSF 410
S ++R + F
Sbjct: 393 GGASMMLRPQDYLF 406
>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
Length = 307
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 92/164 (56%), Gaps = 10/164 (6%)
Query: 253 VMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY 311
+MGLG+ VSVPS+LA G+++ NSFS+CF ++ G + FGD G A Q T F+ + +
Sbjct: 9 LMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTH 67
Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
Y + + S +G+ L GF A+ DSG SFT+L Y F+ +S +R + G
Sbjct: 68 SYYNISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSG 126
Query: 372 NS------WKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIF 408
++ ++YCY+ S ++ +++P + L + F V + ++
Sbjct: 127 STRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVY 170
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 132/303 (43%), Gaps = 39/303 (12%)
Query: 100 HFFGNQFY-WLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDR 156
H GN + L+Y + +G+P + + +D GS+L W C C CA Y
Sbjct: 29 HVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLY----- 83
Query: 157 NLSEYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVD 211
+ +K V C P+C C S C Y +Y+ + +S+ G LV+
Sbjct: 84 --------NPKKAKVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEYA-DGSSTMGVLVE 134
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 270
D L + + + +Q+ IIGCG Q G+ A+ DGV+GL V++P+ LA+
Sbjct: 135 DTLTV----RLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEK 190
Query: 271 GLIQNSFSICFDE--NDSGSVFFGDQGPATQQST--------SFLPIGEKYDAYFVGVES 320
G+I+N C + N G +FFGD+ + T L + + G +S
Sbjct: 191 GIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDS 250
Query: 321 YCIGN-SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
+ N LT+S + DSG SFT+L + YA V+ K R+ + YC+
Sbjct: 251 LVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVK-SDTTLPYCWR 309
Query: 380 ASS 382
S
Sbjct: 310 GPS 312
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 154/343 (44%), Gaps = 39/343 (11%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D+GS + +VPC C QC ++ P
Sbjct: 93 YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCG----------KHQDPKFQPEL 140
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ +C K+ C Y +Y+ E +SS G L +D++ + S+ PQ
Sbjct: 141 SSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSSKGVLGEDLISFGNESQLTPQ 194
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG++GLG GD+S+ L GLI NSF +C+ D
Sbjct: 195 RAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD 248
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIG-NSCLTQSGFQALVD 338
G + G P+ T P Y D + V + NS + A++D
Sbjct: 249 VGGGSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLD 308
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK---YCYNASSE--EMLKV-PDM 391
SG ++ +LP +A + VS K+I ++K + AS++ E+ K+ P +
Sbjct: 309 SGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSV 368
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+IF QS+++ + F ++V C F + T +L
Sbjct: 369 EMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLL 411
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 147/319 (46%), Gaps = 29/319 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++ I +GTP + V +D GS++LWV C C C S L LS Y PSSSS
Sbjct: 73 LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKS-----DLGIELSLYSPSSSS 127
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+S V+C+ C S C + C Y Y + +S++GY V D + L + +
Sbjct: 128 TSNRVTCNQDFCTSTYDGPIPGCTP-ELLCEYRVAYG-DGSSTAGYFVRDHVVLDRVTGN 185
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+S S++ GCG +Q+G AA DG++G G + S+ S LA +G ++ F+ C
Sbjct: 186 FQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCL 245
Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSG 332
D + G +F G+ ++T +P Y+ + +E + N L T
Sbjct: 246 DNINGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIE---VDNEVLNLPTDVFDTDLR 302
Query: 333 FQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
++DSG + + P IY ++ K F + + K +++ + Y+ + ++ P +
Sbjct: 303 KGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFEYDGNVDDGF--PTV 360
Query: 392 RLIFSKNQSFVVRNHIFSF 410
F + S V H + F
Sbjct: 361 TFHFEDSLSLTVYPHEYLF 379
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 147/338 (43%), Gaps = 50/338 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++ P
Sbjct: 75 YYTTRLWI--GTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPEL 122
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S+S + + C +P C +C C Y Y+ E +SSSG L +D++ + S+ +PQ
Sbjct: 123 STSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNESQLSPQ 176
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
+V GC ++TG A DG+MGLG G +SV L G+I++ FS+C+ E
Sbjct: 177 RAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230
Query: 284 NDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
G++ G P S P Y Y + ++ + L +
Sbjct: 231 VGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFNGKHGTV 288
Query: 337 VDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV-- 388
+DSG ++ + P E + + V+K ++ S KRI G Y C++ + ++ ++
Sbjct: 289 LDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRDVAEIHN 344
Query: 389 --PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P++ + F Q ++ + F +V C F
Sbjct: 345 FFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIF 382
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 147/336 (43%), Gaps = 46/336 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI G+P F + +D GS + +VPC C+QC + + P
Sbjct: 88 YYTTRLWI--GSPPQEFALIVDTGSTVTYVPCSNCVQCG----------NHQDPRFQPEL 135
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ + +C C Y Y+ E ++SSG L +D++ S+ PQ
Sbjct: 136 SSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKESELVPQ 189
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC ++G A DG+MGLG G +SV L G++ NSFS+C+ D
Sbjct: 190 RAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD 243
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
G V G P + P Y Y + ++ + L + A+
Sbjct: 244 VGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFDGKYGAI 301
Query: 337 VDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
+DSG ++ + P + Y + ++K K+ K+IS ++K C++ + ++ ++
Sbjct: 302 LDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVTELPKVF 359
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P++ ++F+ Q + + F +V C F
Sbjct: 360 PEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIF 395
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 147/336 (43%), Gaps = 46/336 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI G+P F + +D GS + +VPC C+QC + + P
Sbjct: 88 YYTTRLWI--GSPPQEFALIVDTGSTVTYVPCSNCVQCG----------NHQDPRFQPEL 135
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ + +C C Y Y+ E ++SSG L +D++ S+ PQ
Sbjct: 136 SSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKESELVPQ 189
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC ++G A DG+MGLG G +SV L G++ NSFS+C+ D
Sbjct: 190 RAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD 243
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
G V G P + P Y Y + ++ + L + A+
Sbjct: 244 VGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFDGKYGAI 301
Query: 337 VDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
+DSG ++ + P + Y + ++K K+ K+IS ++K C++ + ++ ++
Sbjct: 302 LDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVTELPKVF 359
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P++ ++F+ Q + + F +V C F
Sbjct: 360 PEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIF 395
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 136/307 (44%), Gaps = 40/307 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T + +GTP LV LD GS+ W IQC P Y +++ + +DPS SS+
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSW-----IQCKPCPDCY----EQHEALFDPSKSSTY 184
Query: 170 KNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
+++CS C+ + +C S K CPY Y+ +D+ + G L D L L +P
Sbjct: 185 SDITCSSRECQELGSSHKHNCSSDKK-CPYEITYA-DDSYTVGNLARDTLTL------SP 236
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+V + GCG GS+ + DG++GLG G S+ S + A FS C +
Sbjct: 237 TDAVP-GFVFGCGHNNAGSFGE---IDGLLGLGRGKASLSSQV--AARYGAGFSYCLPSS 290
Query: 285 DSGSVFFGDQG-----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGF 333
S + + G P Q T + G+ Y++ + + + +
Sbjct: 291 PSATGYLSFSGAAAAAPTNAQFTEMV-AGQHPSFYYLNLTGITVAGRAIKVPPSVFATAA 349
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG +F+ LP YA + + + + + CY+ + E +++P + L
Sbjct: 350 GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVAL 409
Query: 394 IFSKNQS 400
+F+ +
Sbjct: 410 VFADGAT 416
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 147/338 (43%), Gaps = 50/338 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++ P
Sbjct: 75 YYTTRLWI--GTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPEL 122
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S+S + + C +P C +C C Y Y+ E +SSSG L +D++ + S+ +PQ
Sbjct: 123 STSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNESQLSPQ 176
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
+V GC ++TG A DG+MGLG G +SV L G+I++ FS+C+ E
Sbjct: 177 RAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230
Query: 284 NDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
G++ G P S P Y Y + ++ + L +
Sbjct: 231 VGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFNGKHGTV 288
Query: 337 VDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV-- 388
+DSG ++ + P E + + V+K ++ S KRI G Y C++ + ++ ++
Sbjct: 289 LDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRDVAEIHN 344
Query: 389 --PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P++ + F Q ++ + F +V C F
Sbjct: 345 FFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIF 382
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 146/329 (44%), Gaps = 48/329 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP F + +D GS + +VPC C QC + ++ P S + V C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCG----------NHQDPKFQPDLSDTYHPVKC 51
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
+P C +C + D C Y Y+ E +SSSG L +D++ + S+ PQ +V
Sbjct: 52 -NPDC----TCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
GC +TG A DG+MGLG GD+S+ L + G+I +SFS+C+ E G++ G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159
Query: 293 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 345
P + S P Y Y + + + N + ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217
Query: 346 LPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY---CYNASSEEMLKV----PDMRLIF 395
LP + + F + ++S+ L +G Y C++ + E+ ++ P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273
Query: 396 SKNQSFVVRNHIFSFPENEVGDHACFSYF 424
+ + + + F ++V C F
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVF 302
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/324 (27%), Positives = 140/324 (43%), Gaps = 23/324 (7%)
Query: 92 FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
FP +G+ F L+YT + +GTP F V +D GS++LWV C P +
Sbjct: 70 FPVDGASDPFL----VGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKT---- 121
Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGY 208
+ L LS +DP SSS+ VSCS C S +S P C Y Y + + +SGY
Sbjct: 122 SELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGY 180
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLL 267
+ D + + + + + GC Q+G A DG+ GLG G +SV S L
Sbjct: 181 YISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQL 240
Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
A GL FS C + SG G + T + P+ Y V ++S +
Sbjct: 241 AVQGLAPRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQI 299
Query: 328 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
L +G ++D+G + +LP E Y+ + VS + S++ C+
Sbjct: 300 LPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ-CFE 358
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVV 403
++ ++ P + L F+ S V+
Sbjct: 359 ITAGDVDVFPQVSLSFAGGASMVL 382
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/345 (26%), Positives = 153/345 (44%), Gaps = 43/345 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D+GS + +VPC C QC ++ P
Sbjct: 92 YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCG----------KHQDPKFQPEM 139
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ +C ++ C Y +Y+ E +SS G L +D++ + S+ PQ
Sbjct: 140 SSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNESQLTPQ 193
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG++GLG GD+S+ L GLI NSF +C+ D
Sbjct: 194 RAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD 247
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQAL 336
G + G P+ T P Y Y + + + L+ A+
Sbjct: 248 VGGGSMILGGFDYPSDMVFTDSDPDRSPY--YNIDLTGIRVAGKQLSLHSRVFDGEHGAV 305
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASS----EEMLKV-P 389
+DSG ++ +LP +A + VS+ K+I ++K C+ ++ E+ K+ P
Sbjct: 306 LDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFP 365
Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ ++F QS+++ + F ++V C F + T +L
Sbjct: 366 SVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLL 410
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 146/329 (44%), Gaps = 48/329 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP F + +D GS + +VPC C QC + ++ P S + V C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCG----------NHQDPKFQPDLSDTYHPVKC 51
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
+P C +C + D C Y Y+ E +SSSG L +D++ + S+ PQ +V
Sbjct: 52 -NPDC----TCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 292
GC +TG A DG+MGLG GD+S+ L + G+I +SFS+C+ E G++ G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159
Query: 293 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 345
P + S P Y Y + + + N + ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY---CYNASSEEMLKV----PDMRLIF 395
LP + + F + ++S+ L+ G Y C++ + E+ ++ P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273
Query: 396 SKNQSFVVRNHIFSFPENEVGDHACFSYF 424
+ + + + F ++V C F
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVF 302
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 51/327 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V F+ D GS+L W CQ C C P ++ YDPS+SS+ V
Sbjct: 70 LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPV 119
Query: 173 SCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
CS C +SR +C + PC YI YS + S G L + L + S P +V
Sbjct: 120 PCSSATCLPTWRSR-NCSNPSSPCRYIYSYS-DGAYSVGILGTETLTIGS---SVPGQTV 174
Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDE 283
SV GCG G L+ G +GLG G + SLLA+ G+ FS C F+
Sbjct: 175 SVGSVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNS 226
Query: 284 NDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
F G GP T QST L YFV ++ +G+ L
Sbjct: 227 TMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLR 286
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+VDSG +FT L + EVV + +L+ ++ C+ + E +
Sbjct: 287 ADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP-CFPSPDGEPF-M 344
Query: 389 PDMRLIFSKNQSFVV-RNHIFSFPENE 414
PD+ L F+ + R++ S+ E++
Sbjct: 345 PDLVLHFAGGADMRLHRDNYMSYNEDD 371
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 119/260 (45%), Gaps = 33/260 (12%)
Query: 42 SKSGNVSVADSWPK---KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQ 98
S +G V +P+ + E+L L +D R +L + + + P++
Sbjct: 27 SATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHG---RLLGAVDLALGGVGLPTD--- 80
Query: 99 THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL 158
L+YT I+IG+P + V +D GS++LWV CI+C + L L
Sbjct: 81 --------TGLYYTRIEIGSPPKGYYVQVDTGSDILWV--NCIRCD--GCPTRSGLGIEL 128
Query: 159 SEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDD 212
++YDP+ S ++ V C C + S +C S PC + Y + ++++G+ V D
Sbjct: 129 TQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVTD 185
Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKA 270
+ S + ++ +S+ GCG Q G L + A DG++G G D S+ S LA A
Sbjct: 186 FVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAA 244
Query: 271 GLIQNSFSICFDENDSGSVF 290
++ F+ C D G +F
Sbjct: 245 RRVRKIFAHCLDTVRGGGIF 264
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 138/322 (42%), Gaps = 34/322 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++T + +G+P + V +D GS++LWV C C C S L+ L ++P +SS+
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSST 171
Query: 169 SKNVSCSHPLC-----KSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S + CS C S + C++ + PC Y Y + + +SGY V D ++ + +
Sbjct: 172 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVSDTMYFDTVMGN 230
Query: 223 APQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
++ +S++ GC Q+G A DG+ G G +SV S L G+ FS C
Sbjct: 231 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 290
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQ 334
+D+G G + + P+ Y + +ES I +S T S Q
Sbjct: 291 KGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQ 349
Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDM 391
+VDSG + +L Y V VS SL +GN C+ SS P +
Sbjct: 350 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVTSSSVDSSFPTV 406
Query: 392 RLIFSKNQSFVVRNHIFSFPEN 413
L F + V+ PEN
Sbjct: 407 SLYFMGGVAMTVK------PEN 422
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 149/334 (44%), Gaps = 39/334 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP ++F +D GS+L W QCAP + + + + YDP+ SS+
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWT-----QCAPCTTACFA---QPTPLYDPARSSTF 147
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ C+ PLC++ S + + DY ++GYL D L + SS
Sbjct: 148 SKLPCASPLCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGDGDGDASSSF 207
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGS 288
+ V GC G +DGA+ G++GLG S SLL++ G+ FS C + D+G+
Sbjct: 208 AGVAFGCSTANGGD-MDGAS--GIVGLGR---SALSLLSQIGV--GRFSYCLRSDADAGA 259
Query: 289 --VFFGDQGPATQ---QSTSFL--PIGEKYDA--YFVGVESYCIGNSCLTQS----GFQA 335
+ FG T QST+ L P+ + A Y+V + +G++ L + GF A
Sbjct: 260 SPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTA 319
Query: 336 ------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY--CYNASSEEMLK 387
+VDSG +FT+L Y + F + + G + + C+ A + +
Sbjct: 320 AGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEAGAADT-P 378
Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
VP + F+ + V + +E G AC
Sbjct: 379 VPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACL 412
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 90/188 (47%), Gaps = 12/188 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L++T I IGTP + V +D GS++LWV C P ++L L+ YDP S S
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRK----SNLGIELTMYDPRGSQS 144
Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ V+C C + SC S PC Y Y + +S++G+ V D L S
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTS-TSPCEYSISYG-DGSSTAGFFVTDFLQYNQVSGDG 202
Query: 224 PQSSVQSSVIIGCGRKQTGSYL-DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ +SV GCG K G A DG++G G + S+ S LA AG ++ F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 283 ENDSGSVF 290
+ G +F
Sbjct: 263 TVNGGGIF 270
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 109/421 (25%), Positives = 182/421 (43%), Gaps = 56/421 (13%)
Query: 2 VNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEY 61
++++A+ + F ILL ++ +VH S + +++ ++P VE
Sbjct: 5 ISILALILAFAAILL--------TAAVVHCGSPASL---------LTLERAFPVNQRVE- 46
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
LE+L + D R ++ + F G+ + L++T + +G+P
Sbjct: 47 LEVLRARDQARHGRLLR-----GVVGGVVDFTVYGTSDPYL----VGLYFTKVKLGSPPR 97
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
F V +D GS++LWV C C C S L LS +DPSSSS++ VSCSHP+C
Sbjct: 98 EFNVQIDTGSDILWVTCNSCNDCPRTSG-----LGIELSFFDPSSSSTTSLVSCSHPICT 152
Query: 181 S-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
S + C + C Y Y + + ++GY V D+L+ + + ++ +S++ G
Sbjct: 153 SLVQTTAAECSPQSNQCSYSFHYG-DGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFG 211
Query: 236 CGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-ENDSGSVFFGD 293
C Q+G A DG+ G G D+SV S L+ G+ FS C E D G
Sbjct: 212 CSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGGGKLV-- 269
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTF 345
G + + + P+ Y + ++S + L T + +VDSG + T+
Sbjct: 270 LGEILEPNIIYSPLVPSQSHYNLNLQSISVNGQLLPIDPAVFATSNNQGTIVDSGTTLTY 329
Query: 346 LPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
L Y V VSS + +GN CY S+ P + L F+ S V+
Sbjct: 330 LVETAYDPFVSAITATVSSSTTPVLSKGNQ---CYLVSTSVDEIFPPVSLNFAGGASMVL 386
Query: 404 R 404
+
Sbjct: 387 K 387
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 87/333 (26%), Positives = 142/333 (42%), Gaps = 48/333 (14%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP F + +D+GS + +VPC C QC + + P SS+
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSTYS 139
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C S K+ C Y Y+ E +SSSG L +DI+ + S+ PQ +V
Sbjct: 140 PVKCN-----VDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-- 191
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
GC +TG A DG+MGLG G +S+ L G+I +SFS+C+ D G
Sbjct: 192 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGA 247
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V P T + Y Y + ++ + L ++DSG
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305
Query: 342 SFTFLPTEIYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDM 391
++ +LP + + V F VSS K+I +++K C+ + + ++ P +
Sbjct: 306 TYAYLPEQAF----VAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKV 361
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
++F Q + + F ++V C F
Sbjct: 362 DMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 394
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 164/385 (42%), Gaps = 54/385 (14%)
Query: 70 WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL----------HYTWIDIGTP 119
WK++ + +Q NN + N ++ + S+ F GN L ++ + +GTP
Sbjct: 121 WKQEVKVITIQQQNNLA-NAVVASLKSSKDEFSGNIMATLESGASLGTGEYFIDMFVGTP 179
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
+ LD GS+L W IQC P Y ++N Y+P+ SSS +N+SC P C
Sbjct: 180 PKHVWLILDTGSDLSW-----IQCDPC----YDCFEQNGPHYNPNESSSYRNISCYDPRC 230
Query: 180 KSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
+ SS CK+ CPY DY+ ++ + ++ ++ + V+
Sbjct: 231 QLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVM 290
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGS 288
GCG G + ++GLG G +S PS L + +SFS C + + S
Sbjct: 291 FGCGHWNKGFFHGAGG---LLGLGRGPLSFPSQLQ--SIYGHSFSYCLTDLFSNTSVSSK 345
Query: 289 VFFGDQGPATQQS----TSFLPIGEKYDA--YFVGVESYCIGNSCL----------TQSG 332
+ FG+ T L E D Y++ ++S +G L ++
Sbjct: 346 LIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGV 405
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG++ TF P Y + F+K + ++I+ CYN S +++PD
Sbjct: 406 GGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYG 465
Query: 393 LIFSKNQ--SFVVRNHIFSFPENEV 415
+ F+ +F N+ + + +EV
Sbjct: 466 IHFADGAVWNFPAENYFYQYEPDEV 490
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 144/335 (42%), Gaps = 44/335 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS++ +VPC C QC ++ P
Sbjct: 12 YYTTRLWI--GTPPQRFALIVDTGSSVTYVPCSSCEQCG----------RHQDPKFQPDL 59
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ ++V C+ +C K C Y Y+ E ++SSG L +DI+ + S APQ
Sbjct: 60 SSTYQSVKCN-----IDCNCDDEKQQCVYERQYA-EMSTSSGVLGEDIISFGNLSALAPQ 113
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
+V GC +TG A DG+MG+G GD+S+ L G+I +SFS+C+
Sbjct: 114 RAV-----FGCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMG 167
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
V G P+ + P+ Y Y + ++ + N + +
Sbjct: 168 IGGGAMVLGGISPPSNMVFSQSDPVRSPY--YNIDLKEIHVAGKPLPLNPTVFDGKHGTI 225
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----P 389
+DSG ++ +LP + K + S + ++G Y C++ + ++ ++ P
Sbjct: 226 LDSGTTYAYLPEAAFVSFKDAIMKELHSLK-PIRGPDPNYNDICFSGAGSDISQLSSSFP 284
Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
+ ++F Q ++ + F ++V C F
Sbjct: 285 AVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIF 319
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 124/291 (42%), Gaps = 29/291 (9%)
Query: 101 FFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
+G+ + + L+Y ++IG P + + +D GS+L W+ C C C + Y
Sbjct: 56 LYGDVYPHGLYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY------ 109
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 210
+ +K V C LC S + C S + C Y+ Y+ + SS+G LV
Sbjct: 110 -------RPTKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYA-DQGSSTGVLV 161
Query: 211 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 270
+D L + A S V+ S+ GCG Q S + + DGV+GLG G VS+ S +
Sbjct: 162 NDSFAL----RLANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQH 217
Query: 271 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLT 329
G+ +N C G +FFGD Q+ T + Y+ G S G+ L
Sbjct: 218 GVTKNVVGHCLSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLR 277
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
+ + DSG+SFT+ + Y +V +S + S C+
Sbjct: 278 VKLTEVVFDSGSSFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPLCWKG 328
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/258 (29%), Positives = 115/258 (44%), Gaps = 27/258 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT I +G P + + +D GS+L W+ C C CA Y +
Sbjct: 203 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 254
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K++ C L +++ C++ K C Y +Y+ + +SS G L D +H+ + +
Sbjct: 255 PPKDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GR 307
Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ + GC Q G L A DG++GL +S+PS LA G+I N F C D N
Sbjct: 308 EKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPN 367
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVD 338
G +F GD TS PI D F + G+ L+ G Q + D
Sbjct: 368 GGGYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFD 426
Query: 339 SGASFTFLPTEIYAEVVV 356
SG+S+T+LP EIY ++
Sbjct: 427 SGSSYTYLPDEIYKNLIA 444
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/258 (29%), Positives = 115/258 (44%), Gaps = 27/258 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT I +G P + + +D GS+L W+ C C CA Y +
Sbjct: 204 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 255
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K++ C L +++ C++ K C Y +Y+ + +SS G L D +H+ + +
Sbjct: 256 PPKDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GR 308
Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ + GC Q G L A DG++GL +S+PS LA G+I N F C D N
Sbjct: 309 EKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPN 368
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVD 338
G +F GD TS PI D F + G+ L+ G Q + D
Sbjct: 369 GGGYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFD 427
Query: 339 SGASFTFLPTEIYAEVVV 356
SG+S+T+LP EIY ++
Sbjct: 428 SGSSYTYLPDEIYKNLIA 445
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 137/320 (42%), Gaps = 27/320 (8%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L++T + +G+P F V +D GS++LWV C C C S L L+ +D SSSS
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSG-----LGIQLNFFDSSSSS 119
Query: 168 SSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
++ V CS P+C S + C + C Y Y + + +SGY V D L+ +
Sbjct: 120 TAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYE-DGSGTSGYYVSDTLYFDAILGE 178
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ + + ++ GC Q+G + A DG+ G G G++SV S L+ G+ FS C
Sbjct: 179 SLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCL 238
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGF 333
+ + G + + P+ Y + ++S + L T +
Sbjct: 239 -KGEGIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQ 297
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNASSEEMLKVPDM 391
+VDSG + +L E Y V + +VS + +GN CY S+ P
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ---CYLVSTSVSQMFPLA 354
Query: 392 RLIFSKNQSFVVRNHIFSFP 411
F+ S V++ + P
Sbjct: 355 SFNFAGGASMVLKPEDYLIP 374
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 89/343 (25%), Positives = 145/343 (42%), Gaps = 48/343 (13%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP F + +D+GS + +VPC C QC + + P SS+
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCG----------NHQDPRFQPDLSSTYS 139
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ +C S K+ C Y Y+ E +SSSG L +DI+ + S+ PQ +V
Sbjct: 140 PVKCN-----VDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-- 191
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS-- 288
GC +TG A DG+MGLG G +S+ L G+I +SFS+C+ D G
Sbjct: 192 ---FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGA 247
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
V P T + Y Y + ++ + L ++DSG
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305
Query: 342 SFTFLPTEIYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDM 391
++ +LP + + V F VSS K+I ++K C+ + + ++ P +
Sbjct: 306 TYAYLPEQAF----VAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKV 361
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
++F Q + + F ++V C F + T +L
Sbjct: 362 DMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLL 404
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 123/278 (44%), Gaps = 30/278 (10%)
Query: 95 EGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYT 152
EGS + Y YT I+IG P + + +D GS L W+ C C C Y
Sbjct: 117 EGSTAAVLPERQY---YTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYK 173
Query: 153 SLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 212
N+ P S + + + C +CK C Y Y+ + +SS+G L D
Sbjct: 174 PAKENIV---PPRDSHCQELQGNQNYC---DTCKQ----CDYEIAYA-DRSSSAGVLARD 222
Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 271
+ L + A ++ GC Q G L A+ DG++GL G +S+P+ LAK G
Sbjct: 223 NMELIT----ADGERENMDLVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQG 278
Query: 272 LIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCL 328
+I N F C + SGS +F GD + +++P+ D Y V+ G L
Sbjct: 279 IISNVFGHCIATDPSGSAYMFLGDDY-VPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQEL 337
Query: 329 T---QSG--FQALVDSGASFTFLPTEIYAEVVVKFDKL 361
Q+G Q + DSG+S+T+ P EIY ++ + +
Sbjct: 338 NVREQAGKLTQVIFDSGSSYTYFPHEIYTSLITSLEAV 375
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 156/362 (43%), Gaps = 63/362 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
++ I++G P LV +D GS+L+W +QC P Y R ++ YDP SSS+
Sbjct: 88 YFAVINVGDPPTRALVVIDTGSDLIW-----LQCVPCRHCY-----RQVTPLYDPRSSST 137
Query: 169 SKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+ + C+ P C+ C + C Y+ Y + ++SSG L D L P
Sbjct: 138 HRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDRLVF-------PD 189
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--- 282
+ +V +GCG G L+ AA G++G+G G +S P+ LA A + FS C
Sbjct: 190 DTHVHNVTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVFSYCLGDRL 244
Query: 283 ---ENDSGSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQSGFQ-- 334
+N S + FG ST+F P+ + Y+V + + +G +T GF
Sbjct: 245 SRAQNGSSYLVFGRT--PEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVT--GFSNA 300
Query: 335 ------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS----KRISLQGNSWKYCY 378
+VDSG + + + YA V FD ++ ++++ + + + CY
Sbjct: 301 SLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACY 360
Query: 379 ----NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
N + ++VP + L F+ + + P + GD + L+ G+
Sbjct: 361 DLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPV-QGGDRRTYFCLGLQAADDGLN 419
Query: 435 IL 436
+L
Sbjct: 420 VL 421
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 156/364 (42%), Gaps = 54/364 (14%)
Query: 31 RFSDEA-KERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR----VKLQSNNNS 85
R SDE ++ S+ V + K + E+ E +L+ D + + + L+ N
Sbjct: 107 RVSDERNRDDDSSRETTSFVFPVYHKLRAREFHERILAEDLGLENGKFVESMDLELVNPV 166
Query: 86 SRNQLLFPSEGS---QTHFF--GNQFY--WLHYTWIDIGTPNVS--FLVALDAGSNLLWV 136
N +L S GS T F G Y L+YT I +G P + + +D GS+L W+
Sbjct: 167 KVNDVLSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWI 226
Query: 137 PCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSSCKSL 188
C C CA + Y NL V S P C + C+S
Sbjct: 227 QCDAPCTSCAKGANQLYKPRKDNL-------------VRSSEPFCVEVQRNQLTEHCESC 273
Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
C Y +Y+ + + S G L D HL K S +S ++ GCG Q G L+
Sbjct: 274 HQ-CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTL 327
Query: 249 -APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFG-DQGPATQQSTSF 304
DG++GL +S+PS LA G+I N C D N G +F G D P+ ++
Sbjct: 328 LKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPS--HGMTW 385
Query: 305 LPI--GEKYDAYFVGVESYCIGNSCLTQSGF-----QALVDSGASFTFLPTEIYAEVVVK 357
+P+ + Y + V GN+ L+ G + L D+G+S+T+ P + Y+++V
Sbjct: 386 VPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTS 445
Query: 358 FDKL 361
++
Sbjct: 446 LQEV 449
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 79/290 (27%), Positives = 129/290 (44%), Gaps = 31/290 (10%)
Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
GN F +Y+ + IG+P +F +D GS+L WV C AP S +L NL +Y
Sbjct: 41 GNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCD----APCSGC---TLPPNL-QY 92
Query: 162 DPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
P + + CS+P+C + + C + ++ C Y Y+ + SS G LV D L
Sbjct: 93 KPKGNI----IPCSNPICTALHWPNKPHCPNPQEQCDYEVKYA-DQGSSMGALVTDQFPL 147
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLI 273
K S +Q V GCG Q SY P GV+GLG G + + + L AGL
Sbjct: 148 ----KLVNGSFMQPPVAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLT 201
Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 333
+N C G +FFGD ++ P+ + + Y G G
Sbjct: 202 RNVVGHCLSSKGGGFLFFGDNL-VPSIGVAWTPLLSQDNHYTTGPADLLFNGKPTGLKGL 260
Query: 334 QALVDSGASFTFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNAS 381
+ + D+G+S+T+ ++ Y ++ + D VS +++ + + C+ +
Sbjct: 261 KLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGA 310
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 112/234 (47%), Gaps = 19/234 (8%)
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLG 259
+ +S++GYLV D++HL + + S ++I GCG KQ+G + AA DG+MG G
Sbjct: 4 DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQS 63
Query: 260 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
+ S S LA G ++ SF+ C D N+ G +F G P+ K Y V +
Sbjct: 64 NSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLN 121
Query: 320 SYCIGNSC--LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
+ +GNS L+ + F + ++DSG + +LP +Y ++ + L S ++L
Sbjct: 122 AIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEI--LASHPELTLHT 179
Query: 372 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHACFSY 423
+ ++++ + P + F K+ S V R ++F E D CF +
Sbjct: 180 VQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVRE----DTWCFGW 229
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 119/263 (45%), Gaps = 29/263 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT I +G P + + +D GS+L W+ C C CA Y +
Sbjct: 194 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 245
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+++ C L ++ C + K C Y +Y+ + +SS G L D +H+ + +
Sbjct: 246 PPRDLLCQE-LQGDQNYCATCKQ-CDYEIEYA-DRSSSMGVLAKDDMHMIATNG----GR 298
Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ + GC Q G L A DG++GL +S+PS LA G+I N F C + N
Sbjct: 299 EKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPN 358
Query: 285 DSGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSG-----FQALVD 338
G +F GD + ++ PI G + Y + G+ L G Q + D
Sbjct: 359 GGGYMFLGDDY-VPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417
Query: 339 SGASFTFLPTEIYAEVV--VKFD 359
SG+S+T+LP EIY ++V +K+D
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYD 440
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 76/280 (27%), Positives = 124/280 (44%), Gaps = 48/280 (17%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IGTP +D G++ +W QC C P L++ + PS SS+ K + C+
Sbjct: 96 IGTPPFQLYSLIDTGNDNIWF--QCKPCKP-------CLNQTSPMFHPSKSSTYKTIPCT 146
Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
P+CK+ + YL D L L S + P S +++IG
Sbjct: 147 SPICKN----------------------ADGHYLGVDTLTLNS-NNGTPISF--KNIVIG 181
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVF 290
CG + G L+G G +GL G +S S L + I FS C EN S +
Sbjct: 182 CGHRNQGP-LEGYV-SGNIGLARGPLSFISQLNSS--IGGKFSYCLVPLFSKENVSSKLH 237
Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----TQSGFQALVDSGASFTFL 346
FGD+ + T PI E+ + YFV +E++ +G+ + + + +++DSG + T L
Sbjct: 238 FGDKSTVSGLGTVSTPIKEE-NGYFVSLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTIL 296
Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
P ++Y+ + +V KR+ + CY +S +L
Sbjct: 297 PKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLL 336
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 134/305 (43%), Gaps = 46/305 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + D GS+++W QC P + Y ++L ++PS S++ + VS
Sbjct: 89 LSVGTPPFPIIAVADTGSDIIWT-----QCVPCTNCY----QQDLPMFNPSKSTTYRKVS 139
Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSS 227
CS P+C +SC S K C Y Y +++ S G D L + S S P+++
Sbjct: 140 CSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVDTLTMGSTSGRVVAFPRTA 197
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
IGCG GS+ A G++GLGLG S+ + A + FS C D
Sbjct: 198 ------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGND 247
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF--------VGVES--YCIGNSCLTQ 330
+ S + FG + PI +K+ +++ VG + Y NS L
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGG 307
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
++DSG + T LP ++Y ++ +R +YC+ ++++ KVP
Sbjct: 308 KA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDD-YKVPF 365
Query: 391 MRLIF 395
+ + F
Sbjct: 366 IAMHF 370
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 107/441 (24%), Positives = 182/441 (41%), Gaps = 67/441 (15%)
Query: 4 LVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
+V +C+ L+ D FS +++HR S + P E
Sbjct: 12 IVLLCLYINISFLNALDGGGFSVEIIHRDSSRS-----------------PYYRPTETQF 54
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVS 122
++N +R R N+ ++ L+ + +++ +Q Y + Y+ +GTP
Sbjct: 55 QRVANALRRSINRA-----NHFNKPNLVASTNTAESTVIASQGEYLMSYS---VGTPPFQ 106
Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS- 181
L +D GS+++W +QC P Y ++ +DPS S + K + CS +C+S
Sbjct: 107 ILGIVDTGSDIIW-----LQCQPCEDCY----NQTTPIFDPSQSKTYKTLPCSSNICQSV 157
Query: 182 --RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGR 238
+SC S D C Y Y +++ S G L + L L S SSVQ +IGCG
Sbjct: 158 QSAASCSSNNDECEYTITYG-DNSHSQGDLSVETLTLGSTDG----SSVQFPKTVIGCGH 212
Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGD 293
G++ +G +GLG V + + I FS C N S + FGD
Sbjct: 213 NNKGTF----QREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGD 268
Query: 294 QGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGF---------QALVDSGAS 342
+ + + T PI K YF+ +E++ +G++ + ++DSG +
Sbjct: 269 EAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTT 328
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
T LP + Y + + +R+ + CY +S + L VP + F V
Sbjct: 329 LTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITAHFKGAD--V 386
Query: 403 VRNHIFSFPENEVGDHACFSY 423
N I +F E + G CF++
Sbjct: 387 ELNPISTFIEVDEG-VVCFAF 406
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 143/331 (43%), Gaps = 36/331 (10%)
Query: 96 GSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD 155
G ++ F +L Y +++GTP L D GS+L+WV C A ++
Sbjct: 91 GVESKIITRSFEYLMY--VNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNV- 147
Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDD 212
+ P+ SS+ +SC C+ S++SC + + C Y YS D S + G L +
Sbjct: 148 ----VFQPTRSSTYSQLSCQSNACQALSQASCDADSE-CQY--QYSYGDGSRTIGVLSTE 200
Query: 213 ILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
SF + V+ V GC G++ DG++GLG G S+ S L
Sbjct: 201 TF---SFVDGGGKGQVRVPRVNFGCSTASAGTFRS----DGLVGLGAGAFSLVSQLGATT 253
Query: 272 LIQNSFSIC----FDENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCIGN 325
I S C +D N S ++ FG + ++ + P + D+Y+ V +ES +G
Sbjct: 254 HIDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGG 313
Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---SS 382
+ + +VDSG + TFL + +V + ++ + +R+ + CY+ S
Sbjct: 314 QEVATHDSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSE 373
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
+ +PD+ L F + +R PEN
Sbjct: 374 TDNFGIPDVTLRFGGGAAVTLR------PEN 398
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 158/367 (43%), Gaps = 45/367 (12%)
Query: 59 VEYLELLLSNDWKRQKTRVKLQSNNN---SSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
V++ E++ + + + KL N+ S P++ T GN + I
Sbjct: 83 VDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGN-----YIVTIG 137
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IGTP + D GS+L W QC P S Y+ + +++PSSSS+ +NVSCS
Sbjct: 138 IGTPKHDLSLVFDTGSDLTWT-----QCEPCLGSCYS---QKEPKFNPSSSSTYQNVSCS 189
Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
P+C+ SC + C Y Y + + + G+L + L S V V G
Sbjct: 190 SPMCEDAESCSASN--CVYSIGYG-DKSFTQGFLAKEKFTLT-------NSDVLEDVYFG 239
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSIC---FDENDSGSVFF 291
CG G + DGV GL SL A+ N+ FS C F N +G + F
Sbjct: 240 CGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTF 293
Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQSGFQ---ALVDSGASFT 344
G G +S F PI A+ G++ +G+ +T + F A++DSG FT
Sbjct: 294 GSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFT 351
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
LPT++YAE+ F + +SS + + + CY+ + + + P + F+ +
Sbjct: 352 RLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELD 411
Query: 405 NHIFSFP 411
S P
Sbjct: 412 GSGISLP 418
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 138/318 (43%), Gaps = 46/318 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + D GS+++W QC P + Y ++L ++PS S++ + VS
Sbjct: 89 LSVGTPPFPIIAVADTGSDIIWT-----QCEPCTNCY----QQDLPMFNPSKSTTYRKVS 139
Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSS 227
CS P+C +SC S K C Y Y +++ S G D L + S S P+++
Sbjct: 140 CSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVDTLTMGSTSGRVVAFPRTA 197
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
IGCG GS+ A G++GLGLG S+ + A + FS C D
Sbjct: 198 ------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGND 247
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF--------VGVES--YCIGNSCLTQ 330
+ S + FG + PI +K+ +++ VG + Y NS L
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGG 307
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
++DSG + T LP ++Y ++ +R +YC+ ++++ KVP
Sbjct: 308 KA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDD-YKVPF 365
Query: 391 MRLIFSKNQSFVVRNHIF 408
+ + F + R ++
Sbjct: 366 IAMHFEGANLRLQRENVL 383
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 152/336 (45%), Gaps = 46/336 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++ P S
Sbjct: 111 YYTTRLWI--GTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPES 158
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ +C + C Y Y+ E ++SSG L +D++ + S+ APQ
Sbjct: 159 SSTYQPVKCT-----IDCNCDGDRMQCVYERQYA-EMSTSSGVLGEDVISFGNQSELAPQ 212
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L +I +SFS+C+ D
Sbjct: 213 RAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD 266
Query: 286 --SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
G++ G P + + ++ P Y Y + ++ + N+ + +
Sbjct: 267 VGGGAMVLGGISPPSDMTFAYSDPDRSPY--YNIDLKEMHVAGKRLPLNANVFDGKHGTV 324
Query: 337 VDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
+DSG ++ +LP + + +VK +L S K+IS ++ C++ + ++ ++
Sbjct: 325 LDSGTTYAYLPEAAFLAFKDAIVK--ELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSF 382
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P + ++F + + + F ++V C F
Sbjct: 383 PVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIF 418
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 121/265 (45%), Gaps = 26/265 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNL-SEYDPSS 165
L++T + +G P S+ + +D GS+L W+ C CI C + Y N+ S D
Sbjct: 191 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYKPTRSNVVSSVDALC 250
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
KN H +SL C Y Y+ + +SS G LV D LHL + +
Sbjct: 251 LDVQKNQKNGH-------HDESLLQ-CDYEIQYA-DHSSSLGVLVRDELHLVTTNG---- 297
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S + +V+ GCG Q G L+ DG+MGL VS+P LA GLI+N C +
Sbjct: 298 SKTKLNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSND 357
Query: 285 DSGS--VFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLT---QSGFQALV 337
+G +F GD +++P+ D Y + GN L QS +V
Sbjct: 358 GAGGGYMFLGDDF-VPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSKVGKMV 416
Query: 338 -DSGASFTFLPTEIYAEVVVKFDKL 361
DSG+S+T+ P E Y ++V +++
Sbjct: 417 FDSGSSYTYFPKEAYLDLVASLNEV 441
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 113/253 (44%), Gaps = 23/253 (9%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT I +G P + + +D GS+L W+ C C CA Y + P S
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV---PPRDS 247
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ + C++ CK C Y +Y+ + +SS G L D +HL + +
Sbjct: 248 LCQELQGDQNYCET---CKQ----CDYEIEYA-DRSSSMGVLAKDDMHLIATNG----GR 295
Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ + GC Q G L A DG++GL +S+PS LA G+I N F C + N
Sbjct: 296 EKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETN 355
Query: 285 DSGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCL-TQSGFQALVDSGAS 342
G +F GD + ++ PI G + Y + G+ L + Q + DSG+S
Sbjct: 356 GGGYMFLGDDY-VPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414
Query: 343 FTFLPTEIYAEVV 355
+T+LP E+Y ++
Sbjct: 415 YTYLPEEMYKNLI 427
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 159/367 (43%), Gaps = 45/367 (12%)
Query: 59 VEYLELLLSNDWKRQKTRVKLQSNNN---SSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
V++ E++ + + + KL N+ S P++ T GN + I
Sbjct: 83 VDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGN-----YIVTIG 137
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IGTP + D GS+L W QC P S Y+ + +++PSSSS+ +NVSCS
Sbjct: 138 IGTPKHDLSLVFDTGSDLTWT-----QCEPCLGSCYS---QKEPKFNPSSSSTYQNVSCS 189
Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
P+C+ SC + C Y Y + + + G+L + L S V V G
Sbjct: 190 SPMCEDAESCSASN--CVYSIVYG-DKSFTQGFLAKEKFTLT-------NSDVLEDVYFG 239
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSIC---FDENDSGSVFF 291
CG G + DGV GL SL A+ N+ FS C F N +G + F
Sbjct: 240 CGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTF 293
Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQSGFQ---ALVDSGASFT 344
G G +S F PI A+ G++ +G+ +T + F A++DSG FT
Sbjct: 294 GSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFT 351
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
LPT++YAE+ F + +SS + + + CY+ + + + P + F+ + +
Sbjct: 352 RLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELD 411
Query: 405 NHIFSFP 411
S P
Sbjct: 412 GSGISLP 418
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 174/406 (42%), Gaps = 56/406 (13%)
Query: 53 WPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY- 111
W KK LL ++ + Q ++++++ +S+ Q + ++ T G + L+Y
Sbjct: 86 WGKK----MRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTS--GIKLETLNYI 139
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
+++G N+S +V D GS+L WV QC P + Y ++ YDPS SSS K
Sbjct: 140 VTVELGGKNMSLIV--DTGSDLTWV-----QCQPCRSCY----NQQGPLYDPSVSSSYKT 188
Query: 172 VSCSHPLCKSRSSCKS-----------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
V C+ C+ + +K C Y+ Y + + + G L + + L
Sbjct: 189 VFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYG-DGSYTRGDLASESIVLG--- 244
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ +++ GCGR G + G+MGLG VS+ S K FS C
Sbjct: 245 -----DTKLENLVFGCGRNNKGLF---GGASGLMGLGRSSVSLVSQTLKT--FNGVFSYC 294
Query: 281 ---FDENDSGSVFFGDQGPATQQSTS--FLPIGEK---YDAYFVGVESYCIGNSCLTQSG 332
++ SG++ FG+ + STS + P+ + Y + + IG L
Sbjct: 295 LPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLS 354
Query: 333 FQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
F L+DSG T LP IY V +F K S + + C+N +S E + +P
Sbjct: 355 FGRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPT 414
Query: 391 MRLIFSKNQSFVVR-NHIFSFPENEVGDHACFSYFTLEY-NFTGIL 434
+++IF N V +F F + + C + +L Y N GI+
Sbjct: 415 IKMIFEGNAELEVDVTGVFYFVKPDA-SLVCLALASLSYENEVGII 459
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/324 (24%), Positives = 136/324 (41%), Gaps = 37/324 (11%)
Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN F +Y+ + IG P +F +D GS++ WV C C C +L L
Sbjct: 46 GNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGC---------NLPPKL- 95
Query: 160 EYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
+Y P ++ V CS P+C + C + K+ C Y +Y+ + +S ++D
Sbjct: 96 QYKPKGNT----VPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFP 151
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAG 271
K S++Q + GCG Q SY P GV+GLG G + + + L AG
Sbjct: 152 F-----KLLNGSAMQPRLAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAG 204
Query: 272 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 331
L +N C G +FFGD ++ P+ + Y G
Sbjct: 205 LTRNVVGHCLSSKGGGYLFFGDTL-IPSLGVAWTPLLPPDNHYTTGPAELLFNGKPTGLK 263
Query: 332 GFQALVDSGASFTFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNASS--EEMLK 387
G + + D+G+S+T+ ++ Y +V + D VS +++ + + C+ + + +L+
Sbjct: 264 GLKLIFDTGSSYTYFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLE 323
Query: 388 VPDMRLIFSKNQSFVVRNHIFSFP 411
V + + N + RN P
Sbjct: 324 VKNFFKTITINFTNARRNTQLQIP 347
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 142/330 (43%), Gaps = 43/330 (13%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP F + +D GS + +VPC C C A + + P +SSS + VSC
Sbjct: 105 IGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFD-------PRFKPDNSSSYQTVSC 157
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
+ P C ++ C + C Y Y+ E +SS G L D+L + S+ P ++
Sbjct: 158 NSPDCITKM-CDARVHQCKYERVYA-EMSSSKGVLGKDLLGFGNGSRLQPHP-----LLF 210
Query: 235 GCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--SGSVFF 291
GC +TG YL A DG+MGLG G +S+ L G +++SFS+C+ D GS+
Sbjct: 211 GCETAETGDLYLQHA--DGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVL 268
Query: 292 GDQGPATQQSTSFLPIGEKYDAYF------VGVESYCIGNSCLTQSG-FQALVDSGASFT 344
G P + F Y+ + V+ + +G ++DSG ++
Sbjct: 269 GAIPPPP--AMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGTTYA 326
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY---CYNASSEEMLKV----PDMRLI 394
+LP + + F ++ + SLQ G Y C+ + + + P + +
Sbjct: 327 YLPDKAFD----AFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFV 382
Query: 395 FSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
FS NQ + + F +V C +F
Sbjct: 383 FSGNQKVFLAPENYLFKHTKVPGAYCLGFF 412
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/314 (29%), Positives = 140/314 (44%), Gaps = 44/314 (14%)
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFY-WLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--C 140
N+ +N +F + GN + L+Y + IG P + + +D GS+L W+ C C
Sbjct: 2 NADKNATVF------SQLRGNIYPDGLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPC 55
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYI 195
CA S L YDP + + V C PLC +C C Y
Sbjct: 56 RSCA--------SGPHGL--YDPKKA---RLVDCRVPLCALVQQGGSYACGGPVRQCDYD 102
Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVM 254
+Y+ + +S+ G L++D + L + +S +++ IIGCG Q G+ A+ DGVM
Sbjct: 103 VEYA-DGSSTMGVLMEDTITL--LLTNGTRS--KTTAIIGCGYDQQGTLAQTPASTDGVM 157
Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ-GPATQQSTSFLPIGEKY 311
GL +S+PS LAK G+++N C N G +FFGD PA ++ PI K
Sbjct: 158 GLSSAKISLPSQLAKKGIVRNVIGHCLAGGSNGGGYLFFGDSLVPAL--GMTWTPIMGKS 215
Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK---RIS 368
+G +S + G + DSG SFT+L E Y V+ + V RI
Sbjct: 216 ITGNIGGKSGDADDKTGDIGGV--MFDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIK 273
Query: 369 LQGNSWKYCYNASS 382
N+ +C+ S
Sbjct: 274 TD-NTLPFCWRGPS 286
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/310 (25%), Positives = 131/310 (42%), Gaps = 36/310 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + +G+P + + +D GS+L W +QC P + D +DPS+S +
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSW-----LQCKPCVVYCHVQAD---PLFDPSASKTY 64
Query: 170 KNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
K++SC+ C S C++ + C Y A Y + + S GYL D+L LA
Sbjct: 65 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDLLTLAP---- 119
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
S + GCG+ G + A G++GLG +S+ ++ +FS C
Sbjct: 120 ---SQTLPGFVYGCGQDSEGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLP 171
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQSGFQ----A 335
G + + F P+ YF+ + + +G L + Q
Sbjct: 172 TRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT 231
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLI 394
++DSG T LP +Y F K++SSK G S C+ + ++M VP++RLI
Sbjct: 232 IIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLI 291
Query: 395 FSKNQSFVVR 404
F +R
Sbjct: 292 FQGGADLNLR 301
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 138/330 (41%), Gaps = 38/330 (11%)
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL-FPSEGSQTHFFGNQFYWLHYTWIDIGT 118
E+ E+L ++D R + S N ++ F +G+ + L+YT I++GT
Sbjct: 4 EHFEMLKAHDRAR----------HGRSLNTIVDFTLQGTADPYVAG----LYYTRIELGT 49
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P F V +D GS++LWV C+ PL++ L L+ +DP SS++ +SC
Sbjct: 50 PPRPFYVQIDTGSDILWVNCKPCNACPLTS----GLGVALNFFDPRGSSTASPLSCIDSK 105
Query: 179 CK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C S S C + + C Y +Y + + + GY V D + ++ + +
Sbjct: 106 CVSSNQISESVCTTDRY-CGYSFEYG-DGSGTLGYYVSDEFDYNQYVNQYVTNNASAKIT 163
Query: 234 IGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
GC Q+G A DG+ G G D+SV S L GL FS C + D G
Sbjct: 164 FGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGIL- 222
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFT 344
G T+ + PI Y + ++ + L T + ++D G +
Sbjct: 223 VLGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLA 282
Query: 345 FLPTEIYAEVVVKFDKLV--SSKRISLQGN 372
+L E Y V V S++ L+GN
Sbjct: 283 YLAEEAYEPFVNTIIAAVSQSTQPFMLKGN 312
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 142/330 (43%), Gaps = 35/330 (10%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYT-SLDRNLSEYDPSSSSSSKNVS 173
IGTP F + +D GS + +VPC C C AS+ T L + P +SSS + +
Sbjct: 46 IGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQKIG 105
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C C + C S C Y Y+ E ++S G L D+L P S +QS ++
Sbjct: 106 CRSSDCIT-GLCDSNSHQCKYERMYA-EMSTSKGVLGKDLLDFG------PASRLQSQLL 157
Query: 234 -IGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGS 288
GC ++G YL A DG+MGLG G +S+ L G I++SFS+C+ DE
Sbjct: 158 SFGCETAESGDLYLQVA--DGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSM 215
Query: 289 VFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIG-NSCLTQSGFQALVDSGASFT 344
V P+ P Y + + V+ + +S + F ++DSG ++
Sbjct: 216 VLGAIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSGTTYA 275
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLI 394
+LP + F V ++ SLQ N CY + + ++ P + +
Sbjct: 276 YLPDRAFE----AFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPLVDFV 331
Query: 395 FSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
F++NQ + + F +V C +F
Sbjct: 332 FAENQKVSLAPENYLFKHTKVPGAYCLGFF 361
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 120/268 (44%), Gaps = 40/268 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I+IG P + + LD GS+L W+ C C++C L P SS
Sbjct: 42 INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 87
Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
+ C+ PLCK S C++ + C Y +Y+ + SS G LV D+ FS + Q
Sbjct: 88 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 140
Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+ + +GCG Q DGV+GLG G VS+ S L G ++N C
Sbjct: 141 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 200
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQALV---DS 339
G +FFGD + + S+ P+ +Y ++ S +G L +G + L+ DS
Sbjct: 201 GGILFFGDDLYDSSR-VSWTPMSREYSKHY----SPAMGGELLFGGRTTGLKNLLTVFDS 255
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRI 367
G+S+T+ ++ Y V + +S K +
Sbjct: 256 GSSYTYFNSKAYQAVTYLLKRELSGKPL 283
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 132/304 (43%), Gaps = 43/304 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + +G+P + +D+GS+++WV C+ C QC Y D +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
VSC +C++ S DYS + + + G L + L L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
++VQ V IGCG + +G ++ A G++GLG G +S+ L G FS C
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSC---------LTQSGFQA 335
+G G T +P G + + Y+VG+ +G LT+ G
Sbjct: 287 AGGA-----GSLVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGG 341
Query: 336 LV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
+V D+G + T LP E YA + FD + + S + CY+ S ++VP +
Sbjct: 342 VVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFY 401
Query: 395 FSKN 398
F +
Sbjct: 402 FDQG 405
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 19/309 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L++T + +GTP + F V +D GS++LWV C P S + L L+ +D SSSSS
Sbjct: 78 LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRS----SGLGIQLNFFDASSSSS 133
Query: 169 SKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S VSCS P+C S + C + + C Y Y + + +SGY V + ++ +
Sbjct: 134 SSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYG-DGSGTSGYYVSESMYFDMVMGQS 192
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
++ +SV+ GC Q+G A DG+ G G GD+SV S L+ G+ FS C
Sbjct: 193 MIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLK 252
Query: 282 -DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGNSCLTQS-GFQA 335
+ N G + G+ + +P Y+ Y V ++ I S S
Sbjct: 253 GEGNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINRGT 312
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
++DSG + +L E Y V V S+ ++ + CY S+ P + L F
Sbjct: 313 IIDSGTTLAYLVEEAYTPFVSAITAAV-SQSVTPTISKGNQCYLVSTSVGEIFPLVSLNF 371
Query: 396 SKNQSFVVR 404
+ + S V++
Sbjct: 372 AGSASMVLK 380
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/409 (23%), Positives = 168/409 (41%), Gaps = 76/409 (18%)
Query: 3 NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
N+V + LF + + + FS L+HR D + P K E L
Sbjct: 11 NVVVVGFLFHLLEVGLASGGGFSVDLIHR--DSPHSPFFD-----------PSKTRTERL 57
Query: 63 ELLLSNDWKRQKTRV------KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDI 116
++ + R +RV + S+ SR L PS G Y ++ + I
Sbjct: 58 ----TDAFHRSASRVGRFRQSAMTSDGIQSR---LVPSAGE---------YIMN---LSI 98
Query: 117 GTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
GTP V + +D GS+L W C+ C C ++ DP +SS+ ++ SC
Sbjct: 99 GTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFF----------DPKNSSTYRDSSCG 148
Query: 176 HPLCKSRSSCKSLKD--PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C + + +S ++ C ++ Y+ + + + G L + L +AS A +
Sbjct: 149 TSFCLALGNDRSCRNGKKCTFMYSYA-DGSFTGGNLAVETLTVAS---TAGKPVSFPGFA 204
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGS 288
GC + G + + ++ G++GLG+ ++S+ S L I FS C D + S
Sbjct: 205 FGCVHRSGGIFDEHSS--GIVGLGVAELSMISQLKST--INGRFSYCLLPVFTDSSMSSR 260
Query: 289 VFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQSGF---------QAL 336
+ FG G + T P+ G Y + +E + +G L+ GF +
Sbjct: 261 INFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNII 320
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
VDSG ++T+LP E Y ++ + KR+ CYN + +++
Sbjct: 321 VDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQI 369
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 149/336 (44%), Gaps = 46/336 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C QC ++ P
Sbjct: 80 YYTTRLWI--GTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPDL 127
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ +C + + C Y Y+ E ++SSG L +D++ + S+ APQ
Sbjct: 128 SSTYQPVKCT-----LDCNCDNDRMQCVYERQYA-EMSTSSGVLGEDVVSFGNQSELAPQ 181
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L ++ +SFS+C+ D
Sbjct: 182 RAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD 235
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
G V G P+ P+ Y Y + ++ + N + ++
Sbjct: 236 VGGGAMVLGGISPPSDMVFAQSDPVRSPY--YNIDLKEIHVAGKRLPLNPSVFDGKHGSV 293
Query: 337 VDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLKV---- 388
+DSG ++ +LP E + E +VK +L S +IS ++ C++ + ++ ++
Sbjct: 294 LDSGTTYAYLPEEAFLAFKEAIVK--ELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTF 351
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
P + +IF + + + F ++V C F
Sbjct: 352 PVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIF 387
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 102/402 (25%), Positives = 164/402 (40%), Gaps = 67/402 (16%)
Query: 3 NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
N+V + LF + + + FS L+HR D + P K E L
Sbjct: 11 NVVVVGFLFQLLEVALARGGGFSVDLIHR--DSPHSPFFD-----------PSKTQAERL 57
Query: 63 ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
++ ++R +RV S+G Q+ + +L +I GTP V
Sbjct: 58 ----TDAFRRSVSRV-------GRFRPTAMTSDGIQSRIVPSAGEYLMNLYI--GTPPVP 104
Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC--- 179
+ +D GS+L W QC P + Y + + +DP +SS+ ++ SC C
Sbjct: 105 VIAIVDTGSDLTWT-----QCRPCTHCY----KQVVPLFDPKNSSTYRDSSCGTSFCLAL 155
Query: 180 -KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
K RS K K C + YS D S + G L + L + S A + GCG
Sbjct: 156 GKDRSCSKEKK--CTF--RYSYADGSFTGGNLASETLTVDS---TAGKPVSFPGFAFGCG 208
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFG 292
G + ++ G++GLG G++S+ S L I FS C D + S + FG
Sbjct: 209 HSSGGIFDKSSS--GIVGLGGGELSLISQLKST--INGLFSYCLLPVSTDSSISSRINFG 264
Query: 293 DQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQA---------LVDSGA 341
G + T P+ +K Y++ +E +G L G+ +VDSG
Sbjct: 265 ASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEEGNIIVDSGT 324
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
++TFLP E Y+++ + KR+ + CYN ++E
Sbjct: 325 TYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE 366
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 120/268 (44%), Gaps = 40/268 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I+IG P + + LD GS+L W+ C C++C L P SS
Sbjct: 64 INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 109
Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
+ C+ PLCK S C++ + C Y +Y+ + SS G LV D+ FS + Q
Sbjct: 110 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 162
Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+ + +GCG Q DGV+GLG G VS+ S L G ++N C
Sbjct: 163 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 222
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQALV---DS 339
G +FFGD + + S+ P+ +Y ++ S +G L +G + L+ DS
Sbjct: 223 GGILFFGDDLYDSSR-VSWTPMSREYSKHY----SPAMGGELLFGGRTTGLKNLLTVFDS 277
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRI 367
G+S+T+ ++ Y V + +S K +
Sbjct: 278 GSSYTYFNSKAYQAVTYLLKRELSGKPL 305
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 122/272 (44%), Gaps = 30/272 (11%)
Query: 105 QFYWLHYTWIDIGTPNVS--FLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSE 160
Q L+YT I +G P + + +D GS L W+ C C CA + Y NL
Sbjct: 25 QMGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVR 84
Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SS + V + L + +C C Y +Y+ + + S G L D HL
Sbjct: 85 ---SSEAFCVEVQ-RNQLTEHCENCHQ----CDYEIEYA-DHSYSMGVLTKDKFHL---- 131
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
K S +S ++ GCG Q G L+ DG++GL +S+PS LA G+I N
Sbjct: 132 KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGH 191
Query: 280 CF--DENDSGSVFFG-DQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-- 332
C D N G +F G D P+ +++P+ + DAY + V G L+ G
Sbjct: 192 CLASDLNGEGYIFMGSDLVPS--HGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGEN 249
Query: 333 ---FQALVDSGASFTFLPTEIYAEVVVKFDKL 361
+ L D+G+S+T+ P + Y+++V ++
Sbjct: 250 GRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEV 281
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 120/268 (44%), Gaps = 40/268 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I+IG P + + LD GS+L W+ C C++C L P SS
Sbjct: 61 INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 106
Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
+ C+ PLCK S C++ + C Y +Y+ + SS G LV D+ FS + Q
Sbjct: 107 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 159
Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+ + +GCG Q DGV+GLG G VS+ S L G ++N C
Sbjct: 160 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 219
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQALV---DS 339
G +FFGD + + S+ P+ +Y ++ S +G L +G + L+ DS
Sbjct: 220 GGILFFGDDLYDSSR-VSWTPMSREYSKHY----SPAMGGELLFGGRTTGLKNLLTVFDS 274
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRI 367
G+S+T+ ++ Y V + +S K +
Sbjct: 275 GSSYTYFNSKAYQAVTYLLKRELSGKPL 302
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 113/257 (43%), Gaps = 30/257 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+Y ++IG P + + +D GS+L W+ C C C + Y L
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRL--------- 103
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIA--DYS---TEDTSSSGYLVDDILHLASFSKH 222
V C++ LC + S + + CP DY T+ SS G L++D SFS
Sbjct: 104 ----VPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----SFSLP 154
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
S+++ + GCG Q AA DG++GLG G VS+ S L + G+ +N C
Sbjct: 155 MRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHC 214
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVD 338
N G +FFGD + + T ++P+ ++ Y G + L + + D
Sbjct: 215 LSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFD 273
Query: 339 SGASFTFLPTEIYAEVV 355
SG+++T+ + Y VV
Sbjct: 274 SGSTYTYFTAQPYQAVV 290
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 90/332 (27%), Positives = 142/332 (42%), Gaps = 48/332 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP F + LD GS+L W+ QC+ C Y ++N YDP SSS KN++C
Sbjct: 201 VGTPPKHFSLILDTGSDLNWI--QCVPC-------YACFEQNGPYYDPKDSSSFKNITCH 251
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
P C+ SS CK CPY Y ++ + ++ + + P+ +
Sbjct: 252 DPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIV 311
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGS 288
+V+ GCG G + A ++GLG G +S + L L +SFS C D N + S
Sbjct: 312 ENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL--QSLYGHSFSYCLVDRNSNSS 366
Query: 289 V----FFGDQGPATQQS----TSFLPIGEKYDA----YFVGVESYCIGNSCL-------- 328
V FG+ TSF +G K + Y+V ++S +G L
Sbjct: 367 VSSKLIFGEDKELLSHPNLNFTSF--VGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWH 424
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
Q G ++DSG + T+ Y + F + + + K CYN S E +
Sbjct: 425 LSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKM 484
Query: 387 KVPDMRLIFSKNQ--SFVVRNHIFSF-PENEV 415
++P+ ++F+ F V N+ PE+ V
Sbjct: 485 ELPEFAILFADGAMWDFPVENYFIQIEPEDVV 516
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 116/257 (45%), Gaps = 30/257 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+Y ++IG P + + +D GS+L W+ C C C + Y +
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY-------------RPT 99
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIA--DYS---TEDTSSSGYLVDDILHLASFSKH 222
+++ V C++ LC + S + + CP DY T+ SS G L++D SFS
Sbjct: 100 ANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----SFSLP 154
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
S+++ + GCG Q AA DG++GLG G VS+ S L + G+ +N C
Sbjct: 155 MRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHC 214
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVD 338
N G +FFGD + + T ++P+ ++ Y G + L + + D
Sbjct: 215 LSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFD 273
Query: 339 SGASFTFLPTEIYAEVV 355
SG+++T+ + Y VV
Sbjct: 274 SGSTYTYFTAQPYQAVV 290
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/336 (27%), Positives = 144/336 (42%), Gaps = 38/336 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IGTP +D ++ +W QC C P + +DPS SS+ K + CS
Sbjct: 95 IGTPPFQLYGVMDTANDNIWF--QCNPCKPC-------FNTTSPMFDPSKSSTYKTIPCS 145
Query: 176 HPLCKS--RSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
P CK+ + C S K C Y Y E S G L D L L S + P S ++
Sbjct: 146 SPKCKNVENTHCSSDDKKVCEYSFTYGGE-AYSQGDLSIDTLTLNS-NNDTPISF--KNI 201
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSG 287
+IGCG + G L+G G +GLG G +S S L + I FS C +E SG
Sbjct: 202 VIGCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISG 257
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDS 339
+ FGD+ + T PI Y + + +G+ + + ++DS
Sbjct: 258 KLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDS 317
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
G + T LP +Y+ + +V +R +K CY A+ + L VP + F+
Sbjct: 318 GTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKN-LDVPIITAHFNGAD 376
Query: 400 SFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILI 435
+ + F ++EV CF++ ++ NF G +I
Sbjct: 377 VHLNSLNTFYPIDHEV---VCFAFVSVG-NFPGTII 408
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 124/296 (41%), Gaps = 44/296 (14%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSS 166
L+Y + IG P + + +D GS+L W+ C C CA Y DP +
Sbjct: 30 LYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLY----------DPKRA 79
Query: 167 SSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
+ V C P C + +C C Y DY + +S+ G LV+D + L +
Sbjct: 80 ---RVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDY-VDGSSTMGILVEDTITLVLTNG 135
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ Q+ +IGCG Q G+ A DGV+GL +S+PS LA G+ N C
Sbjct: 136 ----TRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHC 191
Query: 281 F--DENDSGSVFFGDQ-GPATQQSTSFL---PIGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
N G +FFGD PA + + + P+ E Y A ++ G L G
Sbjct: 192 LAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIK---YGGEVLELEGTT 248
Query: 335 -----ALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYNASS 382
A+ DSG SFT+L Y V VV+ + +RI + +C+ S
Sbjct: 249 DDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTD-TTLPFCWRGPS 303
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 135/295 (45%), Gaps = 36/295 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V +L D GS+L W C C++C Y L R + ++P S+S +V
Sbjct: 96 VSIGTPPVDYLGIADTGSDLTWAQCLPCLKC-------YQQL-RPI--FNPLKSTSFSHV 145
Query: 173 SCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C+ C + ++ C Y Y + T S G L + + + S SSV+S
Sbjct: 146 PCNTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS-------SSVKS- 196
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---ENDSGS 288
+IGCG +G + GV+GLG G +S+ S +++ I FS C + +G
Sbjct: 197 -VIGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK 252
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQSGFQALVDSGASF 343
+ FG+ + P+ K Y++ +E+ IGN + G ++DSG +
Sbjct: 253 INFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQG-NVIIDSGTTL 311
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEMLKVPDMRLIFS 396
T LP E+Y VV K+V +KR+ S C++ ++ L +P + FS
Sbjct: 312 TILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFS 366
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 73/267 (27%), Positives = 118/267 (44%), Gaps = 38/267 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I+IG P + + LD GS+L W+ C C++C L P SS
Sbjct: 64 INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 109
Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
+ C+ PLCK S C++ + C Y +Y+ + SS G LV D+ + +
Sbjct: 110 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDVFSM----NYTKGL 163
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ + +GCG Q DGV+GLG G VS+ S L G ++N C
Sbjct: 164 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 223
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQALV---DSG 340
G +FFGD + + S+ P+ +Y ++ S +G L +G + L+ DSG
Sbjct: 224 GILFFGDDLYDSSR-VSWTPMSREYSKHY----SPAMGGELLFGGRTTGLKNLLTVFDSG 278
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRI 367
+S+T+ ++ Y V + +S K +
Sbjct: 279 SSYTYFNSKAYQAVTYLLKRELSGKPL 305
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 135/301 (44%), Gaps = 47/301 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP F +D GS+L WV QCAP + + ++ + P +SSS N S
Sbjct: 12 ISLGTPPQQFSAIVDTGSDLCWV-----QCAPCARCF----EQPDPLFIPLASSSYSNAS 62
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C+ LC + R +C S+++ C Y Y + + +F S +
Sbjct: 63 CTDSLCDALPRPTC-SMRNTCTYSYSYGDGSNTRGDF---------AFETVTLNGSTLAR 112
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGS-- 288
+ GCG Q G++ A DG++GLG G +S+PS L + + FS C D++ +G+
Sbjct: 113 IGFGCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFS 167
Query: 289 -VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ--SGFQ-------- 334
+ FG+ A SF P+ + D Y+VGVES +GN + S F+
Sbjct: 168 PITFGNA--AENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGG 225
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS--SEEMLKVPDMR 392
++DSG + T+ + ++ + + +S CY+ S S L +P M
Sbjct: 226 VILDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMT 285
Query: 393 L 393
+
Sbjct: 286 V 286
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 148/354 (41%), Gaps = 74/354 (20%)
Query: 58 SVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF----------- 106
SVE + S Q T+ K Q N++R + HF+
Sbjct: 29 SVELIHRDSSKSPLYQPTQNKYQHIVNAARRSI-----NRANHFYKTALTNTPQSTVIPD 83
Query: 107 ---YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
Y + Y+ +GTP D GS+++W +QC P Y ++ ++ P
Sbjct: 84 HGEYLMTYS---VGTPPFKLYGIADTGSDIVW-----LQCEPCKECY----NQTTPKFKP 131
Query: 164 SSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S SS+ KN+ CS LCKS G L D L L S + H
Sbjct: 132 SKSSTYKNIPCSSDLCKS----------------------GQQGNLSVDTLTLESSTGH- 168
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-- 281
P S ++ +IGCG T S+ +GA+ G++GLG G S+ + L + I FS C
Sbjct: 169 PISFPKT--VIGCGTDNTVSF-EGAS-SGIVGLGGGPASLITQLGSS--IDAKFSYCLLP 222
Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGF--- 333
+ N + + FGD + PI +K Y++ +E++ +GN + G
Sbjct: 223 NPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNG 282
Query: 334 ----QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
++DSG + T +PT++Y + +LV KR++ + CY+ +S+
Sbjct: 283 GHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYSVTSD 336
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/335 (27%), Positives = 145/335 (43%), Gaps = 41/335 (12%)
Query: 53 WPKKNSVEYLELLLSNDWKRQKTR----VKLQSNNNSSRNQLLFPSEGS---QTHFF--G 103
+ K + E+ E +L D + + L+ N N +L S GS T F G
Sbjct: 135 YHKLRAREFHERILEEDLGLENENFVESMDLELVNPVKVNDVLSTSAGSIDSSTTIFPVG 194
Query: 104 NQFY--WLHYTWIDIGTPNVS--FLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRN 157
Y L+YT I +G P + + +D GS L W+ C C CA + Y N
Sbjct: 195 GNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDN 254
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
L SS + V + L + +C C Y +Y+ + + S G L D HL
Sbjct: 255 LVR---SSEAFCVEVQRNQ-LTEHCENCHQ----CDYEIEYA-DHSYSMGVLTKDKFHL- 304
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNS 276
K S +S ++ GCG Q G L+ DG++GL +S+PS LA G+I N
Sbjct: 305 ---KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNV 361
Query: 277 FSICF--DENDSGSVFFG-DQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQS 331
C D N G +F G D P+ +++P+ + DAY + V G L+
Sbjct: 362 VGHCLASDLNGEGYIFMGSDLVPS--HGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLD 419
Query: 332 G-----FQALVDSGASFTFLPTEIYAEVVVKFDKL 361
G + L D+G+S+T+ P + Y+++V ++
Sbjct: 420 GENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEV 454
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 141/328 (42%), Gaps = 64/328 (19%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I IGTP + LD GS+L+W C C +C P A Y P+ S++ N
Sbjct: 96 IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSATYAN 145
Query: 172 VSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
VSC P+C++ S C C Y Y + TS+ G L + L S +
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTLGS-------DT 197
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
V GCG + GS + + G++G+G G + SL+++ G+ + FS C F+
Sbjct: 198 AVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR--FSYCFTPFNAT 249
Query: 285 DSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNSC---------L 328
+ +F G + ++T F+P + Y++ +E +G++ L
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309
Query: 329 TQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEE 384
T G ++DSG +FT L + V L S R+ L + C+ A+S E
Sbjct: 310 TPMGDGGVIIDSGTTFTALEERAF---VALARALASRVRLPLASGAHLGLSLCFAAASPE 366
Query: 385 MLKVPDMRLIFS------KNQSFVVRNH 406
++VP + L F + +S+VV +
Sbjct: 367 AVEVPRLVLHFDGADMELRRESYVVEDR 394
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 120/268 (44%), Gaps = 40/268 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I+IG P + + LD GS+L W+ C C++C L P SS
Sbjct: 52 INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 97
Query: 172 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
+ C+ PLCK S C++ + C Y +Y+ + SS G LV D+ FS + Q
Sbjct: 98 IPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLVRDV-----FSMNYTQG 150
Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+ + +GCG Q DGV+GLG G VS+ S L G ++N C
Sbjct: 151 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 210
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQALV---DS 339
G +FFGD + + S+ P+ +Y ++ S +G L +G + L+ DS
Sbjct: 211 GGILFFGDDLYDSSR-VSWTPMSREYSKHY----SPAMGGELLFGGRTTGLKNLLTVFDS 265
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRI 367
G+S+T+ ++ Y V + +S K +
Sbjct: 266 GSSYTYFNSKAYQAVTYLLKRELSGKPL 293
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 119/265 (44%), Gaps = 26/265 (9%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNL-SEYDPSS 165
L++T + +G P S+ + +D GS+L W+ C C C + Y N+ S D
Sbjct: 193 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQYKPTRSNVVSSVDSLC 252
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
KN H +SL C Y Y+ + +SS G LV D LHL + +
Sbjct: 253 LDVQKNQKNGH-------HDESLLQ-CDYEIQYA-DHSSSLGVLVRDELHLVTTNG---- 299
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S + +V+ GCG Q G L+ A DG+MGL VS+P LA GLI+N C +
Sbjct: 300 SKTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSND 359
Query: 285 DSGS--VFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSG----FQAL 336
+G +F GD +++P+ D Y + GN L G +
Sbjct: 360 GAGGGYMFLGDDF-VPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSKVGKVF 418
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKL 361
DSG+S+T+ P E Y ++V +++
Sbjct: 419 FDSGSSYTYFPKEAYLDLVASLNEV 443
>gi|294461400|gb|ADE76261.1| unknown [Picea sitchensis]
Length = 165
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 81/150 (54%), Gaps = 11/150 (7%)
Query: 2 VNLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEY 61
V+ + +LF + S+ S+S ++ H+FS+E KE W++ + D WP + S EY
Sbjct: 6 VSFIYSLILFTSLGFQNSNGQSYSLQMYHKFSNEVKE-WMTWRHGLD-TDGWPVEGSNEY 63
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
+ L +D R ++ + L F EG++T Q +L Y+ + +GTPNV
Sbjct: 64 YKALYHHDSARHGRKL-------ADHPSLTF-LEGNETVEI-PQLGFLFYSMVQVGTPNV 114
Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
+ VALD GS++ WVPC C CAP SA+ Y
Sbjct: 115 TLFVALDTGSDVFWVPCDCQACAPTSAASY 144
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 140/340 (41%), Gaps = 42/340 (12%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T + IGTP F + +D GS + +VPC C QC + P SS+ +
Sbjct: 79 TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCG----------KHQDPRFQPDLSSTYR 128
Query: 171 NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
V C+ P C +C C Y Y+ E +SSSG + +D++ + S+ PQ +V
Sbjct: 129 PVKCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGVIAEDVVSFGNESELKPQRAV-- 180
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--SGS 288
GC +TG A DG+MGLG G +SV L G+I +SFS+C+ D G+
Sbjct: 181 ---FGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGA 236
Query: 289 VFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGA 341
+ G P S P Y Y + ++ + L ++DSG
Sbjct: 237 MVLGQISPPPNMVFSHSNPYRSPY--YNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGT 294
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKV----PDMRLI 394
++ + P + + K + + + G Y C++ + E+ + P++ ++
Sbjct: 295 TYAYFPEAAFHALKDAIMKEIRHLK-QIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMV 353
Query: 395 FSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
F Q + + F +V C F + T +L
Sbjct: 354 FGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLL 393
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 147/364 (40%), Gaps = 53/364 (14%)
Query: 55 KKNSVEYLELLLSNDWK----RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
K S +++E+L + + K KL +N+ S P++ T GN +
Sbjct: 79 KATSPDHVEILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGN-----Y 133
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ +GTP + D GS+L W CQ C++ T D+ ++PS S+S
Sbjct: 134 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSY 184
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
NVSCS C S SS C Y Y + + S G+L D L S
Sbjct: 185 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFLAKDKFTLTS------ 237
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S V V GCG G + A G++GLG +S PS A A FS C +
Sbjct: 238 -SDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSS 291
Query: 285 DS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVGVESYCIGNSCLTQSG 332
S G + FG G +S F PI D A VG + I ++ + G
Sbjct: 292 ASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG 349
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
AL+DSG T LP + YA + F +S + + C++ S + + +P +
Sbjct: 350 --ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 407
Query: 393 LIFS 396
FS
Sbjct: 408 FSFS 411
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 117/279 (41%), Gaps = 41/279 (14%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L+ I+IG P + + +D GS+L WV C P + ++ ++ Y P+
Sbjct: 61 LYTVSINIGNPPKPYELDIDTGSDLTWVQCD----GPDAPCKGCTMPKD-KLYKPNGK-- 113
Query: 169 SKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
+ V CS P+C + S C PC Y Y+ + S+ G LV D +H+ S
Sbjct: 114 -QVVKCSDPICVATQSTHVLGQICSKQSPPCVYNVQYA-DHASTLGVLVRDYMHIGS--- 168
Query: 222 HAPQSSVQSSVI-IGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
P SS + ++ GCG +Q +G + P G++GLG G S+ S L G I N
Sbjct: 169 --PSSSTKDPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLG 226
Query: 279 ICFDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
C G +F GD+ P Q S EK+ Y G
Sbjct: 227 HCLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSL------EKH--YNTGPVDLFFNGKPTP 278
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
G Q + DSG+S+T+ + +Y V + + K +S
Sbjct: 279 AKGLQIIFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLS 317
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 141/328 (42%), Gaps = 64/328 (19%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I IGTP + LD GS+L+W C C +C P A Y P+ S++ N
Sbjct: 96 IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSATYAN 145
Query: 172 VSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
VSC P+C++ S C C Y Y + TS+ G L + L S +
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTLGS-------DT 197
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
V GCG + GS + + G++G+G G + SL+++ G+ + FS C F+
Sbjct: 198 AVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR--FSYCFTPFNAT 249
Query: 285 DSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNSC---------L 328
+ +F G + ++T F+P + Y++ +E +G++ L
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309
Query: 329 TQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEE 384
T G ++DSG +FT L + V L S R+ L + C+ A+S E
Sbjct: 310 TPMGDGGVIIDSGTTFTALEESAF---VALARALASRVRLPLASGAHLGLSLCFAAASPE 366
Query: 385 MLKVPDMRLIFS------KNQSFVVRNH 406
++VP + L F + +S+VV +
Sbjct: 367 AVEVPRLVLHFDGADMELRRESYVVEDR 394
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 146/348 (41%), Gaps = 50/348 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C C ++ P
Sbjct: 88 YYTTRLWI--GTPPQRFALIVDTGSTVTYVPCSTCEHCG----------RHQDPKFQPDL 135
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S + + V C+ P C +C + C Y Y+ E +SSSG L +D++ + S+ APQ
Sbjct: 136 SETYQPVKCT-PDC----NCDGDTNQCMYDRQYA-EMSSSSGVLGEDVVSFGNLSELAPQ 189
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG+MGLG GD+S+ L +I +SFS+C+ D
Sbjct: 190 RAV-----FGCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD 243
Query: 286 SGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
G + G P T P Y Y + ++ + N + +
Sbjct: 244 VGGGAMILGGISPPEDMVFTHSDPDRSPY--YNINLKEMHVAGKKLQLNPKVFDGKHGTV 301
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ------GNSWKYCYNASSEEMLKV-- 388
+DSG ++ +LP + + F + + +R SL+ N C+ + ++ ++
Sbjct: 302 LDSGTTYAYLPETAF----LAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAK 357
Query: 389 --PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
P + ++F + + F ++V C F+ + T +L
Sbjct: 358 SFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLL 405
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 132/316 (41%), Gaps = 36/316 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+Y ++IG P + + +D GS+L W+ C C C + Y L
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKL--------- 107
Query: 168 SSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
V C++ +C + S S + C Y Y T+ SS G LV D L +K
Sbjct: 108 ----VPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVTDSFSLPLRNK 162
Query: 222 HAPQSSVQSSVIIGCG-RKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSF 277
S+V+ S+ GCG +Q G +GAAP DG++GLG G VS+ S L + G+ +N
Sbjct: 163 ----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVL 216
Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQA 335
C + G +FFGD T + T ++P+ Y G + L+ +
Sbjct: 217 GHCLSTSGGGFLFFGDDMVPTSRVT-WVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV 275
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ DSG+++T+ + Y + +S + S C+ + V D++ F
Sbjct: 276 VFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKDF 334
Query: 396 SKNQSFVVRNHIFSFP 411
Q +N + P
Sbjct: 335 KSLQFIFGKNAVMEIP 350
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 119/281 (42%), Gaps = 29/281 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+Y ++IG P + + +D GS+L W+ C C C + Y +L
Sbjct: 54 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANSL--------- 104
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIA--DYS---TEDTSSSGYLVDDILHLASFSKH 222
V C++ LC + S + CP DY T+ SS G L++D +FS
Sbjct: 105 ----VPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----NFSLP 155
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
S+++ + GCG Q AA DG++GLG G VS+ S L + G+ +N C
Sbjct: 156 MRSSNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHC 215
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDS 339
N G +FFGD T + T ++P+ + Y+ G + L + + DS
Sbjct: 216 LSTNGGGFLFFGDDIVPTSRVT-WVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDS 274
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
G+++T+ + Y VV +S + S C+
Sbjct: 275 GSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSLPLCWKG 315
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/295 (24%), Positives = 129/295 (43%), Gaps = 31/295 (10%)
Query: 80 QSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
+ ++ +N LFP GN F L+YT I +G+P + + +D GS+ WV C
Sbjct: 134 RGGDDWPQNSTLFPHS-----LAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQC 188
Query: 139 QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 198
CA + + Y P+ ++ + + S PLC+ + + C Y Y
Sbjct: 189 DAPPCASCAKGAHPL-------YRPARTADA--LPASDPLCE--GAQHENPNQCDYEISY 237
Query: 199 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLG 257
+ +S Y+ D + + + + ++ GCG Q G L+ DGV+GL
Sbjct: 238 ADGSSSMGVYVRDSMQFVGEDGERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLT 292
Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYD 312
+S+P+ LA G+I N+F C + SG+ +F GD + +++PI G D
Sbjct: 293 NKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADD 351
Query: 313 AYFVGVESYCIGNSCLTQSG--FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
V+ G+ L G Q + D+G+++T+ P E ++ + S +
Sbjct: 352 VRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEALTRLISSLKEAASPR 406
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/328 (28%), Positives = 152/328 (46%), Gaps = 41/328 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y + Y+ +G P +D GS+++W +QC P Y ++ +DPS S
Sbjct: 86 YLISYS---VGIPPFQLYGIIDTGSDMIW-----LQCKPCEKCY----NQTTRIFDPSKS 133
Query: 167 SSSKNVSCSHPLCKS--RSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
++ K + S C+S +SC S + C Y Y + + S G L + L L S +
Sbjct: 134 NTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYG-DGSYSQGDLSVETLTLGSTNG-- 190
Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQNSFSICF 281
SSV+ +IGCGR T S+ +G + G++GLG G VS + L ++ I FS C
Sbjct: 191 --SSVKFRRTVIGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGRKFSYCL 246
Query: 282 DE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL--TQSGFQ 334
N S + FGD + T PI +D Y++ +E++ +GN+ + T S F+
Sbjct: 247 ASMSNISSKLNFGDAAVVSGDGTVSTPI-VTHDPKVFYYLTLEAFSVGNNRIEFTSSSFR 305
Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
++DSG + T LP +IY+++ LV R+ CY ++ +E L
Sbjct: 306 FGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTFDE-LNA 364
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVG 416
P + FS V N + +F E E G
Sbjct: 365 PVIMAHFSGAD--VKLNAVNTFIEVEQG 390
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 138/332 (41%), Gaps = 36/332 (10%)
Query: 96 GSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD 155
G ++ F +L Y +++GTP L D GS+L+WV C S
Sbjct: 88 GVESKIITRSFEYLMY--VNVGTPPAQMLAIADTGSDLVWVNCS-------SNGGGGGAS 138
Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDD 212
+ PS S++ +SC C+ S++SC + + C Y Y+ D S + G L +
Sbjct: 139 DGAVVFHPSRSTTYSLLSCQSAACQALSQASCDADSE-CQY--QYAYGDGSRTIGVLSTE 195
Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
A+ V GC GS+ DG++GLG G +S+ S L A
Sbjct: 196 TFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAAR 251
Query: 273 IQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCI-G 324
I FS C N S ++ FG + + + P + + D+Y+ V +ES + G
Sbjct: 252 IARRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAG 311
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---S 381
+ + + +VDSG + TFL + +V + ++ + R + CY+ S
Sbjct: 312 QDVASANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKS 371
Query: 382 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
E +PD+ L F S +R PEN
Sbjct: 372 QAEDFGIPDVTLRFGGGASVTLR------PEN 397
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 119/279 (42%), Gaps = 36/279 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPS 164
Y L+Y + +G P+ + + +D+GS L W+ C CI CA Y +L
Sbjct: 76 YGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSL------ 129
Query: 165 SSSSSKNVSCSHPLCKSRSSC-------KSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 217
V PLC + + K C Y Y+ + S G+LV D +
Sbjct: 130 -------VPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAYA-DHGYSEGFLVRDSVRAL 181
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
+K + + ++ + GCG Q S + A DG++GLG G S+PS AK GLI+N
Sbjct: 182 LTNK----TVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNV 237
Query: 277 FSICF--DENDSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLTQSG- 332
C D G +FFGD +T T +G Y+VG GN L + G
Sbjct: 238 IGHCIFGAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGD 297
Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
+ DSG+++T+ + Y + + +S K++
Sbjct: 298 GKKLGGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQL 336
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/322 (25%), Positives = 136/322 (42%), Gaps = 33/322 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ GTP ++ + D GS++ W IQC P S Y D +DP+ S++ V
Sbjct: 124 VGFGTPAQTYTLMFDTGSDVSW-----IQCLPCSGHCYKQHD---PIFDPTKSATYSAVP 175
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C HP C + S C Y Y + +S++G L + L L S +
Sbjct: 176 CGHPQCAAAGGKCSSNGTCLYKVQYG-DGSSTAGVLSHETLSLTS-------ARALPGFA 227
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GCG G + D DG++GLG G +S+ S A + S+ + G + G
Sbjct: 228 FGCGETNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGT 284
Query: 294 QGPAT-QQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGAS 342
PA+ + + +K D YFV + S +G L T+ G L+DSG
Sbjct: 285 TTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDG--TLLDSGTV 342
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
T+LP E Y + +F ++ + + + + CY+ + + + +P + FS SF
Sbjct: 343 LTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFD 402
Query: 403 VRNH-IFSFPENEVGDHACFSY 423
+ + FP++ C ++
Sbjct: 403 LSPFGVLIFPDDTAPATGCLAF 424
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 121/285 (42%), Gaps = 47/285 (16%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+IGTP LVALD ++ WVPC C+ CA + +DPS SSSS+N+
Sbjct: 96 NIGTPAQPMLVALDTSNDAAWVPCSGCVGCA------------SSVLFDPSKSSSSRNLQ 143
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C P CK +C + K C + Y +S L D L LA + V S
Sbjct: 144 CDAPQCKQAPNPTCTAGKS-CGFNMTYGGSTIEAS--LTQDTLTLA--------NDVIKS 192
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
GC K TG+ L G+MGLG G +S+ + L ++FS C N SG
Sbjct: 193 YTFGCISKATGTSLPA---QGLMGLGRGPLSL--ISQTQNLYMSTFSYCLPNSKSSNFSG 247
Query: 288 SVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
S+ G + P ++T L + Y+V + +GN + +G +
Sbjct: 248 SLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTI 307
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
DSG FT L Y V +F + + + + G + CY+ S
Sbjct: 308 FDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLG-GFDTCYSGS 351
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 142/319 (44%), Gaps = 41/319 (12%)
Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
G F L Y I IGTP +F V D GS+L WV QC+ C P S+ Y +
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWV--QCLPC-PDSSCY----PQQEPL 165
Query: 161 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
+DPS SS+ +V CS P C ++ C + C Y Y E + + G L ++ L
Sbjct: 166 FDPSKSSTYVDVPCSAPECHIGGVQQTRCGATS--CEYSVKYGDE-SETHGSLAEETFTL 222
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
+ S AP + + V+ GC + + D G G++GLG GD S+ L++ N
Sbjct: 223 SPPSPLAPAA---TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSI---LSQTRRSIN 276
Query: 276 S----FSICFDENDS--GSVFFGDQGPATQQ---STSFLP----IGEKYDAYFVGVESYC 322
S FS C S G + G A QQ + SF P I + AY V +
Sbjct: 277 SGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVS 336
Query: 323 IGNSC--LTQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 376
+ + + S F A++DSG T +P Y + +F + S ++ +G+
Sbjct: 337 VNGAAVDIPASAFSLGAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDT 396
Query: 377 CYNASSEEMLKVPDMRLIF 395
CY+ + ++++ P + L F
Sbjct: 397 CYDVTGQDVVTAPRVALEF 415
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 123/299 (41%), Gaps = 33/299 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I G+P + +D GS+L W QC P S Y + +Y P++S + ++
Sbjct: 62 IHFGSPQKKQFLHMDTGSSLTWT-----QCFPCSDCYAQKI---YPKYRPAASITYRDAM 113
Query: 174 C--SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C SHP + L C Y Y ++T+ G L +++ + H
Sbjct: 114 CEDSHPKSNPHFAFDPLTRICTYQQHY-LDETNIKGTLAQEMI---TVDTHDGGFKRVHG 169
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSG 287
V GC GSY G G++GLG+G S+ G + FS C E S
Sbjct: 170 VYFGCNTLSDGSYFTGT---GILGLGVGKYSI------IGEFGSKFSFCLGEISEPKASH 220
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
++ GD G Q + + I E + + +ES +G Q VD+G++ + L
Sbjct: 221 NLILGD-GANVQGHPTVINITEGHTIF--QLESIIVGEEITLDDPVQVFVDTGSTLSHLS 277
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
T +Y + V FD L+ S+ +S + CY A + E L+ D+ F V H
Sbjct: 278 TNLYYKFVDAFDDLIGSRPLSYEPT---LCYKADTIERLEKMDVGFKFDVGAELSVNIH 333
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 121/264 (45%), Gaps = 30/264 (11%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + +Y+ I+IG + +F +D+GS+L WV C C C Y + L+
Sbjct: 47 GNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALN 106
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD--ILHLA 217
++P +S HP+ + CKS D C Y +Y+ + SS G LV+D L L
Sbjct: 107 CFEPLCTSL-------HPI--TNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVPLKLT 156
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGLIQNS 276
+ S AP+ + GCG S D + P GV+GLG G+VS S L+ G+++N
Sbjct: 157 NGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNV 210
Query: 277 FSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 332
C + + G +FFGD+ T S S IG Y + G G
Sbjct: 211 VGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSS---GPAEVYFGGKATGIKD 266
Query: 333 FQALVDSGASFTFLPTEIYAEVVV 356
+ DSG+S+T+ ++ Y ++
Sbjct: 267 LTLVFDSGSSYTYFNSQAYNSILA 290
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 121/299 (40%), Gaps = 42/299 (14%)
Query: 103 GNQFYWLH-YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSA-SYYTSLD 155
GN + H Y ++IG P + + +D GSNL W+ C C C P YYT D
Sbjct: 30 GNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPYYTPAD 89
Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKSL-----KDP--CPYIADYSTEDTSSSG 207
L V C PLC + R + DP C Y Y T S G
Sbjct: 90 GKLK------------VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEG 135
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSL 266
L DI+ + K + GCG KQ +P +G++GLG+G +
Sbjct: 136 DLATDIISVNGRDK--------KRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQ 187
Query: 267 LAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
L +I +N C G ++ GD P T+ T + P+ E Y G+ I
Sbjct: 188 LKGLKMIKENVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDK 246
Query: 326 SCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS 382
+ F+A+ DSG+++T +P +IY E+V K S + ++G + C+
Sbjct: 247 QPIRGNPTFEAVFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEVKGRALPLCWKGKK 305
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 154/356 (43%), Gaps = 53/356 (14%)
Query: 59 VEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
V+Y++ LS + R+ T L S P+E G+ Y + + +GT
Sbjct: 8 VKYIQSRLSKNLGRENTVKDLDSTT--------LPAESGS--LIGSANYVV---VVGLGT 54
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P + D GS+L W QC P + S Y D + +DPS SSS N++C+ L
Sbjct: 55 PKRDLSLVFDTGSDLTWT-----QCEPCAGSCYKQQD---AIFDPSKSSSYTNITCTSSL 106
Query: 179 CKS------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C +S C S D C Y A Y ++++S G+L + L + + + +
Sbjct: 107 CTQLTSDGIKSECSSSTDASCIYDAKYG-DNSTSVGFLSQERLTITA-------TDIVDD 158
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSV 289
+ GCG+ G + +G+A G+MGLG +S+ + + FS C S G +
Sbjct: 159 FLFGCGQDNEGLF-NGSA--GLMGLGRHPISI--VQQTSSNYNKIFSYCLPATSSSLGHL 213
Query: 290 FFGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL------TQSGFQALVDSG 340
FG AT S + P+ D F G++ S +G + L T S +++DSG
Sbjct: 214 TFG-ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSG 272
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
T L +YA + F + + ++ + CY+ S + + VP + FS
Sbjct: 273 TVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFS 328
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 147/360 (40%), Gaps = 47/360 (13%)
Query: 63 ELLLSNDWKRQKT---RVKLQ---SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDI 116
E +L+ D R K+ RV S RN+ P+ GN + I +
Sbjct: 113 EEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPASSGSALGTGN-----YVVTIGL 167
Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
GTP + V D GS+ WV QC P Y ++ +DP+ SS+ N+SC+
Sbjct: 168 GTPAGRYTVVFDTGSDTTWV-----QCEPCVVVCYKQQEK---LFDPARSSTYANISCAA 219
Query: 177 PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
P C C Y Y + + S G+ D L L+S+ GC
Sbjct: 220 PACSDLYIKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAIKGFRFGC 271
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
G + G Y + A G++GLG G S+P K G + F+ CF SG+ + D G
Sbjct: 272 GERNEGLYGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAHCFPARSSGTGYL-DFG 324
Query: 296 PA-----TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGASFTF 345
P + + T+ + + Y+VG+ +G L+ QS F +VDSG T
Sbjct: 325 PGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITR 384
Query: 346 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
LP Y+ + F ++ + + + + CY+ + + +P + L+F S V
Sbjct: 385 LPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDV 444
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 134/303 (44%), Gaps = 34/303 (11%)
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQL-LFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
+ + W++ + ++++ + N L P +G+ F Q+Y T I +G P +
Sbjct: 148 IDDGWRKARNKMEVAKAAAAGTNSTALLPIKGNV--FPDGQYY----TSIFVGNPPRPYF 201
Query: 125 VALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
+ +D GS+L W+ C C CA Y + +++ C L ++
Sbjct: 202 LDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKIV--------PPRDLLCQE-LQGNQ 252
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
+ C++ K C Y +Y+ + +SS G L D +HL + + + + GC Q G
Sbjct: 253 NYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHLIATNG----GREKLDFVFGCAYDQQG 306
Query: 243 SYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQ 299
L A DG++GL +S+PS LA G+I N F C ++ G +F GD +
Sbjct: 307 QLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGYMFLGDDY-VPR 365
Query: 300 QSTSFLPIGEKYD-AYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPTEIYAE 353
++ I D Y G+ L Q+G Q + DSG+S+T+LP EIY
Sbjct: 366 WGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYTYLPDEIYEN 425
Query: 354 VVV 356
+V
Sbjct: 426 LVA 428
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 140/315 (44%), Gaps = 44/315 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + +GTP + + LD GS+L W +QC P + + D YDPS S +
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSW-----LQCQPCAVYCHAQAD---PLYDPSVSKTY 176
Query: 170 KNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTS-SSGYLVDDILHLASFS 220
K +SC+ C SR +L DP C Y A Y DTS S GYL D+L L S S
Sbjct: 177 KKLSCASVEC-SRLKAATLNDPLCETDSNACLYTASYG--DTSFSIGYLSQDLLTLTS-S 232
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSI 279
+ PQ GCG+ G + A G++GL +S+ + L+ K G ++FS
Sbjct: 233 QTLPQ------FTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTKYG---HAFSY 280
Query: 280 CFDENDSGSVFFGDQ-----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-- 332
C +SGS G P + + T L + YF+ + + + L +
Sbjct: 281 CLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAM 340
Query: 333 --FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVP 389
L+DSG T LP +YA + F K++S+K S C+ S + + VP
Sbjct: 341 YRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVP 400
Query: 390 DMRLIFSKNQSFVVR 404
++++IF +R
Sbjct: 401 EIKMIFQGGADLTLR 415
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 127/292 (43%), Gaps = 32/292 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y ++ +DP+ SS+ NVS
Sbjct: 183 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANVS 234
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 235 CAAPACSDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 286
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
GCG + G + + A G++GLG G S+P K G + F+ C +G+ +
Sbjct: 287 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 340
Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ---ALVDSGASFTF 345
FG PA + +T+ + + Y+VG+ +G L QS F +VDSG T
Sbjct: 341 FGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITR 400
Query: 346 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
LP Y+ + F +S++ + + + CY+ + + +P + L+F
Sbjct: 401 LPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLF 452
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 116/256 (45%), Gaps = 31/256 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+Y ++IG P + + +D GS+L W+ C C C + +Y +
Sbjct: 73 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWY-------------KPT 119
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+K V C+ LC S + K P C Y Y T+ SS G L+ D L+ +
Sbjct: 120 KNKIVPCAASLCTSLTPNKKCAVPQQCDYQIKY-TDKASSLGVLIADNFTLSLRN----S 174
Query: 226 SSVQSSVIIGCG-RKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S+V++++ GCG +Q G +GA A DG++GLG G VS+ S L + G+ +N CF
Sbjct: 175 STVRANLTFGCGYDQQVGK--NGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCF 232
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVDS 339
N G +FFGD T + T ++P+ Y G + L + + DS
Sbjct: 233 STNGGGFLFFGDDIVPTSRVT-WVPMARTTSGNYYSPGSGTLYFDRRSLGMKPMEVVFDS 291
Query: 340 GASFTFLPTEIYAEVV 355
G+++ + E Y V
Sbjct: 292 GSTYAYFAAEPYQATV 307
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 119/269 (44%), Gaps = 37/269 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
++ +IG P + + D GS+L W+ C CIQC P Y + + DP +S
Sbjct: 67 YHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKDPICAS 126
Query: 168 -SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAP 224
N C P D C Y +Y+ + SS G LV+D+ ++L S + P
Sbjct: 127 LHPDNYRCDDP------------DQCDYEVEYA-DGGSSIGVLVNDLFPVNLTSGMRARP 173
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ + IGCG Q L G A DGV+GLG G S+ + L+ GL++N CF
Sbjct: 174 R------LTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCF 223
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---D 338
G +FFGD + P+ Y ++ + I N SG + L+ D
Sbjct: 224 SRRGGGYLFFGDD-IYDSSKVIWTPMSRDYLKHYTPGFAELILNG--RSSGLKNLLVVFD 280
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
SG+S+T+ T+ Y ++ K + K +
Sbjct: 281 SGSSYTYFNTQTYQTLLSFIKKDLHGKPL 309
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 114/265 (43%), Gaps = 35/265 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT I+IG P + + +D GS+ W+ C C C Y +
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVY-------------KPT 62
Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
K V PLC+ +++ C++ K C Y Y+ + +SS G L D + L + A
Sbjct: 63 EGKIVHPRDPLCEELQGNQNYCETCKQ-CDYEITYA-DRSSSKGVLARDNMQLTT----A 116
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
+ GC Q G LD + DG++GL G +S+ + LA +G+I N F C
Sbjct: 117 DGEMKNVDFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMA 176
Query: 282 -DENDSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLTQSG-----FQ 334
D + G +F GD + +++PI + Y V G L G Q
Sbjct: 177 TDPSSGGYMFLGDDY-VPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQ 235
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFD 359
+ DSG+S+T+ P EIY ++ +
Sbjct: 236 VIFDSGSSYTYFPHEIYTNLIALLE 260
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 114/264 (43%), Gaps = 43/264 (16%)
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y ++IG P + + +D GSNL W+ C C C + Y
Sbjct: 41 YVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLY-------------- 86
Query: 166 SSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
K V C+ PLC + C+ D C Y +Y+ + T+S G L+ D L +
Sbjct: 87 -RPKKLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYA-DGTTSLGVLLLDKFSLPT 144
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI- 273
S ++ GCG Q A DG++GLG G V + S L +G +
Sbjct: 145 GS--------ARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVS 196
Query: 274 QNSFSICFDENDSGSVFFGDQG-PATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLTQS 331
+N C G +F G++ P++ ++ I + + Y G + +G + +
Sbjct: 197 KNVIGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTK 256
Query: 332 GFQALVDSGASFTFLPTEIYAEVV 355
F+A+ DSG+++T+LP ++A++V
Sbjct: 257 PFKAIFDSGSTYTYLPENLHAQLV 280
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 80/315 (25%), Positives = 130/315 (41%), Gaps = 34/315 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+Y ++IG P + + +D GS+L W+ C C C + Y L
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKL--------- 107
Query: 168 SSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
V C++ +C + S S + C Y Y T+ SS G LV D L +K
Sbjct: 108 ----VPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVMDSFSLPLRNK 162
Query: 222 HAPQSSVQSSVIIGCG-RKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSF 277
S+V+ S+ GCG +Q G +GAAP DG++GLG G VS+ S L + G+ +N
Sbjct: 163 ----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVL 216
Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQAL 336
C + G +FFGD T + T + Y+ G + L+ + +
Sbjct: 217 GHCLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVV 276
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
DSG+++T+ + Y + +S + S C+ + V D++ F
Sbjct: 277 FDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKDFK 335
Query: 397 KNQSFVVRNHIFSFP 411
Q +N + P
Sbjct: 336 SLQFIFGKNAVMDIP 350
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 162/359 (45%), Gaps = 46/359 (12%)
Query: 54 PKKNSVEYLELLLSNDWKRQKTRVKLQ-SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT 112
P NS E + N +R R LQ SN+++S N Q+ N+ +L
Sbjct: 39 PFYNSAETSSQRMRNAIRRSA-RSTLQFSNDDASPNS-------PQSFITSNRGEYLMN- 89
Query: 113 WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I IGTP V L D GS+L+W QC P Y + +DP SS+ + V
Sbjct: 90 -ISIGTPPVPILAIADTGSDLIWT-----QCNPCEDCY----QQTSPLFDPKESSTYRKV 139
Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
SCS C++ +SC + ++ C Y Y +++ + G + D + + S S P S
Sbjct: 140 SCSSSQCRALEDASCSTDENTCSYTITYG-DNSYTKGDVAVDTVTMGS-SGRRPVS--LR 195
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEND 285
++IIGCG + TG++ A G++GLG G S+ S L K+ I FS C +
Sbjct: 196 NMIIGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETGL 251
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCL--TQSGF-----QAL 336
+ + FG G + + +K A YF+ +E+ +G+ + T + F +
Sbjct: 252 TSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIV 311
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+DSG + T LP+ Y E+ + ++R+ CY SS KVPD+ + F
Sbjct: 312 IDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSS--FKVPDITVHF 368
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 140/338 (41%), Gaps = 39/338 (11%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
N + H I IGTP + +D GS+L+W IQCAP Y + +DP
Sbjct: 62 NAYIGQHLMEIYIGTPPIKITGLVDTGSDLIW-----IQCAPCLGCY----KQIKPMFDP 112
Query: 164 SSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
SS+ N+SC PLC K + S + C Y Y +++ + G L D A+F+ +
Sbjct: 113 LKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYG-DNSLTKGVLAQDT---ATFTSN 168
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSFSIC 280
+ S + GCG TG + D G++GLG G SL+++ G + FS C
Sbjct: 169 TGKPVSLSRFLFGCGHNNTGGFNDHEM--GLIGLGGGPT---SLISQIGPLFGGKKFSQC 223
Query: 281 F-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGV------ESYCIGNSC 327
D S + FG P+ EK +YFV + ++Y NS
Sbjct: 224 LVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNST 283
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEML 386
+ ++ LVDSG LP ++Y +V + V+ K I+ + + CY + L
Sbjct: 284 IGKA--NMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN--L 339
Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
K P + F + F P + C + +
Sbjct: 340 KGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIY 377
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 141/338 (41%), Gaps = 53/338 (15%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+IGTP LVALD ++ W+PC C+ C S+S +DPS SSSS+ +
Sbjct: 93 NIGTPAQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C P CK SC K C + Y ++ YL D L LA S V +
Sbjct: 141 CEAPQCKQAPNPSCTVSKS-CGFNMTYG--GSTIEAYLTQDTLTLA--------SDVIPN 189
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
GC K +G+ L G+MGLG G +S+ S L Q++FS C N SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244
Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
S+ G + P ++T L + Y+V + +GN + +G +
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
DSG +T L Y V +F + V + + G + CY+ S + P + +F+
Sbjct: 305 FDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG-FDTCYSGS----VVFPSVTFMFA 359
Query: 397 KNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ +++ + G+ +C + N +L
Sbjct: 360 GMNVTLPPDNLLI--HSSAGNLSCLAMAAAPVNVNSVL 395
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 140/329 (42%), Gaps = 41/329 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP + D GS+L+W QC P + Y + +DP SSSS N++
Sbjct: 64 LSIGTPPIKIYAEADTGSDLVW-----FQCIPCTKCY----KQQNPMFDPRSSSSYTNIT 114
Query: 174 CSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C C S C + + C Y Y+ +++ + G L + L L S + +
Sbjct: 115 CGTESCNKLDSSLCSTDQKTCNYTYSYA-DNSITQGVLAQETLTLTSTTG---EPVAFQG 170
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA-GLIQNSFSICF-----DEND 285
+I GCG +G + D G++GLG G +S+ S + + G N FS C D +
Sbjct: 171 IIFGCGHNNSG-FNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSI 227
Query: 286 SGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVE------SYCIGNSCLTQSGFQA 335
+ + FG T P+ G Y A +G+ + G+S T +
Sbjct: 228 TSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTITKGNI 287
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
L+DSG + T+LP E Y ++ + V+ + + G ++ CY + L P + + F
Sbjct: 288 LIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG--YELCYQTPTN--LNGPTLTIHF 343
Query: 396 SKNQSFVVRNHIFSFPENEVGDHACFSYF 424
+ +F P + D+ CF+ F
Sbjct: 344 EGGDVLLTPAQMF-IPVQD--DNFCFAVF 369
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 127/289 (43%), Gaps = 27/289 (9%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ GTP + V D GSN+ W IQC P S Y + +DP+ SS+ +N+S
Sbjct: 20 VGFGTPKKNQTVIFDTGSNVNW-----IQCKPCVVSCYPQQE---PLFDPTLSSTYRNIS 71
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ C SS C Y Y + +S+ G+L + LA+ +V ++ I
Sbjct: 72 CTSAACTGLSSRGCSGSTCVYGVTYG-DGSSTVGFLATETFTLAA-------GNVFNNFI 123
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GCG+ G + GAA G++GLG S+ S LA + + N FS C S + +
Sbjct: 124 FGCGQNNQGLF-TGAA--GLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNI 178
Query: 294 QGP-ATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQA---LVDSGASFTFLP 347
P T T+ L YF+ + +G + L+ + FQ+ ++DSG T LP
Sbjct: 179 GNPLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLP 238
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
Y + F ++ + + CY+ S + P ++L ++
Sbjct: 239 PTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYT 287
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 135/309 (43%), Gaps = 44/309 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + +G+P + +D+GS+++WV C+ C QC Y D +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
VSC +C++ S DYS + + + G L + L L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
++VQ V IGCG + +G ++ A G++GLG G +S+ L G FS C
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC---------LTQ 330
+GS+ G + A ++P+ A Y+VG+ +G LT+
Sbjct: 287 AGGAGSLVLG-RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTE 345
Query: 331 SGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
G +V D+G + T LP E YA + FD + + S + CY+ S ++VP
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 405
Query: 390 DMRLIFSKN 398
+ F +
Sbjct: 406 TVSFYFDQG 414
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 141/338 (41%), Gaps = 53/338 (15%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+IGTP LVALD ++ W+PC C+ C S+S +DPS SSSS+ +
Sbjct: 93 NIGTPAQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C P CK SC K C + Y ++ YL D L LA S V +
Sbjct: 141 CEAPQCKQAPNPSCTVSKS-CGFNMTYG--GSTIEAYLTQDTLTLA--------SDVIPN 189
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
GC K +G+ L G+MGLG G +S+ S L Q++FS C N SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244
Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
S+ G + P ++T L + Y+V + +GN + +G +
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
DSG +T L Y V +F + V + + G + CY+ S + P + +F+
Sbjct: 305 FDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG-FDTCYSGS----VVFPSVTFMFA 359
Query: 397 KNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ +++ + G+ +C + N +L
Sbjct: 360 GMNVTLPPDNLLI--HSSAGNLSCLAMAAAPVNVNSVL 395
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 156/366 (42%), Gaps = 50/366 (13%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQ-LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL +D R + ++ +N + Q + P+E + GN + + +GTP
Sbjct: 44 LLEHDQARVDSIHRMIANETAVVGQDVSLPAERGISVGTGN-----YVVSVGLGTPARDL 98
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-KSR 182
V D GS+L WV QC C+ S Y D + PSSSS+ V C P C ++R
Sbjct: 99 TVVFDTGSDLSWV--QCGPCS--SGGCYHQQD---PLFAPSSSSTFSAVRCGEPECPRAR 151
Query: 183 SSCKSLK--DPCPYIADYSTEDTSSSGYLVDDILHLASF-SKHAPQ--SSVQSSVIIGCG 237
SC S D CPY Y + + + G+L +D L L + S +A + S+ + GCG
Sbjct: 152 QSCSSSPGDDRCPYEVVYG-DKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCG 210
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQ 294
TG L G A DG+ GLG G VS+ S AG FS C N G + G
Sbjct: 211 ENNTG--LFGKA-DGLFGLGRGKVSLSS--QAAGKYGEGFSYCLPSSSSNAHGYLSLGTP 265
Query: 295 GPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQAL------VDSGASFTF 345
PA + F P+ + + Y+V + + + S AL VDSG T
Sbjct: 266 APAPAHA-RFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDSGTVITR 324
Query: 346 LPTEIYAEVVVKFDKLVS------SKRISLQGNSWKYCYN--ASSEEMLKVPDMRLIFSK 397
L Y+ + F + + R+S+ CY+ A + + +P + L+F+
Sbjct: 325 LAPRAYSALRTAFLSAMGKYGYKRAPRLSI----LDTCYDFTAHANATVSIPAVALVFAG 380
Query: 398 NQSFVV 403
+ V
Sbjct: 381 GATISV 386
>gi|388513215|gb|AFK44669.1| unknown [Lotus japonicus]
Length = 101
Score = 82.4 bits (202), Expect = 4e-13, Method: Composition-based stats.
Identities = 45/93 (48%), Positives = 62/93 (66%), Gaps = 8/93 (8%)
Query: 16 LDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKT 75
++G AV+FSS+LVHRFS+EAK S+ GN + SWP K++ EY LLL++D RQ
Sbjct: 17 MEGEAAVTFSSRLVHRFSEEAKVHLASR-GNGAALQSWPNKSTSEYFRLLLNSDLTRQ-- 73
Query: 76 RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYW 108
R+KL S S ++PS+G QT FFGN++ W
Sbjct: 74 RMKLGSQYES-----MYPSKGGQTFFFGNEWNW 101
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 28/277 (10%)
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
++ + +D GS +VPC+ C +C + YY DR++ +S C +
Sbjct: 50 TYDLIVDTGSARTYVPCKGCARCGEHAHGYY-DYDRSMEFERLDCGEASDATLCEETM-- 106
Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
+ +C+S C Y+ Y+ E +SS GY+V D + L + ++ + + GC +
Sbjct: 107 -KGTCQS-DGRCSYVVSYA-EGSSSRGYVVRDRVRLG-------EGTLSAMLAFGCEEAE 156
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS----GSVFFGD 293
T + + A DG+ G G G +V + LA AGLI+N FS C F N G FG
Sbjct: 157 TNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFDFGA 215
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ-SGFQALVDSGASFTFLPTEIYA 352
PA + T + + V S+ +G+S + + + +DSG +FTF+P ++
Sbjct: 216 DAPALAR-TPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVWV 274
Query: 353 EVVVKFDKLVSSKRISL-QGNSWKY---CYNASSEEM 385
+ D + + + G +Y CY S+ M
Sbjct: 275 SFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAM 311
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 135/309 (43%), Gaps = 44/309 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + +G+P + +D+GS+++WV C+ C QC Y D +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
VSC +C++ S DYS + + + G L + L L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
++VQ V IGCG + +G ++ A G++GLG G +S+ L G FS C
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQL--GGAAGGVFSYCLASRG 286
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC---------LTQ 330
+GS+ G + A ++P+ A Y+VG+ +G LT+
Sbjct: 287 AGGAGSLVLG-RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTE 345
Query: 331 SGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
G +V D+G + T LP E YA + FD + + S + CY+ S ++VP
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 405
Query: 390 DMRLIFSKN 398
+ F +
Sbjct: 406 TVSFYFDQG 414
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 144/350 (41%), Gaps = 42/350 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+Y ++IG P + + +D GS+L W+ C C C + Y +
Sbjct: 52 YYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLY-------------KPT 98
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPC--PYIADYS---TEDTSSSGYLVDDILHLASFSKH 222
+K V C+ +C + S +S C P DY T+ SS G LV D L +
Sbjct: 99 KNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRN-- 156
Query: 223 APQSSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
SSV+ S GCG Q + + A DG++GLG G VS+ S L G+ +N C
Sbjct: 157 --SSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHC 214
Query: 281 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVD 338
N G +FFGD T ++T ++P+ Y G + L + + D
Sbjct: 215 LSTNGGGFLFFGDNVVPTSRAT-WVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFD 273
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSK 397
SG+++T+ + Y V +S + S C+ +++ K V D++ F
Sbjct: 274 SGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKG--QKVFKSVSDVKNDFKS 331
Query: 398 NQSFVVRNHIFSF-PENEV----GDHACF-----SYFTLEYNFTGILILQ 437
V+N + PEN + +AC S L +N G + +Q
Sbjct: 332 LFLSFVKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNIIGDITMQ 381
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 135/311 (43%), Gaps = 47/311 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP + + LD GS+L W+ C CI C S YY DP SSS +N++C
Sbjct: 198 IGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYY----------DPKESSSFENITC 247
Query: 175 SHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSS 227
P CK SS CK CPY Y ++ + ++ ++L + + + Q
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
V++ V+ GCG G + A ++GLG G +S S L + +SFS C D N
Sbjct: 308 VEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQLQ--SIYGHSFSYCLVDRNSD 361
Query: 287 GSV----FFGDQGPATQQS----TSFLPIGEKYDA---YFVGVESYCIGNSCLT------ 329
SV FG+ TSF+ GE+ Y+VG++S + L
Sbjct: 362 TSVSSKLIFGEDKELLSHPNLNFTSFVG-GEENSVDTFYYVGIKSIMVDGEVLKIPEETW 420
Query: 330 ----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
+ G ++DSG + T+ Y + F K + + K CYN S E
Sbjct: 421 HLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEK 480
Query: 386 LKVPDMRLIFS 396
+++PD ++FS
Sbjct: 481 MELPDFGILFS 491
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 137/321 (42%), Gaps = 41/321 (12%)
Query: 110 HYTWIDIGTPNVSFLVAL-DAGSNLLWVPCQ-CIQCAPLSASYYTS---LDRNLSEYDPS 164
+Y I +G P V FL A+ D GS++LW C+ C C+ S + ++ YDP
Sbjct: 88 YYAQIGVGHP-VQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPE 146
Query: 165 SSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHA 223
S ++ +CS PLC SC+ + C Y D S EDTSSS G D++HL
Sbjct: 147 LSITASPATCSDPLCSEGGSCRGNNNSCAY--DISYEDTSSSTGIYFRDVVHLGH----- 199
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD- 282
++S+ +++ +GC +G + DG+MG G VSVP+ LA N F C
Sbjct: 200 -KASLNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSG 254
Query: 283 ENDSGSVFF---GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------- 328
E + G + D+ P + P+ Y V + S + + L
Sbjct: 255 EKEGGGILVLGKNDEFP----EMVYTPMLANDIVYNVKLVSLSVNSKALPIEASEFEYNA 310
Query: 329 TQSGFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
T ++DSG S P++ ++ + V KF + + + G+ + +
Sbjct: 311 TVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAPLESSGSPCFISISDRNSVE 370
Query: 386 LKVPDMRLIFSKNQSFVVRNH 406
+ P++ L F + + H
Sbjct: 371 VDFPNVTLKFDGGATMELTAH 391
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 67/221 (30%), Positives = 101/221 (45%), Gaps = 19/221 (8%)
Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS 206
+ L +L+ YDP+ S +S V C C S CK CPY Y + +++S
Sbjct: 40 SGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTS 97
Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVP 264
G V+D L S + SSVI GCG KQ+GS + A DG++G G + SV
Sbjct: 98 GSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVL 157
Query: 265 SLLAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 323
S LA +G ++ FS C D + G +F G +T +P Y+ ++
Sbjct: 158 SQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VD 215
Query: 324 GNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVK 357
G L + SG ++DSG + +LP IY +++ K
Sbjct: 216 GEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPK 256
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 126/269 (46%), Gaps = 40/269 (14%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + +Y+ I+IG + +F +D+GS+L WV C C C Y + L+
Sbjct: 47 GNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALN 106
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD--ILHLA 217
++P +S HP+ + CKS D C Y +Y+ + SS G LV+D L L
Sbjct: 107 CFEPLCTSL-------HPI--TNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVPLKLT 156
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGLIQNS 276
+ S AP+ + GCG S D + P GV+GLG G+VS S L+ G+++N
Sbjct: 157 NGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNV 210
Query: 277 FSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKY-----DAYFVGVESYCIGNSC 327
C + + G +FFGD+ T S S IG Y + YF G + G
Sbjct: 211 VGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKAT---GIKD 266
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVV 356
LT + DSG+S+T+ ++ Y ++
Sbjct: 267 LT-----LVFDSGSSYTYFNSQAYNSILA 290
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 80/310 (25%), Positives = 137/310 (44%), Gaps = 46/310 (14%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
L L +D R++ ++ + S FP GS + +Y I +G P+
Sbjct: 73 LAHLREHDAHRRR---RILESPAESPGASTFPLHGSVKE------HGYYYANIALGDPSP 123
Query: 122 -SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
+F V +D GS L +VPC C +C + + +DP+ K ++C C
Sbjct: 124 RTFQVIVDTGSTLTYVPCATCAKCGTHTGG---------TRFDPTG----KWLTCQEKQC 170
Query: 180 KSRSS---CKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
K+ C + + C Y Y+ E + SG LV D +H AP ++ V
Sbjct: 171 KAAGGPGICAGGRGAAANRCTYSRTYA-EGSGVSGDLVRDKMHFGG--DIAPATNGTLDV 227
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGD-VSVPSLLAKAGLIQNSFSICFDENDSGSVFF 291
+ GC ++G+ D A DG++GLG S+P+ LA + FS+CF + G
Sbjct: 228 VFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALS 286
Query: 292 GDQGPATQQSTSF----LPIGEKYDAYFV-GVESYCIGNSCLTQS-----GFQALVDSGA 341
+ PAT + + + E + AY+V + IG+ + G+ ++DSG
Sbjct: 287 FGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSDLAVGYGTVMDSGT 346
Query: 342 SFTFLPTEIY 351
+FT++PT+++
Sbjct: 347 TFTYVPTKVF 356
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 157/391 (40%), Gaps = 66/391 (16%)
Query: 3 NLVAICMLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYL 62
N+V + LF + + + FS L+HR D + P K E L
Sbjct: 11 NVVVVGFLFQLLEVALARGGGFSVDLIHR--DSPHSPFFD-----------PSKTQAERL 57
Query: 63 ELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVS 122
++ ++R +RV S+G Q+ + +L +I GTP V
Sbjct: 58 ----TDAFRRSVSRV-------GRFRPTAMTSDGIQSRIVPSAGEYLMNLYI--GTPPVP 104
Query: 123 FLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC--- 179
+ +D GS+L W QC P + Y + + +DP +SS+ ++ SC C
Sbjct: 105 VIAIVDTGSDLTWT-----QCRPCTHCY----KQVVPLFDPKNSSTYRDSSCGTSFCLAL 155
Query: 180 -KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
K RS K K C + YS D S + G L + L + S A + GCG
Sbjct: 156 GKDRSCSKEKK--CTF--RYSYADGSFTGGNLASETLTVDS---TAGKPVSFPGFAFGCG 208
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFG 292
G + ++ G++GLG G++S+ S L I FS C D + S + FG
Sbjct: 209 HSSGGIFDKSSS--GIVGLGGGELSLISQLKST--INGLFSYCLLPVSTDSSISSRINFG 264
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYA 352
G + T P+ Y Y E + G +VDSG ++TFLP E Y+
Sbjct: 265 ASGRVSGYGTVSTPLRLPYKGYSKKTE---------VEEG-NIIVDSGTTYTFLPQEFYS 314
Query: 353 EVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
++ + KR+ + CYN ++E
Sbjct: 315 KLEKSVANSIKGKRVRDPNGIFSLCYNTTAE 345
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 138/316 (43%), Gaps = 54/316 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I++G+P F +D GS+L+W+ C+ C QC Y+ D YDPS+SS+
Sbjct: 8 IELGSPPKKFNAIVDTGSDLVWIQCKPCSQC-------YSQSD---PIYDPSASSTFAKT 57
Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
SCS C+S S C S C Y Y + +S+ G + L L S S
Sbjct: 58 SCSTSSCQSLPASGCSSSAKTCIYGYQYG-DSSSTQGDFALETLTLRS---SGGSSKAFP 113
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSG 287
+ GCGR +GS+ GAA G++GLG G +S+ + L A I N FS C FD++ S
Sbjct: 114 NFQFGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSK 168
Query: 288 S--VFFGDQGPATQQ--STSFLPIGEKYDAYFVGVESYCIGNSCLT-------------- 329
+ + FG ST +P + YFVG+E +G L+
Sbjct: 169 TSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSK 228
Query: 330 ----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
SG + DSG + T L +Y++V F VS + + + CY+
Sbjct: 229 KKLRVRALEVNSG-GTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYD 287
Query: 380 ASSEEMLKVPDMRLIF 395
S + K P + L F
Sbjct: 288 VSKSKNFKFPALTLAF 303
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 130/297 (43%), Gaps = 40/297 (13%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP V D GS+L WV QC P S Y ++ +DP+ SS+ V
Sbjct: 150 MGLGTPARDMTVVFDTGSDLSWV-----QCTPCSDCY----EQKDPLFDPARSSTYSAVP 200
Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
C+ P C+ SRS + K C Y Y + + + G L D L L QS V
Sbjct: 201 CASPECQGLDSRSCSRDKK--CRYEVVYG-DQSQTDGALARDTLTLT-------QSDVLP 250
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
+ GCG + TG L G A DG++GLG VS+ S A FS C + S + +
Sbjct: 251 GFVFGCGEQDTG--LFGRA-DGLVGLGREKVSLSSQAASK--YGAGFSYCLPSSPSAAGY 305
Query: 291 FGDQGPATQQSTSFLPIGEKYDA---YF-----VGVESYCIGNSCLTQSGFQALVDSGAS 342
GPA + F + ++D+ Y+ V V + S + S ++DSG
Sbjct: 306 LSLGGPAPANA-RFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTV 364
Query: 343 FTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
T LP +YA + F + + KR + CY+ + +++P + L+F+
Sbjct: 365 ITRLPPRVYAALRSAFARSMGRYGYKRAPAL-SILDTCYDFTGHTTVRIPSVALVFA 420
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 112/251 (44%), Gaps = 30/251 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
IG P + + +D GS+L W+ C C C + Y ++++ V
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY-------------RPTANRLVP 47
Query: 174 CSHPLCKSRSSCKSLKDPCP--YIADYS---TEDTSSSGYLVDDILHLASFSKHAPQSSV 228
C++ LC + S + + CP DY T+ SS G L++D SFS S++
Sbjct: 48 CANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----SFSLPMRSSNI 102
Query: 229 QSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ + GCG Q AA DG++GLG G VS+ S L + G+ +N C N
Sbjct: 103 RPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGG 162
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVDSGASFT 344
G +FFGD + + T ++P+ ++ Y G + L + + DSG+++T
Sbjct: 163 GFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYT 221
Query: 345 FLPTEIYAEVV 355
+ + Y VV
Sbjct: 222 YFTAQPYQAVV 232
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 126/299 (42%), Gaps = 31/299 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P + Y ++ +DP+SSS+ NVS
Sbjct: 187 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVACYEQREK---LFDPASSSTYANVS 238
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C C Y Y + + S G+ D L L+S+
Sbjct: 239 CAAPACSDLDVSGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 290
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--F 291
GCG + G + + A G++GLG G S+P + G F+ C +G+ + F
Sbjct: 291 FGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDF 345
Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA---LVDSGASFTFL 346
G P +T L G Y+VG+ +G L S F A +VDSG T L
Sbjct: 346 GAGSPPATTTTPML-TGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRL 404
Query: 347 PTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
P Y+ + F ++++ R + + CY+ + + +P + L+F + V
Sbjct: 405 PPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDV 463
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 147/327 (44%), Gaps = 57/327 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ +G+P + +D+GS+++WV C+ C++C Y D +DP++S++ V
Sbjct: 175 VSVGSPPTEQYLVVDSGSDVMWVQCKPCLEC-------YVQAD---PLFDPATSATFSGV 224
Query: 173 SCSHPLCK--SRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
SC +C+ S+C + C Y Y+ + + + G L + L L +
Sbjct: 225 SCGSAICRILPTSACGDGELGGCEYEVSYA-DGSYTKGALALETLTLG--------GTAV 275
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-------- 281
V+IGCG + G ++ A G+MGLG G +S+ L G + +FS C
Sbjct: 276 EGVVIGCGHRNRGLFVGAA---GLMGLGWGPMSLVGQL--GGEVGGAFSYCLASRGGYGS 330
Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT-QSG-FQ 334
++D+G + G + A + ++P+ A Y+VG+ +G+ L Q+G FQ
Sbjct: 331 GAADDDAGWLVLG-RSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQ 389
Query: 335 --------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSE 383
++D+G + T LP E YA + F ++ QG S CY+ S
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGY 449
Query: 384 EMLKVPDMRLIFSKNQSFVV--RNHIF 408
++VP + F + ++ RN +
Sbjct: 450 ASVRVPTVSFCFDGDARLILAARNVLL 476
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 149/389 (38%), Gaps = 78/389 (20%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
VS W N + L L ++ K KT L LFP
Sbjct: 38 VSSKKPWGSLNHLASLSLSRAHHIKSPKTNFSLIKTP-------LFPRS----------- 79
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYD 162
Y + ++ GTP + +D GS+L+W PC C +C ++ + +
Sbjct: 80 YGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSEC-----NFPNIKKTGIPTFL 134
Query: 163 PSSSSSSKNVSCSHPLC--------KSR-SSCKSLKDPC-----PYIADYSTEDTSSSGY 208
P SSSSK + C +P C +S+ C S C PY+ Y + S++G
Sbjct: 135 PKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSG--STAGL 192
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
L+ + L P ++GC S P+G+ G G S+PS L
Sbjct: 193 LLSETLDF-------PNKKTIPDFLVGC------SIFSIKQPEGIAGFGRSPESLPSQLG 239
Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQG-------PATQQSTSFL--PIGEKYDAYFVGVE 319
S FD+ + S D G A T FL P D Y+V +
Sbjct: 240 LKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLR 299
Query: 320 SYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
+ IG++ + T +VDSG +FTF+ +Y V +F+K ++ ++
Sbjct: 300 NIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVAT 359
Query: 370 QGNS---WKYCYNASSEEMLKVPDMRLIF 395
+ + + CYN S E+ L VPD+ F
Sbjct: 360 EIQNLTGLRPCYNISGEKSLSVPDLIFQF 388
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 126/299 (42%), Gaps = 31/299 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P + Y ++ +DP+SSS+ NVS
Sbjct: 184 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVACYEQREK---LFDPASSSTYANVS 235
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C C Y Y + + S G+ D L L+S+
Sbjct: 236 CAAPACSDLDVSGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--F 291
GCG + G + + A G++GLG G S+P + G F+ C +G+ + F
Sbjct: 288 FGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPPRSTGTGYLDF 342
Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA---LVDSGASFTFL 346
G P +T L G Y+VG+ +G L S F A +VDSG T L
Sbjct: 343 GAGSPPATTTTPML-TGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRL 401
Query: 347 PTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
P Y+ + F ++++ R + + CY+ + + +P + L+F + V
Sbjct: 402 PPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDV 460
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 114/263 (43%), Gaps = 32/263 (12%)
Query: 103 GNQFYWLHYTWI-DIGTPNVSFLVALDAGSNLLWVPCQ--CIQCA-PLSASYYTSLDRNL 158
GN + HY+ I +IG P +F + +D GS+L WV C C C PL Y +R
Sbjct: 60 GNVYPTGHYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLYKPKNNR-- 117
Query: 159 SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
V C+ LC++ ++C + C Y +Y+ + SS G L+ D L
Sbjct: 118 -------------VPCASSLCQAIQNNNCDIPTEQCDYEVEYA-DLGSSLGVLLSDYFPL 163
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLI 273
+ S +Q + GCG Q YL +P G++GLG G S+ S L G+
Sbjct: 164 ----RLNNGSLLQPRIAFGCGYDQ--KYLGPHSPPDTAGILGLGRGKASILSQLRTLGIT 217
Query: 274 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 332
QN CF G +FFGD P + + + + Y G G G
Sbjct: 218 QNVVGHCFSRVTGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKG 277
Query: 333 FQALVDSGASFTFLPTEIYAEVV 355
Q + DSG+S+T+ ++Y ++
Sbjct: 278 LQLIFDSGSSYTYFNAQVYQSIL 300
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 130/303 (42%), Gaps = 54/303 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + +G+P + +D+GS+++WV C+ C QC Y D +DP++SSS
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLASFSKHAPQ 225
VSC +C++ S DYS + + + G L + L L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGG------- 232
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
++VQ V IGCG + +G ++ A G++GLG G +S+ L G FS C
Sbjct: 233 TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC---------LTQSGFQAL 336
+G A ++SF Y+VG+ +G LT+ G +
Sbjct: 287 AGG--------AGSLASSF---------YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGV 329
Query: 337 V-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
V D+G + T LP E YA + FD + + S + CY+ S ++VP + F
Sbjct: 330 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYF 389
Query: 396 SKN 398
+
Sbjct: 390 DQG 392
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 146/364 (40%), Gaps = 53/364 (14%)
Query: 55 KKNSVEYLELLLSNDWK----RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
K S +++E+L + + K KL +++ S P++ T GN +
Sbjct: 50 KATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGN-----Y 104
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ +GTP + D GS+L W CQ C++ T D+ ++PS S+S
Sbjct: 105 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSY 155
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
NVSCS C S SS C Y Y + + S G+L + L
Sbjct: 156 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFLAKEKFTLT------- 207
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S V V GCG G + A G++GLG +S PS A A FS C +
Sbjct: 208 NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSS 262
Query: 285 DS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVGVESYCIGNSCLTQSG 332
S G + FG G +S F PI D A VG + I ++ + G
Sbjct: 263 ASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG 320
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
AL+DSG T LP + YA + F +S + + C++ S + + +P +
Sbjct: 321 --ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 378
Query: 393 LIFS 396
FS
Sbjct: 379 FSFS 382
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 128/294 (43%), Gaps = 42/294 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP + D GS L+W QC P A Y + +DP+ S+S K +
Sbjct: 136 VGIGTPKKEMPLIFDTGSGLIWT-----QCKPCKACY-----PKVPVFDPTKSASFKGLP 185
Query: 174 CSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
CS LC+S R C S K C Y+ Y +++SS+G L + + + ++
Sbjct: 186 CSSKLCQSIRQGCSSPK--CTYLTAY-VDNSSSTGTLATETISFSHLKYDF------KNI 236
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN--DSGSVF 290
+IGC + +G L G+MGL +S+ S A + FS C +G +
Sbjct: 237 LIGCSDQVSGESL---GESGIMGLNRSPISLAS--QTANIYDKLFSYCIPSTPGSTGHLT 291
Query: 291 FGDQGPATQQSTSFLPIGEK-----YDAYFVGVESYCIGNSCLT--QSGFQ--ALVDSGA 341
FG + P F P+ + YD G+ +G L S F+ + +DSGA
Sbjct: 292 FGGKVP---NDVRFSPVSKTAPSSDYDIKMTGIS---VGGRKLLIDASAFKIASTIDSGA 345
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP + Y+ + F +++ + Q + CY+ S+ + +P + + F
Sbjct: 346 VLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFF 399
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 131/307 (42%), Gaps = 29/307 (9%)
Query: 71 KRQKTRVK-LQSNNNSSRNQLLFPSEGSQTHF--FGNQFYWLHY-TWIDIGTPNVSFLVA 126
KR+ R L SSR L+ + GS F +GN + Y ++IG P + +
Sbjct: 31 KRKSGRNSILPGEAMSSRPSLMNHAAGSSIVFPIYGNVYPVGFYNVTLNIGQPPRPYFLD 90
Query: 127 LDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
+D GS L W+ C C QC+ Y S+ + C PLC S
Sbjct: 91 VDTGSELTWLQCDAPCSQCSETPHPLY--------------KPSNDFIPCKDPLCASLQP 136
Query: 185 CK--SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
+ +DP C Y Y+ + S+ G L++D+ +L +F+ ++ + +GCG Q
Sbjct: 137 TDDYTCEDPNQCDYEIKYA-DQYSTLGVLLNDV-YLLNFTNGV---QLKVRMALGCGYDQ 191
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
S DG++GLG G S+ S L GL++N C G +FFG+ +++
Sbjct: 192 IFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFFGNVYDSSRM 251
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
S + + + Y G G + D+G+S+T+ ++ Y ++ +K
Sbjct: 252 SWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFNSQAYQAMISLLNK 311
Query: 361 LVSSKRI 367
+ K I
Sbjct: 312 ELHRKPI 318
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 142/340 (41%), Gaps = 55/340 (16%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSE-YDPSSSS 167
++ + +GTP L+ +D GS+++W+ C+ C+ C R LS YDP SS
Sbjct: 99 YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCY-----------RQLSPLYDPRGSS 147
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ CS P C++ +C C Y Y + +S+SG L D L ++ +S
Sbjct: 148 TYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYG-DASSTSGNLATDRLVFSN------DTS 200
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
V +V +GCG G + A G++G+ G+ S + +A + F+ C D S
Sbjct: 201 V-GNVTLGCGHDNEGLFGSAA---GLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRS 254
Query: 287 GS----VFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQSGFQ----- 334
GS + FG P S F P+ + Y+V + + +G +T GF
Sbjct: 255 GSSSSYLVFGRTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGEPVT--GFSNASLS 311
Query: 335 ---------ALVDSGASFTFLPTEIYAEVVVKFDKL---VSSKRISLQGNSWKYCYNASS 382
+VDSG S T + Y + FD V +++ + + CY+
Sbjct: 312 LDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRG 371
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFS 422
+ P + L F+ + + PE E G + CF+
Sbjct: 372 VAVADAPGVVLHFAGGADVALPPENYLVPE-ESGRYHCFA 410
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 126/299 (42%), Gaps = 31/299 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P + Y ++ +DP+SSS+ NVS
Sbjct: 183 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVACYEQREK---LFDPASSSTYANVS 234
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C C Y Y + + S G+ D L L+S+
Sbjct: 235 CAAPACSDLDVSGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 286
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--F 291
GCG + G + + A G++GLG G S+P + G F+ C +G+ + F
Sbjct: 287 FGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDF 341
Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA---LVDSGASFTFL 346
G P +T L G Y+VG+ +G L S F A +VDSG T L
Sbjct: 342 GAGSPPATTTTPML-TGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRL 400
Query: 347 PTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 403
P Y+ + F ++++ R + + CY+ + + +P + L+F + V
Sbjct: 401 PPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDV 459
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 113/258 (43%), Gaps = 27/258 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT I IG P + + +D GS+L W+ C C CA Y +
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIV-------- 238
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+++ C L +++ C++ K C Y +Y+ + +SS G L D +H+ + +
Sbjct: 239 PPRDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GR 291
Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ + GC Q G L A DG++GL +S PS LA G+I N F C ++
Sbjct: 292 EKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQG 351
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVD 338
G +F GD + ++ I D Y G+ L + S Q + D
Sbjct: 352 GGGYMFLGDDY-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 339 SGASFTFLPTEIYAEVVV 356
SG+S+T+LP EIY +V
Sbjct: 411 SGSSYTYLPNEIYENLVA 428
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 123/278 (44%), Gaps = 35/278 (12%)
Query: 103 GNQFYWLHYTWI-DIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + +Y+ I +IG P +F +D GS+L WV C C C Y NL
Sbjct: 46 GNVYPTGYYSVILNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLYKP-KNNL- 103
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
V CS+ LC++ S+ C + D C Y +Y+ + SS G L+ D
Sbjct: 104 ------------VPCSNSLCQAVSTGENYHCDAPDDQCDYEIEYA-DLGSSIGVLLSDSF 150
Query: 215 HLASFSKHAPQSSVQSSVIIGCG--RKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKA 270
L + + + +Q + GCG +K G + PD G++GLG G VS+ S L
Sbjct: 151 PL----RLSNGTLLQPKMAFGCGYDQKHLGPH---PPPDTAGILGLGRGKVSILSQLRTL 203
Query: 271 GLIQNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
G+ QN CF G +FFGD P+++ + + + Y G G
Sbjct: 204 GITQNVVGHCFSRARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTG 263
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
G Q + DSG+S+T+ ++Y ++ K ++ K +
Sbjct: 264 IKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLAGKPL 301
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 139/316 (43%), Gaps = 42/316 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + IG+P + +D+GS+++WV C+ C++C Y D +DP++S++
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLEC-------YAQAD---PLFDPATSAT 176
Query: 169 SKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
V C +C++ R+S C Y Y + + + G L + L L +
Sbjct: 177 FSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYG-DGSYTKGALALETLTLG--------GT 227
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
V IGCG + G ++ A G++GLG G +S+ L A +FS C +G
Sbjct: 228 AVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAG 282
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC---------LTQSGFQA 335
S+ G + A + ++P+ A Y+VG+ +G+ LT+ G
Sbjct: 283 SLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGG 341
Query: 336 LV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
+V D+G + T LP E YA + F V + + + CY+ S ++VP +
Sbjct: 342 VVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFY 401
Query: 395 FSKNQSFVV--RNHIF 408
F + + RN +
Sbjct: 402 FDGAATLTLPARNLLL 417
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 159/396 (40%), Gaps = 57/396 (14%)
Query: 57 NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQ---LLFPSEGSQTHFFGNQFYWLHYTW 113
+ E L L D KR N +R ++ P G ++T
Sbjct: 91 TAAELLGHRLQRDGKRAARISAAAGAANGTRRTGSGVVAPVVSGLAQGSGE-----YFTK 145
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP L+ LD GS+++W +QCAP Y D++ +DP S S V
Sbjct: 146 IGVGTPATPALMVLDTGSDVVW-----LQCAPCRRCY----DQSGQVFDPRRSRSYGAVG 196
Query: 174 CSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
CS PLC+ S C + C Y Y + + ++G + L A ++ A
Sbjct: 197 CSAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATETLTFAGGARVA-------R 248
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------ 285
+ +GCG G ++ A ++GLG G +S P+ +++ SFS C +
Sbjct: 249 IALGCGHDNEGLFVAAAG---LLGLGRGSLSFPAQISR--RYGRSFSYCLVDRTSSANPA 303
Query: 286 --SGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGVESYCIGNSCLTQSGFQ--- 334
S +V FG + + SF P+ + Y VG+ S + S +
Sbjct: 304 SHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDP 363
Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLK 387
+VDSG S T L Y+ + F + R+S G S + CY+ S +++K
Sbjct: 364 SSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVK 423
Query: 388 VPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
VP + + F+ + + P + G CF++
Sbjct: 424 VPTVSMHFAGGAEAALPPENYLIPVDSKGTF-CFAF 458
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 142/338 (42%), Gaps = 53/338 (15%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+IGTP + LVALD ++ W+PC C+ C S+S +DPS SSSS+ +
Sbjct: 93 NIGTPAQAMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C P CK SC K C + Y ++ YL D L LA + V +
Sbjct: 141 CEAPQCKQAPNPSCTVSKS-CGFNMTYG--GSAIEAYLTQDTLTLA--------TDVIPN 189
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSG 287
GC K +G+ L G+MGLG G +S+ S L Q++FS C N SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244
Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQAL 336
S+ G + P ++T L + Y+V + +GN + +G +
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
DSG +T L Y + +F + V + + G + CY+ S + P + +F+
Sbjct: 305 FDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLGG-FDTCYSGS----VVFPSVTFMFA 359
Query: 397 KNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ +++ + G+ +C + N +L
Sbjct: 360 GMNVTLPPDNLLI--HSSAGNLSCLAMAAAPTNVNSVL 395
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 138/300 (46%), Gaps = 38/300 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP + D GS+LLW C+ C C YT +D +DP +SS+ K+V
Sbjct: 98 ISLGTPPFPIMAIADTGSDLLWTQCKPCDDC-------YTQVD---PLFDPKASSTYKDV 147
Query: 173 SCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
SCS C ++++SC + + C Y Y + + + G + D L L S Q
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYG-DRSYTKGNIAVDTLTLGSTDTRPVQ---L 203
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 285
++IIGCG G++ G++GLG G VS+ + L + I FS C END
Sbjct: 204 KNIIIGCGHNNAGTF--NKKGSGIVGLGGGAVSLITQLGDS--IDGKFSYCLVPLTSEND 259
Query: 286 SGS-VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQA------- 335
S + FG + P+ K Y++ ++S +G+ + G +
Sbjct: 260 RTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNI 319
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
++DSG + T LPTE Y+E+ + +++ CY+A+ + LKVP + + F
Sbjct: 320 IIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGD--LKVPAITMHF 377
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 105/226 (46%), Gaps = 18/226 (7%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYD 162
N ++YT + IGTP F V +D GS++LWV C C+ C PL +N++ +D
Sbjct: 76 NPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGC-PL---------QNVTFFD 125
Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
P +SSS+ ++CS C S KS P Y +YS + + +SGY + D++ +
Sbjct: 126 PGASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVEYS-DGSFTSGYYISDLISFETVMSS 184
Query: 223 APQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ + GC G L + G++GLG G + V S L+ L FS+C
Sbjct: 185 NLTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCL 244
Query: 282 D--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
+ G + G+ +T + P+ Y V ++++ + +
Sbjct: 245 SGGQEGGGVIILGEN---RLPNTVYTPLVRSQTHYNVNLKTFAVND 287
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 116/278 (41%), Gaps = 33/278 (11%)
Query: 102 FGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCA-PLSASYYTSLDRN 157
FGN + +Y+ + IG P F + +D GS+L WV C C C PL Y N
Sbjct: 58 FGNVYPLGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHLYKPR--NN 115
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDD 212
L +SC PLC + + C+S D C Y Y+ E SS G LV D
Sbjct: 116 L-------------LSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEG-SSLGVLVTD 161
Query: 213 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAG 271
L + S ++ + GCG Q P GV+GLG G S+ S L G
Sbjct: 162 YFPLRLMNG----SFLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALG 217
Query: 272 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLT 329
++ N C G +FFG Q P S+ P+ +K D Y+ G G
Sbjct: 218 VMGNVIGHCLSRKGGGFLFFG-QDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTG 276
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
+ + DSG+S+T+ ++Y + K +S K +
Sbjct: 277 TKAEEFIFDSGSSYTYFNAQVYQSTLNLIRKELSGKPL 314
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 146/364 (40%), Gaps = 53/364 (14%)
Query: 55 KKNSVEYLELLLSNDWK----RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
K S +++E+L + + K KL +++ S P++ T GN +
Sbjct: 78 KATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGN-----Y 132
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ +GTP + D GS+L W CQ C++ T D+ ++PS S+S
Sbjct: 133 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR---------TCYDQKEPIFNPSKSTSY 183
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
NVSCS C S SS C Y Y + + S G+L + L
Sbjct: 184 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFLAKEKFTLT------- 235
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S V V GCG G + A G++GLG +S PS A A FS C +
Sbjct: 236 NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSS 290
Query: 285 DS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVGVESYCIGNSCLTQSG 332
S G + FG G +S F PI D A VG + I ++ + G
Sbjct: 291 ASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG 348
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
AL+DSG T LP + YA + F +S + + C++ S + + +P +
Sbjct: 349 --ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 406
Query: 393 LIFS 396
FS
Sbjct: 407 FSFS 410
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 147/338 (43%), Gaps = 41/338 (12%)
Query: 70 WKRQKTRVKLQSNNNSSRNQ-LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
W+R++ + + + ++S + ++ P +G + + N FY + + +G P + + D
Sbjct: 22 WERKRPILSVPTASSSFASSSIVLPLQG---NVYPNGFYNVT---LYVGQPPKPYFLDPD 75
Query: 129 AGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
GS+L W+ C C QC P S+ V C PLC S S
Sbjct: 76 TGSDLTWLQCDAPCQQCT--------------ETLHPLYQPSNDLVPCKDPLCMSLHSSM 121
Query: 187 SLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
+ D C Y +Y+ + SS G LV D+ L + + P ++ + +GCG Q
Sbjct: 122 DHRCENPDQCDYEVEYA-DGGSSLGVLVRDVFPL-NLTNGDP---IRPRLALGCGYDQDP 176
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST 302
DG++GLG G VS+ S L G+++N CF+ G +FFGD G
Sbjct: 177 GSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGD-GIYDPYRL 235
Query: 303 SFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL---VDSGASFTFLPTEIYAEVVVKFD 359
+ P+ Y ++ I N +G + L DSG+S+T+ + Y + +
Sbjct: 236 VWTPMSRDYPKHYSPGFGELIFNG--RSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLN 293
Query: 360 KLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ ++ K R ++ ++ C+ + + + D+R F
Sbjct: 294 RELAGKPLREAMDDDTLPLCWRG-RKPIKSLRDVRKYF 330
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/341 (27%), Positives = 140/341 (41%), Gaps = 52/341 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP V L+ALD S+L W+ CQ C +C P S +DP S+S +
Sbjct: 138 IAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPV----------FDPRHSTSYGEM 187
Query: 173 SCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSS---GYLVDDILHLASFSKHAPQS 226
+ P C++ RS K C Y Y S+S G LV++ L A +
Sbjct: 188 NYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVR----- 242
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
Q+ + IGCG G L GA G++GLG G +S+P +A G SFS C + S
Sbjct: 243 --QAYLSIGCGHDNKG--LFGAPAAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDFIS 297
Query: 287 G------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
G ++ FG T SF P + Y +GV + +T+ Q
Sbjct: 298 GPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQ 357
Query: 335 ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY--CYNASS 382
++DSG + T L Y F +S ++S G S + CY
Sbjct: 358 LDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGG 417
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+KVP + + F+ ++ + P + G CF++
Sbjct: 418 RAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGT-VCFAF 457
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 123/279 (44%), Gaps = 31/279 (11%)
Query: 80 QSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
+ ++ +N LFP GN F L+YT I +G+P + + +D GS+ WV C
Sbjct: 134 RGGDDWPQNSTLFPHS-----LAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQC 188
Query: 139 QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 198
CA + + Y P+ ++ + + S PLC+ + C Y Y
Sbjct: 189 DAPPCASCAKGAHPL-------YRPARTADA--LPASDPLCEGAQHENPNQ--CDYEISY 237
Query: 199 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLG 257
+ +S Y+ D + + + + ++ GCG Q G L+ DGV+GL
Sbjct: 238 ADGSSSMGVYVRDSMQFVGEDGERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLT 292
Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYD 312
+S+P+ LA G+I N+F C + SG+ +F GD + +++PI G D
Sbjct: 293 NKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADD 351
Query: 313 AYFVGVESYCIGNSCLTQSG--FQALVDSGASFTFLPTE 349
V+ G+ L G Q + D+G+++T+ P E
Sbjct: 352 VRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDE 390
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 144/337 (42%), Gaps = 46/337 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C C ++ P
Sbjct: 92 YYTARLWI--GTPPQRFALIVDTGSTVTYVPCSTCRHCG----------SHQDPKFRPED 139
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S + + V C+ + +C + + C Y Y+ E ++SSG L +D++ + ++ +PQ
Sbjct: 140 SETYQPVKCTW-----QCNCDNDRKQCTYERRYA-EMSTSSGALGEDVVSFGNQTELSPQ 193
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
+ I GC +TG + A DG+MGLG GD+S+ L + +I +SFS+C+
Sbjct: 194 RA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMG 247
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
V G PA T P+ Y Y + ++ + N + +
Sbjct: 248 VGGGAMVLGGISPPADMVFTRSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFDGKHGTV 305
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEMLKV---- 388
+DSG ++ +LP + K S KRIS G +Y C++ + ++ ++
Sbjct: 306 LDSGTTYAYLPESAFLAFKHAIMKETHSLKRIS--GPDPRYNDICFSGAEIDVSQISKSF 363
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFT 425
P + ++F + + F ++V C F+
Sbjct: 364 PVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFS 400
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 129/307 (42%), Gaps = 30/307 (9%)
Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
G F V +D GS++LWV C P S + L L+ +D SS++ + CS
Sbjct: 75 GXXXXXFNVQIDTGSDILWVNCNTCSNCPQS----SQLGIELNFFDTVGSSTAALIPCSD 130
Query: 177 PLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
+C S + C + C Y Y + + +SGY V D ++ P + ++
Sbjct: 131 LICTSGVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSDAMYFNLIMGQPPAVNSTAT 189
Query: 232 VIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGS 288
++ GC Q+G A DG+ G G G +SV S L+ G+ FS C D N G
Sbjct: 190 IVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGI 249
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALVDS 339
+ G+ + S + P+ Y + ++S + L + + +VD
Sbjct: 250 LVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDC 306
Query: 340 GASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
G + +L E Y +V + V S+++ + +GN CY S+ P + L F
Sbjct: 307 GTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CYLVSTSIGDIFPLVSLNFEG 363
Query: 398 NQSFVVR 404
S V++
Sbjct: 364 GASMVLK 370
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 146/340 (42%), Gaps = 50/340 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP + L+ LD GS+++W +QCAP Y S +DP S S
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVW-----LQCAPCRHCYAQS----GRVFDPRRSRSY 172
Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
V C P+C+ S C ++ C Y Y + + ++G + L A ++
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------ 225
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------ 281
VQ V IGCG G ++ A G++GLG G +S PS +A++ SFS C
Sbjct: 226 VQ-RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSS 279
Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLTQSG 332
S +V FG A SF P+G Y+V + + +G + ++QS
Sbjct: 280 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 339
Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASS 382
+ ++DSG S T L +Y V F R+S G S + CYN S
Sbjct: 340 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 399
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFS 422
++KVP + + + S + + P + G CF+
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTF-CFA 438
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 146/340 (42%), Gaps = 50/340 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP + L+ LD GS+++W +QCAP Y S +DP S S
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVW-----LQCAPCRHCYAQS----GRVFDPRRSRSY 178
Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
V C P+C+ S C ++ C Y Y + + ++G + L A ++
Sbjct: 179 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------ 231
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------ 281
VQ V IGCG G ++ A G++GLG G +S PS +A++ SFS C
Sbjct: 232 VQ-RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSS 285
Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLTQSG 332
S +V FG A SF P+G Y+V + + +G + ++QS
Sbjct: 286 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 345
Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASS 382
+ ++DSG S T L +Y V F R+S G S + CYN S
Sbjct: 346 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 405
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFS 422
++KVP + + + S + + P + G CF+
Sbjct: 406 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTF-CFA 444
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 91/189 (48%), Gaps = 27/189 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D+GS + +VPC C QC ++ P
Sbjct: 92 YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCG----------KHQDPKFQPEM 139
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
SS+ + V C+ +C ++ C Y +Y+ E +SS G L +D++ + S+ PQ
Sbjct: 140 SSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNESQLTPQ 193
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V GC +TG A DG++GLG GD+S+ L GLI NSF +C+ D
Sbjct: 194 RAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD 247
Query: 286 --SGSVFFG 292
GS+ G
Sbjct: 248 VGGGSMILG 256
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 150/382 (39%), Gaps = 86/382 (22%)
Query: 70 WKRQKTRVKLQSNNNSSRNQLLFPS-----EGSQTH------------FFGNQFYWLHYT 112
+++Q+ R KL N+N + N P EG +H G+ Y++ +
Sbjct: 11 FRKQRGRHKLSDNDNGAHNSANPPVITAVIEGPPSHDHDFQSPVVSGSTLGSGQYFVDFF 70
Query: 113 WIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
+GTP F + +D+GS+LLWV C C+QC ++ Y PS+SS+
Sbjct: 71 ---LGTPPQKFSLIVDSGSDLLWVQCAPCLQC----------YAQDTPLYAPSNSSTFNP 117
Query: 172 VSCSHPLCKSRSSCKSLKDPCPY------IADYSTEDTS-SSGYL------VDDILHLAS 218
V C P C + + PC + +Y DTS S G VDD+
Sbjct: 118 VPCLSPECLLIPATEGF--PCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVR---- 171
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
V GCGR GS+ AA GV+GLG G +S S + A N F+
Sbjct: 172 ----------IDKVAFGCGRDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFA 216
Query: 279 ICF-DENDSGSV----FFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQ 330
C + D SV FGD+ +T F PI Y+V +E +G L
Sbjct: 217 YCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPI 276
Query: 331 S----------GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYN 379
S ++ DSG + T+ Y ++ FDK V R S+QG C +
Sbjct: 277 SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQG--LDLCVD 334
Query: 380 ASSEEMLKVPDMRLIFSKNQSF 401
+ + P ++ F
Sbjct: 335 VTGVDQPSFPSFTIVLGGGAVF 356
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 106/242 (43%), Gaps = 22/242 (9%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I+IG P + + +D GS+L W+ C AP S T P S+ V
Sbjct: 89 INIGYPPRPYFLDIDTGSDLTWLQCD----APCSRCSQTP--------HPLYRPSNDLVP 136
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSVQS 230
C HPLC S + + + DY E SS G LV+D+ ++ +F+ ++
Sbjct: 137 CRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDV-YVLNFTNGV---QLKV 192
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
+ +GCG Q DG++GLG G S+ S L GL++N C G +F
Sbjct: 193 RMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGGGYIF 252
Query: 291 FGDQGPATQQSTSFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTE 349
FGD +++ ++ P+ + Y Y G +G A+ D+G+S+T+ +
Sbjct: 253 FGDVYDSSR--LAWTPMSSRDYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTYFNSN 310
Query: 350 IY 351
Y
Sbjct: 311 AY 312
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 104/399 (26%), Positives = 170/399 (42%), Gaps = 62/399 (15%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSF 123
L+ ++ + Q ++K+++ +S+ Q + ++ T G + L+Y +++G N+S
Sbjct: 91 LVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTS--GIKLESLNYIVTVELGGKNMSL 148
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+V D GS+L WV QC P + Y ++ YDPS SSS K V C+ C+
Sbjct: 149 IV--DTGSDLTWV-----QCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLV 197
Query: 184 SCKS-----------LKDPCPYI-----ADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ S +K PC Y+ Y+ D +S L+ D L +F
Sbjct: 198 AATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENF-------- 248
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
+ GCGR G + + G+ VS+ S K FS C ++
Sbjct: 249 -----VFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDG 298
Query: 285 DSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFVGVESYCIGNSCLTQSGFQA--LV 337
SGS+ FG+ ST S+ P+ + Y + + IG L S F L+
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILI 358
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
DSG T LP IY V ++F K S + + C+N +S E + +P +++IF
Sbjct: 359 DSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQG 418
Query: 398 NQSFVVR-NHIFSFPENEVGDHACFSYFTLEY-NFTGIL 434
N V +F F + + C + +L Y N GI+
Sbjct: 419 NAELEVDVTGVFYFVKPD-ASLVCLALASLSYENEVGII 456
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 125/297 (42%), Gaps = 34/297 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+L WV QC P + Y ++ +DPS SS+ V+
Sbjct: 153 VGLGTPAKQYAVIFDTGSDLSWV-----QCKPCADCY----EQQDPLFDPSLSSTYAAVA 203
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C P C+ S C S C Y Y + + + G LV D L L++ S
Sbjct: 204 CGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA-------SDTLPG 254
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF- 290
+ GCG + G + DG+ GLG VS+PS A + F+ C + SG +
Sbjct: 255 FVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGYL 309
Query: 291 -FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGASF 343
G PA Q T+ L G Y++ + +G + + ++DSG
Sbjct: 310 SLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVI 368
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
T LP YA + F + ++ + + + CY+ + ++P + L F+ +
Sbjct: 369 TRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGAT 425
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 104/399 (26%), Positives = 170/399 (42%), Gaps = 62/399 (15%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSF 123
L+ ++ + Q ++K+++ +S+ Q + ++ T G + L+Y +++G N+S
Sbjct: 91 LVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTS--GIKLESLNYIVTVELGGKNMSL 148
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+V D GS+L WV QC P + Y ++ YDPS SSS K V C+ C+
Sbjct: 149 IV--DTGSDLTWV-----QCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLV 197
Query: 184 SCKS-----------LKDPCPYI-----ADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ S +K PC Y+ Y+ D +S L+ D L +F
Sbjct: 198 AATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENF-------- 248
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
+ GCGR G + + G+ VS+ S K FS C ++
Sbjct: 249 -----VFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDG 298
Query: 285 DSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFVGVESYCIGNSCLTQSGFQA--LV 337
SGS+ FG+ ST S+ P+ + Y + + IG L S F L+
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILI 358
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
DSG T LP IY V ++F K S + + C+N +S E + +P +++IF
Sbjct: 359 DSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQG 418
Query: 398 NQSFVVR-NHIFSFPENEVGDHACFSYFTLEY-NFTGIL 434
N V +F F + + C + +L Y N GI+
Sbjct: 419 NAELEVDVTGVFYFVKPD-ASLVCLALASLSYENEVGII 456
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 104/399 (26%), Positives = 170/399 (42%), Gaps = 62/399 (15%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSF 123
L+ ++ + Q ++K+++ +S+ Q + ++ T G + L+Y +++G N+S
Sbjct: 43 LVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTS--GIKLESLNYIVTVELGGKNMSL 100
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+V D GS+L WV QC P + Y ++ YDPS SSS K V C+ C+
Sbjct: 101 IV--DTGSDLTWV-----QCQPCRSCY----NQQGPLYDPSVSSSYKTVFCNSSTCQDLV 149
Query: 184 SCKS-----------LKDPCPYI-----ADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ S +K PC Y+ Y+ D +S L+ D L +F
Sbjct: 150 AATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENF-------- 200
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDEN 284
+ GCGR G + + G+ VS+ S K FS C ++
Sbjct: 201 -----VFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLPSLEDG 250
Query: 285 DSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFVGVESYCIGNSCLTQSGFQA--LV 337
SGS+ FG+ ST S+ P+ + Y + + IG L S F L+
Sbjct: 251 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILI 310
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
DSG T LP IY V ++F K S + + C+N +S E + +P +++IF
Sbjct: 311 DSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQG 370
Query: 398 NQSFVVR-NHIFSFPENEVGDHACFSYFTLEY-NFTGIL 434
N V +F F + + C + +L Y N GI+
Sbjct: 371 NAELEVDVTGVFYFVKPD-ASLVCLALASLSYENEVGII 408
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 129/305 (42%), Gaps = 40/305 (13%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP + V D GS+ WV QC P Y ++ +DP+ SS+ N+S
Sbjct: 190 IGLGTPAGRYTVVFDTGSDTTWV-----QCEPCVVVCYEQQEK---LFDPARSSTDANIS 241
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 242 CAAPACSDLYTKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAIKGFR 293
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFG 292
GCG + G + + A G++GLG G S+P K G + F+ CF SG+ +
Sbjct: 294 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAHCFPARSSGTGYL- 346
Query: 293 DQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSG 340
D GP + + T+ + + Y+VG+ +G S T +G +VDSG
Sbjct: 347 DFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAG--TIVDSG 404
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
T LP Y+ + F ++++ + + + CY+ + + +P + L+F
Sbjct: 405 TVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGG 464
Query: 399 QSFVV 403
S V
Sbjct: 465 ASLDV 469
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 125/297 (42%), Gaps = 34/297 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+L WV QC P + Y ++ +DPS SS+ V+
Sbjct: 153 VGLGTPAKQYAVIFDTGSDLSWV-----QCKPCADCY----EQQDPLFDPSLSSTYAAVA 203
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C P C+ S C S C Y Y + + + G LV D L L++ S
Sbjct: 204 CGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA-------SDTLPG 254
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF- 290
+ GCG + G + DG+ GLG VS+PS A + F+ C + SG +
Sbjct: 255 FVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGYL 309
Query: 291 -FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGASF 343
G PA Q T+ L G Y++ + +G + + ++DSG
Sbjct: 310 SLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVI 368
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
T LP YA + F + ++ + + + CY+ + ++P + L F+ +
Sbjct: 369 TRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGAT 425
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 139/331 (41%), Gaps = 52/331 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP F + +D GS + +VPC C C + P SS+
Sbjct: 94 IGTPPQEFALIVDTGSTVTYVPCSDCEHCG----------KHQDPRFQPDESSTY----- 138
Query: 175 SHPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
HP+ C +C C Y Y+ E +SSSG L +DI+ + S+ PQ +V
Sbjct: 139 -HPVKCNMDCNCDHDGVNCVYERRYA-EMSSSSGVLGEDIISFGNQSEVVPQRAV----- 191
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFF 291
GC +TG A DG+MGLG G +S+ L +I +SFS+C+ G++
Sbjct: 192 FGCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVL 250
Query: 292 GDQGPATQQSTSFLPIGEKYDAYFVGV---ESYCIGNSC-LTQSGFQ----ALVDSGASF 343
G P S + Y + + + E + G L+ S F ++DSG ++
Sbjct: 251 GGIPPPPDMVFSR---SDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTY 307
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY---CYNASSEEMLKV----PDMRL 393
+LP E + V F + K +L+ G Y C++ + ++ ++ P++ +
Sbjct: 308 AYLPEEAF----VAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDM 363
Query: 394 IFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
+FS Q + + F +V C F
Sbjct: 364 VFSNGQKLSLTPENYLFQHTKVHGAYCLGIF 394
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 144/339 (42%), Gaps = 43/339 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ G+P ++ +++D GS++ W IQC P S Y D +DP+ S++ V
Sbjct: 165 VGFGSPAQNYTLSIDTGSDVSW-----IQCLPCSGHCYKQHD---PVFDPTKSATYSAVP 216
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C HP C + S C Y Y + +S++G L + L L+S ++ P
Sbjct: 217 CGHPQCAAAGGKCSNSGTCLYKVTYG-DGSSTAGVLSHETLSLSS-TRDLP------GFA 268
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVFF 291
GCG+ G + ++GLG G +S+PS A +FS C D+ G +
Sbjct: 269 FGCGQTNLGEFGGVDG---LVGLGRGALSLPS--QAAATFGATFSYCLPSYDTTHGYLTM 323
Query: 292 GDQGPATQ------QSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSGFQALVD 338
G PA Q T+ + + YFV V S IG L T+ G L D
Sbjct: 324 GSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDG--TLFD 381
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
SG T+LP E YA + +F ++ + + + + CY+ + + +P + FS
Sbjct: 382 SGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDG 441
Query: 399 QSFVVRN-HIFSFPENEVGDHACFSYF----TLEYNFTG 432
F + I +P++ C ++ T+ +N G
Sbjct: 442 AVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIG 480
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/340 (26%), Positives = 146/340 (42%), Gaps = 50/340 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP + L+ LD GS+++W +QCAP Y S +DP S S
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVW-----LQCAPCRHCYAQS----GRVFDPRRSRSY 172
Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
V C P+C+ S C ++ C Y Y + + ++G + L A ++
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------ 225
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------ 281
VQ V IGCG G ++ A G++GLG G +S P+ +A++ SFS C
Sbjct: 226 VQ-RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSS 279
Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLTQSG 332
S +V FG A SF P+G Y+V + + +G + ++QS
Sbjct: 280 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 339
Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASS 382
+ ++DSG S T L +Y V F R+S G S + CYN S
Sbjct: 340 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 399
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFS 422
++KVP + + + S + + P + G CF+
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTF-CFA 438
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 166/395 (42%), Gaps = 70/395 (17%)
Query: 22 VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRV-KLQ 80
+ F++ L+HR S ++ P N +E L N R RV
Sbjct: 29 LGFTADLIHRDSPKS-----------------PFYNPMETSSQRLRNAIHRSVNRVFHFT 71
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
+N+ + Q+ S + Y ++ + IGTP + D GS+LLW
Sbjct: 72 EKDNTPQPQIDLTSNSGE--------YLMN---VSIGTPPFPIMAIADTGSDLLWT---- 116
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIAD 197
QCAP YT +D +DP +SS+ K+VSCS C ++++SC + + C Y
Sbjct: 117 -QCAPCD-DCYTQVD---PLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLS 171
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
Y +++ + G + D L L S Q ++IIGCG G++ +
Sbjct: 172 YG-DNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGI 221
Query: 258 LGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPI 307
+G P SL+ + G I FS C ++ + + FG + ST +
Sbjct: 222 VGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAK 281
Query: 308 GEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ Y++ ++S +G+ + S ++DSG + T LPTE Y+E+
Sbjct: 282 ASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVAS 341
Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ +++ + CY+A+ + LKVP + + F
Sbjct: 342 SIDAEKKQDPQSGLSLCYSATGD--LKVPVITMHF 374
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 166/395 (42%), Gaps = 70/395 (17%)
Query: 22 VSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRV-KLQ 80
+ F++ L+HR S ++ P N +E L N R RV
Sbjct: 29 LGFTADLIHRDSPKS-----------------PFYNPMETSSQRLRNAIHRSVNRVFHFT 71
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
+N+ + Q+ S + Y ++ + IGTP + D GS+LLW
Sbjct: 72 EKDNTPQPQIDLTSNSGE--------YLMN---VSIGTPPFPIMAIADTGSDLLWT---- 116
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIAD 197
QCAP YT +D +DP +SS+ K+VSCS C ++++SC + + C Y
Sbjct: 117 -QCAPCD-DCYTQVD---PLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLS 171
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
Y +++ + G + D L L S Q ++IIGCG G++ +
Sbjct: 172 YG-DNSYTKGNIAVDTLTLGSSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGI 221
Query: 258 LGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPI 307
+G P SL+ + G I FS C ++ + + FG + ST +
Sbjct: 222 VGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAK 281
Query: 308 GEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ Y++ ++S +G+ + S ++DSG + T LPTE Y+E+
Sbjct: 282 ASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVAS 341
Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ +++ + CY+A+ + LKVP + + F
Sbjct: 342 SIDAEKKQDPQSGLSLCYSATGD--LKVPVITMHF 374
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 132/303 (43%), Gaps = 43/303 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I IGTP V+ + D GS+L W C C +C +++ ++P SSS + V
Sbjct: 94 IFIGTPPVNVIAIADTGSDLTWTQCLPCREC----------FNQSQPIFNPRRSSSYRKV 143
Query: 173 SCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQ 229
SC+ C+S S C C Y YS D S + G L D + + SF P++
Sbjct: 144 SCASDTCRSLESYHCGPDLQSCSY--GYSYGDRSFTYGDLASDQITIGSFK--LPKT--- 196
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
+IGCG + G++ G + G V + AG ++ FS C + N
Sbjct: 197 ---VIGCGHQNGGTF-GGVTSGIIGLGGGSLSLVSQMRTIAG-VKPRFSYCLPTFFSNAN 251
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGN---------SCLTQSGF 333
+G++ FG + + + P+ + YF+ +E+ +G S +T G
Sbjct: 252 ITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHG- 310
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG + T LP +Y V +++ +KR+ + CY+A + L +P +
Sbjct: 311 NIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITA 370
Query: 394 IFS 396
F+
Sbjct: 371 HFA 373
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 127/296 (42%), Gaps = 38/296 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP + V D GS+ WV QC P Y ++ +DP+ SS+ NVS
Sbjct: 186 IGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYKQQEK---LFDPARSSTYANVS 237
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 238 CAAPACSDLYTRGCSGGHCLYSVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 289
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
GCG + G + + A G++GLG G S+P K G + F+ C SG+ +
Sbjct: 290 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLD 343
Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
FG PA + P+ G + Y+VG+ +G L+ QS F +VDSG
Sbjct: 344 FGPGSPAAVGARQTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSGT 401
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y+ + F ++++ + + + CY+ + + +P + L+F
Sbjct: 402 VITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLF 457
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 112/265 (42%), Gaps = 34/265 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I+IG P + + LD GS+L W+ C C+ C L P S+
Sbjct: 61 INIGQPPRPYYLDLDTGSDLTWLQCDAPCVHC--------------LEAPHPLYQPSNDL 106
Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
+ C+ PLCK+ + + P DY E SS G LV D+ L + +
Sbjct: 107 IPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSL----NYTKGLRL 162
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+ +GCG Q DGV+GLG G VS+ S L G ++N C G
Sbjct: 163 TPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLGGGI 222
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---TQSGFQALV---DSGAS 342
+FFG+ + + S+ P+ + ++ S +G L +G + L+ DSG+S
Sbjct: 223 LFFGNDLYDSSR-VSWTPMARENSKHY----SPAMGGELLFGGRTTGLKNLLTVFDSGSS 277
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRI 367
+T+ ++ Y V + +S K +
Sbjct: 278 YTYFNSKAYQAVTYLLKRELSGKPL 302
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 105/416 (25%), Positives = 177/416 (42%), Gaps = 74/416 (17%)
Query: 5 VAICML----FGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
+AI +L FGCI + V F+ L+HR S + P NS E
Sbjct: 12 LAIALLCVSGFGCIY---ARKVGFTVDLIHRDSPLS-----------------PFYNSEE 51
Query: 61 YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPN 120
++N +R +RV ++ + +++ N+ +L + +GTP
Sbjct: 52 TDLQRINNALRRSISRV----HHFDPIAAASVSPKAAESDVTSNRGEYLM--SLSLGTPP 105
Query: 121 VSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
+ D GS+L+W C+ C +C Y +D +DP SS + ++ SC C
Sbjct: 106 FKIMGIADTGSDLIWTQCKPCERC-------YKQVD---PLFDPKSSKTYRDFSCDARQC 155
Query: 180 K--SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
+S+C + C Y YS D S + G + D + L S + +P S ++ +IGC
Sbjct: 156 SLLDQSTCSG--NICQY--QYSYGDRSYTMGNVASDTITLDS-TTGSPVSFPKT--VIGC 208
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFF 291
G + G++ D + G++GLG G +S+ S + + + FS C +S + F
Sbjct: 209 GHENDGTFSDKGS--GIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLNF 264
Query: 292 GDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSG 340
G GP Q ST L YF+ +E+ +GN S L ++DSG
Sbjct: 265 GSNAVVSGPGVQ-STPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSG 323
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
+ T +P + ++ + V +R CY+A+S+ LKVP + F+
Sbjct: 324 TTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSD--LKVPAITAHFT 377
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 143/324 (44%), Gaps = 36/324 (11%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEY-DPSS 165
L+YT+I +G P + + +D GS+L WV C C C + Y N+ + D
Sbjct: 198 LYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVSFKDSLC 257
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+N C ++C+ C Y Y+ + +SS G LV D L + +
Sbjct: 258 MEVQRNYDGDQ--C---AACQQ----CNYEVQYADQ-SSSLGVLVKDEFTL----RFSNG 303
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--D 282
S + + I GC Q G L+ + DG++GL VS+PS LA G+I N C D
Sbjct: 304 SLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTGD 363
Query: 283 ENDSGSVFFGDQGPATQQSTSFL-----PIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 335
G +F GD Q +++ P + Y V ++ I S T S Q
Sbjct: 364 PAGGGYLFLGDDF-VPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQV 422
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ DSG+S+T+ E Y ++V ++ VS+ + LQ +S C+ + + + V D++ F
Sbjct: 423 VFDSGSSYTYFTKEAYYQLVANLEE-VSAFGLILQDSSDTICWK-TEQSIRSVKDVKHFF 480
Query: 396 SK------NQSFVVRNHIFSFPEN 413
++ ++V + PEN
Sbjct: 481 KPLTLQFGSRFWLVSTKLVILPEN 504
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 87/348 (25%), Positives = 147/348 (42%), Gaps = 48/348 (13%)
Query: 103 GNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSE 160
G + L+Y ++IG N++ +V D GS+L WV CQ C C Y D
Sbjct: 59 GVRLQTLNYIVTVEIGGRNMTVIV--DTGSDLTWVQCQPCRLC-------YNQQD---PL 106
Query: 161 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 213
++PS S S + + C+ C+S C S C Y+ +Y + + + G L +
Sbjct: 107 FNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYG-DGSYTRGDLGMEQ 165
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L+L + H S+ I GCGR G + G+MGLG D+S+ S + +
Sbjct: 166 LNLG--TTHV------SNFIFGCGRNNKGLF---GGASGLMGLGKSDLSLVS--QTSAIF 212
Query: 274 QNSFSICFD---ENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN 325
+ FS C + SGS+ G + + T + + YF+ + IG
Sbjct: 213 EGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG 272
Query: 326 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
L ++ L+DSG T LP +Y ++ +F K S + + C+N +
Sbjct: 273 VALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNG 332
Query: 383 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEVGDHACFSYFTLEYN 429
+ + +P +R+ F N V IF F + + C + +L ++
Sbjct: 333 YDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDA-SQVCLALASLSFD 379
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 141/331 (42%), Gaps = 46/331 (13%)
Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
V +D GS+L WV CQ C +C Y D ++PS+S S + V CS P C+S
Sbjct: 148 VIVDTGSDLSWVQCQPCKRC-------YNQQD---PVFNPSTSPSYRTVLCSSPTCQSLQ 197
Query: 184 S-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
S C S C Y+ +Y + + + G L + L L + S+ ++ I GC
Sbjct: 198 SATGNLGVCGSNPPSCNYVVNYG-DGSYTRGELGTEHLDLGN-------STAVNNFIFGC 249
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGD 293
GR G + G++GLG +S+ S + + FS C + SGS+ G
Sbjct: 250 GRNNQGLF---GGASGLVGLGRSSLSLIS--QTSAMFGGVFSYCLPITETEASGSLVMGG 304
Query: 294 QGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA---LVDSGASFTF 345
+ + T +P + YF+ + +G+ + F ++DSG T
Sbjct: 305 NSSVYKNTTPISYTRMIP-NPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTVITR 363
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR- 404
LP IY + +F K S + C+N S + +++P++++ F N V
Sbjct: 364 LPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDV 423
Query: 405 NHIFSFPENEVGDHACFSYFTLEY-NFTGIL 434
+F F + + C + +L Y N GI+
Sbjct: 424 TGVFYFVKTDA-SQVCLAIASLSYENEVGII 453
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 84/307 (27%), Positives = 134/307 (43%), Gaps = 43/307 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+G P V LV +D GS+LLWV C+ C C S +DPS SS+ ++S
Sbjct: 97 VGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPI----------FDPSKSSTYVDLSY 146
Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
P+C + K + + C Y A Y+ TSS +DI+ F + SSV+
Sbjct: 147 DSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----FETSDQGTVTVSSVV 202
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDSGSV 289
GCG G + DG G++GL GD S+ S L + FS C FD + + +
Sbjct: 203 FGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQ 254
Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALV-DS 339
G + S++ P Y+V +E +G + L T+SG +V DS
Sbjct: 255 LVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDS 312
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFS 396
G + TFL + + + + +LV +++ + CY E L+ P++ F+
Sbjct: 313 GTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFA 372
Query: 397 KNQSFVV 403
+ V+
Sbjct: 373 EGADLVL 379
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 126/307 (41%), Gaps = 36/307 (11%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
K +VKLQ+ SS ++FP G+ + G +Y ++IG P F + +D G
Sbjct: 36 KDSSAQVKLQNRRLSS--TVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTG 87
Query: 131 SNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RS 183
S+L WV C C C A +Y P+ ++ + CSH LC
Sbjct: 88 SDLTWVQCDAPCNGCTKPRA----------KQYKPNHNT----LPCSHILCSGLDLPQDR 133
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTG 242
C +D C Y YS + SS G LV D + L K A S + + GCG +Q
Sbjct: 134 PCADPEDQCDYEIGYS-DHASSIGALVTDEVPL----KLANGSIMNLRLTFGCGYDQQNP 188
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQS 301
G++GLG G V + + L G+ +N C G + GD+ P++ +
Sbjct: 189 GPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVT 248
Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
+ L Y G + G + DSG+S+T+ E Y ++ K
Sbjct: 249 WTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKD 308
Query: 362 VSSKRIS 368
++ K ++
Sbjct: 309 LNGKPLT 315
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 127/300 (42%), Gaps = 46/300 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y ++ +DP+ SS+ N+S
Sbjct: 184 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANIS 235
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 236 CAAPACSDLDTRGCSGGNCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
GCG + G + + A G++GLG G S+P K G + F+ C SG+ +
Sbjct: 288 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLD 341
Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
FG PA + P+ G + Y+VG+ +G L+ QS F +VDSG
Sbjct: 342 FGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGT 399
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y+ + F ++++ +SL CY+ + + +P + L+F
Sbjct: 400 VITRLPPAAYSSLRSAFASAMAARGYKKAPAVSL----LDTCYDFTGMSQVAIPTVSLLF 455
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 93/339 (27%), Positives = 142/339 (41%), Gaps = 51/339 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+G P V LV +D GS+LLWV C+ C C S +DPS SS+ ++S
Sbjct: 65 VGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPI----------FDPSKSSTYVDLSY 114
Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
P+C + K + + C Y A Y+ TSS +DI+ F + SSV+
Sbjct: 115 DSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----FETSDQGTVTVSSVV 170
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDEN-DSGS 288
GCG G + DG G++GL GD S+ S L + FS C FD +
Sbjct: 171 FGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQ 222
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALV-D 338
+ GD + S P Y+V +E +G + L T+SG +V D
Sbjct: 223 LVLGD---GVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIF 395
SG + TFL + + + + +LV +++ + CY E L+ P++ F
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 339
Query: 396 SKNQSFVV-RNHIFSFPENEVGDHACFSYFTLEYNFTGI 433
++ V+ N +F +V F LE N I
Sbjct: 340 AEGADLVLDANSLFVQKNQDV-----FCLAVLESNLKNI 373
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 88/379 (23%), Positives = 153/379 (40%), Gaps = 57/379 (15%)
Query: 56 KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYWLHYTWI 114
KN +Y L+ KR + R++ S N +L S G +T + G+ Y ++ +
Sbjct: 53 KNLTKYE--LIKRAIKRGERRMR-------SINAMLQSSSGIETPVYAGDGEYLMN---V 100
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
IGTP+ SF +D GS+L+W C+ C QC + ++P SSS +
Sbjct: 101 AIGTPDSSFSAIMDTGSDLIWTQCEPCTQC----------FSQPTPIFNPQDSSSFSTLP 150
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C C+ S + C Y Y + +++ GY+ + ++S ++
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYG-DGSTTQGYMATETFTF--------ETSSVPNIA 201
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VF 290
GCG G A G++G+G G +S+PS L FS C S S +
Sbjct: 202 FGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTLA 254
Query: 291 FGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQ--------ALV 337
G P ST+ + Y++ ++ +G N + S FQ ++
Sbjct: 255 LGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMII 314
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFS 396
DSG + T+LP + Y V F ++ + + C+ S+ ++VP++ + F
Sbjct: 315 DSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFD 374
Query: 397 KNQSFVVRNHIFSFPENEV 415
+ +I P V
Sbjct: 375 GGVLNLGEQNILISPAEGV 393
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 123/302 (40%), Gaps = 36/302 (11%)
Query: 76 RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLW 135
+VKLQ N + ++FP G+ + G +Y ++IG P F + +D GS+L W
Sbjct: 42 QVKLQ--NRRLGSSVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTGSDLTW 93
Query: 136 VPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSL 188
V C C C A +Y P+ ++ + CSH LC C
Sbjct: 94 VQCDAPCNGCTKPRAK----------QYKPNHNT----LPCSHLLCSGLDLTQNRPCDDP 139
Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDG 247
+D C Y YS + SS G LV D L K A S + + GCG +Q
Sbjct: 140 EDQCDYEIGYS-DHASSIGALVTDEFPL----KLANGSIMNPHLTFGCGYDQQNPGPHPP 194
Query: 248 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLP 306
G++GLG G V + + L G+ +N C G + GD+ P++ + + L
Sbjct: 195 PPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLA 254
Query: 307 IGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 366
Y G + G + DSG+S+T+ E Y ++ K ++ K
Sbjct: 255 TNSASKNYMTGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKP 314
Query: 367 IS 368
++
Sbjct: 315 LT 316
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 75/266 (28%), Positives = 118/266 (44%), Gaps = 28/266 (10%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAP-LSASYYTSLDRNLSEYDPSS 165
L++T+I +G P + + +D S+L W+ C C CA +A Y D ++ D
Sbjct: 207 LYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIVTPKDSLC 266
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+N + C++ C+ C Y +Y+ + +SS G L D LHL A
Sbjct: 267 VELHRNQKAGY--CET---CQQ----CDYEIEYA-DHSSSMGVLARDELHLT----MANG 312
Query: 226 SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--D 282
SS GC Q G L+ DG++GL VS+PS LA G+I N C D
Sbjct: 313 SSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLAND 372
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSGFQALV--- 337
G +F GD + S++P+ D+Y + G+ L+ G + V
Sbjct: 373 VVGGGYMFLGDDF-VPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRI 431
Query: 338 --DSGASFTFLPTEIYAEVVVKFDKL 361
DSG+S+T+ E Y+E+V ++
Sbjct: 432 VFDSGSSYTYFTKEAYSELVASLKQV 457
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 93/339 (27%), Positives = 142/339 (41%), Gaps = 51/339 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+G P V LV +D GS+LLWV C+ C C S +DPS SS+ ++S
Sbjct: 65 VGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPI----------FDPSKSSTYVDLSY 114
Query: 175 SHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
P+C + K + + C Y A Y+ TSS +DI+ F + SSV+
Sbjct: 115 DSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----FETSDQGTVTVSSVV 170
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDEN-DSGS 288
GCG G + DG G++GL GD S+ S L + FS C FD +
Sbjct: 171 FGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQ 222
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------TQSGFQALV-D 338
+ GD + S P Y+V +E +G + L T+SG +V D
Sbjct: 223 LVLGD---GVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIF 395
SG + TFL + + + + +LV +++ + CY E L+ P++ F
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 339
Query: 396 SKNQSFVV-RNHIFSFPENEVGDHACFSYFTLEYNFTGI 433
++ V+ N +F +V F LE N I
Sbjct: 340 AEGADLVLDANSLFVQKNQDV-----FCLAVLESNLKNI 373
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 115/264 (43%), Gaps = 31/264 (11%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + +YT + IG P + + +D GS+L WV C C C ++ RN
Sbjct: 56 GNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGC---------TIPRN-R 105
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
Y P+ + V C PLCK+ S C + C Y +Y+ + SS G L+ D +
Sbjct: 106 LYKPNGNL----VKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNI 160
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L K S + + GCG Q + A+ GV+GLG G S+ S L GLI
Sbjct: 161 PL----KFTNGSLARPILAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLI 216
Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQS 331
+N C E G +FFGDQ Q + P+ + Y G +
Sbjct: 217 RNVVGHCLSERGGGFLFFGDQ-LVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVK 275
Query: 332 GFQALVDSGASFTFLPTEIYAEVV 355
G Q + DSG+S+T+ ++ + +V
Sbjct: 276 GLQLIFDSGSSYTYFNSKAHKALV 299
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 126/307 (41%), Gaps = 36/307 (11%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
K +VKLQ+ SS ++FP G+ + G +Y ++IG P F + +D G
Sbjct: 36 KDSSAQVKLQNRRLSS--TVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTG 87
Query: 131 SNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RS 183
S+L WV C C C A +Y P+ ++ + CSH LC
Sbjct: 88 SDLTWVQCDAPCNGCTKPRA----------KQYKPNHNT----LPCSHILCSGLDLPQDR 133
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTG 242
C +D C Y YS + SS G LV D + L K A S + + GCG +Q
Sbjct: 134 PCADPEDQCDYEIGYS-DHASSIGALVTDEVPL----KLANGSIMNLRLTFGCGYDQQNP 188
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQS 301
G++GLG G V + + L G+ +N C G + GD+ P++ +
Sbjct: 189 GPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVT 248
Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
+ L Y G + G + DSG+S+T+ E Y ++ K
Sbjct: 249 WTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKD 308
Query: 362 VSSKRIS 368
++ K ++
Sbjct: 309 LNGKPLT 315
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 127/299 (42%), Gaps = 53/299 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I IG+P ++ L+ +D S+LLW+ C CI C ++L +DPS S + +N
Sbjct: 89 ISIGSPPITQLLHMDTASDLLWIQCLPCINCYA----------QSLPIFDPSRSYTHRNE 138
Query: 173 SC-----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+C S P K ++ +S C Y Y +DT S G L ++L + + ++
Sbjct: 139 TCRTSQYSMPSLKFNANTRS----CEYSMRY-VDDTGSKGILAREMLLFNTIYDESSSAA 193
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
+ V+ GCG G L G G++GLG G+ S+ K FS CF D
Sbjct: 194 LH-DVVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGK------KFSYCFGSLDDP 243
Query: 288 S-----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSG 332
S + GD G T+ L I + Y+V +E+ + L Q+G
Sbjct: 244 SYPHNVLVLGDDGANILGDTTPLEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNHQTG 301
Query: 333 FQA-LVDSGASFTFLPTEIYAEVVVK----FDKLVSSKRISLQGNSWKYCYNASSEEML 386
++D+G S T L E Y + + F+ ++ +S CYN + E L
Sbjct: 302 LGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDL 360
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 136/311 (43%), Gaps = 39/311 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + +G+P + + LD GS+L W +QC P ++ +D ++PS+S++
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSW-----LQCKPCVVYCHSQVD---PLFEPSASNTY 171
Query: 170 KNVSCSHPLCKSRSSCKSLKDP-------CPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ + CS C S +L DP C Y A Y + + S GYL D+L L
Sbjct: 172 RPLYCSSSEC-SLLKAATLNDPLCTASGVCVYTASYG-DASYSMGYLSRDLLTLT----- 224
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICF 281
S S GCG+ G + A G++GL +S+ + L+ K G +FS C
Sbjct: 225 --PSQTLPSFTYGCGQDNEGLFGKAA---GIVGLARDKLSMLAQLSPKYGY---AFSYCL 276
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNS--CLTQSGFQA- 335
+ S F G + S F P+ + YF+ + + + + +G+Q
Sbjct: 277 PTSTSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVP 336
Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRL 393
++DSG T LP IYA + F K++S + S C+ S + M P++R+
Sbjct: 337 TIIDSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRM 396
Query: 394 IFSKNQSFVVR 404
IF +R
Sbjct: 397 IFQGGADLSLR 407
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 78/311 (25%), Positives = 137/311 (44%), Gaps = 45/311 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C +++ +DP++S S +NV+C
Sbjct: 155 LGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQSGPIFDPAASISYRNVTC 204
Query: 175 SHPLCK---------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
C+ R + DPCPY Y + ++ L L +F+ + Q
Sbjct: 205 GDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGD------LALEAFTVNLTQ 258
Query: 226 SSVQS--SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
S + V GCG + G + A ++GLG G +S S L + ++FS C E
Sbjct: 259 SGTRRVDGVAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-RGVYGGHAFSYCLVE 314
Query: 284 NDSGS---VFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL-----TQS 331
+ S + + FG T+F P + Y++ ++S +G + T S
Sbjct: 315 HGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLS 374
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
++DSG + ++ P Y + F D++ S + L CYN S E ++VP+
Sbjct: 375 AGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPE 434
Query: 391 MRLIFSKNQSF 401
+ L+F+ ++
Sbjct: 435 LSLVFADGAAW 445
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 110/265 (41%), Gaps = 33/265 (12%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + HYT ++IG P + + +D+GS+L WV C C C Y + NL
Sbjct: 56 GNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKP-NHNL- 113
Query: 160 EYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
V C LC +C S D C Y +Y+ + SS G LV D +
Sbjct: 114 ------------VQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYA-DHGSSLGVLVRDYI 160
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLI 273
+ S V+ V GCG Q S + A GV+GLG G S+ S L GLI
Sbjct: 161 PF----QFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLI 216
Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQS--TSFLP-IGEKYDAYFVGVESYCIGNSCLTQ 330
N C G +FFGD + TS LP EK+ Y G
Sbjct: 217 HNVVGHCLSARGGGFLFFGDDFIPSSGIVWTSMLPSSSEKH--YSSGPAELVFNGKATVV 274
Query: 331 SGFQALVDSGASFTFLPTEIYAEVV 355
G + + DSG+S+T+ ++ Y VV
Sbjct: 275 KGLELIFDSGSSYTYFNSQAYQAVV 299
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 137/297 (46%), Gaps = 38/297 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ +GTP + D GSNL+W C+ C C YT +D +DP +SS+ K+V
Sbjct: 98 LSLGTPPSPIMAVADTGSNLIWTQCKPCDDC-------YTQVD---PLFDPKASSTYKDV 147
Query: 173 SCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
SCS C ++++SC + C Y+ Y+ + + + G D L L S Q
Sbjct: 148 SCSSSQCTALENQASCSTEDKTCSYLVSYA-DGSYTMGKFAVDTLTLGSTDNRPVQ---L 203
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICF-DENDSG 287
++IIGCG+ ++ + ++ G+ SL+ + G I FS C END
Sbjct: 204 KNIIIGCGQNNAVTFRNKSS-----GVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQT 258
Query: 288 S-VFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQA--LVD 338
S + FG GP T + L + + Y++ ++S +G N S + ++D
Sbjct: 259 SKINFGTNAVVSGPGTVSTP--LVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVID 316
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
SG + T LP + Y E+ L+++ + + CYNA+++ L +P + + F
Sbjct: 317 SGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATAD--LNIPVITMHF 371
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/340 (26%), Positives = 148/340 (43%), Gaps = 49/340 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I +GTP L+ LD GS+++W +QCAP Y +++ +DP S S
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVW-----LQCAPCRRCY----EQSGQVFDPRRSRSY 190
Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
V C+ PLC+ S C + C Y Y + + ++G + L A ++ A
Sbjct: 191 NAVGCAAPLCRRLDSGGCDLRRSACLYQVAYG-DGSVTAGDFATETLTFAGGARVA---- 245
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND-- 285
V +GCG G ++ A ++GLG G +S P+ +++ SFS C +
Sbjct: 246 ---RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--RYGRSFSYCLVDRTSS 297
Query: 286 ------SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFV--------GVESYCIGNSCL 328
S +V FG + ++SF P+ + Y+V G + NS L
Sbjct: 298 ANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDL 357
Query: 329 T---QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSE 383
SG +VDSG S T L Y+ + F + R+S G S + CY+ S
Sbjct: 358 RLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGR 417
Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+++KVP + + F+ + + P + G CF++
Sbjct: 418 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTF-CFAF 456
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/397 (23%), Positives = 168/397 (42%), Gaps = 62/397 (15%)
Query: 46 NVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ 105
N+ +++ SVE + S + T + Q N+ R + + +Q + N
Sbjct: 16 NICFSEALKSGFSVEIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQISVYSNA 75
Query: 106 F-----------YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSL 154
Y + Y+ +GTP +D S+++WV CQ + T
Sbjct: 76 VESPVTLLDDGDYLMSYS---LGTPPFPVYGIVDTASDIIWVQCQLCE---------TCY 123
Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKS-LKDPCPYIADYSTEDTSSSGYLVD 211
+ +DPS S + KN+ CS CKS +SC S + C + +Y + + S G L+
Sbjct: 124 NDTSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYK-DGSHSQGDLIV 182
Query: 212 DILHLASFSK---HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
+ + L S++ H P++ +IGC R S+ G++GLG G VS+ L+
Sbjct: 183 ETVTLGSYNDPFVHFPRT------VIGCIRNTNVSF----DSIGIVGLGGGPVSLVPQLS 232
Query: 269 KAGLIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIG 324
+ I FS C + S + FGD + T I K Y++ +E++ +G
Sbjct: 233 SS--ISKKFSYCLAPISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVG 290
Query: 325 NSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
N+ + + ++DSG +FT LP ++Y+++ +V +R +
Sbjct: 291 NNRIEFRSSSSRSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSL 350
Query: 377 CYNASSEEMLKVPDMRLIFSKN-------QSFVVRNH 406
CY S+ + + VP + FS +F+V +H
Sbjct: 351 CYK-STYDKVDVPVITAHFSGADVKLNALNTFIVASH 386
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 100/400 (25%), Positives = 167/400 (41%), Gaps = 57/400 (14%)
Query: 57 NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL------H 110
+ E L L D KR+ +R+ + ++ N G + F L +
Sbjct: 89 TAAELLAHRLRRD-KRRASRISAAAGGAAAANGTRVGGGGGGSGFVAPVVSGLAQGSGEY 147
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+T I +GTP L+ LD GS+++W +QCAP Y D++ +DP +S S
Sbjct: 148 FTKIGVGTPVTPALMVLDTGSDVVW-----LQCAPCRRCY----DQSGQMFDPRASHSYG 198
Query: 171 NVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
V C+ PLC+ S C + C Y Y + + ++G + L AS ++ P+
Sbjct: 199 AVDCAAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATETLTFASGAR-VPR--- 253
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----- 283
V +GCG G ++ A ++GLG G +S PS +++ SFS C +
Sbjct: 254 ---VALGCGHDNEGLFVAAAG---LLGLGRGSLSFPSQISR--RFGRSFSYCLVDRTSSS 305
Query: 284 ----NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQAL 336
+ S +V FG + SF P+ + Y+V + +G + + L
Sbjct: 306 ASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDL 365
Query: 337 ------------VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSE 383
VDSG S T L YA + F + R+S G S + CY+ S
Sbjct: 366 RLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGL 425
Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+++KVP + + F+ + + P + G CF++
Sbjct: 426 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTF-CFAF 464
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 141/335 (42%), Gaps = 42/335 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y+ WI GTP F + +D GS + +VPC C C ++ P +
Sbjct: 92 YYTTRLWI--GTPPQRFALIVDTGSTVTYVPCSTCKHCG----------SHQDPKFRPEA 139
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S + + V C+ + +C + C Y Y+ E ++SSG L +D++ + S+ +PQ
Sbjct: 140 SETYQPVKCTW-----QCNCDDDRKQCTYERRYA-EMSTSSGVLGEDVVSFGNQSELSPQ 193
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---D 282
+ I GC +TG + A DG+MGLG GD+S+ L + +I ++FS+C+
Sbjct: 194 RA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMG 247
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQSGFQAL 336
V G PA T P+ Y Y + ++ + N + +
Sbjct: 248 VGGGAMVLGGISPPADMVFTHSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFDGKHGTV 305
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASSEEMLKV----PD 390
+DSG ++ +LP + K S KRIS + C++ + + ++ P
Sbjct: 306 LDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPV 365
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFT 425
+ ++F + + F ++V C F+
Sbjct: 366 VEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFS 400
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 116/276 (42%), Gaps = 31/276 (11%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + HYT ++IG P + + +D+GS+L WV C C C Y + NL
Sbjct: 56 GNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKP-NHNL- 113
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
V C LC +C S DPC Y +Y+ + SS G LV D +
Sbjct: 114 ------------VQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEYA-DHGSSLGVLVRDYI 160
Query: 215 HLASFSKHAPQSSVQSSVIIGCG--RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
+ S V+ V GCG +K +GS A GV+GLG G S+ S L GL
Sbjct: 161 PF----QFTNGSVVRPRVAFGCGYDQKYSGSN-SPPATSGVLGLGNGRASILSQLHSLGL 215
Query: 273 IQNSFSICFDENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 331
I+N C G +FFGD P++ + + Y G
Sbjct: 216 IRNVVGHCLSAQGGGFLFFGDDFIPSSGIVWTSMLSSSSEKHYSSGPAELVFNGKATAVK 275
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
G + + DSG+S+T+ ++ Y VV K + K++
Sbjct: 276 GLELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGKQL 311
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/305 (25%), Positives = 127/305 (41%), Gaps = 37/305 (12%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
K +VKLQ+ SS ++FP G+ + G +Y ++IG P F + +D G
Sbjct: 36 KDSSAQVKLQNRRLSS--TVVFPVSGN-VYPLG-----YYYVLLNIGNPPKLFDLDIDTG 87
Query: 131 SNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSC 185
S+L WV C AP + ++Y P+ ++ + CSH LC C
Sbjct: 88 SDLTWVQCD----APCNGC---------TKYKPNHNT----LPCSHILCSGLDLPQDRPC 130
Query: 186 KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG-RKQTGSY 244
+D C Y YS + SS G LV D + L K A S + + GCG +Q
Sbjct: 131 ADPEDQCDYEIGYS-DHASSIGALVTDEVPL----KLANGSIMNLRLTFGCGYDQQNPGP 185
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ-GPATQQSTS 303
G++GLG G V + + L G+ +N C G + GD+ P++ + +
Sbjct: 186 HPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWT 245
Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
L Y G + G + DSG+S+T+ E Y ++ K ++
Sbjct: 246 SLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLN 305
Query: 364 SKRIS 368
K ++
Sbjct: 306 GKPLT 310
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/410 (24%), Positives = 163/410 (39%), Gaps = 51/410 (12%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKT-RVKLQSNNNSSRNQLLFPSEGSQTHFFGNQ 105
V++ D P + YL LL+ D R + +++++++ ++ + +E T G +
Sbjct: 123 VAIPDDDPAAHD-RYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTS--GIR 179
Query: 106 FYWLHY-TWIDIG-----TPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLS 159
F L+Y T I +G +P + V +D GS+L WV QC P SA Y +
Sbjct: 180 FQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWV-----QCKPCSACYA----QRDP 230
Query: 160 EYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
+DP+ S++ V C+ C + SC + C Y Y + + S G L
Sbjct: 231 LFDPAGSATYAAVRCNASACAASLKAATGTPGSCGGGNERCYYALAYG-DGSFSRGVLAT 289
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS--LLAK 269
D + L S + GCG G + A G+MGLG ++S+ S L
Sbjct: 290 DTVALGGAS--------LDGFVFGCGLSNRGLFGGTA---GLMGLGRTELSLVSQTALRY 338
Query: 270 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIG 324
G+ + SGS+ G + + +T D YF+ V +G
Sbjct: 339 GGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVG 398
Query: 325 NSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNS-WKYCYN 379
+ L G A L+DSG T L +Y V +F + ++ + G S CY+
Sbjct: 399 GTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYD 458
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYN 429
+ + +KVP + L V F + G C + +L Y
Sbjct: 459 LTGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYE 508
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 134/317 (42%), Gaps = 29/317 (9%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT + IG P + + +D GS+L W+ C C CA Y N+ P S
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVV---PPRDS 215
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ + + + C Y Y+ + +SS G L D + L + A
Sbjct: 216 YCQELQGNQNYGDTSKQCD-------YEITYA-DRSSSMGILARDNMQLIT----ADGER 263
Query: 228 VQSSVIIGCGRKQTGSYLDGAA-PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ GCG Q G+ L A DG++GL +S+P+ LA G+I N F C D +
Sbjct: 264 ENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLT---QSG--FQALVD 338
+ G +F GD + +++PI + Y V+ G+ L ++G Q + D
Sbjct: 324 NGGYMFLGDDY-VPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFD 382
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
SG+S+T+LP + Y ++ L S + +C + + + D++ +F K
Sbjct: 383 SGSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KP 440
Query: 399 QSFVVRNHIFSFPENEV 415
S V + +F P V
Sbjct: 441 LSLVFKKRLFILPRTFV 457
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 119/276 (43%), Gaps = 27/276 (9%)
Query: 119 PNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
P + + D GS+L W+ C C CA + ++Y N+ K++ C
Sbjct: 199 PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWYKPRRGNIV--------PPKDLLCME 250
Query: 177 PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
++ D C Y +Y+ + +SS G L D L L A S + + I GC
Sbjct: 251 VQRNQKAGYCETCDQCDYEIEYA-DHSSSMGVLATDKLLLMV----ANGSLTKLNFIFGC 305
Query: 237 GRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGD 293
Q G L DG++GL VS+PS LA G+I N C D G +F GD
Sbjct: 306 AYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGD 365
Query: 294 QGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSGFQA-----LVDSGASFTFL 346
+ +++P+ + Y V G+S L+ G ++ L DSG+S+T+
Sbjct: 366 DF-VPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTYF 424
Query: 347 PTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNAS 381
P E Y+E+V +++ + + S + C+ A+
Sbjct: 425 PKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRAN 460
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/325 (24%), Positives = 141/325 (43%), Gaps = 51/325 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + IG+P + +D+GS+++WV C+ C++C Y D +DP+SS++
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLEC-------YAQAD---PLFDPASSAT 174
Query: 169 SKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
VSC +C++ R+S C Y Y + + + G L + L L +
Sbjct: 175 FSAVSCGSAICRTLRTSGCGDSGGCEYEVSYG-DGSYTKGTLALETLTLG--------GT 225
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------ 281
V IGCG + G ++ A G++GLG G +S+ L A +FS C
Sbjct: 226 AVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGGS 280
Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC-------- 327
+ +GS+ G + A + ++P+ A Y+VGV +G+
Sbjct: 281 GSGAADAAGSLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLF 339
Query: 328 -LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
LT+ G +V D+G + T LP E YA + F V + + + CY+ S
Sbjct: 340 QLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTS 399
Query: 386 LKVPDMRLIFSKNQSFVV--RNHIF 408
++VP + F + + RN +
Sbjct: 400 VRVPTVSFYFDGAATLTLPARNLLL 424
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 134/317 (42%), Gaps = 29/317 (9%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT + IG P + + +D GS+L W+ C C CA Y N+ P S
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVV---PPRDS 215
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ + + + C Y Y+ + +SS G L D + L + A
Sbjct: 216 YCQELQGNQNYGDTSKQCD-------YEITYA-DRSSSMGILARDNMQLIT----ADGER 263
Query: 228 VQSSVIIGCGRKQTGSYLDGAA-PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ GCG Q G+ L A DG++GL +S+P+ LA G+I N F C D +
Sbjct: 264 ENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLT---QSG--FQALVD 338
+ G +F GD + +++PI + Y V+ G+ L ++G Q + D
Sbjct: 324 NGGYMFLGDDY-VPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFD 382
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
SG+S+T+LP + Y ++ L S + +C + + + D++ +F K
Sbjct: 383 SGSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KP 440
Query: 399 QSFVVRNHIFSFPENEV 415
S V + +F P V
Sbjct: 441 LSLVFKKRLFILPRTFV 457
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 137/323 (42%), Gaps = 54/323 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ +GTP V LD GS+L WVPC QC C S S ++ + P +SSSS
Sbjct: 95 VSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNC-----SSSPSAMSAMAVFHPKNSSSS 149
Query: 170 KNVSCSHPLC-----KSRSSCKSL-----KDPC-PYIADYSTEDTSSSGYLVDDILHLAS 218
+ V C +P C KS S+C S D C PY+ Y + T SG L+ D L L+
Sbjct: 150 RLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGST--SGLLISDTLRLSP 207
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
S + + + + IGC P G+ G G G SVPS L S
Sbjct: 208 SSSSSAPAPFR-NFAIGCSIVSVHQ-----PPSGLAGFGRGAPSVPSQLKVPKFSYCLLS 261
Query: 279 ICFDEND--SGSVFFGDQG-PATQQSTS--FLPI------GEKYDA-YFVGVESYCIG-- 324
FD+N SG + GD PA ++ T+ ++P+ Y Y++ + +G
Sbjct: 262 RRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGK 321
Query: 325 ------NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK----RISLQGNSW 374
+ + SG A++DSG +FT+L ++ V + V + R
Sbjct: 322 PVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGL 381
Query: 375 KYCY--NASSEEMLKVPDMRLIF 395
+ C+ +++PD+ L F
Sbjct: 382 RPCFALPPGPGGAMELPDLELKF 404
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 144/338 (42%), Gaps = 53/338 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP V L+ALD S+L W+ CQ C +C P S +DP S+S + +
Sbjct: 142 IAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPV----------FDPRHSTSYREM 191
Query: 173 SCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
S + C++ RS K C Y Y + +++ G +++ L A + P+ S
Sbjct: 192 SFNAADCQALGRSGGGDAKRGTCVYTVGYG-DGSTTVGDFIEETLTFAGGVR-LPRIS-- 247
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG-- 287
IGCG G L GA G++GLG G +S P+ + G +FS C + SG
Sbjct: 248 ----IGCGHDNKG--LFGAPAAGILGLGRGLMSFPNQIDHNG----TFSYCLVDFLSGPG 297
Query: 288 ----SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGN---SCLTQSGFQ--- 334
++ FG T SF P + Y+V + +G +T+ Q
Sbjct: 298 SLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDP 357
Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNS--WKYCYNASSEEM 385
+VDSG + T L Y F + V ++S+ G S + CY M
Sbjct: 358 YTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGM 417
Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
KVP + + F+ + ++ + P + +G CF++
Sbjct: 418 KKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGT-VCFAF 454
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 117/297 (39%), Gaps = 43/297 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I +GTP F V +D GS L WV C+ Y N + S S
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCR----------YRARGKDNRRVFRADESKSF 155
Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSK 221
K V C CK S ++C + PC Y DY D S++ G + + + +
Sbjct: 156 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY--DYRYADGSAAQGVFAKETITVGLTNG 213
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ +IGC TG GA DGV+GL D S S L FS C
Sbjct: 214 RMARLPGH---LIGCSSSFTGQSFQGA--DGVLGLAFSDFSFTS--TATSLYGAKFSYCL 266
Query: 282 -----DENDSGSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 328
++N S + FG + T+ L + Y + V +G L
Sbjct: 267 VDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV 326
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASS 382
SG ++DSG S T L Y +VV + LV KR+ +G +YC++ +S
Sbjct: 327 WDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 383
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/327 (26%), Positives = 132/327 (40%), Gaps = 54/327 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ GTP L+ D GS+L+W+ C P R + S S++ V
Sbjct: 58 MAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATLSVVP 115
Query: 174 CSHPLC--------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
CS C S + PC Y DY+ + +S++G+L D A+ S
Sbjct: 116 CSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYA-DGSSTTGFLARDT---ATISNGTSG 171
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEN 284
+ V GCG + G G GV+GLG G +S P A++G L +FS C +
Sbjct: 172 GAAVRGVAFGCGTRNQGGSFSGTG--GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDL 226
Query: 285 DSGS-------VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ 334
+ G +F G P + + ++ P+ A Y+VGV + +GN L G +
Sbjct: 227 EGGRRGRSSSFLFLGR--PERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 284
Query: 335 ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYN 379
++DSG++ T+L Y +V F V RI QG + CYN
Sbjct: 285 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG--LELCYN 342
Query: 380 ASSEEMLK-----VPDMRLIFSKNQSF 401
SS L P + + F++ S
Sbjct: 343 VSSSSSLAPANGGFPRLTIDFAQGLSL 369
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 142/358 (39%), Gaps = 45/358 (12%)
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
E L + R + Q + R+ + G+ F Y +H + GTP
Sbjct: 41 ELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVH---LAAGTP 97
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
+ LD GS++ W QC +C P SA + ++ L +DPS+SSS ++ CS P C
Sbjct: 98 PQEVQLTLDTGSDITWT--QCKRC-PASACF----NQTLPLFDPSASSSFASLPCSSPAC 150
Query: 180 KSRSSCKSLKD----PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
++ C D PC Y Y + + S G + ++ AS + ++V ++ G
Sbjct: 151 ETTPPCGGGNDATSRPCNYSISYG-DGSVSRGEIGREVFTFASGTGEGSSAAV-PGLVFG 208
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG 292
CG G + G+ G G G +S+PS L K G +FS CF + + +V G
Sbjct: 209 CGHANRGVFTSNET--GIAGFGRGSLSLPSQL-KVG----NFSHCFTTITGSKTSAVLLG 261
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYA 352
G A ++ P+G + +Y C + +SG S T LP Y
Sbjct: 262 LPGVAPPSAS---PLGRRRGSY-----------RCRSTPRSS---NSGTSITSLPPRTYR 304
Query: 353 EVVVKFDKLVSSKRISLQGNSWKYCYNAS-SEEMLKVPDMRLIF-SKNQSFVVRNHIF 408
V +F V + C++A VP M L F N++F
Sbjct: 305 AVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEGATMRLPQENYVF 362
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 123/287 (42%), Gaps = 30/287 (10%)
Query: 87 RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAP 145
R + + T F L+ + +G P+ + +A GS+++WVPC C C P
Sbjct: 53 RKRFAAKKQQGVTGFVLEAMPGLYCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDC-P 111
Query: 146 LSASYYTSLDRNLSEYDPSSSSSSK-----NVSCSHPLCKSRSSCK---SLKDPCPYIAD 197
SLD YDP +SS+S + C+ L + C S D C Y
Sbjct: 112 TPDDIGFSLDL----YDPKNSSTSSEISCSDDRCADALKTGHAICHTSHSSGDQCGYNQI 167
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
Y+ +++GY V D +H F + +S +SVI GC + ++G DGV+G G
Sbjct: 168 YADGVLATTGYYVSDDIHFDIFMGNESFASSSASVIFGCSKSRSGHL----QADGVIGFG 223
Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDE-NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV 316
S+ S L G + ++FS C D+ +D G V D+ + F + Y +
Sbjct: 224 KDAPSLISQLNSQG-VSHAFSRCLDDSDDGGGVLILDE--VGEPGLEFTSLVASRPCYNL 280
Query: 317 GVESYCIGN-------SCLTQSGFQA-LVDSGASFTFLPTEIYAEVV 355
++S + N S T S Q +DSG S + P +Y V+
Sbjct: 281 NMKSIAVNNQNVPIDSSLFTTSSTQGTFLDSGTSLAYFPDGVYDPVI 327
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 139/330 (42%), Gaps = 45/330 (13%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IGTP F + LD GS+L W+ QC+ C Y +N YDP SSS KN+ C
Sbjct: 198 IGTPPRHFSLILDTGSDLNWI--QCVPC-------YDCFVQNGPYYDPKESSSFKNIGCH 248
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
P C SS CK+ CPY Y ++ + ++ ++L S + + V
Sbjct: 249 DPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV 308
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DE 283
+ +V+ GCG G + A ++GLG G +S S L L +SFS C D
Sbjct: 309 E-NVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDT 362
Query: 284 NDSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVGVESYCIGNSCLT-------- 329
N S + FG D+ +F L G++ Y+V ++S +G L
Sbjct: 363 NVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHL 422
Query: 330 --QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
+ +VDSG + ++ Y + F K V + CYN S E ++
Sbjct: 423 SPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKME 482
Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEV 415
+P+ R++F +F V N+ E+
Sbjct: 483 LPEFRILFEDGAVWNFPVENYFIKLEPEEI 512
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 112/258 (43%), Gaps = 27/258 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+YT I IG P + + +D GS+L W+ C C A Y +
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIV-------- 238
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+++ C L +++ C++ K C Y +Y+ + +SS G L D +H+ + +
Sbjct: 239 PPRDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GR 291
Query: 228 VQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
+ + GC Q G L A DG++GL +S PS LA G+I N F C ++
Sbjct: 292 EKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQG 351
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVD 338
G +F GD + ++ I D Y G+ L + S Q + D
Sbjct: 352 GGGYMFLGDDY-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 339 SGASFTFLPTEIYAEVVV 356
SG+S+T+LP EIY +V
Sbjct: 411 SGSSYTYLPNEIYENLVA 428
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 122/314 (38%), Gaps = 58/314 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I +GTP F V +D GS L WV C+ Y N + S S
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTWVNCR----------YRARGKDNRRVFRADESKSF 133
Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDIL------- 214
K V C CK S ++C + PC Y DY D S++ G + +
Sbjct: 134 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY--DYRYADGSAAQGVFAKETITVGLTNG 191
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
+A H +IGC TG GA DGV+GL D S S L
Sbjct: 192 RMARLPGH----------LIGCSSSFTGQSFQGA--DGVLGLAFSDFSFTS--TATSLYG 237
Query: 275 NSFSICF-----DENDSGSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSC 327
FS C ++N S + FG + T+ L + Y + V +G
Sbjct: 238 AKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDM 297
Query: 328 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCY 378
L SG ++DSG S T L Y +VV + LV KR+ +G +YC+
Sbjct: 298 LDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCF 357
Query: 379 NASSE-EMLKVPDM 391
+ +S + K+P +
Sbjct: 358 SFTSGFNVSKLPQL 371
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 127/296 (42%), Gaps = 38/296 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y ++ +DP+ SS+ NVS
Sbjct: 183 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQQEK---LFDPARSSTYANVS 234
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 235 CAAPACFDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 286
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
GCG + G + + A G++GLG G S+P K G + F+ C SG+ +
Sbjct: 287 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSSGTGYLD 340
Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
FG PA + P+ G + Y+VG+ +G L+ QS F +VDSG
Sbjct: 341 FGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGT 398
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y+ + F ++++ + + + CY+ + + +P + L+F
Sbjct: 399 VITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLF 454
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 133/320 (41%), Gaps = 60/320 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
++ I +G P LV +D GS+L+W +QC P Y R ++ YDP +S +
Sbjct: 92 YFAVIGVGDPPTHALVVIDTGSDLIW-----LQCLPCRRCY-----RQVTPLYDPRNSKT 141
Query: 169 SKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+ + C+ P C+ C + C Y+ Y + ++SSG L D L L P
Sbjct: 142 HRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDTLVL-------PD 193
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-- 283
+ +V +GCG G A G++G G G +S P+ LA A + FS C +
Sbjct: 194 DTRVHNVTLGCGHDNEGLLASAA---GLLGAGRGQLSFPTQLAPA--YGHVFSYCLGDRM 248
Query: 284 ----NDSGSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQSGFQ-- 334
N S + FG ST+F P+ + Y+V + + +G + +GF
Sbjct: 249 SRARNSSSYLVFGRT--PELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERV--AGFSNA 304
Query: 335 ------------ALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYN 379
+VDSG + + + YA V V +R+ + + + CY+
Sbjct: 305 SLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYD 364
Query: 380 ASSE---EMLKVPDMRLIFS 396
++VP + L F+
Sbjct: 365 VHGNGPGTGVRVPSIVLHFA 384
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 152/368 (41%), Gaps = 67/368 (18%)
Query: 66 LSNDWKRQKTRVKL----QSNNNSSRNQL---------------LFPSEGSQTHFFGNQF 106
LS D R +R ++ +S NS R++L PS+ T GN
Sbjct: 80 LSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSKVTLPSKSGSTIGTGN-- 137
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
+ + +GTP D GS+L W QC P + Y + ++PS S
Sbjct: 138 ---YVVTVGLGTPKRDLTFIFDTGSDLTWT-----QCEPCARYCY---HQQEPIFNPSKS 186
Query: 167 SSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
+S N+SCS P C + SC + C Y Y + + S G+ D L L S
Sbjct: 187 TSYTNISCSSPTCDELKSGTGNSPSCSA--STCVYGIQYG-DQSYSVGFFAQDKLALTS- 242
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFS 278
+ V ++ + GCG+ G ++ A G++GLG +S+ S A K G + FS
Sbjct: 243 ------TDVFNNFLFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQKYGKL---FS 290
Query: 279 ICFDENDS--GSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQSG- 332
C S G + FG G T ++ F P + YF+ + + +G L+ S
Sbjct: 291 YCLPSTSSSTGYLTFGSGG-GTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSAS 349
Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
++DSG + LP Y+++ F + +S + + CY+ S + + V
Sbjct: 350 VFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDV 409
Query: 389 PDMRLIFS 396
P + L FS
Sbjct: 410 PKINLYFS 417
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/330 (28%), Positives = 142/330 (43%), Gaps = 45/330 (13%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IGTP + + LD GS+L W+ QC+ C + ++N YDP SSS +N+ C
Sbjct: 96 IGTPPKHYSLILDTGSDLNWI--QCVPC-------HDCFEQNGPYYDPKESSSFRNIGCH 146
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
P C SS CK+ CPY Y ++ + + ++L S + + V
Sbjct: 147 DPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRV 206
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DE 283
+ +V+ GCG G + GA+ G++GLG G +S S L L +SFS C D
Sbjct: 207 E-NVMFGCGHWNRGLF-HGAS--GLLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDT 260
Query: 284 NDSGSVFFG-DQGPATQQSTSFLP-IGEKYDA----YFVGVESYCIGNSCL--------- 328
N S + FG D+ +F +G K + Y+V ++S +G L
Sbjct: 261 NVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNM 320
Query: 329 TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
T G +VDSG + ++ Y + F K V I CYN S E +
Sbjct: 321 TSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKID 380
Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEV 415
+PD ++F+ +F V N+ EV
Sbjct: 381 LPDFGILFADGAVWNFPVENYFIRLDPEEV 410
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 135/320 (42%), Gaps = 44/320 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + +G+P + + +D GS+ W +QC P + + D ++PS+S +
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSW-----LQCQPCTIYCHIQED---PVFNPSASKTY 154
Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
K V CS C + +C + C Y A Y + + S GYL D+L L
Sbjct: 155 KTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDVLTLT----- 208
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
S SS + GCG+ G + DG++GL ++S+ S L +G N+FS C
Sbjct: 209 --PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKYGNAFSYCLP 261
Query: 282 ------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCIGNSCLTQSG 332
+ G + G S F P+ + + YF+ +ES + L +
Sbjct: 262 TSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAA 321
Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLK 387
++DSG T LPT +Y + + ++S K G S C+ S + +
Sbjct: 322 SSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISE 381
Query: 388 V-PDMRLIFSKNQSFVVRNH 406
V PD+R+IF ++ H
Sbjct: 382 VAPDIRIIFKGGADLQLKGH 401
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 135/320 (42%), Gaps = 44/320 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + +G+P + + +D GS+ W +QC P + + D ++PS+S +
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSW-----LQCQPCTIYCHIQED---PVFNPSASKTY 154
Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
K V CS C + +C + C Y A Y + + S GYL D+L L
Sbjct: 155 KTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDVLTLT----- 208
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
S SS + GCG+ G + DG++GL ++S+ S L +G N+FS C
Sbjct: 209 --PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKYGNAFSYCLP 261
Query: 282 ------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCIGNSCLTQSG 332
+ G + G S F P+ + + YF+ +ES + L +
Sbjct: 262 TSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAA 321
Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLK 387
++DSG T LPT +Y + + ++S K G S C+ S + +
Sbjct: 322 SSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISE 381
Query: 388 V-PDMRLIFSKNQSFVVRNH 406
V PD+R+IF ++ H
Sbjct: 382 VAPDIRIIFKGGADLQLKGH 401
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 157/379 (41%), Gaps = 53/379 (13%)
Query: 61 YLELLLSNDWKRQKTRVKLQSNN-NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
Y +L D K T+ +L + SR + L + + Q +L + IG P
Sbjct: 23 YRLVLTHVDSKGGYTKTELMRRAVHRSRLRALSGYDATSPRLHSVQVEYLM--ELAIGKP 80
Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
V F+ D GS+L W CQ C C P ++ YDPS+SS+ + CS
Sbjct: 81 PVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPLPCSSAT 130
Query: 179 CK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
C SR+ S C Y Y + S+G L + L L S AP S V G
Sbjct: 131 CLPIWSRNCTPSSL--CRYRYAYG-DGAYSAGILGTETLTLGPSS--APVSV--GGVAFG 183
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE--NDS------- 286
CG G L+ G +GLG G + SLLA+ G+ FS C + N +
Sbjct: 184 CGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNSALDSPFLL 235
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSGFQAL 336
G++ GP+T QST L + YFV ++ +G+ L +
Sbjct: 236 GTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMI 295
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
VDSG +FT L + EVV + +++ ++ C+ A + E +PD+ L F+
Sbjct: 296 VDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP-CFPAPAGEPPYMPDLVLHFA 354
Query: 397 KNQSF-VVRNHIFSFPENE 414
+ R++ S+ E +
Sbjct: 355 GGADMRLYRDNYMSYNEED 373
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 147/352 (41%), Gaps = 46/352 (13%)
Query: 65 LLSNDWKRQK---TRVKLQSNNNSSRNQL---LFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
+L+ D +R K +R+ +SS ++L P++ GN ++ + +GT
Sbjct: 99 ILNQDKERVKYINSRISKNLGQDSSVSELDSVTLPAKSGSLIGSGN-----YFVVVGLGT 153
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P + D GS+L W QC P + S Y D + +DPS S+S N++C+ L
Sbjct: 154 PKRDLSLIFDTGSDLTWT-----QCEPCARSCYKQQD---AIFDPSKSTSYSNITCTSTL 205
Query: 179 CKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C S+ C + C Y Y + + S GY + L + + + + +
Sbjct: 206 CTQLSTATGNEPGCSASTKACIYGIQYG-DSSFSVGYFSRERLSVTA-------TDIVDN 257
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSV 289
+ GCG+ G + A G++GLG +S + A + + FS C S G +
Sbjct: 258 FLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAVYRKIFSYCLPATSSSTGRL 312
Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----TQSGFQALVDSGASFT 344
FG + + T F I Y + + +G + L T S A++DSG T
Sbjct: 313 SFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVIT 372
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
LP Y + F + +S + + + CY+ S E+ +P + F+
Sbjct: 373 RLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFA 424
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 135/329 (41%), Gaps = 63/329 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
HY + P + +D GSN+ W +E + S S +
Sbjct: 56 HYRFELTHRPKDNISAVVDTGSNIFWT----------------------TEKECSRSKTR 93
Query: 170 KNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS-TEDTSSSGYLVDDILHL-A 217
+ C P C+ R+SC + C Y Y + S++G L +D L + A
Sbjct: 94 SMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVA 153
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
SK P S V IGC T + D + GV GLG S+P L + F
Sbjct: 154 VASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQLNFS-----KF 207
Query: 278 SIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDAYFVGVESYCIG 324
S C + + D S P A +T+ P + YFV ++ IG
Sbjct: 208 SYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIG 267
Query: 325 NSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYC 377
+ L T+SG VD+G SFT L ++A++V + D+++ ++ + N+ + C
Sbjct: 268 GTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQIC 327
Query: 378 Y---NASSEEMLKVPDMRLIFSKNQSFVV 403
Y + +++E K+PDM L F+ + + V+
Sbjct: 328 YSPPSTAADESSKLPDMVLHFADSANMVL 356
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/340 (26%), Positives = 139/340 (40%), Gaps = 51/340 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP V L+A+D GS++ W+ CQ C +C P S +DP S+S + +
Sbjct: 138 IAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPV----------FDPRHSTSYREM 187
Query: 173 SCSHPLCKS--RSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
P C++ RS K C Y Y + +++ G +++ L A P S
Sbjct: 188 GYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAG-GVQVPHMS-- 244
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE------ 283
IGCG G + AA G++GLG G +S PS +A G SFS C +
Sbjct: 245 ----IGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSP 298
Query: 284 --NDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGVESYCIGNSCLTQSGFQ- 334
+ S ++ GD A SF P + Y VGV + +T+ +
Sbjct: 299 GRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKL 358
Query: 335 --------ALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNS--WKYCYNASSE 383
++DSG + T L Y A V ++S+ G S + CY
Sbjct: 359 DPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGR 418
Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
M KVP + + F+ + + P + +G CF++
Sbjct: 419 AM-KVPTVSMHFAGGVELTLPPKNYLIPVDSMGT-VCFAF 456
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 106/259 (40%), Gaps = 40/259 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I+IG P + + +D GS+L WV C P + +L ++ Y P+ + + V
Sbjct: 66 INIGNPPNPYELDIDTGSDLTWVQCD----GPDAPCKGCTLPKD-KLYKPNGN---QLVK 117
Query: 174 CSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
CS P+C + C PC Y +Y+ ++ S+G L D +H+ S P
Sbjct: 118 CSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEYA-DNAESTGALARDYMHIGS-----PS 171
Query: 226 SSVQSSVIIGCGRKQT-GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S V+ GCG +Q + GV+GLG G +S+ S L G I N C
Sbjct: 172 GSNVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSAE 231
Query: 285 DSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 335
G +F GD+ P Q S EK+ Y G G Q
Sbjct: 232 GGGYLFLGDKFIPSSGIFWTPIIQSSL------EKH--YSTGPVDLFFNGKPTPAKGLQI 283
Query: 336 LVDSGASFTFLPTEIYAEV 354
+ DSG+S+T+ +Y V
Sbjct: 284 IFDSGSSYTYFSPRVYTIV 302
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 127/297 (42%), Gaps = 49/297 (16%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP + L+A+D ++ W+PC C+ C S++ + ++ S++ K V C
Sbjct: 102 IGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVK----------STTFKTVGC 148
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
P CK + K C + Y + +++ L D++ LA+ S S
Sbjct: 149 EAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLATDSI--------PSYTF 198
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVF 290
GC + TGS + P G++GLG G +S+ L L Q++FS C N SGS+
Sbjct: 199 GCLTEATGSSIP---PQGLLGLGRGPMSL--LSQTQNLYQSTFSYCLPSFRSLNFSGSLR 253
Query: 291 FGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 339
G G P ++T L + Y+V + + +G + +G + DS
Sbjct: 254 LGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDS 313
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
G FT L Y V F K V + ++ G + CY + + P + +FS
Sbjct: 314 GTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGG-FDTCYTSP----IVAPTITFMFS 365
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 144/354 (40%), Gaps = 53/354 (14%)
Query: 57 NSVEYLELLLSNDWKRQKTRVKLQSNNN-SSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
N+ +E+LL + + KL ++ + P++ + GN + I
Sbjct: 85 NAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAKLPTKSGMSLGTGN-----YIVSIG 139
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+G+P ++ D GS+L W C + +DP+ S+S NVSCS
Sbjct: 140 LGSPKKDLMLIFDTGSDLTWARCSAAE-----------------TFDPTKSTSYANVSCS 182
Query: 176 HPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
PLC S S + C Y Y + + S G+L + L + S + + +
Sbjct: 183 TPLCSSVISATGNPSRCAASTCVYGIQYG-DGSYSIGFLGKERLTIGS-------TDIFN 234
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
+ GCG+ G + A G++GLG +SV S A FS C S S
Sbjct: 235 NFYFGCGQDVDGLFGKAA---GLLGLGRDKLSVVSQTAPK--YNQLFSYCLPS--SSSTG 287
Query: 291 FGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSCLTQSGFQALVDSGAS 342
F G + +S F P+ +++ VG + I S + +G ++DSG
Sbjct: 288 FLSFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAG--TIIDSGTV 345
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
T LP Y+ + F K ++S + + CY+ S + +KVP + + FS
Sbjct: 346 VTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFS 399
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 151/366 (41%), Gaps = 45/366 (12%)
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLW 135
++L +N++ R++ L S H + +YT + IGTP F + +D GS + +
Sbjct: 3 LELVANSHRRRDRELLGSARMDLH--DDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTY 60
Query: 136 VPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
VPC C C + + P+ SSS K + C C + S K Y
Sbjct: 61 VPCSSCTHCG----------NHQDPRFSPALSSSYKPLECGSE-CSTGFCDGSRK----Y 105
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
Y+ E ++SSG L D++ ++ S Q ++ GC +TG D A DG++
Sbjct: 106 QRQYA-EKSTSSGVLGKDVIGFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGII 158
Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKY 311
GLG G +S+ L + +++ FS+C+ DE + G Q P T+ P Y
Sbjct: 159 GLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTASDPHRSPY 218
Query: 312 DAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
Y + ++ +G S L + ++DSG ++ + P + + V S
Sbjct: 219 --YNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSL 276
Query: 366 RISLQGNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEVGDH 418
+ + G K+ CY + + + P + +F QS + + F ++
Sbjct: 277 K-EVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGA 335
Query: 419 ACFSYF 424
C F
Sbjct: 336 YCLGVF 341
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 117/277 (42%), Gaps = 45/277 (16%)
Query: 92 FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY 151
FP +G+ F L+YT + +GTP F V +D GS++LWV C P +
Sbjct: 118 FPVDGASDPFL----VGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKT---- 169
Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGY 208
+ L LS +DP SSS+ VSCS C S +S P C Y Y + + +SGY
Sbjct: 170 SELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGY 228
Query: 209 LVDDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
+ D + +L S P+ +V DG+ GLG G +SV S
Sbjct: 229 YISDFMCSNLQSGDLQRPRRAV----------------------DGIFGLGQGSLSVISQ 266
Query: 267 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 326
LA GL FS C + SG G + T + P+ Y V ++S +
Sbjct: 267 LAVQGLAPRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQ 325
Query: 327 CL--------TQSGFQALVDSGASFTFLPTEIYAEVV 355
L +G ++D+G + +LP E Y+ +
Sbjct: 326 ILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFI 362
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 90/201 (44%), Gaps = 30/201 (14%)
Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
GN F +Y+ + IGTP +F +D GS+L WV C AP + + +Y
Sbjct: 46 GNVFPLGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCD----APCTGCTLPPI----RQY 97
Query: 162 DPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
P ++ V C P+C ++ C + K+ C Y +Y+ + SS G LV D L
Sbjct: 98 KPKGNT----VPCLDPICLALHFPNKPQCPNPKEQCDYEVNYA-DQGSSMGALVIDQFPL 152
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGL 272
+ S++Q + GCG Q L A P GV+GLG G + V L AGL
Sbjct: 153 KLLNG----SAMQPRLAFGCGYDQI---LPKAHPPPATAGVLGLGRGKIGVLPQLVAAGL 205
Query: 273 IQNSFSICFDENDSGSVFFGD 293
+N C G +FFGD
Sbjct: 206 TRNVVGHCLSSKGGGYLFFGD 226
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 133/322 (41%), Gaps = 47/322 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + LD GS+L W+ C CI C S YY DP SSS +N+SC
Sbjct: 201 VGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYY----------DPKDSSSFRNISC 250
Query: 175 SHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSS 227
P C+ SS CK+ CPY Y ++ + ++ ++L + + +
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
V+ +V+ GCG G + A G+ L S L SFS C D N +
Sbjct: 311 VE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYGQSFSYCLVDRNSN 364
Query: 287 GSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL-------- 328
SV FG D+ + + +F G D Y+V + S + + L
Sbjct: 365 ASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWH 424
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
++ ++DSG + T+ Y + F + + + K CYN S E +
Sbjct: 425 LSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKM 484
Query: 387 KVPDMRLIFSKNQ--SFVVRNH 406
++PD ++F+ +F V N+
Sbjct: 485 ELPDFGILFADGAVWNFPVENY 506
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 137/341 (40%), Gaps = 51/341 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T I +GTP F V D GS+L+W IQC P A + ++ +DP SSS
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIW-----IQCKPCQACF----NQKDPIFDPEGSSSY 90
Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+SC LC S R SC C Y Y + + + G L + + L S +
Sbjct: 91 TTMSCGDTLCDSLPRKSCSP---DCDYSYGYG-DGSGTRGTLSSETVTLTSTQG---EKL 143
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
++ GCG GS+ D + G++GLG G++S S L L + FS C
Sbjct: 144 AAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDA 198
Query: 283 ENDSGSVFFGDQGPATQQST----SFLPI---GEKYDAYFVGVESYCIGNSCL------- 328
+ + +FFGD+ + +F P+ Y+V ++ I L
Sbjct: 199 PSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSF 258
Query: 329 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
G ++ DSG + T LP Y V+ +S +I CY+ S +
Sbjct: 259 DIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKA 318
Query: 386 ---LKVPDMRLIF-SKNQSFVVRNHIFSFPENEVGDHACFS 422
+K+P M F + V N+ + N+ G C +
Sbjct: 319 SYKMKIPAMVFHFEGADYQLPVENYFIA--ANDAGTIVCLA 357
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 145/362 (40%), Gaps = 45/362 (12%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQ-LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL D R + + + +N S+ + P+E + GN + + +GTP
Sbjct: 113 LLDQDQARVDSILGMITNETSAVGPGVSLPAERGISVGTGN-----YVVSVGLGTPARDL 167
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
V D GS+L WV QC C+ S Y D + PS SS+ V C C++R
Sbjct: 168 TVVFDTGSDLSWV--QCGPCS--SGGCYKQQD---PLFAPSDSSTFSAVRCGARECRARQ 220
Query: 184 SCKSLK--DPCPYIADYSTEDTSSSGYLVDDILHLASFS---KHAPQSSVQSSVIIGCGR 238
SC D CPY Y + + + G+L +D L L + + A + + GCG
Sbjct: 221 SCGGSPGDDRCPYEVVYG-DKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGE 279
Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG--- 295
TG L G A DG+ GLG G VS+ S AG FS C + S + + G
Sbjct: 280 NNTG--LFGQA-DGLFGLGRGKVSLSS--QAAGKFGEGFSYCLPSSSSSAPGYLSLGTPV 334
Query: 296 --PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS----GFQALVDSGASFTFLPTE 349
PA Q T L Y+V + + + S +VDSG T L
Sbjct: 335 PAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVITRLAPR 394
Query: 350 IYAEVVVKFDKLVS------SKRISLQGNSWKYCYN--ASSEEMLKVPDMRLIFSKNQSF 401
Y + F + + R+S+ CY+ A + + +P + L+F+ +
Sbjct: 395 AYRALRAAFLSAMGKYGYKRAPRLSI----LDTCYDFTAHANATVSIPAVALVFAGGATI 450
Query: 402 VV 403
V
Sbjct: 451 SV 452
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 135/322 (41%), Gaps = 47/322 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + LD GS+L W+ C CI C S YY DP SSS +N+SC
Sbjct: 203 VGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYY----------DPKDSSSFRNISC 252
Query: 175 SHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSS 227
P C+ S+ CK+ CPY Y ++ + ++ ++L + + +
Sbjct: 253 HDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKH 312
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
V+ +V+ GCG G + A G+ L S L SFS C D N +
Sbjct: 313 VE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYGQSFSYCLVDRNSN 366
Query: 287 GSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL-------- 328
SV FG D+ + + +F G D Y+V ++S + + L
Sbjct: 367 ASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWH 426
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
++ ++DSG + T+ Y + F + + ++ K CYN S E +
Sbjct: 427 LSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKM 486
Query: 387 KVPDMRLIFSKNQ--SFVVRNH 406
++PD ++F+ +F V N+
Sbjct: 487 ELPDFGILFADEAVWNFPVENY 508
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 91/202 (45%), Gaps = 35/202 (17%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNL------- 158
Y+ WI GTP F + +D+GS + +VPC C QC + D+ L
Sbjct: 91 YYTTRLWI--GTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQVMLSSPKDQILCLVSCKV 148
Query: 159 -------------SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS 205
++ P SS+ + V C+ +C K+ C Y +Y+ E +SS
Sbjct: 149 QIFKISYGLFDEDPKFQPELSSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSS 202
Query: 206 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 265
G L +D++ + S PQ +V GC +TG A DG++GLG GD+S+
Sbjct: 203 KGVLGEDLISFGNESHLTPQRAV-----FGCKTVETGDLYSQRA-DGIIGLGQGDLSLVG 256
Query: 266 LLAKAGLIQNSFSICFDENDSG 287
L GLI NSF +C+ D G
Sbjct: 257 QLVDKGLISNSFGLCYGGLDVG 278
>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
Length = 178
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 63/128 (49%), Gaps = 8/128 (6%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWV-PCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT I IGTP V + V LD GS WV C QC + + + R L+ YDP SS
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SSK V C +C SR C ++ CPYI Y+ + + G L D+LH +
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLHYHQLYGNGQTQP 170
Query: 228 VQSSVIIG 235
+SV G
Sbjct: 171 TSTSVTFG 178
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 91/328 (27%), Positives = 138/328 (42%), Gaps = 50/328 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V F+ D GS+L W CQ C C P ++ YDPS+SS+ V
Sbjct: 81 LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPV 130
Query: 173 SCSH----PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
CS P+ +SR +C + C Y YS + S+G L + L L S P +V
Sbjct: 131 PCSSATCLPVLRSR-NCSTPSSLCRYGYSYS-DGAYSAGILGTETLTLGS---SVPGQAV 185
Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDE 283
S V GCG G L+ G +GLG G + SLLA+ G+ FS C F+
Sbjct: 186 SVSDVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNS 237
Query: 284 NDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
G GP QST L Y V ++ +G+ L
Sbjct: 238 TLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLH 297
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNASSEEMLK 387
S +VDSG +F+ LP + VV +++ ++ +S + A ++
Sbjct: 298 ANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPF 357
Query: 388 VPDMRLIFSKNQSFVV-RNHIFSFPENE 414
+PD+ L F+ + R++ S+ + +
Sbjct: 358 MPDLVLHFAGGADMRLHRDNYMSYNQED 385
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/297 (26%), Positives = 131/297 (44%), Gaps = 41/297 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y + Y+ +GTP +D GS+++W +QC P Y ++ ++PS S
Sbjct: 87 YLMTYS---VGTPPFKLYGIVDTGSDIVW-----LQCEPCQECY----NQTTPMFNPSKS 134
Query: 167 SSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
SS KN+ C LC+S +SC K+ C Y + Y +++ S G L D L L S +
Sbjct: 135 SSYKNIPCPSKLCQSMEDTSCND-KNYCEY-STYYGDNSHSGGDLSVDTLTLESTNG--- 189
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
+ +++IGCG SY +GA+ G++G G G S + L + FS C
Sbjct: 190 LTVSFPNIVIGCGTNNILSY-EGAS-SGIVGFGSGPASFITQLGSS--TGGKFSYCLTPL 245
Query: 282 ------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGF 333
N + + FGD + PI +K Y++ +E++ +GN + G
Sbjct: 246 FSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGV 305
Query: 334 -------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
++DSG + T L + Y+ + LV +R+ + CY+ +E
Sbjct: 306 PNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSVKAE 362
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/299 (27%), Positives = 129/299 (43%), Gaps = 53/299 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I IG+P V+ L+ +D S+LLW+ C+ CI C ++L +DPS S + +N
Sbjct: 89 ISIGSPPVTQLLHMDTASDLLWLQCRPCINCYA----------QSLPIFDPSRSYTHRNE 138
Query: 173 SC-----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SC S P + + +S C Y Y + T S G L ++L + + ++
Sbjct: 139 SCRTSQYSMPSLRFNAKTRS----CEYSMRY-MDGTGSKGILAKEMLMFNTIYDESSSAA 193
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
+ V+ GCG G L G G++GLG G+ SL+ + G FS CF D
Sbjct: 194 LH-DVVFGCGHDNYGEPLVGT---GILGLGYGEF---SLVHRFG---TKFSYCFGSLDDP 243
Query: 288 S-----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT----------QSG 332
S + GD G T+ L I + Y+V +E+ + L Q+G
Sbjct: 244 SYPHNVLVLGDDGANILGDTTPLEIYNGF--YYVTIEAISVDGIILPIDPWVFNRNHQTG 301
Query: 333 FQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY-CYNASSEEML 386
++D+G S T L E Y + K + + + Q + +K CYN + E L
Sbjct: 302 LGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDL 360
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 87/344 (25%), Positives = 149/344 (43%), Gaps = 56/344 (16%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I +GTP+ L+ LD GS+++W +QCAP Y D++ +DP SSS
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVW-----LQCAPCRRCY----DQSGPVFDPRRSSSY 190
Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
V C+ PLC+ S C + C Y Y + + ++G + L A ++ A
Sbjct: 191 GAVDCAAPLCRRLDSGGCDLRRRACLYQVAYG-DGSVTAGDFATETLTFAGGARVA---- 245
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND-- 285
V +GCG G ++ A ++GLG G +S P+ +++ SFS C +
Sbjct: 246 ---RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--RYGKSFSYCLVDRTSS 297
Query: 286 ----------SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLT 329
S +V F GP + + SF P+ Y+V + +G + +
Sbjct: 298 SSSGAASRSRSSTVTF---GPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVA 354
Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYN 379
+S + +VDSG S T L Y+ + F + R+S G S + CY+
Sbjct: 355 ESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYD 414
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+++KVP + + F+ + + P + G CF++
Sbjct: 415 LGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTF-CFAF 457
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 147/355 (41%), Gaps = 52/355 (14%)
Query: 59 VEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
V+Y++ LS + R+ + +L S +++ L S ++ + +GT
Sbjct: 98 VKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSAN-------------YFVVVGLGT 144
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P + D GS+L W QC P + S Y D + +DPS SSS N++C+ L
Sbjct: 145 PKRDLSLVFDTGSDLTWT-----QCEPCAGSCYKQQD---AIFDPSKSSSYINITCTSSL 196
Query: 179 CKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
C +S C S C Y Y + T S G+L + L + + + +
Sbjct: 197 CTQLTSAGIKSRCSSSTTACIYGIQYGDKST-SVGFLSQERLTITA-------TDIVDDF 248
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVF 290
+ GCG+ G + A G++GLG +S + + + FS C S G +
Sbjct: 249 LFGCGQDNEGLFSGSA---GLIGLGRHPISF--VQQTSSIYNKIFSYCLPSTSSSLGHLT 303
Query: 291 FGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL------TQSGFQALVDSGA 341
FG AT + + P+ D F G++ +G + L T S +++DSG
Sbjct: 304 FGASA-ATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGT 362
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
T L YA + F + + ++ + + CY+ S + + VP + F+
Sbjct: 363 VITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFA 417
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 130/322 (40%), Gaps = 44/322 (13%)
Query: 81 SNNNSSRNQLLFPSEGSQTHF------------FGNQF-YWLHYTWIDIGTPNVSFLVAL 127
S++ R + +FP + + +GN + +Y + IG P + +
Sbjct: 25 SDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPKPYFLDP 84
Query: 128 DAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS-SSKNVSCSHPLCKSRSS 184
D GS+L W+ C C++C Y + + DP +S C HP
Sbjct: 85 DTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPMCASLHPPGYKCEHP------- 137
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
+ C Y +Y+ + SS G LV D+ L+ + + AP+ + +GCG Q
Sbjct: 138 -----EQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR------LALGCGYDQIP 185
Query: 243 --SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
SY DGV+GLG G S+ S L G+I+N C G +FFGD + +
Sbjct: 186 GQSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYDSSR 242
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ +++ Y G +G DSG+S+T+L + Y +V K
Sbjct: 243 VVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRK 302
Query: 361 LVSSK--RISLQGNSWKYCYNA 380
+S K R +L + C+
Sbjct: 303 ELSEKPVREALDDQTLPLCWRG 324
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 134/329 (40%), Gaps = 43/329 (13%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP F + LD GS+L W+ QC+ C Y ++N YDP SSS +N+ C
Sbjct: 187 VGTPPKHFSLILDTGSDLNWI--QCVPC-------YECFEQNGPHYDPGQSSSYRNIGCH 237
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
C SS CK+ CPY Y ++ + ++ + S P+
Sbjct: 238 DSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRV 297
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
+V+ GCG G + A ++GLG G +S S L L +SFS C D N
Sbjct: 298 ENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDAN 352
Query: 285 DSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVGVESYCIGNSCL---------- 328
S + FG D+ + +F L G++ Y+V ++S +G +
Sbjct: 353 VSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIA 412
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
T ++DSG + ++ Y + F V + + CYN + E +
Sbjct: 413 TDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDL 472
Query: 389 PDMRLIFSKNQ--SFVVRNHIFSFPENEV 415
PD ++FS +F V N+ EV
Sbjct: 473 PDFGIVFSDGAVWNFPVENYFIEIEPREV 501
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 125/301 (41%), Gaps = 69/301 (22%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++IGTP + V +D GS+L WVPC CI C L ++ ++ S + P SSS
Sbjct: 15 LNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNL----KSSSIFSPLHSSS 70
Query: 169 SKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYSTEDTSSSGYL 209
S SC+ C S + D PCP A E SG L
Sbjct: 71 SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
DIL + ++ P+ S GC T +Y + P G+ G G G +S+PS L
Sbjct: 131 TRDILK--ARTRDVPRFS------FGC---VTSTYHE---PIGIAGFGRGLLSLPSQL-- 174
Query: 270 AGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPIGEKYDAYFVG 317
G ++ FS CF + N S + G + Q T L ++Y++G
Sbjct: 175 -GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIG 233
Query: 318 VESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
+ES IG + LT F + LVDSG ++T LP Y++++ ++
Sbjct: 234 LESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYP 293
Query: 366 R 366
R
Sbjct: 294 R 294
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 134/315 (42%), Gaps = 41/315 (13%)
Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
G F+ L Y I IGTP +F V D GS+L WV QC P + S Y +
Sbjct: 117 LGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWV-----QCKPCTDSCY---QQQEPL 168
Query: 161 YDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
+DPS SS+ +V C P CK +C C Y Y + + + G L + L
Sbjct: 169 FDPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTT--CEYSVKYG-DQSVTRGNLAQEAFTL 225
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGL 272
+P + + V+ GC + + S + GA + G++GLG GD S+ S + G
Sbjct: 226 ------SPSAPPAAGVVFGCSHEYS-SGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGN 277
Query: 273 IQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNS 326
+ FS C S G + G P Q + SF P+ + Y V + + +
Sbjct: 278 SGDVFSYCLPPRGSSAGYLTIGAAAPP-QSNLSFTPLVTDNSQLSSVYVVNLVGISVSGA 336
Query: 327 CLT--QSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNA 380
L S F ++DSG T +P Y + +F + + + +G+ S CY+
Sbjct: 337 ALPIDASAFYIGTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDV 396
Query: 381 SSEEMLKVPDMRLIF 395
+ +++ P + L F
Sbjct: 397 TGHDVVTAPPVALEF 411
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 139/349 (39%), Gaps = 75/349 (21%)
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSAS 149
+ P T + GNQ P + +D GSN+ W
Sbjct: 25 FMTPRTSCITFYLGNQ------------RPKDNISAVVDTGSNIFWT------------- 59
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS 199
+E + S S + + C P C+ R+SC + C Y Y
Sbjct: 60 ---------TEKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYG 110
Query: 200 -TEDTSSSGYLVDDILHL-ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
+ S++G L +D L + A SK P S V IGC T + D + GV GLG
Sbjct: 111 GNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLG 169
Query: 258 LGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSF 304
S+P L + FS C + + D S P A +T+
Sbjct: 170 RSATSLPRQLNFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTAL 224
Query: 305 LPIGEKYDAYFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
P + YFV ++ IG + L T+SG VD+G SFT L ++A++V + D+
Sbjct: 225 QPNSDYKTRYFVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDR 284
Query: 361 LVSSKRISLQ---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVV 403
++ ++ + N+ + CY + +++E K+PDM L F+ + + V+
Sbjct: 285 IMKERKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVL 333
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 130/305 (42%), Gaps = 50/305 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V++ +D GS+L+W C+ C++C +++ +DPSSSS+ +
Sbjct: 106 MSIGTPAVAYAAIIDTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYAAL 155
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
CS LC S K C Y Y + +S+ G L + LA + V
Sbjct: 156 PCSSTLCSDLPSSKCTSAKCGYTYTYG-DSSSTQGVLAAETFTLA--------KTKLPDV 206
Query: 233 IIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGS 288
GCG G + GA G++GLG G + SL+++ GL N FS C D+
Sbjct: 207 AFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--NKFSYCLTSLDDTSKSP 258
Query: 289 VFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ---- 334
+ G ++ Q+T + + Y+V ++ +G++ +T S F
Sbjct: 259 LLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDD 318
Query: 335 ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
+VDSG S T+L + Y + F + G C+ A + + +V
Sbjct: 319 GTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQVEV 378
Query: 391 MRLIF 395
+L+F
Sbjct: 379 PKLVF 383
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 130/312 (41%), Gaps = 37/312 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ +GTP F++ D GS+L WV C+ + + AS S + P++S S
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLAS----PRVFRPANSKSW 165
Query: 170 KNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFS 220
+ CS CKS S+ + PC Y DY +D SS+ G + D +A
Sbjct: 166 APIPCSSDTCKSYVPFSLANCSAGTTPPAPCGY--DYRYKDKSSARGVVGTDAATIALSG 223
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ + + V++GC G + DGV+ LG ++S S A FS C
Sbjct: 224 SGSDRKAKLQEVVLGCTTSYDGQSFQSS--DGVLSLGNSNISFASR--AAARFGGRFSYC 279
Query: 281 F-----DENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYFVGVESYCIGNSCL------ 328
N + + FG G A S T L + Y V V++ + L
Sbjct: 280 LVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEV 339
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN-ASSEE 384
+ A++DSG S T L T Y VV K L R+++ + ++YCYN ++
Sbjct: 340 WDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM--DPFEYCYNWTATRR 397
Query: 385 MLKVPDMRLIFS 396
VP + + F+
Sbjct: 398 PPAVPRLEVRFA 409
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 85/309 (27%), Positives = 133/309 (43%), Gaps = 36/309 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP ++ + +D GS+L WV QC CA + S Y D +DP+ SSS V C
Sbjct: 143 LGTPGMAQTLEVDTGSDLSWV--QCKPCA--APSCYRQKD---PLFDPAQSSSYAAVPCG 195
Query: 176 HPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C S+C + + C Y+ Y + ++++G D L LA+ ++VQ
Sbjct: 196 RSACAGLGIYASACSAAQ--CGYVVSYG-DGSNTTGVYSSDTLTLAA------NATVQ-G 245
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVF 290
+ GCG Q+G G DG++G G PSL+ + AG FS C S + +
Sbjct: 246 FLFGCGHAQSGGLFTGI--DGLLGFGR---EQPSLVQQTAGAYGGVFSYCLPTKSSTTGY 300
Query: 291 FGDQGPATQQ----STSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--LVDSGAS 342
GP+ +T LP Y V + +G L+ S F A +VD+G
Sbjct: 301 LTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTV 360
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
T LP YA + F ++S + CY+ + + + + L FS +
Sbjct: 361 ITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMT 420
Query: 403 V-RNHIFSF 410
+ + I SF
Sbjct: 421 LGADGIMSF 429
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 126/303 (41%), Gaps = 69/303 (22%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++IGTP + V LD GS+L WVPC CI+C L + ++ S + P SS+
Sbjct: 87 LNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDL----KSPSVFSPLHSST 142
Query: 169 SKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYSTEDTSSSGYL 209
S SC+ C S + D PCP A E SG L
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
DIL + ++ P+ S GC T +Y + P G+ G G G +S+PS L
Sbjct: 203 TRDILK--ARTRDVPRFS------FGC---VTSTYRE---PIGIAGFGRGLLSLPSQL-- 246
Query: 270 AGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPIGEKYDAYFVG 317
G ++ FS CF + N S + G + Q T L ++Y++G
Sbjct: 247 -GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIG 305
Query: 318 VESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
+ES IG + LT F + LVDSG ++T LP Y++++ ++
Sbjct: 306 LESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYP 365
Query: 366 RIS 368
R +
Sbjct: 366 RAT 368
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 134/313 (42%), Gaps = 39/313 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP V+ + +D GS+L WV QC CA + + Y+ D +DP+ SSS V
Sbjct: 144 VSLGTPGVAQTLEVDTGSDLSWV--QCTPCA--APACYSQKD---PLFDPAQSSSYAAVP 196
Query: 174 CSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
C P+C SSC + + C Y+ Y + + ++G D L L +P +V+
Sbjct: 197 CGGPVCGGLGIYASSCSAAQ--CGYVVSYG-DGSKTTGVYSSDTLTL------SPNDAVR 247
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
GCG Q+G + DG++GLG + S+ + AG FS C S +
Sbjct: 248 -GFFFGCGHAQSGFTGN----DGLLGLGREEASL--VEQTAGTYGGVFSYCLPTRPSTTG 300
Query: 290 FFGDQGPATQ-----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--LVDSG 340
+ GP+ +T L Y V + +G L+ S F +VD+G
Sbjct: 301 YLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTG 360
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
T LP YA + F ++S + CYN S + +P++ L FS
Sbjct: 361 TVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTFSGG 420
Query: 399 QSFVV-RNHIFSF 410
+ + + I SF
Sbjct: 421 ATVTLGADGILSF 433
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 153/368 (41%), Gaps = 63/368 (17%)
Query: 6 AICMLFGCILLDG-----SDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVE 60
A+ ++F + + G ++ +SF+++L+HR S + P N+ E
Sbjct: 14 ALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNS-----------------PLFNASE 56
Query: 61 YLELLLSNDWKRQKTRV-KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
++ L+N +R RV + ++S FPS F I IG P
Sbjct: 57 TTDIRLANAVERSADRVNRFNDLISNSITAAEFPSILDNGDFL---------MKISIGIP 107
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
LV + GS+L+W+PC + + + +L +DP SS+ KNV C C
Sbjct: 108 PTELLVNVATGSDLVWIPCLSFKPC--------THNCDLRFFDPMESSTYKNVPCDSYRC 159
Query: 180 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
+ ++ C Y D +D+ G L D L L S + +S + + CG +
Sbjct: 160 QITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTG---KSFMLPNTGFICGNR 216
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGP 296
G Y G++GLG G +S+ + ++ LI FS C + N + + FGD+
Sbjct: 217 IGGDY----PGVGILGLGHGSLSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSFGDKAV 270
Query: 297 ATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA-------LVDSGASFTFL 346
+ ST G Y +Y + +GN ++ G + +DSG FT+
Sbjct: 271 VSGSAMFSTRLDMTGGPY-SYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYF 329
Query: 347 PTEIYAEV 354
P Y+++
Sbjct: 330 PEYFYSQL 337
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 129/305 (42%), Gaps = 56/305 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ GTP F + LD GS++ W C+ C++C L + +DPS+S
Sbjct: 166 VAFGTPPQKFTLILDTGSSITWTQCKPCVRC----------LKASRRHFDPSAS------ 209
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
L S SC Y Y + TS Y D + S V
Sbjct: 210 -----LTYSLGSCIPSTVGNTYNMTYGDKSTSVGNYGCDTMT--------LEHSDVFPKF 256
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFF 291
GCGR G + GA DG++GLG G +S S A + FS C E DS GS+ F
Sbjct: 257 QFGCGRNNEGDFGSGA--DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLF 312
Query: 292 GDQGPATQQSTSF----LPIG------EKYDAYFVGVESYCIGNSCLT--QSGFQA---L 336
G++ AT QS+S L G E+ YFV + +GN L S F + +
Sbjct: 313 GEK--ATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTI 370
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMR 392
+DSG T LP Y+ + F K ++ +S +G+ CYN S + + +P++
Sbjct: 371 IDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIV 430
Query: 393 LIFSK 397
L F +
Sbjct: 431 LHFGE 435
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 119/278 (42%), Gaps = 32/278 (11%)
Query: 107 YWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y HY + IGTP D GS+L W C P + Y RN +DP
Sbjct: 68 YLGHYLMELSIGTPPFKIYGIADTGSDLTWT-----SCVPCNNCYK---QRN-PMFDPQK 118
Query: 166 SSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
S++ +N+SC LC K + S + C Y Y++ + G L + + L+S
Sbjct: 119 STTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITR-GVLAQETITLSSTKG--- 174
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
+S ++ GCG TG + D G++GLG G VS+ S + + FS C
Sbjct: 175 KSVPLKGIVFGCGHNNTGGFNDHEM--GIIGLGGGPVSLISQMGSS-FGGKRFSQCLVPF 231
Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYD--AYFVGVESYCIGNSCLTQSG----- 332
D + S + FG + + P+ K D YFV + + N+ L +G
Sbjct: 232 HTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNV 291
Query: 333 --FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
+DSG T LPT++Y +VV + V+ K ++
Sbjct: 292 EKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVT 329
>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 654
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 134/304 (44%), Gaps = 39/304 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
HYTW+ GTP V D GS L+ PC C C + + + +SS+
Sbjct: 65 HYTWVYAGTPPQRASVIADTGSGLMAFPCSGCDGCGSHTDQPFQA----------DNSST 114
Query: 169 SKNVSC----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---ASFSK 221
+V+C SH CK C D C Y E +S +V+D+++L +SF
Sbjct: 115 LIHVTCSQQQSHFQCK---ECTEKSDTCAISQSY-MEGSSWKASVVEDVVYLGGESSFHD 170
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 280
A + + GC +TG ++ A DG+MGL D + + L + I N FS+C
Sbjct: 171 EAMRDRYGTHFQFGCQSSETGLFVTQVA-DGIMGLSNSDTHIVAKLHRENKIPSNLFSLC 229
Query: 281 FDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ-----S 331
F EN G++ G+ A + S+ + + A Y V ++ IG + +
Sbjct: 230 FTEN-GGTMSVGEPNTKAHRGEISYAKVIKDRSAGHFYNVNMKDIRIGGKSINAKEEAYT 288
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+VDSG + ++LP + E + F ++ R G S C+ ++E++ +P +
Sbjct: 289 RGHYIVDSGTTDSYLPRAMKNEFLQVFKEVAG--RDYQVGTS---CHGYTNEDLASLPKI 343
Query: 392 RLIF 395
+L+
Sbjct: 344 QLVM 347
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 94/213 (44%), Gaps = 22/213 (10%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+YT++ IGTP + LD GS L PC C +C P + P SS+
Sbjct: 81 YYTYLTIGTPGQTVSGILDTGSTLPAFPCSGCTRCGP----------SKTGMFKPELSST 130
Query: 169 SKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
S CS C +SC + C Y Y E +S+SG+L +D+L + A
Sbjct: 131 SSTFGCSDARCFCGANSCSCNNEQCGYSIRY-LEGSSTSGFLAEDMLAVGDGGPAA---- 185
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
+ + GC + ++G A DGV G+G S+ L + G+I ++FS+CF G
Sbjct: 186 ---NFVFGCAQSESGLLYSQIA-DGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREG 241
Query: 288 SVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVE 319
+ G+ PA + P+ + + + +E
Sbjct: 242 VLLLGNVALPADAPAPVVTPVVGNTNKFNIQIE 274
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 139/330 (42%), Gaps = 71/330 (21%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V+F V D GS+L+W C C +CA R + P+SSS+ +
Sbjct: 94 LSIGTPPVTFSVLADTGSSLIWTQCAPCTECA----------ARPAPPFQPASSSTFSKL 143
Query: 173 SCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQS 226
C+ LC+ +S C + C Y Y T +GYL + LH+ ASF A
Sbjct: 144 PCASSLCQFLTSPYLTCNATG--CVYYYPYGMGFT--AGYLATETLHVGGASFPGVAFGC 199
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----D 282
S ++ V G + G++GLG + SL+++ G+ FS C D
Sbjct: 200 STENGV--------------GNSSSGIVGLGRSPL---SLVSQVGV--GRFSYCLRSDAD 240
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL----TQSGF 333
DS + FG T + P+ E + Y+V + +G + L T GF
Sbjct: 241 AGDS-PILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGF 299
Query: 334 Q----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----CYN 379
+VDSG + T+L E YA V F +++ ++ N ++ C++
Sbjct: 300 TRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFD 359
Query: 380 ASSE---EMLKVPDMRLIFSKNQSFVVRNH 406
A++ + VP + L F+ + VR
Sbjct: 360 ATAAGGGSGVPVPTLVLRFAGGAEYAVRRR 389
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/290 (26%), Positives = 124/290 (42%), Gaps = 33/290 (11%)
Query: 78 KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
KL ++ ++ + +FP G + N Y+ H I +G+P + + +D GS+L W+
Sbjct: 75 KLATSVSAFDSSTIFPVRGD---VYPNGLYFTH---IFVGSPPRRYFLDMDTGSDLTWIQ 128
Query: 138 CQ--CIQCAPLSASYYTSLDRNLSEY-DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
C C CA Y NL D +N+ + C++ + C Y
Sbjct: 129 CDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGY--------CETCEQ-CDY 179
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGV 253
+Y+ + +SS G L D LHL A S + ++ GC Q G L+ A DG+
Sbjct: 180 EIEYA-DHSSSMGVLASDDLHLM----LANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGI 234
Query: 254 MGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI---- 307
+GL VS+PS LA +I N C D G +F GD +++P+
Sbjct: 235 LGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPMLNSH 293
Query: 308 GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVV 355
Y + + + S Q G + + D+G+S+T+ P E Y +V
Sbjct: 294 SPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALV 343
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 36/301 (11%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y +H + +GTP + LD GS+L+W QCAP + D+ + DP++S
Sbjct: 86 YLVH---LAVGTPPRPVALTLDTGSDLVWT-----QCAPCRDCF----DQGIPLLDPAAS 133
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ + C P C++ C Y+ Y + + + G + D +
Sbjct: 134 STYAALPCGAPRCRALPFTSCGGRSCVYVYHYG-DKSVTVGKIATDRFTFGDNGRRNGDG 192
Query: 227 SVQSS--VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-- 282
S+ ++ + GCG G + G+ G G G S+PS L SFS CF
Sbjct: 193 SLPATRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNA-----TSFSYCFTSM 245
Query: 283 -ENDSGSVFFGDQGPA--------TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QS 331
++ S V G A ++T + YF+ ++ +G + L ++
Sbjct: 246 FDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPET 305
Query: 332 GFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
F++ ++DSGAS T LP E+Y V +F V ++G++ C+ + + P
Sbjct: 306 KFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPA 365
Query: 391 M 391
+
Sbjct: 366 V 366
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 126/281 (44%), Gaps = 41/281 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
++IG P+ + + +D GS+L W+ C C+QC YY + NL
Sbjct: 38 LNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRN-NL------------- 83
Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHL--ASFSKHAPQS 226
V C P+C+S S + P DY E SS G LV D +L S +H+P
Sbjct: 84 VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPL- 142
Query: 227 SVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ +GCG Q GS+ DGV+GLG G S+ S L+ GL++N C +
Sbjct: 143 -----LALGCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGH 194
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGA 341
G +FFGD + + ++ P+ Y G+ +GF+ L+ DSGA
Sbjct: 195 GGGFLFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGA 250
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNA 380
S+T+L ++ Y ++ K +S K R +L + C+
Sbjct: 251 SYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG 291
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/327 (25%), Positives = 135/327 (41%), Gaps = 36/327 (11%)
Query: 107 YWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y HY + IGTP D GS+L W C P + Y RN +DP
Sbjct: 21 YLGHYLMEVSIGTPPFKIYGIADTGSDLTWT-----SCVPCNKCYK---QRN-PIFDPQK 71
Query: 166 SSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
S+S +N+SC LC K + S + C Y Y++ + G L + + L+S
Sbjct: 72 STSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAI-TQGVLAQETITLSSTKG--- 127
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
+S ++ GCG TG + D G++GLG G VS S + + FS C
Sbjct: 128 ESVPLKGIVFGCGHNNTGGFNDREM--GIIGLGGGPVSFISQIGSS-FGGKRFSQCLVPF 184
Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGEKYD--AYFVGVESYCIGNSCLTQSG----- 332
D + S + G + + P+ K D YFV + +GN+ L +G
Sbjct: 185 HTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQS 244
Query: 333 ---FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKV 388
+DSG T LPT++Y +V + V+ K ++ + + CY ++ L+
Sbjct: 245 VEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYR--TKNNLRG 302
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEV 415
P + F ++ F P++ V
Sbjct: 303 PVLTAHFEGGDVKLLPTQTFVSPKDGV 329
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 135/341 (39%), Gaps = 51/341 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T I +GTP F V D GS+L+W IQC P A + ++ +DP SSS
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIW-----IQCKPCQACF----NQKDPIFDPEGSSSY 90
Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+SC LC S R SC C Y Y + + + G L + + L S +
Sbjct: 91 TTMSCGDTLCDSLPRKSCSP---NCDYSYGYG-DGSGTRGTLSSETVTLTSTQG---EKL 143
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----D 282
++ GCG GS+ D + G++GLG G++S S L L + FS C
Sbjct: 144 AAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDA 198
Query: 283 ENDSGSVFFGDQGPATQQST----SFLPIGEKYDA---YFVGVESYCIGNSCL------- 328
+ + +FFGD+ + +F P+ Y+V ++ I L
Sbjct: 199 PSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSF 258
Query: 329 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
G ++ DSG + T LP Y V+ VS I CY+ S +
Sbjct: 259 DIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKA 318
Query: 386 ---LKVPDMRLIF-SKNQSFVVRNHIFSFPENEVGDHACFS 422
K+P M F + V N+ + N+ G C +
Sbjct: 319 SYKKKIPAMVFHFEGADHQLPVENYFIA--ANDAGTIVCLA 357
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 139/311 (44%), Gaps = 43/311 (13%)
Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS 159
G L Y + IG+P V+ +++D GS++ WV C+ C QC ++ +D S
Sbjct: 122 LGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQC-------HSEVD---S 171
Query: 160 EYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 213
+DPS+SS+ SCS C + + C S + C YI Y + +S++G D
Sbjct: 172 LFDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQ--CQYIVSY-VDGSSTTGTYSSDT 228
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGL 272
L L S + Q GC + ++G + D DG+MGLG GD SL+++ AG
Sbjct: 229 LTLGSNAIKGFQ--------FGCSQSESGGFSD--QTDGLMGLG-GDAQ--SLVSQTAGT 275
Query: 273 IQNSFSICFDENDSGSVFFGDQGPATQQS---TSFLPIGEKYDAYFVGVESYCIGNSCLT 329
+FS C GS F G A++ T L + Y V +E+ +G L
Sbjct: 276 FGKAFSYCLPPTP-GSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLN 334
Query: 330 --QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
S F A ++DSG T LP Y+ + F + + C++ S +
Sbjct: 335 IPTSVFSAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSS 394
Query: 386 LKVPDMRLIFS 396
+ +P + L+FS
Sbjct: 395 VSIPSVALVFS 405
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 125/290 (43%), Gaps = 33/290 (11%)
Query: 78 KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
KL ++ ++ + +FP G + N Y+ H I +G+P + + +D GS+L W+
Sbjct: 288 KLATSVSAFDSSTIFPVRGD---VYPNGLYFTH---IFVGSPPRRYFLDMDTGSDLTWIQ 341
Query: 138 CQ--CIQCAPLSASYYTSLDRNLSEY-DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
C C CA Y NL D +N+ + C+ +C+ C Y
Sbjct: 342 CDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGY--CE---TCEQ----CDY 392
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGV 253
+Y+ + +SS G L D LHL A S + ++ GC Q G L+ A DG+
Sbjct: 393 EIEYA-DHSSSMGVLASDDLHLM----LANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGI 447
Query: 254 MGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI---- 307
+GL VS+PS LA +I N C D G +F GD +++P+
Sbjct: 448 LGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPMLNSH 506
Query: 308 GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVV 355
Y + + + S Q G + + D+G+S+T+ P E Y +V
Sbjct: 507 SPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALV 556
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 131/298 (43%), Gaps = 44/298 (14%)
Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
LD GS+L W +QC P + + D YDPS S + K +SC+ C SR
Sbjct: 3 LDTGSSLSW-----LQCQPCAVYCHAQAD---PLYDPSVSKTYKKLSCASVEC-SRLKAA 53
Query: 187 SLKDP--------CPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
+L DP C Y A Y DTS S GYL D+L L S S+ PQ GCG
Sbjct: 54 TLNDPLCETDSNACLYTASYG--DTSFSIGYLSQDLLTLTS-SQTLPQ------FTYGCG 104
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDSGSVFFGDQ-- 294
+ G + A G++GL +S+ + L+ K G ++FS C +SGS G
Sbjct: 105 QDNQGLFGRAA---GIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSSGGGFLSI 158
Query: 295 ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLP 347
P + + T L + YF+ + + + L + L+DSG T LP
Sbjct: 159 GSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLP 218
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
+YA + F K++S+K S C+ S + + VP++++IF +R
Sbjct: 219 MSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLR 276
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 140/332 (42%), Gaps = 70/332 (21%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP+ S + LD GS L W+ C + TS +DPS SSS ++
Sbjct: 84 LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTS-------FDPSLSSSFSDLP 136
Query: 174 CSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
CSHPLCK R +SC S + C Y Y+ + T + G LV + ++ P
Sbjct: 137 CSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVKEKFTFSNSQTTPP-- 192
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
+I+GC ++ T G++G+ LG + S +++A + + S+ I N
Sbjct: 193 -----LILGCAKESTDE-------KGILGMNLGRL---SFISQAKISKFSYCIPTRSNRP 237
Query: 285 ---DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
+GS + GD P +Q+ + P+ AY V ++ IG L
Sbjct: 238 GLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPL-----AYTVPLQGIRIGQKRL 292
Query: 329 TQSG----------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKY 376
G Q +VDSG+ FT L Y +V + +LV S K+ + G++
Sbjct: 293 NIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADM 352
Query: 377 CY--NASSEEMLKVPDMRLIFSKNQSFVVRNH 406
C+ N S E + D+ F + +V
Sbjct: 353 CFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQ 384
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 142/336 (42%), Gaps = 74/336 (22%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYD 162
Y + + GTP + + +D GS+L+W PC C C+ +++ + + + +
Sbjct: 87 YGAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCS------FSTSNPSSNIFI 140
Query: 163 PSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCP--------YIADYSTEDTSSSGY 208
P SSSSSK + C +P C K +S C+ + P Y+ Y + T G
Sbjct: 141 PKSSSSSKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITG--GI 198
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
++ + L L K P + I+GC S L + P G+ G G G S+PS L
Sbjct: 199 MLSETLDLPG--KGVP------NFIVGC------SVLSTSQPAGISGFGRGPPSLPSQL- 243
Query: 269 KAGLIQNSFSICF------DENDSGSVFFGDQGPATQQST--SFLPIGEKYDA------- 313
GL FS C D +S S+ + + +++ S+ P +
Sbjct: 244 --GL--KKFSYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFS 299
Query: 314 --YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
Y++G+ +G + ++DSG +FT++ EI+ V +F+K
Sbjct: 300 VYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQ 359
Query: 362 VSSKRIS-LQG-NSWKYCYNASSEEMLKVPDMRLIF 395
V SKR + ++G + C+N S P++ L F
Sbjct: 360 VQSKRATEVEGITGLRPCFNISGLNTPSFPELTLKF 395
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/319 (24%), Positives = 133/319 (41%), Gaps = 37/319 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD-RNLSEYDPSSSSS 168
++ +GTP+ F++ D GS+L W+ C+ C + S + R+ + + SSS
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSS 141
Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFS 220
K + C +CK S ++C + PC Y DY D S++ G+ ++ + +
Sbjct: 142 FKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVE--L 197
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
K + + +V+IGC G A DGVMGLG S + A FS C
Sbjct: 198 KEGRKMKLH-NVLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--IKAAEKFGGKFSYC 252
Query: 281 F-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 328
+N S + FG + + + L +G Y V + IG + L
Sbjct: 253 LVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIP 312
Query: 329 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSWKYCYNASS 382
+ ++DSG+S TFL Y V+ L+ +++ + +YC+N++
Sbjct: 313 SEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372
Query: 383 EEMLKVPDMRLIFSKNQSF 401
E VP + F+ F
Sbjct: 373 FEESLVPRLVFHFADGAEF 391
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 141/349 (40%), Gaps = 59/349 (16%)
Query: 97 SQTHFFGNQFYWLHYTWIDIGTPNVS-FLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSL 154
S +H G Y +H+ IGTP + +D GS+++W C+ C C
Sbjct: 82 SGSHVVGYTEYLIHF---GIGTPRPQQVALEVDTGSDVVWTQCRPCFDC----------F 128
Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
+ L +D S+S + V C+ P+C++ C Y +Y +++ + G L D
Sbjct: 129 TQPLPRFDTSASDTVHGVLCTDPICRALRPHACFLGGCTYQVNYG-DNSVTIGQLAKDSF 187
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
+F ++ GCG+ TG++ G+ G G G +S+P L +
Sbjct: 188 ---TFDGKGGGKVTVPDLVFGCGQYNTGNFHSNET--GIAGFGRGPLSLPRQLGVS---- 238
Query: 275 NSFSICFD---ENDSGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGVESY 321
SFS CF E+ S VF G GP ST FLP +Y Y++ ++
Sbjct: 239 -SFSYCFTTIFESKSTPVFLGGAPADGLRAHATGPIL--STPFLPNHPEY--YYLSLKGI 293
Query: 322 CIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ- 370
+G + L +S F ++DSG + T P ++ + F V S
Sbjct: 294 TVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYND 353
Query: 371 -GNSWKYCYNASS---EEMLKVPDMRL-IFSKNQSFVVRNHIFSFPENE 414
G C++ S + VP M L + + N++ +P+++
Sbjct: 354 TGEPTLQCFSTESVPDASKVPVPKMTLHLEGADWELPRENYMAEYPDSD 402
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 127/308 (41%), Gaps = 46/308 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C D+ +DP +S+S +NV+C
Sbjct: 156 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FDQRGPVFDPMASTSYRNVTC 205
Query: 175 SHPLC------KSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
C + +C+S + DPCPY Y + ++ D L + + A S
Sbjct: 206 GDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTASSSR 261
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
V++GCG + G + A G+ L S L A G ++FS C ++ S
Sbjct: 262 RVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HAFSYCLVDHGSA 316
Query: 288 ---SVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL-----------T 329
+ FGD T+F P + Y+V ++ +G L
Sbjct: 317 VGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKE 376
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
++DSG + ++ P Y + F D++ + + CYN S E ++V
Sbjct: 377 DGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEV 436
Query: 389 PDMRLIFS 396
P+ L+F+
Sbjct: 437 PEFSLLFA 444
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 148/347 (42%), Gaps = 59/347 (17%)
Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
FG+ Y++ + IG+P + +D GS++ W IQC+P + Y +N + +
Sbjct: 9 FGSGEYFVR---VGIGSPTKLQYLVMDTGSDVPW-----IQCSPCKSCY----KQNDAVF 56
Query: 162 DPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
DP +SSS + +SCS P CK +C S + C Y Y + + + G L D L S
Sbjct: 57 DPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASDSF-LVSR 114
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ +P V+ GCG G ++ A G+ L S PS L+ FS
Sbjct: 115 GRTSP-------VVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLSS-----RKFSY 159
Query: 280 CFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT-- 329
C D+G ++ FGD T S ++ + K D Y+ G+ IG + L+
Sbjct: 160 CLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIP 219
Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKY 376
+ F+ ++DSG S T LPT Y + F KL + SL +
Sbjct: 220 STAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL----FDT 275
Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
CY+ S+ + +P + F S + + P + G CF++
Sbjct: 276 CYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTF-CFAF 321
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 158/380 (41%), Gaps = 79/380 (20%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSS-RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP- 119
+ LLLS R + QS +N+S +N LFP Y + + GTP
Sbjct: 94 INLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPRS-----------YGAYSVSLAFGTPP 142
Query: 120 -NVSFLVALDAGSNLLWVPCQC-IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHP 177
N+SF+ D GS+L+W PC +C+ S Y +S++ P SSS K V C +P
Sbjct: 143 QNLSFI--FDTGSSLVWFPCTAGYRCSRCSFPYVD--PATISKFVPKLSSSVKVVGCRNP 198
Query: 178 LC--------KSR-----SSCKSLKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHA 223
C KSR S + D CP Y Y + T +G L+ + L L +K
Sbjct: 199 KCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGAT--AGILLSETLDLE--NKRV 254
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
P ++GC S + P G+ G G G S+PS + S FD+
Sbjct: 255 PD------FLVGC------SVMSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDD 302
Query: 284 NDSGSVFFGDQGPATQQSTS----FLPIGEK--------YDAYFVGVESYCIGNSCL--- 328
+ S D G + +S + + P E + Y++ + IG +
Sbjct: 303 SPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFP 362
Query: 329 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRISLQGNSWKY 376
T +G A++DSG++FTFL I+ + + +K + +K + Q + +
Sbjct: 363 YKYLVPDSTGNG-GAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQ-SGLRP 420
Query: 377 CYN-ASSEEMLKVPDMRLIF 395
C+N EE + PD+ L F
Sbjct: 421 CFNIPKEEESAEFPDVVLKF 440
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 134/337 (39%), Gaps = 45/337 (13%)
Query: 81 SNNNSSRNQLLFPSEGSQTHF------------FGNQF-YWLHYTWIDIGTPNVSFLVAL 127
S++ R + +FP + + +GN + +Y + IG P + +
Sbjct: 25 SDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPXPYFLDP 84
Query: 128 DAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSS-SSKNVSCSHPLCKSRSS 184
GS+L W+ C C++C Y + + DP + C HP
Sbjct: 85 XTGSDLSWLQCDAPCVRCTKAXHXLYRPNNNLVICKDPMCAXLHPPGYKCEHP------- 137
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQT- 241
+ C Y +Y+ + SS G LV D+ L+ + + AP+ + +GCG Q
Sbjct: 138 -----EQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR------LALGCGYDQIP 185
Query: 242 -GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
SY DGV+GLG G S+ S L G+I+N C + G +FFGD + +
Sbjct: 186 GXSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYDSSR 242
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ +++ Y G +G DSG+S+T+L + Y +V K
Sbjct: 243 VVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRK 302
Query: 361 LVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+S K R +L + C+ V D+R F
Sbjct: 303 ELSEKPVREALDDQTLPLCWRG-KRPFKSVRDVRKFF 338
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 152/372 (40%), Gaps = 65/372 (17%)
Query: 54 PKKNSVE--YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY 111
P ++VE ELL + + + + KL N+ S + + + + G+ L Y
Sbjct: 66 PAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAITLPTTLGSALDTLAY 125
Query: 112 T-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+ IGTP ++ V +D GS++ WV C A +S + +DP SS+
Sbjct: 126 VITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGA--GSSLF---------FDPGKSSTYT 174
Query: 171 NVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
SCS C + R + SL C Y Y + ++++G D L L S K
Sbjct: 175 PFSCSSAACTRLEGRDNGCSLNSTCQYTVRYG-DGSNTTGTYGSDTLALNSTEK------ 227
Query: 228 VQSSVIIGCGRK-QTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDEND 285
+ GC G LD DG+MGLG G PSL+++ A ++FS C
Sbjct: 228 -VENFQFGCSETSDPGEGLDEDQTDGLMGLGGG---APSLVSQTAATYGSAFSYCL---- 279
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-----------------YFVGVESYCIGNS-- 326
PAT +S+ FL +G YFV ++ +G
Sbjct: 280 ----------PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPV 329
Query: 327 CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
++ + F A ++DSG T LP Y+ + F + + + C++ + ++
Sbjct: 330 AISPTVFAAGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQD 389
Query: 385 MLKVPDMRLIFS 396
+ +P + L+FS
Sbjct: 390 NVSIPAVELVFS 401
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 155/391 (39%), Gaps = 51/391 (13%)
Query: 27 KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSS 86
+L HR ++ V AD + VEY++ +S R LQ S
Sbjct: 76 RLAHRCGPSTASASFAE---VQRAD----EQRVEYIQRRVSGGGARGAK-GALQQLATGS 127
Query: 87 RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPL 146
R+ + + G T + + + +GTP VS V +D GS++ WV QC P
Sbjct: 128 RSATVPTTMGVGT--------FQYVVTVSLGTPGVSQTVEVDTGSDVSWV-----QCKPC 174
Query: 147 SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTED 202
SA S L +DP+ SS+ V C C + C + C Y+ Y +
Sbjct: 175 SAPACNSQRDQL--FDPAKSSTYSAVPCGADACSELRIYEAGCSGSQ--CGYVVSYG-DG 229
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
++++G D L L AP ++V + + GCG Q G + A DG++ LG +S
Sbjct: 230 SNTTGVYGSDTLAL------APGNTV-GTFLFGCGHAQAGMF---AGIDGLLALGRQSMS 279
Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVE 319
+ S AG FS C S + + GP++ +T L Y V +
Sbjct: 280 LKS--QAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLT 337
Query: 320 SYCIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNS- 373
+G + S F +VD+G T LP YA + F ++ S N
Sbjct: 338 GISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGI 397
Query: 374 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
CY+ S ++ +P + L FS + +
Sbjct: 398 LDTCYDFSRYGVVTLPTVALTFSGGATLALE 428
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 129/295 (43%), Gaps = 34/295 (11%)
Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
VS V +D S++ WV QC+ C P+ + L ++ YDP+ SS+ + C P CK
Sbjct: 167 VSQTVVVDTSSDIPWV--QCLPC-PIPQCH---LQKD-PLYDPAKSSTFAPIPCGSPACK 219
Query: 181 SRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
S C D C YI +Y + +++G V D L ++ + V G
Sbjct: 220 ELGSSYGNGCSPTTDECKYIVNYG-DGKATTGTYVTDTLTMSP-------TIVVKDFRFG 271
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
C GS+ + A G++ LG G S+ L A N+FS C + S F G
Sbjct: 272 CSHAVRGSFSNQNA--GILALGGGRGSL--LEQTADAYGNAFSYCIPKPSSAG-FLSLGG 326
Query: 296 PATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLP 347
P S+ P+ + A Y V +E+ + L T A++DSGA T LP
Sbjct: 327 PVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLP 386
Query: 348 TEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
++YA + F +++ ++ + CY+ + +KVP + L+F+ +
Sbjct: 387 PQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATL 441
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/246 (27%), Positives = 107/246 (43%), Gaps = 38/246 (15%)
Query: 122 SFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
+F + +D GS+ ++PC+ C C A Y YD +S+ V CS C
Sbjct: 46 TFELIVDTGSSRTYLPCKGCASCGAHEAGRY---------YDYDASADFSRVECS--ACA 94
Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
C Y Y E + S GYLV D++ L S ++V+ GC ++
Sbjct: 95 GIGGKCGTSGVCRYDVHY-LEGSGSEGYLVRDVVSLGG-------SVGNATVVFGCEERE 146
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS------------GS 288
GS +A DG+ G G ++ + LA A +I + FS+C + + G+
Sbjct: 147 LGSIKQQSA-DGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGN 205
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTFLP 347
FG PA + P+ Y V S+ +GNS + S G ++DSG S+T++P
Sbjct: 206 FDFGADAPA----LVYTPMVSSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVP 261
Query: 348 TEIYAE 353
++A
Sbjct: 262 GNMHAR 267
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 131/322 (40%), Gaps = 50/322 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVP-CQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP L+ +D S L WV C C+P + ++P SSS + C
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPT----------KVPPFNPGLSSSFISEPC 54
Query: 175 SHPLCKSR------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
+ +C R S+C C + Y + + + G + +I L S+ A S
Sbjct: 55 TSSVCLGRSKLGFQSACNRSTGSCSFQVAY-LDGSEAYGVIAREIFSLQSWDGAA---ST 110
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL---AKAGLIQNSFSICFDE-- 283
VI GC K +D ++ G +GL G S P+ + +K+GL + FS CF
Sbjct: 111 LGDVIFGCASKDLQRPVDFSS--GTLGLNRGSFSFPAQIGSRSKSGL-SDRFSYCFPNRA 167
Query: 284 ---NDSGSVFFGDQG-PATQQSTSFL----PIGEKYDAYFVGVESYCIGNSCL--TQSGF 333
N SG + FGD G PA L PI D Y+VG++ +G L +S F
Sbjct: 168 EHLNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAF 227
Query: 334 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLV-SSKRISLQGNSWKYCYN--ASS 382
+ DSG + +FL + +V F + V R S + + CY+ A
Sbjct: 228 KIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGD 287
Query: 383 EEMLKVPDMRLIFSKNQSFVVR 404
+ P + L F N +R
Sbjct: 288 ARLPTAPLVTLHFKNNVDMELR 309
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 123/305 (40%), Gaps = 45/305 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + +GTP + LD GS+L+W C C+ C A + DP++
Sbjct: 94 YLVH---LSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGA---------IPVLDPAA 141
Query: 166 SSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
SS+ V C P+C++ R + C Y+ Y + + + G L D
Sbjct: 142 SSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYG-DKSITVGKLASDRFTFGP 200
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ + GCG G + A G+ G G G S+PS L SFS
Sbjct: 201 GDNADGGGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV-----TSFS 253
Query: 279 ICFD---ENDSGSVFFGDQGPAT------QQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 329
CF E+ S V G PA QST L + YF+ +++ +G + +
Sbjct: 254 YCFTSMFESTSSLVTLG-VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIP 312
Query: 330 QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
+ A++DSGAS T LP ++Y V +F V +++G++ C+ S
Sbjct: 313 IPERRQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPS 372
Query: 383 EEMLK 387
K
Sbjct: 373 AAAPK 377
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 142/347 (40%), Gaps = 60/347 (17%)
Query: 73 QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSN 132
+ T V + S+ + L P F + + IGTP +++ +D GS+
Sbjct: 67 RATGVPMTSSKAAGGGDLQVPVHAGNGEFLMD---------VSIGTPALAYSAIVDTGSD 117
Query: 133 LLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLK 189
L+W C+ C+ C ++ +DPSSSS+ V CS C S C S
Sbjct: 118 LVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSAS 167
Query: 190 DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 249
C Y Y + +S+ G L + LA S V+ GCG G A
Sbjct: 168 K-CGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGVVFGCGDTNEGDGFSQGA 217
Query: 250 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQ------- 299
G++GLG G + SL+++ GL + FS C D+ ++ + G ++
Sbjct: 218 --GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 270
Query: 300 -QSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ--------ALVDSGASFTFLPT 348
Q+T + + Y+V +++ +G++ L S F +VDSG S T+L
Sbjct: 271 VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 330
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ Y + F ++ G C+ A ++ + +V RL+F
Sbjct: 331 QGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVF 377
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 154/391 (39%), Gaps = 51/391 (13%)
Query: 27 KLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSS 86
+L HR ++ V AD + VEY++ +S R LQ S
Sbjct: 76 RLAHRCGPSTASASFAE---VQRAD----EQRVEYIQRRVSGGGARGAK-GALQQLATGS 127
Query: 87 RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPL 146
R+ + + G T + + + +GTP VS V +D GS++ WV QC P
Sbjct: 128 RSATVPTTMGVGT--------FQYVVTVSLGTPGVSQTVEVDTGSDVSWV-----QCKPC 174
Query: 147 SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTED 202
SA S L +DP+ SS+ V C C + C + C Y+ Y +
Sbjct: 175 SAPACNSQRDQL--FDPAKSSTYSAVPCGADACSELRIYEAGCSGSQ--CGYVVSYG-DG 229
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
++++G D L L AP ++V + + GCG Q G + A DG++ LG +S
Sbjct: 230 SNTTGVYGSDTLAL------APGNTV-GTFLFGCGHAQAGMF---AGIDGLLALGRQSMS 279
Query: 263 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVE 319
+ S AG FS C S + + GP + +T L Y V +
Sbjct: 280 LKS--QAAGAYGGVFSYCLPSKQSAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLT 337
Query: 320 SYCIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNS- 373
+G + S F +VD+G T LP YA + F ++ S N
Sbjct: 338 GISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGI 397
Query: 374 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
CY+ S ++ +P + L FS + +
Sbjct: 398 LDTCYDFSRYGVVTLPTVALTFSGGATLALE 428
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 142/347 (40%), Gaps = 60/347 (17%)
Query: 73 QKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSN 132
+ T V + S+ + L P F + + IGTP +++ +D GS+
Sbjct: 77 RATGVPMTSSKAAGGGDLQVPVHAGNGEFLMD---------VSIGTPALAYSAIVDTGSD 127
Query: 133 LLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLK 189
L+W C+ C+ C ++ +DPSSSS+ V CS C S C S
Sbjct: 128 LVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSAS 177
Query: 190 DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 249
C Y Y + +S+ G L + LA S V+ GCG G A
Sbjct: 178 K-CGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGVVFGCGDTNEGDGFSQGA 227
Query: 250 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQ------- 299
G++GLG G + SL+++ GL + FS C D+ ++ + G ++
Sbjct: 228 --GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 280
Query: 300 -QSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ--------ALVDSGASFTFLPT 348
Q+T + + Y+V +++ +G++ L S F +VDSG S T+L
Sbjct: 281 VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 340
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ Y + F ++ G C+ A ++ + +V RL+F
Sbjct: 341 QGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVF 387
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 131/321 (40%), Gaps = 46/321 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I +GTP F + +D GS+L W +QC P + +D + PS+S +
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSW-----LQCQPCVIYCHVQVD---PIFTPSTSKTY 164
Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSK 221
K + CS C + C + C Y A Y DTS S GYL D+L L
Sbjct: 165 KALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYG--DTSFSIGYLSQDVLTL----- 217
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
P + S + GCG+ G + G++GL +S+ L+K N+FS C
Sbjct: 218 -TPSEAPSSGFVYGCGQDNQGLF---GRSSGIIGLANDKISMLGQLSKK--YGNAFSYCL 271
Query: 282 DEND--------SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ 330
+ SG + G T F P+ + YF+ + + + L
Sbjct: 272 PSSFSAPNSSSLSGFLSIGASS-LTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGV 330
Query: 331 SG----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEM 385
S ++DSG T LP +Y + F ++S K G S C+ S +EM
Sbjct: 331 SASSYNVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEM 390
Query: 386 LKVPDMRLIFSKNQSFVVRNH 406
VP++++IF ++ H
Sbjct: 391 STVPEIQIIFRGGAGLELKAH 411
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 149/347 (42%), Gaps = 59/347 (17%)
Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
FG+ Y++ + IG+P + +D GS++ W IQC+P + Y +N + +
Sbjct: 9 FGSGEYFVR---VGIGSPTKLQYLVMDTGSDVPW-----IQCSPCKSCY----KQNDAVF 56
Query: 162 DPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
DP +SSS + +SCS P CK +C S + C Y Y + + + G L D SF
Sbjct: 57 DPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASD-----SF 110
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
S ++ S V+ GCG G ++ A G+ L S PS L+ FS
Sbjct: 111 SVSRGRT---SPVVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLSS-----RKFSY 159
Query: 280 CFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT-- 329
C D+G ++ FGD T S ++ + K D Y+ G+ IG + L+
Sbjct: 160 CLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIP 219
Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKY 376
+ F+ ++DSG S T LPT Y + F KL + SL +
Sbjct: 220 STAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL----FDT 275
Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
CY+ S+ + +P + F S + + P + G CF++
Sbjct: 276 CYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTF-CFAF 321
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 139/346 (40%), Gaps = 58/346 (16%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y +H + IGTP + LD GS+L+W QC P A + D+ L +DPS+S
Sbjct: 35 YLVH---LAIGTPPQPVQLTLDTGSDLIWT-----QCQPCPACF----DQALPYFDPSTS 82
Query: 167 SSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
S+ SC LC+ +SC S K C Y Y + + ++G+L D
Sbjct: 83 STLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGAG 141
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
P V GCG G + G+ G G G +S+PS L K G +FS C
Sbjct: 142 ASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHC 188
Query: 281 FDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGNS 326
F + +F QG Q+T + + Y++ ++ +G++
Sbjct: 189 FTTITGAIPSTVLLDLPADLFSNGQG--AVQTTPLIQYAKNEANPTLYYLSLKGITVGST 246
Query: 327 ---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 377
LT ++DSG S T LP ++Y V +F + + C
Sbjct: 247 RLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTC 306
Query: 378 YNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEVGDHACFS 422
++A S+ VP + L F + R N++F P++ C +
Sbjct: 307 FSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLA 352
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 71/276 (25%), Positives = 108/276 (39%), Gaps = 45/276 (16%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLV-ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEY 161
N Y +H + IG P +V LD GS+++W C+ C +C + L +
Sbjct: 89 NSEYLIH---LSIGAPRSQPVVLTLDTGSDVVWTQCEPCAEC----------FTQPLPRF 135
Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
D ++S++ ++V+CS PLC + S C Y++ Y S +L D +F
Sbjct: 136 DTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSF----TFDD 191
Query: 222 HAPQSSVQSSVI-IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
V I GCG G +L G+ G G G +S+PS L FS C
Sbjct: 192 GKGGGKVTVPDIGFGCGMYNAGRFLQ--TETGIAGFGRGPLSLPSQLK-----VRQFSYC 244
Query: 281 FD---ENDSGSVFFGDQGPATQQSTS---------FLPIGEKYDAYFVGVESYCIGNSCL 328
F E S VF G G +T LP G Y + + +G + L
Sbjct: 245 FTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRL 304
Query: 329 TQSGFQA------LVDSGASFTFLPTEIYAEVVVKF 358
+A +DSG T P ++ ++ F
Sbjct: 305 PVPEIKADGSGATFIDSGTDITTFPDAVFRQLKSAF 340
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 140/341 (41%), Gaps = 41/341 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y + Y+ +GTP L +D GS + W+ CQ C C ++ +DPS
Sbjct: 97 YLMSYS---VGTPPFEILGVVDTGSGITWMQCQRCEDC----------YEQTTPIFDPSK 143
Query: 166 SSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
S + K + CS +C+S SC S K C Y Y + + S G L + L L S +
Sbjct: 144 SKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYG-DGSHSQGDLSVETLTLGSTNG- 201
Query: 223 APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
SSVQ + +IGCG G++ + +G G + + G + F
Sbjct: 202 ---SSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMF 258
Query: 282 DENDSGSVF-FGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL--------- 328
+++S S FGD + P+ K + Y++ +E++ +G+ +
Sbjct: 259 SQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSS 318
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
+ ++DSG + T LP E Y+ + + + R+S N CY + L
Sbjct: 319 GSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQL 378
Query: 387 KVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLE 427
VP + F V N I +F + G CF++ + E
Sbjct: 379 DVPVITAHFKGAD--VELNPISTFVQVAEG-VVCFAFHSSE 416
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 138/340 (40%), Gaps = 75/340 (22%)
Query: 99 THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL 158
T + GNQ P + +D GS++ W
Sbjct: 113 TFYLGNQ------------RPEDNISAVVDTGSDIFWT---------------------- 138
Query: 159 SEYDPSSSSSSKNVSCSHPLCKSRSSC----------KSLKDPCPYIADYS-TEDTSSSG 207
+E + S S + + C P C+ R+SC + C Y Y + S++G
Sbjct: 139 TEKECSRSKTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAG 198
Query: 208 YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
+ +D L + + SK P S V IGC T + D + GV GLG S+P
Sbjct: 199 VMYEDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 257
Query: 267 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 313
L + FS C + E D S P A +T+ P +
Sbjct: 258 LNFS-----KFSYCLSSYQEPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTL 312
Query: 314 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
YFV +++ IG + T+SG VD+GASFT L ++A++V + D+++ ++
Sbjct: 313 YFVHLQNISIGGTRFPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVK 372
Query: 370 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVV 403
+ N+ + CY + +++E K+PDM L F+ + + V+
Sbjct: 373 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVL 412
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 37/313 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD-RNLSEYDPSSSSSSKNVSC 174
+GTP+ F++ D GS+L W+ C+ C + S + R+ + + SSS K + C
Sbjct: 18 VGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPC 76
Query: 175 SHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHAPQS 226
+CK S ++C + PC Y DY D S++ G+ ++ + + K +
Sbjct: 77 LTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVE--LKEGRKM 132
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----- 281
+ +V+IGC G A DGVMGLG S + A FS C
Sbjct: 133 KLH-NVLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--IKAAEKFGGKFSYCLVDHLS 187
Query: 282 DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------T 329
+N S + FG + + + L +G Y V + IG + L
Sbjct: 188 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV 247
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ ++DSG+S TFL Y V+ L+ +++ + +YC+N++ E V
Sbjct: 248 KGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLV 307
Query: 389 PDMRLIFSKNQSF 401
P + F+ F
Sbjct: 308 PRLVFHFADGAEF 320
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 122/300 (40%), Gaps = 49/300 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ GTP L+ D GS+L+W+ C P R + S S++ V
Sbjct: 57 MAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATLSVVP 114
Query: 174 CSHPLC--------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
CS C + + PC Y DY+ + +S++G+L D A+ S
Sbjct: 115 CSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYA-DGSSTTGFLARDT---ATISNGTSG 170
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEN 284
+ V GCG + G G GV+GLG G +S P A++G L +FS C +
Sbjct: 171 GAAVRGVAFGCGTRNQGGSFSGTG--GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDL 225
Query: 285 DSGS-------VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ 334
+ G +F G P + + ++ P+ A Y+VGV + +GN L G +
Sbjct: 226 EGGRRGRSSSFLFLGR--PERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 283
Query: 335 ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYN 379
++DSG++ T+L Y +V F V RI QG + CYN
Sbjct: 284 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG--LELCYN 341
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 120/270 (44%), Gaps = 49/270 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP+ ++ +D GS+L+W +QC+P Y + +DP SS+
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVW-----LQCSPCRRCYA----QRGQVFDPRRSSTY 136
Query: 170 KNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
+ V CS P C++ S + C Y+ Y + +SS+G L D L A+
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGDLATDKLAFAN------ 189
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ ++V +GCGR G + D AA G++G+G G +S+ + +A A + F C +
Sbjct: 190 -DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVGRGKISISTQVAPA--YGSVFEYCLGDR 243
Query: 285 DSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ---- 334
S S VF P + T+ L + Y+V + + +G +T GF
Sbjct: 244 TSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT--GFSNASL 301
Query: 335 ----------ALVDSGASFTFLPTEIYAEV 354
+VDSG + + + YA +
Sbjct: 302 ALDTATGRGGVVVDSGTAISRFARDAYAAL 331
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 37/313 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLD-RNLSEYDPSSSSSSKNVSC 174
+GTP+ F++ D GS+L W+ C+ C + S + R+ + + SSS K + C
Sbjct: 89 VGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPC 147
Query: 175 SHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHAPQS 226
+CK S ++C + PC Y DY D S++ G+ ++ + + K +
Sbjct: 148 LTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFFANETVTVE--LKEGRKM 203
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----- 281
+ +V+IGC G A DGVMGLG S + A FS C
Sbjct: 204 KLH-NVLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--IKAAEKFGGKFSYCLVDHLS 258
Query: 282 DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------T 329
+N S + FG + + + L +G Y V + IG + L
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDV 318
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ ++DSG+S TFL Y V+ L+ +++ + +YC+N++ E V
Sbjct: 319 KGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLV 378
Query: 389 PDMRLIFSKNQSF 401
P + F+ F
Sbjct: 379 PRLVFHFADGAEF 391
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 85/175 (48%), Gaps = 20/175 (11%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y + T + IGTP F + +D GSN+ +VPC C S Y + + SS
Sbjct: 47 YGYYATKLYIGTPPQEFTLVVDTGSNMTFVPC----CG--SEEYCGKHED--PAFQTESS 98
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ + V+C HP C C L+ C Y Y + + S G L +DI+ + S+ APQ
Sbjct: 99 STYQPVNC-HPSCD----CDYLRSQCSYKMHYG-DGSYSRGVLAEDIISFGNESEFAPQR 152
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
++ GC GS L DG++GLG G ++ L G+I +SFS+C+
Sbjct: 153 -----LVFGCELDAIGS-LYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLCY 201
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 34/294 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F + D GS++ W QC P + Y + L +PS+S+S KN+S
Sbjct: 123 VGLGTPKKEFTLIFDTGSDITWT-----QCEPCVKTCYKQKEPRL---NPSTSTSYKNIS 174
Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
CS LCK +S K C Y Y + + S G+ + L L+S S+V
Sbjct: 175 CSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTLSS-------SNV 226
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+ + GCG++ G + A G+ L ++PS AK + FS C + S
Sbjct: 227 FKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKLFSYCLPASSSSK 281
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGA 341
+ G +S F P+ +D+ Y + + +G L+ +S F A ++DSG
Sbjct: 282 GYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGT 340
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T L Y+E+ F L++ + + + CY+ S + +++P + + F
Sbjct: 341 VITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 394
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 95/413 (23%), Positives = 170/413 (41%), Gaps = 83/413 (20%)
Query: 5 VAICML-FGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLE 63
+A+C+ FGCI + F+++LVHR S ++ P NS +
Sbjct: 14 IALCVASFGCIY---AHNAGFTTELVHRDSPKS-----------------PLYNSQQTHL 53
Query: 64 LLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
+ +R +RV ++ R + ++ N +L + +GTP
Sbjct: 54 QRWNKAMRRSVSRV-----HHFQRTAATVSPKEVESEIIANGGEYL--MSLSLGTPPFEI 106
Query: 124 LVALDAGSNLLWVPCQ-CIQC----APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
L D GS+L+W C C +C APL +DP SS + +++SC
Sbjct: 107 LAIADTGSDLIWTQCTPCDKCYKQIAPL--------------FDPKSSKTYRDLSCDTRQ 152
Query: 179 CKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK---HAPQSSVQSSV 232
C++ SSC S + C Y + Y + + ++G L D + L S + + P++
Sbjct: 153 CQNLGESSSCSS-EQLCQY-SYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKT------ 204
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDS 286
+IGCGR+ G++ G++GLG G +S+ S + + + FS C +S
Sbjct: 205 VIGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESAGNS 260
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQA-------LV 337
+ FG + P+ K Y++ +E+ +G+ + G ++
Sbjct: 261 SKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGNIII 320
Query: 338 DSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEMLKVP 389
DSG S T P + E + +++ +R +CY + + LKVP
Sbjct: 321 DSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPD--LKVP 371
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 104/403 (25%), Positives = 164/403 (40%), Gaps = 53/403 (13%)
Query: 27 KLVHRFSDEAKERWISKSGNVSVADSW-PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNS 85
+L HR A R S + SVAD+ + EY+ +R R ++ +
Sbjct: 69 RLTHRHGPCAPSRASSLAAP-SVADTLRADQRRAEYI-------LRRVSGRAPQLWDSKA 120
Query: 86 SRNQLLFPSEGSQTHFFGNQFYWLHYTWI-DIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
+ P+ +G L+Y +GTP V+ + +D GS+L WV C+ A
Sbjct: 121 AAAAATVPAS------WGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAA 174
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS---SCKSLKDPCPYIADYSTE 201
P S Y+ D +DP+ SSS V C P+C + C Y+ Y +
Sbjct: 175 P---SCYSQKD---PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYG-D 227
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
++++G D L L++ SS GCG Q+G + +G DG++GLG
Sbjct: 228 GSNTTGVYSSDTLTLSA-------SSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGR--- 274
Query: 262 SVPSLLAK-AGLIQNSFSICFDENDS--GSVFFGDQGPATQ----QSTSFLPIGEKYDAY 314
PSL+ + AG FS C S G + G GP+ +T LP Y
Sbjct: 275 EQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYY 334
Query: 315 FVGVESYCIGNSCLT--QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRIS 368
V + +G L+ S F +VD+G T LP YA + F ++S +
Sbjct: 335 VVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTA 394
Query: 369 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSF 410
CYN + + +P++ L F + ++ + I SF
Sbjct: 395 PSNGILDTCYNFAGYGTVTLPNVALTFGSGATVMLGADGILSF 437
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 117/273 (42%), Gaps = 26/273 (9%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP+V L D GS+L W+ C C C P A +DP+ SS+ +V C
Sbjct: 94 LGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPL----------FDPTQSSTYVDVPC 143
Query: 175 SHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
C +++ C S K C Y+ Y T D+ + G L D + +S ++
Sbjct: 144 ESQPCTLFPQNQRECGSSKQ-CIYLHQYGT-DSFTIGRLGYDTISFSSTGMGQGGATFPK 201
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSG 287
SV GC ++ +G +GLG G +S+ S L I + FS C F +G
Sbjct: 202 SV-FGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTG 258
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGN-SCLT-QSGFQALVDSGASFT 344
+ FG P + ++ I Y +Y+V +E +G LT Q G ++DS T
Sbjct: 259 KLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILT 318
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 377
L IY + + + ++ + ++YC
Sbjct: 319 HLEQGIYTDFISSVKEAINVEVAEDAPTPFEYC 351
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 130/302 (43%), Gaps = 47/302 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP +V +D GS+L W IQ P A + ++ +DPS SS+ ++
Sbjct: 29 IYLGTPPQKAVVIIDTGSDLTW-----IQSEPCRACF----EQADPIFDPSKSSTYNKIA 79
Query: 174 CSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--APQSSV 228
CS C +C + + C Y Y + + + GY FSK +
Sbjct: 80 CSSSACADLLGTQTCSAAAN-CIYAYGYG-DGSVTRGY----------FSKETITATDTA 127
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----- 283
V G TG++ D +G++GLG G VS+PS L ++ N FS C +
Sbjct: 128 GEEVKFGASVYNTGTFGDTGG-EGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAG 184
Query: 284 NDSGSVFFGDQG-PATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ----- 334
+++ +++FGD P+ + Q T +P + Y++ V+ +G S L QS ++
Sbjct: 185 SETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGG 244
Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
++DSG + T+L E++ +V + V + C+N P M
Sbjct: 245 SGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTT-SATGLDLCFNTRGTGSPVFPAM 303
Query: 392 RL 393
+
Sbjct: 304 TI 305
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 91/351 (25%), Positives = 148/351 (42%), Gaps = 55/351 (15%)
Query: 61 YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTP 119
Y++ S D K+ Q ++ + P+ G L Y + +G+P
Sbjct: 92 YIKRKFSGDVKKDG-----QGAGGVEQSHVTVPTT------LGTSLNTLEYLITVRLGSP 140
Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
+ V +D+GS++ WV C+ C+QC ++ +D +DPS SS+ SCS
Sbjct: 141 AKTQTVLIDSGSDVSWVQCKPCLQC-------HSQVD---PLFDPSLSSTYSPFSCSSAA 190
Query: 179 C----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
C + + C S C YI Y+ + +S++G D L L S + S+
Sbjct: 191 CAQLGQDGNGCSSSSQ-CQYIVRYA-DGSSTTGTYSSDTLALGS--------NTISNFQF 240
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVFF-- 291
GC ++G + D DG+MGLG G PSL ++ AG +FS C S S F
Sbjct: 241 GCSHVESG-FND--LTDGLMGLGGG---APSLASQTAGTFGTAFSYCLPPTPSSSGFLTL 294
Query: 292 --GDQG----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTF 345
G G P + S G + +A VG I S + ++DSG T
Sbjct: 295 GAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAG---MVMDSGTIITR 351
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
LP Y+ + F + R + + C++ S + +++P + L+FS
Sbjct: 352 LPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVFS 402
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 131/312 (41%), Gaps = 43/312 (13%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
GN Y + T +G+P SF V +D GS+L WV QC P Y + ++D
Sbjct: 35 GNGEYLMTLT---LGSPPQSFDVIVDTGSDLNWV-----QCLPCRVCY----QQPGPKFD 82
Query: 163 PSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
PS S S + +C+ LC + +C + + C Y Y + ++ + I S
Sbjct: 83 PSKSRSFRKAACTDNLCNVSALPLKACAA--NVCQYQYTYGDQSNTNGDLAFETI----S 136
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ A SV + GCG + G++ A G++GLG G +S+ S L+ N FS
Sbjct: 137 LNNGAGTQSV-PNFAFGCGTQNLGTFAGAA---GLVGLGQGPLSLNSQLSHT--FANKFS 190
Query: 279 ICFDENDSGS---VFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLT----- 329
C +S S + FG A Q TS + Y+V + S +G L
Sbjct: 191 YCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSV 250
Query: 330 ----QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
QS + ++DSG + T L Y+ V+ ++ V+ R+ C+N +
Sbjct: 251 FAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGV 310
Query: 384 EMLKVPDMRLIF 395
VPDM F
Sbjct: 311 SNPSVPDMVFKF 322
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 126/304 (41%), Gaps = 33/304 (10%)
Query: 101 FFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
F G+ Y + + GTP + V D GS++ W +QC P + Y +
Sbjct: 10 FIGSGNYVIT---VGFGTPTRTQTVVFDTGSDVNW-----LQCKPCAVRCYAQQE---PL 58
Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
+DPS SS+ +NVSC+ P C S+ C Y Y + +S+ G+L D L
Sbjct: 59 FDPSLSSTYRNVSCTEPACVGLSTRGCSSSTCLYGVFYG-DGSSTIGFLAMDTFMLTPAQ 117
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAK-AGLIQNSFS 278
K + I GCG+ TG + G GL GLG S SL ++ A + N FS
Sbjct: 118 KF-------KNFIFGCGQNNTGLF------QGTAGLVGLGRSSTYSLNSQVAPSLGNVFS 164
Query: 279 ICFDENDSGSVFFGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA 335
C S + + P T T+ L YF+ + +G + L+ S FQ+
Sbjct: 165 YCLPSTSSATGYLNIGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQS 224
Query: 336 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG T LP Y+ + ++ ++ CY+ S + P +
Sbjct: 225 VGTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIV 284
Query: 393 LIFS 396
L F+
Sbjct: 285 LHFA 288
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 157/358 (43%), Gaps = 40/358 (11%)
Query: 61 YLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW-IDIGTP 119
+ E+L + + R K+ +++N + + + +G +Y + +GTP
Sbjct: 95 HTEILRRDQDRVDAIRRKVTASSNKPKGGVSLLAN------WGKSLSTTNYVASLRLGTP 148
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
+V LD GS+ WV QC P + Y ++ +DP++SS+ V C C
Sbjct: 149 ATELVVELDTGSDQSWV-----QCKPCADCY----EQRDPVFDPTASSTYSAVPCGAREC 199
Query: 180 KSRSSCKSLKDP-------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
+ +S S ++ CPY Y +D+ + G L D L L+ +P +V
Sbjct: 200 QELASSSSSRNCSSDNNKNCPYEVSYD-DDSHTVGDLARDTLTLSPSPSPSPADTVPG-F 257
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
+ GCG G++ + DG++GLGLG S+PS + A +FS C + S + +
Sbjct: 258 VFGCGHSNAGTFGE---VDGLLGLGLGKASLPSQV--AARYGAAFSYCLPSSPSAAGYLS 312
Query: 293 DQGPATQQSTSF--LPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFT 344
G A + + F + G+ +Y++ + + + + ++DSG +F+
Sbjct: 313 FGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFS 372
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSEEMLKVPDMRLIFSKNQS 400
LP YA + F + R +S + CY+ + E +++P + L+F+ +
Sbjct: 373 RLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFADGAT 430
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 138/347 (39%), Gaps = 61/347 (17%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + IGTP + LD GS+L+W C+ C+ C D+ L +D S
Sbjct: 35 YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCKPCVSC----------FDQPLPYFDTSR 81
Query: 166 SSSSKNVSCSHPLCK---SRSSCKSLK---DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
SS++ + C CK + + C L C Y Y +++ + G L D +
Sbjct: 82 SSTNALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYG-DNSVTIGLLAADKFTFVA- 139
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ V GCG TG + + G+ G G G +S+PS L K G +FS
Sbjct: 140 ------GTSLPGVTFGCGLNNTGVF--NSNETGIAGFGRGPLSLPSQL-KVG----NFSH 186
Query: 280 CFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGN 325
CF + +F QG Q+T + + Y++ ++ +G+
Sbjct: 187 CFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLYYLSLKGITVGS 244
Query: 326 S---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
+ LT ++DSG S T LP ++Y V +F + +
Sbjct: 245 TRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYT 304
Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEVGDHACFS 422
C++A S+ VP + L F + R N++F P++ C +
Sbjct: 305 CFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLA 351
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 127/295 (43%), Gaps = 36/295 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + LD GS+L+W QCAP + D++L DP++SS+ +
Sbjct: 88 LAVGTPRRPVALTLDTGSDLVWT-----QCAPCRDCF----DQDLPVLDPAASSTYAALP 138
Query: 174 CSHPLCKSR--SSC--KSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSV 228
C C++ +SC ++L + I Y D S + G + D S + +S
Sbjct: 139 CGAARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGD-SGGSGESLH 197
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD---END 285
+ GCG G + G+ G G G S+PS L SFS CF E+
Sbjct: 198 TRRLTFGCGHLNKGVFQSNET--GIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFESK 250
Query: 286 SGSVFFGDQGPA--------TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 335
S V G A ++T L + YF+ ++ +G + L ++ F++
Sbjct: 251 SSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRS 310
Query: 336 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
++DSGAS T LP E+Y V +F V ++G++ C+ + + P
Sbjct: 311 TIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRP 365
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 102/408 (25%), Positives = 166/408 (40%), Gaps = 72/408 (17%)
Query: 69 DWKRQ-KTRVKLQSNNNSSR----NQLLFPSEGSQTHFF---------GNQFYWLHY-TW 113
DW++ + R+ L + N +S +FP QTH G + L+Y
Sbjct: 92 DWEKIFQNRIILDAINVNSLFSHFKSAIFPG---QTHQLSDSQIPISSGARLQTLNYIVT 148
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IG N + +V D GS+L WV QC P Y ++ ++PS+SSS ++
Sbjct: 149 VGIGGQNSTLIV--DTGSDLTWV-----QCLPCRLCY----NQQEPLFNPSNSSSFLSLP 197
Query: 174 CSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
C+ P C S C + C Y DY + + S G L F K
Sbjct: 198 CNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL--------GFEKLTLG 248
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN- 284
+ + I GCGR G + G+MGL ++S+ S + L + FS C
Sbjct: 249 KTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGSVFSYCLPTTG 303
Query: 285 --DSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCL------ 328
SGS+ G + ++ S PI + + YF+ + IG L
Sbjct: 304 VGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS 361
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ G +L+DSG T L IY +F+K S R + + C+N + E + +
Sbjct: 362 SNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNI 421
Query: 389 PDMRLIFSKNQSFVVR-NHIFSFPENEVGDHACFSYFTLEYNFTGILI 435
P ++ IF N +V +F F +++ C ++ +L Y ++I
Sbjct: 422 PTVKFIFEGNAEMIVDVEGVFYFVKSDA-SQICLAFASLGYEDQTMII 468
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 122/294 (41%), Gaps = 28/294 (9%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I IGTP+V L D GS+L WV QC+P + +N YDP +SS+ +
Sbjct: 100 IYIGTPSVERLAIADTGSDLTWV-----QCSPCDNT--KCFAQNTPLYDPLNSSTFTLLP 152
Query: 174 CSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
C C S+ C D C Y Y +++ S G L D + L H
Sbjct: 153 CDSQPCTQLPYSQYVCSDYGD-CIYAYTYG-DNSYSYGGLSSDSIRLMLLQLH-----YN 205
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS 286
S + GCG + + G++GLG G +S+ S L I + FS C F N +
Sbjct: 206 SKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSN 263
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLT--QSGFQALVDSGAS 342
+ FG+ P+ K D Y++ +E +G + Q+ ++DSG++
Sbjct: 264 SKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGST 323
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
T+L Y E V + V+ + + +C+ E M PD+ F+
Sbjct: 324 LTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTY-KEGMSTPPDVVFHFT 376
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 34/294 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F + D GS++ W QC P + Y + L +PS+S+S KN+S
Sbjct: 75 VGLGTPKKEFTLIFDTGSDITWT-----QCEPCVKTCYKQKEPRL---NPSTSTSYKNIS 126
Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
CS LCK +S K C Y Y + + S G+ + L L+S S+V
Sbjct: 127 CSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTLSS-------SNV 178
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+ + GCG++ G + A G+ L ++PS AK + FS C + S
Sbjct: 179 FKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKLFSYCLPASSSSK 233
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGA 341
+ G +S F P+ +D+ Y + + +G L+ +S F A ++DSG
Sbjct: 234 GYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGT 292
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T L Y+E+ F L++ + + + CY+ S + +++P + + F
Sbjct: 293 VITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 346
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 151/360 (41%), Gaps = 65/360 (18%)
Query: 71 KRQKTRVKL---QSNNNSSRNQLLFPSEGSQTHFF--------GNQFYWLHYTWIDIGTP 119
+ + RV L ++ N SR+QLL + H GN + + + IGTP
Sbjct: 27 RLKGLRVHLTHVDAHGNYSRHQLLRRAARRSHHRMSRLVPVHAGNGEFLMD---VSIGTP 83
Query: 120 NVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
+++ +D GS+L+W C+ C+ C ++ +DPSSSS+ V CS
Sbjct: 84 ALAYSAIVDTGSDLVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPCSSAS 133
Query: 179 CKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
C S C S C Y Y + +S+ G L + LA S V+ GC
Sbjct: 134 CSDLPTSKCTSASK-CGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGVVFGC 183
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGD 293
G G A G++GLG G + SL+++ GL + FS C D+ ++ + G
Sbjct: 184 GDTNEGDGFSQGA--GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPLLLGS 236
Query: 294 QGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ--------A 335
++ Q+T + + Y+V +++ +G++ L S F
Sbjct: 237 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGV 296
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+VDSG S T+L + Y + F ++ G C+ A ++ + +V RL+F
Sbjct: 297 IVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVF 356
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 130/331 (39%), Gaps = 44/331 (13%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F V +D GS+L WV QC+P Y +N S + P++S+S ++
Sbjct: 7 VRLGTPERVFSVIVDTGSDLTWV-----QCSPCGTCY----SQNDSLFIPNTSTSFTKLA 57
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C LC + C Y Y + + S+G V D + + + Q +
Sbjct: 58 CGTELCNGLPYPMCNQTTCVYWYSYG-DGSLSTGDFVYDTITMDGINGQKQQV---PNFA 113
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGS 288
GCG GS+ A DG++GLG G +S PS L + FS C + +
Sbjct: 114 FGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSP 168
Query: 289 VFFGDQGPATQQSTSFL-----PIGEKYDAYFVGVESYCIGNSCLTQSGFQ--------- 334
+ FGD T ++ P Y Y+V + +G L S
Sbjct: 169 LLFGDAAVPTFPGVKYISLLTNPKVPTY--YYVKLNGISVGGKLLNISSTAFDIDSVGRA 226
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEML-KVPDM 391
+ DSG + T L E++ EV+ + + R S + C +E L VP M
Sbjct: 227 GTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSM 286
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFS 422
F + ++ F F E+ CFS
Sbjct: 287 TFHFEGGDMELPPSNYFIFLESS--QSYCFS 315
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 132/316 (41%), Gaps = 44/316 (13%)
Query: 80 QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
Q +N + ++FP +G + + FY + + IG P + + +D+GS+L W+ C
Sbjct: 44 QPISNRMGHTVVFPLQG---NVYPQGFYSVS---LRIGNPPKPYTLDIDSGSDLTWLQCD 97
Query: 140 --CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPC 192
C+ C P + ++C+ P+C S+ CK+ + C
Sbjct: 98 APCVSCT--------------KAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQC 143
Query: 193 PYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
Y Y+ + SS G LV DI L L + + AP+ + GCG Q SY AP
Sbjct: 144 DYEVSYA-DHGSSLGVLVHDIFSLQLTNGTLAAPR------LAFGCGYDQ--SYPGPNAP 194
Query: 251 ---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 307
DGV+GLG G S+ + L GLI++ C G +F GD +T + P+
Sbjct: 195 PFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGL-STTPGIIWTPM 253
Query: 308 GEK--YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
K AY +G G + + DSG+S+T+ + Y + K ++ K
Sbjct: 254 SRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK 313
Query: 366 RISLQGNSWKYCYNAS 381
S C+ +
Sbjct: 314 LKETADESLPVCWRGA 329
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 102/408 (25%), Positives = 166/408 (40%), Gaps = 72/408 (17%)
Query: 69 DWKRQ-KTRVKLQSNNNSSR----NQLLFPSEGSQTHFF---------GNQFYWLHY-TW 113
DW++ + R+ L + N +S +FP QTH G + L+Y
Sbjct: 13 DWEKIFQNRIILDAINVNSLFSHFKSAIFPG---QTHQLSDSQIPISSGARLQTLNYIVT 69
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IG N + +V D GS+L WV QC P Y ++ ++PS+SSS ++
Sbjct: 70 VGIGGQNSTLIV--DTGSDLTWV-----QCLPCRLCY----NQQEPLFNPSNSSSFLSLP 118
Query: 174 CSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
C+ P C S C + C Y DY + + S G L F K
Sbjct: 119 CNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL--------GFEKLTLG 169
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN- 284
+ + I GCGR G + G+MGL ++S+ S + L + FS C
Sbjct: 170 KTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGSVFSYCLPTTG 224
Query: 285 --DSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCL------ 328
SGS+ G + ++ S PI + + YF+ + IG L
Sbjct: 225 VGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS 282
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+ G +L+DSG T L IY +F+K S R + + C+N + E + +
Sbjct: 283 SNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNI 342
Query: 389 PDMRLIFSKNQSFVVR-NHIFSFPENEVGDHACFSYFTLEYNFTGILI 435
P ++ IF N +V +F F +++ C ++ +L Y ++I
Sbjct: 343 PTVKFIFEGNAEMIVDVEGVFYFVKSDA-SQICLAFASLGYEDQTMII 389
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 104/247 (42%), Gaps = 23/247 (9%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
++IG P + + +D GS+L W+ C AP S T P S+ V
Sbjct: 83 LNIGQPPRPYFLDIDTGSDLTWLQCD----APCSRCSQTP--------HPLYRPSNDLVP 130
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSVQS 230
C H LC S + P+ DY + SS G L+ D+ L +F+ ++
Sbjct: 131 CRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NFTNGV---QLKV 186
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
+ +GCG Q DG++GLG G S+ S L GL++N C G +F
Sbjct: 187 RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIF 246
Query: 291 FGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLTQSGFQALVDSGASFTFLPT 348
FGD + + ++ P+ + Y Y V G G A+ D+G+S+T+ +
Sbjct: 247 FGDVYDSFR--LTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSYTYFNS 304
Query: 349 EIYAEVV 355
Y ++
Sbjct: 305 YAYQVLI 311
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 148/347 (42%), Gaps = 50/347 (14%)
Query: 83 NNSSRNQLL----FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
+N R + L FP +G+ + L+YT I +G P V +D GS++LWV C
Sbjct: 58 HNDRRGRFLQGISFPLKGNYSDL------GLYYTEIGLGNPVQKLKVIVDTGSDILWVKC 111
Query: 139 Q-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDP 191
C C LS + LS Y+ S+SS+S SCS PLC SRS S
Sbjct: 112 SPCRSC--LSKQ---DIIPPLSIYNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNS---A 163
Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
C Y++ Y + S Y+ DD+ ++ H ++ S + GC TGS+ D
Sbjct: 164 CAYVSSYQDKSASVGAYVRDDMHYVL----HGGNATT-SRIFFGCATNITGSW----PVD 214
Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEK 310
G+MG GL +VP+ +A + FS C E G + + P T + F P+
Sbjct: 215 GIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEAPNTTEMV-FTPLLNV 273
Query: 311 YDAYFVGVESYCIGNSCL------------TQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
Y V + S + + L + + ++DSG +F L T+ + +
Sbjct: 274 TTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSGTTFVLLTTKANRMLFQEI 333
Query: 359 DKLVSSKR-ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
L ++K L+G Y + + E P++ L FS + ++
Sbjct: 334 KSLTTAKLGPKLEGLECFYLKSGLTMET-SFPNVTLTFSGGSTMKLK 379
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 34/294 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F + D GS++ W QC P + Y + L +PS+S+S KN+S
Sbjct: 135 VGLGTPKKEFTLIFDTGSDITWT-----QCEPCVKTCYKQKEPRL---NPSTSTSYKNIS 186
Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
CS LCK +S K C Y Y + + S G+ + L L+S S+V
Sbjct: 187 CSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTLSS-------SNV 238
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+ + GCG++ G + A G+ L ++PS AK + FS C + S
Sbjct: 239 FKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKLFSYCLPASSSSK 293
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGA 341
+ G +S F P+ +D+ Y + + +G L+ +S F A ++DSG
Sbjct: 294 GYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGT 352
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T L Y+E+ F L++ + + + CY+ S + +++P + + F
Sbjct: 353 VITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 406
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 142/351 (40%), Gaps = 58/351 (16%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
L+ KR + R++ S N +L S G +T + +L + IGTP S
Sbjct: 60 LIKRAIKRGERRMR-------SINAMLQSSSGIETPVYAGSGEYLMN--VAIGTPASSLS 110
Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
+D GS+L+W C+ C QC + ++P SSS + C C+
Sbjct: 111 AIMDTGSDLIWTQCEPCTQC----------FSQPTPIFNPQDSSSFSTLPCESQYCQDLP 160
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
S +S + C Y Y + +S+ GY+ + ++S ++ GCG G
Sbjct: 161 S-ESCYNDCQYTYGYG-DGSSTQGYMATETFTF--------ETSSVPNIAFGCGEDNQGF 210
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--------GSVFFGDQG 295
A G++G+G G +S+PS L FS C + S GS G
Sbjct: 211 GQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYCMTSSGSSSPSTLALGSAASGV-- 261
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSGFQ--------ALVDSGASFTF 345
P ST+ + Y++ ++ +G N + S FQ ++DSG + T+
Sbjct: 262 PEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTY 321
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIF 395
LP + Y V F ++ + + C+ S+ ++VP++ + F
Sbjct: 322 LPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQF 372
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 90/391 (23%), Positives = 149/391 (38%), Gaps = 82/391 (20%)
Query: 47 VSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF 106
VS + W N + L L ++ K KT+ L LFP
Sbjct: 47 VSSKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTP-------LFPRS----------- 88
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
Y + ++ GTP + +D GS+L+W PC C +C + +++ + +
Sbjct: 89 YGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCD------FPNIEVTGIPTF 142
Query: 162 DPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPC---------PYIADYSTEDTSSS 206
P SSSS + C + C K +S C+ DP PY+ Y S++
Sbjct: 143 IPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQEC-DPTTQNCTQSCPPYVIQYGLG--STA 199
Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
G L+ + L P ++GC S P+G+ G G S+PS
Sbjct: 200 GLLLSETLDF-------PHKKTIPGFLVGC------SLFSIRQPEGIAGFGRSPESLPSQ 246
Query: 267 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS-------TSFL--PIGEKYDAYFVG 317
L S FD+ + S D G + + T F P D Y+V
Sbjct: 247 LGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVL 306
Query: 318 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 367
+ + IG++ + + +VDSG +FTF+ +Y V +F+K V+ +
Sbjct: 307 LRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTV 366
Query: 368 SLQ---GNSWKYCYNASSEEMLKVPDMRLIF 395
+ + + C+N S E+ + VP+ F
Sbjct: 367 ATEVQNQTGLRPCFNISGEKSVSVPEFIFHF 397
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 64/261 (24%), Positives = 106/261 (40%), Gaps = 27/261 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
++IG P + + +D GS+L W+ C C +C+ Y S+
Sbjct: 81 LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLY--------------RPSNDF 126
Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
V C H LC S + P+ DY + SS G L+ D+ L +F+ +
Sbjct: 127 VPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NFTNGV---QL 182
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+ + +GCG Q DG++GLG G S+ S L GL++N C G
Sbjct: 183 KVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGY 242
Query: 289 VFFGDQGPATQQSTSFLPIGEK-YDAY-FVGVESYCIGNSCLTQSGFQALVDSGASFTFL 346
+FFGD +++ ++ P+ + Y Y G G A+ D+G+S+T+
Sbjct: 243 IFFGDVYDSSR--LTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSYTYF 300
Query: 347 PTEIYAEVVVKFDKLVSSKRI 367
Y ++ K K +
Sbjct: 301 NPYAYQALISWLGKESGGKPL 321
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 147/358 (41%), Gaps = 63/358 (17%)
Query: 107 YWLHYTWIDIGTP--NVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSE 160
Y H + GTP +SFLV D GS+++W PC C C S+ + + +
Sbjct: 84 YGGHSIPLSFGTPPQKLSFLV--DTGSHVVWAPCTTHYTCTNC-----SFSDAEPKKVPI 136
Query: 161 YDPSSSSSSKNVSCSHPLCKSRSS---------CKSLKDPC-----PYIADYSTEDTSSS 206
++P SSSSK + C +P C + SS C C PY Y T SS
Sbjct: 137 FNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGT-GASSG 195
Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 266
+L++++ + P ++ ++GC G A + G G S+P
Sbjct: 196 DFLLENL--------NFPGKTIH-EFLVGCTTSAVGEVTSAA----LAGFGRSMFSLPMQ 242
Query: 267 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----YFVGVESYC 322
+ S +D+ + S D + S+ P + Y++GV+
Sbjct: 243 MGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIK 302
Query: 323 IGNSCL-TQSGFQA---------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 372
IGN L S + A ++DSG ++ ++ ++ +V + K +S R SL+
Sbjct: 303 IGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAE 362
Query: 373 S---WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHACFSYFT 425
+ CYN + ++ +K+PD+ F + VV +N+ PE + ACF T
Sbjct: 363 AEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISL---ACFPLTT 417
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 132/316 (41%), Gaps = 44/316 (13%)
Query: 80 QSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ 139
Q +N + ++FP +G + + FY + + IG P + + +D+GS+L W+ C
Sbjct: 11 QPISNRMGHTVVFPLQG---NVYPQGFYSVS---LRIGNPPKPYTLDIDSGSDLTWLQCD 64
Query: 140 --CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPC 192
C+ C P + ++C+ P+C S+ CK+ + C
Sbjct: 65 APCVSCT--------------KAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQC 110
Query: 193 PYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
Y Y+ + SS G LV DI L L + + AP+ + GCG Q SY AP
Sbjct: 111 DYEVSYA-DHGSSLGVLVHDIFSLQLTNGTLAAPR------LAFGCGYDQ--SYPGPNAP 161
Query: 251 ---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 307
DGV+GLG G S+ + L GLI++ C G +F GD +T + P+
Sbjct: 162 PFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGL-STTPGIIWTPM 220
Query: 308 GEK--YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
K AY +G G + + DSG+S+T+ + Y + K ++ K
Sbjct: 221 SRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK 280
Query: 366 RISLQGNSWKYCYNAS 381
S C+ +
Sbjct: 281 LKETADESLPVCWRGA 296
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 145/366 (39%), Gaps = 64/366 (17%)
Query: 98 QTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC----QCIQCA-------PL 146
QT F + Y H + GTP +D GS+++W PC C C+ P+
Sbjct: 76 QTSLFPHS-YGAHTIPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPI 134
Query: 147 SASYYTSLDRNLSEYDPS-SSSSSKNVSCSHPLCKSRSSCKSLKDPCP-YIADYSTEDTS 204
+S D+ L DP + +SS BV P C S K CP Y Y T +
Sbjct: 135 FNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCNGNS--KKCSHACPQYTLQYGTG--A 190
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 264
+SG+ + + L + H ++GC T S + D + G G S+P
Sbjct: 191 ASGFFLLENLDFPGKTIH--------KFLVGC----TTSADREPSSDALAGFGRTMFSLP 238
Query: 265 SLLAKAGLIQNSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYD----AY 314
+ F+ C + +D SG + D Q S+ P + Y
Sbjct: 239 MQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYSDGETQGLSYAPFXKNPPDYPIYY 292
Query: 315 FVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
++GV+ IGN L G ++DSG +++++ ++ V + K +S
Sbjct: 293 YLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSK 352
Query: 365 KRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHA 419
R SL+ + CYN + + +K+PD+ F+ + VV N+ F E +G
Sbjct: 353 YRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLG--- 409
Query: 420 CFSYFT 425
CF T
Sbjct: 410 CFPVTT 415
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 125/295 (42%), Gaps = 54/295 (18%)
Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-- 181
V +D GS+L WV CQ C +C Y D ++PS S S + V C+ C+S
Sbjct: 79 VIVDTGSDLSWVQCQPCNRC-------YNQQD---PVFNPSKSPSYRTVLCNSLTCRSLQ 128
Query: 182 -----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
C S C Y+ +Y + + +SG + + L+L ++ ++ I GC
Sbjct: 129 LATGNSGVCGSNPPTCNYVVNYG-DGSYTSGEVGMEHLNLG--------NTTVNNFIFGC 179
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND---SGSVFFGD 293
GRK G + G++GLG D+S+ S ++ + FS C + SGS+ G
Sbjct: 180 GRKNQGLF---GGASGLVGLGRTDLSLISQISP--MFGGVFSYCLPTTEAEASGSLVMGG 234
Query: 294 QGPATQQST----------SFLPIGEKYDAYFVGVESYCIGNSCLTQSGF---QALVDSG 340
+ +T LP YF+ + +G + F + ++DSG
Sbjct: 235 NSSVYKNTTPISYTRMIHNPLLPF------YFLNLTGITVGGVEVQAPSFGKDRMIIDSG 288
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ LP IY + +F K S + C+N S + +K+PD+++ F
Sbjct: 289 TVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYF 343
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 124/300 (41%), Gaps = 34/300 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + +GTP + D GS L W QC P + S Y D +DPS SSS
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWT-----QCEPCAGSCYKQQD---PIFDPSKSSSY 191
Query: 170 KNVSCSHPLCKS-RSS-CKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
N+ C+ LC RS+ C S D C Y Y +++ S G+L + L + + +
Sbjct: 192 TNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYG-DNSISRGFLSQERLTITA-------T 243
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ + GCG+ G + A G+MGL +S + + + FS C S
Sbjct: 244 DIVHDFLFGCGQDNEGLFRGTA---GLMGLSRHPISF--VQQTSSIYNKIFSYCLPSTPS 298
Query: 287 --GSVFFGDQGP--ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQAL 336
G + FG A + T F I + Y + + +G + L T S ++
Sbjct: 299 SLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSI 358
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
+DSG T LP YA + F + + ++ CY+ S + + VP + F+
Sbjct: 359 IDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFA 418
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 151/365 (41%), Gaps = 47/365 (12%)
Query: 56 KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQL-----LFPSEGSQTHFFGNQFYWLH 110
K + E L + + + K+ S N+ +L P+ S + G Y +
Sbjct: 75 KEKPSHEETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTIPT--SSGYSLGTTEYVIT 132
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
T IGTP V+ ++++D GS++ WV QCAP +A +S L +DP+ S++
Sbjct: 133 VT---IGTPAVTQVMSIDTGSDVSWV-----QCAPCAAQSCSSQKDKL--FDPAMSATYS 182
Query: 171 NVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
SC C + C LK C YI Y + ++++G D L L S S
Sbjct: 183 AFSCGSAQCAQLGDEGNGC--LKSQCQYIVKYG-DGSNTAGTYGSDTLSLTS-------S 232
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDEND 285
S GC + G + DG+MGLG GD SL+++ A +FS C
Sbjct: 233 DAVKSFQFGCSHRAAGFVGE---LDGLMGLG-GDTE--SLVSQTAATYGKAFSYCLPPPS 286
Query: 286 S---GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV--ESYCIGNSCLT--QSGFQ--AL 336
S G + G G A+ S P+ F GV + + + L S F ++
Sbjct: 287 SSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASV 346
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
VDSG T LP Y + F K + + + S C++ S + VP + L FS
Sbjct: 347 VDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFS 406
Query: 397 KNQSF 401
+ +
Sbjct: 407 RGAAM 411
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 114/272 (41%), Gaps = 21/272 (7%)
Query: 93 PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
PS GN + H+ ++IG P + + +D GS L W+ C CI C +
Sbjct: 20 PSSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSL 79
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCK-SLKDPCPYIADYSTEDT 203
+Y L + + V C+ C R K K+ C Y Y
Sbjct: 80 FYPRLIGSFVPHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GG 137
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVS 262
SS G L+ D SFS A + +S+ GCG Q + + P +G++GLG G V+
Sbjct: 138 SSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVT 192
Query: 263 VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVE 319
+ S L G+I ++ C G +FFGD T T + P+ ++ Y G
Sbjct: 193 LLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTL 251
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIY 351
+ + ++ + + + DSGA++T+ + Y
Sbjct: 252 QFNSNSKPISAAPMEVIFDSGATYTYFALQPY 283
>gi|297739018|emb|CBI28370.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 65/121 (53%), Gaps = 13/121 (10%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
+F + HRFSD K + D P+K S++Y + + DW R+ S
Sbjct: 29 TFGFDMHHRFSDPVK--------GILDVDDLPEKLSLQYYKAMAHRDWVIHGRRL---ST 77
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
++ + L F S+G++T+ + Y LHY + +GTP++ FLVALD GS+L W+PC C
Sbjct: 78 SDEVKPPLTF-SDGNETYRLSSLGY-LHYANVSLGTPSLWFLVALDTGSDLFWLPCDCTS 135
Query: 143 C 143
C
Sbjct: 136 C 136
>gi|359496801|ref|XP_003635339.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 151
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 65/121 (53%), Gaps = 13/121 (10%)
Query: 23 SFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSN 82
+F + HRFSD K + D P+K S++Y + + DW R+ S
Sbjct: 29 TFGFDMHHRFSDPVK--------GILDVDDLPEKLSLQYYKAMAHRDWVIHGRRL---ST 77
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ 142
++ + L F S+G++T+ + Y LHY + +GTP++ FLVALD GS+L W+PC C
Sbjct: 78 SDEVKPPLTF-SDGNETYRLSSLGY-LHYANVSLGTPSLWFLVALDTGSDLFWLPCDCTS 135
Query: 143 C 143
C
Sbjct: 136 C 136
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 106/436 (24%), Positives = 176/436 (40%), Gaps = 68/436 (15%)
Query: 31 RFSDEA------KERWISKSGN-VSV------ADSWPKKNSVEYLELLLSNDWKRQKTRV 77
R+S+ A + RW+ + N VSV P S + E LS +R + R
Sbjct: 36 RYSEPAATCSTSRVRWLDEGSNTVSVPLVHRHGPCAPSTRSSD--EPSLSERLRRSRARS 93
Query: 78 KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
K + S N + TH G+ + + +GTP VS ++ +D GS+L WV
Sbjct: 94 KYIMSRASKSNVSI------PTHLGGSVDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWV- 146
Query: 138 CQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKD- 190
QCAP +++ T + +DPS SS+ + C+ C+ S C S
Sbjct: 147 ----QCAPCNST--TCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDCTSGSGG 200
Query: 191 --PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
C Y Y + + ++G ++ L + AP +V+ GCG Q G
Sbjct: 201 GAQCGYAITYG-DGSQTTGVYSNETLTM------APGVTVK-DFHFGCGHDQDGP---ND 249
Query: 249 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST-SFLP- 306
DG++GLG S+ ++ + + +FS C + + F P S F P
Sbjct: 250 KYDGLLGLGGAPESL--VVQTSSVYGGAFSYCLPAANDQAGFLALGAPVNDASGFVFTPM 307
Query: 307 IGEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLV 362
+ E+ Y V + +G + S F ++DSG T L YA + F K +
Sbjct: 308 VREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQHTAYAALQAAFRKAM 367
Query: 363 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-------VVRNHIFSF----P 411
++ + L CYN + + VP + L FS + ++ ++ +F P
Sbjct: 368 AAYPL-LPNGELDTCYNFTGHSNVTVPRVALTFSGGATVDLDVPDGILLDNCLAFQEAGP 426
Query: 412 ENEVGDHACFSYFTLE 427
+N+ G + TLE
Sbjct: 427 DNQPGILGNVNQRTLE 442
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 73/329 (22%), Positives = 133/329 (40%), Gaps = 62/329 (18%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
Y + T + GTP + + D GS+L+W PC C +C+ + +D + +
Sbjct: 78 YGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRF 131
Query: 162 DPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADYSTEDTSSSG 207
P SSSSK V C +P C +S C+S CP Y+ Y + T +G
Sbjct: 132 VPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGST--AG 189
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
L+ + L K P + ++GC S+L P G+ G G G S+PS +
Sbjct: 190 LLLSETLDFP--DKKIP------NFVVGC------SFLSIHQPSGIAGFGRGSESLPSQM 235
Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--------YDAYFVGVE 319
S FD++ D ++ P + + Y++ +
Sbjct: 236 GLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIR 295
Query: 320 SYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRI 367
+GN + +++DSG++FTF+ + V +F+K ++ ++
Sbjct: 296 KIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRAT 355
Query: 368 SLQG-NSWKYCYNASSEEMLKVPDMRLIF 395
++ + C++ S E+ +K P++ F
Sbjct: 356 DVETLTGLRPCFDISKEKSVKFPELIFQF 384
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 117/264 (44%), Gaps = 41/264 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T + +GTP ++++ +D+GS+L W +QCAP + S + + YDP +SS+
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTW-----LQCAPCAVSCH---PQAGPLYDPRASSTY 159
Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
V CS P C + SSC S C Y A Y + + S GYL D + L+S
Sbjct: 160 AAVPCSAPQCAELQAATLNPSSC-SGSGVCQYQASYG-DGSFSFGYLSKDTVSLSS---- 213
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
S GCG+ G + A G++GL +S+ S LA + + NSF+ C
Sbjct: 214 ---SGSFPGFYYGCGQDNVGLFGRAA---GLIGLARNKLSLLSQLAPS--VGNSFAYCLP 265
Query: 283 EN---DSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-----Q 330
+ +G + FG ++ P TS + YFV + + S L
Sbjct: 266 TSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEY 325
Query: 331 SGFQALVDSGASFTFLPTEIYAEV 354
++DSG T LPT +Y +
Sbjct: 326 GSLPTIIDSGTVITRLPTPVYTAL 349
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 77/306 (25%), Positives = 123/306 (40%), Gaps = 44/306 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + D GS+L W QC P S Y + +DPS+S + N+S
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWT-----QCQPCVKSCYA---QQQPIFDPSASKTYSNIS 209
Query: 174 CSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
C+ C S C S C Y Y + + + G+ D L L Q+
Sbjct: 210 CTSTACSGLKSATGNSPGCSS--SNCVYGIQYG-DSSFTVGFFAKDTLTLT-------QN 259
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
V + GCG+ G + A G++GLG +S+ A+ FS C
Sbjct: 260 DVFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314
Query: 285 DSGSVFFGD-----QGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSG--FQ- 334
+G + FG+ A + +F P A YF+ V +G L+ S FQ
Sbjct: 315 SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN 374
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG T LP+ +Y + F + +S + + CY+ S+ + +P +
Sbjct: 375 AGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKIS 434
Query: 393 LIFSKN 398
F+ N
Sbjct: 435 FNFNGN 440
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 100/410 (24%), Positives = 163/410 (39%), Gaps = 55/410 (13%)
Query: 17 DGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
DG+ +V+ S HR+ G S AD + ELL + + R
Sbjct: 30 DGTSSVTLS----HRY------------GPCSPADPNSGEKRPTDEELLRRDQLRADYIR 73
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLW 135
K +N ++ + S+ S G+ L Y + +G+P V+ V +D GS++ W
Sbjct: 74 RKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSW 133
Query: 136 VPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSL 188
V C+ C +P A + +DP++SS+ +CS C + C +
Sbjct: 134 VQCEPCPAPSPCHA-------HAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDA- 185
Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
K C YI Y + ++++G D+L L+ S V GC + G+ +D
Sbjct: 186 KSRCQYIVKYG-DGSNTTGTYSSDVLTLSG-------SDVVRGFQFGCSHAELGAGMDDK 237
Query: 249 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ--------Q 300
DG++GLG GD P + A SF C + S F PA+
Sbjct: 238 T-DGLIGLG-GDAQSP-VSQTAARYGKSFFYCLPATPASSGFLTLGAPASGGGGGASRFA 294
Query: 301 STSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVV 356
+T L + YF +E +G L+ S F A LVDSG T LP YA +
Sbjct: 295 TTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSLVDSGTVITRLPPAAYAALSS 354
Query: 357 KFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH 406
F ++ + C+N + + + +P + L+F+ + H
Sbjct: 355 AFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAH 404
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 150/372 (40%), Gaps = 61/372 (16%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
L +N + R L +N N++ QL P + ++ + IGTP
Sbjct: 52 LSTNTALKMMLRNSLIANTNNNNTQLKSPPSSPYNYKLSFKYSMALIVDLPIGTPPQVQP 111
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
+ LD GS L W+ QC + AP S +DPS SS+ + C+HP+CK R
Sbjct: 112 MVLDTGSQLSWI--QCHKKAPAKPPPTAS-------FDPSLSSTFSTLPCTHPVCKPRIP 162
Query: 185 CKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
+L C + + + + T + G LV + +FS+ S +I+GC +
Sbjct: 163 DFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKF---TFSR----SLFTPPLILGCATE 215
Query: 240 QTGSYLDGAAPDGVMGLGLGDVS-------------VPSLLAKAGLIQN-SFSICFDEND 285
T P G++G+ G +S VP+ + + G SF + + N
Sbjct: 216 STD-------PRGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNS 268
Query: 286 SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLTQSGF----------Q 334
+ + A Q L P+ AY V ++ IG L S Q
Sbjct: 269 NTFRYIEMLTFARSQRMPNLDPL-----AYTVALQGIRIGGRKLNISPAVFRADAGGSGQ 323
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLK-VPDM 391
++DSG+ FT+L E Y +V + + V K+ + G C++ ++ E+ + + DM
Sbjct: 324 TMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDM 383
Query: 392 RLIFSKNQSFVV 403
F K VV
Sbjct: 384 VFEFEKGVQIVV 395
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 123/296 (41%), Gaps = 54/296 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+ IGTP V +D GS+L W PC CI+C +Y +R ++ + PS SSS
Sbjct: 84 LSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIEC----DNYRN--NRMMASFSPSHSSS 137
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDI 213
S SC+ P C S + DPC +A S T +G +V
Sbjct: 138 SHRDSCTSPFCIDVHSSDNPLDPC-TMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGT 196
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L + H V + C SY + P G+ G G G +S+PS L G +
Sbjct: 197 LTRDTLRVHGRNLGVTQEIPRFCFGCVASSYRE---PIGIAGFGRGALSLPSQL---GFL 250
Query: 274 QNSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCI 323
+ FS CF + N S + GD ++ F P+ + + Y+VG+E+ +
Sbjct: 251 RKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITV 310
Query: 324 GNSCLTQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
GN T+ LVDSG ++T LP Y++V+ +++ R +
Sbjct: 311 GNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRAT 366
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 132/319 (41%), Gaps = 42/319 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-C-----IQCAPLSASYYTSLDRNLSEYDP 163
+Y I +GTP F + +D GS+L W+ CQ C +Q P+ ++S+
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPI-------FTPSVSKTYK 159
Query: 164 SSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKH 222
+ S SS S + C + C Y A Y DTS S GYL D+L L
Sbjct: 160 ALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYG--DTSFSIGYLSQDVLTL------ 211
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
P ++ S + GCG+ G + A G++GL +S+ L+ N+FS C
Sbjct: 212 TPSAAPSSGFVYGCGQDNQGLFGRSA---GIIGLANDKLSMLGQLSNK--YGNAFSYCLP 266
Query: 282 -----DENDSGSVFFGDQGPATQQST-SFLPIGEKYDA---YFVGVESYCIGNSCLTQSG 332
N S S F + S F P+ + YF+G+ + + L S
Sbjct: 267 SSFSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSA 326
Query: 333 ----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLK 387
++DSG T LP IY + F ++S K G S C+ S +EM
Sbjct: 327 SSYNVPTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMST 386
Query: 388 VPDMRLIFSKNQSFVVRNH 406
VP++R+IF ++ H
Sbjct: 387 VPEIRIIFRGGAGLELKVH 405
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/381 (23%), Positives = 164/381 (43%), Gaps = 55/381 (14%)
Query: 70 WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQF-YWLHYTWIDIGTPNVSFLVALD 128
++R V+ N + + ++ +++ +Q Y + Y+ +G+P L +D
Sbjct: 53 FQRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQGEYLMRYS---VGSPPFQVLGIVD 109
Query: 129 AGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-RSSCKS 187
GS++LW +QC P Y + +DPS S + K + CS C+S R++ S
Sbjct: 110 TGSDILW-----LQCEPCEDCY----KQTTPIFDPSKSKTYKTLPCSSNTCESLRNTACS 160
Query: 188 LKDPCPYIADYSTEDTSSSGYLVDDILHLASF---SKHAPQSSVQSSVIIGCGRKQTGSY 244
+ C Y DY + + S G L + L L S S H P++ +IGCG G++
Sbjct: 161 SDNVCEYSIDYG-DGSHSDGDLSVETLTLGSTDGSSVHFPKT------VIGCGHNNGGTF 213
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQ 299
+ +G +GLG V + + I FS C + N S + FGD +
Sbjct: 214 QE----EGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSG 269
Query: 300 QSTSFLPI----GEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 345
+ T P+ G+ + YF+ +E++ +G++ + ++DSG + T
Sbjct: 270 RGTVSTPLDPLNGQVF--YFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTL 327
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR- 404
LP E Y + ++ +R CY +S+E+ D+ +I + + V
Sbjct: 328 LPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDEL----DLPVITAHFKGADVEL 383
Query: 405 NHIFSFPENEVGDHACFSYFT 425
N I +F E G CF++ +
Sbjct: 384 NPISTFVPVEKG-VVCFAFIS 403
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/270 (25%), Positives = 119/270 (44%), Gaps = 49/270 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP+ ++ +D GS+L+W +QC+P Y + +DP SS+
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVW-----LQCSPCRRCYA----QRGQVFDPRRSSTY 136
Query: 170 KNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
+ V CS P C++ S + C Y+ Y + +SS+G L D L A+
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGELATDKLAFAN------ 189
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ ++V +GCGR G + D AA G++G+ G +S+ + +A A + F C +
Sbjct: 190 -DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVARGKISISTQVAPA--YGSVFEYCLGDR 243
Query: 285 DSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ---- 334
S S VF P + T+ L + Y+V + + +G +T GF
Sbjct: 244 TSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT--GFSNASL 301
Query: 335 ----------ALVDSGASFTFLPTEIYAEV 354
+VDSG + + + YA +
Sbjct: 302 ALDTATGRGGVVVDSGTAISRFARDAYAAL 331
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 127/311 (40%), Gaps = 54/311 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IG+P F +D GS+L+W C C+ C Y ++P+ S+S ++
Sbjct: 92 VGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPY----------FEPAKSTSYASL 141
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
CS +C + S ++ C Y A Y + SS+G L ++ +F ++ + +V V
Sbjct: 142 PCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFGTNSTRVAVP-RV 196
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
GCG G+ +G+ G++G G G +S+ S L FS C F + +
Sbjct: 197 SFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSRL 248
Query: 290 FFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
+FG GP QST F+ YF+ + + L
Sbjct: 249 YFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 306
Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYN--ASSEE 384
T ++DSG + TFL YA V F V R + +++ C+
Sbjct: 307 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 366
Query: 385 MLKVPDMRLIF 395
M+ +P+M L F
Sbjct: 367 MVTLPEMVLHF 377
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/329 (22%), Positives = 133/329 (40%), Gaps = 62/329 (18%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
Y + T + GTP + + D GS+L+W PC C +C+ + +D + +
Sbjct: 78 YGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRF 131
Query: 162 DPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADYSTEDTSSSG 207
P SSSSK V C +P C +S C+S CP Y+ Y + T +G
Sbjct: 132 VPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGST--AG 189
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
L+ + L K P + ++GC S+L P G+ G G G S+PS +
Sbjct: 190 LLLSETLDFP--DKXIP------NFVVGC------SFLSIHQPSGIAGFGRGSESLPSQM 235
Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--------YDAYFVGVE 319
S FD++ D ++ P + + Y++ +
Sbjct: 236 GLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIR 295
Query: 320 SYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRI 367
+GN + +++DSG++FTF+ + V +F+K ++ ++
Sbjct: 296 KIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRAT 355
Query: 368 SLQG-NSWKYCYNASSEEMLKVPDMRLIF 395
++ + C++ S E+ +K P++ F
Sbjct: 356 DVETLTGLRPCFDISKEKSVKFPELIFQF 384
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 147/333 (44%), Gaps = 42/333 (12%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y + Y+ +GTP++ LD GS+++W+ CQ C +C ++ +D S
Sbjct: 89 YLISYS---VGTPSLQVFGILDTGSDIIWLQCQPCKKC----------YEQTTPIFDSSK 135
Query: 166 SSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S + K + C C+S C S K C Y Y + + S G L + L L S +
Sbjct: 136 SQTYKTLPCPSNTCQSVQGTFCSSRKH-CLYSIHY-VDGSQSLGDLSVETLTLGSTNG-- 191
Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
S VQ +IGCGR + + G++GLG G +S+ + L+ + FS C
Sbjct: 192 --SPVQFPGTVIGCGRYNAIGIEEKNS--GIVGLGRGPMSLITQLSPS--TGGKFSYCLV 245
Query: 283 ---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLT----QSGF 333
S + FG+ + + T P+ K YF+ +E++ +G + + SG
Sbjct: 246 PGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGG 305
Query: 334 QA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPD 390
+ ++DSG + T LP +Y+++ K V +R+ CY + +++ VP
Sbjct: 306 KGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDASVPV 365
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+ FS V N I +F + D CF++
Sbjct: 366 ITAHFSGAD--VTLNAINTFVQ-VADDVVCFAF 395
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 126/308 (40%), Gaps = 53/308 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
I IGTP + LD GS+L+W C C +C P A Y P+ S++ N
Sbjct: 96 IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSATYAN 145
Query: 172 VSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
VSC P+C++ S C C Y Y + TS+ G L + L S +
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTLGS-------DT 197
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
V GCG + GS + + G++G+G G + SL+++ G+ + S G
Sbjct: 198 AVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTRPRRSCRARAAARG 251
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 347
P T + +G+ + ++ + + G ++DSG +FT L
Sbjct: 252 GGA-----PTTTSPLEGITVGDT----LLPIDPAVFRLTPMGDGGV--IIDSGTTFTALE 300
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFS------KN 398
+ V L S R+ L + C+ A+S E ++VP + L F +
Sbjct: 301 ERAF---VALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRR 357
Query: 399 QSFVVRNH 406
+S+VV +
Sbjct: 358 ESYVVEDR 365
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 133/312 (42%), Gaps = 51/312 (16%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
+ T + +GTP S+ + +D GS+L W +QC+P S R + YDP +SS+
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTW-----LQCSPC----VVSCHRQVGPLYDPRASST 184
Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
V CS C + S+C S+++ C Y A Y + + S GYL D + S S
Sbjct: 185 YATVPCSASQCDELQAATLNPSAC-SVRNVCIYQASYG-DSSFSVGYLSRDTVSFGSGSY 242
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ GCG+ G + A G++GL +S+ LA + + SFS C
Sbjct: 243 --------PNFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCL 289
Query: 282 DENDSGSVFFGDQGPATQQSTSFLPIG-EKYDA--YFVGVESYCIGNSCLT-----QSGF 333
S + GP T S+ P+ DA YFV + +G S L S
Sbjct: 290 PT--PASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSL 347
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYCYNASSEEMLKVP 389
++DSG T LPT +Y K V++ + +Q C+ + + L+VP
Sbjct: 348 PTIIDSGTVITRLPTAVY----TALSKAVAAAMVGVQSAPAFSILDTCFQGQASQ-LRVP 402
Query: 390 DMRLIFSKNQSF 401
+ + F+ +
Sbjct: 403 AVAMAFAGGATL 414
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 88/330 (26%), Positives = 136/330 (41%), Gaps = 53/330 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V F+ D GS+L W C+ C C P ++ YD ++S+S V
Sbjct: 99 LAIGTPPVPFVALADTGSDLTWTQCKPCKLCFP----------QDTPIYDTAASASFSPV 148
Query: 173 SCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQS 226
C+ C SR+ + PC Y Y+ +D + S+G L + L A S AP
Sbjct: 149 PCASATCLPIWRSSRNCTATTTSPCRY--RYAYDDGAYSAGVLGTETLTFAGSSPGAPGP 206
Query: 227 SVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----F 281
V V GCG G + G +GLG G + SL+A+ G+ FS C F
Sbjct: 207 GVSVGGVAFGCGVDNGGLSYNST---GTVGLGRGSL---SLVAQLGV--GKFSYCLTDFF 258
Query: 282 DENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 329
+ + V FG G A QST + Y+V +E +G++ L
Sbjct: 259 NTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPN 318
Query: 330 -------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNAS 381
+VDSG FT L + VV +++ ++ +S + A
Sbjct: 319 GTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSLDSPCFPATAG 378
Query: 382 SEEMLKVPDMRLIFSKNQSFVV-RNHIFSF 410
+++ +PDM L F+ + R++ SF
Sbjct: 379 EQQLPDMPDMLLHFAGGADMRLHRDNYMSF 408
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 124/277 (44%), Gaps = 32/277 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
++IG P+ + + +D GS+L W+ C C+QC YY + NL
Sbjct: 24 LNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRN-NL------------- 69
Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
V C P+C+S S + P DY E SS G LV D +L +F+ S +
Sbjct: 70 VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNL-NFTSEKRHSPL 128
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+ + G + GS+ DGV+GLG G S+ S L+ GL++N C + G
Sbjct: 129 LALGLCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGF 185
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGASFTF 345
+FFGD + + ++ P+ Y G+ +GF+ L+ DSGAS+T+
Sbjct: 186 LFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGASYTY 241
Query: 346 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNA 380
L ++ Y ++ K +S K R +L + C+
Sbjct: 242 LNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG 278
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 103/425 (24%), Positives = 168/425 (39%), Gaps = 85/425 (20%)
Query: 17 DGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTR 76
DG+ +V+ S HR+ G S AD + ELL + + R
Sbjct: 57 DGTSSVTLS----HRY------------GPCSPADPNSGEKRPTDEELLRRDQLRADYIR 100
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLW 135
K +N ++ + S+ S G+ L Y + +G+P ++ V +D GS++ W
Sbjct: 101 RKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSW 160
Query: 136 VPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSL 188
V C+ C +P A + +L +DP++SS+ +CS C + C +
Sbjct: 161 VQCEPCPAPSPCHA-HAGAL------FDPAASSTYAAFNCSAAACAQLGDSGEANGCDA- 212
Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
K C YI Y + ++++G D+L L+ S V GC + G+ +D
Sbjct: 213 KSRCQYIVKYG-DGSNTTGTYSSDVLTLSG-------SDVVRGFQFGCSHAELGAGMDDK 264
Query: 249 APDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 307
DG++GLG GD SL+++ A SFS C PAT S+ FL +
Sbjct: 265 T-DGLIGLG-GDAQ--SLVSQTAARYGKSFSYCL--------------PATPASSGFLTL 306
Query: 308 GEKYDA----------------------YFVGVESYCIGNS--CLTQSGFQA--LVDSGA 341
G YF +E +G L+ S F A LVDSG
Sbjct: 307 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSLVDSGT 366
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
T LP YA + F ++ + C+N + + + +P + L+F+
Sbjct: 367 VITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVV 426
Query: 402 VVRNH 406
+ H
Sbjct: 427 DLDAH 431
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 127/311 (40%), Gaps = 54/311 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IG+P F +D GS+L+W C C+ C Y ++P+ S+S ++
Sbjct: 89 VGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPY----------FEPAKSTSYASL 138
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
CS +C + S ++ C Y A Y + SS+G L ++ +F ++ + +V V
Sbjct: 139 PCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFGTNSTRVAVP-RV 193
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
GCG G+ +G+ G++G G G +S+ S L FS C F + +
Sbjct: 194 SFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSRL 245
Query: 290 FFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---------- 328
+FG GP QST F+ YF+ + + L
Sbjct: 246 YFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 303
Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYN--ASSEE 384
T ++DSG + TFL YA V F V R + +++ C+
Sbjct: 304 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 363
Query: 385 MLKVPDMRLIF 395
M+ +P+M L F
Sbjct: 364 MVTLPEMVLHF 374
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 125/290 (43%), Gaps = 29/290 (10%)
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P V V LD+ S++ WV QC+ C P+ + +D S YDPS S SS SCS P
Sbjct: 155 PGVIQTVVLDSASDVPWV--QCVPC-PIPPCH-PQVD---SFYDPSRSPSSAPFSCSSPT 207
Query: 179 CKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
C + + + C Y+ Y + +S+SG + D+L L + + S GC
Sbjct: 208 CTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTLDA-------GNAVSGFKFGC 259
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGP 296
+ GS+ AA G+M LG G S+ L A N+FS C S S FF P
Sbjct: 260 SHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFFTLGVP 315
Query: 297 ATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGASFTFLPT 348
S T + + Y V + + +G L + F A ++DS + T LP
Sbjct: 316 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 375
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
Y + F ++ R + CY+ + +++P + L+F +N
Sbjct: 376 TAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRN 425
>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
Length = 500
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 134/329 (40%), Gaps = 59/329 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I+ TP V + +D G LWV C+ ++YTS + S +K+ S
Sbjct: 53 INQRTPLVPLNLVVDLGGKFLWVDCE---------NHYTSSTYRPVRCPSAQCSLAKSDS 103
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK-HAPQSSVQSSV 232
C + C + C I D + +++ G L +D+L + S S + Q+ V S
Sbjct: 104 CGDCFSSPKPGCN---NTCGLIPDNTITHSATRGDLAEDVLSIQSTSGFNTGQNVVVSRF 160
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
+ C L G A G+ GLG +++PS LA A + + F+ CF +D G + FG
Sbjct: 161 LFSCAPTSLLRGLAGGA-SGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSD-GVIIFG 218
Query: 293 DQGPAT--------------QQSTSFLPI-------------GEKYDAYFVGVESYCI-G 324
D GP + +S ++ P+ GE YF+GV++ I G
Sbjct: 219 D-GPYSFLADNPSLPNVVFDSKSLTYTPLLINHVSTASAFLQGESSVEYFIGVKTIKIDG 277
Query: 325 NSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-- 373
S ++ + G +T L IY V F K ++ I+ + +S
Sbjct: 278 KVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNITTEDSSPP 337
Query: 374 WKYCYN----ASSEEMLKVPDMRLIFSKN 398
+++CY+ + VP + L+ N
Sbjct: 338 FEFCYSFDNLPGTPLGASVPTIELLLQNN 366
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 146/349 (41%), Gaps = 66/349 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++IGTP V +D GS+L WVPC C+ C S + +S + PS SSS
Sbjct: 16 LNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNS------KLMSAFSPSHSSS 69
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDI 213
S SC+ P C S + DPC +A S T +G +V
Sbjct: 70 SYRDSCASPYCTDIHSSDNSFDPC-TVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGT 128
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L + H + V + C +Y + P G+ G G +S PS L GL+
Sbjct: 129 LTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHE---PIGIAGFVRGTLSFPSQL---GLL 182
Query: 274 QNSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCI 323
+ FS CF + N S + GD +++ + F P+ + + Y++G+E+ +
Sbjct: 183 KKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITV 242
Query: 324 GNSCLT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISL 369
GN T Q L+DSG ++T LP Y++++ F +++ R + +
Sbjct: 243 GNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVEM 302
Query: 370 QGNSWKYCY------NASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSF 410
+ + CY N +++ P + F N SFV+ NH ++
Sbjct: 303 RA-GFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAM 350
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 153/380 (40%), Gaps = 58/380 (15%)
Query: 55 KKNSVEYLELL----LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHF-----FGNQ 105
+ + LE+L + ++ R+K + N ++ ++L SQT F FG
Sbjct: 72 RAPAPSLLEMLRWDQVRTEYVRRKASGGAEDVLNPAKPRVLM----SQTDFAVRSPFGVG 127
Query: 106 FYWLHYTWIDI-GTPNV--SFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLS 159
WID G P V +A+D ++ W+ PC QC P +
Sbjct: 128 SGSGSSAWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYP----------QRDP 177
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDI 213
+DP++SS++ V C P C+S S +S C Y+ +YS +D +++G + D
Sbjct: 178 LFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSANAECRYLIEYS-DDRATAGTYMTDT 236
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L ++ ++ + GC G + D A G M LG G S+ + A++ +
Sbjct: 237 LTISG-------TTAVRNFRFGCSHAVRGRFSDLTA--GTMSLGGGAQSLLAQTARS--L 285
Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------YFVGVESYCIGNSC 327
N+FS C + S S F GPAT ST+ + Y V ++ +
Sbjct: 286 GNAFSYCVPQA-SASGFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRR 344
Query: 328 LTQSGFQ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
L A++DS A T LP Y + F + + S + CY+
Sbjct: 345 LGIPPVAFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLGL 404
Query: 384 EMLKVPDMRLIFSKNQSFVV 403
++VP + L+F V+
Sbjct: 405 TNVRVPAVSLVFGGGAVVVL 424
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 153/366 (41%), Gaps = 56/366 (15%)
Query: 55 KKNSVEYLELLLSNDWKRQKTRVKLQSN----NNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
K NS + ++L ++ + + +L N +N ++ PS+ + T GN +
Sbjct: 93 KANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLPSKSASTLGSGN-----Y 147
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+ +G+P D GS+L W QC P Y + +DPS+S S
Sbjct: 148 VVTVGLGSPKRDLTFIFDTGSDLTWT-----QCEPCVGYCYQQREH---IFDPSTSLSYS 199
Query: 171 NVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
NVSC P C+ S C S C Y Y + + S G+ + L L S
Sbjct: 200 NVSCDSPSCEKLESATGNSPGCSS--STCLYGIRYG-DGSYSIGFFAREKLSLTS----- 251
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFD 282
+ V ++ GCG+ G + G A G++GL +S+ S A K G + FS C
Sbjct: 252 --TDVFNNFQFGCGQNNRGLF-GGTA--GLLGLARNPLSLVSQTAQKYGKV---FSYCLP 303
Query: 283 --ENDSGSVFFGDQGPATQQSTSFLP--IGEKYDAYF--------VGVESYCIGNSCLTQ 330
+ +G + FG G ++ F P + Y +++ VG I S +
Sbjct: 304 SSSSSTGYLSFG-SGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFST 362
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
+G ++DSG + LP +Y+ V F +L+S + CY+ S + +KVP
Sbjct: 363 AG--TIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPK 420
Query: 391 MRLIFS 396
+ L FS
Sbjct: 421 IILYFS 426
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 100/411 (24%), Positives = 167/411 (40%), Gaps = 54/411 (13%)
Query: 34 DEAKERWISKSGNVSVADSWPKKNSVEYLELL---LSNDWKRQKTRVK-LQSNNNSSRNQ 89
+E E+W+ K V D NS ++ L L D KR + ++ L S S
Sbjct: 66 EEGGEKWMMK---VVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRV 122
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSA 148
F ++ G+ Y++ I +G+P S + +D+GS+++WV CQ C QC
Sbjct: 123 DDFGTDVISGMEQGSGEYFVR---IGVGSPPRSQYMVIDSGSDIVWVQCQPCTQC----- 174
Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
Y D +DP+ S+S VSCS +C + C Y Y + + + G
Sbjct: 175 --YHQSD---PVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYG-DGSYTKGT 228
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
L L +F + ++ SV IGCG + G ++ A G+ G + V
Sbjct: 229 LA---LETLTFGR-----TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVG-----Q 275
Query: 269 KAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 322
G +FS C + SGS+ FG + A +++P+ A Y++G+
Sbjct: 276 LGGQTGGAFSYCLVSRGTDSSGSLVFGRE--ALPAGAAWVPLVRNPRAPSFYYIGLAGLG 333
Query: 323 IGNS---------CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 372
+G LT+ G +V D+G + T LPT Y F ++ +
Sbjct: 334 VGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA 393
Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+ CY+ ++VP + FS + F P ++ G CF++
Sbjct: 394 IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTF-CFAF 443
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 123/277 (44%), Gaps = 32/277 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQC--IQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
++IG P+ + + +D GS+L W+ C QC YY S+
Sbjct: 24 LNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHPYY--------------KPSNNL 69
Query: 172 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 228
V+C P+C+S + + P DY E SS G LV D +L +F+ QS +
Sbjct: 70 VACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNL-NFTSEKRQSPL 128
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+ + G + G+Y DGV+GLG G S+ S L+ GL++N C G
Sbjct: 129 LALGLCGYDQLPGGTY---HPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSGRGGGF 185
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGASFTF 345
+FFGD + + ++ P+ Y G +GF+ L+ DSGAS+T+
Sbjct: 186 LFFGDDLYDSSR-VAWTPMSPNAKHYSPGFAELTFDGK---TTGFKNLIVAFDSGASYTY 241
Query: 346 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNA 380
L +++Y ++ + +S+K R +L + C+
Sbjct: 242 LNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKG 278
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/301 (25%), Positives = 117/301 (38%), Gaps = 49/301 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F V D GS+ WV QC P A Y + +DP+ S++ N+S
Sbjct: 100 VRLGTPAERFTVVFDTGSDTTWV-----QCQPCVAYCYRQKE---PLFDPTKSATYANIS 151
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
CS C C Y Y + + + G+ D L LA +
Sbjct: 152 CSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--------YDTIKNFR 202
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
GCG K G + A G++GLG G S+P K G + F+ C +G+ F
Sbjct: 203 FGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGFLD 256
Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-----FQALVDSGASFTF 345
G PA + + + Y+VG+ +G L G LVDSG T
Sbjct: 257 LGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITR 316
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWK---------YCYNASSEE--MLKVPDMRLI 394
LP YA + F K ++QG + CY+ + + + +P + L+
Sbjct: 317 LPPSAYAPLRSAFSK-------AMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLV 369
Query: 395 F 395
F
Sbjct: 370 F 370
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/306 (25%), Positives = 129/306 (42%), Gaps = 43/306 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + D GS+L W QC P + Y D + PS S++ N+S
Sbjct: 135 VGLGTPKKYLSLIFDTGSDLTWT-----QCQPCARYCYNQKD---PVFVPSQSTTYSNIS 186
Query: 174 CSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
CS P C S C + + C Y Y + + S GY + L L S +
Sbjct: 187 CSSPDCSQLESGTGNQPGCSAAR-ACIYGIQYG-DQSFSVGYFAKETLTLTS-------T 237
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDEND 285
V + + GCG+ G + A G++GLG +S+ A K G + FS C +
Sbjct: 238 DVIENFLFGCGQNNRGLFGSAA---GLIGLGQDKISIVKQTAQKYGQV---FSYCLPKTS 291
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVE---------SYCIGNSCLTQSGFQA 335
S + + G + + PI + + A F GV+ I +S + SG A
Sbjct: 292 SSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSG--A 349
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
++DSG T LP + Y+ + F+K ++ + + + CY+ S +++P + +F
Sbjct: 350 IIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVF 409
Query: 396 SKNQSF 401
+
Sbjct: 410 KGGEEL 415
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 156/375 (41%), Gaps = 41/375 (10%)
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWV 136
VKL N+S Q+LF +QT + + +L + IGTP V +D GS+L+W+
Sbjct: 31 VKLIPRNSS---QVLFNRITAQTPVSVHHYDYL--MELSIGTPPVKTYAQVDTGSDLIWL 85
Query: 137 PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPY 194
QCI C + Y L+ +DP SSS+ N++ C +SC ++ C Y
Sbjct: 86 --QCIPC----TNCYKQLN---PMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNY 136
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
Y +D+ + G L + L L S + + VI GCG G + D G++
Sbjct: 137 TYSYE-DDSITEGVLAQETLTLTSTTG---KPVALKGVIFGCGHNNNGVFNDKEM--GII 190
Query: 255 GLGLGDVSVPSLLAKA--GLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEK 310
GLG G +S+ S + + G + + + F N S + FG P+ K
Sbjct: 191 GLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSK 250
Query: 311 --YDAYF------VGVESYCI----GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
+ A++ + VE + G+S + ++DSG T LP + Y +V +
Sbjct: 251 NTHQAFYFVTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEV 310
Query: 359 DKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGD 417
V+ I + ++ CY + LK + F + IF ++ +
Sbjct: 311 RNKVALDPIPIDPTLGYQLCYRTPTN--LKGTTLTAHFEGADVLLTPTQIFIPVQDGIFC 368
Query: 418 HACFSYFTLEYNFTG 432
A S F+ EY G
Sbjct: 369 FAFTSTFSNEYGIYG 383
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 130/304 (42%), Gaps = 51/304 (16%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP +++ +D GS+L+W C+ C+ C ++ +DPSSSS+ V C
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATVPC 222
Query: 175 SHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
S C S C S C Y Y + +S+ G L + LA S V
Sbjct: 223 SSASCSDLPTSKCTSASK-CGYTYTYG-DSSSTQGVLATETFTLA--------KSKLPGV 272
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
+ GCG G A G++GLG G + SL+++ GL + FS C D+ ++ +
Sbjct: 273 VFGCGDTNEGDGFSQGA--GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPL 325
Query: 290 FFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ----- 334
G ++ Q+T + + Y+V +++ +G++ L S F
Sbjct: 326 LLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDG 385
Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+VDSG S T+L + Y + F ++ G C+ A ++ + +V
Sbjct: 386 TGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVP 445
Query: 392 RLIF 395
RL+F
Sbjct: 446 RLVF 449
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 120/294 (40%), Gaps = 36/294 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F + D GS + W QC P S Y ++ ++DP+ S+S NVS
Sbjct: 139 VGLGTPKEDFTLVFDTGSGITWT-----QCQPCLGSCYPQKEQ---KFDPTKSTSYNNVS 190
Query: 174 CSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
CS C S C + C Y Y + + S G+ + L ++S S V
Sbjct: 191 CSSASCNLLPTSERGCSASNSTCLYQIIYG-DQSYSQGFFATETLTISS-------SDVF 242
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
++ + GCG+ G + A G+ SV A Q FS C S +
Sbjct: 243 TNFLFGCGQSNNGLFGQAAGLLGLS-----SSSVSLPSQTAEKYQKQFSYCLPSTPSSTG 297
Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYF----VGV----ESYCIGNSCLTQSGFQALVDSGA 341
+ + G Q+ F PI + +++ VG+ I S T SG A++DSG
Sbjct: 298 YL-NFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSG--AIIDSGT 354
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y + FD+ +S+ + CY+ S+ + P + + F
Sbjct: 355 VITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSF 408
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 120/283 (42%), Gaps = 53/283 (18%)
Query: 107 YWLHYTWIDIGTPNVSFL-VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H+ +IGTP + + +D GS+L+W QC P + D+ +DPS
Sbjct: 87 YLIHF---NIGTPRPQRVALTMDTGSDLVWT-----QCTPCPVCF----DQPFPLFDPSV 134
Query: 166 SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SS+ + V+C P+C+ S S+C C Y+ Y + + ++GY+ D S +
Sbjct: 135 SSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYG-DKSITAGYIFKDTFTFMSPN 193
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
S + GCG TG + + G+ G G G +S+PS L + G FS C
Sbjct: 194 GEGAPPVAVSGLAFGCGDYNTGVFASNES--GIAGFGRGPLSLPSQL-RVG----RFSYC 246
Query: 281 F------DENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
+ N + +VF G GP +ST + Y++ +E +G
Sbjct: 247 LTSHDETESNKTSAVFLGTPPNGLRAHSSGPF--RSTPIIHSPSFPTFYYLSLEGITVGK 304
Query: 326 SCL-TQSGFQAL---------VDSGASFTFLPTEIYAEVVVKF 358
+ L S AL +DSG T P ++ ++ +F
Sbjct: 305 TRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEF 347
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/312 (25%), Positives = 139/312 (44%), Gaps = 30/312 (9%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+++G+P S L D GS+L+WV C+ SA+ T +++DPS SS+ VS
Sbjct: 105 VNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPT------TQFDPSRSSTYGRVS 158
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL-ASFSKHAPQSSVQS 230
C C++ R++C + C Y+ Y + ++++G L + S +P+
Sbjct: 159 CQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTFDDGGSGRSPRQVRVG 216
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSG 287
V GC GS+ +GLG G VS+ + L A + FS C N S
Sbjct: 217 GVKFGCSTATAGSFPADGL----VGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSVNASS 272
Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-FQALVDSGASFT 344
++ FG T+ + P+ G+ Y V ++S +GN + + + +VDSG + T
Sbjct: 273 ALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRIIVDSGTTLT 332
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK---VPDMRLIFSKNQSF 401
FL + +V + + ++ + + CYN + E+ +PD+ L F +
Sbjct: 333 FLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAV 392
Query: 402 VVRNHIFSFPEN 413
++ PEN
Sbjct: 393 ALK------PEN 398
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 130/311 (41%), Gaps = 54/311 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP LD GS +W C C+ C ++ +DPS SS+ K +
Sbjct: 63 LQIGTPPFEIEAVLDTGSEHIWTQCLPCVHC----------YNQTAPIFDPSKSSTFKEI 112
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
C + CPY Y + + + G LV + + + S S Q V
Sbjct: 113 -----------RCDTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTSG---QPFVMPET 157
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
IIGCGR +G + G A GV+GL G S+ + G S CF + + FG
Sbjct: 158 IIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYCFAGKGTSKINFG 212
Query: 293 DQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQAL-----VDSGA 341
G +T F+ K Y++ +++ +GN+ + G F AL +DSG+
Sbjct: 213 ANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGS 271
Query: 342 SFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR------LI 394
+ T+ P E Y +V K +++V++ R S CY + + ++ V M L+
Sbjct: 272 TLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPVITMHFSGGADLV 327
Query: 395 FSKNQSFVVRN 405
K +V N
Sbjct: 328 LDKYNMYVASN 338
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 112/272 (41%), Gaps = 34/272 (12%)
Query: 93 PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
PS GN + H+ ++IG P S+ + +D GS L W+ C C C +
Sbjct: 20 PSSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHV 79
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTED 202
Y + L V+C+ LC C S K C Y+ Y D
Sbjct: 80 LYKPTPKKL-------------VTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--D 123
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDV 261
+SS G LV D FS A + +++ GCG Q + P D ++GL G V
Sbjct: 124 SSSMGVLVID-----RFSLSASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKV 178
Query: 262 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVE 319
++ S L G+I ++ C G +FFGD Q P + + + + KY + G
Sbjct: 179 TLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTL 238
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIY 351
+ + ++ + + DSGA++T+ + Y
Sbjct: 239 HFDSNSKAISAAPMAVIFDSGATYTYFAAQPY 270
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 137/338 (40%), Gaps = 42/338 (12%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDP 163
N + + + IGTP + +D GS+L+WV QC P Y ++ +DP
Sbjct: 58 NAYIGQYLMELYIGTPPIKISGTVDTGSDLIWV-----QCVPCLGCY----NQINPMFDP 108
Query: 164 SSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
SS+ N+SC PLC C K C Y Y+ + + + G L + + L S +
Sbjct: 109 LKSSTYTNISCDSPLCYKPYIGECSPEKR-CDYTYGYA-DSSLTKGVLAQETVTLTSNTG 166
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSFSI 279
S+Q ++ GCG TG++ D G++GLG G SL+++ G + FS
Sbjct: 167 KP--ISLQ-GILFGCGHNNTGNFNDHEM--GLIGLGGGPT---SLVSQIGPLFGGKKFSQ 218
Query: 280 CF-----DENDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGV---ESYCIGN 325
C D S + FG + P+ ++ Y +G+ ++Y N
Sbjct: 219 CLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMN 278
Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEE 384
S + + LVDSG LP ++Y V V+ V + I+ + + CY +
Sbjct: 279 STIEKGNM--LVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTN- 335
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFS 422
LK P + F + F P E C +
Sbjct: 336 -LKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLA 372
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 123/305 (40%), Gaps = 42/305 (13%)
Query: 59 VEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGT 118
V + E ++ R+ + S+ + +L G+ HFF ++IG
Sbjct: 361 VPHSEAIIHETPNRKVGTARQPSSPAPTGAAILCRGVGAPRHFF---------ITMNIGD 411
Query: 119 PNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
P S+ + +D GS L W+ C C C + Y + L V+C+
Sbjct: 412 PAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKL-------------VTCAD 458
Query: 177 PLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
LC C S K C Y+ Y D+SS G LV D FS A +
Sbjct: 459 SLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--DSSSMGVLVID-----RFSLSASNGTNP 510
Query: 230 SSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSG 287
+++ GCG Q + P D ++GL G V++ S L G+I ++ C G
Sbjct: 511 TTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGG 570
Query: 288 SVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFL 346
+FFGD Q P + + + + KY + G + + ++ + + DSGA++T+
Sbjct: 571 FLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYF 630
Query: 347 PTEIY 351
+ Y
Sbjct: 631 AAQPY 635
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 125/290 (43%), Gaps = 29/290 (10%)
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P V V LD+ S++ WV QC+ C P+ + +D S YDPS S +S SCS P
Sbjct: 25 PGVIQTVVLDSASDVPWV--QCVPC-PIPPCH-PQVD---SFYDPSRSPTSAAFSCSSPT 77
Query: 179 CKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
C + + + C Y+ Y + +S+SG + D+L L + + S GC
Sbjct: 78 CTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTLDA-------GNAVSGFKFGC 129
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGP 296
+ GS+ AA G+M LG G S+ L A N+FS C S S FF P
Sbjct: 130 SHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFFTLGVP 185
Query: 297 ATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGASFTFLPT 348
S T + + Y V + + +G L + F A ++DS + T LP
Sbjct: 186 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 245
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
Y + F ++ R + CY+ + +++P + L+F +N
Sbjct: 246 TAYQALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRN 295
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/331 (27%), Positives = 147/331 (44%), Gaps = 45/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ I +GTP V LD GS++ W IQC P S Y ++ +DP+SSS+
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNW-----IQCLPCSECY----QQSDPIFDPTSSSTF 214
Query: 170 KNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K+++CS P C S S+C+S K C Y Y + Y D + +F +S
Sbjct: 215 KSLTCSDPKCASLDVSACRSNK--CLYQVSYGDGSFTVGNYATDTV----TFG----ESG 264
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
+ V +GCG G + A G+ G L + + AK SFS C + DS
Sbjct: 265 KVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTN--QIKAK------SFSYCLVDRDSA 316
Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
S+ F +T+ L K D Y+VG+ + +G + S F+
Sbjct: 317 KSSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAG 376
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKVPDMR 392
++D G + T L T+ Y + F KL + K+ + + + CY+ SS +KVP +
Sbjct: 377 GVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVT 436
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ +S + + P ++ G CF++
Sbjct: 437 FHFTGGKSLNLPAKNYLIPIDDAGTF-CFAF 466
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 115/294 (39%), Gaps = 35/294 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F V D GS+ WV QC P A Y + +DP+ S++ N+S
Sbjct: 165 VRLGTPAERFTVVFDTGSDTTWV-----QCQPCVAYCYRQKE---PLFDPTKSATYANIS 216
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
CS C C Y Y + + + G+ D L LA +
Sbjct: 217 CSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--------YDTIKNFR 267
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
GCG K G + A G++GLG G S+P K G + F+ C +G+ F
Sbjct: 268 FGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGFLD 321
Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-----FQALVDSGASFTF 345
G PA + + + Y+VG+ +G L G LVDSG T
Sbjct: 322 LGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITR 381
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQG--NSWKYCYNASSEE--MLKVPDMRLIF 395
LP YA + F K + S + CY+ + + + +P + L+F
Sbjct: 382 LPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF 435
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 130/311 (41%), Gaps = 54/311 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP LD GS +W C C+ C ++ +DPS SS+ K +
Sbjct: 69 LQIGTPPFEIEAVLDTGSEHIWTQCLPCVHC----------YNQTAPIFDPSKSSTFKEI 118
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
C + CPY Y + + + G LV + + + S S Q V
Sbjct: 119 -----------RCDTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTSG---QPFVMPET 163
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
IIGCGR +G + G A GV+GL G S+ + G S CF + + FG
Sbjct: 164 IIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYCFAGKGTSKINFG 218
Query: 293 DQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQAL-----VDSGA 341
G +T F+ K Y++ +++ +GN+ + G F AL +DSG+
Sbjct: 219 ANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGS 277
Query: 342 SFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR------LI 394
+ T+ P E Y +V K +++V++ R S CY + + ++ V M L+
Sbjct: 278 TLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPVITMHFSGGADLV 333
Query: 395 FSKNQSFVVRN 405
K +V N
Sbjct: 334 LDKYNMYVASN 344
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 130/310 (41%), Gaps = 54/310 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP +S+ +D GS+L+W C+ C+ C ++ +DPSSSS+ V
Sbjct: 104 VAIGTPALSYAAIVDTGSDLVWTQCKPCVDC----------FKQSTPVFDPSSSSTYATV 153
Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
CS LC S+C S C Y Y + +S+ G L + L K P
Sbjct: 154 PCSSALCSDLPTSTCTSASK-CGYTYTYG-DASSTQGVLASETFTLGKEKKKLP------ 205
Query: 231 SVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS- 288
V GCG G + GA G++GLG G + SL+++ GL + FS C D G
Sbjct: 206 GVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDGDG 257
Query: 289 ---VFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ- 334
+ G A Q+T + + Y+V + +G++ +T S F
Sbjct: 258 KSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAI 317
Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE--EM 385
+VDSG S T+L + Y + F ++ + C+ ++ +
Sbjct: 318 QDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDE 377
Query: 386 LKVPDMRLIF 395
++VP + L F
Sbjct: 378 VQVPKLVLHF 387
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/324 (25%), Positives = 135/324 (41%), Gaps = 47/324 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSS 165
++ + +GTP L+ D GS+L+WV C C + P SA L R+ + + P+
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSA----FLARHSTTFSPNH 144
Query: 166 SSSSKNVSCSH-PLCK-SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKH 222
S +C PL K R + L PC Y +YS D S +SG+ + L + S
Sbjct: 145 CYDS---ACQLVPLPKHHRCNHARLHSPCRY--EYSYGDGSKTSGFFSKETTTLNTSSG- 198
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
+ + + GC + +G + GA+ GVMGLG G +S+ S L N FS
Sbjct: 199 --REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR--FGNKFSY 254
Query: 280 CFDEND-----SGSVFFG----DQGPATQQSTSFLPIGEKYDA---YFVGVESYCI-GNS 326
C ++D + + G D P ++ F P+ + Y++G+ES + G
Sbjct: 255 CLMDHDISPSPTSYLLIGSTQNDVAPG-KRRMRFTPLHINPLSPTFYYIGIESVSVDGIK 313
Query: 327 CLTQSGFQAL---------VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 377
AL VDSG + TFLP Y +++ + V + + C
Sbjct: 314 LPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLC 373
Query: 378 YNASSEEMLKVPDMRLIFSKNQSF 401
N S E ++P + + F
Sbjct: 374 VNVSEIEHPRLPKLSFKLGGDSVF 397
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 100/406 (24%), Positives = 154/406 (37%), Gaps = 64/406 (15%)
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYWLHY-TWIDIG 117
YL LL+ D R + Q N R S ++ G + L+Y T I +G
Sbjct: 95 RYLRRLLAADESRANS---FQPRRNKDRASASTQSASAEVPLTSGIRLQTLNYVTTISLG 151
Query: 118 ----TPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+P + V +D GS+L WV QC P SA Y + +DP+ S++ V
Sbjct: 152 GSSGSPAANLTVIVDTGSDLTWV-----QCKPCSACYA----QRDPLFDPAGSATYAAVR 202
Query: 174 CSHPLCK--------SRSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
C+ C + SC S + C Y Y + + S G L D + L S
Sbjct: 203 CNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYG-DGSFSRGVLATDTVALGGAS-- 259
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+ GCG G + G+MGLG ++S+ S A FS C
Sbjct: 260 ------LGGFVFGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAS--RYGGVFSYCLP 308
Query: 283 ENDSG------SVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCL 328
SG S+ GD ++ ++T+ P+ + YF+ V +G + L
Sbjct: 309 AATSGDASGSLSLGGGDDAASSYRNTT--PVAYTRMIADPAQPPFYFLNVTGAAVGGTAL 366
Query: 329 TQSGFQA---LVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNS-WKYCYNASSE 383
G A L+DSG T L +Y V +F + ++ + G S CY+ +
Sbjct: 367 AAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGH 426
Query: 384 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYN 429
+ +KVP + L V F + G C + +L Y
Sbjct: 427 DEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYE 472
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 89/343 (25%), Positives = 142/343 (41%), Gaps = 54/343 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+++GTP V +D GS+L WVPC + + + Y + ++ +S Y PS SSSS
Sbjct: 16 LNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRN-NKLMSTYSPSYSSSSLRDL 74
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 218
C PLC S + DPC +A S T +G +V L +
Sbjct: 75 CVSPLCSDVHSSDNSYDPC-AVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 133
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ H S V C +Y + P G+ G G G +S+PS L G +Q FS
Sbjct: 134 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 187
Query: 279 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
CF + N S + GD ++ Q TS L + Y++G+E+ +GN+
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 247
Query: 329 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 375
Q ++DSG ++T LP Y +++ +++ R Q +
Sbjct: 248 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 307
Query: 376 YCY------NASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSF 410
CY N ++ +P + FS N S V+ NH ++
Sbjct: 308 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAM 350
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 144/371 (38%), Gaps = 49/371 (13%)
Query: 54 PKKNSVEYLELLLSNDWKR----QKTRVKLQSNNNSSRNQLLFPSEGSQTHFF-GNQFYW 108
P + E + LLS D R Q + SS ++ + +Q G +
Sbjct: 81 PANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVSSGARLRT 140
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L+Y +G V +D S L WV QCAP + + D+ +DPSSS S
Sbjct: 141 LNYVAT-VGLGGGEATVIVDTASELTWV-----QCAPCESCH----DQQGPLFDPSSSPS 190
Query: 169 SKNVSCSHPLCKS-----RSSCKSLKDPC----PYIADYS---TEDTSSSGYLVDDILHL 216
V C P C + + + PC P Y+ + + S G L D L L
Sbjct: 191 YAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL 250
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK--AGLIQ 274
A V + GCG G G + G+MGLG +S+ S G+
Sbjct: 251 A--------GEVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTVDQFGGVFS 300
Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--------YFVGVESYCIGNS 326
+ + + SGS+ GD A + ST + ++ Y V + +G
Sbjct: 301 YCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ 360
Query: 327 CLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ +GF +A+VDSG T L +Y V +F ++ + + C+N + +
Sbjct: 361 EVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLK 420
Query: 385 MLKVPDMRLIF 395
++VP + L+F
Sbjct: 421 EVQVPSLTLVF 431
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 89/343 (25%), Positives = 142/343 (41%), Gaps = 54/343 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+++GTP V +D GS+L WVPC + + + Y + ++ +S Y PS SSSS
Sbjct: 33 LNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRN-NKLMSTYSPSYSSSSLRDL 91
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 218
C PLC S + DPC +A S T +G +V L +
Sbjct: 92 CVSPLCSDVHSSDNSYDPC-AVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 150
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ H S V C +Y + P G+ G G G +S+PS L G +Q FS
Sbjct: 151 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 204
Query: 279 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
CF + N S + GD ++ Q TS L + Y++G+E+ +GN+
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 264
Query: 329 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 375
Q ++DSG ++T LP Y +++ +++ R Q +
Sbjct: 265 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 324
Query: 376 YCY------NASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSF 410
CY N ++ +P + FS N S V+ NH ++
Sbjct: 325 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAM 367
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 145/359 (40%), Gaps = 55/359 (15%)
Query: 56 KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
K V+Y+ LS + + + +L S +++ L S GN ++ +
Sbjct: 105 KERVKYINSRLSKNLGQDSSVEELDSATLPAKSGSLIGS--------GN-----YFVVVG 151
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP + D GS+L W QC P + S Y D +DPS S+S N++C+
Sbjct: 152 LGTPKRDLSLIFDTGSDLTWT-----QCEPCARSCYKQQD---VIFDPSKSTSYSNITCT 203
Query: 176 HPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
LC S+ C + C Y Y + + S GY + L + + + V
Sbjct: 204 SALCTQLSTATGNDPGCSASTKACIYGIQYG-DSSFSVGYFSRERLTVTA-------TDV 255
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
+ + GCG+ G + A G++GLG +S + A + FS C S
Sbjct: 256 VDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAKYRKIFSYCLPSTSSST 310
Query: 287 GSVFFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL-----TQSGFQALV 337
G + F GPA + T F I Y + + + +G L T S A++
Sbjct: 311 GHLSF---GPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAII 367
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
DSG T LP Y + F + +S + + + CY+ S ++ +P + F+
Sbjct: 368 DSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTIEFSFA 426
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 124/296 (41%), Gaps = 41/296 (13%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y + +DP+ SS+ NVS
Sbjct: 167 VGLGTPASKYTVVFDTGSDTTWV-----QCRPCVVKCY---KQKEPLFDPAKSSTYANVS 218
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ C + C Y Y + + + G+ D L +A
Sbjct: 219 CTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--------HDAIKGFR 269
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GCG K G + A G+MGLG G S+ + +F+ C +G+ + D
Sbjct: 270 FGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYCLPALTTGTGYL-D 323
Query: 294 QGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSGA 341
GP + + + L P+ G+ + Y+VG+ +G S + +G LVDSG
Sbjct: 324 FGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVFSTAG--TLVDSGT 379
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y + FDK++ ++ + + + CY+ + +++P + L+F
Sbjct: 380 VITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVF 435
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/347 (25%), Positives = 148/347 (42%), Gaps = 51/347 (14%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
+R +RV + + ++N +F ++ +Q+ NQ +L +GTP L D G
Sbjct: 59 RRSMSRVH---HFSPTKNSDIF-TDTAQSEMISNQGEYLM--KFSLGTPAFDILAIADTG 112
Query: 131 SNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCK 186
S+L+W C+ C QC +++ +DP SSS+ +++SCS C K +SC
Sbjct: 113 SDLIWTQCKPCDQC----------YEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCS 162
Query: 187 SLKDP-CPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
+ C Y YS D S +SG + D + L S S + + IIGCG GS+
Sbjct: 163 GEGNKTCHY--SYSYGDRSFTSGNVAADTITLGSTSG---RPVLLPKAIIGCGHNNGGSF 217
Query: 245 LDGAAPDGVMGLGLGDVSVP-SLLAKAG-LIQNSFSICF-----DENDSGSVFFGDQGPA 297
+ + + P SL+++ G I FS C + +S + FG G
Sbjct: 218 TEKGSGIVGL------GGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGIV 271
Query: 298 TQQSTSFLP-IGEKYDA-YFVGVESYCIGN-------SCLTQSGFQALVDSGASFTFLPT 348
+ P I + D YF+ +E+ +G+ S S ++DSG + T P
Sbjct: 272 SGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPE 331
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ ++E+ V+ + CY+ ++ LK P + F
Sbjct: 332 DFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDAD--LKFPSITAHF 376
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 135/309 (43%), Gaps = 51/309 (16%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++T + +GTP + LD GS+++W+ C CI+C Y+ D +DP+ S S
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKC-------YSQTD---PVFDPTKSRS 194
Query: 169 SKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-- 224
N+ C PLC+ C + K C Y Y D + FS
Sbjct: 195 FANIPCGSPLCRRLDYPGCSTKKQICLYQVSYG-----------DGSFTVGEFSTETLTF 243
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ + V++GCG G ++ A ++GLG G +S PS + + + FS C +
Sbjct: 244 RGTRVGRVVLGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQIGRR--FNSKFSYCLGDR 298
Query: 285 DS----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ 334
+ S+ FGD A ++T F P+ K D Y+V + +G S ++ S F+
Sbjct: 299 SASSRPSSIVFGDS--AISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFK 356
Query: 335 --------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
++DSG S T L Y + F S+ + + + + + C++ S + +
Sbjct: 357 LDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEV 416
Query: 387 KVPDMRLIF 395
KVP + L F
Sbjct: 417 KVPTVVLHF 425
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 91/370 (24%), Positives = 157/370 (42%), Gaps = 49/370 (13%)
Query: 74 KTRVKLQSNNNS----SRNQLLFPSEGSQTHFFGN-QFYWLHYTWIDIGTPNVSFLVALD 128
K ++ L S N S + +LL P + S G Q +++ + +G P+ F + LD
Sbjct: 116 KLQLALSSLNRSDLYPTETELLRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLD 175
Query: 129 AGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCK 186
GS++ W +QC P S Y ++ +DP++SSS ++C C+ S+C+
Sbjct: 176 TGSDVNW-----LQCKPCSDCY----QQSDPIFDPTASSSYNPLTCDAQQCQDLEMSACR 226
Query: 187 SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD 246
+ K C Y Y + Y+ + + SF + + V IGCG G +
Sbjct: 227 NGK--CLYQVSYGDGSFTVGEYVTETV----SFGAGS-----VNRVAIGCGHDNEGLF-- 273
Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG---SVFFGDQGPATQQSTS 303
V GL + L + + SFS C + DSG ++ F P
Sbjct: 274 ------VGSAGLLGLGGGPLSLTSQIKATSFSYCLVDRDSGKSSTLEFNSPRPGDSVVAP 327
Query: 304 FLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQA-LVDSGASFTFLPTEIYAE 353
L + Y+V + +G +T QSG +VDSG + T L T+ Y
Sbjct: 328 LLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNS 387
Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
V F + S+ R + + CY+ SS + ++VP + FS ++++ + + P +
Sbjct: 388 VRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKNYLIPVD 447
Query: 414 EVGDHACFSY 423
G + CF++
Sbjct: 448 GAGTY-CFAF 456
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 109/444 (24%), Positives = 179/444 (40%), Gaps = 68/444 (15%)
Query: 15 LLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVAD--SWPKKNSVEYLELLLSNDWKR 72
L + + V+F+ ++ + + ++ S + + S K + +Y L LS R
Sbjct: 38 LQNAHNVVAFTHHHPNKHQRQQESSLLTSSFGIQLHSRASIQKSSHSDYKSLTLSR-LAR 96
Query: 73 QKTRVK-LQSNNN----SSRNQLLFPSEGSQTHFFGN-----------QFYWLHYTWIDI 116
RVK LQ+ + N L P+E S+ F N Q ++ + I
Sbjct: 97 DSARVKALQTRLDLFLKRVSNSDLHPAE-SKAEFESNALQGPVVSGTSQGSGEYFLRVGI 155
Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
G P V LD GS++ W IQCAP S Y S +DP SS+S + C
Sbjct: 156 GKPPSQAYVVLDTGSDVSW-----IQCAPCSECYQQSDPI----FDPISSNSYSPIRCDE 206
Query: 177 PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
P CKS + C Y Y + + + G + + L S+ +V IGC
Sbjct: 207 PQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--------SAAVENVAIGC 257
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF---FGD 293
G G ++ A G+ G L S P A + SFS C DS +V F
Sbjct: 258 GHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYCLVNRDSDAVSTLEFNS 309
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASF 343
P + + E Y++G++ +G L +S F+ ++DSG +
Sbjct: 310 PLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAV 369
Query: 344 TFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
T L +E+Y + F K + + +SL + CY+ SS E +++P + F + +
Sbjct: 370 TRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSRESVEIPTVSFRFPEGR 425
Query: 400 SFVVRNHIFSFPENEVGDHACFSY 423
+ + P + VG CF++
Sbjct: 426 ELPLPARNYLIPVDSVGTF-CFAF 448
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/326 (23%), Positives = 144/326 (44%), Gaps = 65/326 (19%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQC-IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ +GTP+ + + +D GS+L+W PC CA S ++ + + ++ P SSSSK +
Sbjct: 88 LSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCA--SCNFPNTDITKIPKFMPRLSSSSKLI 145
Query: 173 SCSHPLCK------SRSSC-------KSLKDPC-PYIADYSTEDTSSSGYLVDDILHLAS 218
C +P C +S C ++ C PYI Y S++G L+ + ++
Sbjct: 146 GCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLG--STAGLLLSETINF-- 201
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
P ++ S + GC S L P+G+ G G S+P L GL + S+
Sbjct: 202 -----PNKTI-SDFLAGC------SLLSTRQPEGIAGFGRSQESLPLQL---GLKKFSYC 246
Query: 279 IC---FDENDSGSVFFGDQGPATQQST----SFLPIGEKY---------DAYFVGVESYC 322
+ FD++ S D GP+T S S+ P + + Y+V +
Sbjct: 247 LVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKII 306
Query: 323 IGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--- 369
+G + + + +VDSG++FTF+ ++ + +F+K +++ ++
Sbjct: 307 VGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQ 366
Query: 370 QGNSWKYCYNASSEEMLKVPDMRLIF 395
+ + C++ S E+ + +PD+ F
Sbjct: 367 KLTGLRPCFDISGEKSVVIPDLTFQF 392
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/299 (25%), Positives = 122/299 (40%), Gaps = 29/299 (9%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP LV D GS+L WV QC P Y ++ +DPS S++ V
Sbjct: 142 VGLGTPKRDLLVVFDTGSDLSWV-----QCKPCDGCY----QQHDPLFDPSQSTTYSAVP 192
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C C+ S C Y Y + + + G L D L L S + +Q +
Sbjct: 193 CGAQECRRLDSGSCSSGKCRYEVVYG-DMSQTDGNLARDTLTLGPSSSSSSSDQLQ-EFV 250
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGLIQNSFSICFDENDS--GSVF 290
GCG TG + DG+ GLG VS+ S AK G FS C + + G +
Sbjct: 251 FGCGDDDTGLF---GKADGLFGLGRDRVSLASQAAAKYGA---GFSYCLPSSSTAEGYLS 304
Query: 291 FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ---ALVDSGASFTF 345
G P + T+ + + Y++ + + + S F+ ++DSG T
Sbjct: 305 LGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITR 364
Query: 346 LPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
LP+ YA + F L+ S KR + CY+ + +++P + L+F +
Sbjct: 365 LPSRAYAALRSSFAGLMRRYSYKRAPAL-SILDTCYDFTGRNKVQIPSVALLFDGGATL 422
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 124/296 (41%), Gaps = 41/296 (13%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y + +DP+ SS+ NVS
Sbjct: 167 VGLGTPASKYTVVFDTGSDTTWV-----QCRPCVVKCY---KQKGPLFDPAKSSTYANVS 218
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ C + C Y Y + + + G+ D L +A
Sbjct: 219 CTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--------HDAIKGFR 269
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD 293
GCG K G + A G+MGLG G S+ + +F+ C +G+ + D
Sbjct: 270 FGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYCLPALTTGTGYL-D 323
Query: 294 QGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSGA 341
GP + + + L P+ G+ + Y+VG+ +G S + +G LVDSG
Sbjct: 324 FGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVFSTAG--TLVDSGT 379
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y + FDK++ ++ + + + CY+ + +++P + L+F
Sbjct: 380 VITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVF 435
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 126/323 (39%), Gaps = 65/323 (20%)
Query: 127 LDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS- 181
+D GS+L+WVPC CI C SAS L P SSS V+C+ CK+
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFL--------PRMSSSLHLVTCADSNCKTL 52
Query: 182 ------------RSSCKSLKDPC-PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
S K+ + C PY Y S++G L+ + L+L + ++
Sbjct: 53 YGNNTELLCQSCAGSLKNCSETCPPYGIQYGR--GSTAGLLLTETLNLPLENGEGARAIT 110
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC-----FDE 283
+V GC S + P G+ G G G +S+PS L + + ++ F+ C FDE
Sbjct: 111 HFAV--GC------SIVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDE 161
Query: 284 NDSGSVF-FGDQGPATQQSTSFLPIGEKYDA---------YFVGVESYCIGNSCL----- 328
+ S+ GD+ ++ P A Y++G+ IG L
Sbjct: 162 ENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPS 221
Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNA 380
T+ ++DSG +FT EI+ + F + +R CY+
Sbjct: 222 KLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDV 281
Query: 381 SSEEMLKVPDMRLIFSKNQSFVV 403
+ E + +P+ F V+
Sbjct: 282 TGLENIVLPEFAFHFKGGSDMVL 304
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 130/309 (42%), Gaps = 54/309 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP +++ +D GS+L+W C+ C++C +++ +DPSSSS+ +
Sbjct: 122 MSIGTPALAYAAIVDTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYSTL 171
Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
CS LC S+C S C Y Y + +S+ G L + LA +
Sbjct: 172 PCSSSLCSDLPTSTCTSAAKDCGYTYTYG-DASSTQGVLAAETFTLA--------KTKLP 222
Query: 231 SVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS 286
V GCG G + GA G++GLG G + SL+++ GL FS C D+
Sbjct: 223 GVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--GKFSYCLTSLDDTSK 274
Query: 287 GSVFFGD--------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQ-- 334
+ G A Q+T + + Y+V +++ +G++ L S F
Sbjct: 275 SPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQ 334
Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEML 386
+VDSG S T+L + Y + F + C+ AS + +
Sbjct: 335 DDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDV 394
Query: 387 KVPDMRLIF 395
+VP + L F
Sbjct: 395 EVPKLVLHF 403
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 132/323 (40%), Gaps = 60/323 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP + + LD GS L W+ C + +DPS SSS +
Sbjct: 84 LPIGTPPQTQQMVLDTGSQLSWIQCH--------KKSVPKKPPPTTSFDPSLSSSFSVLP 135
Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
C+HPLCK R +L C + + + + T + G LV + + +S P
Sbjct: 136 CNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPP---- 191
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-------------VPSLLAKAGLIQN 275
+I+GC T G++G+ LG S VP+ A+AGL
Sbjct: 192 ---LILGCAEASTDE-------KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSST 241
Query: 276 SFSICFDENDSGSVFFGDQGPAT--QQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 333
+ +SG + + T Q+S + P+ AY + ++ +GN+ L S
Sbjct: 242 GSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPL-----AYTIPMQGIRMGNARLNISAT 296
Query: 334 ----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNAS 381
Q ++DSG+ FT+L E Y +V + +LV K+ + G C++ +
Sbjct: 297 LFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCFDGN 356
Query: 382 SEEMLK-VPDMRLIFSKNQSFVV 403
E+ + + +M F K V+
Sbjct: 357 PMEIGRLIGNMVFEFEKGVEIVI 379
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 45/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ I +GTP + LD GS++ W IQC P + Y ++ ++P+SSS+
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNW-----IQCEPCADCY----QQSDPVFNPTSSSTY 212
Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K+++CS P C S+C+S K C Y Y + + + G L D + + K
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
++V +GCG G + A G+ V S+ + + SFS C + DSG
Sbjct: 265 --NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFSYCLVDRDSG 314
Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
S+ F +T+ L +K D Y+VG+ + +G L + F
Sbjct: 315 KSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG 374
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++D G + T L T+ Y + F KL V+ K+ S + + CY+ SS +KVP +
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVA 434
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ +S + + P ++ G CF++
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDSGTF-CFAF 464
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 126/306 (41%), Gaps = 37/306 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T + +GTP +++ +D GS+L W +QC+P S + ++ +DP +SSS
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTW-----LQCSPCRVSCH---RQSGPVFDPKTSSSY 168
Query: 170 KNVSCSHPLCKSRSSCK------SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
VSCS P C S+ S + C Y A Y + + S GYL D + + S
Sbjct: 169 AAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYG-DSSFSVGYLSKDTVSFGANSV-- 225
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-D 282
+ GCG+ G + A G+MGL +S+ L A + SFS C
Sbjct: 226 ------PNFYYGCGQDNEGLFGRSA---GLMGLARNKLSL--LYQLAPTLGYSFSYCLPS 274
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-----GFQALV 337
+ SG + G P T + YF+ + + L S ++
Sbjct: 275 TSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTII 334
Query: 338 DSGASFTFLPTEIYAEV--VVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
DSG T LPT +Y + V S+KR + + C+ + ++ VP + + F
Sbjct: 335 DSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAY-SILDTCFEGQASKLRAVPAVSMAF 393
Query: 396 SKNQSF 401
S +
Sbjct: 394 SGGATL 399
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 133/320 (41%), Gaps = 72/320 (22%)
Query: 81 SNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ- 139
++N N + P+ G F + IG P V + +D GS+L+W C+
Sbjct: 88 ASNPDDTNNIKAPTHGGSGEFL---------MELSIGNPAVKYAAIVDTGSDLIWTQCKP 138
Query: 140 CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIAD 197
C +C D+ +DP SSS V CS LC + RS+C KD C Y+
Sbjct: 139 CTEC----------FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYT 188
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGL 256
Y + +S+ G L + ++S+ S + GCG + G DG + G++GL
Sbjct: 189 YG-DYSSTRGLLATETFTFED------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGL 237
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFGD-------------QGPATQ 299
G G +S+ S L + FS C D S S+F G G T
Sbjct: 238 GRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVT- 291
Query: 300 QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTE 349
++ S L ++ Y++ ++ +G L+ +S F+ ++DSG + T+L
Sbjct: 292 KTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYL--- 348
Query: 350 IYAEVVVKFDKLVSSKRISL 369
E K K + R+SL
Sbjct: 349 --EETAFKVLKEEFTSRMSL 366
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 127/296 (42%), Gaps = 38/296 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y ++ +DP+ SS+ NVS
Sbjct: 184 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANVS 235
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 236 CAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVF-- 290
GCG + G + + A G++GLG G S+P K G + F+ C +G+ +
Sbjct: 288 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 341
Query: 291 FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QSGFQ---ALVDSGA 341
FG A ++ P+ G + Y+VG+ +G L+ QS F +VDSG
Sbjct: 342 FGAGSLAAARARLTTPMLTENGPTF--YYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGT 399
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y+ + F ++++ + + + CY+ + + +P + L+F
Sbjct: 400 VITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLF 455
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 45/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ I +GTP + LD GS++ W IQC P + Y ++ ++P+SSS+
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNW-----IQCEPCADCY----QQSDPVFNPTSSSTY 212
Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K+++CS P C S+C+S K C Y Y + + + G L D + + K
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
++V +GCG G + A G+ V S+ + + SFS C + DSG
Sbjct: 265 --NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFSYCLVDRDSG 314
Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSGFQ------- 334
S+ F +T+ L +K D Y+VG+ + +G L + F
Sbjct: 315 KSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG 374
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++D G + T L T+ Y + F KL V+ K+ S + + CY+ SS +KVP +
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVA 434
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ +S + + P ++ G CF++
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDSGTF-CFAF 464
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 156/382 (40%), Gaps = 62/382 (16%)
Query: 65 LLSNDWKRQKTR-VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL R K R +L S +S GS T + Y +H + IGTP
Sbjct: 72 LLRRMAARSKARSARLLSGRAASARM----DPGSYTDGVPDTEYLVH---MAIGTPPQPV 124
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--S 181
+ LD GS+L W QCAP + + SL R ++PS S + + C +C+ +
Sbjct: 125 QLILDTGSDLTWT-----QCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLT 175
Query: 182 RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
SSC C Y Y+ + + ++G+L D AS + HA + + GCG
Sbjct: 176 WSSCGEQSWGNGICVYAYAYA-DHSITTGHLDSDTFSFAS-ADHAIGGASVPDLTFGCGL 233
Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG--- 292
G ++ G+ G G +S+P A L ++FS CF ++ VF G
Sbjct: 234 FNNGIFVSNET--GIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPP 286
Query: 293 -------DQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCL--TQSGFQ-------- 334
G QST+ + + AY++ ++ +G + L +S F
Sbjct: 287 NLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGG 346
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDM 391
+VDSG T LP +Y V D V+ ++++ ++ + C++ VP +
Sbjct: 347 TIVDSGTGMTMLPEAVYNLVC---DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPAL 403
Query: 392 RLIFSKNQSFVVR-NHIFSFPE 412
L F + R N++F E
Sbjct: 404 VLHFEGATLDLPRENYMFEIEE 425
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 139/360 (38%), Gaps = 81/360 (22%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ GTP + +D GS+ +W PC C C S +S + P SSSS
Sbjct: 81 LSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNC---------SFTSRISPFLPKHSSSS 131
Query: 170 KNVSCSHPLCK-------SRSSCKSLKDPC-----PYIADYSTEDTSSSGYLVDDILHLA 217
K + C +P C + C + C PY+ Y + T G + + LHL
Sbjct: 132 KIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTT--GGVALSETLHLH 189
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
+ + ++GC S P G+ G G G S+PS L GL + F
Sbjct: 190 GL--------IVPNFLVGC------SVFSSRQPAGIAGFGRGPSSLPSQL---GLTK--F 230
Query: 278 SICF------DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDA----YFVGVES 320
S C D +S S+ Q + +++ + + P + A Y+V +
Sbjct: 231 SYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRR 290
Query: 321 YCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
IG + ++DSG +FT++ TE + + +F V + +L
Sbjct: 291 ISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALM 350
Query: 371 GNS---WKYCYNASSEEMLKVPDMRLIFS--KNQSFVVRNHIFSFPENEVGDHACFSYFT 425
+ K C+N S + L++P +RL F + + N+ EV ACF+ T
Sbjct: 351 VEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREV---ACFTVVT 407
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 145/340 (42%), Gaps = 67/340 (19%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
G ++ + + IGTP + + LD GS L W+ QC + P S +D
Sbjct: 75 GFKYSMILLVSLPIGTPPQTQQMILDTGSQLSWI--QCHKKVPRKPP-------PSSVFD 125
Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLA 217
PS SSS + C+HPLCK R +L C + + + + T + G LV + +
Sbjct: 126 PSLSSSFSVLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKI--- 182
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
+FS+ S +I+GC + + + G++G+ LG +S S +A L + S+
Sbjct: 183 TFSR----SQSTPPLILGCAEESSDA-------KGILGMNLGRLSFAS---QAKLTKFSY 228
Query: 278 SICFDE-----NDSGSVFFGDQGPA-------------TQQSTSFLPIGEKYDAYFVGVE 319
+ + +GS + G+ + +Q+ + P+ AY V ++
Sbjct: 229 CVPTRQVRPGFTPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPL-----AYTVAMQ 283
Query: 320 SYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRI 367
IGN L Q ++DSG+ FT+L E Y +V + +LV + K+
Sbjct: 284 GIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKG 343
Query: 368 SLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNH 406
+ G C+N ++ E+ + + +M F K VV
Sbjct: 344 YVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKGVEIVVEKE 383
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 137/329 (41%), Gaps = 53/329 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V F+ D GS+L W CQ C C P ++ YD + SSS V
Sbjct: 97 LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPIYDTAVSSSFSPV 146
Query: 173 SCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
C+ C S +C + PC Y Y + S+G L + L AP SV
Sbjct: 147 PCASATCLPIWSSRNCTASSSPCRYRYAYG-DGAYSAGVLGTETLTF----PGAPGVSV- 200
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDEND 285
+ GCG G + G +GLG G + SL+A+ G+ FS C F+ +
Sbjct: 201 GGIAFGCGVDNGGLSYNST---GTVGLGRGSL---SLVAQLGV--GKFSYCLTDFFNTSL 252
Query: 286 SGSVFFGD----QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--------- 329
V FG P+T + P+ + Y+V +E +G++ L
Sbjct: 253 GSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLR 312
Query: 330 -QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEML 386
+VDSG +FTFL E VVV V + + + C+ A++ +++
Sbjct: 313 DDGSGGMIVDSGTTFTFL-VESAFRVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLP 371
Query: 387 KVPDMRLIFSKNQSFVV-RNHIFSFPENE 414
+PDM L F+ + R++ SF + E
Sbjct: 372 AMPDMVLHFAGGADMRLHRDNYMSFNQEE 400
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 156/382 (40%), Gaps = 62/382 (16%)
Query: 65 LLSNDWKRQKTR-VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL R K R +L S +S GS T + Y +H + IGTP
Sbjct: 46 LLRRMAARSKARSARLLSGRAASARM----DPGSYTDGVPDTEYLVH---MAIGTPPQPV 98
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--S 181
+ LD GS+L W QCAP + + SL R ++PS S + + C +C+ +
Sbjct: 99 QLILDTGSDLTWT-----QCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLT 149
Query: 182 RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
SSC C Y Y+ + + ++G+L D AS + HA + + GCG
Sbjct: 150 WSSCGEQSWGNGICVYAYAYA-DHSITTGHLDSDTFSFAS-ADHAIGGASVPDLTFGCGL 207
Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG--- 292
G ++ G+ G G +S+P A L ++FS CF ++ VF G
Sbjct: 208 FNNGIFVSNET--GIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPP 260
Query: 293 -------DQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCL--TQSGFQ-------- 334
G QST+ + + AY++ ++ +G + L +S F
Sbjct: 261 NLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGG 320
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDM 391
+VDSG T LP +Y V D V+ ++++ ++ + C++ VP +
Sbjct: 321 TIVDSGTGMTMLPEAVYNLVC---DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPAL 377
Query: 392 RLIFSKNQSFVVR-NHIFSFPE 412
L F + R N++F E
Sbjct: 378 VLHFEGATLDLPRENYMFEIEE 399
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 79/308 (25%), Positives = 124/308 (40%), Gaps = 48/308 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + D GS+L W QC P S Y + +DPS+S + N+S
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWT-----QCQPCVKSCYA---QQQPIFDPSTSKTYSNIS 209
Query: 174 CSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
C+ C S S C S C Y Y + + + G+ D L L Q+
Sbjct: 210 CTSAACSSLKSATGNSPGCSS--SNCVYGIQYG-DSSFTIGFFAKDKLTLT-------QN 259
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEN 284
V + GCG+ G + A G++GLG +S+ A+ FS C
Sbjct: 260 DVFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314
Query: 285 DSGSVFFGD-----QGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLTQSG--F 333
+G + FG+ A + +F P G Y YF+ V +G L+ S F
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAY--YFIDVLGISVGGKALSISPMLF 372
Query: 334 Q---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
Q ++DSG T LP+ Y + F + +S + + CY+ S+ + +P
Sbjct: 373 QNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPK 432
Query: 391 MRLIFSKN 398
+ F+ N
Sbjct: 433 ISFNFNGN 440
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 119/296 (40%), Gaps = 31/296 (10%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP LV D GS+L WV QC P + Y ++ +DPS S++ V
Sbjct: 192 VGLGTPRRDLLVVFDTGSDLSWV-----QCKPCNNCY----KQHDPLFDPSQSTTYSAVP 242
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C C +C S K C Y Y + + + G L D L L P S +
Sbjct: 243 CGAQECLDSGTCSSGK--CRYEVVYG-DMSQTDGNLARDTLTL------GPSSDQLQGFV 293
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN--DSGSVFF 291
GCG TG + DG+ GLG VS+ S A FS C + G +
Sbjct: 294 FGCGDDDTGLF---GRADGLFGLGRDRVSLAS--QAAARYGAGFSYCLPSSWRAEGYLSL 348
Query: 292 GD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--LTQSGFQA---LVDSGASFTF 345
G P Q T+ + + Y++ + + + + F+A ++DSG T
Sbjct: 349 GSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITR 408
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
LP+ Y+ + F + + + + CY+ + +++P + L+F +
Sbjct: 409 LPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATL 464
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 138/324 (42%), Gaps = 39/324 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ GTP + + LD GS+L W IQC P S Y D ++DP+ SSS V
Sbjct: 141 VGFGTPAQTAAIILDTGSDLSW-----IQCKPCSGHCYRQHD---PDFDPAKSSSYAAVP 192
Query: 174 CSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
C P+C + C C Y Y + +S++G L D L S SK +
Sbjct: 193 CGTPVCAAAGGMCNGTT--CLYGVQYG-DGSSTTGVLSRDTLTFNSSSKF-------TGF 242
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--GSVF 290
GCG K G + + DG++GLG G +S+PS A + FS C ++ G +
Sbjct: 243 TFGCGEKNIGDFGEV---DGLLGLGRGKLSLPSQAAPS--FGGVFSYCLPSYNTTPGYLN 297
Query: 291 FGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQALVDSG 340
G P + Q T+ + + YF+ + S IG S T++G L+DSG
Sbjct: 298 IGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTG--TLLDSG 355
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
T+LP Y + +F + + + CY+ + + + +P + FS
Sbjct: 356 TILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAV 415
Query: 401 FVVRNH-IFSFPENEVGDHACFSY 423
F + + I FP++ C ++
Sbjct: 416 FDLDFYGIMIFPDDAKPLIGCLAF 439
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 136/345 (39%), Gaps = 46/345 (13%)
Query: 68 NDWKRQKTRVKLQS---NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
N R +R++ S + N LL P +G + +FY IG+P V L
Sbjct: 56 NAALRSMSRLQRVSHFLDENKLPESLLIPDKGE----YLMRFY--------IGSPPVERL 103
Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---- 179
+D GS+L+W+ C C C P + ++P SS+ K +C C
Sbjct: 104 AMVDTGSSLIWLQCSPCHNCFP----------QETPLFEPLKSSTYKYATCDSQPCTLLQ 153
Query: 180 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
S+ C L C Y Y + + S G L + L S Q+ + I GCG
Sbjct: 154 PSQRDCGKLGQ-CIYGIMYG-DKSFSVGILGTETLSFG--STGGAQTVSFPNTIFGCGVD 209
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGP 296
+ G+ GLG G +S+ S L I + FS C +D + + FG +
Sbjct: 210 NNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKLKFGSEAI 267
Query: 297 ATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQALVDSGASFTFLPTEIY 351
T P+ K YF+ +E+ IG ++ Q+ ++DSG T+L Y
Sbjct: 268 ITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTYLENTFY 327
Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
V + + K + + K C+ + L +PD+ F+
Sbjct: 328 NNFVASLQETLGVKLLQDLPSPLKTCF--PNRANLAIPDIAFQFT 370
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 144/356 (40%), Gaps = 67/356 (18%)
Query: 110 HYTWIDIGTP--NVSFLVALDAGSNLLWVPC----QCIQCA-------PLSASYYTSLDR 156
H + GTP +SFLV D GS+++W PC C C+ P+ +S D+
Sbjct: 87 HTIPLSFGTPPQKLSFLV--DTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDK 144
Query: 157 NLSEYDPS-SSSSSKNVSCSHPLCKSRSSCKSLKDPCP-YIADYSTEDTSSSGYLVDDIL 214
L DP +++SS +V P C S K CP Y Y T ++SG+ + + L
Sbjct: 145 ILGCRDPKCANTSSPDVHLGCPRCNGNS--KKCSHACPQYTLQYGTG--AASGFFLLENL 200
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
+ H ++GC T S + D + G G S+P +
Sbjct: 201 DFPGKTIH--------KFLVGC----TTSADREPSSDALAGFGRTMFSLPMQMG-----V 243
Query: 275 NSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIG 324
F+ C + +D SG + D Q S+ P + Y++GV+ IG
Sbjct: 244 KKFAYCLNSHDYDDTRNSGKLIL-DYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIG 302
Query: 325 NSCLTQSG----------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 373
N L G ++DSG ++ ++ ++ V + K +S R SL+ +
Sbjct: 303 NKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQ 362
Query: 374 --WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHACFSYFT 425
CYN + + +K+PD+ F+ + VV N+ F E +G CF T
Sbjct: 363 SGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLG---CFPVTT 415
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 142/333 (42%), Gaps = 72/333 (21%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP+ S + LD GS L W+ C + TS +DPS SSS ++
Sbjct: 85 LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTS-------FDPSLSSSFSDLP 137
Query: 174 CSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
CSHPLCK R +SC S + C Y Y+ + T + G LV + ++ P
Sbjct: 138 CSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVKEKFTFSNSQTTPP-- 193
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
+I+GC ++ T G++G+ LG + S +++A + + S+ I N
Sbjct: 194 -----LILGCAKESTDV-------KGILGMNLGRL---SFISQAKISKFSYCIPTRSNRP 238
Query: 285 ---DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
+GS + G+ P +Q+ + P+ AY V + IG L
Sbjct: 239 GLASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPL-----AYTVPLLGIRIGQKRL 293
Query: 329 T-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWK 375
SG Q +VDSG+ FT L Y +V + +LV S K+ + G++
Sbjct: 294 NIPSSVFRPDAGGSG-QTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTAD 352
Query: 376 YCYNASSEEMLK--VPDMRLIFSKNQSFVVRNH 406
C++ + + ++ + D+ F + +V
Sbjct: 353 MCFDGNHQMVIGRLIGDLVFEFGRGVEILVEKQ 385
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 120/304 (39%), Gaps = 48/304 (15%)
Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSE 160
F N Y + + +GTP +D GS + W C C+ C ++N
Sbjct: 60 FDNSVYLMK---LQVGTPPFEIQAIIDTGSEITWTQCLPCVHC----------YEQNAPI 106
Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
+DPS SS+ K C CPY DY + T + G L + + L S S
Sbjct: 107 FDPSKSSTFKEKRCD-------------GHSCPYEVDYF-DHTYTMGTLATETITLHSTS 152
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ V IIGCG S+ + G++GL G S+ + G S C
Sbjct: 153 G---EPFVMPETIIGCGHNN--SWFKPSF-SGMVGLNWGPSSL--ITQMGGEYPGLMSYC 204
Query: 281 FDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ 334
F + + FG G +T F+ K Y++ +++ +GN+ + G F
Sbjct: 205 FSGQGTSKINFGANAIVAGDGVVSTTMFMTTA-KPGFYYLNLDAVSVGNTRIETMGTTFH 263
Query: 335 AL-----VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
AL +DSG + T+ P V + +V++ R + + CYN+ + ++ V
Sbjct: 264 ALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDIFPVI 323
Query: 390 DMRL 393
M
Sbjct: 324 TMHF 327
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 85/326 (26%), Positives = 138/326 (42%), Gaps = 66/326 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP + F V +D GSNL+W C C +C P P+ SS+ +
Sbjct: 95 ISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRL 146
Query: 173 SCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQ 225
C+ C+ SR + C Y +Y+ ++GYL + L + +F K
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVGDGTFPK---- 200
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DE 283
V GC T + +D ++ G++GLG G +S+ S LA FS C D
Sbjct: 201 ------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG-----RFSYCLRSDM 244
Query: 284 NDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGNSCL------- 328
D G+ + FG T+ QST L P ++ Y+V + + ++ L
Sbjct: 245 ADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTF 304
Query: 329 --TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----CYNA 380
TQ+G +VDSG + T+L + YA V F +++ + + Y CY
Sbjct: 305 GFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKP 364
Query: 381 SS---EEMLKVPDMRLIFSKNQSFVV 403
S+ + ++VP + L F+ + V
Sbjct: 365 SAGGGGKAVRVPRLALRFAGGAKYNV 390
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 125/308 (40%), Gaps = 52/308 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP LD GS+L+W QC CA + L + + P +SSS + +
Sbjct: 108 LAVGTPPQPVSALLDTGSDLIWT--QCAPCA-------SCLPQPDPIFSPGASSSYEPMR 158
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C+ LC SC+ D C Y Y + T++ G + +S S + + +
Sbjct: 159 CAGELCNDILHHSCQR-PDTCTYRYSYG-DGTTTRGVYATERFTFSSSSSGGETTKLSAP 216
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG---S 288
+ GCG GS +G+ G++G G +S+ S LA FS C SG +
Sbjct: 217 LGFGCGTMNKGSLNNGS---GIVGFGRAPLSLVSQLAI-----RRFSYCLTPYASGRKST 268
Query: 289 VFFG-------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ----- 334
+ FG D AT Q+T L + Y+V +G L S F
Sbjct: 269 LLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDG 328
Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK-----YCYNASSEEML 386
A+VDSG + T P + AEVV F S R+ N C+ A++ +
Sbjct: 329 SGGAIVDSGTALTLFPAPVLAEVVRAFR---SQLRLPFAANGSSGPDDGVCFAAAASRVP 385
Query: 387 K---VPDM 391
+ VP M
Sbjct: 386 RPAVVPRM 393
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 57/279 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP + + LD GS L W+ QC + P +AS+ DPS SS+ +
Sbjct: 79 LPIGTPPQTQPMVLDTGSQLSWI--QCHKKQPPTASF-----------DPSLSSTFSILP 125
Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
C+HPLCK R +L C + + + + T + G LV + +FS+ S
Sbjct: 126 CTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKF---TFSR----SVS 178
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-------------VPSLLAKAGLIQN 275
+I+GC + T P G++G+ LG +S VP + G
Sbjct: 179 TPPLILGCATESTD-------PRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPT 231
Query: 276 -SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE--------SYCIGNS 326
SF + + + G + G + Q+ +F P+ Y VG+ S + +
Sbjct: 232 GSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLA--YTIPMVGIRIAGKKLNISPAVFRA 289
Query: 327 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 365
SG Q ++DSG+ FT+L +E Y +V + + V +
Sbjct: 290 DAGGSG-QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPR 327
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 85/326 (26%), Positives = 138/326 (42%), Gaps = 66/326 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP + F V +D GSNL+W C C +C P P+ SS+ +
Sbjct: 95 ISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRL 146
Query: 173 SCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQ 225
C+ C+ SR + C Y +Y+ ++GYL + L + +F K
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVGDGTFPK---- 200
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DE 283
V GC T + +D ++ G++GLG G +S+ S LA FS C D
Sbjct: 201 ------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG-----RFSYCLRSDM 244
Query: 284 NDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGNSCL------- 328
D G+ + FG T+ QST L P ++ Y+V + + ++ L
Sbjct: 245 ADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTF 304
Query: 329 --TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----CYNA 380
TQ+G +VDSG + T+L + YA V F +++ + + Y CY
Sbjct: 305 GFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKP 364
Query: 381 SS---EEMLKVPDMRLIFSKNQSFVV 403
S+ + ++VP + L F+ + V
Sbjct: 365 SAGGGGKAVRVPRLALRFAGGAKYNV 390
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 139/330 (42%), Gaps = 44/330 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+++ + +G+P + LD GS++ WV CQ C C Y D +DPS S+S
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSTS 216
Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
+V+C +P C ++C++ C Y Y + + + G + L L S
Sbjct: 217 YASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG-------DS 268
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ SSV IGCG G ++ A + G L S PS ++ +FS C + DS
Sbjct: 269 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDS 320
Query: 287 GS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------- 334
S + FGD A + + + Y+VG+ +G L+ S F
Sbjct: 321 PSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAG 379
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+VDSG + T L + YA + F + S + + + CY+ S ++VP + L
Sbjct: 380 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 439
Query: 394 IFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ + + P + G + C ++
Sbjct: 440 RFAGGGELRLPAKNYLIPVDGAGTY-CLAF 468
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 79/308 (25%), Positives = 115/308 (37%), Gaps = 42/308 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP F + D GS L WV C P + P +S S
Sbjct: 91 YFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLV------------FRPEASKSW 138
Query: 170 KNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
V CS CK S ++C S PC Y Y + G + D +A
Sbjct: 139 APVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVA 198
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
Q V++GC G DGV+ LG +S S A SFS C
Sbjct: 199 Q---LQDVVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASR--AAARFGGSFSYCLVDH 251
Query: 282 --DENDSGSVFFG----DQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLTQS 331
N +G + FG + PATQ P G K DA V ++ I
Sbjct: 252 LAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPK 311
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYN--ASSEEMLKV 388
++DSG + T L T Y VV KL++ ++ +++CYN A ++
Sbjct: 312 SGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP--PFEHCYNWTAPRPGAPEI 369
Query: 389 PDMRLIFS 396
P + + F+
Sbjct: 370 PKLAVQFT 377
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 124/315 (39%), Gaps = 41/315 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ +GTP F++ D GS+L WV C+ A + + + + ++S S
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPA-----RVFRTAASKSW 155
Query: 170 KNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA------ 217
++CS C S ++C S PC Y DY D S++ G + D +A
Sbjct: 156 APIACSSDTCTSYVPFSLANCSSPASPCAY--DYRYRDGSAARGVVGTDSATIALSSGSG 213
Query: 218 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
+ + V++GC G + DGV+ LG ++S S A
Sbjct: 214 RGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSS--DGVLSLGNSNISFASR--AAARFGG 269
Query: 276 SFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 328
FS C N + + FG A T L Y V V++ + L
Sbjct: 270 RFSYCLVDHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDI 329
Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNAS 381
A++DSG S T L T Y VV K L R+++ + ++YCYN +
Sbjct: 330 PADVWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM--DPFEYCYNWT 387
Query: 382 SEEMLKVPDMRLIFS 396
L++P M + F+
Sbjct: 388 DAGALEIPKMEVHFA 402
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 149/350 (42%), Gaps = 60/350 (17%)
Query: 66 LSNDWKRQKTRVKLQSNN----NSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
L N + R +RV + NS +N L+ P+ G ++ + IGTP V
Sbjct: 59 LRNAFSRSISRVNVFKTKAVDINSFQNDLV-PNGGE------------YFMKMSIGTPLV 105
Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK- 180
+V D GS+L WV QC+ C P + +DPS SSS +++ C C
Sbjct: 106 EVIVIADTGSDLTWV--QCLPCDP-------CYRQKSPLFDPSRSSSYRHMLCGSRFCNA 156
Query: 181 ---SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
S +C + C Y YS D S ++G L + + S S S ++ GC
Sbjct: 157 LDVSEQACTMDTNICEY--HYSYGDKSYTNGNLATEKFTIGSTSSRPVH---LSPIVFGC 211
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICF-----DENDSGSVF 290
G G++ + G+ SL+++ + +I+ FS C N + +
Sbjct: 212 GTGNGGTF-----DELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIK 266
Query: 291 FG-DQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCL----------TQSGFQALVD 338
FG D + Q S + ++ D Y+ V +E+ +GN L + G ++D
Sbjct: 267 FGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKG-NVIID 325
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
SG + TFL +E + E+ ++ V ++R+S + C+ ++ + L V
Sbjct: 326 SGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGDIDLPV 375
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 124/304 (40%), Gaps = 48/304 (15%)
Query: 102 FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSE 160
F N Y + + +GTP +D GS + W C C+ C +N
Sbjct: 375 FDNSVYLMK---LQVGTPPFEIEAVIDTGSEITWTQCLPCVHC----------YKQNAPI 421
Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
+DPS SS+ K C CPY DY + T + G L D + + S S
Sbjct: 422 FDPSKSSTFKEKRCH-------------DHSCPYEVDYF-DKTYTKGTLATDTVTIHSTS 467
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+ V + IIGCGR S+ + +G +GL G +S+ + G S C
Sbjct: 468 G---EPFVMAETIIGCGRNN--SWFRPSF-EGFVGLNWGPLSL--ITQMGGEYPGLMSYC 519
Query: 281 FDENDSGSVFFGDQ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA 335
F N + + FG G ST+ + Y++ +++ +G++ + G F A
Sbjct: 520 FAGNGTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHA 579
Query: 336 L-----VDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
L +DSG + T+ P E Y +V + + +V + + + CY +++ E+ V
Sbjct: 580 LEGNIVIDSGTTLTYFP-ESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYSNTTEIFPVI 638
Query: 390 DMRL 393
M
Sbjct: 639 TMHF 642
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 113/283 (39%), Gaps = 57/283 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP LD GS L+W C C+ C D+ +DPS SS+ K
Sbjct: 69 LQIGTPPFEVEAVLDTGSELIWTQCLPCLHC----------YDQKAPIFDPSKSSTFKET 118
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
C+ P CPY Y + + + G L + + + S S V
Sbjct: 119 RCNTP-----------DHSCPYKLVYD-DKSYTQGTLATETVTIHSTSG---VPFVMPET 163
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
IIGC R +GS ++ G++GL G +S+ S + G + G
Sbjct: 164 IIGCSRNNSGSGFRPSS-SGIVGLSRGSLSLISQM-------------------GGAYPG 203
Query: 293 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQAL-----VDSGASFTF 345
D ST+ K Y++ +++ +G++ + G F AL +DSG T+
Sbjct: 204 DG----VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLTY 259
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
P V +++V++ R+ + CY +++ E+ V
Sbjct: 260 FPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIEIFPV 302
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 156/382 (40%), Gaps = 62/382 (16%)
Query: 65 LLSNDWKRQKTR-VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LL R K R +L S +S GS T + Y +H + IGTP
Sbjct: 72 LLHRMAARSKARSARLLSGRAASARV----DPGSYTDGVPDTEYLVH---MAIGTPPQPV 124
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--S 181
+ LD GS+L W QCAP + + SL R ++PS S + + C +C+ +
Sbjct: 125 QLILDTGSDLTWT-----QCAPCVSCFRQSLPR----FNPSRSMTFSVLPCDLRICRDLT 175
Query: 182 RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 238
SSC C Y Y+ + + ++G+L D AS + HA + + GCG
Sbjct: 176 WSSCGEQSWGNGICVYAYAYA-DHSITTGHLDSDTFSFAS-ADHAIGGASVPDLTFGCGL 233
Query: 239 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFG--- 292
G ++ G+ G G +S+P A L ++FS CF ++ VF G
Sbjct: 234 FNNGIFVSNET--GIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPSPVFLGVPP 286
Query: 293 -------DQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCL--TQSGFQ-------- 334
G QST+ + + AY++ ++ +G + L +S F
Sbjct: 287 NLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGG 346
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDM 391
+VDSG T LP +Y V D V+ ++++ ++ + C++ VP +
Sbjct: 347 TIVDSGTGMTMLPEAVYNLVC---DAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPAL 403
Query: 392 RLIFSKNQSFVVR-NHIFSFPE 412
L F + R N++F E
Sbjct: 404 VLHFEGATLDLPRENYMFEIEE 425
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 132/323 (40%), Gaps = 45/323 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y LH + IGTP + LD GS L+W CQ C C +++L YD S
Sbjct: 35 YLLH---LAIGTPPQPVQLTLDTGSVLVWTQCQPCAVC----------FNQSLPYYDASR 81
Query: 166 SSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
SS+ SC CK S + C C Y YS D S++ +D + SF
Sbjct: 82 SSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFLD--VETVSFVA 137
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
A SV V+ GCG TG + G+ G G G +S+PS L K G + F+
Sbjct: 138 GA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVS 190
Query: 282 DENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
S +F G T Q+T + Y++ ++ +G++ L +S F
Sbjct: 191 GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFA 250
Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS-EEML 386
++DSG +FT LP +Y V +F V + C++A +
Sbjct: 251 LKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAP 310
Query: 387 KVPDMRLIFSKNQSFVVR-NHIF 408
VP + L F + R N++F
Sbjct: 311 HVPKLVLHFEGATMHLPRENYVF 333
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 141/351 (40%), Gaps = 65/351 (18%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + IGTP + LD GS+L+W C+ C C R L DPS+
Sbjct: 415 YLVH---LAIGTPPQPVQLILDTGSDLVWTQCRPCPVC----------FSRALGPLDPSN 461
Query: 166 SSSSKNVSCSHPLCKSR--SSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SS+ + CS P+C + SSC C Y+ Y + G + L +F+
Sbjct: 462 SSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY------ADGSITTGHLDAETFT 515
Query: 221 KHAPQSSVQSSV---IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
A + Q++V GCG G + G+ G G G +S+PS L ++F
Sbjct: 516 FAAADGTGQATVPDLAFGCGLFNNGIFTSNET--GIAGFGRGALSLPSQLKV-----DNF 568
Query: 278 SICFDE---NDSGSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS-- 326
S CF ++ SV G QST + AY++ ++ +G++
Sbjct: 569 SHCFTAITGSEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRL 628
Query: 327 -------CLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----W 374
L Q G ++DSG T LP + Y V D + R+ + +
Sbjct: 629 PIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLV---HDAFTAQVRLPVDNATSSSLS 685
Query: 375 KYCYNASSEEMLK--VPDMRLIFSKNQSFVVR-NHIFSFPENEVGDHACFS 422
+ C++ S K VP + L F + R N++F F E+ G C +
Sbjct: 686 RLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEF-EDAGGSVTCLA 735
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/313 (26%), Positives = 130/313 (41%), Gaps = 72/313 (23%)
Query: 88 NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPL 146
N + P+ G F + IG P V + +D GS+L+W C+ C +C
Sbjct: 94 NNIKAPTHGGSGEFL---------MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC--- 141
Query: 147 SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTS 204
D+ +DP SSS V CS LC + RS+C KD C Y+ Y + +S
Sbjct: 142 -------FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSS 193
Query: 205 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSV 263
+ G L + ++S+ S + GCG + G DG + G++GLG G +S+
Sbjct: 194 TRGLLATETFTFED------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSL 243
Query: 264 PSLLAKAGLIQNSFSICF----DENDSGSVFFGD-------------QGPATQQSTSFLP 306
S L + FS C D S S+F G G T ++ S L
Sbjct: 244 ISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLR 297
Query: 307 IGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVV 356
++ Y++ ++ +G L+ +S F+ ++DSG + T+L E
Sbjct: 298 NPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYL-----EETAF 352
Query: 357 KFDKLVSSKRISL 369
K K + R+SL
Sbjct: 353 KVLKEEFTSRMSL 365
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 116/451 (25%), Positives = 169/451 (37%), Gaps = 75/451 (16%)
Query: 16 LDGSDAVSFSSKLVHRFS--DEAKERWISKSGNVSVADSWPKKNSVEYLELLLSND---- 69
L D + S +L+HR S EAKE+ + LE L ++
Sbjct: 48 LSPRDGGTLSLELIHRNSLLREAKEKL--------------HTHEQLLLETLQRDEQRVR 93
Query: 70 WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
W K ++ + + +S L P + G F L +GTP S + +D
Sbjct: 94 WIESKAQLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRL-----GVGTPARSLFMVVDT 148
Query: 130 GSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRS 183
GS+L W+ CQ C C Y D +DP +SSS + + C PLCK S S
Sbjct: 149 GSDLPWLQCQPCKSC-------YKQAD---PIFDPRNSSSFQRIPCLSPLCKALEIHSCS 198
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
+ C Y Y + + S G D+ L + SK SV GCG G
Sbjct: 199 GSRGATSRCSYQVAYG-DGSFSVGDFSSDLFTLGTGSKAM-------SVAFGCGFDNEGL 250
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGPA 297
+ A G+ L S + NSFS C + S S+ FG
Sbjct: 251 FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGAAAIP 310
Query: 298 TQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL---------TQSGFQA-LVDSGASFTFL 346
+ + S L K D Y+ + +G + L +QSG ++DSG S T
Sbjct: 311 STAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRF 370
Query: 347 PTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
PT +YA + F L S+ R SL + CYN S + + VP + L F
Sbjct: 371 PTSVYATIRDAFRNATTNLPSAPRYSL----FDTCYNFSGKASVDVPALVLHFENGADLQ 426
Query: 403 VRNHIFSFPENEVGDHA-CFSYFTLEYNFTG 432
+ + P N G F+ ++E G
Sbjct: 427 LPPTNYLIPINTAGSFCLAFAPTSMELGIIG 457
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 130/305 (42%), Gaps = 41/305 (13%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+ GTP+V ++ +D GS++ WV PC +C P + +DPS SS+
Sbjct: 135 LGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYP----------QKDPLFDPSKSSTYA 184
Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
++C+ C+ + C S C Y +Y+ + + S G ++ L L AP
Sbjct: 185 PIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYA-DGSHSRGVYSNETLTL------APG 237
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+V+ GCGR Q G DG++GLG VS+ ++ + + +FS C +
Sbjct: 238 ITVE-DFHFGCGRDQRGP---SDKYDGLLGLGGAPVSL--VVQTSSVYGGAFSYCLPALN 291
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKY-----DAYFVGVESYCIGNSCL--TQSGFQA--L 336
S + F P + ++F+ ++ Y V + +G L QS F+ +
Sbjct: 292 SEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMI 351
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
+DSG T LP Y + K + + + + + + CYN + + VP + FS
Sbjct: 352 IDSGTVDTELPETAYNALEAALRKALKAYPL-VPSDDFDTCYNFTGYSNITVPRVAFTFS 410
Query: 397 KNQSF 401
+
Sbjct: 411 GGATI 415
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 139/330 (42%), Gaps = 44/330 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+++ + +G+P + LD GS++ WV CQ C C Y D +DPS S+S
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSTS 212
Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
+V+C +P C ++C++ C Y Y + + + G + L L S
Sbjct: 213 YASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG-------DS 264
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ SSV IGCG G ++ A + G L S PS ++ +FS C + DS
Sbjct: 265 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDS 316
Query: 287 GS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------- 334
S + FGD A + + + Y+VG+ +G L+ S F
Sbjct: 317 PSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAG 375
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+VDSG + T L + YA + F + S + + + CY+ S ++VP + L
Sbjct: 376 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 435
Query: 394 IFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ + + P + G + C ++
Sbjct: 436 RFAGGGELRLPAKNYLIPVDGAGTY-CLAF 464
>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
Length = 947
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 146/366 (39%), Gaps = 47/366 (12%)
Query: 57 NSVEYLELLLSNDWKRQKTR--VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWI 114
N E EL + D K+ R +S S + LFP G+ H+ ++
Sbjct: 83 NHTESAELTAAVDAKKLARRDWQGRRSLYMSFEDTPLFPGWGT------------HFAYV 130
Query: 115 DIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
GTP V +D GS+ PC +C C + + +D S S+SS V+
Sbjct: 131 YAGTPPQRVSVIIDTGSHFTAFPCSECENCGSHTDPH----------WDQSKSTSSHIVT 180
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP------QSS 227
C C C+ K C + YS E +S Y V+D+L + + +S+
Sbjct: 181 CED--CHGSFRCQKDKR-CGFSQRYS-EGSSWRAYQVEDVLWVGELTLQQSEKINHDESA 236
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 286
+ GC QTG + A DG+MG+ ++ LAKAG I + +FS+CF +N
Sbjct: 237 YSVEFMFGCIESQTGLFKTQLA-DGIMGMSADSHTLVWQLAKAGKIKERTFSLCFGKNGG 295
Query: 287 GSVFFGDQGPATQQSTSFL-PIGEKYDAYF------VGVESYCIG-NSCLTQSGFQALVD 338
V G + + K + +F + V I + + Q G +VD
Sbjct: 296 TMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQDPAIFQRGKGIIVD 355
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
SG + T+LP + +++ S + + N +C +S E+ +P + +
Sbjct: 356 SGTTDTYLPRSVAKGFSAAWERATGSPYANCKDN--HFCMILTSAELEALPTVTIHMDGG 413
Query: 399 QSFVVR 404
VR
Sbjct: 414 LEVNVR 419
>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
Length = 642
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 134/316 (42%), Gaps = 61/316 (19%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCA----PLSASYYTSLDRNLSEY 161
Y HY I +G P V +D GS+L +PC C C PL +
Sbjct: 92 YGTHYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGCGQHTDPL--------------F 137
Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
D S S+++K ++C H SC+S + YI+ E + +VD+++ + FS
Sbjct: 138 DVSKSTTAKYLAC-HDF----DSCRSCEQDRCYISQSYMEGSMWEAVMVDELVWVGGFSS 192
Query: 222 HAPQ-----SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QN 275
A + + +GC K+TG ++ +G+MGLG +V S + AG + QN
Sbjct: 193 PADEMEGVLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQN 251
Query: 276 SFSICFDENDSGSVFFG---------DQG--PATQQSTSFLPIGEKYDAYFVGVESYCIG 324
F++CF D G + FG D G P +++ P+ K D GV S I
Sbjct: 252 LFTLCF-AGDGGELVFGGVDYSHHTSDVGYTPLLSDKSAYYPVHVK-DILLNGV-SLGID 308
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRISLQGNSWKYCYNA 380
+ SG +VDSG + TF + + F K S R+ L
Sbjct: 309 TGTI-NSGRGVIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYSESRMKL----------- 356
Query: 381 SSEEMLKVPDMRLIFS 396
+SEE+ +P + +I S
Sbjct: 357 TSEELAALPVISIILS 372
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 111/444 (25%), Positives = 179/444 (40%), Gaps = 68/444 (15%)
Query: 15 LLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVAD--SWPKKNSVEYLELLLSNDWKR 72
L + +AV+F+ +++ + + +S S + + S K + +Y L LS R
Sbjct: 38 LQNAHNAVAFTPHHLNQHQRQQEALLLSSSFGIHLRSRASIQKPSHRDYKSLTLSR-LAR 96
Query: 73 QKTRVK-LQSNNN----SSRNQLLFPSEGSQTHFFGN-----------QFYWLHYTWIDI 116
RVK LQ+ + N L P+E S F N Q ++ + I
Sbjct: 97 DSARVKSLQTRLDLVLKRVSNSDLHPAE-SNAEFEANALQGPVVSGTSQGSGEYFLRVGI 155
Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
G P V LD GS++ W IQCAP S Y S +DP SS+S + C
Sbjct: 156 GKPPSQAYVVLDTGSDVSW-----IQCAPCSECYQQSD----PIFDPVSSNSYSPIRCDA 206
Query: 177 PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
P CKS + C Y Y + + + G + + L ++ +V IGC
Sbjct: 207 PQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--------TAAVENVAIGC 257
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF---FGD 293
G G ++ A G+ G L S P A + SFS C DS +V F
Sbjct: 258 GHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYCLVNRDSDAVSTLEFNS 309
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASF 343
P + E Y++G++ +G L +S F+ ++DSG +
Sbjct: 310 PLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAV 369
Query: 344 TFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
T L +E+Y + F K + + +SL + CY+ SS E ++VP + F + +
Sbjct: 370 TRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSRESVQVPTVSFHFPEGR 425
Query: 400 SFVVRNHIFSFPENEVGDHACFSY 423
+ + P + VG CF++
Sbjct: 426 ELPLPARNYLIPVDSVGTF-CFAF 448
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/328 (27%), Positives = 131/328 (39%), Gaps = 53/328 (16%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + IGTP + LD GS+L+W CQ C C D+ L +DPS+
Sbjct: 82 YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 128
Query: 166 SSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
SS+ SC LC+ +SC S K C Y Y + + ++G+L D
Sbjct: 129 SSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGA 187
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
P V GCG G + G+ G G G +S+PS L K G +FS
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSH 234
Query: 280 CFDENDS--GSVFFGD-------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 328
CF + S D G QST + Y++ ++ +G++ L
Sbjct: 235 CFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPV 294
Query: 329 TQSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
+S F ++DSG + T LPT +Y V F V +S +C +A
Sbjct: 295 PESEFTLKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354
Query: 382 SEEMLKVPDMRLIFSKNQSFVVR-NHIF 408
VP + L F + R N++F
Sbjct: 355 LRAKPYVPKLVLHFEGATMDLPRENYVF 382
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/328 (27%), Positives = 131/328 (39%), Gaps = 53/328 (16%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + IGTP + LD GS+L+W CQ C C D+ L +DPS+
Sbjct: 82 YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 128
Query: 166 SSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
SS+ SC LC+ +SC S K C Y Y + + ++G+L D
Sbjct: 129 SSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGA 187
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
P V GCG G + G+ G G G +S+PS L K G +FS
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSH 234
Query: 280 CFDENDS--GSVFFGD-------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 328
CF + S D G QST + Y++ ++ +G++ L
Sbjct: 235 CFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPV 294
Query: 329 TQSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
+S F ++DSG + T LPT +Y V F V +S +C +A
Sbjct: 295 PESEFALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354
Query: 382 SEEMLKVPDMRLIFSKNQSFVVR-NHIF 408
VP + L F + R N++F
Sbjct: 355 LRAKPYVPKLVLHFEGATMDLPRENYVF 382
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/301 (25%), Positives = 119/301 (39%), Gaps = 52/301 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ GTP F + LD GS++ W C+ C+ C S ++ SL + + S+ N
Sbjct: 131 VAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIPSTVGNT 190
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
Y Y + TS Y D + S V
Sbjct: 191 ---------------------YNMTYGDKSTSVGNYGCDTMT--------LEPSDVFQKF 221
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFF 291
GCGR G + G+ DG++GLG G +S S A + FS C +EN GS+ F
Sbjct: 222 QFGCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLF 277
Query: 292 GDQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVD 338
G++ + S F + E+ YFV + +GN L S F + ++D
Sbjct: 278 GEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIID 337
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLI 394
SG T LP Y+ + F K ++ +S + + CYN S + + +P+ L
Sbjct: 338 SGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLH 397
Query: 395 F 395
F
Sbjct: 398 F 398
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 88/323 (27%), Positives = 133/323 (41%), Gaps = 45/323 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y LH + IGTP + LD GS+L+W CQ C C +++L YD S
Sbjct: 91 YLLH---LAIGTPPQPVQLTLDTGSDLVWTQCQPCAVC----------FNQSLPYYDASR 137
Query: 166 SSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
SS+ SC CK S + C C + YS D S++ +D + SF
Sbjct: 138 SSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAF--SYSYGDKSATIGFLD--VETVSFVA 193
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
A SV V+ GCG TG + G+ G G G +S+PS L K G + F+
Sbjct: 194 GA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVS 246
Query: 282 DENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
S +F G T Q+T + Y++ ++ +G++ L +S F
Sbjct: 247 GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFA 306
Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS-EEML 386
++DSG +FT LP +Y V +F V + C++A +
Sbjct: 307 LKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAP 366
Query: 387 KVPDMRLIFSKNQSFVVR-NHIF 408
VP + L F + R N++F
Sbjct: 367 HVPKLVLHFEGATMHLPRENYVF 389
>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 873
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 109/251 (43%), Gaps = 28/251 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
HY + IG P V LD GS L PC +C+ C + DP +
Sbjct: 46 HYAELYIGIPPQRASVILDTGSGLTAFPCDKCVDCG--------------THTDP-KFDA 90
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
+K+ S + CK C + +D I +E + ++ D++ + + + +
Sbjct: 91 TKSTSINFVQCKYEEGCDTCRDNLCVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMIM 150
Query: 229 QSSVI---IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEN 284
+ I GC ++TG ++ +G+MGLG+G ++ + + KA + ++ F++CF +
Sbjct: 151 RRYGIRFKFGCQTRETGLFI-TQVENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQK 209
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT------QSGFQALV 337
V G ++ P+ + + Y + V+ IG L +SG A+V
Sbjct: 210 GGSFVIGGVDYSHHTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSGRGAIV 269
Query: 338 DSGASFTFLPT 348
DSG + T+ P+
Sbjct: 270 DSGTTDTYFPS 280
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 129/322 (40%), Gaps = 57/322 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP V L+ALD S+L W+ CQ C +C P S +DP S+S +
Sbjct: 145 IAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPV----------FDPRHSTSYGEM 194
Query: 173 SCSHPLCKS--RSSCKSLK-DPCPYIADYSTED-----TSSSGYLVDDILHLASFSKHAP 224
+ P C++ RS K C Y Y D ++S G LV++ L A +
Sbjct: 195 NYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVR--- 251
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
Q+ + IGCG G L GA G++GL G +S+P +A G SFS C +
Sbjct: 252 ----QAYLSIGCGHDNKG--LFGAPAAGILGLSRGQISIPHQIAFLGY-NASFSYCLVDF 304
Query: 285 DSG------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVESYCIGNSCLTQSG 332
SG ++ FG T SF P + Y +GV + +T+
Sbjct: 305 ISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERD 364
Query: 333 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY--CYNA 380
Q ++DSG + T L Y F + ++S G S + CY
Sbjct: 365 LQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTV 424
Query: 381 SSEEML----KVPDMRLIFSKN 398
L KVP + + F+
Sbjct: 425 GGRAGLRHCVKVPAVSMHFAGG 446
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 149/348 (42%), Gaps = 52/348 (14%)
Query: 83 NNSSRNQLL----FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
+N R + L FP +G+ + L+YT I +G P V +D GS++LWV C
Sbjct: 58 HNDRRGRFLQGISFPLKGNYSDL------GLYYTEIGLGNPVQKLKVIVDTGSDILWVKC 111
Query: 139 Q-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDP 191
C C LS + LS Y+ S+SS+S SCS PLC SRS S
Sbjct: 112 SPCRSC--LSKQ---DIIPPLSIYNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNS---A 163
Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
C Y Y + TS Y+ DD+ ++ ++ S + GC TGS+ D
Sbjct: 164 CAYGISYQDKSTSIGAYVKDDMHYVLQGG-----NATTSHIFFGCAINITGSW----PAD 214
Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGE 309
G+MG G +VP+ +A + FS C +++ G + FG++ T+ F P+
Sbjct: 215 GIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEEPNTTEM--VFTPLLN 272
Query: 310 KYDAYFVGVESYCIGNSCL------------TQSGFQALVDSGASFTFLPTEIYAEVVVK 357
Y V + S + + L + + ++DSG SF L T+ + +
Sbjct: 273 VTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRILFSE 332
Query: 358 FDKLVSSK-RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 404
L ++K L+G Y + + E P++ L FS + ++
Sbjct: 333 IKNLTTAKLGPKLEGLQCFYLKSGLTVET-SFPNVTLTFSGGSTMKLK 379
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 123/315 (39%), Gaps = 51/315 (16%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y +H + +GTP + LD GS+L+W QCAP ++ + L DP++S
Sbjct: 92 YLVH---LAVGTPPRPVALTLDTGSDLVWT-----QCAPCRDCFH----QGLPLLDPAAS 139
Query: 167 SSSKNVSCSHPLCKS----------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
S+ + C P C++ RSS + C YI Y + + + G + D
Sbjct: 140 STYAALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYG-DKSVTVGEIATDRFTF 198
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
+ + GCG G + G+ G G G S+PS L +
Sbjct: 199 GGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNET--GIAGFGRGRWSLPSQLNV-----TT 251
Query: 277 FSICFD---ENDSGSVFFGDQGPATQ-------------QSTSFLPIGEKYDAYFVGVES 320
FS CF E+ S V G PA ++T L + YF+ ++
Sbjct: 252 FSYCFTSMFESKSSLVTLGG-APAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKG 310
Query: 321 YCIGNSCLTQSGFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNSWKY 376
+G + L + ++DSGAS T LP +Y V +F V ++G++
Sbjct: 311 ISVGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDL 370
Query: 377 CYNASSEEMLKVPDM 391
C+ + + P +
Sbjct: 371 CFALPVTALWRRPPV 385
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 132/323 (40%), Gaps = 45/323 (13%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y LH + IGTP + LD GS L+W CQ C C +++L YD S
Sbjct: 91 YLLH---LAIGTPPQPVQLTLDTGSVLVWTQCQPCAVC----------FNQSLPYYDASR 137
Query: 166 SSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
SS+ SC CK S + C C Y YS D S++ +D + SF
Sbjct: 138 SSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFLD--VETVSFVA 193
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
A SV V+ GCG TG + G+ G G G +S+PS L K G + F+
Sbjct: 194 GA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVS 246
Query: 282 DENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
S +F G T Q+T + Y++ ++ +G++ L +S F
Sbjct: 247 GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFA 306
Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS-EEML 386
++DSG +FT LP +Y V +F V + C++A +
Sbjct: 307 LKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAP 366
Query: 387 KVPDMRLIFSKNQSFVVR-NHIF 408
VP + L F + R N++F
Sbjct: 367 HVPKLVLHFEGATMHLPRENYVF 389
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/321 (23%), Positives = 129/321 (40%), Gaps = 40/321 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP F V +D GS+L WV QC+P Y +N + + P++S+S ++
Sbjct: 17 VRLGTPERVFSVIVDTGSDLTWV-----QCSPCGKCY----SQNDALFLPNTSTSFTKLA 67
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C LC + C Y Y + + ++G V D + + + Q +
Sbjct: 68 CGSALCNGLPFPMCNQTTCVYWYSYG-DGSLTTGDFVYDTITMDGINGQKQQV---PNFA 123
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGS 288
GCG GS+ A DG++GLG G +S S L + FS C + +
Sbjct: 124 FGCGHDNEGSF---AGADGILGLGQGPLSFHSQLKS--VYNGKFSYCLVDWLAPPTQTSP 178
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQS----------GFQA 335
+ FGD +LPI Y+V + +G++ L S G
Sbjct: 179 LLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGT 238
Query: 336 LVDSGASFTFLPTEIYAEVVVKFD--KLVSSKRISLQGNSWKYCYNASSEEML-KVPDMR 392
+ DSG + T L Y EV+ + + S++I + C + ++ L VP M
Sbjct: 239 IFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKID-DISRLDLCLSGFPKDQLPTVPAMT 297
Query: 393 LIFSKNQSFVVRNHIFSFPEN 413
F + ++ F + E+
Sbjct: 298 FHFEGGDMVLPPSNYFIYLES 318
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 137/360 (38%), Gaps = 60/360 (16%)
Query: 53 WPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT 112
WP N + +E ++ D KR + + + L GS + Q++ T
Sbjct: 42 WP--NPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDL-----GSGIDYGTAQYF----T 90
Query: 113 WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ +GTP F V +D GS L WV C+ +N + S S K V
Sbjct: 91 EVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKV-------KNRRVFRAEESKSFKTV 143
Query: 173 SCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHAP 224
C CK S S+C + PC Y DY D S++ G + + + +
Sbjct: 144 GCFTQTCKVDLMNLFSLSTCPTPSTPCSY--DYRYADGSAAQGVFAKETITVGLTNGRKA 201
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
+ +++GC +G GA DGV+GL D S S L S C
Sbjct: 202 R---LRGLLVGCSSSFSGQSFQGA--DGVLGLAFSDFSFTS--TATSLFGAKLSYCLVDH 254
Query: 282 --DENDSGSVFFG--------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 328
++N S + FG P I Y +G+ IG+ L
Sbjct: 255 LSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGIS---IGDDMLDIP 311
Query: 329 TQ-----SGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASS 382
TQ +G ++DSG S T L Y VV + LV KR+ +G +YC++++S
Sbjct: 312 TQVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTS 371
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 143/345 (41%), Gaps = 55/345 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I +GTP L+ LD GS+++WV QCAP Y +++ +DP SSS
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWV-----QCAPCRRCY----EQSGPVFDPRRSSSY 179
Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
V C LC+ S C + C Y Y + + ++G V + L A ++ A
Sbjct: 180 GAVGCGAALCRRLDSGGCDLRRGACMYQVAYG-DGSVTAGDFVTETLTFAGGARVA---- 234
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDS 286
V +GCG G ++ A G+ G +S P+ +++ SFS C D S
Sbjct: 235 ---RVALGCGHDNEGLFVAAAGLLGLG---RGGLSFPTQISR--RYGRSFSYCLVDRTSS 286
Query: 287 G-----------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS---CLT 329
G +V FG G S SF P+ Y+V + +G + +
Sbjct: 287 GAGAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVA 345
Query: 330 QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQGNS-WKYCY 378
+S + +VDSG S T L Y+ + F + R+S G S + CY
Sbjct: 346 ESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCY 405
Query: 379 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+ ++KVP + + F+ + + P + G CF++
Sbjct: 406 DLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTF-CFAF 449
>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
Length = 656
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 112/250 (44%), Gaps = 29/250 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
HY WI +GTP + +D GS + PC C QC +T + ++ + SSS
Sbjct: 95 HYAWIYVGTPPQRVSIIIDTGSGMTAFPCSGCDQCG-----NHTDI-----PFNTNLSSS 144
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---ASFSKHAPQ 225
+ +SC+H S + C + +PC E +S S +++DI++L AS
Sbjct: 145 IQPISCNHRTYFSCAYCTNPTEPCRTY----MEGSSWSAKVMEDIVYLGDVASAKDTNLH 200
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S + + GC K+TG ++ A DG+MG+ G+ V L + + N+F++CF
Sbjct: 201 HSYSTRYMFGCQNKETGLFIPQVA-DGIMGIHNNGNDIVTKLFREKKIPSNTFTLCFSPR 259
Query: 285 DSGSVFFGDQGPATQQ-STSFLPI----GEKYDAYF---VGVESYCIGNSCLTQSGFQAL 336
G G + ++ I GE Y A F + V + I + ++ +
Sbjct: 260 -GGYFALGAMDTSRHAGEVTYARINDAYGENYYAVFMTDIRVGGHSIDIDMKATNSYRYI 318
Query: 337 VDSGASFTFL 346
VDSG + + +
Sbjct: 319 VDSGTTNSII 328
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 147/360 (40%), Gaps = 75/360 (20%)
Query: 107 YWLHYTWIDIGTP--NVSFLVALDAGSNLLWVPC----QCIQCAPLSASYYTSLDRNLSE 160
Y H + GTP +SFLV D GS+++W PC C C S+ + + +
Sbjct: 75 YGGHSISLSFGTPPQKLSFLV--DTGSDVVWAPCTTDYTCTNC-----SFSAADPKKVPI 127
Query: 161 YDPSSSSSSKNVSCSHPLCKSR---------SSCKSLKDPCPYIADYSTE--DTSSSGYL 209
+DP SSSSK + C +P C S C C Y YST+ +SSGY
Sbjct: 128 FDPKLSSSSKILDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYF 187
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
+ + L P+ +++ + ++GC T S + D + G G S+P +
Sbjct: 188 LLENLKF-------PRKTIR-NFLLGC----TTSAARELSSDALAGFGRSMFSLPIQMG- 234
Query: 270 AGLIQNSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA----YFVGVE 319
F+ C + +D SG + D + S+ P + A Y +GV+
Sbjct: 235 ----VKKFAYCLNSHDYDDTRNSGKLIL-DYRDGKTKGLSYTPFLKSPPASAFYYHLGVK 289
Query: 320 SYCIGNSCLT------------QSGFQALVDSG-ASFTFLPTEIYAEVVVKFDKLVSSKR 366
IGN L +SG ++DSG ++ ++ V + K +S R
Sbjct: 290 DIKIGNKLLRIPSKYLAPGSDGRSG--VIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYR 347
Query: 367 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHACF 421
SL+ + CYN + + +K+P + F + VV +N+ P+ + ACF
Sbjct: 348 RSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESL---ACF 404
>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
Length = 394
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 141/329 (42%), Gaps = 63/329 (19%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSS 168
+Y I IGTP+ F V D GS+ LWVP + QC Y+T++ + ++YD + SSS
Sbjct: 75 YYGPISIGTPSQDFKVVFDTGSSNLWVPSK--QC------YFTNIACLMHNKYDANKSSS 126
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
K K+ + Y + S SGYL D +++A Q+
Sbjct: 127 YK------------------KNGTEFAIHYGS--GSLSGYLSTDTVNIAGLGIEG-QTFA 165
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICF 281
++ + G GA DG++GLG ++V + + + GLI Q FS
Sbjct: 166 EA-------LSEPGLVFIGAKFDGILGLGYSSIAVDGVKPPFYQMYEQGLISQPVFSFYL 218
Query: 282 DEN----DSGSVFFGDQGPATQQST-SFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQA 335
+ + + G + FG P + ++LP+ K AY+ + ++S +GN L Q G Q
Sbjct: 219 NRDPKAPEGGEIIFGGSDPNHYKGEFTYLPVTRK--AYWQIKMDSASMGNLNLCQGGCQV 276
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ D+G S LP +K + I + G Y + E + K+P +R +
Sbjct: 277 IADTGTSLIALP----PSEATSINKAIGGTPI-MGGQ-----YMVACENIPKLPVIRFVL 326
Query: 396 SKNQSFVVRNHIFSFPENEVGDHACFSYF 424
++F + + ++G C S F
Sbjct: 327 G-GKTFELEGKDYILRIAQMGKTICLSGF 354
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 130/355 (36%), Gaps = 78/355 (21%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ I +GTP S L+ D GS+L+WV C C C+ S S + P SSS
Sbjct: 88 YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPS---------SAFLPRHSSS 138
Query: 169 SKNVSCSHPLCK-----SRSSCK--SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
C P C+ C L PC ++ Y+ + + SSG+ + L S S
Sbjct: 139 FSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYA-DGSLSSGFFSKETTTLKSLSG 197
Query: 222 HAPQSSVQ-SSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSF 277
S + + GCG + +G + GA GVMGLG G +S S L + N F
Sbjct: 198 ----SEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR--FGNKF 251
Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----------------------Y 314
S C + + TSFL IG + Y
Sbjct: 252 SYCLMDYT-----------LSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFY 300
Query: 315 FVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 364
++ + S I L Q +VDSG + T+L Y EV+ + V
Sbjct: 301 YITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKL 360
Query: 365 KRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEVG 416
+ + C NAS E +P +R F RN+ F E E G
Sbjct: 361 PNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNY---FLETEEG 412
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 144/354 (40%), Gaps = 48/354 (13%)
Query: 103 GNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
G +F L+Y + +G+ N+S +V D GS+L WV QC P + Y ++N +
Sbjct: 114 GIKFQTLNYIVTMGLGSQNMSVIV--DTGSDLTWV-----QCEPCRSCY----NQNGPLF 162
Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-----CPYIADYSTEDTSSSGYLVDDILHL 216
PS+S S + + C+ C+S DP C Y+ +Y + + +SG L + L
Sbjct: 163 KPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYG-DGSYTSGELGIEKLGF 221
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
S S+ + GCGR G + G+MGLG ++S+ S
Sbjct: 222 GGISV--------SNFVFGCGRNNKGLF---GGASGLMGLGRSELSMIS--QTNATFGGV 268
Query: 277 FSICFDEND----SGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGNSC 327
FS C D SGS+ G+Q + T LP + + Y + + +G
Sbjct: 269 FSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVS 328
Query: 328 L--TQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
L S F ++DSG + L +Y + KF + S + + C+N +
Sbjct: 329 LHVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTG 388
Query: 383 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTL--EYNFTGIL 434
+ + +P + + F N V + E C + +L EY GI+
Sbjct: 389 YDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEM-GII 441
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 125/297 (42%), Gaps = 52/297 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ I IGTP + LV D GS+L+WV CQ C +C + ++P SS+
Sbjct: 94 YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPI----------FNPKQSST 143
Query: 169 SKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSK 221
+ V C C + + S C Y YS D S + GYL + + S
Sbjct: 144 YRRVLCETRYCNALNSDMRACSAHGFFKACGY--SYSYGDHSFTMGYLATERFIIGS--- 198
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSFSIC 280
+S+Q + GCG G++ + G+ SL+++ G I N FS C
Sbjct: 199 --TNNSIQ-ELAFGCGNSNGGNF-----DEVGSGIVGLGGGSLSLISQLGTKIDNKFSYC 250
Query: 281 F-----DENDS-GSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 330
N S G + FGD G T ST + E Y++ +E+ +GN L
Sbjct: 251 LVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVS-KEPETFYYLTLEAISVGNERLAY 309
Query: 331 SGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 378
+ ++DSG + TFL +++Y ++ + +K V +R+S + C+
Sbjct: 310 ENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICF 366
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 127/326 (38%), Gaps = 48/326 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ +GTP FL+ D GS+L WV C+ A S S S + P S +
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 170 KNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
+SC+ C S ++C + PC Y DY +D S++ V + S
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSGREE 214
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
+ + +++GC TG + A DGV+ LG +S S A FS C
Sbjct: 215 RKAKLKGLVLGCSSSYTGPSFE--ASDGVLSLGYSGISFASHAAS--RFGGRFSYCLVDH 270
Query: 282 --DENDSGSVFFGDQGPATQ----------------QSTSFLPIGEKYDAYFVGVESYCI 323
N + + FG PA + T L Y V +++ +
Sbjct: 271 LSPRNATSYLTFGPN-PAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISV 329
Query: 324 GNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSW 374
L ++G ++DSG S T L Y VV K L R+++ + +
Sbjct: 330 AGEFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTM--DPF 387
Query: 375 KYCYNASS----EEMLKVPDMRLIFS 396
+YCYN +S + + VP M + F+
Sbjct: 388 EYCYNWTSPSGKDADVAVPKMAVHFA 413
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/347 (25%), Positives = 138/347 (39%), Gaps = 60/347 (17%)
Query: 114 IDIGTP---NVSF--LVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
I +GTP + SF L++ D GS++ W+ C C +C Y L SS
Sbjct: 129 ITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLK----------SS 178
Query: 168 SSKNVSCSHPLCKSRSS---CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
S+ +V C P C++ S C + C Y +Y +S+ + V+ + P
Sbjct: 179 SASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTF--------P 230
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
V IGCG G + AA G++GLG G +S PS + AG SFS C
Sbjct: 231 PGVRVPGVAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQI--AGRYGRSFSYCLAGQ 286
Query: 285 DSG----SVFFGDQGPA------TQQSTSFLPIGEKYDAYFVGVESYCIGN---SCLTQS 331
+G ++ FG A T L Y Y+VG+ +G +T+S
Sbjct: 287 GTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTES 346
Query: 332 GFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKY---C 377
+ +VDSG + T L YA F ++ + K + G + + C
Sbjct: 347 DLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF-RVAAVKELGWPSPGGPFAFFDTC 405
Query: 378 YNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
Y++ M KVP + + F+ + + P + CF++
Sbjct: 406 YSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAF 452
>gi|298707682|emb|CBJ25999.1| aspartyl protease [Ectocarpus siliculosus]
Length = 547
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 130/318 (40%), Gaps = 32/318 (10%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y H+ +I GTP V ++ GS+ PC +C C + Y +DPS
Sbjct: 105 YGTHFAYIYAGTPPQRASVIINTGSHFSAFPCSECRSCGNHTDPY----------WDPSQ 154
Query: 166 SSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA----SFS 220
SS++ V+C C C+S K C + ++ TE +S VDD+L + S S
Sbjct: 155 SSTAHIVTCDETERCHGAYKCQSDKK-C-VLREHYTEGSSWRAKQVDDLLWVGERTLSDS 212
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSI 279
+ S+ GC TG + A DG+MGL ++ + LA AG I + FS+
Sbjct: 213 QKHDDSAFSVDFTFGCIESLTGLFKTQLA-DGIMGLNADSRTLITQLATAGKISERKFSL 271
Query: 280 CFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLT------Q 330
CF E G++ G P + S + P + A V V + +T Q
Sbjct: 272 CFSET-GGTMVIGGYDPLLNKPGSEMQYTPSTGEISAPTVKVTDVTLNGVSITTDASVFQ 330
Query: 331 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
G + SG + T+LP + ++ S + + N ++C ++ E+ +P
Sbjct: 331 KGTGIKIVSGTTNTYLPRAVAEGFSAAWEAATGSPYATCKMN--EFCMTRTTVELEALPV 388
Query: 391 MRLIFSKNQSFVVRNHIF 408
+ + VR +
Sbjct: 389 LMIHMDGGVEVNVRPEAY 406
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 153/331 (46%), Gaps = 45/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ I +GTP + LD GS++ W IQC P S Y ++ ++P+SSS+
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNW-----IQCEPCSDCY----QQSDPVFNPTSSSTY 212
Query: 170 KNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K+++CS P C S+C+S K C Y Y + + + G L D + + K
Sbjct: 213 KSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGNSGKI----- 264
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
+ V +GCG G + A ++GLG G +S+ + + SFS C + DSG
Sbjct: 265 --NDVALGCGHDNEGLFTGAAG---LLGLGGGALSITNQMKA-----TSFSYCLVDRDSG 314
Query: 288 ---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQ---------SGFQ 334
S+ F + +T+ L +K D Y+VG+ + +G + SG
Sbjct: 315 KSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSG 374
Query: 335 ALV-DSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKVPDMR 392
++ D G + T L T+ Y + F KL ++ K+ + + + CY+ SS +KVP +
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVA 434
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ +S + + P ++ G CF++
Sbjct: 435 FHFTGGKSLDLPAKNYLIPVDDNGTF-CFAF 464
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 151/361 (41%), Gaps = 44/361 (12%)
Query: 56 KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQL-----LFPSEGSQTHFFGNQFYWLH 110
K + E L + + KL S NSS +L P+ S + G Y +
Sbjct: 76 KEKPSHEETLGRDQLRAANIHAKLSSPRNSSAKELQQSGVTIPT--SSGYSLGTPEYVI- 132
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+ +GTP V+ ++++D GS++ WV QCAP +A +S L +DP+ S++
Sbjct: 133 --TVSLGTPAVTQVMSIDTGSDVSWV-----QCAPCAAQSCSSQKDKL--FDPAKSATYS 183
Query: 171 NVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
SCS C + L C YI Y + ++++G D L L + S
Sbjct: 184 AFSCSSAQCAQLGGEGNGCLNSHCQYIVKY-VDHSNTTGTYGSDTLGLTT-------SDA 235
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSG 287
+ GC + G DG+MGLG GD SL+++ A +FS C + S
Sbjct: 236 VKNFQFGCSHRANGFV---GQLDGLMGLG-GDTE--SLVSQTAATYGKAFSYCLPPSSSS 289
Query: 288 SVFFGDQGPATQQST----SFLPIGEKYDAYFVGV--ESYCIGNSCLT--QSGFQ--ALV 337
+ F G A ++ S P+ F GV ++ + + L S F ++V
Sbjct: 290 AGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVV 349
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
DSG T LP Y + F K + + + C++ S + ++VP + L FS+
Sbjct: 350 DSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSR 409
Query: 398 N 398
Sbjct: 410 G 410
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 88/335 (26%), Positives = 137/335 (40%), Gaps = 50/335 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP + L+A+D ++ W+PC C CA + + P S++ KNVSC
Sbjct: 84 IGTPPQTLLLAMDTSNDAAWIPCTACDGCAS-------------TLFAPEKSTTFKNVSC 130
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
+ P CK + C + Y + +S + LV D + LA + P S
Sbjct: 131 AAPECKQVPNPGCGVSSCNFNLTYGS--SSIAANLVQDTITLA--TDPVP------SYTF 180
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVF 290
GC K TG+ A P G++GLG G +S+ S L Q++FS C N SGS+
Sbjct: 181 GCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFSGSLR 235
Query: 291 FGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 339
G P + T L + Y+V +E+ +G + +G + DS
Sbjct: 236 LGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDS 295
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
G FT L +Y V +F + V K + CYN + VP + IF+
Sbjct: 296 GTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP----IVVPTITFIFTGMN 351
Query: 400 SFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ +++I + G C + N +L
Sbjct: 352 VTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVL 384
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 86/314 (27%), Positives = 125/314 (39%), Gaps = 52/314 (16%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + IGTP + LD GS+L+W CQ C C D+ L +DPS+
Sbjct: 82 YLVH---LAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC----------FDQALPYFDPST 128
Query: 166 SSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASF 219
SS+ SC LC+ +SC S K C Y Y + + ++G+L D
Sbjct: 129 SSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDKFTFVGA 187
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
P V GCG G + G+ G G G +S+PS L K G +FS
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSH 234
Query: 280 CFDENDS--GSVFFGD-------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 328
CF + S D G QST + Y++ ++ +G++ L
Sbjct: 235 CFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPV 294
Query: 329 TQSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 381
+S F ++DSG + T LPT +Y V F V +S +C +A
Sbjct: 295 PESEFALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354
Query: 382 SEEMLKVPDMRLIF 395
VP + L F
Sbjct: 355 LRAKPYVPKLVLHF 368
>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
Length = 681
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 134/302 (44%), Gaps = 36/302 (11%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
HYTW+ GTP V D GS L+ PC C C ++T + ++SS+
Sbjct: 67 HYTWVYAGTPPQRASVIADTGSALMAFPCSGCDGCG-----HHTD-----QPFQAANSST 116
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---ASFSKHAPQ 225
+++C+ C D C Y E +S +V+DI++L +SF +
Sbjct: 117 LVHITCAQKSLFQCKECHVQSDTCGISQSY-MEGSSWKASVVEDIVYLGGESSFDDKEMR 175
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDEN 284
+ + GC + G ++ A DG+MGL + + + L + I N FS+CF EN
Sbjct: 176 NRYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNLFSLCFTEN 234
Query: 285 DSGSVFFGDQGPATQQ-STSFLPI------GEKYDAYF----VGVESYCIGNSCLTQSGF 333
G++ G A + S++ + G Y+ + +G +S T+ +
Sbjct: 235 -GGTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKEEAYTRGHY 293
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+VDSG + ++LP + E + F ++ + R GNS C +++++ +P ++L
Sbjct: 294 --IVDSGTTDSYLPRALKTEFLQMFKEI--AGRDYQVGNS---CKGFTNKDLASLPTIQL 346
Query: 394 IF 395
+
Sbjct: 347 VM 348
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 92/328 (28%), Positives = 136/328 (41%), Gaps = 54/328 (16%)
Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
GTP + L+ +D GS++ W IQC P S Y+ +D ++P SSS K++SC
Sbjct: 145 GTPAKNSLLIIDTGSDVTW-----IQCKPCS-DCYSQVD---PIFEPQQSSSYKHLSCLS 195
Query: 177 PLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
C ++ + C Y +Y + + S G + L L S S S G
Sbjct: 196 SACTELTTMNHCRLGGCVYEINYG-DGSRSQGDFSQETLTLGSDSF--------PSFAFG 246
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL-AKAGLIQNSFSIC---FDENDSGSVFF 291
CG TG + A G++GLG +S PS +K G FS C F + S F
Sbjct: 247 CGHTNTGLFKGSA---GLLGLGRTALSFPSQTKSKYG---GQFSYCLPDFVSSTSTGSFS 300
Query: 292 GDQG--PATQQSTSFLPI--GEKYDA-YFVGVESYCIGN-------SCLTQSGFQALVDS 339
QG PAT +F+P+ Y + YFVG+ +G + L + G +VDS
Sbjct: 301 VGQGSIPAT---ATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGG--TIVDS 355
Query: 340 GASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
G T L + Y + F L S+K S+ CY+ SS +++P + F
Sbjct: 356 GTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSI----LDTCYDLSSYSQVRIPTITFHF 411
Query: 396 SKNQSFVVRNHIFSFPENEVGDHACFSY 423
N V F G C ++
Sbjct: 412 QNNADVAVSAVGILFTIQSDGSQVCLAF 439
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/340 (26%), Positives = 135/340 (39%), Gaps = 72/340 (21%)
Query: 106 FYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPS 164
F LH+T + IGTP + LD GS+L+W C+ + T R YDP+
Sbjct: 84 FGRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKL---------FDTRQHREKPLYDPA 134
Query: 165 SSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SSS C LC++ S +C ++ C Y +Y + T G L + +F
Sbjct: 135 KSSSFAAAPCDGRLCETGSFNTKNCS--RNKCIYTYNYGSATT--KGELASETF---TFG 187
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
+H V S+ GCG+ +GS L GA+ G++G+ +S+ S L FS C
Sbjct: 188 EH---RRVSVSLDFGCGKLTSGS-LPGAS--GILGISPDRLSLVSQLQIP-----RFSYC 236
Query: 281 ----FDENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAYF------------ 315
D N + +FFG GP S P G Y Y
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296
Query: 316 -VGVESYCIGNSCLTQSGFQALVDSGASFTFLPT---EIYAEVVVKFDKLVSSKRISLQG 371
V V S+ IG SG VDSG + LP+ E E +V+ KL G
Sbjct: 297 NVPVSSFAIGRD---GSG-GTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATD-HG 351
Query: 372 NSWKYCYN------ASSEEMLKVPDMRLIFSKNQSFVVRN 405
++ C+ + E ++VP + F + ++R
Sbjct: 352 YEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRR 391
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 137/331 (41%), Gaps = 46/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++T + +G P F + LD GS++ W+ CQ C C Y D +DP++SS+
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPTASST 210
Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
V+C C S SSC+S + C Y +Y Y D A+ S S
Sbjct: 211 YAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNY-----GDGSYTFGD---FATESVSFGNS 260
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+V +GCG G + V GL + L L SFS C DS
Sbjct: 261 GSVKNVALGCGHDNEGLF--------VGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDS 312
Query: 287 G---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQ------ 334
++ F T+ L K D Y+VG+ +G ++ +S F+
Sbjct: 313 AGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGN 372
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
+VD G + T L T+ Y + F ++ + +++ + CY+ S + ++VP +
Sbjct: 373 GGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVS 432
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ +S+ + + P + G + CF++
Sbjct: 433 FHFADGKSWNLPAANYLIPVDSAGTY-CFAF 462
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 152/351 (43%), Gaps = 74/351 (21%)
Query: 93 PSEGSQTHFFGNQFYWLHYTWID--IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASY 150
PS S + F ++F + I IGTP + + LD GS L W+ C + P
Sbjct: 53 PSPSSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP----- 107
Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDT 203
+ + +DPS SSS + CSHPLCK R +SC S + C Y Y+ + T
Sbjct: 108 -----KPKTSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGT 160
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
+ G LV + + +FS + + +I+GC + + G++G+ G +
Sbjct: 161 FAEGNLVKEKI---TFS----NTEITPPLILGCATESSDD-------RGILGMNRGRL-- 204
Query: 264 PSLLAKAGLIQNSFSICFDEN-----DSGSVFFGDQG-------------PATQQSTSFL 305
S +++A + + S+ I N +GS + GD P +Q+ +
Sbjct: 205 -SFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLD 263
Query: 306 PIGEKYDAYFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AE 353
P+ Y +G+ + + ++ S F Q +VDSG+ FT L Y AE
Sbjct: 264 PLA--YTVPMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAE 320
Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV 403
++ + + + K+ + G + C++ + + + + D+ +F++ +V
Sbjct: 321 IMTRVGRRL--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEILV 369
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 124/287 (43%), Gaps = 63/287 (21%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IG P V + +D GS+L+W C+ C +C D+ +DP SSS V
Sbjct: 3 LSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKV 52
Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
CS LC + RS+C KD C Y+ Y + +S+ G L + ++S+ S
Sbjct: 53 GCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSSTRGLLATETFTFED------ENSI-S 104
Query: 231 SVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 285
+ GCG + G DG + G++GLG G +S+ S L + FS C D
Sbjct: 105 GIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEA 156
Query: 286 SGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--Q 330
S S+F G G T ++ S L ++ Y++ ++ +G L+ +
Sbjct: 157 SSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLRNPDQPSFYYLELQGITVGAKRLSVEK 215
Query: 331 SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
S F+ ++DSG + T+L E K K + R+SL
Sbjct: 216 STFELAEDGTGGMIIDSGTTITYL-----EETAFKVLKEEFTSRMSL 257
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 129/321 (40%), Gaps = 40/321 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + IG P+ +F + +D GS++ W+ C+ C C Y +D +DP+SSSS
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDC-------YQQVD---PIFDPASSSS 209
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
+ C P C++ D C Y Y Y V D A+ + S
Sbjct: 210 FSRLGCQTPQCRNLDVFACRNDSCLYQVSY-----GDGSYTVGD---FATETVSFGNSGS 261
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DEND 285
V IGCG G ++ A G+ G L SL ++ + +SFS C D D
Sbjct: 262 VDKVAIGCGHDNEGLFVGAAGLIGLGGGPL------SLTSQ--IKASSFSYCLVNRDSVD 313
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA-------- 335
S ++ F P+ + + Y+VG+ +G L S F+
Sbjct: 314 SSTLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGI 373
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+VD G + T L T+ Y + F KL + + CYN SS ++VP + +F
Sbjct: 374 IVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLF 433
Query: 396 SKNQSFVVRNHIFSFPENEVG 416
+S + + P + G
Sbjct: 434 DGGKSLPLPPSNYLIPVDSAG 454
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 136/332 (40%), Gaps = 45/332 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+++ + IG+P + LD GS++ WV CQ C C Y D +DPS S+S
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSAS 218
Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
VSC P C+ ++C++ C Y Y + + + G + L L S
Sbjct: 219 YAAVSCDSPRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG-------DS 270
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ ++V IGCG G ++ A + G L S PS ++ ++FS C + DS
Sbjct: 271 TPVTNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDS 322
Query: 287 ---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL-----------TQS 331
++ FG G T+ L + Y+V + +G L T
Sbjct: 323 PAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSG 382
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+VDSG + T L + YA + F + S + + + CY+ S ++VP +
Sbjct: 383 SGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 442
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
L F + + + P + G + C ++
Sbjct: 443 SLRFEGGGALRLPAKNYLIPVDGAGTY-CLAF 473
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 159/360 (44%), Gaps = 75/360 (20%)
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW---IDIGTPNVSFLVALDAGSNL 133
+++Q+N++ S +L F + S+T G + + T + IGTP + + LD GS L
Sbjct: 34 LRIQNNHHISTRRL-FSNSSSKTT--GKLLFHHNVTLTASLTIGTPPQNITMVLDTGSEL 90
Query: 134 LWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK---D 190
W+ C+ +TS+ ++P +S + + CS CK+R+S +L D
Sbjct: 91 SWLRCK-------KEPNFTSI------FNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCD 137
Query: 191 P---CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYL-D 246
P C +I Y+ + +S G+L + S ++ A + GC + S +
Sbjct: 138 PAKLCHFIISYA-DASSVEGHLAFETFRFGSLTRPA--------TVFGCMDSGSSSNTEE 188
Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFGDQ----------G 295
A G+MG+ G + S + + G FS C DS G + G+
Sbjct: 189 DAKTTGLMGMNRGSL---SFVNQMGF--RKFSYCISGLDSTGFLLLGEARYSWLKPLNYT 243
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFT 344
P Q ST LP ++ AY V +E + N L T +G Q +VDSG FT
Sbjct: 244 PLVQISTP-LPYFDRV-AYSVQLEGIKVNNKVLPLPKSVFVPDHTGAG-QTMVDSGTQFT 300
Query: 345 FLPTEIYAEVVVKF-------DKLVSSKRISLQGNSWKYCY--NASSEEMLKVPDMRLIF 395
FL +Y+ + +F ++++ + QG + CY +++S + +P ++L+F
Sbjct: 301 FLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQG-AMDLCYLIDSTSSTLPNLPVVKLMF 359
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 147/361 (40%), Gaps = 47/361 (13%)
Query: 56 KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
K + L L D R KT L + N +R S +Q ++T +
Sbjct: 76 KTPSQLFHLRLERDAARVKTLTHLAAATNKTRPANPGSGFSSSVVSGLSQGSGEYFTRLG 135
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP + LD GS+++W+ C+ C +C Y+ D+ +DPS S S + C
Sbjct: 136 VGTPPKYLYMVLDTGSDVVWLQCKPCTKC-------YSQTDQ---IFDPSKSKSFAGIPC 185
Query: 175 SHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
PLC+ S C + C Y Y + + + + +F + A V
Sbjct: 186 YSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETL----TFRRAA-----VPRV 236
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----GS 288
IGCG G ++ A ++GLG G +S P+ N FS C + + S
Sbjct: 237 AIGCGHDNEGLFVGAAG---LLGLGRGGLSFPTQTGTR--FNNKFSYCLTDRTASAKPSS 291
Query: 289 VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ-------- 334
+ FGD A ++ F P+ K D Y+V + +G + ++ S F+
Sbjct: 292 IVFGDS--AVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGG 349
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
++DSG S T L Y + F S + + + + + CY+ S +KVP + L
Sbjct: 350 VIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLH 409
Query: 395 F 395
F
Sbjct: 410 F 410
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 131/306 (42%), Gaps = 34/306 (11%)
Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS 159
G L Y + IG+P V+ +++D GS++ WV C+ C QC ++ +D S
Sbjct: 113 LGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQC-------HSEVD---S 162
Query: 160 EYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
+DPSSSS+ SCS C +S+ + C YI +Y +++ D L
Sbjct: 163 LFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTG-TYSSDTLT 221
Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
L SS + GC + ++G + D DG+MGLG G S+ S AG
Sbjct: 222 LG--------SSAMTDFQFGCSQSESGGFNDQT--DGLMGLGGGAQSLAS--QTAGTFGT 269
Query: 276 SFSICFDENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 332
+FS C S F G + T L + Y V +ES +G+ L S
Sbjct: 270 AFSYCLPPTSGSSGFLTLGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSV 329
Query: 333 FQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 390
F A L+DSG T LP Y+ + F + + C++ S + + +P
Sbjct: 330 FSAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPT 389
Query: 391 MRLIFS 396
+ L+FS
Sbjct: 390 VTLVFS 395
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 154/353 (43%), Gaps = 64/353 (18%)
Query: 71 KRQKTRVK-------LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
KR K+R++ S+ S +QL P GN Y + + IGTP VS+
Sbjct: 71 KRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHA------GNGEYLIE---LAIGTPPVSY 121
Query: 124 LVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
LD GS+L+W QC P + Y + +DP SSS VSC LC +
Sbjct: 122 PAVLDTGSDLIWT-----QCKPCTRCY----KQPTPIFDPKKSSSFSKVSCGSSLCSALP 172
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
S + D C Y+ Y + + + G L + +F K + SV ++ GCG G
Sbjct: 173 S-STCSDGCEYVYSYG-DYSMTQGVLATETF---TFGKSKNKVSVH-NIGFGCGEDNEGD 226
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQ 300
+ A+ G++GLG G +S+ S L + FS C D+ + G G
Sbjct: 227 GFEQAS--GLVGLGRGPLSLVSQLK-----EQRFSYCLTPIDDTKESVLLLGSLGKVKDA 279
Query: 301 ----STSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFL 346
+T L + Y++ +E+ +G++ L+ +S F+ ++DSG + T++
Sbjct: 280 KEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYV 339
Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYN-ASSEEMLKVPDMRLIF 395
+ Y + +F +S +++L S C++ S +++P +L+F
Sbjct: 340 QQKAYEALKKEF---ISQTKLALDKTSSTGLDLCFSLPSGSTQVEIP--KLVF 387
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 82/357 (22%), Positives = 154/357 (43%), Gaps = 75/357 (21%)
Query: 93 PSEGSQTHFFGNQFYWLHYTWID--IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASY 150
PS S + F ++F + I IGTP + + LD GS L W+ C + P
Sbjct: 53 PSPSSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP----- 107
Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDT 203
+ + +DPS SSS + CSHPLCK R +SC S + C Y Y+ + T
Sbjct: 108 -----KPKTSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGT 160
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
+ G LV + + ++ + + +I+GC + + G++G+ G +
Sbjct: 161 FAEGNLVKEKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL-- 204
Query: 264 PSLLAKAGLIQNSFSICFDEN-----DSGSVFFGDQG-------------PATQQSTSFL 305
S +++A + + S+ I N +GS + GD P +Q+ +
Sbjct: 205 -SFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLD 263
Query: 306 PIGEKYDAYFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AE 353
P+ Y +G+ + + ++ S F Q +VDSG+ FT L Y AE
Sbjct: 264 PLA--YTVPMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAE 320
Query: 354 VVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKN-QSFVVRNHIF 408
++ + + + K+ + G + C++ + + + + D+ +F++ + FV + +
Sbjct: 321 IMTRVGRRL--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEIFVPKERVL 375
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 150/401 (37%), Gaps = 78/401 (19%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRN--QLLFP-SEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
LL + R +R + Q RN Q+ P S GS Y L +T +V
Sbjct: 45 LLKSTSSRSASRFQHQHQKRHLRNRHQVSLPLSPGSD--------YTLSFTLNSNPPQHV 96
Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS 181
S LD GS+L+W PC+ +C + + S P SS++++V C C +
Sbjct: 97 SLY--LDTGSDLVWFPCKPFECILCEGK---AENTTASTPPPRLSSTARSVHCKSSACSA 151
Query: 182 RSSCKSLKDPCPYIADYSTEDTSSS----------------GYLVDDILHLASFSKHAPQ 225
S D C IAD E +S G LV + H + A
Sbjct: 152 AHSNLPTSDLCA-IADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATP 210
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDEN 284
S + GC A P GV G G G +S+P+ LA A + N FS C +
Sbjct: 211 SLSLHNFTFGCAHTAL------AEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSH 264
Query: 285 DSGS--------VFFGDQGPATQQS---------TSFLPIGEKYDAYFVGVESYCIGNSC 327
S + G ++ TS L + Y VG+E IG
Sbjct: 265 SFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKK 324
Query: 328 LTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVS-----SKRISLQGN 372
+ F +VDSG +FT LP +Y VV +FD V +K +
Sbjct: 325 IPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVE-DKT 383
Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF 410
CY + ++ +P + L F N+S VV +N+ + F
Sbjct: 384 GLGPCYYY--DTVVNIPSLVLHFVGNESSVVLPKKNYFYDF 422
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/354 (23%), Positives = 140/354 (39%), Gaps = 63/354 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ GTP + +D GS+++W PC + +S + + P SSSSK +
Sbjct: 71 LSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLG 130
Query: 174 CSHPLCK-------------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
C +P C S SC + P PY+ Y + T G + + LHL S S
Sbjct: 131 CKNPKCSWIHHSNINCDQDCSIKSCLNQTCP-PYMIFYGSGTTG--GVALSETLHLHSLS 187
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
K + ++GC S P G+ G G G S+PS L S
Sbjct: 188 K--------PNFLVGC------SVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHR 233
Query: 281 FDENDSGS---VFFGDQGPATQQSTS--FLPI--GEKYDA-------YFVGVESYCIGNS 326
FD++ S V +Q + +++ + + P K D Y++G+ +G
Sbjct: 234 FDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGH 293
Query: 327 CLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNS 373
+ ++DSG +FTF+ E + + +F + + R +
Sbjct: 294 HVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIG 353
Query: 374 WKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEVGDHACFSYFT 425
+ C+N S + + P++RL F + + V N+ F+F EV AC + T
Sbjct: 354 LRPCFNVSDAKTVSFPELRLYFKGGADVALPVENY-FAFVGGEV---ACLTVVT 403
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 121/293 (41%), Gaps = 36/293 (12%)
Query: 84 NSSRNQLLFPSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--C 140
N + + L+FP GN + +Y + IG P + + +D GS+L W+ C C
Sbjct: 51 NRAGSSLVFP-------LHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPC 103
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDP--CPYIA 196
QC + P S+ V C PLC S + +DP C Y
Sbjct: 104 RQC--------------IEAPHPLYRPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEV 149
Query: 197 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
+Y+ + SS G LV D+ L +F+ + + +GCG Q + DG++GL
Sbjct: 150 EYA-DGGSSLGVLVKDVFVL-NFTN---GKRLNPLLALGCGYDQLPGRSNHPL-DGILGL 203
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY-DAYF 315
G G S+PS L+ GL+ N C G +FFG+ ++ P+ + Y
Sbjct: 204 GRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGED-IYDSSGVTWTPMSRDHLKHYS 262
Query: 316 VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
G + DSG+S+T+L + Y +V + +S K IS
Sbjct: 263 PGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQHLVFSLKRELSRKPIS 315
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 139/317 (43%), Gaps = 50/317 (15%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CI-QCAPLSASYYTSLDRNLSEY 161
NQF+ I +GTP V LV +D GS + WV CQ CI C YT R +
Sbjct: 21 NQFFM----GISLGTPAVFNLVTIDTGSTISWVQCQYCIVHC-------YTQDQRAGPTF 69
Query: 162 DPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
+ SSSS+ + V CS +C S C +D C Y Y++ + S+GYL D L
Sbjct: 70 NTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEY-SAGYLSQDRL 128
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
LA+ S+Q I GCG + + +G + G++G G S + +A+
Sbjct: 129 TLAN------SYSIQ-KFIFGCG---SDNRYNGHSA-GIIGFGNKSYSFFNQIAQL-TNY 176
Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 334
++FS CF N F GP + S + + + +D Y + Y + + +G +
Sbjct: 177 SAFSYCFPSNQENEGFL-SIGPYVRDSNKLI-LTQLFD-YGAHLPVYALQQFDMMVNGMR 233
Query: 335 ------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY--NA 380
+VDSG TF+ + ++ + K + ++ +S + C+ N
Sbjct: 234 LQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSNG 293
Query: 381 SSEEMLKVPDMRLIFSK 397
S + K+P + + FS+
Sbjct: 294 DSVDWSKLPVVEIKFSR 310
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 141/331 (42%), Gaps = 46/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++T + +G P F + LD GS++ W+ CQ C C Y D +DP++SS+
Sbjct: 20 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPTASST 69
Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
V+C C S SSC+S + C Y +Y Y D A+ S S
Sbjct: 70 YAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNY-----GDGSYTFGD---FATESVSFGNS 119
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+V +GCG G ++ A G+ G L SL + L SFS C DS
Sbjct: 120 GSVKNVALGCGHDNEGLFVGAAGLLGLGGGPL------SLTNQ--LKATSFSYCLVNRDS 171
Query: 287 G---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQ------ 334
++ F T+ L K D Y+VG+ +G ++ +S F+
Sbjct: 172 AGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGN 231
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
+VD G + T L T+ Y + F ++ + +++ + CY+ S + ++VP +
Sbjct: 232 GGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVS 291
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ +S+ + + P + G + CF++
Sbjct: 292 FHFADGKSWNLPAANYLIPVDSAGTY-CFAF 321
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 130/314 (41%), Gaps = 40/314 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP F++ D GS+L WV C S+S + + P+ S S
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPS----SSSSSPAASPPQRVFRPAGSKSW 159
Query: 170 KNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHA 223
+ C CKS ++C S DPC Y DY +D SS+ G + D ++
Sbjct: 160 SPLPCDSDTCKSYVPFSLANCSSPPDPCSY--DYRYKDNSSARGVVGLDSATVSLSGNDG 217
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-- 281
+ + V++GC G + DGV+ LG ++S S A FS C
Sbjct: 218 TRKAKLQEVVLGCTTSYDGQSFKSS--DGVLSLGNSNISFAS--RAASRFGGRFSYCLVD 273
Query: 282 ---DENDSGSVFFGDQGPATQQSTSF--LPIGEKYDA-----YFVGVESYCIGNSCLT-- 329
N + + FG+ + +S P+ DA YFV V++ + L
Sbjct: 274 HLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEIL 333
Query: 330 ------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASS 382
+ A++DSG S T L T Y VV K R+++ + ++YCYN +
Sbjct: 334 PDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM--DPFEYCYNWTG 391
Query: 383 EEMLKVPDMRLIFS 396
++P M L F+
Sbjct: 392 VSA-EIPRMELRFA 404
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 134/342 (39%), Gaps = 66/342 (19%)
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P+ S + +D GS+L+W PC +C + + N++ S VSC P
Sbjct: 29 PSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITR--------SHRVSCQSPA 80
Query: 179 CKSRSSCKSLKDPCPY----IADYSTEDTSSSG-----YLVDDILHLASFSKHAPQSSVQ 229
C + S S D C + + T D SS+ Y D SF H + ++
Sbjct: 81 CSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGD----GSFIAHLHRDTLS 136
Query: 230 SSVI------IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-- 280
S + GC A P GV G G G +S+P+ LA + + N FS C
Sbjct: 137 MSQLFLKNFTFGCAHTAL------AEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLV 190
Query: 281 ---FDE---NDSGSVFFGDQGPATQQSTSFL---PIGEKYDAYF--VGVESYCIGNSCL- 328
FD+ + G + + F+ + +YF VG+ +G +
Sbjct: 191 SHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTIL 250
Query: 329 ---------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRIS--LQGNSWK 375
+ +VDSG +FT LP +Y VV +FD+ V KR S +
Sbjct: 251 APEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLG 310
Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPENE 414
CY E +++VP + F N S V+ N+ + F + E
Sbjct: 311 PCYFL--EGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGE 350
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 135/320 (42%), Gaps = 52/320 (16%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP L+A+D ++ W+PC C C SA + +DP++S+S + V C
Sbjct: 118 LGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSA----------APFDPAASASYRTVPC 167
Query: 175 SHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
PLC ++C C + Y+ D+S L D L +A + A
Sbjct: 168 GSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVAGNAVKA--------Y 217
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
GC ++ TG+ A P G++GLG G +S L + + +FS C N SG+
Sbjct: 218 TFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGT 272
Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGA 341
+ G G P ++T L + Y+V + +G + +G ++DSG
Sbjct: 273 LRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGT 332
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
FT L Y V + + V + SL G + C+N ++ + P M L+F Q
Sbjct: 333 MFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA---VAWPPMTLLFDGMQ-- 385
Query: 402 VVRNHIFSFPENEVGDHACF 421
+ PE V H+ +
Sbjct: 386 ------VTLPEENVVIHSTY 399
>gi|389747274|gb|EIM88453.1| Asp-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 416
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/311 (26%), Positives = 126/311 (40%), Gaps = 64/311 (20%)
Query: 99 THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL 158
T+F Q+Y T IDIGTP +F V LD GS+ LWVP QC ++ +T
Sbjct: 98 TNFMNAQYY----TEIDIGTPPQTFKVILDTGSSNLWVPSS--QCTSIACFLHT------ 145
Query: 159 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
+YD S+SSS K + S + G++ +D +
Sbjct: 146 -KYDSSASSSYKANGTEFSIQYGSGSME--------------------GFVSNDDIVFGD 184
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGL 272
S SSV K+ G DG++GL ++V + L G+
Sbjct: 185 MS--------LSSVDFAEATKEPGLAFAFGKFDGILGLAYDTIAVNHITPVFYELVNQGI 236
Query: 273 IQN---SFSICFDENDSGSVFFGDQGP-ATQQSTSFLPIGEKYDAYF-VGVESYCIGNSC 327
I SF + E+D G FG P A + P+ K AY+ V +E G+
Sbjct: 237 ISEPVFSFRLGSSEDDGGEAIFGGIDPSAYSGKIDYAPVRRK--AYWEVELEKVSFGDDD 294
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
L A +D+G S LPT++ AE++ + + +K+ SW Y ++
Sbjct: 295 LELENTGAAIDTGTSLIALPTDV-AEML---NTQIGAKK------SWNGQYTVDCAKVPD 344
Query: 388 VPDMRLIFSKN 398
+PD+ F++
Sbjct: 345 LPDLTFYFNEK 355
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 135/332 (40%), Gaps = 80/332 (24%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ GTP+ +F LD GS L+W+PC C +C S N ++ P +SSSS
Sbjct: 90 LEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFS---------NTPKFIPKNSSSS 140
Query: 170 KNVSCSHPLC--------------KSRSSCKSLKDPCP-YIADYSTEDTSSSGYLVDDIL 214
K V C++P C + +++ + CP Y Y S++G+L+ + L
Sbjct: 141 KFVGCTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLG--STAGFLLSENL 198
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
+ + S ++GC S + P G+ G G G+ S+PS + L +
Sbjct: 199 NFP--------TKKYSDFLLGC------SVVSVYQPAGIAGFGRGEESLPS---QMNLTR 241
Query: 275 NSFSICFDENDSGSVFFGDQGPATQQS----------TSFL--PIGEKYDA----YFVGV 318
S+ + + D + + T S T FL P +K A Y++ +
Sbjct: 242 FSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITL 301
Query: 319 ESYCIGNSCLT------------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 366
+ +G + GF +VDSG++FTF+ I+ V +F K VS R
Sbjct: 302 KRIVVGEKRVRVPRRLLEPNVDGDGGF--IVDSGSTFTFMERPIFDLVAQEFAKQVSYTR 359
Query: 367 ISLQGNSWKY--CYN-ASSEEMLKVPDMRLIF 395
+ C+ A E P++R F
Sbjct: 360 AREAEKQFGLSPCFVLAGGAETASFPELRFEF 391
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 125/301 (41%), Gaps = 54/301 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ GTP + LD GS++ W C+ C+ C S Y +D S+SS+
Sbjct: 132 VAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRY----------FDSSASSTYSFG 181
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
SC ++ Y Y + TS Y D + S V
Sbjct: 182 SCIPSTVENN-----------YNMTYGDDSTSVGNYGCDTMT--------LEPSDVFQKF 222
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFF 291
GCGR G + G+ DG++GLG G +S S A FS C E DS GS+ F
Sbjct: 223 QFGCGRNNKGDF--GSGVDGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLF 278
Query: 292 GDQGPATQQSTSF----LPIG----EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVD 338
G++ AT QS+S L G ++ YFV + +GN L S F + ++D
Sbjct: 279 GEK--ATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIID 336
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLI 394
S T LP Y+ + F K ++ +S +G+ CYN S + + +P++ L
Sbjct: 337 SRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLH 396
Query: 395 F 395
F
Sbjct: 397 F 397
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 132/298 (44%), Gaps = 26/298 (8%)
Query: 79 LQSNNNSSRNQLLFPSEGSQT-HFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWV 136
L S SSR++LL P+ S +GN + Y ++IG P + + +D GS+L W+
Sbjct: 36 LPSEATSSRSRLLNPAGSSIVLPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWL 95
Query: 137 PCQ--CIQCAPLSASYYTSLDRNLSEYDP--SSSSSSKNVSCSHPLCKSRSSCKSLKDPC 192
C C C+ Y + + DP +S +++ +C HP D C
Sbjct: 96 QCDAPCTHCSETPHPLYRPSNDFVPCRDPLCASLQPTEDYNCEHP------------DQC 143
Query: 193 PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDG 252
Y +Y+ + S+ G L++D+ +L +F+ ++ + +GCG Q S DG
Sbjct: 144 DYEINYA-DQYSTFGVLLNDV-YLLNFTNGV---QLKVRMALGCGYDQVFSPSSYHPLDG 198
Query: 253 VMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE-KY 311
++GLG G S+ S L GL++N C G +FFG+ + + ++ PI
Sbjct: 199 LLGLGRGKASLISQLNSQGLVRNVIGHCLSAQGGGYIFFGNAYDSAR--VTWTPISSVDS 256
Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
Y G G A+ D+G+S+T+ + Y ++ K +S K + +
Sbjct: 257 KHYSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKV 314
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 144/338 (42%), Gaps = 58/338 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ I +G+P + V +D+GS+++WV C+ C QC Y D ++P+ SSS
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQC-------YHQSD---PVFNPADSSS 183
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
VSC+ +C + + C Y Y + + + G L + L +F + ++
Sbjct: 184 YAGVSCASTVCSHVDNAGCHEGRCRYEVSYG-DGSYTKGTLALETL---TFGR-----TL 234
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQNSFSICFDEN--- 284
+V IGCG G ++ A G++GLG G +S V L +AG +FS C
Sbjct: 235 IRNVAIGCGHHNQGMFVGAA---GLLGLGSGPMSFVGQLGGQAG---GTFSYCLVSRGIQ 288
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YF------------VGVESYCIGNSCLT 329
SG + FG + A +++P+ A Y+ V + S L
Sbjct: 289 SSGLLQFGRE--AVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELG 346
Query: 330 QSGFQALVDSGASFTFLPTEIYA----EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
G ++D+G + T LPT Y + + L + +S+ + CY+
Sbjct: 347 DGG--VVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSI----FDTCYDLFGFVS 400
Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
++VP + FS + F P ++VG CF++
Sbjct: 401 VRVPTVSFYFSGGPILTLPARNFLIPVDDVGSF-CFAF 437
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 121/300 (40%), Gaps = 38/300 (12%)
Query: 119 PNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
P V L+ LD S++ WV PC QC Y D YDPS S SS++ +CS
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQC-------YAQTD---VLYDPSKSRSSESFACS 227
Query: 176 HPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
P C+ SS + C Y Y + +++SG LV D L L +P S V
Sbjct: 228 SPTCRQLGPYANGCSSSSNSAGQCQYRVRYP-DGSTTSGTLVADQLSL------SPTSQV 280
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDSG 287
GC GS+ + G+M LG G S+ S + K G + FS CF S
Sbjct: 281 -PKFEFGCSHAARGSF-SRSKTAGIMALGRGVQSLVSQTSTKYGQV---FSYCFPPTASH 335
Query: 288 SVFFGDQGPATQQST-SFLPIGEKYDAYFVGVESYCIGNSCL----TQSGFQALVDSGAS 342
FF P S + P+ + Y V +E+ + L T A +DS
Sbjct: 336 KGFFVLGVPRRSSSRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVFAAGAALDSRTV 395
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
T LP Y + F +S R + CY+ + + +P + L+F + + V
Sbjct: 396 ITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGV 455
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 133/337 (39%), Gaps = 52/337 (15%)
Query: 68 NDWKRQKTRVKLQSNNNSSRNQLLFP-SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVA 126
N + RV S+ +S P + G Q GN + + +GTP +
Sbjct: 61 NMASKDPARVTYLSSLVASPKATSVPIASGQQVLNIGN-----YVVRVKLGTPGQLMFMV 115
Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS--S 184
LD + WVPC CA S+ + P++SS+ ++ CS P C S
Sbjct: 116 LDTSRDAAWVPCA--DCAGCSS----------PTFSPNTSSTYASLQCSVPQCTQVRGLS 163
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C + + D+S S L D L LA S GC +GS
Sbjct: 164 CPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLA--------VDTLPSYSFGCVNAVSGST 215
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEND----SGSVFFGDQG-PAT 298
L P G++GLG G + SLL+++G L FS CF SGS+ G G P
Sbjct: 216 LP---PQGLLGLGRGPM---SLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKN 269
Query: 299 QQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPT 348
++T L + Y+V + +G + +G ++DSG T
Sbjct: 270 IRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVE 329
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
+YA + +F K V ++ ++ C+ A++E++
Sbjct: 330 PVYAAIRDEFRKQVKGPFATI--GAFDTCFAATNEDI 364
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 75/309 (24%), Positives = 130/309 (42%), Gaps = 47/309 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+ GTP+V ++ +D GS++ WV PC +C P + +DPS SS+
Sbjct: 129 LGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYP----------QKDPLFDPSKSSTYA 178
Query: 171 NVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
++C C R+ C S C Y +Y + +S+ G ++ + AP
Sbjct: 179 PIACGADACNKLGDHYRNGCTSGGTQCGYRVEYG-DGSSTRGVYSNETITF------APG 231
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--E 283
+V+ GCG Q G DG++GLG S+ ++ A + +FS C
Sbjct: 232 ITVK-DFHFGCGHDQRGP---SDKFDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALN 285
Query: 284 NDSGSVFFGDQGPATQQSTSF-------LPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ 334
+++G + G + A +++F LP+ +Y V + +G L +S F+
Sbjct: 286 SEAGFLALGVRPSAATNTSAFVFTPMWHLPMDAT--SYMVNMTGISVGGKPLDIPRSAFR 343
Query: 335 A--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
L+DSG T LP Y + K ++ + + + CYN + + VP +
Sbjct: 344 GGMLIDSGTIVTELPETAYNALNAALRKAFAAYPM-VASEDFDTCYNFTGYSNVTVPRVA 402
Query: 393 LIFSKNQSF 401
L FS +
Sbjct: 403 LTFSGGATI 411
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 128/305 (41%), Gaps = 55/305 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS-EYDPSSSSSSKN 171
I IG P + LV +D GS++LWV C C C D +L +DPS SS+
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMCTPCTNC-----------DNDLGLLFDPSKSSTFS- 152
Query: 172 VSCSHPLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
PLCK+ + + DP P+ Y+ T+S + D ++ F +S S
Sbjct: 153 -----PLCKTPCDFEGCRCDPIPFTVTYADNSTASGTFGRDTVV----FETTDEGTSRIS 203
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----ND 285
V+ GCG G D +G++GL G SL+ K G FS C +
Sbjct: 204 DVLFGCGH-NIGHDTD-PGHNGILGLNNGP---DSLVTKLG---QKFSYCIGNLADPYYN 255
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL----------TQSGFQ 334
+ G+ ST F E Y+ Y+V +E +G L
Sbjct: 256 YHQLILGEGADLEGYSTPF----EVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGG 311
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYC-YNASSEEMLKVPDM 391
++D+G++ TFL ++ + + L+ S ++ +++ + W C Y + S +++ P +
Sbjct: 312 VIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVV 371
Query: 392 RLIFS 396
FS
Sbjct: 372 TFHFS 376
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 134/356 (37%), Gaps = 48/356 (13%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
G + L+Y +G V +D S L WV QC P A + D+ +D
Sbjct: 105 GARLRTLNYVAT-VGIGGGEATVIVDTASELTWV-----QCEPCDACH----DQQEPLFD 154
Query: 163 PSSSSSSKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
PSSS S V C+ C S +C C Y Y + + S G L D L
Sbjct: 155 PSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYR-DGSYSRGVLAHDRL 213
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGLI 273
LA +Q + GCG G + G+MGLG +S+ S + + G +
Sbjct: 214 SLAG-------EDIQG-FVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGGV 262
Query: 274 QNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGN 325
FS C + SGS+ GD + ST + D Y + +G
Sbjct: 263 ---FSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGG 319
Query: 326 SCLTQSGF------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
+ GF +A+VDSG T L +YA V +F ++ + + C++
Sbjct: 320 EDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFD 379
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILI 435
+ ++VP ++L+F V + + C + +L+ + +I
Sbjct: 380 LTGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPII 435
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 75/321 (23%), Positives = 121/321 (37%), Gaps = 52/321 (16%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ +GTP F + +D GS+L +V QCAP Y +++ Y PS+SS+
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFV-----QCAPCDLCY----EQDGPLYQPSNSSTF 84
Query: 170 KNVSCSHPLC-----KSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
V C C + C S + C Y Y +++S+ G + +
Sbjct: 85 TPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYG-DNSSTVGVFAYETATVGG 143
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 278
+ V GCG + GS++ GV+GLG G +S S A +N F+
Sbjct: 144 IRVN--------HVAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYA--FENKFA 190
Query: 279 ICFDENDS-----GSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGNSCL-- 328
C S S+ FGD +T F P+ Y+V + C G L
Sbjct: 191 YCLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLI 250
Query: 329 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
+ + DSG + T+ + YA ++ F+K V R C N
Sbjct: 251 PDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNV 310
Query: 381 SSEEMLKVPDMRLIFSKNQSF 401
S + P + F + ++
Sbjct: 311 SGIDHPIYPSFTIEFDQGATY 331
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 81/311 (26%), Positives = 132/311 (42%), Gaps = 51/311 (16%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C ++ +DP++SSS +NV+C
Sbjct: 155 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTC 204
Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--APQ 225
C R+ + +D CPY Y + ++ L L SF+ + AP
Sbjct: 205 GDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGD------LALESFTVNLTAPG 258
Query: 226 SSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+S + V+ GCG + G + A G+ L S L A G ++FS C E+
Sbjct: 259 ASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVEH 313
Query: 285 --DSGS-VFFGDQ----GPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQSGFQ-- 334
D+GS V FG+ + T+F P D Y+V ++ +G L S
Sbjct: 314 GSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWD 373
Query: 335 --------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNASSEEM 385
++DSG + ++ Y + F L+S + CYN S E
Sbjct: 374 VGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVER 433
Query: 386 LKVPDMRLIFS 396
+VP++ L+F+
Sbjct: 434 PEVPELSLLFA 444
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 80/284 (28%), Positives = 119/284 (41%), Gaps = 43/284 (15%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + +Y+ + IG P ++ + +D GS+L WV C C C +L R+
Sbjct: 40 GNVYPLGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGC---------TLPRD-R 89
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
+Y P + V C PLC + S C + + C Y +Y+ + SS G LV DI+
Sbjct: 90 QYKPHGNL----VKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYA-DQGSSLGVLVRDII 144
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L K + S + GCG QT + + GV+GLG G S+ S L GLI
Sbjct: 145 PL----KLTNGTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLI 200
Query: 274 QNSFSICFDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
+N C G +FFGDQ P Q S+S L Y G
Sbjct: 201 RNVVGHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLL------KHYKTGPADMFFN 254
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
+ G + DSG+S+T+ + + +V + K +S
Sbjct: 255 GKATSVKGLELTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLS 298
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 81/382 (21%), Positives = 151/382 (39%), Gaps = 58/382 (15%)
Query: 51 DSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
DS E LE + +R + R++ N S ++ +G Y ++
Sbjct: 49 DSGKNLTKFELLERAVERGSRRLQ-RLEAMLNGPSGVETPVYAGDGE---------YLMN 98
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ IGTP F +D GS+L+W CQ C QC +++ ++P SSS
Sbjct: 99 ---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC----------FNQSTPIFNPQGSSSF 145
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ CS LC++ S + C Y Y + + + G + + L S S
Sbjct: 146 STLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVSI-------- 196
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
++ GCG G A G++G+G G +S+PS L FS C ++S
Sbjct: 197 PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSNS 249
Query: 287 GSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGF 333
++ G + A +T+ + + Y++ + +G++ L + +G
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309
Query: 334 QA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPDM 391
++DSG + T+ Y V F ++ ++ + + C+ S++ L++P
Sbjct: 310 GGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369
Query: 392 RLIFSKNQSFVVRNHIFSFPEN 413
+ F + + F P N
Sbjct: 370 VMHFDGGDLVLPSENYFISPSN 391
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 98/398 (24%), Positives = 162/398 (40%), Gaps = 68/398 (17%)
Query: 25 SSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNN 84
S L+HR D R + + + + VEYL+ LS + ++ S
Sbjct: 70 SLALLHR--DAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVSGI- 126
Query: 85 SSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQC 143
SEGS +F + +G+P + +D+GS+++W+ C+ C +C
Sbjct: 127 ---------SEGSGEYF----------VRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAEC 167
Query: 144 APLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYS 199
Y D +DP++S+S V C +C++ S C C Y Y
Sbjct: 168 -------YQQAD---PLFDPAASASFTAVPCDSGVCRTLPGGSSGCAD-SGACRYQVSYG 216
Query: 200 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 259
+ + + G L + L +F P VQ V IGCG + G ++ A G++GLG G
Sbjct: 217 -DGSYTQGVLAMETL---TFGDSTP---VQ-GVAIGCGHRNRGLFVGAA---GLLGLGWG 265
Query: 260 DVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFG--DQGPATQQSTSFLPIGEKYDA 313
+S+ L A +FS C + +GS+ FG D P L ++
Sbjct: 266 PMSLVGQLGGA--AGGAFSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSF 323
Query: 314 YFVGVESYCI---------GNSCLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVS 363
Y+VG+ + G LT+ G +V D+G + T LP + YA + F +
Sbjct: 324 YYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIG 383
Query: 364 SKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQS 400
G S CY+ S ++VP + L F ++ +
Sbjct: 384 GDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRDGA 421
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 152/391 (38%), Gaps = 55/391 (14%)
Query: 70 WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
W K ++ + + +S L P + +G+ Y++ + +GTP S + +D
Sbjct: 19 WIESKAKLAGKKKDEASSTDLNGPV--TSGLLYGSGEYFVR---LGLGTPARSLFMVVDT 73
Query: 130 GSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRS 183
GS+L W+ CQ C C Y D +DP +SSS + + C PLCK S S
Sbjct: 74 GSDLPWLQCQPCKSC-------YKQAD---PIFDPRNSSSFQRIPCLSPLCKALEVHSCS 123
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
+ C Y Y + + S G D+ L + SK SV GCG G
Sbjct: 124 GSRGATSRCSYQVAYG-DGSFSVGDFSSDLFTLGTGSKAM-------SVAFGCGFDNEGL 175
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE------NDSGSVFFGDQGPA 297
+ A G+ L S + NSFS C + S S+ FG
Sbjct: 176 FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIP 235
Query: 298 TQQSTSFLPIGEKYDA-YFVGVESYCIGNS---------CLTQSGFQA-LVDSGASFTFL 346
+ + S L K D Y+ + +G + L+QSG ++DSG S T
Sbjct: 236 STAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRF 295
Query: 347 PTEIYAEVVVKFD----KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV 402
PT +YA + F L S+ R SL + CYN S + + VP + L F
Sbjct: 296 PTSVYATIRDAFRNATINLPSAPRYSL----FDTCYNFSGKASVDVPALVLHFENGADLQ 351
Query: 403 VRNHIFSFPENEVGDHA-CFSYFTLEYNFTG 432
+ + P N G F+ ++E G
Sbjct: 352 LPPTNYLIPINTAGSFCLAFAPTSMELGIIG 382
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 82/382 (21%), Positives = 150/382 (39%), Gaps = 58/382 (15%)
Query: 51 DSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLH 110
DS E LE + +R + R++ N S ++ +G Y ++
Sbjct: 49 DSGKNLTKFELLERAVERGSRRLQ-RLEAMLNGPSGVETPVYAGDGE---------YLMN 98
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ IGTP F +D GS+L+W CQ C QC +++ ++P SSS
Sbjct: 99 ---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC----------FNQSTPIFNPQGSSSF 145
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ CS LC++ S + C Y Y + + + G + + L S S
Sbjct: 146 STLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVSI-------- 196
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
++ GCG G A G++G+G G +S+PS L FS C + S
Sbjct: 197 PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTS 249
Query: 287 GSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------- 334
++ G + A +T+ + + Y++ + +G++ L S F+
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM-LKVPDM 391
++DSG + T+ Y V F ++ ++ + + C+ S++ L++P
Sbjct: 310 GGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369
Query: 392 RLIFSKNQSFVVRNHIFSFPEN 413
+ F + + F P N
Sbjct: 370 VMHFDGGDLVLPSENYFISPSN 391
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 103/421 (24%), Positives = 155/421 (36%), Gaps = 73/421 (17%)
Query: 14 ILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQ 73
++ SD S LVHR A + G S+A+ L D R
Sbjct: 7 LMTSSSDPNRASVPLVHRHGPCAPS--AASGGKPSLAER-------------LRRDRART 51
Query: 74 KTRVKLQSNNNSSRNQLLFPSEGSQT--HFFGNQFYWLHYT-WIDIGTPNVSFLVALDAG 130
V + ++ L + G + F G+ L Y + IGTP V V +D G
Sbjct: 52 NYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTG 111
Query: 131 SNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------ 184
S+L WV QC P A + L +DPSSSSS +V C C+ ++
Sbjct: 112 SDLSWV-----QCKPCGAGECYAQKDPL--FDPSSSSSYASVPCDSDACRKLAAGAYGHG 164
Query: 185 CKSLKDP----CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII-----G 235
C + C Y +Y T++ Y + + +++ V++ G
Sbjct: 165 CTGVSGGAAALCEYGIEYGNRATTTGVYSTETL-------------TLKPGVVVADFGFG 211
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 295
CG Q G Y DG++GLG S+ S + FS C G+ F
Sbjct: 212 CGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFLTLGA 266
Query: 296 PATQQST------SFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGAS 342
P S+ SF P+ Y V + +G + L S F + ++DSG
Sbjct: 267 PPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVIDSGTV 326
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
T LP YA + F +S R+ G CY+ + + VP + L FS +
Sbjct: 327 ITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTISLTFSGGAT 386
Query: 401 F 401
Sbjct: 387 I 387
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 110/272 (40%), Gaps = 34/272 (12%)
Query: 93 PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
PS GN + H+ ++IG P + + +D GS L W+ C CI C +
Sbjct: 20 PSSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCK-SLKDPCPYIADYSTEDT 203
Y V C+ C R K K+ C Y Y
Sbjct: 80 LY-------------KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GG 124
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVS 262
SS G L+ D SFS A + +S+ GCG Q + + P +G++GLG G V+
Sbjct: 125 SSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVT 179
Query: 263 VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVE 319
+ S L G+I ++ C G +FFGD T T + P+ ++ Y G
Sbjct: 180 LLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTL 238
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIY 351
+ + ++ + + + DSGA++T+ + Y
Sbjct: 239 QFNSNSKPISAAPMEVIFDSGATYTYFALQPY 270
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 75/312 (24%), Positives = 130/312 (41%), Gaps = 46/312 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP F + LD GS+L W+ QC+ C Y +N + YDP +S+S KN++C+
Sbjct: 168 VGTPPKHFSLILDTGSDLNWL--QCLPC-------YDCFHQNEAFYDPKTSASFKNITCN 218
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
P C SS CKS CPY Y ++ + V+ ++L + + + V
Sbjct: 219 DPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKV 278
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DE 283
+ +++ GCG G + + G+ L S L +SFS C D
Sbjct: 279 E-NMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVDRNSDT 332
Query: 284 NDSGSVFFGDQGPATQQS----TSFLPIGEK--YDAYFVGVESYCIGNSCL--------- 328
N S + FG+ + TSF+ E Y++ ++S +G L
Sbjct: 333 NVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNI 392
Query: 329 -TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS--SEE 384
++DSG + ++ Y + KF +K+ + + C+N S E
Sbjct: 393 SPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEEN 452
Query: 385 MLKVPDMRLIFS 396
+ +P++ + F+
Sbjct: 453 NIHLPELGIAFA 464
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 135/340 (39%), Gaps = 52/340 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP +S +D GS+L+W C C C+ S SSSS+ V C
Sbjct: 48 IGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDP------------SSSSTYSKVLC 95
Query: 175 SHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
LC+ S SC + D C Y+ Y + +S+SG L D+ ++S S ++
Sbjct: 96 QSSLCQPPSIFSCNNDGD-CEYVYPYG-DRSSTSGILSDETFSISSQSL--------PNI 145
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGS 288
GCG G D G++G G G +S+ S L + + N FS C D + +
Sbjct: 146 TFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSP 199
Query: 289 VFFGDQG--PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--------QSGFQA--L 336
+F G+ AT ++ L + Y++ +E +G L QS +
Sbjct: 200 LFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLI 259
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
+DSG + TFL Y V + +VSS + C+N P M F
Sbjct: 260 IDSGTTLTFLQQTAYDAVK---EAMVSSINLPQADGQLDLCFNQQGSSNPGFPSMTFHF- 315
Query: 397 KNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILIL 436
K + V + FP++ D C + N + I
Sbjct: 316 KGADYDVPKENYLFPDS-TSDIVCLAMMPTNSNLGNMAIF 354
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 88/351 (25%), Positives = 148/351 (42%), Gaps = 46/351 (13%)
Query: 66 LSNDWKRQKTRVKLQSNNNSS-RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
L+ D R K+ L + S+ R + P S Q ++T + +GTP
Sbjct: 102 LARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGLAQGSGEYFTRLGVGTPARYVF 161
Query: 125 VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
+ LD GS+++W IQCAP Y + ++P+ S S N+ C PLC+ S
Sbjct: 162 MVLDTGSDVVW-----IQCAPCKKCY----SQTDPVFNPTKSRSFANIPCGSPLCRRLDS 212
Query: 185 --CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
C + K C Y Y + + + G + L + + V +GCG G
Sbjct: 213 PGCSTKKHICLYQVSYG-DGSFTYGEFSTETLTF--------RGTRVGRVALGCGHDNEG 263
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS----VFFGDQGPAT 298
++ A ++GLG G +S PS + + FS C + + S + FGD A
Sbjct: 264 LFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFSYCLVDRSASSKPSYMVFGDS--AI 316
Query: 299 QQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ--------ALVDSGASFT 344
++ F P+ K D Y+V + +G + +T S F+ ++DSG S T
Sbjct: 317 SRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVT 376
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
L Y + F S+ + + + + + C++ S + +KVP + L F
Sbjct: 377 RLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHF 427
>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
Length = 191
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 68/127 (53%), Gaps = 14/127 (11%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
L++T + +G+P + V +D GS++LWV C++C+ +D L+ YDP S +
Sbjct: 69 LYFTKLGLGSPKKDYYVQVDTGSDILWV--NCVECSRCPTKSQIGMD--LTLYDPKGSHT 124
Query: 169 SKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH- 222
S+ +SC H C S C++ + PCPY Y + ++++GY V D L + +
Sbjct: 125 SELISCDHEFCSSTYDGPIPGCRA-ETPCPYSITYG-DGSATTGYYVRDYLTFDRINGNL 182
Query: 223 --APQSS 227
APQ+S
Sbjct: 183 HTAPQNS 189
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 134/330 (40%), Gaps = 78/330 (23%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP + LD GS L W+ QC P +AS+ DPS SSS +
Sbjct: 92 LPIGTPPQPQQMVLDTGSQLSWI--QCHNKTPPTASF-----------DPSLSSSFYVLP 138
Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
C+HPLCK R +L C + + + + T + G LV + L +FS S
Sbjct: 139 CTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKL---AFSP----SQT 191
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--- 285
+I+GC + + G++G+ LG +S P AK FS C
Sbjct: 192 TPPLILGCSSESRDA-------RGILGMNLGRLSFP-FQAKV----TKFSYCVPTRQPAN 239
Query: 286 -----SGSVFFGDQG-------------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
+GS + G+ P +Q+ + P+ AY V ++ IG
Sbjct: 240 NNNFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPL-----AYTVPMQGIRIGGRK 294
Query: 328 LT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSW 374
L SG Q +VDSG+ FTFL Y V + +++ K+ + G
Sbjct: 295 LNIPPSVFRPNAGGSG-QTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVA 353
Query: 375 KYCYNASSEEMLK-VPDMRLIFSKNQSFVV 403
C++ ++ E+ + + D+ F K VV
Sbjct: 354 DMCFDGNAMEIGRLLGDVAFEFEKGVEIVV 383
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 121/286 (42%), Gaps = 54/286 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS-EYDPSSSSSSKN 171
+ IG P++ LV +D GS++LW+ C C C D +L +DPS SS+
Sbjct: 105 LSIGQPSIPQLVVMDTGSDILWIMCNPCTNC-----------DNHLGLLFDPSMSSTFS- 152
Query: 172 VSCSHPLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
PLCK+ K K DP P+ Y +++S+SG DIL F +S S
Sbjct: 153 -----PLCKTPCGFKGCKCDPIPFTISY-VDNSSASGTFGRDIL---VFETTDEGTSQIS 203
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----ND 285
VIIGCG + +G++GL G P+ LA I FS C +
Sbjct: 204 DVIIGCGHNI--GFNSDPGYNGILGLNNG----PNSLATQ--IGRKFSYCIGNLADPYYN 255
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL---------TQSGFQA 335
+ G+ ST F E Y Y+V +E +G L ++G
Sbjct: 256 YNQLRLGEGADLEGYSTPF----EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGG 311
Query: 336 LV-DSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCY 378
++ DSG + T+L + + + L+ S +++ + WK CY
Sbjct: 312 VILDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCY 357
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 92/351 (26%), Positives = 134/351 (38%), Gaps = 46/351 (13%)
Query: 70 WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT-WIDIGTPNVSFLVALD 128
+ + R S R + S + G L Y + IGTP V V +D
Sbjct: 84 LRSDRARADHILRKASGRRMMSEGGGASIPTYLGGFVDSLEYVVTLGIGTPAVQQTVLID 143
Query: 129 AGSNLLWVPCQCIQCAPLSAS-YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS------ 181
GS+L WV QC P +AS Y D +DPS SS+ + C+ CK
Sbjct: 144 TGSDLSWV-----QCKPCNASDCYPQKD---PLFDPSKSSTFATIPCASDACKQLPVDGY 195
Query: 182 ----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
++ + C Y +Y + G + L L S S+V S GCG
Sbjct: 196 DNGCTNNTSGMPPQCGYAIEYG-NGAITEGVYSTETLALGS-------SAVVKSFRFGCG 247
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGP- 296
Q G Y DG++GLG S+ S A + +FS C +SG+ F P
Sbjct: 248 SDQHGPYDKF---DGLLGLGGAPESLVSQTAS--VYGGAFSYCLPPLNSGAGFLTLGAPN 302
Query: 297 ATQQSTS---FLPI----GEKYDAYFVGVESYCIGNSCL--TQSGFQA--LVDSGASFTF 345
+T S S F P+ + Y V + +G L + F +VDSG T
Sbjct: 303 STNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAKGNIVDSGTVITG 362
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIF 395
+PT Y + F ++ + +S CYN + + VP + L F
Sbjct: 363 IPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVPKVALTF 413
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 118/294 (40%), Gaps = 67/294 (22%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVA-LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
G+ Y +H + IGTP +V LD GS+L+W C C C D+ + +
Sbjct: 90 GSSEYLIH---LGIGTPRPQRVVLHLDTGSDLVWTQCACTVC----------FDQPVPVF 136
Query: 162 DPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
S S + V CS PLC S C + C Y Y + + ++G + +D
Sbjct: 137 RASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGY-MDHSITTGKMAED---- 191
Query: 217 ASFSKHAPQSSVQSSVI----IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
+F+ AP + ++ + GCG G + + G+ G G G +S+PS L
Sbjct: 192 -TFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQS--GIAGFGTGPLSLPSQLKV--- 245
Query: 273 IQNSFSICF---DENDSGSVFFGDQ---------GPATQQSTSF------LPIGEKYDAY 314
FS CF +E+ V G + GP QST F P+G + Y
Sbjct: 246 --RRFSYCFTAMEESRVSPVILGGEPENIEAHATGPI--QSTPFAPGPAGAPVGSQ-PFY 300
Query: 315 FVGVESYCIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKF 358
F+ + +G + L S F +DSG + TF P ++ + F
Sbjct: 301 FLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAF 354
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 72/292 (24%), Positives = 120/292 (41%), Gaps = 33/292 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV--- 172
+GTP +S +ALD GS++ W QC P S Y +++DP SSS KNV
Sbjct: 51 LGTPKLSLSLALDTGSDITWT-----QCEPCVGSCYRQAQ---TKFDPRKSSSYKNVSCS 102
Query: 173 -SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
S + S + + C Y Y + + S G+ + L ++ S V S+
Sbjct: 103 SSSCRIITDSGGARGCVSSTCIYKVQYG-DGSYSVGFFATEKLTISP-------SDVISN 154
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGS 288
+ GCG++ G + A G+ + L + N F+ C F + +G
Sbjct: 155 FLFGCGQQNAGRFGRIAGLLGLG-----RGKLSLALQTSEKYNNLFTYCLPSFSSSSTGH 209
Query: 289 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----TQSGFQALVDSGASF 343
+ G Q P + + T P + Y + ++ +G L S A++DSG
Sbjct: 210 LTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVI 269
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T L +Y+ + KF +L+ + + CY+ S E + VP + F
Sbjct: 270 TRLQPTVYSALSSKFQQLMKDYPKTDGFSILDTCYDFSGNESISVPRISFFF 321
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 103/416 (24%), Positives = 153/416 (36%), Gaps = 73/416 (17%)
Query: 19 SDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVK 78
SD S LVHR A + G S+A+ L D R V
Sbjct: 92 SDPNRASVPLVHRHGPCAPS--AASGGKPSLAER-------------LRRDRARTNYIVT 136
Query: 79 LQSNNNSSRNQLLFPSEGSQT--HFFGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLW 135
+ ++ L + G + F G+ L Y + IGTP V V +D GS+L W
Sbjct: 137 KATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSW 196
Query: 136 VPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLK 189
V QC P A + L +DPSSSSS +V C C+ ++ C +
Sbjct: 197 V-----QCKPCGAGECYAQKDPL--FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVS 249
Query: 190 DP----CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII-----GCGRKQ 240
C Y +Y T++ Y + + +++ V++ GCG Q
Sbjct: 250 GGAAALCEYGIEYGNRATTTGVYSTETL-------------TLKPGVVVADFGFGCGDHQ 296
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 300
G Y DG++GLG S+ S + FS C G+ F P
Sbjct: 297 HGPYEKF---DGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFLTLGAPPNSS 351
Query: 301 ST------SFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--LVDSGASFTFLP 347
S+ SF P+ Y V + +G + L S F + ++DSG T LP
Sbjct: 352 SSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVIDSGTVITGLP 411
Query: 348 TEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
YA + F +S R+ G CY+ + + VP + L FS +
Sbjct: 412 ATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTISLTFSGGATI 467
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 141/327 (43%), Gaps = 59/327 (18%)
Query: 71 KRQKTRVK------LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
KR K+R++ L ++ S +QL P GN Y + + IGTP VS+
Sbjct: 72 KRGKSRLQRLNAMVLAASTLDSEDQLEAPIHA------GNGEYLME---LAIGTPPVSYP 122
Query: 125 VALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
LD GS+L+W C+ C QC + +DP SSS VSC LC +
Sbjct: 123 AVLDTGSDLIWTQCKPCTQC----------YKQPTPIFDPKKSSSFSKVSCGSSLCSAVP 172
Query: 184 SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 243
S + D C Y+ Y + + + G L + +F K + SV ++ GCG G
Sbjct: 173 S-STCSDGCEYVYSYG-DYSMTQGVLATETF---TFGKSKNKVSVH-NIGFGCGEDNEGD 226
Query: 244 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQ 300
+ A+ G++GLG G +S+ S L + FS C D+ + G G
Sbjct: 227 GFEQAS--GLVGLGRGPLSLVSQLK-----EPRFSYCLTPMDDTKESILLLGSLGKVKDA 279
Query: 301 ----STSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFL 346
+T L + Y++ +E +G++ L+ +S F+ ++DSG + T++
Sbjct: 280 KEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYI 339
Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNS 373
+ + + +F +S ++ L S
Sbjct: 340 EQKAFEALKKEF---ISQTKLPLDKTS 363
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 88/335 (26%), Positives = 132/335 (39%), Gaps = 66/335 (19%)
Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
+D GS+L+W PC +C Y T+ LS P + +SS +VSC P C + +
Sbjct: 91 MDTGSDLVWFPCAPFECILCEGKYDTAATGGLS---PPNITSSASVSCKSPACSAAHTSL 147
Query: 187 SLKDPCPY----IADYSTEDTSS-----------SGYLVDDILHLASFSKHAPQSSVQSS 231
S D C + T D SS G LV L+ S S A V +
Sbjct: 148 SSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVAR-LYRDSLSMPASSPLVLHN 206
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-----FD--- 282
GC G P GV G G G +S+P+ LA + + N FS C FD
Sbjct: 207 FTFGCAHTALGE------PVGVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADR 260
Query: 283 --------------ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 328
+++ D+G T+ L + Y VG+E +GN +
Sbjct: 261 VRRPSPLILGRYSLDDEKKKRVGHDRGEFVY--TAMLDNPKHPYFYCVGLEGITVGNRKI 318
Query: 329 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISL--QGNSW 374
+ +VDSG +FT LP +Y +V +F+ + KR + +
Sbjct: 319 PVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGL 378
Query: 375 KYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIF 408
CY S + KVP + L F N + ++ RN+ +
Sbjct: 379 GPCYY-SDDSAAKVPAVALHFVGNSTVILPRNNYY 412
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 129/308 (41%), Gaps = 49/308 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I +GTP + LD GS+++W IQCAP Y S +DP S S
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVW-----IQCAPCKRCYAQS----DPVFDPRKSRSF 176
Query: 170 KNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP--Q 225
+++C PLC S C + K C Y Y D FS +
Sbjct: 177 ASIACRSPLCHRLDSPGCNTQKQTCMYQVSYG-----------DGSFTFGDFSTETLTFR 225
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
+ + V +GCG G ++ A ++GLG G +S PS + + FS C +
Sbjct: 226 RTRVARVALGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQTGRR--FNHKFSYCLVDRS 280
Query: 286 S----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ- 334
+ S+ FGD A ++ F P+ K D Y+V + +G + +T S F+
Sbjct: 281 ASSKPSSMVFGDS--AVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKL 338
Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
++DSG S T L Y F S+ + + Q + + C++ S + +K
Sbjct: 339 DQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVK 398
Query: 388 VPDMRLIF 395
VP + L F
Sbjct: 399 VPTVVLHF 406
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 145/331 (43%), Gaps = 48/331 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ + IG P + LD GS++ WV QCAP A Y D ++P+SS+S
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWV-----QCAPC-ADCYQQAD---PIFEPASSASF 199
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+SC+ C+S + D C Y Y + + + G V + + L S AP +V
Sbjct: 200 STLSCNTRQCRSLDVSECRNDTCLYEVSYG-DGSYTVGDFVTETITLGS----APVDNVA 254
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS- 288
IGCG G ++ A ++GLG G +S PS + SFS C + DS S
Sbjct: 255 ----IGCGHNNEGLFVGAAG---LLGLGGGSLSFPSQINAT-----SFSYCLVDRDSESA 302
Query: 289 --VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------AL 336
+ F P S L Y+VG+ +G ++ +S FQ +
Sbjct: 303 STLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVI 362
Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
VDSG + T L T++Y + F K L S+ I+L + CY+ SS+ ++VP +
Sbjct: 363 VDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL----FDTCYDLSSKGNVEVPTVS 418
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F + + + P + G CF++
Sbjct: 419 FHFPDGKELPLPAKNYLVPLDSEGTF-CFAF 448
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 148/331 (44%), Gaps = 48/331 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ + IG P + LD GS++ WV QCAP + Y ++ ++P+SS+S
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWV-----QCAPCAECY----EQTDPXFEPTSSASF 201
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++SC CKS + C Y Y + + + G V + + L S S
Sbjct: 202 TSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTSL-------- 252
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
++ IGCG G ++ A ++GLG G +S PS L + SFS C + DS S
Sbjct: 253 GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDST 304
Query: 290 FFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCL--TQSGFQA--------L 336
D P T + T+ L D +F +G+ +G + L ++ FQ +
Sbjct: 305 STLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364
Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
VDSG + T L T +Y + F K L +++ ++L + CY+ SS+ ++VP +
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL----FDTCYDLSSKSRVEVPTVS 420
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ + + P + G CF++
Sbjct: 421 FHFANGNELPLPAKNYLIPVDSEGTF-CFAF 450
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 82/315 (26%), Positives = 136/315 (43%), Gaps = 54/315 (17%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C ++ +DP++SSS +N++C
Sbjct: 152 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNLTC 201
Query: 175 SHPLC--------KSRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--A 223
P C + +C+ +DPCPY Y + S+ L L SF+ + A
Sbjct: 202 GDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGD------LALESFTVNLTA 255
Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
P +S + V+ GCG + G + A ++GLG G +S S L +A ++FS C
Sbjct: 256 PGASSRVDGVVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-RAVYGGHTFSYCLV 311
Query: 283 ENDS---GSVFFGDQGPATQQS------TSFLPIGEKYDA-YFVGVESYCIGNSCLTQS- 331
++ S V FG+ + T+F P D Y+V + +G L S
Sbjct: 312 DHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISS 371
Query: 332 ---------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS 381
++DSG + ++ Y + F D++ S CYN S
Sbjct: 372 DTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVS 431
Query: 382 SEEMLKVPDMRLIFS 396
E +VP++ L+F+
Sbjct: 432 GVERPEVPELSLLFA 446
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 137/333 (41%), Gaps = 44/333 (13%)
Query: 101 FFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ----------CAPLSASY 150
F+G+ F +L +++GTP V FL D GS+L+W+ C Q ++S
Sbjct: 76 FYGD-FEYL--AAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSP 132
Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSG 207
+ ++P SSS V C P C + +SC C + Y + S++G
Sbjct: 133 PPPPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYR-DGASATG 191
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
L D + S+ +S+ GC G DG++GLG G +S+ S L
Sbjct: 192 LLAADTFTFGGNINNDTTST--ASIDFGCATGTAGREFQA---DGMVGLGAGPLSLASQL 246
Query: 268 AKAGLIQNSFSIC---FDENDSGSVF-FGDQGPATQQSTSFLP-IGEKYDA---YFVGVE 319
+ FS C +D +D+ S+ FG + + + P I +A Y + ++
Sbjct: 247 GR------KFSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISID 300
Query: 320 SYCIGNSCL--TQSGFQALVDSGASFTFLP-TEIYAEVVVKFDKLVSSK---RISLQGNS 373
S + + T S + +VD+G TFL + A + +++ R +
Sbjct: 301 SLKVAGQPVPGTTSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDET 360
Query: 374 WKYCYNASSEEMLK--VPDMRLIFSKNQSFVVR 404
+ CY+ S + + +PD+ L+ VR
Sbjct: 361 LELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVR 393
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 123/277 (44%), Gaps = 51/277 (18%)
Query: 87 RNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPL 146
+NQLL S + T F Y ++ + +GTP + +D GS+L+W QC+ C P
Sbjct: 42 KNQLLGASPYADTVFD----YSIYLMRLQLGTPPFEIVAEIDTGSDLIWT--QCMPC-PN 94
Query: 147 SASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSS 206
+ + + +DPS SS+ K C + CPY Y+ E + S+
Sbjct: 95 CYTQFAPI------FDPSKSSTFKEKRCH-------------GNSCPYEIIYADE-SYST 134
Query: 207 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVP 264
G L + + + S S + V + IGCG + G A+ G++GL +G S+
Sbjct: 135 GILATETVTIQSTSG---EPFVMAETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLI 191
Query: 265 SL--LAKAGLIQNSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGV 318
S L GLI S CF + + FG G T + F+ + + Y++ +
Sbjct: 192 SQMDLPIPGLI----SYCFSSQGTSKINFGTNAVVAGDGTVAADMFIKKDQPF--YYLNL 245
Query: 319 ESYCIGNSCLTQSG--FQA-----LVDSGASFTFLPT 348
++ +G+ + G F A +DSG ++T+LPT
Sbjct: 246 DAVSVGDKRIETLGTPFHAQDGNIFIDSGTTYTYLPT 282
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/389 (25%), Positives = 151/389 (38%), Gaps = 56/389 (14%)
Query: 57 NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL--FPSEGSQTHFFGNQFYWLHYTWI 114
NS +++L+ S ++R R+ + NS + P + T GN +
Sbjct: 88 NSSSWIDLV-SQSFERDNARLNTIRSKNSGPYTTMSNLPLQSGTTVGTGN-----YIVTA 141
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
GTP + L+ +D GS+L W IQC P A Y+ +D + ++P SSS K + C
Sbjct: 142 GFGTPAKNSLLIIDTGSDLTW-----IQCKPC-ADCYSQVD---AIFEPKQSSSYKTLPC 192
Query: 175 SHPLCKSRSSCKSLKDP-----CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
C + +S P C Y +Y + +SS G + L L S S
Sbjct: 193 LSATCTELITSESNPTPCLLGGCVYEINYG-DGSSSQGDFSQETLTLGSDSFQ------- 244
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL-LAKAGLIQNSFSICF-DENDSG 287
+ GCG TG + G++GLG +S PS +K G F+ C D S
Sbjct: 245 -NFAFGCGHTNTGLF---KGSSGLLGLGQNSLSFPSQSKSKYG---GQFAYCLPDFGSST 297
Query: 288 SVFFGDQGPAT-QQSTSFLPIGEKY---DAYFVGVESYCIGNSCLT-----QSGFQALVD 338
S G + S F P+ + YFVG+ +G L+ +VD
Sbjct: 298 STGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVD 357
Query: 339 SGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
SG T L + Y + F L S+K S+ CY+ S +++P +
Sbjct: 358 SGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI----LDTCYDLSRHSQVRIPTITFH 413
Query: 395 FSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F N V + P G C ++
Sbjct: 414 FQNNADVAVSDVGILVPVQNGGSQVCLAF 442
>gi|340500865|gb|EGR27703.1| plasmepsin 5, putative [Ichthyophthirius multifiliis]
Length = 602
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 121/320 (37%), Gaps = 59/320 (18%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
YW++ I IG+P +D GS LL PCQ C C D YD
Sbjct: 46 YWIN---IYIGSPPQRQTAIIDTGSYLLAFPCQECKTCG----------DHISYPYDLEK 92
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA-------- 217
S ++K C + C + C + Y+ E +S SGY+ D + L
Sbjct: 93 SLTAKKEKCKSTKLSCQGYCNNFSQECNWSVSYA-EGSSISGYMAGDYVVLGDEMQDYIE 151
Query: 218 -----SFSKHAPQSSV----QSSVII--GCGRKQTGSYLDGAAPDGVMGLGLGDVS---- 262
S+ Q + SV + GC +T +L PDG++GL D S
Sbjct: 152 KLTKNQISEKEEQEYLTYIKHESVFLNFGCTTNETNLFL-SQVPDGIIGLAPSDKSGRAN 210
Query: 263 ----VPSLLAKAGLIQNS----FSICFDENDSGSVFFGDQGPATQQS---TSFLPIGEKY 311
V + K QN+ FS+C + G + G + T +P
Sbjct: 211 TGNIVDEIFKKHK--QNNETHVFSLCLNAEKGGYMSVGGYNYELHEKNARTQIIPFDSDS 268
Query: 312 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
Y V ++ I N+ + + ++DSG + P+ I ++ K ++L S++ S G
Sbjct: 269 GYYSVSIKQILIQNNVIVTNIGYTIIDSGTTIVLGPSRIINPIIQKINELCESEQYSCGG 328
Query: 372 NSW-------KYCYNASSEE 384
+ K+ YN S E
Sbjct: 329 SKKNGDKQQSKFLYNPSKYE 348
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 134/324 (41%), Gaps = 56/324 (17%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP L+A+D ++ W+PC C C SA +DP++S+S ++V C
Sbjct: 116 LGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSA----------PPFDPAASTSYRSVPC 165
Query: 175 SHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
PLC ++C C + Y+ D+S L D L +A +
Sbjct: 166 GSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVA--------GDAVKTY 215
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
GC +K TG+ A P G++GLG G +S L + Q +FS C N SG+
Sbjct: 216 TFGCLQKATGT---AAPPQGLLGLGRGPLSF--LSQTRDMYQGTFSYCLPSFKSLNFSGT 270
Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALV 337
+ G G P ++T L + Y+V + +G + +G ++
Sbjct: 271 LRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVL 330
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
DSG FT L Y V + + V + SL G + C+N ++ + P + L+F
Sbjct: 331 DSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA---VAWPPVTLLFDG 385
Query: 398 NQSFVVRNHIFSFPENEVGDHACF 421
Q + PE V H+ +
Sbjct: 386 MQ--------VTLPEENVVIHSTY 401
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 138/339 (40%), Gaps = 58/339 (17%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP + L+A+D ++ W+PC C CA + + P S++ KNVSC
Sbjct: 99 IGTPPQTLLLAMDTSNDAAWIPCTACDGCA-------------STLFAPEKSTTFKNVSC 145
Query: 175 SHPLCKSRSSCKSLKDPCPYIA----DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
+ P CK + +P ++ + + +S + LV D + LA +
Sbjct: 146 AAP------ECKQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITLA--------TDPVP 191
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDS 286
S GC K TG+ A P G++GLG G +S+ S L Q++FS C N S
Sbjct: 192 SYTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 246
Query: 287 GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQA 335
GS+ G P + T L + Y+V +E+ +G + +G
Sbjct: 247 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 306
Query: 336 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ DSG FT L +Y V +F + V K + CYN + VP + IF
Sbjct: 307 IFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP----IVVPTITFIF 362
Query: 396 SKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
+ + +++I + G C + N +L
Sbjct: 363 TGMNVTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVL 399
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 115/277 (41%), Gaps = 47/277 (16%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + +GTP + LD GS+L+W C C+ C A+ DP++
Sbjct: 90 YLMH---VSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAA---------PVLDPAA 137
Query: 166 SSSSKNVSCSHPLCKSR--SSC--KSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASFS 220
SS+ + C PLC++ +SC +S D C Y+ Y + + + G L D
Sbjct: 138 SSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYG-DRSLTVGQLATDSFTFGGDD 196
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
++ + V GCG G + A G+ G G G S+PS L SFS C
Sbjct: 197 NAGGLAARR--VTFGCGHINKGIFQ--ANETGIAGFGRGRWSLPSQLNV-----TSFSYC 247
Query: 281 ----FDENDSGSVFFGDQGP-----------ATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
FD S V G ++T + + YFV + +G
Sbjct: 248 FTSMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGG 307
Query: 326 S--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKF 358
+ + +S ++ ++DSGAS T LP ++Y V +F
Sbjct: 308 ARVAVPESRLRSSTIIDSGASITTLPEDVYEAVKAEF 344
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 133/322 (41%), Gaps = 73/322 (22%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSSSKNV 172
+ +G+P + + LD GS L W+ C+ AP NL S +DP SSS +
Sbjct: 67 LTVGSPPQTVTMVLDTGSELSWLHCKK---AP-----------NLHSVFDPLRSSSYSPI 112
Query: 173 SCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
C+ P C++R+ SC K C I Y+ + +S G L D H+
Sbjct: 113 PCTSPTCRTRTRDFSIPVSCDK-KKLCHAIISYA-DASSIEGNLASDTFHIG-------- 162
Query: 226 SSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+S + I GC S D + G++G+ G + S + + GL FS C
Sbjct: 163 NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMGL--QKFSYCISGQ 217
Query: 285 D-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
D SG + FG+ P Q ST LP ++ AY V +E + NS L
Sbjct: 218 DSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEGIKVANSMLQLPKS 275
Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLVSSKRISLQGNSWK 375
T +G Q +VDSG FTFL +Y + +F K++ QG +
Sbjct: 276 VYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQG-AMD 333
Query: 376 YCYNA--SSEEMLKVPDMRLIF 395
CY + + +P + L+F
Sbjct: 334 LCYRVPLTRRTLPPLPTVTLMF 355
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 137/332 (41%), Gaps = 50/332 (15%)
Query: 96 GSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQ-----CAPLSASY 150
G QF +L I++GTP V L D GS+L+WV C+ AP S +
Sbjct: 98 GVVAEVVSRQFEYL--MAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYF 155
Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK--DPCPYIADYSTEDTSSSGY 208
PS+SS+ V C C++ SS S C Y+ Y + + +SG
Sbjct: 156 V-----------PSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYG-DGSRASGQ 203
Query: 209 LVDDILHLASFSKHAPQSSVQ--------------SSVIIGCGRKQTGSYLDGAAPDGVM 254
L + ++ + + +S + + GC TG++ DG++
Sbjct: 204 LSTETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTF----RADGLV 259
Query: 255 GLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFGDQGPATQQSTSFLPI--G 308
GLG G VS+ S L + FS C + N S ++ FG + ++ + P+ G
Sbjct: 260 GLGGGPVSLASQLGATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITG 319
Query: 309 EKYDAYFVGVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 366
E Y + ++S + + + QA +VDSG + T+L + + +V + + R
Sbjct: 320 EVETYYTIALDSINVAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPR 379
Query: 367 ISLQGNSWKYCYNAS---SEEMLKVPDMRLIF 395
CY+ S E+ L +PD+ L+
Sbjct: 380 AESPEKILDLCYDISGVRGEDALGIPDVTLVL 411
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 133/322 (41%), Gaps = 73/322 (22%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSSSKNV 172
+ +G+P + + LD GS L W+ C+ AP NL S +DP SSS +
Sbjct: 60 LTVGSPPQTVTMVLDTGSELSWLHCKK---AP-----------NLHSVFDPLRSSSYSPI 105
Query: 173 SCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
C+ P C++R+ SC K C I Y+ + +S G L D H+
Sbjct: 106 PCTSPTCRTRTRDFSIPVSCDK-KKLCHAIISYA-DASSIEGNLASDTFHIG-------- 155
Query: 226 SSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+S + I GC S D + G++G+ G + S + + GL FS C
Sbjct: 156 NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMGL--QKFSYCISGQ 210
Query: 285 D-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
D SG + FG+ P Q ST LP ++ AY V +E + NS L
Sbjct: 211 DSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEGIKVANSMLQLPKS 268
Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLVSSKRISLQGNSWK 375
T +G Q +VDSG FTFL +Y + +F K++ QG +
Sbjct: 269 VYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQG-AMD 326
Query: 376 YCYNA--SSEEMLKVPDMRLIF 395
CY + + +P + L+F
Sbjct: 327 LCYRVPLTRRTLPPLPTVTLMF 348
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 135/320 (42%), Gaps = 52/320 (16%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP L+A+D ++ W+PC C C SA + +DP+SS+S + V C
Sbjct: 118 LGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSA----------APFDPASSASYRTVPC 167
Query: 175 SHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
PLC ++C C + Y+ D+S L D L +A + A
Sbjct: 168 GSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVAGNAVKA--------Y 217
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
GC ++ TG+ A P G++GLG G +S L + + +FS C N SG+
Sbjct: 218 TFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGT 272
Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQSGFQALVDSGA 341
+ G G P ++T L + Y+V + +G + +G ++DSG
Sbjct: 273 LRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGT 332
Query: 342 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
FT L Y V + + V + SL G + C+N ++ + P + L+F Q
Sbjct: 333 MFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA---VAWPPVTLLFDGMQ-- 385
Query: 402 VVRNHIFSFPENEVGDHACF 421
+ PE V H+ +
Sbjct: 386 ------VTLPEENVVIHSTY 399
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/313 (25%), Positives = 128/313 (40%), Gaps = 59/313 (18%)
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
L KR K R++ S +S F S GN + + + IGTP ++
Sbjct: 61 LQRAMKRGKLRLQRLSAKTAS-----FESSVEAPVHAGNGEFLMK---LAIGTPAETYSA 112
Query: 126 ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-- 182
+D GS+L+W C+ C C D+ +DP SSS + CS LC +
Sbjct: 113 IMDTGSDLIWTQCKPCKDC----------FDQPTPIFDPKKSSSFSKLPCSSDLCAALPI 162
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
SSC D C Y+ Y + +S+ G L + S S + GCG G
Sbjct: 163 SSC---SDGCEYLYSYG-DYSSTQGVLATETFAFGDASV--------SKIGFGCGEDNDG 210
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DENDSGSVFFGDQGPAT 298
S A G++GLG G +S+ S L + FS C D S+ G + AT
Sbjct: 211 SGFSQGA--GLVGLGRGPLSLISQLG-----EPKFSYCLTSMDDSKGISSLLVGSE--AT 261
Query: 299 QQSTSFLPIGEKYDA---YFVGVESYCIGNSCL--TQSGFQA--------LVDSGASFTF 345
++ P+ + Y++ +E +G++ L +S F ++DSG + T+
Sbjct: 262 MKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITY 321
Query: 346 LPTEIYAEVVVKF 358
L +A + +F
Sbjct: 322 LEDSAFAALKKEF 334
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 127/304 (41%), Gaps = 39/304 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP F + +D GS+L W IQC P + + +S YD SSSSS
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTW-----IQCNPPNTTANSS-SPPAPWYDKSSSSSY 80
Query: 170 KNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSK-- 221
+ + C+ C SSC S+K P P Y D S ++G L + + + S +
Sbjct: 81 REIPCTDDECLFLPAPIGSSC-SIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSG 139
Query: 222 -----HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
H ++ +V +GC R+ G+ GA+ GV+GLG G +S+ + L
Sbjct: 140 KRAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGI 196
Query: 277 FSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS----- 326
FS C + S + F G + + PI A Y+V V +
Sbjct: 197 FSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 256
Query: 327 -----CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
+ G + + DSG + ++L Y++V+ + + R ++ CYN
Sbjct: 257 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 316
Query: 381 SSEE 384
+ E
Sbjct: 317 TRME 320
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 137/307 (44%), Gaps = 51/307 (16%)
Query: 70 WKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDA 129
+R R +L+ S++ PS + H GN + ++ + IGTP ++ +D
Sbjct: 61 LQRAVKRGRLRLQRLSAKTASFEPSVEAPVHA-GNGEFLMN---LAIGTPAETYSAIMDT 116
Query: 130 GSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKS 187
GS+L+W QC P + D+ +DP SSS + CS LC + SSC
Sbjct: 117 GSDLIWT-----QCKPCKVCF----DQPTPIFDPEKSSSFSKLPCSSDLCVALPISSC-- 165
Query: 188 LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLD 246
D C Y Y + +S+ G L + S S + GCG G +Y
Sbjct: 166 -SDGCEYRYSYG-DHSSTQGVLATETFTFGDASV--------SKIGFGCGEDNRGRAYSQ 215
Query: 247 GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVFFGDQGPATQQSTSF 304
GA G++GLG G + SL+++ G+ + S+ + ++ G ++ G + AT +S
Sbjct: 216 GA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLLVGSE--ATVKSAIP 267
Query: 305 LPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASFTFLPTEIY 351
P+ + + Y++ +E +G++ L +S F ++DSG + T+L +
Sbjct: 268 TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAF 327
Query: 352 AEVVVKF 358
A + +F
Sbjct: 328 AALKKEF 334
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 102/432 (23%), Positives = 183/432 (42%), Gaps = 62/432 (14%)
Query: 32 FSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL 91
SD + +S + + DS + + + + ++ ++RVK + +S+ + L
Sbjct: 13 ISDASPVATVSIDAKLVLRDSAARGGGIGFKAIHVAAP----QSRVKANPSPSSAAQKSL 68
Query: 92 FPSEG--------------SQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
FP S T G +F +YT I +G+P ++ +D GS L W+
Sbjct: 69 FPYSAHIFQQHTKNPAALRSSTTTLGRKF-GEYYTSIKLGSPGQEAILIVDTGSELTWLQ 127
Query: 138 C-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH-PLCKSRSSCK----SLKDP 191
C C CAP S+D + YD + S+S + V+C++ LC + S +
Sbjct: 128 CLPCKVCAP-------SVD---TIYDAARSASYRPVTCNNSQLCSNSSQGTYAYCARGSQ 177
Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
C + A Y + + S G L D L + + P +VQ GC + GA+
Sbjct: 178 CQFAAFYG-DGSFSYGSLSTDTLIMETVVGGKP-VTVQ-DFAFGCAQGDLELVPTGAS-- 232
Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGSVFFGD-QGPATQ-QSTSF 304
G++GL G +++P L + FS CF + N +G VFFG+ + P Q Q TS
Sbjct: 233 GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSV 290
Query: 305 LPIGEKYDA--YFVGVESYCIGNSCLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ Y V ++ I + L G ++DSG+SF+ ++++ F K
Sbjct: 291 ALTNSELQRKFYHVALKGVSINSHELVFLPRGSVVILDSGSSFSSFVRPFHSQLREAFLK 350
Query: 361 LVSSKRISLQGNSW---KYCYNASSEEM----LKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
L+G+S+ C+ S++++ +P + L+F + + + P
Sbjct: 351 HRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVA 410
Query: 414 EVGDHA--CFSY 423
+H CF++
Sbjct: 411 RFQNHVKMCFAF 422
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 105/256 (41%), Gaps = 35/256 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y + G+P + + +D GS+L W +QC P + D +DPS+S +
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSW-----LQCKPCVVYCHVQAD---PLFDPSASKTY 169
Query: 170 KNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
K++SC+ C S C++ + C Y A Y + + S GYL D+L LA
Sbjct: 170 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDLLTLA----- 223
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
S + GCG+ G + A G++GLG +S+ L + +FS C
Sbjct: 224 --PSQTLPGFVYGCGQDSDGLFGRAA---GILGLGRNKLSM--LGQVSSKFGYAFSYCLP 276
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQSGFQ----A 335
G + + F P+ YF+ + + +G L + Q
Sbjct: 277 TRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT 336
Query: 336 LVDSGASFTFLPTEIY 351
++DSG T LP +Y
Sbjct: 337 IIDSGTVITRLPMSVY 352
>gi|225451013|ref|XP_002284868.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 441
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 84/196 (42%), Gaps = 27/196 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T I TP V V +D G LWV C S Y S S Y P+ SS
Sbjct: 46 YVTIISQRTPLVPLNVIVDLGGQFLWVGC---------GSNYVS-----SSYRPAQCHSS 91
Query: 170 K------NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ SC H L + R C + C ++ S+G L +D+L L S
Sbjct: 92 QCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSEDVLSLQSTDGLN 149
Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
P+S+V + C + L G A +G+ GLG G + +P+LL+ A F++C
Sbjct: 150 PRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSALNFTRKFAVCLP 208
Query: 282 -DENDSGSVFFGDQGP 296
SG +FFGD GP
Sbjct: 209 PTTTSSGVIFFGD-GP 223
>gi|147821119|emb|CAN68736.1| hypothetical protein VITISV_030193 [Vitis vinifera]
Length = 441
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 84/196 (42%), Gaps = 27/196 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T I TP V V +D G LWV C S Y S S Y P+ SS
Sbjct: 46 YVTIISQRTPLVPLNVIVDLGGQFLWVGC---------GSNYVS-----SSYRPARCHSS 91
Query: 170 K------NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ SC H L + R C + C ++ S+G L +D+L L S
Sbjct: 92 QCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSEDVLSLQSTDGLN 149
Query: 224 PQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
P+S+V + C + L G A +G+ GLG G + +P+LL+ A F++C
Sbjct: 150 PRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSALNFTRKFAVCLP 208
Query: 282 -DENDSGSVFFGDQGP 296
SG +FFGD GP
Sbjct: 209 PTTTSSGVIFFGD-GP 223
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 137/306 (44%), Gaps = 51/306 (16%)
Query: 71 KRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAG 130
+R R +L+ S++ PS + H GN + ++ + IGTP ++ +D G
Sbjct: 62 QRAVKRGRLRLQRLSAKTASFEPSVEAPVHA-GNGEFLMN---LAIGTPAETYSAIMDTG 117
Query: 131 SNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSL 188
S+L+W QC P + D+ +DP SSS + CS LC + SSC
Sbjct: 118 SDLIWT-----QCKPCKVCF----DQPTPIFDPEKSSSFSKLPCSSDLCVALPISSC--- 165
Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDG 247
D C Y Y + +S+ G L + S S + GCG G +Y G
Sbjct: 166 SDGCEYRYSYG-DHSSTQGVLATETFTFGDASV--------SKIGFGCGEDNRGRAYSQG 216
Query: 248 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVFFGDQGPATQQSTSFL 305
A G++GLG G + SL+++ G+ + S+ + ++ G ++ G + AT +S
Sbjct: 217 A---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLLVGSE--ATVKSAIPT 268
Query: 306 PIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LVDSGASFTFLPTEIYA 352
P+ + + Y++ +E +G++ L +S F ++DSG + T+L +A
Sbjct: 269 PLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFA 328
Query: 353 EVVVKF 358
+ +F
Sbjct: 329 ALKKEF 334
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 133/322 (41%), Gaps = 44/322 (13%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IG+P F + LD GS+L W+ QC+ C + ++N YDP S S +N++C+
Sbjct: 202 IGSPPKHFSLILDTGSDLNWI--QCVPC-------FDCFEQNGPYYDPKDSISFRNITCN 252
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
P C+ SS CK CPY Y ++ + ++ ++L S + +
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+V+ GCG G + A ++GLG G +S S L L +SFS C + DS +
Sbjct: 313 VENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRDSDT 367
Query: 289 ------VFFGDQGPATQQSTSFL--------PIGEKY----DAYFVGVESYCIGNSCLTQ 330
+F D+ T +F P+ Y + FVG E I
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427
Query: 331 SGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
S A ++DSG + ++ Y + F + V ++ CYN S + L
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELN 487
Query: 388 VPDMRLIFSKNQ--SFVVRNHI 407
P+ + F+ +F V N+
Sbjct: 488 FPEFLIQFADGAVWNFPVENYF 509
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 136/330 (41%), Gaps = 44/330 (13%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IG+P F + LD GS+L W+ QC+ C + ++N YDP S S +N++C+
Sbjct: 202 IGSPPKHFSLILDTGSDLNWI--QCVPC-------FDCFEQNGPYYDPKDSISFRNITCN 252
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSV 228
P C+ SS CK CPY Y ++ + ++ ++L S + +
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
+V+ GCG G + A ++GLG G +S S L L +SFS C + DS +
Sbjct: 313 VENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRDSDT 367
Query: 289 ------VFFGDQGPATQQSTSFL--------PIGEKY----DAYFVGVESYCIGNSCLTQ 330
+F D+ T +F P+ Y + FVG E I
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427
Query: 331 SGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
S A ++DSG + ++ Y + F + V ++ CYN S + L
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELN 487
Query: 388 VPDMRLIFSKNQ--SFVVRNHIFSFPENEV 415
P+ + F+ +F V N+ + ++
Sbjct: 488 FPEFLIQFADGAVWNFPVENYFIRIQQLDI 517
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/315 (26%), Positives = 137/315 (43%), Gaps = 46/315 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
+ T + +GTP+ S+ + +D GS+L W +QC+P S + R + +DP +SS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTW-----LQCSPCVVSCH----RQVGPLFDPRASST 184
Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
+V CS C + S+C S + C Y A Y + + S GYL D + S S
Sbjct: 185 YTSVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGYLSTDTVSFGSTSY 242
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S GCG+ G + A G++GL +S+ LA + + SFS C
Sbjct: 243 --------PSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCL 289
Query: 282 DENDSGSVFFGDQGP-ATQQSTSFLPIG-EKYDA--YFVGVESYCIGNSCLT-----QSG 332
+ S + GP T S+ P+ DA YF+ + +G S L S
Sbjct: 290 PT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSS 347
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG T LPT ++ + + ++ + + + C+ + + L+VP +
Sbjct: 348 LPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVV 406
Query: 393 LIFSKNQS--FVVRN 405
+ F+ S RN
Sbjct: 407 MAFAGGASMKLTTRN 421
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 145/386 (37%), Gaps = 58/386 (15%)
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
L + R +R N S N + P + + N I +GTP VS
Sbjct: 60 LQKAFHRSISRANHFRANGVSTNSIQSPVISNNGEYLMN---------ISLGTPPVSMHG 110
Query: 126 ALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSC 185
D GS+LLW QC P + Y ++ +DP+ S + + +SC C +
Sbjct: 111 IADTGSDLLWR-----QCKPCDSCY----EQIEPIFDPAKSKTYQILSCEGKSCSNLGGQ 161
Query: 186 KSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
D I YS D S +SG L D L + S + P S + V+ GCG G++
Sbjct: 162 GGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGR-PVSVPK--VVFGCGHNNGGTF 218
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICF-----DENDSGSVFFGDQGPAT 298
+ + S++++ LI FS C D + S + FG +G +
Sbjct: 219 ELHGSGLVGL-----GGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIVS 273
Query: 299 QQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGF-------------QALVDSGASF 343
P+ + Y++ +ES +G+ L GF ++DSG +
Sbjct: 274 GAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTL 333
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK------ 397
T LP + Y + + K + N + CY S+ L++P + F
Sbjct: 334 TLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCY--SNLSGLRIPTITAHFVGADLELK 391
Query: 398 --NQSFVVRNHIFSFPENEVGDHACF 421
N V+ +F F V D A F
Sbjct: 392 PLNTFVQVQEDLFCFAMIPVSDLAIF 417
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/344 (26%), Positives = 140/344 (40%), Gaps = 67/344 (19%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
IGTP ++ LD GS+L+W C C +C P A Y P+ S + NVS
Sbjct: 106 IGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYA----------PARSVTYANVS 155
Query: 174 CSHPLCKSRSSCKSL-------------KDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
C LC + S + + C Y YS D SS+ D +L +F+
Sbjct: 156 CGSRLCDALPSLRPSSRCSASASAPAPERGGCTYY--YSYGDGSST----DGVLATETFT 209
Query: 221 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
A + + GCG G + + G++G+G G + SL+++ G+ + FS C
Sbjct: 210 FGA--GTTVHDLAFGCGTDNLGGTDNSS---GLVGMGRGPL---SLVSQLGVTK--FSYC 259
Query: 281 F----DENDSGSVFFGDQG---PATQQSTSFL--PIGEKYDA-YFVGVESYCIGNSC--- 327
F D S +F G PA +ST F+ P G + + Y++ +E +G++
Sbjct: 260 FTPFNDTTTSSPLFLGSSASLSPAA-KSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPI 318
Query: 328 ------LTQSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
LT SG L +DSG +FT L + + V+ S C+ A
Sbjct: 319 DPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAA 378
Query: 381 ---SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACF 421
E + VP + L F + R+ + E+ V AC
Sbjct: 379 PQGRGPEAVDVPRLVLHFDGADMELPRSS--AVVEDRVAGVACL 420
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 105/241 (43%), Gaps = 31/241 (12%)
Query: 203 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 262
+SSSG L +DI+ S+ Q +V GC +TG A DG+MGLG G +S
Sbjct: 2 SSSSGVLGEDIVSFGRESELKAQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLS 55
Query: 263 VPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 319
+ L + G+I +SFS+C+ D G V G P+ + P+ Y Y + ++
Sbjct: 56 IMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY--YNIELK 113
Query: 320 SYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---Q 370
+ L S ++DSG ++ +LP + + + F V+SK SL +
Sbjct: 114 EIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAF----MAFKDAVTSKVHSLKKIR 169
Query: 371 GNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
G Y C+ + + K+ PD+ ++F Q + + F ++V C
Sbjct: 170 GPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV 229
Query: 424 F 424
F
Sbjct: 230 F 230
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 87/346 (25%), Positives = 145/346 (41%), Gaps = 53/346 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +G P F + D ++ W+ CQ CI+C D+ S +DPS SSS +
Sbjct: 191 IGVGGPPQKFYMIFDLQTDFTWLQCQPCIKC----------YDQPDSIFDPSQSSSYTLL 240
Query: 173 SCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
SC C SSC S C Y Y + T++ G L+++ + S S
Sbjct: 241 SCETKHCNLLPNSSC-SDDGYCRYNITYK-DGTNTEGVLINETVSFES-------SGWVD 291
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--- 287
V +GC K G ++ DG GLG G +S PS + + S S C E+ G
Sbjct: 292 RVSLGCSNKNQGPFV---GSDGTFGLGRGSLSFPSRINAS-----SMSYCLVESKDGYSS 343
Query: 288 -SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG-------NSCLTQSGF---QAL 336
++ F + L + + Y+VG++ +G NS T + +
Sbjct: 344 STLEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMI 403
Query: 337 VDSGASFTFLPTEIYAEV----VVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
V S + T L + Y V V K L K LQ ++ CYN SS +++P +
Sbjct: 404 VSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAF-LQFDT---CYNLSSNNTVELPILE 459
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILILQK 438
+ +S+++ + + ++ G CF++ + +F+ + LQ+
Sbjct: 460 FEVNDGKSWLLPKESYLYAVDKNGTF-CFAFAPSKGSFSILGTLQQ 504
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 118/289 (40%), Gaps = 61/289 (21%)
Query: 132 NLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 190
++ W C+ C++C L + +DPS+S L S SC
Sbjct: 97 SITWTQCKPCVRC----------LKDSHRHFDPSAS-----------LTYSLGSCIPSTV 135
Query: 191 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 250
Y Y + TS Y D + S V GCGR G + GA
Sbjct: 136 GNTYNMTYGDKSTSVGNYGCDTMT--------LEPSDVFPKFQFGCGRNNEGDFGSGA-- 185
Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFGDQGPATQQS-------- 301
DG++GLG G +S S A + FS C E DS GS+ FG++ AT QS
Sbjct: 186 DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFGEK--ATSQSSLKFTSLV 241
Query: 302 ----TSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDSGASFTFLPTEIYA 352
TS L E+ YFV + +GN L S F + ++DSG T LP Y+
Sbjct: 242 NGPGTSGL---EESGYYFVKLLDISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYS 298
Query: 353 EVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
+ F K ++ +S +G+ CYN S + + +P++ L F +
Sbjct: 299 ALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGE 347
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 129/294 (43%), Gaps = 39/294 (13%)
Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPC--QCIQCAPLSASYYTSLDRNLS 159
GN + H+T ++IG P+ F + +D GS+L WV C +CI C +L R++
Sbjct: 45 GNVYPLGHFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGC---------TLPRDML 95
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
Y P +++ VS PLC + SS K+ D C Y +Y+ + SS G LV D++
Sbjct: 96 -YRPHNNA----VSREDPLCAALSSLGKFIFKNPNDQCAYEVEYA-DHGSSVGVLVKDLV 149
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQ-TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
+ + + ++ GCG Q G + GV+GL ++ S L+ G +
Sbjct: 150 PM----RLTNGKRISPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHV 205
Query: 274 QNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS 331
N C F GD P++ S+ PI + Y G +
Sbjct: 206 SNVVGHCLTGRGGGFLFFGGDVVPSS--GMSWTPILRNSEGKYSSGPAEVYFNGRAVGIG 263
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
G DSG+S+T+ +++Y + +KL+ + L+GN K + + E+
Sbjct: 264 GLTLTFDSGSSYTYFNSQVYRAI----EKLLKN---DLKGNPLKLASDDKTLEL 310
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/264 (23%), Positives = 111/264 (42%), Gaps = 42/264 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IG+P SF +D GS+L+W C+ C QC D++ +DP SSS +SC
Sbjct: 372 IGSPPRSFSAIMDTGSDLIWTQCKPCQQC----------FDQSTPIFDPKQSSSFYKISC 421
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
S LC + + D C Y+ Y + +S+ G L + +F +
Sbjct: 422 SSELCGALPTSTCSSDGCEYLYTYG-DSSSTQGVLAFETF---TFGDSTEDQISIPGLGF 477
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFF 291
GCG G A G++GLG G +S+ S L + F+ C D++ S+
Sbjct: 478 GCGNDNNGDGFSQGA--GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSLLL 530
Query: 292 GDQGPAT-------QQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ-------- 334
G T ++T + + Y++ ++ +G + L+ +S F+
Sbjct: 531 GSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGG 590
Query: 335 ALVDSGASFTFLPTEIYAEVVVKF 358
++DSG + T++ + + +F
Sbjct: 591 VIIDSGTTITYVENSAFTSLKNEF 614
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 102/422 (24%), Positives = 176/422 (41%), Gaps = 71/422 (16%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVK-LQSN 82
++KL+HR + ++ V + +S+E + L ++++K L+S
Sbjct: 38 LATKLIHR--NSYLHPLYDQNETVEDRSKREQTSSIERFDFL--------ESKIKELKSV 87
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCI 141
N +R+ L+ + GS F N + IG+P V+ LV +D GS+LLWV C CI
Sbjct: 88 GNEARSSLIPFNRGSG--FLVN---------LSIGSPPVTQLVVVDTGSSLLWVQCLPCI 136
Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK-DPCPYIADYST 200
C S S+ +DP S S K + C P + K + + Y Y
Sbjct: 137 NCFQQSTSW----------FDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLG 186
Query: 201 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 260
D SS G L + L + + + +S++ GCG + D A +GV GLG
Sbjct: 187 GD-SSQGILAKESLLFETLDEGKIK---KSNITFGCGHMNIKTNNDDAY-NGVFGLG--- 238
Query: 261 VSVPSLLAKAGLIQNSFSICFDENDS-----GSVFFGDQGPATQQSTSFLPIGEKYDAYF 315
+ P + A + N FS C + ++ + G QG + ++ P+ + Y+
Sbjct: 239 -AYPH-ITMATQLGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDST--PLQIHFGHYY 293
Query: 316 VGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTE----IYAEVVVKFDKL 361
V ++S +G+ L + F+ L+DSG ++T L +Y E+V L
Sbjct: 294 VTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL 353
Query: 362 VSSKRISLQGNSWKYCYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHAC 420
+ +RI Q C+ S +++ P + F+ V+ + S GD C
Sbjct: 354 L--ERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESG--SLFRQHGGDRFC 409
Query: 421 FS 422
+
Sbjct: 410 LA 411
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 165/388 (42%), Gaps = 58/388 (14%)
Query: 76 RVKLQSNNNSSRNQLLFPSEG--------------SQTHFFGNQFYWLHYTWIDIGTPNV 121
RVK + +S+ + LFP S T G +F +YT I +G+P
Sbjct: 53 RVKANPSPSSAAQKSLFPYSAHIFQQHTKNPAALRSSTTTLGRKF-GEYYTSIKLGSPGQ 111
Query: 122 SFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH-PLC 179
++ +D GS L W+ C C CAP S+D + YD + S S K V+C++ LC
Sbjct: 112 EAILIVDTGSELTWLKCLPCKVCAP-------SVD---TIYDAARSVSYKPVTCNNSQLC 161
Query: 180 KSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
+ S + C + A Y + + S G L D L + + P +VQ G
Sbjct: 162 SNSSQGTYAYCARGSQCQFAAFYG-DGSFSYGSLSTDTLIMETVVGGKP-VTVQ-DFAFG 218
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGSVF 290
C + GA+ G++GL G +++P L + FS CF + N +G VF
Sbjct: 219 CAQGDLELVPTGAS--GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSSHLNSTGVVF 274
Query: 291 FGD-QGPATQ-QSTSFLPIGEKYDA--YFVGVESYCIGNS--CLTQSGFQALVDSGASFT 344
FG+ + P Q Q TS + Y V ++ I + L G ++DSG+SF+
Sbjct: 275 FGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVVILDSGSSFS 334
Query: 345 FLPTEIYAEVVVKFDKLVSSKRISLQGNSW---KYCYNASSEEM----LKVPDMRLIFSK 397
++++ F K L+G+S+ C+ S++++ +P + L+F
Sbjct: 335 SFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVFED 394
Query: 398 NQSFVVRNHIFSFPENEVGDHA--CFSY 423
+ + + P +H CF++
Sbjct: 395 GVTIGIPSIGVLLPVARYQNHVKMCFAF 422
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 148/331 (44%), Gaps = 48/331 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ + IG P + LD GS++ WV QCAP + Y ++ ++P+SS+S
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWV-----QCAPCAECY----EQTDPIFEPTSSASF 201
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++SC CKS + C Y Y + + + G V + + L S S
Sbjct: 202 TSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTS--------L 252
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
++ IGCG G ++ A ++GLG G +S PS L + SFS C + DS S
Sbjct: 253 GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDST 304
Query: 290 FFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCL--TQSGFQA--------L 336
D P T + T+ L D +F +G+ +G + L ++ FQ +
Sbjct: 305 STLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364
Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
VDSG + T L T +Y + F K L +++ ++L + CY+ SS+ ++VP +
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL----FDTCYDLSSKSRVEVPTVS 420
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F+ + + P + G CF++
Sbjct: 421 FHFANGNELPLPAKNYLIPVDSEGTF-CFAF 450
>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 435
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 119/281 (42%), Gaps = 45/281 (16%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYY-TSLDRNLSEYDPSSSSS 168
+ T I+ TP V+ + +D G +WV C +S+SY D L + S S +
Sbjct: 49 YVTQINQRTPLVAVKLTVDLGGTFMWVDCDNY----VSSSYTPVRCDSALCKLADSHSCT 104
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
++ S P C + + C +I S+SG + D++ L S P +V
Sbjct: 105 TECYSSPKPGCYNNT--------CSHIPYNPVVHVSTSGDIGLDVVSLQSMDGKYPGRNV 156
Query: 229 Q-SSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-- 283
+V CG TG L+ A GV GLG G++S+P+ + A +Q+ F+IC
Sbjct: 157 SVPNVPFVCG---TGFMLENLADGVLGVAGLGRGNISLPAYFSSALGLQSKFAICLSSLT 213
Query: 284 NDSGSVFFGDQ-GPATQQSTSFLPI-------------GEKYDAYFVGVESYCIGNSCLT 329
N SG ++FGD GP + + P+ G+ YF+ V++ +G +
Sbjct: 214 NSSGVIYFGDSIGPLSSDFLIYTPLVRNPVSTAGAYFEGQSSTDYFIAVKTLRVGGKEIK 273
Query: 330 QSGFQALVDSGAS----------FTFLPTEIYAEVVVKFDK 360
+ +D+ +T L T IY V+ F K
Sbjct: 274 FNKTLLSIDNEGKGGTRISTVHPYTLLHTSIYKAVIKAFAK 314
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/330 (23%), Positives = 131/330 (39%), Gaps = 47/330 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ GTP SF LD GSN+ W+PC C C+ ++PS SS+ +
Sbjct: 128 LGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCS-----------SKQQPFEPSKSSTYNYL 176
Query: 173 SCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
+C+ C+ C + C Y + VD+IL + S + Q
Sbjct: 177 TCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSE------VDEILSSETLSVGSQQV---E 227
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDS 286
+ + GC G L P ++G G +S S A L ++FS C F +
Sbjct: 228 NFVFGCSNAARG--LIQRTPS-LVGFGRNPLSFVS--QTATLYDSTFSYCLPSLFSSAFT 282
Query: 287 GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT----------QSGF 333
GS+ G + + Q F P+ +Y + Y+VG+ +G ++ +G
Sbjct: 283 GSLLLGKEA-LSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGR 341
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG T L Y + F +S+ ++ + + CYN S ++ + P + L
Sbjct: 342 GTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDV-EFPLITL 400
Query: 394 IFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F N + +P N+ G C ++
Sbjct: 401 HFDDNLDLTLPLDNILYPGNDDGSVLCLAF 430
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 140/340 (41%), Gaps = 48/340 (14%)
Query: 81 SNNNSSRNQLLFPSEGSQTHF-----FGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLW 135
S +NS Q PS+ Q G+ Y++ + +GTP + +D GS++LW
Sbjct: 6 STSNSHDRQTKVPSQDFQAPVISGLSLGSGEYFIR---VSVGTPPRGMYLVMDTGSDILW 62
Query: 136 VPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYI 195
+QCAP + Y+ + +DP SS+ + C+ C + + + C Y
Sbjct: 63 -----LQCAPCVSCYHQCDE----VFDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQ 113
Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
DY + + S+G D + L S S V + + +GCG G ++ A ++G
Sbjct: 114 VDYG-DGSFSTGEFATDAVSLNSTSGGG--QVVLNKIPLGCGHDNEGYFVGAAG---LLG 167
Query: 256 LGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQG--PATQQSTSFLPIG 308
LG G +S P+ + FS C D + S+ FGD PA F P
Sbjct: 168 LGKGPLSFPNQINSEN--GGRFSYCLTGRDTDSTERSSLIFGDAAVPPA---GVRFTPQA 222
Query: 309 EKYDA---YFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVV 355
Y++ + +G S LT S FQ ++DSG S T L YA +
Sbjct: 223 SNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLR 282
Query: 356 VKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
F S ++ + + + CYN S + VP + L F
Sbjct: 283 EAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHF 322
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 131/329 (39%), Gaps = 56/329 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ + IG P S L+ D GS+L+WV C C C+ S + + + P SS+
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA---------TVFFPRHSST 133
Query: 169 SKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASF 219
C P+C+ R + + CPY +Y D S +SG + L +
Sbjct: 134 FSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPY--EYGYADGSLTSGLFARETTSLKTS 191
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLIQNS 276
S + + SV GCG + +G + G + +GVMGLG G +S S L + N
Sbjct: 192 SG---KEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNK 246
Query: 277 FSIC-----FDENDSGSVFFGDQGPATQQS--TSFL--PIGEKYDAYFVGVESYCIGNSC 327
FS C + + GD G A + T L P+ + Y+V ++S + +
Sbjct: 247 FSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTF--YYVKLKSVFVNGAK 304
Query: 328 LT---------QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 377
L SG V DSG + FL Y V+ + + + C
Sbjct: 305 LRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLC 364
Query: 378 YNASS----EEMLKVPDMRLIFSKNQSFV 402
N S E++L P ++ FS FV
Sbjct: 365 VNVSGVTKPEKIL--PRLKFEFSGGAVFV 391
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 127/315 (40%), Gaps = 56/315 (17%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNV 121
LEL + + T +++ + +L E S + Y Y IG P
Sbjct: 26 LELTHVDAKQNCSTEERMRRATERTHRRLASMGEASAPVHWAESQYIAEYL---IGDPPQ 82
Query: 122 SFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK- 180
+D GSNL+W QC C P +NLS YDPS S +++ V+C+ C
Sbjct: 83 QAEAIIDTGSNLIWT--QCSTCQPAGC-----FSQNLSFYDPSRSRTARPVACNDTACAL 135
Query: 181 -SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC--G 237
S + C C + Y ++ +L +F+ PQS S+ GC
Sbjct: 136 GSETRCARDNKACAVLTAYGAG-------VIGGVLGTEAFTFQ-PQSE-NVSLAFGCIAA 186
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFG 292
+ T LDGA+ G++GLG G++S+ S L N FS C ++ +F G
Sbjct: 187 TRLTPGSLDGAS--GIIGLGRGNLSLVSQLGD-----NKFSYCLTPYFSQSTNTSRLFVG 239
Query: 293 -----DQGPATQQSTSFL--PIGEKYDA-YFVGVESYCIGNSCLT--QSGFQ-------- 334
G A S FL P + + Y++ + +G++ L ++ F
Sbjct: 240 ASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGL 299
Query: 335 ---ALVDSGASFTFL 346
L+DSG+ FT L
Sbjct: 300 WAGTLIDSGSPFTSL 314
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 129/320 (40%), Gaps = 39/320 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T + IG P + LD GS++ W +QC P + Y+ + ++PSSSSS
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNW-----LQCTPCADCYH----QTEPIFEPSSSSSY 198
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ +SC P C + + C Y Y + + + G + L + S++
Sbjct: 199 EPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIG--------STLV 249
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
+V +GCG G + V GL + L + L SFS C + DS S
Sbjct: 250 QNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA 301
Query: 290 FFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA--------L 336
D G + P+ + Y++G+ +G L QS F+ +
Sbjct: 302 STVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 361
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
+DSG + T L TEIY + F K + + CYN S++ ++VP + F
Sbjct: 362 IDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFP 421
Query: 397 KNQSFVVRNHIFSFPENEVG 416
+ + + P + VG
Sbjct: 422 GGKMLALPAKNYMIPVDSVG 441
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 129/320 (40%), Gaps = 39/320 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T + IG P + LD GS++ W +QC P + Y+ + ++PSSSSS
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNW-----LQCTPCADCYH----QTEPIFEPSSSSSY 201
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ +SC P C + + C Y Y + + + G + L + S ++
Sbjct: 202 EPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIGS--------TLV 252
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS- 288
+V +GCG G + V GL + L + L SFS C + DS S
Sbjct: 253 QNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA 304
Query: 289 --VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--------L 336
V FG P L + Y++G+ +G L QS F+ +
Sbjct: 305 STVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 364
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
+DSG + T L T IY + F K S + + CYN S++ ++VP + F
Sbjct: 365 IDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFP 424
Query: 397 KNQSFVVRNHIFSFPENEVG 416
+ + + P + VG
Sbjct: 425 GGKMLALPAKNYMIPVDSVG 444
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 146/364 (40%), Gaps = 64/364 (17%)
Query: 93 PSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYT 152
P+ G ++ + Y L + IG+ + +D GS + V C
Sbjct: 83 PTSGVRSVVTPLEDYALFSMQLGIGSLQKNLSAIIDTGSEAVLVQCG------------- 129
Query: 153 SLDRNLSEYDPSSSSSSKNVSCSHPLC---------KSRSSCKSLKDPCPYIADYSTEDT 203
R+ +DP++S S + V C LC S C + C Y Y +
Sbjct: 130 --SRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYG-DSR 186
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
+S+G D++ L S + + Q+ V GC G +D + G++G G++S+
Sbjct: 187 NSTGDFSQDVIFLNS-TNSSGQAVQFRDVAFGCAHSPQGFLVDLGSL-GIVGFNRGNLSL 244
Query: 264 PSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGE------KYD 312
PS L K L + FS CF +G +F GD G ++ + P+ + +
Sbjct: 245 PSQL-KDRLGGSKFSYCFPSQPWQPRATGVIFLGDSG-LSKSKVGYTPLLDNPVTPARSQ 302
Query: 313 AYFVGVESYCIGNSCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKL 361
Y+VG+ S + L +S F+ ++DSG +FT + + Y F
Sbjct: 303 LYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF--- 359
Query: 362 VSSKRISLQ-----GNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVR-NHIF---SFP 411
+S R L+ + CYN S+ L VP++RL N +R H+F S
Sbjct: 360 AASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAA 419
Query: 412 ENEV 415
NEV
Sbjct: 420 GNEV 423
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/340 (22%), Positives = 137/340 (40%), Gaps = 48/340 (14%)
Query: 93 PSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYY 151
PS + + G+ Y ++ + IGTP F +D GS+L+W CQ C QC
Sbjct: 81 PSGVETSVYAGDGEYLMN---LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC-------- 129
Query: 152 TSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
+++ ++P SSS + CS LC++ SS + C Y Y + + + G +
Sbjct: 130 --FNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYG-DGSETQGSMGT 186
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
+ L S S ++ GCG G A G++G+G G +S+PS L
Sbjct: 187 ETLTFGSVSI--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT- 235
Query: 272 LIQNSFSICFDENDSGS---VFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 325
FS C S + + G + A +T+ + + Y++ + +G+
Sbjct: 236 ----KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 291
Query: 326 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
+ L S F ++DSG + T+ Y V +F ++ ++ + +
Sbjct: 292 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGF 351
Query: 375 KYCYNASSEEM-LKVPDMRLIFSKNQSFVVRNHIFSFPEN 413
C+ S+ L++P + F + + F P N
Sbjct: 352 DLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPSN 391
>gi|395328846|gb|EJF61236.1| endopeptidase [Dichomitus squalens LYAD-421 SS1]
Length = 412
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/335 (24%), Positives = 132/335 (39%), Gaps = 60/335 (17%)
Query: 75 TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
+R +Q Q F EG N ++ I +GTP +F V LD GS+ L
Sbjct: 66 SRPAVQDGEELFWTQEEFSVEGGHNVPLSNFMNAQYFAEISLGTPPQTFKVILDTGSSNL 125
Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
WVP ++C ++ +T +YD SSSS+ K + S +
Sbjct: 126 WVP--SVKCTSIACFLHT-------KYDSSSSSTYKANGTEFSIQYGSGSMEG------- 169
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
+ ++DT G L D L A +K + + G+ DG++
Sbjct: 170 ---FVSQDTFRIGDLTVDGLDFAEATK-------EPGLAFAFGKF-----------DGIL 208
Query: 255 GLGLGDVSVPSL------LAKAGLIQN---SFSICFDENDSGSVFFGD-QGPATQQSTSF 304
GL ++V + L GL+ SF + E+D G FG A +
Sbjct: 209 GLAYDTIAVNHITPPFYHLINKGLVDEPVFSFRLGSSEDDGGEAIFGGVDDSAYTGKIQY 268
Query: 305 LPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
+P+ K AY+ V +E +G+ L A +D+G S LPT+I AE++ + +
Sbjct: 269 VPVRRK--AYWEVELEKVSLGDDVLELESTGAAIDTGTSLIALPTDI-AEMI---NTQIG 322
Query: 364 SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 398
+ + SW Y ++ +PD+ F N
Sbjct: 323 ATK------SWNGQYTVDCAKVPSLPDLTFTFGGN 351
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/338 (23%), Positives = 143/338 (42%), Gaps = 46/338 (13%)
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYD 162
NQ ++ I +G+P V +D+GS+++WV CQ C QC Y D +D
Sbjct: 136 NQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQC-------YHQTD---PVFD 185
Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
P+ S+S V CS +C+ + C Y Y + + + G L + L +F +
Sbjct: 186 PADSASFMGVPCSSSVCERIENAGCHAGGCRYEVMYG-DGSYTKGTLALETL---TFGR- 240
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICF 281
+V +V IGCG + G ++ A G+ G + SL+ + G +FS C
Sbjct: 241 ----TVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSM------SLVGQLGGQTGGAFSYCL 290
Query: 282 ---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS--CLTQSGF 333
+ +GS+ FG A +++P+ A Y++ + +G +++ F
Sbjct: 291 VSRGTDSAGSLEFGRG--AMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVF 348
Query: 334 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
Q ++D+G + T +PT Y F + + + + CYN +
Sbjct: 349 QLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVS 408
Query: 386 LKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
++VP + F+ + F P ++VG CF++
Sbjct: 409 VRVPTVSFYFAGGPILTLPARNFLIPVDDVGTF-CFAF 445
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 146/349 (41%), Gaps = 75/349 (21%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +G+P + LD GS L W+ C+ + TS+ ++P SSSS +
Sbjct: 1004 LTVGSPPQQVTMVLDTGSELSWLHCK-------KSPNLTSV------FNPLSSSSYSPIP 1050
Query: 174 CSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
CS P+C++R+ + L +P C I Y+ + +S G L D +
Sbjct: 1051 CSSPICRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLASDNFRIG-------- 1099
Query: 226 SSVQSSVIIGCGRKQ-TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
SS + GC + + + A G+MG+ G + S + + GL + FS C
Sbjct: 1100 SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQLGLPK--FSYCISGR 1154
Query: 285 D-SGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
D SG + FGD P Q ST LP ++ AY V ++ +GN L
Sbjct: 1155 DSSGVLLFGDLHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQLDGIRVGNKILPLPKS 1212
Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL-------QGNSWK 375
T +G Q +VDSG FTFL +Y + +F + L QG +
Sbjct: 1213 IFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQG-AMD 1270
Query: 376 YCYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHI--FSFPENEVGDHACF 421
CY+ A+ ++ +P + L+F + VV + + PE G+ +
Sbjct: 1271 LCYSVAAGGKLPTLPSVSLMF-RGAEMVVGGEVLLYRVPEMMKGNEWVY 1318
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 129/301 (42%), Gaps = 34/301 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + D GS+L WV QC P +S + ++ +DPS SS+ V
Sbjct: 148 VGLGTPAQPSALIFDTGSDLSWV-----QCQPCGSSGHCHPQQD-PLFDPSKSSTYAAVH 201
Query: 174 CSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
C P C + C C Y+ Y + +S++G L D L L S S +
Sbjct: 202 CGEPQCAAAGDLCSEDNTTCLYLVRYG-DGSSTTGVLSRDTLALTS-------SRALTGF 253
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
GCG + G + DG++GLG G++S+PS A + FS C ++S + +
Sbjct: 254 PFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLT 308
Query: 293 -DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSG 340
PAT Q T+ L + YFV + S IG L T+ G L+DSG
Sbjct: 309 IGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRGG--TLLDSG 366
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
T+LP + YA + +F + + + CY+ + E + VP + F
Sbjct: 367 TVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAV 426
Query: 401 F 401
F
Sbjct: 427 F 427
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/319 (22%), Positives = 131/319 (41%), Gaps = 64/319 (20%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPS--------SSS 167
+GTP + LD GS+L+W PC + + YT + S DP+ SS
Sbjct: 80 LGTPPQKVSLVLDTGSSLVWTPCT------IPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 168 SSKNVSCSHPLCK----SRSSCKSLKDPCPYIA-DYSTEDTSSSGYLVDDILHLASFSKH 222
+ +++ C P C S +C + K CPY +Y S++G LV D+L L+ ++
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKR-CPYYGLEYGLG--STTGQLVSDVLGLSKLNRI 190
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC-- 280
+ GC S + P+G+ G G G S+P A+ GL + S+ +
Sbjct: 191 P-------DFLFGC------SLVSNRQPEGIAGFGRGLASIP---AQLGLTKFSYCLVSH 234
Query: 281 -FDENDSGSVFFGDQG----PATQQSTSFLP------IGEKYDAYFVGVESYCIGNSCLT 329
FD+ +G A ++ P + + Y++ + +G +
Sbjct: 235 RFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVP 294
Query: 330 ----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY 376
+ +VDSG++FTF+ I+ V + +K ++ + + + +
Sbjct: 295 IPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGP 354
Query: 377 CYNASSEEMLKVPDMRLIF 395
CYN + + + VP + F
Sbjct: 355 CYNITGQSEVDVPKLTFSF 373
>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 435
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 141/355 (39%), Gaps = 72/355 (20%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T I TP V + LD G LWV C + Y S S Y P+ S+
Sbjct: 45 YITQIKQRTPLVPENLVLDIGGQFLWVDCD---------NNYVS-----STYRPARCGSA 90
Query: 170 KNVSCSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
+ CS S +C S P C D + T++SG L D++ L S +
Sbjct: 91 Q---CSLARSDSCGNCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQDVVSLQSTNGFN 147
Query: 224 P-QSSVQSSVIIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
P Q++ S + C L G A G+ GLG +++PS LA A + F++C
Sbjct: 148 PIQNATVSRFLFSCAPT---FLLQGLATGVSGMAGLGRTRIALPSQLASAFSFRRKFAVC 204
Query: 281 FDENDSGSVFFGDQGP-------ATQQSTSFLPI-------------GEKYDAYFVGVES 320
++ G FFGD GP Q +F P+ GE YF+GV+S
Sbjct: 205 LSSSN-GVAFFGD-GPYVLLPNVDASQLLTFTPLLINPVSTASAFSQGEPSAEYFIGVKS 262
Query: 321 YCIG------NSCLTQSGFQAL----VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
I N+ L + + + S +T L I+ V F K S++ I+
Sbjct: 263 IKIDEKTVPLNTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVTEAFVKASSARNITRV 322
Query: 371 GNSWKYCYNASSEEML------KVPDMRLIFSKNQSFVVR----NHIFSFPENEV 415
+ + S E +L VP + L+ +NQ V R N + S +++V
Sbjct: 323 ASVAPFEVCFSRENVLATRLGAAVPTIELVL-QNQKTVWRIFGANSMVSVSDDKV 376
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 127/304 (41%), Gaps = 39/304 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP F + +D GS+L W IQC P + + +S YD SSSSS
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTW-----IQCNPPNTTANSS-SPPAPWYDKSSSSSY 112
Query: 170 KNVSCSHPLCK-----SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK-- 221
+ + C+ C+ SSC + PC Y YS + + ++G L + + + S +
Sbjct: 113 REIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYS-DQSRTTGILAYETISMKSRKRSG 171
Query: 222 -----HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
H + +V +GC R+ G+ GA+ GV+GLG G +S+ + L
Sbjct: 172 KRAGNHKTRRIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGI 228
Query: 277 FSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS----- 326
FS C + S + F G + + PI A Y+V V +
Sbjct: 229 FSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 288
Query: 327 -----CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 380
+ G + + DSG + ++L Y++V+ + + R ++ CYN
Sbjct: 289 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 348
Query: 381 SSEE 384
+ E
Sbjct: 349 TRME 352
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 113/291 (38%), Gaps = 45/291 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP F + D GS+L WV +C +P + P +S S
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWV--KCAGASPPG-----------RVFRPKTSRSW 162
Query: 170 KNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
+ CS CK + ++C S PC Y DY ++ S+ I+ S + P
Sbjct: 163 APIPCSSDTCKLDVPFTLANCSSPASPCTY--DYRYKEGSAG---ARGIVGTESATIALP 217
Query: 225 QSSVQ--SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
V V++GC G A DGV+ LG +S + A SFS C
Sbjct: 218 GGKVAQLKDVVLGCSSSHDGQSFRSA--DGVLSLGNAKISFAT--QAAARFGGSFSYCLV 273
Query: 282 ----DENDSGSVFFG----DQGPATQQSTSFLP----IGEKYDAYFVGVESYCIGNSCLT 329
N +G + FG + PATQ P G K DA V ++ I
Sbjct: 274 DHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWD 333
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN 379
++DSG + T L Y VV K L ++S +++CYN
Sbjct: 334 AKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFP--PFEHCYN 382
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 98/220 (44%), Gaps = 32/220 (14%)
Query: 70 WKRQKTRVKLQSNNNSSRNQ-LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
W+R++ + + + ++S + ++ P +G + + N FY + + +G P + + D
Sbjct: 22 WERKRPILSVPTASSSFASSSIVLPLQG---NVYPNGFYNVT---LYVGQPPKPYFLDPD 75
Query: 129 AGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
GS+L W+ C C QC P S+ V C PLC S S
Sbjct: 76 TGSDLTWLQCDAPCQQCT--------------ETLHPLYQPSNDLVPCKDPLCMSLHSSM 121
Query: 187 SLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
+ D C Y +Y+ + SS G LV D+ L + + P ++ + +GCG Q
Sbjct: 122 DHRCENPDQCDYEVEYA-DGGSSLGVLVRDVFPL-NLTNGDP---IRPRLALGCGYDQDP 176
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
DG++GLG G VS+ S L G+++N CF+
Sbjct: 177 GSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN 216
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 129/320 (40%), Gaps = 55/320 (17%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
G + L+Y +G V +D S L WV QCAP ++ + D+ +D
Sbjct: 119 GARLRTLNYV-ATVGLGGGEATVIVDTASELTWV-----QCAPCASCH----DQQGPLFD 168
Query: 163 PSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 213
P+SS S + C+ C + +C + P C Y Y + + S G L D
Sbjct: 169 PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAHDK 227
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGL 272
L LA V + GCG G + G+MGLG +S+ S + + G
Sbjct: 228 LSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGG 276
Query: 273 IQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVESYC 322
+ FS C + SGS+ GD + ST + P+ + YFV +
Sbjct: 277 V---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGIT 331
Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKYCY 378
IG + S + +VDSG T L +Y AE + +F + + S+ C+
Sbjct: 332 IGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDTCF 387
Query: 379 NASSEEMLKVPDMRLIFSKN 398
N + +++P ++ +F N
Sbjct: 388 NLTGFREVQIPSLKFVFEGN 407
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 123/319 (38%), Gaps = 51/319 (15%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEY 161
G L+Y +G V +D S L WV CQ C C D+ +
Sbjct: 112 GANLRTLNYVAT-VGLGAAEATVVVDTASELTWVQCQPCESCH----------DQQDPLF 160
Query: 162 DPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCP----------YIADYSTEDTSSSGYLV 210
DPSSS S V C+ C + R + + PC Y Y + + S G L
Sbjct: 161 DPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYR-DGSYSRGVLA 219
Query: 211 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAK 269
D L LA + GCG G+ G + G+MGLG VS V + +
Sbjct: 220 RDKLRLAGQDIEG--------FVFGCGTSNQGAPFGGTS--GLMGLGRSHVSLVSQTMDQ 269
Query: 270 AGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-------YFVGVE 319
G + FS C + SGS+ GD A + ST + D+ YF+ +
Sbjct: 270 FGGV---FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLT 326
Query: 320 SYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 376
+G + F A ++DSG T L +Y V +F ++ + +
Sbjct: 327 GITVGGQEVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDT 386
Query: 377 CYNASSEEMLKVPDMRLIF 395
C+N + + ++VP ++ +F
Sbjct: 387 CFNLTGLKEVQVPSLKFVF 405
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 103/253 (40%), Gaps = 31/253 (12%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
IGTP V L D S+L+WV QC+P T ++ ++P SS+ N+SC
Sbjct: 96 IGTPPVERLAIADTASDLIWV-----QCSPCE----TCFPQDTPLFEPHKSSTFANLSCD 146
Query: 176 HPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C S + C + + C Y Y + +S+ G L + +H S + P++ I
Sbjct: 147 SQPCTSSNIYYCPLVGNLCLYTNTYG-DGSSTKGVLCTESIHFGSQTVTFPKT------I 199
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVF 290
GCG + G++GLG G +S+ S L I + FS C F + +
Sbjct: 200 FGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKLK 257
Query: 291 FGDQGPATQQSTSFLP--IGEKYDA-YFVGVESYCIGNSCLT-----QSGFQALVDSGAS 342
FG+ T P I Y + YF+ + IG L + ++D G
Sbjct: 258 FGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTV 317
Query: 343 FTFLPTEIYAEVV 355
T+L Y V
Sbjct: 318 LTYLEVNFYHNFV 330
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 121/298 (40%), Gaps = 34/298 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I +GTP L D +L W+PC+ C C +++ PS SS+ +
Sbjct: 101 ISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFF-----------PSESSTYTSA 149
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS---SGYLVDDILHLASFSKHAPQSSVQ 229
+C C+ + C Y+ + SS G + D + SF + Q+
Sbjct: 150 ACESYQCQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTI---SFHSSSGQALSY 206
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDS 286
+ CG + GA G++GLG G S+ S + LI +FS C + S
Sbjct: 207 PNTNFICGTFIDNWHYIGA---GIVGLGRGLFSMTSQMKH--LINGTFSQCLVPYSSKQS 261
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYD--AYFVGVESYCIGNSCLTQSGFQA-----LVDS 339
+ FG +G + + PI + + AYF+ +E+ +G + + + + A +D
Sbjct: 262 SKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSNIYIDW 321
Query: 340 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNASSEEMLKVPDMRLIFS 396
+FT LP + Y V + K ++ I+ CY + S+ P + + F+
Sbjct: 322 RTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLSLCYKSESDHDFDAPPITMHFT 379
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/304 (22%), Positives = 127/304 (41%), Gaps = 43/304 (14%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IG+P SF +D GS+L+W C+ C QC D++ +DP SSS +
Sbjct: 115 LAIGSPPRSFSAIMDTGSDLIWTQCKPCQQC----------FDQSTPIFDPKQSSSFYKI 164
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
SCS LC + + D C Y+ Y + +S+ G L + ++ Q S+ +
Sbjct: 165 SCSSELCGALPTSTCSSDGCEYLYTYG-DSSSTQGVLAFETFTFGDSTED--QISI-PGL 220
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSV 289
GCG G A G++GLG G +S+ S L + F+ C D++ S+
Sbjct: 221 GFGCGNDNNGDGFSQGA--GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSL 273
Query: 290 FFGDQGPAT-------QQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------ 334
G T ++T + + Y++ ++ +G + L+ +S F+
Sbjct: 274 LLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGS 333
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN-ASSEEMLKVPDM 391
++DSG + T++ + + +F ++ C+N + ++VP +
Sbjct: 334 GGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKL 393
Query: 392 RLIF 395
F
Sbjct: 394 TFHF 397
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 129/320 (40%), Gaps = 55/320 (17%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
G + L+Y +G V +D S L WV QCAP ++ + D+ +D
Sbjct: 118 GARLRTLNYV-ATVGLGGGEATVIVDTASELTWV-----QCAPCASCH----DQQGPLFD 167
Query: 163 PSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 213
P+SS S + C+ C + +C + P C Y Y + + S G L D
Sbjct: 168 PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAHDK 226
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGL 272
L LA V + GCG G + G+MGLG +S+ S + + G
Sbjct: 227 LSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGG 275
Query: 273 IQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVESYC 322
+ FS C + SGS+ GD + ST + P+ + YFV +
Sbjct: 276 V---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGIT 330
Query: 323 IGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKYCY 378
IG + S + +VDSG T L +Y AE + +F + + S+ C+
Sbjct: 331 IGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDTCF 386
Query: 379 NASSEEMLKVPDMRLIFSKN 398
N + +++P ++ +F N
Sbjct: 387 NLTGFREVQIPSLKFVFEGN 406
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 124/294 (42%), Gaps = 34/294 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y ++ +DP+ SS+ NVS
Sbjct: 184 VGLGTPVSRYTVVFDTGSDTTWV-----QCQPCVVVCYEQREK---LFDPARSSTYANVS 235
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 236 CAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 287
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFG 292
GCG + G + + A G++GLG G S+P K G + F+ C +G+ +
Sbjct: 288 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 341
Query: 293 DQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGFQ---ALVDSGASF 343
+ +++ L D Y+VG+ +G L+ QS F +VDSG
Sbjct: 342 FGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVI 401
Query: 344 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y+ + F ++++ + + + CY+ + + +P + L+F
Sbjct: 402 TRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLF 455
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 148/362 (40%), Gaps = 58/362 (16%)
Query: 60 EYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL------HYTW 113
E L L D R K KL S +SRN G T F + L ++T
Sbjct: 79 ELFHLRLQRDAIRVK---KLSSLGATSRN---LSKPGGTTGFSSSVISGLAQGSGEYFTR 132
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP + LD GS+++W +QCAP Y + ++P S S V
Sbjct: 133 IGVGTPPKYVYMVLDTGSDIVW-----LQCAPCKNCY----SQTDPVFNPVKSGSFAKVL 183
Query: 174 CSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C PLC+ S C + C Y Y + + ++G V + L + +
Sbjct: 184 CRTPLCRRLESPGCNQ-RQTCLYQVSYG-DGSYTTGEFVTETLTF--------RRTKVEQ 233
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----G 287
V +GCG G ++ A ++GLG G +S PS + FS C + +
Sbjct: 234 VALGCGHDNEGLFVGAAG---LLGLGRGGLSFPSQAGRT--FNQKFSYCLVDRSASSKPS 288
Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ------- 334
SV FG+ A ++ F P+ + D Y+V + +G S +T S F+
Sbjct: 289 SVVFGNS--AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNG 346
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++D G S T L Y + F SS + + + + + CY+ S + +KVP + L
Sbjct: 347 GVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVL 406
Query: 394 IF 395
F
Sbjct: 407 HF 408
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 125/317 (39%), Gaps = 46/317 (14%)
Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRN 157
G L Y + GTP V +V +D GS++ W+ PC QC P +
Sbjct: 70 LGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP----------QK 119
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
YDPS SS+ V C+ +CK S C S K C + Y+ + TS+ G
Sbjct: 120 DPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQ 177
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
D L L AP + VQ + GCG G + DGV+GLG SL A+ G
Sbjct: 178 DKLTL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGR---LRESLGARYG 224
Query: 272 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSC- 327
+ FS C S F F P+G + V + +G
Sbjct: 225 GV---FSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKL 281
Query: 328 -LTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
L S F +VDSG T L + Y + F K + + R+ G+ CYN + +
Sbjct: 282 DLRPSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD-LDTCYNLTGYK 340
Query: 385 MLKVPDMRLIFSKNQSF 401
+ VP + L F+ +
Sbjct: 341 NVVVPKIALTFTGGATI 357
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 138/333 (41%), Gaps = 51/333 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+++ + +G P F + LD GS++ W+ CQ C C Y D +DP SSSS
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPRSSSS 204
Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
++ C C++ S C++ K C Y Y + + + G V + L + S
Sbjct: 205 FASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVTETLTFGN-------S 254
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
+ + V +GCG G + V GL + L + + +SFS C +
Sbjct: 255 GMINDVAVGCGHDNEGLF--------VGSAGLLGLGGGPLSLTSQMKASSFSYCLVDRDS 306
Query: 285 -DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQ 334
S + F P+ + L G+ Y+VG+ +G L+ SG+
Sbjct: 307 SSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366
Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKVPD 390
+VDSG + T L T+ Y + D VS + N + CY+ SS+ + +P
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+ F+ +S + + P + VG CF++
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGTF-CFAF 455
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 133/330 (40%), Gaps = 72/330 (21%)
Query: 102 FGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWV---PCQCIQCAPLSASYYTSLDRN 157
G L Y + GTP V +V +D GS++ W+ PC QC P +
Sbjct: 104 LGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP----------QK 153
Query: 158 LSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
YDPS SS+ V C+ +CK S C S K C + Y+ + TS+ G
Sbjct: 154 DPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQ 211
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG---------LGDV- 261
D L L AP + VQ + GCG G + DGV+GLG G V
Sbjct: 212 DKLTL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGRLRESLGARYGGVF 261
Query: 262 --SVPSLLAKAGLIQNSFSICFDENDSGSVF--FGD-QGPATQQSTSFLPI---GEKYDA 313
+PS+ +K G + ++ +N SG VF G G T + + I G+K D
Sbjct: 262 SYCLPSVSSKPGFL----ALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLD- 316
Query: 314 YFVGVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
L S F +VDSG T L + Y + F K + + R+ G
Sbjct: 317 --------------LRPSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNG 362
Query: 372 NSWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
+ CYN + + + VP + L F+ +
Sbjct: 363 D-LDTCYNLTGYKNVVVPKIALTFTGGATI 391
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 86/356 (24%), Positives = 147/356 (41%), Gaps = 64/356 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQC-----APLSASYYTSLDRNLSEYDP 163
++IGTP V +D GS+L WVPC C++C L A++ S +
Sbjct: 86 LNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSYRASC 145
Query: 164 SS-------SSSSKNVSCSHPLCKSRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILH 215
+S SS + +C+ C + K+ PCP A T +G +V IL
Sbjct: 146 ASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFA-----YTYGAGGVVTGILT 200
Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
+ + V + C +Y + P G+ G G G + S++++ G +Q
Sbjct: 201 RDTLRVNGSSPGVAKEIPKFCFGCVGSAYRE---PIGIAGFGRGTL---SMVSQLGFLQK 254
Query: 276 SFSICF-------DENDSGSVFFGDQGPATQQSTSFLPI--GEKY-DAYFVGVESYCIGN 325
FS CF + N S + GD ++ F P+ Y + Y+VG+E+ +GN
Sbjct: 255 GFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGN 314
Query: 326 SCLTQ-----SGFQAL------VDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISLQG 371
T+ F +L +DSG ++T LP Y++V+ ++ R + +Q
Sbjct: 315 VSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQ- 373
Query: 372 NSWKYCYNA--------SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHA 419
+ CY +S+++L P + F N S V+ +P + G+ A
Sbjct: 374 TGFDLCYKVPRPNNNTLTSDDLL--PSITFHFLNNVSLVLPQGNHFYPVSAPGNPA 427
>gi|150866171|ref|XP_001385673.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
gi|149387427|gb|ABN67644.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
Length = 417
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 149/347 (42%), Gaps = 76/347 (21%)
Query: 95 EGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSL 154
E T++ Q++ T I +GTP F V LD GS+ LWVP Q +C+ L+ +T
Sbjct: 92 EAPLTNYLNAQYF----TEISLGTPAQQFKVILDTGSSNLWVPSQ--ECSSLACFLHT-- 143
Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
+YD SSS+ K+ S S++ + Y ++DT + G LV
Sbjct: 144 -----KYDHDSSST----------YKANGSEFSIQYGSGAMEGYVSQDTLAIGDLVIPKQ 188
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL-------L 267
A +++ + + G+ DG++GL +SV + L
Sbjct: 189 DFA-------EATSEPGLAFAFGKF-----------DGILGLAYNTISVNKIVPPVYNAL 230
Query: 268 AKAGLIQNSFSICF-----DENDSGSVFFG--DQGPATQQSTSFLPIGEKYDAYF-VGVE 319
A+ L + F+ DEND G FG D+ T + T +LP+ K AY+ V E
Sbjct: 231 AQGLLDEPQFAFYLGDTKKDENDGGLATFGGYDESAFTGKIT-WLPVRRK--AYWEVSFE 287
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 379
+G+ A +D+G S LP+ + AE++ K+ ++K SW Y
Sbjct: 288 GIGLGDEYAELDNTGAAIDTGTSLITLPSSL-AEIINA--KIGATK-------SWSGQYQ 337
Query: 380 ASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEVGDHACFSYFT 425
E+ +PD+ L F+ N + ++I EVG +C S FT
Sbjct: 338 IDCEKQDTLPDLTLNFAGYNFTLTAHDYIL-----EVGG-SCISVFT 378
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/286 (23%), Positives = 115/286 (40%), Gaps = 33/286 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I IGTP D GS+L+W QC P + Y + +DPS S+S K VS
Sbjct: 95 ISIGTPPFDVYGIYDTGSDLMWT-----QCLPCLSCY----KQKNPMFDPSKSTSFKEVS 145
Query: 174 CSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C C+ SC + C + Y + + + G + + L L S S Q +
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYG-DGSLAQGVIATETLTLNSNSG---QPXSIXN 201
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDS 286
++ GCG +G++ + G+ G G +S+ S + FS C D + +
Sbjct: 202 IVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSIT 259
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGN--------SCLTQSGFQAL 336
+ FG + + P+ K D YFV ++ +G+ S + G
Sbjct: 260 SKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKG-NVF 318
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
+D+G T LP + Y +V + + + + + CY +++
Sbjct: 319 IDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSAT 364
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 139/330 (42%), Gaps = 45/330 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++T + +G P S+ + LD GS++ W+ CQ C C S +T P++SSS
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFT----------PAASSS 208
Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
++C C S SSC++ + C Y +Y + + + G V + + S
Sbjct: 209 YSPLTCDSQQCNSLQMSSCRNGQ--CRYQVNYG-DGSFTFGDFVTETMSFGG-------S 258
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+S+ +GCG G ++ A G+ G L SL ++ L SFS C DS
Sbjct: 259 GTVNSIALGCGHDNEGLFVGAAGLLGLGGGPL------SLTSQ--LKATSFSYCLVNRDS 310
Query: 287 GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT--QSGFQ------- 334
+ D A + P+ K D Y+VG+ +G L Q F+
Sbjct: 311 AASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDG 370
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+VD G + T L +E Y + F + R + + CY+ S + +KVP +
Sbjct: 371 GVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSF 430
Query: 394 IFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F +S+ + + P + G + CF++
Sbjct: 431 HFDGGKSWDLPAANYLIPVDSAGTY-CFAF 459
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/286 (23%), Positives = 116/286 (40%), Gaps = 33/286 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I IGTP D GS+L+W QC P + Y + +DPS S+S K VS
Sbjct: 95 ISIGTPPFDVYGIYDTGSDLMWT-----QCLPCLSCY----KQKNPMFDPSKSTSFKEVS 145
Query: 174 CSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
C C+ SC + C + Y + + + G + + L L S S Q + +
Sbjct: 146 CESQQCRLLDTVSCSQPQKLCDFSYGYG-DGSLAQGVIATETLTLNSNSG---QPTSILN 201
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDS 286
++ GCG +G++ + G+ G G +S+ S + FS C D + +
Sbjct: 202 IVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSIT 259
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGN--------SCLTQSGFQAL 336
+ FG + + P+ K D YFV ++ +G+ S + G
Sbjct: 260 SKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKG-NVF 318
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 382
+D+G T LP + Y +V + + + + + CY +++
Sbjct: 319 IDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSAT 364
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 114/286 (39%), Gaps = 58/286 (20%)
Query: 107 YWLHYTWIDIGTPNVSFL-VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS 165
Y +H + IGTP + + LD GS+L+W C C C + +D +
Sbjct: 100 YLIH---LSIGTPRPQRVALTLDTGSDLVWTQCACHVC----------FAQPFPTFDALA 146
Query: 166 SSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S ++ V CS P+C S S C + C Y+ DY+ + + +SG +V+D +F+
Sbjct: 147 SQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYA-DKSITSGRIVED-----TFTF 200
Query: 222 HAPQSSVQS---------SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
+PQ + S +V GCG+ G + + G+ G G +S+PS L K
Sbjct: 201 RSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNES--GIAGFSRGPMSLPSQL-KVAR 257
Query: 273 IQNSFSICFDENDSGSVFFGDQGP--------ATQQSTSFLPIGEKYDAYFVGVESYCIG 324
+ F+ D S G GP QST F Y++ ++ +G
Sbjct: 258 FSHCFTAIADARTSPVFLGGAPGPDNLGAHATGPVQSTPF--ANSNGSLYYLTLKGITVG 315
Query: 325 NSCLTQSGFQ------------ALVDSGASFTFLPTEIYAEVVVKF 358
+ L + ++DSG LP +Y + F
Sbjct: 316 KTRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAF 361
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 127/311 (40%), Gaps = 50/311 (16%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP F + +D+GS+LLWV QC+P Y + D L Y PS+SS+ V C
Sbjct: 70 LGTPPQKFSLIVDSGSDLLWV-----QCSPCRQCY--AQDSPL--YVPSNSSTFSPVPCL 120
Query: 176 HPLCKSRSSCKSLKDPCPY------IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
C + + PC + +Y DTSSS + ++
Sbjct: 121 SSDCLLIPATEGF--PCDFRYPGACAYEYLYADTSSSK-------GVFAYESATVDGVRI 171
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
V GCG GS+ AA GV+GLG G +S S + A N F+ C +
Sbjct: 172 DKVAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDPTS 226
Query: 285 DSGSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCL--TQSGFQ----- 334
S S+ FGD+ +T + PI + Y+V +E +G L + S ++
Sbjct: 227 VSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLG 286
Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEMLKVPD 390
++ DSG + T+ Y+ ++ FD V R S+QG C + + P
Sbjct: 287 NGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQG--LDLCVELTGVDQPSFPS 344
Query: 391 MRLIFSKNQSF 401
+ F F
Sbjct: 345 FTIEFDDGAVF 355
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/308 (26%), Positives = 129/308 (41%), Gaps = 33/308 (10%)
Query: 102 FGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
G+ L Y + +GTP V+ V +D GS++ WV C P A + +
Sbjct: 118 LGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHA-------QTGAL 170
Query: 161 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
+DP+ SS+ + VSC+ C + + C + C Y Y + ++++G D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
+ S GC ++G + D DG+MGLG G S+ S A A NS
Sbjct: 230 SGASDAV------KGFQFGCSHLESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278
Query: 277 FSICFDENDSGSVFFGDQGPATQQ----STSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 330
FS C SGS F G +T L + Y ++ +G L+
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSP 337
Query: 331 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
S F A +VDSG T LP Y+ + F + R + + C++ + + + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 389 PDMRLIFS 396
P + L+FS
Sbjct: 398 PTVALVFS 405
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 75/168 (44%), Gaps = 17/168 (10%)
Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
C Y Y+ E +SS G++V+D P ++ GC +TG A D
Sbjct: 7 CYYSRTYA-ERSSSEGWMVEDAFGF-------PDDQPPVRMVFGCENGETGEIYRQLA-D 57
Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY 311
G+MG+G + S L G+I++ FS+CF G + GD +T + P+
Sbjct: 58 GIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNL 117
Query: 312 DAYFVGVESYCIG--------NSCLTQSGFQALVDSGASFTFLPTEIY 351
++ V I N+ + G+ ++DSG +FT+LPTE +
Sbjct: 118 HLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAF 165
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 66/287 (22%), Positives = 117/287 (40%), Gaps = 52/287 (18%)
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQ-----CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y ++IG P + + +D GS+ W+ C C C + Y + L
Sbjct: 40 YVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLYRLTRKKL------- 92
Query: 166 SSSSKNVSCSHPLCK-------SRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLA 217
V C+ PLC + C + K+ C Y Y +S L+D
Sbjct: 93 ------VPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLD------ 140
Query: 218 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI 273
K + + ++ GCG Q A DG++GLG G V + S L +G +
Sbjct: 141 ---KFSLPTGGARNIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAV 197
Query: 274 -QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE----KYDAYFVGVESYCIGNSCL 328
+N C G +F G++ + T ++P+ + + Y G + + ++ +
Sbjct: 198 SKNVIGHCLSSKGGGYLFIGEENVPSSHVT-WVPMAPTTPGEPNHYSPGQATLHLDSNPI 256
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 375
+A+ DSG+++T+LP ++A+ LVS+ + SL +S K
Sbjct: 257 GTKPLKAIFDSGSTYTYLPENLHAQ-------LVSALKASLSKSSLK 296
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 134/331 (40%), Gaps = 46/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+++ + +G P + LD GS++ W+ CQ C C Y D YDPS S+S
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADC-------YAQSD---PVYDPSVSTS 212
Query: 169 SKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
V C P C+ ++C++ C Y Y + + + G + L L S
Sbjct: 213 YATVGCDSPRCRDLDAAACRNSTGSCLYEVAYG-DGSYTVGDFATETLTLG-------DS 264
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ S+V IGCG G ++ A + G L S PS ++ +FS C + DS
Sbjct: 265 APVSNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDS 316
Query: 287 GS---VFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------ 334
S + FGD + PA P + Y+V + +G L+ S F
Sbjct: 317 PSSSTLQFGDSEQPAVTAPLIRSPRTNTF--YYVALSGISVGGEALSIPSSAFAMDDAGS 374
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
+VDSG + T L + Y + F + S + + + CY+ + ++VP +
Sbjct: 375 GGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVA 434
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
L F + + P + G + C ++
Sbjct: 435 LWFEGGGELKLPAKNYLIPVDAAGTY-CLAF 464
>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 203
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 65/120 (54%), Gaps = 17/120 (14%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT + IGTP V +D GS+L+WV C C+ C PL N++ +DP +SS
Sbjct: 77 LYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGC-PL---------HNVTFFDPGASS 126
Query: 168 SSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
S+ ++CS C S +S C SL + C Y +Y + + +SGY + D++ + S A
Sbjct: 127 SAVKLACSDKRCSSDLQKKSRC-SLLESCTYKVEYG-DGSVTSGYYISDLISFDTMSGVA 184
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 140/333 (42%), Gaps = 51/333 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+++ + +G P F + LD GS++ W+ CQ C C Y D +DP SSSS
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDC-------YQQTD---PIFDPRSSSS 204
Query: 169 SKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
++ C C++ S C++ K C Y Y + + + G V + L + S
Sbjct: 205 FASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVIETLTFGN-------S 254
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-- 284
+ ++V +GCG G + V GL + SL + + +SFS C +
Sbjct: 255 GMINNVAVGCGHDNEGLF--------VGSAGLLGLGGGSLSLTSQMKASSFSYCLVDRDS 306
Query: 285 -DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT---------QSGFQ 334
S + F P+ + L G+ Y+VG+ +G L+ SG+
Sbjct: 307 SSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366
Query: 335 A-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLKVPD 390
+VDSG + T L T+ Y + D VS + N + CY+ SS+ + +P
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+ F+ +S + + P + VG CF++
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGTF-CFAF 455
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 109/272 (40%), Gaps = 34/272 (12%)
Query: 93 PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
PS GN + H+ ++I P + + +D GS L W+ C CI C +
Sbjct: 20 PSSAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCK-SLKDPCPYIADYSTEDT 203
Y V C+ C R K K+ C Y Y
Sbjct: 80 LY-------------KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GG 124
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVS 262
SS G L+ D SFS A + +S+ GCG Q + + P +G++GLG G V+
Sbjct: 125 SSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVT 179
Query: 263 VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY--FVGVE 319
+ S L G+I ++ C G +FFGD T T + P+ ++ Y G
Sbjct: 180 LLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTL 238
Query: 320 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIY 351
+ + ++ + + + DSGA++T+ + Y
Sbjct: 239 HFNSNSKPISAAPMEVIFDSGATYTYFALQPY 270
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 133/306 (43%), Gaps = 53/306 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP ++ +D GS+L+W C+ C QC D+ +DP SSS +
Sbjct: 104 LAIGTPPETYSAIMDTGSDLIWTQCKPCTQC----------FDQPSPIFDPKKSSSFSKL 153
Query: 173 SCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
SCS LCK+ +SSC D C Y+ Y + +S+ G + + S
Sbjct: 154 SCSSQLCKALPQSSC---SDSCEYLYTYG-DYSSTQGTMATETFTFGKVSI--------P 201
Query: 231 SVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
+V GCG G DG G++GLG G +S+ S L +A FS C D+ +
Sbjct: 202 NVGFGCGEDNEG---DGFTQGSGLVGLGRGPLSLVSQLKEA-----KFSYCLTSIDDTKT 253
Query: 287 GSVFFG-----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ----- 334
++ G + A ++T + + Y++ +E +G + L +S FQ
Sbjct: 254 STLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDG 313
Query: 335 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPD 390
++DSG + T+L + V +F + + + CYN S+ L+VP
Sbjct: 314 TGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSELEVPK 373
Query: 391 MRLIFS 396
+ L F+
Sbjct: 374 LVLHFT 379
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 79/305 (25%), Positives = 125/305 (40%), Gaps = 46/305 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP + LVA+D ++ WVPC C+ CAP ++S +DP+ SS+ + V C
Sbjct: 106 LGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASS---------PSFDPTQSSTYRPVRC 156
Query: 175 SHPLC----KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
P C + SC + C + Y++ + L D L L+ + A
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA--VLGQDALSLSDSNGAA---VPD 211
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSICFD----EN 284
GC R TGS P G++G G G + S L++ S FS C N
Sbjct: 212 DHYTFGCLRVVTGSG-GSVPPQGLVGFGRGPL---SFLSQTKATYGSIFSYCLPSYKSSN 267
Query: 285 DSGSVFFGDQG-PATQQSTSFLP------------IGEKYDAYFVGVESYCIGNSCLTQS 331
SG++ G G P ++T L +G + + V + + + T
Sbjct: 268 FSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGR 327
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 391
G +VD+G FT L YA + F + VS+ G + CY + + VP +
Sbjct: 328 G-GTIVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPALGG-FDTCYYVNGTK--SVPAV 383
Query: 392 RLIFS 396
+F+
Sbjct: 384 AFVFA 388
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 84/375 (22%), Positives = 158/375 (42%), Gaps = 46/375 (12%)
Query: 66 LSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLV 125
+ D KR + + S+ ++++ ++ GS NQ ++ I +G+P S +
Sbjct: 1 MHRDVKRVASLIHRLSSGSAAKYEV--EDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYM 58
Query: 126 ALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS 184
+D+GS+++WV C+ C QC Y D +DP+ S+S VSCS +C +
Sbjct: 59 VIDSGSDIVWVQCKPCTQC-------YHQTD---PLFDPADSASFMGVSCSSAVCDRVEN 108
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 244
C Y Y + + + G L L +F + +V +V IGCG G +
Sbjct: 109 AGCNSGRCRYEVSYG-DGSYTKGTLA---LETLTFGR-----TVVRNVAIGCGHSNRGMF 159
Query: 245 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQS 301
+ A G+ S+ + +G N+FS C N +G + FG + A
Sbjct: 160 VGAAGLLGLG-----GGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSE--AMPVG 212
Query: 302 TSFLPIGEKYDA---YFVGVESYCIGNS--CLTQSGFQ--------ALVDSGASFTFLPT 348
+++P+ A Y++ + +G++ +++ FQ ++D+G + T PT
Sbjct: 213 AAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPT 272
Query: 349 EIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
Y F + + + + + CYN ++VP + FS + + F
Sbjct: 273 VAYEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNF 332
Query: 409 SFPENEVGDHACFSY 423
P ++ G CF++
Sbjct: 333 LIPVDDAGTF-CFAF 346
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 139/331 (41%), Gaps = 44/331 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++ I +G+P S + +D+GS+++WV C+ C QC Y D +DP+ S+S
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQC-------YHQTD---PLFDPADSAS 92
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
VSCS +C + C Y Y + +S+ G L + L L +V
Sbjct: 93 FMGVSCSSAVCDQVDNAGCNSGRCRYEVSYG-DGSSTKGTLALETLTLG--------RTV 143
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE---ND 285
+V IGCG G ++ A G+ G + V L + G N+FS C N
Sbjct: 144 VQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVG--QLSRERG---NAFSYCLVSRVTNS 198
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC---------LTQSGF 333
+G + FG + A +++P+ + Y++G+ +G+ LT+ G
Sbjct: 199 NGFLEFGSE--AMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGN 256
Query: 334 QALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
+V D+G + T PT Y F + + + + CYN ++VP +
Sbjct: 257 GGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVS 316
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
FS + + F P ++ G CF++
Sbjct: 317 FYFSGGPILTLPANNFLIPVDDAGTF-CFAF 346
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 86/365 (23%), Positives = 152/365 (41%), Gaps = 45/365 (12%)
Query: 77 VKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLW 135
++L +N++ R++ L S H + +YT + IGTP F + +D S +
Sbjct: 3 LELVANSHRRRDRELLGSARMDLH--DDLLTKGYYTSRVKIGTPPHEFSLIVDRSS---F 57
Query: 136 VPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYI 195
V + + C S++ D S P+ SSS K + C + C + S K Y
Sbjct: 58 VSPKTMFC-----SFFFLQDPRFS---PALSSSYKPLECGNE-CSTGFCDGSRK----YQ 104
Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
Y+ E ++SSG L D++ ++ S Q ++ GC +TG D A DG++G
Sbjct: 105 RQYA-EKSTSSGVLGKDVISFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGIIG 157
Query: 256 LGLGDVSVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYD 312
LG G +S+ L + +++ FS+C+ DE + G Q P TS P Y
Sbjct: 158 LGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPY- 216
Query: 313 AYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 366
Y + ++ +G S L + ++DSG ++ + P + + V S +
Sbjct: 217 -YNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLK 275
Query: 367 ISLQGNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEVGDHA 419
+ G K+ CY + + + P + +F QS + + F ++
Sbjct: 276 -EVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGAY 334
Query: 420 CFSYF 424
C F
Sbjct: 335 CLGVF 339
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 114/302 (37%), Gaps = 51/302 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP F V D GS+ WV QC P A Y + + P+ S++ N+S
Sbjct: 169 IRLGTPAARFTVVFDTGSDTTWV-----QCQPCVAYCY---QQKEPLFTPTKSATYANIS 220
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ C + C Y Y + + + G+ D L L
Sbjct: 221 CTSSYCSDLDTRGCSGGHCLYAVQYG-DGSYTVGFYAQDTLTLG--------YDTVKDFR 271
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG- 292
GCG K G + A G+MGLG G SVP + F+ C SG+ F
Sbjct: 272 FGCGEKNRGLFGKAA---GLMGLGRGKTSVP--VQAYDKYSGVFAYCIPATSSGTGFLDF 326
Query: 293 DQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLTQ-----SGFQALVDSGASF 343
G + P+ G + Y+VG+ +G L+ S ALVDSG
Sbjct: 327 GPGAPAAANARLTPMLVDNGPTF--YYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVI 384
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---------YCYNASS-EEMLKVPDMRL 393
T LP Y + F K ++G +K CY+ + + + +P + L
Sbjct: 385 TRLPPSAYEPLRSAFAK-------GMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSL 437
Query: 394 IF 395
+F
Sbjct: 438 VF 439
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 146/374 (39%), Gaps = 64/374 (17%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
+L++ TR K + N ++ + P G Q N + +GTP + L
Sbjct: 64 MLTSGAGPLTTRAKPKPKNRANPPVPIAP--GRQILSIPN-----YIARAGLGTPAQTLL 116
Query: 125 VALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---K 180
VA+D ++ WVPC C CA S S + P+ SS+ + V C P C
Sbjct: 117 VAIDPSNDAAWVPCSACAGCAASSPS-----------FSPTQSSTYRTVPCGSPQCAQVP 165
Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
S S + C + Y+ ++ L D L L +++V S GC R
Sbjct: 166 SPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL--------ENNVVVSYTFGCLRVV 215
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVFFGDQG- 295
+G+ + P G++G G G +S L + FS C N SG++ G G
Sbjct: 216 SGNSVP---PQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQ 270
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 345
P ++T L + Y+V + +G+ + +G ++D+G FT
Sbjct: 271 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 330
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRN 405
L +YA V F V + G + CYN + + VP + +F+ +
Sbjct: 331 LAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT----VSVPTVTFMFAGAVA----- 380
Query: 406 HIFSFPENEVGDHA 419
+ PE V H+
Sbjct: 381 --VTLPEENVMIHS 392
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 101/248 (40%), Gaps = 39/248 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP F V D GS+ WV QC P S Y DR +DP+ SS+ NVS
Sbjct: 167 IGLGTPPSRFTVVFDTGSDTTWV-----QCRPCVVSCYKQKDR---LFDPAKSSTYANVS 218
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + + G+ D L +A Q +++
Sbjct: 219 CADPACADLDASGCNAGHCLYGIQYG-DGSYTVGFFAKDTLAVA-------QDAIK-GFK 269
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFF- 291
GCG K G + A G++GLG G S+ K G SFS C + + + +
Sbjct: 270 FGCGEKNRGLFGQTA---GLLGLGRGPTSITVQAYEKYG---GSFSYCLPASSAATGYLE 323
Query: 292 ----GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN--------SCLTQSGFQALVDS 339
+ T+ + + Y+VG+ +G S + SG LVDS
Sbjct: 324 FGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSG--TLVDS 381
Query: 340 GASFTFLP 347
G T LP
Sbjct: 382 GTVITRLP 389
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 76/167 (45%), Gaps = 23/167 (13%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I IGTP + D GS+L W QC P S Y+ + +++PSSSSS NVS
Sbjct: 138 IGIGTPKHDISLMFDTGSDLTWT-----QCEPCLGSCYSQKE---PKFNPSSSSSYHNVS 189
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
CS P+C + SC + C Y Y + + + G+L + L + S V +
Sbjct: 190 CSSPMCGNPESCSA--SNCLYGIGYG-DGSVTVGFLAKEKFTLTN-------SDVLDDIY 239
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 280
GCG G ++ A G++GLG G S P L N FS C
Sbjct: 240 FGCGENNKGVFIGSA---GILGLGPGKFSFP--LQTTTTYNNIFSYC 281
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 146/374 (39%), Gaps = 64/374 (17%)
Query: 65 LLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFL 124
+L++ TR K + N ++ + P G Q N + +GTP + L
Sbjct: 45 MLTSGAGPLTTRAKPKPKNRANPPVPIAP--GRQILSIPN-----YIARAGLGTPAQTLL 97
Query: 125 VALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC---K 180
VA+D ++ WVPC C CA S S + P+ SS+ + V C P C
Sbjct: 98 VAIDPSNDAAWVPCSACAGCAASSPS-----------FSPTQSSTYRTVPCGSPQCAQVP 146
Query: 181 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 240
S S + C + Y+ ++ L D L L +++V S GC R
Sbjct: 147 SPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL--------ENNVVVSYTFGCLRVV 196
Query: 241 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVFFGDQG- 295
+G+ + P G++G G G +S L + FS C N SG++ G G
Sbjct: 197 SGNSVP---PQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQ 251
Query: 296 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 345
P ++T L + Y+V + +G+ + +G ++D+G FT
Sbjct: 252 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 311
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRN 405
L +YA V F V + G + CYN + + VP + +F+ +
Sbjct: 312 LAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT----VSVPTVTFMFAGAVA----- 361
Query: 406 HIFSFPENEVGDHA 419
+ PE V H+
Sbjct: 362 --VTLPEENVMIHS 373
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 127/308 (41%), Gaps = 57/308 (18%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IG P + L +D GS+L WV C C C+ +++ +DPS SS+ N+SC
Sbjct: 99 IGEPPIPQLAVMDTGSSLTWVMCHPCSSCS----------QQSVPIFDPSKSSTYSNLSC 148
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
S C + C + CPY +Y SS G + L L + + + S+I
Sbjct: 149 SE--C---NKCDVVNGECPYSVEY-VGSGSSQGIYAREQLTLETIDESIIKV---PSLIF 199
Query: 235 GCGRK----QTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQNSFSICFDENDSGSV 289
GCGRK G G +GV GLG G S +PS K FS C + +
Sbjct: 200 GCGRKFSISSNGYPYQGI--NGVFGLGSGRFSLLPSFGKK-------FSYCIGNLRNTNY 250
Query: 290 FF-----GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGF 333
F GD+ ST+ I Y+V +E+ IG L T +
Sbjct: 251 KFNRLVLGDKANMQGDSTTLNVIN---GLYYVNLEAISIGGRKLDIDPTLFERSITDNNS 307
Query: 334 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYCYNA-SSEEMLKVP 389
++DSGA T+L + + + + L+ + Q N + CY+ S+++ P
Sbjct: 308 GVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFP 367
Query: 390 DMRLIFSK 397
+ F++
Sbjct: 368 LVTFHFAE 375
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 83/301 (27%), Positives = 128/301 (42%), Gaps = 34/301 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + D GS+L WV QC P +S + ++ +DPS SS+ V
Sbjct: 153 VGLGTPAQPSALIFDTGSDLSWV-----QCQPCGSSGHCHPQQD-PLFDPSKSSTYAAVH 206
Query: 174 CSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
C P C + C C Y+ Y + +S++G L D L L S S +
Sbjct: 207 CGEPQCAAAGGLCSEDNTTCLYLVHYG-DGSSTTGVLSRDTLALTS-------SRALAGF 258
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFG 292
GCG + G + DG++GLG G++S+PS A + FS C ++S + +
Sbjct: 259 PFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLT 313
Query: 293 -DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL-------TQSGFQALVDSG 340
PAT Q T+ L + YFV + S IG L T+ G L+DSG
Sbjct: 314 IGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGG--TLLDSG 371
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
T+LP + Y + +F + + + CY+ + E + VP + F
Sbjct: 372 TVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAV 431
Query: 401 F 401
F
Sbjct: 432 F 432
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 150/379 (39%), Gaps = 92/379 (24%)
Query: 72 RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGS 131
R + R + + S L+P H +G + + +GTP V LD GS
Sbjct: 62 RPRPRSRQGTAPPPSVRASLYP------HSYGGYAFT-----VSLGTPPQPLPVLLDTGS 110
Query: 132 NLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL--------- 178
+L WVPC QC C+ LSA+ L + P +SSSS+ + C +P
Sbjct: 111 HLSWVPCTSSYQCRNCSSLSAA------SPLHVFHPKNSSSSRLIGCRNPSCLWIHSPDH 164
Query: 179 ---CKSRSSCK---------SLKDPC-PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
C++ SSC + + C PY+ Y + T +G L+ D L P
Sbjct: 165 LSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGST--AGLLISDTL-------RTPG 215
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----- 280
+V+ + +IGC P G+ G G G SVPS L GL + FS C
Sbjct: 216 RAVR-NFVIGCSLASVHQ-----PPSGLAGFGRGAPSVPSQL---GLTK--FSYCLLSRR 264
Query: 281 FDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA-------YFVGVESYCIGNSC--L 328
FD+N + S + G G + P+ A Y++ + + +G L
Sbjct: 265 FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQL 324
Query: 329 TQSGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSK----RISLQGNSWKYC 377
+ F A+VDSG +F++ ++ V V + ++ +G C
Sbjct: 325 PERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLSPC 384
Query: 378 YN-ASSEEMLKVPDMRLIF 395
+ + +++P+M L F
Sbjct: 385 FAMPPGTKTMELPEMSLHF 403
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 104/432 (24%), Positives = 179/432 (41%), Gaps = 78/432 (18%)
Query: 24 FSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVK-LQSN 82
++KL+HR + ++ V + +S+E + L ++++K L+S
Sbjct: 38 LATKLIHR--NSYLHPLYDQNETVEDRSKREQTSSIERFDFL--------ESKIKELKSV 87
Query: 83 NNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-QCI 141
N +R+ L+ + GS F N + IG+P V+ LV +D GS+LLWV C CI
Sbjct: 88 GNEARSSLIPFNRGSG--FLVN---------LSIGSPPVTQLVVVDTGSSLLWVQCLPCI 136
Query: 142 QCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK-DPCPYIADYST 200
C S S+ +DP S S K + C P + K + + Y Y
Sbjct: 137 NCFQQSTSW----------FDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLG 186
Query: 201 EDTSSSGYLVDDILHLAS------FSKHAPQSSV----QSSVIIGCGRKQTGSYLDGAAP 250
D SS G L + L + F +A + + +S++ GCG + D A
Sbjct: 187 GD-SSQGILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAY- 244
Query: 251 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-----GSVFFGDQGPATQQSTSFL 305
+GV GLG + P + A + N FS C + ++ + G QG + ++
Sbjct: 245 NGVFGLG----AYPH-ITMATQLGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDST-- 296
Query: 306 PIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTE----IY 351
P+ + Y+V ++S +G+ L + F+ L+DSG ++T L +Y
Sbjct: 297 PLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLY 356
Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
E+V L+ +RI Q C+ S +++ P + F+ V+ + S
Sbjct: 357 DEIVDLMKGLL--ERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESG--SL 412
Query: 411 PENEVGDHACFS 422
GD C +
Sbjct: 413 FRQHGGDRFCLA 424
>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 430
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 64/117 (54%), Gaps = 17/117 (14%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT + IGTP V +D GS+L+WV C C+ C PL N++ +DP +SS
Sbjct: 77 LYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGC-PL---------HNVTFFDPGASS 126
Query: 168 SSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
S+ ++CS C S +S C SL + C Y +Y + + +SGY + D++ + S
Sbjct: 127 SAVKLACSDKRCSSDLQKKSRC-SLLESCTYKVEYG-DGSVTSGYYISDLISFDTMS 181
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 129/311 (41%), Gaps = 48/311 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
I IG P V L D GS+L+WV CQ C C +N +DP SSS +NV
Sbjct: 97 ISIGNPQVEILAIADTGSDLIWVQCQPCEMC----------YKQNSPIFDPRRSSSYRNV 146
Query: 173 SCSHPLC-----KSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
C + C ++RS + C Y Y + + S G+L + + S + + +
Sbjct: 147 LCGNEFCNKLDGEARSCDARGFVKTCGYTYSYG-DQSFSDGHLAIERFGIGSTNSNTSAA 205
Query: 227 -SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICF--- 281
+ V GCG K G++ + G+ SL+++ G + FS C
Sbjct: 206 IAYFQEVAFGCGTKNGGTF-----DELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPT 260
Query: 282 --DENDSGSVFFGDQGPATQQ-----STSFLPIG-EKYDAYFVGVESYCIGNSCLTQSGF 333
N + + FG+ + ST LP E Y Y++ +E+ + N L +
Sbjct: 261 SEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETY--YYLTLEAISVENKRLPYTNL 318
Query: 334 --------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 385
++DSG + TFL +E + + ++ V +R+S + C+ E+
Sbjct: 319 WNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICF--KDEKA 376
Query: 386 LKVPDMRLIFS 396
+++P + F+
Sbjct: 377 IELPIITAHFT 387
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 95/227 (41%), Gaps = 36/227 (15%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQT---HFFGNQFYWLHYTWIDIGT 118
+ L L++ ++ V+ + + R S G T H+ G Y Y IG
Sbjct: 23 IRLELTHVDAKEHYTVEERVRRATERTHRRLASMGGVTAPIHWGGQSQYIAEYL---IGD 79
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P +D GSNL+W QC +C P T +NL YDPS S +++ V C+
Sbjct: 80 PPQRAEAIIDTGSNLIWT--QCSRCRP------TCFRQNLPYYDPSRSRAARAVGCNDAA 131
Query: 179 CK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
C S + C S C + Y + +G L + L S + S++ GC
Sbjct: 132 CALGSETQCLSDNKTCAVVTGYGAGNI--AGTLATENLTFQSETV---------SLVFGC 180
Query: 237 --GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
K + L+GA+ G++GLG G +S+PS L FS C
Sbjct: 181 IVVTKLSPGSLNGAS--GIIGLGRGKLSLPSQLG-----DTRFSYCL 220
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 123/294 (41%), Gaps = 34/294 (11%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + V D GS+ WV QC P Y ++ +DP SS+ NVS
Sbjct: 182 VGLGTPASRYTVVFDTGSDTTWV-----QCQPCVVVCYEQQEK---LFDPVRSSTYANVS 233
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 234 CAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY-------DAVKGFR 285
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSICFDENDSGSVFFG 292
GCG + G + + A G++GLG G S+P K G + F+ C +G+ +
Sbjct: 286 FGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPARSTGTGYLD 339
Query: 293 DQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGFQ---ALVDSGASF 343
+ +++ L D Y++G+ +G L+ QS F +VDSG
Sbjct: 340 FGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVI 399
Query: 344 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
T LP Y+ + F ++++ + + + CY+ + + +P + L+F
Sbjct: 400 TRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLF 453
>gi|68071623|ref|XP_677725.1| aspartyl (acid) protease [Plasmodium berghei strain ANKA]
gi|56497949|emb|CAH98861.1| aspartyl (acid) protease, putative [Plasmodium berghei]
Length = 518
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 132/319 (41%), Gaps = 62/319 (19%)
Query: 82 NNNSSRNQLLFPSEGSQTHFFGN-QFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC-Q 139
N NS +N+ ++ + +G+ Y ++ I+IGTP + +D GS+ L PC +
Sbjct: 31 NKNSEKNEEIY-----KYKLYGDIDEYAYYFMDINIGTPGQKLSLIVDTGSSSLSFPCSE 85
Query: 140 CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 199
C C + ++ ++SS+S + C+ +C C +K C Y+ Y
Sbjct: 86 CKDCGVHME----------NPFNLNNSSTSSILYCNDNICPYNLKC--VKGRCEYLQSY- 132
Query: 200 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 259
E + +G+ DI+ L S + + ++ +GC + G +L A GV+GL L
Sbjct: 133 CEGSRINGFYFSDIVRLES-NNNTKNGNITFKKHMGCHMHEEGLFLHQHAT-GVLGLSLT 190
Query: 260 D-VSVPS----LLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA- 313
VP+ L + + FS+C E G + G G + + I EK D
Sbjct: 191 KPKGVPTFIDLLFKSSPKLNKIFSLCISEY-GGELILG--GYSKDYIVKEVSIDEKKDNI 247
Query: 314 -----------------------------YFVGVESYCIGNSCLTQSG--FQALVDSGAS 342
Y++ V+ + + + + + + LVDSG++
Sbjct: 248 EHNKNENINSINKSIVDGILWEAITRKYYYYIRVKGFQLFGTTFSHNNKSMEMLVDSGST 307
Query: 343 FTFLPTEIYAEVVVKFDKL 361
FT LP ++Y + FD L
Sbjct: 308 FTHLPDDLYNNLNFFFDIL 326
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 122/298 (40%), Gaps = 49/298 (16%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+GTP + L+ALD + W+PC+ C+ C S++ + ++ S++ K +
Sbjct: 40 KVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVK----------STTFKTLG 86
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C P CK + C + Y + S+ L D + L+ P +
Sbjct: 87 CGAPQCKQVPNPICGGSTCTWNTTYGSSTILSN--LTRDTIALS--MDPVPYYA------ 136
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSV 289
GC +K TGS + P G++G G G +S L L +++FS C N SGS+
Sbjct: 137 FGCIQKATGSSVP---PQGLLGFGRGPLSF--LSQTQNLYKSTFSYCLPSFRTLNFSGSL 191
Query: 290 FFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVD 338
G G P ++T L + Y+V + +G + +G + D
Sbjct: 192 RLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFD 251
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
SG FT L Y V +F K V + +S G + CY+ + P + +FS
Sbjct: 252 SGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGG-FDTCYSVP----IVPPTITFMFS 304
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 143/362 (39%), Gaps = 50/362 (13%)
Query: 56 KNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID 115
K + +L L D KR + V L + N S + S Q ++T I
Sbjct: 76 KTPEQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFSSSIISGLA-QGSGEYFTRIG 134
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP + LD GS+++W +QCAP YT D +DP+ S + + C
Sbjct: 135 VGTPARYVYMVLDTGSDVVW-----LQCAPCRKC-YTQAD---PVFDPTKSRTYAGIPCG 185
Query: 176 HPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP--QSSVQSS 231
PLC+ S C + C Y Y D FS + + +
Sbjct: 186 APLCRRLDSPGCNNKNKVCQYQVSYG-----------DGSFTFGDFSTETLTFRRTRVTR 234
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----G 287
V +GCG G ++ A ++GLG G +S P + FS C + +
Sbjct: 235 VALGCGHDNEGLFIGAAG---LLGLGRGRLSFPVQTGRR--FNQKFSYCLVDRSASAKPS 289
Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CLTQSGFQ------- 334
SV FGD A ++ F P+ K D Y++ + +G S L+ S F+
Sbjct: 290 SVVFGDS--AVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNG 347
Query: 335 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
++DSG S T L Y + F S + + + + + C++ S +KVP + L
Sbjct: 348 GVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVL 407
Query: 394 IF 395
F
Sbjct: 408 HF 409
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/339 (24%), Positives = 144/339 (42%), Gaps = 58/339 (17%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
++T I +GTP + LD GS++ W+ C+ C +C Y+ D ++PS S+S
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCREC-------YSQAD---PIFNPSYSAS 206
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
V C +C + C Y A Y + + S+G + L + S
Sbjct: 207 FSTVGCDSAVCSQLDAYDCHSGGCLYEASYG-DGSYSTGSFATETLTFGTTSV------- 258
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DEND 285
++V IGCG K G ++ A ++GLG G +S P+ + ++FS C + +
Sbjct: 259 -ANVAIGCGHKNVGLFIGAAG---LLGLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDS 312
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT------------- 329
SG + FG + + + F P+ + Y++ V + +G + L
Sbjct: 313 SGPLQFGPK--SVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETS 370
Query: 330 -QSGFQALVDSGASFTFLPTEIYAEV----VVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
GF ++DSG T L T Y V V +L + +S+ + CY+ S +
Sbjct: 371 GHGGF--IIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSI----FDTCYDLSGLQ 424
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
+ VP + FS S ++ + P + VG CF++
Sbjct: 425 FVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTF-CFAF 462
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 138/343 (40%), Gaps = 62/343 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IG+ + +D GS + V C R+ +DP++S S + V
Sbjct: 3 LGIGSLQKNLSAIIDTGSEAVLVQCG---------------SRSRPVFDPAASQSYRQVP 47
Query: 174 CSHPLC---------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
C LC S C + C Y Y + +S+G D++ L S + +
Sbjct: 48 CISQLCLAVQQQTSNGSSQPCVNSSAACTYSLSYG-DSRNSTGDFSQDVIFLNS-TNSSS 105
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--- 281
Q+ V GC G +D G++G G++S+PS L K L + FS CF
Sbjct: 106 QAVQFRDVAFGCAHSPQGFLVD-LGSLGIVGFNRGNLSLPSQL-KDRLGGSKFSYCFPSQ 163
Query: 282 --DENDSGSVFFGDQGPATQQSTSFLPIGE------KYDAYFVGVESYCIGNSCLT--QS 331
+G +F GD G ++ S+ P+ + + Y+VG+ S + L +S
Sbjct: 164 PWQPRATGVIFLGDSG-LSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPES 222
Query: 332 GFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ-----GNSWKYC 377
F+ ++DSG +FT + + Y F +S R L+ + C
Sbjct: 223 AFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF---AASNRSGLRKKVGAAAGFDDC 279
Query: 378 YNASSEEMLK-VPDMRLIFSKNQSFVVR-NHIFSFPENEVGDH 418
YN S+ L VP++RL N +R H+F P + G+
Sbjct: 280 YNISAGSSLPGVPEVRLSLQNNVRLELRFEHLF-VPVSAAGNE 321
>gi|7413629|emb|CAB85978.1| putative protein [Arabidopsis thaliana]
Length = 356
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 64/117 (54%), Gaps = 17/117 (14%)
Query: 109 LHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSS 167
L+YT + IGTP V +D GS+L+WV C C+ C PL N++ +DP +SS
Sbjct: 77 LYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGC-PL---------HNVTFFDPGASS 126
Query: 168 SSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 220
S+ ++CS C S +S C SL + C Y +Y + + +SGY + D++ + S
Sbjct: 127 SAVKLACSDKRCSSDLQKKSRC-SLLESCTYKVEYG-DGSVTSGYYISDLISFDTMS 181
>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
Length = 435
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 132/333 (39%), Gaps = 63/333 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I+ TP V V +D G LWV C+ + Y S S Y P+ S++
Sbjct: 51 INQRTPLVPLNVIVDLGGQFLWVDCE---------NKYIS-----STYRPARCRSAQ--- 93
Query: 174 CSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QS 226
CS C S P C D S T++SG L +D+L + S + P Q+
Sbjct: 94 CSLANSDGCGDCFSSPKPGCNNNTCGVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQN 153
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
V S + C L A G+ GLG +++PS LA A F+IC +
Sbjct: 154 VVVSRFLFSCAPTFLLKGLATGA-SGMAGLGRTKIALPSQLASAFSFARKFAICLSSSK- 211
Query: 287 GSVFFGDQGPA--------TQQSTSFLPI-------------GEKYDAYFVGVESYCIGN 325
G V FGD GP S ++ P+ G+ YF+GV++ I
Sbjct: 212 GVVLFGD-GPYGFLPNVVFDSDSLTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKIDE 270
Query: 326 SCLTQSGFQALVDSGA----------SFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--S 373
++ + +D+ +T L IY V F K +++ I G+
Sbjct: 271 KVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKAPAARNIKRVGSVAP 330
Query: 374 WKYCYNASSEEML--KVPDMRLIFSKNQSFVVR 404
+++CY + L VP + L F +N++ V R
Sbjct: 331 FEFCYTNLTGTRLGAAVPTIEL-FLQNENVVWR 362
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 87/186 (46%), Gaps = 26/186 (13%)
Query: 110 HYTWI---DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
HY ++ IGTP V D GS+L+W+ QCI C + Y L+ +D SS
Sbjct: 56 HYDYLMELSIGTPPVKIYAQADTGSDLIWL--QCIPC----TNCYKQLN---PMFDSQSS 106
Query: 167 SSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLASFSKHA 223
S+ N++C C +SC + C Y +YS D S + G L + L L S +
Sbjct: 107 STFSNIACGSESCSKLYSTSCSPDQINCKY--NYSYVDGSETQGVLAQETLTLTSTTG-- 162
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC--- 280
+ VI GCG G++ D G++GLG G +S+ S + + L N FS C
Sbjct: 163 -EPVAFKGVIFGCGHNNNGAFNDKEM--GIIGLGRGPLSLVSQIGSS-LGGNMFSQCLVP 218
Query: 281 FDENDS 286
F+ N S
Sbjct: 219 FNTNPS 224
>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 435
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 132/333 (39%), Gaps = 63/333 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I+ TP V V +D G LWV C+ + Y S S Y P+ S++
Sbjct: 51 INQRTPLVPLNVIVDLGGQFLWVDCE---------NKYIS-----STYRPARCRSAQ--- 93
Query: 174 CSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QS 226
CS C S P C D S T++SG L +D+L + S + P Q+
Sbjct: 94 CSLANSDGCGDCFSSPKPGCNNNTCGVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQN 153
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
V S + C L A G+ GLG +++PS LA A F+IC +
Sbjct: 154 VVVSRFLFSCAPTFLLKGLATGA-SGMAGLGRTKIALPSQLASAFSFARKFAICLSSSK- 211
Query: 287 GSVFFGDQGPA--------TQQSTSFLPI-------------GEKYDAYFVGVESYCIGN 325
G V FGD GP S ++ P+ G+ YF+GV++ I
Sbjct: 212 GVVLFGD-GPYGFLPNVVFDSDSLTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKIDE 270
Query: 326 SCLTQSGFQALVDSGA----------SFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--S 373
++ + +D+ +T L IY V F K +++ I G+
Sbjct: 271 KVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKASAARNIKRVGSVAP 330
Query: 374 WKYCYNASSEEML--KVPDMRLIFSKNQSFVVR 404
+++CY + L VP + L F +N++ V R
Sbjct: 331 FEFCYTNLTGTRLGAAVPTIEL-FLQNENVVWR 362
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 151/379 (39%), Gaps = 92/379 (24%)
Query: 72 RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGS 131
R + R + + S L+P H +G + + +GTP V LD GS
Sbjct: 62 RPRPRSRQGTAPPPSVRASLYP------HSYGGYAFT-----VSLGTPPQPLPVLLDTGS 110
Query: 132 NLLWVPC----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL--------- 178
+L WVPC QC C+ LSA+ L + P +SSSS+ + C +P
Sbjct: 111 HLSWVPCTSSYQCRNCSSLSAA------SPLHVFHPKNSSSSRLIGCRNPSCLWIHSPDH 164
Query: 179 ---CKSRSSCK---------SLKDPC-PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
C++ SSC + + C PY+ Y + S++G L+ D L P
Sbjct: 165 LSDCRAASSCPGANCTPRNANANNVCPPYLVVYGS--GSTAGLLISDTL-------RTPG 215
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----- 280
+V+ + +IGC P G+ G G G SVPS L GL + FS C
Sbjct: 216 RAVR-NFVIGCSLASVHQ-----PPSGLAGFGRGAPSVPSQL---GLTK--FSYCLLSRR 264
Query: 281 FDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA-------YFVGVESYCIGNSC--L 328
FD+N + S + G G + P+ A Y++ + + +G L
Sbjct: 265 FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQL 324
Query: 329 TQSGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSK----RISLQGNSWKYC 377
+ F A+VDSG +F++ ++ V V + ++ +G C
Sbjct: 325 PERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLSPC 384
Query: 378 YN-ASSEEMLKVPDMRLIF 395
+ + +++P+M L F
Sbjct: 385 FAMPPGTKTMELPEMSLHF 403
>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
Length = 445
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 81/338 (23%), Positives = 133/338 (39%), Gaps = 64/338 (18%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T I+ TP V + ++ G LWV C+ Y S S Y P+ S+
Sbjct: 47 YLTQINQRTPLVPVKLTVNLGGEFLWVDCE---------KGYVS-----STYKPARCRSA 92
Query: 170 K-----NVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
+ + SC + C + + C TS+SG L DI+ + S + P
Sbjct: 93 QCNLAGSKSCGECFDGPKPGCNN--NTCGLFPYNPFIRTSTSGELAQDIISIQSTNGSNP 150
Query: 225 QSSVQ-SSVIIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
V +VI CG L+G A G+ GLG +++PS A A + F++C
Sbjct: 151 SKVVSFPNVIFTCGST---FLLEGLASGVTGIAGLGRKKIALPSQFAAAFSFKRKFALCL 207
Query: 282 DEND--SGSVFFGDQGP-------ATQQSTSFLPI-------------GEKYDAYFVGVE 319
+ +G VFFGD GP Q+ + P+ GE YF+GV+
Sbjct: 208 SSSTRATGVVFFGD-GPYIMLPNKDVSQNLIYTPLILNPVSTAGASFEGEPSADYFIGVK 266
Query: 320 SYCI-GNSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFDKLVSSKRISL 369
+ G + ++ G +T L T IY V+ F K V+
Sbjct: 267 GIKVNGEDVKLNTSLLSIAKDGTGGTKISTTQPYTSLETSIYKAVIGAFGKAVAKVPRVT 326
Query: 370 QGNSWKYCYNASSEEMLK----VPDMRLIFSKNQSFVV 403
++ C+N++S + VP + L+ N+++ +
Sbjct: 327 AVAPFELCFNSTSFSSTRVGPGVPQIDLVLPNNKAWTI 364
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 81/308 (26%), Positives = 130/308 (42%), Gaps = 33/308 (10%)
Query: 102 FGNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
G+ L Y + +GTP V+ V +D GS++ WV C P A + +
Sbjct: 118 LGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYA-------QTGAL 170
Query: 161 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
+DP+ SS+ + VSC+ C + + C + C Y Y + ++++G D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
+ S GC ++G + D DG+MGLG G S+ S A A NS
Sbjct: 230 SGASDAV------KGFQFGCSHVESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278
Query: 277 FSICFDENDSGS----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 330
FS C SGS G G + +T L + Y ++ +G L+
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSP 337
Query: 331 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
S F A +VDSG T LP Y+ + F + R + + C++ + + + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 389 PDMRLIFS 396
P + L+FS
Sbjct: 398 PTVALVFS 405
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 93/388 (23%), Positives = 160/388 (41%), Gaps = 72/388 (18%)
Query: 38 ERWISK---SGNVSVADSWPKKNSVEYLELLLSNDWKR---QKTRVKLQSNNNSSRNQLL 91
+R +SK G+V P + + + +EL + + R + R++ +N+ +
Sbjct: 33 QRLVSKLIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARV 92
Query: 92 FPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASY 150
PS +T N I IG P + LV +D GS++LWV C C C
Sbjct: 93 SPSLTGRT-IMAN---------ISIGQPPIPQLVVMDTGSDILWVMCTPCTNC------- 135
Query: 151 YTSLDRNLS-EYDPSSSSSSKNVSCSHPLCKSRSSCK--SLKDPCPYIADYSTEDTSSSG 207
D +L +DPS SS+ PLCK+ K S DP P+ Y+ T+S
Sbjct: 136 ----DNHLGLLFDPSMSSTFS------PLCKTPCDFKGCSRCDPIPFTVTYADNSTASGM 185
Query: 208 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
+ D ++ F +S V+ GCG G D +G++GL G P L
Sbjct: 186 FGRDTVV----FETTDEGTSRIPDVLFGCGH-NIGQDTD-PGHNGILGLNNG----PDSL 235
Query: 268 AKAGLIQNSFSICFDE-----NDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESY 321
A I FS C + + + G+ ST F E ++ Y+V +E
Sbjct: 236 ATK--IGQKFSYCIGDLADPYYNYHQLILGEGADLEGYSTPF----EVHNGFYYVTMEGI 289
Query: 322 CIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISL 369
+G L F+ ++D+G++ TFL ++ + + L+ S ++ ++
Sbjct: 290 SVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTI 349
Query: 370 QGNSWKYC-YNASSEEMLKVPDMRLIFS 396
+ + W C Y + S +++ P + F+
Sbjct: 350 EKSPWMQCFYGSISRDLVGFPVVTFHFA 377
>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
Length = 140
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 66/135 (48%), Gaps = 4/135 (2%)
Query: 232 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSV 289
+ GCG KQ +P DG++GLG+G + L +I N C G +
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60
Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPT 348
+FGD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T +P
Sbjct: 61 YFGDFNPPSRGVT-WVPMKESXXYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119
Query: 349 EIYAEVVVKFDKLVS 363
+IY E+V K +S
Sbjct: 120 QIYNEIVSKVRGTLS 134
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 118/270 (43%), Gaps = 44/270 (16%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + H+T + IG P F + +D GS+L WV C C C DR
Sbjct: 47 GNVYPLGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCT-------LPHDR--- 96
Query: 160 EYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI- 213
Y P ++ V C PLC S+S CK+ D C Y +Y+ + SS G LV D
Sbjct: 97 LYKPHNNV----VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYA-DHGSSIGVLVKDPV 151
Query: 214 -LHLASFSKHAPQSSVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 270
L L + + AP ++ GCG Q GS L GV+GLG ++ + L+
Sbjct: 152 PLRLTNGTILAP------NLGFGCGYDQHNGGSQLPPLT-AGVLGLGNSKATMATQLSAL 204
Query: 271 GLIQNSFSIC-FDENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGN 325
++N C + F GD P++ S++PI G KY A G G
Sbjct: 205 SHVRNVLGHCFSGQGGGFLFFGGDLVPSS--GMSWMPILRTPGGKYSA---GPAEVYFGG 259
Query: 326 SCLTQSGFQALVDSGASFTFLPTEIYAEVV 355
+ + G DSG+S+T+ +++Y V+
Sbjct: 260 NPVGIRGLILTFDSGSSYTYFNSQVYGAVL 289
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 74/314 (23%), Positives = 133/314 (42%), Gaps = 43/314 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ +++G+P S L D GS+L+WV C+ SA+ T +++DPS SS+
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPT------TQFDPSRSSTY 154
Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL-ASFSKHAPQS 226
VSC C++ R++C + C Y+ Y + ++++G L + + +P+
Sbjct: 155 GRVSCQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTFDDGGAGRSPRQ 212
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DE 283
V GC GS+ +GLG G VS+ + L A + FS C
Sbjct: 213 VRIGGVKFGCSTATAGSFPADGL----VGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSV 268
Query: 284 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG-FQALVDSGAS 342
N S ++ FG T+ + P+ +GN + + + +VDSG +
Sbjct: 269 NASSALNFGALADVTEPGAASTPL---------------VGNKTVASAASSRIIVDSGTT 313
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK---VPDMRLIFSKNQ 399
TFL + +V + + ++ + + CYN + E+ +PD+ L F
Sbjct: 314 LTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGA 373
Query: 400 SFVVRNHIFSFPEN 413
+ ++ PEN
Sbjct: 374 AVALK------PEN 381
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 87/367 (23%), Positives = 146/367 (39%), Gaps = 46/367 (12%)
Query: 75 TRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
TR+ L+ N S+ +G G Q +++ + IG+P + LD GS++
Sbjct: 132 TRLDLRPANGSAVFAASAAIQGPVVSGVG-QGSGEYFSRVGIGSPARQLYMVLDTGSDVT 190
Query: 135 WVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDP 191
WV CQ C C Y D +DPS S+S VSC C+ ++C++
Sbjct: 191 WVQCQPCADC-------YQQSD---PVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 240
Query: 192 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 251
C Y Y + + + G + L L S+ +V IGCG G ++ A
Sbjct: 241 CLYEVAYG-DGSYTVGDFATETLTLG-------DSTPVGNVAIGCGHDNEGLFVGAAGLL 292
Query: 252 GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS---GSVFFGDQGPATQQSTSFLPIG 308
+ G L S PS ++ ++FS C + DS ++ FGD T+ L
Sbjct: 293 ALGGGPL---SFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRS 344
Query: 309 EKYDA-YFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVV 356
+ Y+V + +G L T +VDSG + T L + YA +
Sbjct: 345 PRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRD 404
Query: 357 KFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVG 416
F + S + + + CY+ S ++VP + L F + + + P + G
Sbjct: 405 AFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAG 464
Query: 417 DHACFSY 423
+ C ++
Sbjct: 465 TY-CLAF 470
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 89/227 (39%), Gaps = 27/227 (11%)
Query: 62 LELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWID---IGT 118
L L L++ +Q K + + R S +W +I IG
Sbjct: 33 LRLELTHVDAKQNCTTKERMRRATERTHRRLASMAGGGGEASAPIHWNETQYIAEYLIGD 92
Query: 119 PNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPL 178
P +D GSNL+W QC C ++L+ YDPS S ++K V+C+
Sbjct: 93 PPQQAAAIIDTGSNLIWT--QCSTC-----RANGCFGQDLTFYDPSRSRTAKPVACNDTA 145
Query: 179 C--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
C S + C C + Y + G+L ++ H S S+ GC
Sbjct: 146 CLLGSETRCARDGKACAVLTAYGAG--AIGGFLGTEVFTFG----HGQSSENNVSLAFGC 199
Query: 237 --GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
+ T LDGA+ G++GLG G +S+PS L N FS C
Sbjct: 200 ITASRLTPGSLDGAS--GIIGLGRGKLSLPSQLG-----DNKFSYCL 239
>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
Length = 107
Score = 61.2 bits (147), Expect = 1e-06, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 1/76 (1%)
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSV 289
+V C TGS+LDG A +G+MGLG VSV +L +GL+ +SFS+CF E+ G +
Sbjct: 13 AVAKACRCGPTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGRI 72
Query: 290 FFGDQGPATQQSTSFL 305
FGD G Q F+
Sbjct: 73 NFGDAGIRGQGEMPFI 88
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 131/313 (41%), Gaps = 45/313 (14%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSS 166
Y ++ + +GTP + +D GS+++W QC+ C P S + + +DPS S
Sbjct: 418 YSIYLMKLQVGTPPFEIVAEIDTGSDIIWT--QCMPC-PNCYSQFAPI------FDPSKS 468
Query: 167 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
S+ + C+ + C Y Y+ + T S G L + + + S S +
Sbjct: 469 STFREQRCN-------------GNSCHYEIIYA-DKTYSKGILATETVTIPSTSG---EP 511
Query: 227 SVQSSVIIGCGRKQTGSYLDGAA--PDGVMGLGLGDVSVPSL--LAKAGLIQNSFSICFD 282
V + IGCG T G A G++GL +G +S+ S L GLI S CF
Sbjct: 512 FVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLI----SYCFS 567
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSG--FQA--- 335
+ + FG T + K D Y++ +++ + ++ + G F A
Sbjct: 568 GQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDG 627
Query: 336 --LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 393
+DSG + T+ P V +++V++ ++ G+ CY + + ++ V M
Sbjct: 628 NIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTIDIFPVITMH- 686
Query: 394 IFSKNQSFVVRNH 406
FS V+ +
Sbjct: 687 -FSGGADLVLDKY 698
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 88/356 (24%), Positives = 148/356 (41%), Gaps = 52/356 (14%)
Query: 77 VKLQSNNNS---SRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNL 133
++ +SN++S S+NQL S + T F Y ++ + +GTP +D GS+L
Sbjct: 50 IQRRSNSSSFRLSKNQLQGASPYADTLFD----YNIYLMKLQVGTPPFEIAAEIDTGSDL 105
Query: 134 LWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP 193
+W QC+ C Y+ D +DPS SS+ + R KS C
Sbjct: 106 IWT--QCMPC----PDCYSQFD---PIFDPSKSSTFN---------EQRCHGKS----CH 143
Query: 194 YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA--PD 251
Y Y ++T S G L + + + S S + V + IGCG T G A
Sbjct: 144 YEIIYE-DNTYSKGILATETVTIHSTSG---EPFVMAETTIGCGLHNTDLDNSGFASSSS 199
Query: 252 GVMGLGLGDVSVPSL--LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE 309
G++GL +G S+ S L GLI S CF + + FG T +
Sbjct: 200 GIVGLNMGPRSLISQMDLPYPGLI----SYCFSGQGTSKINFGTNAIVAGDGTVAADMFI 255
Query: 310 KYDA--YFVGVESYCIGNSCLTQSG--FQA-----LVDSGASFTFLPTEIYAEVVVKFDK 360
K D Y++ +++ + ++ + G F A ++DSG++ T+ P V ++
Sbjct: 256 KKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVEQ 315
Query: 361 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVG 416
+V++ R+ + CY + + ++ V M FS V+ + N G
Sbjct: 316 VVTAVRVPDPSGNDMLCYFSETIDIFPVITMH--FSGGADLVLDKYNMYMESNSGG 369
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 157/403 (38%), Gaps = 53/403 (13%)
Query: 27 KLVHRFSDEAKERWISKSGNVSVADSW-PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNS 85
+L HR A R S + SVAD+ + EY+ +R R ++ +
Sbjct: 69 RLTHRHGPCAPSRASSLAAP-SVADTLRADQRRAEYI-------LRRVSGRAPQLWDSKA 120
Query: 86 SRNQLLFPSEGSQTHFFGNQFYWLHYTWI-DIGTPNVSFLVALDAGSNLLWVPCQCIQCA 144
+ P+ +G L+Y +GTP V+ + +D GS+L WV C+ A
Sbjct: 121 AAAAATVPAS------WGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAA 174
Query: 145 PLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS---SCKSLKDPCPYIADYSTE 201
P S Y+ D +DP+ SSS V C P+C + C Y+ Y +
Sbjct: 175 P---SCYSQKD---PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYG-D 227
Query: 202 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 261
++++G D L L++ SS GCG Q+G + +G DG++GLG
Sbjct: 228 GSNTTGVYSSDTLTLSA-------SSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGR--- 274
Query: 262 SVPSLLAK-AGLIQNSFSICFDENDS--GSVFFGDQGPATQ----QSTSFLPIGEKYDAY 314
PSL+ + AG FS C S G + G GP+ +T LP Y
Sbjct: 275 EQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYY 334
Query: 315 FVGVESYCIGNSCLT--QSGFQALVDSGASFTF--LPTEIYAEVVVKFDKLVSS--KRIS 368
V + +G L+ S F LP YA + F ++S +
Sbjct: 335 VVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTA 394
Query: 369 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSF 410
CYN + + +P++ L F + + + I SF
Sbjct: 395 PSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSF 437
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 87/329 (26%), Positives = 130/329 (39%), Gaps = 53/329 (16%)
Query: 101 FFGNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLS 159
F G+ L Y + IGTP V +V +D GS+L WV QC P A + L
Sbjct: 108 FLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWV-----QCKPCGAGECYAQKDPL- 161
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDP-CPYIADYSTEDTSSSGYLVDD 212
+DPSSSSS +V C C+ ++ C S C Y +Y T++ Y +
Sbjct: 162 -FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTET 220
Query: 213 ILHLASFSKHAPQSSVQSSVII-----GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
+ +++ V++ GCG Q G Y DG++GLG S+ S
Sbjct: 221 L-------------TLKPGVVVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQT 264
Query: 268 AKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS------FLP---IGEKYDAYFVGV 318
+ FS C G+ F P + S++ F P I Y V +
Sbjct: 265 SSQ--FGGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTL 322
Query: 319 ESYCIGNSCLT--QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGN 372
+G + L S F + ++DSG T LP YA + F +S R+ G
Sbjct: 323 TGISVGGAPLAVPPSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGA 382
Query: 373 SWKYCYNASSEEMLKVPDMRLIFSKNQSF 401
CY+ + + VP + L FS +
Sbjct: 383 VLDTCYDFTGHTNVTVPTIALTFSGGATI 411
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 140/335 (41%), Gaps = 71/335 (21%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +G+P + LD GS L W+ C+ + TS+ ++P SSSS +
Sbjct: 44 LTVGSPPQQVTMVLDTGSELSWLHCK-------KSPNLTSV------FNPLSSSSYSPIP 90
Query: 174 CSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
CS P+C++R+ + L +P C I Y+ + +S G L D +
Sbjct: 91 CSSPVCRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLASDNFRIG-------- 139
Query: 226 SSVQSSVIIGCGRKQ-TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
SS + GC + + + A G+MG+ G + S + + GL + FS C
Sbjct: 140 SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQLGLPK--FSYCISGR 194
Query: 285 D-SGSVFFGDQ----------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 328
D SG + FGD P Q ST LP ++ AY V ++ +GN L
Sbjct: 195 DSSGVLLFGDSHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQLDGIRVGNKILPLPKS 252
Query: 329 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------ 376
T +G Q +VDSG FTFL +Y + +F + L ++ +
Sbjct: 253 IFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDL 311
Query: 377 CYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 410
CY + ++ ++P + L+F + VV + +
Sbjct: 312 CYRVPAGGKLPELPAVSLMF-RGAEMVVGGEVLLY 345
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 139/357 (38%), Gaps = 58/357 (16%)
Query: 103 GNQFYWLHY-TWIDIGTPNVSFL-VALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE 160
G ++ L+Y T I +G L V +D GS+L WV QC C +S Y D
Sbjct: 172 GIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWV--QCEPCP--GSSCYAQRD---PL 224
Query: 161 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-----------------CPYIADYSTEDT 203
+DP++S + V C P C + SLKD C Y Y + +
Sbjct: 225 FDPAASPTFAAVPCGSPACAA-----SLKDATGAPGSCARSAGNSEQRCYYALSYG-DGS 278
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
S G L D L L + +K + GCG G + A G+MGLG D+S+
Sbjct: 279 FSRGVLAQDTLGLGTTTK-------LDGFVFGCGLSNRGLFGGTA---GLMGLGRTDLSL 328
Query: 264 PSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPAT----QQSTSFLPIGEKYDAYFVG 317
S A FS C +GS+ G GP++ T + + YF+
Sbjct: 329 VS--QTAARFGGVFSYCLPATTTSTGSLSLG-PGPSSSFPNMAYTRMIADPTQPPFYFIN 385
Query: 318 VE-SYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 373
+ + G + LT GF A LVDSG T L +Y V +F + + + G S
Sbjct: 386 ITGAAVGGGAALTAPGFGAGNVLVDSGTVITRLAPSVYKAVRAEFARRF--EYPAAPGFS 443
Query: 374 -WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYN 429
CY+ + + + VP + L V F + G C + +L Y
Sbjct: 444 ILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYE 500
>gi|330794218|ref|XP_003285177.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
gi|325084898|gb|EGC38316.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
Length = 817
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/273 (26%), Positives = 116/273 (42%), Gaps = 34/273 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSE-YDPSS 165
++ I +GTP F V +D GS L VP C + + S S D NL Y
Sbjct: 205 YFIPILVGTPPQMFTVQVDTGSTSLAVPGSNCYLYKSQSIKTSCSCS-DGNLDGLYSLEE 263
Query: 166 SSSSKNVSCSHP----LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S SS ++CS CK+ S K PCP++ Y + + +G LV D + + F+
Sbjct: 264 SISSNQLNCSDTSNCNTCKNNKSNK----PCPFVLKYG-DGSFIAGSLVIDHVTIGDFTV 318
Query: 222 HAPQSSVQSSVI----IGCGRKQTGSYLDGAAPDGVMGLGL-------GDVSVPSLLAKA 270
A ++Q + + C Q A DG++GL GD ++A
Sbjct: 319 PAKFGNIQKESLSFSQLTCPSTQRSQ----AVRDGILGLSFQQLDPDNGDDIFSKIVAHY 374
Query: 271 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 330
I N FS+C ++ G TQ++ + PI + + Y + V + +GN L
Sbjct: 375 N-IPNVFSMCLGKDGGLLTIGGTNDHITQETPKYTPIFDSH-YYSITVTNIYVGNDSLNL 432
Query: 331 SG---FQALVDSGASFTFLPTEIYAEVVVKFDK 360
+ ++VDSG + + EI+ +V ++
Sbjct: 433 APPDLSTSIVDSGTTLLYFSDEIFYSIVRNLEE 465
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 122/302 (40%), Gaps = 39/302 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP F + D GS+L W QC P S Y ++ + ++PS S+S
Sbjct: 153 YFVTVGLGTPKKDFSLIFDTGSDLTWT-----QCEPCVKSCY---NQKEAIFNPSQSTSY 204
Query: 170 KNVSCSHPLCKSRSSCKS-----LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
N+SC LC S +S C Y Y + + S G+ + L L +
Sbjct: 205 ANISCGSTLCDSLASATGNIFNCASSTCVYGIQYG-DSSFSIGFFGKEKLSLTA------ 257
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ V + GCG+ G + A G+ L VS A FS C +
Sbjct: 258 -TDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVS-----QTAQRYNKIFSYCLPSS 311
Query: 285 DSGSVFFGDQGPATQQSTSFLPI----------GEKYDAYFVGVESYCIGNSCLTQSGFQ 334
S + F G +T +S SF P+ G VG I S + +G
Sbjct: 312 SSSTGFL-TFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAG-- 368
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
++DSG T LP Y+ + F KL+S + + C++ S+ + + VP + L
Sbjct: 369 TIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLF 428
Query: 395 FS 396
FS
Sbjct: 429 FS 430
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 135/332 (40%), Gaps = 54/332 (16%)
Query: 72 RQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGS 131
+ R++ S+ + + + G Q GN + + +GTP + + LD +
Sbjct: 62 KDPARIRYLSSLTAQKTVAAPIASGQQVLNVGN-----YVVRVQLGTPGQTMYMVLDTSN 116
Query: 132 NLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC-KSRS-SCKSL 188
+ W PC CI C+ + + + +SS+ + CS P C ++R SC +
Sbjct: 117 DAAWAPCSGCIGCS------------STTTFSAQNSSTFATLDCSKPECTQARGLSCPTT 164
Query: 189 KD-PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 247
+ C + Y D++ S LV D LHL +V + GC +GS +
Sbjct: 165 GNVDCLFNQTYG-GDSTFSATLVQDSLHLG--------PNVIPNFSFGCISSASGSSI-- 213
Query: 248 AAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEND----SGSVFFGDQG-PATQQS 301
P G+MGLG G + SL++++G L FS C SGS+ G G P ++
Sbjct: 214 -PPQGLMGLGRGPL---SLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRT 269
Query: 302 TSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIY 351
T L + Y+V + +G + +G ++DSG T IY
Sbjct: 270 TPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIY 329
Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
V +F K V L ++ C+ ++E
Sbjct: 330 TAVRDEFRKQVGGSFSPL--GAFDTCFATNNE 359
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 133/335 (39%), Gaps = 71/335 (21%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEY 161
Y + + GTP+ + D GS+L+W PC C C ++ LD + +
Sbjct: 87 YGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCN------FSGLDPTQIPRF 140
Query: 162 DPSSSSSSKNVSCSHPLCK----SRSSCKSLKD-------PC-PYIADYSTEDTSSSGYL 209
P +SSSS+ + C +P C+ + C+ PC PYI Y S++G L
Sbjct: 141 IPKNSSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGL--GSTAGIL 198
Query: 210 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 269
+ + L + ++GC S + P G+ G G G S+PS +
Sbjct: 199 ISEKLDFPDLT--------VPDFVVGC------SVISTRTPAGIAGFGRGPESLPSQMKL 244
Query: 270 AGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEKYDA-------- 313
S FD+ D+GS G + + S+ P + +
Sbjct: 245 KSFSHCLVSRRFDDTNVTTDLGLDTGS---GHKSGSKTPGLSYTPFRKNPNVSNTAFLEY 301
Query: 314 YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
Y++ + +G+ + T ++VDSG++FTF+ ++ V +F +S
Sbjct: 302 YYLNLRRIYVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMS 361
Query: 364 --SKRISLQGNSW-KYCYNASSEEMLKVPDMRLIF 395
++ L+ S C+N S + + VP++ F
Sbjct: 362 NYTREKDLEKVSGIAPCFNISGKGDVTVPELIFEF 396
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 128/332 (38%), Gaps = 78/332 (23%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLD-RNLSEYDPSSSSS 168
+++GTP + LD GS+L+W PC C C + ++D + + P +SS+
Sbjct: 92 LNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCN------FPNIDPTKIPTFIPKNSST 145
Query: 169 SKNVSCSHPLC------KSRSSCKSLKDP--------CP-YIADYSTEDTSSSGYLVDDI 213
+K + C +P C S C K P CP YI Y T +G+L+ D
Sbjct: 146 AKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGAT--AGFLLLDN 203
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L+ K PQ ++GC S L P G+ G G G S+PS +
Sbjct: 204 LNFP--GKTVPQ------FLVGC------SILSIRQPSGIAGFGRGQESLPSQMN----- 244
Query: 274 QNSFSIC-----FDENDSGS---VFFGDQGPATQQSTSFLPIGEK-------YDAYFVGV 318
FS C FD+ S + G S+ P + Y+V +
Sbjct: 245 LKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTL 304
Query: 319 ESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 368
+G + + +VDSG++FTF+ +Y V +F + + K+ S
Sbjct: 305 RKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQL-GKKYS 363
Query: 369 LQGN-----SWKYCYNASSEEMLKVPDMRLIF 395
+ N C+N S + + P+ F
Sbjct: 364 REENVEAQSGLSPCFNISGVKTISFPEFTFQF 395
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 137/323 (42%), Gaps = 61/323 (18%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP S + LD GS L W+ QC + P T +DPS SSS +
Sbjct: 81 LPIGTPPQSQQMILDTGSQLSWI--QCHKKVPRKPPPSTV-------FDPSLSSSFSVLP 131
Query: 174 CSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
C+HPLCK R +SC L C Y Y+ + T + G LV + + ++ P
Sbjct: 132 CNHPLCKPRIPDFTLPTSC-DLNRLCHYSYFYA-DGTLAEGNLVREKITFSTSQSTPP-- 187
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE--- 283
+I+GC D + G++G+ LG +S S +A + + S+ + +
Sbjct: 188 -----LILGCAE-------DASDDKGILGMNLGRLSFAS---QAKITKFSYCVPTRQVRP 232
Query: 284 --NDSGSVFFGDQ-GPATQQSTSFLPIGEKYD-------AYFVGVESYCIGNSCLT--QS 331
+GS + G+ A Q S L + A+ V ++ IGN L S
Sbjct: 233 GFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVS 292
Query: 332 GF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNAS 381
F Q+++DSG+ FT+L Y +V + +L K+ + C++ +
Sbjct: 293 AFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDMCFDGN 352
Query: 382 SEEMLK-VPDMRLIFSKNQSFVV 403
+ E+ + + +M F K V+
Sbjct: 353 AMEIGRLIGNMVFEFDKGVEIVI 375
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/301 (25%), Positives = 128/301 (42%), Gaps = 49/301 (16%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+G P L +D GSN+LWV C C +C +N DPS SS+ ++ C
Sbjct: 105 MGQPATPQLAIMDTGSNILWVRCAPCKRCT----------QQNGPLLDPSKSSTYASLPC 154
Query: 175 SHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS-- 230
++ +C S C L + C Y Y+T SS+G L + L H+ V +
Sbjct: 155 TNTMCHYAPSAYCNRL-NQCGYNLSYAT-GLSSAGVLATEQLIF-----HSSDEGVNAVP 207
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD-----END 285
SV+ GC + G Y D GV GLG G + S + + G + FS C
Sbjct: 208 SVVFGCSH-ENGDYKDRRF-TGVFGLGKG---ITSFVTRMG---SKFSYCLGNIADPHYG 259
Query: 286 SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG-------NSCLTQSGFQ--AL 336
+ FG++ ST P+ Y+V +E +G ++ + G + AL
Sbjct: 260 YNQLVFGEKANFEGYST---PLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSAL 316
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS-SEEMLKVPDMRLIF 395
+DSG + T+L + + + +L+ + S+ CY + S++++ P + F
Sbjct: 317 IDSGTALTWLAESAFRALDNEVRQLLDGVLMPFWRGSFA-CYKGTVSQDLIGFPVVTFHF 375
Query: 396 S 396
S
Sbjct: 376 S 376
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 89/211 (42%), Gaps = 54/211 (25%)
Query: 34 DEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFP 93
DEA+ RWI + S+D + ++ R LQ+ SS L
Sbjct: 4 DEARLRWIHHR--------------------IQSSDHRHRRGRSLLQTAQVSSGLSL--- 40
Query: 94 SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTS 153
GS +F + IG+P S+ + LD GS++ W IQCAP S S Y+
Sbjct: 41 --GSGEYF----------ARMGIGSPQRSYYLELDTGSDVTW-----IQCAPCS-SCYSQ 82
Query: 154 LDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 211
+D YDPS+SSS + V C LC++ S+C+ + C Y Y SS
Sbjct: 83 VD---PIYDPSNSSSYRRVYCGSALCQALDYSACQGMG--CSYRVVYGDSSASSGD---- 133
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
L + SF S+ ++ GCG +G
Sbjct: 134 --LGIESFYLGPNSSTAMRNIAFGCGHSNSG 162
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 129/371 (34%), Gaps = 65/371 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCI-------------QCAPLSASYYTSLDRNLSE 160
+ IGTP + + + LD ++L W+ C+ Q + T+ + S+
Sbjct: 128 VRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAAKKEASK 187
Query: 161 --YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL---VDDILH 215
Y P+ SSS + + CS C PY S S Y D +
Sbjct: 188 NWYRPAKSSSWRRIRCSQKECAV----------LPYNTCQSPSKAESCSYFQKTQDGTVT 237
Query: 216 LASFSKHAPQSSVQS-------SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
+ + K +V +I+GC + G +D A DGV+ LG GD+S A
Sbjct: 238 IGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAA 295
Query: 269 KAGLIQNSFSICF-----DENDSGSVFFGDQ----GPATQQSTSFLPI------GEKYDA 313
K FS C + S + FG GP T ++ + G K
Sbjct: 296 KR--FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTG 353
Query: 314 YFVGVESYCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 370
VG E I + F ++D+ S T L E YA V D+ +S +
Sbjct: 354 VLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYE 413
Query: 371 GNSWKYCYN-------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
++YCY + +P + + PE E G AC ++
Sbjct: 414 LEGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGV-ACLAF 472
Query: 424 FTLEYNFTGIL 434
L GIL
Sbjct: 473 RKLLRGGPGIL 483
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 130/324 (40%), Gaps = 80/324 (24%)
Query: 78 KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
+ N++S + PS + + + + +T +GTP V LD GS+L WVP
Sbjct: 68 RRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFT-ASLGTPPQPLPVLLDTGSHLTWVP 126
Query: 138 C----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSL 188
C +C C+ SAS + + P +SSSS+ V C +P C+ + + K
Sbjct: 127 CTSSYECRNCSSPSAS-------AVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCR 179
Query: 189 KDPC----------------PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
+ PC PY Y + T +G L+ D L AP +V
Sbjct: 180 RAPCSPGAANCPAAASNVCPPYAVVYGSGST--AGLLIADTL-------RAPGRAVP-GF 229
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC-----FDEND-- 285
++GC P G+ G G G SVP+ L GL + FS C FD+N
Sbjct: 230 VLGCSLVSVHQ-----PPSGLAGFGRGAPSVPAQL---GLPK--FSYCLLSRRFDDNAAV 279
Query: 286 SGSVFFGDQGPATQQSTSFLPI-----GEKYD---AYFVGVESYCIGNSC--LTQSGFQA 335
SGS+ G + ++P+ G+K Y++ + +G L F
Sbjct: 280 SGSLVLGGT--GGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAG 337
Query: 336 --------LVDSGASFTFLPTEIY 351
+VDSG +FT+L ++
Sbjct: 338 NAAGSGGTIVDSGTTFTYLDPTVF 361
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 136/336 (40%), Gaps = 84/336 (25%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ GTP +F LD GS+L+W+PC C +C S + N ++ P S SS
Sbjct: 220 LKFGTPPQTFPFVLDTGSSLVWLPCYSHYLCSKCNSFS-------NNNTPKFIPKDSFSS 272
Query: 170 KNVSCSHPLCK-------SRSSCKSLK----------DPCP-YIADYSTEDTSSSGYLVD 211
K V C +P C + CK K CP Y Y S++G+L+
Sbjct: 273 KFVGCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGL--GSTAGFLLS 330
Query: 212 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 271
+ L+ P +V S ++GC S + P G+ G G G+ S+P A+
Sbjct: 331 ENLNF-------PAKNV-SDFLVGC------SVVSVYQPGGIAGFGRGEESLP---AQMN 373
Query: 272 LIQNSFSIC-----FDENDSGSVFFGD-----QGPATQ--QSTSFL--PIGEK--YDAYF 315
L + FS C FDE+ S + +G T T+FL P +K + AY+
Sbjct: 374 LTR--FSYCLLSHQFDESPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYY 431
Query: 316 --------VGVESYCIGNSCLT-----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
VG + + L GF +VDSG++ TF+ I+ V +F K V
Sbjct: 432 YITLRKIVVGEKRVRVPRRMLEPDVNGDGGF--IVDSGSTLTFMERPIFDLVAEEFVKQV 489
Query: 363 SSKRISLQGNSWKY--CYN-ASSEEMLKVPDMRLIF 395
+ R + C+ A E P+MR F
Sbjct: 490 NYTRARELEKQFGLSPCFVLAGGAETASFPEMRFEF 525
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 130/333 (39%), Gaps = 68/333 (20%)
Query: 97 SQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDR 156
S + G Y++ + +GTP F++ D GS+L WV C+ P S D
Sbjct: 95 SSGAYTGTGQYFVRFR---VGTPAQPFVLVADTGSDLTWVKCRGAAGPPAS-------DP 144
Query: 157 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLV 210
E+ S S S ++CS C S ++C S PC Y DY +D S++ G +
Sbjct: 145 PAREFRASESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAY--DYRYKDGSAARGVVG 202
Query: 211 DDILHLA-------SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
D +A S + + V++GC G + DGV+ LG ++S
Sbjct: 203 TDAATIALSGSGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSS--DGVLSLGNSNISF 260
Query: 264 PSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPI----------- 307
S A FS C N S + FG + P+
Sbjct: 261 ASR--AAARFGGRFSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYA 318
Query: 308 ---------GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
GE D + + + +G G A++DSG S T L T Y VV
Sbjct: 319 VAVDAVYVAGEALD---IPADVWDVGR------GGGAILDSGTSLTVLATPAYRAVVAAL 369
Query: 359 -DKLVSSKRISLQGNSWKYCYN--ASSEEMLKV 388
+L + R+++ + ++YCYN A + E+ K+
Sbjct: 370 GGRLAALPRVAM--DPFEYCYNWTAGAPEIPKL 400
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 124/310 (40%), Gaps = 48/310 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C ++ +DP++S S +NV+C
Sbjct: 158 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPATSLSYRNVTC 207
Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
P C R+ + DPCPY Y + ++ D L + + AP +S
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTAPGAS 263
Query: 228 VQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ V+ GCG G + A G+ L S L A G ++FS C ++ S
Sbjct: 264 RRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG---HAFSYCLVDHGS 318
Query: 287 ---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS------ 331
+ FGD P + D Y+V ++ +G L S
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378
Query: 332 ----GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 386
++DSG + ++ Y + F +++ + + CYN S E +
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438
Query: 387 KVPDMRLIFS 396
+VP+ L+F+
Sbjct: 439 EVPEFSLLFA 448
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/266 (24%), Positives = 106/266 (39%), Gaps = 44/266 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP + LD GS+L+W C C+ C +D+ +DP++SS+ +++
Sbjct: 96 MGIGTPARFYSAILDTGSDLIWTQCAPCLLC----------VDQPTPYFDPANSSTYRSL 145
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
CS P C + + C Y Y + S++G L ++ + +
Sbjct: 146 GCSAPACNALYYPLCYQKTCVYQYFYG-DSASTAGVLANETFTFGTNDTRVTLPRIS--- 201
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSV 289
GCG GS +G+ G++G G G +S+ S L FS C F +
Sbjct: 202 -FGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVRSRL 252
Query: 290 FFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSG 332
+FG +T QST F+ YF+ + +G + L T
Sbjct: 253 YFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGT 312
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKF 358
++DSG + T+L Y V F
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAF 338
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 124/310 (40%), Gaps = 48/310 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C ++ +DP++S S +NV+C
Sbjct: 158 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASLSYRNVTC 207
Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
P C R+ + DPCPY Y + ++ D L + + AP +S
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTAPGAS 263
Query: 228 VQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS 286
+ V+ GCG G + A G+ L S L A G ++FS C ++ S
Sbjct: 264 RRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG---HAFSYCLVDHGS 318
Query: 287 ---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS------ 331
+ FGD P + D Y+V ++ +G L S
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378
Query: 332 ----GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 386
++DSG + ++ Y + F +++ + + CYN S E +
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438
Query: 387 KVPDMRLIFS 396
+VP+ L+F+
Sbjct: 439 EVPEFSLLFA 448
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 117/298 (39%), Gaps = 35/298 (11%)
Query: 117 GTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSH 176
G+ V+ + +D S++ WV QCAP A + + L YDPS SSSS CS
Sbjct: 150 GSGGVAQTMVIDTASDVPWV-----QCAPCPAPHCHAQTDVL--YDPSKSSSSAAFPCSS 202
Query: 177 PLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
P C++ + C D C Y Y + ++S+G + D+L L A +S S
Sbjct: 203 PACRNLGPYANGCTPAGDQCQYRVQYP-DGSASAGTYISDVLTL----NPAKPASAISEF 257
Query: 233 IIGCGRK--QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF 290
GC Q GS+ + + G+M LG G S+P+ + FS C S F
Sbjct: 258 RFGCSHALLQPGSFSNKTS--GIMALGRGAQSLPT--QTKATYGDVFSYCLPPTPVHSGF 313
Query: 291 FGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCLTQS----GFQALVDSGAS 342
F P S T L Y V + + + L A++DS
Sbjct: 314 FILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTI 373
Query: 343 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS-----SEEMLKVPDMRLIF 395
T LP Y + F + + R + CY+ S +K+P + L+F
Sbjct: 374 VTRLPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVF 431
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 92/415 (22%), Positives = 157/415 (37%), Gaps = 75/415 (18%)
Query: 57 NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFG------------- 103
N+ E L +R+ RV+ + +L GS + G
Sbjct: 88 NATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYENVAGVTAEFGSEVVSGM 147
Query: 104 NQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYD 162
Q ++T I IGTP + LD GS+++W+ C+ C +C Y+ D ++
Sbjct: 148 EQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCREC-------YSQAD---PIFN 197
Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
PSSS S V C +C + C Y Y + Y + + +F
Sbjct: 198 PSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETL----TFGT- 252
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
+S+Q +V IGCG G ++ A G+ L S P+ L +FS C
Sbjct: 253 ---TSIQ-NVAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLV 303
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSCLTQSG 332
+ DS S + GP + +PIG + Y++ + + +G L
Sbjct: 304 DRDSESSGTLEFGPES------VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVP 357
Query: 333 FQA------------LVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKY 376
+A ++DSG + T L T Y + F L + IS+ +
Sbjct: 358 SEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI----FDT 413
Query: 377 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFT 431
CY+ S+ + + +P + FS F++ P + +G CF++ + N +
Sbjct: 414 CYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTF-CFAFAPADSNLS 467
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 113/296 (38%), Gaps = 47/296 (15%)
Query: 117 GTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
GTP + L+ALD S+ W+PC C+ C+ + P S+S +NVSC
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGCS------------TSKPFAPIKSTSFRNVSCG 151
Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
P CK + C + Y + ++S +V D L LA + G
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLA--------TDPIPGYTFG 201
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVFF 291
C K TGS +AP + ++ L +++FS C N SGS+
Sbjct: 202 CVNKTTGS----SAPQQGLLGLGRGPLSLLSQSQ-NLYKSTFSYCLPSFKSINFSGSLRL 256
Query: 292 GD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSG 340
G P + T L + Y+V + + +G + +G + DSG
Sbjct: 257 GPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSG 316
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
FT L +Y V +F + V K + CYN + VP + +FS
Sbjct: 317 TVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVP----IVVPTITFLFS 368
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 125/311 (40%), Gaps = 47/311 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE--YDPSSSS 167
++ +GTP FL+ D GS+L WV C+ + A S + +S + + P S
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 168 SSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD------ILHL 216
+ + C+ C S S+C + PC Y DY +D S++ V +
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSSS 212
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
+S SK+ + + +++GC TG + A DGV+ LG +VS S A
Sbjct: 213 SSSSKNKVKKAKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFAS--HAASRFGGR 268
Query: 277 FSICF-----DENDSGSVFFGDQ-----------GPATQQSTSFLPIGEKYDAYFVGVES 320
FS C N + + FG GP +Q T + Y V +++
Sbjct: 269 FSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQ-TPLVLDSRMRPFYDVSIKA 327
Query: 321 YCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQG 371
+ L G +VDSG S T L Y VV KL R+++
Sbjct: 328 ISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAM-- 385
Query: 372 NSWKYCYNASS 382
+ ++YCYN +S
Sbjct: 386 DPFEYCYNWTS 396
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 142/359 (39%), Gaps = 70/359 (19%)
Query: 78 KLQSNNNSSRNQ--LLFPSEGSQTHFFGNQFYWLHYTWIDI----GTPNVSFLVALDAGS 131
++Q+ +SS+ Q LL P + +QT + + H + I G+P + + LD GS
Sbjct: 22 QIQTCVSSSQTQKPLLLPLK-TQTQTPPRKLAFQHNVTLTISLTIGSPPQNVTMVLDTGS 80
Query: 132 NLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS-------S 184
L W+ C+ L S ++P SSS C+ +C +R+ S
Sbjct: 81 ELSWLHCK-------------KLPNLNSTFNPLLSSSYTPTPCNSSVCMTRTRDLTIPAS 127
Query: 185 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC--GRKQTG 242
C C I Y+ + +S+ G L + LA + Q + GC T
Sbjct: 128 CDPNNKLCHVIVSYA-DASSAEGTLAAETFSLA--------GAAQPGTLFGCMDSAGYTS 178
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST 302
+ A G+MG+ G +S+ ++ FS C D+ V GP+
Sbjct: 179 DINEDAKTTGLMGMNRGSLSL-----VTQMVLPKFSYCISGEDAFGVLLLGDGPSAPSPL 233
Query: 303 SFLPI------GEKYD--AYFVGVESYCIGNSCL-----------TQSGFQALVDSGASF 343
+ P+ +D AY V +E + L T +G Q +VDSG F
Sbjct: 234 QYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAG-QTMVDSGTQF 292
Query: 344 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CYNASSEEMLKVPDMRLIFS 396
TFL +Y + +F + ++ ++ + CY+A + + VP + L+FS
Sbjct: 293 TFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA-SLAAVPAVTLVFS 350
>gi|170091822|ref|XP_001877133.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
gi|164648626|gb|EDR12869.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
Length = 408
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 126/297 (42%), Gaps = 60/297 (20%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I IG P SF V LD GS+ LWVP ++C ++ +T +YD +SSS+
Sbjct: 97 YFTEISIGNPPQSFKVILDTGSSNLWVPS--VKCTSIACFLHT-------KYDSASSSTF 147
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
K + S + G++ +D+L + + Q +
Sbjct: 148 KANGSEFSIHYGSGSME--------------------GFVSNDLLSIGDITIKG-QDFAE 186
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA------KAGLIQN---SFSIC 280
+ K+ G DG++GLG +SV ++ GLI + SF +
Sbjct: 187 AV-------KEPGLAFAFGKFDGILGLGYDTISVNHIIPPFYSMINQGLIDSPVFSFRLG 239
Query: 281 FDENDSG-SVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVD 338
E D G +VF G A + +++P+ K AY+ V +E GN L A +D
Sbjct: 240 SSEEDGGEAVFGGIDESAYKGKITYVPVRRK--AYWEVELEKVSFGNDDLELESTGAAID 297
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+G S LPT+I AE++ + + +K+ SW Y ++ +P++ F
Sbjct: 298 TGTSLIVLPTDI-AEML---NTQIGAKK------SWNGQYQVDCAKVPSLPELSFYF 344
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 109/273 (39%), Gaps = 35/273 (12%)
Query: 93 PSEGSQTHFFGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSAS 149
PS GN + H+ ++I P + + +D GS L W+ C CI C +
Sbjct: 20 PSSAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79
Query: 150 YYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCK-SLKDPCPYIADYSTEDT 203
Y V C+ C R K K+ C Y Y
Sbjct: 80 LY-------------KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GG 124
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVS 262
SS G L+ D SFS A + +S+ GCG Q + + P +G++GLG G V+
Sbjct: 125 SSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVT 179
Query: 263 VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 321
+ S L G+I ++ C G +FFGD T T + P+ ++ Y +
Sbjct: 180 LLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTL 238
Query: 322 CIGN---SCLTQSGFQALVDSGASFTFLPTEIY 351
+ S ++ + + + DSGA++T+ + Y
Sbjct: 239 HFNSNKQSPISAAPMEVIFDSGATYTYFALQPY 271
>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
Length = 154
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 69/138 (50%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 286
+ ++ GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 6 KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P T+ T ++P+ E Y G+ + I + F+A+ DSG+++T+
Sbjct: 66 GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTY 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E+V K +S
Sbjct: 125 MPAQIYNELVSKIRGTLS 142
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 126/312 (40%), Gaps = 37/312 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP V+ + +D GS+L WV C+ AP S Y+ D +DP+ SSS V C
Sbjct: 146 LGTPGVAQTMEVDTGSDLSWVQCKPCAAAP---SCYSQKD---PLFDPAQSSSYAAVPCG 199
Query: 176 HPLCKSRS---SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
P+C + C Y+ Y + ++++G D L L++ SS
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYG-DGSNTTGVYSSDTLTLSA-------SSAVQGF 251
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDS--GSV 289
GCG Q+G + +G DG++GLG PSL+ + AG FS C S G +
Sbjct: 252 FFGCGHAQSGLF-NGV--DGLLGLGR---EQPSLVEQTAGTYGGVFSYCLPTKPSTAGYL 305
Query: 290 FFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQALVDSGASF 343
G GP+ +T LP Y V + +G L+ S F
Sbjct: 306 TLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGT 365
Query: 344 TF--LPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
LP YA + F ++S + CYN + + +P++ L F
Sbjct: 366 VVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGA 425
Query: 400 SFVV-RNHIFSF 410
+ + + I SF
Sbjct: 426 TVTLGADGILSF 437
>gi|156099262|ref|XP_001615633.1| aspartic protease PM5 [Plasmodium vivax Sal-1]
gi|148804507|gb|EDL45906.1| aspartic protease PM5 [Plasmodium vivax]
Length = 536
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 123/297 (41%), Gaps = 61/297 (20%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y ++ IDIGTP + LD GS+ L PC C C + ++ ++
Sbjct: 59 YAYYFLDIDIGTPEQRISLILDTGSSSLSFPCAGCKNCGVHME----------NPFNLNN 108
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S +S + C + C + +C +K C Y+ Y E + SG+ D++ + S++
Sbjct: 109 SKTSSILYCENEECPFKLNC--VKGKCEYMQSY-CEGSQISGFYFSDVVSVVSYN----N 161
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPS----LLAKAGLIQNSFSIC 280
V ++GC + +L A GV+G+ L +P+ L A ++ F+IC
Sbjct: 162 ERVTFRKLMGCHMHEESLFLYQQA-TGVLGMSLSKPQGIPTFVNLLFDNAPQLKQVFTIC 220
Query: 281 FDEN------------------DSGSVFFGDQGPATQ----------------QSTSFLP 306
EN S SV GP ++ + +
Sbjct: 221 ISENGGELIAGGYDPAYIVRRGGSKSVSGQGSGPVSESLSESGEDPQVALREAEKVVWEN 280
Query: 307 IGEKYDAYFVGVESY-CIGNSCLTQS-GFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
+ KY Y++ V G + ++ S G + LVDSG++FT +P ++Y ++ FD L
Sbjct: 281 VTRKY-YYYIKVRGLDMFGTNMMSSSKGLEMLVDSGSTFTHIPEDLYNKLNYFFDIL 336
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 123/328 (37%), Gaps = 67/328 (20%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ---------CIQCAPLSASYYTSLDRNLSE 160
++ +GTP FL+ D GS+L WV C + L A S R
Sbjct: 87 YFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT--- 143
Query: 161 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
+ P S + + CS C+ S ++C + +PC Y DY +D S++ V
Sbjct: 144 FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAY--DYRYKDGSAARGTVGVDSA 201
Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 274
+ S A + + V++GC G S+L A DGV+ LG ++S S A
Sbjct: 202 TIALSGRAARKAKLRGVVLGCTTSYNGQSFL---ASDGVLSLGYSNISFASRAAS--RFG 256
Query: 275 NSFSICF-----DENDSGSVFFG----------DQGPAT----------------QQSTS 303
FS C N + + FG +G A+ + T
Sbjct: 257 GRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTP 316
Query: 304 FLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVV 355
+ Y V V+ + L + G A++DSG S T L Y VV
Sbjct: 317 LVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVV 376
Query: 356 VKFDK-LVSSKRISLQGNSWKYCYNASS 382
K L R+++ + + YCYN +S
Sbjct: 377 AALSKRLAGLPRVTM--DPFDYCYNWTS 402
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 80/315 (25%), Positives = 135/315 (42%), Gaps = 46/315 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE-YDPSSSSS 168
+ T + +GTP+ S+ + +D GS+L W +QC+P S R + +DP +SS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTW-----LQCSPC----VVSCHRQVGPLFDPRASST 184
Query: 169 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
+V CS C + S+C S + C Y A Y + + S G L D +
Sbjct: 185 YASVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGSLSTDTVSFG---- 238
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
S+ S GCG+ G + A G++GL +S+ LA + + SFS C
Sbjct: 239 ----STRYPSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSYCL 289
Query: 282 DENDSGSVFFGDQGP-ATQQSTSFLPIG-EKYDA--YFVGVESYCIGNSCLT-----QSG 332
+ S + GP T S+ P+ DA YF+ + +G S L S
Sbjct: 290 PT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSS 347
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG T LPT ++ + + ++ + + + C+ + + L+VP +
Sbjct: 348 LPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVA 406
Query: 393 LIFSKNQS--FVVRN 405
+ F+ S RN
Sbjct: 407 MAFAGGASMKLTTRN 421
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 143/325 (44%), Gaps = 66/325 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP L D GS+L W+ + C QC P + DPS+S++ +
Sbjct: 84 LSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIF----------DPSNSTTFHKL 133
Query: 173 SCSHPLCKS-RSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
C+ C + S +S DP C Y Y + + ++GYL D + + + +SVQ
Sbjct: 134 PCTTAPCNALDESARSCTDPTTCGYTYSYG-DHSYTTGYLASDTVTVGN-------ASVQ 185
Query: 230 -SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------- 281
+V GCG + G++ + + G++GLG G++S S L I FS C
Sbjct: 186 IRNVAFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IGKKFSYCLLPLENEI 241
Query: 282 -----DENDSGSVFFGDQGPATQQSTSFL-----PIGEKYDA--YFVGVESYCIGNSCLT 329
D + + FGD + ST+ + P+ K + Y++ +E+ +G L
Sbjct: 242 SSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLL 301
Query: 330 -----------QSGFQA-------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 371
SG ++ ++DSG + TFL E Y + + + +R++
Sbjct: 302 YSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVK 361
Query: 372 NS-WKYCYNASSEEMLKVPDMRLIF 395
NS + C+ + EE +++P M++ F
Sbjct: 362 NSMFSLCFKSGKEE-VELPLMKVHF 385
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 136/351 (38%), Gaps = 75/351 (21%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPC-----QCIQCAPLSASYYTSLD-RNLSEYDPSSSS 167
+ IGTP V +D GS+L WVPC C C Y ++ L+ + P+ SS
Sbjct: 25 LSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCE----EYQNNISGPRLAAFLPTHSS 80
Query: 168 SSKNVSCSHPLCKSRSSCKSLKDPC-------------------PYIADYSTEDTSSSGY 208
+S +C C S + DPC P A +G
Sbjct: 81 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140
Query: 209 LVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 267
L D+L + ++ Q GC +Y + P G+ G G G +S+P L
Sbjct: 141 LTRDVLFTHGNYNNNNNNNKQIPRFCFGC---VGATYRE---PIGIAGFGRGLLSLPFQL 194
Query: 268 AKAGLIQNSFSICF-------DENDSGSVFFGDQGPATQ----QSTSFLPIGEKYDAYFV 316
G FS CF + N S + G+ +++ Q T L + Y++
Sbjct: 195 ---GFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYI 251
Query: 317 GVESYCIGNS--------------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
G+ES IGN T+ L+DSG ++T LP +Y++++ + ++
Sbjct: 252 GLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVI 311
Query: 363 S---SKRISLQGNSWKYCY-------NASSEEMLKVPDMRLIFSKNQSFVV 403
+K++ L + CY N+S + ++P + F N S V+
Sbjct: 312 GYPRAKQVELN-TGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVL 361
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 113/296 (38%), Gaps = 47/296 (15%)
Query: 117 GTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
GTP + L+ALD S+ W+PC C+ C+ + P S+S +NVSC
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGCS------------TSKPFAPIKSTSFRNVSCG 151
Query: 176 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 235
P CK + C + Y + ++S +V D L LA + G
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLA--------ADPIPGYTFG 201
Query: 236 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVFF 291
C K TGS +AP + ++ L +++FS C N SGS+
Sbjct: 202 CVNKTTGS----SAPQQGLLGLGRGPLSLLSQSQ-NLYKSTFSYCLPSFKSINFSGSLRL 256
Query: 292 GD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSG 340
G P + T L + Y+V + + +G + +G + DSG
Sbjct: 257 GPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSG 316
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
FT L +Y V +F + V K + CYN + VP + +FS
Sbjct: 317 TVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVP----IVVPTITFLFS 368
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/310 (23%), Positives = 124/310 (40%), Gaps = 44/310 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP F + LD GS+L W+ QC+ C Y +N YDP +S+S KN++C+
Sbjct: 166 VGTPPKHFSLILDTGSDLNWL--QCLPC-------YDCFHQNGMFYDPKTSASFKNITCN 216
Query: 176 HPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
P C SS C+S CPY Y ++ + V+ + ++
Sbjct: 217 DPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKV 276
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
+++ GCG G + + G+ L S L +SFS C + N
Sbjct: 277 GNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVDRNSNTN 331
Query: 285 DSGSVFFGDQGPATQQS----TSFLPIGEK--YDAYFVGVESYCIGNSCL---------- 328
S + FG+ + TSF+ E Y++ ++S +G L
Sbjct: 332 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNIS 391
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS--SEEM 385
+ ++DSG + ++ Y + KF +K+ + I C+N S E
Sbjct: 392 SDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENN 451
Query: 386 LKVPDMRLIF 395
+ +P++ + F
Sbjct: 452 IHLPELGIAF 461
>gi|46488451|gb|AAS99547.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488453|gb|AAS99548.1| aspartic protease PM5 [Plasmodium vivax]
Length = 536
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 123/297 (41%), Gaps = 61/297 (20%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y ++ IDIGTP + LD GS+ L PC C C + ++ ++
Sbjct: 59 YAYYFLDIDIGTPEQRISLILDTGSSSLSFPCAGCKNCGVHME----------NPFNLNN 108
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S +S + C + C + +C +K C Y+ Y E + SG+ D++ + S++
Sbjct: 109 SKTSSILYCENEECPFKLNC--VKGKCEYMQSY-CEGSQISGFYFSDVVSVVSYN----N 161
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPS----LLAKAGLIQNSFSIC 280
V ++GC + +L A GV+G+ L +P+ L A ++ F+IC
Sbjct: 162 ERVTFRKLMGCHMHEESLFLYQQA-TGVLGMSLSKPQGIPTFVNLLFDNAPQLKQVFTIC 220
Query: 281 FDEN------------------DSGSVFFGDQGPATQ----------------QSTSFLP 306
EN S SV GP ++ + +
Sbjct: 221 ISENGGELIAGGYDPAYIVRRRGSKSVSGQGSGPVSESLSESGEDPQVALREAEKIVWEN 280
Query: 307 IGEKYDAYFVGVESY-CIGNSCLTQS-GFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
+ KY Y++ V G + ++ S G + LVDSG++FT +P ++Y ++ FD L
Sbjct: 281 VTRKY-YYYIKVRGLDMFGTNMMSSSKGLEMLVDSGSTFTHIPEDLYNKLNYFFDIL 336
>gi|46488413|gb|AAS99528.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488415|gb|AAS99529.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488417|gb|AAS99530.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488419|gb|AAS99531.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488421|gb|AAS99532.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488423|gb|AAS99533.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488425|gb|AAS99534.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488427|gb|AAS99535.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488429|gb|AAS99536.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488431|gb|AAS99537.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488433|gb|AAS99538.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488435|gb|AAS99539.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488437|gb|AAS99540.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488439|gb|AAS99541.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488441|gb|AAS99542.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488443|gb|AAS99543.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488445|gb|AAS99544.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488447|gb|AAS99545.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488449|gb|AAS99546.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488455|gb|AAS99549.1| aspartic protease PM5 [Plasmodium vivax]
Length = 536
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 123/297 (41%), Gaps = 61/297 (20%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSS 165
Y ++ IDIGTP + LD GS+ L PC C C + ++ ++
Sbjct: 59 YAYYFLDIDIGTPEQRISLILDTGSSSLSFPCAGCKNCGVHME----------NPFNLNN 108
Query: 166 SSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
S +S + C + C + +C +K C Y+ Y E + SG+ D++ + S++
Sbjct: 109 SKTSSILYCENEECPFKLNC--VKGKCEYMQSY-CEGSQISGFYFSDVVSVVSYN----N 161
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPS----LLAKAGLIQNSFSIC 280
V ++GC + +L A GV+G+ L +P+ L A ++ F+IC
Sbjct: 162 ERVTFRKLMGCHMHEESLFLYQQA-TGVLGMSLSKPQGIPTFVNLLFDNAPQLKQVFTIC 220
Query: 281 FDEN------------------DSGSVFFGDQGPATQ----------------QSTSFLP 306
EN S SV GP ++ + +
Sbjct: 221 ISENGGELIAGGYDPAYIVRRGGSKSVSGQGSGPVSESLSESGEDPQVALREAEKIVWEN 280
Query: 307 IGEKYDAYFVGVESY-CIGNSCLTQS-GFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
+ KY Y++ V G + ++ S G + LVDSG++FT +P ++Y ++ FD L
Sbjct: 281 VTRKY-YYYIKVRGLDMFGTNMMSSSKGLEMLVDSGSTFTHIPEDLYNKLNYFFDIL 336
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 103/428 (24%), Positives = 170/428 (39%), Gaps = 70/428 (16%)
Query: 35 EAKERWISKSGNVSVADSWPKK---NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLL 91
E K R S V DS K N+ E L +R RV+ R +L
Sbjct: 106 ETKPRQTPWSVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLN 165
Query: 92 FPSEGSQTHF------FGNQFY-------WLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
GS + FG + ++T I +GTP + LD GS+++W+ C
Sbjct: 166 KDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQC 225
Query: 139 Q-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIAD 197
+ C +C Y+ +D ++PS S+S + C+ +C + C Y
Sbjct: 226 EPCSKC-------YSQVD---PIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVS 275
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
Y + + + G ++L + S +V IGCG G ++ A ++GLG
Sbjct: 276 YG-DGSYTIGSFATEMLTFGTTSVR--------NVAIGCGHDNAGLFVGAAG---LLGLG 323
Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDEN---DSGSVFFGDQG-PATQQSTSFLPIGEKYDA 313
G +S PS L +FS C + SG++ FG + P T L
Sbjct: 324 AGLLSFPSQLGTQ--TGRAFSYCLVDRFSESSGTLEFGPESVPLGSILTPLLTNPSLPTF 381
Query: 314 YFVGVESYCIGNSCLT--------------QSGFQALVDSGASFTFLPTEIYAEV----V 355
Y+V + S +G + L + GF +VDSG + T L T +Y V V
Sbjct: 382 YYVPLISISVGGALLDSVPPDVFRIDETSGRGGF--IVDSGTAVTRLQTPVYDAVRDAFV 439
Query: 356 VKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEV 415
+L ++ +S+ + CY+ S ++ VP + FS S ++ + P + +
Sbjct: 440 AGTRQLPKAEGVSI----FDTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFM 495
Query: 416 GDHACFSY 423
G CF++
Sbjct: 496 GTF-CFAF 502
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 128/327 (39%), Gaps = 62/327 (18%)
Query: 103 GNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYD 162
G + L+Y +G V +D S L WV QCAP + + D+ +D
Sbjct: 145 GAKLRTLNYVAT-VGLGGGEATVIVDTASELTWV-----QCAPCESCH----DQQDPLFD 194
Query: 163 PSSSSSSKNVSCSHPLCKS-----------RSSCKSLKD---PCPYIADYSTEDTSSSGY 208
PSSS S V C+ C + ++C+ C Y Y + + S G
Sbjct: 195 PSSSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYR-DGSYSRGV 253
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLL 267
L D L LA V + GCG G G + G+MGLG +S V +
Sbjct: 254 LAHDRLSLAG--------EVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTM 303
Query: 268 AKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVE 319
+ G + FS C + + SGS+ GD + ST + D YFV +
Sbjct: 304 DQFGGV---FSYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLT 360
Query: 320 SYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRIS 368
+G + G +A++DSG T L IY AE + +F + + S
Sbjct: 361 GITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFS 420
Query: 369 LQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ C+N + ++VP ++L+F
Sbjct: 421 I----LDTCFNMTGLREVQVPSLKLVF 443
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 113/283 (39%), Gaps = 79/283 (27%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC----QCIQC-APLSASYYTSLDRNLSEYDPSSSSSSK 170
+GTP V LD GS L WVPC C C +P +A+ + + P +SSSS+
Sbjct: 109 LGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAA--------VPVFHPKNSSSSR 160
Query: 171 NVSCSHPLC---KSRSSCKSLKDPC--------------PYIADYSTEDTSSSGYLVDDI 213
V C +P C S + PC PY Y + T +G L+ D
Sbjct: 161 LVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGST--AGLLIADT 218
Query: 214 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 273
L AP +V S ++GC P G+ G G G SVP A+ GL
Sbjct: 219 L-------RAPGRAV-SGFVLGCSLVSVHQ-----PPSGLAGFGRGAPSVP---AQLGL- 261
Query: 274 QNSFSIC-----FDEND--SGSVFFGDQGPATQQSTSFLPI-----GEKYD---AYFVGV 318
+ FS C FD+N SGS+ G Q ++P+ G+K Y++ +
Sbjct: 262 -SKFSYCLLSRRFDDNAAVSGSLVLGGDNDGMQ----YVPLVKSAAGDKQPYAVYYYLAL 316
Query: 319 ESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIY 351
+G + A+VDSG +FT+L ++
Sbjct: 317 SGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVF 359
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 140/339 (41%), Gaps = 63/339 (18%)
Query: 97 SQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDR 156
S + G Y++ + +GTP F++ D GS+L WV C S + + D
Sbjct: 102 SSGAYTGTGQYFVRFR---VGTPAQPFVLVADTGSDLTWVKC--------SGAGDGTGDA 150
Query: 157 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLV 210
+ ++S S ++CS C S ++C S PC Y DY D S++ G +
Sbjct: 151 PRRVFRAAASRSWAPIACSSDTCTSYVPFSLANCSSPASPCAY--DYRYNDGSAARGVVG 208
Query: 211 DDILHLA-----SFSKHAPQSSVQSSVIIGCGRKQTGSYLDG---AAPDGVMGLGLGDVS 262
D +A S ++ +Q V++GC T SY DG + DGV+ LG ++S
Sbjct: 209 TDSATIALSGSESRDGGGRRAKLQ-GVVLGC----TASY-DGQSFQSSDGVLSLGNSNIS 262
Query: 263 VPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGP-----------ATQQSTSFLP 306
S A FS C N + + FG GP + T L
Sbjct: 263 FASR--AAARFGGRFSYCLVDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLL 320
Query: 307 IGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
Y V V++ + L G A++DSG S T L T Y VV
Sbjct: 321 DRRMSPFYAVAVDAVHVAGEALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAAL 380
Query: 359 -DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
++L R+S+ + ++YCYN ++ L++P + + F+
Sbjct: 381 SERLAGLPRVSM--DPFEYCYNWTA-AALEIPGLEVRFA 416
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 145/328 (44%), Gaps = 53/328 (16%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
T + G ++++ALD +NLLW+ C+ +Q +T L ++P+ S S +
Sbjct: 88 TSVGTGAGRRTYVLALDMTTNLLWMQCKPVQ------EPFTQLP---PPFEPAKSPSFRR 138
Query: 172 VSCSHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ ++ C R ++++DPC + + + G L ++ L +F+ Q +
Sbjct: 139 LPGNNAFCLPAPRGHRRTVQDPCKFHSIRLDGSADARGVLSNETL---AFAASGQQQTEV 195
Query: 230 SSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLL-----AKAGLIQ-NSFSICFD 282
+ V+IGC G ++ GV+GLG PSL+ + G +Q + FS C
Sbjct: 196 TGVVIGCTHNSKGFNFNSHGVLAGVLGLGR---QAPSLIWTLGQHRHGTVQVHRFSYCLP 252
Query: 283 ENDSGS------VFFGDQGPATQQ--STSFLPI----GEKYDAYFVGVESYCIGNSCL-- 328
+ S S + F D P TQ ST + + + AYFV + + L
Sbjct: 253 SHGSSSSDHHTFLRFDDDVPNTQHMVSTKIMYMDSTTSRDFRAYFVSLTGISVAGKPLQD 312
Query: 329 TQSGFQALVD-----SGASF-TFLPTEIYAEVVVKFDKLVSS-----KRISLQGNSWKY- 376
+ F+ V SG +F PT + ++ ++KL + K + LQ S +Y
Sbjct: 313 VKELFKRHVHGQVWTSGCAFDAGTPTMVM--IMPAYNKLKDAVVRHLKPLGLQIVSGQYH 370
Query: 377 -CYNASSEEMLKVPDMRLIFSKNQSFVV 403
C+ A+S+ +P + L F++ ++ +V
Sbjct: 371 LCFRATSQLWQHLPTVMLQFAETEARLV 398
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 140/333 (42%), Gaps = 48/333 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+G P FL+ +D GS+L W +QC P A + D++ +DPS S+S K + C+
Sbjct: 93 VGNPPRHFLLIIDTGSDLTW-----LQCKPCKACF----DQSGPVFDPSQSTSFKIIPCN 143
Query: 176 --------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
H C+ SS S K C Y Y + + +SG L + L + S S H P S
Sbjct: 144 AAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALESLSV-SLSDH-PSSL 199
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND-- 285
++IGCG G + ++GLG G +S PS L ++ I SFS C +
Sbjct: 200 EIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSPIGQSFSYCLVDRTNN 255
Query: 286 ---SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESYCIGNSCL-------- 328
S ++ FG ++ F P ++ Y++G++ I L
Sbjct: 256 LSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFA 315
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
T ++DSG + T+L + Y V F +S R + CYNA+ +
Sbjct: 316 IATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PFDILGICYNATGRAAV 374
Query: 387 KVPDMRLIFSKNQSF-VVRNHIFSFPENEVGDH 418
P + ++F + + + F P+ + H
Sbjct: 375 PFPALSIVFQNGAELDLPQENYFIQPDPQEAKH 407
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 130/333 (39%), Gaps = 68/333 (20%)
Query: 97 SQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDR 156
S + G Y++ + +GTP F++ D GS+L WV C+ P S D
Sbjct: 4 SSGAYTGTGQYFVRFR---VGTPAQPFVLVADTGSDLTWVKCRGAAGPPAS-------DP 53
Query: 157 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLV 210
E+ S S S ++CS C S ++C S PC Y DY +D S++ G +
Sbjct: 54 PAREFRASESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAY--DYRYKDGSAARGVVG 111
Query: 211 DDILHLA-------SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
D +A S + + V++GC G + DGV+ LG ++S
Sbjct: 112 TDAATIALSGSGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSS--DGVLSLGNSNISF 169
Query: 264 PSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPI----------- 307
S A FS C N S + FG + P+
Sbjct: 170 ASR--AAARFGGRFSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYA 227
Query: 308 ---------GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 358
GE D + + + +G G A++DSG S T L T Y VV
Sbjct: 228 VAVDAVYVAGEALD---IPADVWDVGR------GGGAILDSGTSLTVLATPAYRAVVAAL 278
Query: 359 -DKLVSSKRISLQGNSWKYCYN--ASSEEMLKV 388
+L + R+++ + ++YCYN A + E+ K+
Sbjct: 279 GGRLAALPRVAM--DPFEYCYNWTAGAPEIPKL 309
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 101/399 (25%), Positives = 159/399 (39%), Gaps = 67/399 (16%)
Query: 54 PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQT--HFFGNQFYWLHY 111
P K+ ++ LL +D R R + S + +R + S +Q H + ++
Sbjct: 64 PPKSRLDGTRQLLQSDNAR---RQMISSLRHGTRRKAFEVSHTAQIPIHSGADSGQSQYF 120
Query: 112 TWIDIGTPNV-SFLVALDAGSNLLWVPCQ-----CIQCAPLSASYYTSLDRNLSEYDPSS 165
I IGTP F++ D GS+L W+ C+ C + P + + D
Sbjct: 121 VSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRAND---------- 170
Query: 166 SSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
SSS + + CS CK S + C + PC + Y + G ++ + +
Sbjct: 171 SSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRY-LNGPRAIGVFANETVTVG- 228
Query: 219 FSKHAPQSSVQSSVIIGCGR--KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 276
+ H + V+IGC +T + PDGVMGLG S+ LA+ + N
Sbjct: 229 LNDH--KKIRLFDVLIGCTESFNETNGF-----PDGVMGLGYRKHSLALRLAE--IFGNK 279
Query: 277 FSICFDENDSGS-----VFFGD----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
FS C ++ S S + FGD + P Q + L +G Y V V +G S
Sbjct: 280 FSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTE--LLLGYINAFYPVNVSGISVGGSM 337
Query: 328 LTQSG--------FQALVDSGASFTFLPTEIYAEVVVK----FDKLVSSKRISLQGNSWK 375
L+ S +VDSG S T L E Y +VV FDK I L +
Sbjct: 338 LSISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELN-N 396
Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSF--VVRNHIFSFPE 412
+C+ + VP + + F+ F V+++I E
Sbjct: 397 FCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAE 435
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 121/310 (39%), Gaps = 47/310 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T + +GTP +++ +D GS+L W +QC+P S + ++ +DP +SSS
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTW-----LQCSPCRVSCH---RQSGPVFDPKTSSSY 188
Query: 170 KNVSCSHPLCKSRSSCK------SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 223
VSCS P C S+ S D C Y A Y + + S GYL D + S S
Sbjct: 189 AAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYG-DSSFSVGYLSKDTVSFGSNSV-- 245
Query: 224 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 283
+ GCG+ G + A G+MGL +S+ L A + SFS C
Sbjct: 246 ------PNFYYGCGQDNEGLFGRSA---GLMGLARNKLSL--LYQLAPTLGYSFSYCLPS 294
Query: 284 NDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-----QSGFQALV 337
+ S P T + YF+ + + L S ++
Sbjct: 295 SSSSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTII 354
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN------ASSEEMLKVPDM 391
DSG T LPT +Y D L + +++G Y+ L+VP +
Sbjct: 355 DSGTVITRLPTTVY-------DALSKAVAGAMKGTKRADAYSILDTCFVGQASSLRVPAV 407
Query: 392 RLIFSKNQSF 401
+ FS +
Sbjct: 408 SMAFSGGAAL 417
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/322 (25%), Positives = 134/322 (41%), Gaps = 62/322 (19%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C ++ +DP++SSS +NV+C
Sbjct: 157 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTC 206
Query: 175 SHPLC------------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
C R+ + +DPCPY Y + ++ L L SF+ +
Sbjct: 207 GDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGD------LALESFTVN 260
Query: 223 --APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
AP +S + V+ GCG + G + A G+ L S L A G ++FS
Sbjct: 261 LTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSY 315
Query: 280 CFDEN--DSGS-VFFGDQGPATQ-------QSTSFLPIGEKYDA----YFVGVESYCIGN 325
C ++ D GS V FG+ A + T+F P Y+V ++ +G
Sbjct: 316 CLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGG 375
Query: 326 SCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSW 374
L S ++DSG + ++ Y + F D++ S + +
Sbjct: 376 ELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVL 435
Query: 375 KYCYNASSEEMLKVPDMRLIFS 396
CYN S E +VP++ L+F+
Sbjct: 436 SPCYNVSGVERPEVPELSLLFA 457
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/328 (25%), Positives = 129/328 (39%), Gaps = 57/328 (17%)
Query: 65 LLSNDWKRQKTRVK----LQSNNNSSR------NQLLFPSEGSQTHFFGNQFYWLHYTWI 114
L+ +R K R +++ S+R +Q P G G+ Y + +
Sbjct: 50 LIRRAMQRSKARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVD---L 106
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP LD GS+L+W QC CA + L + + P S+S + + C
Sbjct: 107 AIGTPPQPVSALLDTGSDLIWT--QCAPCA-------SCLAQPDPLFAPGESASYEPMRC 157
Query: 175 SHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
+ LC C+ + D C Y +Y + Y + +F+ + +
Sbjct: 158 AGQLCSDILHHGCE-MPDTCTYRYNYGDGTMTMGVYATERF----TFTSSGGDRLMTVPL 212
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG---SV 289
GCG GS +G+ G++G G +S+ S L+ FS C SG ++
Sbjct: 213 GFGCGSMNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFSYCLTSYGSGRKSTL 264
Query: 290 FFGD-----QGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ------ 334
FG G AT Q+T L + Y+V + +G L +S F
Sbjct: 265 LFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGS 324
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDK 360
+VDSG + T LP + AEVV F +
Sbjct: 325 GGVIVDSGTALTLLPGAVLAEVVRAFRQ 352
>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
Length = 148
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 286
+ V GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKVAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 126/312 (40%), Gaps = 37/312 (11%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+GTP V+ + +D GS+L WV C+ AP S Y+ D +DP+ SSS V C
Sbjct: 54 LGTPGVAQTMEVDTGSDLSWVQCKPCAAAP---SCYSQKD---PLFDPAQSSSYAAVPCG 107
Query: 176 HPLCKSRS---SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
P+C + C Y+ Y + ++++G D L L++ SS
Sbjct: 108 GPVCAGLGIYAASACSAAQCGYVVSYG-DGSNTTGVYSSDTLTLSA-------SSAVQGF 159
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDS--GSV 289
GCG Q+G + +G DG++GLG PSL+ + AG FS C S G +
Sbjct: 160 FFGCGHAQSGLF-NGV--DGLLGLGR---EQPSLVEQTAGTYGGVFSYCLPTKPSTAGYL 213
Query: 290 FFGDQGPATQQ----STSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQALVDSGASF 343
G GP+ +T LP Y V + +G L+ S F
Sbjct: 214 TLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGT 273
Query: 344 TF--LPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 399
LP YA + F ++S + CYN + + +P++ L F
Sbjct: 274 VVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGA 333
Query: 400 SFVV-RNHIFSF 410
+ + + I SF
Sbjct: 334 TVTLGADGILSF 345
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 141/331 (42%), Gaps = 51/331 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP + L+ LD GS+++W P + + L R + + + ++ +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALP----------PLLRAVRQGSSTGAAPA 171
Query: 170 KNV--SCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
+C P+C+ S C ++ C Y Y + + ++G + L A ++
Sbjct: 172 PTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR---- 226
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DEN 284
VQ V IGCG G ++ A G++GLG G +S PS +A++ SFS C D
Sbjct: 227 --VQ-RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRT 278
Query: 285 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS---CLTQSGFQ------- 334
S + T + +F Y+V + + +G + ++QS +
Sbjct: 279 SSRRARPSRRWGGTPRMATF---------YYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 329
Query: 335 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDM 391
++DSG S T L +Y V F R+S G S + CYN S ++KVP +
Sbjct: 330 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTV 389
Query: 392 RLIFSKNQSFVVRNHIFSFPENEVGDHACFS 422
+ + S + + P + G CF+
Sbjct: 390 SMHLAGGASVALPPENYLIPVDTSGTF-CFA 419
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 140/326 (42%), Gaps = 48/326 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T + +GTP + LD GS+++W +QC+P Y ++ ++P S S
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVW-----LQCSPCRKCY----SQSDPIFNPYKSKSF 160
Query: 170 KNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
+ CS PLC+ S C + + C Y Y + + ++G + L + +
Sbjct: 161 AGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYG-DGSFTTGDFATETLTF--------RGN 211
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSFSICFDENDS 286
+ V +GCG G ++ A ++GLG G +S PS + G+ + FS C + +
Sbjct: 212 KIAKVALGCGHHNEGLFVGAAG---LLGLGRGRLSFPS---QTGIRFNHKFSYCLVDRSA 265
Query: 287 ----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ-- 334
S+ FGD A + F P+ K D Y+VG+ +G ++ S F+
Sbjct: 266 SSKPSSMVFGDA--AISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLD 323
Query: 335 ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
++DSG S T L Y + F + + + + CY+ S + +KV
Sbjct: 324 SAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKV 383
Query: 389 PDMRLIF-SKNQSFVVRNHIFSFPEN 413
P + L F + + N++ EN
Sbjct: 384 PTVVLHFRGADMALPATNYLIPVDEN 409
>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
Length = 440
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 123/340 (36%), Gaps = 77/340 (22%)
Query: 112 TWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKN 171
T I TP V + +D G LWV C+ Y SSS K
Sbjct: 47 TTISQRTPLVPVKLTIDLGQRFLWVDCE--------KGYV--------------SSSYKP 84
Query: 172 VSCSHPLCKSR------SSCKSLKDP------CPYIADYSTEDTSSSGYLVDDILHLASF 219
V C CK SC P C +I TS+ G L D++ L S
Sbjct: 85 VPCGSIPCKRSLSGACVESCVGPPSPGCNNNTCSHIPYNHFIRTSTGGELAQDVVSLQST 144
Query: 220 SKHAPQSSVQSS-VIIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAKAGLIQNS 276
P+ + ++ V+ C S L+G A G++GLG G V P+ LA A +
Sbjct: 145 DGSNPRKYLSTNGVVFDCAPH---SLLEGLAKGVKGILGLGNGYVGFPTQLANAFSVPRK 201
Query: 277 FSICFDENDS--GSVFFGDQ------GPATQQSTSFLPI-------------GEKYDAYF 315
F+IC + + G +FFGD G + + P+ GE YF
Sbjct: 202 FAICLTSSTTSRGVIFFGDSPYVFLPGMDVSKRLVYTPLLKNPVSTSGSYFEGEPSTDYF 261
Query: 316 VGVESYCI-GNSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFDK-LVSS 364
+GV S I GN + + G +T L T IY + F K L
Sbjct: 262 IGVTSIKINGNVVPINTTLLNITKDGKGGTKISTVDPYTKLETSIYNALTKAFVKSLAKV 321
Query: 365 KRISLQGNSWKYCYNASSEEMLK----VPDMRLIFSKNQS 400
R+ +K CYN +S + VP + L+ +
Sbjct: 322 PRVKPVA-PFKVCYNRTSLGSTRVGRGVPPIELVLGNKNA 360
>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
Length = 150
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 69/138 (50%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 286
+ ++ GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 4 KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGK 63
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P T+ T ++P+ E Y G+ + I + F+A+ DSG+++T+
Sbjct: 64 GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTY 122
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E+V K +S
Sbjct: 123 VPAQIYNELVSKIRGTLS 140
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 132/339 (38%), Gaps = 54/339 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP L+A+D S++ W+PC C+ C +A + P+ S+S KNVSC
Sbjct: 121 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSC 168
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
S P CK + C + Y + +++ L D + LA+ A
Sbjct: 169 SAPQCKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA--------FTF 218
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVF 290
GC K G G P LGLG + + + +++FS C SGS+
Sbjct: 219 GCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLR 275
Query: 291 FGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 339
G P + T L + Y+V + + +G + +G + DS
Sbjct: 276 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 335
Query: 340 GASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
G +T L +Y V +F K V ++ SL G + CY+ +KVP + +F
Sbjct: 336 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG--FDTCYSG----QVKVPTITFMFK 389
Query: 397 K-NQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
N + N + + G +C + N ++
Sbjct: 390 GVNMTMPADNLML---HSTAGSTSCLAMAAAPENVNSVV 425
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 132/339 (38%), Gaps = 54/339 (15%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP L+A+D S++ W+PC C+ C +A + P+ S+S KNVSC
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSC 152
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
S P CK + C + Y + +++ L D + LA+ A
Sbjct: 153 SAPQCKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA--------FTF 202
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSVF 290
GC K G G P LGLG + + + +++FS C SGS+
Sbjct: 203 GCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLR 259
Query: 291 FGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 339
G P + T L + Y+V + + +G + +G + DS
Sbjct: 260 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 319
Query: 340 GASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
G +T L +Y V +F K V ++ SL G + CY+ +KVP + +F
Sbjct: 320 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG--FDTCYSG----QVKVPTITFMFK 373
Query: 397 K-NQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
N + N + + G +C + N ++
Sbjct: 374 GVNMTMPADNLML---HSTAGSTSCLAMAAAPENVNSVV 409
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 97/335 (28%), Positives = 143/335 (42%), Gaps = 51/335 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ I +G P L+ LD GS++ W IQC P S Y S Y+P+ SSS
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTW-----IQCEPCSDCYQQS----DPIYNPALSSSY 195
Query: 170 KNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
K V C LC+ S C S C Y Y + + + G + L L AP
Sbjct: 196 KLVGCQANLCQQLDVSGC-SRNGSCLYQVSYG-DGSYTQGNFATETLTLGG----AP--- 246
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDS 286
+V IGCG G ++ A ++GLG G +S PS L + G I FS C + DS
Sbjct: 247 -LQNVAIGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQLTDENGKI---FSYCLVDRDS 299
Query: 287 GS---VFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS----GFQA--- 335
S + FG + + + D Y+V + +G L+ S G A
Sbjct: 300 ESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGN 359
Query: 336 ---LVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYNASSEEMLKV 388
+VDSG + T L T Y + F L S+ +SL + CY+ SS+E + V
Sbjct: 360 GGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL----FDTCYDLSSKESVDV 415
Query: 389 PDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
P + FS S + + P + +G CF++
Sbjct: 416 PTVVFHFSGGGSMSLPAKNYLVPVDSMGTF-CFAF 449
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 111/265 (41%), Gaps = 33/265 (12%)
Query: 103 GNQFYWLHYTW-IDIGTPNVSFLVALDAGSNLLWVPCQ--CIQCAPLSASYYTSLDRNLS 159
GN + +YT + IG P + + +D GS+L WV C C C +L RN
Sbjct: 56 GNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGC---------TLPRN-R 105
Query: 160 EYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
Y P V C PLC + S C + C Y +Y+ + SS G L+ D +
Sbjct: 106 LYKPHGDL----VKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNI 160
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGL 272
L K S + + GCG QT + P GV+GLG G S+ S L GL
Sbjct: 161 PL----KFTNGSLARPMLAFGCGYDQT-HHGQNPPPSTAGVLGLGNGRTSILSQLHSLGL 215
Query: 273 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQ 330
I+N C G +FFGDQ + P+ + A Y G +
Sbjct: 216 IRNVVGHCLSGRGGGFLFFGDQL-IPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSV 274
Query: 331 SGFQALVDSGASFTFLPTEIYAEVV 355
G + + DSG+S+T+ ++ + +V
Sbjct: 275 KGLELIFDSGSSYTYFNSQAHKALV 299
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/341 (23%), Positives = 134/341 (39%), Gaps = 54/341 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP L+A+D S++ W+PC C+ C +A + P+ S+S KNV
Sbjct: 103 VLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNV 150
Query: 173 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
SCS P CK + C + Y + +++ L D + LA+ A
Sbjct: 151 SCSAPQCKQVPNPACGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA--------F 200
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGS 288
GC K G G P LGLG + + + +++FS C SGS
Sbjct: 201 TFGCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGS 257
Query: 289 VFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALV 337
+ G P + T L + Y+V + + +G + +G +
Sbjct: 258 LRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIF 317
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 394
DSG +T L +Y V +F K V ++ SL G + CY+ +KVP + +
Sbjct: 318 DSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGG--FDTCYSG----QVKVPTITFM 371
Query: 395 FSK-NQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
F N + N + + G +C + + N ++
Sbjct: 372 FKGVNMTMPADNLML---HSTAGSTSCLAMASAPENVNSVV 409
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 129/332 (38%), Gaps = 66/332 (19%)
Query: 63 ELLLSNDWKRQKTRVK-LQS-------NNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWI 114
E LLS +R RV LQS + ++ L+ S+G G
Sbjct: 47 EQLLSRALRRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEMG----------- 95
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
IGTP + LD GS+L+W C C+ C +D+ +DP+ S++ +++
Sbjct: 96 -IGTPTRYYSAILDTGSDLIWTQCAPCLLC----------VDQPTPYFDPARSATYRSLG 144
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C+ P C + + C Y Y + S++G L ++ +F + + S+ +
Sbjct: 145 CASPACNALYYPLCYQKVCVYQYFYG-DSASTAGVLANETF---TFGTNETRVSL-PGIS 199
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVF 290
GCG GS +G+ G++G G G +S+ S L FS C F ++
Sbjct: 200 FGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVPSRLY 251
Query: 291 FG--------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQS 331
FG + QST F+ YF+ + +G L T
Sbjct: 252 FGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDG 311
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 363
++DSG + T+L Y V F ++
Sbjct: 312 TGGTIIDSGTTITYLAEPAYDAVRAAFASQIT 343
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 135/332 (40%), Gaps = 67/332 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + + +D GS L W+ C + SY T+ +DP+ S+S + +
Sbjct: 35 LTVGTPPQNVSMVIDTGSELSWLHCN------KTLSYPTT-------FDPTRSTSYQTIP 81
Query: 174 CSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
CS P C +R+ SC S + C Y+ + +SS G L D+ H+ S
Sbjct: 82 CSSPTCTNRTQDFPIPASCDS-NNLCHATLSYA-DASSSDGNLASDVFHIG--------S 131
Query: 227 SVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 285
S S ++ GC S D + G+MG+ G +S S L FS C D
Sbjct: 132 SDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP-----KFSYCISGTD 186
Query: 286 -SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSG 332
SG + G+ P Q ST LP ++ AY V +E + + L +S
Sbjct: 187 FSGLLLLGESNLTWSVPLNYTPLIQISTP-LPYFDRV-AYTVQLEGIKVLDKLLPIPKST 244
Query: 333 F--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CY 378
F Q +VDSG FTFL +Y + F SS L+ + + CY
Sbjct: 245 FEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCY 304
Query: 379 --NASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
S + +P + L+F + V + +
Sbjct: 305 LVPLSQRVLPLLPTVTLVFRGAEMTVSGDRVL 336
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 133/335 (39%), Gaps = 70/335 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP + + +D GS L W+ C A + + ++P+ SSS +S
Sbjct: 70 ITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPYPF----------FNPNISSSYTPIS 119
Query: 174 CSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 226
CS P C +R+ SC S + C Y+ + +SS G L D S
Sbjct: 120 CSSPTCTTRTRDFPIPASCDS-NNLCHATLSYA-DASSSEGNLASDTFGFG--------S 169
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 282
S ++ GC SY + D G+MG+ LG +S+ S L FS C
Sbjct: 170 SFNPGIVFGC---MNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP-----KFSYCIS 221
Query: 283 END-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 331
+D SG + G+ P Q ST LP ++ AY V +E I + L S
Sbjct: 222 GSDFSGILLLGESNFSWGGSLNYTPLVQISTP-LPYFDR-SAYTVRLEGIKISDKLLNIS 279
Query: 332 G----------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY----- 376
G Q + D G F++L +Y + +F + +L ++ +
Sbjct: 280 GNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMD 339
Query: 377 -CYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIF 408
CY + E+ ++P + L+F + V + +
Sbjct: 340 LCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLL 374
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 89/344 (25%), Positives = 142/344 (41%), Gaps = 55/344 (15%)
Query: 78 KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL------HYTWIDIGTPNVSFLVALDAGS 131
KL S +SRN G T F + L ++T I +GTP + LD GS
Sbjct: 7 KLSSLGATSRN---LSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGS 63
Query: 132 NLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLK 189
+++W +QCAP Y + ++P S S V C PLC+ S C +
Sbjct: 64 DIVW-----LQCAPCKNCY----SQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQ-R 113
Query: 190 DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 249
C Y Y + + ++G V + L + + V +GCG G ++ A
Sbjct: 114 QTCLYQVSYG-DGSYTTGEFVTETLTF--------RRTKVEQVALGCGHDNEGLFVGAAG 164
Query: 250 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----GSVFFGDQGPATQQSTSFL 305
++GLG G +S PS + FS C + + SV FG+ A ++ F
Sbjct: 165 ---LLGLGRGGLSFPSQAGRT--FNQKFSYCLVDRSASSKPSSVVFGNS--AVSRTARFT 217
Query: 306 PI--GEKYDA-YFVGVESYCIGN---SCLTQSGFQ--------ALVDSGASFTFLPTEIY 351
P+ + D Y+V + +G S +T S F+ ++D G S T L Y
Sbjct: 218 PLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAY 277
Query: 352 AEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 395
+ F SS + + + + + CY+ S + +KVP + L F
Sbjct: 278 IALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHF 321
>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
Length = 406
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 126/312 (40%), Gaps = 70/312 (22%)
Query: 99 THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLD 155
T+F Q++ T I +GTP +F V LD GS+ LWVP C I C L A
Sbjct: 88 TNFMNAQYF----TEITLGTPPQNFKVILDTGSSNLWVPSSKCTSIACF-LHA------- 135
Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
+YD S+SS+ K + S + G++ D+L
Sbjct: 136 ----KYDSSASSTYKQNGTEFSIQYGSGSME--------------------GFVSQDVLT 171
Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAK 269
+ + P +V K+ G DG++GLG +SV + +
Sbjct: 172 IGDLT--IPGQDFAEAV------KEPGLTFAFGKFDGILGLGYDTISVNHIVPPHYNMIN 223
Query: 270 AGLIQN---SFSICFDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYF-VGVESYCIG 324
GL+ SF + E D G FG A + +++P+ K AY+ V +E G
Sbjct: 224 KGLLDEPVFSFRLGKSEEDGGEAIFGGVDKSAYKGDLTYVPVRRK--AYWEVELEKISFG 281
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ L A +D+G S LPT++ AE++ + + +K+ SW Y +
Sbjct: 282 SEELELESTGAAIDTGTSLIALPTDM-AEMI---NAEIGAKK------SWNGQYQVECSK 331
Query: 385 MLKVPDMRLIFS 396
+ +P++ L F
Sbjct: 332 VPDLPELSLYFG 343
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 72/304 (23%), Positives = 118/304 (38%), Gaps = 39/304 (12%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP V L D GS+L+W QC P Y ++ +DP S + K +
Sbjct: 98 ISLGTPPVPMLGIADTGSDLIWR-----QCLPCPNCY----EQVEPLFDPKESETYKTLD 148
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSV 232
C + C+ S D YS D S + G L D L + S ++ P S +
Sbjct: 149 CDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGS-TEGDPASF--PGI 205
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSG 287
GCG G++ + +GLG + ++ + + FS C D S
Sbjct: 206 AFGCGHDNGGTFNE----KDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSS 261
Query: 288 SVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSGF------------ 333
+ FG G + T P+ G Y++ +E +G+ + GF
Sbjct: 262 KINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEE 321
Query: 334 -QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG + T LP + Y +V + + + + CY SS L++P +
Sbjct: 322 GNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY--SSVNNLEIPTIT 379
Query: 393 LIFS 396
F+
Sbjct: 380 AHFT 383
>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
Length = 141
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 67/139 (48%), Gaps = 4/139 (2%)
Query: 232 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSV 289
+ GCG KQ +P DG++GLG+G + L +I +N C G +
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60
Query: 290 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPT 348
+ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T +P
Sbjct: 61 YVGDFNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119
Query: 349 EIYAEVVVKFDKLVSSKRI 367
+IY E+V K +S +
Sbjct: 120 QIYNEIVSKVRGTLSEPSL 138
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 129/316 (40%), Gaps = 56/316 (17%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
+GTP F + +D GS+L W+ C C+ C D+ +DP++SSS +NV+C
Sbjct: 157 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FDQVGPVFDPAASSSYRNVTC 206
Query: 175 SHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH--APQ 225
C R+ + +D CPY Y + ++ L L SF+ + AP
Sbjct: 207 GDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGD------LALESFTVNLTAPG 260
Query: 226 SSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+S + V+ GCG G + A G+ L S L A G ++FS C ++
Sbjct: 261 ASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVDH 315
Query: 285 DS---GSVFFGDQGPATQQS-------TSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG- 332
S V FG+ + T+F P D Y+V ++ +G L S
Sbjct: 316 GSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSD 375
Query: 333 -----------FQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNA 380
++DSG + ++ Y + F D++ S + CYN
Sbjct: 376 TWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNV 435
Query: 381 SSEEMLKVPDMRLIFS 396
S + +VP++ L+F+
Sbjct: 436 SGVDRPEVPELSLLFA 451
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 79/359 (22%), Positives = 131/359 (36%), Gaps = 86/359 (23%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSS---- 165
++ + IGTP + L+ D GS+L+WV +C+P RN S P S
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWV-----KCSPC---------RNCSHRSPGSAFFA 131
Query: 166 --SSSSKNVSCSHPLCK-----SRSSCK--SLKDPCPYIADYSTEDTSSSGYLVDDILHL 216
S++ + C P C+ + C L PC Y Y+ + ++++G+ + L L
Sbjct: 132 RHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYA-DSSTTTGFFSKEALTL 190
Query: 217 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGDVSVPSLLAKAGLI 273
+ + + + + GCG + +G L GA+ GVMGLG +S S L +
Sbjct: 191 NTSTGKVKK---LNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRR--F 245
Query: 274 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-------------------- 313
+ FS C + + TSFL IG +
Sbjct: 246 GSKFSYCLMDYT-----------LSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSP 294
Query: 314 --YFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 361
Y++ ++ + L ++DSG + TF+ Y E++ F K
Sbjct: 295 TFYYIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKR 354
Query: 362 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF--VVRNHIFSFPENEVGDH 418
V + + C N S +P M + F RN+ E GD
Sbjct: 355 VKLPSPAEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFI-----ETGDQ 408
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 147/379 (38%), Gaps = 58/379 (15%)
Query: 65 LLSNDWKRQKTRVK-LQSNNNSSR----NQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTP 119
L+ +R K R L + N +R N+ P+ G+ Y + + IGTP
Sbjct: 49 LIRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVD---LAIGTP 105
Query: 120 NVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLC 179
LD GS+L+W QC CA + L + + P S+S + + C+ LC
Sbjct: 106 PQPVSALLDTGSDLIWT--QCAPCA-------SCLSQPDPLFAPGQSASYEPMRCAGTLC 156
Query: 180 KS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 237
SC+ D C Y +Y + T + G + AS S ++ + GCG
Sbjct: 157 SDILHHSCER-PDTCTYRYNYG-DGTMTVGVYATERFTFAS-SGGGGLTTTTVPLGFGCG 213
Query: 238 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS--------GSV 289
GS +G+ G++G G +S+ S L+ FS C S GS+
Sbjct: 214 SVNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFSYCLTSYASRRQSTLLFGSL 265
Query: 290 FFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ--------ALV 337
G G AT Q+T L + Y+V +G L +S F +V
Sbjct: 266 SDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIV 325
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY-------NASSEEMLKVPD 390
DSG + T LP + AEVV F + + + C+ +SS + VP
Sbjct: 326 DSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPR 385
Query: 391 MRLIF-SKNQSFVVRNHIF 408
M L F + RN++
Sbjct: 386 MVLHFQGADLDLPRRNYVL 404
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 95/408 (23%), Positives = 159/408 (38%), Gaps = 67/408 (16%)
Query: 34 DEAKERWISKSGNVSVADSWPKKNSVEYLELL---LSNDWKRQKTRVK-LQSNNNSSRNQ 89
+E E+W+ K V D NS ++ L L D KR + ++ L S S
Sbjct: 127 EEGGEKWMMK---VVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRV 183
Query: 90 LLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSA 148
F ++ G+ Y++ I +G+P S + +D+GS+++WV CQ C QC
Sbjct: 184 DDFGTDVISGMEQGSGEYFVR---IGVGSPPRSQYMVIDSGSDIVWVQCQPCTQC----- 235
Query: 149 SYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 208
Y D +DP+ S+S VSCS +C + C Y Y + + + G
Sbjct: 236 --YHQSD---PVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYG-DGSYTKGT 289
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
L L +F + ++ SV IGCG + G ++ A G+ G + V
Sbjct: 290 LA---LETLTFGR-----TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVG-----Q 336
Query: 269 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGN 325
G +FS C S +++P+ A Y++G+ +G
Sbjct: 337 LGGQTGGAFSYCL------------------VSAAWVPLVRNPRAPSFYYIGLAGLGVGG 378
Query: 326 S---------CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 375
LT+ G +V D+G + T LPT Y F ++ + +
Sbjct: 379 IRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFD 438
Query: 376 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
CY+ ++VP + FS + F P ++ G CF++
Sbjct: 439 TCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTF-CFAF 485
>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
Length = 154
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 286
+ + GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P T+ T ++P+ E Y G+ I + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTH 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY+E+V K +S
Sbjct: 125 VPAQIYSEIVSKVRGTLS 142
>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
Length = 154
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 286
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 86/197 (43%), Gaps = 26/197 (13%)
Query: 57 NSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWL------H 110
+ E L L D KR+ +R+ + ++ N G + F L +
Sbjct: 89 TAAELLAHRLRRD-KRRASRISAAAGGAAAANGTRVGGGGGGSGFVAPVVSGLAQGSGEY 147
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
+T I +GTP L+ LD GS+++W +QCAP Y D++ +DP +S S
Sbjct: 148 FTKIGVGTPVTPALMVLDTGSDVVW-----LQCAPCRRCY----DQSGQMFDPRASHSYG 198
Query: 171 NVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
V C+ PLC+ S C + C Y Y + + ++G + L AS ++ P+
Sbjct: 199 AVDCAAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATETLTFASGAR-VPR--- 253
Query: 229 QSSVIIGCGRKQTGSYL 245
V +GCG G ++
Sbjct: 254 ---VALGCGHDNEGLFV 267
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 123/319 (38%), Gaps = 62/319 (19%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP V V +D GS+L WV QC C S+S Y D YDP++SS+ V
Sbjct: 131 LGIGTPAVQQTVLIDTGSDLSWV--QCKPCN--SSSCYPQKD---PLYDPTASSTYAPVP 183
Query: 174 CSHPLCK-----------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
C CK + SS SL C Y +Y DT+ Y + +
Sbjct: 184 CDSKACKDLVPDAYDHGCTNSSGTSL---CQYGIEYGNRDTTVGVYSTETL-------TL 233
Query: 223 APQSSVQSSVIIGCGRKQTGS-------YLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQ 274
+PQ SV+ GCG Q G+ G AP+ SL+++ A
Sbjct: 234 SPQVSVK-DFGFGCGLVQQGTFDLFDGLLGLGGAPE-------------SLVSQTAETYG 279
Query: 275 NSFSICFDENDSGSVFFGDQGPATQQSTS---FLP---IGEKYDAYFVGVESYCIGNSCL 328
+FS C +S + F P T+ F P + E+ Y V + +G L
Sbjct: 280 GAFSYCLPPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPL 339
Query: 329 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASS 382
T ++DSG T LP Y+ + F +S+ + N CYN +
Sbjct: 340 DIPPTVLSGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTG 399
Query: 383 EEMLKVPDMRLIFSKNQSF 401
+ VP + L F +
Sbjct: 400 IANVTVPTVALTFDGGATI 418
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 81/348 (23%), Positives = 133/348 (38%), Gaps = 63/348 (18%)
Query: 127 LDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCK 186
LD GS+L+W PC C L T N S + S+ + C+ P C + S
Sbjct: 102 LDTGSDLVWFPCAPFTCM-LCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSA 160
Query: 187 SLKDPCPY----IADYSTEDTSSSG------YLVDDILHLASFSKHAPQSSVQSSVII-- 234
D C + D T ++S Y D +A + + + +SV +
Sbjct: 161 PPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRG--RVGIAASVAVEN 218
Query: 235 ---GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------ 285
C G P GV G G G +S+P+ LA A L FS C +
Sbjct: 219 FTFACAHTALGE------PVGVAGFGRGPLSLPAQLAPAAL-SGRFSYCLVAHSFRADRP 271
Query: 286 --SGSVFFG---DQGPATQQSTSFLPI--GEKYDAYF-VGVESYCIGNSCLT-------- 329
+ G + PA++ + P+ K+ ++ V +E+ +G + +
Sbjct: 272 IRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRV 331
Query: 330 -QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---------GNSWKYCY 378
++G +V DSG +FT LP E YA V +F + +++ R + Y +
Sbjct: 332 GRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDH 391
Query: 379 NASSEE---MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEVGDHACF 421
+AS+ E VP + + F + V+ RN+ F E C
Sbjct: 392 DASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCL 439
>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
Length = 154
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 286
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 82/340 (24%), Positives = 135/340 (39%), Gaps = 60/340 (17%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP + + LD GS L W + CAP A S + P +SS+ V
Sbjct: 89 LAVGTPPQNVTMVLDTGSELSW-----LLCAPAGARNKFSA----MSFRPRASSTFAAVP 139
Query: 174 CSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
C+ C+SR +C C Y+ + +SS G L D+ + S
Sbjct: 140 CASAQCRSRDLPSPPACDGASSRCSVSLSYA-DGSSSDGALATDVFAVGS--------GP 190
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSG 287
GC S DG A G++G+ G +S +++A FS C D +D+G
Sbjct: 191 PLRAAFGCMSSAFDSSPDGVASAGLLGMNRGALS---FVSQAS--TRRFSYCISDRDDAG 245
Query: 288 SVFFGDQGPATQQSTSFLPIGEK------YD--AYFVGVESYCIGNSCL----------- 328
+ G T ++ P+ + +D AY V + +G L
Sbjct: 246 VLLLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDH 305
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CYN--- 379
T +G Q +VDSG FTFL + Y+ + +F + +L S+ + C+
Sbjct: 306 TGAG-QTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQ 364
Query: 380 ASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEVGD 417
S ++P + L+F+ + V + ++ P E GD
Sbjct: 365 GRSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGD 404
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 129/324 (39%), Gaps = 80/324 (24%)
Query: 78 KLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
+ N++S + PS + + + + +T +GTP V LD GS+L WVP
Sbjct: 36 RRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFT-ASLGTPPQPLPVLLDTGSHLTWVP 94
Query: 138 C----QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSL 188
C +C C+ SAS + + P +SSSS+ V C +P C+ + + K
Sbjct: 95 CTSSYECRNCSSPSAS-------AVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCR 147
Query: 189 KDPC----------------PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSV 232
+ PC PY Y + T +G L+ D L AP +V
Sbjct: 148 RAPCSPGAANCPAAASNVCPPYAVVYGSGST--AGLLIADTL-------RAPGRAVP-GF 197
Query: 233 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC-----FDEND-- 285
++GC P G+ G G G SVP+ L GL + FS C FD+N
Sbjct: 198 VLGCSLVSVHQ-----PPSGLAGFGRGAPSVPAQL---GLPK--FSYCLLSRRFDDNAAV 247
Query: 286 SGSVFFGDQGPATQQSTSFLPI-----GEKYD---AYFVGVESYCIGNSCL--------- 328
SGS+ G + ++P+ G+K Y++ + +G +
Sbjct: 248 SGSLVLGGT--GGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAA 305
Query: 329 -TQSGFQALVDSGASFTFLPTEIY 351
+VDSG +FT+L ++
Sbjct: 306 NAAGSGGTIVDSGTTFTYLDPTVF 329
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 127/342 (37%), Gaps = 53/342 (15%)
Query: 97 SQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC------APLSASY 150
S + G Y++ + +GTP F++ D GS+L WV C+ A+
Sbjct: 100 SSGAYTGTGQYFVRFR---VGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAA 156
Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSS 205
+ + P S + + CS CKS ++C S C Y DY D S+
Sbjct: 157 PSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSY--DYRYNDNSA 214
Query: 206 SGYLVDDILHLASFSKHAP------QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 259
+ +V + S + + V++GC G + A DGV+ LG
Sbjct: 215 ARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFE--ASDGVLSLGYS 272
Query: 260 DVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFG-------DQGPATQQSTSFLPI 307
++S S A FS C N + + FG PA T L
Sbjct: 273 NISFASRAAS--RFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLD 330
Query: 308 GEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF- 358
Y V V+S + L S ++DSG S T L T Y VV
Sbjct: 331 ARVRPFYAVAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALS 390
Query: 359 DKLVSSKRISLQGNSWKYCYNASSE----EMLKVPDMRLIFS 396
++L R+++ + + YCYN ++ L VP + + F+
Sbjct: 391 EQLAGLPRVAM--DPFDYCYNWTARGDGGGDLAVPKLAVQFA 430
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 77/336 (22%), Positives = 132/336 (39%), Gaps = 73/336 (21%)
Query: 107 YWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQ----CIQCAPLSASYYTSLDRNL-SEY 161
Y + + GTP+ + D GS+L+W+PC C C ++ LD L +
Sbjct: 87 YGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCD------FSGLDPTLIPRF 140
Query: 162 DPSSSSSSKNVSCSHPLCK----SRSSCKSLKDP----C-----PYIADYSTEDTSSSGY 208
P +SSSSK + C P C+ C+ DP C PYI Y S++G
Sbjct: 141 IPKNSSSSKIIGCQSPKCQFLYGPNVQCRGC-DPNTRNCTVGCPPYILQYGLG--STAGV 197
Query: 209 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 268
L+ + L + ++GC S + P G+ G G G VS+PS +
Sbjct: 198 LITEKLDFPDLT--------VPDFVVGC------SIISTRQPAGIAGFGRGPVSLPSQMN 243
Query: 269 KAGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEKYDA------- 313
S FD+ D+GS G + ++ P + +
Sbjct: 244 LKRFSHCLVSRRFDDTNVTTDLDLDTGS---GHNSGSKTPGLTYTPFRKNPNVSNKAFLE 300
Query: 314 -YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 362
Y++ + +G + T ++VDSG++FTF+ ++ V +F +
Sbjct: 301 YYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM 360
Query: 363 S--SKRISLQGNS-WKYCYNASSEEMLKVPDMRLIF 395
S ++ L+ + C+N S + + VP++ F
Sbjct: 361 SNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEF 396
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 83/181 (45%), Gaps = 16/181 (8%)
Query: 180 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 239
+ + CK + C Y Y+ + SS G L+ D L P + ++ GCG
Sbjct: 66 RFKHDCKENPNQCDYDVRYAGGE-SSLGVLIADKFSL-------PGRDARPTLTFGCGYD 117
Query: 240 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPAT 298
Q G + DGV+G+G G + S L + G I +N C G +FFG +
Sbjct: 118 QEGGKAEMPV-DGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHE-KVP 175
Query: 299 QQSTSFLPIGEKYDAYFVGVESYC----IGNSCLTQSGFQALVDSGASFTFLPTEIYAEV 354
+++P+ Y G+ + +GN ++ + + ++DSG+++T++PTE Y +
Sbjct: 176 SSVVTWVPMVPNNHYYSPGLAALHFNGNLGNP-ISVAPMEVVIDSGSTYTYMPTETYRRL 234
Query: 355 V 355
V
Sbjct: 235 V 235
>gi|291002742|gb|ADD71503.1| xyloglucanase inhibitor 1 [Humulus lupulus]
Length = 443
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 78/336 (23%), Positives = 126/336 (37%), Gaps = 72/336 (21%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T I TP V V LD G LW+ C+ Y SS+
Sbjct: 48 YITQITQRTPPVQLKVVLDVGGEFLWIDCE--------KGY--------------KSSTK 85
Query: 170 KNVSCSHPLC--KSRSSCKSLKDP-----CPYIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ V C P C +C + +P C + + +SG L +DIL++ S +
Sbjct: 86 RPVPCGSPQCVLSGSGACTTSDNPSDVGVCGVMPNNPFSSVGTSGDLFEDILYIQSTNGF 145
Query: 223 APQSSVQ-SSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGLIQNSFSI 279
P V +++ C S L+G A G+ G G V++PSL + A F +
Sbjct: 146 NPGKQVSVPNLLFSCAPN---SLLEGLASGIIGMAGFGRNKVALPSLFSSAFSFPRKFGV 202
Query: 280 CFDENDSGSVFFGDQG----PATQ----QSTSFLPI------------GEKYDAYFVGVE 319
C ++ G +FFG + P S ++ P+ G YF+GV+
Sbjct: 203 CLSSSN-GVIFFGKEPYVLLPGIDVSDPTSLTYTPLIQNPRSLVSSFEGNPSAEYFIGVK 261
Query: 320 SYCI-GNSCLTQSGFQALVDSGA----------SFTFLPTEIYAEVVVKFDKLVSSKRIS 368
S + G + + G FT L T IY VV F K + K
Sbjct: 262 SIKVDGKPLRLNTTLLTFDNEGGHGGTKISTVDPFTTLETSIYKAVVGAFVKALGPKVPR 321
Query: 369 LQGNS-WKYCYNA----SSEEMLKVPDMRLIFSKNQ 399
++ + + C+NA ++ VP + L+ ++
Sbjct: 322 VKAVAPFGACFNAKYIGNTRVGPAVPQIDLVLRNDK 357
>gi|255552241|ref|XP_002517165.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543800|gb|EEF45328.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 434
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 138/360 (38%), Gaps = 63/360 (17%)
Query: 90 LLFPSEGSQTHFFGNQFYW--------LHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQC 140
++PS QT F L Y T I+ TP V + LD G LWV C
Sbjct: 15 FVYPSIADQTSFRPKALVLPVSRDPSTLQYLTSINQRTPLVPVKLTLDLGGQYLWVDCDQ 74
Query: 141 IQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIADYS 199
+S+SY R+ + S +K+ SC S R C + D C + D +
Sbjct: 75 ---GYVSSSYKPVRCRS------AQCSLAKSKSCISECFSSPRPGCNN--DTCALLPDNT 123
Query: 200 TEDTSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAP--DGVMGL 256
+ +SG + D++ + S +P V +I C T L+G A G+ GL
Sbjct: 124 VTHSGTSGEVGQDVVTVQSTDGFSPGRVVSVPKLIFTCA---TTFLLEGLASGVKGMAGL 180
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFFGDQGP-------ATQQSTSFLPI- 307
G +S+PS + A F+IC N G VFFGD GP +S + P+
Sbjct: 181 GRTKISLPSQFSAAFSFDRKFAICLTSSNAKGIVFFGD-GPYVFLPNIDVSKSLIYTPLI 239
Query: 308 ------------GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA----------SFTF 345
G+ YF+GV+S I + + +D +T
Sbjct: 240 LNPVSTASAFFKGDPSSEYFIGVKSIKINGKAVPLNTSLLFIDKEGVGGTKISTVDPYTV 299
Query: 346 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK----VPDMRLIFSKNQSF 401
L T IY V F K ++ + + C+N+S+ + VP + L+ + F
Sbjct: 300 LETTIYQAVTKVFIKELAEVPRVAPVSPFGVCFNSSNIGSTRVGPAVPQIDLVLQSSSVF 359
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 78/327 (23%), Positives = 139/327 (42%), Gaps = 40/327 (12%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ + IG P+ + LD GS++ W IQCAP + Y+ + ++P+SS+S
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNW-----IQCAPCADCYH----QADPIFEPASSTSY 194
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+SC C+S + + C Y Y + + + G V + + L S S
Sbjct: 195 SPLSCDTKQCQSLDVSECRNNTCLYEVSYG-DGSYTVGDFVTETITLGSASV-------- 245
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSG 287
+V IGCG G ++ A G+ G L S PS + + SFS C ++DS
Sbjct: 246 DNVAIGCGHNNEGLFIGAAGLLGLGGGKL---SFPSQINAS-----SFSYCLVDRDSDSA 297
Query: 288 SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQA--------L 336
S + T+ L + D Y+VG+ +G L+ +S F+ +
Sbjct: 298 STLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGII 357
Query: 337 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
+DSG + T L T Y + F K ++ + + CY+ S + ++VP + +
Sbjct: 358 IDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLA 417
Query: 397 KNQSFVVRNHIFSFPENEVGDHACFSY 423
+ + + P + G CF++
Sbjct: 418 GGKVLPLPATNYLIPVDSDGTF-CFAF 443
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 139/349 (39%), Gaps = 42/349 (12%)
Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
GNQ + +Y +GTP + LD ++ +W+PC C+ S + + + S Y
Sbjct: 22 GNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCS--GCSGCSNASTSFNTNSSSTY 79
Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S S+++ C S S S+ C + Y D+S S LV D L LA
Sbjct: 80 STVSCSTAQCTQARGLTCPSSSPQPSV---CSFNQSYG-GDSSFSASLVQDTLTLA---- 131
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
V + GC +G+ L P G+MGLG G +S+ S L FS C
Sbjct: 132 ----PDVIPNFSFGCINSASGNSL---PPQGLMGLGRGPMSLVS--QTTSLYSGVFSYCL 182
Query: 282 DEND----SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
SGS+ G G P + + T L + Y+V + +G+ +
Sbjct: 183 PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLT 242
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
SG ++DSG T +Y + +F K V+ S G ++ C++A +E +
Sbjct: 243 FDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLG-AFDTCFSADNENV- 300
Query: 387 KVPDMRL-IFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
P + L + S + + N + + G C S + N +L
Sbjct: 301 -APKITLHMTSLDLKLPMENTLI---HSSAGTLTCLSMAGIRQNANAVL 345
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 73/327 (22%), Positives = 124/327 (37%), Gaps = 49/327 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I +G+P + + +D+GS+++WV QC P S Y S +DP+ SSS
Sbjct: 143 YFVRIGVGSPPRNQYMVIDSGSDIVWV-----QCKPCSRCYQQS----DPVFDPADSSSF 193
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
VSC +C + C Y Y + + + G L + L + +
Sbjct: 194 AGVSCGSDVCDRLENTGCNAGRCRYEVSYG-DGSYTKGTLALETLTVGQV--------MI 244
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSV 289
V IGCG G ++ A G+ S+ + G +FS C +GS
Sbjct: 245 RDVAIGCGHTNQGMFIGAAGLLGLG-----GGSMSFIGQLGGQTGGAFSYCLVSRGTGST 299
Query: 290 FFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSC---------LTQ 330
A + LP+G + + Y++G+ +G LT+
Sbjct: 300 ------GALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE 353
Query: 331 SGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
G +V D+G + T PT Y F S+ + + + CY+ + E ++VP
Sbjct: 354 YGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP 413
Query: 390 DMRLIFSKNQSFVVRNHIFSFPENEVG 416
+ FS + F P + G
Sbjct: 414 TVSFYFSDGPVLTLPARNFLIPVDGGG 440
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 139/349 (39%), Gaps = 42/349 (12%)
Query: 103 GNQFYWLHYT-WIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEY 161
GNQ + +Y +GTP + LD ++ +W+PC C+ S + + + S Y
Sbjct: 96 GNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCS--GCSGCSNASTSFNTNSSSTY 153
Query: 162 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 221
S S+++ C S S S+ C + Y D+S S LV D L LA
Sbjct: 154 STVSCSTAQCTQARGLTCPSSSPQPSV---CSFNQSYG-GDSSFSASLVQDTLTLA---- 205
Query: 222 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 281
V + GC +G+ L P G+MGLG G +S+ S L FS C
Sbjct: 206 ----PDVIPNFSFGCINSASGNSL---PPQGLMGLGRGPMSLVS--QTTSLYSGVFSYCL 256
Query: 282 DEND----SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-------- 328
SGS+ G G P + + T L + Y+V + +G+ +
Sbjct: 257 PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLT 316
Query: 329 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
SG ++DSG T +Y + +F K V+ S G ++ C++A +E +
Sbjct: 317 FDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLG-AFDTCFSADNENV- 374
Query: 387 KVPDMRL-IFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
P + L + S + + N + + G C S + N +L
Sbjct: 375 -APKITLHMTSLDLKLPMENTLI---HSSAGTLTCLSMAGIRQNANAVL 419
>gi|304361786|gb|ADM26243.1| MIP25078p [Drosophila melanogaster]
Length = 467
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 79/339 (23%), Positives = 133/339 (39%), Gaps = 80/339 (23%)
Query: 109 LHYT-WIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPS 164
+ YT ++IGTP F V D GS+ +WVP C+ C + +Y P+
Sbjct: 150 MEYTCKMNIGTPKQKFTVLPDTGSSNIWVPGPHCKSKAC------------KKHKQYHPA 197
Query: 165 SSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
SS+ +K+ + Y + S +G L D + +A
Sbjct: 198 KSST------------------YVKNGKSFAITYGSG--SVAGVLAKDTVRIAGL----- 232
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN--------- 275
V ++ K+ G+ + DG++GLG ++V ++ L+QN
Sbjct: 233 ---VVTNQTFAMTTKEPGTTFVTSNFDGILGLGYRSIAVDNVKT---LVQNMCSEDVITS 286
Query: 276 -SFSICFD----ENDSGSVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
F+IC + G++ FG + S ++ P+ +K F + Y +G +
Sbjct: 287 CKFAICMKGGGSSSRGGAIIFGSSNTSAYSGSNSYTYTPVTKKGYWQFTLQDIY-VGGTK 345
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
++ S QA+VDSG S PT IY K +K++ R + G W C K
Sbjct: 346 VSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGC-RATSSGECWMKCAK-------K 392
Query: 388 VPDMRLIFSKNQSFVVRNHIFSFP-ENEVGDHACFSYFT 425
+PD + + + FVV+ + G C S T
Sbjct: 393 IPDFTFVIA-GKKFVVKGNKMKLKVRTNRGRTVCISAVT 430
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 65/261 (24%), Positives = 108/261 (41%), Gaps = 34/261 (13%)
Query: 108 WLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSS 167
+L I +GTP V LVA+D G+ L +V QC P + + D +DPS S
Sbjct: 204 FLFLMPIKLGTPPVWNLVAVDTGATLSFV-----QCEPCTLRCHKQTDAG-EIFDPSKSE 257
Query: 168 SSKNVSCSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 219
S V CS C++ +C +D C Y + + S G LV D L + +
Sbjct: 258 SFSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGKY 317
Query: 220 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAKAGLIQ-NSF 277
+K + GC LD GL G D A L+ +F
Sbjct: 318 AK----GYSFPDFLFGCS-------LDTEYHQYEAGLVGFADEPFSFFEQVAPLVNYKAF 366
Query: 278 SICF--DENDSGSVFFGDQGPATQQSTSFLP--IGEKYDAYFVGVESYCIGNSCLTQSGF 333
S CF D +G + GD T+ ++++ P + + Y + ++ + L +
Sbjct: 367 SYCFPSDRRKTGYLSIGDY---TRVNSTYTPLFLARQQSRYALKLDEVLVNGMALVTTPS 423
Query: 334 QALVDSGASFTFLPTEIYAEV 354
+ +VDSG+ +T L ++ + ++
Sbjct: 424 EMIVDSGSRWTILLSDTFTQL 444
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 134/350 (38%), Gaps = 62/350 (17%)
Query: 19 SDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSNDWKRQKTRVK 78
+D SF+++L+H S + P N+ E L+ +R RV
Sbjct: 33 ADKFSFTAELIHIDSPNS-----------------PFFNASETTTHRLAKALQRSANRV- 74
Query: 79 LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPC 138
+ N L EG F +L + IGTP A+D GSN++W+PC
Sbjct: 75 ------ARLNPLSNSDEGVHASIFSGDGNYL--MKLLIGTPPTEIHAAIDTGSNVIWIPC 126
Query: 139 -QCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIAD 197
C C +++ S ++P +SS+ ++ C C++ SS + C Y D
Sbjct: 127 INCKDC----------FNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCD 176
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 257
+ +G + D + L S S P S + CG ++ A GV+GLG
Sbjct: 177 EKHQLNCPNGRIAVDTMTLTS-SDGRPFPLPYSDFV--CGNSIYKTF----AGVGVIGLG 229
Query: 258 LGDVSVPSLLAKAGLIQNSFSICFDE---NDSGSVFFGDQGPATQQSTSFL--PIGEKYD 312
G +S+ S L L FS C + + FG Q + + +G
Sbjct: 230 RGALSLTSKLYH--LSDGKFSYCLADYYSKQPSKINFGLQSFISDDDLEVVSTTLGHHRH 287
Query: 313 A--YFVGVESYCIG---------NSCLTQSGFQALVDSGASFTFLPTEIY 351
+ Y+V +E +G + L+DSG FT LP + Y
Sbjct: 288 SGNYYVTLEGISVGEKRQDLYYVDDPFAPPVGNMLIDSGTMFTLLPKDFY 337
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 89/396 (22%), Positives = 151/396 (38%), Gaps = 65/396 (16%)
Query: 54 PKKNSVEYLELLLSNDWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTW 113
P+ + E++ L D R + Q +S+ L +Q Y +
Sbjct: 34 PEVTASEFVRGALRRDMHRHARFAREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMT--- 90
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSE----YDPSSSSSS 169
+ IGTP +S+ D GS+L+W QCAP + + ++ + Y+PSSS++
Sbjct: 91 LSIGTPPLSYRAIADTGSDLIWT-----QCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCP-------YIADYSTEDTSSSGYLVDDILHLASFSKH 222
+ C+ PL S C ++ P P Y Y T T+ V + S
Sbjct: 146 GVLPCNSPL----SMCAAMAGPSPPPGCACMYNQTYGTGWTAG----VQSVETFTFGSSS 197
Query: 223 APQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF- 281
P + ++ GC + + +G+A G++GLG G +S+ S L +FS C
Sbjct: 198 TPPAVRVPNIAFGCSNASSNDW-NGSA--GLVGLGRGSMSLVSQLGA-----GAFSYCLT 249
Query: 282 ---DENDSGSVFFGD------QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT 329
D N + ++ G +G +ST F+ K Y++ + +G + L
Sbjct: 250 PFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALA 309
Query: 330 ---------QSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG----NSWK 375
G L +DSG + T L Y +V L+ ++ G
Sbjct: 310 IPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLD 369
Query: 376 YCYN-ASSEEMLKVPDMRLIFSKNQSFV--VRNHIF 408
C+ +S +P M L F V V N++
Sbjct: 370 LCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI 405
>gi|24647679|ref|NP_650621.1| CG17283 [Drosophila melanogaster]
gi|7300253|gb|AAF55416.1| CG17283 [Drosophila melanogaster]
Length = 465
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 79/339 (23%), Positives = 133/339 (39%), Gaps = 80/339 (23%)
Query: 109 LHYT-WIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLDRNLSEYDPS 164
+ YT ++IGTP F V D GS+ +WVP C+ C + +Y P+
Sbjct: 148 MEYTCKMNIGTPKQKFTVLPDTGSSNIWVPGPHCKSKAC------------KKHKQYHPA 195
Query: 165 SSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
SS+ +K+ + Y + S +G L D + +A
Sbjct: 196 KSST------------------YVKNGKSFAITYGSG--SVAGVLAKDTVRIAGL----- 230
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN--------- 275
V ++ K+ G+ + DG++GLG ++V ++ L+QN
Sbjct: 231 ---VVTNQTFAMTTKEPGTTFVTSNFDGILGLGYRSIAVDNVKT---LVQNMCSEDVITS 284
Query: 276 -SFSICFD----ENDSGSVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
F+IC + G++ FG + S ++ P+ +K F + Y +G +
Sbjct: 285 CKFAICMKGGGSSSRGGAIIFGSSNTSAYSGSNSYTYTPVTKKGYWQFTLQDIY-VGGTK 343
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
++ S QA+VDSG S PT IY K +K++ R + G W C K
Sbjct: 344 VSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGC-RATSSGECWMKCAK-------K 390
Query: 388 VPDMRLIFSKNQSFVVRNHIFSFP-ENEVGDHACFSYFT 425
+PD + + + FVV+ + G C S T
Sbjct: 391 IPDFTFVIA-GKKFVVKGNKMKLKVRTNRGRTVCISAVT 428
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 104/247 (42%), Gaps = 48/247 (19%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNV 172
+ IGTP V+F V D GS+L+W C C +CA R + P+SSS+ +
Sbjct: 94 LSIGTPPVTFSVLADTGSSLIWTQCAPCTECA----------ARPAPPFQPASSSTFSKL 143
Query: 173 SCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQS 226
C+ LC+ +S C + C Y Y T +GYL + LH+ ASF
Sbjct: 144 PCASSLCQFLTSPYRTCNATG--CVYYYPYGMGFT--AGYLATETLHVGGASF------- 192
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN-D 285
V GC + G + G++GLG + SL+++ G+ + FS C N D
Sbjct: 193 ---PGVTFGCSTENG----VGNSSSGIVGLGRSPL---SLVSQVGVAR--FSYCLRSNAD 240
Query: 286 SG--SVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCLTQSGFQALVD 338
+G + FG T + P+ E + Y+V + +G + L +
Sbjct: 241 AGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMAMANLTTV 300
Query: 339 SGASFTF 345
+G F F
Sbjct: 301 NGTRFGF 307
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 74/309 (23%), Positives = 128/309 (41%), Gaps = 42/309 (13%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+G+P F + LD GS+L W+ QC+ C + +N + YDP +S+S KN++C+
Sbjct: 161 VGSPPKHFSLILDTGSDLNWI--QCLPC-------HDCFQQNGAFYDPKASASYKNITCN 211
Query: 176 HPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
P C S CKS CPY Y ++ + V+ + S + +
Sbjct: 212 DPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNV 271
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEN 284
+++ GCG G + A ++GLG G +S S L L +SFS C D N
Sbjct: 272 ENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDTN 326
Query: 285 DSGSVFFGDQGPATQQS----TSFLPIGEKY--DAYFVGVESYCIGNSCL---------- 328
S + FG+ TSF+ E Y+V ++S + L
Sbjct: 327 VSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNIS 386
Query: 329 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQGNSWKYCYNASSEEMLK 387
+ ++DSG + ++ Y + K + K + C+N S + ++
Sbjct: 387 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQ 446
Query: 388 VPDMRLIFS 396
+P++ + F+
Sbjct: 447 LPELGIAFA 455
>gi|198422402|ref|XP_002130569.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 389
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 138/332 (41%), Gaps = 68/332 (20%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+Y I IGTP F V D GS+ LWVP + CA + L N +Y S SSS
Sbjct: 68 YYGKIYIGTPPQPFTVVFDTGSSNLWVP--SVHCAITDIA---CLIHN--KYKASESSSY 120
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
K+ S + S SGY+ DI+ +A
Sbjct: 121 KSNGTSFAIQYGSGSL--------------------SGYVSSDIVSIAGVK--------S 152
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-------LLAKAGLIQNSFSICF- 281
+ + K+ G A DG++G+G ++SV + + L N FS
Sbjct: 153 KNQLFAEATKEPGLTFVAAKFDGILGMGYPEISVNGITPVFNQMFKQEALAHNQFSFYLN 212
Query: 282 -DENDS--GSVFFGDQGPATQQST---SFLPIGEKYDAYFVGVESYCIGNS---CLTQSG 332
D N S G ++ G G T++ T S+ P+ K + + ++S +G+S C+ SG
Sbjct: 213 RDANASSGGELYLG--GVDTKKFTGSFSYHPVTVK-GYWQISMDSVSVGSSTSACV--SG 267
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
+A+VDSG S PT + + K +KL+ + + L G C +M +PD+
Sbjct: 268 CKAIVDSGTSLLAGPT----DEIEKINKLIGATKF-LNGEYIVQC-----NKMATMPDIT 317
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSYF 424
S ++++ + + E+ G+ C S F
Sbjct: 318 FSLS-GVKYILKPNDYVMKESTAGESICISGF 348
>gi|50878437|gb|AAT85211.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 435
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 135/358 (37%), Gaps = 54/358 (15%)
Query: 80 QSNNNSSRNQLLFP--SEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP 137
++ + SS + +L P +G+ + +W TP+V +D +LWV
Sbjct: 29 RAASGSSPSAVLLPVDKDGATQQYV--TMFWQR-------TPSVPVKAVVDLAGAMLWVD 79
Query: 138 CQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIAD 197
C+ S Y S +K+ +C+ C +S L D C +
Sbjct: 80 CE---------SGYESSSYARVPCGSKPCRLAKSAACATG-CSGAASPGCLNDTCTGFPE 129
Query: 198 YSTEDTSSSGYLVDDILHLASFSKHAP-QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 256
Y+ S+ G ++ D L L + + P + + CG L GAA G+M L
Sbjct: 130 YTITRVSTGGNIITDKLSLYTTCRPMPVPRATAPGFLFTCGATSLTKGL-GAAATGMMSL 188
Query: 257 GLGDVSVPSLLAKAGLIQNSFSICFDEND-SGSVFFGDQG----PATQQSTSFL------ 305
++P+ +A F++C + SG V FGD P S S +
Sbjct: 189 SRARFALPTQVASIFRFSRKFALCLAPAESSGVVVFGDAPYEFQPVMDLSKSLIYTPLLV 248
Query: 306 -PI----GEKYDAYFVGVESYCI-GNSCLTQSGFQALVDSG---------ASFTFLPTEI 350
P+ G+K YF+GV + G + + A+ SG + +T L T I
Sbjct: 249 NPVTTTGGDKSTEYFIGVTGIKVNGRAVPLNATLLAIAKSGVGGTKLSMLSPYTVLETSI 308
Query: 351 YAEVVVKFDKLVSSKRISLQGNSWKYCYN----ASSEEMLKVPDMRLIF-SKNQSFVV 403
Y V F + +K CY+ S+ VP + L+ SK S+VV
Sbjct: 309 YKAVTDAFAAETAMIPRVPAVAPFKLCYDGTMVGSTRAGPAVPTVELVLQSKAVSWVV 366
>gi|448113357|ref|XP_004202330.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359465319|emb|CCE89024.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 108/433 (24%), Positives = 174/433 (40%), Gaps = 84/433 (19%)
Query: 9 MLFGCILLDGSDAVSFSSKLVHRFSDEAKERWISKSGNVSVADSWPKKNSVEYLELLLSN 68
++ IL G+DA S+ + S E +S G ++ K YL L S
Sbjct: 11 VVLATILQSGADAKKVSTSI----SKVPLEETLSGKGFNKYTEALANK----YLNLFNSA 62
Query: 69 DWKRQKTRVKLQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALD 128
K V QS+ + Q+ F ++G N +YT I +G+P F V LD
Sbjct: 63 GGKGAGAPV--QSSQEGA--QIPFVAQGGHDAPLENYLNAQYYTTIGLGSPVQEFKVVLD 118
Query: 129 AGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSL 188
GS+ LWVP C+ L+ +T +YD S SSS K + R SL
Sbjct: 119 TGSSNLWVP--STDCSSLACFLHT-------KYDHSESSSYKQNGSEFAI---RYGSGSL 166
Query: 189 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 248
+ GY+ D L+LA + + +S + G A
Sbjct: 167 E-----------------GYVSQDTLNLAGLTIEKQDFAEATS--------EPGLAFAFA 201
Query: 249 APDGVMGLGLGDVSVPSLLA------KAGLI-QNSFSICF-----DENDSGSVFFGDQGP 296
DG++GL +SV +++ GL+ + F+ DEND G FG G
Sbjct: 202 KFDGILGLAYDTISVNNIVPPIYNAINQGLLDEPKFAFYLGDKDKDENDGGVATFG--GV 259
Query: 297 ATQQ---STSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYA 352
T+ LPI K AY+ V + +G+ + A +D+G S LP+ + A
Sbjct: 260 DTKHYKGDIVELPIRRK--AYWEVSFDGIGLGDEYAELTSTGAAIDTGTSLITLPSSL-A 316
Query: 353 EVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 412
E++ + + +K+ SW Y+ + +P++ + F +F + + ++
Sbjct: 317 EII---NAKIGAKK------SWSGQYSVDCDSRDSLPELTMTF-HGHNFTLSPYEYTL-- 364
Query: 413 NEVGDHACFSYFT 425
EVG +C S FT
Sbjct: 365 -EVGG-SCISAFT 375
>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
gi|255630909|gb|ACU15817.1| unknown [Glycine max]
Length = 244
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 67/147 (45%), Gaps = 6/147 (4%)
Query: 279 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 338
+CF + +G + FGD G Q+ T F + + + Y + + + +S + F A+ D
Sbjct: 1 MCFGPDGAGRITFGDTGSPDQRKTPFN-VRKLHPTYNITITQIVVEDS-VADLEFHAIFD 58
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYCYNASSEEMLKVPDMRLI 394
SG SFT++ Y + ++ V + R S Q ++YCY+ S + ++VP + L
Sbjct: 59 SGTSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQTIEVPFLNLT 118
Query: 395 FSKNQSFVVRNHIFSFPENEVGDHACF 421
+ V + I E GD C
Sbjct: 119 MKGGDDYYVMDPIVQVFSEEEGDLLCL 145
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 130/322 (40%), Gaps = 71/322 (22%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNL-SEYDPSSSSSSKNV 172
+ +G+P + + LD GS L W+ C+ NL S ++P SSS+ V
Sbjct: 65 LAVGSPPQNISMVLDTGSELSWLHCK--------------KSPNLGSVFNPVSSSTYSPV 110
Query: 173 SCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 225
CS P+C++R+ SC C ++A + TS G L D + S ++
Sbjct: 111 PCSSPICRTRTRDLPIPASCDPKTHFC-HVAISYADATSIEGNLAHDTFVIGSVTRPG-- 167
Query: 226 SSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
+ GC S + A G+MG+ G +S + L + FS C +
Sbjct: 168 ------TLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYCISGS 216
Query: 285 D-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QS 331
D SG + GD P Q+T LP ++ AY V +E +G+ L+ +S
Sbjct: 217 DSSGILLLGDASYSWLGPIQYTPLVLQTTP-LPYFDRV-AYTVQLEGIRVGSKILSLPKS 274
Query: 332 GF--------QALVDSGASFTFLPTEIYAEVVVKFD-------KLVSSKRISLQGNSWKY 376
F Q +VDSG FTFL +Y + +F ++V QG +
Sbjct: 275 VFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQG-TMDL 333
Query: 377 CYNASSE---EMLKVPDMRLIF 395
CY S +P + L+F
Sbjct: 334 CYRVGSSTRPNFTGLPVISLMF 355
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 140/333 (42%), Gaps = 48/333 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCS 175
+G P FL+ +D GS+L W +QC P A + D++ +DPS S+S K + C+
Sbjct: 177 VGNPPRHFLLIIDTGSDLTW-----LQCKPCKACF----DQSGPVFDPSQSTSFKIIPCN 227
Query: 176 --------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 227
H C+ SS S K C Y Y + + +SG L + L + S S H P S
Sbjct: 228 AAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALESLSV-SLSDH-PSSL 283
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND-- 285
++IGCG G + ++GLG G +S PS L ++ I SFS C +
Sbjct: 284 EIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSPIGQSFSYCLVDRTNN 339
Query: 286 ---SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESYCIGNSCLTQSGFQ-- 334
S ++ FG ++ F P ++ Y++G++ I L +
Sbjct: 340 LSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFA 399
Query: 335 --------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 386
++DSG + T+L + Y V F +S R + CYNA+ +
Sbjct: 400 IAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PFDILGICYNATGRTAV 458
Query: 387 KVPDMRLIFSKNQSF-VVRNHIFSFPENEVGDH 418
P + ++F + + + F P+ + H
Sbjct: 459 PFPTLSIVFQNGAELDLPQENYFIQPDPQEAKH 491
>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
Length = 152
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 286
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 4 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSSKGK 63
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 64 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 122
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E+V K +S
Sbjct: 123 VPAQIYNEIVSKVRGTLS 140
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 135/336 (40%), Gaps = 52/336 (15%)
Query: 102 FGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS 159
G L Y + +G+P S + +D GS++ WV C+ C QC ++ D
Sbjct: 119 LGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQC-------HSQAD---P 168
Query: 160 EYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
+DPSSSS+ SC C + + C S C YI Y + +S++G D L
Sbjct: 169 LFDPSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQ-CQYIVTYG-DGSSTTGTYSSDTLA 226
Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
L SS S GC ++G + D DG+MGLG G S+ S AG +
Sbjct: 227 LG--------SSAVKSFQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVS--QTAGTLGR 273
Query: 276 SFSICFDENDSGSVFFGDQGPATQQSTSFL--PI----------GEKYDAYFVGVESYCI 323
+FS C S S F ++ F+ P+ G + A VG I
Sbjct: 274 AFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSI 333
Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
S + ++DSG T LP Y+ + F + + C++ S +
Sbjct: 334 PASVFSAG---TVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQ 390
Query: 384 EMLKVPDMRLIFSK------NQSFVVRNHIFSFPEN 413
+ +P + L+FS + S ++ ++ +F N
Sbjct: 391 SSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAAN 426
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 81/343 (23%), Positives = 135/343 (39%), Gaps = 50/343 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSS 168
+ + +GTP + LD ++ WVPC C C+ S ++ + L D S +
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCS--STTFLPNASTTLGSLDCSGAQC 155
Query: 169 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
S+ S P S + C + Y D+S + LV D + LA + V
Sbjct: 156 SQVRGFSCPATGSSA--------CLFNQSYG-GDSSLTATLVQDAITLA--------NDV 198
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDEND-- 285
GC +G + P G++GLG G + SL+++AG + FS C
Sbjct: 199 IPGFTFGCINAVSGGSI---PPQGLLGLGRGPI---SLISQAGAMYSGVFSYCLPSFKSY 252
Query: 286 --SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSG 332
SGS+ G G P + ++T L + Y+V + +G + +G
Sbjct: 253 YFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTG 312
Query: 333 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
++DSG T +Y + +F K V+ SL ++ C+ A++E + P +
Sbjct: 313 AGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSL--GAFDTCFAATNEA--EAPAIT 368
Query: 393 LIFSK-NQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
L F N + N + + G AC S N +L
Sbjct: 369 LHFEGLNLVLPMENSLI---HSSSGSLACLSMAAAPNNVNSVL 408
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 125/295 (42%), Gaps = 32/295 (10%)
Query: 85 SSRNQLLFPSEGSQTHF--FGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ-- 139
SS LL P+ GS F +GN + Y ++IG P + + +D GS+L W+ C
Sbjct: 44 SSWPSLLNPA-GSSIVFPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAP 102
Query: 140 CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYI 195
C C+ P S+ V C PLC S + D C Y
Sbjct: 103 CTHCSETP--------------HPLHRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYE 148
Query: 196 ADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMG 255
+Y+ + S+ G L++D+ L S + Q V+ + +GCG Q S DG++G
Sbjct: 149 INYA-DQYSTYGVLLNDVYLLN--SSNGVQLKVR--MALGCGYDQVFSPSSYHPLDGLLG 203
Query: 256 LGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE-KYDAY 314
LG G S+ S L GL++N C G +FFG+ + + ++ PI Y
Sbjct: 204 LGRGKASLISQLNSQGLVRNVIGHCLSSQGGGYIFFGNAYDSAR--VTWTPISSVDSKHY 261
Query: 315 FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 369
G G A+ D+G+S+T+ + Y ++ +K +S K + +
Sbjct: 262 SAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKV 316
>gi|393215979|gb|EJD01470.1| aspartic peptidase A1 [Fomitiporia mediterranea MF3/22]
Length = 412
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 146/349 (41%), Gaps = 76/349 (21%)
Query: 99 THFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVP---CQCIQCAPLSASYYTSLD 155
T+F Q++ T I +GTP F V LD GS+ LWVP C I C L A
Sbjct: 94 TNFMNAQYF----TTITLGTPPQEFKVILDTGSSNLWVPSTKCTSIACF-LHA------- 141
Query: 156 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
+YD S+SS+ K K+ + +Y + S G++ +D+L
Sbjct: 142 ----KYDSSASSTHK------------------KNGTSFKIEYGS--GSMEGFVSNDVLS 177
Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAK 269
+ H Q +++ K+ G DG++GLG +SV + +
Sbjct: 178 IGDLKIH-DQDFAEAT-------KEPGLAFAFGKFDGILGLGYDTISVNHITPPFYSMVN 229
Query: 270 AGLIQN---SFSICFDENDSG-SVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIG 324
GL+ SF + E D G +VF G A ++ P+ K AY+ V + G
Sbjct: 230 KGLLDAPVFSFRLGSSEEDGGEAVFGGIDESAYSGKINYAPVRRK--AYWEVELPKVAFG 287
Query: 325 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 384
+ L A +D+G S LP+++ AE++ + + + + SW Y ++
Sbjct: 288 DDVLELENTGAAIDTGTSLIALPSDV-AEML---NAQIGATK------SWNGQYTVDCKK 337
Query: 385 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFT-LEYNFTG 432
+ +PD L F+ Q++ ++ + EV C S FT L+ N G
Sbjct: 338 VPDLPDFTLWFN-GQAYPLKGSDYIL---EV-QGTCISSFTGLDINVPG 381
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 132/310 (42%), Gaps = 45/310 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I +GTP S + D GS++ W +QC+P Y + ++PS SSS
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSW-----LQCSPCRKCYR----QQDPIFNPSLSSSF 131
Query: 170 KNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
K ++C+ +C K + S K+ C Y Y + + + + SF +HA +
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETL----SFGEHAVR--- 184
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
SV +GCGR G + A ++GLG G +S PS + + FS C +S
Sbjct: 185 --SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAI 237
Query: 287 -GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQ 334
S+ FG P + T LP Y+VG+ + S + ++
Sbjct: 238 AASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGG 297
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+VDSG + + L T Y + F LV S+ ISL + CY+ SS + +P +
Sbjct: 298 VIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSSMKTATLPAV 353
Query: 392 RLIFSKNQSF 401
L F S
Sbjct: 354 VLDFDGGASM 363
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 137/337 (40%), Gaps = 53/337 (15%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
++GTP +FL+ALD ++ W+PC C+ C S++ + S+ +S++ K +
Sbjct: 95 NVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSV----------TSTTFKTLG 141
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C P CK + C + Y S+ L D + L+ + +
Sbjct: 142 CDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALS--------TDIVPGYT 191
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSV 289
GC +K TGS + P G++GLG G +S L L +++FS C N SG++
Sbjct: 192 FGCIQKTTGSSV---PPQGLLGLGRGPLSF--LSQTQDLYKSTFSYCLPSFRTLNFSGTL 246
Query: 290 FFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVD 338
G G P ++T L + Y+V + +G + +G + D
Sbjct: 247 RLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFD 306
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS-K 397
SG FT L +Y V +F K V + +S G + CY + P M +FS
Sbjct: 307 SGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG-FDTCYTGP----IVAPTMTFMFSGM 361
Query: 398 NQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGIL 434
N + N + + G +C + N +L
Sbjct: 362 NVTLPTDNLLI---RSTAGSTSCLAMAAAPDNVNSVL 395
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 121/307 (39%), Gaps = 46/307 (14%)
Query: 116 IGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSC 174
IGTP V L D GS+L+WV C C C P S + L S + P++ S C
Sbjct: 96 IGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKS--STFMPTTCRSQP---C 150
Query: 175 SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 234
+ L + + KS + C Y Y + + S G L + L S+ Q+ +
Sbjct: 151 TLLLPEQKGCGKSGE--CIYTYKYGDQYSFSEGLLSTETLRFD--SQGGVQTVAFPNSFF 206
Query: 235 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFF 291
GCG + G+MGLG G +S+ S + I + FS C + + F
Sbjct: 207 GCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLKF 264
Query: 292 GDQGPATQQSTSFLPIGEK---YDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFL 346
G++ T + P+ K YF+ +E+ + + ++DSG T+L
Sbjct: 265 GNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTYL 324
Query: 347 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ-SFVVRN 405
G S+ Y + AS +E L V ++ + S F R+
Sbjct: 325 ------------------------GESFYYNFAASLQESLAVELVQDVLSPLPFCFPYRD 360
Query: 406 HIFSFPE 412
+ F FPE
Sbjct: 361 N-FVFPE 366
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 127/328 (38%), Gaps = 68/328 (20%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP + + LD GS L W+ C+ P +A +DP SSS +
Sbjct: 82 LPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTA------------FDPLLSSSFSVLP 129
Query: 174 CSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
C+H LCK R +L C + + + + T + G LV + +S P
Sbjct: 130 CNHSLCKPRVPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPP---- 185
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
+I+GC D + G++G+ LG +S S LAK + FS C S
Sbjct: 186 ---LILGCAT-------DSSDTQGILGMNLGRLSFSS-LAKI----SKFSYCVPPRRSQS 230
Query: 287 -----GSVFFGDQGPA-------------TQQSTSFLPIGEKYDAYFVGV----ESYCIG 324
GS + G + +Q+ + P+ Y +G+ + I
Sbjct: 231 GSSPTGSFYLGPNPSSAGFKYVNLMTYRQSQRMPNLDPLA--YTLPMLGIRINGKKLNIS 288
Query: 325 NSCL---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYN 379
S Q L+DSG FTFL E Y++V + KL K+ + G S C++
Sbjct: 289 TSAFRADPSGAGQTLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGSLDMCFD 348
Query: 380 ASSEEMLK-VPDMRLIFSKNQSFVVRNH 406
+ + + + +M F VV
Sbjct: 349 GDAMVIGRMIGNMAFEFENGVEIVVERE 376
>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
Length = 154
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 286
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIGGNPTFEAVFDSGSTYTH 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|149244964|ref|XP_001527016.1| vacuolar aspartic protease precursor [Lodderomyces elongisporus
NRRL YB-4239]
gi|146449410|gb|EDK43666.1| vacuolar aspartic protease precursor [Lodderomyces elongisporus
NRRL YB-4239]
Length = 429
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 143/337 (42%), Gaps = 82/337 (24%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T I +GTP +F V LD GS+ LWVP + C+ L+ + S+YD +SSS
Sbjct: 114 YFTEIQLGTPPQTFKVILDTGSSNLWVPSK--DCSSLACFLH-------SKYDHDASSSY 164
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF---SKHAPQS 226
K + S + GY+ DIL + + ++
Sbjct: 165 KANGSEFSIQYGSGSME--------------------GYISQDILSIGDLVIPKQDFAEA 204
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA------GLIQNSFSIC 280
+ + + G+ DG++GL +SV ++ GL+ +S +
Sbjct: 205 TSEPGLAFAFGKF-----------DGILGLAYDTISVNHIVPPVYNAINQGLL-DSPQVS 252
Query: 281 F-------DENDSGSVFFGDQGPAT-QQSTSFLPIGEKYDAYF-VGVESYCIGN--SCLT 329
F DEND G FG + Q ++LP+ K AY+ V E +G+ + L
Sbjct: 253 FYLGDTNKDENDGGVATFGGYDESLFQGKITWLPVRRK--AYWEVAFEGLGLGDEYAELI 310
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
Q+G A +D+G S LP+ + AE++ K+ ++K SW Y ++ +P
Sbjct: 311 QTG--AAIDTGTSLITLPSTL-AEIINA--KIGATK-------SWSGQYQVDCDKRDSLP 358
Query: 390 DMRLIFSK-NQSFVVRNHIFSFPENEVGDHACFSYFT 425
D+ L FS N + ++I EVG +C S FT
Sbjct: 359 DLTLTFSGYNFTLSAYDYIL-----EVG-GSCISVFT 389
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/348 (25%), Positives = 146/348 (41%), Gaps = 46/348 (13%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ + +GTP + + D GS++LW+ QC+ C S Y D ++PS SS+
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWL--QCLPC----QSCYGQTD---PLFNPSFSSTF 131
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ--SS 227
++++C LC+ ++ C Y Y D + FS S+
Sbjct: 132 QSITCGSSLCQQLLIRGCRRNQCLYQVSYG-----------DGSFTVGEFSTETLSFGSN 180
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS- 286
+SV IGCG G + A ++GLG G +S PS + + L + FS C +S
Sbjct: 181 AVNSVAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTREST 235
Query: 287 GSV--FFGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSCL-----TQS 331
GSV FG+Q A+ + L K D ++ VG S I L T +
Sbjct: 236 GSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGN 295
Query: 332 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPD 390
G ++DSG + T L T Y + F + S G S + CY+ S + +P
Sbjct: 296 G-GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPA 354
Query: 391 MRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLEYNFTGILILQK 438
+ +F+ + + P + G + C ++ NF+ I +Q+
Sbjct: 355 VSFVFNGGATMALPAQNIMVPVDNSGTY-CLAFAPNSENFSIIGNIQQ 401
>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
Length = 127
Score = 57.8 bits (138), Expect = 1e-05, Method: Composition-based stats.
Identities = 38/125 (30%), Positives = 62/125 (49%), Gaps = 4/125 (3%)
Query: 236 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD 293
CG KQ +P DG++GLG+G + + L +I +N C G ++ GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60
Query: 294 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 352
P T+ T ++P+ E Y G+ I + F+A+ DSG+++T +P +IY
Sbjct: 61 FNPPTRGVT-WVPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119
Query: 353 EVVVK 357
E+V K
Sbjct: 120 EIVSK 124
>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 324
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 96/215 (44%), Gaps = 43/215 (20%)
Query: 93 PSEGSQTHFFGNQFYWLHYTWI--DIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASY 150
PS S + F ++F + I IGTP + + LD GS L W+ C + P
Sbjct: 55 PSPSSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP----- 109
Query: 151 YTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDT 203
+ + +DPS SSS + CSHPLCK R +SC S + C Y Y+ + T
Sbjct: 110 -----KPKTSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGT 162
Query: 204 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 263
+ G LV + + ++ + + +I+GC + + G++G+ G +
Sbjct: 163 FAEGNLVKEKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL-- 206
Query: 264 PSLLAKAGLIQNSFSICFDEN-----DSGSVFFGD 293
S +++A + + S+ I N +GS + GD
Sbjct: 207 -SFVSQAKITKFSYCIPPKSNRPGFTPTGSFYLGD 240
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 125/298 (41%), Gaps = 49/298 (16%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
++GTP +FL+ALD ++ W+PC C+ C S++ + S+ +S++ K +
Sbjct: 95 NVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSV----------TSTTFKTLG 141
Query: 174 CSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 233
C P CK + C + Y S+ L D + L+ + +
Sbjct: 142 CDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALS--------TDIVPGYT 191
Query: 234 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE----NDSGSV 289
GC +K TGS + P G++GLG G +S L L +++FS C N SG++
Sbjct: 192 FGCIQKTTGSSV---PPQGLLGLGRGPLSF--LSQTQDLYKSTFSYCLPSFRTLNFSGTL 246
Query: 290 FFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVD 338
G G P ++T L + Y+V + +G + +G + D
Sbjct: 247 RLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFD 306
Query: 339 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 396
SG FT L +Y V +F K V + +S G + CY + P M +FS
Sbjct: 307 SGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG-FDTCYTGP----IVAPTMTFMFS 359
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 117/293 (39%), Gaps = 32/293 (10%)
Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
V+ + LD S++ WV QC C P Y + YDP+ SSSS SC+ P C
Sbjct: 167 VTQTMVLDTASDVTWV--QCSPC-PTPPCY----PQKDVLYDPTKSSSSGVFSCNSPTCT 219
Query: 181 S----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
+ C + + C Y Y + TS++G + D+L + P ++V+ S GC
Sbjct: 220 QLGPYANGCTN-NNQCQYRVRYP-DGTSTAGTYISDLLTI------TPATAVR-SFQFGC 270
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGP 296
GS+ G++ G+M LG G S+ S A FS CF FF P
Sbjct: 271 SHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRVFSHCFPPPTRRG-FFTLGVP 327
Query: 297 ATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLP 347
L K A Y V +E+ + + T A +DS + T LP
Sbjct: 328 RVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLP 387
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
Y + F ++ + + CY+ + +P + L+F KN +
Sbjct: 388 PTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAA 440
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 144/331 (43%), Gaps = 47/331 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+++ + IG+P + +D GS++ WV QCAP A Y D ++PS SSS
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWV-----QCAPC-ADCYQQAD---PIFEPSFSSSY 205
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
++C CKS + D C Y Y Y V D A+ + S+
Sbjct: 206 APLTCETHQCKSLDVSECRNDSCLYEVSY-----GDGSYTVGD---FATETITLDGSASL 257
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 286
++V IGCG G ++ A ++GLG G +S PS + + SFS C D + +
Sbjct: 258 NNVAIGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQINAS-----SFSYCLVNRDTDSA 309
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA--------L 336
++ F P+ + L + Y++G+ +G L+ +S F+ +
Sbjct: 310 STLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGII 369
Query: 337 VDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 392
VDSG + T L +++Y + F + L S+ ++L + CY+ SS ++VP +
Sbjct: 370 VDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL----FDTCYDLSSRSSVEVPTVS 425
Query: 393 LIFSKNQSFVVRNHIFSFPENEVGDHACFSY 423
F + + + P + G CF++
Sbjct: 426 FHFPDGKYLALPAKNYLIPVDSAGTF-CFAF 455
>gi|50419019|ref|XP_458031.1| DEHA2C08074p [Debaryomyces hansenii CBS767]
gi|49653697|emb|CAG86094.1| DEHA2C08074p [Debaryomyces hansenii CBS767]
Length = 416
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 160/368 (43%), Gaps = 76/368 (20%)
Query: 77 VKLQSNNNSSRNQLLF--PSEGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLL 134
V + S Q+ F P+EG N ++T I +GTP SF V LD GS+ L
Sbjct: 67 VAFGGQHPGSEAQVPFISPTEGRHEAPLTNYLNAQYFTEIQLGTPGQSFKVILDTGSSNL 126
Query: 135 WVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 194
WVP + C+ L+ + S+Y SSS+ K+ S S++
Sbjct: 127 WVPSE--DCSSLACFLH-------SKYAHDSSST----------YKANGSSFSIQYGSGA 167
Query: 195 IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVM 254
+ Y ++DT + G D I+ F+ +++ + + G+ DG++
Sbjct: 168 MEGYVSQDTLAIG---DLIIPKQDFA----EATSEPGLAFAFGKF-----------DGIL 209
Query: 255 GLGLGDVSVPSLLA------KAGLIQN-SFSICF-----DENDSGSVFFG--DQGPATQQ 300
GL +SV ++ + GL++ F+ +E D G FG D+ T +
Sbjct: 210 GLAYNTISVNKIVPPVYNAIEQGLLEEPRFAFYLGDTSKNEEDGGVATFGGIDEDLYTGK 269
Query: 301 STSFLPIGEKYDAYF-VGVESYCIGN--SCLTQSGFQALVDSGASFTFLPTEIYAEVVVK 357
LP+ K AY+ V E +G+ + LT++G A +D+G S LP+ + AE++
Sbjct: 270 VVD-LPVRRK--AYWEVAFEGIGLGDEYAELTKTG--AAIDTGTSLITLPSSL-AEII-- 321
Query: 358 FDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGD 417
+ +I + SW Y E+ +PD+ L F+ +F + + ++ EVG
Sbjct: 322 ------NSKIGAE-KSWSGQYQIECEKRDSLPDLTLTFA-GHNFTLSPYDYTL---EVGG 370
Query: 418 HACFSYFT 425
+C S FT
Sbjct: 371 -SCISVFT 377
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 112/280 (40%), Gaps = 48/280 (17%)
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPC-QCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
Y + IGTP F V +D GS +V C C C ++ + YD + SSS
Sbjct: 139 YATVLIGTPGHQFEVIVDTGSTYTFVTCYPCASCGQHGSN---------APYDAAKSSSY 189
Query: 170 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
+ V C L C Y +S ED+ G++V D++ + S P+
Sbjct: 190 ERVPCGSGCIFGACRASGL---CEYDEKFS-EDSQVGGHVVSDVIDVGG-SLGTPR---- 240
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA----GLIQNSFSICFDEND 285
+ GC +T + L +G++ LG + + L K G +F +C +
Sbjct: 241 --IHFGCNSLET-NMLKTQKANGMIALGRAEAGLHRQLKKKAYPPGSYDGTFGLCLGSFE 297
Query: 286 SGSVFFGDQGP----------ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 329
G V + P T ST L G K Y V V + N+ L
Sbjct: 298 GGGVLSLGKLPEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEVHRMFVRNTELKKPSGAE 357
Query: 330 -----QSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVS 363
++G+ ++DSG ++T+L +++ + + DK+V+
Sbjct: 358 LMEAFRAGYGTVLDSGTTYTYLHEDVFIPFISEIEDKVVN 397
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 134/348 (38%), Gaps = 73/348 (20%)
Query: 115 DIGTPNVSFLVALDAGSNLLWVPC---QCIQC--APLSASYYTSLDRNLS---EYDPSSS 166
++G+ + + +D GS+L+W PC +CI C P S + N S S+
Sbjct: 81 NLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSCSAAACSA 140
Query: 167 SSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 218
+ ++S SH SR S C S P Y Y+ D S L D L L +
Sbjct: 141 AHGGSLSASHLCAISRCPLESIEISECSSFSCPPFY---YAYGDGSLVARLYRDSLSLPT 197
Query: 219 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSF 277
+ P + + GC G P GV G G G +S+PS LA + + N F
Sbjct: 198 PAPSPPINV--RNFTFGCAHTTLGE------PVGVAGFGRGVLSMPSQLATFSPQLGNRF 249
Query: 278 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------------------YFVGVE 319
S C + F D+ + S L +G Y Y VG+
Sbjct: 250 SYCLVSHS----FAADR----VRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLA 301
Query: 320 SYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFD----KLVSSK 365
+GN + F +VDSG +FT LP +Y VV +F+ K+ +
Sbjct: 302 GISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRA 361
Query: 366 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF 410
R + CY E + VP + L F +S VV +N+ + F
Sbjct: 362 RRIEENTGLSPCY--YYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEF 407
>gi|225719388|gb|ACO15540.1| Cathepsin D precursor [Caligus clemensi]
Length = 362
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 138/344 (40%), Gaps = 68/344 (19%)
Query: 111 YTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSK 170
YT I IGTP SF + G N LWV PSS+ +
Sbjct: 43 YTEITIGTPPQSFKTVIHLGHNELWV--------------------------PSSTCGAP 76
Query: 171 NVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 229
NV C +H S +S +KD + Y +SG+L D K
Sbjct: 77 NVPCKTHNQYDSGNSSTHVKDGSKFNVKYKI--GKASGFLSQD--------KVCVDGVCM 126
Query: 230 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSICFD------ 282
G ++ DGV+GLG G S + L G I++ FS+ +
Sbjct: 127 EEQTFGEATSESMDPFANVYHDGVLGLGFGKDSFLNSLLDQGRIESPLFSLWVNRQPFRS 186
Query: 283 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-----GNSCLTQSGFQALV 337
+N+S V G + S++P+ D + VG++S I G +T+ G +
Sbjct: 187 KNNSRLVLGGIDTGHYSGNISYIPLNSD-DVWRVGMKSISIKGVHRGCGFITRPGCDVVF 245
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
D+G+ FT+ P + A+ + ++ + + +I+ S+ Y Y E+L +P++ L+F +
Sbjct: 246 DAGSRFTYGPI-LEAKTI---NRWIGATQIA---PSYGY-YKVRCNEILTLPNVELVF-E 296
Query: 398 NQSFVVRNHIFSFPENEVGDHACFSYF---------TLEYNFTG 432
+ + V++ + +G C S F TL NF G
Sbjct: 297 DLTLVLKPKDYIVETKILGMKTCMSGFVGLTKQESWTLGANFFG 340
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 129/308 (41%), Gaps = 50/308 (16%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++T + +GTP + LD GS+++W+ QC+ CA Y D ++P++SS+
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWI--QCLPCAKC----YGQTD---PLFNPAASSTY 203
Query: 170 KNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP--Q 225
+ V C+ PLCK S C++ K C Y Y D + FS +
Sbjct: 204 RKVPCATPLCKKLDISGCRN-KRYCEYQVSYG-----------DGSFTVGDFSTETLTFR 251
Query: 226 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DEN 284
V V +GCG G ++ A G+ G +S PS FS C D +
Sbjct: 252 GQVIRRVALGCGHDNEGLFIGAAGLLGLG---RGSLSFPS--QTGAQFSKRFSYCLVDRS 306
Query: 285 DSG---SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLTQ---SGFQ- 334
SG S+ FG A +S F P+ K D Y+V + +G LT S F+
Sbjct: 307 ASGTASSLIFGK--AAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRM 364
Query: 335 -------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
++DSG S T L Y+ + F + + + + + CY+ S + +K
Sbjct: 365 DATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVK 424
Query: 388 VPDMRLIF 395
VP + F
Sbjct: 425 VPTLVFHF 432
>gi|392586802|gb|EIW76137.1| Asp-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 409
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 129/312 (41%), Gaps = 60/312 (19%)
Query: 95 EGSQTHFFGNQFYWLHYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSL 154
EG T N ++T I++G+P +F V LD GS+ LWVP QC ++ +
Sbjct: 83 EGGHTVPLSNFMNAQYFTEIELGSPAQTFKVILDTGSSNLWVPSA--QCTSIACFLH--- 137
Query: 155 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 214
++YD SSS+S K + + Y T S G++ D L
Sbjct: 138 ----AKYDSSSSASYK------------------ANGTEFSIQYGT--GSMEGFVSQDTL 173
Query: 215 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LA 268
+ S + Q +++ K+ G DG++GLG +SV + +
Sbjct: 174 KIGDVSI-SHQDFAEAT-------KEPGLTFAFGKFDGILGLGYDTISVNHITPPVYNMI 225
Query: 269 KAGLIQN---SFSICFDENDSG-SVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCI 323
GL+ SF + E+D G +VF G A ++P+ K AY+ V +E
Sbjct: 226 NQGLLDEPLFSFRLGSSESDGGEAVFGGIDHSAYTGDIEYVPVRRK--AYWEVELEKVSF 283
Query: 324 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 383
G L A +D+G S LPT++ AE++ + + +KR SW Y
Sbjct: 284 GGDELELESTGAAIDTGTSLIALPTDV-AEML---NTQIGAKR------SWNGQYTIDCS 333
Query: 384 EMLKVPDMRLIF 395
++ +PD F
Sbjct: 334 KVPSLPDFTFYF 345
>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
Length = 154
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 286
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E++ K +S
Sbjct: 125 VPAQIYNEILSKVRGTLS 142
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/319 (22%), Positives = 129/319 (40%), Gaps = 48/319 (15%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
+ T + +GTP + +V +D GS+ WV C+C C N + S S++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHT-----------NPRTFLQSRSTTC 49
Query: 170 KNVSCSHPLC---KSRSSCKSLKD--PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 224
VSC +C S C+ ++ CP+ Y + ++S G L D L + K
Sbjct: 50 AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQ-DGSASYGILYQDTLTFSDVQKIP- 107
Query: 225 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 284
S GC G+ G DG++G+G G +SV L ++ + FS C
Sbjct: 108 ------SFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQ 157
Query: 285 DSGSVFF---------GDQGPATQ-QSTSFLPIGEKYDAYFVGV-------ESYCIGNSC 327
S FF G T + T + + + +FV + E + S
Sbjct: 158 KSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSI 217
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
++ G + DSG+ +++P + + + +L+ +R + + S + CY+ S +
Sbjct: 218 FSRKG--VVFDSGSELSYIPDRALSVLSQRIRELL-LRRGAAEEESERNCYDMRSVDEGD 274
Query: 388 VPDMRLIFSKNQSFVVRNH 406
+P + L F F + H
Sbjct: 275 MPAISLHFDDGARFDLGRH 293
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 132/310 (42%), Gaps = 45/310 (14%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSS 169
++ I +GTP S + D GS++ W +QC+P Y + ++PS SSS
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSW-----LQCSPCRKCY----RQQDPIFNPSLSSSF 64
Query: 170 KNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
K ++C+ +C K + S K+ C Y Y + + + + SF +HA +
Sbjct: 65 KPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETL----SFGEHAVR--- 117
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-- 286
SV +GCGR G + A ++GLG G +S PS + + FS C +S
Sbjct: 118 --SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAI 170
Query: 287 -GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQ 334
S+ FG P + T LP Y+VG+ + S + ++
Sbjct: 171 AASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGG 230
Query: 335 ALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASSEEMLKVPDM 391
+VDSG + + L T Y + F LV S+ ISL + CY+ SS + +P +
Sbjct: 231 VIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSSMKTATLPAV 286
Query: 392 RLIFSKNQSF 401
L F S
Sbjct: 287 VLDFDGGASM 296
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 136/353 (38%), Gaps = 76/353 (21%)
Query: 110 HYTWIDIGTPNVSFLVALDAGSNLLWVPCQCIQC----------------APLSASYYTS 153
++ +GTP FL+ D GS+L WV C+ AP S +
Sbjct: 55 YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114
Query: 154 LDRNLSE---YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSS 205
S + P S + + CS C S ++C + PC Y +Y +D S+
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAY--EYRYKDGSA 172
Query: 206 S-GYLVDDILHLASFSKHAPQSSVQS---SVIIGCGRKQTG-SYLDGAAPDGVMGLGLGD 260
+ G + D +A + A + ++ V++GC TG S+L A DGV+ LG +
Sbjct: 173 ARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL---ASDGVLSLGYSN 229
Query: 261 VSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQ------------------GPA 297
VS S A FS C N + + FG P
Sbjct: 230 VSFASR--AAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPG 287
Query: 298 TQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTE 349
+Q T L Y V V + L Q G A++DSG S T L +
Sbjct: 288 ARQ-TPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSP 346
Query: 350 IYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASS----EEM-LKVPDMRLIFS 396
Y VV KLV R+++ + + YCYN +S E++ + VP + + F+
Sbjct: 347 AYRAVVAALGKKLVGLPRVAM--DPFDYCYNWTSPLTGEDLAVAVPALAVHFA 397
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 143/371 (38%), Gaps = 53/371 (14%)
Query: 65 LLSNDWKRQKTRVK-LQSNNNSSRNQLLFPSEGSQTHFFGNQFYWLHYTWIDIGTPNVSF 123
LLS R K RV LQS + ++ P ++ + +L + IGTP + +
Sbjct: 47 LLSRAIARSKARVAALQSA--AVLPPVVDPITAARVLVTASSGEYL--VDLAIGTPPLYY 102
Query: 124 LVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSR 182
+D GS+L+W C C+ CA D+ +D S++ + + C C S
Sbjct: 103 TAIMDTGSDLIWTQCAPCLLCA----------DQPTPYFDVKKSATYRALPCRSSRCASL 152
Query: 183 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 242
SS K C Y Y + S++G L ++ +F +++ GCG G
Sbjct: 153 SSPSCFKKMCVY-QYYYGDTASTAGVLANETF---TFGAANSTKVRATNIAFGCGSLNAG 208
Query: 243 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFG------- 292
D A G++G G G +S+ S L + FS C S + ++FG
Sbjct: 209 ---DLANSSGMVGFGRGPLSLVSQLGP-----SRFSYCLTSYLSATPSRLYFGVYANLSS 260
Query: 293 --DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF----------QALVDSG 340
+ QST F+ + YF+ +++ +G L ++DSG
Sbjct: 261 TNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSG 320
Query: 341 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEMLKVPDMRLIF-SK 397
S T+L + Y V + ++ C+ + VPD+ F S
Sbjct: 321 TSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSA 380
Query: 398 NQSFVVRNHIF 408
N + + N++
Sbjct: 381 NMTLLPENYML 391
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 146/366 (39%), Gaps = 58/366 (15%)
Query: 69 DWKRQKTR------VKLQSNNNSSRNQLLFPS-EGSQTHF---FGNQFYWLHYTWIDIGT 118
DW R+ + ++++S N R + + E SQT G L+Y + +G
Sbjct: 13 DWNRRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYI-VTMGL 71
Query: 119 PNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHP 177
+ + V +D GS+L WV C+ C+ C ++ + PS+SSS ++VSC+
Sbjct: 72 GSTNMTVIIDTGSDLTWVQCEPCMSC----------YNQQGPIFKPSTSSSYQSVSCNSS 121
Query: 178 LCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 230
C+S +C S C Y+ +Y + + ++G L + L S S
Sbjct: 122 TCQSLQFATGNTGACGSNPSTCNYVVNYG-DGSYTNGELGVEQLSFGGVSV--------S 172
Query: 231 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--- 287
+ GCGR G + G+MGLG +S+ S FS C +SG
Sbjct: 173 DFVFGCGRNNKGLF---GGVSGLMGLGRSYLSLVS--QTNATFGGVFSYCLPTTESGASG 227
Query: 288 -------SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF---QALV 337
S F + P T T LP + + Y + + + L F L+
Sbjct: 228 SLVMGNESSVFKNVTPITY--TRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLI 285
Query: 338 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 397
DSG T LP+ +Y + F K + + + C+N + + + +P + + F
Sbjct: 286 DSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEG 345
Query: 398 NQSFVV 403
N V
Sbjct: 346 NAELKV 351
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 81/367 (22%), Positives = 129/367 (35%), Gaps = 61/367 (16%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQ-----------CAPLSASYYTSLDRNLSEYD 162
+ IGTP + + + LD ++L W+ C+ + +S + + + + Y
Sbjct: 129 VRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEASKNWYR 188
Query: 163 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYL---VDDILHLASF 219
P+ SSS + + CS C PY S S Y D + + +
Sbjct: 189 PAKSSSWRRIRCSQKECAV----------LPYNTCQSPSKAESCSYFQKTQDGTVTIGIY 238
Query: 220 SKHAPQSSVQS-------SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 272
K +V +I+GC + G +D A DGV+ LG GD+S AK
Sbjct: 239 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR-- 294
Query: 273 IQNSFSICF-----DENDSGSVFFGDQ----GPATQQSTSFLPI------GEKYDAYFVG 317
FS C + S + FG GP T ++ + G + VG
Sbjct: 295 FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVG 354
Query: 318 VESYCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 374
E I + F ++D+ S T L E YA V D+ +S + +
Sbjct: 355 GERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGF 414
Query: 375 KYCYN-------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEVGDHACFSYFTLE 427
+YCY + +P + + PE E G AC ++ L
Sbjct: 415 EYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGV-ACLAFRKLL 473
Query: 428 YNFTGIL 434
GIL
Sbjct: 474 RGGPGIL 480
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 128/317 (40%), Gaps = 50/317 (15%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ +GTP+ + +V +D GS+ WV C+C C N + S S++ VS
Sbjct: 5 VGLGTPSKTQIVEIDTGSSTSWVFCECDGCHT-----------NPRTFLQSRSTTCAKVS 53
Query: 174 CSHPLCKSRSSCKSLKDP-----CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSV 228
C +C S +D CP+ Y + ++S G L D L + K
Sbjct: 54 CGTSMCLLGGSDPHCQDSENYPDCPFRVSYQ-DGSASYGILYQDTLTFSDVQKIP----- 107
Query: 229 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 288
GC G+ G DG++G+G G +SV L ++ + FS C S
Sbjct: 108 --GFTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSER 161
Query: 289 VFF---------GDQGPATQ---QSTSFLPIGEKYDAYFVGV-------ESYCIGNSCLT 329
FF G + AT+ + T + + + +FV + E + S +
Sbjct: 162 GFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS 221
Query: 330 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 389
+ G + DSG+ +++P + + + +L+ +R + + S + CY+ S + +P
Sbjct: 222 RKG--VVFDSGSELSYIPDRALSVLSQRIRELL-LRRGAAEEESERNCYDMRSVDEGDMP 278
Query: 390 DMRLIFSKNQSFVVRNH 406
+ L F F + H
Sbjct: 279 AISLHFDDGARFDLGRH 295
>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
Length = 149
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 229 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 286
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 287 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 345
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 346 LPTEIYAEVVVKFDKLVS 363
+P +IY E++ K +S
Sbjct: 125 VPAQIYNEILSKVRGTLS 142
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 128/321 (39%), Gaps = 62/321 (19%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
I +GTP ++F V D GS+L+W QCAP + + + + P+SSS+ +
Sbjct: 90 ISVGTPLLTFSVVADTGSDLIWT-----QCAPCTKCF----QQPAPPFQPASSSTFSKLP 140
Query: 174 CSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL--ASFSKHAPQSS 227
C+ C+ S +C + C Y Y + T +GYL + L + ASF
Sbjct: 141 CTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASF-------- 188
Query: 228 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 287
SV GC + G + G+ GLG G + SL+ + G+ FS C +
Sbjct: 189 --PSVAFGCSTENG----VGNSTSGIAGLGRGAL---SLIPQLGV--GRFSYCLRSGSAA 237
Query: 288 S---VFFGDQGPATQ---QSTSFL---PIGEKYDAYFVGVESYCIGNSCL---------T 329
+ FG T QST F+ + Y Y+V + +G + L T
Sbjct: 238 GASPILFGSLANLTDGNVQSTPFVNNPAVHPSY--YYVNLTGITVGETDLPVTTSTFGFT 295
Query: 330 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS--SEEM 385
Q+G +VDSG + T+L + Y V F + C+ ++
Sbjct: 296 QNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGG 355
Query: 386 LKVPDMRLIFSKNQSFVVRNH 406
+ VP + L F + V +
Sbjct: 356 IAVPSLVLRFDGGAEYAVPTY 376
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 125/309 (40%), Gaps = 43/309 (13%)
Query: 102 FGNQFYWLHY-TWIDIGTPNVSFLVALDAGSNLLWVPCQ-CIQCAPLSASYYTSLDRNLS 159
G L Y + +G+P S + +D GS++ WV C+ C QC ++ D
Sbjct: 124 LGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQC-------HSQAD---P 173
Query: 160 EYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 215
+DPSSSS+ SCS C + + C S + C Y Y + +S++G D L
Sbjct: 174 LFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQ--CQYTVTYG-DGSSTTGTYSSDTLA 230
Query: 216 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 275
L S + Q GC ++G + D DG+MGLG G S+ S AG
Sbjct: 231 LGSNAVRKFQ--------FGCSNVESG-FND--QTDGLMGLGGGAQSLVS--QTAGTFGA 277
Query: 276 SFSICFDENDSGSVFF----GDQG----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 327
+FS C S S F G G P + S G + A VG I S
Sbjct: 278 AFSYCLPATSSSSGFLTLGAGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSV 337
Query: 328 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 387
+ ++DSG T LP Y+ + F + + C++ S + +
Sbjct: 338 FSAG---TIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVS 394
Query: 388 VPDMRLIFS 396
+P + L+FS
Sbjct: 395 IPTVALVFS 403
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 128/332 (38%), Gaps = 66/332 (19%)
Query: 114 IDIGTPNVSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVS 173
+ IGTP V F+ D GS+L W QC P + ++ YD ++SSS +
Sbjct: 87 LAIGTPPVPFIALADTGSDLTWT-----QCKPCKLCF----GQDTPIYDTTTSSSFSPLP 137
Query: 174 CSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 231
CS C S C + C Y Y DD ++S SV
Sbjct: 138 CSSATCLPIWSSRCSTPSATCRYR------------YAYDD----GAYSPECAGISV-GG 180
Query: 232 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC----FDENDSG 287
+ GCG G + G +GLG G + SL+A+ G+ FS C F+ + S
Sbjct: 181 IAFGCGVDNGGLSYNST---GTVGLGRGSL---SLVAQLGV--GKFSYCLTDFFNTSLSS 232
Query: 288 SVFFGDQGPATQ----------QSTSFLPIGEKYDAYFVGVESYCIGNSCLT-------- 329
VFFG QST + Y+V +E +G++ L
Sbjct: 233 PVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDL 292
Query: 330 ---QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS---E 383
+VDSG FT L E VVV V + + + + C+ A + +
Sbjct: 293 NDDDGSGGMIVDSGTIFTIL-VETGFRVVVDHVAGVLGQPVVNASSLDRPCFPAPAAGVQ 351
Query: 384 EMLKVPDMRLIFSKNQSFVV-RNHIFSFPENE 414
E+ +PDM L F+ + R++ SF E E
Sbjct: 352 ELPDMPDMVLHFAGGADMRLHRDNYMSFNEEE 383
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 108/288 (37%), Gaps = 65/288 (22%)
Query: 127 LDAGSNLLWVPCQ---CIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCKSRS 183
LD GS+L+W PCQ CI C + TSL S P S ++ VSC C +
Sbjct: 97 LDTGSDLVWFPCQPFECILCE--GKAENTSLA---STPPPKLSKTATPVSCKSSACSAAH 151
Query: 184 SCKSLKDPCPYIADYSTEDTSSSG----------YLVDDILHLASFSKHA-------PQS 226
S D C I++ E +S Y D +A + + P +
Sbjct: 152 SNLPSSDLCA-ISNCPLESIETSDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLSNPTN 210
Query: 227 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC----- 280
+ ++ GC A P GV G G G +S+P+ LA + + N FS C
Sbjct: 211 LIVNNFTFGCAHTAL------AEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHS 264
Query: 281 ----------------FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 324
+D ++ G P TS L E Y VG+E IG
Sbjct: 265 FDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVGLEGISIG 323
Query: 325 NSCLTQSGFQA----------LVDSGASFTFLPTEIYAEVVVKFDKLV 362
+ GF +VDSG +FT LP +Y VV +F+ V
Sbjct: 324 RKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRV 371
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 117/293 (39%), Gaps = 32/293 (10%)
Query: 121 VSFLVALDAGSNLLWVPCQCIQCAPLSASYYTSLDRNLSEYDPSSSSSSKNVSCSHPLCK 180
V+ + LD S++ WV QC C P Y + YDP+ SSSS SC+ P C
Sbjct: 142 VTQTMVLDTASDVTWV--QCSPC-PTPPCY----PQKDVLYDPTKSSSSGVFSCNSPTCT 194
Query: 181 S----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGC 236
+ C + + C Y Y + TS++G + D+L + P ++V+ S GC
Sbjct: 195 QLGPYANGCTN-NNQCQYRVRYP-DGTSTAGTYISDLLTI------TPATAVR-SFQFGC 245
Query: 237 GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGP 296
GS+ G++ G+M LG G S+ + A FS CF FF P
Sbjct: 246 SHGVQGSFSFGSSAAGIMALGGGPESL--VSQTAATYGRVFSHCFPPPTRRG-FFTLGVP 302
Query: 297 ATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLP 347
L K A Y V +E+ + + T A +DS + T LP
Sbjct: 303 RVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLP 362
Query: 348 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 400
Y + F ++ + + CY+ + +P + L+F KN +
Sbjct: 363 PTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAA 415
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.402
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,908,184,071
Number of Sequences: 23463169
Number of extensions: 285177209
Number of successful extensions: 713110
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 302
Number of HSP's successfully gapped in prelim test: 2582
Number of HSP's that attempted gapping in prelim test: 708296
Number of HSP's gapped (non-prelim): 3577
length of query: 438
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 292
effective length of database: 8,933,572,693
effective search space: 2608603226356
effective search space used: 2608603226356
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)