BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044367
(450 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 211/455 (46%), Positives = 278/455 (61%), Gaps = 27/455 (5%)
Query: 9 LLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLN 65
L+SL L FT+ + T +PK+LVTKL+H S+L +NPN +V +A+R +
Sbjct: 7 LVSLGLLIFTT--LVTGNIVEAYNAQPKQLVTKLIHWGSILSPYFNPNASVAERAERIVK 64
Query: 66 MSMARFIYLSQKSSQKAH--DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
S R YL + H D +L P + P+F VNFS+GQP PQLA++DTGS+++
Sbjct: 65 TSATRIAYLYAQIKGDIHMNDFELNLLPS-TYEPLFLVNFSMGQPATPQLAIMDTGSNIL 123
Query: 124 WVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD---ECWYNIRYTNGP 177
WV+C PC++C DPSKS TYA+LPC ++ C Y + +C YN+ Y G
Sbjct: 124 WVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMCHYAPSAYCNRLNQCGYNLSYATGL 183
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
S G + +EQ F +SDEG + V FGCSH N + D +FTGVFGLG +S V
Sbjct: 184 SSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHENGDYKDRRFTGVFGLGKGITS---FVT 240
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLD 297
++GSKFSYC+GN+ Y YN L+ GE A EG STP+ V++G YYVTLEGIS+GEK LD
Sbjct: 241 RMGSKFSYCLGNIADPHYGYNQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLD 300
Query: 298 IDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-- 355
ID F A IDSGT LTWL SA++ L EV L G+L P W
Sbjct: 301 IDSTAFSMKGNEKSA--LIDSGTALTWLAESAFRALDNEVRQLLDGVLM-----PFWRGS 353
Query: 356 -LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD 414
CY G +++DL GFP + FHF+GGADL LD ES+FYQ + + C+AV + G FK
Sbjct: 354 FACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQATPDILCIAVRQASAYGNDFKS 413
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
S+IG++AQQ YN+AYDL S +L+FQRIDC+LL D
Sbjct: 414 FSVIGLMAQQYYNMAYDLNSNKLFFQRIDCQLLVD 448
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 208/426 (48%), Positives = 267/426 (62%), Gaps = 34/426 (7%)
Query: 34 KPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQK--AHDTRAH 88
+P RLVTKL+HRDS++ Y NDTV + +RT+ S+AR YL K + +D +
Sbjct: 33 QPTRLVTKLIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDINDLWLN 92
Query: 89 LHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSL 144
LHP S P+F VNFS+GQPPVPQLA++DTGSSL+W++C PC+ C FDPS S
Sbjct: 93 LHPSASE-PLFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISS 151
Query: 145 TYATLPCDSSYCTNDCGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
TY +L C + C G D +C YN Y G S G I +EQ F +SDEG+ +
Sbjct: 152 TYDSLSCKNIICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVN 211
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
+V FGCSH N ++ D +FTGVFGLG S S+V ++GSKFSYCIGN+ +Y+YN L+
Sbjct: 212 NVLFGCSHRNGNYKDRRFTGVFGLG---SGITSVVNQMGSKFSYCIGNIADPDYSYNQLV 268
Query: 262 LGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
L EG +EG STP+ V+DG Y V LEGIS+GE L IDP+ FK+ T V IDSGT
Sbjct: 269 LSEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKR--TEKQRRVIIDSGTA 326
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD 381
TWL + Y+ L +EV +L L P LCY G + +DL GFPA+ FHFA GAD
Sbjct: 327 PTWLAENEYRALEREVRNLLDRFLT--PFMRESFLCYKGKVGQDLVGFPAVTFHFAEGAD 384
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
LV+D E +SV+ G+ FKD S+IG++AQQ YNVAYDL +L+FQR
Sbjct: 385 LVVDTE----MRQASVY----------GKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQR 430
Query: 442 IDCELL 447
IDCELL
Sbjct: 431 IDCELL 436
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 194/427 (45%), Positives = 262/427 (61%), Gaps = 34/427 (7%)
Query: 34 KPKRLVTKLLHRDSLL--YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHP 91
KP RLVT L+H+DS+L Y D + + +RT A FI +++ A D
Sbjct: 37 KPLRLVTGLIHQDSILSSYQSLDRNNVERRRT---RRAAFITDEIQANMVADDRGQ---- 89
Query: 92 GISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYAT 148
F VNFS+G+PPVPQL +DTGS L+WV+C+PC C FDPSKS TY
Sbjct: 90 ------AFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVD 143
Query: 149 LPCDSSYCTNDCG---GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
L DS C N + ++C YN Y +G S G + +E FETSD+G + V F
Sbjct: 144 LSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVF 203
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H+N D Q +G+ GL ++ S+V ++GS+FSYCIG+L Y +N L+LG+G
Sbjct: 204 GCGHSNRGRFDGQQSGILGL---SAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDG 260
Query: 266 AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+EG STP +G YYVTLEGIS+GE LDI+P +F++ ++ GV +DSGTT T+L
Sbjct: 261 VKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTES-GQGGVVMDSGTTATFL 319
Query: 326 VPSAYQTLRKEVEDLFQGLLPS--YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+ L E++ L +G Y P W LCY G +N DL+GFP +AFHFA GADLV
Sbjct: 320 AKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLV 378
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDL-SIIGMIAQQNYNVAYDLVSKQLYFQRI 442
LDA S+F Q++ VFCLAV S++ K++ S+IG++AQQ+YNVAYDL+ K++YFQR
Sbjct: 379 LDANSLFVQKNQDVFCLAVLESNL-----KNIGSVIGIMAQQHYNVAYDLIGKRVYFQRT 433
Query: 443 DCELLAD 449
DCELL D
Sbjct: 434 DCELLED 440
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 194/427 (45%), Positives = 262/427 (61%), Gaps = 34/427 (7%)
Query: 34 KPKRLVTKLLHRDSLL--YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHP 91
KP RLVT L+H+DS+L Y D + + +RT A FI +++ A D
Sbjct: 5 KPLRLVTGLIHQDSILSSYQSLDRNNVERRRT---RRAAFIXDEIQANMVADDRGQ---- 57
Query: 92 GISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYAT 148
F VNFS+G+PPVPQL +DTGS L+WV+C+PC C FDPSKS TY
Sbjct: 58 ------AFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVD 111
Query: 149 LPCDSSYCTNDCG---GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
L DS C N + ++C YN Y +G S G + +E FETSD+G + V F
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVF 171
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H+N D Q +G+ GL ++ S+V ++GS+FSYCIG+L Y +N L+LG+G
Sbjct: 172 GCGHSNRGRFDGQQSGILGL---SAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDG 228
Query: 266 AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+EG STP +G YYVTLEGIS+GE LDI+P +F++ ++ GV +DSGTT T+L
Sbjct: 229 VKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTES-GQGGVVMDSGTTATFL 287
Query: 326 VPSAYQTLRKEVEDLFQGLLPS--YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+ L E++ L +G Y P W LCY G +N DL+GFP +AFHFA GADLV
Sbjct: 288 AKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLV 346
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDL-SIIGMIAQQNYNVAYDLVSKQLYFQRI 442
LDA S+F Q++ VFCLAV S++ K++ S+IG++AQQ+YNVAYDL+ K++YFQR
Sbjct: 347 LDANSLFVQKNQDVFCLAVLESNL-----KNIGSVIGIMAQQHYNVAYDLIGKRVYFQRT 401
Query: 443 DCELLAD 449
DCELL D
Sbjct: 402 DCELLED 408
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 348 bits (893), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 194/427 (45%), Positives = 262/427 (61%), Gaps = 34/427 (7%)
Query: 34 KPKRLVTKLLHRDSLL--YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHP 91
KP RLVT L+H+DS+L Y D + + +RT A FI +++ A D
Sbjct: 5 KPLRLVTGLIHQDSILSSYQSLDRNNVERRRT---RRAAFITDEIQANMVADDRGQ---- 57
Query: 92 GISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYAT 148
F VNFS+G+PPVPQL +DTGS L+WV+C+PC C FDPSKS TY
Sbjct: 58 ------AFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVD 111
Query: 149 LPCDSSYCTNDCG---GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
L DS C N + ++C YN Y +G S G + +E FETSD+G + V F
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVF 171
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H+N D Q +G+ GL ++ S+V ++GS+FSYCIG+L Y +N L+LG+G
Sbjct: 172 GCGHSNRGRFDGQQSGILGL---SAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDG 228
Query: 266 AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+EG STP +G YYVTLEGIS+GE LDI+P +F++ ++ GV +DSGTT T+L
Sbjct: 229 VKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTES-GQGGVVMDSGTTATFL 287
Query: 326 VPSAYQTLRKEVEDLFQGLLPS--YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+ L E++ L +G Y P W LCY G +N DL+GFP +AFHFA GADLV
Sbjct: 288 AKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW-LCYKGRVNEDLRGFPELAFHFAEGADLV 346
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDL-SIIGMIAQQNYNVAYDLVSKQLYFQRI 442
LDA S+F Q++ VFCLAV S++ K++ S+IG++AQQ+YNVAYDL+ K++YFQR
Sbjct: 347 LDANSLFVQKNQDVFCLAVLESNL-----KNIGSVIGIMAQQHYNVAYDLIGKRVYFQRT 401
Query: 443 DCELLAD 449
DCELL D
Sbjct: 402 DCELLED 408
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 348 bits (892), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 204/458 (44%), Positives = 278/458 (60%), Gaps = 29/458 (6%)
Query: 9 LLSLITLPF-TSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTL 64
LL +TL F ST I +ST KP RL TKL+HR+S L Y+ N+TV+ +++R
Sbjct: 11 LLPSLTLAFYLSTAIISSTLITT---KPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQ 67
Query: 65 NMSMARFIYLSQKSSQ---KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSS 121
S+ RF +L K + ++ R+ L P + F VN SIG PPV QL V+DTGSS
Sbjct: 68 TSSIERFDFLESKIKELKSVGNEARSSLIP-FNRGSGFLVNLSIGSPPVTQLVVVDTGSS 126
Query: 122 LIWVKCQPCEQC---GATTFDPSKSLTYATLPCD-SSYCTNDCGGYP----DECWYNIRY 173
L+WV+C PC C + FDP KS+++ TL C Y N GY ++ Y +RY
Sbjct: 127 LLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGY--NYINGYKCNRFNQAEYKLRY 184
Query: 174 TNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS-DEQFTGVFGLGPATSST 232
G SQG + E FET DEGK ++ FGC H N + D+ + GVFGLG T
Sbjct: 185 LGGDSSQGILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHIT 244
Query: 233 HSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLG 292
+ ++G+KFSYCIG++N Y +N L+LG+G+ +EGDSTP+ + G YYVTL+ IS+G
Sbjct: 245 --MATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVG 302
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP 352
K L IDPN FK + S GV IDSG T T L ++ L E+ DL +GLL P
Sbjct: 303 SKTLKIDPNAFKISSDGS-GGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 361
Query: 353 AWH-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGER 411
+ LC+ G ++RDL GFPA+ FHFAGGADLVL++ S+F Q FCLA+ PS+
Sbjct: 362 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSN---SE 418
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
+LS+IG++AQQNYNV +DL +++F+RIDC+LL +
Sbjct: 419 LLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 205/471 (43%), Positives = 280/471 (59%), Gaps = 42/471 (8%)
Query: 9 LLSLITLPF-TSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTL 64
LL +TL F ST I +ST KP RL TKL+HR+S L Y+ N+TV+ +++R
Sbjct: 11 LLPSLTLAFYLSTAIISSTLITT---KPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQ 67
Query: 65 NMSMARFIYLSQKSSQ---KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSS 121
S+ RF +L K + ++ R+ L P + F VN SIG PPV QL V+DTGSS
Sbjct: 68 TSSIERFDFLESKIKELKSVGNEARSSLIP-FNRGSGFLVNLSIGSPPVTQLVVVDTGSS 126
Query: 122 LIWVKCQPCEQC---GATTFDPSKSLTYATLPCD-SSYCTNDCGGYP----DECWYNIRY 173
L+WV+C PC C + FDP KS+++ TL C Y N GY ++ Y +RY
Sbjct: 127 LLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGY--NYINGYKCNRFNQAEYKLRY 184
Query: 174 TNGPDSQGTIGSEQFNFETSDEGKTFLYD-------------VGFGCSHNNAHFS-DEQF 219
G SQG + E FET DEG+ F Y+ + FGC H N + D+ +
Sbjct: 185 LGGDSSQGILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAY 244
Query: 220 TGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVID 279
GVFGLG T + ++G+KFSYCIG++N Y +N L+LG+G+ +EGDSTP+ +
Sbjct: 245 NGVFGLGAYPHIT--MATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHF 302
Query: 280 GSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVED 339
G YYVTL+ IS+G K L IDPN FK + S GV IDSG T T L ++ L E+ D
Sbjct: 303 GHYYVTLQSISVGSKTLKIDPNAFKISSDGS-GGVLIDSGMTYTKLANGGFELLYDEIVD 361
Query: 340 LFQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF 398
L +GLL P + LC+ G ++RDL GFPA+ FHFAGGADLVL++ S+F Q F
Sbjct: 362 LMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRF 421
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
CLA+ PS+ +LS+IG++AQQNYNV +DL +++F+RIDC+LL +
Sbjct: 422 CLAILPSN---SELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 469
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 197/444 (44%), Positives = 266/444 (59%), Gaps = 33/444 (7%)
Query: 21 RIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQK 77
R S+T ++GKP+RLV+KL+H S+ Y PN+T + + + S AR + +
Sbjct: 18 RCCFSSTNTISSGKPQRLVSKLIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQAR 77
Query: 78 ---SSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG 134
S +D +A + P ++ + N SIGQPP+PQL V+DTGS ++WV C PC C
Sbjct: 78 IEGSLVSNNDYKARVSPSLTGRTIM-ANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCD 136
Query: 135 ---ATTFDPSKSLTYATL---PCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQF 188
FDPSKS T++ L PCD C C P + + Y + + GT G +
Sbjct: 137 NDLGLLFDPSKSSTFSPLCKTPCDFEGCR--CDPIP----FTVTYADNSTASGTFGRDTV 190
Query: 189 NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIG 248
FET+DEG + + DV FGC HN H +D G+ GL + SLV K+G KFSYCIG
Sbjct: 191 VFETTDEGTSRISDVLFGCGHNIGHDTDPGHNGILGL---NNGPDSLVTKLGQKFSYCIG 247
Query: 249 NLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
NL Y Y+ LILGEGA LEG STP V +G YYVT+EGIS+GEK LDI P F+ +
Sbjct: 248 NLADPYYNYHQLILGEGADLEGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKEN 307
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF-----QGLLPSYPMDPAWHLCYSGNIN 363
+ GV ID+G+T+T+LV S ++ L KEV +L Q + P W C+ G+I+
Sbjct: 308 RA-GGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSP----WMQCFYGSIS 362
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
RDL GFP + FHF+ GADL LD+ S F Q + +VFC+ VGP + K S+IG++AQ
Sbjct: 363 RDLVGFPVVTFHFSDGADLALDSGSFFNQLNDNVFCMTVGPVSSLNIKSKP-SLIGLLAQ 421
Query: 424 QNYNVAYDLVSKQLYFQRIDCELL 447
Q+YNV YDLV++ +YFQRIDCELL
Sbjct: 422 QSYNVGYDLVNQFVYFQRIDCELL 445
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 331 bits (848), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 193/439 (43%), Positives = 266/439 (60%), Gaps = 28/439 (6%)
Query: 25 STTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQK---S 78
S+T+ ++ KP+RLV+KL+H S+ Y PN+T + + + S ARF Y+ + S
Sbjct: 22 SSTSTISSVKPQRLVSKLIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGS 81
Query: 79 SQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---A 135
++ +A + P ++ + N SIGQPP+PQL V+DTGS ++WV C PC C
Sbjct: 82 LVSNNEYKARVSPSLTGRTIM-ANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLG 140
Query: 136 TTFDPSKSLTYATL---PCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET 192
FDPS S T++ L PCD C+ C P + + Y + + G G + FET
Sbjct: 141 LLFDPSMSSTFSPLCKTPCDFKGCSR-CDPIP----FTVTYADNSTASGMFGRDTVVFET 195
Query: 193 SDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNY 252
+DEG + + DV FGC HN +D G+ GL + SL K+G KFSYCIG+L
Sbjct: 196 TDEGTSRIPDVLFGCGHNIGQDTDPGHNGILGL---NNGPDSLATKIGQKFSYCIGDLAD 252
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLF--KKNDTWS 310
Y Y+ LILGEGA LEG STP V +G YYVT+EGIS+GEK LDI P F KKN T
Sbjct: 253 PYYNYHQLILGEGADLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRT-- 310
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA-WHLCYSGNINRDLQGF 369
GV ID+G+T+T+LV S ++ L KEV +L ++ + W C+ G+I+RDL GF
Sbjct: 311 -GGVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGF 369
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + FHFA GADL LD+ S F Q + +VFC+ VGP + K S+IG++AQQ+Y+V
Sbjct: 370 PVVTFHFADGADLALDSGSFFNQLNDNVFCMTVGPVSSLNLKSKP-SLIGLLAQQSYSVG 428
Query: 430 YDLVSKQLYFQRIDCELLA 448
YDLV++ +YFQRIDCELL+
Sbjct: 429 YDLVNQFVYFQRIDCELLS 447
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 200/453 (44%), Positives = 266/453 (58%), Gaps = 37/453 (8%)
Query: 8 LLLSLITLPFTSTRIFT-STTAAPAAGKPKRLVTKLLHRDSLL--YNPNDTV-DAQAQRT 63
L+ +L++LPF IF S T A LV KL+H +S L YN DT+ D + +
Sbjct: 16 LVYTLVSLPF----IFHFSLTTATITTSTINLVIKLIHHESSLSPYNSKDTIWDHYSHKI 71
Query: 64 LNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
L + + +D ++L P V VF +NFSIG+PP+PQLAV+DTGSSL
Sbjct: 72 LKQTFS-------------NDYISNLVPSPRYV-VFLMNFSIGEPPIPQLAVMDTGSSLT 117
Query: 124 WVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQ 180
WV C PC C + FDPSKS TY+ L C S C N C EC Y++ Y SQ
Sbjct: 118 WVMCHPCSSCSQQSVPIFDPSKSSTYSNLSC--SEC-NKCDVVNGECPYSVEYVGSGSSQ 174
Query: 181 GTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD----EQFTGVFGLGPATSSTHSLV 236
G EQ ET DE + + FGC + S+ + GVFGLG S SL+
Sbjct: 175 GIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLG---SGRFSLL 231
Query: 237 EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKML 296
G KFSYCIGNL Y +N L+LG+ A ++GDST ++VI+G YYV LE IS+G + L
Sbjct: 232 PSFGKKFSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVINGLYYVNLEAISIGGRKL 291
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP--AW 354
DIDP LF+++ T +++GV IDSG TWL ++ L EVE+L +G+L D +
Sbjct: 292 DIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPY 351
Query: 355 HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD 414
LCYSG +++DL GFP + FHFA GA L LD S+F Q + + FC+A+ P + G+ ++
Sbjct: 352 TLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYES 411
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S IGM+AQQNYNV YDL ++YFQRIDCELL
Sbjct: 412 FSSIGMLAQQNYNVGYDLNRMRVYFQRIDCELL 444
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 195/445 (43%), Positives = 260/445 (58%), Gaps = 31/445 (6%)
Query: 19 STRIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLS 75
ST F+ST+ +A KP+RLV+KL+H S+ Y PN+T + + + S AR Y+
Sbjct: 17 STCCFSSTSTVSSA-KPRRLVSKLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQ 75
Query: 76 QK---SSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ 132
+ S +D A + P ++ + VN SIGQP +PQL V+DTGS ++W+ C PC
Sbjct: 76 ARIEGSLVYNNDYTASVSPSLTGRTIL-VNLSIGQPSIPQLVVMDTGSDILWIMCNPCTN 134
Query: 133 CG---ATTFDPSKSLTYATLPCDSSYCTNDCGGYPDEC---WYNIRYTNGPDSQGTIGSE 186
C FDPS S T++ L C CG +C + I Y + + GT G +
Sbjct: 135 CDNHLGLLFDPSMSSTFSPL------CKTPCGFKGCKCDPIPFTISYVDNSSASGTFGRD 188
Query: 187 QFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYC 246
FET+DEG + + DV GC HN SD + G+ GL + +SL ++G KFSYC
Sbjct: 189 ILVFETTDEGTSQISDVIIGCGHNIGFNSDPGYNGILGL---NNGPNSLATQIGRKFSYC 245
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLF--K 304
IGNL Y YN L LGEGA LEG STP V G YYVT+EGIS+GEK LDI F K
Sbjct: 246 IGNLADPYYNYNQLRLGEGADLEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMK 305
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA-WHLCYSGNIN 363
+N T GV +DSGTT+T+LV SA++ L EV +L + + A W LCY G I+
Sbjct: 306 RNGT---GGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIIS 362
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
RDL GFP + FHF GADL LD S F+ + +FC+ V P+ I S+IG++AQ
Sbjct: 363 RDLVGFPVVTFHFVDGADLALDTGS-FFSQRDDIFCMTVSPASILNTTISP-SVIGLLAQ 420
Query: 424 QNYNVAYDLVSKQLYFQRIDCELLA 448
Q+YNV YDLV++ +YFQRIDCELL+
Sbjct: 421 QSYNVGYDLVNQFVYFQRIDCELLS 445
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 173/436 (39%), Positives = 255/436 (58%), Gaps = 35/436 (8%)
Query: 38 LVTKLLHRDSLL-YNPNDTV----DAQAQRTLNMSMARFIYLSQKSSQK--AHDTRAHLH 90
+ KL+ R+S++ +NP+ V + Q ++S ARF YL ++ + D + +H
Sbjct: 1 MAMKLIRRESVVRHNPDARVPVTPEDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVH 60
Query: 91 PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT-----TFDPSKSLT 145
I T +F+VNFS+GQPPVPQ ++DTGSSL+W++C PC+ C + F+P+ S T
Sbjct: 61 QAIKT-SLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSST 119
Query: 146 YATLPCDSSYCTNDCGGY--PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ CD +C G+ ++C Y Y +G S+G + E+ F T + +
Sbjct: 120 FVECSCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPI 179
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + +FTG+ GLG + SL ++GSKFSYCIG+L Y YN L+LG
Sbjct: 180 AFGCGHENGEQLESEFTGILGLGAKPT---SLAVQLGSKFSYCIGDLANKNYGYNQLVLG 236
Query: 264 EGAILEGDSTPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
E A + GD TP+ +G YY+ LEGIS+G+K L+I+P +FK+ S GV +D+GT
Sbjct: 237 EDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRG--SRTGVILDTGTL 294
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW---HLCYSGNINRDLQGFPAMAFHFAG 378
TWL AY+ L E++ + L + W LCY G +N +L GFP + FHFAG
Sbjct: 295 YTWLADIAYRELYNEIKSILDPKLERF-----WFRDFLCYHGRVNEELIGFPVVTFHFAG 349
Query: 379 GADLVLDAESVFYQESSS-----VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
GA+L ++A S+FY + S VFC++V P+ +G +KD + IG++AQQ YN+AYDL
Sbjct: 350 GAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLK 409
Query: 434 SKQLYFQRIDCELLAD 449
+ +Y QRIDC LL D
Sbjct: 410 ERNIYLQRIDCVLLDD 425
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 314 bits (805), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 192/456 (42%), Positives = 265/456 (58%), Gaps = 31/456 (6%)
Query: 15 LPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARF 71
L FT T + + T KP + TKL+HRDS+ YNPND++ +A+R L S ARF
Sbjct: 14 LTFTITLLSLALTTNTKPNKP--VTTKLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARF 71
Query: 72 IYLSQKSSQKAH-------DTRA----HLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGS 120
Y+ S + + DT A + +S + F VNFSIGQPPVPQ AV+DTGS
Sbjct: 72 DYVQAISKRNSAVVDYDGGDTSAADDAYEASLLSELCTFLVNFSIGQPPVPQYAVMDTGS 131
Query: 121 SLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGP 177
SL W++C+PC C ++PS S TY + T + +C Y+ Y +
Sbjct: 132 SLTWIQCEPCINCHQQKGPLYNPSSSSTYVSCSDFDRTDTTFTATHGSDCNYSQTYADKT 191
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQ--FTGVFGLGPATSSTHSL 235
++GT EQ FET D+G T ++DV FGC HNN +GVFGLG + S S+
Sbjct: 192 TTRGTYAREQLLFETPDDGITIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGS---SI 248
Query: 236 VEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKM 295
+ K+G FSYCIGN+ Y ++ L LG +EG STP+ V G YY+TL GIS+G++
Sbjct: 249 ISKLGFGFSYCIGNIGDPLYGFHRLTLGNKLKIEGYSTPL-VPRGLYYITLVGISIGQER 307
Query: 296 LDIDPNLFKKND-TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDPA 353
LDIDP +F++ D + + IDSG TL+++ AY +R +V + G L Y +
Sbjct: 308 LDIDPIVFQRVDLNGISSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARH 367
Query: 354 WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
LCY G +N+DLQGFP FH A GADLV E +F+Q + +V CLA+ P++ + E
Sbjct: 368 LSLCYIGKLNQDLQGFPDATFHLADGADLVFQVEGLFFQYTDNVLCLALVPTESDEET-- 425
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
+IG++AQQ YNVAYDL ++LYFQRI+CELL D
Sbjct: 426 --CLIGLLAQQYYNVAYDLKQQKLYFQRIECELLDD 459
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 311 bits (796), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 172/440 (39%), Positives = 252/440 (57%), Gaps = 35/440 (7%)
Query: 34 KPKRLVTKLLHRDSLL-YNPNDTV----DAQAQRTLNMSMARFIYLSQKSSQK--AHDTR 86
KP R+ KL+HR+S+ NPN V + + ++S ARF YL ++ + + +
Sbjct: 25 KPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLTDISSARFKYLQNSIDKELGSSNFQ 84
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPS 141
+ I T +F VNFS+GQPPVPQL ++DTGSSL+W++CQPC+ C + F+P+
Sbjct: 85 VDVEQAIKT-SLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPA 143
Query: 142 KSLTYATLPCDSSYCT---NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
S T+ CD +C N G ++C Y Y +G S+G + E+ F T +
Sbjct: 144 LSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTV 203
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYN 258
+ FGC + N + FTG+ GLG + SL ++GSKFSYCIG+L Y YN
Sbjct: 204 VTQPIAFGCGYENGEQLESHFTGILGLGAKPT---SLAVQLGSKFSYCIGDLANKNYGYN 260
Query: 259 MLILGEGAILEGDSTPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
L+LGE A + GD TP+ + YY+ LEGIS+G+ L+I+P +FK+ + GV +
Sbjct: 261 QLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRT--GVIL 318
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW---HLCYSGNINRDLQGFPAMA 373
DSGT TWL AY+ L E++ + L + W LCY G ++ +L GFP +
Sbjct: 319 DSGTLYTWLADIAYRELYNEIKSILDPKLERF-----WFRDFLCYHGRVSEELIGFPVVT 373
Query: 374 FHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
FHFAGGA+L ++A S+FY S +VFC++V P+ +G +K+ + IG++AQQ YN+
Sbjct: 374 FHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIG 433
Query: 430 YDLVSKQLYFQRIDCELLAD 449
YDL K +Y QRIDC L D
Sbjct: 434 YDLKEKNIYLQRIDCVQLDD 453
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 162/373 (43%), Positives = 224/373 (60%), Gaps = 23/373 (6%)
Query: 84 DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDP 140
D +H+ P I F N SIG PPVPQL ++DTGS L W++C PC +C T F P
Sbjct: 74 DIVSHVTP-IPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHP 131
Query: 141 SKSLTYATLPCDSSYCTNDCGGYPDE----CWYNIRYTNGPDSQGTIGSEQFNFETSDEG 196
S+S TY C+S+ + DE C Y++RY + +++G + E+ F+TSDEG
Sbjct: 132 SRSSTYRNASCESAPHAMP-QIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEG 190
Query: 197 KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYA 256
++ FGC +N+ F+ Q++GV GLGP T S + GSKFSYC G+L Y
Sbjct: 191 LISKPNIVFGCGQDNSGFT--QYSGVLGLGPGTFSI--VTRNFGSKFSYCFGSLIDPTYP 246
Query: 257 YNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+N LILG GA +EGD TP+ + YY+ L+ ISLGEK+LDI+P +F++ S G I
Sbjct: 247 HNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYR--SKGGTVI 304
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDPAWHLCYSGNINRDLQGFPAMAFH 375
D+G + T L AY+TL +E++ L +L + + CY GN+ DL GFP + FH
Sbjct: 305 DTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFH 364
Query: 376 FAGGADLVLDAESVFY-QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
FAGGA+L LD ES+F ES FCLA + F D+S+IG +AQQNYNV Y+L +
Sbjct: 365 FAGGAELALDVESLFVSSESGDSFCLA-----MTMNTFDDMSVIGAMAQQNYNVGYNLRT 419
Query: 435 KQLYFQRIDCELL 447
++YFQR DCE+L
Sbjct: 420 MKVYFQRTDCEIL 432
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 166/399 (41%), Positives = 230/399 (57%), Gaps = 27/399 (6%)
Query: 62 RTLNMSMARFIYLSQKSSQKAHD----TRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
+T S + YL KS+ + T +H+ P I F N SIG PPVPQL ++D
Sbjct: 38 KTQESSKIKIGYLHSKSTPASRLDNLWTVSHVTP-IPNPAAFLANISIGNPPVPQLLLID 96
Query: 118 TGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGYPDE----CWYN 170
TGS L W+ C PC +C T F PS+S TY C S+ + DE C Y+
Sbjct: 97 TGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAPHAMP-QIFRDEKTGNCQYH 154
Query: 171 IRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATS 230
+RY + +++G + E+ FETSD+G ++ FGC +N+ F+ +++GV GLGP T
Sbjct: 155 LRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSGFT--KYSGVLGLGPGTF 212
Query: 231 STHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGIS 290
S + GSKFSYC G+L Y +N+LILG GA +EGD TP+ + YY+ L+ IS
Sbjct: 213 SI--VTRNFGSKFSYCFGSLTNPTYPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAIS 270
Query: 291 LGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP- 349
GEK+LDI+P F++ S G ID+G + T L AY+TL +E++ L +L
Sbjct: 271 FGEKLLDIEPGTFQRYR--SQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKD 328
Query: 350 MDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY-QESSSVFCLAVGPSDIN 408
D CY GN+ DL GFP + FHFAGGA+L LD ES+F ES FCLA +
Sbjct: 329 WDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCLA-----MT 383
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
F D+S+IG +AQQNYNV Y+L + ++YFQR DCE++
Sbjct: 384 MNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCEII 422
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 170/429 (39%), Positives = 245/429 (57%), Gaps = 38/429 (8%)
Query: 38 LVTKLLHRDSL--LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST 95
LV L+H + + L +P Q S+ R YL K++ D AHL P +
Sbjct: 30 LVLNLVHSNQIYSLQSP------QVSHIKEASVERLEYLKAKATG---DIIAHLSPNVPI 80
Query: 96 VP-VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
+P F VN SIG PPV QL +DT S L+W++C+PC C A + FDPS+S T+ C
Sbjct: 81 IPQAFLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESC 140
Query: 152 DSSYCTNDC---GGYPDECWYNIRYTNGPDSQGTIGSEQFNFET--SDEGKTFLYDVGFG 206
+S + C Y++RY +G S+G + E F T + L+DV FG
Sbjct: 141 RTSQYSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFG 200
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE-G 265
C H+N + TG+ GLG SLV + G+KFSYC G+L+ Y +N+L+LG+ G
Sbjct: 201 CGHDN-YGEPLVGTGILGLG---YGEFSLVHRFGTKFSYCFGSLDDPSYPHNVLVLGDDG 256
Query: 266 AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
A + GD+TP+ + +G YYVT+E IS+ +L IDP +F +N G ID+G +LT L
Sbjct: 257 ANILGDTTPLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTSL 316
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPM--DPAWHL-CYSGNINRDL--QGFPAMAFHFAGGA 380
V AY+ L+ ++ED F+G + + D + + CY+GN+ RDL GFP + FHF+ GA
Sbjct: 317 VEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDGA 376
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+L LD +SVF + S +VFCLAV P ++N IG AQQ+YN+ YDL +K++ F+
Sbjct: 377 ELSLDVKSVFMKLSPNVFCLAVTPGNMNS--------IGATAQQSYNIGYDLEAKKISFE 428
Query: 441 RIDCELLAD 449
RIDC +L D
Sbjct: 429 RIDCGVLFD 437
>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 336
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 160/353 (45%), Positives = 208/353 (58%), Gaps = 37/353 (10%)
Query: 103 FSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATL---PCDSSYCTND 159
SIGQPP+PQL ++DT S ++W+ C FDPSKS T++ L PC C
Sbjct: 13 LSIGQPPIPQLVIMDTSSDILWIMCNHV----GLLFDPSKSSTFSPLCKTPCGFKGC--K 66
Query: 160 CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQF 219
C P +NI Y + + GT GS+ FET+DEG + ++DV C HN +D +
Sbjct: 67 CDPIP----FNISYVDKSSTSGTFGSDTVVFETTDEGHSQIFDVLVRCGHNIGFNTDPGY 122
Query: 220 TGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVID 279
G+ GL + +SL K+G KFSYC+GNL Y YN LIL EGA LEG STP V
Sbjct: 123 NGIRGL---NNGPNSLATKIGQKFSYCVGNLADPYYNYNQLILCEGADLEGYSTPFEVHH 179
Query: 280 GSYYVTLEGISLGEKMLDIDPNLF--KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV 337
G YYVTL+GI +GEK LDI P F K N+T GV DSGTT+T+LV S ++ L EV
Sbjct: 180 GFYYVTLKGIIVGEKRLDIAPITFEIKGNNT---GGVIRDSGTTITYLVDSVHKLLYNEV 236
Query: 338 EDLFQGLLPSYPMDPAW---HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES 394
+L +W LC+ G I+RDL GFP + FHFA GADL LD S F+ +
Sbjct: 237 RNLL-----------SWSFRQLCHYGIISRDLVGFPVVTFHFADGADLALDTGS-FFNQL 284
Query: 395 SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+S+ C+ V P+ I S+I ++AQQ+YNV YDL++ +YFQRIDCELL
Sbjct: 285 NSILCMTVSPASILNTTISP-SVIELLAQQSYNVGYDLLTNFVYFQRIDCELL 336
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 153/388 (39%), Positives = 220/388 (56%), Gaps = 30/388 (7%)
Query: 67 SMARFIYLSQKSSQKAHDTRAHLHPGISTVP-VFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ R YL K++ D AHL P + +P F VN SIG PP+ QL +DT S L+W+
Sbjct: 55 SVERLEYLKAKTTG---DIIAHLSPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWI 111
Query: 126 KCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGY---PDECWYNIRYTNGPDS 179
+C PC C A + FDPS+S T+ C +S + + C Y++RY + S
Sbjct: 112 QCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDTGS 171
Query: 180 QGTIGSEQFNFET--SDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
+G + E F T + L+DV FGC H+N + TG+ GLG SLV
Sbjct: 172 KGILAREMLLFNTIYDESSSAALHDVVFGCGHDN-YGEPLVGTGILGLG---YGEFSLVH 227
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGE-GAILEGDSTPMSVIDGSYYVTLEGISLGEKML 296
+ G KFSYC G+L+ Y +N+L+LG+ GA + GD+TP+ + +G YYVT+E IS+ +L
Sbjct: 228 RFGKKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISVDGIIL 287
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM---DPA 353
IDP +F +N G ID+G +LT LV AY+ L+ +ED+F+G + + D
Sbjct: 288 PIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMI 347
Query: 354 WHLCYSGNINRDL--QGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGER 411
CY+GN RDL GFP + FHF+ GA+L LD +S+F + S +VFCLAV P ++N
Sbjct: 348 KMECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNLNS-- 405
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
IG AQQ+YN+ YDL + ++ F
Sbjct: 406 ------IGATAQQSYNIGYDLEAMEVSF 427
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 190/392 (48%), Gaps = 45/392 (11%)
Query: 76 QKSSQKAHDTRAHLH-PGISTVPVF------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
++ S++ A L+ P PV+ +N SIG P P A++DTGS LIW +CQ
Sbjct: 65 ERGSRRLQRLEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ 124
Query: 129 PCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQG 181
PC QC F+P S +++TLPC S C + C + C Y Y +G ++QG
Sbjct: 125 PCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSN--NSCQYTYGYGDGSETQG 182
Query: 182 TIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS 241
++G+E F G + ++ FGC NN F G+ G+G S S ++ +
Sbjct: 183 SMGTETLTF-----GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV--T 235
Query: 242 KFSYC---IGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS------YYVTLEGISLG 292
KFSYC IG+ N + L+LG A +P + + S YY+TL G+S+G
Sbjct: 236 KFSYCMTPIGSSNS-----STLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVG 290
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP 352
L IDP++FK N G+ IDSGTTLT+ V +AYQ +R+ L
Sbjct: 291 STPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMN-LSVVNGSSS 349
Query: 353 AWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERF 412
+ LC+ ++ P HF GG DLVL +E+ F S+ + CLA+G S
Sbjct: 350 GFDLCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNGLICLAMGSSS------ 402
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +SI G I QQN V YD + + F C
Sbjct: 403 QGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 189/389 (48%), Gaps = 39/389 (10%)
Query: 76 QKSSQKAHDTRAHLH-PGISTVPVF------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
++ S++ A L+ P PV+ +N SIG P P A++DTGS LIW +CQ
Sbjct: 65 ERGSRRLQRLEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ 124
Query: 129 PCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQG 181
PC QC F+P S +++TLPC S C + C + C Y Y +G ++QG
Sbjct: 125 PCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSN--NSCQYTYGYGDGSETQG 182
Query: 182 TIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS 241
++G+E F G + ++ FGC NN F G+ G+G S S ++ +
Sbjct: 183 SMGTETLTF-----GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV--T 235
Query: 242 KFSYCIGNLNYFEYAYNMLILGEGA-ILEGDSTPMSVIDGS-----YYVTLEGISLGEKM 295
KFSYC+ + + L+LG A + S ++I+ S YY+TL G+S+G
Sbjct: 236 KFSYCMTPIG--SSTSSTLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTP 293
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH 355
L IDP++FK N G+ IDSGTTLT+ +AYQ +R+ L +
Sbjct: 294 LPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMN-LSVVNGSSSGFD 352
Query: 356 LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDL 415
LC+ ++ P HF GG DLVL +E+ F S+ + CLA+G S + +
Sbjct: 353 LCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNGLICLAMGSSS------QGM 405
Query: 416 SIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
SI G I QQN V YD + + F C
Sbjct: 406 SIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 140/431 (32%), Positives = 213/431 (49%), Gaps = 54/431 (12%)
Query: 42 LLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP- 97
L+HR+S L YNP+ T + + T+ S AR ++ +D R+ PG T+P
Sbjct: 33 LIHRESPLSPFYNPSLTPSERIKNTVLRSFAR---SKRRLRLSQNDDRS---PGTITIPD 86
Query: 98 ----VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLP 150
+ + F IG PPV + A+ DTGS LIWV+C PCE+C A FDP KS T+ T+P
Sbjct: 87 EPITEYLMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVP 146
Query: 151 CDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
CDS CT C G +C+Y Y + G +G E NF + + F +
Sbjct: 147 CDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFP-KLT 205
Query: 205 FGCSHNNAHFSDE--QFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLI 261
FGC+ +N DE + G+ GLG S S L ++G KFSYC L+ + + +
Sbjct: 206 FGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLS--SNSTSKMR 263
Query: 262 LGEGAILEGD----STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
G AI++ STP+ S+ YY+ LEG+S+G K + K +++ +D +
Sbjct: 264 FGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKV-------KTSESQTDGNI 316
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP-AWHLCYSGNINRDLQGFPAMA 373
IDSGT+ T L S Y V++++ + + + P ++ C+ R + FP +
Sbjct: 317 LIDSGTSFTILKQSFYNKFVALVKEVYG--VEAVKIPPLVYNFCFENKGKR--KRFPDVV 372
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F F GA + +DA ++F E +++ C+ P+ +D SI G AQ Y V YDL
Sbjct: 373 FLFT-GAKVRVDASNLFEAEDNNLLCMVALPTSD-----EDDSIFGNHAQIGYQVEYDLQ 426
Query: 434 SKQLYFQRIDC 444
+ F DC
Sbjct: 427 GGMVSFAPADC 437
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 152/469 (32%), Positives = 213/469 (45%), Gaps = 44/469 (9%)
Query: 3 SSHAILLLSL--ITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQA 60
SSH +L+LS+ + L F + AA K T L+H S +P V A++
Sbjct: 6 SSHELLVLSMASVNLLLIICFTFIFSPCISAASDSKGFSTNLIHIHSP-SSPYKNVKAES 64
Query: 61 ---QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
L +++R YL + + P I F N SIG PP VLD
Sbjct: 65 LAKDTALESTLSRHAYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLD 124
Query: 118 TGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC--------TNDCGGYPDE 166
TGS L W++C+PC+ C ++ +KS +Y + C+ C +D G
Sbjct: 125 TGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSG----S 180
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFET--SDEGKTFLYDVGFGCSHNNAHF----SDEQFT 220
C Y Y +G + G + E+ F + SDE KT VGFGC N +F D
Sbjct: 181 CLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKT--AQVGFGCGLQNLNFVTSSRDGGVL 238
Query: 221 GVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDG 280
G+ + S S + KV F+YC GNL+ A L+ G+ L GD TPM VI
Sbjct: 239 GLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSN-PNAGGFLVFGDATYLNGDMTPM-VIAE 296
Query: 281 SYYVTLEGISLG--EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
YYV L GI LG E LDI+ + F++ S GV IDSG+TL+ P Y+ +R V
Sbjct: 297 FYYVNLLGIGLGVEEPRLDINSSSFERKPDGS-GGVIIDSGSTLSIFPPEVYEVVRNAVV 355
Query: 339 DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF 398
D + P+ + C+ G I RDL FP + + ++ D S+F Q +F
Sbjct: 356 DKLKKGYNISPLTSSPD-CFEGKIGRDLPLFPTLVLYLE-STGILNDRWSIFLQRYDELF 413
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ-RIDCEL 446
CL +GE LSIIG +AQQ+Y Y+L L + DC L
Sbjct: 414 CLGF----TSGE---GLSIIGTLAQQSYKFGYNLELSTLSIESNPDCGL 455
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 177/359 (49%), Gaps = 32/359 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ +N SIG P P A++DTGS LIW +CQPC QC F+P S +++TLPC S
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C + C Y Y +G ++QG++G+E F G + ++ FGC NN
Sbjct: 155 CQALSSPTCSN--NFCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENN 207
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
F G+ G+G S S ++ +KFSYC+ + N+L+ +
Sbjct: 208 QGFGQGNGAGLVGMGRGPLSLPSQLDV--TKFSYCMTPIGS-STPSNLLLGSLANSVTAG 264
Query: 272 STPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
S ++I S YY+TL G+S+G L IDP+ F N G+ IDSGTTLT+ V
Sbjct: 265 SPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFV 324
Query: 327 PSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
+AYQ++R+E + Q LP + LC+ + P HF GG DL L
Sbjct: 325 NNAYQSVRQEF--ISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELP 381
Query: 386 AESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+E+ F S+ + CLA+G S + +SI G I QQN V YD + + F C
Sbjct: 382 SENYFISPSNGLICLAMGSSS------QGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 137/430 (31%), Positives = 207/430 (48%), Gaps = 43/430 (10%)
Query: 36 KRLVTKLLHRD---SLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG 92
K L +++HRD S LY+P T +A ++ S+ R Y +++ S + + L P
Sbjct: 26 KGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHRSINRVNYFTKEFSLNKNQPVSTLTPE 85
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATL 149
+ + +++S+G PP +DTGS+++W++CQPC C T F+PSKS +Y +
Sbjct: 86 LGE---YLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNI 142
Query: 150 PCDSSYC--TND----CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
PC SS C TND C D C Y+I Y SQG + ++ +++ ++
Sbjct: 143 PCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNI 202
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV--EKVGSKFSYCIGNLNYFEYAYNMLI 261
GC H N + Q +GV G+G S V VGSKFSYC+ N + + LI
Sbjct: 203 VIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLI 262
Query: 262 LGEGAILEGD---STPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
GE ++ G+ STPM ++G Y++TLE S+G ++ + + S +
Sbjct: 263 FGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIE-----YGERSNASTQNIL 317
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGL-LPSY-PMDPAWHLCYSGNINRDLQGFPAMA 373
IDSGT LT L P+ + L K V + Q + LP P D LCY N P +
Sbjct: 318 IDSGTPLTML-PNLF--LSKLVSYVAQEVKLPRIEPPDHHLSLCY--NTTGKQLNVPDIT 372
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
HF GAD+ L++ F+ + C S NG L I G IAQ N + YDL
Sbjct: 373 AHF-NGADVKLNSNGTFFPFEDGIMCFGFISS--NG-----LEIFGNIAQNNLLIDYDLE 424
Query: 434 SKQLYFQRID 443
+ + F+ D
Sbjct: 425 KEIISFKPTD 434
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 187/374 (50%), Gaps = 42/374 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSSY 155
+ ++ +IG PP+ A++DTGS LIW +C PC C F P++S TY +PC S
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 156 CTNDCGGYP-----DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH- 209
C YP C Y Y + + G + SE F F ++ K + DV FGC +
Sbjct: 152 CAAL--PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N+ ++ +G+ GLG SLV ++G S+FSYC+ ++ + L G A L
Sbjct: 210 NSGQLANS--SGMVGLG---RGPLSLVSQLGPSRFSYCL--TSFLSPEPSRLNFGVFATL 262
Query: 269 EG----------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G STP+ V + Y+++L+GISLG+K L IDP +F ND + GVF
Sbjct: 263 NGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGT-GGVF 321
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ-GFPAMAF 374
IDSGT+LTWL AY +R+E+ + + L P+ + C+ + P M
Sbjct: 322 IDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMEL 381
Query: 375 HFAGGADLVLDAESVFYQESSSVF-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
HF GGA++ + E+ + ++ F CLA+ R D +IIG QQN ++ YD+
Sbjct: 382 HFDGGANMTVPPENYMLIDGATGFLCLAM-------IRSGDATIIGNYQQQNMHILYDIA 434
Query: 434 SKQLYFQRIDCELL 447
+ L F C ++
Sbjct: 435 NSLLSFVPAPCNIV 448
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 146/466 (31%), Positives = 216/466 (46%), Gaps = 50/466 (10%)
Query: 7 ILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNM 66
I+L+SL+ + + + ++ PAAG L L H D+ + +A R +
Sbjct: 28 IVLVSLLLVSMA--IVLAAASSHPAAGLLDGLRVPLTHVDAHGNYTKLQLLRRAARRSHH 85
Query: 67 SMARFIYLSQKSSQKAH---DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
M+R + + S KA D + +H G F ++ SIG P + A++DTGS L+
Sbjct: 86 RMSRLVARTATGSVKAAAAPDLQVPVHAGNGE---FLMDMSIGTPALAYAAIVDTGSDLV 142
Query: 124 WVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNG 176
W +C+PC +C FDPS S TY+TLPC SS C T+ C +C Y Y +
Sbjct: 143 WTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDA 202
Query: 177 PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV 236
+QG + +E F KT L V FGC N Q G+ GLG SLV
Sbjct: 203 SSTQGVLAAETFTL-----AKTKLPGVAFGCGDTNEGDGFTQGAGLVGLG---RGPLSLV 254
Query: 237 EKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS-----------YYV 284
++G KFSYC+ +L+ + + + L+LG A + D+ + I + YYV
Sbjct: 255 SQLGLGKFSYCLTSLD--DTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYV 312
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
TL+ +++G + + + F D + GV +DSGT++T+L Y+ L+K Q
Sbjct: 313 TLKALTVGSTRIPLPGSAFAVQDDGT-GGVIVDSGTSITYLELQGYRPLKKAFAA--QMK 369
Query: 345 LPSYPMDP-AWHLCYSGNINR-DLQGFPAMAFHFAGGADLVLDAESVFYQESSS-VFCLA 401
LP LC+ + D P + HF GGADL L AE+ +S+S CL
Sbjct: 370 LPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLT 429
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
V S + LSIIG QQN YD+ L F + C L
Sbjct: 430 VMGS-------RGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQCAKL 468
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 186/374 (49%), Gaps = 42/374 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSSY 155
+ ++ +IG PP+ A++DTGS LIW +C PC C F P++S TY +PC S
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPL 151
Query: 156 CTNDCGGYP-----DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH- 209
C YP C Y Y + + G + SE F F ++ K + DV FGC +
Sbjct: 152 CAAL--PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N+ ++ +G+ GLG SLV ++G S+FSYC+ ++ + L G A L
Sbjct: 210 NSGQLANS--SGMVGLG---RGPLSLVSQLGPSRFSYCL--TSFLSPEPSRLNFGVFATL 262
Query: 269 EG----------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G STP+ V + Y+++L+GISLG+K L IDP +F ND + GVF
Sbjct: 263 NGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGT-GGVF 321
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ-GFPAMAF 374
IDSGT+LTWL AY +R E+ + + L P+ + C+ + P M
Sbjct: 322 IDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMEL 381
Query: 375 HFAGGADLVLDAESVFYQESSSVF-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
HF GGA++ + E+ + ++ F CLA+ R D +IIG QQN ++ YD+
Sbjct: 382 HFDGGANMTVPPENYMLIDGATGFLCLAM-------IRSGDATIIGNYQQQNMHILYDIA 434
Query: 434 SKQLYFQRIDCELL 447
+ L F C ++
Sbjct: 435 NSLLSFVPAPCNIV 448
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 142/462 (30%), Positives = 222/462 (48%), Gaps = 47/462 (10%)
Query: 3 SSHAILLLSLITLPFTSTRIFT--STTAAPAAGKPKR--LVTKLLHRDSLLYNPNDTVDA 58
+SH I+++ L+ L +ST +F+ ++T+ +P++ L H DS N T
Sbjct: 5 ASHMIIVI-LLALAVSST-LFSPAASTSRSLDRRPEKNGFRVSLRHVDS---GGNYTKFE 59
Query: 59 QAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDT 118
+ QR + R LS K++ A +H G F +N +IG P A++DT
Sbjct: 60 RLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGE---FLMNLAIGTPAETYSAIMDT 116
Query: 119 GSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNI 171
GS LIW +C+PC+ C FDP KS +++ LPC S C + C D C Y
Sbjct: 117 GSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCS---DGCEYRY 173
Query: 172 RYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS 231
Y + +QG + +E F F G + +GFGC +N + Q G+ GLG
Sbjct: 174 SYGDHSSTQGVLATETFTF-----GDASVSKIGFGCGEDNRGRAYSQGAGLVGLG---RG 225
Query: 232 THSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTL 286
SL+ ++G KFSYC+ +++ + +L+ E + TP+ + + S YY++L
Sbjct: 226 PLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPL-IQNPSRPSFYYLSL 284
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
EGIS+G+ +L I+ + F D S G+ IDSGTT+T+L +A+ L+KE + L
Sbjct: 285 EGISVGDTLLPIEKSTFSIQDDGS-GGLIIDSGTTITYLKDNAFAALKKEFISQMK-LDV 342
Query: 347 SYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPS 405
LC++ + P + FHF G DL L E+ ++S+ V CL +G S
Sbjct: 343 DASGSTELELCFTLPPDGSPVEVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSS 401
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+SI G QQN V +DL + + F C L
Sbjct: 402 -------SGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 137/440 (31%), Positives = 210/440 (47%), Gaps = 49/440 (11%)
Query: 35 PKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHP 91
PK L +L+HRDS L YNP +TV + S++R L+ SQ D ++ L
Sbjct: 23 PKNLSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRLNNILSQT--DLQSGL-- 78
Query: 92 GISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYAT 148
I F+++ +IG PP+ A+ DTGS L WV+C+PC+QC FD KS TY +
Sbjct: 79 -IGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKS 137
Query: 149 LPCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
PCDS C C + C Y Y + S+G + +E + +++
Sbjct: 138 EPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPG 197
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTH-SLVEKVGS----KFSYCIGNLNYFEYAY 257
FGC +NN DE +G+ H SL+ ++GS KFSYC+ + +
Sbjct: 198 TVFGCGYNNGGTFDETGSGII----GLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGT 253
Query: 258 NMLILGEGAI---LEGDSTPMS--VIDGS----YYVTLEGISLGEKMLDIDPNLFKKND- 307
+++ LG +I L DS +S ++D YY+TLE IS+G+K + + + ND
Sbjct: 254 SVINLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDG 313
Query: 308 ---TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINR 364
+ + + IDSGTTLT L + VE+L G DP L +
Sbjct: 314 GIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTG--AKRVSDPQGLLSHCFKSGS 371
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
G P + HF GAD+ L + F + S + CL++ P+ +++I G AQ
Sbjct: 372 AEIGLPEITVHFT-GADVRLSPINAFVKVSEDMVCLSMVPT-------TEVAIYGNFAQM 423
Query: 425 NYNVAYDLVSKQLYFQRIDC 444
++ V YDL ++ + FQR+DC
Sbjct: 424 DFLVGYDLETRTVSFQRMDC 443
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 140/464 (30%), Positives = 220/464 (47%), Gaps = 45/464 (9%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFT--STTAAPAAGKPKR--LVTKLLHRDSLLYNPNDTV 56
M SS + +++ ++ + S+ +F+ ++T +P++ L H DS N T
Sbjct: 1 MASSASHMIIVILLVLAVSSALFSPAASTWRSLDRRPEKNGFRVSLRHVDS---GGNYTK 57
Query: 57 DAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVL 116
+ QR + R LS K++ A +H G F +N +IG P A++
Sbjct: 58 FERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGE---FLMNLAIGTPAETYSAIM 114
Query: 117 DTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT----NDCGGYPDECWY 169
DTGS LIW +C+PC+ C FDP KS +++ LPC S C + C D C Y
Sbjct: 115 DTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCS---DGCEY 171
Query: 170 NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPAT 229
Y + +QG + +E F F G + +GFGC +N + Q G+ GLG
Sbjct: 172 RYSYGDHSSTQGVLATETFTF-----GDASVSKIGFGCGEDNRGRAYSQGAGLVGLG--- 223
Query: 230 SSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYV 284
SL+ ++G KFSYC+ +++ + +L+ E + TP+ + + S YY+
Sbjct: 224 RGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPL-IQNPSRPSFYYL 282
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
+LEGIS+G+ +L I+ + F D S G+ IDSGTT+T+L SA+ L+KE + L
Sbjct: 283 SLEGISVGDTLLPIEKSTFSIQDDGS-GGLIIDSGTTITYLKDSAFAALKKEFISQMK-L 340
Query: 345 LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVG 403
LC++ + P + FHF G DL L E+ ++S+ V CL +G
Sbjct: 341 DVDASGSTELELCFTLPPDGSPVDVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMG 399
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S +SI G QQN V +DL + + F C L
Sbjct: 400 SS-------SGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 138/426 (32%), Positives = 198/426 (46%), Gaps = 38/426 (8%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+++HRDS LY +T + + S+ R + ++KS + +T STV
Sbjct: 38 EMIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHFNKKSFVASTNTAE------STVK 91
Query: 98 V----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLP 150
+ +++S+G PP L V+DTGS + W++CQ CE C T FDPSKS TY TLP
Sbjct: 92 ASQGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLP 151
Query: 151 CDSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C S+ C T C C Y I+Y +G SQG + E +++ +
Sbjct: 152 CSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVI 211
Query: 206 GCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC HNN F E V G S L +G KFSYC+ + + + L G+
Sbjct: 212 GCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNFGD 271
Query: 265 GAILEG---DSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
A++ G STP+ GS YY+TLE S+G+K ++ + + + IDS
Sbjct: 272 AAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDS 331
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
GTTLT L Y L V D Q S P + LCY + L P + HF
Sbjct: 332 GTTLTLLPQEDYSNLESAVADAIQANRVSDPSN-FLSLCYQTTPSGQLD-VPVITAHFK- 388
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GAD+ L+ S F Q + V C A S++ +SI G +AQ N V YDL+ + +
Sbjct: 389 GADVELNPISTFVQVAEGVVCFAFHSSEV-------VSIFGNLAQLNLLVGYDLMEQTVS 441
Query: 439 FQRIDC 444
F+ DC
Sbjct: 442 FKPTDC 447
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 134/424 (31%), Positives = 194/424 (45%), Gaps = 36/424 (8%)
Query: 42 LLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS YNP +T + + ++ S++R + + S + A D + S
Sbjct: 35 LIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDL-TSNSGE 93
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ +N S+G PP P +A+ DTGS L+W +C+PC+ C FDP S TY + C SS
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQ 153
Query: 156 CT-----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
CT C + C Y+ Y + ++G I + ++D L ++ GC HN
Sbjct: 154 CTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIGCGHN 213
Query: 211 NA-HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
NA F+ + V G A S L + + KFSYC+ L + + G A++
Sbjct: 214 NAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTNAVVS 273
Query: 270 GD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-VFIDSGTTLT 323
G STP+ YY+TL+ IS+G K + + +D+ S G + IDSGTTLT
Sbjct: 274 GTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQ-----YPGSDSGSGEGNIIIDSGTTLT 328
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
L Y L V P LCYS DL+ PA+ HF GAD+
Sbjct: 329 LLPTEFYSELEDAVASSIDAEKKQDPQ-TGLSLCYSA--TGDLK-VPAITMHF-DGADVN 383
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
L + F Q S + C A S SI G +AQ N+ V YD VSK + F+ D
Sbjct: 384 LKPSNCFVQISEDLVCFAFRGS-------PSFSIYGNVAQMNFLVGYDTVSKTVSFKPTD 436
Query: 444 CELL 447
C +
Sbjct: 437 CAKM 440
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 143/462 (30%), Positives = 217/462 (46%), Gaps = 49/462 (10%)
Query: 4 SHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKR-----LVTKLLHRDSLLYNPNDTVDA 58
SH I+++ L+ L +S + S A+ + G +R L H DS N T
Sbjct: 6 SHMIIVI-LLALAVSSALV--SPAASTSRGLDRRPEKTWFRVSLRHVDS---GGNYTKFE 59
Query: 59 QAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDT 118
+ QR + R LS K++ A +H G F + +IG P A++DT
Sbjct: 60 RLQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNGE---FLMKLAIGTPAETYSAIMDT 116
Query: 119 GSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNI 171
GS LIW +C+PC+ C FDP KS +++ LPC S C + C D C Y
Sbjct: 117 GSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCS---DGCEYLY 173
Query: 172 RYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS 231
Y + +QG + +E F F G + +GFGC +N Q G+ GLG
Sbjct: 174 SYGDYSSTQGVLATETFAF-----GDASVSKIGFGCGEDNDGSGFSQGAGLVGLG---RG 225
Query: 232 THSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTL 286
SL+ ++G KFSYC+ +++ + ++L+ E + +TP+ + + S YY++L
Sbjct: 226 PLSLISQLGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPL-IQNPSQPSFYYLSL 284
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
EGIS+G+ +L I+ + F + S G+ IDSGTT+T+L SA+ L+KE + L
Sbjct: 285 EGISVGDTLLPIEKSTFSIQNDGS-GGLIIDSGTTITYLEDSAFAALKKEFISQLK-LDV 342
Query: 347 SYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPS 405
LC++ + P + FHF GADL L AE+ +S V CL +G S
Sbjct: 343 DESGSTGLDLCFTLPPDASTVDVPQLVFHFE-GADLKLPAENYIIADSGLGVICLTMGSS 401
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+SI G QQN V +DL + + F C L
Sbjct: 402 -------SGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 149/469 (31%), Positives = 210/469 (44%), Gaps = 50/469 (10%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQA 60
M S + +LL+ T F+ AA K T L+H S +P V A++
Sbjct: 1 MASVNNLLLIICFTFIFS--------PCISAASDSKGFSTNLIHIHSP-SSPYKNVKAES 51
Query: 61 ---QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
L +++R YL + + P I F N SIG PP VLD
Sbjct: 52 LAKDTALESTLSRHAYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLD 111
Query: 118 TGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT--------NDCGGYPDE 166
TGS L W++C+PC+ C ++ +KS +Y + C+ C +D G
Sbjct: 112 TGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSG----S 167
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFET--SDEGKTFLYDVGFGCSHNNAHF----SDEQFT 220
C Y Y +G + G + E+ F + SDE KT VGFGC N +F D
Sbjct: 168 CLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKT--AQVGFGCGLQNLNFITSNRDGGVL 225
Query: 221 GVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDG 280
G+ + S S + KV F+YC GN++ A L+ G+ L GD TPM VI
Sbjct: 226 GLGPGLVSLVSQLSAIGKVSKSFAYCFGNISN-PNAGGFLVFGDATYLNGDMTPM-VIAE 283
Query: 281 SYYVTLEGISL--GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
YYV L GI L GE LDI+ + F++ S GV IDSG+TL+ P Y+ +R V
Sbjct: 284 FYYVNLLGIGLGVGEPRLDINSSSFERKPDGS-GGVIIDSGSTLSVFPPEVYEVVRNAVV 342
Query: 339 DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF 398
D + P+ + C+ G I RDL FP + + ++ D S+F Q +F
Sbjct: 343 DKLKKGYNISPLTSSPD-CFEGKIERDLPLFPTLVLYLE-STGILNDRWSIFLQRYDELF 400
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ-RIDCEL 446
CL +GE LSIIG +AQQ+Y Y+L L + DC L
Sbjct: 401 CLGF----TSGE---GLSIIGTLAQQSYKFGYNLELSTLSIESNPDCGL 442
>gi|357449529|ref|XP_003595041.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484089|gb|AES65292.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 210
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 97/221 (43%), Positives = 130/221 (58%), Gaps = 34/221 (15%)
Query: 234 SLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGE 293
SL ++ KFSYC+G+L +Y YN LILGE A L GD+TP V +G +VT+EGIS+G+
Sbjct: 17 SLATQISKKFSYCMGSLTDKDYDYNQLILGEEAYLAGDTTPFQVYNGVNHVTMEGISIGQ 76
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL------LPS 347
K LDI P FK + +D Y+ L KEV +LFQ L L
Sbjct: 77 KSLDIAPGTFKMKNNVND-----------------VYELLCKEVRNLFQRLKFQEVRLQG 119
Query: 348 YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDI 407
P W LCY G+++RDL+GFP + F+FAGGA + LD + F Q VFC++V PS
Sbjct: 120 SP----WALCYFGSVSRDLKGFPVVTFYFAGGAVIGLDTLNFFVQAKDDVFCMSVHPS-- 173
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLA 448
DLS+IG++AQQ+YNV YD +Y + IDC+LL+
Sbjct: 174 -----HDLSVIGLLAQQSYNVGYDKDKGLIYIESIDCQLLS 209
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 131/421 (31%), Positives = 191/421 (45%), Gaps = 35/421 (8%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQK--SSQKAHDTRAHLHPGIST 95
+++HRDS LY P +T + + S+ R + + S+ A T +++
Sbjct: 34 EMIHRDSSRSPLYRPTETPFQRVANAVRRSINRGNHFKKAFVSTDSAESTV------VAS 87
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCD 152
+ + +S+G PP L ++DTGS ++W++C+PCE C T FDPSKS TY TLPC
Sbjct: 88 QGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCS 147
Query: 153 SSYC---TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
S+ C N + C Y+I Y +G S G + E ++D GC H
Sbjct: 148 SNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGH 207
Query: 210 NN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
NN F +E V G S L +G KFSYC+ + + + L G+ A++
Sbjct: 208 NNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVV 267
Query: 269 EGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G STP+ ++G Y++TLE S+G+ ++ D + IDSGTTLT
Sbjct: 268 SGRGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFS-GSSSSGSGSGDGNIIIDSGTTLT 326
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
L Y L V D+ + DP+ L D P + HF GAD+
Sbjct: 327 LLPQEDYLNLESAVSDVIK---LERARDPSKLLSLCYKTTSDELDLPVITAHFK-GADVE 382
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
L+ S F V C A S I +I G +AQQN V YDLV K + F+ D
Sbjct: 383 LNPISTFVPVEKGVVCFAFISSKIG-------AIFGNLAQQNLLVGYDLVKKTVSFKPTD 435
Query: 444 C 444
C
Sbjct: 436 C 436
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 208/433 (48%), Gaps = 53/433 (12%)
Query: 42 LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKS----SQKAHDTRAHLHPGISTVP 97
L H DS N T + QR +N R L + + K DT P
Sbjct: 49 LRHVDS---GKNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGSG 105
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
F + SIG P V A++DTGS LIW +C+PC +C FDP KS +Y+ + C S
Sbjct: 106 EFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 165
Query: 155 YCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C ++C D C Y Y + ++G + +E F FE + + +GFGC
Sbjct: 166 LCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS----ISGIGFGCGVE 221
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG------- 263
N Q +G+ GLG S S +++ +KFSYC+ ++ E + ++ I
Sbjct: 222 NEGDGFSQGSGLVGLGRGPLSLISQLKE--TKFSYCLTSIEDSEASSSLFIGSLASGIVN 279
Query: 264 -EGAILEGDSTP-MSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
GA L+G+ T MS++ YY+ L+GI++G K L ++ + F+ + + G+ I
Sbjct: 280 KTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT-GGMII 338
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA----WHLCYSGNINRDLQGFPAM 372
DSGTT+T+L +A++ L++E S P+D + LC+ P M
Sbjct: 339 DSGTTITYLEETAFKVLKEEFTSRM-----SLPVDDSGSTGLDLCFKLPDAAKNIAVPKM 393
Query: 373 AFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
FHF GADL L E+ +SS+ V CLA+G S NG +SI G + QQN+NV +D
Sbjct: 394 IFHFK-GADLELPGENYMVADSSTGVLCLAMGSS--NG-----MSIFGNVQQQNFNVLHD 445
Query: 432 LVSKQLYFQRIDC 444
L + + F +C
Sbjct: 446 LEKETVSFVPTEC 458
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 132/420 (31%), Positives = 191/420 (45%), Gaps = 38/420 (9%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+L+HRDS LY P S+ R L + S ++ +++ G
Sbjct: 31 ELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLFKDSLSNTPESTVYVNGG----- 85
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSS 154
+ + +S+G PP V+DTGS ++W++C+PCEQC T F+PSKS +Y +PC S+
Sbjct: 86 EYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCSSN 145
Query: 155 YCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C + C + C Y I +++ SQG + E +++ GC HN
Sbjct: 146 LCQSVRYTSCNK-QNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHN 204
Query: 211 NAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
N + +G+ GLG S T L +G KFSYC+ L + L G+ A++
Sbjct: 205 NRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGDAAVVS 264
Query: 270 GD---STPMSVID--GSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
GD STP D YY+TLE S+G K ++ F+ D + + +DSGTTLT
Sbjct: 265 GDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIE-----FEVLDDSEEGNIILDSGTTLTL 319
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL 384
L Y L V L + P + +LCYS I D FP + HF GAD+ L
Sbjct: 320 LPSHVYTNLESAVAQLVKLDRVDDP-NQLLNLCYS--ITSDQYDFPIITAHFK-GADIKL 375
Query: 385 DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S F + V CLA S I G +AQ N V YDL + F+ DC
Sbjct: 376 NPISTFAHVADGVVCLAFTSSQTG-------PIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 136/436 (31%), Positives = 196/436 (44%), Gaps = 40/436 (9%)
Query: 31 AAGKPKR-LVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTR 86
A KPK L+HRDS YNP +T + + ++ S+ R + ++K +
Sbjct: 23 ANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQID 82
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKS 143
+ G + +N SIG PP P +A+ DTGS L+W +C PC+ C FDP S
Sbjct: 83 LTSNSG-----EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTS 137
Query: 144 LTYATLPCDSSYCT-----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
TY + C SS CT C + C Y++ Y + ++G I + +SD
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPM 197
Query: 199 FLYDVGFGCSHNNA-HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAY 257
L ++ GC HNNA F+ + V G S L + + KFSYC+ L +
Sbjct: 198 QLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 257
Query: 258 NMLILGEGAILEGD---STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ + G AI+ G STP+ + + YY+TL+ IS+G K + + S+
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI----QYSGSDSESSE 313
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
+ IDSGTTLT L Y L V P LCYS DL+ P
Sbjct: 314 GNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQS-GLSLCYSA--TGDLK-VPV 369
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ HF GAD+ LD+ + F Q S + C A S SI G +AQ N+ V YD
Sbjct: 370 ITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRGS-------PSFSIYGNVAQMNFLVGYD 421
Query: 432 LVSKQLYFQRIDCELL 447
VSK + F+ DC +
Sbjct: 422 TVSKTVSFKPTDCAKM 437
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 136/437 (31%), Positives = 196/437 (44%), Gaps = 40/437 (9%)
Query: 31 AAGKPKR-LVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTR 86
A KPK L+HRDS YNP +T + + ++ S+ R + ++K +
Sbjct: 23 ANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQID 82
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKS 143
+ G + +N SIG PP P +A+ DTGS L+W +C PC+ C FDP S
Sbjct: 83 LTSNSG-----EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTS 137
Query: 144 LTYATLPCDSSYCT-----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
TY + C SS CT C + C Y++ Y + ++G I + +SD
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPM 197
Query: 199 FLYDVGFGCSHNNA-HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAY 257
L ++ GC HNNA F+ + V G S L + + KFSYC+ L +
Sbjct: 198 QLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 257
Query: 258 NMLILGEGAILEGD---STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ + G AI+ G STP+ + + YY+TL+ IS+G K + + S+
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI----QYSGSDSESSE 313
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
+ IDSGTTLT L Y L V P LCYS DL+ P
Sbjct: 314 GNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQS-GLSLCYSA--TGDLK-VPV 369
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ HF GAD+ LD+ + F Q S + C A S SI G +AQ N+ V YD
Sbjct: 370 ITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRGS-------PSFSIYGNVAQMNFLVGYD 421
Query: 432 LVSKQLYFQRIDCELLA 448
VSK + F+ DC +
Sbjct: 422 TVSKTVSFKPTDCAKMG 438
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 144/456 (31%), Positives = 211/456 (46%), Gaps = 49/456 (10%)
Query: 10 LSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDT-----VDAQAQ 61
LS +TL S S + A + G +L+HRDS Y P + VDA A+
Sbjct: 4 LSFLTLSLFSLCFIASFSHALSNG----FSVELIHRDSPKSPYYKPTENKYQHFVDA-AR 58
Query: 62 RTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSS 121
R++N + F K DT I + + +S+G PP + DTGS
Sbjct: 59 RSINRANHFF---------KDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSD 109
Query: 122 LIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYT 174
++W++C+PCEQC T F+PSKS +Y +PC S C + C + C Y I Y
Sbjct: 110 IVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQ-NSCQYKISYG 168
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA-TSSTH 233
+ SQG + + + E++ + GC +NA +G+ GLG S
Sbjct: 169 DSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLIT 228
Query: 234 SLVEKVGSKFSYC-IGNLNYFEYAYNMLILGEGAILEGD---STPMSVIDGS-YYVTLEG 288
L +G KFSYC + LN A ++L G+ A++ GD STP+ D Y++TL+
Sbjct: 229 QLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQA 288
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
S+G K ++ + +D + + IDSGTTLT + Y L V DL +
Sbjct: 289 FSVGNKRVEFGGSSEGGDD---EGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDD 345
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDIN 408
P + + LCYS + + FP + HF GAD+ L + S F + + C A PS
Sbjct: 346 P-NQQFSLCYS--LKSNEYDFPIITVHFK-GADVELHSISTFVPITDGIVCFAFQPSPQL 401
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
G SI G +AQQN V YDL K + F+ DC
Sbjct: 402 G------SIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/427 (30%), Positives = 197/427 (46%), Gaps = 50/427 (11%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+++HRDS + P +T Q QR N + + ++ H +AH +
Sbjct: 32 EMIHRDSSRSPFFRPTET---QFQRVANA-------VHRSVNRANHFHKAHKAAKATITQ 81
Query: 98 ---VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
+ +++S+G PP ++DTGS +IW++C+PCE+C T FDPSKS TY LP
Sbjct: 82 NDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPF 141
Query: 152 DSSYC--TNDCGGYPDE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
S+ C D D C Y I Y +G SQG + E +++ G
Sbjct: 142 SSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIG 201
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK----VGSKFSYCIGNLNYFEYAYNMLIL 262
C NN + + +G+ GLG S + + + +G KFSYC+ +++ N
Sbjct: 202 CGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLN---F 258
Query: 263 GEGAILEGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G+ A++ GD STP+ D YY+TLE S+G ++ + F+ + + ID
Sbjct: 259 GDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGE---KGNIIID 315
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SGTTLT L Y L V DL + P+ LCY D P + HF+
Sbjct: 316 SGTTLTLLPNDIYSKLESAVADLVELDRVKDPLK-QLSLCYRSTF--DELNAPVIMAHFS 372
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GAD+ L+A + F + V CLA S I I G +AQQN+ V YDL K +
Sbjct: 373 -GADVKLNAVNTFIEVEQGVTCLAFISSKIG-------PIFGNMAQQNFLVGYDLQKKIV 424
Query: 438 YFQRIDC 444
F+ DC
Sbjct: 425 SFKPTDC 431
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 128/422 (30%), Positives = 200/422 (47%), Gaps = 41/422 (9%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQK-SSQKAHDTRAHLHPGISTV 96
+++HRDS ++P +T + ++ S+ R +L+Q S + +T IS +
Sbjct: 32 EMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNSPETTV-----ISAL 86
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDS 153
+ +++S+G P + +LDTGS +IW++CQPC++C T FD SKS TY TLPC S
Sbjct: 87 GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPS 146
Query: 154 SYCTNDCGGY---PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
+ C + G + C Y+I Y +G S G + E +++ GC
Sbjct: 147 NTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCGRY 206
Query: 211 NAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
NA +E+ +G+ GLG S L G KFSYC+ + A + L G A++
Sbjct: 207 NAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCL--VPGLSTASSKLNFGNAAVVS 264
Query: 270 GD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
G STP+ +G Y++TLE S+G ++ F + + IDSGTTLT
Sbjct: 265 GRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIE-----FGSPGSGGKGNIIIDSGTTLTA 319
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDP--AWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
L Y L V + ++ DP LCY ++ P + HF+ GAD+
Sbjct: 320 LPNGVYSKLEAAVA---KTVILQRVRDPNQVLGLCYKVTPDKLDASVPVITAHFS-GADV 375
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L+A + F Q + V C A P++ ++ G +AQQN V YDL + F+
Sbjct: 376 TLNAINTFVQVADDVVCFAFQPTETG-------AVFGNLAQQNLLVGYDLQMNTVSFKHT 428
Query: 443 DC 444
DC
Sbjct: 429 DC 430
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 136/434 (31%), Positives = 212/434 (48%), Gaps = 55/434 (12%)
Query: 42 LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKS----SQKAHDTRAHLHPGISTVP 97
L H DS N T + QR +N R L + + DT P
Sbjct: 50 LRHVDS---GKNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASNPDDTNNIKAPTHGGSG 106
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
F + SIG P V A++DTGS LIW +C+PC +C FDP KS +Y+ + C S
Sbjct: 107 EFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 166
Query: 155 YCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C ++C D C Y Y + ++G + +E F FE + + +GFGC
Sbjct: 167 LCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENS----ISGIGFGCGVE 222
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG------- 263
N Q +G+ GLG S S +++ +KFSYC+ ++ E + ++ I
Sbjct: 223 NEGDGFSQGSGLVGLGRGPLSLISQLKE--TKFSYCLTSIEDSEASSSLFIGSLASGIVN 280
Query: 264 -EGAILEGDSTP-MSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
GA L+G+ T MS++ YY+ L+GI++G K L ++ + F+ ++ + G+ I
Sbjct: 281 KTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGT-GGMII 339
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA----WHLCYS-GNINRDLQGFPA 371
DSGTT+T+L +A++ L++E S P+D + LC+ N +++ P
Sbjct: 340 DSGTTITYLEETAFKVLKEEFTSRM-----SLPVDDSGSTGLDLCFKLPNAAKNI-AVPK 393
Query: 372 MAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ FHF GADL L E+ +SS+ V CLA+G S NG +SI G + QQN+NV +
Sbjct: 394 LIFHFK-GADLELPGENYMVADSSTGVLCLAMGSS--NG-----MSIFGNVQQQNFNVLH 445
Query: 431 DLVSKQLYFQRIDC 444
DL + + F +C
Sbjct: 446 DLEKETVTFVPTEC 459
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 136/450 (30%), Positives = 198/450 (44%), Gaps = 58/450 (12%)
Query: 22 IFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKS 78
+ ++ + A G +L+HRDS +YNP + + TL S++ L +
Sbjct: 14 LISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVTNT 73
Query: 79 SQK-AHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---G 134
+ ++ R + + S+G PP P +AV DTGS +IW +C+PC C
Sbjct: 74 VEAPIYNNRGE----------YLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQD 123
Query: 135 ATTFDPSKSLTYATLPCDSSYCT-----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFN 189
F+PSKS TY + C S C+ N C PD C Y+I Y + SQG +
Sbjct: 124 LPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPD-CTYSISYGDNSHSQGDFAVDTLT 182
Query: 190 FETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGV--FGLGPATSSTHSLVEKVGSKFSYCI 247
++ GC H+NA D +G+ GLGPA S + VG KFSYC+
Sbjct: 183 MGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPA-SLIKQMGSAVGGKFSYCL 241
Query: 248 -------GNLNYFEYAYNMLILGEGAILEGDSTPMSVID---GSYYVTLEGISLGEKMLD 297
G N + N + G GA+ STP+ + D Y + L+ +S+G
Sbjct: 242 TPIGNDDGGSNKLNFGSNANVSGSGAV----STPIYISDKFKSFYSLKLKAVSVGRN--- 294
Query: 298 IDPNLF---KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
N F + A + IDSGTTLT L Y K + + + DP
Sbjct: 295 ---NTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISN---SINLQRTDDPNQ 348
Query: 355 HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD 414
L Y D P +A HF GA+L L E+V + S +V CLA G + D
Sbjct: 349 FLEYCFETTTDDYKVPFIAMHFE-GANLRLQRENVLIRVSDNVICLAFA-----GAQDND 402
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+SI G IAQ N+ V YD+ + L F+ ++C
Sbjct: 403 ISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 125/372 (33%), Positives = 186/372 (50%), Gaps = 48/372 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-----GATTFDPSKSLTYATLPCDS 153
+ + SIG PP A++DTGS L+W+KC C+ C G T F S +Y LPC+S
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 154 SYCT--NDCGGYP---DECWYNIRYTNGPDSQGTIGSEQFNFETSDEG---KTFLYDVGF 205
++C+ + G P + C Y Y +G + G +GS++ +F + G ++F F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 206 GCSHNNAHFSDEQFT-GVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
GC+ D FT G+ GLG + S L +K+G KFSYC+ + + A + L LG
Sbjct: 125 GCARKLK--GDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLG 182
Query: 264 EGAILEG-DSTPMSVIDGS------YYVTLEGISLG---------EKMLDIDPNLFKKND 307
A L G D ++ G YYV L+ I++G E + F N
Sbjct: 183 SSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFLANK 242
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
T IDSGTT T L P Y+ +RK +E+ Q +LP+ LC++ + +
Sbjct: 243 T------VIDSGTTYTLLTPPVYEAMRKSIEE--QVILPTLGNSAGLDLCFNSSGDTSY- 293
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
GFP++ F+FA LVL E++F S V CL++ S DLSIIG + QQN++
Sbjct: 294 GFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSG------GDLSIIGNMQQQNFH 347
Query: 428 VAYDLVSKQLYF 439
+ YDLV+ Q+ F
Sbjct: 348 ILYDLVASQISF 359
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 135/445 (30%), Positives = 210/445 (47%), Gaps = 51/445 (11%)
Query: 31 AAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRA 87
++G PK +L+HRDS L YNP TV + LN + R + S++ + + T
Sbjct: 19 SSGHPKNFSVELIHRDSPLSPIYNPQITVTDR----LNAAFLRSVSRSRRFNHQLSQT-- 72
Query: 88 HLHPG-ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKS 143
L G I F+++ +IG PP+ A+ DTGS L WV+C+PC+QC FD KS
Sbjct: 73 DLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKS 132
Query: 144 LTYATLPCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
TY + PCDS C C + C Y Y + S+G + +E + +++
Sbjct: 133 STYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSP 192
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTH-SLVEKVGS----KFSYCIGNLNY 252
FGC +NN DE +G+ H SL+ ++GS KFSYC+ + +
Sbjct: 193 VSFPGTVFGCGYNNGGTFDETGSGII----GLGGGHLSLISQLGSSISKKFSYCLSHKSA 248
Query: 253 FEYAYNMLILGEGAI---LEGDSTPMS--VIDGS----YYVTLEGISLGEKMLDIDPNLF 303
+++ LG +I L DS +S ++D YY+TLE IS+G+K + + +
Sbjct: 249 TTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSY 308
Query: 304 KKND----TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS 359
ND + + + IDSGTTLT L + VE+ G DP L +
Sbjct: 309 NPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTG--AKRVSDPQGLLSHC 366
Query: 360 GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
G P + HF GAD+ L + F + S + CL++ P+ +++I G
Sbjct: 367 FKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPT-------TEVAIYG 418
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
AQ ++ V YDL ++ + FQ +DC
Sbjct: 419 NFAQMDFLVGYDLETRTVSFQHMDC 443
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 135/423 (31%), Positives = 195/423 (46%), Gaps = 35/423 (8%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+++HRDS Y P +T + L S+ R + ++ + + +T I++
Sbjct: 35 EIIHRDSSRSPYYRPTETQFQRVANALRRSINRANHFNKPNLVASTNTAESTV--IASQG 92
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSS 154
+ +++S+G PP L ++DTGS +IW++CQPCE C T FDPS+S TY TLPC S+
Sbjct: 93 EYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSN 152
Query: 155 YCTN-----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C + C DEC Y I Y + SQG + E ++D GC H
Sbjct: 153 ICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGCGH 212
Query: 210 NN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
NN F E V G S L +G KFSYC+ L + + L G+ A++
Sbjct: 213 NNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAVV 272
Query: 269 EGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G STP+ +G Y++TLE S+G+ I+ + + + IDSGTTLT
Sbjct: 273 SGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNR--IEFGSSSFESSGGEGNIIIDSGTTLT 330
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSGNINRDLQGFPAMAFHFAGGAD 381
L Y L V D + DP+ LCY + +L P + HF GAD
Sbjct: 331 ILPEDDYLNLESAVADAIE---LERVEDPSKFLRLCYRTTSSDELN-VPVITAHFK-GAD 385
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
+ L+ S F + V C A S I I G +AQQN V YDLV + + F+
Sbjct: 386 VELNPISTFIEVDEGVVCFAFRSSKIG-------PIFGNLAQQNLLVGYDLVKQTVSFKP 438
Query: 442 IDC 444
DC
Sbjct: 439 TDC 441
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 139/426 (32%), Positives = 195/426 (45%), Gaps = 40/426 (9%)
Query: 40 TKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTV 96
T L+ RDS L YNP++T + Q+ + S++R + T + P IS
Sbjct: 37 TDLISRDSPLSPFYNPSETQFDRLQKAFHRSISR----ANHFRANGVSTNSIQSPVISNN 92
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDS 153
+ +N S+G PPV + DTGS L+W +C+PC+ C FDP+KS TY L C+
Sbjct: 93 GEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEG 152
Query: 154 SYCTN--DCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C+N GG D+ C Y+ Y +G + G + + ++ + V FGC H
Sbjct: 153 KSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGCGH 212
Query: 210 NN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
NN F V G S L +G +FSYC+ L + + G I+
Sbjct: 213 NNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIV 272
Query: 269 EGD---STPMSVI--DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA---GVFIDSGT 320
G STP++ D YY+TLE +S+G K L K +DA + IDSGT
Sbjct: 273 SGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYK-GFSKVGSPLADADEGNIIIDSGT 331
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF--PAMAFHFAG 378
TLT L Y TL V G P + + LCYS +L G P + HF
Sbjct: 332 TLTLLPQDFYGTLESNVVSAIGG-KPVRDPNNVFSLCYS-----NLSGLRIPTITAHFV- 384
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GADL L + F Q +FC A+ P DL+I G +AQ N+ V YDL S+ +
Sbjct: 385 GADLELKPLNTFVQVQEDLFCFAMIP-------VSDLAIFGNLAQMNFLVGYDLKSRTVS 437
Query: 439 FQRIDC 444
F+ DC
Sbjct: 438 FKPTDC 443
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/372 (33%), Positives = 185/372 (49%), Gaps = 48/372 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-----GATTFDPSKSLTYATLPCDS 153
+ + SIG PP A++DTGS L+W+KC C+ C G T F S +Y LPC+S
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 154 SYCT--NDCGGYP---DECWYNIRYTNGPDSQGTIGSEQFNFETSDEG---KTFLYDVGF 205
++C+ + G P + C Y Y +G + G +GS++ +F + G ++F F
Sbjct: 65 THCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLF 124
Query: 206 GCSHNNAHFSDEQFT-GVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
GC D FT G+ GLG + S L +K+G KFSYC+ + + A + L LG
Sbjct: 125 GCGRKLK--GDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLG 182
Query: 264 EGAILEG-DSTPMSVIDGS------YYVTLEGISLG---------EKMLDIDPNLFKKND 307
A L G D ++ G YYV L+ I++G E + F N
Sbjct: 183 SSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFLANK 242
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
T IDSGTT T L P Y+ +RK +E+ Q +LP+ LC++ + +
Sbjct: 243 T------VIDSGTTYTLLTPPVYEAMRKSIEE--QVILPTLGNSAGLDLCFNSSGDTSY- 293
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
GFP++ F+FA LVL E++F S V CL++ S DLSIIG + QQN++
Sbjct: 294 GFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSG------GDLSIIGNMQQQNFH 347
Query: 428 VAYDLVSKQLYF 439
+ YDLV+ Q+ F
Sbjct: 348 ILYDLVASQISF 359
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 147/462 (31%), Positives = 217/462 (46%), Gaps = 61/462 (13%)
Query: 11 SLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDT-----VDAQAQR 62
S +TL F S S + A G +L+HRDSL LY P VDA A+R
Sbjct: 5 SFLTLLFFSICFIVSFSHAQKNG----FSVELIHRDSLKSPLYKPTQNKYQYFVDA-ARR 59
Query: 63 TLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSL 122
++N + + Y A+ ++ + P I + + +S+G PP ++DTGS +
Sbjct: 60 SINRANHFYKY------SLANIPQSTVIPDIGE---YLMTYSVGTPPFKLYGIVDTGSDI 110
Query: 123 IWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTN 175
+W++C+PC++C T F+PSKS +Y +PC S C + C + C Y+ Y +
Sbjct: 111 VWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCND-KNYCEYSTYYGD 169
Query: 176 GPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGV--FGLGPATSSTH 233
S G + + E+++ ++ GC NN + +G+ FG GPA+ T
Sbjct: 170 NSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQ 229
Query: 234 SLVEKVGSKFSYCIGNL----NYFEYAYNMLILGEGAILEGD---STPMSVIDGS--YYV 284
L G KFSYC+ L N A + L G+ A + GD +TP+ D YY+
Sbjct: 230 -LGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYL 288
Query: 285 TLEGISLGEKMLDID--PNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
TLE S+G + ++I PN ++ + IDSGTTLT L Y L V DL +
Sbjct: 289 TLEAFSVGNRRVEIGGVPN------GDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVK 342
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV 402
P +LCYS + + FP + HF GAD+ L S F + VFCLA
Sbjct: 343 LERVDDPTQ-TLNLCYS--VKAEGYDFPIITMHFK-GADVDLHPISTFVSVADGVFCLAF 398
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
E +D +I G +AQQN V YDL K + F+ DC
Sbjct: 399 -------ESSQDHAIFGNLAQQNLMVGYDLQQKIVSFKPSDC 433
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 136/450 (30%), Positives = 197/450 (43%), Gaps = 58/450 (12%)
Query: 22 IFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKS 78
+ ++ + A G +L+HRDS +YNP + + TL S++ L +
Sbjct: 14 LISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVTNT 73
Query: 79 SQK-AHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---G 134
+ ++ R + + S+G PP P +AV DTGS +IW +C PC C
Sbjct: 74 VEAPIYNNRGE----------YLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQD 123
Query: 135 ATTFDPSKSLTYATLPCDSSYCT-----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFN 189
F+PSKS TY + C S C+ N C PD C Y+I Y + SQG +
Sbjct: 124 LPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPD-CTYSISYGDNSHSQGDFAVDTLT 182
Query: 190 FETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGV--FGLGPATSSTHSLVEKVGSKFSYCI 247
++ GC H+NA D +G+ GLGPA S + VG KFSYC+
Sbjct: 183 MGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPA-SLIKQMGSAVGGKFSYCL 241
Query: 248 -------GNLNYFEYAYNMLILGEGAILEGDSTPMSVID---GSYYVTLEGISLGEKMLD 297
G N + N + G GA+ STP+ + D Y + L+ +S+G
Sbjct: 242 TPIGNDDGGSNKLNFGSNANVSGSGAV----STPIYISDKFKSFYSLKLKAVSVGRN--- 294
Query: 298 IDPNLF---KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
N F + A + IDSGTTLT L Y K + + + DP
Sbjct: 295 ---NTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISN---SINLQRTDDPNQ 348
Query: 355 HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD 414
L Y D P +A HF GA+L L E+V + S +V CLA G + D
Sbjct: 349 FLEYCFETTTDDYKVPFIAMHFE-GANLRLQRENVLIRVSDNVICLAFA-----GAQDND 402
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+SI G IAQ N+ V YD+ + L F+ ++C
Sbjct: 403 ISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 144/458 (31%), Positives = 204/458 (44%), Gaps = 48/458 (10%)
Query: 9 LLSLITLPFTSTRIFTSTTAAPAAGKPKR-LVTKLLHRDSL---LYNPNDTVDAQAQRTL 64
++SL T S +F+S + KPK T L+HRDS YNP +T + + +
Sbjct: 1 MVSLFTSVLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAI 60
Query: 65 NMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV---FYVNFSIGQPPVPQLAVLDTGSS 121
+ S R + + S A + P P + +N S+G PP P +AV DTGS+
Sbjct: 61 HRSFNRVSHFTDLSEMDA----SLNSPQTDITPCGGEYLMNLSLGTPPSPIMAVADTGSN 116
Query: 122 LIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCT-----NDCGGYPDECWYNIRY 173
LIW +C+PC+ C FDP S TY + C SS CT C C Y + Y
Sbjct: 117 LIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSY 176
Query: 174 TNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA-HFSDEQFTGVFGLGPATSST 232
+G + G + ++D L ++ GC NNA F ++ V G A S
Sbjct: 177 ADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLI 236
Query: 233 HSLVEKVGSKFSYCIGNLN----YFEYAYNMLILGEGAILEGDSTPMSVI--DGSYYVTL 286
L + + KFSYC+ N + N ++ G G + STP+ V D YY+TL
Sbjct: 237 KQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVSGPGTV----STPLVVKSRDTFYYLTL 292
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
+ IS+G K + + K N + IDSGTTLT L Y + V L
Sbjct: 293 KSISVGSKNMQTPDSNIKGN-------MVIDSGTTLTLLPVKYYIEIENAVASLINA-DK 344
Query: 347 SYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSD 406
S LCY N DL P + HF GAD+ L + F++ + + CLA G S
Sbjct: 345 SKDERIGSSLCY--NATADLN-IPVITMHFE-GADVKLYPYNSFFKVTEDLVCLAFGMS- 399
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F I G +AQ+N+ V YD SK + F+ DC
Sbjct: 400 -----FYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDC 432
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 138/434 (31%), Positives = 205/434 (47%), Gaps = 45/434 (10%)
Query: 38 LVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQ--KSSQKAHDTRAHLHPGIST 95
L +L H D+ + +A R + M+R + + K+ D + +H G
Sbjct: 40 LRVRLTHVDAHGNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGE 99
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCD 152
F ++ +IG P + A++DTGS L+W +C+PC C FDPS S TYAT+PC
Sbjct: 100 ---FLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 156
Query: 153 SSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
S+ C T+ C +C Y Y + +QG + SE F T + K L V FGC
Sbjct: 157 SALCSDLPTSTCTS-ASKCGYTYTYGDASSTQGVLASETF---TLGKEKKKLPGVAFGCG 212
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI 267
N Q G+ GLG SLV ++G KFSYC+ +L+ + +L+ G A
Sbjct: 213 DTNEGDGFTQGAGLVGLG---RGPLSLVSQLGLDKFSYCLTSLDDGDGKSPLLLGGSAAA 269
Query: 268 LEG-------DSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+ +TP+ V + S YYV+L G+++G + + + F D + GV +
Sbjct: 270 ISESAATAPVQTTPL-VKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGT-GGVIV 327
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDPAWHLCYSGNINR-DLQGFPAMAF 374
DSGT++T+L Y+ L+K + Q LP+ + LC+ G D P +
Sbjct: 328 DSGTSITYLELQGYRALKKAF--VAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVL 385
Query: 375 HFAGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
HF GGADL L AE+ +S+S CL V PS + LSIIG QQN+ YD+
Sbjct: 386 HFDGGADLDLPAENYMVLDSASGALCLTVAPS-------RGLSIIGNFQQQNFQFVYDVA 438
Query: 434 SKQLYFQRIDCELL 447
L F + C L
Sbjct: 439 GDTLSFAPVQCNKL 452
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 136/425 (32%), Positives = 199/425 (46%), Gaps = 45/425 (10%)
Query: 41 KLLHRDSL---LYNPNDT-----VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG 92
+L+HRDS Y P + VDA A+R++N + F K DT
Sbjct: 31 ELIHRDSPKSPYYKPTENKYQHFVDA-ARRSINRANHFF---------KDSDTSTPESTV 80
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATL 149
I + + +S+G PP + DTGS ++W++C+PCEQC T F+PSKS +Y +
Sbjct: 81 IPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNI 140
Query: 150 PCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
PC S C + C + C Y I Y + SQG + + + E++
Sbjct: 141 PCLSKLCHSVRDTSCSDQ-NSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVI 199
Query: 206 GCSHNNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYC-IGNLNYFEYAYNMLILG 263
GC +NA +G+ GLG S L +G KFSYC + LN A ++L G
Sbjct: 200 GCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259
Query: 264 EGAILEGD---STPMSVIDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ A++ GD STP+ D Y++TL+ S+G K ++ + +D + + IDSG
Sbjct: 260 DAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDD---EGNIIIDSG 316
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
TTLT + Y L V DL + P + + LCYS + + FP + HF G
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDP-NQQFSLCYS--LKSNEYDFPIITAHFK-G 372
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
AD+ L + S F + + C A PS G SI G +AQQN V YDL K + F
Sbjct: 373 ADIELHSISTFVPITDGIVCFAFQPSPQLG------SIFGNLAQQNLLVGYDLQQKTVSF 426
Query: 440 QRIDC 444
+ DC
Sbjct: 427 KPTDC 431
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 134/405 (33%), Positives = 194/405 (47%), Gaps = 47/405 (11%)
Query: 66 MSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
M ++R+ +S S A L G + + + +IG PPVP +A+ DTGS L W
Sbjct: 67 MMLSRYFTMSTSSDAGP----ARLRSGQAE---YLMELAIGTPPVPFVALADTGSDLTWT 119
Query: 126 KCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-----TNDCGGYPDECWYNIRYTNGP 177
+CQPC+ C +D + S +++ +PC S+ C + +C C Y Y +G
Sbjct: 120 QCQPCKLCFPQDTPIYDTAVSSSFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGA 179
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
S G +G+E F + + + FGC +N S TG GLG + SLV
Sbjct: 180 YSAGVLGTETLTFPGAP--GVSVGGIAFGCGVDNGGLSYNS-TGTVGLG---RGSLSLVA 233
Query: 238 KVG-SKFSYCIGNLNYFEYAYNMLIL-GEGAILEGDSTPMSV----------IDGSYYVT 285
++G KFSYC+ ++F + +L G A L ST +V + YYV+
Sbjct: 234 QLGVGKFSYCL--TDFFNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVS 291
Query: 286 LEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF-QGL 344
LEGISLG+ L I F D S G+ +DSGTT T+LV SA++ + V + Q +
Sbjct: 292 LEGISLGDARLPIPNGTFDLRDDGS-GGMIVDSGTTFTFLVESAFRVVVDHVAGVLRQPV 350
Query: 345 LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL--DAESVFYQESSSVFCLAV 402
+ + +D +G + L P M HFAGGAD+ L D F QE SS FCL
Sbjct: 351 VNASSLDSPCFPAATG--EQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESS-FCL-- 405
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+I G D+SI+G QQN + +D+ QL F DC L
Sbjct: 406 ---NIAGSPSADVSILGNFQQQNIQMLFDITVGQLSFMPTDCGKL 447
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 141/458 (30%), Positives = 209/458 (45%), Gaps = 44/458 (9%)
Query: 6 AILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQR 62
++ LL+++TL F+ T + P +L++RDS YNP +T +
Sbjct: 4 SVSLLAIVTLIFSGTLV-------PIDAAKDGFTVELINRDSPKSPFYNPRETPTQRIVS 56
Query: 63 TLNMSMARFIYLS-QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSS 121
+ SM+R + S K+S DT IS + + FS+G P LA+ DTGS
Sbjct: 57 AVRRSMSRVHHFSPTKNSDIFTDTAQSEM--ISNQGEYLMKFSLGTPAFDILAIADTGSD 114
Query: 122 LIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT-----NDCGGYPDE-CWYNIR 172
LIW +C+PC+QC A FDP S TY + C + C C G ++ C Y+
Sbjct: 115 LIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYS 174
Query: 173 YTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSS 231
Y + + G + ++ ++ L GC HNN F+++ V G S
Sbjct: 175 YGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISL 234
Query: 232 THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG---DSTPMSVIDGS--YYVTL 286
L + KFSYC+ L+ + L G I+ G STP+ D Y++TL
Sbjct: 235 ISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTL 294
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
E +S+G + + + F S+ + IDSGTTLT + L V+D G P
Sbjct: 295 EAVSVGSERIKFPGSSFGT----SEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAG-TP 349
Query: 347 SYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSD 406
LCYS I+ DL+ FP++ HF GAD+ L+ + F Q S +V C A P
Sbjct: 350 VEDPSGILSLCYS--IDADLK-FPSITAHF-DGADVKLNPLNTFVQVSDTVLCFAFNP-- 403
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
IN +I G +AQ N+ V YDL K + F+ DC
Sbjct: 404 INSG-----AIFGNLAQMNFLVGYDLEGKTVSFKPTDC 436
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 146/472 (30%), Positives = 211/472 (44%), Gaps = 73/472 (15%)
Query: 5 HAILLLSL-ITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQA 60
H LL S+ I L F S ++ A K R L+HRDS LYNP++T A
Sbjct: 6 HLGLLFSIVIALSFVSVAHISA-----AEVKNGRFSIDLIHRDSPKSPLYNPSET---PA 57
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV------FYVNFSIGQPPVPQLA 114
+R L+ RF+ S+ A + P PV + + SIG PP
Sbjct: 58 ER-LDRFFRRFMSFSE----------ASISPNTPEPPVSSNNGEYLMKISIGTPPFDVYG 106
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDEC 167
+ DTGS L+W +C PC C FDPSKS ++ + C+S C T C C
Sbjct: 107 IYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLC 166
Query: 168 WYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL-G 226
++ Y +G +QG I +E ++ T + ++ FGC HNN+ +E G+FG G
Sbjct: 167 DFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGG 226
Query: 227 PATSSTHSLVEKVGS--KFSYCIGNLNYFEYAYNMLILGEGAILEGD---STPMSVIDGS 281
S T ++ +GS KFS C+ + +I G A + G STP+ D
Sbjct: 227 RPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDP 286
Query: 282 --YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-VFIDSGTTLTWLVPSAYQTLRKEVE 338
Y+VTL+GIS+G+K+ F + + G VFID+GT T L Y
Sbjct: 287 TYYFVTLDGISVGDKLFP-----FSSSSPMATKGNVFIDAGTPPTLLPRDFYNR------ 335
Query: 339 DLFQGLLPSYPMDPAW------HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
L QG+ + PM+P LCY + L P + HF GAD+ L + F
Sbjct: 336 -LVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHF-DGADVQLKPLNTFIS 390
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V+C A+ P D D I G Q N+ + +DL K++ F+ +DC
Sbjct: 391 PKEGVYCFAMQPID------GDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 140/435 (32%), Positives = 208/435 (47%), Gaps = 49/435 (11%)
Query: 38 LVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMAR--FIYLSQKSSQKAHDTRAHLHPG 92
+L+H DS L YN T A+ + T++ S +R ++Y K S+ A D L P
Sbjct: 8 FTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSLSPT 67
Query: 93 -ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC------EQCGATT-FDPSKSL 144
++ + ++F+IG P + LDT + LIWV+C C E+ G TT F SKS
Sbjct: 68 LVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSF 127
Query: 145 TYATLPCDSSYCTN-----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TY PC S++C + C C Y + Y + + G + S+ F F+TSD
Sbjct: 128 TYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSD---GM 184
Query: 200 LYDVG---FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEY 255
L DVG FGCS ++ +TG GL + SL+ ++G KFSYC+ N
Sbjct: 185 LVDVGFLNFGCSEAPLTGDEQSYTGNVGL---NQTPLSLISQLGIKKFSYCLVPFNNLGS 241
Query: 256 AYNMLILGEGAILEGDSTPMSVIDG-SYYVTLEGISLG--EKMLDIDPNLFKKNDTWSDA 312
M G + G TP+ + +YYV + GIS+G E D ++++ D W
Sbjct: 242 TSKMY-FGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRDGW--- 297
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSGNINRDLQGFP 370
ID+G T + L A+ +L + L P DP + LC+ DL+ FP
Sbjct: 298 --IIDTGITYSSLETDAFDSLLAKFLTLKD--FPQRKDDPKERFELCFELQNANDLESFP 353
Query: 371 AMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
+ HF GADL+L+ ES F + E +FCLA+ S +SI+G QNY+V
Sbjct: 354 DVTVHF-DGADLILNVESTFVKIEDDGIFCLALLRSG------SPVSILGNFQLQNYHVG 406
Query: 430 YDLVSKQLYFQRIDC 444
YDL ++ + F +DC
Sbjct: 407 YDLEAQVISFAPVDC 421
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 131/435 (30%), Positives = 207/435 (47%), Gaps = 40/435 (9%)
Query: 31 AAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRA 87
A K T + RDS YNP++T + Q+ S+ R + + +D ++
Sbjct: 27 AEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHF-RAIRASPNDIQS 85
Query: 88 HLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSL 144
++ IS + +N S+G PPV L + DTGS LIW +C PC+ C FDP KS
Sbjct: 86 NV---ISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSK 142
Query: 145 TYATLPCDSSYCTNDCG-----GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TY TL C++ +C D G G + C + Y + ++ + SE F +++
Sbjct: 143 TYKTLGCNNDFC-QDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPAS 201
Query: 200 LYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYN 258
+ FGC H+N F+++ + G S L KVG +FSYC+ L+ A +
Sbjct: 202 FPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASS 261
Query: 259 MLILGEGAILEGD---STPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT----W 309
+ G+ A++ G STP+ D YY+TLEG+SLG + + F KN +
Sbjct: 262 KINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKG--FSKNKSSPAAA 319
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
++ + IDSGTTLT L Y + + + G + P + LCYSG ++
Sbjct: 320 EESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRG-TFSLCYSGVKKLEI--- 375
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + HF GAD+ L + F Q + C ++ PS +L+I G ++Q N+ V
Sbjct: 376 PTITAHFI-GADVQLPPLNTFVQAQEDLVCFSMIPS-------SNLAIFGNLSQMNFLVG 427
Query: 430 YDLVSKQLYFQRIDC 444
YDL + ++ F+ DC
Sbjct: 428 YDLKNNKVSFKPTDC 442
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 121/370 (32%), Positives = 188/370 (50%), Gaps = 46/370 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT 157
+ SIG P V A++DTGS LIW +C+PC +C FDP KS +Y+ + C S C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 158 ----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
++C D C Y Y + ++G + +E F FE + + +GFGC N
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS----ISGIGFGCGVENEG 116
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG--------EG 265
Q +G+ GLG S S +++ +KFSYC+ ++ E + ++ I G
Sbjct: 117 DGFSQGSGLVGLGRGPLSLISQLKE--TKFSYCLTSIEDSEASSSLFIGSLASGIVNKTG 174
Query: 266 AILEGDSTP-MSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
A L+G+ T MS++ YY+ L+GI++G K L ++ + F+ + + G+ IDSG
Sbjct: 175 ASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT-GGMIIDSG 233
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA----WHLCYSGNINRDLQGFPAMAFH 375
TT+T+L +A++ L++E S P+D + LC+ P M FH
Sbjct: 234 TTITYLEETAFKVLKEEFTSRM-----SLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFH 288
Query: 376 FAGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GADL L E+ +SS+ V CLA+G S NG +SI G + QQN+NV +DL
Sbjct: 289 FK-GADLELPGENYMVADSSTGVLCLAMGSS--NG-----MSIFGNVQQQNFNVLHDLEK 340
Query: 435 KQLYFQRIDC 444
+ + F +C
Sbjct: 341 ETVSFVPTEC 350
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 145/472 (30%), Positives = 210/472 (44%), Gaps = 73/472 (15%)
Query: 5 HAILLLSL-ITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQA 60
H LL S+ I L F S ++ A K R L+HRDS LYNP++T A
Sbjct: 6 HLGLLFSIVIALSFVSVAHISA-----AEVKNGRFSIDLIHRDSPKSPLYNPSET---PA 57
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV------FYVNFSIGQPPVPQLA 114
+R L+ RF+ S+ A + P PV + + SIG PP
Sbjct: 58 ER-LDRFFRRFMSFSE----------ASISPNTPEPPVSSNNGEYLMKISIGTPPFDVYG 106
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDEC 167
+ DTGS L+W +C PC C FDPSKS ++ + C+S C T C C
Sbjct: 107 IYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLC 166
Query: 168 WYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL-G 226
++ Y +G +QG I +E ++ + ++ FGC HNN+ +E G+FG G
Sbjct: 167 DFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGG 226
Query: 227 PATSSTHSLVEKVGS--KFSYCIGNLNYFEYAYNMLILGEGAILEGD---STPMSVIDGS 281
S T ++ +GS KFS C+ + +I G A + G STP+ D
Sbjct: 227 RPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDP 286
Query: 282 --YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-VFIDSGTTLTWLVPSAYQTLRKEVE 338
Y+VTL+GIS+G+K+ F + + G VFID+GT T L Y
Sbjct: 287 TYYFVTLDGISVGDKLFP-----FSSSSPMATKGNVFIDAGTPPTLLPRDFYNR------ 335
Query: 339 DLFQGLLPSYPMDPAW------HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
L QG+ + PM+P LCY + L P + HF GAD+ L + F
Sbjct: 336 -LVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHF-DGADVQLKPLNTFIS 390
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V+C A+ P D D I G Q N+ + +DL K++ F+ +DC
Sbjct: 391 PKEGVYCFAMQPID------GDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 126/373 (33%), Positives = 181/373 (48%), Gaps = 36/373 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDS- 153
+ + +IG PP+P A+ DTGS LIW +C PC QC ++PS S T+A LPC+S
Sbjct: 92 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151
Query: 154 -SYCTNDCGGYPD------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
S C G C YN+ Y +G S GSE F F ++ G + + FG
Sbjct: 152 LSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSETFTFGSTPAGHARVPGIAFG 210
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEG 265
CS ++ F+ +G+ GLG SLV ++G KFSYC+ + + L+LG
Sbjct: 211 CSTASSGFNASSASGLVGLG---RGRLSLVSQLGVPKFSYCLTPYQDTN-STSTLLLGPS 266
Query: 266 AILEG----DSTPM------SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
A L G STP + ++ YY+ L GISLG L I P+ F N + G+
Sbjct: 267 ASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT-GGLI 325
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPAMAF 374
IDSGTT+T L +AYQ +R V L D LC+ + P+M
Sbjct: 326 IDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTL 385
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
HF GAD+VL A+S + S ++CLA + + +++I+G QQN ++ YD+
Sbjct: 386 HF-NGADMVLPADSYMMSDDSGLWCLA-----MQNQTDGEVNILGNYQQQNMHILYDIGQ 439
Query: 435 KQLYFQRIDCELL 447
+ L F C L
Sbjct: 440 ETLSFAPAKCSAL 452
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 125/365 (34%), Positives = 191/365 (52%), Gaps = 35/365 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + +IG PPV AVLDTGS LIW +C+PC QC FDP KS +++ + C SS
Sbjct: 108 YLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSL 167
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++ C D C Y Y + +QG + +E F F S + K ++++GFGC +N
Sbjct: 168 CSAVPSSTCS---DGCEYVYSYGDYSMTQGVLATETFTFGKS-KNKVSVHNIGFGCGEDN 223
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE-GAILEG 270
EQ +G+ GLG S S +++ +FSYC+ ++ + ++L+LG G + +
Sbjct: 224 EGDGFEQASGLVGLGRGPLSLVSQLKE--PRFSYCLTPMD--DTKESILLLGSLGKVKDA 279
Query: 271 D---STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
+TP+ + YY++LEGIS+G+ L I+ + F+ D + GV IDSGTT+T+
Sbjct: 280 KEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDD-GNGGVIIDSGTTITY 338
Query: 325 LVPSAYQTLRKEVEDLFQGLLP-SYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+ A++ L+KE + Q LP LC+S P + FHF GG DL
Sbjct: 339 IEQKAFEALKKEF--ISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGG-DLE 395
Query: 384 LDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L AE+ +S+ V CLA+G S +SI G + QQN V +DL + + F
Sbjct: 396 LPAENYMIGDSNLGVACLAMGASS-------GMSIFGNVQQQNILVNHDLEKETISFVPT 448
Query: 443 DCELL 447
C+ L
Sbjct: 449 SCDQL 453
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 130/443 (29%), Positives = 212/443 (47%), Gaps = 44/443 (9%)
Query: 25 STTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQK 81
S T+ A+ L+HRDS LYNP +T + Q + + S++R + S
Sbjct: 20 SKTSLTASMNNGSFTASLIHRDSPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNSVSA 79
Query: 82 AHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTF 138
A + PG +++ SIG PP+ L + DTGS LIWV+CQPC++C + F
Sbjct: 80 AKTLEYDIIPGGGE---YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIF 136
Query: 139 DPSKSLTYATLPCDSSYCT------NDCG--GYPDECWYNIRYTNGPDSQGTIGSEQFNF 190
+P +S TY + C++ YC C G+ C Y+ Y + + G + +E+F
Sbjct: 137 NPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFII 196
Query: 191 ETSDEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYC-IG 248
+++ + ++ FGC ++N +F + V G + S L K+ +KFSYC +
Sbjct: 197 GSTNNS---IQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVP 253
Query: 249 NLNYFEYAYNMLILGEGAILEGD----STPMSVIDGS--YYVTLEGISLGEKMLDIDPNL 302
L ++ ++ G+ + + G STP+ + YY+TLE IS+G + L + +
Sbjct: 254 ILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENS- 312
Query: 303 FKKNDTWSDAG-VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGN 361
+ND + G + IDSGTTLT+L Y L +E +G S P + + +C+
Sbjct: 313 --RNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDP-NGIFSICFRDK 369
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMI 421
I +L P + HF AD+ L + F + + C + PS NG ++I G +
Sbjct: 370 IGIEL---PIITVHFT-DADVELKPINTFAKAEEDLLCFTMIPS--NG-----IAIFGNL 418
Query: 422 AQQNYNVAYDLVSKQLYFQRIDC 444
AQ N+ V YDL + F DC
Sbjct: 419 AQMNFLVGYDLDKNCVSFMPTDC 441
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 126/373 (33%), Positives = 181/373 (48%), Gaps = 36/373 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDS- 153
+ + +IG PP+P A+ DTGS LIW +C PC QC ++PS S T+A LPC+S
Sbjct: 32 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91
Query: 154 -SYCTNDCGGYPD------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
S C G C YN+ Y +G S GSE F F ++ G + + FG
Sbjct: 92 LSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSETFTFGSTPAGHARVPGIAFG 150
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEG 265
CS ++ F+ +G+ GLG SLV ++G KFSYC+ + + L+LG
Sbjct: 151 CSTASSGFNASSASGLVGLG---RGRLSLVSQLGVPKFSYCLTPYQDTN-STSTLLLGPS 206
Query: 266 AILEG----DSTPM------SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
A L G STP + ++ YY+ L GISLG L I P+ F N + G+
Sbjct: 207 ASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT-GGLI 265
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPAMAF 374
IDSGTT+T L +AYQ +R V L D LC+ + P+M
Sbjct: 266 IDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTL 325
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
HF GAD+VL A+S + S ++CLA + + +++I+G QQN ++ YD+
Sbjct: 326 HF-NGADMVLPADSYMMSDDSGLWCLA-----MQNQTDGEVNILGNYQQQNMHILYDIGQ 379
Query: 435 KQLYFQRIDCELL 447
+ L F C L
Sbjct: 380 ETLSFAPAKCSAL 392
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 129/435 (29%), Positives = 207/435 (47%), Gaps = 40/435 (9%)
Query: 31 AAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRA 87
A K T + RDS YNP++T + Q+ S+ R + + +D ++
Sbjct: 27 AEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHF-RAMRASPNDIQS 85
Query: 88 HLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSL 144
+ IS + +N S+G PPVP L + DTGS LIW +C PC C FDP +S
Sbjct: 86 DV---ISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESE 142
Query: 145 TYATLPCDSSYCTNDCG--GYPDE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TY TL CD+ +C D G G D+ C Y+ Y + ++G + S+ +++
Sbjct: 143 TYKTLDCDNEFC-QDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPAS 201
Query: 200 LYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYN 258
+ FGC H+N F+++ + G S L +VG +FSYC+ L+ +
Sbjct: 202 FPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSS 261
Query: 259 MLILGEGAILEGD---STPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT----W 309
+ G+ ++ G STP+ D YY+TLEG+S+G + + F +N +
Sbjct: 262 KINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKG--FSENKSSPAAV 319
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
+ + IDSGTTLT L Y + + + G + P + + LCYS N ++
Sbjct: 320 EEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDP-NGIFSLCYSSVNNLEI--- 375
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + HF GAD+ L + F Q + C ++ PS +L+I G +AQ N+ V
Sbjct: 376 PTITAHFT-GADVQLPPLNTFVQVQEDLVCFSMIPS-------SNLAIFGNLAQINFLVG 427
Query: 430 YDLVSKQLYFQRIDC 444
YDL + ++ F++ DC
Sbjct: 428 YDLKNNKVSFKQTDC 442
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 134/426 (31%), Positives = 200/426 (46%), Gaps = 49/426 (11%)
Query: 42 LLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS YNP+ T + SM+R Q+ S + + I
Sbjct: 33 LIHRDSPSSPFYNPSLTPSERIINAALRSMSRL----QRVSHFLDENKLPESLLIPDKGE 88
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + F IG PPV +LA++DTGSSLIW++C PC C F+P KS TY CDS
Sbjct: 89 YLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQP 148
Query: 156 CT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY-DVGFGCS 208
CT DCG +C Y I Y + S G +G+E +F ++ +T + + FGC
Sbjct: 149 CTLLQPSQRDCGKL-GQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCG 207
Query: 209 HNNAH--FSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
+N ++ + G+ GLG S S L ++G KFSYC+ L Y + + L G
Sbjct: 208 VDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCL--LPYDSTSTSKLKFGSE 265
Query: 266 AILEGD---STPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
AI+ + STP+ + + Y++ LE +++G+K++ +D + IDSG
Sbjct: 266 AIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTG---------QTDGNIVIDSG 316
Query: 320 TTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
T LT+L + Y + E L LL P C+ NR P +AF F G
Sbjct: 317 TPLTYLENTFYNNFVASLQETLGVKLLQDLPS--PLKTCFP---NRANLAIPDIAFQFTG 371
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
+ + + S++ CLAV PS G +S+ G IAQ ++ V YDL K++
Sbjct: 372 ASVALRPKNVLIPLTDSNILCLAVVPSSGIG-----ISLFGSIAQYDFQVEYDLEGKKVS 426
Query: 439 FQRIDC 444
F DC
Sbjct: 427 FAPTDC 432
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 128/375 (34%), Positives = 186/375 (49%), Gaps = 40/375 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDS- 153
+ + +IG PP+P A+ DTGS LIW +C PC QC ++PS S T+A LPC+S
Sbjct: 90 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149
Query: 154 -SYCTNDCGGYPD------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
S C G C YN+ Y +G S GSE F F ++ G++ + + FG
Sbjct: 150 LSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSETFTFGSTPAGQSRVPGIAFG 208
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEG 265
CS ++ F+ +G+ GLG SLV ++G KFSYC+ + + L+LG
Sbjct: 209 CSTASSGFNASSASGLVGLG---RGRLSLVSQLGVPKFSYCLTPYQDTN-STSTLLLGPS 264
Query: 266 AILEG----DSTPM------SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
A L G STP + ++ YY+ L GISLG L I P+ F N + G+
Sbjct: 265 ASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGT-GGLI 323
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYS-GNINRDLQGFPAM 372
IDSGTT+T L +AYQ +R V L LP+ A LC+ + P+M
Sbjct: 324 IDSGTTITLLGNTAYQQVRAAVVSLVT--LPTTDGSAATGLDLCFMLPSSTSAPPAMPSM 381
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
HF GAD+VL A+S + S ++CLA + + +++I+G QQN ++ YD+
Sbjct: 382 TLHF-NGADMVLPADSYMMSDDSGLWCLA-----MQNQTDGEVNILGNYQQQNMHILYDI 435
Query: 433 VSKQLYFQRIDCELL 447
+ L F C L
Sbjct: 436 GQETLSFAPAKCSAL 450
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 140/466 (30%), Positives = 224/466 (48%), Gaps = 60/466 (12%)
Query: 5 HAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQ 61
H ++ LSL + + ++ ++ + + L+HRDS L Y P+ T +
Sbjct: 2 HPLVFLSL------ALYLLSTVSSREVSEGQRGFSIDLIHRDSPLSPFYKPSLT---PSD 52
Query: 62 RTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSS 121
R +N ++ L++ S ++ + I + + F IG PPV +LA+ DT S
Sbjct: 53 RIINTALRSIYQLNRASHSDLNEKKTLERVRIPNHGEYLMRFYIGTPPVERLAIADTASD 112
Query: 122 LIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTND----CGGYPDECWYNIRYT 174
LIWV+C PCE C F+P KS T+A L CDS CT+ C + C Y Y
Sbjct: 113 LIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYG 172
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN--AHFSDEQFTGVFGLGPATSST 232
+G ++G + +E +F + TF + FGC NN H + TG+ GLG S
Sbjct: 173 DGSSTKGVLCTESIHF--GSQTVTFPKTI-FGCGSNNDFMHQISNKVTGIVGLGAGPLSL 229
Query: 233 HS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD---STPMSVID----GSYYV 284
S L +++G KFSYC+ L + + L G + G+ STP+ +ID Y++
Sbjct: 230 VSQLGDQIGHKFSYCL--LPFTSTSTIKLKFGNDTTITGNGVVSTPL-IIDPHYPSYYFL 286
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQ---TLRKEVEDLF 341
L GI++G+KML + + ++ + ID GT LT+L + Y TL +E +
Sbjct: 287 HLVGITIGQKMLQV------RTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGIS 340
Query: 342 QGL--LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVF 398
+ +P YP D C+ N FP + F F GA + L +++F++ + ++
Sbjct: 341 ETKDDIP-YPFD----FCFPNQANIT---FPKIVFQFT-GAKVFLSPKNLFFRFDDLNMI 391
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLAV P D + F S+ G +AQ ++ V YD K++ F DC
Sbjct: 392 CLAVLP-DFYAKGF---SVFGNLAQVDFQVEYDRKGKKVSFAPADC 433
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 147/455 (32%), Positives = 212/455 (46%), Gaps = 50/455 (10%)
Query: 22 IFTSTTAAPAAGKPK-RLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQ 80
+F A A+G R+ +H D + P DA +R ++ +R ++ + +
Sbjct: 15 VFLVVCATLASGAASVRVGLTRIHSDPDITAPEFVRDAL-RRDMHRQQSRSLFGRELAES 73
Query: 81 KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC--EQC---GA 135
A + + + SIG PP+ A+ DTGS LIW +C PC +QC A
Sbjct: 74 DGTTVSARTRKDLPNGGEYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPA 133
Query: 136 TTFDPSKSLTYATLPCDS--SYCTNDCGGYPD----ECWYNIRYTNGPDSQGTIGSEQFN 189
++P+ S T+ LPC+S S C G C YN Y G + G GSE F
Sbjct: 134 PLYNPASSTTFGVLPCNSSLSMCAGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFT 192
Query: 190 FETSDEGKTFLYDVGFGCSHNNAHFSD-EQFTGVFGLGPATSSTHSLVEKVGS-KFSYCI 247
F ++ + + + FGCS NA SD G+ GLG + SLV ++G+ +FSYC
Sbjct: 193 FGSAAADQARVPGIAFGCS--NASSSDWNGSAGLVGLG---RGSLSLVSQLGAGRFSYC- 246
Query: 248 GNLNYFE--YAYNMLILGEGAILEG---DSTPM------SVIDGSYYVTLEGISLGEKML 296
L F+ + + L+LG A L G STP + + YY+ L GISLG K L
Sbjct: 247 --LTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKAL 304
Query: 297 DIDPNLFK-KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM--DPA 353
I P+ F K D G+ IDSGTT+T LV +AYQ +R V+ L LP+
Sbjct: 305 SISPDAFSLKAD--GTGGLIIDSGTTITSLVNAAYQQVRAAVQSLVT--LPAIDGSDSTG 360
Query: 354 WHLCYS-GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERF 412
LCY+ P+M HF GAD+VL A+S + S V+CLA + +
Sbjct: 361 LDLCYALPTPTSAPPAMPSMTLHF-DGADMVLPADS-YMISGSGVWCLA-----MRNQTD 413
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+S G QQN ++ YD+ ++ L F C L
Sbjct: 414 GAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCSTL 448
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/448 (28%), Positives = 205/448 (45%), Gaps = 52/448 (11%)
Query: 34 KPKRLV--TKLLHRDSLLY----NPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTR- 86
KPKR +L+HRDSLL+ N + + + + L AR L Q+ +K +
Sbjct: 65 KPKRTAWSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKD 124
Query: 87 -AHLHPGISTVPV----------------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
A + ++ V ++ IG P Q VLDTGS ++W++C+P
Sbjct: 125 PAGSYENVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEP 184
Query: 130 CEQCGATT---FDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNIRYTNGPDSQGT 182
C +C + F+PS S++++T+ CDS+ C+ NDC G C Y + Y +G + G+
Sbjct: 185 CRECYSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHG--GGCLYEVSYGDGSYTVGS 242
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK 242
+E F G T + +V GC H+N + + S L + G
Sbjct: 243 YATETLTF-----GTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRA 297
Query: 243 FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV---IDGSYYVTLEGISLGEKMLDID 299
FSYC+ + + E + + E + TP+ + YY+++ IS+G +LD
Sbjct: 298 FSYCLVDRDS-ESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSV 356
Query: 300 PN-LFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY 358
P+ F+ ++T G+ IDSGT +T L SAY LR Q LP + CY
Sbjct: 357 PSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQH-LPRADGISIFDTCY 415
Query: 359 SGNINRDLQGFPAMAFHFAGGADLVLDAESVFY-QESSSVFCLAVGPSDINGERFKDLSI 417
+ + + PA+ FHF+ GA +L A++ +S FC A P+D N LSI
Sbjct: 416 DLSALQSVS-IPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSN------LSI 468
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+G I QQ V++D + + F C+
Sbjct: 469 MGNIQQQGIRVSFDSANSLVGFAIDQCQ 496
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 137/457 (29%), Positives = 220/457 (48%), Gaps = 54/457 (11%)
Query: 19 STRIFTSTTAAPAAGKPKRLVT-----KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIY 73
+T F+S+ + A KP +L + +L H D + N T + +R + R
Sbjct: 27 NTLAFSSSLSRRALQKPNKLPSHGFRVRLKHVDHV---KNLTRFERLRRGVARGKNRLHR 83
Query: 74 LSQKSSQKAHDTRAH--LHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE 131
L+ A+ T P ++ F + +IG PP A++DTGS LIW +C+PC+
Sbjct: 84 LNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQ 143
Query: 132 QC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIG 184
QC FDP +S ++ + C S C T+ C D C Y Y + +QG +
Sbjct: 144 QCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS--DGCEYLYTYGDSSSTQGVLA 201
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFS 244
E F F S E + + +GFGC ++N Q G+ GLG S S +++ KF+
Sbjct: 202 FETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKE--QKFA 259
Query: 245 YCIGNLNYFEYAYNMLILGEGAIL-------EGDSTPMSVIDGS----YYVTLEGISLGE 293
YC+ ++ + + L+LG A + E +TP+ + + S YY++L+GIS+G
Sbjct: 260 YCLTAID--DSKPSSLLLGSLANITPKTSKDEMKTTPL-IKNPSQPSFYYLSLQGISVGG 316
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
L I + F+ +D S GV IDSGTT+T++ SA+ +L+ E + P+D +
Sbjct: 317 TQLSIPKSTFELHDDGS-GGVIIDSGTTITYVENSAFTSLKNEFIAQM-----NLPVDDS 370
Query: 354 ----WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDIN 408
LC++ + P + FHF GADL L E+ +S + + CLA+G S
Sbjct: 371 GTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSS--- 426
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ +SI G + QQN+ V +DL + L F C+
Sbjct: 427 ----RGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCD 459
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 137/449 (30%), Positives = 203/449 (45%), Gaps = 71/449 (15%)
Query: 31 AAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSS-----QKA 82
A P L+HRDS L YNP+ T +QR +N ++ L++ S+ K
Sbjct: 22 ANESPSGFTVDLIHRDSPLSPFYNPSLT---PSQRIINAALRSISRLNRVSNLLDQNNKL 78
Query: 83 HDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFD 139
+ LH G + + F IG PPV +LA DTGS LIWV+C PC C F
Sbjct: 79 PQSVLILHNG-----EYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQ 133
Query: 140 PSKSLTYATLPCDSSYCT------NDCGGYPDECWYNIRYTNGPD-SQGTIGSEQFNFET 192
P KS T+ C S CT CG EC Y +Y + S+G + +E F++
Sbjct: 134 PLKSSTFMPTTCRSQPCTLLLPEQKGCGK-SGECIYTYKYGDQYSFSEGLLSTETLRFDS 192
Query: 193 SDEGKTFLY-DVGFGCS--HNNAHFSDEQFTGVFGLGPATSSTHSLV-EKVGSKFSYCIG 248
+T + + FGC +N F + TG+ GLG S S + +++G KFSYC+
Sbjct: 193 QGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLL 252
Query: 249 NL-----NYFEYAYNMLILGEGAILEGDSTPMSV---IDGSYYVTLEGISLGEKMLDIDP 300
L + ++ +I GEG + STPM + + Y++ LE +++ +K +
Sbjct: 253 PLGSTSTSKLKFGNESIITGEGVV----STPMIIKPWLPTYYFLNLEAVTVAQKTVPTGS 308
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE-----DLFQGLLPSYPMDPAWH 355
+D V IDSGT LT+L S Y ++ +L Q +L P
Sbjct: 309 ---------TDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLP------ 353
Query: 356 LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDL 415
C+ RD FP +AF F G + A E + CL + PS ++G +
Sbjct: 354 FCFP---YRDNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSG-----I 405
Query: 416 SIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
SI G +Q ++ V YDL K++ FQ DC
Sbjct: 406 SIFGSFSQIDFQVEYDLEGKKVSFQPTDC 434
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 136/427 (31%), Positives = 210/427 (49%), Gaps = 45/427 (10%)
Query: 42 LLHRDSLLYNPNDTVDAQAQRTLNMSMARF-----IYLSQKSSQKAHDT-RAHLHPGIST 95
L H DS N T + Q + +R + L+ S+ + D A +H G
Sbjct: 51 LRHVDS---GKNLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAGNGE 107
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCD 152
+ + +IG PPV AVLDTGS LIW +C+PC +C FDP KS +++ + C
Sbjct: 108 ---YLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCG 164
Query: 153 SSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
SS C ++ C D C Y Y + +QG + +E F F S + K ++++GFGC
Sbjct: 165 SSLCSALPSSTCS---DGCEYVYSYGDYSMTQGVLATETFTFGKS-KNKVSVHNIGFGCG 220
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE-GAI 267
+N EQ +G+ GLG S S +++ +FSYC+ ++ + ++L+LG G +
Sbjct: 221 EDNEGDGFEQASGLVGLGRGPLSLVSQLKE--QRFSYCLTPID--DTKESVLLLGSLGKV 276
Query: 268 LEGD---STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+ +TP+ + YY++LE IS+G+ L I+ + F+ D + GV IDSGTT
Sbjct: 277 KDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDD-GNGGVIIDSGTT 335
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD 381
+T++ AY+ L+KE + L LC+S P + FHF GG D
Sbjct: 336 ITYVQQKAYEALKKEFISQTK-LALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGG-D 393
Query: 382 LVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L L AE+ +S+ V CLA+G S +SI G + QQN V +DL + + F
Sbjct: 394 LELPAENYMIGDSNLGVACLAMGASS-------GMSIFGNVQQQNILVNHDLEKETISFV 446
Query: 441 RIDCELL 447
C+ L
Sbjct: 447 PTSCDQL 453
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 137/469 (29%), Positives = 213/469 (45%), Gaps = 56/469 (11%)
Query: 3 SSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQ 59
++ +L SL+ + T FTST++A K L +L+HRDS LYNP TV +
Sbjct: 2 ATKTLLYCSLLAI----TIFFTSTSSA----HRKNLSVELIHRDSPHSPLYNPQHTVSDR 53
Query: 60 AQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG-ISTVPVFYVNFSIGQPPVPQLAVLDT 118
LN + +L S + T+ L G IS ++++ SIG PP LA+ DT
Sbjct: 54 ----LNAA-----FLRSISRSRRFSTKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADT 104
Query: 119 GSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT------NDCGGYPDECWY 169
GS L WV+C+PC+QC FD KS TY T CDS C C + C Y
Sbjct: 105 GSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKY 164
Query: 170 NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPA 228
Y + ++G + +E + ++S FGC +NN F + + G
Sbjct: 165 RYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGP 224
Query: 229 TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI---------D 279
S L +G KFSYC+ + + +++ LG ++ S +++ +
Sbjct: 225 LSLVSQLGSSIGKKFSYCLSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPE 284
Query: 280 GSYYVTLEGISLGEKMLDIDP----NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRK 335
Y++TLE I++G+ L +L +K+ + + IDSGTTLT L Y
Sbjct: 285 TYYFLTLEAITVGKTKLPYTGGGGYSLNRKSKKTGN--IIIDSGTTLTLLDSGFYDDFGA 342
Query: 336 EVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS 395
VE+ G DP L + G P + HF GAD+ L + F + S
Sbjct: 343 VVEESVTG--AKRVSDPQGILTHCFKSGDKEIGLPTITMHFT-GADVKLSPINSFVKLSE 399
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ CL++ P+ +++I G + Q ++ V YDL +K + FQR+DC
Sbjct: 400 DIVCLSMIPT-------TEVAIYGNMVQMDFLVGYDLETKTVSFQRMDC 441
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 132/429 (30%), Positives = 190/429 (44%), Gaps = 50/429 (11%)
Query: 42 LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSS-QKAHDTRAHLHPG---ISTVP 97
L+HRDS ++ + +QR M I S +S+ Q ++D + P S
Sbjct: 30 LIHRDSPKSPFYNSAETSSQR-----MRNAIRRSARSTLQFSNDDASPNSPQSFITSNRG 84
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSS 154
+ +N SIG PPVP LA+ DTGS LIW +C PCE C T FDP +S TY + C SS
Sbjct: 85 EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSS 144
Query: 155 YCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH- 209
C C + C Y I Y + ++G + + +S L ++ GC H
Sbjct: 145 QCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHE 204
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI-------GNLNYFEYAYNMLIL 262
N F + G +TS L + + KFSYC+ G + + N ++
Sbjct: 205 NTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIVS 264
Query: 263 GEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
G+G + ST M D + Y++ LE IS+G K + +F + + IDSGT
Sbjct: 265 GDGVV----STSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTG----EGNIVIDSGT 316
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF--PAMAFHFAG 378
TLT L + Y L V + P D LCY RD F P + HF G
Sbjct: 317 TLTLLPSNFYYELESVVASTIKAERVQDP-DGILSLCY-----RDSSSFKVPDITVHFKG 370
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
G D+ L + F S V C A ++ L+I G +AQ N+ V YD VS +
Sbjct: 371 G-DVKLGNLNTFVAVSEDVSCFAFAANE-------QLTIFGNLAQMNFLVGYDTVSGTVS 422
Query: 439 FQRIDCELL 447
F++ DC +
Sbjct: 423 FKKTDCSQM 431
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 181/359 (50%), Gaps = 38/359 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT----TFDPSKSLTYATLPCDSS 154
F G P Q +DTGSSL W +C PC C A + P+ S+TY C+ S
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAMCEDS 117
Query: 155 YCTNDCGGYPDE----CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS-- 208
+ ++ D C Y Y + + +GT+ E +T D G ++ V FGC+
Sbjct: 118 HPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCNTL 177
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+ ++F+ TG+ GLG +S++ + GSKFS+C+G ++ + ++N LILG+GA +
Sbjct: 178 SDGSYFTG---TGILGLGVG---KYSIIGEFGSKFSFCLGEISEPKASHN-LILGDGANV 230
Query: 269 EGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+G T +++ +G LE I +GE++ DP VF+D+G+TL+ L +
Sbjct: 231 QGHPTVINITEGHTIFQLESIIVGEEITLDDP-----------VQVFVDTGSTLSHLSTN 279
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
Y + D F L+ S P+ LCY + L+ + F F GA+L ++ +
Sbjct: 280 LYY----KFVDAFDDLIGSRPLSYEPTLCYKADTIERLEKMD-VGFKFDVGAELSVNIHN 334
Query: 389 VFYQES-SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
+F Q+ + CLA+ N E F + IIG+IA Q YNV YDL +K Y + DC++
Sbjct: 335 IFIQQGPPEIRCLAI---QNNKESFSHV-IIGVIAMQGYNVGYDLSAKTAYINKQDCDM 389
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 189/370 (51%), Gaps = 44/370 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
F + +IG PP A++DTGS LIW +C+PC+QC FDP +S ++ + C S
Sbjct: 366 FLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSEL 425
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T+ C D C Y Y + +QG + E F F S E + + +GFGC ++N
Sbjct: 426 CGALPTSTCSS--DGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDN 483
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL--- 268
Q G+ GLG S S +++ KF+YC+ ++ + + L+LG A +
Sbjct: 484 NGDGFSQGAGLVGLGRGPLSLVSQLKE--QKFAYCLTAID--DSKPSSLLLGSLANITPK 539
Query: 269 ----EGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
E +TP+ + + S YY++L+GIS+G L I + F+ +D S GV IDSGT
Sbjct: 540 TSKDEMKTTPL-IKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGS-GGVIIDSGT 597
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA----WHLCYSGNINRDLQGFPAMAFHF 376
T+T++ SA+ +L+ E F + + P+D + LC++ + P + FHF
Sbjct: 598 TITYVENSAFTSLKNE----FIAQM-NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHF 652
Query: 377 AGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
GADL L E+ +S + + CLA+G S + +SI G + QQN+ V +DL +
Sbjct: 653 K-GADLELPGENYMIGDSKAGLLCLAIGSS-------RGMSIFGNLQQQNFMVVHDLQEE 704
Query: 436 QLYFQRIDCE 445
L F C+
Sbjct: 705 TLSFLPTQCD 714
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 179/376 (47%), Gaps = 50/376 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ V+ +IG PP+ A++DTGS LIW +C PC C A FD +S TY LPC SS
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSR 148
Query: 156 CT--NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNA 212
C + + C Y Y + + G + +E F F + K ++ FGC S N
Sbjct: 149 CAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSLNAG 208
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG- 270
++ FG GP SLV ++G S+FSYC+ +Y + L G A L
Sbjct: 209 ELANSSGMVGFGRGP-----LSLVSQLGPSRFSYCL--TSYLSPTPSRLYFGVFANLNST 261
Query: 271 --------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
STP + + Y+++++GISLG K L IDP +F ND + GV IDSG
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGT-GGVIIDSG 320
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM------DPAWHLCYSGNINRDLQ-GFPAM 372
T++TWL AY+ +R+ GL + P+ D C+ ++ P
Sbjct: 321 TSITWLQQDAYEAVRR-------GLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDF 373
Query: 373 AFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
FHF GA++ L E+ + ++ CLA+ P+ + +IIG QQN ++ YD
Sbjct: 374 VFHF-DGANMTLPPENYMLIASTTGYLCLAMAPTSVG-------TIIGNYQQQNLHLLYD 425
Query: 432 LVSKQLYFQRIDCELL 447
+ + L F C+++
Sbjct: 426 IANSFLSFVPAPCDII 441
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 138/415 (33%), Positives = 199/415 (47%), Gaps = 45/415 (10%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYV-NFSIGQPPVPQLA 114
VD++ T M R + S+ + +D + P + +V V Y+ +IG PPVP +A
Sbjct: 25 VDSKIGFTKTELMRRAAHRSRLQALSGYDANS---PRLHSVQVEYLMELAIGTPPVPFVA 81
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-----TNDCGGYPDE 166
+ DTGS L W +CQPC+ C +DPS S T++ +PC S+ C + +C
Sbjct: 82 LADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPSSP 141
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGFGCSHNNAHFSDEQFTGVFGL 225
C Y Y++G S G +G+E +S G+T + V FGC +N S TG GL
Sbjct: 142 CRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNS-TGTVGL 200
Query: 226 GPATSSTHSLVEKVG-SKFSYCIGNL-------NYFEYAYNMLILGEGAILEGDSTPM-- 275
G T SL+ ++G KFSYC+ + +F L G G + STP+
Sbjct: 201 G---RGTLSLLAQLGVGKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTV---QSTPLLQ 254
Query: 276 SVIDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLR 334
S ++ S Y+V L+GISLG+ L I PN + G+ +DSGTT T L S +
Sbjct: 255 SPLNPSRYFVNLQGISLGDVRLPI-PNGTFDLRADGNGGMMVDSGTTFTILAKSGF---- 309
Query: 335 KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-PAMAFHFAGGADLVLDAESVF-YQ 392
+EV D LL P++ A L + D + F P + HFAGGAD+ L ++ Y
Sbjct: 310 REVVDRVAQLLGQPPVN-ASSLDSPCFPSPDGEPFMPDLVLHFAGGADMRLHRDNYMSYN 368
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
E S FCL + S R +G QQN + +D+ QL F DC L
Sbjct: 369 EDDSSFCLNIVGSPSTWSR------LGNFQQQNIQMLFDMTVGQLSFLPTDCSKL 417
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 133/412 (32%), Positives = 201/412 (48%), Gaps = 36/412 (8%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYV-NFSIGQPPVPQLA 114
VD++ T M R + S+ + +D + P + +V V Y+ +IG PPVP +A
Sbjct: 36 VDSKIGLTKTELMRRAAHRSRLRALSGYDANS---PRLHSVQVEYLMELAIGTPPVPFVA 92
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-----TNDCGGYPDE 166
+ DTGS L W +CQPC+ C +DPS S T++ +PC S+ C + +C
Sbjct: 93 LADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSL 152
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGFGCSHNNAHFSDEQFTGVFGL 225
C Y Y++G S G +G+E +S G+ + DV FGC +N S TG GL
Sbjct: 153 CRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNS-TGTVGL 211
Query: 226 GPATSSTHSLVEKVG-SKFSYCIGNL--NYFEYAYNMLILGEGAILEG--DSTPM--SVI 278
G T SL+ ++G KFSYC+ + + + + + L E A G STP+ S +
Sbjct: 212 G---RGTLSLLAQLGVGKFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPL 268
Query: 279 DGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV 337
+ S Y V+L+GI+LG+ L I PN S G+ +DSGTT + L S ++ + V
Sbjct: 269 NPSRYVVSLQGITLGDVRLPI-PNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHV 327
Query: 338 EDLF-QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF-YQESS 395
+ Q + + +D +G R L P + HFAGGAD+ L ++ Y +
Sbjct: 328 AQVLGQPPVNASSLDSPCFPAPAG--ERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQED 385
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S FCL +I G S++G QQN + +D+ QL F DC L
Sbjct: 386 SSFCL-----NIVGTT-STWSMLGNFQQQNIQMLFDMTVGQLSFLPTDCSKL 431
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 131/442 (29%), Positives = 199/442 (45%), Gaps = 44/442 (9%)
Query: 28 AAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHD 84
A+ ++ + L +L+HRDS LYNP+ TV + LN + R I S++ +
Sbjct: 19 ASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDR----LNAAFLRSISRSRRFT----- 69
Query: 85 TRAHLHPG-ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDP 140
T+ L G IS ++++ SIG PP A+ DTGS L WV+C+PC+QC + FD
Sbjct: 70 TKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDK 129
Query: 141 SKSLTYATLPCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSD 194
KS TY T CDS C C D C Y Y + ++G + +E + ++S
Sbjct: 130 KKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSS 189
Query: 195 EGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYF 253
FGC +NN F + + G S L +G KFSYC+ +
Sbjct: 190 GSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAAT 249
Query: 254 EYAYNMLILGEGAILEGDS-------TPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFK 304
+++ LG +I S TP+ D Y++TLE +++G+ L +
Sbjct: 250 TNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYG 309
Query: 305 KNDTWSD--AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI 362
N S + IDSGTTLT L Y VE+ G DP L +
Sbjct: 310 LNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTG--AKRVSDPQGLLTHCFKS 367
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
G PA+ HF AD+ L + F + + CL++ P+ +++I G +
Sbjct: 368 GDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPT-------TEVAIYGNMV 419
Query: 423 QQNYNVAYDLVSKQLYFQRIDC 444
Q ++ V YDL +K + FQR+DC
Sbjct: 420 QMDFLVGYDLETKTVSFQRMDC 441
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 131/428 (30%), Positives = 198/428 (46%), Gaps = 47/428 (10%)
Query: 42 LLHRDSLLYNPNDTVD-AQAQRTLNMSMARFIYLS-QKSSQKAHDTRAHLHP--GISTVP 97
LLH P VD Q N++ I + ++ ++ A L GI T P
Sbjct: 30 LLHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIET-P 88
Query: 98 VF------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYAT 148
V+ +N +IG P A++DTGS LIW +C+PC QC + F+P S +++T
Sbjct: 89 VYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFST 148
Query: 149 LPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
LPC+S YC + C +EC Y Y +G +QG + +E F FETS + ++
Sbjct: 149 LPCESQYCQDLPSETCNN--NECQYTYGYGDGSTTQGYMATETFTFETSS-----VPNIA 201
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILG 263
FGC +N F G+ G+G SL ++G +FSYC+ +Y + + L LG
Sbjct: 202 FGCGEDNQGFGQGNGAGLIGMGWG---PLSLPSQLGVGQFSYCM--TSYGSSSPSTLALG 256
Query: 264 EGAILEGDSTPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
A + +P + + S YY+TL+GI++G L I + F+ D + G+ ID
Sbjct: 257 SAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGT-GGMIID 315
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYSGNINRDLQGFPAMAFHF 376
SGTTLT+L AY + + D LP+ C+ + P ++ F
Sbjct: 316 SGTTLTYLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQF 373
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GG L L +++ + V CLA+G S G +SI G I QQ V YDL +
Sbjct: 374 DGGV-LNLGEQNILISPAEGVICLAMGSSSQLG-----ISIFGNIQQQETQVLYDLQNLA 427
Query: 437 LYFQRIDC 444
+ F C
Sbjct: 428 VSFVPTQC 435
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 151/477 (31%), Positives = 209/477 (43%), Gaps = 71/477 (14%)
Query: 17 FTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQ 76
F+ I T A A R+ +H D P T + L M R ++
Sbjct: 4 FSVLLILACTILASDAAAAVRVGLTRIHAD-----PEVTASEFVRGALRRDMHRHARFAR 58
Query: 77 KSSQKAHDTRAHLHPGISTVP------VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC 130
+ + A L G T + + SIG PP+ A+ DTGS LIW +C PC
Sbjct: 59 EQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPC 118
Query: 131 --------EQC---GATTFDPSKSLTYATLPCDS--SYCTNDCGGYPD---ECWYNIRYT 174
QC ++PS S T+ LPC+S S C G P C YN Y
Sbjct: 119 GDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYG 178
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTF-LYDVGFGCSHNNAHFSD-EQFTGVFGLGPATSST 232
G + G E F F +S + ++ FGCS NA +D G+ GLG +
Sbjct: 179 TG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCS--NASSNDWNGSAGLVGLG---RGS 232
Query: 233 HSLVEKVGS-KFSYCIGNLNYFEYA--YNMLILG--EGAILEG----DSTPM------SV 277
SLV ++G+ FSYC L F+ A + L+LG A L+G STP +
Sbjct: 233 MSLVSQLGAGAFSYC---LTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAP 289
Query: 278 IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV 337
+ YY+ L GIS+GE L I P+ F + G+ IDSGTT+T LV SAYQ +R V
Sbjct: 290 MSTYYYLNLTGISVGETALAIPPDAFSLRADGT-GGLIIDSGTTITTLVDSAYQQVRAAV 348
Query: 338 EDLFQGLLP-------SYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF 390
L LP S +D LC++ + P+M HF GGAD+VL E+ +
Sbjct: 349 RSLLVTRLPLAHGPDHSTGLD----LCFALKASTPPPAMPSMTLHFEGGADMVLPVEN-Y 403
Query: 391 YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S V+CLA + + +S++G QQN +V YD+ + L F C L
Sbjct: 404 MILGSGVWCLA-----MRNQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVCSSL 455
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 131/428 (30%), Positives = 195/428 (45%), Gaps = 48/428 (11%)
Query: 42 LLHRDSLLYNPN-DTVDAQAQRTLNMSMARFIYLS-QKSSQKAHDTRAHLHP--GISTVP 97
LLH P V Q +N++ I + ++ ++ A L GI T P
Sbjct: 30 LLHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRSINAMLQSSSGIET-P 88
Query: 98 VF------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYAT 148
V+ +N +IG P A++DTGS LIW +C+PC QC + F+P S +++T
Sbjct: 89 VYAGSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFST 148
Query: 149 LPCDSSYCTNDCGGYPDE-----CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
LPC+S YC + P E C Y Y +G +QG + +E F FETS + ++
Sbjct: 149 LPCESQYCQD----LPSESCYNDCQYTYGYGDGSSTQGYMATETFTFETSS-----VPNI 199
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLIL 262
FGC +N F G+ G+G SL ++G +FSYC+ + + + L L
Sbjct: 200 AFGCGEDNQGFGQGNGAGLIGMGWG---PLSLPSQLGVGQFSYCM--TSSGSSSPSTLAL 254
Query: 263 GEGAILEGDSTPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
G A + +P + + S YY+TL+GI++G L I + F+ D + G+ I
Sbjct: 255 GSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGT-GGMII 313
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGTTLT+L AY + + D L P C+ + P ++ F
Sbjct: 314 DSGTTLTYLPQDAYNAVAQAFTDQIN-LSPVDESSSGLSTCFQLPSDGSTVQVPEISMQF 372
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GG L L E+V + V CLA+G S G +SI G I QQ V YDL +
Sbjct: 373 DGGV-LNLGEENVLISPAEGVICLAMGSSSQQG-----ISIFGNIQQQETQVLYDLQNLA 426
Query: 437 LYFQRIDC 444
+ F C
Sbjct: 427 VSFVPTQC 434
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 126/424 (29%), Positives = 190/424 (44%), Gaps = 38/424 (8%)
Query: 42 LLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS ++P+ T + + S +R Q S+ + ++ L P
Sbjct: 36 LIHRDSPHSPFFDPSKTRTERLTDAFHRSASRVGRFRQ-SAMTSDGIQSRLVPSAGE--- 91
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ +N SIG PPVP +A++DTGS L W +C+PC C FDP S TY C +S+
Sbjct: 92 YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSF 151
Query: 156 CT---ND--CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C ND C +C + Y +G + G + E ++ FGC H
Sbjct: 152 CLALGNDRSCRN-GKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHR 210
Query: 211 NAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+ DE +G+ GLG A S S L + +FSYC+ + + + G I+
Sbjct: 211 SGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVS 270
Query: 270 GD---STPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
G STP+ V+ G Y +TLEG S+G+K L F K + + +DSGTT
Sbjct: 271 GAGTVSTPL-VMKGPDTYYYLITLEGFSVGKKRLSYKG--FSKKAEVEEGNIIVDSGTTY 327
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T+L Y L + V +G P + LCY N D P + HF A++
Sbjct: 328 TYLPLEFYVKLEESVAHSIKGKRVRDP-NGISSLCY--NTTVDQIDAPIITAHFK-DANV 383
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L + F + + C V P+ D+ I+G +AQ N+ V +DL K++ F+
Sbjct: 384 ELQPWNTFLRMQEDLVCFTVLPTS-------DIGILGNLAQVNFLVGFDLRKKRVSFKAA 436
Query: 443 DCEL 446
DC L
Sbjct: 437 DCTL 440
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 121/377 (32%), Positives = 181/377 (48%), Gaps = 51/377 (13%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC-QPCEQC---GATTFDPSKSLTYATL 149
++ + V+ +IG PP+P AVLDTGS LIW +C PC +C A + P++S TYA +
Sbjct: 87 ASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANV 146
Query: 150 PCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
C S C + C C Y Y +G + G + +E F + T + V
Sbjct: 147 SCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGS----DTAVRGV 202
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLIL 262
FGC N +D +G+ G+G SLV ++G ++FSYC N A + L L
Sbjct: 203 AFGCGTENLGSTDNS-SGLVGMG---RGPLSLVSQLGVTRFSYCFTPFN--ATAASPLFL 256
Query: 263 GEGAILE--GDSTPM--SVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
G A L +TP S G+ YY++LEGI++G+ +L IDP +F+ D
Sbjct: 257 GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTP-MGDG 315
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH----LCYSGNINRDLQG 368
GV IDSGTT T L SA+ L + + + P+ H LC++ ++
Sbjct: 316 GVIIDSGTTFTALEESAFVALARALASRVR-----LPLASGAHLGLSLCFAAASPEAVE- 369
Query: 369 FPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + HF GAD+ L ES V S+ V CL + + + +S++G + QQN +
Sbjct: 370 VPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMVSA-------RGMSVLGSMQQQNTH 421
Query: 428 VAYDLVSKQLYFQRIDC 444
+ YDL L F+ C
Sbjct: 422 ILYDLERGILSFEPAKC 438
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 143/422 (33%), Positives = 205/422 (48%), Gaps = 45/422 (10%)
Query: 51 NPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP-VFYVNFSIGQPP 109
+P+ T + L+ M R +S A + P +TVP F + +IG PP
Sbjct: 38 DPSVTASQFVRAALHRDMHRHNARKLAASSSDGTVSAPVSP--TTVPGEFLMTLAIGTPP 95
Query: 110 VPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS--YCTNDCGGY 163
+P LA+ DTGS LIW +C PC QC ++PS S T++ LPC+SS C C
Sbjct: 96 LPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSSLGLCAPACA-- 153
Query: 164 PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG-KTFLYDVGFGCSHNNAHFSDEQFTGV 222
C YN+ Y +G + G+E F F +S + + + FGCS+ ++ F+ +G+
Sbjct: 154 ---CMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGL 209
Query: 223 FGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEG----DSTPMSV 277
GLG + SLV ++G+ KFSYC+ + + L+LG A L STP
Sbjct: 210 VGLG---RGSLSLVSQLGAPKFSYCLTPYQDTN-STSTLLLGPSASLNDTGVVSSTPFVA 265
Query: 278 IDGS--YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVFIDSGTTLTWLVPSAYQTLR 334
S YY+ L GISLG L I PN F K D G+ IDSGTT+T L +AYQ +R
Sbjct: 266 SPSSIYYYLNLTGISLGTTALPIPPNAFSLKAD--GTGGLIIDSGTTITMLGNTAYQQVR 323
Query: 335 KEVEDLFQGLLPSYPMDPA--WHLCYS-GNINRDLQGFPAMAFHFAGGADLVLDAESVFY 391
V L LP+ A LC+ + P+M HF GAD+VL A++
Sbjct: 324 AAVLSLVT--LPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHF-DGADMVLPADNYMM 380
Query: 392 -----QESSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
SS++CLA+ +D +G +SI+G QQN ++ YD+ + L F C
Sbjct: 381 SLSDPDSDSSLWCLAMQNQTDTDGVV---VSILGNYQQQNMHILYDVGKETLSFAPAKCS 437
Query: 446 LL 447
L
Sbjct: 438 TL 439
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 153/461 (33%), Positives = 209/461 (45%), Gaps = 57/461 (12%)
Query: 22 IFTSTTAAPAAGKPK-RLVTKLLHRDSLLYNPNDTVDA-------QAQRTLNMSMARFIY 73
+F A A+G R+ +H D P DA Q R+ R +
Sbjct: 31 VFLVVCATLASGAASVRVGLTRIHSDPDTTAPQFVRDALRRDMHRQRSRSFGRDRDRELA 90
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQ 132
S + + TR L G + + +IG PP+P AV DTGS LIW +C PC Q
Sbjct: 91 ESDGRTTVSARTRKDLPNGGE----YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQ 146
Query: 133 C---GATTFDPSKSLTYATLPCDS--SYCTNDCGGYPD----ECWYNIRYTNGPDSQGTI 183
C A ++P+ S T++ LPC+S S C G C YN Y G + G
Sbjct: 147 CFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQ 205
Query: 184 GSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD-EQFTGVFGLGPATSSTHSLVEKVGS- 241
GSE F F +S + + V FGCS NA SD G+ GLG + SLV ++G+
Sbjct: 206 GSETFTFGSSAADQARVPGVAFGCS--NASSSDWNGSAGLVGLG---RGSLSLVSQLGAG 260
Query: 242 KFSYCIGNLNYFE--YAYNMLILGEGAILEG---DSTPM------SVIDGSYYVTLEGIS 290
+FSYC L F+ + + L+LG A L G STP + + YY+ L GIS
Sbjct: 261 RFSYC---LTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGIS 317
Query: 291 LGEKMLDIDPNLFK-KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
LG K L I P F K D G+ IDSGTT+T L +AYQ +R V+ L L
Sbjct: 318 LGAKALPISPGAFSLKPD--GTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDG 375
Query: 350 MDP-AWHLCYS--GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSD 406
D LC++ + P+M HF GAD+VL A+S + S V+CLA
Sbjct: 376 SDSTGLDLCFALPAPTSAPPAVLPSMTLHF-DGADMVLPADS-YMISGSGVWCLA----- 428
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ + +S G QQN ++ YD+ + L F C L
Sbjct: 429 MRNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCSTL 469
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 194/425 (45%), Gaps = 42/425 (9%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+L+HRDS Y P ++ S+ R + ++ S ++ + G
Sbjct: 31 ELIHRDSSKSPFYKPTQNKYQHVVDAVHRSINRVNHSNKNSLASTPESTVISYEGD---- 86
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSS 154
+ +++S+G PP+ ++DTGS ++W++C+PCEQC T F+PSKS +Y + C S
Sbjct: 87 -YIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSK 145
Query: 155 YCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C + C + C Y+I Y N SQG + E E++ GC N
Sbjct: 146 LCQSVRDTSCNDKKN-CEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCGTN 204
Query: 211 N-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGN----LNYFEYAYNMLILGEG 265
N F V G S L +G KFSYC+ L + L G+
Sbjct: 205 NIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKLNFGDV 264
Query: 266 AILEGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
AI+ G STP+ D S YY+T+E S+G+K ++ + + + IDS T
Sbjct: 265 AIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVE----FAGSSKGVEEGNIIIDSST 320
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ-GFPAMAFHFAGG 379
+T++ Y L + DL P + + LCY N++ D + FP M HF G
Sbjct: 321 IVTFVPSDVYTKLNSAIVDLVTLERVDDP-NQQFSLCY--NVSSDEEYDFPYMTAHFK-G 376
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
AD++L A + F + + V C A PS NG +I G +QQ++ V YDL K + F
Sbjct: 377 ADILLYATNTFVEVARDVLCFAFAPS--NGG-----AIFGSFSQQDFMVGYDLQQKTVSF 429
Query: 440 QRIDC 444
+ +DC
Sbjct: 430 KSVDC 434
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 115/377 (30%), Positives = 176/377 (46%), Gaps = 52/377 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP+ A++DTGS LIW +C PC C FD KS TY LPC SS
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSR 148
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHN 210
C + C + C Y Y + + G + +E F F ++ K ++ FGC S N
Sbjct: 149 CASLSSPSC--FKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLN 206
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILE 269
++ FG GP SLV ++G S+FSYC+ +Y + L G A L
Sbjct: 207 AGDLANSSGMVGFGRGPL-----SLVSQLGPSRFSYCL--TSYLSATPSRLYFGVYANLS 259
Query: 270 G---------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
STP + + Y+++L+ ISLG K+L IDP +F ND + GV ID
Sbjct: 260 STNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGT-GGVIID 318
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM------DPAWHLCYSGNINRDLQ-GFP 370
SGT++TWL AY+ +R+ GL+ + P+ D C+ ++ P
Sbjct: 319 SGTSITWLQQDAYEAVRR-------GLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVP 371
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ FHF +L + ++ CL + P+ + +IIG QQN ++ Y
Sbjct: 372 DLVFHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVG-------TIIGNYQQQNLHLLY 424
Query: 431 DLVSKQLYFQRIDCELL 447
D+ + L F C+++
Sbjct: 425 DIGNSFLSFVPAPCDII 441
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 177/374 (47%), Gaps = 51/374 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
F ++ SIG P V A++DTGS L+W +C+PC +C FDPS S TYA LPC S+
Sbjct: 102 FLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCSSTL 161
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C++ C +C Y Y + +QG + +E F KT L DV FGC N
Sbjct: 162 CSDLPSSKC--TSAKCGYTYTYGDSSSTQGVLAAETFTLA-----KTKLPDVAFGCGDTN 214
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
Q G+ GLG SLV ++G +KFSYC+ +L+ + + + L+LG A +
Sbjct: 215 EGDGFTQGAGLVGLG---RGPLSLVSQLGLNKFSYCLTSLD--DTSKSPLLLGSLATISE 269
Query: 271 DSTPMSVIDGS-----------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ S + + YYV L+G+++G + + + F D + GV +DSG
Sbjct: 270 SAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGT-GGVIVDSG 328
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPS-----YPMDPAWHLCYSGNINRDLQGFPAMAF 374
T++T+L Y+ L+K + LP+ +D + SG D P + F
Sbjct: 329 TSITYLELQGYRALKKAFAAQMK--LPAADGSGIGLDTCFEAPASG---VDQVEVPKLVF 383
Query: 375 HFAGGADLVLDAESVFYQES-SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
H GADL L AE+ +S S CL V S + LSIIG QQN YD+
Sbjct: 384 HL-DGADLDLPAENYMVLDSGSGALCLTVMGS-------RGLSIIGNFQQQNIQFVYDVG 435
Query: 434 SKQLYFQRIDCELL 447
L F + C L
Sbjct: 436 ENTLSFAPVQCAKL 449
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 130/443 (29%), Positives = 197/443 (44%), Gaps = 41/443 (9%)
Query: 27 TAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAH 83
P + + +L+H DS YN +T + + S+ R YL+ S +
Sbjct: 16 VVTPIESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHN 75
Query: 84 DTRAHLHPGISTVPV---FYV-NFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-- 137
D P + +P +YV ++SIG PP V+DTGS IW +C+PC+ C T
Sbjct: 76 DL-----PKPTIIPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSP 130
Query: 138 -FDPSKSLTYATLPCDSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE 191
F+PSKS TY + C S C T +C Y I Y + SQG I +
Sbjct: 131 IFNPSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLN 190
Query: 192 TSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNL 250
++D + GC H N+ ++ +G+ G G S S L +G KFSYC+ +L
Sbjct: 191 SNDGSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASL 250
Query: 251 NYFEYAYNMLILGEGAILEGD---STPM--SVIDGSYYVTLEGISLGEKMLDI-DPNLFK 304
+ L G+ A++ G STP+ S G+Y+ LE S+G+ ++ + D +L
Sbjct: 251 FSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIP 310
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINR 364
N + IDSG+T+T L Y L V + + P LCY + +
Sbjct: 311 DN----EGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQ-QLSLCYKTTLKK 365
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
P + HF GAD+ L+A + F Q + V C A N F + + G IAQQ
Sbjct: 366 --YEVPIITAHFR-GADVKLNAFNTFIQMNHEVMCFA-----FNSSAFPWV-VYGNIAQQ 416
Query: 425 NYNVAYDLVSKQLYFQRIDCELL 447
N+ V YD + + F+ +C L
Sbjct: 417 NFLVGYDTLKNIISFKPTNCTKL 439
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 127/370 (34%), Positives = 187/370 (50%), Gaps = 38/370 (10%)
Query: 91 PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYA 147
P +S F +N +IG PP A++DTGS LIW +C+PC QC + FDP KS +++
Sbjct: 92 PVLSGNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFS 151
Query: 148 TLPCDSSYCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
L C S C + C D C Y Y + +QGT+ +E F F GK + +V
Sbjct: 152 KLSCSSQLCKALPQSSCS---DSCEYLYTYGDYSSTQGTMATETFTF-----GKVSIPNV 203
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
GFGC +N Q +G+ GLG S S +++ +KFSYC+ +++ + + L++G
Sbjct: 204 GFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKE--AKFSYCLTSID--DTKTSTLLMG 259
Query: 264 EGAILEGDS-----TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
A + G S TP+ + YY++LEGIS+G L I + F+ D + G+
Sbjct: 260 SLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGT-GGLI 318
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
IDSGTT+T+L SA+ ++KE GL LCY+ + P + H
Sbjct: 319 IDSGTTITYLEESAFDLVKKEFTSQM-GLPVDNSGATGLELCYNLPSDTSELEVPKLVLH 377
Query: 376 FAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GADL L E+ +SS V CLA+G S +SI G + QQN V++DL
Sbjct: 378 FT-GADLELPGENYMIADSSMGVICLAMGSSG-------GMSIFGNVQQQNMFVSHDLEK 429
Query: 435 KQLYFQRIDC 444
+ L F +C
Sbjct: 430 ETLSFLPTNC 439
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 129/431 (29%), Positives = 196/431 (45%), Gaps = 57/431 (13%)
Query: 42 LLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYL----SQKSSQKAHDTRAHLHPGIS 94
L+HRDS L YN +T + L S++R + + S KA ++ + G
Sbjct: 36 LIHRDSPLSPFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAAESDVTSNRG-- 93
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
+ ++ S+G PP + + DTGS LIW +C+PCE+C FDP S TY C
Sbjct: 94 ---EYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSC 150
Query: 152 DSSYCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
D+ C+ + C G + C Y Y + + G + S+ +++ GC
Sbjct: 151 DARQCSLLDQSTCSG--NICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGC 208
Query: 208 SH-NNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCI-------GNLNYFEYAYN 258
H N+ FSD+ +G+ GLG S S + VG KFSYC+ GN + + N
Sbjct: 209 GHENDGTFSDKG-SGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSN 267
Query: 259 MLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
++ G G STP+ + Y++TLE +S+G + + + + +
Sbjct: 268 AVVSGPGV----QSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTG----EGNII 319
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL--CYSGNINRDLQGFPAMA 373
IDSGTTLT + + L V + +G DP+ L CYS DL+ PA+
Sbjct: 320 IDSGTTLTIVPDDFFSNLSTAVGNQVEG---RRAEDPSGFLSVCYSA--TSDLK-VPAIT 373
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
HF GAD+ L + F Q S V CLA + +SI G +AQ N+ V Y++
Sbjct: 374 AHFT-GADVKLKPINTFVQVSDDVVCLAFASTT------SGISIYGNVAQMNFLVEYNIQ 426
Query: 434 SKQLYFQRIDC 444
K L F+ DC
Sbjct: 427 GKSLSFKPTDC 437
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 137/461 (29%), Positives = 207/461 (44%), Gaps = 58/461 (12%)
Query: 9 LLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLN 65
+ SL+ L ++ +F++ TA + +L+HRDS +YN ++T + L
Sbjct: 4 VFSLLFL-ISTASVFSAVTA-----RDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALR 57
Query: 66 MSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S R + + + +A P + + V S+G PP +AV DTGS +IW
Sbjct: 58 RSSHRNTVVLESDTAEA--------PIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWT 109
Query: 126 KCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYP----DECWYNIRYTNGPD 178
+C+PC C A FDPSKS TY + C S C+ G EC Y+I Y +
Sbjct: 110 QCKPCSNCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSH 169
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL--GPATSSTHSLV 236
SQG + + +++ GC H+NA + +G+ GL GPA+ T L
Sbjct: 170 SQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQ-LG 228
Query: 237 EKVGSKFSYCI-----GNLN---YFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVT 285
G KFSYC+ G+ N + N + G G + STP+ + Y +
Sbjct: 229 PATGGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTV----STPIYSSAQYKTFYSLK 284
Query: 286 LEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL 345
LE +S+G+ + K ++ + IDSGTTLT+L PSA L + Q +
Sbjct: 285 LEAVSVGDTKFNFPEGASK---LGGESNIIIDSGTTLTYL-PSAL--LNSFGSAISQSMS 338
Query: 346 PSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVG-- 403
+ DP+ L Y D P + HF GAD+ L E++F + S CLA G
Sbjct: 339 LPHAQDPSEFLDYCFATTTDDYEMPPVTMHFE-GADVPLQRENLFVRLSDDTICLAFGSF 397
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
P D ++ I G IAQ N+ V YD+ + + FQ C
Sbjct: 398 PDD-------NIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 125/376 (33%), Positives = 182/376 (48%), Gaps = 48/376 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ ++ IG PP A+LDTGS LIW +C PC C FDP++S +YA LPC+S
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPM 148
Query: 156 CTND----CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C Y + C Y Y + ++ G + +E F F T+D + + + FGC + N
Sbjct: 149 CNALYYPLC--YRNVCVYQYFYGDSANTAGVLSNETFTFGTNDT-RVTVPRIAFGCGNLN 205
Query: 212 AH--FSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAIL 268
A F+ G FG GP SLV ++GS +FSYC+ ++ + L G A L
Sbjct: 206 AGSLFNGSGMVG-FGRGPL-----SLVSQLGSPRFSYCL--TSFMSPVPSRLYFGAYATL 257
Query: 269 EG---------DSTPMSVIDG---SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
STP V G YY+ + GIS+G ++L IDP++F ND GV I
Sbjct: 258 NSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVII 317
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLP---SYPMDPAWHLCYS-GNINRDLQGFPAM 372
DSG+T+T+L +AY + + D Q LP + + C+ R + P +
Sbjct: 318 DSGSTITYLARAAYDMVHQAFAD--QVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPEL 375
Query: 373 AFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
AFHF GA++ L E+ + + CLA+ SD D SIIG QN++V YD
Sbjct: 376 AFHFE-GANMELPLENYMLIDGDTGNLCLAIAASD-------DGSIIGSFQHQNFHVLYD 427
Query: 432 LVSKQLYFQRIDCELL 447
+ L F C ++
Sbjct: 428 NENSLLSFTPATCNVM 443
>gi|124359514|gb|ABD28633.2| Peptidase aspartic, catalytic [Medicago truncatula]
Length = 181
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 122/202 (60%), Gaps = 22/202 (10%)
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
+G+L +Y YN LILGE A L GD+TP V +G +VT+EGIS+G+K LDI P FK
Sbjct: 1 MGSLTDKDYDYNQLILGEEAYLAGDTTPFQVYNGVNHVTMEGISIGQKSLDIAPGTFKMK 60
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
+ + G+ +LT V + +Q L+ + E QG W LCY G+++RDL
Sbjct: 61 NNGTGGGL------SLTQEVRNLFQRLKFQ-EVRLQG--------SPWALCYFGSVSRDL 105
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
+GFP + F+FAGGA + LD + F Q VFC++V PS DLS+IG++AQQ+Y
Sbjct: 106 KGFPVVTFYFAGGAVIGLDTLNFFVQAKDDVFCMSVHPS-------HDLSVIGLLAQQSY 158
Query: 427 NVAYDLVSKQLYFQRIDCELLA 448
NV YD +Y + IDC+LL+
Sbjct: 159 NVGYDKDKGLIYIESIDCQLLS 180
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 127/429 (29%), Positives = 194/429 (45%), Gaps = 44/429 (10%)
Query: 42 LLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS LYNP DT + + + + S++R S ++ + PG
Sbjct: 36 LIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANRFKPNSISARALVQSDIVPGGGE--- 92
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + SIG P V LA+ DTGS LIWV+CQPCE C + FDP +S +Y + C + +
Sbjct: 93 YLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEF 152
Query: 156 CTN--------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT----FLYDV 203
C D G+ C Y Y + S G + E+F +++ + + +V
Sbjct: 153 CNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQEV 212
Query: 204 GFGC-SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL 262
FGC + N F + + G + S L K+ KFSYC+ + + +
Sbjct: 213 AFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSKINF 272
Query: 263 GEGAILEGD-----STPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G + G STP+ + YY+TLE IS+ K L NL+ N +
Sbjct: 273 GNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPY-TNLW--NGEVEKGNII 329
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
IDSGTTLT+L + L VE+ +G S P +++C+ +L P + H
Sbjct: 330 IDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHG-LFNICFKDEKAIEL---PIITAH 385
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
F GAD+ L + F + + C + PS+ D++I G +AQ N+ V YDL K
Sbjct: 386 FT-GADVELQPVNTFAKVEEDLLCFTMIPSN-------DIAIFGNLAQMNFLVGYDLEKK 437
Query: 436 QLYFQRIDC 444
+ F DC
Sbjct: 438 AVSFLPTDC 446
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 187/404 (46%), Gaps = 48/404 (11%)
Query: 68 MARFIYLSQKSSQKAH--DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
+AR + SS+ A D + +H G F ++ SIG P + A++DTGS L+W
Sbjct: 75 VARATGVPMTSSKAAGGGDLQVPVHAGNGE---FLMDVSIGTPALAYSAIVDTGSDLVWT 131
Query: 126 KCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPD 178
+C+PC C FDPS S TYAT+PC S+ C T+ C +C Y Y +
Sbjct: 132 QCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYGDSSS 190
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK 238
+QG + +E F K+ L V FGC N Q G+ GLG SLV +
Sbjct: 191 TQGVLATETFTLA-----KSKLPGVVFGCGDTNEGDGFSQGAGLVGLG---RGPLSLVSQ 242
Query: 239 VG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS-----------YYVTL 286
+G KFSYC+ +L+ + + L+LG A + S S + + YYV+L
Sbjct: 243 LGLDKFSYCLTSLD--DTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSL 300
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
+ I++G + + + F D + GV +DSGT++T+L Y+ L+K Q LP
Sbjct: 301 KAITVGSTRISLPSSAFAVQDDGT-GGVIVDSGTSITYLEVQGYRALKKAFAA--QMALP 357
Query: 347 SYPMDP-AWHLCYSGNINR-DLQGFPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVG 403
+ LC+ D P + FHF GGADL L AE+ + S CL V
Sbjct: 358 AADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM 417
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S + LSIIG QQN+ YD+ L F + C L
Sbjct: 418 GS-------RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 454
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 139/419 (33%), Positives = 195/419 (46%), Gaps = 46/419 (10%)
Query: 57 DAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVL 116
D QR+ + R L++ + + A + + + +IG PP+P AV
Sbjct: 72 DMHRQRSRSFGRDRDRELAESDGRTSTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVA 131
Query: 117 DTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDS--SYCTNDCGGYPD----E 166
DTGS LIW +C PC QC A ++P+ S T++ LPC+S S C G
Sbjct: 132 DTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCA 191
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD-EQFTGVFGL 225
C Y Y G + G GSE F F +S + + V FGCS NA SD G+ GL
Sbjct: 192 CMYYQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCS--NASSSDWNGSAGLVGL 248
Query: 226 GPATSSTHSLVEKVGS-KFSYCIGNLNYFE--YAYNMLILGEGAILEG---DSTPM---- 275
G + SLV ++G+ +FSYC L F+ + + L+LG A L G STP
Sbjct: 249 G---RGSLSLVSQLGAGRFSYC---LTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASP 302
Query: 276 --SVIDGSYYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
+ + YY+ L GISLG K L I P F K D G+ IDSGTT+T L +AYQ
Sbjct: 303 ARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPD--GTGGLIIDSGTTITSLANAAYQQ 360
Query: 333 LRKEVEDLFQGLLPSYPM--DPAWHLCYS--GNINRDLQGFPAMAFHFAGGADLVLDAES 388
+R V+ LP+ LC++ + P+M HF GAD+VL A+S
Sbjct: 361 VRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHF-DGADMVLPADS 419
Query: 389 VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ S V+CLA + + +S G QQN ++ YD+ + L F C L
Sbjct: 420 -YMISGSGVWCLA-----MRNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCSTL 472
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/377 (31%), Positives = 180/377 (47%), Gaps = 51/377 (13%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC-QPCEQC---GATTFDPSKSLTYATL 149
++ + V+ +IG PP+P AVLDTGS LIW +C PC +C A + P++S TYA +
Sbjct: 87 ASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANV 146
Query: 150 PCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
C S C + C C Y Y +G + G + +E F + T + V
Sbjct: 147 SCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGS----DTAVRGV 202
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLIL 262
FGC N +D +G+ G+G SLV ++G ++FSYC N A + L L
Sbjct: 203 AFGCGTENLGSTDNS-SGLVGMG---RGPLSLVSQLGVTRFSYCFTPFN--ATAASPLFL 256
Query: 263 GEGAILE--GDSTPM--SVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
G A L +TP S G+ YY++LEGI++G+ +L IDP +F+ D
Sbjct: 257 GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTP-MGDG 315
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH----LCYSGNINRDLQG 368
GV IDSGTT T L A+ L + + + P+ H LC++ ++
Sbjct: 316 GVIIDSGTTFTALEERAFVALARALASRVR-----LPLASGAHLGLSLCFAAASPEAVE- 369
Query: 369 FPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + HF GAD+ L ES V S+ V CL + + + +S++G + QQN +
Sbjct: 370 VPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMVSA-------RGMSVLGSMQQQNTH 421
Query: 428 VAYDLVSKQLYFQRIDC 444
+ YDL L F+ C
Sbjct: 422 ILYDLERGILSFEPAKC 438
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/419 (27%), Positives = 191/419 (45%), Gaps = 34/419 (8%)
Query: 42 LLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS ++P+ T + S++R + + ++ + ++ + P
Sbjct: 36 LIHRDSPHSPFFDPSKTQAERLTDAFRRSVSR-VGRFRPTAMTSDGIQSRIVPSAGE--- 91
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ +N IG PPVP +A++DTGS L W +C+PC C FDP S TY C +S+
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + +C + Y +G + G + SE +++ FGC H++
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211
Query: 212 AHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
D+ +G+ GLG S S L + FSYC+ ++ + + G + G
Sbjct: 212 GGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSG 271
Query: 271 ---DSTPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
STP+ D YY+TLEGIS+G+K L + K + + +DSGTT T+L
Sbjct: 272 YGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKG--YSKKTEVEEGNIIVDSGTTYTFL 329
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
Y L K V + +G P + + LCY N ++ P + HF A++ L
Sbjct: 330 PQEFYSKLEKSVANSIKGKRVRDP-NGIFSLCY--NTTAEINA-PIITAHFK-DANVELQ 384
Query: 386 AESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ F + + C V P+ D+ ++G +AQ N+ V +DL K++ F+ DC
Sbjct: 385 PLNTFMRMQEDLVCFTVAPTS-------DIGVLGNLAQVNFLVGFDLRKKRVSFKAADC 436
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 180/380 (47%), Gaps = 46/380 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + +IG PPVP +A+ DTGS L W +C+PC+ C +D + S +++ +PC S+
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154
Query: 156 C------TNDC-GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK----TFLYDVG 204
C + +C C Y Y +G S G +G+E F S G + V
Sbjct: 155 CLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVA 214
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLIL- 262
FGC +N S TG GLG + SLV ++G KFSYC+ ++F + +L
Sbjct: 215 FGCGVDNGGLSYNS-TGTVGLG---RGSLSLVAQLGVGKFSYCL--TDFFNTSLGSPVLF 268
Query: 263 GEGAILEGDST-------PMSVIDG-----SYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
G A L ST ++ G YYV+LEGISLG+ L I F D S
Sbjct: 269 GSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGS 328
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF-QGLLPSYPMDPAWHLCYSGNINRDLQGF 369
G+ +DSGT T LV SA++ + V + Q ++ + +D +G + L
Sbjct: 329 -GGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSLDSPCFPATAG--EQQLPDM 385
Query: 370 PAMAFHFAGGADLVL--DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P M HFAGGAD+ L D F QESSS FCL +I G SI+G QQN
Sbjct: 386 PDMLLHFAGGADMRLHRDNYMSFNQESSS-FCL-----NIAGAPSAYGSILGNFQQQNIQ 439
Query: 428 VAYDLVSKQLYFQRIDCELL 447
+ +D+ QL F DC L
Sbjct: 440 MLFDITVGQLSFVPTDCSKL 459
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 187/404 (46%), Gaps = 48/404 (11%)
Query: 68 MARFIYLSQKSSQKAH--DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
+AR + SS+ A D + +H G F ++ SIG P + A++DTGS L+W
Sbjct: 65 VARATGVPMTSSKAAGGGDLQVPVHAGNGE---FLMDVSIGTPALAYSAIVDTGSDLVWT 121
Query: 126 KCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPD 178
+C+PC C FDPS S TYAT+PC S+ C T+ C +C Y Y +
Sbjct: 122 QCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYGDSSS 180
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK 238
+QG + +E F K+ L V FGC N Q G+ GLG SLV +
Sbjct: 181 TQGVLATETFTLA-----KSKLPGVVFGCGDTNEGDGFSQGAGLVGLG---RGPLSLVSQ 232
Query: 239 VG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS-----------YYVTL 286
+G KFSYC+ +L+ + + L+LG A + S S + + YYV+L
Sbjct: 233 LGLDKFSYCLTSLD--DTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSL 290
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
+ I++G + + + F D + GV +DSGT++T+L Y+ L+K Q LP
Sbjct: 291 KAITVGSTRISLPSSAFAVQDDGT-GGVIVDSGTSITYLEVQGYRALKKAFAA--QMALP 347
Query: 347 SYPMDP-AWHLCYSGNINR-DLQGFPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVG 403
+ LC+ D P + FHF GGADL L AE+ + S CL V
Sbjct: 348 AADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM 407
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S + LSIIG QQN+ YD+ L F + C L
Sbjct: 408 GS-------RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 444
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 142/430 (33%), Positives = 197/430 (45%), Gaps = 44/430 (10%)
Query: 38 LVTKLLHRDSLL--YNP-NDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
L L+ DS L ++P N + + +R + S R L Q S + A ++ G
Sbjct: 55 LRIDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKL-QMSVDEVKAVEAPVYAGNG 113
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPC 151
F + +IG P + A+LDTGS L W +C+PC C +DPS+S TY+ +PC
Sbjct: 114 E---FLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPC 170
Query: 152 DSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
SS C C G C Y Y + +QG + E F + L + FGC
Sbjct: 171 SSSMCQALPMYSCSG--ANCEYLYSYGDQSSTQGILSYESFTLTSQS-----LPHIAFGC 223
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
N Q G+ G G S S L + +G+KFSYC+ ++ + L +G+ A
Sbjct: 224 GQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTA 283
Query: 267 ILEGD---STPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSD--AGVFIDS 318
L STP+ YY++LEGIS+G ++LDI F D D GV IDS
Sbjct: 284 SLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTF---DLQLDGTGGVIIDS 340
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDPAWHLCYSGNINRDLQGFPAMAFHFA 377
GTT+T+L S Y ++K V LP + LC+ FP + FHF
Sbjct: 341 GTTVTYLEQSGYDVVKKAVISSIN--LPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHFE 398
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GAD L E+ Y +SS + CLA+ PS NG +SI G I QQNY + YD L
Sbjct: 399 -GADFNLPKENYIYTDSSGIACLAMLPS--NG-----MSIFGNIQQQNYQILYDNERNVL 450
Query: 438 YFQRIDCELL 447
F C+ L
Sbjct: 451 SFAPTVCDTL 460
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 181/379 (47%), Gaps = 49/379 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
F ++ S+G P +P A++DTGS L+W +C+PC +C T FDP+ S TYA LPC S+
Sbjct: 116 FLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSAL 175
Query: 156 CTN----------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C + C Y Y + +QG + +E F + + V F
Sbjct: 176 CADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTL-----ARQKVPGVAF 230
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGE 264
GC N Q G+ GLG SLV ++G +FSYC+ +L+ +L+
Sbjct: 231 GCGDTNEGDGFTQGAGLVGLG---RGPLSLVSQLGIDRFSYCLTSLDDAAGRSPLLLGSA 287
Query: 265 GAILE------GDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
I +TP+ V + S YYV+L G+++G L + + F D + GV
Sbjct: 288 AGISASAATAPAQTTPL-VKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGT-GGV 345
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCY---SGNINRDLQ-GF 369
+DSGT++T+L AY+ LRK + LP+ + LC+ +G +++D+Q
Sbjct: 346 IVDSGTSITYLELRAYRALRKAF--VAHMSLPTVDASEIGLDLCFQGPAGAVDQDVQVQV 403
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
P + HF GGADL L AE+ +S+S CL V S + LSIIG QQN+
Sbjct: 404 PKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMAS-------RGLSIIGNFQQQNFQF 456
Query: 429 AYDLVSKQLYFQRIDCELL 447
YD+ L F +C L
Sbjct: 457 VYDVAGDTLSFAPAECNKL 475
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 139/427 (32%), Positives = 195/427 (45%), Gaps = 51/427 (11%)
Query: 52 PNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP--VFYVNFSIGQPP 109
P D + +R + MA + S +S +A R P + VP + V+ +IG PP
Sbjct: 368 PRDGGRSLTRREVLHRMAARLLFS--ASGRAASARVDPGPYANGVPDTEYLVHLAIGTPP 425
Query: 110 VPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTN----DCGG 162
P +LDTGS L+W +C+PC C + DPS S T+ LPC S C N CG
Sbjct: 426 QPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGK 485
Query: 163 Y---PDECWYNIRYTNGPDSQGTIGSEQFNFETSD-EGKTFLYDVGFGCS-HNNAHFSDE 217
+ C Y Y +G + G + +E F F +D G+ + D+ FGC NN F+
Sbjct: 486 HNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSN 545
Query: 218 QFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD----ST 273
+ TG+ G G S S ++ FS+C + E + +L L + D ST
Sbjct: 546 E-TGIAGFGRGALSLPSQLKV--DNFSHCFTAITGSEPSSVLLGLPANLYSDADGAVQST 602
Query: 274 PMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVFIDSGTTLTWLVPSA 329
P+ S YY++L+GI++G L I + F K D G IDSGT +T L A
Sbjct: 603 PLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQD--GTGGTIIDSGTGMTTLPQDA 660
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAW-----HLCYSGNINRDLQ-GFPAMAFHFAGGADLV 383
Y K V D F + P+D A LC+S ++ R + P + HF GA L
Sbjct: 661 Y----KLVHDAFTAQV-RLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFE-GATLD 714
Query: 384 LDAESVFYQ---ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L E+ ++ SV CLA+ D DL+IIG QQN +V YDLV L F
Sbjct: 715 LPRENYMFEFEDAGGSVTCLAINAGD-------DLTIIGNYQQQNLHVLYDLVRNMLSFV 767
Query: 441 RIDCELL 447
C L
Sbjct: 768 PAQCNRL 774
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 133/413 (32%), Positives = 194/413 (46%), Gaps = 43/413 (10%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYV-NFSIGQPPVPQLA 114
VD++ T M R ++ S+ + +D + P + +V V Y+ +IG+PPVP +A
Sbjct: 30 VDSKGGYTKTELMRRAVHRSRLRALSGYDATS---PRLHSVQVEYLMELAIGKPPVPFVA 86
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDE- 166
+ DTGS L W +CQPC+ C +DPS S T++ LPC S+ C + +C P
Sbjct: 87 LADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSATCLPIWSRNC--TPSSL 144
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLG 226
C Y Y +G S G +G+E S + V FGC +N S TG GLG
Sbjct: 145 CRYRYAYGDGAYSAGILGTETLTLGPS-SAPVSVGGVAFGCGTDNGGDSLNS-TGTVGLG 202
Query: 227 PATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNM-LILGEGAILE-GDSTPMSVI----- 278
T SL+ ++G KFSYC+ ++F A + +LG A L G ST S
Sbjct: 203 ---RGTLSLLAQLGVGKFSYCL--TDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSP 257
Query: 279 --DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE 336
Y+V+L+GISLG+ L I F + G+ +DSGTT T L S ++ +
Sbjct: 258 QNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGT-GGMIVDSGTTFTILAESGFREVVGR 316
Query: 337 VEDLF-QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF-YQES 394
V + Q + + +D +G P + HFAGGAD+ L ++ Y E
Sbjct: 317 VARVLGQPPVNASSLDAPCFPAPAGEPPY----MPDLVLHFAGGADMRLYRDNYMSYNEE 372
Query: 395 SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S FCL +I G + S++G QQN + +D QL F DC L
Sbjct: 373 DSSFCL-----NIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDCSKL 420
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 174/371 (46%), Gaps = 43/371 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
F ++ SIG P + A++DTGS L+W +C+PC C FDPS S TYAT+PC S+
Sbjct: 74 FLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSAS 133
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T+ C +C Y Y + +QG + +E F K+ L V FGC N
Sbjct: 134 CSDLPTSKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLA-----KSKLPGVVFGCGDTN 187
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
Q G+ GLG SLV ++G KFSYC+ +L+ + + L+LG A +
Sbjct: 188 EGDGFSQGAGLVGLG---RGPLSLVSQLGLDKFSYCLTSLD--DTNNSPLLLGSLAGISE 242
Query: 271 DSTPMSVIDGS-----------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
S S + + YYV+L+ I++G + + + F D + GV +DSG
Sbjct: 243 ASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGT-GGVIVDSG 301
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP-AWHLCYSGNINR-DLQGFPAMAFHFA 377
T++T+L Y+ L+K Q LP+ LC+ D P + FHF
Sbjct: 302 TSITYLEVQGYRALKKAFAA--QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFD 359
Query: 378 GGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GGADL L AE+ + S CL V S + LSIIG QQN+ YD+
Sbjct: 360 GGADLDLPAENYMVLDGGSGALCLTVMGS-------RGLSIIGNFQQQNFQFVYDVGHDT 412
Query: 437 LYFQRIDCELL 447
L F + C L
Sbjct: 413 LSFAPVQCNKL 423
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 124/374 (33%), Positives = 173/374 (46%), Gaps = 44/374 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDS- 153
+ + +IG PP+ A+ DTGS LIW +C PC QC ++PS S T+ LPC+S
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSS 147
Query: 154 -SYCTNDCGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
S C G P C YN Y G + G E F F ++ +T + + FGCS+
Sbjct: 148 VSMCAALAGPSPPPGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSN 206
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYA--YNMLILGEGA 266
S + + G GL + SLV ++G+ FSYC L F+ A + L+LG A
Sbjct: 207 A----SSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYC---LTPFQDANSTSTLLLGPSA 259
Query: 267 ILEG------------DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
L G PMS YY+ L GIS+G L I PN F T G+
Sbjct: 260 ALNGTGVLTTPFVASPSKAPMSTY---YYLNLTGISIGTTALSIPPNAFALR-TDGTGGL 315
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL-QGFPAMA 373
IDSGTT+T LV +AYQ +R +E L + LC++ P+M
Sbjct: 316 IIDSGTTITSLVDAAYQQVRAAIESLVTLPVADGSDSTGLDLCFALTSETSTPPSMPSMT 375
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
FHF GAD+VL ++ + S V+CLA + + +S G QQN ++ YD+
Sbjct: 376 FHF-DGADMVLPVDN-YMILGSGVWCLA-----MRNQTVGAMSTFGNYQQQNVHLLYDIH 428
Query: 434 SKQLYFQRIDCELL 447
+ L F C L
Sbjct: 429 EETLSFAPAKCSTL 442
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 130/425 (30%), Positives = 192/425 (45%), Gaps = 40/425 (9%)
Query: 41 KLLH---RDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+L+H S YN ++ + + S R YL+ S + P I P
Sbjct: 29 ELIHPISSKSPFYNTAESHFQRMSNNMKHSTNRVHYLNHVFSFPPNKV-----PNIVVSP 83
Query: 98 V----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLP 150
+ ++F IG PP V+DT + IW +C PC+ C TT FDPSKS TY T+P
Sbjct: 84 FMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIP 143
Query: 151 CDSSYCTN----DCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C S C N C + C Y+ Y SQG + + ++++ ++
Sbjct: 144 CSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVI 203
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H N + +G GLG S S L +G KFSYC+ L E L G+
Sbjct: 204 GCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLHFGD 263
Query: 265 GAILEG---DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+++ G STP++ + Y TL +S+G+ ++ + N KND + IDSGTT
Sbjct: 264 KSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFE-NSTSKNDNLGNT--IIDSGTT 320
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI-NRDLQGFPAMAFHFAGGA 380
LT L + Y L V + + P + + LCY + N D+ P + HF GA
Sbjct: 321 LTILPENVYSRLESIVTSMVKLERAKSP-NQQFKLCYKATLKNLDV---PIITAHF-NGA 375
Query: 381 DLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
D+ L++ + FY V C A V + G +IIG IAQQN+ V +DL + F
Sbjct: 376 DVHLNSLNTFYPIDHEVVCFAFVSVGNFPG------TIIGNIAQQNFLVGFDLQKNIISF 429
Query: 440 QRIDC 444
+ DC
Sbjct: 430 KPTDC 434
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 131/447 (29%), Positives = 200/447 (44%), Gaps = 56/447 (12%)
Query: 11 SLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMAR 70
S + L F R+ S T G L+ + R S YNP +T + LN S+ R
Sbjct: 6 SFVLLLFCFCRL--SLTKTQNHGFNVELIHPISSR-SPFYNPKETQIQRISSILNYSINR 62
Query: 71 FIYLSQK---SSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC 127
YL+ S K D G V +++SIG PP +++DTG+ IW +C
Sbjct: 63 VRYLNHVFSFSPNKIQDVPLSSFMGAGYV----MSYSIGTPPFQLYSLIDTGNDNIWFQC 118
Query: 128 QPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIG 184
+PC+ C T F PSKS TY T+PC S C N G Y +G
Sbjct: 119 KPCKPCLNQTSPMFHPSKSSTYKTIPCTSPICKNADGHY-------------------LG 159
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL--GPATSSTHSLVEKVGSK 242
+ +++ ++ GC H N + +G GL GP S L +G K
Sbjct: 160 VDTLTLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPL-SFISQLNSSIGGK 218
Query: 243 FSYCIGNLNYFEYAYNMLILGEGAILEG---DSTPMSVIDGSYYVTLEGISLGEKMLDID 299
FSYC+ L E + L G+ + + G STP+ +G Y+V+LE S+G+ ++ ++
Sbjct: 219 FSYCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPIKEENG-YFVSLEAFSVGDHIIKLE 277
Query: 300 PNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLC 357
++ + IDSGTT+T L Y L V D+ + DP+ ++LC
Sbjct: 278 -------NSDNRGNSIIDSGTTMTILPKDVYSRLESVVLDMVK---LKRVKDPSQQFNLC 327
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSI 417
Y L + HF+ G+++ L+A + FY + V C A ++G F L+I
Sbjct: 328 YQTTSTTLLTKVLIITAHFS-GSEVHLNALNTFYPITDEVICFAF----VSGGNFSSLAI 382
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDC 444
G + QQN+ V +DL K + F+ DC
Sbjct: 383 FGNVVQQNFLVGFDLNKKTISFKPTDC 409
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 141/455 (30%), Positives = 190/455 (41%), Gaps = 78/455 (17%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSS----QKAHDTRAHLH 90
P R L HR + P + A S A + + + +KA R
Sbjct: 51 PTRASVPLAHR----HGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMSE 106
Query: 91 PGISTVPVF----------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA----- 135
G +++P + V IG P V Q ++DTGS L WV+C+PC
Sbjct: 107 GGGASIPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKD 166
Query: 136 TTFDPSKSLTYATLPCDS------------SYCTNDCGGYPDECWYNIRYTNGPDSQGTI 183
FDPSKS T+AT+PC S + CTN+ G P +C Y I Y NG ++G
Sbjct: 167 PLFDPSKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVY 226
Query: 184 GSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSK 242
+E +S K+F FGC + H ++F G+ GLG A S S V G
Sbjct: 227 STETLALGSSAVVKSFR----FGCGSDQ-HGPYDKFDGLLGLGGAPESLVSQTASVYGGA 281
Query: 243 FSYCIGNLNYFEYAYNMLILGEGAILEGDS-----TPMSV----IDGSYYVTLEGISLGE 293
FSYC+ LN L LG + TPM I Y VTL GIS+G
Sbjct: 282 FSYCLPPLN---SGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGG 338
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM--- 350
K LDI P +F K G +DSGT +T + +AY+ LR F+ + YP+
Sbjct: 339 KALDIPPAVFAK-------GNIVDSGTVITGIPTTAYKALRTA----FRSAMAEYPLLPP 387
Query: 351 -DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDING 409
D A CY+ + + P +A F GGA + LD S E CLA +D
Sbjct: 388 ADSALDTCYNFTGHGTVT-VPKVALTFVGGATVDLDVPSGVLVED----CLAF--ADAGD 440
Query: 410 ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F IIG + + V YD L F+ C
Sbjct: 441 GSF---GIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 176/365 (48%), Gaps = 34/365 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+++ S+G PP V+DTGS ++W++C PC C FDP KS TY+TL C+S
Sbjct: 37 YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSRQ 96
Query: 156 CTN-DCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLYDVGFGCSHNNA 212
C N D GG ++C Y + Y +G S G ++ + TS G+ L + GC H+N
Sbjct: 97 CLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNE 156
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
+ + S + + + G +FSYC+ + + LI G+ A+
Sbjct: 157 GYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSSLIFGDAAVPPAGV 216
Query: 273 --TPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
TP + + YY+ + GIS+G +L I + F+ D+ + GV IDSGT++T L
Sbjct: 217 RFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQL-DSLGNGGVIIDSGTSVTRLQN 275
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG--FPAMAFHFAGGADLVLD 385
+AY +LR+ L+ + + CY N++ DL P + HF GGADL L
Sbjct: 276 AAYASLREAFRAGTSDLVLTTEFS-LFDTCY--NLS-DLSSVDVPTVTLHFQGGADLKLP 331
Query: 386 AESVFYQ-ESSSVFCLA----VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
A + ++SS FCLA GP SIIG I QQ + V YD + Q+ F
Sbjct: 332 ASNYLVPVDNSSTFCLAFAGTTGP-----------SIIGNIQQQGFRVIYDNLHNQVGFV 380
Query: 441 RIDCE 445
C+
Sbjct: 381 PSQCD 385
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 114/406 (28%), Positives = 182/406 (44%), Gaps = 39/406 (9%)
Query: 52 PNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVP 111
PN T ++ + R +L + S D A++ P S + + G P
Sbjct: 69 PNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANV-PVRSGSGEYIIQVDFGTPKQS 127
Query: 112 QLAVLDTGSSLIWVKCQPCEQCGATT--FDPSKSLTYATLPCDSSYC---TNDCGGYPDE 166
++DTGS + W+ C+ C+ C +T FDP+KS +Y CDS C + +CGG +
Sbjct: 128 MYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQEISGNCGGN-SK 186
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH--FSDEQFTGVFG 224
C + + Y +G GT+ S+ G +L + FGC+ + + +S G+ G
Sbjct: 187 CQFEVLYGDGTQVDGTLASDAITL-----GSQYLPNFSFGCAESLSEDTYSSPGLMGLGG 241
Query: 225 LGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI--DGS- 281
+ + E G FSYC L + L+LG+ A + S + + D S
Sbjct: 242 GSLSLLTQAPTAELFGGTFSYC---LPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSF 298
Query: 282 ---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
Y+VTL+ IS+G + + + S G IDSGTT+T+LVPSAY+ LR
Sbjct: 299 PTFYFVTLKAISVGNTRISV-----PATNIASGGGTIIDSGTTITYLVPSAYKDLRDAFR 353
Query: 339 DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF 398
L P+ D CY +++ P + H DLVL E++ + S +
Sbjct: 354 QQLSSLQPTPVED--MDTCY--DLSSSSVDVPTITLHLDRNVDLVLPKENILITQESGLS 409
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA +D SIIG + QQN+ + +D+ + Q+ F + C
Sbjct: 410 CLAFSSTD-------SRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 123/425 (28%), Positives = 195/425 (45%), Gaps = 45/425 (10%)
Query: 43 LHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST-VPVFYV 101
+ RD+L ++ + +T+N + R +++ + D +A + G+S +++
Sbjct: 5 ISRDNLRVA---SIHGRINQTVN-GLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFI 60
Query: 102 NFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN 158
S+G PP V+DTGS ++W++C PC C FDP KS TY+TL C + C N
Sbjct: 61 RISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLN 120
Query: 159 -DCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLYDVGFGCSHNNAHFS 215
D G ++C Y + Y +G + G G++ + TS G+ L + GC H+N +
Sbjct: 121 LDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYF 180
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS--T 273
+ S + + + G +FSYC+ + + L+ GE A+ + T
Sbjct: 181 VGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFT 240
Query: 274 PMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
P + YY+ + GIS+G +L I + F+ D+ + GV IDSGT++T L +AY
Sbjct: 241 PQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQL-DSLGNGGVIIDSGTSVTRLQNAAY 299
Query: 331 QTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGADLVLD 385
+LR L P+ + CY DL G P + HF GG DL L
Sbjct: 300 ASLRDAFRAGTSDLAPTAGFS-LFDTCY------DLSGLASVDVPTVTLHFQGGTDLKLP 352
Query: 386 AESVFYQ-ESSSVFCLA----VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
A + ++S+ FCLA GP SIIG I QQ + V YD + Q+ F
Sbjct: 353 ASNYLIPVDNSNTFCLAFAGTTGP-----------SIIGNIQQQGFRVIYDNLHNQVGFV 401
Query: 441 RIDCE 445
C
Sbjct: 402 PSQCN 406
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 170/365 (46%), Gaps = 43/365 (11%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----T 157
IG P + A++DTGS L+W +C+PC C FDPS S TYAT+PC S+ C T
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 158 NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDE 217
+ C +C Y Y + +QG + +E F K+ L V FGC N
Sbjct: 233 SKCTSA-SKCGYTYTYGDSSSTQGVLATETFTLA-----KSKLPGVVFGCGDTNEGDGFS 286
Query: 218 QFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMS 276
Q G+ GLG SLV ++G KFSYC+ +L+ + + L+LG A + S S
Sbjct: 287 QGAGLVGLG---RGPLSLVSQLGLDKFSYCLTSLD--DTNNSPLLLGSLAGISEASAAAS 341
Query: 277 VIDGS-----------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+ + YYV+L+ I++G + + + F D + GV +DSGT++T+L
Sbjct: 342 SVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGT-GGVIVDSGTSITYL 400
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDP-AWHLCYSGNINR-DLQGFPAMAFHFAGGADLV 383
Y+ L+K Q LP+ LC+ D P + FHF GGADL
Sbjct: 401 EVQGYRALKKAFA--AQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLD 458
Query: 384 LDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L AE+ + S CL V S + LSIIG QQN+ YD+ L F +
Sbjct: 459 LPAENYMVLDGGSGALCLTVMGS-------RGLSIIGNFQQQNFQFVYDVGHDTLSFAPV 511
Query: 443 DCELL 447
C L
Sbjct: 512 QCNKL 516
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 132/481 (27%), Positives = 209/481 (43%), Gaps = 73/481 (15%)
Query: 14 TLPFTSTRIFTSTTAAPAAGKPKRLVTK---------LLHRDSLLY----NPNDTVDAQA 60
TL + I T T AP + ++ TK ++HRDSLL N + + +
Sbjct: 81 TLDIAAWLIETKTAPAPGRDEYEKRETKPRQTPWSVQVVHRDSLLVKDAANATASYERRL 140
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTR--AHLHPGISTVPV----------------FYVN 102
+ TL R L Q+ ++ + A H ++ V ++
Sbjct: 141 EETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFTR 200
Query: 103 FSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCT-- 157
+G P Q VLDTGS ++W++C+PC +C + F+PS S +++TL C+S+ C+
Sbjct: 201 IGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSYL 260
Query: 158 --NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
+C G C Y + Y +G + G+ +E F G T + +V GC H+NA
Sbjct: 261 DAYNCHG--GGCLYKVSYGDGSYTIGSFATEMLTF-----GTTSVRNVAIGCGHDNAGLF 313
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-TP 274
+ S L + G FSYC+ ++ F + L G ++ G TP
Sbjct: 314 VGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCL--VDRFSESSGTLEFGPESVPLGSILTP 371
Query: 275 MSV---IDGSYYVTLEGISLGEKMLD-IDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
+ + YYV L IS+G +LD + P++F+ ++T G +DSGT +T L Y
Sbjct: 372 LLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVY 431
Query: 331 QTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGADLVL 384
+R V Q LP + CY DL G P + FHF+ GA L+L
Sbjct: 432 DAVRDAFVAGTRQ--LPKAEGVSIFDTCY------DLSGLPLVNVPTVVFHFSNGASLIL 483
Query: 385 DAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
A++ + + FC A P+ DLSI+G I QQ V++D + + F
Sbjct: 484 PAKNYMIPMDFMGTFCFAFAPAT------SDLSIMGNIQQQGIRVSFDTANSLVGFALRQ 537
Query: 444 C 444
C
Sbjct: 538 C 538
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 131/408 (32%), Positives = 199/408 (48%), Gaps = 47/408 (11%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAH---DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
+R + S R L S+ H D + P I + + + +IG P + A++D
Sbjct: 2 KRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGE-YLIQMAIGTPALSLSAIMD 60
Query: 118 TGSSLIWVKCQPCEQCGA-TTFDPSKSLTYATLPCDSSY--------CTNDCGGYPDECW 168
TGS L+W KC PC C + +DPS S TY+ + C SS C ND +C
Sbjct: 61 TGSDLVWTKCNPCTDCSTSSIYDPSSSSTYSKVLCQSSLCQPPSIFSCNND-----GDCE 115
Query: 169 YNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA 228
Y Y + + G + E F+ + L ++ FGC H+N F ++ G+ G G
Sbjct: 116 YVYPYGDRSSTSGILSDETFSISSQS-----LPNITFGCGHDNQGF--DKVGGLVGFGRG 168
Query: 229 TSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG---DSTPM--SVIDGSY 282
+ S S L +G+KFSYC+ + + L +G A LE STP+ S Y
Sbjct: 169 SLSLVSQLGPSMGNKFSYCLVSRTD-SSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHY 227
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSD--AGVFIDSGTTLTWLVPSAYQTLRKEVEDL 340
Y++LEGIS+G + L I F D SD G+ IDSGTTLT+L +AY +++ +
Sbjct: 228 YLSLEGISVGGQSLAIPTGTF---DIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSS 284
Query: 341 FQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS-VFC 399
LP D LC++ + + GFP+M FHF GAD + E+ + +S+S + C
Sbjct: 285 IN--LPQ--ADGQLDLCFNQQGSSN-PGFPSMTFHFK-GADYDVPKENYLFPDSTSDIVC 338
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
LA+ P++ N +++I G + QQNY + YD + L F C+ L
Sbjct: 339 LAMMPTNSN---LGNMAIFGNVQQQNYQILYDNENNVLSFAPTACDTL 383
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 126/428 (29%), Positives = 191/428 (44%), Gaps = 53/428 (12%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQK-----SSQKAHDTRAHLHPGIS- 94
L+HRD++ + Q + AR +L ++ S D + + PG+
Sbjct: 66 SLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD 125
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
++V +G PP Q V+D+GS +IWV+C+PCEQC A T FDP+ S +++ + C
Sbjct: 126 GSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 152 DSSYCTN------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
S+ C GG +C Y++ Y +G ++G + ET G T + V
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKG-----ELALETLTLGGTAVQGVAI 240
Query: 206 GCSHNNAHFSDEQFTGVFGLG-PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H N+ G+ GLG A S L G FSYC+ + L+LG
Sbjct: 241 GCGHRNSGLFVGA-AGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRG--AGGAGSLVLGR 297
Query: 265 GAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
+ S YYV L GI +G + L + +LF+ + + GV +D+GT +T
Sbjct: 298 TEAVPRGRRASSF----YYVGLTGIGVGGERLPLQDSLFQLTEDGA-GGVVMDTGTAVTR 352
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHF 376
L AY LR F G + + P PA L CY DL G+ P ++F+F
Sbjct: 353 LPREAYAALRGA----FDGAMGALPRSPAVSLLDTCY------DLSGYASVRVPTVSFYF 402
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GA L L A ++ + +VFCLA PS +SI+G I Q+ + D +
Sbjct: 403 DQGAVLTLPARNLLVEVGGAVFCLAFAPSS------SGISILGNIQQEGIQITVDSANGY 456
Query: 437 LYFQRIDC 444
+ F C
Sbjct: 457 VGFGPNTC 464
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 131/424 (30%), Positives = 187/424 (44%), Gaps = 45/424 (10%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLN---MSMARFIYLSQKS--SQKAHDTRAHLHPGIST 95
+++HRDS + Q QR N SM R + +Q S S L G
Sbjct: 30 EIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQISVYSNAVESPVTLLDDGD-- 87
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCD 152
+ +++S+G PP P ++DT S +IWV+CQ CE C T FDPS S TY LPC
Sbjct: 88 ---YLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCS 144
Query: 153 SSYCTNDCGG--YPDE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
S+ C + G DE C + + Y +G SQG + E + ++ GC
Sbjct: 145 STTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGC 204
Query: 208 SHN-NAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
N N F G+ GLG S L + KFSYC+ ++ + L G+
Sbjct: 205 IRNTNVSFDS---IGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRS---SKLKFGDA 258
Query: 266 AILEGDSTPMSVI-----DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
A++ GD T + I YY+TLE S+G ++ + + + IDSGT
Sbjct: 259 AMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEF---RSSSSRSSGKGNIIIDSGT 315
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
T T L Y L V D+ + P+ + LCY D P + HF+ GA
Sbjct: 316 TFTVLPDDVYSKLESAVADVVKLERAEDPLK-QFSLCYKSTY--DKVDVPVITAHFS-GA 371
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
D+ L+A + F S V CLA S + +I G +AQQN+ V YDL K + F+
Sbjct: 372 DVKLNALNTFIVASHRVVCLAFLSS-------QSGAIFGNLAQQNFLVGYDLQRKIVSFK 424
Query: 441 RIDC 444
DC
Sbjct: 425 PTDC 428
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 131/468 (27%), Positives = 205/468 (43%), Gaps = 53/468 (11%)
Query: 5 HAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTL 64
HA+ LL L+ +F+S+ A P L + HRD++ P D V + +
Sbjct: 6 HALALLGLL--------VFSSSHAT--LQPPTTLHVPVFHRDTVFPPPPDDVKCVSLLSR 55
Query: 65 NMSMARFIYLSQKSSQ-----KAHDTRAHLH-PGISTVPV----FYVNFSIGQPPVPQLA 114
++ Y + +S AHD HLH P IS +P ++ + +G PP P L
Sbjct: 56 RLAADAARYAALVASLIIGSLTAHDDD-HLHSPVISGLPFASGEYFASVGVGTPPTPALL 114
Query: 115 VLDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSSYCTN--DCGGYPDECWY 169
V+DTGS ++W++C+PC C + +DP S TYA PC C N C G C Y
Sbjct: 115 VIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQCRNPQTCDGTTGGCGY 174
Query: 170 NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPAT 229
I Y + + G + +++ F T + +V GC H+N G+ G+
Sbjct: 175 RIVYGDASSTSGNLATDRLVFSN----DTSVGNVTLGCGHDNEGLFGSA-AGLLGVARGN 229
Query: 230 SSTHSLV-EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS--TPMSVIDGS---YY 283
+S + V + G F+YC+G+ + + L+ G A S TP+ YY
Sbjct: 230 NSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYY 289
Query: 284 VTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
V + G S+ GE + + GV +DSGT++T AY LR D F
Sbjct: 290 VDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALR----DAFD 345
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGADLVLDAESVFYQESSSV 397
M + DL+G P + HFAGGAD+ L E+ E S
Sbjct: 346 ARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVVLHFAGGADVALPPENYLVPEESGR 405
Query: 398 F-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ C A+ + +G LS+IG + QQ + V +D+ ++++ F+ C
Sbjct: 406 YHCFALEAAGHDG-----LSVIGNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 178/374 (47%), Gaps = 47/374 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ ++ IG PP A++DTGS LIW +C PC C F+P+KS +YA+LPC S+
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAM 147
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C + + C Y Y + S G + +E F F T + + + V FGC + N
Sbjct: 148 CNALYSPLC--FQNACVYQAFYGDSASSAGVLANETFTFGT-NSTRVAVPRVSFGCGNMN 204
Query: 212 AH--FSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAIL 268
A F+ G FG G SLV ++GS +FSYC+ ++ A + L G A L
Sbjct: 205 AGTLFNGSGMVG-FGRG-----ALSLVSQLGSPRFSYCL--TSFMSPATSRLYFGAYATL 256
Query: 269 EG---------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
STP V + Y++ + GIS+ +L IDP++F N+T GV I
Sbjct: 257 NSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVII 316
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYS-GNINRDLQGFPAMA 373
DSGTT+T+L AY ++ LP P+ + C+ R + P M
Sbjct: 317 DSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATPSDTFDTCFKWPPPPRRMVTLPEMV 374
Query: 374 FHFAGGADLVLDAESVFYQESSSV-FCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
HF GAD+ L E+ + + CLA+ PSD D SIIG QN+++ YDL
Sbjct: 375 LHF-DGADMELPLENYMVMDGGTGNLCLAMLPSD-------DGSIIGSFQHQNFHMLYDL 426
Query: 433 VSKQLYFQRIDCEL 446
+ L F C L
Sbjct: 427 ENSLLSFVPAPCNL 440
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 122/421 (28%), Positives = 189/421 (44%), Gaps = 36/421 (8%)
Query: 42 LLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS L YNPN T + + + S++R K A D + + +
Sbjct: 38 LIHRDSPLSPLYNPNHTDFDRLRNAFSRSISRVNVFKTK----AVDINSFQNDLVPNGGE 93
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+++ SIG P V + + DTGS L WV+C PC+ C + FDPS+S +Y + C S +
Sbjct: 94 YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRF 153
Query: 156 CT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-S 208
C C + C Y+ Y + + G + +E+F ++ L + FGC +
Sbjct: 154 CNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGT 213
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N F + V G A S L + KFSYC+ L+ + + G +++
Sbjct: 214 GNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVI 273
Query: 269 EGD---STPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G STP+ D YYVTLE IS+G K L L N V IDSGTTLT
Sbjct: 274 SGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGN--VEKGNVIIDSGTTLT 331
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+L + L + +E+ + S P + +C+ + DL P +A HF AD+
Sbjct: 332 FLDSEFFTELERVLEETVKAERVSDPRG-LFSVCFRSAGDIDL---PVIAVHF-NDADVK 386
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
L + F + + C + S+ + I G +AQ ++ V YDL + + F+ D
Sbjct: 387 LQPLNTFVKADEDLLCFTMISSN-------QIGIFGNLAQMDFLVGYDLEKRTVSFKPTD 439
Query: 444 C 444
C
Sbjct: 440 C 440
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 135/465 (29%), Positives = 208/465 (44%), Gaps = 70/465 (15%)
Query: 5 HAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLL-------YNPNDTVD 57
H ILLL + F+ T I T L HRDSLL + D +
Sbjct: 10 HLILLL----ISFSQTTIINGDNG---------FTTSLFHRDSLLSPLEFSSLSHYDRLT 56
Query: 58 AQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
+R+L+ S L++ ++ A D +A L PG + ++ SIG PPV + + D
Sbjct: 57 NAFRRSLSRSAT---LLNRAATNGALDLQAPLTPGSGE---YLMSVSIGTPPVDYIGMAD 110
Query: 118 TGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYN 170
TGS L+W +C PC +C FDP KS +++ +PC+S C + CG C Y+
Sbjct: 111 TGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQ-GVCDYS 169
Query: 171 IRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATS 230
Y + ++G +G E+ +S GC H + V GLG
Sbjct: 170 YTYGDQTYTKGDLGFEKITIGSSSVKSV------IGCGHESGGGFGFASG-VIGLGGGQL 222
Query: 231 STHSLVEK---VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD---STPMSVIDGS--Y 282
S S + + + +FSYC+ L +A + G+ A++ G STP+ + Y
Sbjct: 223 SLVSQMSQTSGISRRFSYCLPTL--LSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYY 280
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
YVTLE IS+G + + + V IDSGTTL++L Y + V L +
Sbjct: 281 YVTLEAISIGNE---------RHMASAKQGNVIIDSGTTLSFLPKELYDGV---VSSLLK 328
Query: 343 GLLPSYPMDPA--WHLCYSGNINRDL-QGFPAMAFHFAGGADLVLDAESVFYQESSSVFC 399
+ DP W LC+ IN G P + F+GGA++ L + F + +++V C
Sbjct: 329 VVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNC 388
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L + P+ E IIG +A N+ + YDL +K+L F+ C
Sbjct: 389 LTLTPASPTDE----FGIIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 195/433 (45%), Gaps = 54/433 (12%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQK-----SSQKAHDTRAHLHPGIS- 94
L+HRD++ + Q + AR +L ++ S D + + PG+
Sbjct: 66 SLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD 125
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
++V +G PP Q V+D+GS +IWV+C+PCEQC A T FDP+ S +++ + C
Sbjct: 126 GSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 152 DSSYCTN------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
S+ C GG +C Y++ Y +G ++G + ET G T + V
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKG-----ELALETLTLGGTAVQGVAI 240
Query: 206 GCSHNNAHFSDEQFTGVFGLG-PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H N+ G+ GLG A S L G FSYC+ + L+LG
Sbjct: 241 GCGHRNSGLFVGA-AGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRG--AGGAGSLVLGR 297
Query: 265 GAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ + + ++ + YYV L GI +G + L + +LF+ + + GV +D+G
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGA-GGVVMDTG 356
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PA 371
T +T L AY LR F G + + P PA L CY DL G+ P
Sbjct: 357 TAVTRLPREAYAALRGA----FDGAMGALPRSPAVSLLDTCY------DLSGYASVRVPT 406
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
++F+F GA L L A ++ + +VFCLA PS +SI+G I Q+ + D
Sbjct: 407 VSFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSS------SGISILGNIQQEGIQITVD 460
Query: 432 LVSKQLYFQRIDC 444
+ + F C
Sbjct: 461 SANGYVGFGPNTC 473
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 178/374 (47%), Gaps = 47/374 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ ++ IG PP A++DTGS LIW +C PC C F+P+KS +YA+LPC S+
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAM 144
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C + + C Y Y + S G + +E F F T + + + V FGC + N
Sbjct: 145 CNALYSPLC--FQNACVYQAFYGDSASSAGVLANETFTFGT-NSTRVAVPRVSFGCGNMN 201
Query: 212 AH--FSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAIL 268
A F+ G FG G SLV ++GS +FSYC+ ++ A + L G A L
Sbjct: 202 AGTLFNGSGMVG-FGRG-----ALSLVSQLGSPRFSYCL--TSFMSPATSRLYFGAYATL 253
Query: 269 EG---------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
STP V + Y++ + GIS+ +L IDP++F N+T GV I
Sbjct: 254 NSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVII 313
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYS-GNINRDLQGFPAMA 373
DSGTT+T+L AY ++ LP P+ + C+ R + P M
Sbjct: 314 DSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATPSDTFDTCFKWPPPPRRMVTLPEMV 371
Query: 374 FHFAGGADLVLDAESVFYQESSSV-FCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
HF GAD+ L E+ + + CLA+ PSD D SIIG QN+++ YDL
Sbjct: 372 LHF-DGADMELPLENYMVMDGGTGNLCLAMLPSD-------DGSIIGSFQHQNFHMLYDL 423
Query: 433 VSKQLYFQRIDCEL 446
+ L F C L
Sbjct: 424 ENSLLSFVPAPCNL 437
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 172/359 (47%), Gaps = 28/359 (7%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ IG P Q VLDTGS ++W++C+PC +C + F+PS S++++T+ CDS+
Sbjct: 8 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAV 67
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C+ NDC G C Y + Y +G + G+ +E F G T + +V GC H+N
Sbjct: 68 CSQLDANDCHG--GGCLYEVSYGDGSYTVGSYATETLTF-----GTTSIQNVAIGCGHDN 120
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ + S L + G FSYC+ + + E + + E +
Sbjct: 121 VGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDS-ESSGTLEFGPESVPIGSI 179
Query: 272 STPMSV---IDGSYYVTLEGISLGEKMLDIDPN-LFKKNDTWSDAGVFIDSGTTLTWLVP 327
TP+ + YY+++ IS+G +LD P+ F+ ++T G+ IDSGT +T L
Sbjct: 180 FTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQT 239
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAE 387
SAY LR Q LP + CY + + + PA+ FHF+ GA +L A+
Sbjct: 240 SAYDALRDAFIAGTQ-HLPRADGISIFDTCYDLSALQSVS-IPAVGFHFSNGAGFILPAK 297
Query: 388 SVFY-QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ +S FC A P+D N LSI+G I QQ V++D + + F C+
Sbjct: 298 NCLIPMDSMGTFCFAFAPADSN------LSIMGNIQQQGIRVSFDSANSLVGFAIDQCQ 350
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 194/433 (44%), Gaps = 54/433 (12%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQK-----SSQKAHDTRAHLHPGIS- 94
L+HRD++ + Q + AR +L ++ S D + + PG+
Sbjct: 66 SLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD 125
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
++V +G PP Q V+D+GS +IWV+C+PCEQC A T FDP+ S +++ + C
Sbjct: 126 GSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 152 DSSYCTN------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
S+ C GG +C Y++ Y +G ++G + ET G T + V
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKG-----ELALETLTLGGTAVQGVAI 240
Query: 206 GCSHNNAHFSDEQFTGVFGLG-PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H N+ G+ GLG A S L G FSYC+ + L+LG
Sbjct: 241 GCGHRNSGLFVGA-AGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRG--AGGAGSLVLGR 297
Query: 265 GAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ + + ++ + YYV L GI +G + L + LF+ + + GV +D+G
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGA-GGVVMDTG 356
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PA 371
T +T L AY LR F G + + P PA L CY DL G+ P
Sbjct: 357 TAVTRLPREAYAALRGA----FDGAMGALPRSPAVSLLDTCY------DLSGYASVRVPT 406
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
++F+F GA L L A ++ + +VFCLA PS +SI+G I Q+ + D
Sbjct: 407 VSFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSS------SGISILGNIQQEGIQITVD 460
Query: 432 LVSKQLYFQRIDC 444
+ + F C
Sbjct: 461 SANGYVGFGPNTC 473
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 124/428 (28%), Positives = 188/428 (43%), Gaps = 66/428 (15%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQK-----SSQKAHDTRAHLHPGIS- 94
L+HRD++ + Q + AR +L ++ S D + + PG+
Sbjct: 66 SLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD 125
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
++V +G PP Q V+D+GS +IWV+C+PCEQC A T FDP+ S +++ + C
Sbjct: 126 GSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 152 DSSYCTN------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
S+ C GG +C Y++ Y +G ++G + ET G T + V
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKG-----ELALETLTLGGTAVQGVAI 240
Query: 206 GCSHNNAHFSDEQFTGVFGLG-PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H N+ G+ GLG A S L G FSYC+ +
Sbjct: 241 GCGHRNSGLFVGA-AGLLGLGWGAMSLVGQLGGAAGGVFSYCLAS--------------- 284
Query: 265 GAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
G S+ YYV L GI +G + L + +LF+ + + GV +D+GT +T
Sbjct: 285 ----RGAGGAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGA-GGVVMDTGTAVTR 339
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHF 376
L AY LR F G + + P PA L CY DL G+ P ++F+F
Sbjct: 340 LPREAYAALRGA----FDGAMGALPRSPAVSLLDTCY------DLSGYASVRVPTVSFYF 389
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GA L L A ++ + +VFCLA PS +SI+G I Q+ + D +
Sbjct: 390 DQGAVLTLPARNLLVEVGGAVFCLAFAPSS------SGISILGNIQQEGIQITVDSANGY 443
Query: 437 LYFQRIDC 444
+ F C
Sbjct: 444 VGFGPNTC 451
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 201/425 (47%), Gaps = 48/425 (11%)
Query: 42 LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV--- 98
L+HRDS L D ++R N + S + ++ +H + P +P
Sbjct: 36 LIHRDSPLSPFYDPSLTPSERITNAAFRS----SSRLNRVSHFLDENNLPESLLIPENGE 91
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + IG PPV +LA+ DTGS LIWV+C PC+ C F+P KS T+ CDS
Sbjct: 92 YLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCDSQP 151
Query: 156 CTN------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG-FGCS 208
CT+ CG +C Y+ Y + + G +G+E +F ++ + +T + FGC
Sbjct: 152 CTSVPPSQRQCGKV-GQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCG 210
Query: 209 -HNN--AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
+NN H SD+ V G S L ++G KFSYC+ L + + + L G
Sbjct: 211 VYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCL--LPFSSNSTSKLKFGSE 268
Query: 266 AILEGD---STPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
AI+ + STP+ + Y++ LE +++G+K++ +D + IDSG
Sbjct: 269 AIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTG---------RTDGNIIIDSG 319
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T LT+L + Y ++++ + + + + C+ RD+ P +AF F G
Sbjct: 320 TVLTYLEQTFYNNFVASLQEVL-SVESAQDLPFPFKFCFP---YRDMT-IPVIAFQFTGA 374
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ + + + ++ CLAV PS ++G +SI G +AQ ++ V YDL K++ F
Sbjct: 375 SVALQPKNLLIKLQDRNMLCLAVVPSSLSG-----ISIFGNVAQFDFQVVYDLEGKKVSF 429
Query: 440 QRIDC 444
DC
Sbjct: 430 APTDC 434
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 145/466 (31%), Positives = 215/466 (46%), Gaps = 61/466 (13%)
Query: 7 ILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNM 66
+L L++ T+ F S TS A K +L H DS N T + + +
Sbjct: 10 VLALAMFTI-FFSPAFSTSRRALEHPKMQKGFRVRLKHVDS---GKNLTKLERIRHGVKR 65
Query: 67 SMARFIYLSQKS--SQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIW 124
R L + + + + A + PG F + +IG PP A+LDTGS LIW
Sbjct: 66 GRNRLQRLQAMALVASSSSEIEAPVLPGNGE---FLMKLAIGTPPETYSAILDTGSDLIW 122
Query: 125 VKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGP 177
+C+PC QC FDP KS +++ L C S C + C + C Y Y +
Sbjct: 123 TQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCN---NGCEYLYSYGDYS 179
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
+QG + SE F GK + +V FGC +N Q G+ GLG S S ++
Sbjct: 180 STQGILASETLTF-----GKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLK 234
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI--------DGSYYVTLEGI 289
+ KFSYC+ ++ + + L++G A + S+ + YY++LEGI
Sbjct: 235 E--PKFSYCLTTVD--DTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGI 290
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
S+G+ L I + F D S G+ IDSGTT+T+L SA+ + KE + P
Sbjct: 291 SVGDTRLPIKKSTFSLQDDGS-GGLIIDSGTTITYLEESAFNLVAKEFTAKI-----NLP 344
Query: 350 MDPA----WHLCY---SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLA 401
+D + +C+ SG+ N ++ P + FHF GADL L AE+ +SS V CLA
Sbjct: 345 VDSSGSTGLDVCFTLPSGSTNIEV---PKLVFHF-DGADLELPAENYMIGDSSMGVACLA 400
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+G S +SI G + QQN V +DL + L F C+LL
Sbjct: 401 MGSSS-------GMSIFGNVQQQNMLVLHDLEKETLSFLPTQCDLL 439
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 123/420 (29%), Positives = 186/420 (44%), Gaps = 56/420 (13%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+L+HRDS Y P + + S+ R + + S + + G
Sbjct: 32 ELIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFYKYSLTSTPQSTVNSDKG----- 86
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSS 154
+ +++SIG PP +DTGS L+W++C+PC+QC FDPS S +Y +PC S
Sbjct: 87 EYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLSD 146
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
C +++R T D +G + E +++ GC + N
Sbjct: 147 TC------------HSMR-TTSCDVRGYLSVETLTLDSTTGYSVSFPKTMIGCGYRNTGT 193
Query: 215 SDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD-- 271
+G+ GLG S S L +G KFSYC+G + + + L G+ AI+ GD
Sbjct: 194 FHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLG--PWLPNSTSKLNFGDAAIVYGDGA 251
Query: 272 -STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+TP+ D YY+TLE S+G K+++ + N + + IDSGTT T+L
Sbjct: 252 MTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGN----EGNILIDSGTTFTFLPYD 307
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDP--AWHLCYSGNINRDLQGF--PAMAFHFAGGADLVL 384
Y V + + DP + LCY N GF P + HF GAD+ L
Sbjct: 308 VYYRFESAVAEYIN---LEHVEDPNGTFKLCY----NVAYHGFEAPLITAHFK-GADIKL 359
Query: 385 DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S F + S + CLA PS +I G +AQQN V Y+LV + F+ +DC
Sbjct: 360 YYISTFIKVSDGIACLAFIPSQT--------AIFGNVAQQNLLVGYNLVQNTVTFKPVDC 411
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 122/433 (28%), Positives = 196/433 (45%), Gaps = 46/433 (10%)
Query: 38 LVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKA-HDTRAHLHPGI 93
T+L+HRDS LYN T + + + S++R + + ++ + + + + I
Sbjct: 31 FTTELVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEI---I 87
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLP 150
+ + ++ S+G PP LA+ DTGS LIW +C PC++C A FDP S TY L
Sbjct: 88 ANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLS 147
Query: 151 CDSSYCTN-----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
CD+ C N C C Y+ Y + + G + + +++ G +
Sbjct: 148 CDTRQCQNLGESSSCSS-EQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVI 206
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCI--------GNLNYFEYA 256
GC N D++ +G+ GLG S S + VG KFSYC+ GN + +
Sbjct: 207 GCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFG 266
Query: 257 YNMLILGEGAILEGDSTPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
N ++ G G STP+ D YY+TLE +S+G+K ++ + S+ +
Sbjct: 267 RNAVVSGSGV----QSTPLISKNPDTFYYLTLEAMSVGDKKIEFG----GSSFGGSEGNI 318
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAF 374
IDSGT+LT + + VE+ + D + L + DL+ P +
Sbjct: 319 IIDSGTSLTLFPVNFFTEFATAVENAV--INGERTQDASGLLSHCYRPTPDLK-VPVITA 375
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
HF GAD+VL + F S V CLA + + +I G +AQ N+ + YD+
Sbjct: 376 HF-NGADVVLQTLNTFILISDDVLCLAFNST-------QSGAIFGNVAQMNFLIGYDIQG 427
Query: 435 KQLYFQRIDCELL 447
K + F+ DC L
Sbjct: 428 KSVSFKPTDCTQL 440
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 170/375 (45%), Gaps = 57/375 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++V IG PP Q V+D+GS +IWV+C+PC +C A FDP+ S T++ + C S+
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAI 184
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T+ CG C Y + Y +G ++GT+ ET G T + V GC H N
Sbjct: 185 CRTLRTSGCGD-SGGCEYEVSYGDGSYTKGTLA-----LETLTLGGTAVEGVAIGCGHRN 238
Query: 212 AHFSDEQFTGVFGL-----GPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLIL 262
F G GL GP S L G FSYC+ G+ + A L+L
Sbjct: 239 RGL----FVGAAGLLGLGWGP-MSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVL 293
Query: 263 GEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G + + + ++ YYV + GI +G++ L + LF+ + GV +D
Sbjct: 294 GRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTED-GGGGVVMD 352
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF----- 369
+GT +T L AY LR D F G + + P P L CY DL G+
Sbjct: 353 TGTAVTRLPQEAYAALR----DAFVGAVGALPRAPGVSLLDTCY------DLSGYTSVRV 402
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P ++F+F G A L L A ++ + ++CLA PS LSI+G I Q+ +
Sbjct: 403 PTVSFYFDGAATLTLPARNLLLEVDGGIYCLAFAPSS------SGLSILGNIQQEGIQIT 456
Query: 430 YDLVSKQLYFQRIDC 444
D + + F C
Sbjct: 457 VDSANGYIGFGPATC 471
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 115/406 (28%), Positives = 179/406 (44%), Gaps = 39/406 (9%)
Query: 52 PNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVP 111
PN T ++ + R +L + S D A++ P S + + G P
Sbjct: 69 PNRTWESLMSEKIRGDANRLRFLKRTSRSSKQDANANV-PVRSGSGEYIIQVDFGTPKQS 127
Query: 112 QLAVLDTGSSLIWVKCQPCEQCGATT--FDPSKSLTYATLPCDSSYC---TNDCGGYPDE 166
++DTGS + W+ C+ C+ C +T FDP+KS +Y CDS C + +CGG +
Sbjct: 128 MYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQEISGNCGGN-SK 186
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN-NAHFSDEQFTGVFGL 225
C + + Y +G GT+ S+ G +L + FGC+ + + S G
Sbjct: 187 CQFEVSYGDGTQVDGTLASDAITL-----GSQYLPNFSFGCAESLSEDTSPSPGLMGLGG 241
Query: 226 GPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI--DGS- 281
G + T + E G FSYC L + L+LG+ A + S + + D S
Sbjct: 242 GSLSLLTQAPTAELFGGTFSYC---LPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSI 298
Query: 282 ---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
Y+VTL+ IS+G + + + S G IDSGTT+T LVPSAY LR
Sbjct: 299 PTFYFVTLKAISVGNTRISV-----PGTNIASGGGTIIDSGTTITHLVPSAYTALRDAFR 353
Query: 339 DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF 398
L P+ D CY +++ P + H DLVL E++ + S +
Sbjct: 354 QQLSSLQPTPVED--MDTCY--DLSSSSVDVPTITLHLDRNVDLVLPKENILITQESGLA 409
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA +D SIIG + QQN+ + +D+ + Q+ F + C
Sbjct: 410 CLAFSSTD-------SRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 125/426 (29%), Positives = 189/426 (44%), Gaps = 65/426 (15%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+L+HRDS LY P + Q +N + S + + T P + +P
Sbjct: 31 ELIHRDSSKSPLYQPTQN---KYQHIVNAARR-----SINRANHFYKTALTNTPQSTVIP 82
Query: 98 ---VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
+ + +S+G PP + DTGS ++W++C+PC++C T F PSKS TY +PC
Sbjct: 83 DHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPC 142
Query: 152 DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
S C + QG + + E+S GC +N
Sbjct: 143 SSDLCKS-------------------GQQGNLSVDTLTLESSTGHPISFPKTVIGCGTDN 183
Query: 212 AHFSDEQFTGVFGL--GPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+ +G+ GL GPA+ T L + +KFSYC+ + L G+ A++
Sbjct: 184 TVSFEGASSGIVGLGGGPASLITQ-LGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVS 242
Query: 270 GD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
GD STP+ D YY+TLE S+G K ++ + ++ + + IDSGTTLT
Sbjct: 243 GDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFE----GSSNGGHEGNIIIDSGTTLTV 298
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL 384
+ Y L V +L + + P ++LCYS + D FP + HF GAD+ L
Sbjct: 299 IPTDVYNNLESAVLELVKLKRVNDPTR-LFNLCYS--VTSDGYDFPIITTHFK-GADVKL 354
Query: 385 DAESVFYQESSSVFCLAVG------PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
S F + + CLA PSD+ +SI G +AQQN V YDL K +
Sbjct: 355 HPISTFVDVADGIVCLAFATTSAFIPSDV-------VSIFGNLAQQNLLVGYDLQQKIVS 407
Query: 439 FQRIDC 444
F+ DC
Sbjct: 408 FKPTDC 413
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 128/433 (29%), Positives = 194/433 (44%), Gaps = 49/433 (11%)
Query: 36 KRLVTKLLHRDSLLYNPNDTVDAQAQ-RTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
K ++ + L RD+ D+++A+ Q + +S A L+ S D + IS
Sbjct: 88 KEILQERLKRDAARV---DSINARVQLAAMGVSKAEMKPLNGSSIDARFDAKDFSSSIIS 144
Query: 95 TVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYA 147
+ ++ +G PP VLDTGS ++W++C PC +C T F+P+ S TY
Sbjct: 145 GLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYR 204
Query: 148 TLPCDSSYCTN-DCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
+PC + C D G ++ C Y + Y +G + G +E F + V
Sbjct: 205 KVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ-----VIRRVA 259
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H+N + + S + +FSYC+ + + A + LI G+
Sbjct: 260 LGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTA-SSLIFGK 318
Query: 265 GAILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
AI + TP+ +D YYV L GIS+G + L P + D + GV IDSGT
Sbjct: 319 AAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGT 378
Query: 321 TLTWLVPSAYQTLRKEVEDLFQ---GLLPSYPMDPAWHLCYSGNINRDLQGF-----PAM 372
++T LV SAY T+R D F+ G L S + CY DL G P +
Sbjct: 379 SVTRLVDSAYSTMR----DAFRVGTGNLKSAGGFSLFDTCY------DLSGLKTVKVPTL 428
Query: 373 AFHFAGGADLVLDAESVFY-QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
FHF GGA + L A + +SS+ FC A + LSIIG I QQ Y V +D
Sbjct: 429 VFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNT------GGLSIIGNIQQQGYRVVFD 482
Query: 432 LVSKQLYFQRIDC 444
++ ++ F+ C
Sbjct: 483 SLANRVGFKAGSC 495
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 177/372 (47%), Gaps = 48/372 (12%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-----GATTFDPSKSLTYATLPCD 152
+ YVN +G PP LA+ DTGS L+WV C GA F PS+S TY+ L C
Sbjct: 101 LMYVN--VGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQ 158
Query: 153 SSYCTNDCGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNF---ETSDEGKTFLYDVGFG 206
S+ C D EC Y Y +G + G + +E F+F EG+ + V FG
Sbjct: 159 SAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVSFG 218
Query: 207 CSHNNA-HFSDEQFTGVFGLGPATSSTHSLVEKVGS------KFSYCIGNLNYFEYAYNM 259
CS +A F + G+ GLG + SLV ++G+ +FSYC+ + +
Sbjct: 219 CSTGSAGSFRSD---GLVGLG---AGALSLVSQLGAAARIARRFSYCLVPPYAAANSSST 272
Query: 260 LILGEGAILE---GDSTPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
L G A++ STP+ S +D Y V LE +++ + ++ N + +
Sbjct: 273 LSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ------DVASANSSR----I 322
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAM 372
+DSGTTLT+L P+ + L E+E + L + P + LCY G + G P +
Sbjct: 323 IVDSGTTLTFLDPALLRPLVAELERRIR-LPRAQPPEQLLQLCYDVQGKSQAEDFGIPDV 381
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
F GGA + L E+ F CL + P + + +SI+G IAQQN++V YDL
Sbjct: 382 TLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSES----QPVSILGNIAQQNFHVGYDL 437
Query: 433 VSKQLYFQRIDC 444
++ + F +DC
Sbjct: 438 DARTVTFAAVDC 449
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 159/364 (43%), Gaps = 43/364 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G P V DTGS WV+CQPC Q F P+KS TYA + C SS
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSS 224
Query: 155 YC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
YC T C G C Y ++Y +G + G + G + D FGC
Sbjct: 225 YCSDLDTRGCSG--GHCLYAVQYGDGSYTVGFYAQDTLTL-----GYDTVKDFRFGCGEK 277
Query: 211 NAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGEG 265
N + G+ GLG TS +K F+YCI + ++
Sbjct: 278 NRGLFGKA-AGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLDFGPGAPAAANA 336
Query: 266 AILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ TPM V +G YYV + GI +G +L I +F SDAG +DSGT +T
Sbjct: 337 RL-----TPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVF------SDAGALVDSGTVIT 385
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGA 380
L PSAY+ LR +GL Y PA+ + CY + PA++ F GGA
Sbjct: 386 RLPPSAYEPLRSAFAKGMEGL--GYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGA 443
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L +DA + Y S CLA +D + D++I+G Q+ Y+V YDL K + F
Sbjct: 444 CLDVDASGILYVADVSQACLAFAANDDD----TDMTIVGNTQQKTYSVLYDLGKKVVGFA 499
Query: 441 RIDC 444
C
Sbjct: 500 PGAC 503
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 129/376 (34%), Positives = 180/376 (47%), Gaps = 43/376 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDS- 153
+ + +IG PPV A+ DTGS LIW +C PC QC ++PS S T+A LPC+S
Sbjct: 86 YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSS 145
Query: 154 -SYCTNDCGGYPD----ECWYNIRYTNGPDSQGTIGSEQFNFETSDEG-KTFLYDVGFGC 207
S C G C YN+ Y +G S GSE F F +S +T + + FGC
Sbjct: 146 LSMCAAALAGTTPPPGCTCMYNMTYGSGWTSV-YQGSETFTFGSSTPANQTGVPGIAFGC 204
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGA 266
S+ + F+ +G+ GLG + SLV ++G KFSYC+ + + L+LG A
Sbjct: 205 SNASGGFNTSSASGLVGLG---RGSLSLVSQLGVPKFSYCLTPYQDTN-STSTLLLGPSA 260
Query: 267 ILEG----DSTPM------SVIDGSYYVTLEGISLGEKMLDIDPN-LFKKNDTWSDAGVF 315
L STP + + YY+ L GISLG L I L K D G
Sbjct: 261 SLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKAD--GTGGFI 318
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA---WHLCYS-GNINRDLQGFPA 371
IDSGTT+T L +AYQ +R V L LP+ A LC+ + P+
Sbjct: 319 IDSGTTITLLGNTAYQQVRAAVVSLVT--LPTTDGGSAATGLDLCFELPSSTSAPPTMPS 376
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
M HF GAD+VL A+S + S+++CLA + + +SI+G QQN ++ YD
Sbjct: 377 MTLHF-DGADMVLPADS-YMMLDSNLWCLA-----MQNQTDGGVSILGNYQQQNMHILYD 429
Query: 432 LVSKQLYFQRIDCELL 447
+ + L F C L
Sbjct: 430 VGQETLTFAPAKCSTL 445
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/397 (28%), Positives = 185/397 (46%), Gaps = 56/397 (14%)
Query: 71 FIYLSQKSSQKAHDTRAHLHPGISTV---PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC 127
I+ +S + +T++ P +TV V+ + +G PP A++DTGS + W +C
Sbjct: 34 LIHRRSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQC 93
Query: 128 QPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIG 184
PC C A FDPSKS T+ CD C Y + Y + + GT+
Sbjct: 94 LPCVHCYEQNAPIFDPSKSSTFKEKRCDG-----------HSCPYEVDYFDHTYTMGTLA 142
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL--GPATSSTHSLVEKVGSK 242
+E ++ + + GC HNN+ F F+G+ GL GP+ SL+ ++G +
Sbjct: 143 TETITLHSTSGEPFVMPETIIGCGHNNSWF-KPSFSGMVGLNWGPS-----SLITQMGGE 196
Query: 243 F----SYCIG--NLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGE 293
+ SYC + + N ++ G+G + ST M + G YY+ L+ +S+G
Sbjct: 197 YPGLMSYCFSGQGTSKINFGANAIVAGDGVV----STTMFMTTAKPGFYYLNLDAVSVGN 252
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
++ F + + IDSGTTLT+ S +R+ VE + + + DP
Sbjct: 253 TRIETMGTTFHA----LEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAA---DPT 305
Query: 354 WH--LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGE 410
+ LCY+ + + FP + HF+GG DLVLD +++ + ++ VFCLA+ + E
Sbjct: 306 GNDMLCYNSD---TIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQE 362
Query: 411 RFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+I G AQ N+ V YD S + F +C L
Sbjct: 363 -----AIFGNRAQNNFLVGYDSSSLLVSFSPTNCSAL 394
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 125/420 (29%), Positives = 189/420 (45%), Gaps = 72/420 (17%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPV 110
+ TL R Y+ K S + ++ L T+P + + +IG P V
Sbjct: 81 EETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAV 140
Query: 111 PQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCT------ND 159
Q+ +DTGS + WV+C PC + C + FDP+ S TY+ C S+ C N
Sbjct: 141 TQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDEGNG 200
Query: 160 CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQF 219
C +C Y ++Y +G ++ GT GS+ + +SD K+F FGCSH A F E
Sbjct: 201 C--LKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAVKSFQ----FGCSHRAAGFVGE-L 253
Query: 220 TGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-TP 274
G+ GLG T SLV + G FSYC+ + + L GA S TP
Sbjct: 254 DGLMGLG---GDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTP 310
Query: 275 MS--VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
M + Y V L+GI++ ML++ ++F S A V +DSGT +T L P+AYQ
Sbjct: 311 MVRFSVPTFYGVFLQGITVAGTMLNVPASVF------SGASV-VDSGTVITQLPPTAYQA 363
Query: 333 LRKEVEDLFQGLLPSYP-MDPAWHL--CYSGNINRDLQGF-----PAMAFHFAGGADLVL 384
LR F+ + +YP P L C+ D GF P + F+ GA + L
Sbjct: 364 LRTA----FKKEMKAYPSAAPVGSLDTCF------DFSGFNTITVPTVTLTFSRGAAMDL 413
Query: 385 DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
D + Y CLA + +G D I+G + Q+ + + +D+ + + F+ C
Sbjct: 414 DISGILYAG-----CLAFTATAHDG----DTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 162/358 (45%), Gaps = 33/358 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +GQP P VLDTGS + W++CQPC C T FDP S ++A+LPC+S
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 156 CT--NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
C G +C Y + Y +G + G +E F S + DV GC H+N
Sbjct: 215 CQALETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSG----MINDVAVGCGHDNEG 270
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDST 273
+ G S T + S FSYC+ ++ + + L A + +
Sbjct: 271 LFVGSAGLLGLGGGPLSLTSQM---KASSFSYCL--VDRDSSSSSDLEFNSAAPSDSVNA 325
Query: 274 PM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
P+ +D YYV L G+S+G ++L I PNLF+ +D+ G+ +DSGT +T L AY
Sbjct: 326 PLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY-GGIIVDSGTAITRLQTQAY 384
Query: 331 QTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAE 387
TLR D F P + L CY + + P ++F FAGG L L +
Sbjct: 385 NTLR----DAFVSRTPYLKKTNGFALFDTCYDLSSQSRVT-IPTVSFEFAGGKSLQLPPK 439
Query: 388 SVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +S FC A P+ LSIIG + QQ V YDL + + F C
Sbjct: 440 NYLIPVDSVGTFCFAFAPTT------SSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 115/421 (27%), Positives = 186/421 (44%), Gaps = 49/421 (11%)
Query: 36 KRLVTKLLHRDSLLYNPND----TVDAQAQRTLNM--SMARFIYLSQKSSQKAHDTRAHL 89
++ + K++HRD L + +D +D + +R S+ R + S + D +
Sbjct: 70 EKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVDDFGTDV 129
Query: 90 HPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLT 145
G+ ++V +G PP Q V+D+GS ++WV+CQPC QC FDP+ S +
Sbjct: 130 ISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSAS 189
Query: 146 YATLPCDSSYCTN--DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ + C SS C + G + C Y + Y +G ++GT+ E F G+T + V
Sbjct: 190 FTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF-----GRTMVRSV 244
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
GC H N + G + S L + G FSYC+ ++ + L+ G
Sbjct: 245 AIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCL--VSRGTDSSGSLVFG 302
Query: 264 EGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
A+ G + V + YY+ L G+ +G + I +F+ + D GV +D+G
Sbjct: 303 REALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTE-LGDGGVVMDTG 361
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA---WHLCYSGNINRDLQGF-----PA 371
T +T L AYQ R D F + P + CY DL GF P
Sbjct: 362 TAVTRLPTLAYQAFR----DAFLAQTANLPRATGVAIFDTCY------DLLGFVSVRVPT 411
Query: 372 MAFHFAGGADLVLDAESVFY-QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
++F+F+GG L L A + + + FC A PS LSI+G I Q+ +++
Sbjct: 412 VSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPST------SGLSILGNIQQEGIQISF 465
Query: 431 D 431
D
Sbjct: 466 D 466
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 167/371 (45%), Gaps = 58/371 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++V IG PP Q V+D+GS +IWV+C+PC +C A FDP+ S T++ +PC S+
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAV 186
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T+ CG C Y + Y +G ++G + ET G T + V GC H N
Sbjct: 187 CRTLRTSGCGD-SGGCDYEVSYGDGSYTKGALA-----LETLTLGGTAVEGVAIGCGHRN 240
Query: 212 AHFSDEQFTGVFGL-----GPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
F G GL GP S L G FSYC+ + L+LG
Sbjct: 241 RGL----FVGAAGLLGLGWGP-MSLVGQLGGAAGGAFSYCLAS-----RGAGSLVLGRSE 290
Query: 267 ILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+ + + ++ YYV L GI +G++ L + +LF+ + + GV +D+GT
Sbjct: 291 AVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGA-GGVVMDTGTA 349
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMA 373
+T L AY LR D F + + P P L CY DL G+ P ++
Sbjct: 350 VTRLPQEAYAALR----DAFVAAVGALPRAPGVSLLDTCY------DLSGYTSVRVPTVS 399
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F+F G A L L A ++ + ++CLA PS SI+G I Q+ + D
Sbjct: 400 FYFDGAATLTLPARNLLLEVDGGIYCLAFAPSS------SGPSILGNIQQEGIQITVDSA 453
Query: 434 SKQLYFQRIDC 444
+ + F C
Sbjct: 454 NGYIGFGPTTC 464
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 133/459 (28%), Positives = 206/459 (44%), Gaps = 62/459 (13%)
Query: 20 TRIFT-STTAAPAAGKPKRLVTKLLHRDSL--------LYNPNDTVDAQAQRTLNMSMAR 70
T+ FT TT+ P++ L +L H D+L L+N DA ++L +S+A
Sbjct: 57 TQTFTDQTTSEPSSSATTFLSVQLHHIDALSSDKSSQDLFNSRLVRDAARVKSL-ISLAA 115
Query: 71 FIYLSQKSSQKAHDTRAHLHPGISTVPV---------FYVNFSIGQPPVPQLAVLDTGSS 121
+ + TRA PG S+ + ++ +G P VLDTGS
Sbjct: 116 TV-------GGTNLTRAR-GPGFSSSVISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSD 167
Query: 122 LIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYT 174
++W++C PC +C + T FDP+KS ++A +PC S C C C Y + Y
Sbjct: 168 IVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQICLYQVSYG 227
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS 234
+G + G +E F + G+ L GC H+N + S
Sbjct: 228 DGSFTVGEFSTETLTFRGTRVGRVVL-----GCGHDNEGLFVGAAGLLGLGRGRLSFPSQ 282
Query: 235 LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-TPM---SVIDGSYYVTLEGIS 290
+ + SKFSYC+G+ + + ++ G+ AI TP+ +D YYV L GIS
Sbjct: 283 IGRRFNSKFSYCLGDRSASSRP-SSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLGIS 341
Query: 291 L-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
+ G ++ I +LFK D+ + GV IDSGT++T L +AY LR D F +
Sbjct: 342 VGGTRVSGISASLFKL-DSTGNGGVIIDSGTSVTRLTRAAYVALR----DAFLVGASNLK 396
Query: 350 MDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPS 405
P + L C+ + +++ P + HF GAD+ L A + ++S FC A +
Sbjct: 397 RAPEFSLFDTCFDLSGKTEVK-VPTVVLHFR-GADVPLPASNYLIPVDNSGSFCFAFAGT 454
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
LSIIG I QQ + V YDL + ++ F C
Sbjct: 455 A------SGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 175/372 (47%), Gaps = 50/372 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP-------CEQCGATTFDPSKSLTYATLP 150
+ YVN +G PP LA+ DTGS L+WV C + G F P++S TY+ L
Sbjct: 104 LMYVN--VGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLS 161
Query: 151 CDSSYCTNDCGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYDVGFG 206
C S+ C D EC Y Y +G + G + +E F+F + +G+ + V FG
Sbjct: 162 CQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFG 221
Query: 207 CSHNNAH-FSDEQFTGVFGLGPATSSTHSLVEKVGS------KFSYCIGNLNYFEYAYNM 259
CS +A F + G+ GLG + SLV ++G+ K SYC+ +Y + +
Sbjct: 222 CSTASAGTFRSD---GLVGLG---AGAFSLVSQLGATTHIDRKLSYCL-IPSYDANSSST 274
Query: 260 LILGEGAILE---GDSTPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
L G A++ STP+ S +D Y V LE +++G + + D+ +
Sbjct: 275 LNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEV-----------ATHDSRI 323
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAM 372
+DSGTTLT+L P+ L E+E + L P + LCY G D G P +
Sbjct: 324 IVDSGTTLTFLDPALLGPLVTELERRIK-LQRVQPPEQLLQLCYDVQGKSETDNFGIPDV 382
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
F GGA + L E+ F CL + P + +SI+G IAQQN++V YDL
Sbjct: 383 TLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPV----SESQPVSILGNIAQQNFHVGYDL 438
Query: 433 VSKQLYFQRIDC 444
++ + F DC
Sbjct: 439 DARTVTFAAADC 450
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 124/441 (28%), Positives = 190/441 (43%), Gaps = 63/441 (14%)
Query: 42 LLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAH-DTRAHLHPGISTVP 97
L+HRDS L + PN T + Q + +L S Q H D + L P
Sbjct: 31 LIHRDSPLSPLHTPNLTFSDRLQAS---------FLRAISRQSRHVDFQTDLLPSGGE-- 79
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
+ +N SIG PP P LA+ DTGS L W++ +PC+QC FDPS S T+ LPC ++
Sbjct: 80 -YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTA 138
Query: 155 YC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC- 207
C C P C Y Y + + G + S+ T + +V FGC
Sbjct: 139 PCNALDESARSCTD-PTTCGYTYSYGDHSYTTGYLASDTV---TVGNASVQIRNVAFGCG 194
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYF-------EYAYNML 260
+ N +F ++ V G S L + +G KFSYC+ L A + +
Sbjct: 195 TRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRI 254
Query: 261 ILGEGAILEGDS--------TPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFK------ 304
+ G+ + S TP+ + S YY+T+E I++G K L + K
Sbjct: 255 VFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDS 314
Query: 305 -KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNIN 363
+ + + IDSGTTLT+L Y L + + + + + + LC+
Sbjct: 315 GSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKS--G 372
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
++ P M HF GGAD+ L + F + + C + P++ D+ I G +AQ
Sbjct: 373 KEEVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLPTN-------DVGIYGNLAQ 425
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
N+ V YDL + + F DC
Sbjct: 426 MNFVVGYDLGKRTVSFLPADC 446
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 117/381 (30%), Positives = 170/381 (44%), Gaps = 46/381 (12%)
Query: 91 PGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQCGAT---TFDPSK 142
P S +P+ + VN +G P + DTGS L W +CQPC + C A FDPS
Sbjct: 142 PAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPST 201
Query: 143 SLTYATLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
S TY+ + C S+ C+ N G C Y I+Y + + G ++ +D
Sbjct: 202 SKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDV 261
Query: 196 GKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCI----GNL 250
F+ FGC NN + G+ GLG S +K G FSYC+ G+
Sbjct: 262 FDGFM----FGCGQNNKGLFGKT-AGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN 316
Query: 251 NYFEYAYNMLILGEGAILEGDS-TPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKND 307
+ + + A+ G + TP + G+ Y++ + GIS+G K L I P LF+
Sbjct: 317 GHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ--- 373
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINR 364
+AG IDSGT +T L +AY +L+ F+ + YP PA L CY + N
Sbjct: 374 ---NAGTIIDSGTVITRLPSTAYGSLKSA----FKQFMSKYPTAPALSLLDTCYDLS-NY 425
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
P ++F+F G A++ LD + +S CLA NG+ + I G I QQ
Sbjct: 426 TSISIPKISFNFNGNANVELDPNGILITNGASQVCLAFAG---NGDD-DSIGIFGNIQQQ 481
Query: 425 NYNVAYDLVSKQLYFQRIDCE 445
V YD+ QL F C
Sbjct: 482 TLEVVYDVAGGQLGFGYKGCS 502
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 167/360 (46%), Gaps = 33/360 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G P QL VLDTGS + W++C+PC C + ++P+ S +Y + C ++
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANL 204
Query: 156 CTN-DCGG--YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
C D G C Y + Y +G +QG +E G L +V GC H+N
Sbjct: 205 CQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTL-----GGAPLQNVAIGCGHDNE 259
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
+ G + S L ++ G FSYC+ ++ + + L G A+ G
Sbjct: 260 GLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCL--VDRDSESSSTLQFGRAAVPNGAV 317
Query: 273 -TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
PM S +D YYV+L GIS+G KML I ++F D + GV +DSGT +T L +
Sbjct: 318 LAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGI-DASGNGGVIVDSGTAVTRLQTA 376
Query: 329 AYQTLRKEVEDLFQG---LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY +LR D F+ LPS + CY + + P + FHF+GG + L
Sbjct: 377 AYDSLR----DAFRAGTKNLPSTDGVSLFDTCYDLSSKESVD-VPTVVFHFSGGGSMSLP 431
Query: 386 AESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A++ +S FC A P+ LSI+G I QQ V++D + Q+ F C
Sbjct: 432 AKNYLVPVDSMGTFCFAFAPTS------SSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 140/466 (30%), Positives = 209/466 (44%), Gaps = 72/466 (15%)
Query: 12 LITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSM 68
L TLPFT + P L+H DS YN + T ++Q N +M
Sbjct: 15 LATLPFTE-----------PSKTPSSFTIDLIHHDSPPSPFYNSSMT---RSQLIRNAAM 60
Query: 69 ARFIYLSQKSSQKAHDTRAHLH---PGISTVPV---FYVNFSIGQPPVPQLAVLDTGSSL 122
R I + + S + L P +P + + IG P V +LA+ DTGS L
Sbjct: 61 -RSISRANQLSLSLSHSLNQLKESSPEPIIIPNNGNYLMRIYIGTPSVERLAIADTGSDL 119
Query: 123 IWVKCQPCE--QCGATT---FDPSKSLTYATLPCDSSYCTN------DCGGYPDECWYNI 171
WV+C PC+ +C A +DP S T+ LPCDS CT C Y D C Y
Sbjct: 120 TWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGD-CIYAY 178
Query: 172 RYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDE--QFTGVFGLGPAT 229
Y + S G + S+ + + FGC N +D+ + TG+ GLG
Sbjct: 179 TYGDNSYSYGGLSSDSIRLMLLQ--LHYNSKICFGCGFQNKFTADKSGKTTGIVGLGAGP 236
Query: 230 SSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD---STPMSVIDGS--YY 283
S S L +++G KFSYC+ L + + + L GE AI++G+ STP+ + YY
Sbjct: 237 LSLVSQLGDEIGHKFSYCL--LPFSSNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYY 294
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQ---TLRKEVEDL 340
+ LEGI++G K + +D + IDSG+TLT+L S Y +L KE +
Sbjct: 295 LNLEGITVGAKTVKTG---------QTDGNIIIDSGSTLTYLEESFYNEFVSLVKETVAV 345
Query: 341 FQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCL 400
+ YP D C++ + P + FHF GG D+VL + ++ C
Sbjct: 346 EEDQYIPYPFD----FCFT--YKEGMSTPPDVVFHFTGG-DVVLKPMNTLVLIEDNLICS 398
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
V PS +G ++I G + Q +++V YD+ ++ F DC L
Sbjct: 399 TVVPSHFDG-----IAIFGNLGQIDFHVGYDIQGGKVSFAPTDCSL 439
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 131/433 (30%), Positives = 202/433 (46%), Gaps = 63/433 (14%)
Query: 40 TKLLHRDSLL-------YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG 92
T L HRDSLL + D + +R+L+ S A L++ ++ A ++ + PG
Sbjct: 32 TSLFHRDSLLSPLEFSSLSHYDRLANAFRRSLSRSAA---LLNRAATSGAVGLQSSIGPG 88
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATL 149
+ ++ SIG PPV L + DTGS L W +C PC +C F+P KS +++ +
Sbjct: 89 SGE---YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHV 145
Query: 150 PCDSSYC-TNDCG--GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
PC++ C D G G C Y+ Y + S+G +G E+ +S G
Sbjct: 146 PCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSVKSV------IG 199
Query: 207 CSHNNAHFSDEQF---TGVFGLGPATSSTHSLVEK---VGSKFSYCIGNLNYFEYAYNML 260
C H S F +GV GLG S S + + + +FSYC+ L +A +
Sbjct: 200 C----GHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTL--LSHANGKI 253
Query: 261 ILGEGAILEGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-V 314
GE A++ G STP+ + YY+TLE IS+G +++ ++ G V
Sbjct: 254 NFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGN----------ERHMAFAKQGNV 303
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP--AWHLCYSGNINRDLQ-GFPA 371
IDSGTTLT L Y + V L + + DP + LC+ IN G P
Sbjct: 304 IIDSGTTLTILPKELYDGV---VSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPV 360
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ HF+GGA++ L + F + + +V CL + + E IIG +AQ N+ + YD
Sbjct: 361 ITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTE----FGIIGNLAQANFLIGYD 416
Query: 432 LVSKQLYFQRIDC 444
L +K+L F+ C
Sbjct: 417 LEAKRLSFKPTVC 429
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 120/441 (27%), Positives = 202/441 (45%), Gaps = 55/441 (12%)
Query: 28 AAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMS---MARFIYLSQKSSQKAHD 84
AA A +R + + L R+++ ++ Q +RTL ++ + R+ +++ + +
Sbjct: 89 AANATASYERRLKEKLRREAVRVR---GLERQIERTLTLNKDPVNRYENVAEVDADFGGE 145
Query: 85 TRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPS 141
+ + G ++ +G P Q VLDTGS + W++C+PC +C + F+PS
Sbjct: 146 VVSGMEQGSGE---YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPS 202
Query: 142 KSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
S +++T+ CDS+ C+ DC + C Y Y +G S G+ +E F G
Sbjct: 203 YSASFSTVGCDSAVCSQLDAYDC--HSGGCLYEASYGDGSYSTGSFATETLTF-----GT 255
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAY 257
T + +V GC H N + A S + + + G FSYC+ ++ +
Sbjct: 256 TSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCL--VDRESDSS 313
Query: 258 NMLILGEGAILEGDS-TPMSV---IDGSYYVTLEGISLGEKMLD-IDPNLFKKNDTWSDA 312
L G ++ G TP+ + YY+++ IS+G +LD I P +F+ ++T
Sbjct: 314 GPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHG 373
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLF---QGLLPSYPMDPAWHLCYSGNINRDLQGF 369
G IDSGT +T LV SAY +R D F G LP + CY DL G
Sbjct: 374 GFIIDSGTVVTRLVTSAYDAVR----DAFVAGTGQLPRTDAVSIFDTCY------DLSGL 423
Query: 370 -----PAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
P + FHF+ GA L+L A++ + ++ FC A P+ +SI+G Q
Sbjct: 424 QFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAA------SSVSIMGNTQQ 477
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
Q+ V++D + + F C
Sbjct: 478 QHIRVSFDSANSLVGFAFDQC 498
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 162/363 (44%), Gaps = 41/363 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G P V DTGS WV+CQPC + FDP+KS TYA + C SS
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 155
Query: 155 YCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
YC++ C G C Y I+Y +G + G + D K F FGC
Sbjct: 156 YCSDLYVSGCSG--GHCLYGIQYGDGSYTIGFYAQDTLTLAY-DTIKNFR----FGCGEK 208
Query: 211 NAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA-IL 268
N G+ GLG TS +K G F+YC L L LG GA
Sbjct: 209 NRGLFGRA-AGLLGLGRGKTSLPVQAYDKYGGVFAYC---LPATSAGTGFLDLGPGAPAA 264
Query: 269 EGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TPM V G YYV + GI +G +L I ++F S AG +DSGT +T L
Sbjct: 265 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF------STAGTLVDSGTVITRLP 318
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CY--SGNINRDLQGFPAMAFHFAGGAD 381
PSAY LR QGL Y PA+ + CY +G+ + PA++ F GGA
Sbjct: 319 PSAYAPLRSAFSKAMQGL--GYSAAPAFSILDTCYDLTGHKGGSIA-LPAVSLVFQGGAC 375
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L +DA + Y S CLA P+ + D++I+G Q+ + V YD+ K + F
Sbjct: 376 LDVDASGILYVADVSQACLAFAPNADD----TDVAIVGNTQQKTHGVLYDIGKKIVGFAP 431
Query: 442 IDC 444
C
Sbjct: 432 GAC 434
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 162/363 (44%), Gaps = 41/363 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G P V DTGS WV+CQPC + FDP+KS TYA + C SS
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 220
Query: 155 YCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
YC++ C G C Y I+Y +G + G + D K F FGC
Sbjct: 221 YCSDLYVSGCSG--GHCLYGIQYGDGSYTIGFYAQDTLTLAY-DTIKNFR----FGCGEK 273
Query: 211 NAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA-IL 268
N G+ GLG TS +K G F+YC L L LG GA
Sbjct: 274 NRGLFGRA-AGLLGLGRGKTSLPVQAYDKYGGVFAYC---LPATSAGTGFLDLGPGAPAA 329
Query: 269 EGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TPM V G YYV + GI +G +L I ++F S AG +DSGT +T L
Sbjct: 330 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF------STAGTLVDSGTVITRLP 383
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CY--SGNINRDLQGFPAMAFHFAGGAD 381
PSAY LR QGL Y PA+ + CY +G+ + PA++ F GGA
Sbjct: 384 PSAYAPLRSAFSKAMQGL--GYSAAPAFSILDTCYDLTGHKGGSI-ALPAVSLVFQGGAC 440
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L +DA + Y S CLA P+ + D++I+G Q+ + V YD+ K + F
Sbjct: 441 LDVDASGILYVADVSQACLAFAPNADD----TDVAIVGNTQQKTHGVLYDIGKKIVGFAP 496
Query: 442 IDC 444
C
Sbjct: 497 GAC 499
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 132/465 (28%), Positives = 201/465 (43%), Gaps = 61/465 (13%)
Query: 12 LITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARF 71
L+ L F ++ + +S+T A L KL H D T + + +R + +S R
Sbjct: 9 LVLLCFRASLVTSSSTGAG-------LRMKLTHVDD---KAGYTTEERVRRAVAVSRERL 58
Query: 72 IYLSQKSSQKAH-DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC--- 127
Y Q+ +A D A +H + + IG PP A++DTGS+LIW +C
Sbjct: 59 AYTQQQQQLRASGDVSAPVHLATRQ---YIAEYLIGDPPQRAAALIDTGSNLIWTQCGTT 115
Query: 128 ---QPCEQCGATTFDPSKSLTYATLPC-DSSYCTNDCG----GYPDECWYNIRYTNGPDS 179
+ C + ++ S+S T+A +PC DS+ G G C + Y G
Sbjct: 116 CGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLCGLDGSCTFAASYGAG-SV 174
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV 239
G++G+E F F++ +GFGC + + G GL SLV +
Sbjct: 175 FGSLGTEAFTFQSGAA------KLGFGCV-SLTRITKGALNGASGLIGLGRGRLSLVSQT 227
Query: 240 GS-KFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI-----------DGSYYVTLE 287
G+ KFSYC+ A + L +G A L G ++ I YY+ L
Sbjct: 228 GATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLV 287
Query: 288 GISLGEKMLDIDPNLFKKNDT----WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
GIS+GE L I F+ WS GV ID+G+ +T L +AY L EV
Sbjct: 288 GISVGETKLPIPSAAFELRRVAAGYWS-GGVIIDTGSPVTSLAEAAYSALSDEVARQLNR 346
Query: 344 LLPSYPMDPAWHLCYSGNINRDL-QGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV 402
L P D LC + +D+ + P + FHF GGAD+ + A S + S C+ +
Sbjct: 347 SLVQPPADTGLDLCVA---RQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLI 403
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
E ++IG QQ+ ++ YD+ +L FQ DC +L
Sbjct: 404 -------EEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADCSVL 441
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 119/419 (28%), Positives = 186/419 (44%), Gaps = 52/419 (12%)
Query: 41 KLLHRDSLL------YNPNDTVDAQAQRTLNMSMARFIYLSQK---SSQKAHDTRAHLHP 91
KL+HRD + Y+ + A+ QR LS + SS + A +
Sbjct: 74 KLVHRDKITAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFGAEVVS 133
Query: 92 GIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYA 147
G++ +++ +G PP Q V+D+GS ++WV+CQPC QC T FDP+ S ++
Sbjct: 134 GMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFM 193
Query: 148 TLPCDSSYCTN--DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
+PC SS C + G + C Y + Y +G ++GT+ E F G+T + +V
Sbjct: 194 GVPCSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTF-----GRTVVRNVAI 248
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H N + G + S L + G FSYC+ ++ + L G G
Sbjct: 249 GCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTDSAGSLEFGRG 306
Query: 266 AILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
A+ G + P+ YY+ L G+ +G + I ++F+ N+ + GV +D+GT
Sbjct: 307 AMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNE-MGNGGVVMDTGTA 365
Query: 322 LTWLVPSAYQTLRKEVEDLF---QGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMA 373
+T + AY R D F G LP + CY +L GF P ++
Sbjct: 366 VTRIPTVAYVAFR----DAFIGQTGNLPRASGVSIFDTCY------NLNGFVSVRVPTVS 415
Query: 374 FHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
F+FAGG L L A + + FC A S LSIIG I Q+ +++D
Sbjct: 416 FYFAGGPILTLPARNFLIPVDDVGTFCFAFAASP------SGLSIIGNIQQEGIQISFD 468
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 125/372 (33%), Positives = 185/372 (49%), Gaps = 52/372 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
F + +IG PP A++DTGS LIW +C+PC QC FDP KS +++ L C S
Sbjct: 97 FLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKL 156
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C D C Y Y + +QG + SE F GK + +V FGC +N
Sbjct: 157 CEALPQSTCS---DGCEYLYGYGDYSSTQGMLASETLTF-----GKVSVPEVAFGCGEDN 208
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL--- 268
Q +G+ GLG S S +++ KFSYC+ +++ + + L++G A +
Sbjct: 209 EGSGFSQGSGLVGLGRGPLSLVSQLKE--PKFSYCLTSVD--DTKASTLLMGSLASVKAS 264
Query: 269 --EGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
E +TP+ S YY++LEGIS+G+ L I + F + S G+ IDSGTT+T
Sbjct: 265 DSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGS-GGLIIDSGTTIT 323
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMD----PAWHLCY---SGNINRDLQGFPAMAFHF 376
+L SA+ + KE + P+D +C+ SG+ + ++ P + FHF
Sbjct: 324 YLEQSAFDLVAKEFTSQI-----NLPVDNSGSTGLEVCFTLPSGSTDIEV---PKLVFHF 375
Query: 377 AGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
GADL L AE+ ++S V CLA+G S +SI G I QQN V +DL +
Sbjct: 376 -DGADLELPAENYMIADASMGVACLAMGSS-------SGMSIFGNIQQQNMLVLHDLEKE 427
Query: 436 QLYFQRIDCELL 447
L F C+ L
Sbjct: 428 TLSFLPTQCDEL 439
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 177/390 (45%), Gaps = 37/390 (9%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-- 133
Q++ D ++ P + F V +G PP + ++DTGS L W++ +PC C
Sbjct: 2 QETLPGQTDNESYEFPESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFE 61
Query: 134 -GATTFDPSKSLTYATLPCDSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQ 187
FDPSKS TY + C SS C T C + C Y Y +G ++G
Sbjct: 62 QADPIFDPSKSSTYNKIACSSSACADLLGTQTCSAAAN-CIYAYGYGDGSVTRG-----Y 115
Query: 188 FNFETSDEGKTFLYDVGFGCS-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSY 245
F+ ET T +V FG S +N F D G+ GLG S S + V G+KFSY
Sbjct: 116 FSKETITATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSY 175
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDP 300
C+ + + + G+ A+ G+ ++ + YY+ ++GIS+G +LDID
Sbjct: 176 CLVDWLSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQ 235
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSG 360
++++ D+ G IDSGTT+T+L + L Q P+ LC+
Sbjct: 236 SVYEI-DSGGSGGTIIDSGTTITYLQQEVFNALVAAYTS--QVRYPTTTSATGLDLCF-- 290
Query: 361 NINRDLQG---FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSI 417
N G FPAM H G L L + F +++ CLA + F ++I
Sbjct: 291 --NTRGTGSPVFPAMTIHL-DGVHLELPTANTFISLETNIICLAFA----SALDFP-IAI 342
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
G I QQN+++ YDL + ++ F DC L
Sbjct: 343 FGNIQQQNFDIVYDLDNMRIGFAPADCASL 372
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 162/358 (45%), Gaps = 33/358 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +GQP P VLDTGS + W++CQPC C T FDP S ++A+LPC+S
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 156 CT--NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
C G +C Y + Y +G + G E F S + +V GC H+N
Sbjct: 215 CQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSG----MINNVAVGCGHDNEG 270
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDST 273
+ G + S T + S FSYC+ ++ + + L A + +
Sbjct: 271 LFVGSAGLLGLGGGSLSLTSQM---KASSFSYCL--VDRDSSSSSDLEFNSAAPSDSVNA 325
Query: 274 PM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
P+ +D YYV L G+S+G ++L I PNLF+ +D+ G+ +DSGT +T L AY
Sbjct: 326 PLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY-GGIIVDSGTAITRLQTQAY 384
Query: 331 QTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAE 387
TLR D F P + L CY + + P ++F FAGG L L +
Sbjct: 385 NTLR----DAFVSRTPYLKKTNGFALFDTCYDLSSQSRVT-IPTVSFEFAGGKSLQLPPK 439
Query: 388 SVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +S FC A P+ LSIIG + QQ V YDL + + F C
Sbjct: 440 NYLIPVDSVGTFCFAFAPTT------SSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 130/439 (29%), Positives = 205/439 (46%), Gaps = 54/439 (12%)
Query: 27 TAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSS--QKAHD 84
+ P K KL+H++S PN + + R Y K S QK+
Sbjct: 19 SQTPTEAYNKGFSFKLIHKNS----PNSPF--YKSNNFHKNKLRSFYQVPKKSFVQKSPY 72
Query: 85 TRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPS 141
TR + G + + ++G PPV ++DTGS L+W +C PC C + F+P
Sbjct: 73 TRVTSNNGD-----YLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPL 127
Query: 142 KSLTYATLPCDSSYCTNDCGGY---PDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
+S TY+ +PC+S C+ GY P + C Y+ Y + ++G + E F ++D
Sbjct: 128 RSKTYSPIPCESEQCS--FFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDP 185
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS-----KFSYCIGNLNY 252
+ D+ FGC H+N+ +E G+ G SLV ++G+ +FS C+ +
Sbjct: 186 VVVGDIIFGCGHSNSGTFNENDMGIIG---MGGGPLSLVSQIGTLYGSKRFSQCLVPFHT 242
Query: 253 FEYAYNMLILGEGAILEGD---STPMSVIDG--SYYVTLEGISLGEKMLDIDPNLFKKND 307
+ + GE + + G+ +TP++ +G SY VTLEGIS+G+ + F ++
Sbjct: 243 DAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVR-----FNSSE 297
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP--AWHLCYSGNINRD 365
T S + IDSGT T++ Y+ L +E++ + LLP DP LCY N
Sbjct: 298 TLSKGNIMIDSGTPATYIPQEFYERLVEELK-VQSSLLP-IEDDPDLGTQLCYRSETN-- 353
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
L+G P + HF GAD+ L F VFC A+ S +G+ I G AQ N
Sbjct: 354 LEG-PILTAHFE-GADVQLLPIQTFIPPKDGVFCFAMAGS-TDGDY-----IFGNFAQSN 405
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+ +DL K + F+ DC
Sbjct: 406 ILMGFDLDRKTISFKPTDC 424
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 142/468 (30%), Positives = 192/468 (41%), Gaps = 73/468 (15%)
Query: 22 IFTSTTAAP--AAGKPKRLVTKLLHRDS--LLYNPNDTVDAQAQRTLNMSMARFIYLSQK 77
+ ++T++P AA P VT R S L+Y A A T S A + +
Sbjct: 30 VVQTSTSSPSNAACSPAAQVTSDPSRASMPLMYRHGPCAPASAAATNRPSPAEMLRRDRA 89
Query: 78 SS----QKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLI 123
+KA R L G+S +P + V G P VPQ+ ++DTGS L
Sbjct: 90 RRNHILRKASGRRITL--GVS-IPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTGSDLS 146
Query: 124 WVKCQPCEQCGA-----TTFDPSKSLTYATLPCDS---------SY---CTNDCGGYPDE 166
WV+CQPC FDPS S TYA +PC S SY CTN G
Sbjct: 147 WVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSSGA-SL 205
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLG 226
C Y I+Y NG + G +E S E T + + FGC D + G
Sbjct: 206 CQYGIQYGNGDTTVGVYSTETLTL--SPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGG 263
Query: 227 PATSSTHSLVEKVGSKFSYCI--GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYY- 283
S G FSYC+ GN A G TP+ V++ ++Y
Sbjct: 264 APESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAGFQFTPLQVVETTFYL 323
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
V L GIS+G K LDI+P +F G+ IDSGT +T L +AY LR F+
Sbjct: 324 VKLTGISVGGKQLDIEPTVFA-------GGMIIDSGTIVTGLPETAYSALRTA----FRS 372
Query: 344 LLPSYPMDPA-----WHLCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS 396
+ +YP+ P CY +GN N + P +A F GG + LD S +
Sbjct: 373 AMSAYPLLPPNDDEDLDTCYDFTGNTNVTV---PTVALTFEGGVTIDLDVPSGVLLDG-- 427
Query: 397 VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA + G D IIG + Q+ + V YD + F+ C
Sbjct: 428 --CLAF----VAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 165/360 (45%), Gaps = 52/360 (14%)
Query: 116 LDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSSYCTN----DCGGYPDECW 168
+DTGS LIW +C PC C FD KS TY LPC SS C + C + C
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC--FKKMCV 58
Query: 169 YNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGP 227
Y Y + + G + +E F F ++ K ++ FGC S N ++ FG GP
Sbjct: 59 YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGP 118
Query: 228 ATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG---------DSTPMSV 277
SLV ++G S+FSYC+ +Y + L G A L STP +
Sbjct: 119 L-----SLVSQLGPSRFSYCL--TSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVI 171
Query: 278 ---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLR 334
+ Y+++L+ ISLG K+L IDP +F ND + GV IDSGT++TWL AY+ +R
Sbjct: 172 NPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGT-GGVIIDSGTSITWLQQDAYEAVR 230
Query: 335 KEVEDLFQGLLPSYPM------DPAWHLCYSGNINRDLQ-GFPAMAFHFAGGADLVLDAE 387
+ GL+ + P+ D C+ ++ P + FHF +L
Sbjct: 231 R-------GLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPEN 283
Query: 388 SVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ ++ CL + P+ + +IIG QQN ++ YD+ + L F C+++
Sbjct: 284 YMLIASTTGYLCLVMAPTGVG-------TIIGNYQQQNLHLLYDIGNSFLSFVPAPCDII 336
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 170/375 (45%), Gaps = 34/375 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG----ATTFDPSKSLTYATLPCDSS 154
++V+ IGQPP L + DTGS L+WVKC C C AT F P S T++ C
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 142
Query: 155 YC--TNDCGGYP--------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
C G P C Y Y +G + G E + +TS + L V
Sbjct: 143 VCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVA 202
Query: 205 FGCSH--NNAHFSDEQF---TGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYN 258
FGC + S F GV GLG S S L + G+KFSYC+ + +
Sbjct: 203 FGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTS 262
Query: 259 MLILGEG--AILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
LI+G+G A+ + TP+ S YYV L+ + + L IDP++++ +D+ + G
Sbjct: 263 YLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDS-GNGG 321
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYS-GNINRDLQGFPA 371
+DSGTTL +L AY+ + V+ + LP+ + P + LC + + + + P
Sbjct: 322 TVMDSGTTLAFLADPAYRLVIAAVKQRIK--LPNADELTPGFDLCVNVSGVTKPEKILPR 379
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ F F+GGA V + F + + CLA+ D S+IG + QQ + +D
Sbjct: 380 LKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPK----VGFSVIGNLMQQGFLFEFD 435
Query: 432 LVSKQLYFQRIDCEL 446
+L F R C L
Sbjct: 436 RDRSRLGFSRRGCAL 450
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 179/417 (42%), Gaps = 60/417 (14%)
Query: 36 KRLVTKLLHRDSLLYNPND----TVDAQAQRTLNM--SMARFIYLSQKSSQKAHDTRAHL 89
++ + K++HRD L + +D +D + +R S+ R + S + D +
Sbjct: 131 EKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVDDFGTDV 190
Query: 90 HPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLT 145
G+ ++V +G PP Q V+D+GS ++WV+CQPC QC FDP+ S +
Sbjct: 191 ISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSAS 250
Query: 146 YATLPCDSSYCTN--DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ + C SS C + G + C Y + Y +G ++GT+ E F G+T + V
Sbjct: 251 FTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF-----GRTMVRSV 305
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
GC H N + G + S L + G FSYC+ + + N
Sbjct: 306 AIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSAAWVPLVRN----- 360
Query: 264 EGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
P + YY+ L G+ +G + I +F+ + D GV +D+GT +T
Sbjct: 361 ----------PRA--PSFYYIGLAGLGVGGIRVPISEEVFRLTEL-GDGGVVMDTGTAVT 407
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPA---WHLCYSGNINRDLQGF-----PAMAFH 375
L AYQ R D F + P + CY DL GF P ++F+
Sbjct: 408 RLPTLAYQAFR----DAFLAQTANLPRATGVAIFDTCY------DLLGFVSVRVPTVSFY 457
Query: 376 FAGGADLVLDAESVFY-QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
F+GG L L A + + + FC A PS LSI+G I Q+ +++D
Sbjct: 458 FSGGPILTLPARNFLIPMDDAGTFCFAFAPST------SGLSILGNIQQEGIQISFD 508
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 119/393 (30%), Positives = 182/393 (46%), Gaps = 66/393 (16%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-PCEQC---GATTFDPSKSLTYATL 149
++ + V+F+IG PP+ AVLDTGS LIW +C PC +C A + P++S+TYA +
Sbjct: 95 ASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANV 154
Query: 150 PCDSSYC-------------------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF 190
C S C + GG C Y Y +G + G + +E F F
Sbjct: 155 SCGSRLCDALPSLRPSSRCSASASAPAPERGG----CTYYYSYGDGSSTDGVLATETFTF 210
Query: 191 ETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGN 249
T ++D+ FGC +N +D +G+ G+G SLV ++G +KFSYC
Sbjct: 211 GAG----TTVHDLAFGCGTDNLGGTDNS-SGLVGMG---RGPLSLVSQLGVTKFSYCFTP 262
Query: 250 LNYFEYAYNMLILGEGAILE--GDSTPMSVIDGS------YYVTLEGISLGEKMLDIDPN 301
N + + L LG A L STP YY++LEGI++G+ +L IDP
Sbjct: 263 FNDTTTS-SPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPA 321
Query: 302 LFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL----C 357
+F+ + G+ IDSGTT T L A+ L + V P+ HL C
Sbjct: 322 VFRLTASGR-GGLIIDSGTTFTALEERAFVVLARAVAARVA-----LPLASGAHLGLSVC 375
Query: 358 YSGNINRDLQG--FPAMAFHFAGGADLVLDAESVFYQES-SSVFCLAVGPSDINGERFKD 414
++ R + P + HF GAD+ L S ++ + V CL + + +
Sbjct: 376 FAAPQGRGPEAVDVPRLVLHF-DGADMELPRSSAVVEDRVAGVACLGIVSA-------RG 427
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+S++G + QQN +V YD+ L F+ +C L
Sbjct: 428 MSVLGSMQQQNMHVRYDVGRDVLSFEPANCGEL 460
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 165/379 (43%), Gaps = 41/379 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG----ATTFDPSKSLTYATLPCDSS 154
++V+ IG PP L V DTGS LIWVKC PC C + F S TY+ + C S
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSP 145
Query: 155 YCTNDCGGYPD---------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C +P+ C Y Y + + G E TS L + F
Sbjct: 146 QCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSF 205
Query: 206 GC-------SHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAY 257
GC S A F Q GV GLG A S S L + GSKFSYC+ +
Sbjct: 206 GCGFRISGPSLTGASFEGAQ--GVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPT 263
Query: 258 NMLILGEGAILEGDS------TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ L +G + TP+ + S YY+ ++G+ + L I+P+++ +D
Sbjct: 264 SFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDD- 322
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQ 367
+ G IDSGTTLT++ AY + K + + P+ P P + LC + + R
Sbjct: 323 LGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEP-TPGFDLCMNVSGVTRP-- 379
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P M+F+ AGG+ + F + + CLAV P +G S++G + QQ +
Sbjct: 380 ALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDG----GFSVLGNLMQQGFL 435
Query: 428 VAYDLVSKQLYFQRIDCEL 446
+ +D +L F R C L
Sbjct: 436 LEFDRDKSRLGFTRRGCAL 454
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 163/361 (45%), Gaps = 32/361 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++V IG P Q V+DTGS + W++C PC+ C FDP S ++ L C +
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C + C Y + Y +G + G + S+ F+ G+T V FGC H+N
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVS---RGRT--SPVVFGCGHDN 128
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEG 270
+ F G GL + S ++ S KFSYC+ + + A + L+ G+ A+
Sbjct: 129 ----EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSALPTS 184
Query: 271 DSTPMS------VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
S + +D YY L GIS+G +L I FK + + GV IDSGT++T
Sbjct: 185 ASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTR 244
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL 384
L AY +R Q LP + CY + + P ++FHF GGA + L
Sbjct: 245 LPTYAYTVMRDAFRSATQK-LPRAADFSLFDTCYDFSALTSVT-IPTVSFHFEGGASVQL 302
Query: 385 DAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
+ ++S FC A + + DLSIIG I QQ VA DL S ++ F
Sbjct: 303 PPSNYLVPVDTSGTFCFAFSKTSL------DLSIIGNIQQQTMRVAIDLDSSRVGFAPRQ 356
Query: 444 C 444
C
Sbjct: 357 C 357
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 168/377 (44%), Gaps = 38/377 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG----ATTFDPSKSLTYATLPCDSS 154
++V+ IGQPP L + DTGS L+WVKC C C AT F P S T++ C
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDP 143
Query: 155 YCTNDCGGYPDE------------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
C PD C Y Y +G + G E + +TS + L
Sbjct: 144 VC--RLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKS 201
Query: 203 VGFGCSH--NNAHFSDEQF---TGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYA 256
V FGC + S F GV GLG S S L + G+KFSYC+ +
Sbjct: 202 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP 261
Query: 257 YNMLILGEG--AILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ LI+G G I + TP+ S YYV L+ + + L IDP++++ +D+ +
Sbjct: 262 TSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDS-GN 320
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP-SYPMDPAWHLCYS-GNINRDLQGF 369
G +DSGTTL +L AY+++ V + LP + + P + LC + + + +
Sbjct: 321 GGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK--LPIADALTPGFDLCVNVSGVTKPEKIL 378
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + F F+GGA V + F + + CLA+ D S+IG + QQ +
Sbjct: 379 PRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPK----VGFSVIGNLMQQGFLFE 434
Query: 430 YDLVSKQLYFQRIDCEL 446
+D +L F R C L
Sbjct: 435 FDRDRSRLGFSRRGCAL 451
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 123/480 (25%), Positives = 197/480 (41%), Gaps = 57/480 (11%)
Query: 4 SHAILLLSLITLPFTSTR---------IFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPND 54
+ ++L+ L PF+++ +F AA P + ++HRD + N
Sbjct: 33 TQSLLVTPLSPTPFSASSELARGDDKDVFAGNLAAAEDATPSTVQFSVVHRDDFVVN--- 89
Query: 55 TVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTV-PV----------FYVNF 103
A A L + R + + S A G V PV ++
Sbjct: 90 ---ATAAELLGHRLQRDGKRAARISAAAGAANGTRRTGSGVVAPVVSGLAQGSGEYFTKI 146
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN-D 159
+G P P L VLDTGS ++W++C PC +C FDP +S +Y + C + C D
Sbjct: 147 GVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAPLCRRLD 206
Query: 160 CGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
GG C Y + Y +G + G +E F G + + GC H+N
Sbjct: 207 SGGCDLRRKACLYQVAYGDGSVTAGDFATETLTF----AGGARVARIALGCGHDNEGLFV 262
Query: 217 EQFTGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGEGAI---LE 269
+ + S + + G FSYC+ + N ++ + + G GA+ +
Sbjct: 263 AAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHS-STVTFGSGAVGSTVA 321
Query: 270 GDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
TPM ++ YYV L GIS+ G ++ + + + + + GV +DSGT++T L
Sbjct: 322 ASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRL 381
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY LR GL S + CY + R + P ++ HFAGGA+ L
Sbjct: 382 ARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLS-GRKVVKVPTVSMHFAGGAEAALP 440
Query: 386 AESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
E+ +S FC A +D +SIIG I QQ + V +D +++ F C
Sbjct: 441 PENYLIPVDSKGTFCFAFAGTD------GGVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 119/370 (32%), Positives = 166/370 (44%), Gaps = 38/370 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + IG P A+LDTGS LIW +C PC C FDP+ S TY +L C +
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPA 151
Query: 156 CTND----CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C Y C Y Y + + G + +E F F T+D + L + FGC + N
Sbjct: 152 CNALYYPLC--YQKTCVYQYFYGDSASTAGVLANETFTFGTNDT-RVTLPRISFGCGNLN 208
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEG 270
A S +G+ G G + SLV ++GS +FSYC+ ++ + L G A L
Sbjct: 209 AG-SLANGSGMVGFG---RGSLSLVSQLGSPRFSYCL--TSFLSPVRSRLYFGAYATLNS 262
Query: 271 ------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
STP + + Y++ + GIS+G L IDP + NDT G IDSGTT
Sbjct: 263 TNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTT 322
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL--CYS-GNINRDLQGFPAMAFHFAG 378
+T+L AY +R+ LP + L C+ R P + HF
Sbjct: 323 ITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHF-D 381
Query: 379 GADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GAD L ++ + S+ CLA+ S D SIIG QN+NV YDL + L
Sbjct: 382 GADWELPLQNYMLVDPSTGGLCLAMATS-------SDGSIIGSYQHQNFNVLYDLENSLL 434
Query: 438 YFQRIDCELL 447
F C L+
Sbjct: 435 SFVPAPCNLM 444
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 132/435 (30%), Positives = 187/435 (42%), Gaps = 77/435 (17%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
P L LHRD+L V A R S + LSQ S +
Sbjct: 70 PTDLFNLRLHRDTL------RVHALNSRAAGFSSSVVSGLSQGSGE-------------- 109
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
++ +G PP VLDTGS ++W++C PC +C + + F+P KS ++A +PC
Sbjct: 110 ----YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPC 165
Query: 152 DSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
S C ++ C C Y + Y +G + G +E F + K V GC
Sbjct: 166 SSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAK-----VALGC 220
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG----SKFSYCIGNLNYFEYAYNMLILG 263
H+N + F G GL S + G KFSYC+ + + +M + G
Sbjct: 221 GHHN----EGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSM-VFG 275
Query: 264 EGAILE-GDSTPM---SVIDGSYYVTLEGISLGE-KMLDIDPNLFKKNDTWSDAGVFIDS 318
+ AI TP+ +D YYV L GIS+G ++ + P+LFK D+ + GV IDS
Sbjct: 276 DAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKL-DSAGNGGVIIDS 334
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQG-----FP 370
GT++T L AY LR D F+ P + L CY DL G P
Sbjct: 335 GTSVTRLTRPAYTALR----DAFRVGARHLKRGPEFSLFDTCY------DLSGQSSVKVP 384
Query: 371 AMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
+ HF GAD+ L A + + + FC A I+G LSIIG I QQ + V
Sbjct: 385 TVVLHFR-GADMALPATNYLIPVDENGSFCFAFA-GTISG-----LSIIGNIQQQGFRVV 437
Query: 430 YDLVSKQLYFQRIDC 444
YDL ++ F C
Sbjct: 438 YDLAGSRIGFAPRGC 452
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 125/442 (28%), Positives = 191/442 (43%), Gaps = 76/442 (17%)
Query: 21 RIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQ 80
+I T A P+ L+HR S +A + R N L +
Sbjct: 13 QIITYFLITTTASSPQGFTIDLIHRRS---------NASSSRVFNTQ------LGSPYAD 57
Query: 81 KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATT 137
DT +L + IG PP AVLDTGS IW +C PC C A
Sbjct: 58 TVFDTYEYL-----------MKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPI 106
Query: 138 FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
FDPSKS T+ + CD+ + C Y + Y ++GT+ +E ++
Sbjct: 107 FDPSKSSTFKEIRCDT---------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQP 157
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGL--GPATSSTHSLVEKVGSKF----SYCIG--N 249
+ + GC NN+ F F GV GL GP SL+ ++G ++ SYC
Sbjct: 158 FVMPETIIGCGRNNSGF-KPGFAGVVGLDRGP-----KSLITQMGGEYPGLMSYCFAGKG 211
Query: 250 LNYFEYAYNMLILGEGAILEGDSTPMSVID---GSYYVTLEGISLGEKMLDIDPNLFKKN 306
+ + N ++ G+G + ST + V G YY+ L+ +S+G ++ F
Sbjct: 212 TSKINFGANAIVAGDGVV----STTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHA- 266
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
+ IDSG+TLT+ S +RK VE + + +P LCY ++ +
Sbjct: 267 ---LKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAV--RFPRSDI--LCY---YSKTI 316
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
FP + HF+GGADLVLD +++ ++ VFCLA+ I ++ +I G AQ N
Sbjct: 317 DIFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAI----ICNSPIEE-AIFGNRAQNN 371
Query: 426 YNVAYDLVSKQLYFQRIDCELL 447
+ V YD S + F+ +C L
Sbjct: 372 FLVGYDSSSLLVSFKPTNCSAL 393
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 158/362 (43%), Gaps = 37/362 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ IG P VLDTGS + WV+CQPC C FDPS S +YA + CDS
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQR 225
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T C C Y + Y +G + G +E S T + +V GC H+N
Sbjct: 226 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDS----TPVGNVAIGCGHDN 281
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F G GL S ++ S FSYC+ ++ A + L G+GA G
Sbjct: 282 EGL----FVGAAGLLALGGGPLSFPSQISASTFSYCL--VDRDSPAASTLQFGDGAAEAG 335
Query: 271 DSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
T V YYV L GIS+G + L I + F + T GV +DSGT +T L
Sbjct: 336 TVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQ 395
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLV 383
+AY LR D F PS P L CY + +R PA++ F GG L
Sbjct: 396 SAAYAALR----DAFVQGAPSLPRTSGVSLFDTCYDLS-DRTSVEVPAVSLRFEGGGALR 450
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L A++ + + +CLA P++ +SIIG + QQ V++D + F
Sbjct: 451 LPAKNYLIPVDGAGTYCLAFAPTNAA------VSIIGNVQQQGTRVSFDTARGAVGFTPN 504
Query: 443 DC 444
C
Sbjct: 505 KC 506
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 162/369 (43%), Gaps = 39/369 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P P L VLDTGS ++W++C PC +C FDP +S +Y + C +
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPL 199
Query: 156 CTN-DCGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C D GG C Y + Y +G + G +E F G + V GC H+N
Sbjct: 200 CRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTF----AGGARVARVALGCGHDN 255
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGEGAI 267
+ + S + + G FSYC+ + N + + + G GA+
Sbjct: 256 EGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRS-STVTFGSGAV 314
Query: 268 ---LEGDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+ TPM ++ YYV L GIS+ G ++ + + + + + GV +DSGT
Sbjct: 315 GSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSGT 374
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA----WHLCYSGNINRDLQGFPAMAFHF 376
++T L AY LR D F+G + P + CY + R + P ++ HF
Sbjct: 375 SVTRLARPAYSALR----DAFRGAAAGLRLSPGGFSLFDTCYDLS-GRKVVKVPTVSMHF 429
Query: 377 AGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
AGGA+ L E+ +S FC A +D +SIIG I QQ + V +D +
Sbjct: 430 AGGAEAALPPENYLIPVDSKGTFCFAFAGTD------GGVSIIGNIQQQGFRVVFDGDGQ 483
Query: 436 QLYFQRIDC 444
++ F C
Sbjct: 484 RVAFTPKGC 492
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 125/442 (28%), Positives = 193/442 (43%), Gaps = 77/442 (17%)
Query: 29 APAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAH 88
A A P + KL+HRD + P + N M Q+ +++ R H
Sbjct: 57 ATEASSPAKYKLKLVHRDKV---PTFNTSHDHRTRFNARM-------QRDTKRVAALRRH 106
Query: 89 LHPGISTVPV-----------------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE 131
L G T ++V +G PP Q V+D+GS +IWV+C+PC
Sbjct: 107 LAAGKPTYAEEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCT 166
Query: 132 QC---GATTFDPSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRYTNGPDSQGTIGSE 186
QC F+P+ S +YA + C S+ C+ ++ G + C Y + Y +G ++GT+ E
Sbjct: 167 QCYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALE 226
Query: 187 QFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---- 242
F G+T + +V GC H+N F G GL S S V ++G +
Sbjct: 227 TLTF-----GRTLIRNVAIGCGHHN----QGMFVGAAGLLGLGSGPMSFVGQLGGQAGGT 277
Query: 243 FSYCIGNLNYFEYAYNMLILGEGAILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDI 298
FSYC+ ++ + +L G A+ G + P+ YYV L G+ +G + I
Sbjct: 278 FSYCL--VSRGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPI 335
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL-- 356
++FK ++ D GV +D+GT +T L +AY+ R D F + P +
Sbjct: 336 SEDVFKLSE-LGDGGVVMDTGTAVTRLPTAAYEAFR----DAFIAQTTNLPRASGVSIFD 390
Query: 357 -CYSGNINRDLQGF-----PAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDING 409
CY DL GF P ++F+F+GG L L A + + FC A PS
Sbjct: 391 TCY------DLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSS--- 441
Query: 410 ERFKDLSIIGMIAQQNYNVAYD 431
LSIIG I Q+ ++ D
Sbjct: 442 ---SGLSIIGNIQQEGIEISVD 460
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 125/442 (28%), Positives = 191/442 (43%), Gaps = 76/442 (17%)
Query: 21 RIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQ 80
+I T A P+ L+HR S +A + R N L +
Sbjct: 7 QIITYFLITTTASSPQGFTIDLIHRRS---------NASSSRVFNTQ------LGSPYAD 51
Query: 81 KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATT 137
DT +L + IG PP AVLDTGS IW +C PC C A
Sbjct: 52 TVFDTYEYL-----------MKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPI 100
Query: 138 FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
FDPSKS T+ + CD+ + C Y + Y ++GT+ +E ++
Sbjct: 101 FDPSKSSTFKEIRCDT---------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQP 151
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGL--GPATSSTHSLVEKVGSKF----SYCIG--N 249
+ + GC NN+ F F GV GL GP SL+ ++G ++ SYC
Sbjct: 152 FVMPETIIGCGRNNSGF-KPGFAGVVGLDRGP-----KSLITQMGGEYPGLMSYCFAGKG 205
Query: 250 LNYFEYAYNMLILGEGAILEGDSTPMSVID---GSYYVTLEGISLGEKMLDIDPNLFKKN 306
+ + N ++ G+G + ST + V G YY+ L+ +S+G ++ F
Sbjct: 206 TSKINFGANAIVAGDGVV----STTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHA- 260
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
+ IDSG+TLT+ S +RK VE + + +P LCY ++ +
Sbjct: 261 ---LKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAV--RFPRSDI--LCY---YSKTI 310
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
FP + HF+GGADLVLD +++ ++ VFCLA+ I ++ +I G AQ N
Sbjct: 311 DIFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAI----ICNSPIEE-AIFGNRAQNN 365
Query: 426 YNVAYDLVSKQLYFQRIDCELL 447
+ V YD S + F+ +C L
Sbjct: 366 FLVGYDSSSLLVSFKPTNCSAL 387
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 175/367 (47%), Gaps = 53/367 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
++ + +G PP A +DTGS LIW +C PC C A FDPS S T+ C+
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG- 118
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
+ C Y I Y + S+GT+ +E ++ + + GC HN++ F
Sbjct: 119 ----------NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWF 168
Query: 215 SDEQFTGVFGL--GPATSSTHSLVEKVGSKF----SYCIGN--LNYFEYAYNMLILGEGA 266
F+G+ GL GP+ SL+ ++G ++ SYC + + + N ++ G+G
Sbjct: 169 K-PTFSGMVGLSWGPS-----SLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGV 222
Query: 267 ILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ ST M + G YY+ L+ +S+G+ ++ F + + IDSGTTLT
Sbjct: 223 V----STTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHA----LEGNIIIDSGTTLT 274
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH--LCYSGNINRDLQGFPAMAFHFAGGAD 381
+ P +Y L +E D + + + DP + LCY + FP + HF+GGAD
Sbjct: 275 YF-PVSYCNLVREAVDHYVTAVRT--ADPTGNDMLCY---YTDTIDIFPVITMHFSGGAD 328
Query: 382 LVLDAESVFYQE-SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
LVLD +++ + + FCLA+ I +D +I G AQ N+ V YD S ++F
Sbjct: 329 LVLDKYNMYIETITRGTFCLAI----ICNNPPQD-AIFGNRAQNNFLVGYDSSSLLVFFS 383
Query: 441 RIDCELL 447
+C L
Sbjct: 384 PTNCSAL 390
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 174/368 (47%), Gaps = 55/368 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
V+ + +G PP AV+DTGS + W +C PC C A FDPSKS T+ C
Sbjct: 379 VYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEKRC--- 435
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
+ C Y + Y + ++GT+ ++ ++ + + GC NN+ F
Sbjct: 436 --------HDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSWF 487
Query: 215 SDEQFTGVFGL--GPATSSTHSLVEKVGSKF----SYCI-GN-LNYFEYAYNMLILGEGA 266
F G GL GP SL+ ++G ++ SYC GN + + N ++ G G
Sbjct: 488 R-PSFEGFVGLNWGPL-----SLITQMGGEYPGLMSYCFAGNGTSKINFGTNAIVGGGGV 541
Query: 267 ILEGDSTPMSVID---GSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ ST M V G YY+ L+ +S+G+ ++ F + + IDSGTTLT
Sbjct: 542 V----STTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHA----LEGNIVIDSGTTLT 593
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYP-MDPAWH--LCYSGNINRDLQGFPAMAFHFAGGA 380
+ S +R+ VE ++P+ P DP + LCY N + FP + HF+GGA
Sbjct: 594 YFPESYCNLVRQAVEH----VVPAVPAADPTGNDLLCYYSNTT---EIFPVITMHFSGGA 646
Query: 381 DLVLDAESVFYQE-SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
DLVLD ++F + S +FCLA+ ++ E +I G AQ N+ V YD S + F
Sbjct: 647 DLVLDKYNMFMESYSGGLFCLAIICNNPTQE-----AIFGNRAQNNFLVGYDSSSLLVSF 701
Query: 440 QRIDCELL 447
+ +C L
Sbjct: 702 KPTNCSAL 709
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 170/374 (45%), Gaps = 71/374 (18%)
Query: 75 SQKSSQKAHDTRAHLHPGISTVPVFY---VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE 131
S SS + +T+A P TV Y + IG PP AVLDTGS LIW +C PC
Sbjct: 39 SNASSSRVSNTQAG-SPYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCL 97
Query: 132 QC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQ 187
C A FDPSKS T+ C++ PD C Y + Y + +QGT+ +E
Sbjct: 98 HCYDQKAPIFDPSKSSTFKETRCNT----------PDHSCPYKLVYDDKSYTQGTLATET 147
Query: 188 FNFETSDEGKTFLYDVGFGCSHNNAHFS-DEQFTGVFGLGPATSSTHSLVEKVGSKFSYC 246
++ + + GCS NN+ +G+ GL + + SL+ ++G +
Sbjct: 148 VTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGL---SRGSLSLISQMGGAYP-- 202
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLF 303
G+G + ST M + G YY+ L+ +S+G+ ++ F
Sbjct: 203 ----------------GDGVV----STTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPF 242
Query: 304 KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH--LCYSGN 361
+ + IDSGT LT+ S +RK VE + + +DP+ + LCY N
Sbjct: 243 HA----LNGNIVIDSGTPLTYFPVSYCNLVRKAVERV---VTADRVVDPSRNDMLCYYSN 295
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAV---GPSDINGERFKDLSI 417
++ FP + HF+GGADLVLD +++ + VFCLA+ P+ + +I
Sbjct: 296 T---IEIFPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICNNPTQV--------AI 344
Query: 418 IGMIAQQNYNVAYD 431
G AQ N+ V YD
Sbjct: 345 FGNRAQNNFLVGYD 358
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 127/450 (28%), Positives = 186/450 (41%), Gaps = 85/450 (18%)
Query: 65 NMSMARFIYLSQKSSQKAHDTRAHLHPGIST-----VPVFYVNFSIGQPPVPQLAVLDTG 119
M R ++S + ++A +T + +S+ ++V F +G P P L V DTG
Sbjct: 48 RMDRERMAFISSRGRRRAAETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVADTG 107
Query: 120 SSLIWVKCQ------------------PCEQCGATTFDPSKSLTYATLPCDSSYCTND-- 159
S L WVKC P TF P KS T+A +PC S+ C
Sbjct: 108 SDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDKSRTWAPIPCSSATCRESLP 167
Query: 160 -----CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG--KTFLYDVGFGC--SHN 210
C + C Y+ RY +G ++GT+G + S K L V GC S+N
Sbjct: 168 FSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYN 227
Query: 211 NAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL- 268
F GV LG + S S + G +FSYC+ + A + L G
Sbjct: 228 GQSFLASD--GVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGPNPAFS 285
Query: 269 -----EG--------------------DSTPMSVIDGS----YYVTLEGISLGEKMLDID 299
EG TP+ V+D Y VT++G+S+ ++L I
Sbjct: 286 SRRPSEGIASCKPAPAPTPAPAGAPGARQTPL-VLDHRTRPFYAVTVKGVSVAGELLKIP 344
Query: 300 PNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY- 358
++ D G +DSGT+LT L AY+ + + G LP MDP + CY
Sbjct: 345 RAVW---DVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAG-LPRVTMDP-FDYCYN 399
Query: 359 -SGNINRDLQG-FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGERFKD 414
+ D+ P +A HFAG A L A+S + V C+ + GP +
Sbjct: 400 WTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGP-------WPG 452
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
LS+IG I QQ + YDL +++L F+R C
Sbjct: 453 LSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 122/448 (27%), Positives = 184/448 (41%), Gaps = 65/448 (14%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQAQR-TLNMSMARFIYLSQKSSQKAHDTRAHLH-PG 92
P+ L + HRD+L P R L AR+ L D LH P
Sbjct: 24 PRTLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLV--------DATGRLHSPV 75
Query: 93 ISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLT 145
S +P ++ +G P + V+DTGS L+W++C PC +C A FDP +S T
Sbjct: 76 FSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSST 135
Query: 146 YATLPCDSSYCTN------DCGGYP-DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
Y +PC S C D GG C Y + Y +G S G + +++ F T
Sbjct: 136 YRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN----DT 191
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAY 257
++ +V GC +N D G+ G+G S + V GS F YC+G+
Sbjct: 192 YVNNVTLGCGRDNEGLFDSA-AGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRS 250
Query: 258 NMLILGEGAILEGDSTPMSVIDGS------YYVTLEGISL-GEKMLDIDPNLFKKNDTWS 310
+ L+ G E ST + + + YYV + G S+ GE++ +
Sbjct: 251 SYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATG 308
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD--PAWHLCYSGNINRDLQG 368
GV +DSGT ++ AY LR + + + CY DL+G
Sbjct: 309 RGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACY------DLRG 362
Query: 369 FPA-----MAFHFAGGADLVLDAESVF-------YQESSSVFCLAVGPSDINGERFKDLS 416
PA + HFAGGAD+ L E+ F + +S CL +D LS
Sbjct: 363 RPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAAD------DGLS 416
Query: 417 IIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+IG + QQ + V +D+ +++ F C
Sbjct: 417 VIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 164/357 (45%), Gaps = 31/357 (8%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCG 161
+G P ++DTGS L WV+C PC +C + F P+ S ++ L C S+ C
Sbjct: 19 LGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSALCN---- 74
Query: 162 GYP------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
G P C Y Y +G + G + + + K + + FGC H+N S
Sbjct: 75 GLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEG-S 133
Query: 216 DEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAI-LEGDST 273
G+ GLG S HS ++ V KFSYC+ + + L+ G+ A+ + D
Sbjct: 134 FAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGDAAVPILPDVK 193
Query: 274 PMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ ++ YYV L GIS+G+ +L+I +F D+ AG DSGTT+T L +
Sbjct: 194 YLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDI-DSVGGAGTIFDSGTTVTQLAEA 252
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
AY+ + + LC SG L PAM FHF GG D+VL +
Sbjct: 253 AYKEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTVPAMTFHFEGG-DMVLPPSN 311
Query: 389 VF-YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F Y ESS +C A+ S D++IIG + QQN+ V YD ++L F DC
Sbjct: 312 YFIYLESSQSYCFAMTSS-------PDVNIIGSVQQQNFQVYYDTAGRKLGFVPKDC 361
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 169/362 (46%), Gaps = 34/362 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + ++G PPV ++DTGS L+W +C PC+ C + F+P +S TY +PCDS
Sbjct: 50 YLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSEE 109
Query: 156 CTNDCGG--YPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
C + G P + C Y+ Y + ++G + E F ++D + D+ FGC H+N+
Sbjct: 110 CNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFGCGHSNS 169
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGS-----KFSYCIGNLNYFEYAYNMLILGEGAI 267
+E G+ SLV + G+ +FS C+ + + + G+ +
Sbjct: 170 GTFNEN---DMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTISFGDASD 226
Query: 268 LEGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ G+ +TP+ +G Y VTLEGIS+G+ + F ++ S + IDSGT
Sbjct: 227 VSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVS-----FNSSEMLSKGNIMIDSGTPA 281
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T+L Y L KE++ L D LCY N L+G P + HF GAD+
Sbjct: 282 TYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETN--LEG-PILIAHFE-GADV 337
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L F VFC A+ + +GE I G AQ N + +DL K + F+
Sbjct: 338 QLMPIQTFIPPKDGVFCFAMAGT-TDGEY-----IFGNFAQSNVLIGFDLDRKTVSFKAT 391
Query: 443 DC 444
DC
Sbjct: 392 DC 393
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 162/361 (44%), Gaps = 32/361 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++V IG P Q V+DTGS + W++C PC+ C FDP S ++ L C +
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C + C Y + Y +G + G + S+ F G+T V FGC H+N
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVS---RGRT--SPVVFGCGHDN 128
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEG 270
+ F G GL + S ++ S KFSYC+ + + A + L+ G+ A+
Sbjct: 129 ----EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSALPTS 184
Query: 271 DSTPMS------VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
S + +D YY L GIS+G +L I FK + + GV IDSGT++T
Sbjct: 185 ASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTR 244
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL 384
L AY +R Q LP + CY + + P ++FHF GGA + L
Sbjct: 245 LPTYAYTVMRDAFRSATQK-LPRAADFSLFDTCYDFSALTSVT-IPTVSFHFEGGASVQL 302
Query: 385 DAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
+ ++S FC A + + DLSIIG I QQ VA DL S ++ F
Sbjct: 303 PPSNYLVPVDTSGTFCFAFSKTSL------DLSIIGNIQQQTMRVAIDLDSSRVGFAPRQ 356
Query: 444 C 444
C
Sbjct: 357 C 357
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 168/381 (44%), Gaps = 46/381 (12%)
Query: 91 PGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQCGAT---TFDPSK 142
P S +P+ + VN +G P + DTGS L W +CQPC + C A FDPS
Sbjct: 142 PAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSA 201
Query: 143 SLTYATLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
S TY+ + C S+ C+ N G C Y I+Y + + G + +D
Sbjct: 202 SKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDV 261
Query: 196 GKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCI----GNL 250
F+ FGC NN + G+ GLG S +K G FSYC+ G+
Sbjct: 262 FDGFM----FGCGQNNRGLFGKT-AGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN 316
Query: 251 NYFEYAYNMLILGEGAILEGDS-TPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKND 307
+ + + A+ G + TP + G+ Y++ + GIS+G K L I P LF+
Sbjct: 317 GHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ--- 373
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINR 364
+AG IDSGT +T L + Y +L+ F+ + YP PA L CY + N
Sbjct: 374 ---NAGTIIDSGTVITRLPSTVYGSLKST----FKQFMSKYPTAPALSLLDTCYDLS-NY 425
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
P ++F+F G A++ L+ + +S CLA NG+ + I G I QQ
Sbjct: 426 TSISIPKISFNFNGNANVDLEPNGILITNGASQVCLAFAG---NGDD-DTIGIFGNIQQQ 481
Query: 425 NYNVAYDLVSKQLYFQRIDCE 445
V YD+ QL F C
Sbjct: 482 TLEVVYDVAGGQLGFGYKGCS 502
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 134/427 (31%), Positives = 198/427 (46%), Gaps = 60/427 (14%)
Query: 47 SLLYNPNDT----VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP---VF 99
S LYN T V + A R++ S R ++ Q S L P I+ +P +
Sbjct: 38 SPLYNSQMTQTELVKSAALRSITRS-KRVNFIGQISPP--------LSPIITPIPDHGEY 88
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC 156
+ FS+G P V +LA+ DTGS L W++C PC+ C A FDP++S TY +PC+S C
Sbjct: 89 LMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQPC 148
Query: 157 T------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK---TFLYDVGFGC 207
T +CG +C Y +Y + G +G + +F ++ G+ TF V FGC
Sbjct: 149 TLFPQNQRECGSS-KQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV-FGC 206
Query: 208 S-HNNAHFS-DEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
+ ++N F + G GLGP S S L +++G KFSYC+ + + + L G
Sbjct: 207 AFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCM--VPFSSTSTGKLKFGS 264
Query: 265 GAIL-EGDSTPMSVIDG--SYYV-TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
A E STP + SYYV LEGI++G+K K + IDS
Sbjct: 265 MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQK---------KVLTGQIGGNIIIDSVP 315
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
LT L Y V++ + P + C N + FP FHF GA
Sbjct: 316 ILTHLEQGIYTDFISSVKEAINVEVAEDAPTP-FEYCVRNPTNLN---FPEFVFHFT-GA 370
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
D+VL +++F +++ C+ V PS K +SI G AQ N+ V YDL K++ F
Sbjct: 371 DVVLGPKNMFIALDNNLVCMTVVPS-------KGISIFGNWAQVNFQVEYDLGEKKVSFA 423
Query: 441 RIDCELL 447
+C +
Sbjct: 424 PTNCSTI 430
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 124/438 (28%), Positives = 190/438 (43%), Gaps = 48/438 (10%)
Query: 30 PAAGKPKRLVTKLLHRDSL-LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT--R 86
PA LV LHRD L L + + + S+ + + Q+ +T R
Sbjct: 12 PANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLR 71
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKS 143
+ L G ++V+ +G PP V DTGS ++W++C PC+ C T F+PS S
Sbjct: 72 SGLSDGSGE---YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFS 128
Query: 144 LTYATLPCDSSYCTNDC--GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
T+ ++ C SS C G ++C Y + Y +G + G +E +F G +
Sbjct: 129 STFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF-----GSNAVN 183
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
V GC HNN + S + + GS FSYC+ LI
Sbjct: 184 SVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRE--STGSVPLI 241
Query: 262 LGEGAILEGDSTPMSV----IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G A+ + +D YYV + GI +G ++I + + + GV +D
Sbjct: 242 FGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILD 301
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHL---CYSGNINRDLQG----- 368
SGT +T LV SAY +R D F+ +PS M + L CY DL G
Sbjct: 302 SGTAVTRLVTSAYNPMR----DAFRAGMPSDAKMTSGFSLFDTCY------DLSGRSSIM 351
Query: 369 FPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
PA++F F GGA + L A+++ ++S +CLA P N E F SIIG I QQ++
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAP---NSENF---SIIGNIQQQSFR 405
Query: 428 VAYDLVSKQLYFQRIDCE 445
+++D ++ C
Sbjct: 406 MSFDSTGNRVGIGANQCN 423
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 173/373 (46%), Gaps = 49/373 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-GATT--FDPSKSLTYATLPCDSSY 155
+ + +IG PPVP +A+ DTGS L W +C+PC+ C G T +D + S +++ LPC S+
Sbjct: 83 YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++ C C Y Y +G S G + + FGC +N
Sbjct: 143 CLPIWSSRCSTPSATCRYRYAYDDGAYSPECAGIS-------------VGGIAFGCGVDN 189
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLIL-------- 262
S TG GLG + SLV ++G KFSYC+ ++F + + +
Sbjct: 190 GGLSYNS-TGTVGLG---RGSLSLVAQLGVGKFSYCL--TDFFNTSLSSPVFFGSLAELA 243
Query: 263 -----GEGAILEGDSTPMSVIDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+ A+++ S + S YYV+LEGISLG+ L I F ND G+ +
Sbjct: 244 ASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIV 303
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLF-QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
DSGT T LV + ++ + V + Q ++ + +D + + ++L P M H
Sbjct: 304 DSGTIFTILVETGFRVVVDHVAGVLGQPVVNASSLDRPCFPAPAAGV-QELPDMPDMVLH 362
Query: 376 FAGGADLVLDAESVF-YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
FAGGAD+ L ++ + E S FCL +I G S++G QQN + +D+
Sbjct: 363 FAGGADMRLHRDNYMSFNEEESSFCL-----NIVGTESASGSVLGNFQQQNIQMLFDITV 417
Query: 435 KQLYFQRIDCELL 447
QL F DC L
Sbjct: 418 GQLSFMPTDCSKL 430
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 117/367 (31%), Positives = 159/367 (43%), Gaps = 50/367 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP++S TYA + C +
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ-REKLFDPARSSTYANVSCAA 237
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C T C G C Y ++Y +G S G + + D K F FGC
Sbjct: 238 PACSDLDTRGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 291
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA-I 267
N E G+ GLG TS +K G F++C L L G G+
Sbjct: 292 RNEGLFGEA-AGLLGLGRGKTSLPVQTYDKYGGVFAHC---LPARSTGTGYLDFGAGSPA 347
Query: 268 LEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+TPM V +G YYV L GI +G ++L I ++F + AG +DSGT +T L
Sbjct: 348 ARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVF------ATAGTIVDSGTVITRL 401
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHFA 377
P+AY +LR Y PA L CY D G P ++ F
Sbjct: 402 PPAAYSSLRSAFAAAMSAR--GYKKAPAVSLLDTCY------DFAGMSQVAIPTVSLLFQ 453
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GGA L +DA + Y S+S CLA ++ G D+ I+G + + VAYD+ K +
Sbjct: 454 GGARLDVDASGIMYAASASQVCLAFAANEDGG----DVGIVGNTQLKTFGVAYDIGKKVV 509
Query: 438 YFQRIDC 444
F C
Sbjct: 510 SFSPGAC 516
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 169/372 (45%), Gaps = 53/372 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPC---- 151
+ + SIG PP+ A DTGS L+W +C PC +C FDP S +Y + C
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTES 119
Query: 152 ----DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
DSS C+ D C Y Y + +QG + E ++ + FGC
Sbjct: 120 CNKLDSSLCSTD----QKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGC 175
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-------FSYCIGNLNYFEYAYNML 260
HNN+ F+D + G+ GLG SL+ ++GS FS C+ N + +
Sbjct: 176 GHNNSGFNDREM-GLIGLG---RGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQM 231
Query: 261 ILGEGAILEGD---STPMSVIDGS-YYVTLEGISLGEKMLDIDPNLFKKN----DTWSDA 312
G+G+ + G+ STP+ DG+ Y+ TL GIS+ D NL N T +
Sbjct: 232 NFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVE------DINLPFSNGSSLGTITKG 285
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAM 372
+ IDSGTT+T+L Y L ++V + + L + +D + LCY N L G P +
Sbjct: 286 NILIDSGTTITYLPEEFYHRLIEQVRN--KVALEPFRID-GYELCYQTPTN--LNG-PTL 339
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
HF GG D++L +F FC AV D N E G AQ NY + +DL
Sbjct: 340 TIHFEGG-DVLLTPAQMFIPVQDDNFCFAV--FDTNEEYVT----YGNYAQSNYLIGFDL 392
Query: 433 VSKQLYFQRIDC 444
+ + F+ DC
Sbjct: 393 ERQVVSFKATDC 404
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 126/436 (28%), Positives = 201/436 (46%), Gaps = 60/436 (13%)
Query: 38 LVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHL--HPGIST 95
L T+L+HR+ +P+ + + +T + F+ ++ +++ H+ + +
Sbjct: 18 LRTELIHRE----HPSSPLRSNTSKT---TTEIFLAAVKRGAERRAQLSKHILAEGRLFS 70
Query: 96 VPV------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTY 146
PV + ++ S G PP ++DTGS LIW +C PCE C A FDP KS TY
Sbjct: 71 TPVASGNGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTY 130
Query: 147 ATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
T+ C S++C++ C C Y+ Y +G + G + + ET G + +
Sbjct: 131 DTVSCASNFCSSLPFQSC---TTSCKYDYMYGDGSSTSGAL-----STETVTVGTGTIPN 182
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLI 261
V FGC H N S G+ GLG S S + S KFSYC+ L + ++
Sbjct: 183 VAFGCGHTNLG-SFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLG--STKTSPML 239
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
+G+ A G + + + + YY L GIS+ K + F D G +D
Sbjct: 240 IGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSI-DASGQGGFILD 298
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH---LCYS--GNINRDLQGFPAM 372
SGTTLT+L A+ L + + +P D + + C+S G N +P M
Sbjct: 299 SGTTLTYLETGAFNALVAAL----KAEVPFPEADGSLYGLDYCFSTAGVANPT---YPTM 351
Query: 373 AFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
FHF GAD L E+VF ++ CLA+ S SI+G I QQN+ + +D
Sbjct: 352 TFHFK-GADYELPPENVFVALDTGGSICLAMAAS-------TGFSIMGNIQQQNHLIVHD 403
Query: 432 LVSKQLYFQRIDCELL 447
LV++++ F+ +CE +
Sbjct: 404 LVNQRVGFKEANCETI 419
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 122/443 (27%), Positives = 189/443 (42%), Gaps = 50/443 (11%)
Query: 38 LVTKLLHRDSLLYNPNDTVDAQAQ---RTLNMSMARFIYLSQKS----SQKAHDTRAHLH 90
L +L+HR+SLL + + Q TL R ++ K+ +K + L+
Sbjct: 56 LSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEASSTDLN 115
Query: 91 PGISTVPVF-----YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSK 142
+++ ++ +V +G P V+DTGS L W++CQPC+ C FDP
Sbjct: 116 GPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRN 175
Query: 143 SLTYATLPCDSSYCT----NDCG---GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
S ++ +PC S C + C G C Y + Y +G S G S+ F T +
Sbjct: 176 SSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSK 235
Query: 196 GKTFLYDVGFGCSHNNAHFSDEQFTGVFGLG-----PATSSTHSLVEKVGSKFSYC-IGN 249
+ V FGC +N + P+ S + FSYC +
Sbjct: 236 AMS----VAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 291
Query: 250 LNYFEYAYNMLILGEGAILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKK 305
N + + LI G AI + +P+ +D YY + G+S+G L I +
Sbjct: 292 SNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQL 351
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNIN 363
+ + S GV IDSGT++T S Y T+R + LPS P + CY SG +
Sbjct: 352 SQSGS-GGVIIDSGTSVTRFPTSVYATIRDAFRNATTN-LPSAPRYSLFDTCYNFSGKAS 409
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
D+ PA+ HF GADL L + ++ FCLA P+ + +L IIG I
Sbjct: 410 VDV---PALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM------ELGIIGNIQ 460
Query: 423 QQNYNVAYDLVSKQLYFQRIDCE 445
QQ++ + +DL L F C+
Sbjct: 461 QQSFRIGFDLQKSHLAFAPQQCK 483
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 116/378 (30%), Positives = 182/378 (48%), Gaps = 60/378 (15%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC---- 156
++G V+DT S L WV+CQPCE C FDPS S +YA +PC+SS C
Sbjct: 123 TVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALR 182
Query: 157 ------TNDCGGYPDE---CWYNIRYTNGPDSQGTIGSEQFNFETSD-EGKTFLYDVGFG 206
T+ C ++ C Y + Y +G S+G + ++ D EG F G G
Sbjct: 183 VAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEGFVF----GCG 238
Query: 207 CSHNNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
S+ A F +G+ GLG + S +++ G FSYC+ + L+LG+
Sbjct: 239 TSNQGAPFGGT--SGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRE--SGSSGSLVLGDD 294
Query: 266 AILEGDSTPM---SVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-V 314
+ +STP+ +++ S Y++ L GI++G + ++ W AG V
Sbjct: 295 SSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVE---------SPWFSAGRV 345
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPA 371
IDSGT +T LVPS Y +R E F L YP PA+ + C++ +++Q P+
Sbjct: 346 IIDSGTIITTLVPSVYNAVRAE----FLSQLAEYPQAPAFSILDTCFNLTGLKEVQ-VPS 400
Query: 372 MAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
+ F F G ++ +D++ V Y SS S CLA+ + + E D SIIG Q+N V
Sbjct: 401 LKFVFEGSVEVEVDSKGVLYFVSSDASQVCLAL--ASLKSE--YDTSIIGNYQQKNLRVI 456
Query: 430 YDLVSKQLYFQRIDCELL 447
+D + Q+ F + C+ +
Sbjct: 457 FDTLGSQIGFAQETCDYI 474
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 124/438 (28%), Positives = 189/438 (43%), Gaps = 48/438 (10%)
Query: 30 PAAGKPKRLVTKLLHRDSL-LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT--R 86
PA LV LHRD L L + + + S+ + + Q+ +T R
Sbjct: 12 PANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLR 71
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKS 143
+ L G ++V+ +G PP V DTGS ++W++C PC+ C T F+PS S
Sbjct: 72 SGLSDGSGE---YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFS 128
Query: 144 LTYATLPCDSSYCTNDC--GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
T+ ++ C SS C G ++C Y + Y +G + G +E +F G +
Sbjct: 129 STFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF-----GSNAVN 183
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
V GC HNN + S + + GS FSYC+ LI
Sbjct: 184 SVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRE--STGSVPLI 241
Query: 262 LGEGAILEGDSTPMSV----IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G A+ + +D YYV + GI +G + I + + + GV +D
Sbjct: 242 FGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILD 301
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHL---CYSGNINRDLQG----- 368
SGT +T LV SAY +R D F+ +PS M + L CY DL G
Sbjct: 302 SGTAVTRLVTSAYNPMR----DAFRAGMPSDAKMTSGFSLFDTCY------DLSGRSSIM 351
Query: 369 FPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
PA++F F GGA + L A+++ ++S +CLA P N E F SIIG I QQ++
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAP---NSENF---SIIGNIQQQSFR 405
Query: 428 VAYDLVSKQLYFQRIDCE 445
+++D ++ C
Sbjct: 406 MSFDSTGNRVGIGANQCN 423
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 123/434 (28%), Positives = 185/434 (42%), Gaps = 61/434 (14%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQ--------AQRTLNMSMARFIYLSQKSSQKAHDTR 86
P+ L + L RDS T+ AQ A RT S + LSQ S +
Sbjct: 88 PQELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRTGGFSSSVVSGLSQGSGE------ 141
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKS 143
++ +G P VLDTGS ++W++C PC +C + + FDP KS
Sbjct: 142 ------------YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189
Query: 144 LTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TYAT+PC S +C + C C Y + Y +G + G +E F +
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNR 244
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNM 259
+ V GC H+N + S + KFSYC+ + + +
Sbjct: 245 VKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKP-SS 303
Query: 260 LILGEGAILE-GDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGV 314
++ G A+ TP+ +D YYV L GIS+ G ++ + +LFK D + GV
Sbjct: 304 VVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKL-DQIGNGGV 362
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPA 371
IDSGT++T L+ AY +R D F+ + P + L C+ + N + P
Sbjct: 363 IIDSGTSVTRLIRPAYIAMR----DAFRVGAKALKRAPDFSLFDTCFDLS-NMNEVKVPT 417
Query: 372 MAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ HF GAD+ L A + +++ FC A + LSIIG I QQ + V Y
Sbjct: 418 VVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGT------MGGLSIIGNIQQQGFRVVY 470
Query: 431 DLVSKQLYFQRIDC 444
DL S ++ F C
Sbjct: 471 DLASSRVGFAPGGC 484
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 136/273 (49%), Gaps = 38/273 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP+ A++DTGS LIW +C PC C FD KS TY LPC SS
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSR 148
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHN 210
C + C + C Y Y + + G + +E F F ++ K ++ FGC S N
Sbjct: 149 CASLSSPSC--FKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLN 206
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILE 269
++ FG GP SLV ++G S+FSYC+ +Y + L G A L
Sbjct: 207 AGDLANSSGMVGFGRGPL-----SLVSQLGPSRFSYCL--TSYLSATPSRLYFGVYANLS 259
Query: 270 G---------DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
STP + + Y+++L+ ISLG K+L IDP +F ND + GV ID
Sbjct: 260 STNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGT-GGVIID 318
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM 350
SGT++TWL AY+ +R+ GL+ + P+
Sbjct: 319 SGTSITWLQQDAYEAVRR-------GLVSAIPL 344
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 166/377 (44%), Gaps = 55/377 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ V S+G PP Q V+D+GS ++WV+C+PC +C FDP+ S T++ + C S+
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAI 230
Query: 156 C----TNDCG-GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C T+ CG G C Y + Y +G ++G + ET G T + V GC H
Sbjct: 231 CRILPTSACGDGELGGCEYEVSYADGSYTKGALA-----LETLTLGGTAVEGVVIGCGHR 285
Query: 211 NAHFSDEQFTGVFGL-----GPATSSTHSLVEKVGSKFSYCIGNLNYFEYA-----YNML 260
N F G GL GP S L +VG FSYC+ + + L
Sbjct: 286 NRGL----FVGAAGLMGLGWGP-MSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWL 340
Query: 261 ILGEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGV 314
+LG + + + ++ YYV L GI +G++ L + LF+ D D V
Sbjct: 341 VLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD--V 398
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL--CYSGNINRDLQGF--- 369
+D+GTT+T L AY LR G +P + L CY DL G+
Sbjct: 399 VMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCY------DLSGYASV 452
Query: 370 --PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P ++F F G A L+L A +V + ++CLA PS LSI+G Q
Sbjct: 453 RVPTVSFCFDGDARLILAARNVLLEVDMGIYCLAFAPSS------SGLSIMGNTQQAGIQ 506
Query: 428 VAYDLVSKQLYFQRIDC 444
+ D + + F +C
Sbjct: 507 ITVDSANGYIGFGPANC 523
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 174/367 (47%), Gaps = 53/367 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
++ + +G PP A +DTGS LIW +C PC C A FDPS S T+ C+
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG- 118
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
+ C Y I Y + S+GT+ +E ++ + + GC HN++ F
Sbjct: 119 ----------NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWF 168
Query: 215 SDEQFTGVFGL--GPATSSTHSLVEKVGSKF----SYCIGN--LNYFEYAYNMLILGEGA 266
F+G+ GL GP+ SL+ ++G ++ SYC + + + N ++ G+G
Sbjct: 169 -KPTFSGMVGLSWGPS-----SLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGV 222
Query: 267 ILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ ST M + G YY+ L+ +S+G+ ++ F + + IDSGTTLT
Sbjct: 223 V----STTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHA----LEGNIIIDSGTTLT 274
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH--LCYSGNINRDLQGFPAMAFHFAGGAD 381
+ P +Y L +E D + + + DP + LCY + FP + HF+GGAD
Sbjct: 275 YF-PVSYCNLVREAVDHYVTAVRT--ADPTGNDMLCY---YTDTIDIFPVITMHFSGGAD 328
Query: 382 LVLDAESVFYQE-SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
LVLD +++ + + FCLA+ I +D +I G AQ N+ V YD S + F
Sbjct: 329 LVLDKYNMYIETITRGTFCLAI----ICNNPPQD-AIFGNRAQNNFLVGYDSSSLLVSFS 383
Query: 441 RIDCELL 447
+C L
Sbjct: 384 PTNCSAL 390
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 124/433 (28%), Positives = 190/433 (43%), Gaps = 45/433 (10%)
Query: 35 PKRLVTKLLHRD-SLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGI 93
P+ L+ H+D L D+ + +N + + + KS DT LHP
Sbjct: 86 PRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPMDTEI-LHPQD 144
Query: 94 STVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDP 140
+ PV +++ IG+P V+DTGS + W++C+PC+ C FDP
Sbjct: 145 FSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDP 204
Query: 141 SKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG 196
+ S +++ L C + C N C D C Y + Y +G + G +E +F S
Sbjct: 205 ASSSSFSRLGCQTPQCRNLDVFACRN--DSCLYQVSYGDGSYTVGDFATETVSFGNSGS- 261
Query: 197 KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEY 255
+ V GC H+N F G GL SL ++ S FSYC+ N + +
Sbjct: 262 ---VDKVAIGCGHDNEGL----FVGAAGLIGLGGGPLSLTSQIKASSFSYCLVNRDSVDS 314
Query: 256 AYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ L + + P+ S +D YYV + G+S+G + L I P++F+ D
Sbjct: 315 S--TLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEV-DGSGKG 371
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAM 372
G+ +D GT +T L AY LR L + LPS + CY+ + +R P +
Sbjct: 372 GIIVDCGTAVTRLQTQAYNALRDTFVKLTKD-LPSTSGFALFDTCYNLS-SRTSVRVPTV 429
Query: 373 AFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
AF F GG L L + +S+ FCLA P+ + LSIIG + QQ V YD
Sbjct: 430 AFLFDGGKSLPLPPSNYLIPVDSAGTFCLAFAPTTAS------LSIIGNVQQQGTRVTYD 483
Query: 432 LVSKQLYFQRIDC 444
L + Q+ F C
Sbjct: 484 LANSQVSFSSRKC 496
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 116/362 (32%), Positives = 162/362 (44%), Gaps = 37/362 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ IG P VLDTGS + WV+CQPC C FDPS S +YA + CDS
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSPR 228
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T C C Y + Y +G + G +E S T + +V GC H+N
Sbjct: 229 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDS----TPVTNVAIGCGHDN 284
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILG-EGAILE 269
F G GL S ++ S FSYC+ ++ A + L G +GA +
Sbjct: 285 EGL----FVGAAGLLALGGGPLSFPSQISASTFSYCL--VDRDSPAASTLQFGADGAEAD 338
Query: 270 GDSTPM--SVIDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+ P+ S G+ YYV L GIS+G + L I + F + T GV +DSGT +T L
Sbjct: 339 TVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVTRLQ 398
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLV 383
SAY LR D F PS P L CY + +R PA++ F GG L
Sbjct: 399 SSAYAALR----DAFVRGTPSLPRTSGVSLFDTCYDLS-DRTSVEVPAVSLRFEGGGALR 453
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L A++ + + +CLA P++ +SIIG + QQ V++D + F
Sbjct: 454 LPAKNYLIPVDGAGTYCLAFAPTNAA------VSIIGNVQQQGTRVSFDTAKGVVGFTPN 507
Query: 443 DC 444
C
Sbjct: 508 KC 509
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 155/360 (43%), Gaps = 28/360 (7%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + SIG PP + DTGS L W C PC +C FDP KS +Y + CDS
Sbjct: 25 YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKL 84
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T C C Y Y + +QG + E ++ L + FGC HNN
Sbjct: 85 CHKLDTGVCSPQ-KHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNN 143
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV--GSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+++ G+ GLG S S + G +FS C+ + + + LG+G+ +
Sbjct: 144 TGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVS 203
Query: 270 GD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
G STP+ Y+VTL GIS+G L + + + + VF+DSGT T
Sbjct: 204 GKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGS---SSQSVEKGNVFLDSGTPPTI 260
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL 384
L Y L +V + +D LCY +L+G P + HF GG D+ L
Sbjct: 261 LPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCY--RTKNNLRG-PVLTAHFEGG-DVKL 316
Query: 385 DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F VFCL + +G + G AQ NY + +DL + + F+ +DC
Sbjct: 317 LPTQTFVSPKDGVFCLGFTNTSSDG------GVYGNFAQSNYLIGFDLDRQVVSFKPMDC 370
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 121/448 (27%), Positives = 183/448 (40%), Gaps = 65/448 (14%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQAQR-TLNMSMARFIYLSQKSSQKAHDTRAHLH-PG 92
P+ L + HRD+L P R L AR+ L D LH P
Sbjct: 24 PRTLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLV--------DATGRLHSPV 75
Query: 93 ISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLT 145
S +P ++ +G P + V+DTGS L+W++C PC +C A FDP +S T
Sbjct: 76 FSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSST 135
Query: 146 YATLPCDSSYCTN------DCGGYP-DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
Y +PC S C D GG C Y + Y +G S G + +++ F T
Sbjct: 136 YRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFAN----DT 191
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAY 257
++ +V GC +N D G+ G+ S + V GS F YC+G+
Sbjct: 192 YVNNVTLGCGRDNEGLFDSA-AGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRS 250
Query: 258 NMLILGEGAILEGDSTPMSVIDGS------YYVTLEGISL-GEKMLDIDPNLFKKNDTWS 310
+ L+ G E ST + + + YYV + G S+ GE++ +
Sbjct: 251 SYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATG 308
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD--PAWHLCYSGNINRDLQG 368
GV +DSGT ++ AY LR + + + CY DL+G
Sbjct: 309 RGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACY------DLRG 362
Query: 369 FPA-----MAFHFAGGADLVLDAESVF-------YQESSSVFCLAVGPSDINGERFKDLS 416
PA + HFAGGAD+ L E+ F + +S CL +D LS
Sbjct: 363 RPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAAD------DGLS 416
Query: 417 IIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+IG + QQ + V +D+ +++ F C
Sbjct: 417 VIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 111/377 (29%), Positives = 168/377 (44%), Gaps = 42/377 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ +G PP L ++DTGS L W++C+PC+ C FDPS+S ++ +PC+++
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146
Query: 156 C---------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGF 205
C N P C Y Y + + G + E + SD + + D+
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK-VGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H+N + A S L +G FSYC+ + + + G
Sbjct: 207 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGA 266
Query: 265 GAIL-----EGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G L + TP + S YY+ ++GI + +++L I F T G
Sbjct: 267 GFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIA-TNGSGGTI 325
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM-DP--AWHLCYSGNINRDLQGFPAM 372
IDSGTTLT+L AY + VE F + SYP DP +CY+ R FPA+
Sbjct: 326 IDSGTTLTYLNRDAY----RAVESAFLARI-SYPRADPFDILGICYNAT-GRAAVPFPAL 379
Query: 373 AFHFAGGADLVLDAESVFYQ--ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ F GA+L L E+ F Q + CLA+ P+D +SIIG QQN + Y
Sbjct: 380 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTD-------GMSIIGNFQQQNIHFLY 432
Query: 431 DLVSKQLYFQRIDCELL 447
D+ +L F DC L
Sbjct: 433 DVQHARLGFANTDCSAL 449
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 116/429 (27%), Positives = 193/429 (44%), Gaps = 45/429 (10%)
Query: 41 KLLHRDSL--------LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG 92
+L H D+L L+N DA ++L S+A + + ++ + + + G
Sbjct: 81 QLHHLDALSSDETPQDLFNSRLARDASRVKSLT-SLAAAVGSTNRTRARGPGFSSSVTSG 139
Query: 93 IST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYAT 148
++ ++ +G P VLDTGS ++W++C PC++C + T F+P+KS ++A
Sbjct: 140 LAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFAN 199
Query: 149 LPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
+PC S C C C Y + Y +G + G +E F + G+ V
Sbjct: 200 IPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGR-----VA 254
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H+N + S + + KFSYC+ + + + ++ G+
Sbjct: 255 LGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSA-SSKPSYMVFGD 313
Query: 265 GAILE-GDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
AI TP+ +D YYV L G+S+ G ++ I +LFK D+ + GV IDSG
Sbjct: 314 SAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKL-DSTGNGGVIIDSG 372
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHF 376
T++T L AY LR D F+ + P + L C+ + +++ P + HF
Sbjct: 373 TSVTRLTRPAYVALR----DAFRVGASNLKRAPEFSLFDTCFDLSGKTEVK-VPTVVLHF 427
Query: 377 AGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
GAD+ L A + ++S FC A + LSI+G I QQ + V YDL +
Sbjct: 428 R-GADVSLPASNYLIPVDNSGSFCFAFAGT------MSGLSIVGNIQQQGFRVVYDLAAS 480
Query: 436 QLYFQRIDC 444
++ F C
Sbjct: 481 RVGFAPRGC 489
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 115/378 (30%), Positives = 171/378 (45%), Gaps = 51/378 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP P LDTGS LIW +C+PC C FD S+S T A LPC+S+
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCESTQ 94
Query: 156 CTND--------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C D C Y Y + + G + +++F F T L V FGC
Sbjct: 95 CKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAG----TSLPGVTFGC 150
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNML 260
NN + TG+ G G S S + KVG+ FS+C + + ++
Sbjct: 151 GLNNTGVFNSNETGIAGFGRGPLSLPSQL-KVGN-FSHCFTTITGAIPSTVLLDLPADLF 208
Query: 261 ILGEGAILEGDSTPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
G+GA+ +TP+ + YY++L+GI++G L + + F T G
Sbjct: 209 SNGQGAV---QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL--TNGTGGT 263
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMA 373
IDSGT++T L P YQ +R E + LP P + H C+S ++ P +
Sbjct: 264 IIDSGTSITSLPPQVYQVVRDEFAAQIK--LPVVPGNATGHYTCFSAP-SQAKPDVPKLV 320
Query: 374 FHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
HF GA + L E+ ++ +S+ CLA+ D + +IIG QQN +V
Sbjct: 321 LHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD-------ETTIIGNFQQQNMHVL 372
Query: 430 YDLVSKQLYFQRIDCELL 447
YDL + L F C+ L
Sbjct: 373 YDLQNNMLSFVAAQCDKL 390
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 114/374 (30%), Positives = 166/374 (44%), Gaps = 50/374 (13%)
Query: 90 HPGISTVPVFYV-NFSIGQPPVPQLAVLDTGSSLIWVKCQPCE--QCGATT---FDPSKS 143
H G S + + YV S G P VPQ+ V+DTGS + W++C+PC QC +DPS S
Sbjct: 69 HLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHS 128
Query: 144 LTYATLPCDSSYCTN---DCGG----YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG 196
TY+ +PC S C D G +C + I Y +G + G ++
Sbjct: 129 STYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIV 188
Query: 197 KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYA 256
+ F FGC H H F GV GLG SL + G FSYC+ +++
Sbjct: 189 QNFY----FGCGHGK-HAVRGLFDGVLGLG---RLRESLGARYGGVFSYCLPSVSSKP-- 238
Query: 257 YNMLILGEGAILEGDS-TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
L LG G G TPM + G VTL GI++G K LD+ P+ F
Sbjct: 239 -GFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF-------SG 290
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYSGNINRDLQGFP 370
G+ +DSGT +T L +AY+ LR + LLP+ +D ++L N+ P
Sbjct: 291 GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDLDTCYNLTGYKNVV-----VP 345
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+A F GGA + LD + CLA S +G ++G + Q+ + V +
Sbjct: 346 KIALTFTGGATINLDVPNGILVNG----CLAFAESGPDGS----AGVLGNVNQRAFEVLF 397
Query: 431 DLVSKQLYFQRIDC 444
D + + F+ C
Sbjct: 398 DTSTSKFGFRAKAC 411
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 165/362 (45%), Gaps = 35/362 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G PP VLDTGS ++W++C PC++C A + FDP KS ++A++ C S
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPL 185
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C C Y + Y +G + G +E F +T + V GC H+N
Sbjct: 186 CHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFR-----RTRVARVALGCGHDN 240
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE-G 270
+ S + KFSYC+ + + +M + G+ A+
Sbjct: 241 EGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSM-VFGDSAVSRTA 299
Query: 271 DSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP+ +D YYV L GIS+ G ++ I +LFK + T + GV IDSGT++T L
Sbjct: 300 RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQT-GNGGVIIDSGTSVTRLT 358
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLV 383
AY R D F+ + P + L C+ + +++ P + HF GAD+
Sbjct: 359 RPAYIAFR----DAFRAGASNLKRAPQFSLFDTCFDLSGKTEVK-VPTVVLHFR-GADVS 412
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L A + ++S FCLA + LSIIG I QQ + V YDL ++ F
Sbjct: 413 LPASNYLIPVDTSGNFCLAFAGT------MGGLSIIGNIQQQGFRVVYDLAGSRVGFAPH 466
Query: 443 DC 444
C
Sbjct: 467 GC 468
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 124/432 (28%), Positives = 193/432 (44%), Gaps = 57/432 (13%)
Query: 26 TTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT 85
T+ APA+ ++L RD L D++ QA+R++N++ S H
Sbjct: 77 TSTAPASS-----FNEILRRDKLRV---DSI-IQARRSMNLT-----------SSVEHMK 116
Query: 86 RAHLHPGISTVPV--FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPS 141
+ G+S + + VN IG P + DTGS LIW +C+PC+ C FDP+
Sbjct: 117 SSVPFYGLSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKVPVFDPT 176
Query: 142 KSLTYATLPCDSSYCTN-DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
KS ++ LPC S C + G +C Y Y + S GT+ +E +F K
Sbjct: 177 KSASFKGLPCSSKLCQSIRQGCSSPKCTYLTAYVDNSSSTGTLATETISFS---HLKYDF 233
Query: 201 YDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNM 259
++ GCS + S + +G+ GL + S S + K FSYCI + +
Sbjct: 234 KNILIGCSDQVSGESLGE-SGIMGLNRSPISLASQTANIYDKLFSYCIPST---PGSTGH 289
Query: 260 LILGEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
L G + +P+S S Y + + GIS+G + L ID + FK T ID
Sbjct: 290 LTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAST-------ID 342
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM---DPAWHLCYSGNINRDLQGFPAMAF 374
SG LT L P AY LR +F+ ++ YP+ D CY + N P+++
Sbjct: 343 SGAVLTRLPPKAYSALR----SVFREMMKGYPLLDQDDFLDTCYDFS-NYSTVAIPSISV 397
Query: 375 HFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F GG ++ +D + +Q S V+CLA D ++SI G Q+ Y V +D
Sbjct: 398 FFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELD------DEVSIFGNFQQKTYTVVFDGA 451
Query: 434 SKQLYFQRIDCE 445
+++ F C+
Sbjct: 452 KERIGFAPGGCD 463
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 120/392 (30%), Positives = 175/392 (44%), Gaps = 54/392 (13%)
Query: 82 AHDTRAHLHPGISTVPVFY-VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--- 137
A D++ L G+ + Y V IG + ++DTGS L WV+CQPC C
Sbjct: 49 ALDSQIPLSSGVRLQTLNYIVTVEIGGRNMT--VIVDTGSDLTWVQCQPCRLCYNQQDPL 106
Query: 138 FDPSKSLTYATLPCDSSYCTN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQF 188
F+PS S +Y T+ C+SS C + CG C Y + Y +G ++G +G EQ
Sbjct: 107 FNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQL 166
Query: 189 NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA----TSSTHSLVEKVGSKFS 244
N G T + + FGC NN +G+ GLG + S T ++ E V FS
Sbjct: 167 NL-----GTTHVSNFIFGCGRNNKGLFGGA-SGLMGLGKSDLSLVSQTSAIFEGV---FS 217
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSYYVTLEGISLGEKML 296
YC+ A LILG + + ++TP+S + Y++ L GIS+G L
Sbjct: 218 YCLPTTA--ADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVAL 275
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL 356
PN + +G+ IDSGT +T L P Y+ L+ E F G PS P
Sbjct: 276 QA-PN-------YRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSG-FPSAPPFSILDT 326
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKD 414
C++ N D P + F G A+L +D +FY + +S CLA+ + E
Sbjct: 327 CFNLN-GYDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDE---- 381
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
+ IIG Q+N V Y+ +L F C
Sbjct: 382 IPIIGNYQQRNQRVIYNTKESKLGFAAEACSF 413
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 164/356 (46%), Gaps = 49/356 (13%)
Query: 114 AVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN---------DCG 161
++DTGS L WV+CQPC++C F+PS S +Y T+ C S C + CG
Sbjct: 148 VIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCG 207
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTG 221
P C Y + Y +G ++G +G+E + S F+ FGC NN F G
Sbjct: 208 SNPPSCNYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFI----FGCGRNNQGL----FGG 259
Query: 222 VFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV 277
GL S+ SL+ + G FSYC+ + E A L++G + + ++TP+S
Sbjct: 260 ASGLVGLGRSSLSLISQTSAMFGGVFSYCL-PITETE-ASGSLVMGGNSSVYKNTTPISY 317
Query: 278 IDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
Y++ L GI++G + + F K+ G+ IDSGT +T L PS Y
Sbjct: 318 TRMIPNPQLPFYFLNLTGITVGS--VAVQAPSFGKD------GMMIDSGTVITRLPPSIY 369
Query: 331 QTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF 390
Q L+ E F G PS P C++ + ++++ P + HF G A+L +D VF
Sbjct: 370 QALKDEFVKQFSG-FPSAPAFMILDTCFNLSGYQEVE-IPNIKMHFEGNAELNVDVTGVF 427
Query: 391 Y--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
Y + +S CLA+ E + IIG Q+N V YD L F C
Sbjct: 428 YFVKTDASQVCLAIASLSYENE----VGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 114/374 (30%), Positives = 169/374 (45%), Gaps = 45/374 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ S+G P + DTGS LIW++C+PC+ C FDP S +Y T+ C +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-N 210
C C PD C Y+ Y +G ++GT+ SE ++ K ++ FGC H N
Sbjct: 100 CDSLPRKSCS--PD-CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLN 156
Query: 211 NAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE----- 264
F+D +G+ GLG S L + G KFSYC+ + + G+
Sbjct: 157 RGSFNDA--SGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSH 214
Query: 265 --GAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
G L TPM ++ YYV L+ IS+ + L I F S G+ DSG
Sbjct: 215 SSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS-GGMIFDSG 273
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM----DPAWHLCY--SGNINRDLQGFPAMA 373
TTLT L + YQ + + + S+P LCY SG+ PAM
Sbjct: 274 TTLTLLPDAPYQIVLRALRSKI-----SFPKIDGSSAGLDLCYDVSGSKASYKMKIPAMV 328
Query: 374 FHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
FHF GAD L E+ F ++ ++ CLA+ S++ D+ I G + QQN+ V YD
Sbjct: 329 FHFE-GADYQLPVENYFIAANDAGTIVCLAMVSSNM------DIGIYGNMMQQNFRVMYD 381
Query: 432 LVSKQLYFQRIDCE 445
+ S ++ + C+
Sbjct: 382 IGSSKIGWAPSQCD 395
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 174/399 (43%), Gaps = 53/399 (13%)
Query: 64 LNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
S AR Y+ + K AHL + ++ + V S G P VPQ+ V+DTGS +
Sbjct: 82 FRRSRARPSYIVRG---KKVSVPAHLGTSVMSLE-YVVRVSFGTPAVPQVVVIDTGSDVS 137
Query: 124 WVKCQPCE--QCGATT---FDPSKSLTYATLPCDSSYCTN---DCGG----YPDECWYNI 171
W++C+PC QC +DPS S TY+ +PC S C D G +C + I
Sbjct: 138 WLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAI 197
Query: 172 RYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS 231
Y +G + G ++ + F FGC H H F GV GLG
Sbjct: 198 SYADGTSTVGAYSQDKLTLAPGAIVQNFY----FGCGHGK-HAVRGLFDGVLGLG---RL 249
Query: 232 THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-TPMSVIDGS---YYVTLE 287
SL + G FSYC+ +++ L LG G G TPM + G VTL
Sbjct: 250 RESLGARYGGVFSYCLPSVSSKP---GFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLA 306
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG--LL 345
GI++G K LD+ P+ F G+ +DSGT +T L +AY+ LR + LL
Sbjct: 307 GINVGGKKLDLRPSAF-------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLL 359
Query: 346 PSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS 405
P+ +D ++L N+ P +A F GGA + LD + CLA S
Sbjct: 360 PNGDLDTCYNLTGYKNVV-----VPKIALTFTGGATINLDVPNGILVNG----CLAFAES 410
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+G ++G + Q+ + V +D + + F+ C
Sbjct: 411 GPDGS----AGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 134/444 (30%), Positives = 187/444 (42%), Gaps = 70/444 (15%)
Query: 38 LVTKLLHRDSLLYNP------NDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHP 91
L L+HR Y P +D TL S AR Y+ ++S T
Sbjct: 55 LSVPLVHR----YGPCAASQYSDMPTPSFSETLRHSRARTNYIKSRASTGMASTPDD--- 107
Query: 92 GISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE--QCGATT-- 137
TVP + V G P VPQ+ ++DTGS + WV+C PC +C
Sbjct: 108 AAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDP 167
Query: 138 -FDPSKSLTYATLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFN 189
FDPSKS TYA + C + C N C +C Y + Y +G ++G +E
Sbjct: 168 LFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETIT 227
Query: 190 FETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIG 248
F K F FGC H+ SD +F G+ GLG A S V G FSYC+
Sbjct: 228 FAPGITVKDFH----FGCGHDQRGPSD-KFDGLLGLGGAPESLVVQTASVYGGAFSYCLP 282
Query: 249 NLNYFEYAYNMLILGEGAILEGDS---TPM---SVIDGSYYVTLEGISLGEKMLDIDPNL 302
LN E + L + A + TPM + SY V + GIS+G K LDI +
Sbjct: 283 ALNS-EAGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSA 341
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSG 360
F+ G+ IDSGT +T L +AY L + F +YPM + + CY+
Sbjct: 342 FR-------GGMLIDSGTIVTELPETAYNALNAALRKAFA----AYPMVASEDFDTCYNF 390
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
++ P +A F+GGA + LD + + CLA S + L IIG
Sbjct: 391 TGYSNVT-VPRVALTFSGGATIDLDVPNGILVKD----CLAFRESGPD----VGLGIIGN 441
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
+ Q+ V YD ++ F+ C
Sbjct: 442 VNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 169/373 (45%), Gaps = 57/373 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS L W +C+PC Q FDPS SL+Y+ + CDS
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 206
Query: 155 YCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C N G C Y IRY +G S G E+ + ++D F FGC
Sbjct: 207 SCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQ----FGC 262
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLV----EKVGSKFSYCIGNLNYFEYAYNMLILG 263
NN F G GL + SLV +K G FSYC L + L G
Sbjct: 263 GQNNRGL----FGGTAGLLGLARNPLSLVSQTAQKYGKVFSYC---LPSSSSSTGYLSFG 315
Query: 264 EGAILEGDS-----TPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G +GDS TP V Y++ + GIS+GE+ L I ++F S AG
Sbjct: 316 SG---DGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVF------STAGTI 366
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAM 372
IDSGT ++ L P+ Y +++K +F+ L+ YP + CY + + ++ P +
Sbjct: 367 IDSGTVISRLPPTVYSSVQK----VFRELMSDYPRVKGVSILDTCYDLSKYKTVK-VPKI 421
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+F+GGA++ L E + Y S CLA G SD + +++IIG + Q+ +V YD
Sbjct: 422 ILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDD-----EVAIIGNVQQKTIHVVYD 476
Query: 432 LVSKQLYFQRIDC 444
++ F C
Sbjct: 477 DAEGRVGFAPSGC 489
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 178/377 (47%), Gaps = 50/377 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC---T 157
++ +IG PP VLDTGS L W+ C+ + TF+P S +Y PC+SS C T
Sbjct: 61 ISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNS-TFNPLLSSSYTPTPCNSSVCMTRT 119
Query: 158 ND------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
D C C + Y + ++GT+ +E F+ + + T FGC +
Sbjct: 120 RDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTL-----FGCMDSA 174
Query: 212 AHFS----DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG-- 265
+ S D + TG+ G+ + S + + V KFSYCI E A+ +L+LG+G
Sbjct: 175 GYTSDINEDAKTTGLMGMNRGSLSL--VTQMVLPKFSYCISG----EDAFGVLLLGDGPS 228
Query: 266 AILEGDSTPMSVIDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
A TP+ S Y V LEGI + EK+L + ++F + T + +D
Sbjct: 229 APSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGA-GQTMVD 287
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQGFPAM 372
SGT T+L+ Y +L+ E + +G+L P++ + A LCY + L PA+
Sbjct: 288 SGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPAS--LAAVPAV 345
Query: 373 AFHFAGGADLVLDAESVFYQESSS---VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
F+ GA++ + E + Y+ S V+C G SD+ G + +IG QQN +
Sbjct: 346 TLVFS-GAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLG---IEAYVIGHHHQQNVWME 401
Query: 430 YDLVSKQLYFQRIDCEL 446
+DLV ++ F C+L
Sbjct: 402 FDLVKSRVGFTETTCDL 418
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 112/370 (30%), Positives = 160/370 (43%), Gaps = 54/370 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP++S TYA + C +
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ-REKLFDPARSSTYANVSCAA 238
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C++ C G C Y ++Y +G S G + + D K F FGC
Sbjct: 239 PACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 292
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGE 264
N E G+ GLG TS +K G F++C+ Y ++ L
Sbjct: 293 RNEGLFGEA-AGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAAR 351
Query: 265 GAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ +TPM +G YYV + GI +G ++L I ++F + AG +DSGT +
Sbjct: 352 ARL----TTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVF------ATAGTIVDSGTVI 401
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAF 374
T L P+AY +LR Y PA L CY D G P ++
Sbjct: 402 TRLPPAAYSSLRYAFAAAMA--ARGYKKAPAVSLLDTCY------DFTGMSQVAIPTVSL 453
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GGA L +DA + Y S+S CLA ++ G D+ I+G + + VAYD+
Sbjct: 454 LFQGGARLDVDASGIMYAASASQVCLAFAANEDGG----DVGIVGNTQLKTFGVAYDIGK 509
Query: 435 KQLYFQRIDC 444
K + F C
Sbjct: 510 KVVGFYPGAC 519
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 123/431 (28%), Positives = 194/431 (45%), Gaps = 53/431 (12%)
Query: 36 KRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST 95
K LV LHRD++ +N ++ A+ Q L +++ ++ K D + G S
Sbjct: 101 KSLVLSRLHRDTVRFN---SLTARLQLALE-DISKSDLKPLETEIKPEDLSTPVTSGTSQ 156
Query: 96 -VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
++ +G P VLDTGS + W++CQPC C T FDP+ S TYA + C
Sbjct: 157 GSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTC 216
Query: 152 DSSYCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
S C+ + C +C Y + Y +G + G +E +F S K +V GC
Sbjct: 217 QSQQCSSLEMSSC--RSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVK----NVALGC 270
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYA---YNMLILG 263
H+N F G GL SL ++ + FSYC+ N + + +N LG
Sbjct: 271 GHDNEGL----FVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLG 326
Query: 264 EGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
++ + P+ ID YYV L G+S+G +M+ I + F+ +++ + G+ +D GT
Sbjct: 327 VDSV----TAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDES-GNGGIIVDCGT 381
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-----FPAMAFH 375
+T L AY LR + Q L + + + CY DL G P ++FH
Sbjct: 382 AITRLQTQAYNPLRDAFVRMTQNLKLTSAV-ALFDTCY------DLSGQASVRVPTVSFH 434
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
FA G L A + +S+ +C A P+ LSIIG + QQ V +DL +
Sbjct: 435 FADGKSWNLPAANYLIPVDSAGTYCFAFAPTT------SSLSIIGNVQQQGTRVTFDLAN 488
Query: 435 KQLYFQRIDCE 445
++ F C+
Sbjct: 489 NRMGFSPNKCQ 499
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 160/362 (44%), Gaps = 32/362 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ IG P LDTGS + W++C PC C + +DPS S +Y + C S+
Sbjct: 45 YFARMGIGSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 104
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C G C Y + Y + S G +G E F T + ++ FGC H+N
Sbjct: 105 CQALDYSACQGM--GCSYRVVYGDSSASSGDLGIESFYL--GPNSSTAMRNIAFGCGHSN 160
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYC-IGNLNYFEYAYNMLILGEGAI-LE 269
+ + + G S + +G FSYC + + + + LI G AI
Sbjct: 161 SGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFA 220
Query: 270 GDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP+ ID YY L GIS+G L I P F + G +DSGT++T +V
Sbjct: 221 ARFTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGT-GGAILDSGTSVTRVV 279
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLV 383
P+AY LR D ++ + P P +L C++ +Q P++ HF D+V
Sbjct: 280 PAAYAVLR----DAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQ-IPSLVLHFDNDVDMV 334
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L ++ + S FCLA PS + +S+IG + QQ + + +DL +
Sbjct: 335 LPGGNILIPVDRSGTFCLAFAPSSM------PISVIGNVQQQTFRIGFDLQRSLIAIAPR 388
Query: 443 DC 444
+C
Sbjct: 389 EC 390
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 169/371 (45%), Gaps = 36/371 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+ + SIG PPV A +DTGS LIW++C PC C FDP S TY+ + S
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSES 118
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C+ C + C Y Y + ++G + E ++ L V FGC HNN
Sbjct: 119 CSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNN 178
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-----FSYCIGNLNYFEYAYNMLILGEGA 266
+++ G+ GLG SLV ++GS FS C+ + + + G+G+
Sbjct: 179 NGVFNDKEMGIIGLG---RGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGS 235
Query: 267 ILEGD---STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+ G+ STP+ + Y+VTL GIS+ + +++ N + + + IDSGT
Sbjct: 236 EVLGNGVVSTPLVSKNTHQAFYFVTLLGISVED--INLPFNDGSSLEPITKGNMVIDSGT 293
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP--AWHLCYSGNINRDLQGFPAMAFHFAG 378
T L Y L +EV + + L P+DP + LCY N L+G + HF
Sbjct: 294 PTTLLPEDFYHRLVEEVRN--KVALDPIPIDPTLGYQLCYRTPTN--LKG-TTLTAHFE- 347
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GAD++L +F +FC A + N + I G AQ NY + +DL + +
Sbjct: 348 GADVLLTPTQIFIPVQDGIFCFAFTSTFSN-----EYGIYGNHAQSNYLIGFDLEKQLVS 402
Query: 439 FQRIDCELLAD 449
F+ DC L D
Sbjct: 403 FKATDCTNLQD 413
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 171/394 (43%), Gaps = 63/394 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------------TFDPSKSLTY 146
++V F +G P P L V DTGS L WVKC+ ++ F P S T+
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 147 ATLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS--DEGK 197
A + C S CT C C Y+ RY +G ++GT+G+E S +E K
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERK 216
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLG-PATSSTHSLVEKVGSKFSYCIGNLNYFEYA 256
L + GCS + S E GV LG S + G +FSYC+ + A
Sbjct: 217 AKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNA 276
Query: 257 YNMLILGEGAIL---------------EGDSTPMSVIDGS----YYVTLEGISLGEKMLD 297
+ L G + TP+ ++D Y V+L+ IS+ + L
Sbjct: 277 TSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPL-LLDRRMRPFYDVSLKAISVAGEFLK 335
Query: 298 IDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWH 355
I ++ D + GV +DSGT+LT L AY+ + V L +GL LP MDP +
Sbjct: 336 IPRAVW---DVEAGGGVILDSGTSLTVLAKPAYRAV---VAALSKGLAGLPRVTMDP-FE 388
Query: 356 LCY--SGNINRDLQ-GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGE 410
CY + +D P MA HFAG A L +S + V C+ + GP
Sbjct: 389 YCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGP------ 442
Query: 411 RFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +S+IG I QQ + +D+ +++L FQR C
Sbjct: 443 -WPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 475
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/377 (29%), Positives = 167/377 (44%), Gaps = 42/377 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ +G PP L ++DTGS L W++C+PC+ C FDPS+S ++ +PC+++
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230
Query: 156 C---------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGF 205
C N P C Y Y + + G + E + SD + + D+
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK-VGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H+N + A S L +G FSYC+ + + + G
Sbjct: 291 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGA 350
Query: 265 GAIL-----EGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G L + TP + S YY+ ++GI + +++L I F S G
Sbjct: 351 GFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGS-GGTI 409
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM-DP--AWHLCYSGNINRDLQGFPAM 372
IDSGTTLT+L AY + VE F + SYP DP +CY+ R FP +
Sbjct: 410 IDSGTTLTYLNRDAY----RAVESAFLARI-SYPRADPFDILGICYNAT-GRTAVPFPTL 463
Query: 373 AFHFAGGADLVLDAESVFYQ--ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ F GA+L L E+ F Q + CLA+ P+D +SIIG QQN + Y
Sbjct: 464 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTD-------GMSIIGNFQQQNIHFLY 516
Query: 431 DLVSKQLYFQRIDCELL 447
D+ +L F DC L
Sbjct: 517 DVQHARLGFANTDCSAL 533
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 123/437 (28%), Positives = 181/437 (41%), Gaps = 61/437 (13%)
Query: 38 LVTKLLHRDSLL---YNPN----DTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLH 90
T L RDS L +NP+ D++ +R+ + S +L+ S T
Sbjct: 28 FTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVS------TACIRS 81
Query: 91 PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYA 147
P I F ++ IG PPV +A+ DTGS L W +C PC +C F+P +S +Y
Sbjct: 82 PIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYR 141
Query: 148 TLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ C S C + CG C Y Y + + G + S+Q + KT +
Sbjct: 142 KVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVI--- 198
Query: 204 GFGCSHNNAHFSDEQFTGVFG-----------LGPATSSTHSLVEKVGSKFSYCIGNLNY 252
GC H N G FG + S + V +FSYC+
Sbjct: 199 --GCGHQNG--------GTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFS 248
Query: 253 FEYAYNMLILGEGAILEGD---STPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
+ G A++ G STP+ D Y++TLE IS+G+K + +
Sbjct: 249 NANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTN 308
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
+ IDSGTTLT L S Y + + + + P LCYS DL
Sbjct: 309 ---HGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSG-ILELCYSAGQVDDLN 364
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + HFAGGAD+ L + F + +V CL P+ ++I G +AQ N+
Sbjct: 365 -IPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPA-------TQVAIFGNLAQINFE 416
Query: 428 VAYDLVSKQLYFQRIDC 444
V YDL +K+L F+ C
Sbjct: 417 VGYDLGNKRLSFEPKLC 433
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 118/397 (29%), Positives = 181/397 (45%), Gaps = 66/397 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--------------TFDPSKSL 144
++V F +G P P L V DTGS L WVKC+P + A+ F P KS
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 145 TYATLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE------ 191
T+A +PC S C+ C C Y+ RY +G ++GT+G+E
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSS 214
Query: 192 --TSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIG 248
+ K L + GC+ + S E GV LG + S S + G +FSYC+
Sbjct: 215 SSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSYCLV 274
Query: 249 NLNYFEYAYNMLILGEGAILEG----------DSTPMSVIDGS----YYVTLEGISLGEK 294
+ A + L G + L G TP+ V+D Y V+++ IS+ +
Sbjct: 275 DHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPL-VLDSRMRPFYDVSIKAISVDGE 333
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDP 352
+L I ++++ + GV +DSGT+LT L AY+ + V L + L P MDP
Sbjct: 334 LLKIPRDVWEVD---GGGGVIVDSGTSLTVLAKPAYRAV---VAALGKKLARFPRVAMDP 387
Query: 353 AWHLCYS-GNINRDLQG--FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDI 407
+ CY+ + +R +G P +A HFAG A L ++S + V C+ V GP
Sbjct: 388 -FEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGP--- 443
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +S+IG I QQ + +DL +++L F+R C
Sbjct: 444 ----WPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 125/426 (29%), Positives = 196/426 (46%), Gaps = 61/426 (14%)
Query: 40 TKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVF 99
T L HRDSLL +P + S++ + L+ + + A L+ ++ V
Sbjct: 32 TSLFHRDSLL-SPLEF----------SSLSHYDRLANAFRRSLSRSAALLNRAATSGAVG 80
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYC 156
+ IG PPV L + DTGS L W +C PC +C F+P KS +++ +PC++ C
Sbjct: 81 LQSSIIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC 140
Query: 157 -TNDCG--GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
D G G C Y+ Y + S+G +G E+ +S GC H
Sbjct: 141 HAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSVKSV------IGC----GH 190
Query: 214 FSDEQF---TGVFGLGPATSSTHSLVEK---VGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
S F +GV GLG S S + + + +FSYC+ L +A + G+ A+
Sbjct: 191 ASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTL--LSHANGKINFGQNAV 248
Query: 268 LEGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-VFIDSGTT 321
+ G STP+ + YY+TLE IS+G +++ ++ G V IDSGTT
Sbjct: 249 VSGPGVVSTPLISKNTVTYYYITLEAISIGN----------ERHMAFAKQGNVIIDSGTT 298
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSGNINRDL-QGFPAMAFHFAG 378
L++L Y + V L + + DP W LC+ IN G P + F+G
Sbjct: 299 LSFLPKELYDGV---VSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSG 355
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GA++ L + F + +++V CL + P+ E IIG +A N+ + YDL +K+L
Sbjct: 356 GANVNLLPVNTFQKVANNVNCLTLTPASPTDE----FGIIGNLALANFLIGYDLEAKRLS 411
Query: 439 FQRIDC 444
F+ C
Sbjct: 412 FKPTVC 417
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 125/428 (29%), Positives = 179/428 (41%), Gaps = 64/428 (14%)
Query: 61 QRTLNMSMARFIYLSQKSS-------QKAHDTRAHLHPGISTVPV----FYVNFSIGQPP 109
+R + S AR LS S + A H PG+ P + ++ +IG PP
Sbjct: 54 RRAMQRSKARAAALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDLEYLIDLAIGTPP 113
Query: 110 VPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTN----DCGG 162
P A+LDTGS LIW +C PC C A F P+ S +Y + C C + C
Sbjct: 114 QPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHHSC-Q 172
Query: 163 YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGV 222
PD C Y Y +G + G +E+F F +S G+ +GFGC N S +G+
Sbjct: 173 RPDTCTYRYNYGDGTTTLGVYATERFTFASS-SGEKLSVPLGFGCGTMNVG-SLNNGSGI 230
Query: 223 FGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILG--EGAILEGDSTPMSVID 279
G G SLV ++ +FSYC+ Y + L+ G + EGD +
Sbjct: 231 VGFG---RDPLSLVSQLSIRRFSYCL--TPYTSTRKSTLMFGSLSDGVFEGDDAATGQVQ 285
Query: 280 GS-----------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ YYV G+++G + L I + F S GV +DSGT LT L P+
Sbjct: 286 TTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGS-GGVIVDSGTALT-LFPA 343
Query: 329 AYQTLRKEVEDLFQGLLP---SYPMDPAWHLCYSGNI--------NRDLQGFPAMAFHFA 377
A T EV F+ L + P +C++ + + P MAFHF
Sbjct: 344 AVLT---EVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAFHFQ 400
Query: 378 GGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GADL L + V C+ + S +G + IG QQ+ V YDL ++
Sbjct: 401 -GADLELPRRNYVLDDPRRGSLCILLADSGDSG------ATIGNFVQQDMRVLYDLEAET 453
Query: 437 LYFQRIDC 444
L F C
Sbjct: 454 LSFAPAQC 461
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 111/435 (25%), Positives = 181/435 (41%), Gaps = 48/435 (11%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-- 98
+++HRD+ V+A A L + R + + S+ A + G++ V
Sbjct: 68 RVVHRDTF------AVNATAGELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSG 121
Query: 99 -------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYAT 148
++ +G P L VLDTGS ++WV+C PC +C FDP +S +Y
Sbjct: 122 LAQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGA 181
Query: 149 LPCDSSYCTN-DCGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
+ C ++ C D GG C Y + Y +G + G +E F G + V
Sbjct: 182 VGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTF----AGGARVARVA 237
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI-------GNLNYFEYAY 257
GC H+N + S + + G FSYC+ +
Sbjct: 238 LGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRS 297
Query: 258 NMLILGEGAILEGDS--TPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSD 311
+ + G G++ + TPM ++ YYV L GIS+ G ++ + + + + +
Sbjct: 298 STVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGR 357
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD-PAWHLCYSGNINRDLQGFP 370
GV +DSGT++T L ++Y LR G L P + CY R ++ P
Sbjct: 358 GGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVK-VP 416
Query: 371 AMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
++ HFAGGA+ L E+ +S FC A +D +SIIG I QQ + V
Sbjct: 417 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD------GGVSIIGNIQQQGFRVV 470
Query: 430 YDLVSKQLYFQRIDC 444
+D +++ F C
Sbjct: 471 FDGDGQRVGFAPKGC 485
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 168/374 (44%), Gaps = 45/374 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ S+G P + DTGS LIW++C+PC+ C FDP S +Y T+ C +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-N 210
C C C Y+ Y +G ++GT+ SE ++ K ++ FGC H N
Sbjct: 100 CDSLPRKSCS---PNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLN 156
Query: 211 NAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE----- 264
F+D +G+ GLG S L + G KFSYC+ + + G+
Sbjct: 157 RGSFNDA--SGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSH 214
Query: 265 --GAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
G L TPM ++ YYV L+ IS+ + L I F S G+ DSG
Sbjct: 215 SSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS-GGMIFDSG 273
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYP----MDPAWHLCY--SGNINRDLQGFPAMA 373
TTLT L + YQ + + + S+P LCY SG+ + PAM
Sbjct: 274 TTLTLLPDAPYQIVLRALRSKV-----SFPEIDGSSAGLDLCYDVSGSKASYKKKIPAMV 328
Query: 374 FHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
FHF GAD L E+ F ++ ++ CLA+ S++ D+ I G + QQN+ V YD
Sbjct: 329 FHFE-GADHQLPVENYFIAANDAGTIVCLAMVSSNM------DIGIYGNMMQQNFRVMYD 381
Query: 432 LVSKQLYFQRIDCE 445
+ S ++ + C+
Sbjct: 382 IGSSKIGWAPSQCD 395
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 127/442 (28%), Positives = 185/442 (41%), Gaps = 59/442 (13%)
Query: 33 GKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSM---ARFIYLSQKSSQKAHDTR--A 87
G+P LLHRD++ T + L ++ AR YL ++ S T +
Sbjct: 67 GRPS---LALLHRDAV---SGRTYPSTRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGS 120
Query: 88 HLHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKS 143
+ GIS ++V +G PP Q V+D+GS +IW++C+PC +C FDP+ S
Sbjct: 121 EVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAAS 180
Query: 144 LTYATLPCDSSYCTNDCGGY-----PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
++ +PCDS C GG C Y + Y +G +QG + E F S T
Sbjct: 181 ASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDS----T 236
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGL-----GPATSSTHSLVEKVGSKFSYCIGNLNYF 253
+ V GC H N F G GL GP S L G FSYC+ +
Sbjct: 237 PVQGVAIGCGHRNRGL----FVGAAGLLGLGWGP-MSLVGQLGGAAGGAFSYCLASRGA- 290
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ L+ G + + + ++ + YYV L G+ +G + L + LF +
Sbjct: 291 DAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTED 350
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
GV +D+GT +T L P AY LR G LP P CY DL G
Sbjct: 351 -GGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCY------DLSG 403
Query: 369 F-----PAMAFHFA-GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
+ P +A +F GA L L A ++ + V+CLA S LSI+G I
Sbjct: 404 YASVRVPTVALYFGRDGAALTLPARNLLVEMGGGVYCLAFAASA------SGLSILGNIQ 457
Query: 423 QQNYNVAYDLVSKQLYFQRIDC 444
QQ + D + + F C
Sbjct: 458 QQGIQITVDSANGYVGFGPSTC 479
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 161/362 (44%), Gaps = 32/362 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ IG P LDTGS + W++C PC C + +DPS S +Y + C S+
Sbjct: 12 YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 71
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C G C Y + Y + S G +G E F + T + ++ FGC H+N
Sbjct: 72 CQALDYSACQGM--GCSYRVVYGDSSASSGDLGIESFYLGPNSS--TAMRNIAFGCGHSN 127
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYC-IGNLNYFEYAYNMLILGEGAI-LE 269
+ + + G S + +G FSYC + + + + LI G AI
Sbjct: 128 SGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFA 187
Query: 270 GDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP+ I+ YY L GIS+G L I P F + G +DSGT++T +V
Sbjct: 188 ARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGT-GGAILDSGTSVTRVV 246
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLV 383
P AY LR D ++ + P P +L C++ +Q P++ HF G D+V
Sbjct: 247 PPAYAVLR----DAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQ-IPSLVLHFDNGVDMV 301
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L ++ + S FCLA PS + +S+IG + QQ + + +DL +
Sbjct: 302 LPGGNILIPVDRSGTFCLAFAPSSM------PISVIGNVQQQTFRIGFDLQRSLIAIAPR 355
Query: 443 DC 444
+C
Sbjct: 356 EC 357
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 174/377 (46%), Gaps = 45/377 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--------TFDPSKSLTYATLP 150
++V +G P P + V DTGS L WVKC ++ F P+ S +++ LP
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLP 163
Query: 151 CDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS-DEG--KTFL 200
CDS C + +C PD C Y+ RY + ++G +G + S ++G K L
Sbjct: 164 CDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAKL 223
Query: 201 YDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNM 259
+V GC+ + S + GV LG + S S + G +FSYC+ + A +
Sbjct: 224 QEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATSF 283
Query: 260 LILGE-----GAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNL--FKKND 307
L G G TP+ +++ + Y+V+++ +++ + L+I P++ F+KN
Sbjct: 284 LTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWDFRKN- 342
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
G +DSGT+LT L AY + K + F G +P MDP + CY N
Sbjct: 343 ----GGAILDSGTSLTILATPAYDAVVKAISKQFAG-VPRVNMDP-FEYCY--NWTGVSA 394
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P M FAG A L +S + V C+ V + G + +S+IG I QQ +
Sbjct: 395 EIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGV----VEGA-WPGVSVIGNILQQEHL 449
Query: 428 VAYDLVSKQLYFQRIDC 444
+DL ++ L F++ C
Sbjct: 450 WEFDLANRWLRFKQSRC 466
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 118/419 (28%), Positives = 178/419 (42%), Gaps = 51/419 (12%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST-VPVFYVNFSIGQPPVPQLAVLDTG 119
Q+ N++ A L + + + A L G S ++++ +G PP +LDTG
Sbjct: 131 QQQNNLANAVVASLKSSKDEFSGNIMATLESGASLGTGEYFIDMFVGTPPKHVWLILDTG 190
Query: 120 SSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT--------NDCGGYPDECW 168
S L W++C PC C ++P++S +Y + C C C C
Sbjct: 191 SDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQLVSSPDPLQHCKTENQTCP 250
Query: 169 YNIRYTNGPDSQGTIGSEQF----NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
Y Y +G ++ G E F + E + DV FGC H N F +
Sbjct: 251 YFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGL 310
Query: 225 LGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG------------AILEGDS 272
S L G FSYC+ +L + LI GE +L G+
Sbjct: 311 GRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEE 370
Query: 273 TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV---FIDSGTTLTWLVPSA 329
TP D YY+ ++ I +G ++LDI +K WS GV IDSG+TLT+ SA
Sbjct: 371 TPD---DTFYYLQIKSIVVGGEVLDIP----EKTWHWSSEGVGGTIIDSGSTLTFFPDSA 423
Query: 330 YQTLRKEVE---DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDA 386
Y +++ E L Q + M P +++ SG + +L P HFA GA A
Sbjct: 424 YDVIKEAFEKKIKLQQIAADDFIMSPCYNV--SGAMQVEL---PDYGIHFADGAVWNFPA 478
Query: 387 ESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
E+ FYQ E V CLA+ + L+IIG + QQN+++ YD+ +L + C
Sbjct: 479 ENYFYQYEPDEVICLAI----LKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 533
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 121/461 (26%), Positives = 189/461 (40%), Gaps = 63/461 (13%)
Query: 20 TRIFTSTTAAPAAGKP---KRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQ 76
T + + + P+ P + K++H+ + A+AQ L +R +
Sbjct: 62 TSLLPAASCKPSTQVPSIENKAFLKVVHKHGPCSDLRQGHKAEAQYILLQDQSRVDSIHS 121
Query: 77 KSSQKA--HDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIW 124
K S+ + D +A +T+P ++V +G P + DTGS L W
Sbjct: 122 KLSKDSGLSDVKA---TAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTW 178
Query: 125 VKCQPC-EQC---GATTFDPSKSLTYATLPCDSSYCTN---------DCGGYPDECWYNI 171
+C+PC + C F+PS+S +YA + C S+ C + +C C Y I
Sbjct: 179 TQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCA--SSTCVYGI 236
Query: 172 RYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS 231
+Y + S G G E+ + +D D FGC NN + S
Sbjct: 237 QYGDSSFSIGFFGKEKLSLTATD----VFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSL 292
Query: 232 THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS---YYVTLEG 288
++ FSYC L + L G TP++ I G Y + L G
Sbjct: 293 VSQTAQRYNKIFSYC---LPSSSSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTG 349
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
IS+G + L I P++F S AG IDSGT +T L P+AY L F+ L+ Y
Sbjct: 350 ISVGGRKLAISPSVF------STAGTIIDSGTVITRLPPAAYSAL----SSTFRKLMSQY 399
Query: 349 PMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA-VGP 404
P PA + C+ + N D P + F+GG + +D +FY + CLA G
Sbjct: 400 PAAPALSILDTCFDFS-NHDTISVPKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGN 458
Query: 405 SDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
SD + D++I G + Q+ V YD + ++ F C
Sbjct: 459 SDAS-----DVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 158/370 (42%), Gaps = 54/370 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP++S TYA + C +
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ-REKLFDPARSSTYANISCAA 238
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C T C G C Y ++Y +G S G + + D K F FGC
Sbjct: 239 PACSDLDTRGCSG--GNCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 292
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGE 264
N E G+ GLG TS +K G F++C+ Y ++
Sbjct: 293 RNEGLFGEA-AGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA-A 350
Query: 265 GAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
GA L +TPM +G YYV + GI +G ++L I ++F + AG +DSGT +
Sbjct: 351 GARL---TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF------TTAGTIVDSGTVI 401
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAF 374
T L P+AY +LR Y PA L CY D G P ++
Sbjct: 402 TRLPPAAYSSLRSAFASAMAAR--GYKKAPAVSLLDTCY------DFTGMSQVAIPTVSL 453
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GGA L +DA + Y S S CL ++ G D+ I+G + + VAYD+
Sbjct: 454 LFQGGARLDVDASGIMYAASVSQVCLGFAANEDGG----DVGIVGNTQLKTFGVAYDIGK 509
Query: 435 KQLYFQRIDC 444
K + F C
Sbjct: 510 KVVGFSPGAC 519
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 178/378 (47%), Gaps = 52/378 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN-- 158
V+ ++G PP VLDTGS L W+ C+ + TF+P S +Y PC+SS CT
Sbjct: 62 VSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNS-TFNPLLSSSYTPTPCNSSICTTRT 120
Query: 159 -------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C + Y + ++GT+ +E F+ + + T FGC +
Sbjct: 121 RDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTL-----FGCMDSA 175
Query: 212 AHFS----DEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEG- 265
+ S D + TG+ G+ + SLV ++ KFSYCI E A +L+LG+G
Sbjct: 176 GYTSDINEDSKTTGLMGMN---RGSLSLVTQMSLPKFSYCISG----EDALGVLLLGDGT 228
Query: 266 -AILEGDSTPMSVIDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
A TP+ S Y V LEGI + EK+L + ++F + T + +
Sbjct: 229 DAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGA-GQTMV 287
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQGFPA 371
DSGT T+L+ S Y +L+ E + +G+L P++ + A LCY + PA
Sbjct: 288 DSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCY--HAPASFAAVPA 345
Query: 372 MAFHFAGGADLVLDAESVFYQESSS---VFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
+ F+ GA++ + E + Y+ S V+C G SD+ G + +IG QQN +
Sbjct: 346 VTLVFS-GAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLG---IEAYVIGHHHQQNVWM 401
Query: 429 AYDLVSKQLYFQRIDCEL 446
+DL+ ++ F + C+L
Sbjct: 402 EFDLLKSRVGFTQTTCDL 419
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 122/434 (28%), Positives = 182/434 (41%), Gaps = 61/434 (14%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQ--------AQRTLNMSMARFIYLSQKSSQKAHDTR 86
P L + L RDS T+ AQ A R S + LSQ S +
Sbjct: 88 PDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGE------ 141
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKS 143
++ +G P VLDTGS ++W++C PC +C + + FDP KS
Sbjct: 142 ------------YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189
Query: 144 LTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TYAT+PC S +C C C Y + Y +G + G +E F +
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNR 244
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNM 259
+ V GC H+N + S + KFSYC+ + + +
Sbjct: 245 VKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKP-SS 303
Query: 260 LILGEGAILE-GDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGV 314
++ G A+ TP+ +D YYV L GIS+ G ++ + +LFK D + GV
Sbjct: 304 VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKL-DQIGNGGV 362
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPA 371
IDSGT++T L+ AY +R D F+ + P + L C+ + N + P
Sbjct: 363 IIDSGTSVTRLIRPAYIAMR----DAFRVGAKTLKRAPDFSLFDTCFDLS-NMNEVKVPT 417
Query: 372 MAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ HF GAD+ L A + +++ FC A + LSIIG I QQ + V Y
Sbjct: 418 VVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGT------MGGLSIIGNIQQQGFRVVY 470
Query: 431 DLVSKQLYFQRIDC 444
DL S ++ F C
Sbjct: 471 DLASSRVGFAPGGC 484
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 114/376 (30%), Positives = 169/376 (44%), Gaps = 41/376 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ + +IG PP A+ DTGS L+W +C PC E+C + ++PS S T+ LPC S+
Sbjct: 97 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 156
Query: 155 --YCTND---CGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C + G P C YN Y G S G GSE F F +S + + + FG
Sbjct: 157 LNLCAAEARLAGATPPPGCACRYNQTYGTGWTS-GLQGSETFTFGSSPADQVRVPGIAFG 215
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL---- 262
CS+ + SD+ +G + + FSYC+ + +L+
Sbjct: 216 CSNAS---SDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAA 272
Query: 263 -----GEGA-----ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
G G + PMS YY+ L GIS+G L I P F +
Sbjct: 273 AAALNGTGVRSTPFVPSPSKPPMSTY---YYLNLTGISVGPAALPIPPGAFALRADGT-G 328
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPA 371
G+ IDSGTT+T LV +AY+ +R V L + + LC++ + + P+
Sbjct: 329 GLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 388
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
M HF GGAD+VL E+ + ++CLA+ S +GE LS +G QQN ++ YD
Sbjct: 389 MTLHFGGGADMVLPVENYMILD-GGMWCLAMR-SQTDGE----LSTLGNYQQQNLHILYD 442
Query: 432 LVSKQLYFQRIDCELL 447
+ + L F C L
Sbjct: 443 VQKETLSFAPAKCSTL 458
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 164/365 (44%), Gaps = 48/365 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYC 156
+ + SIG P + Q ++DTGS + WV C G++ FDP KS TY C S+ C
Sbjct: 124 AYVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSLFFDPGKSSTYTPFSCSSAAC 183
Query: 157 T------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
T N C C Y +RY +G ++ GT GS+ ++++ + F FGCS
Sbjct: 184 TRLEGRDNGC-SLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEKVENFQ----FGCSET 238
Query: 211 N---AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGA 266
+ ++Q G+ GLG S S GS FSYC L + L LG
Sbjct: 239 SDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYC---LPATTRSSGFLTLGAST 295
Query: 267 ILEG-DSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
G +TPM + Y+V L+GI++G + I P +F AG +DSGT +
Sbjct: 296 GTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFA-------AGSIMDSGTII 348
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGG 379
T L P AY L F+ + YP A+ + C+ +D PA+ F+GG
Sbjct: 349 TRLPPRAYSALSAA----FRAGMRRYPRARAFSILDTCFD-FTGQDNVSIPAVELVFSGG 403
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
A + LDA+ + Y CLA P+ SIIG + Q+ + V +D+ L F
Sbjct: 404 AVVDLDADGIMYGS-----CLAFAPATGG-----IGSIIGNVQQRTFEVLHDVGQSVLGF 453
Query: 440 QRIDC 444
+ C
Sbjct: 454 RPGAC 458
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 114/376 (30%), Positives = 169/376 (44%), Gaps = 41/376 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ + +IG PP A+ DTGS L+W +C PC E+C + ++PS S T+ LPC S+
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 155 --YCTND---CGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C + G P C YN Y G S G GSE F F +S + + + FG
Sbjct: 152 LNLCAAEARLAGATPPPGCACRYNQTYGTGWTS-GLQGSETFTFGSSPADQVRVPGIAFG 210
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL---- 262
CS+ + SD+ +G + + FSYC+ + +L+
Sbjct: 211 CSNAS---SDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAA 267
Query: 263 -----GEGA-----ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
G G + PMS YY+ L GIS+G L I P F +
Sbjct: 268 AAALNGTGVRSTPFVPSPSKPPMSTY---YYLNLTGISVGAAALPIPPGAFALRADGT-G 323
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPA 371
G+ IDSGTT+T LV +AY+ +R V L + + LC++ + + P+
Sbjct: 324 GLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 383
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
M HF GGAD+VL E+ + ++CLA+ S +GE LS +G QQN ++ YD
Sbjct: 384 MTLHFGGGADMVLPVENYMILD-GGMWCLAMR-SQTDGE----LSTLGNYQQQNLHILYD 437
Query: 432 LVSKQLYFQRIDCELL 447
+ + L F C L
Sbjct: 438 VQKETLSFAPAKCSTL 453
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 187/438 (42%), Gaps = 71/438 (16%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
P +++ L R N + +QA +++ M MA S A T G
Sbjct: 75 PTPSISETLRRSRARTN---YIMSQASKSMGMGMA-----STPDDDDAAVTIPTRLGGFV 126
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATL 149
+ V G P VPQ+ ++DTGS + WV+C PC FDPSKS TYA +
Sbjct: 127 DSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPI 186
Query: 150 PCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
C++ C N C +C Y++ Y +G S+G +E G T + D
Sbjct: 187 ACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLA---PGIT-VED 242
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLI 261
FGC + SD ++ G+ GLG A S V G FSYC+ LN L+
Sbjct: 243 FHFGCGRDQRGPSD-KYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN---SEAGFLV 298
Query: 262 LGEGAILEGDS---TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
LG + TPM + G Y VT+ GIS+G K L I + F+ G+
Sbjct: 299 LGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFR-------GGMI 351
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSGNINRDLQGFPAMA 373
IDSGT T L +AY L E + L +YP+ P+ + CY+ ++ P +A
Sbjct: 352 IDSGTVDTELPETAYNAL----EAALRKALKAYPLVPSDDFDTCYNFTGYSNIT-VPRVA 406
Query: 374 FHFAGGADLVLDA-------ESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
F F+GGA + LD + + +QES GP D L IIG + Q+
Sbjct: 407 FTFSGGATIDLDVPNGILVNDCLAFQES--------GPDD-------GLGIIGNVNQRTL 451
Query: 427 NVAYDLVSKQLYFQRIDC 444
V YD + F+ C
Sbjct: 452 EVLYDAGRGNVGFRAGAC 469
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 114/376 (30%), Positives = 169/376 (44%), Gaps = 41/376 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ + +IG PP A+ DTGS L+W +C PC E+C + ++PS S T+ LPC S+
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA 151
Query: 155 --YCTND---CGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C + G P C YN Y G S G GSE F F +S + + + FG
Sbjct: 152 LNLCAAEARLAGATPPPGCACRYNQTYGTGWTS-GLQGSETFTFGSSPADQVRVPGIAFG 210
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL---- 262
CS+ + SD+ +G + + FSYC+ + +L+
Sbjct: 211 CSNAS---SDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAA 267
Query: 263 -----GEGA-----ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
G G + PMS YY+ L GIS+G L I P F +
Sbjct: 268 AAALNGTGVRSTPFVPSPSKPPMSTY---YYLNLTGISVGPAALPIPPGAFALRADGT-G 323
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPA 371
G+ IDSGTT+T LV +AY+ +R V L + + LC++ + + P+
Sbjct: 324 GLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 383
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
M HF GGAD+VL E+ + ++CLA+ S +GE LS +G QQN ++ YD
Sbjct: 384 MTLHFGGGADMVLPVENYMILD-GGMWCLAMR-SQTDGE----LSTLGNYQQQNLHILYD 437
Query: 432 LVSKQLYFQRIDCELL 447
+ + L F C L
Sbjct: 438 VQKETLSFAPAKCSTL 453
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 173/375 (46%), Gaps = 57/375 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ + IG PP+ +DTGS LIWV+C PC C FDP KS TY + CDS
Sbjct: 64 YLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPL 123
Query: 156 C----TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGFGCSH 209
C +C P++ C Y Y + ++G + E TS+ GK L + FGC H
Sbjct: 124 CYKPYIGECS--PEKRCDYTYGYADSSLTKGVLAQETVTL-TSNTGKPISLQGILFGCGH 180
Query: 210 NN-AHFSDEQFTGVFGLGPATSSTHSLVEKV--GSKFSYCIGNL-------NYFEYAYNM 259
NN +F+D + G+ GLG +S S + + G KFS C+ + +
Sbjct: 181 NNTGNFNDHEM-GLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGS 239
Query: 260 LILGEGAILEGDSTPMSVID---GSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+LGEG + +TP+ + SYYVTL GIS+ + L + N T + +
Sbjct: 240 EVLGEGVV----TTPLVQREQDMTSYYVTLLGISVEDTYLPM-------NSTIEKGNMLV 288
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM--DPAW--HLCYSGNINRDLQGFPAM 372
DSGT L Y + EV++ +P P+ DP+ LCY N L+G P +
Sbjct: 289 DSGTPPNILPQQLYDRVYVEVKN----KVPLEPITDDPSLGPQLCYRTQTN--LKG-PTL 341
Query: 373 AFHFAGGADLVLDAESVFY---QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
+HF GA+L+L F E+ VFCLA I D I G AQ NY +
Sbjct: 342 TYHFE-GANLLLTPIQTFIPPTPETKGVFCLA-----ITNCANSDPGIYGNFAQTNYLIG 395
Query: 430 YDLVSKQLYFQRIDC 444
+DL + + F+ DC
Sbjct: 396 FDLDRQIVSFKPTDC 410
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 170/377 (45%), Gaps = 58/377 (15%)
Query: 80 QKAHDTRAHLHPGISTVPVF--YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT 137
Q A +R+ P V F V S+G P V Q +DTGS + WV+C+PC +
Sbjct: 122 QLATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNS 181
Query: 138 -----FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQF 188
FDP+KS TY+ +PC + C+ + G +C Y + Y +G ++ G GS+
Sbjct: 182 QRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTL 241
Query: 189 NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFS 244
+ TFL FGC H A F G+ GL + SL + G FS
Sbjct: 242 ALAPGNTVGTFL----FGCGHAQAGM----FAGIDGLLALGRQSMSLKSQAAGAYGGVFS 293
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDP 300
YC L + A L LG + G +T + + Y V L GIS+G + + +
Sbjct: 294 YC---LPSKQSAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPA 350
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWH 355
+ F G +D+GT +T L P+AY LR F+G + PS P +
Sbjct: 351 SAFA-------GGTVVDTGTVITRLPPTAYAALRSA----FRGAIAPCGYPSAPANGILD 399
Query: 356 LCYSGNINR-DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD 414
CY + +R + P +A F+GGA L L+A + S CLA P+ +G D
Sbjct: 400 TCY--DFSRYGVVTLPTVALTFSGGATLALEAPGIL-----SSGCLAFAPNGGDG----D 448
Query: 415 LSIIGMIAQQNYNVAYD 431
+I+G + Q+++ V +D
Sbjct: 449 AAILGNVQQRSFAVRFD 465
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 115/418 (27%), Positives = 173/418 (41%), Gaps = 44/418 (10%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFY-VNFSIGQPPVPQLA 114
++ +R +AR S + A + G++ Y ++ +G PP
Sbjct: 105 IETMHRRAARSGVARMPASSSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRM 164
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC--------TNDCGG- 162
++DTGS L W++C PC C FDP+ S +Y + C C C
Sbjct: 165 IMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRP 224
Query: 163 YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD-VGFGCSHNNAHFSDEQFTG 221
D C Y Y + ++ G + E F + G + D V FGC H N
Sbjct: 225 AEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGL 284
Query: 222 VFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD--------ST 273
+ S L G FSYC+ + + A + ++ GE ++ +
Sbjct: 285 LGLGRGPLSFASQLRAVYGHTFSYCL--VEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAP 342
Query: 274 PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW-----SDAGVFIDSGTTLTWLVPS 328
S D YYV L+G+ +G +L+I +DTW G IDSGTTL++ V
Sbjct: 343 TSSPADTFYYVKLKGVLVGGDLLNI------SSDTWDVGKDGSGGTIIDSGTTLSYFVEP 396
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPAMAFHFAGGADLVLDAE 387
AYQ +R+ DL L P P P + CY+ + R P ++ FA GA AE
Sbjct: 397 AYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVER--PEVPELSLLFADGAVWDFPAE 454
Query: 388 SVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ F + + + CLAV G +SIIG QQN++V YDL + +L F C
Sbjct: 455 NYFVRLDPDGIMCLAV-----RGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 507
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 116/421 (27%), Positives = 178/421 (42%), Gaps = 56/421 (13%)
Query: 41 KLLHRDSLLY------NPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
LLHRD L + ND + A R + LS + D+R + +
Sbjct: 75 NLLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRR----LSHGAPAAVKDSRYKVANFAT 130
Query: 95 TV--------PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKS 143
V ++V +G PP Q V+D+GS ++WV+C+PC +C FDP+ S
Sbjct: 131 DVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADS 190
Query: 144 LTYATLPCDSSYCTN--DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
++A + C S C + G C Y + Y +G ++GT+ ET G+ +
Sbjct: 191 SSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLA-----LETLTVGQVMIR 245
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
DV GC H N + G + S L + G FSYC+ ++ + L
Sbjct: 246 DVAIGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCL--VSRGTGSTGALE 303
Query: 262 LGEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
G GA+ G +T +S+I YY+ L GI +G + + F+ + + GV +
Sbjct: 304 FGRGALPVG-ATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE-YGTNGVVM 361
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PA 371
D+GT +T +AY R LP P + CY DL GF P
Sbjct: 362 DTGTAVTRFPTAAYVAFRDSFTAQTSN-LPRAPGVSIFDTCY------DLNGFESVRVPT 414
Query: 372 MAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
++F+F+ G L L A + + FCLA PS LSIIG I Q+ +++
Sbjct: 415 VSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSP------SGLSIIGNIQQEGIQISF 468
Query: 431 D 431
D
Sbjct: 469 D 469
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 123/433 (28%), Positives = 178/433 (41%), Gaps = 61/433 (14%)
Query: 34 KPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGI 93
K + +LL RD L QR M+ A + S+ + L +
Sbjct: 70 KKRPTEEELLKRDQLRAE-------HIQRKFAMNAAVDGAGDLQQSKVSSSVPTKLGSSL 122
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ--CGATT---FDPSKSLTYAT 148
T+ + ++ +G P V Q +DTGS + WV+C PC C A T FDP+KS TY
Sbjct: 123 DTLE-YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRA 181
Query: 149 LPCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLY 201
+ C ++ C N CG EC Y ++Y +G + GT + SD K F
Sbjct: 182 VSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQ- 240
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCI------GNLNYFE 254
FGCSH + FSD Q G+ GLG S S G+ FSYC+
Sbjct: 241 ---FGCSHVESGFSD-QTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLG 296
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +L P Y L+ I++G K L + P++F AG
Sbjct: 297 GGGGVSGFVTTRMLRSRQIPT-----FYGARLQDIAVGGKQLGLSPSVFA-------AGS 344
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPA 371
+DSGT +T L P+AY L F+ + Y PA + C+ + P
Sbjct: 345 VVDSGTIITRLPPTAYSAL----SSAFKAGMKQYRSAPARSILDTCFDFAGQTQIS-IPT 399
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+A F+GGA + LD + Y CLA + +G IIG + Q+ + V YD
Sbjct: 400 VALVFSGGAAIDLDPNGIMYGN-----CLAFAATGDDGT----TGIIGNVQQRTFEVLYD 450
Query: 432 LVSKQLYFQRIDC 444
+ S L F+ C
Sbjct: 451 VGSSTLGFRSGAC 463
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 127/459 (27%), Positives = 200/459 (43%), Gaps = 61/459 (13%)
Query: 22 IFTSTTAAPAAGKPKRLVT-KLLHRDSLLYNPNDTVDAQA----QRTLNMSMARFIYLSQ 76
+F S++ + +A PKR + +++H+ N + A+A +N+ R Y+
Sbjct: 48 LFPSSSCSSSAKGPKRKASLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQS 107
Query: 77 KSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+ S+ +T+P +YV +G P + DTGS L W +
Sbjct: 108 RLSKNLGGENRVKELDSTTLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQ 167
Query: 127 CQPCE-QCGAT---TFDPSKSLTYATLPCDSSYCTN----DCGGYPD-ECWYNIRYTNGP 177
C+PC C FDPSKS +Y + C SS CT C D C Y+++Y +
Sbjct: 168 CEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNS 227
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
S+G + E+ +D ++D FGC +N + F G GL + S V+
Sbjct: 228 ISRGFLSQERLTITATD----IVHDFLFGCGQDN----EGLFRGTAGLMGLSRHPISFVQ 279
Query: 238 KVGS----KFSYCIGNLNYFEYAYNMLILGEGAILEGD--STPMSVIDGS---YYVTLEG 288
+ S FSYC+ + + L G A + TP S I G Y + + G
Sbjct: 280 QTSSIYNKIFSYCLPST---PSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVG 336
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
IS+G L P + + T+S G IDSGT +T L P+AY LR F+ + Y
Sbjct: 337 ISVGGTKL---PAV--SSSTFSAGGSIIDSGTVITRLPPTAYAALRSA----FRQFMMKY 387
Query: 349 PMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS 405
P+ L CY + +++ P + F FAGG + L + Y ES+ CLA +
Sbjct: 388 PVAYGTRLLDTCYDFSGYKEIS-VPRIDFEFAGGVKVELPLVGILYGESAQQLCLAFA-A 445
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ NG D++I G + Q+ V YD+ ++ F C
Sbjct: 446 NGNGN---DITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 481
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 121/436 (27%), Positives = 185/436 (42%), Gaps = 66/436 (15%)
Query: 36 KRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST 95
K LV LHRDS R ++ + L+ S + + P +
Sbjct: 99 KALVLSRLHRDS-------------SRVQAITTRLQLILNGVSKSDLKPLQTEIQPQDLS 145
Query: 96 VPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSK 142
PV ++ +G P VLDTGS + W++CQPC C + F P+
Sbjct: 146 TPVSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAA 205
Query: 143 SLTYATLPCDSSYCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
S +Y+ L CDS C + C +C Y + Y +G + G +E +F G
Sbjct: 206 SSSYSPLTCDSQQCNSLQMSSCRN--GQCRYQVNYGDGSFTFGDFVTETMSF----GGSG 259
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAY 257
+ + GC H+N F G GL SL ++ + FSYC+ +N A
Sbjct: 260 TVNSIALGCGHDNEGL----FVGAAGLLGLGGGPLSLTSQLKATSFSYCL--VNRDSAAS 313
Query: 258 NMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ L + + P+ S ID YYV L G+S+G ++L I +FK +D+ D GV
Sbjct: 314 STLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDS-GDGGV 372
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-----F 369
+D GT +T L AY +LR + + L + + + CY DL G
Sbjct: 373 IVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGV-ALFDTCY------DLSGQSSVKV 425
Query: 370 PAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
P ++FHF GG L A + +S+ +C A P+ LSIIG + QQ V
Sbjct: 426 PTVSFHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTT------SSLSIIGNVQQQGTRV 479
Query: 429 AYDLVSKQLYFQRIDC 444
++DL + ++ F C
Sbjct: 480 SFDLANNRVGFSTNKC 495
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 179/405 (44%), Gaps = 43/405 (10%)
Query: 58 AQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
A++ + AR S S D + LHP + ++ S+G P A+ D
Sbjct: 17 AKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGG---YVMDISVGTPGKRFRAIAD 73
Query: 118 TGSSLIWVKCQPCEQC-GATTFDPSKSLTYATLPCDSSYCT---NDCGGYPDECWYNIRY 173
TGS L+WV+ +PC C G T FDP +S T+ + C S CT C C Y+ Y
Sbjct: 74 TGSDLVWVQSEPCTGCSGGTIFDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSYSYEY 133
Query: 174 TNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA-TSST 232
+G +++G + + T+ G GC N+ F + G+ GLG S T
Sbjct: 134 GSG-ETEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGF--DGVDGLVGLGQGPVSLT 190
Query: 233 HSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS------TPMSVIDGSYY-VT 285
L + SKFSYC+ ++N + + L+ G A L G TP S +YY +T
Sbjct: 191 SQLSAAIDSKFSYCLVDINS-QSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLT 249
Query: 286 LEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL 345
+ GI++ + + S IDSGTTLT++ Y + +E + L
Sbjct: 250 VNGIAVAGQTMG------------SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVT--L 295
Query: 346 PSYPMDP-AWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF--YQESSSVFCLAV 402
P LCY + NR+ + FPA+ A GA + + + F +S CLA+
Sbjct: 296 PRVDGSSMGLDLCYDRSSNRNYK-FPALTIRLA-GATMTPPSSNYFLVVDDSGDTVCLAM 353
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
G + +SIIG + QQ Y++ YD S +L F + CE L
Sbjct: 354 GSAG-----GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 156/366 (42%), Gaps = 49/366 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP+ S TYA + C +
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQ-REKLFDPASSSTYANVSCAA 237
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C++ C G C Y ++Y +G S G + + D K F FGC
Sbjct: 238 PACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 291
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N E G+ GLG TS K G F++C L L G G+
Sbjct: 292 RNDGLFGEA-AGLLGLGRGKTSLPVQTYGKYGGVFAHC---LPARSTGTGYLDFGAGSPP 347
Query: 269 EGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+TPM +G YYV + GI +G ++L I P++F AG +DSGT +T L
Sbjct: 348 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAA------AGTIVDSGTVITRLP 401
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHFAG 378
P+AY +LR Y A L CY D G P ++ F G
Sbjct: 402 PAAYSSLRSAFAAAMA--ARGYRKAAAVSLLDTCY------DFTGMSQVAIPTVSLLFQG 453
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GA L +DA + Y S+S CLA ++ G D+ I+G + + VAYD+ K +
Sbjct: 454 GAALDVDASGIMYTVSASQVCLAFAGNEDGG----DVGIVGNTQLKTFGVAYDIGKKVVG 509
Query: 439 FQRIDC 444
F C
Sbjct: 510 FSPGAC 515
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 155/370 (41%), Gaps = 40/370 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P P L VLDTGS ++W++C PC +C FDP S +Y + C +
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206
Query: 156 CTN-DCGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C D GG C Y + Y +G + G +E F + + V GC H+N
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGAR----VPRVALGCGHDN 262
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGN----LNYFEYAYNMLILGEGAI 267
+ + S + + G FSYC+ + + + G GA+
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAV 322
Query: 268 ---LEGDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
TPM ++ YYV L GIS+ G ++ + + + + + GV +DSGT
Sbjct: 323 GPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGT 382
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFH 375
++T L AY LR GL S + CY DL G P ++ H
Sbjct: 383 SVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCY------DLSGLKVVKVPTVSMH 436
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
FAGGA+ L E+ +S FC A +D +SIIG I QQ + V +D
Sbjct: 437 FAGGAEAALPPENYLIPVDSRGTFCFAFAGTD------GGVSIIGNIQQQGFRVVFDGDG 490
Query: 435 KQLYFQRIDC 444
++L F C
Sbjct: 491 QRLGFVPKGC 500
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 169/377 (44%), Gaps = 58/377 (15%)
Query: 80 QKAHDTRAHLHPGISTVPVF--YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT 137
Q A +R+ P V F V S+G P V Q +DTGS + WV+C+PC +
Sbjct: 122 QLATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNS 181
Query: 138 -----FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQF 188
FDP+KS TY+ +PC + C+ + G +C Y + Y +G ++ G GS+
Sbjct: 182 QRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTL 241
Query: 189 NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFS 244
+ TFL FGC H A F G+ GL + SL + G FS
Sbjct: 242 ALAPGNTVGTFL----FGCGHAQAGM----FAGIDGLLALGRQSMSLKSQAAGAYGGVFS 293
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDP 300
YC L + A L LG G +T + + Y V L GIS+G + + +
Sbjct: 294 YC---LPSKQSAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPA 350
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWH 355
+ F G +D+GT +T L P+AY LR F+G + PS P +
Sbjct: 351 SAFA-------GGTVVDTGTVITRLPPTAYAALRSA----FRGAIAPYGYPSAPANGILD 399
Query: 356 LCYSGNINR-DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD 414
CY + +R + P +A F+GGA L L+A + S CLA P+ +G D
Sbjct: 400 TCY--DFSRYGVVTLPTVALTFSGGATLALEAPGIL-----SSGCLAFAPNGGDG----D 448
Query: 415 LSIIGMIAQQNYNVAYD 431
+I+G + Q+++ V +D
Sbjct: 449 AAILGNVQQRSFAVRFD 465
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 151/365 (41%), Gaps = 44/365 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC---- 151
+ V+ +G P V DTGS L WV+C PC C FDP++S TY+ +PC
Sbjct: 146 YVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPE 205
Query: 152 ----DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
DS C+ D +C Y + Y + + G + + SD F+ FGC
Sbjct: 206 CQGLDSRSCSRD-----KKCRYEVVYGDQSQTDGALARDTLTLTQSDVLPGFV----FGC 256
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
+ G+ GLG S S K G+ FSYC L A L LG A
Sbjct: 257 GEQDTGLFGRA-DGLVGLGREKVSLSSQAASKYGAGFSYC---LPSSPSAAGYLSLGGPA 312
Query: 267 ILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
T M S YYV L G+ + + + + P +F S AG IDSGT +T
Sbjct: 313 PANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVF------SAAGTVIDSGTVIT 366
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGA 380
L P Y LR Y PA + CY + ++ P++A FAGGA
Sbjct: 367 RLPPRVYAALRSAFARSMGRY--GYKRAPALSILDTCYDFTGHTTVR-IPSVALVFAGGA 423
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ LD V Y S CLA P NG+ D IIG Q+ V YD+ +++ F
Sbjct: 424 AVGLDFSGVLYVAKVSQACLAFAP---NGD-GADAGIIGNTQQKTLAVVYDVARQKIGFG 479
Query: 441 RIDCE 445
C
Sbjct: 480 ANGCS 484
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 117/405 (28%), Positives = 179/405 (44%), Gaps = 43/405 (10%)
Query: 58 AQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
A++ + AR S S D + LHP + ++ S+G P A+ D
Sbjct: 17 AKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGG---YVMDISVGTPGKRFRAIAD 73
Query: 118 TGSSLIWVKCQPCEQC-GATTFDPSKSLTYATLPCDSSYCT---NDCGGYPDECWYNIRY 173
TGS L+WV+ +PC C G T FDP +S T+ + C S C C C Y+ Y
Sbjct: 74 TGSDLVWVQSEPCTGCSGGTIFDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSYSYEY 133
Query: 174 TNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA-TSST 232
+G +++G + + T+ +G GC N+ F + G+ GLG S T
Sbjct: 134 GSG-ETEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGF--DGVDGLVGLGQGPVSLT 190
Query: 233 HSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS------TPMSVIDGSYY-VT 285
L + SKFSYC+ ++N + + L+ G A L G TP S +YY +T
Sbjct: 191 SQLSAAIDSKFSYCLVDINS-QSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLT 249
Query: 286 LEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL 345
+ GI++ + + S IDSGTTLT++ Y + +E + L
Sbjct: 250 VNGIAVAGQTMG------------SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVT--L 295
Query: 346 PSYPMDP-AWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF--YQESSSVFCLAV 402
P LCY + NR+ + FPA+ A GA + + + F +S CLA+
Sbjct: 296 PRVDGSSMGLDLCYDRSSNRNYK-FPALTIRLA-GATMTPPSSNYFLVVDDSGDTVCLAM 353
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
G + +SIIG + QQ Y++ YD S +L F + CE L
Sbjct: 354 GSAS-----GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 181/433 (41%), Gaps = 61/433 (14%)
Query: 34 KPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGI 93
K + +LL RD L QR M+ A + S+ + L +
Sbjct: 70 KKRPTEEELLKRDQLRAE-------HIQRKFAMNAAVDGAGDLQQSKVSSSVPTKLGSSL 122
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ--CGATT---FDPSKSLTYAT 148
T+ + ++ +G P V Q +DTGS + WV+C PC C A T FDP+KS TY
Sbjct: 123 DTLE-YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRA 181
Query: 149 LPCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLY 201
+ C ++ C N CG EC Y ++Y +G + GT + SD K F
Sbjct: 182 VSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQ- 240
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCI----GNLNYFEYA 256
FGCSH + FSD Q G+ GLG S S G+ FSYC+ G+ +
Sbjct: 241 ---FGCSHLESGFSD-QTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLG 296
Query: 257 YNMLILGE--GAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
G +L P Y L+ I++G K L + P++F AG
Sbjct: 297 GGGGASGFVTTRMLRSKQIPT-----FYGARLQDIAVGGKQLGLSPSVFA-------AGS 344
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPA 371
+DSGT +T L P+AY L F+ + Y PA + C+ + P
Sbjct: 345 VVDSGTIITRLPPTAYSAL----SSAFKAGMKQYRSAPARSILDTCFDFAGQTQIS-IPT 399
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+A F+GGA + LD + Y CLA + +G IIG + Q+ + V YD
Sbjct: 400 VALVFSGGAAIDLDPNGIMYGN-----CLAFAATGDDGT----TGIIGNVQQRTFEVLYD 450
Query: 432 LVSKQLYFQRIDC 444
+ S L F+ C
Sbjct: 451 VGSSTLGFRSGAC 463
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 157/366 (42%), Gaps = 49/366 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP+ S TYA + C +
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQ-REKLFDPASSSTYANVSCAA 241
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C++ C G C Y ++Y +G S G + + D K F FGC
Sbjct: 242 PACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 295
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N E G+ GLG TS K G F++C L L G G+
Sbjct: 296 RNDGLFGEA-AGLLGLGRGKTSLPVQTYGKYGGVFAHC---LPARSTGTGYLDFGAGSPP 351
Query: 269 EGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+TPM +G YYV + GI +G ++L I P++F + AG +DSGT +T L
Sbjct: 352 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVF------AAAGTIVDSGTVITRLP 405
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHFAG 378
P+AY +LR Y A L CY D G P ++ F G
Sbjct: 406 PAAYSSLRSAFAAAMA--ARGYRKAAAVSLLDTCY------DFTGMSQVAIPTVSLLFQG 457
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GA L +DA + Y S+S CLA ++ G D+ I+G + + VAYD+ K +
Sbjct: 458 GAALDVDASGIMYTVSASQVCLAFAGNEDGG----DVGIVGNTQLKTFGVAYDIGKKVVG 513
Query: 439 FQRIDC 444
F C
Sbjct: 514 FSPGAC 519
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 163/373 (43%), Gaps = 38/373 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ--------CGATTFDPSKSLTYATLP 150
++V F +G P P + V DTGS L WVKC+ F P+ S ++A +P
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIP 169
Query: 151 CDSSYCTN-------DCGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG---K 197
C S C + +C P C Y+ RY + ++G +G++ S G K
Sbjct: 170 CSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDRK 229
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYA 256
L +V GC+ + S + GV LG + S S + G +FSYC+ + A
Sbjct: 230 AKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 289
Query: 257 YNMLILGE-GAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ L G GA TP+ ++D Y VT++ +S+ K L+I ++ D +
Sbjct: 290 TSYLTFGPVGAAHSPSRTPL-LLDAQVAPFYAVTVDAVSVAGKALNIPAEVW---DVKKN 345
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
G +DSGT+LT L AY+ + + +P MDP + CY+ R P
Sbjct: 346 GGAILDSGTSLTILATPAYKAVVAALSKQL-ARVPRVTMDP-FEYCYNWTATRRPPAVPR 403
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ FAG A L +S + V C+ + + +S+IG I QQ + +D
Sbjct: 404 LEVRFAGSARLRPPTKSYVIDAAPGVKCIG-----LQEGVWPGVSVIGNILQQEHLWEFD 458
Query: 432 LVSKQLYFQRIDC 444
L ++ L FQ C
Sbjct: 459 LANRWLRFQESRC 471
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 167/365 (45%), Gaps = 59/365 (16%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC--------------T 157
++DT S L WV+C PCE C FDPS S +YA +PCDS C
Sbjct: 157 IVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGA 216
Query: 158 NDC-GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
C G P C Y + Y +G S+G + ++ + + FGC +N
Sbjct: 217 PPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL-----AGEVIDGFVFGCGTSNQGPPF 271
Query: 217 EQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
+G+ GLG + S S V++ G FSYC+ L+ A L+LG+ +STP+
Sbjct: 272 GGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCL-PLSRESDASGSLVLGDDPSAYRNSTPV 330
Query: 276 ---SVIDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
S++ S Y V L GI++G + ++ T A +DSGT +T
Sbjct: 331 VYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVE---------STGFSARAIVDSGTVITS 381
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGAD 381
LVPS Y +R E F L YP P + + C++ +++Q P++ F GGA+
Sbjct: 382 LVPSVYNAVRAE----FMSQLAEYPQAPGFSILDTCFNMTGLKEVQ-VPSLTLVFDGGAE 436
Query: 382 LVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ +D+ V Y SS CLAV + + E + SIIG Q+N V +D + Q+ F
Sbjct: 437 VEVDSGGVLYFVSSDSSQVCLAV--ASLKSE--DETSIIGNYQQKNLRVVFDTSASQVGF 492
Query: 440 QRIDC 444
+ C
Sbjct: 493 AQETC 497
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 166/382 (43%), Gaps = 36/382 (9%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA 135
Q S+ K AH + T + V+ +G P L V DTGS L WV+C+PC C
Sbjct: 166 QSSASKGVSLPAHRGLRLGTAN-YIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYK 224
Query: 136 T---TFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF-E 191
FDPS+S TY+ +PC + C + +C Y + Y + + G + +
Sbjct: 225 QHDPLFDPSQSTTYSAVPCGAQECLDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGP 284
Query: 192 TSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNL 250
+SD+ + F+ FGC ++ G+FGLG S S + G+ FSYC L
Sbjct: 285 SSDQLQGFV----FGCGDDDTGLFGRA-DGLFGLGRDRVSLASQAAARYGAGFSYC---L 336
Query: 251 NYFEYAYNMLILGEGAI-LEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKN 306
A L LG A T M + YY+ L GI + + + + P +FK
Sbjct: 337 PSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKA- 395
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNIN 363
G IDSGT +T L AY LR F G + Y PA + CY
Sbjct: 396 -----PGTVIDSGTVITRLPSRAYSALRSS----FAGFMRRYKRAPALSILDTCYDFTGR 446
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
+Q P++A F GGA L L V Y + S CLA NG+ + I+G + Q
Sbjct: 447 TKVQ-IPSVALLFDGGATLNLGFGGVLYVANRSQACLAFAS---NGDD-TSVGILGNMQQ 501
Query: 424 QNYNVAYDLVSKQLYFQRIDCE 445
+ + V YDL ++++ F C
Sbjct: 502 KTFAVVYDLANQKIGFGAKGCS 523
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 165/359 (45%), Gaps = 38/359 (10%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTN-DC 160
IG PP+ ++DTGS LIW++C PC C FDP KS TY + CDS C D
Sbjct: 74 IGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPLCHKLDT 133
Query: 161 GGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGFGCSHNNAHFSDE 217
G E C Y Y + ++G + + F TS+ GK L FGC HNN ++
Sbjct: 134 GVCSPEKRCNYTYGYGDNSLTKGVLAQDTATF-TSNTGKPVSLSRFLFGCGHNNTGGFND 192
Query: 218 QFTGVFGLGPATSSTHSLVEKV--GSKFSYCIGNLNYFEYAYNMLILGEGAILEGD---S 272
G+ GLG +S S + + G KFS C+ + + G+G+ + G+ +
Sbjct: 193 HEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQVLGNGVVT 252
Query: 273 TPM--SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
TP+ D SY+VTL GIS ++ F N T A + +DSGT L Y
Sbjct: 253 TPLVPREKDTSYFVTLLGIS-------VEDTYFPMNSTIGKANMLVDSGTPPILLPQQLY 305
Query: 331 QTLRKEVEDLFQGLLPSYPMDPAW--HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
+ EV + + L DP+ LCY N L+G P + FHF GA+++L
Sbjct: 306 DKVFAEVRN--KVALKPITDDPSLGTQLCYRTQTN--LKG-PTLTFHFV-GANVLLTPIQ 359
Query: 389 VFY---QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F ++ +FCLA I D + G AQ NY + +DL + + F+ DC
Sbjct: 360 TFIPPTPQTKGIFCLA-----IYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 153/346 (44%), Gaps = 37/346 (10%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDEC 167
VLDTGS + WV+CQPC C FDPS S +YA + CDS C T C C
Sbjct: 2 VLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGAC 61
Query: 168 WYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP 227
Y + Y +G + G +E S T + +V GC H+N F G GL
Sbjct: 62 LYEVAYGDGSYTVGDFATETLTLGDS----TPVGNVAIGCGHDNEGL----FVGAAGLLA 113
Query: 228 ATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----Y 282
S ++ S FSYC+ ++ A + L G+GA G T V Y
Sbjct: 114 LGGGPLSFPSQISASTFSYCL--VDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFY 171
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
YV L GIS+G + L I + F + T GV +DSGT +T L +AY LR D F
Sbjct: 172 YVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALR----DAFV 227
Query: 343 GLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVF 398
PS P L CY + +R PA++ F GG L L A++ + + +
Sbjct: 228 QGAPSLPRTSGVSLFDTCYDLS-DRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTY 286
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA P++ +SIIG + QQ V++D + F C
Sbjct: 287 CLAFAPTN------AAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 157/366 (42%), Gaps = 49/366 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP+ S TYA + C +
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQ-REKLFDPASSSTYANVSCAA 238
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C++ C G C Y ++Y +G S G + + D K F FGC
Sbjct: 239 PACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 292
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N E G+ GLG TS K G F++C L L G G+
Sbjct: 293 RNDGLFGEA-AGLLGLGRGKTSLPVQTYGKYGGVFAHC---LPPRSTGTGYLDFGAGSPP 348
Query: 269 EGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+TPM +G YYV + GI +G ++L I P++F + AG +DSGT +T L
Sbjct: 349 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVF------AAAGTIVDSGTVITRLP 402
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHFAG 378
P+AY +LR Y A L CY D G P ++ F G
Sbjct: 403 PAAYSSLRSAFAAAMA--ARGYRKAAAVSLLDTCY------DFTGMSQVAIPTVSLLFQG 454
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GA L +DA + Y S+S CLA ++ G D+ I+G + + VAYD+ K +
Sbjct: 455 GAALDVDASGIMYTVSASQVCLAFAGNEDGG----DVGIVGNTQLKTFGVAYDIGKKVVG 510
Query: 439 FQRIDC 444
F C
Sbjct: 511 FSPGAC 516
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 121/434 (27%), Positives = 182/434 (41%), Gaps = 61/434 (14%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQ--------AQRTLNMSMARFIYLSQKSSQKAHDTR 86
P+ L + L RDS T+ AQ A R S + LSQ S +
Sbjct: 88 PQELFSSRLQRDSRRVRSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGE------ 141
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKS 143
++ +G P VLDTGS ++W++C PC +C + + FDP KS
Sbjct: 142 ------------YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189
Query: 144 LTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TYAT+PC S +C C C Y + Y +G + G +E F +
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNR 244
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNM 259
+ V GC H+N + S + KFSYC+ + + +
Sbjct: 245 VKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKP-SS 303
Query: 260 LILGEGAILE-GDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGV 314
++ G A+ TP+ +D YYV L GIS+ G ++ + +LFK D + GV
Sbjct: 304 VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKL-DQIGNGGV 362
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPA 371
IDSGT++T L+ AY +R D F+ + P + L C+ + N + P
Sbjct: 363 IIDSGTSVTRLIRPAYIAMR----DAFRVGAKTLKRAPNFSLFDTCFDLS-NMNEVKVPT 417
Query: 372 MAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ HF AD+ L A + +++ FC A + LSIIG I QQ + V Y
Sbjct: 418 VVLHFR-RADVSLPATNYLIPVDTNGKFCFAFAGT------MGGLSIIGNIQQQGFRVVY 470
Query: 431 DLVSKQLYFQRIDC 444
DL S ++ F C
Sbjct: 471 DLASSRVGFAPGGC 484
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 160/364 (43%), Gaps = 42/364 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ V G P L ++DTGS + W++C+PC C + F+P +S +Y L C SS
Sbjct: 138 YIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSCLSSA 197
Query: 156 CT-----NDC--GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
CT N C GG C Y I Y +G SQG F+ ET G FGC
Sbjct: 198 CTELTTMNHCRLGG----CVYEINYGDGSRSQG-----DFSQETLTLGSDSFPSFAFGCG 248
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
H N G+ GLG S S + K G +FSYC+ + + +G+G+I
Sbjct: 249 HTNTGLFKGS-AGLLGLGRTALSFPSQTKSKYGGQFSYCLPDF-VSSTSTGSFSVGQGSI 306
Query: 268 LEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+T + ++ S Y+V L GIS+G + L I P + + G +DSGT +
Sbjct: 307 -PATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGR------GGTIVDSGTVI 359
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T LVP AY L+ + L + P CY + ++ P + FHF AD+
Sbjct: 360 TRLVPQAYDALKTSFRSKTRNLPSAKPFS-ILDTCYDLSSYSQVR-IPTITFHFQNNADV 417
Query: 383 VLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ A + + Q S CLA + + +IIG QQ VA+D + ++ F
Sbjct: 418 AVSAVGILFTIQSDGSQVCLAFA----SASQSISTNIIGNFQQQRMRVAFDTGAGRIGFA 473
Query: 441 RIDC 444
C
Sbjct: 474 PGSC 477
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 123/425 (28%), Positives = 189/425 (44%), Gaps = 55/425 (12%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV------FYVNFSIGQPP 109
VDA+ T + R I LS++ + + TRA G + PV + + +G PP
Sbjct: 41 VDAKGNYTAPERVRRAIALSRQINLAS--TRAE--GGGVSAPVHWATRQYIAEYMVGDPP 96
Query: 110 VPQLAVLDTGSSLIWVKCQPCEQCGATT-----FDPSKSLTYATLPCDSSYCTND----C 160
A++DTGSSLIW +C C + F+ S S ++A +PC C + C
Sbjct: 97 QRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDKACAGNYLHFC 156
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFT 220
C + + Y G G +G++ F F++ G T + FGC + +
Sbjct: 157 -ALDGTCTFRVTYGAG-GIIGFLGTDAFTFQSG--GAT----LAFGCVSFTRFAAPDVLH 208
Query: 221 GVFGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNMLILGEGAILEGDS---TPMS 276
G GL SL + G+K FSYC+ + A + L +G A L G M+
Sbjct: 209 GASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGGGAVMSMA 268
Query: 277 VIDGS--------YYVTLEGISLGEKMLDIDPNLF---KKNDTWSDAGVFIDSGTTLTWL 325
++ YY+ L GI++GE L I F + + + + GV IDSG+ T L
Sbjct: 269 FVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSL 328
Query: 326 VPSAYQTLRKEVEDLFQGLL--PSYPMDPAWHLCYS-GNINRDLQGFPAMAFHFAGGADL 382
V AY+ L E+ G L P D LC + G+++R + P + HF+GGAD+
Sbjct: 329 VEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVV---PTLVLHFSGGADM 385
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L E+ + S C+A+ R SIIG QQN ++ +D+ +L FQ
Sbjct: 386 ALPPENYWAPLEKSTACMAI-------VRGYLQSIIGNFQQQNMHILFDVGGGRLSFQNA 438
Query: 443 DCELL 447
DC +
Sbjct: 439 DCSTI 443
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 153/365 (41%), Gaps = 30/365 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P P L VLDTGS ++W++C PC +C FDP +S +Y + C +
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAPL 199
Query: 156 CTN-DCGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C D GG C Y + Y +G + G +E F G + V GC H+N
Sbjct: 200 CRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTF----AGGARVARVALGCGHDN 255
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNMLILGE 264
+ + S + + G FSYC+ + + + G
Sbjct: 256 EGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVTFGP 315
Query: 265 GAILEGDSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+ TPM ++ YYV L GIS+ G ++ + + + + + GV +DSGT
Sbjct: 316 PSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGT 375
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
++T L +Y LR GL S + CY R + P ++ HFAGGA
Sbjct: 376 SVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLG-GRKVVKVPTVSMHFAGGA 434
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ L E+ +S FC A +D +SIIG I QQ + V +D +++ F
Sbjct: 435 EAALPPENYLIPVDSRGTFCFAFAGTD------GGVSIIGNIQQQGFRVVFDGDGQRVGF 488
Query: 440 QRIDC 444
C
Sbjct: 489 APKGC 493
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 165/359 (45%), Gaps = 42/359 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++V +G PP Q V+D+GS ++WV+C+PC QC T FDP+ S ++ + C S+
Sbjct: 43 YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAV 102
Query: 156 C--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
C + G C Y + Y +G ++GT+ E F G+T + +V GC H+N
Sbjct: 103 CDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTF-----GRTVVRNVAIGCGHSNRG 157
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLN-YFEYAYNMLILGEGAI- 267
+ G + S L + G+ FSYC+ N N + E+ + +G I
Sbjct: 158 MFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPVGAAWIP 217
Query: 268 -LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+ P YY+ L G+ +G+ + + ++F+ N+ S GV +D+GT +T
Sbjct: 218 LVRNPRAP-----SFYYIRLLGLGVGDTRVPVSEDVFQLNELGS-GGVVMDTGTAVTRFP 271
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGAD 381
AY+ R + Q LP + CY +L GF P ++F+F+GG
Sbjct: 272 TVAYEAFRNAFIEQTQN-LPRASGVSIFDTCY------NLFGFLSVRVPTVSFYFSGGPI 324
Query: 382 LVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L + A + + + FC A PS LSI+G I Q+ ++ D ++ + F
Sbjct: 325 LTIPANNFLIPVDDAGTFCFAFAPSP------SGLSILGNIQQEGIQISVDEANEFVGF 377
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 161/351 (45%), Gaps = 42/351 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++V +G PP Q V+D+GS ++WV+C+PC QC T FDP+ S ++ + C S+
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAV 102
Query: 156 C--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
C ++ G C Y + Y +G ++GT+ ET G+T + +V GC H N
Sbjct: 103 CDQVDNAGCNSGRCRYEVSYGDGSSTKGTLA-----LETLTLGRTVVQNVAIGCGHMNQG 157
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYC----IGNLN-YFEYAYNMLILGEGAI- 267
+ G + S L + G+ FSYC + N N + E+ + +G I
Sbjct: 158 MFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPVGAAWIP 217
Query: 268 -LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+ +P YY+ L G+ +G+ + I ++F+ + + GV +D+GT +T
Sbjct: 218 LIRNPHSP-----SYYYIGLSGLGVGDMKVPISEDIFELTE-LGNGGVVMDTGTAVTRFP 271
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGAD 381
AY+ R D G LP + CY +L GF P ++F+F+GG
Sbjct: 272 TVAYEAFRDAFIDQ-TGNLPRASGVSIFDTCY------NLFGFLSVRVPTVSFYFSGGPI 324
Query: 382 LVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
L L A + + + FC A PS LSI+G I Q+ ++ D
Sbjct: 325 LTLPANNFLIPVDDAGTFCFAFAPSP------SGLSILGNIQQEGIQISVD 369
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 104/408 (25%), Positives = 175/408 (42%), Gaps = 57/408 (13%)
Query: 78 SSQKAHDTRAHL--------------HPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
S+ K HD R H HP + +++ +G PP +DTGS ++
Sbjct: 49 SALKQHDARRHRRILSAVDLPLGGNGHP--AEAGLYFAKIGLGNPPKDYYVQVDTGSDIL 106
Query: 124 WVKCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD------ECWY 169
WV C C++C T +DP S + + CD +C G C Y
Sbjct: 107 WVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIYCDDDFCAATYNGVLQGCTKDLPCQY 166
Query: 170 NIRYTNGPDSQGTIGSE--QF-----NFETSDEGKTFLYDVGFGCSHNNAH---FSDEQF 219
++ Y +G + G + QF N +TS + + FGC + S E
Sbjct: 167 SVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVI----FGCGAKQSGELGTSSEAL 222
Query: 220 TGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMS 276
G+ G G A SS S + KV F++C+ N+ + +GE + ++TPM
Sbjct: 223 DGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVK----GGGIFAIGEVVSPKVNTTPMV 278
Query: 277 VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE 336
Y V ++ I +G +L++ ++F DT G IDSGTTL +L Y+++ +
Sbjct: 279 PNQPHYNVVMKEIEVGGNVLELPTDIF---DTGDRRGTIIDSGTTLAYLPEVVYESMMTK 335
Query: 337 VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS 396
+ GL + Y+GN+N +GFP + FHF G L ++ +Q
Sbjct: 336 IVSEQPGLKLHTVEEQFTCFQYTGNVN---EGFPVVKFHFNGSLSLTVNPHDYLFQIHEE 392
Query: 397 VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V+C S + + +D++++G + N V YDL ++ + + +C
Sbjct: 393 VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAIGWTDYNC 440
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 173/374 (46%), Gaps = 54/374 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ NF+IG PP P AV+D L+W +C+ C +C G FDP+ S TY PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPL 110
Query: 156 C------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C +C G + C Y TN D+ G +G++ F T+ + FGC
Sbjct: 111 CESIPSDVRNCSG--NVCAYEAS-TNAGDTGGKVGTDTFAVGTAKA------SLAFGCVV 161
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+ + +G+ GLG + SLV + G + FSYC+ + + + L LG A L
Sbjct: 162 ASDIDTMGGPSGIVGLG---RTPWSLVTQTGVAAFSYCLAPHDAGK--NSALFLGSSAKL 216
Query: 269 EGD----STPMSVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G STP I G+ Y V LEG+ G+ M+ + P S + V +D
Sbjct: 217 AGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP---------SGSTVLLD 267
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
+ + +++LV AYQ ++K V + P++P + LC+ + P + F F
Sbjct: 268 TFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEP-FDLCFPKSGASGAA--PDLVFTFR 324
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK---DLSIIGMIAQQNYNVAYDLVS 434
GGA + + A + + CLA+ ++ R +LS++G + Q+N + +DL
Sbjct: 325 GGAAMTVPATNYLLDYKNGTVCLAM----LSSARLNSTTELSLLGSLQQENIHFLFDLDK 380
Query: 435 KQLYFQRIDCELLA 448
+ L F+ DC L+
Sbjct: 381 ETLSFEPADCTKLS 394
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 153/334 (45%), Gaps = 46/334 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---ATTFDPSKSLTYATLPCDSSY 155
+ + FSIG+PP+ A +DTGS L+WVKC PC C + +DP++S + LPC S
Sbjct: 87 YIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQL 146
Query: 156 C---------TNDCGGYPDECWYNIRYTNGPD--SQGTIGSEQFNFETSDEGKTFLY-DV 203
C ++ C P C Y+ Y + D +QG +G+E F F G ++ +V
Sbjct: 147 CQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTF-----GDGYVANNV 201
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLIL 262
FG S QF G GL SLV ++G+ +F+YC L Y+ ++
Sbjct: 202 SFGRSDT---IDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYC---LAADPNVYSTILF 255
Query: 263 GEGAILE---GD--STPMSV-----IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
G A L+ GD STP+ D YYV L+GIS+G L I F N S
Sbjct: 256 GSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGS-G 314
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAM 372
GVF DSG T L +AYQ +R+ + Q L D C+ + + P +
Sbjct: 315 GVFFDSGAIDTSLKDAAYQVVRQAITSEIQRL----GYDAGDDTCFVAANQQAVAQMPPL 370
Query: 373 AFHFAGGADLVLDAESVFYQE----SSSVFCLAV 402
HF GAD+ L+ + S + C+A+
Sbjct: 371 VLHFDDGADMSLNGRNYLKTSTKGPSEVLVCMAI 404
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 117/406 (28%), Positives = 182/406 (44%), Gaps = 46/406 (11%)
Query: 62 RTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSS 121
+ S R + + K S A ++ P + + + ++G PP ++DTGS
Sbjct: 2 EAVQRSHERVAFYTLKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSD 61
Query: 122 LIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT------NDCGGYPDECWYNIR 172
L WV+C PC C FDPSKS ++ C + C C + C Y
Sbjct: 62 LNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAA--NVCQYQYT 119
Query: 173 YTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPATSS 231
Y + ++ G + E + + G + + FGC + N F+ G+ GLG S
Sbjct: 120 YGDQSNTNGDLAFETISLN-NGAGTQSVPNFAFGCGTQNLGTFAGA--AGLVGLGQGPLS 176
Query: 232 THS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTL 286
+S L +KFSYC+ +LN + + L G A V++ YYV L
Sbjct: 177 LNSQLSHTFANKFSYCLVSLN--SLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQL 234
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
I +G + L++ P++F + + G IDSGTT+T L AY + + E
Sbjct: 235 NSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFV----- 289
Query: 347 SYP-MDPAWH---LCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVF--YQESSSVF 398
+YP +D + + LC+ +G N + P M F F GAD + E++F S++
Sbjct: 290 NYPRLDGSAYGLDLCFNIAGVSNPSV---PDMVFKFQ-GADFQMRGENLFVLVDTSATTL 345
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA+G S + SIIG I QQN+ V YDL +K++ F DC
Sbjct: 346 CLAMGGS-------QGFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 166/361 (45%), Gaps = 34/361 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G PP VLDTGS ++W++C PC +C + T FDP KS +++++ C S
Sbjct: 147 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPL 206
Query: 156 CTN-DCGG--YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
C D G C Y + Y +G + G +E F T + V GC H+N
Sbjct: 207 CLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFR-----GTRVPKVALGCGHDNE 261
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
+ S + G KFSYC+ + + + ++ G+ A+
Sbjct: 262 GLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSA-SSKPSSVVFGQSAVSRTAV 320
Query: 273 -TPMSV---IDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
TP+ +D YY+ L GIS+ G ++ I +LFK DT + GV IDSGT++T L
Sbjct: 321 FTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKL-DTAGNGGVIIDSGTSVTRLTR 379
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVL 384
AY +LR D F+ P + L C+ + +++ P + HF GAD+ L
Sbjct: 380 RAYVSLR----DAFRAGAADLKRAPDYSLFDTCFDLSGKTEVK-VPTVVMHFR-GADVSL 433
Query: 385 DAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
A + +++ VFC A + LSIIG I QQ + V +D+ + ++ F
Sbjct: 434 PATNYLIPVDTNGVFCFAFAGT------MSGLSIIGNIQQQGFRVVFDVAASRIGFAARG 487
Query: 444 C 444
C
Sbjct: 488 C 488
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/378 (27%), Positives = 167/378 (44%), Gaps = 37/378 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG----ATTFDPSKSLTYATLPCDSS 154
++V+ +G PP L V DTGS L+WVKC C C ++ F P S +++ C
Sbjct: 88 YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 147
Query: 155 YCT-------NDCG--GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
+C + C C + Y +G S G E ++ + L + F
Sbjct: 148 HCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSF 207
Query: 206 GCSH--NNAHFSDEQFT---GVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNM 259
GC + S QF GV GLG + S S L + G+KFSYC+ + +
Sbjct: 208 GCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSF 267
Query: 260 LILGEGA-------ILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTW 309
L++G G + TP+ + S YY+T+ I++ L I+P +++ D
Sbjct: 268 LMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEI-DEQ 326
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP-SYPMDPAWHLCYSGNINRDLQG 368
+ G +DSGTTLT+L +AY+ + K V + LP + + P + LC + +
Sbjct: 327 GNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK--LPNAAELTPGFDLCVNASGESRRPS 384
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
P + F GGA + F + V CLA+ + +G F S+IG + QQ + +
Sbjct: 385 LPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVE-SGNGF---SVIGNLMQQGFLL 440
Query: 429 AYDLVSKQLYFQRIDCEL 446
+D +L F R C L
Sbjct: 441 EFDKEESRLGFTRRGCGL 458
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 118/393 (30%), Positives = 171/393 (43%), Gaps = 54/393 (13%)
Query: 81 KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATT 137
+A T+ L GI+ + Y+ ++G ++DTGS L WV+C+PC C
Sbjct: 46 EASQTQIPLSSGINLQTLNYI-VTMGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPI 104
Query: 138 FDPSKSLTYATLPCDSSYC---------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQF 188
F PS S +Y ++ C+SS C T CG P C Y + Y +G + G +G EQ
Sbjct: 105 FKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQL 164
Query: 189 NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFS 244
+F G + D FGC NN F GV GL S SLV + G FS
Sbjct: 165 SF-----GGVSVSDFVFGCGRNNKGL----FGGVSGLMGLGRSYLSLVSQTNATFGGVFS 215
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSYYVTLEGISLGEKML 296
YC+ A L++G + + + TP++ + Y + L GI + L
Sbjct: 216 YCLPTTE--SGASGSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVAL 273
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL 356
+ ++ + GV IDSGT +T L S Y+ L+ F G PS P
Sbjct: 274 QV--------PSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTG-FPSAPGFSILDT 324
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGP-SDINGERFK 413
C++ D P ++ HF G A+L +DA FY +E +S CLA+ SD
Sbjct: 325 CFN-LTGYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDA-----Y 378
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
D +IIG Q+N V YD ++ F C
Sbjct: 379 DTAIIGNYQQRNQRVIYDTKQSKVGFAEESCSF 411
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 126/439 (28%), Positives = 190/439 (43%), Gaps = 65/439 (14%)
Query: 29 APAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTR-A 87
+P K + + LHRD L A QR + Q++H T
Sbjct: 70 SPLPTKKMPTLEERLHRDQLRA-------AYIQRKFSGGGVNGSRGGAGDVQQSHATVPT 122
Query: 88 HLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSL 144
L + T+ + + +G P Q ++DTGS + WV+C+PC QC + FDPS S
Sbjct: 123 TLGTSLDTLE-YLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSS 181
Query: 145 TYATLPCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
TY+ C S+ C N C +C Y + Y +G + GT S+ G
Sbjct: 182 TYSPFSCSSAACAQLGQEGNGCS--SSQCQYTVTYGDGSSTTGTYSSDTLAL-----GSN 234
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFE 254
+ FGCS+ + F+D Q G+ GLG SLV + G+ FSYC L
Sbjct: 235 AVRKFQFGCSNVESGFND-QTDGLMGLG---GGAQSLVSQTAGTFGAAFSYC---LPATS 287
Query: 255 YAYNMLILGEGAILEG-DSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
+ L LG G G TPM S + Y V ++ I +G + L I ++F
Sbjct: 288 SSSGFLTLGAGT--SGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVF------- 338
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLC-----YSGNINRD 365
AG +DSGT LT L P+AY L F+ + YP P + +SG +
Sbjct: 339 SAGTIMDSGTVLTRLPPTAYSAL----SSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVS 394
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+ P +A F+GGA + + ++ + Q S+S+ CLA + + L IIG + Q+
Sbjct: 395 I---PTVALVFSGGAVVDIASDGIMLQTSNSILCLAFAANSDD----SSLGIIGNVQQRT 447
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+ V YD+ + F+ C
Sbjct: 448 FEVLYDVGGGAVGFKAGAC 466
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 163/371 (43%), Gaps = 47/371 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+++ +G P VLDTGS ++W++C PC+ C F+P+KS T+AT+PC S
Sbjct: 136 YFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRL 195
Query: 156 C-----TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C +++C + C Y + Y +G + G +E F + + V GC H
Sbjct: 196 CRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGAR-----VDHVALGCGH 250
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNM---LILGEGA 266
+N + S + KFSYC+ + + ++ G GA
Sbjct: 251 DNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGA 310
Query: 267 ILEGDS-TPMSV---IDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+ + TP+ +D YY+ L GIS+ G ++ + + FK D + GV IDSGT+
Sbjct: 311 VPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKL-DATGNGGVIIDSGTS 369
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMA 373
+T L SAY LR D F+ P++ L C+ DL G P +
Sbjct: 370 VTRLTQSAYVALR----DAFRLGATRLKRAPSYSLFDTCF------DLSGMTTVKVPTVV 419
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
FHF GG + + + + FC A + LSIIG I QQ + VAYDLV
Sbjct: 420 FHFTGGEVSLPASNYLIPVNNQGRFCFAFAGT------MGSLSIIGNIQQQGFRVAYDLV 473
Query: 434 SKQLYFQRIDC 444
++ F C
Sbjct: 474 GSRVGFLSRAC 484
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 118/471 (25%), Positives = 194/471 (41%), Gaps = 63/471 (13%)
Query: 15 LPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYL 74
LP + T A A +PK L ++HR ++ + +R + +
Sbjct: 1 LPLRFLLVVLVTFTADATHRPKTLHIPVVHRGAVFPSRRGAPPGSLRRCRHAAPFTAQVA 60
Query: 75 SQKSSQKAHDTRAHLHPGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC 130
S S D R P +S VP ++ ++G PP L V+DTGS LIW++C PC
Sbjct: 61 SFHSIAADDDDRLR-SPVMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPC 119
Query: 131 EQCGATT---FDPSKSLTYATLPCDSSYCTN-----DCGGYPDECWYNIRYTNGPDSQGT 182
C +DP S T+ +PC S C + C C Y + Y +G S G
Sbjct: 120 RHCYRQVTPLYDPRSSSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGD 179
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS-THSLVEKVGS 241
+ +++ F T +++V GC H+N E G+ G+G S L G
Sbjct: 180 LATDRLVFPD----DTHVHNVTLGCGHDNVGLL-ESAAGLLGVGRGQLSFPTQLAPAYGH 234
Query: 242 KFSYCIGN-LNYFEYAYNMLILGEGAILEGDSTPMSVIDGS------YYVTLEGISL-GE 293
FSYC+G+ L+ + + L+ G E ST + + + YYV + G S+ GE
Sbjct: 235 VFSYCLGDRLSRAQNGSSYLVFGR--TPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGE 292
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVED------LFQGLLPS 347
++ N G+ +DSGT ++ AY +R + + L
Sbjct: 293 RVTGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATK 352
Query: 348 YPMDPAWHLCYSGNINRDLQG---------FPAMAFHFAGGADLVLDAES----VFYQES 394
+ + + CY DL+G P++ HFAGGAD+ L + V +
Sbjct: 353 FSV---FDACY------DLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDR 403
Query: 395 SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ FCL + +D L+++G + QQ + + +D+ ++ F C
Sbjct: 404 RTYFCLGLQAAD------DGLNVLGNVQQQGFGLVFDVERGRIGFTPNGCS 448
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 116/381 (30%), Positives = 168/381 (44%), Gaps = 60/381 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V IG P + DTGS L WV+C+PC Q FDPSKS TY +PC +
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTP 185
Query: 155 YCTND------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C CGG C Y+++Y + ++G + E F S V FGCS
Sbjct: 186 QCKIGGGQDLTCGG--TTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA---GVVFGCS 240
Query: 209 H---NNAHFSDEQFT--GVFGLGPATSSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLI 261
H + ++E+ + G+ GLG SS S + G FSYC L + L
Sbjct: 241 HEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYC---LPPRGSSAGYLT 297
Query: 262 LGEGAILEGDS--TPM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
+G A + + TP+ S + Y V L GIS+ L ID + F G
Sbjct: 298 IGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFY-------IGTV 350
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL-----CYSGNINRDLQGFP 370
IDSGT +T + +AY LR E F+ + Y M P H+ CY D+ P
Sbjct: 351 IDSGTVITHMPAAAYYVLRDE----FRRHMGGYTMLPEGHVESLDTCYD-VTGHDVVTAP 405
Query: 371 AMAFHFAGGADLVLDAESVFY-------QESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
+A F GGA + +DA + +S ++ CLA P+++ G IIG + Q
Sbjct: 406 PVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPG-----FVIIGNMQQ 460
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
+ YNV +D+ +++ F C
Sbjct: 461 RAYNVVFDVEGRRIGFGANGC 481
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 115/369 (31%), Positives = 155/369 (42%), Gaps = 52/369 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+C+PC EQ FDP++S T A + C +
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQ-QEKLFDPARSSTDANISCAA 244
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C T C G C Y ++Y +G S G + + D K F FGC
Sbjct: 245 PACSDLYTKGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFR----FGCGE 298
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG--- 265
N E G+ GLG TS +K G F++C L G G
Sbjct: 299 RNEGLFGEA-AGLLGLGRGKTSLPVQAYDKYGGVFAHC---FPARSSGTGYLDFGPGSSP 354
Query: 266 AILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
A+ +TPM V +G YYV L GI +G K+L I P++F + AG +DSGT +T
Sbjct: 355 AVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVF------TTAGTIVDSGTVIT 408
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFH 375
L P+AY +LR Y PA L CY D G P ++
Sbjct: 409 RLPPAAYSSLRSAFASAIAAR--GYKKAPALSLLDTCY------DFTGMSQVAIPTVSLL 460
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
F GGA L +DA + Y S S CL + E D+ I+G + + V YD+ K
Sbjct: 461 FQGGASLDVDASGIIYAASVSQACLGFAAN----EEDDDVGIVGNTQLKTFGVVYDIGKK 516
Query: 436 QLYFQRIDC 444
+ F C
Sbjct: 517 VVGFSPGAC 525
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 122/408 (29%), Positives = 186/408 (45%), Gaps = 61/408 (14%)
Query: 64 LNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
L S AR Y+ ++S+ HL + ++ + V +G P V Q+ ++DTGS L
Sbjct: 86 LRRSRARSKYIMSRASKSNVSIPTHLGGSVDSLE-YVVTVGLGTPAVSQVLLIDTGSDLS 144
Query: 124 WVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCTN--------DC---GGYP 164
WV+C PC +TT FDPS+S TYA +PC++ C + DC G
Sbjct: 145 WVQCAPCN---STTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDCTSGSGGG 201
Query: 165 DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
+C Y I Y +G + G +E T G T + D FGC H+ +D ++ G+ G
Sbjct: 202 AQCGYAITYGDGSQTTGVYSNETL---TMAPGVT-VKDFHFGCGHDQDGPND-KYDGLLG 256
Query: 225 LGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS----TPMSVID 279
LG A S V G FSYC+ N L GA + S TPM
Sbjct: 257 LGGAPESLVVQTSSVYGGAFSYCLPAAN-----DQAGFLALGAPVNDASGFVFTPMVREQ 311
Query: 280 GSYYVT-LEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
++YV + GI++G + +D+ P+ F G+ IDSGT +T L +AY L+
Sbjct: 312 QTFYVVNMTGITVGGEPIDVPPSAFS-------GGMIIDSGTVVTELQHTAYAALQAA-- 362
Query: 339 DLFQGLLPSYPMDPAWHL--CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS 396
F+ + +YP+ P L CY+ + ++ P +A F+GGA + LD ++
Sbjct: 363 --FRKAMAAYPLLPNGELDTCYNFTGHSNVT-VPRVALTFSGGATVDLDVPDGILLDNCL 419
Query: 397 VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F A GP + G I+G + Q+ V YD+ ++ F C
Sbjct: 420 AFQEA-GPDNQPG-------ILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 171/372 (45%), Gaps = 57/372 (15%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPC 151
P + + S+G P V Q+ +DTGS + WV+C PC + C + FDP+KS TY+ C
Sbjct: 128 PEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSC 187
Query: 152 DSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
S+ C N C C Y ++Y + ++ GT GS+ TSD K F F
Sbjct: 188 SSAQCAQLGGEGNGC--LNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAVKNFQ----F 241
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLI 261
GCSH F Q G+ GLG T SLV + G FSYC+ + A L
Sbjct: 242 GCSHRANGFVG-QLDGLMGLG---GDTESLVSQTAATYGKAFSYCLPPSS--SSAGGFLT 295
Query: 262 LGEGAILEGDS----TPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
LG A S TP+ + Y V L+ I++ L++ ++F S A V
Sbjct: 296 LGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVF------SGASV- 348
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDPAWHL--CYSGNINRDLQGFPAM 372
+DSGT +T L P+AYQ LR F+ + +YP P L C+ + + ++ P +
Sbjct: 349 VDSGTVITQLPPTAYQALRTA----FKKEMKAYPSAAPVGILDTCFDFSGIKTVR-VPVV 403
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
F+ GA + LD +FY CLA + +G D I+G + Q+ + + +D+
Sbjct: 404 TLTFSRGAVMDLDVSGIFYAG-----CLAFTATAQDG----DTGILGNVQQRTFEMLFDV 454
Query: 433 VSKQLYFQRIDC 444
L F+ C
Sbjct: 455 GGSTLGFRPGAC 466
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 128/441 (29%), Positives = 184/441 (41%), Gaps = 64/441 (14%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-- 98
K L R L+ A+A +S R S + S K D R G+S P
Sbjct: 43 KQLSRSELIRRAMQRSKARAA---ALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGD 99
Query: 99 --FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDS 153
+ V+ +IG PP P A+LDTGS LIW +C PC C A F P +S +Y + C
Sbjct: 100 LEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAG 159
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C++ C PD C Y Y +G + G +E+F F +S + +GFGC
Sbjct: 160 QLCSDILHHGC-EMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGS 218
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILG--EGA 266
N S +G+ G G + SLV ++ +FSYC+ +Y + L+ G G
Sbjct: 219 MNVG-SLNNGSGIVGFG---RNPLSLVSQLSIRRFSYCL--TSYGSGRKSTLLFGSLSGG 272
Query: 267 ILEGDSTPMSVI--------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
+ + P+ YYV L G+++G + L I + F S GV +DS
Sbjct: 273 VYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGS-GGVIVDS 331
Query: 319 GTTLTWL----VPSAYQTLRKEV----------EDLFQGLLPSYPMDPAWHLCYSGNINR 364
GT LT L + + R+++ ED L+P+ AW S +
Sbjct: 332 GTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPA-----AWRRSSSTS--- 383
Query: 365 DLQGFPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
P M FHF ADL L + V CL + S +G S IG + Q
Sbjct: 384 -QVPVPRMVFHFQ-DADLDLPRRNYVLDDHRKGRLCLLLADSGDDG------STIGNLVQ 435
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
Q+ V YDL ++ L F C
Sbjct: 436 QDMRVLYDLEAETLSFAPAQC 456
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 131/436 (30%), Positives = 185/436 (42%), Gaps = 64/436 (14%)
Query: 28 AAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRA 87
A P A +R + R N QR L+M AR + S+Q T
Sbjct: 19 APPPAFSARRSFRATMTRTEPAINLTRAAHKSHQR-LSMLAARLDDAASGSAQ----TPL 73
Query: 88 HLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSL 144
L G + + FSIG PP A+ DTGS LIW KC C +C G+ ++ P+KS
Sbjct: 74 QLDSGGG---AYDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSS 130
Query: 145 TYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPD----SQGTIGSEQFNFETSDEG 196
+++ LPC S C++ C EC Y Y D +QG +GSE F G
Sbjct: 131 SFSKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTL-----G 185
Query: 197 KTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFE 254
+ +GFGC + + + G GP SLV ++ FSYC L
Sbjct: 186 SDAVPGIGFGCTTMSEGGYGSGSGLVGLGRGPL-----SLVSQLNVGAFSYC---LTSDA 237
Query: 255 YAYNMLILGEGAILEG--DSTPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ L+ G GA+ STP+ YY V LE IS+G
Sbjct: 238 AKTSPLLFGSGALTGAGVQSTPLLRTSTYYYTVNLESISIGA----------ATTAGTGS 287
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGF 369
+G+ DSGTT+ +L AY ++ V L + D + +C+ SG + F
Sbjct: 288 SGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRD-GYEVCFQTSGAV------F 340
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P+M HF GG D+ L E+ F SV C V ++ LSI+G I Q NY++
Sbjct: 341 PSMVLHFDGG-DMDLPTENYFGAVDDSVSCWIV-------QKSPSLSIVGNIMQMNYHIR 392
Query: 430 YDLVSKQLYFQRIDCE 445
YD+ L FQ +C+
Sbjct: 393 YDVEKSMLSFQPANCD 408
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 157/370 (42%), Gaps = 54/370 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP++S TYA + C +
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ-QEKLFDPARSSTYANVSCAA 237
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C T C G C Y ++Y +G S G + + D K F FGC
Sbjct: 238 PACFDLDTRGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 291
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGE 264
N E G+ GLG TS +K G F++C+ Y ++
Sbjct: 292 RNEGLFGEA-AGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPA-AA 349
Query: 265 GAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
GA L +TPM +G YYV + GI +G ++L I ++F + AG +DSGT +
Sbjct: 350 GARL---TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF------ATAGTIVDSGTVI 400
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAF 374
T L P AY +LR Y PA L CY D G P ++
Sbjct: 401 TRLPPPAYSSLRSAFVSAMAAR--GYKKAPAVSLLDTCY------DFTGMSQVAIPTVSL 452
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GGA L +DA + Y S S CL ++ G D+ I+G + + VAYD+
Sbjct: 453 LFQGGAILDVDASGIMYAASVSQVCLGFAANEDGG----DVGIVGNTQLKTFGVAYDIGK 508
Query: 435 KQLYFQRIDC 444
K + F C
Sbjct: 509 KVVGFSPGAC 518
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 169/378 (44%), Gaps = 45/378 (11%)
Query: 99 FYVNFSIGQPP---VPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSLTYATLPC 151
+ V IG P P+ + DTGS L W +C+PC C + T DPSKS T+ L C
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 182
Query: 152 DSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY--DVG 204
C D GG C + RY +G G + S+ F+F + +G + DV
Sbjct: 183 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 242
Query: 205 FGCSHNNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG-SKFSYCI--------GNLNYFE 254
FGC+H + + TG+ LG S V ++G +FSYCI + + E
Sbjct: 243 FGCAHVEDSKAVRGYSTGILALGIGKP---SFVTQLGVDRFSYCIPASEITDDDDDDDEE 299
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISL--GEKMLDID--PNLFKKNDTWS 310
+ + L G A + G P Y V L+ + G ++ P + +
Sbjct: 300 RSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEEAAA 359
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
+ +DSGTTL WL S + L++ +E+ L Y + CY GN+ D++
Sbjct: 360 AMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS-LTRRYDLTHPSLYCYLGNMT-DVEAV- 416
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSV---FCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
++ F GGADL L S+F+ + + CLAV G R +I+G+ Q+N N
Sbjct: 417 SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA----AGNR----AILGVYPQRNIN 468
Query: 428 VAYDLVSKQLYFQRIDCE 445
V YDL + ++ F R C+
Sbjct: 469 VGYDLSTMEIAFDRDQCD 486
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 169/378 (44%), Gaps = 45/378 (11%)
Query: 99 FYVNFSIGQPP---VPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSLTYATLPC 151
+ V IG P P+ + DTGS L W +C+PC C + T DPSKS T+ L C
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 161
Query: 152 DSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY--DVG 204
C D GG C + RY +G G + S+ F+F + +G + DV
Sbjct: 162 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 221
Query: 205 FGCSHNNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG-SKFSYCI--------GNLNYFE 254
FGC+H + + TG+ LG S V ++G +FSYCI + + E
Sbjct: 222 FGCAHVEDSKAVRGYSTGILALGIGKP---SFVTQLGVDRFSYCIPASEITDDDDDDDEE 278
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISL--GEKMLDID--PNLFKKNDTWS 310
+ + L G A + G P Y V L+ + G ++ P + +
Sbjct: 279 RSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEEAAA 338
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
+ +DSGTTL WL S + L++ +E+ L Y + CY GN+ D++
Sbjct: 339 AMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS-LTRRYDLTHPSLYCYLGNMT-DVEAV- 395
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSV---FCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
++ F GGADL L S+F+ + + CLAV G R +I+G+ Q+N N
Sbjct: 396 SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA----AGNR----AILGVYPQRNIN 447
Query: 428 VAYDLVSKQLYFQRIDCE 445
V YDL + ++ F R C+
Sbjct: 448 VGYDLSTMEIAFDRDQCD 465
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 162/371 (43%), Gaps = 47/371 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+++ +G P VLDTGS ++W++C PC+ C FDP KS T+AT+PC S
Sbjct: 138 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRL 197
Query: 156 C-----TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C +++C + C Y + Y +G ++G +E F + + V GC H
Sbjct: 198 CRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR-----VDHVPLGCGH 252
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN---YFEYAYNMLILGEGA 266
+N + S + KFSYC+ + + ++ G A
Sbjct: 253 DNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDA 312
Query: 267 ILEGDS-TPMSV---IDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+ + TP+ +D YY+ L GIS+ G ++ + + FK D + GV IDSGT+
Sbjct: 313 VPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKL-DATGNGGVIIDSGTS 371
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMA 373
+T L SAY LR D F+ P++ L C+ DL G P +
Sbjct: 372 VTRLTQSAYVALR----DAFRLGATKLKRAPSYSLFDTCF------DLSGMTTVKVPTVV 421
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
FHF GG + + + + FC A + LSIIG I QQ + VAYDLV
Sbjct: 422 FHFGGGEVSLPASNYLIPVNTEGRFCFAFAGT------MGSLSIIGNIQQQGFRVAYDLV 475
Query: 434 SKQLYFQRIDC 444
++ F C
Sbjct: 476 GSRVGFLSRAC 486
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 128/439 (29%), Positives = 202/439 (46%), Gaps = 60/439 (13%)
Query: 38 LVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG--IST 95
V + R L+Y + + +++ TL FI ++ ++ H+ G +
Sbjct: 22 FVFNQVFRAELIYREHQSSPLRSE-TLKTPSEIFIAAVKRGHERRARLAKHVLAGDQLFE 80
Query: 96 VPV------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTY 146
PV + ++ S G PP A++DTGS L WV+C PC+ C T FDPSKS +Y
Sbjct: 81 TPVASGNGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASY 140
Query: 147 ATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
TL C S++C + C C Y+ Y +G + G + ++ T GK + +
Sbjct: 141 KTLGCGSNFCQDLPFQSCAA---SCQYDYMYGDGSSTSGALSTDDVTIGT---GK--IPN 192
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG----SKFSYCIGNLNYFEYAYN 258
V FGC ++N F G GL SLV ++G KFSYC+ L +
Sbjct: 193 VAFGCGNSNLG----TFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLG--STKTS 246
Query: 259 MLILGEGAILEGDS-TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
L +G+ + G + TPM + YY L+GIS+ K ++ N F T G+
Sbjct: 247 PLYIGDSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAAT-GRGGL 305
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH---LCYS--GNINRDLQGF 369
+DSGTTLT+L A+ + + + LP D +++ C+S G N +
Sbjct: 306 ILDSGTTLTYLDVDAFNPMVAAL----KAALPYPEADGSFYGLEYCFSTAGVANPT---Y 358
Query: 370 PAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
P + FHF GAD+ L ++ F + CLA+ S SI G I Q N+ +
Sbjct: 359 PTVVFHF-NGADVALAPDNTFIALDFEGTTCLAMASS-------TGFSIFGNIQQLNHVI 410
Query: 429 AYDLVSKQLYFQRIDCELL 447
+DLV+K++ F+ +CE +
Sbjct: 411 VHDLVNKRIGFKSANCETI 429
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 183/404 (45%), Gaps = 49/404 (12%)
Query: 67 SMARFIYLSQKSSQKA--HDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIW 124
S+ I SSQ A +T+ L GI + Y+ ++G ++DTGS L W
Sbjct: 87 SIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYI-VTMGLGSQNMSVIVDTGSDLTW 145
Query: 125 VKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN----DCGGYPD---ECWYNIRYT 174
V+C+PC C F PS S +Y + C+S+ C + CG P C Y + Y
Sbjct: 146 VQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYG 205
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS 234
+G + G +G E+ F G + + FGC NN F G GL S S
Sbjct: 206 DGSYTSGELGIEKLGF-----GGISVSNFVFGCGRNNKGL----FGGASGLMGLGRSELS 256
Query: 235 LVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSY 282
++ + G FSYC+ + + A L++G + + + TP++ + Y
Sbjct: 257 MISQTNATFGGVFSYCLPSTDQ-AGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFY 315
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
+ L GI +G L + + F + GV +DSGT ++ L PS Y+ L+ + + F
Sbjct: 316 ILNLTGIDVGGVSLHVQASSF------GNGGVILDSGTVISRLAPSVYKALKAKFLEQFS 369
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCL 400
G PS P C++ D P ++ +F G A+L +DA +FY +E +S CL
Sbjct: 370 G-FPSAPGFSILDTCFN-LTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCL 427
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A+ + ++ E ++ IIG Q+N V YD Q+ F + C
Sbjct: 428 AL--ASLSDEY--EMGIIGNYQQRNQRVLYDAKLSQVGFAKEPC 467
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 113/383 (29%), Positives = 180/383 (46%), Gaps = 57/383 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ ++G PP V+DTGS L W+ C TTFDP++S +Y T+PC S CTN
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSY-PTTFDPTRSTSYQTIPCSSPTCTNRT 91
Query: 161 GGYP--------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
+P + C + Y + S G + S+ F+ +SD + + FGC ++
Sbjct: 92 QDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD-----ISGLVFGCM--DS 144
Query: 213 HFS-----DEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGA 266
FS D + TG+ G+ + S V ++G KFSYCI ++ +L+LGE
Sbjct: 145 VFSSNSDEDSKSTGLMGM---NRGSLSFVSQLGFPKFSYCISGTDF----SGLLLLGESN 197
Query: 267 I----------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
+ L STP+ D +Y V LEGI + +K+L I + F+ + T +
Sbjct: 198 LTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGA-GQTM 256
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNIN-RDLQGF 369
+DSGT T+L+ Y LR + +L P + A LCY ++ R L
Sbjct: 257 VDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLL 316
Query: 370 PAMAFHFAGGADLVLDAESVFYQE------SSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
P + F GA++ + + V Y+ + SV CL+ G SD+ G + +IG Q
Sbjct: 317 PTVTLVFR-GAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLG---VEAYVIGHHHQ 372
Query: 424 QNYNVAYDLVSKQLYFQRIDCEL 446
QN + +DL ++ ++ C+L
Sbjct: 373 QNVWMEFDLEKSRIGLAQVRCDL 395
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 172/374 (45%), Gaps = 54/374 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ NF+IG PP P AV+D L+W +C+ C +C FDP+ S TY PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110
Query: 156 C------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C + +C G + C Y TN D+ G +G++ F T+ + FGC
Sbjct: 111 CESIPSDSRNCSG--NVCAYQAS-TNAGDTGGKVGTDTFAVGTAKA------SLAFGCVV 161
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+ + +G+ GLG + SLV + G + FSYC+ + + L LG A L
Sbjct: 162 ASDIDTMGGPSGIVGLG---RTPWSLVTQTGVAAFSYCLAPHDAGR--NSALFLGSSAKL 216
Query: 269 EGD----STPMSVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G STP I G+ Y V LEG+ G+ M+ + P S + V +D
Sbjct: 217 AGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP---------SGSTVLLD 267
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
+ + +++LV AYQ ++K V + P++P + LC+ + P + F F
Sbjct: 268 TFSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEP-FDLCFPKSGASGAA--PDLVFTFR 324
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK---DLSIIGMIAQQNYNVAYDLVS 434
GGA + + A + + CLA+ ++ R +LS++G + Q+N + +DL
Sbjct: 325 GGAAMTVPATNYLLDYKNGTVCLAM----LSSARLNSTTELSLLGSLQQENIHFLFDLDK 380
Query: 435 KQLYFQRIDCELLA 448
+ L F+ DC L+
Sbjct: 381 ETLSFEPADCTKLS 394
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 165/378 (43%), Gaps = 42/378 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG-------ATTFDPSKSLTYATLPC 151
++V F +G P P + V DTGS L WVKC+ A F + S ++A + C
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIAC 160
Query: 152 DSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-----------TS 193
S CT+ +C C Y+ RY +G ++G +G++ +S
Sbjct: 161 SSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSS 220
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNY 252
+ L V GC+ S + GV LG + S S + G +FSYC+ +
Sbjct: 221 GGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLA 280
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDT 308
A + L G GA TP+ ++D Y VT++ + + + LDI +++ D
Sbjct: 281 PRNATSYLTFGPGATAPAAQTPL-LLDRRMTPFYAVTVDAVYVAGEALDIPADVW---DV 336
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
+ G +DSGT+LT L AY+ + + G LP MDP + CY+ L+
Sbjct: 337 DRNGGAILDSGTSLTILATPAYRAVVTALSKHLAG-LPRVTMDP-FEYCYNWTDAGALE- 393
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
P M HFAG A L A+S + V C+ V G +S+IG I QQ +
Sbjct: 394 IPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPG-----VSVIGNILQQEHLW 448
Query: 429 AYDLVSKQLYFQRIDCEL 446
+DL + L F+ C L
Sbjct: 449 EFDLRDRWLRFKHTRCAL 466
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 123/415 (29%), Positives = 181/415 (43%), Gaps = 68/415 (16%)
Query: 61 QRTLNMSMARFIYL--SQKSSQKAHDTRAHLHPGI--STVPV--FYVNFSIGQPPVPQLA 114
+R S AR +L +Q S + A ++PG P + V+ + G PP
Sbjct: 44 RRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTPPQEVQL 103
Query: 115 VLDTGSSLIWVKCQ--PCEQCGATT---FDPSKSLTYATLPCDSSYC--TNDCGGYPDE- 166
LDTGS + W +C+ P C T FDPS S ++A+LPC S C T CGG D
Sbjct: 104 TLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACETTPPCGGGNDAT 163
Query: 167 ---CWYNIRYTNGPDSQGTIGSEQFNFE--TSDEGKTFLYDVGFGCSHNNAHFSDEQFTG 221
C Y+I Y +G S+G IG E F F T + + + FGC H N TG
Sbjct: 164 SRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRGVFTSNETG 223
Query: 222 VFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
+ G G + S S + KVG+ FS+C + + + ++LG + ++P+ GS
Sbjct: 224 IAGFGRGSLSLPSQL-KVGN-FSHCFTTITGSKTS--AVLLGLPGVAPPSASPLGRRRGS 279
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y ++ S +SGT++T L P Y+ +R+E
Sbjct: 280 YRC--------------------RSTPRSS-----NSGTSITSLPPRTYRAVREEFAAQV 314
Query: 342 Q-GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-------- 392
+ ++P DP C+S + P MA HF GA + L E+ ++
Sbjct: 315 KLPVVPGNATDP--FTCFSAPLRGPKPDVPTMALHFE-GATMRLPQENYVFEVVDDDDAG 371
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
SS + CLAV I G I+G I QQN +V YDL + +L F C+ L
Sbjct: 372 NSSRIICLAV----IEGGEI----ILGNIQQQNMHVLYDLQNSKLSFVPAQCDQL 418
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 182/385 (47%), Gaps = 58/385 (15%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--FDPSKSLTYATLPCDSSYCTN 158
V+ ++G PP V+DTGS L W+ C + F+ ++S++Y +PC SS CTN
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRPIPCSSSTCTN 92
Query: 159 DCGGY--PDECWYN------IRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
+ P C N + Y + S+G + S+ F+ SD + + FGC
Sbjct: 93 QTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASD-----IPGMVFGCM-- 145
Query: 211 NAHFS-----DEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGE 264
++ FS D + TG+ G+ + S V ++G KFSYCI ++ ML+LGE
Sbjct: 146 DSVFSSNSDEDSKNTGLMGM---NRGSLSFVSQMGFPKFSYCISGTDF----SGMLLLGE 198
Query: 265 GAI----------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
L STP+ D +Y V LEGI + +++L I ++F+ + T +
Sbjct: 199 SNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGA-GQ 257
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNIN-RDLQ 367
+DSGT T+L+ AY LR E + G L P + A LCY I+ R L
Sbjct: 258 TMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLP 317
Query: 368 GFPAMAFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLSIIGMI 421
P ++ F GA++ + E V Y + + SV CL+ G SD+ G + +IG
Sbjct: 318 RLPTVSLVF-NGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLG---VEAYVIGHH 373
Query: 422 AQQNYNVAYDLVSKQLYFQRIDCEL 446
QQN + +DL ++ ++ C+L
Sbjct: 374 HQQNVWMEFDLERSRIGLAQVRCDL 398
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 161/360 (44%), Gaps = 36/360 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS + WV+CQPC C FDPS S +YA++ CD+
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPR 222
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C C Y + Y +G + G +E S + V GC H+N
Sbjct: 223 CHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAP----VSSVAIGCGHDN 278
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F G GL S ++ + FSYC+ ++ + + L G+ A E
Sbjct: 279 EGL----FVGAAGLLALGGGPLSFPSQISATTFSYCL--VDRDSPSSSTLQFGDAADAEV 332
Query: 271 DSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ + S YYV L GIS+G ++L I P+ F + T + GV +DSGT +T L S
Sbjct: 333 TAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGA-GGVIVDSGTAVTRLQSS 391
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY LR D F S P L CY + +R PA++ FAGG +L L
Sbjct: 392 AYAALR----DAFVRGTQSLPRTSGVSLFDTCYDLS-DRTSVEVPAVSLRFAGGGELRLP 446
Query: 386 AESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A++ + + +CLA P++ +SIIG + QQ V++D + F C
Sbjct: 447 AKNYLIPVDGAGTYCLAFAPTN------AAVSIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 173/374 (46%), Gaps = 54/374 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ NF+IG PP P AV+D L+W +C+ C +C FDP+ S TY PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110
Query: 156 C------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C + +C G + C Y TN D+ G +G++ F T+ + FGC
Sbjct: 111 CESIPSDSRNCSG--NVCAYQAS-TNAGDTGGKVGTDTFAVGTAKA------SLAFGCVV 161
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+ + +G+ GLG + SLV + G + FSYC+ + + + L LG A L
Sbjct: 162 ASDIDTMGGPSGIVGLG---RTPWSLVTQTGVAAFSYCLAPHDAGK--NSALFLGSSAKL 216
Query: 269 EGD----STPMSVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G STP I G+ Y V LEG+ G+ M+ + P S + V +D
Sbjct: 217 AGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP---------SGSTVLLD 267
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
+ + +++LV AYQ ++K V + P++P + LC+ + P + F F
Sbjct: 268 TFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEP-FDLCFPKSGASGAA--PDLVFTFR 324
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK---DLSIIGMIAQQNYNVAYDLVS 434
GGA + + A + + CLA+ ++ R +LS++G + Q+N + +DL
Sbjct: 325 GGAAMTVAASNYLLDYKNGTVCLAM----LSSARLNSTTELSLLGSLQQENIHFLFDLDK 380
Query: 435 KQLYFQRIDCELLA 448
+ L F+ DC L+
Sbjct: 381 ETLSFEPADCTKLS 394
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 161/360 (44%), Gaps = 43/360 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
+ V+ +G P + + DTGS L W +C E TFDP+KS +YA + C + C++
Sbjct: 134 YIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAE-----TFDPTKSTSYANVSCSTPLCSS 188
Query: 159 --DCGGYPDECW-----YNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN- 210
G P C Y I+Y +G S G +G E+ ++D F FGC +
Sbjct: 189 VISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTDIFNNFY----FGCGQDV 244
Query: 211 NAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+ F + G+ GLG S S K FSYC+ + + L G
Sbjct: 245 DGLFG--KAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPS----SSSTGFLSFGSSQSKS 298
Query: 270 GDSTPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
TP+S S+Y + L GI++G + L I ++F S AG IDSGT +T L P+
Sbjct: 299 AKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVF------STAGTIIDSGTVVTRLPPA 352
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY LR F+ + SYPM + CY + + ++ P + F+GG D+ +D
Sbjct: 353 AYSALRSA----FRKAMASYPMGKPLSILDTCYDFSKYKTIK-VPKIVISFSGGVDVDVD 407
Query: 386 AESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+F CLA + G R D +I G Q+N+ V YD+ ++ F C
Sbjct: 408 QAGIFVANGLKQVCLAFAGN--TGAR--DTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 177/425 (41%), Gaps = 57/425 (13%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST-VPVFYVNFSIGQPPVPQLAVLDTG 119
Q+ N++ A L + + + A L G S ++++ +G PP +LDTG
Sbjct: 132 QQQNNLANAFVASLESSKGEFSGNIMATLESGASLGTGEYFLDMFVGTPPKHVWLILDTG 191
Query: 120 SSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT--------NDCGGYPDECW 168
S L W++C PC C + + P S TY + C C C C
Sbjct: 192 SDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQLVSSSDPLQHCKAENQTCP 251
Query: 169 YNIRYTNGPDSQGTIGSEQF----NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
Y Y +G ++ G SE F + E + DV FGC H N F +G+ G
Sbjct: 252 YFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGCGHWNKGFF-YGASGLLG 310
Query: 225 LGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEG------------AILEGD 271
LG S S ++ + G FSYC+ +L + LI GE +L G+
Sbjct: 311 LGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGE 370
Query: 272 STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA--------GVFIDSGTTLT 323
TP YY+ ++ I +G ++LDI + WS G IDSG+TLT
Sbjct: 371 ETPDETF---YYLQIKSIMVGGEVLDISEQTWH----WSSEGAAADAGGGTIIDSGSTLT 423
Query: 324 WLVPSAYQTLRKEVE---DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
+ SAY +++ E L Q + M P CY+ + P HFA G
Sbjct: 424 FFPDSAYDIIKEAFEKKIKLQQIAADDFVMSP----CYNVSGAMMQVELPDFGIHFADGG 479
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
AE+ FYQ E V CLA+ + L+IIG + QQN+++ YD+ +L +
Sbjct: 480 VWNFPAENYFYQYEPDEVICLAI----MKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGY 535
Query: 440 QRIDC 444
C
Sbjct: 536 SPRRC 540
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 182/425 (42%), Gaps = 66/425 (15%)
Query: 52 PNDTVDAQAQRTLNMSMARFIYLSQK-SSQKAHDTRAHLHPGISTVPV----------FY 100
P++ + A + L R Y+ +K S K D +TVP +
Sbjct: 76 PSNKMPASLEERLQRDQLRAAYIKRKFSGAKGGDVE---QSDAATVPTTLGTSLSTLEYV 132
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCT 157
+ IG P V Q +DTGS + WV+C+PC QC + + FDPS S TY+ C S+ C
Sbjct: 133 ITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAACV 192
Query: 158 --------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
N C +C Y + Y +G + GT S+ G + FGCS
Sbjct: 193 QLSQSQQGNGCSS--SQCQYIVSYVDGSSTTGTYSSDTLTL-----GSNAIKGFQFGCSQ 245
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEG 265
+ + +Q G+ GLG SLV + G FSYC L + L LG
Sbjct: 246 SESGGFSDQTDGLMGLG---GDAQSLVSQTAGTFGKAFSYC---LPPTPGSSGFLTLGAA 299
Query: 266 AILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ TPM + I Y V LE I +G + L+I ++F AG +DSGT +
Sbjct: 300 SRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF-------SAGSVMDSGTVI 352
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYP---MDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T L P+AY L + + P+ P +D + +SG + + P++A F+GG
Sbjct: 353 TRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFD--FSGQSSVSI---PSVALVFSGG 407
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
A + LD + + + +CLA + + L IG + Q+ + V YD+ + F
Sbjct: 408 AVVNLDFNGIMLELDN--WCLAFAANSDD----SSLGFIGNVQQRTFEVLYDVGGGAVGF 461
Query: 440 QRIDC 444
+ C
Sbjct: 462 RAGAC 466
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 169/380 (44%), Gaps = 47/380 (12%)
Query: 99 FYVNFSIGQPP---VPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSLTYATLPC 151
+ V IG P P+ + DTGS L W +C+PC C + T DPSKS T+ L C
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 160
Query: 152 DSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY--DVG 204
C D GG C + RY +G G + S+ F+F + +G + DV
Sbjct: 161 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 220
Query: 205 FGCSHNNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG-SKFSYCI----------GNLNY 252
FGC+H + + TG+ LG S V ++G +FSYCI + +
Sbjct: 221 FGCAHVEDSKAVRGYSTGILALGIGKP---SFVTQLGVDRFSYCIPASEITDDDDDDDDD 277
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISL--GEKMLDID--PNLFKKNDT 308
E + + L G A + G P Y V L+ + G ++ P +
Sbjct: 278 EERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEEA 337
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
+ + +DSGTTL WL S + L++ +E+ L Y + CY GN+ D++
Sbjct: 338 AAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS-LTRRYDLTHPSLYCYLGNMT-DVEA 395
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSV---FCLAVGPSDINGERFKDLSIIGMIAQQN 425
++ F GGADL L S+F+ + + CLAV G R +I+G+ Q+N
Sbjct: 396 V-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA----AGNR----AILGVYPQRN 446
Query: 426 YNVAYDLVSKQLYFQRIDCE 445
NV YDL + ++ F R C+
Sbjct: 447 INVGYDLSTMEIAFDRDQCD 466
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 169/380 (44%), Gaps = 47/380 (12%)
Query: 99 FYVNFSIGQPP---VPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSLTYATLPC 151
+ V IG P P+ + DTGS L W +C+PC C + T DPSKS T+ L C
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 181
Query: 152 DSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY--DVG 204
C D GG C + RY +G G + S+ F+F + +G + DV
Sbjct: 182 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 241
Query: 205 FGCSHNNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG-SKFSYCI----------GNLNY 252
FGC+H + + TG+ LG S V ++G +FSYCI + +
Sbjct: 242 FGCAHVEDSKAVRGYSTGILALGIGKP---SFVTQLGVDRFSYCIPASEITDDDDDDDDD 298
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISL--GEKMLDID--PNLFKKNDT 308
E + + L G A + G P Y V L+ + G ++ P +
Sbjct: 299 EERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEEA 358
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
+ + +DSGTTL WL S + L++ +E+ L Y + CY GN+ D++
Sbjct: 359 AAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS-LTRRYDLTHPSLYCYLGNMT-DVEA 416
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSV---FCLAVGPSDINGERFKDLSIIGMIAQQN 425
++ F GGADL L S+F+ + + CLAV G R +I+G+ Q+N
Sbjct: 417 V-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA----AGNR----AILGVYPQRN 467
Query: 426 YNVAYDLVSKQLYFQRIDCE 445
NV YDL + ++ F R C+
Sbjct: 468 INVGYDLSTMEIAFDRDQCD 487
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 162/366 (44%), Gaps = 58/366 (15%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC-QPCEQC---GATTFDPSKSLTYATL 149
++ + V+ +IG PP+P AVLDTGS LIW +C PC +C A + P++S TYA +
Sbjct: 87 ASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANV 146
Query: 150 PCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
C S C + C C Y Y +G + G + +E F + T + V
Sbjct: 147 SCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGS----DTAVRGV 202
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC N +D +G+ G+G SLV ++G + +
Sbjct: 203 AFGCGTENLGSTDNS-SGLVGMG---RGPLSLVSQLG---------VTRPRRSCRARAAA 249
Query: 264 EGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G ++P LEGI++G+ +L IDP +F+ D GV IDSGTT T
Sbjct: 250 RGGGAPTTTSP-----------LEGITVGDTLLPIDPAVFRLTP-MGDGGVIIDSGTTFT 297
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH----LCYSGNINRDLQGFPAMAFHFAGG 379
L A+ L + + + P+ H LC++ ++ P + HF G
Sbjct: 298 ALEERAFVALARALASRVR-----LPLASGAHLGLSLCFAAASPEAVE-VPRLVLHF-DG 350
Query: 380 ADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
AD+ L ES V S+ V CL + + + +S++G + QQN ++ YDL L
Sbjct: 351 ADMELRRESYVVEDRSAGVACLGMVSA-------RGMSVLGSMQQQNTHILYDLERGILS 403
Query: 439 FQRIDC 444
F+ C
Sbjct: 404 FEPAKC 409
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 159/364 (43%), Gaps = 38/364 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSY 155
+ V G P L ++DTGS L W++C+PC C + F+P +S +Y TLPC S+
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSAT 196
Query: 156 CT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
CT N C Y I Y +G SQG F+ ET G + FGC
Sbjct: 197 CTELITSESNPTPCLLGGCVYEINYGDGSSSQG-----DFSQETLTLGSDSFQNFAFGCG 251
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
H N +G+ GLG + S S + K G +F+YC+ + + + G+G+I
Sbjct: 252 HTNTGLFKGS-SGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSV-GKGSI 309
Query: 268 -LEGDSTPMS---VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
TP+ + Y+V L GIS+G L I P + + T +DSGT +T
Sbjct: 310 PASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGST------IVDSGTVIT 363
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ-GFPAMAFHFAGGADL 382
L+P AY L+ + L + P CY +++R Q P + FHF AD+
Sbjct: 364 RLLPQAYNALKTSFRSKTRDLPSAKPFS-ILDTCY--DLSRHSQVRIPTITFHFQNNADV 420
Query: 383 VLDAESVF--YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ + Q S CLA + + +IIG QQ VA+D + ++ F
Sbjct: 421 AVSDVGILVPVQNGGSQVCLAFA----SASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFA 476
Query: 441 RIDC 444
C
Sbjct: 477 SGSC 480
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 156/363 (42%), Gaps = 40/363 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G P V DTGS WV+CQPC + FDP++S TYA + C +
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241
Query: 155 YC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C T C G C Y+++Y +G S G + + D K F FGC
Sbjct: 242 ACSDLYTRGCSG--GHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGER 295
Query: 211 NAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG---A 266
N E G+ GLG TS +K G F++C L L G G A
Sbjct: 296 NEGLFGEA-AGLLGLGRGKTSLPVQTYDKYGGVFAHC---LPARSSGTGYLDFGPGSPAA 351
Query: 267 ILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
+ +TPM +G YYV + GI +G ++L I ++F S AG +DSGT +T
Sbjct: 352 VGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF------STAGTIVDSGTVITR 405
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGAD 381
L P+AY +LR Y PA L CY ++ P ++ F GGA
Sbjct: 406 LPPAAYSSLRSAFASAMAAR--GYKKAPALSLLDTCYDFTGMSEV-AIPKVSLLFQGGAY 462
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L ++A + Y S S CL ++ + D+ I+G + + V YD+ K + F
Sbjct: 463 LDVNASGIMYAASLSQVCLGFAANEDD----DDVGIVGNTQLKTFGVVYDIGKKTVGFSP 518
Query: 442 IDC 444
C
Sbjct: 519 GAC 521
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 159/365 (43%), Gaps = 54/365 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP++S TYA + C +
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQ-REKLFDPARSSTYANVSCAA 238
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C++ C G C Y ++Y +G S G + + D K F FGC
Sbjct: 239 PACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 292
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGE 264
N E G+ GLG TS +K G F++C+ Y ++ L
Sbjct: 293 RNEGLFGEA-AGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAAS 351
Query: 265 GAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ +TPM +G YYV + GI +G ++L I ++F + AG +DSGT +
Sbjct: 352 ARL----TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF------ATAGTIVDSGTVI 401
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAF 374
T L P+AY +LR Y PA L CY D G P ++
Sbjct: 402 TRLPPAAYSSLRYAFAAAMA--ARGYKKAPAVSLLDTCY------DFTGMSQVAIPTVSL 453
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GGA L +DA + Y S+S CLA ++ G D+ I+G + + VAYD+
Sbjct: 454 LFQGGARLDVDASGIMYAASASQVCLAFAANEDGG----DVGIVGNTQLKTFGVAYDIGK 509
Query: 435 KQLYF 439
K + F
Sbjct: 510 KVVGF 514
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 163/379 (43%), Gaps = 43/379 (11%)
Query: 89 LHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCGATT---FDPSKS 143
L+PG+S +YV +G PP +LDTGSSL W++CQPC C A +DPS S
Sbjct: 114 LNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVS 173
Query: 144 LTYATLPCDSSYCT-------ND--CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSD 194
TY L C S C+ ND C + C Y Y + S G + + +S
Sbjct: 174 KTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ 233
Query: 195 EGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNYF 253
F Y GC +N G+ GL S L K G FSYC+ N
Sbjct: 234 TLPQFTY----GCGQDNQGLFGRA-AGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSG 288
Query: 254 EYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
L +G + TPM S Y++ L I++ + LD+ +++
Sbjct: 289 SSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYR------ 342
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQ 367
IDSGT +T L S Y LR+ + Y PA+ + C+ G++ + +
Sbjct: 343 -VPTLIDSGTVITRLPMSMYAALRQAFVKIMS---TKYAKAPAYSILDTCFKGSL-KSIS 397
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNY 426
P + F GGADL L A S+ + + CLA G S N ++IIG QQ Y
Sbjct: 398 AVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTN-----QIAIIGNRQQQTY 452
Query: 427 NVAYDLVSKQLYFQRIDCE 445
N+AYD+ + ++ F C
Sbjct: 453 NIAYDVSTSRIGFAPGSCH 471
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 162/371 (43%), Gaps = 47/371 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+++ +G P VLDTGS ++W++C PC+ C T FDP KS T+AT+PC S
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRL 194
Query: 156 C-----TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C +++C + C Y + Y +G ++G +E F + + V GC H
Sbjct: 195 CRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR-----VDHVPLGCGH 249
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN---YFEYAYNMLILGEGA 266
+N + S + KFSYC+ + + ++ G A
Sbjct: 250 DNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAA 309
Query: 267 ILEGDS-TPMSV---IDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+ + TP+ +D YY+ L GIS+ G ++ + + FK D + GV IDSGT+
Sbjct: 310 VPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKL-DATGNGGVIIDSGTS 368
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMA 373
+T L AY LR D F+ P++ L C+ DL G P +
Sbjct: 369 VTRLTQPAYVALR----DAFRLGATKLKRAPSYSLFDTCF------DLSGMTTVKVPTVV 418
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
FHF GG + + + + FC A + LSIIG I QQ + VAYDLV
Sbjct: 419 FHFGGGEVSLPASNYLIPVNTEGRFCFAFAGT------MGSLSIIGNIQQQGFRVAYDLV 472
Query: 434 SKQLYFQRIDC 444
++ F C
Sbjct: 473 GSRVGFLSRAC 483
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 160/366 (43%), Gaps = 43/366 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G PP VLDTGS ++W++C+PC +C + T FDPSKS ++A +PC S
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPL 189
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C + C Y + Y +G + G +E F + + V GC H+N
Sbjct: 190 CRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFR-----RAAVPRVAIGCGHDN 244
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE-G 270
+ S + +KFSYC+ + + ++ G+ A+
Sbjct: 245 EGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTA-SAKPSSIVFGDSAVSRTA 303
Query: 271 DSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
TP+ +D YYV L GIS+G + F + D+ + GV IDSGT++T L
Sbjct: 304 RFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTR 363
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHFAGG 379
AY +LR D F+ P + L CY DL G P + HF G
Sbjct: 364 PAYVSLR----DAFRVGASHLKRAPEFSLFDTCY------DLSGLSEVKVPTVVLHFR-G 412
Query: 380 ADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
AD+ L A + ++S FC A + LSIIG I QQ + V +DL ++
Sbjct: 413 ADVSLPAANYLVPVDNSGSFCFAFAGT------MSGLSIIGNIQQQGFRVVFDLAGSRVG 466
Query: 439 FQRIDC 444
F C
Sbjct: 467 FAPRGC 472
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 162/384 (42%), Gaps = 81/384 (21%)
Query: 85 TRAHLHPGISTVPV------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GA 135
+ A + P PV + + SIG PP + DTGS L+W +C PC C
Sbjct: 4 SEASISPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKN 63
Query: 136 TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
FDPSKS ++ + C+S C R + P S
Sbjct: 64 PMFDPSKSTSFKEVSCESQQC---------------RLLDTPTS---------------- 92
Query: 196 GKTFLYDVGFGCSHNNAHFSDEQFTGVFGL-GPATSSTHSLVEKVGS--KFSYCIGNLNY 252
+ ++ FGC HNN+ +E G+FG G S T ++ +GS KFS C+
Sbjct: 93 ----ILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRT 148
Query: 253 FEYAYNMLILGEGAILEGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKND 307
+ +I G A + G STP+ D Y+VTL+GIS+G+K+ F +
Sbjct: 149 DPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFP-----FSSSS 203
Query: 308 TWSDAG-VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW------HLCYSG 360
+ G VFID+GT T L Y L QG+ + PM+P LCY
Sbjct: 204 PMATKGNVFIDAGTPPTLLPRDFYNR-------LVQGVKEAIPMEPVQDPDLQPQLCYR- 255
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
+ L P + HF GAD+ L + F V+C A+ P D D I G
Sbjct: 256 --SATLIDGPILTAHF-DGADVQLKPLNTFISPKEGVYCFAMQPID------GDTGIFGN 306
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
Q N+ + +DL K++ F+ +DC
Sbjct: 307 FVQMNFLIGFDLDGKKVSFKAVDC 330
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 169/380 (44%), Gaps = 47/380 (12%)
Query: 99 FYVNFSIGQPP---VPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSLTYATLPC 151
+ V IG P P+ + DTGS L W +C+PC C + T DPSKS T+ L C
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 163
Query: 152 DSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY--DVG 204
C D GG C + RY +G G + S+ F+F + +G + DV
Sbjct: 164 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 223
Query: 205 FGCSHNNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG-SKFSYCI----------GNLNY 252
FGC+H + + TG+ LG S V ++G +FSYCI + +
Sbjct: 224 FGCAHVEDSKAVRGYSTGILALGIGKP---SFVTQLGVDRFSYCIPASEITDDDDDDDDD 280
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISL--GEKMLDID--PNLFKKNDT 308
E + + L G A + G P Y V L+ + G ++ P +
Sbjct: 281 EERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEEA 340
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
+ + +DSGTTL WL S + L++ +E+ L Y + CY GN+ D++
Sbjct: 341 AAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDIS-LTRRYDLTHPSLYCYLGNMT-DVEA 398
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSV---FCLAVGPSDINGERFKDLSIIGMIAQQN 425
++ F GGADL L S+F+ + + CLAV G R +I+G+ Q+N
Sbjct: 399 V-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA----AGNR----AILGVYPQRN 449
Query: 426 YNVAYDLVSKQLYFQRIDCE 445
NV YDL + ++ F R C+
Sbjct: 450 INVGYDLSTMEIAFDRDQCD 469
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 115/399 (28%), Positives = 180/399 (45%), Gaps = 54/399 (13%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-- 133
SS+++ + + L GI+ + Y+ +IG ++DTGS L WV+C PC C
Sbjct: 109 HNSSEQSSEIQIPLASGINLETLNYI-VTIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYS 167
Query: 134 -GATTFDPSKSLTYATLPCDSSYCTN---------DC-GGYPDECWYNIRYTNGPDSQGT 182
F+PS S +Y +L C+SS C N C P C + + Y +G + G
Sbjct: 168 QQGPVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGE 227
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GS 241
+G E +F G + + FGC NN +G+ GLG + S S G
Sbjct: 228 LGVEHLSF-----GGISVSNFVFGCGRNNKGLFGG-VSGIMGLGRSNLSMISQTNTTFGG 281
Query: 242 KFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSYYVTLEGISLGE 293
FSYC+ + A L++G + L + TP++ + Y + L GI +G
Sbjct: 282 VFSYCLPTTD--SGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGG 339
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
+ ++ ++ + G+ IDSGT +T L PS Y L+ E F G YP+ PA
Sbjct: 340 VAI--------QDTSFGNGGILIDSGTVITRLAPSLYNALKAEFLKQFSG----YPIAPA 387
Query: 354 WHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGP-SDIN 408
+ C++ ++ P ++ HF DL +DA + Y + S CLA+ SD N
Sbjct: 388 LSILDTCFNLTGIEEVS-IPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDEN 446
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
D++IIG Q+N V YD ++ F R DC +
Sbjct: 447 -----DMAIIGNYQQRNQRVIYDAKQSKIGFAREDCSFI 480
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 110/364 (30%), Positives = 166/364 (45%), Gaps = 50/364 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCT 157
V+Y ++G PP V+DTGS L WV+C PC ++TFD S TY L C
Sbjct: 2 VYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASNTYKALTCADD--- 58
Query: 158 NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF--ETSDEGKTFLYDVGFGC-SHNNAHF 214
Y+ Y +G +QG + + SDE + F V FGC S
Sbjct: 59 -----------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFV-FGCGSLLKGLI 106
Query: 215 SDEQFTGVFGLGPATSSTHSLV-EKVGSKFSYCI-----------GNLNYFEYAYNMLIL 262
S E G+ L P + S S + EK G+KFSYC+ + + E A +
Sbjct: 107 SGE--VGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEP 164
Query: 263 GEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
G G + E TP+ Y V L+GIS+G + LD+ P+ F D DSGTTL
Sbjct: 165 GSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNG---QDKPTIFDSGTTL 221
Query: 323 TWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
T L P ++++ + + G + +D + + S QG P + FHF GGA
Sbjct: 222 TMLPPGVCDSIKQSLASMVSGAEFVAIKGLDACFRVPPSSG-----QGLPDITFHFNGGA 276
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
D V S + + S+ CL P++ ++SI G + QQ++ V +D+ ++++ F+
Sbjct: 277 DFVT-RPSNYVIDLGSLQCLIFVPTN-------EVSIFGNLQQQDFFVLHDMDNRRIGFK 328
Query: 441 RIDC 444
DC
Sbjct: 329 ETDC 332
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 124/457 (27%), Positives = 196/457 (42%), Gaps = 55/457 (12%)
Query: 16 PFTSTRIFTSTTAAPAAGKPKRLVTKLLHRD--SLLYNPNDTVDAQAQRTLNMSMARFIY 73
PF I TS++ + V K H D SL + + D+ +++N + I+
Sbjct: 50 PFNQQEIVTSSSQLTMELHSRTSVQKTKHPDYRSLTLSRLER-DSARVKSINTRLDLAIH 108
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLI 123
S K DT + P+ ++ IG+P P VLDTGS +
Sbjct: 109 GLSTSDLKPLDTDSQFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVN 168
Query: 124 WVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNG 176
W++C PC C F+P+ S +Y+ L CD+ C ++C + C Y + Y +G
Sbjct: 169 WIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQCQSLDVSECRN--NTCLYEVSYGDG 226
Query: 177 PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV 236
+ G F ET G + +V GC HNN F G GL S
Sbjct: 227 SYTVG-----DFVTETITLGSASVDNVAIGCGHNNEGL----FIGAAGLLGLGGGKLSFP 277
Query: 237 EKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLG 292
++ S FSYC+ ++ + + L + + P+ +D YYV + G+S+G
Sbjct: 278 SQINASSFSYCL--VDRDSDSASTLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVG 335
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLR----KEVEDLFQGLLPSY 348
++L I ++F+ +++ + G+ IDSGT +T L +AY LR K +D LP
Sbjct: 336 GELLSIPESMFEMDES-GNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKD-----LPVT 389
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDI 407
+ CY + ++ P + FH AGG L L A + +S FC A P+
Sbjct: 390 SEVALFDTCYDLSRKTSVE-VPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTS- 447
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
LSIIG + QQ V +DL + + F+ C
Sbjct: 448 -----SALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 161/372 (43%), Gaps = 44/372 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++V +G P V+DTGS L W++CQPC+ C FDP S ++ +PC S
Sbjct: 54 YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 113
Query: 156 C----TNDCG---GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C + C G C Y + Y +G S G S+ F T + + V FGC
Sbjct: 114 CKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMS----VAFGCG 169
Query: 209 HNNAHFSDEQFTGVFGLG-----PATSSTHSLVEKVGSKFSYC-IGNLNYFEYAYNMLIL 262
+N + P+ S + FSYC + N + + LI
Sbjct: 170 FDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIF 229
Query: 263 GEGAILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
G AI + +P+ +D YY + G+S+G L I + + + S GV IDS
Sbjct: 230 GVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGS-GGVIIDS 288
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGL---LPSYPMDPAWHLCY--SGNINRDLQGFPAMA 373
GT++T S Y T+R D F+ LPS P + CY SG + D+ PA+
Sbjct: 289 GTSVTRFPTSVYATIR----DAFRNATINLPSAPRYSLFDTCYNFSGKASVDV---PALV 341
Query: 374 FHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
HF GADL L + ++ FCLA P+ + +L IIG I QQ++ + +DL
Sbjct: 342 LHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM------ELGIIGNIQQQSFRIGFDL 395
Query: 433 VSKQLYFQRIDC 444
L F C
Sbjct: 396 QKSHLAFAPQQC 407
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 164/382 (42%), Gaps = 56/382 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ---PCEQCGATTFDPSKSLTYATLPCDSSYCT 157
V+ IG PP Q VLDTGS L W++C P + +FDPS S T++TLPC C
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158
Query: 158 NDCGGYP-----DE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
+ D+ C Y+ Y +G ++G + E+F F S F + GC+
Sbjct: 159 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS----LFTPPLILGCAT 214
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI--------------------GN 249
+ +D + G+ G+ S S + +KFSYC+ N
Sbjct: 215 ES---TDPR--GILGMNRGRLSFAS--QSKITKFSYCVPTRVTRPGYTPTGSFYLGHNPN 267
Query: 250 LNYFEYAYNMLILGEGAILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDT 308
N F Y ML S M +D +Y V L+GI +G + L+I P +F+ D
Sbjct: 268 SNTFRYI-EMLTFAR-------SQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRA-DA 318
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
+DSG+ T+LV AY +R E V + + Y +C+ GN +
Sbjct: 319 GGSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGR 378
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
M F F G +V+ E V V C+ + SD G +IIG QQN
Sbjct: 379 LIGDMVFEFEKGVQIVVPKERVLATVEGGVHCIGIANSDKLGA---ASNIIGNFHQQNLW 435
Query: 428 VAYDLVSKQLYFQRIDCELLAD 449
V +DLV++++ F DC LA
Sbjct: 436 VEFDLVNRRMGFGTADCSRLAK 457
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 170/379 (44%), Gaps = 47/379 (12%)
Query: 91 PGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKS 143
P VP+ + V+ +G P L V DTGS L WV+C+PC+ C FDPS+S
Sbjct: 126 PARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQS 185
Query: 144 LTYATLPCDSSYCTN-DCGGYPD-ECWYNIRYTNGPDSQGTIGSEQFNF------ETSDE 195
TY+ +PC + C D G +C Y + Y + + G + + +SD+
Sbjct: 186 TTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQ 245
Query: 196 GKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFE 254
+ F+ FGC ++ + G+FGLG S S K G+ FSYC+ + + E
Sbjct: 246 LQEFV----FGCGDDDTGLFGKA-DGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAE 300
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
L LG A T M + YY+ L GI + + + + P +F+
Sbjct: 301 ---GYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRT------ 351
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP--SYPMDPAWHL---CYSGNINRDL 366
G IDSGT +T L AY LR F GL+ SY PA + CY +
Sbjct: 352 PGTVIDSGTVITRLPSRAYAALRSS----FAGLMRRYSYKRAPALSILDTCYDFTGRNKV 407
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
Q P++A F GGA L L V Y + S CLA NG+ ++I+G + Q+ +
Sbjct: 408 Q-IPSVALLFDGGATLNLGFGEVLYVANKSQACLAFAS---NGDD-TSIAILGNMQQKTF 462
Query: 427 NVAYDLVSKQLYFQRIDCE 445
V YD+ ++++ F C
Sbjct: 463 AVVYDVANQKIGFGAKGCS 481
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 164/364 (45%), Gaps = 43/364 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS + W++C PC +C + FDP+ S T+ +L C
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPK 223
Query: 156 CTN-DCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
C + D ++C Y + Y +G + G ++ F + GK + DV GC H+N
Sbjct: 224 CASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTF--GESGK--VNDVALGCGHDNEG 279
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYA---YNMLILGEGAILE 269
FTG GL S+ ++ +K FSYC+ + + + + +N + +G
Sbjct: 280 L----FTGAAGLLGLGGGALSMTNQIKAKSFSYCLVDRDSAKSSSLDFNSVQIG-----A 330
Query: 270 GDST-PM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
GD+T P+ S +D YYV L G S+G + + I +LF+ D GV +D GT +T L
Sbjct: 331 GDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEV-DASGAGGVILDCGTAVTRL 389
Query: 326 VPSAYQTLR----KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD 381
AY +LR K D +G P D + + P + FHF GG
Sbjct: 390 QTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVK-----VPTVTFHFTGGKS 444
Query: 382 LVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L L A++ + + FC A P+ LSIIG + QQ + YDL + +
Sbjct: 445 LNLPAKNYLIPIDDAGTFCFAFAPTS------SSLSIIGNVQQQGTRITYDLANNLIGLS 498
Query: 441 RIDC 444
C
Sbjct: 499 ANKC 502
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 161/360 (44%), Gaps = 36/360 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS + WV+CQPC C FDPS S +YA++ CD+
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPR 226
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C C Y + Y +G + G +E S + V GC H+N
Sbjct: 227 CHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAP----VSSVAIGCGHDN 282
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F G GL S ++ + FSYC+ ++ + + L G+ A E
Sbjct: 283 EGL----FVGAAGLLALGGGPLSFPSQISATTFSYCL--VDRDSPSSSTLQFGDAADAEV 336
Query: 271 DSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ + S YYV L G+S+G ++L I P+ F + T + GV +DSGT +T L S
Sbjct: 337 TAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGA-GGVIVDSGTAVTRLQSS 395
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY LR D F S P L CY + +R PA++ FAGG +L L
Sbjct: 396 AYAALR----DAFVRGTQSLPRTSGVSLFDTCYDLS-DRTSVEVPAVSLRFAGGGELRLP 450
Query: 386 AESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A++ + + +CLA P++ +SIIG + QQ V++D + F C
Sbjct: 451 AKNYLIPVDGAGTYCLAFAPTNAA------VSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 169/379 (44%), Gaps = 51/379 (13%)
Query: 83 HDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ 132
DTR P T PV ++ +G P VLDTGS + W++C+PC
Sbjct: 138 EDTR--YQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSD 195
Query: 133 C---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGS 185
C F+P+ S TY +L C + C T+ C ++C Y + Y +G + G + +
Sbjct: 196 CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSAC--RSNKCLYQVSYGDGSFTVGELAT 253
Query: 186 EQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFS 244
+ F S GK + DV GC H+N FTG GL S+ ++ + FS
Sbjct: 254 DTVTFGNS--GK--INDVALGCGHDNEGL----FTGAAGLLGLGGGALSITNQMKATSFS 305
Query: 245 YCIGNLNYFEYA---YNMLILGEGAILEGDST-PM---SVIDGSYYVTLEGISLGEKMLD 297
YC+ + + + + +N + LG GD+T P+ ID YYV L G S+G + +
Sbjct: 306 YCLVDRDSGKSSSLDFNSVQLG-----SGDATAPLLRNQKIDTFYYVGLSGFSVGGQKV- 359
Query: 298 IDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLC 357
+ P+ D GV +D GT +T L AY +LR L L + C
Sbjct: 360 MMPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTC 419
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLS 416
Y + ++ P +AFHF GG L L A++ + + FC A P+ LS
Sbjct: 420 YDFSSLSSVK-VPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTS------SSLS 472
Query: 417 IIGMIAQQNYNVAYDLVSK 435
IIG + QQ + YDL +K
Sbjct: 473 IIGNVQQQGTRITYDLANK 491
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 153/351 (43%), Gaps = 49/351 (13%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC------TNDCGGYPD 165
++DTGS + W++C PC QC + F P+ S TY LPC+S+ C ++ C
Sbjct: 4 LIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSC--LNS 61
Query: 166 ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL 225
C Y + Y + ++G E + D + + FGC H N + G+ GL
Sbjct: 62 SCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGA-AGLMGL 120
Query: 226 G------PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVID 279
G PA +S G FSYC+ +++ +L GE A+L+ D ++D
Sbjct: 121 GKSSIGFPAQTSV-----AFGKVFSYCLPSVSS-TIPSGILHFGEAAMLDYDVRFTPLVD 174
Query: 280 GS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLR 334
S Y+V++ GI++G+++L I A V +DSGT ++ SAY+ LR
Sbjct: 175 SSSGPSQYFVSMTGINVGDELLPI------------SATVMVDSGTVISRFEQSAYERLR 222
Query: 335 KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES 394
+ GL + + P + C+ + D+ P + HF A+L L + Y
Sbjct: 223 DAFTQILPGLQTAVSVAP-FDTCFRVSTVDDIN-IPLITLHFRDDAELRLSPVHILYPVD 280
Query: 395 SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
V C A PS S++G QQN YD+ +L +C
Sbjct: 281 DGVMCFAFAPSS------SGRSVLGNFQQQNLRFVYDIPKSRLGISAFECN 325
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 175/391 (44%), Gaps = 57/391 (14%)
Query: 84 DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDP 140
DT+ L GI + Y+ ++ ++DTGS L WV+CQPC +C F+P
Sbjct: 50 DTQIPLTSGIRLQSLNYI-VTVELGGRKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNP 108
Query: 141 SKSLTYATLPCDSSYC---------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE 191
SKS +Y T+ C+S C + CG P C Y + Y +G + G +G E N
Sbjct: 109 SKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNL- 167
Query: 192 TSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCI 247
G T + + FGC N F G GL + SL+ ++ G FSYC+
Sbjct: 168 ----GNTTVNNFIFGCGRKNQGL----FGGASGLVGLGRTDLSLISQISPMFGGVFSYCL 219
Query: 248 GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS-------YYVTLEGISLGEKMLDIDP 300
A L++G + + ++TP+S Y++ L GI++G +++
Sbjct: 220 PTTE--AEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGG--VEVQA 275
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---C 357
F K+ + IDSGT ++ L PS YQ L+ E F G YP P++ + C
Sbjct: 276 PSFGKDR------MIIDSGTVISRLPPSIYQALKAEFVKQFSG----YPSAPSFMILDSC 325
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDL 415
++ + ++++ P + +F G A+L +D VFY + +S CLA+ E +
Sbjct: 326 FNLSGYQEVK-IPDIKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDE----V 380
Query: 416 SIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
IIG Q+N + YD L F C
Sbjct: 381 GIIGNYQQKNQRIIYDTKGSMLGFAEEACSF 411
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 180/393 (45%), Gaps = 51/393 (12%)
Query: 86 RAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLT 145
+ H H +S V+ ++G PP VLDTGS L W++C Q TTFDP++S +
Sbjct: 76 KLHFHHNVSLT----VSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTTFDPNRSSS 130
Query: 146 YATLPCDSSYCTNDCGGYP--------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
Y+ +PC S CT+ +P C + Y + S+G + S+ F SD
Sbjct: 131 YSPVPCSSLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSDMPG 190
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYA 256
T FGC ++ + E+ + GL + S V ++ KFSYCI + ++
Sbjct: 191 TI-----FGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSDF---- 241
Query: 257 YNMLILGEGAI----------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKK 305
+L+LG+ L STP+ D +Y V LEGI + K+L + ++F
Sbjct: 242 SGVLLLGDANFSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVP 301
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSG 360
+ T + +DSGT T+L+ Y LR E + +L P+Y LCY
Sbjct: 302 DHTGA-GQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRV 360
Query: 361 NINR-DLQGFPAMAFHFAGGADLVLDAESVFYQE------SSSVFCLAVGPSDINGERFK 413
+++ L P ++ F GA++ + + + Y+ S SV+C G SD+
Sbjct: 361 PLSQTSLPWLPTVSLMFR-GAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLA---V 416
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
+ +IG QQN + +DL ++ F ++ C+L
Sbjct: 417 EAYVIGHHHQQNVWMEFDLEKSRIGFAQVQCDL 449
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 157/363 (43%), Gaps = 46/363 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ + G P Q + DTGS++ W++C+PC C FDP+ S TY + C S+
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSA 75
Query: 155 YCT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
CT C G C Y + Y +G + G + +E F + F+ FGC N
Sbjct: 76 ACTGLSSRGCSG--STCVYGVTYGDGSSTVGFLATETFTLAAGNVFNNFI----FGCGQN 129
Query: 211 NAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLN----YFEYAYNMLILGEG 265
N G+ GLG + S +S L +G+ FSYC+ + + Y + G
Sbjct: 130 NQGLFTGA-AGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPLRTPGYT 188
Query: 266 AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
A+L P Y++ L GIS+G L + +F+ G IDSGT +T L
Sbjct: 189 AMLTNSRAPT-----LYFIDLIGISVGGTRLALSSTVFQ------SVGTIIDSGTVITRL 237
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADL 382
P+AY LR F+ + Y A + CY + + FP + H+ G D+
Sbjct: 238 PPTAYGALRTA----FRAAMTQYTRAAAASILDTCYDFSRTTTVT-FPTIKLHYT-GLDV 291
Query: 383 VLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
+ VFY SSS CLA G SD + IIG + Q+ V YD K++ F
Sbjct: 292 TIPGAGVFYVISSSQVCLAFAGNSD-----STQIGIIGNVQQRTMEVTYDNALKRIGFAA 346
Query: 442 IDC 444
C
Sbjct: 347 GAC 349
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 157/360 (43%), Gaps = 29/360 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDS-- 153
++ IG P VLDTGS + W++C PC C A + FDP+ S +YAT+PCDS
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSPH 255
Query: 154 ------SYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
S C N+ C Y + Y +G + G +E +G ++DV GC
Sbjct: 256 CRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTL--GGDGSAAVHDVAIGC 313
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGA 266
H+N F G GL S ++ ++FSYC+ + + + + +
Sbjct: 314 GHDNEGL----FVGAAGLLALGGGPLSFPSQISATEFSYCLVDRDSPSASTLQFGASDSS 369
Query: 267 ILEGDSTPMSVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+ + YYV L GIS+ GE + DI P F ++ S GV +DSGT +T L
Sbjct: 370 TVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGS-GGVIVDSGTAVTRL 428
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
SAY LR Q LP + CY +Q PA++ F GG +L L
Sbjct: 429 QSSAYSALRDAFVRGTQA-LPRASGVSLFDTCYDLAGRSSVQ-VPAVSLRFEGGGELKLP 486
Query: 386 AESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A++ + + +CLA + +SI+G + QQ V++D + F C
Sbjct: 487 AKNYLIPVDGAGTYCLAFAATG------GAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/364 (30%), Positives = 159/364 (43%), Gaps = 54/364 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCT- 157
+ + IG P V Q ++DTGS + WV+C + G T FDPSKS TYA C S+ C
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTD--GLTLFDPSKSTTYAPFSCSSAACAQ 186
Query: 158 ---NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
N G C Y ++Y +G ++ GT S+ SD + D FGCSH+ F
Sbjct: 187 LGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASDT----VTDFHFGCSHHEEDF 242
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
E+ G+ GLG SLV + G FSYC+ N L G G
Sbjct: 243 DGEKIDGLMGLG---GDAQSLVSQTAATYGKSFSYCLPPTNRTS---GFLTFGAPNGTSG 296
Query: 271 D--STPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+TPM + Y V L+ IS+G L I P++ G +DSGT +TWL
Sbjct: 297 GFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSN-------GSVMDSGTVITWL 349
Query: 326 VPSAYQTL----RKEVEDL-FQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
AY L R + L Q P +D + ++G +N + PA++ GGA
Sbjct: 350 PRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYD--FTGLVNVSI---PAVSLVLDGGA 404
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ LD + Q+ CLA + + SIIG + Q+ + V +D+ F+
Sbjct: 405 VVDLDGNGIMIQD-----CLAFAATSGD-------SIIGNVQQRTFEVLHDVGQGVFGFR 452
Query: 441 RIDC 444
C
Sbjct: 453 SGAC 456
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 167/366 (45%), Gaps = 46/366 (12%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
++ + +G PP +A +DTGS LIW +C PC C A FDPSKS T+ C
Sbjct: 60 IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEKRC--- 116
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
+ + C Y I Y + S G + +E +++ + + GC NN++
Sbjct: 117 --------HGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSNL 168
Query: 215 SDEQF----TGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+ +G+ GL SS S ++ + SYC + + + G A++
Sbjct: 169 MTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYC-----FSSQGTSKINFGTNAVVA 223
Query: 270 GDSTPMSVI-----DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
GD T + + YY+ L+ +S+G+K ++ F D +FIDSGTT T+
Sbjct: 224 GDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQ----DGNIFIDSGTTYTY 279
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH--LCYSGNINRDLQGFPAMAFHFAGGADL 382
L P++Y L +E P DP+ LCY+ + ++ FP + HFAGGADL
Sbjct: 280 L-PTSYCNLVREAVAASVVAANQVP-DPSSENLLCYNWD---TMEIFPVITLHFAGGADL 334
Query: 383 VLDAESVFYQE-SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
VLD +++ + + FCLA+G D + +I G A N V YD + + F
Sbjct: 335 VLDKYNMYVETITGGTFCLAIGCVDPSMP-----AIFGNRAHNNLLVGYDSSTLVISFSP 389
Query: 442 IDCELL 447
+C L
Sbjct: 390 TNCSAL 395
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 171/373 (45%), Gaps = 49/373 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ IG PP Q VLDTGS L W++C+ + T FDP S +++ LPC+ S C
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKPRV 139
Query: 161 GGYP-----DE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
Y D+ C Y+ Y +G ++G + E+F F +S + GC+ ++
Sbjct: 140 PDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLI----LGCATDS- 194
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI------------------GNLNYFE 254
SD Q G+ G+ S SL + SKFSYC+ N +
Sbjct: 195 --SDTQ--GILGMNLGRLSFSSLAKI--SKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAG 248
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG- 313
+ Y L+ + + P+ +Y + + GI + K L+I + F+ + S AG
Sbjct: 249 FKYVNLMTYRQSQRMPNLDPL-----AYTLPMLGIRINGKKLNISTSAFRADP--SGAGQ 301
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLF-QGLLPSYPMDPAWHLCYSGNINRDLQGFPAM 372
IDSGT T+LV AY +++E+ L L Y + +C+ G+ + M
Sbjct: 302 TLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNM 361
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
AF F G ++V++ E + V CL +G SD+ G +IIG QQ+ V +DL
Sbjct: 362 AFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVAS---NIIGNFHQQDLWVEFDL 418
Query: 433 VSKQLYFQRIDCE 445
V +++ F R DC
Sbjct: 419 VGRRVGFGRTDCS 431
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 136/460 (29%), Positives = 192/460 (41%), Gaps = 82/460 (17%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQ---AQRTLNMSMARFIYLSQKSS--QKAHDTRAHL 89
P R L+HR P+ + A+R L AR Y+ K++ + A +
Sbjct: 14 PNRASVPLVHRHGPC-APSAASGGKPSLAER-LRRDRARTNYIVTKATGGRTAATALSDA 71
Query: 90 HPGISTVPVF----------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT 137
G +++P F V IG P V Q ++DTGS L WV+C+PC +C A
Sbjct: 72 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 131
Query: 138 ---FDPSKSLTYATLPCDSSY------------CTNDCGGYPDECWYNIRYTNGPDSQGT 182
FDPS S +YA++PCDS CT GG C Y I Y N + G
Sbjct: 132 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 191
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV-EKVGS 241
+E + + D GFGC ++ H E+F G+ GLG A S S + G
Sbjct: 192 YSTETLTLKPG----VVVADFGFGCG-DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGG 246
Query: 242 KFSYCIGNLNYFEYAYNMLILG------EGAILEGDS-TPMSVIDGS---YYVTLEGISL 291
FSYC L L LG G S TPM + Y VTL GIS+
Sbjct: 247 PFSYC---LPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISV 303
Query: 292 GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD 351
G L I P+ F +G+ IDSGT +T L +AY LR F+ + Y +
Sbjct: 304 GGAPLAIPPSAFS-------SGMVIDSGTVITGLPATAYAALRSA----FRSAMSEYRLL 352
Query: 352 P-----AWHLCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGP 404
P CY +G+ N + P ++ F+GGA + L A + + CLA
Sbjct: 353 PPSNGGVLDTCYDFTGHANVTV---PTISLTFSGGATIDLAAPAGVLVDG----CLAFAG 405
Query: 405 SDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + + IIG + Q+ + V YD + F+ C
Sbjct: 406 AGTD----NAIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 159/363 (43%), Gaps = 43/363 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+C+PC +Q G FDP+KS TYA + C
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGP-LFDPAKSSTYANVSCTD 221
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
S C TN C G C Y ++Y +G + G + D K F FGC
Sbjct: 222 SACADLDTNGCTG--GHCLYAVQYGDGSYTVGFFAQDTLTIA-HDAIKGFR----FGCGE 274
Query: 210 -NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
NN F + G+ GLG TS T K G F+YC+ L L G G+
Sbjct: 275 KNNGLFG--KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALT---TGTGYLDFGPGSA 329
Query: 268 LEGDS-TPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
TPM G YYV + GI +G + + + ++F S AG +DSGT +T
Sbjct: 330 GNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF------STAGTLVDSGTVITR 383
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGAD 381
L +AY L + + L Y P + + CY D++ P ++ F GGA
Sbjct: 384 LPATAYTALSSAFDKVM--LARGYKKAPGYSILDTCYDFTGLSDVE-LPTVSLVFQGGAC 440
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L +D + Y S + CLA NG+ + ++I+G Q+ Y V YDL K + F
Sbjct: 441 LDVDVSGIVYAISEAQVCLAFAS---NGDD-ESVAIVGNTQQKTYGVLYDLGKKTVGFAP 496
Query: 442 IDC 444
C
Sbjct: 497 GSC 499
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 134/428 (31%), Positives = 189/428 (44%), Gaps = 58/428 (13%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAH--DTRAHLHPGISTVPV----------FYVNF 103
VDA A T ++R + ++SS + + A L PG + + +
Sbjct: 38 VDADAGYTEEQLLSRAL---RRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEM 94
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTND- 159
IG P A+LDTGS LIW +C PC C FDP++S TY +L C S C
Sbjct: 95 GIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALY 154
Query: 160 ---CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
C Y C Y Y + + G + +E F F T +E + L + FGC + NA S
Sbjct: 155 YPLC--YQKVCVYQYFYGDSASTAGVLANETFTFGT-NETRVSLPGISFGCGNLNAG-SL 210
Query: 217 EQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEG----- 270
+G+ G G + SLV ++GS +FSYC+ ++ + L G A L
Sbjct: 211 ANGSGMVGFG---RGSLSLVSQLGSPRFSYCL--TSFLSPVPSRLYFGVYATLNSTNASS 265
Query: 271 ---DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
STP V + Y++ + GIS+G +L IDP +F NDT G IDSGTT+T+
Sbjct: 266 EPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITY 325
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL--CYS-GNINRDLQGFPAMAFHFAGGAD 381
L AY +R Q LP + A L C+ R P + HF GAD
Sbjct: 326 LAEPAYDAVRAAFAS--QITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHF-DGAD 382
Query: 382 LVLDAESVFYQESSS--VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L ++ + S+ CLA+ D SIIG QN+NV YDL + + F
Sbjct: 383 WELPLQNYMLVDPSTGGGLCLAMA-------SSSDGSIIGSYQHQNFNVLYDLENSLMSF 435
Query: 440 QRIDCELL 447
C L+
Sbjct: 436 VPAPCHLM 443
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 163/387 (42%), Gaps = 63/387 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDS 153
+ V IG PP + DTGS L WV+C PC FDPSKS TY +PC +
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSA 181
Query: 154 SYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C CG C Y+++Y + ++ G++ E F V FGC
Sbjct: 182 PECHIGGVQQTRCGA--TSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFGC 239
Query: 208 SHNN-AHFSDEQF--TGVFGLGPATSS----THSLVEKVGSKFSYCIGNLNYFEYAYNML 260
SH + F+D G+ GLG SS T + G FSYC L + L
Sbjct: 240 SHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYC---LPPRGSSTGYL 296
Query: 261 ILGEGAILEGDS----------TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
+G GA T +S + +Y V L G+S+ +DI + F
Sbjct: 297 TIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFS------ 350
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL-----CYSGNINRD 365
G IDSGT +T + +AY LR E F+ + SY M P + CY +D
Sbjct: 351 -LGAVIDSGTVVTHMPAAAYYPLRDE----FRLHMGSYKMLPEGSMKLLDTCYD-VTGQD 404
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFY--------QESSSVFCLAVGPSDINGERFKDLSI 417
+ P +A F GGA + +DA + +S ++ CLA P++ G L I
Sbjct: 405 VVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAG-----LVI 459
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+G + Q+ YNV +D+ ++ F C
Sbjct: 460 VGNMQQRAYNVVFDVDGGRIGFGPNGC 486
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 169/378 (44%), Gaps = 48/378 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ ++G PP VLDTGS L W+ C+ + F+P S +Y+ +PC S C
Sbjct: 42 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNL-TSVFNPLSSSSYSPIPCSSPVCRTRT 100
Query: 161 GGYPDE--------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
P+ C + Y + +G + S+ F G + L FGC +
Sbjct: 101 RDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI-----GSSALPGTLFGCMDSGF 155
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI---- 267
+ E+ GL + S V ++G KFSYCI + + +L+ G+ +
Sbjct: 156 SSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRD----SSGVLLFGDSHLSWLG 211
Query: 268 ------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
L STP+ D +Y V L+GI +G K+L + ++F + T + +DSGT
Sbjct: 212 NLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQ-TMVDSGT 270
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
T+L+ Y LR E + +G+L P++ A LCY L PA++
Sbjct: 271 QFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLM 330
Query: 376 FAGGADLVLDAESVFYQESSS------VFCLAVGPSDING-ERFKDLSIIGMIAQQNYNV 428
F GA++V+ E + Y+ V+CL G SD+ G E F +IG QQN +
Sbjct: 331 FR-GAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAF----VIGHHHQQNVWM 385
Query: 429 AYDLVSKQLYFQRIDCEL 446
+DLV ++ F C+L
Sbjct: 386 EFDLVKSRVGFVETRCDL 403
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 156/374 (41%), Gaps = 45/374 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDS 153
+ V+ +G P V DTGS L WV+C PC G F PS S T++ + C +
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGA 213
Query: 154 SYC--TNDCGGYP--DECWYNIRYTNGPDSQGTIGSEQFNFET------SDEGKTFLYDV 203
C CGG P D C Y + Y + +QG +G++ T S E L
Sbjct: 214 RECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGF 273
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLIL 262
FGC NN Q G+FGLG S S K G FSYC+ + + Y L
Sbjct: 274 VFGCGENNTGLFG-QADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYLSLGT 332
Query: 263 GEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDI-DPNLFKKNDTWSDAGVFIDS 318
A TPM + YYV L GI + + + + P + + +DS
Sbjct: 333 PVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP--------LIVDS 384
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL-----CY--SGNINRDLQGFPA 371
GT +T L P AY+ LR F + Y A L CY + + N + PA
Sbjct: 385 GTVITRLAPRAYRALRAA----FLSAMGKYGYKRAPRLSILDTCYDFTAHANATVS-IPA 439
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+A FAGGA + +D V Y + CLA P NG+ + I+G Q+ V YD
Sbjct: 440 VALVFAGGATISVDFSGVLYVAKVAQACLAFAP---NGD-GRSAGILGNTQQRTLAVVYD 495
Query: 432 LVSKQLYFQRIDCE 445
+ +++ F C
Sbjct: 496 VARQKIGFAAKGCS 509
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 163/377 (43%), Gaps = 58/377 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP------CEQCGATTFDPSKSLTYATLPCD 152
+ + ++G PP LA+ DTGS L+WVKC+ T FDPS+S TY + C
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160
Query: 153 SSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT----FLYD 202
+ C T D G C Y Y +G ++ G + +E F F+ G++ +
Sbjct: 161 TDACEALGRATCDDG---SNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQVRVGG 217
Query: 203 VGFGCSHNNA---------HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI--GNLN 251
V FGCS A + V LG ATS +G +FSYC+ ++N
Sbjct: 218 VKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATS--------LGRRFSYCLVPHSVN 269
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
A N L + STP+ +D Y V L+ + +G K + +
Sbjct: 270 A-SSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTV----------ASA 318
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQ 367
+ + + +DSGTTLT+L PS + E+ L P D LCY +G +
Sbjct: 319 ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRIT-LPPVQSPDGLLQLCYNVAGREVEAGE 377
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + F GGA + L E+ F CLA+ + + +SI+G +AQQN +
Sbjct: 378 SIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAI----VATTEQQPVSILGNLAQQNIH 433
Query: 428 VAYDLVSKQLYFQRIDC 444
V YDL + + F DC
Sbjct: 434 VGYDLDAGTVTFAGADC 450
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 174/376 (46%), Gaps = 50/376 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ-------PCEQCGATTFDPSKSLTYATLPCDS 153
+ IG PP P+ ++DTGS LIW +C+ +DP +S T+A LPC
Sbjct: 93 LTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSD 152
Query: 154 SYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C +C + C Y Y + + G + SE F F + +GFGC
Sbjct: 153 RLCQEGQFSFKNCTSK-NRCVYEDVYGSAA-AVGVLASETFTFGAR---RAVSLRLGFGC 207
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGA 266
+A S TG+ GL P + SL+ ++ +FSYC+ + + + L+ G A
Sbjct: 208 GALSAG-SLIGATGILGLSP---ESLSLITQLKIQRFSYCL--TPFADKKTSPLLFGAMA 261
Query: 267 ILEGDSTPMSVIDGS----------YYVTLEGISLGEKMLDI-DPNLFKKNDTWSDAGVF 315
L T + + YYV L GISLG K L + +L + D G
Sbjct: 262 DLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPD--GGGGTI 319
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-----SGNINRDLQGFP 370
+DSG+T+ +LV +A++ +++ V D+ + + + ++ + LC+ + + P
Sbjct: 320 VDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVED-YELCFVLPRRTAAAAMEAVQVP 378
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLAVGP-SDINGERFKDLSIIGMIAQQNYNVA 429
+ HF GGA +VL ++ F + + + CLAVG +D +G +SIIG + QQN +V
Sbjct: 379 PLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSG-----VSIIGNVQQQNMHVL 433
Query: 430 YDLVSKQLYFQRIDCE 445
+D+ + F C+
Sbjct: 434 FDVQHHKFSFAPTQCD 449
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 164/384 (42%), Gaps = 52/384 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+YV +G P V + ++DTGS + W++C PC+ C F+P S ++ LPC SS
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 197
Query: 156 CTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK---TFLYDVGF 205
CTN C C ++I+Y +G S G + E T + G L ++
Sbjct: 198 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 257
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC+ + +G+ G+ S S L + KFS+C + + ++ GE
Sbjct: 258 GCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFGE 317
Query: 265 GAIL----------EGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
I+ + + P + +D YYV L GIS+ E L + F + G
Sbjct: 318 SDIISPYLRYTPLVQNPAVPSASLD-YYYVGLVGISVDESRLPLSHKNFDIDKVTGSGGT 376
Query: 315 FIDSGTTLTWLVPSAYQTLRKE----------VEDLFQGLLPSYPMDPAWHLCYSGNINR 364
IDSGT T+L A+Q +R+E V+D G P Y + SG
Sbjct: 377 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDD-NSGFTPCYNIT-------SGTAAL 428
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSS----VFCLAVGPSDINGERFKDLSIIGM 420
+ P++ HF GG D+VL S+ SSS CLA ++G+ +IIG
Sbjct: 429 ESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAF---QMSGD--IPFNIIGN 483
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
QQN V YDL +L C
Sbjct: 484 YQQQNLWVEYDLEKLRLGIAPAQC 507
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 102/415 (24%), Positives = 176/415 (42%), Gaps = 45/415 (10%)
Query: 42 LLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS ++P+ T + S++R + + ++ + ++ + P
Sbjct: 36 LIHRDSPHSPFFDPSKTQAERLTDAFRRSVSR-VGRFRPTAMTSDGIQSRIVPSAGE--- 91
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ +N IG PPVP +A++DTGS L W +C+PC C FDP S TY C +S+
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + +C + Y +G + G + SE +++ FGC H++
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211
Query: 212 AHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
D+ +G+ GLG S S L + FSYC+ +
Sbjct: 212 GGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCL------------------LPVST 253
Query: 271 DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
DS+ S I+ + G L + + K + + +DSGTT T+L Y
Sbjct: 254 DSSISSRINFGASGRVSGYGTVSTPLRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFY 313
Query: 331 QTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF 390
L K V + +G P + + LCY N ++ P + HF A++ L + F
Sbjct: 314 SKLEKSVANSIKGKRVRDP-NGIFSLCY--NTTAEINA-PIITAHFK-DANVELQPLNTF 368
Query: 391 YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ + C V P+ D+ ++G +AQ N+ V +DL K+ + ++ + E
Sbjct: 369 MRMQEDLVCFTVAPTS-------DIGVLGNLAQVNFLVGFDLRKKRGFSKKAEVE 416
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 64/144 (44%), Gaps = 11/144 (7%)
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI 362
F K + + +DSGTT T+L Y L + V +G P + LCY N
Sbjct: 409 FSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDP-NGISSLCY--NT 465
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
D P + HF A++ L + F + + C V P+ D+ I+G +A
Sbjct: 466 TVDQIDAPIITAHFKD-ANVELQPWNTFLRMQEDLVCFTVLPTS-------DIGILGNLA 517
Query: 423 QQNYNVAYDLVSKQLYFQRIDCEL 446
Q N+ V +DL K++ F+ DC L
Sbjct: 518 QVNFLVGFDLRKKRVSFKAADCTL 541
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 133/428 (31%), Positives = 188/428 (43%), Gaps = 58/428 (13%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAH--DTRAHLHPGISTVPV----------FYVNF 103
VDA A T ++R + ++SS + + A L PG + + +
Sbjct: 38 VDADAGYTEEQLLSRAL---RRSSARVATLQSLAALAPGDAITAARILVLASDGEYLMEM 94
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTND- 159
IG P A+LDTGS LIW +C PC C FDP++S TY +L C S C
Sbjct: 95 GIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALY 154
Query: 160 ---CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
C Y C Y Y + + G + +E F F T +E + L + FGC + NA
Sbjct: 155 YPLC--YQKVCVYQYFYGDSASTAGVLANETFTFGT-NETRVSLPGISFGCGNLNAGLLA 211
Query: 217 EQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEG----- 270
+G+ G G + SLV ++GS +FSYC+ ++ + L G A L
Sbjct: 212 NG-SGMVGFG---RGSLSLVSQLGSPRFSYCL--TSFLSPVPSRLYFGVYATLNSTNASS 265
Query: 271 ---DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
STP V + Y++ + GIS+G +L IDP +F NDT G IDSGTT+T+
Sbjct: 266 EPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITY 325
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL--CYS-GNINRDLQGFPAMAFHFAGGAD 381
L AY +R Q LP + A L C+ R P + HF GAD
Sbjct: 326 LAEPAYDAVRAAFAS--QITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHF-DGAD 382
Query: 382 LVLDAESVFYQESSS--VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L ++ + S+ CLA+ D SIIG QN+NV YDL + + F
Sbjct: 383 WELPLQNYMLVDPSTGGGLCLAMA-------SSSDGSIIGSYQHQNFNVLYDLENSLMSF 435
Query: 440 QRIDCELL 447
C L+
Sbjct: 436 VPAPCHLM 443
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 108/366 (29%), Positives = 166/366 (45%), Gaps = 48/366 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS + W++CQPC C T FDP+ S TYA + C S
Sbjct: 20 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQ 79
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C+ + C +C Y + Y +G + G +E +F S K +V GC H+N
Sbjct: 80 CSSLEMSSC--RSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVK----NVALGCGHDN 133
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYA---YNMLILGEGAI 267
F G GL SL ++ + FSYC+ N + + +N LG ++
Sbjct: 134 EGL----FVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSV 189
Query: 268 LEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
+ P+ ID YYV L G+S+G +M+ I + F+ +++ + G+ +D GT +T
Sbjct: 190 ----TAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDES-GNGGIIVDCGTAITR 244
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-----FPAMAFHFAGG 379
L AY LR + Q L + + + CY DL G P ++FHFA G
Sbjct: 245 LQTQAYNPLRDAFVRMTQNLKLTSAV-ALFDTCY------DLSGQASVRVPTVSFHFADG 297
Query: 380 ADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
L A + +S+ +C A P+ LSIIG + QQ V +DL + ++
Sbjct: 298 KSWNLPAANYLIPVDSAGTYCFAFAPTT------SSLSIIGNVQQQGTRVTFDLANNRMG 351
Query: 439 FQRIDC 444
F C
Sbjct: 352 FSPNKC 357
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 132/457 (28%), Positives = 191/457 (41%), Gaps = 76/457 (16%)
Query: 35 PKRLVTKLLHRDSLLYNPNDTVDAQ---AQRTLNMSMARFIYLSQKSS--QKAHDTRAHL 89
P R L+HR P+ + A+R L AR Y+ K++ + A +
Sbjct: 94 PNRASVPLVHRHGPC-APSAASGGKPSLAER-LRRDRARTNYIVTKATGGRTAATALSDA 151
Query: 90 HPGISTVPVF----------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT 137
G +++P F V IG P V Q ++DTGS L WV+C+PC +C A
Sbjct: 152 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 211
Query: 138 ---FDPSKSLTYATLPCDSSY------------CTNDCGGYPDECWYNIRYTNGPDSQGT 182
FDPS S +YA++PCDS CT GG C Y I Y N + G
Sbjct: 212 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 271
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV-EKVGS 241
+E + + D GFGC ++ H E+F G+ GLG A S S + G
Sbjct: 272 YSTETLTLKPG----VVVADFGFGCG-DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGG 326
Query: 242 KFSYCI----GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS---YYVTLEGISLGEK 294
FSYC+ G + A TPM + Y VTL GIS+G
Sbjct: 327 PFSYCLPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGA 386
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
L I P+ F +G+ IDSGT +T L +AY LR F+ + Y + P
Sbjct: 387 PLAIPPSAFS-------SGMVIDSGTVITGLPATAYAALRSA----FRSAMSEYRLLPPS 435
Query: 355 H-----LCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDI 407
+ CY +G+ N + P ++ F+GGA + L A + + CLA +
Sbjct: 436 NGGVLDTCYDFTGHANVTV---PTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGT 488
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + IIG + Q+ + V YD + F+ C
Sbjct: 489 D----NAIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 159/371 (42%), Gaps = 58/371 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC---- 151
+ V+ +G P + DTGS L WV+C+PC C FDPS S TYA + C
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 152 ----DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
D+S C++D C Y ++Y + + G + + SD F+ FGC
Sbjct: 209 CQELDASGCSSD-----SRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFV----FGC 259
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSL-VEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
NA Q G+FGLG S S G F+YC+ + + G G
Sbjct: 260 GDQNAGLFG-QVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSS----------GRGY 308
Query: 267 ILEGDSTPM-----SVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
+ G + P ++ DG+ YY+ L GI +G + + I + G ID
Sbjct: 309 LSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRI-----PATAFAAAGGTVID 363
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAF 374
SGT +T L P AY LR F + Y PA + CY +R Q P +
Sbjct: 364 SGTVITRLPPRAYAPLRAA----FARSMAQYKKAPALSILDTCYDFTGHRTAQ-IPTVEL 418
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
FAGGA + LD V Y S CLA P+ + ++I+G Q+ + VAYD+ +
Sbjct: 419 AFAGGATVSLDFTGVLYVSKVSQACLAFAPNADD----SSIAILGNTQQKTFAVAYDVAN 474
Query: 435 KQLYFQRIDCE 445
+++ F C
Sbjct: 475 QRIGFGAKGCS 485
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 160/374 (42%), Gaps = 38/374 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ ++ +G PP ++DTGS L W++C PC C FDP+ S +Y L C
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPR 205
Query: 156 CTNDCGGY-----------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD-V 203
C + D C Y Y + +S G + E F + G + D V
Sbjct: 206 CGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVDGV 265
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNMLIL 262
FGC H N + S L G FSYC+ +++ + ++
Sbjct: 266 VFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCL--VDHGSDVASKVVF 323
Query: 263 GEGAILEGDSTPM----------SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
GE L + P S D YYV L G+ +G ++L+I + + ++ S
Sbjct: 324 GEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDASEGGS-G 382
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPA 371
G IDSGTTL++ V AYQ +R+ D G P P P CY+ + R P
Sbjct: 383 GTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVER--PEVPE 440
Query: 372 MAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
++ FA GA AE+ F + + + CLAV + G +SIIG QQN++VAY
Sbjct: 441 LSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG-----MSIIGNFQQQNFHVAY 495
Query: 431 DLVSKQLYFQRIDC 444
DL + +L F C
Sbjct: 496 DLHNNRLGFAPRRC 509
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 163/385 (42%), Gaps = 54/385 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+YV +G P V + ++DTGS + W++C PC+ C F+P S ++ LPC SS
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 198
Query: 156 CTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK---TFLYDVGF 205
CTN C C ++I+Y +G S G + E T + G L ++
Sbjct: 199 CTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITL 258
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC+ + +G+ G+ S S L + KFS+C + + ++ GE
Sbjct: 259 GCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGLVFFGE 318
Query: 265 GAIL----------EGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
I+ + + P + +D YYV L GIS+ E L + F + G
Sbjct: 319 SDIISPYLRYTPLVQNPAVPSASLD-YYYVGLVGISVDESRLPLSHKNFDIDKVTGSGGT 377
Query: 315 FIDSGTTLTWLVPSAYQTLRKE----------VEDLFQGLLPSYPMDPAWHLCYSGNINR 364
IDSGT T+L A+Q +R+E V+D G P Y + SG
Sbjct: 378 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDD-NSGFTPCYNIT-------SGTAAL 429
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSS----VFCLA-VGPSDINGERFKDLSIIG 419
+ P++ HF GG D+VL S+ SSS CLA + DI +IIG
Sbjct: 430 ESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDI------PFNIIG 483
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
QQN V YDL +L C
Sbjct: 484 NYQQQNLWVEYDLEKLRLGIAPAQC 508
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 116/379 (30%), Positives = 174/379 (45%), Gaps = 52/379 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP P LDTGS LIW +CQPC C FDPS S T + CDS+
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 94
Query: 156 CTN----DCGG---YPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C CG +P++ C Y Y + + G + ++F F + + V FGC
Sbjct: 95 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---VPGVAFGC 151
Query: 208 S-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNM 259
NN F + TG+ G G S S + KVG+ FS+C + + ++
Sbjct: 152 GLFNNGVFKSNE-TGIAGFGRGPLSLPSQL-KVGN-FSHCFTTITGAIPSTVLLDLPADL 208
Query: 260 LILGEGAILEGDSTPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
G+GA+ +TP+ + YY++L+GI++G L + + F T G
Sbjct: 209 FSNGQGAV---QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL--TNGTGG 263
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAM 372
IDSGT++T L P YQ +R E + LP P + H C+S ++ P +
Sbjct: 264 TIIDSGTSITSLPPQVYQVVRDEFAAQIK--LPVVPGNATGHYTCFSAP-SQAKPDVPKL 320
Query: 373 AFHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
HF GA + L E+ ++ +S+ CLA+ D + +IIG QQN +V
Sbjct: 321 VLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD-------ETTIIGNFQQQNMHV 372
Query: 429 AYDLVSKQLYFQRIDCELL 447
YDL + L F C+ L
Sbjct: 373 LYDLQNNMLSFVAAQCDKL 391
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 109/372 (29%), Positives = 164/372 (44%), Gaps = 57/372 (15%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCT--- 157
S G P ++DTGS L WV+C+PC C A FDP+ S TYA + C++S C
Sbjct: 195 SSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASL 254
Query: 158 -------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
CGG + C+Y + Y +G S+G + ++ G L FGC +
Sbjct: 255 KAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVAL-----GGASLDGFVFGCGLS 309
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGA 266
N F G GL + SLV + G FSYC+ + A L LG A
Sbjct: 310 NRGL----FGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGD-ASGSLSLGGDA 364
Query: 267 ILEGDSTPMS----VIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
++TP++ + D + Y++ + G ++G L L N V IDS
Sbjct: 365 SSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ-GLGASN-------VLIDS 416
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFH 375
GT +T L PS Y+ +R E F YP P + + CY + +++ P +
Sbjct: 417 GTVITRLAPSVYRGVRAEFTRQFAA--AGYPTAPGFSILDTCYDLTGHDEVK-VPLLTLR 473
Query: 376 FAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLS-IIGMIAQQNYNVAYDL 432
GGA++ +DA + + ++ S CLA+ ++D + IIG Q+N V YD
Sbjct: 474 LEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLS-----YEDQTPIIGNYQQKNKRVVYDT 528
Query: 433 VSKQLYFQRIDC 444
V +L F DC
Sbjct: 529 VGSRLGFADEDC 540
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 122/428 (28%), Positives = 193/428 (45%), Gaps = 61/428 (14%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQK------AHDTRAHLHPGISTVPVFYVNFSIGQPP 109
VD + T + R + +S++ Q+ D A +H + ++ IG PP
Sbjct: 40 VDDKGGYTTEERVLRAVAVSRQQQQQRLMAGAEDDVSAQVHRATRQ---YIASYLIGSPP 96
Query: 110 VPQLAVLDTGSSLIWVKC------QPCEQCGATTFDPSKSLTYATLPC--DSSYCTND-- 159
A++DTGS LIW +C + C + G ++ S+S T+ +PC + +C +
Sbjct: 97 QRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCADKAGFCAANGV 156
Query: 160 --CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH----NNAH 213
C G C + Y G G++G+E F FE+ G T L FGC +
Sbjct: 157 HLC-GLDGSCTFIASYGAG-RVIGSLGTESFAFES---GTTSL---AFGCVSLTRITSGA 208
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
+D +G+ GLG SLV ++G+ +FSYC+ + A + L +G A L G
Sbjct: 209 LNDA--SGLIGLG---RGRLSLVSQIGATRFSYCLTPYFHSSGASSHLFVGASASLGGGG 263
Query: 273 TPMSVIDGS--------YYVTLEGISLGEKML-DIDPNLFKKNDTWSD---AGVFIDSGT 320
M + YY+ LEGI++G+ L ++ F+ + GV ID+G+
Sbjct: 264 ASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDTGS 323
Query: 321 TLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
LT L AY+ L++EV L G L P D LC + + + PA+ FHF GG
Sbjct: 324 PLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVAREGFQKV--VPALVFHFGGG 381
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
AD+ + A S + + C+ + + SIIG QQ+ ++ YDL + F
Sbjct: 382 ADMAVPAASYWAPVDKAAACMMILEGGYD-------SIIGNFQQQDMHLLYDLRRGRFSF 434
Query: 440 QRIDCELL 447
Q DC +L
Sbjct: 435 QTADCTML 442
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 168/370 (45%), Gaps = 55/370 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS 154
++ + +G PP +A +DTGS +IW +C PC C A FDPSKS T+ C+
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNG- 478
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
+ C Y I Y + S+G + +E ++ + + GC +N +
Sbjct: 479 ----------NSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTNL 528
Query: 215 SDEQF----TGVFGL--GPATSSTHSLVEKVGSKF----SYCIGNLNYFEYAYNMLILGE 264
F +G+ GL GP SL+ ++ + SYC + + G
Sbjct: 529 QYSGFASSSSGIVGLNMGPL-----SLISQMDLPYPGLISYCFSG-----QGTSKINFGT 578
Query: 265 GAILEGDSTPMSVI-----DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
AI+ GD T + + + YY+ L+ +S+ + ++ F D +FIDSG
Sbjct: 579 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHA----EDGNIFIDSG 634
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
TTLT+ S +R+ VE + + +P D LCY + + FP + HF+G
Sbjct: 635 TTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNL--LCYYSDT---IDIFPVITMHFSG 689
Query: 379 GADLVLDAESVFYQE-SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GADLVLD +++ + + +FCLA+G +D + ++ G AQ N+ V YD S +
Sbjct: 690 GADLVLDKYNMYLETITGGIFCLAIGCNDPSMP-----AVFGNRAQNNFLVGYDPSSNVI 744
Query: 438 YFQRIDCELL 447
F +C L
Sbjct: 745 SFSPTNCSAL 754
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 161/356 (45%), Gaps = 59/356 (16%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSS 154
++ + +G PP A +DTGS LIW +C PC C + FDPSKS T+ C
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRC--- 137
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
+ C Y I Y + S+G + +E ++ + + GC +N
Sbjct: 138 --------HGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTDL 189
Query: 215 SDEQF----TGVFGL--GPATSSTHSLVEKVGSKF----SYCIGNLNYFEYAYNMLILGE 264
+ F +G+ GL GP SL+ ++ + SYC + + G
Sbjct: 190 DNSGFASSSSGIVGLNMGP-----RSLISQMDLPYPGLISYCFSG-----QGTSKINFGT 239
Query: 265 GAILEGDSTPMSVI-----DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
AI+ GD T + + + YY+ L+ +S+ + ++ F D + IDSG
Sbjct: 240 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHA----EDGNIVIDSG 295
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWH--LCYSGNINRDLQGFPAMAFHF 376
+T+T+ S +RK VE + + +P DP+ + LCY + + FP + HF
Sbjct: 296 STVTYFPVSYCNLVRKAVEQVVTAVRVP----DPSGNDMLCY---FSETIDIFPVITMHF 348
Query: 377 AGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+GGADLVLD +++ + +S +FCLA+ + E +I G AQ N+ V YD
Sbjct: 349 SGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQE-----AIFGNRAQNNFLVGYD 399
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 156/365 (42%), Gaps = 54/365 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS WV+CQPC EQ FDP +S TYA + C +
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ-QEKLFDPVRSSTYANVSCAA 236
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C++ C G C Y ++Y +G S G + + D K F FGC
Sbjct: 237 PACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFR----FGCGE 290
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGE 264
N E G+ GLG TS +K G F++C+ Y ++
Sbjct: 291 RNEGLFGEA-AGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAAAS 349
Query: 265 GAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ +TPM +G YY+ + GI +G ++L I ++F + AG +DSGT +
Sbjct: 350 ARL----TTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVF------ATAGTIVDSGTVI 399
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAF 374
T L P AY +LR Y PA L CY D G P ++
Sbjct: 400 TRLPPPAYSSLRYAFAAAMA--ARGYKKAPAVSLLDTCY------DFTGMSQVAIPTVSL 451
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GGA L +DA + Y S+S CLA ++ G D+ I+G + + VAYD+
Sbjct: 452 LFQGGARLDVDASGIMYAASASQVCLAFAANEDGG----DVGIVGNTQLKTFGVAYDIGK 507
Query: 435 KQLYF 439
K + F
Sbjct: 508 KVVGF 512
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 150/363 (41%), Gaps = 35/363 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + SIG PP + DTGS L W C PC C FDP KS TY + CDS
Sbjct: 72 YLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKL 131
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T C C Y Y + ++G + E ++ L + FGC HNN
Sbjct: 132 CHKLDTGVCSPQ-KRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHNN 190
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGS-----KFSYCIGNLNYFEYAYNMLILGEGA 266
++ G+ GLG SL+ ++GS +FS C+ + + + G+G+
Sbjct: 191 TGGFNDHEMGIIGLG---GGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGS 247
Query: 267 ILEGD---STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+ G STP+ Y+VTL GIS+ L + + +F+DSGT
Sbjct: 248 KVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFN----GSSQNVEKGNMFLDSGTP 303
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD 381
T L Y + +V + D LCY +L+G P + HF GAD
Sbjct: 304 PTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCY--RTKNNLRG-PVLTAHFE-GAD 359
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
+ L F VFCL + +G + G AQ NY + +DL + + F+
Sbjct: 360 VKLSPTQTFISPKDGVFCLGFTNTSSDG------GVYGNFAQSNYLIGFDLDRQVVSFKP 413
Query: 442 IDC 444
DC
Sbjct: 414 KDC 416
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 112/396 (28%), Positives = 164/396 (41%), Gaps = 77/396 (19%)
Query: 94 STVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTY 146
S VPV + V+ +G PP ++DTGS L W++C PC C FDP+ S++Y
Sbjct: 140 SGVPVGSGEYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISY 199
Query: 147 ATLPCDSSYCT----------NDCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
+ C C +C D C Y Y + ++ G + E F +
Sbjct: 200 RNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQS 259
Query: 196 GKTFLYDVGFGCSHNNAHFSD----------------EQFTGVFGLGPATSSTHSLVEKV 239
G + V FGC H N Q GV+G
Sbjct: 260 GTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYG--------------- 304
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-------TPMSVIDGSYYVTLEGISLG 292
G FSYC+ + + A + +I G L P + D YY+ L+ I +G
Sbjct: 305 GHAFSYCL--VEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVG 362
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD- 351
+ ++I +DT S G IDSGTTL++ AYQ +R+ D PSYP+
Sbjct: 363 GEAVNI------SSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMS---PSYPLIL 413
Query: 352 --PAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDIN 408
P CY+ + ++ P ++ FA GA AE+ F + E + CLAV + +
Sbjct: 414 GFPVLSPCYNVSGAEKVE-VPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRS 472
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
G +SIIG QQN++V YDL +L F C
Sbjct: 473 G-----MSIIGNYQQQNFHVLYDLEHNRLGFAPRRC 503
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 126/463 (27%), Positives = 196/463 (42%), Gaps = 66/463 (14%)
Query: 22 IFTSTTAAPAAGKPKRLVT-KLLHR----DSLLYNPNDTVDAQAQRTLNMSMARFIYLSQ 76
+F S++ + +A PKR + +++H+ L +N +N+ R Y+
Sbjct: 44 LFPSSSCSSSAKGPKRKASLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQS 103
Query: 77 KSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+ S+ + +T+P ++V +G P V DTGS L W +
Sbjct: 104 RLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQ 163
Query: 127 CQPCE----QCGATTFDPSKSLTYATLPCDSSYCT--------NDCGGYPDECWYNIRYT 174
C+PC + FDPSKS +Y + C SS CT + C C Y I+Y
Sbjct: 164 CEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYG 223
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPA----T 229
+ S G + E+ +D FL FGC +N FS G+ GLG
Sbjct: 224 DKSTSVGFLSQERLTITATDIVDDFL----FGCGQDNEGLFSGS--AGLIGLGRHPISFV 277
Query: 230 SSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD--STPMSVIDGS---YYV 284
T S+ K+ FSYC+ + + + L G A + TP+S I G Y +
Sbjct: 278 QQTSSIYNKI---FSYCLPSTS---SSLGHLTFGASAATNANLKYTPLSTISGDNTFYGL 331
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
+ GIS+G L P + + T+S G IDSGT +T L P+AY LR F+
Sbjct: 332 DIVGISVGGTKL---PAV--SSSTFSAGGSIIDSGTVITRLAPTAYAALRSA----FRQG 382
Query: 345 LPSYPM---DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
+ YP+ D + CY + +++ P + F FAGG + L + S+ CLA
Sbjct: 383 MEKYPVANEDGLFDTCYDFSGYKEIS-VPKIDFEFAGGVTVELPLVGILIGRSAQQVCLA 441
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
NG D++I G + Q+ V YD+ ++ F C
Sbjct: 442 FAA---NGND-NDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 166/370 (44%), Gaps = 49/370 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ-C---GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS L W +C+PC + C F+PSKS +Y + C S
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 155 YC---TNDCGGYP----DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C + G P C Y I+Y + S G ++ ++D FL FGC
Sbjct: 198 TCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDVFNNFL----FGC 253
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLV----EKVGSKFSYCIGNLNYFEYAYNMLILG 263
NN F GV GL + SLV +K G FSYC+ + + Y G
Sbjct: 254 GQNNRGL----FVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSS-STGYLTFGSG 308
Query: 264 EGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
G TP S+++ Y++ L IS+G + L ++F S AG IDSG
Sbjct: 309 GGTSKAVKFTP-SLVNSQGPSFYFLNLIAISVGGRKLSTSASVF------STAGTIIDSG 361
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDPAWHL--CYSGNINRDLQGFPAMAFHF 376
T ++ L P+AY LR FQ + YP PA L CY + D P + +F
Sbjct: 362 TVISRLPPTAYSDLRAS----FQQQMSKYPKAAPASILDTCYDFS-QYDTVDVPKINLYF 416
Query: 377 AGGADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
+ GA++ LD +FY + S CLA G SD D++I+G + Q+ ++V YD+
Sbjct: 417 SDGAEMDLDPSGIFYILNISQVCLAFAGNSDAT-----DIAILGNVQQKTFDVVYDVAGG 471
Query: 436 QLYFQRIDCE 445
++ F CE
Sbjct: 472 RIGFAPGGCE 481
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 158/362 (43%), Gaps = 41/362 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ V +G P V DTGS WV+C+PC +C FDP+KS TYA + C S
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDS 222
Query: 155 YC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH- 209
C TN C G C Y ++Y +G + G + D K F FGC
Sbjct: 223 ACADLDTNGCTG--GHCLYAVQYGDGSYTVGFFAQDTLTIA-HDAIKGFR----FGCGEK 275
Query: 210 NNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
NN F + G+ GLG TS T K G F+YC+ L L G G+
Sbjct: 276 NNGLFG--KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALT---TGTGYLDFGPGSAG 330
Query: 269 EGDS-TPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
TPM G YYV + GI +G + + + ++F S AG +DSGT +T L
Sbjct: 331 NNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF------STAGTLVDSGTVITRL 384
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADL 382
+AY L + + L Y P + + CY D++ P ++ F GGA L
Sbjct: 385 PATAYTALSSAFDKVM--LARGYKKAPGYSILDTCYDFTGLSDVE-LPTVSLVFQGGACL 441
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
+D + Y S + CLA NG+ + ++I+G Q+ Y V YDL K + F
Sbjct: 442 DVDVSGIVYAISEAQVCLAFAS---NGDD-ESVAIVGNTQQKTYGVLYDLGKKTVGFAPG 497
Query: 443 DC 444
C
Sbjct: 498 SC 499
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 164/366 (44%), Gaps = 43/366 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ + +G P L LDTGS W++C+PC C FDPSKS TY+ + C S
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRE 193
Query: 156 C-------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C ++C +C Y I Y + + G + + +D F+ FGC
Sbjct: 194 CQELGSSHKHNCSS-DKKCPYEITYADDSYTVGNLARDTLTLSPTDAVPGFV----FGCG 248
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLV-EKVGSKFSYCIGN----LNYFEYAYNMLILG 263
HNNA S + G+ GLG +S S V + G+ FSYC+ + Y ++ G
Sbjct: 249 HNNAG-SFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLSFS------G 301
Query: 264 EGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
A ++ ++ G YY+ L GI++ + + + P++F + AG IDSG
Sbjct: 302 AAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFA-----TAAGTIIDSG 356
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T + L PSAY LR V G P + CY + ++ P++A FA G
Sbjct: 357 TAFSCLPPSAYAALRSSVRSAM-GRYKRAPSSTIFDTCYDLTGHETVR-IPSVALVFADG 414
Query: 380 ADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
A + L V Y S+ S CLA P+ + L ++G Q+ V YD+ ++++
Sbjct: 415 ATVHLHPSGVLYTWSNVSQTCLAFLPNPDD----TSLGVLGNTQQRTLAVIYDVDNQKVG 470
Query: 439 FQRIDC 444
F C
Sbjct: 471 FGANGC 476
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 165/393 (41%), Gaps = 57/393 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ------------------CGATTFDP 140
++V F +G P P + + DTGS L WVKC+ F P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169
Query: 141 SKSLTYATLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
S T++ +PC S C + +C C Y+ RY + ++G +G++ S
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVALS 229
Query: 194 --------DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFS 244
+ K L V GC+ +A E GV LG + S S + G +FS
Sbjct: 230 GGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISFASRAASRFGGRFS 289
Query: 245 YCIGNLNYFEYAYNMLILGEG-------AILEGDSTPM---SVIDGSYYVTLEGISLGEK 294
YC+ + A + L G G A G TP+ + + Y V ++ +S+
Sbjct: 290 YCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDGV 349
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
LDI ++ D S+ G IDSGT+LT L AY+ + + + G LP MDP +
Sbjct: 350 ALDIPAEVW---DVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAG-LPRVAMDP-F 404
Query: 355 HLCYSGNINRDLQG---FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGER 411
CY+ D G P +A FAG A L A+S + V C+ V G
Sbjct: 405 DYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPG-- 462
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+S+IG I QQ + +DL ++ L F++ C
Sbjct: 463 ---VSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 166/373 (44%), Gaps = 36/373 (9%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC----- 156
IG PP L ++DT S L WV+ C C T F+P S ++ + PC SS C
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64
Query: 157 ---TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
+ C C + + Y +G ++ G I E F+ ++ D + L DV FGC+ +
Sbjct: 65 LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGS--------KFSYCIGNLNYFEYAYNMLILGEG 265
+ +G GL + S ++GS +FSYC N + ++I G+
Sbjct: 125 RPVDFSSGTLGL---NRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDS 181
Query: 266 AI---------LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
I LE + S++D YYV L+GIS+G ++L I + FK D + G +
Sbjct: 182 GIPAHHFQYLSLEQEPPIASIVD-FYYVGLQGISVGGELLHIPRSAFKI-DRLGNGGTYF 239
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQGFPAMAFH 375
DSGTT+++LV A+ L + L + D LCY + L P + H
Sbjct: 240 DSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLH 299
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD--LSIIGMIAQQNYNVAYDLV 433
F D+ L SV+ + + + + + +N +++IG QQ+Y + +DL
Sbjct: 300 FKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLE 359
Query: 434 SKQLYFQRIDCEL 446
++ F +C +
Sbjct: 360 RSRIGFAPANCVM 372
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 132/466 (28%), Positives = 199/466 (42%), Gaps = 52/466 (11%)
Query: 10 LSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMA 69
L ++ L ST + T A + L KL H D+ N T + +R +
Sbjct: 5 LFVLVLIMCSTTALITCTNGGAGDGGEGLHMKLTHVDA---KGNYTAEELVRRAVAAGKQ 61
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
R +L + +T+ + + IG PP A++DTGS L+W +C
Sbjct: 62 RLAFLDAAMAGGGDGGGVGAPVRWATLQ-YVAEYLIGDPPQRAEALIDTGSDLVWTQCST 120
Query: 130 C--EQCGATT---FDPSKSLTYATLPCDSSYCT--NDCGGYPD---ECWYNIRYTNGPDS 179
C + C ++ S S T+A +PC + C +D + D C Y G +
Sbjct: 121 CLRKVCARQALPYYNSSASSTFAPVPCAARICAANDDIIHFCDLAAGCSVIAGYGAGVVA 180
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCS--HNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
GT+G+E F F++ ++ FGC + +G+ GLG SLV
Sbjct: 181 -GTLGTEAFAFQSGTA------ELAFGCVTFTRIVQGALHGASGLIGLG---RGRLSLVS 230
Query: 238 KVGS-KFSYCIGNLNYFEYAYNMLILGEGAIL--EGDSTPMSVIDGS-----YYVTLEGI 289
+ G+ KFSYC+ + A L +G A L GD + G YY+ L G+
Sbjct: 231 QTGATKFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGL 290
Query: 290 SLGEKMLDIDPNLFKKNDTWS---DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
++GE L I +F + GV IDSG+ T LV AY L E+ G L
Sbjct: 291 TVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLV 350
Query: 347 SYPMDP-AWHLCYSGNINRDL-QGFPAMAFHFAGGADLVLDAESVFY---QESSSVFCLA 401
+ P D LC + RD+ + PA+ FHF GGAD+ + AES + + ++ + +
Sbjct: 351 APPPDADDGALCVA---RRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIAS 407
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
GP ++ S+IG QQN V YDL + FQ DC L
Sbjct: 408 AGP-------YRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCSAL 446
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 125/444 (28%), Positives = 187/444 (42%), Gaps = 63/444 (14%)
Query: 41 KLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
+ +HRDS ++P+ T A+ S R LS+ + + +++ P
Sbjct: 38 EFIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSELTSTP 97
Query: 98 VFYV-NFSIGQPPVPQLAVLDTGSSLIWVKCQ-------------PCEQCGATTFDPSKS 143
Y+ +IG PP +A+ DTGS LIW+ C Q FDPSKS
Sbjct: 98 FEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKS 157
Query: 144 LTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS-----D 194
T+ + CDS C+ CG +C Y+ Y +G + G + +E F F + D
Sbjct: 158 TTFRLVDCDSVACSELPEASCGA-DSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGD 216
Query: 195 EGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS------KFSYCIG 248
T + +V FGCS S SLV ++G+ +FSYC+
Sbjct: 217 GTTTRVANVNFGCSTTFVGSSVGDGLVG-----LGGGDLSLVSQLGADTSLGRRFSYCL- 270
Query: 249 NLNYFEYAYNMLILGEGAILE---GDSTPM--SVIDGSYYVTLEGISLGEKMLDIDPNLF 303
+ Y A + L G A + +TP+ S + Y V L + +G K F
Sbjct: 271 -VPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKT-------F 322
Query: 304 KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNIN 363
+ D + + +DSGTTLT+L + L KE+ + L P+ + LC+ +
Sbjct: 323 EAPD---RSPLIVDSGTTLTFLPEALVDPLVKELTGRIK-LPPAQSPERLLPLCFDVSGV 378
Query: 364 RDLQ---GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
R+ Q P + GGA + L AE+ F + CLAV E+F SIIG
Sbjct: 379 REGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAV---SAMSEQFP-ASIIGN 434
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
IAQQN +V YDL + F C
Sbjct: 435 IAQQNMHVGYDLDKGTVTFAPAAC 458
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 117/411 (28%), Positives = 185/411 (45%), Gaps = 59/411 (14%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPV------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
++++++ H A + G ++ PV + + IG PP A++DTGS+LIW +C
Sbjct: 44 RRATERTHRRLASM--GEASAPVHWAESQYIAEYLIGDPPQQAEAIIDTGSNLIWTQCST 101
Query: 130 CEQCGA-----TTFDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNIRYTNGPDSQ 180
C+ G + +DPS+S T + C+ + C C C Y G
Sbjct: 102 CQPAGCFSQNLSFYDPSRSRTARPVACNDTACALGSETRCARDNKACAVLTAYGAGVIG- 160
Query: 181 GTIGSEQFNFETSDEGKTFLYDVGFGC--SHNNAHFSDEQFTGVFGLGPATSSTHSLVEK 238
G +G+E F F+ E + + FGC + S + +G+ GLG SLV +
Sbjct: 161 GVLGTEAFTFQPQSENVS----LAFGCIAATRLTPGSLDGASGIIGLG---RGNLSLVSQ 213
Query: 239 VG-SKFSYCIGNLNYFEYAYNM--LILGEGAILEGDSTPMSVI-----------DGSYYV 284
+G +KFSYC+ YF + N L +G A L P + + YY+
Sbjct: 214 LGDNKFSYCL--TPYFSQSTNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYL 271
Query: 285 TLEGISLGEKMLDIDPNLF--KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE-VEDLF 341
L GI++G+ L + F ++ T AG IDSG+ T LV AYQ LR E V+ L
Sbjct: 272 PLTGITVGDAKLAVPEAAFDLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLG 331
Query: 342 QGLLPSYPMDPAWHLCYS---GNINRDLQGFPAMAFHF-AGGADLVLDAESVFYQESSSV 397
++P LC + G++ + + P + HF +GG D+ + E+ + S
Sbjct: 332 ASIVPPPAGAEGLDLCAAVAHGDVGKLV---PPLVLHFGSGGGDVAVPPENYWGPVDDST 388
Query: 398 FCLAV----GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
C+ V GP+ + +IIG QQ+ ++ YDL L FQ DC
Sbjct: 389 ACMVVFSSGGPNST--LPMNETTIIGNYMQQDMHLLYDLEKGMLSFQPADC 437
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 156/364 (42%), Gaps = 39/364 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS ++W++C PC +C T FDP+KS TYA +PC +
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPL 177
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C C Y + Y +G + G +E F + + V GC H+N
Sbjct: 178 CRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFR-----RNRVTRVALGCGHDN 232
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE-G 270
+ S + KFSYC+ + + + +I G+ A+
Sbjct: 233 EGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSA-SAKPSSVIFGDSAVSRTA 291
Query: 271 DSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP+ +D YY+ L GIS+ G + + +LF+ D + GV IDSGT++T L
Sbjct: 292 HFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRL-DAAGNGGVIIDSGTSVTRLT 350
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGAD 381
AY LR + L P + C+ DL G P + HF GAD
Sbjct: 351 RPAYIALRDAFR-IGASHLKRAPEFSLFDTCF------DLSGLTEVKVPTVVLHFR-GAD 402
Query: 382 LVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ L A + ++S FC A + LSIIG I QQ + ++YDL ++ F
Sbjct: 403 VSLPATNYLIPVDNSGSFCFAFAGT------MSGLSIIGNIQQQGFRISYDLTGSRVGFA 456
Query: 441 RIDC 444
C
Sbjct: 457 PRGC 460
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 119/435 (27%), Positives = 180/435 (41%), Gaps = 75/435 (17%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTG 119
R +++ ++A +T A +P+ ++V F +G P P L V DTG
Sbjct: 55 RMAFIASHGRRRARETAAGSSAAAFEMPLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTG 114
Query: 120 SSLIWVKC-QPCEQCGAT------TFDPSKSLTYATLPCDSSYCTND-------CGGYPD 165
S L WVKC +P + F P S T+A + C S CT C
Sbjct: 115 SDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGS 174
Query: 166 ECWYNIRYTNGPDSQGTIGSEQFNFETSDEG----KTFLYDVGFGCSHNNAHFSDEQFTG 221
C Y+ RY +G ++GT+G+E S G K L + GC+ + S E G
Sbjct: 175 PCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDG 234
Query: 222 VFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEG--------------- 265
V LG + S S + +FSYC+ + A + L G
Sbjct: 235 VLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPA 294
Query: 266 --------AILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
TP+ ++D Y V ++ +S+ + L I ++ D + G
Sbjct: 295 SCTAAAPRPRPRARQTPL-LLDRRMRPFYDVAVKAVSVAGQFLKIPRAVW---DVDAGGG 350
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWHLCYSGNINRDLQGFPA 371
V +DSGT+LT L AY+ + V L +GL LP MDP + CY+ P
Sbjct: 351 VILDSGTSLTVLAKPAYRAV---VAALSEGLAGLPRVTMDP-FEYCYNWTSPSGDVTLPK 406
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVA 429
MA HFAG A L +S + V C+ + GP + +S+IG I QQ +
Sbjct: 407 MAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGP-------WPGISVIGNILQQEHLWE 459
Query: 430 YDLVSKQLYFQRIDC 444
+D+ +++L FQR C
Sbjct: 460 FDIKNRRLKFQRSRC 474
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 30/365 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G P L VLDTGS ++W++C PC C A + FDP +S +YA + C +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPI 181
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C + C Y + Y +G + G SE F + V GC H+N
Sbjct: 182 CRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR----VQRVAIGCGHDN 237
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNL-------NYFEYAYNMLILGE 264
+ S + G FSYC+ + +
Sbjct: 238 EGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAV 297
Query: 265 GAILEGDSTPMS---VIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
A TPM + YYV L G S+ G ++ + + + N T GV +DSGT
Sbjct: 298 AAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGT 357
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
++T L Y+ +R GL S + CY+ + R ++ P ++ H AGGA
Sbjct: 358 SVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVK-VPTVSMHLAGGA 416
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ L E+ ++S FC A+ +D +SIIG I QQ + V +D ++++ F
Sbjct: 417 SVALPPENYLIPVDTSGTFCFAMAGTD------GGVSIIGNIQQQGFRVVFDGDAQRVGF 470
Query: 440 QRIDC 444
C
Sbjct: 471 VPKSC 475
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 127/432 (29%), Positives = 185/432 (42%), Gaps = 87/432 (20%)
Query: 45 RDSLLYNPN--DTVDAQAQRTLNMSMARFIY--LSQKSSQKAHDTRAHLHPGISTVPV-- 98
R S L P+ DT+ A +R A +I +S + + + D++A +TVP
Sbjct: 80 RASSLATPSVADTLRADQRR------AEYILRRVSGRGTPQLWDSKAEAA--TATVPANW 131
Query: 99 --------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-----FDPSKSLT 145
+ V S+G P V Q +DTGS L WV+C PC + FDP++S +
Sbjct: 132 GFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSS 191
Query: 146 YATLPCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
YA +PC C + C +C Y + Y +G + G S+ +D + F
Sbjct: 192 YAAVPCGGPVCGGLGIYASSCSA--AQCGYVVSYGDGSKTTGVYSSDTLTLSPNDAVRGF 249
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEY 255
FGC H + F+ G+ GLG SLVE+ G FSYC L
Sbjct: 250 F----FGCGHAQSGFTGND--GLLGLG---REEASLVEQTAGTYGGVFSYC---LPTRPS 297
Query: 256 AYNMLILG--EGAILEGDSTPM---SVIDGSYYVT-LEGISLGEKMLDIDPNLFKKNDTW 309
L LG GA G ST S +YYV L GIS+G + L + ++F
Sbjct: 298 TTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFA----- 352
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYS----GNI 362
G +D+GT +T L P+AY LR YP PA + CY+ G +
Sbjct: 353 --GGTVVDTGTVITRLPPTAYAALRSAFRSGMASY--GYPSAPATGILDTCYNFSGYGTV 408
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
P +A F+GGA + L A+ + S CLA PS +G ++I+G +
Sbjct: 409 T-----LPNVALTFSGGATVTLGADGIL-----SFGCLAFAPSGSDG----GMAILGNVQ 454
Query: 423 QQNYNVAYDLVS 434
Q+++ V D S
Sbjct: 455 QRSFEVRIDGTS 466
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 30/365 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G P L VLDTGS ++W++C PC C A + FDP +S +YA + C +
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPI 187
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C + C Y + Y +G + G SE F + V GC H+N
Sbjct: 188 CRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR----VQRVAIGCGHDN 243
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNL-------NYFEYAYNMLILGE 264
+ S + G FSYC+ + +
Sbjct: 244 EGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAV 303
Query: 265 GAILEGDSTPMS---VIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
A TPM + YYV L G S+ G ++ + + + N T GV +DSGT
Sbjct: 304 AAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGT 363
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
++T L Y+ +R GL S + CY+ + R ++ P ++ H AGGA
Sbjct: 364 SVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVK-VPTVSMHLAGGA 422
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ L E+ ++S FC A+ +D +SIIG I QQ + V +D ++++ F
Sbjct: 423 SVALPPENYLIPVDTSGTFCFAMAGTD------GGVSIIGNIQQQGFRVVFDGDAQRVGF 476
Query: 440 QRIDC 444
C
Sbjct: 477 VPKSC 481
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 171/391 (43%), Gaps = 27/391 (6%)
Query: 63 TLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSL 122
TL ARF+YLS + + I P + V +IG P P L LDT +
Sbjct: 52 TLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDA 111
Query: 123 IWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYCT---NDCGGYPDECWYNIRYTNGPD 178
W+ C C C ++ FDPSKS + TL C++ C N C +N+ Y
Sbjct: 112 AWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTY----- 166
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK 238
G+ +T + + FGC N A + G+ GLG S S +
Sbjct: 167 -GGSTIEAYLTQDTLTLASDVIPNYTFGC-INKASGTSLPAQGLMGLGRGPLSLISQSQN 224
Query: 239 V-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEK 294
+ S FSYC+ N ++ ++ + + + +TP+ YYV L GI +G K
Sbjct: 225 LYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNK 284
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
++DI P D + AG DSGT T LV AY +R E + + +
Sbjct: 285 IVDI-PTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNA--NATSLGGF 341
Query: 355 HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFK 413
CYSG++ FP++ F FA G ++ L +++ S+ ++ CLA+ + +N
Sbjct: 342 DTCYSGSV-----VFPSVTFMFA-GMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSV- 394
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L++I + QQN+ V D+ + +L R C
Sbjct: 395 -LNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 171/391 (43%), Gaps = 27/391 (6%)
Query: 63 TLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSL 122
TL ARF+YLS + + I P + V +IG P P L LDT +
Sbjct: 52 TLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDA 111
Query: 123 IWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYCT---NDCGGYPDECWYNIRYTNGPD 178
W+ C C C ++ FDPSKS + TL C++ C N C +N+ Y
Sbjct: 112 AWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTY----- 166
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK 238
G+ +T + + FGC N A + G+ GLG S S +
Sbjct: 167 -GGSTIEAYLTQDTLTLASDVIPNYTFGC-INKASGTSLPAQGLMGLGRGPLSLISQSQN 224
Query: 239 V-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEK 294
+ S FSYC+ N ++ ++ + + + +TP+ YYV L GI +G K
Sbjct: 225 LYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNK 284
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
++DI P D + AG DSGT T LV AY +R E + + +
Sbjct: 285 IVDI-PTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNA--NATSLGGF 341
Query: 355 HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFK 413
CYSG++ FP++ F FA G ++ L +++ S+ ++ CLA+ + +N
Sbjct: 342 DTCYSGSV-----VFPSVTFMFA-GMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSV- 394
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L++I + QQN+ V D+ + +L R C
Sbjct: 395 -LNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 30/365 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G P L VLDTGS ++W++C PC C A + FDP +S +YA + C +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPI 181
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C + C Y + Y +G + G SE F + V GC H+N
Sbjct: 182 CRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR----VQRVAIGCGHDN 237
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNL-------NYFEYAYNMLILGE 264
+ S + G FSYC+ + +
Sbjct: 238 EGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAV 297
Query: 265 GAILEGDSTPMS---VIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
A TPM + YYV L G S+ G ++ + + + N T GV +DSGT
Sbjct: 298 AAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGT 357
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
++T L Y+ +R GL S + CY+ + R ++ P ++ H AGGA
Sbjct: 358 SVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVK-VPTVSMHLAGGA 416
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ L E+ ++S FC A+ +D +SIIG I QQ + V +D ++++ F
Sbjct: 417 SVALPPENYLIPVDTSGTFCFAMAGTD------GGVSIIGNIQQQGFRVVFDGDAQRVGF 470
Query: 440 QRIDC 444
C
Sbjct: 471 VPKSC 475
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 187/445 (42%), Gaps = 60/445 (13%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLS-------QKSSQKAHDTRAHLHPGI 93
KL + L P +D Q L AR +S +K+ + +H + +H G
Sbjct: 54 KLKSQSKFLGPPKSRLDGTRQ-LLQSDNARRQMISSLRHGTRRKAFEVSHTAQIPIHSGA 112
Query: 94 -STVPVFYVNFSIGQP-PVPQLAVLDTGSSLIWVKCQP-CEQCG------ATTFDPSKSL 144
S ++V+ IG P P + V DTGS L W+ C+ C+ C F + S
Sbjct: 113 DSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSS 172
Query: 145 TYATLPCDSSYCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
++ T+PC S C +C C ++ RY NGP + G +E +D
Sbjct: 173 SFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDH 232
Query: 196 GKTFLYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLN 251
K L+DV GC+ + F D GV GLG S L E G+KFSYC+ ++
Sbjct: 233 KKIRLFDVLIGCTESFNETNGFPD----GVMGLGYRKHSLALRLAEIFGNKFSYCL--VD 286
Query: 252 YFEYAYNMLILGEGAILEGDSTPMS-------VIDGSYYVTLEGISLGEKMLDIDPNLFK 304
+ + + L G I E M I+ Y V + GIS+G ML I
Sbjct: 287 HLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSI------ 340
Query: 305 KNDTWSDAGV---FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD-PAW-HLCYS 359
+D W+ GV +DSGT+LT L AY + ++ +F P++ P + C+
Sbjct: 341 SSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFE 400
Query: 360 GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
+ D P + HFA GA +S + + CL + +D G SI+G
Sbjct: 401 -DKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGS-----SILG 454
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
+ QQN+ YDL +L F C
Sbjct: 455 NVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 109/400 (27%), Positives = 179/400 (44%), Gaps = 57/400 (14%)
Query: 77 KSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ----PCEQ 132
++ Q +++ R+ ++ + V+ IG PP Q VLDTGS L W++C P +
Sbjct: 62 QTKQPSYNYRSSFKYSMALI----VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKP 117
Query: 133 CGATTFDPSKSLTYATLPCDSSYCTNDCGGYP-----DE---CWYNIRYTNGPDSQGTIG 184
T+FDPS S +++ LPC+ C + D+ C Y+ Y +G ++G++
Sbjct: 118 PPTTSFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLV 177
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFS 244
E+ F +S + GC+ + +DE+ G+ G+ S S + SKFS
Sbjct: 178 REKITFSSSQSTPPLI----LGCAEAS---TDEK--GILGMNLGRRSFASQAKI--SKFS 226
Query: 245 YCI------------------GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTL 286
YC+ N N + Y L+ + + P+ +Y + +
Sbjct: 227 YCVPTRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPL-----AYTIPM 281
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSDAG-VFIDSGTTLTWLVPSAYQTLRKEVEDLF-QGL 344
+GI +G L+I LF+ + S AG IDSG+ T+LV AY +R+EV L L
Sbjct: 282 QGIRMGNARLNISATLFRPDP--SGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKL 339
Query: 345 LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGP 404
Y +C+ GN + M F F G ++V+D V V C+ +G
Sbjct: 340 KKGYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEKGVEIVIDKWRVLADVGGGVHCIGIGR 399
Query: 405 SDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S++ G +IIG QQN V YDL ++++ + DC
Sbjct: 400 SEMLG---AASNIIGNFHQQNLWVEYDLANRRIGLGKADC 436
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 158/371 (42%), Gaps = 58/371 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC---- 151
+ V+ +G P + DTGS L WV+C+PC C FDPS S TYA + C
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 152 ----DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
D+S C++D C Y ++Y + + G + + SD F+ FGC
Sbjct: 209 CQELDASGCSSD-----SRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFV----FGC 259
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSL-VEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
NA Q G+FGLG S S G F+YC+ + + G G
Sbjct: 260 GDQNAGLFG-QVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSS----------GRGY 308
Query: 267 ILEGDSTPM-----SVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
+ G + P ++ DG+ YY+ L GI +G + + I + G ID
Sbjct: 309 LSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRI-----PATAFAAAGGTVID 363
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAF 374
SGT +T L P AY LR F + Y PA + CY +R Q P +
Sbjct: 364 SGTVITRLPPRAYAPLRAA----FARSMAQYKKAPALSILDTCYDFTGHRTAQ-IPTVEL 418
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
FAGGA + LD V Y S CLA P+ + ++I+G Q+ + V YD+ +
Sbjct: 419 AFAGGATVSLDFTGVLYVSKVSQACLAFAPNADD----SSIAILGNTQQKTFAVTYDVAN 474
Query: 435 KQLYFQRIDCE 445
+++ F C
Sbjct: 475 QRIGFGAKGCS 485
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 115/383 (30%), Positives = 172/383 (44%), Gaps = 65/383 (16%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ--C 133
Q + KA A+L I T+ + V S+G P V Q +DTGS + WV+C+PC C
Sbjct: 120 QLAGSKAATVPANLGFSIGTL-QYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPC 178
Query: 134 GATT---FDPSKSLTYATLPCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIG 184
+ FDP++S +Y+ +PC ++ C +N C G +C Y + Y +G + G
Sbjct: 179 YSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSG--GQCGYVVSYGDGSTTTGVYS 236
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----G 240
S+ S+ K FL FGC H F GV GL SLV + G
Sbjct: 237 SDTLTLTGSNALKGFL----FGCGHAQQGL----FAGVDGLLGLGRQGQSLVSQASSTYG 288
Query: 241 SKFSYCIGNLNYFEYAYNMLILGEGAILEG-DSTPMSVI--DGSYY-VTLEGISLGEKML 296
FSYC L + + + LG + G +TP+ D +YY V L GIS+G + L
Sbjct: 289 GVFSYC---LPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPL 345
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP-SYPMDPAWH 355
ID ++F +G +D+GT +T L P+AY LR + P YP PA
Sbjct: 346 SIDASVFA-------SGAVVDTGTVVTRLPPTAYSALRSAFR---AAMAPYGYPSAPATG 395
Query: 356 L---CYS----GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDIN 408
+ CY G + P ++ F GGA + L + CLA P+ +
Sbjct: 396 ILDTCYDFTRYGTVT-----LPTISIAFGGGAAMDLGTSGILTSG-----CLAFAPTGGD 445
Query: 409 GERFKDLSIIGMIAQQNYNVAYD 431
+ SI+G + Q+++ V +D
Sbjct: 446 SQ----ASILGNVQQRSFEVRFD 464
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 164/379 (43%), Gaps = 46/379 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC----EQCGATTFDPSKSLTYATLPCDSS 154
+ ++ S+G PP P LDTGS L+W +C PC EQ A DP+ S T+A LPCD+
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCDAP 149
Query: 155 YCT----NDCGGYP---DECWYNIRYTNGPDSQGTIGSEQFNFETSDE-GKTFLYDVGFG 206
C CGG C Y Y + + G + ++ F F D G V FG
Sbjct: 150 LCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVTFG 209
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
C H N TG+ G G S S + + FSYC ++ + + +++ LG A
Sbjct: 210 CGHINKGIFQANETGIAGFGRGRWSLPSQLNV--TSFSYCFTSM-FDTKSSSVVTLGAAA 266
Query: 267 --ILE-------GDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+L GD +I Y+V L GIS+G + + + + +
Sbjct: 267 AELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSS------ 320
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ--GFP 370
IDSG ++T L Y+ ++ E GL + A LC++ + + P
Sbjct: 321 -TIIDSGASITTLPEDVYEAVKAEFVSQV-GLPAAAAGSAALDLCFALPVAALWRRPAVP 378
Query: 371 AMAFHFAGGADLVL-DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
A+ H GGAD L VF ++ V C+ + + GE+ +IG QQN +V
Sbjct: 379 ALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAA--AGEQV----VIGNYQQQNTHVV 432
Query: 430 YDLVSKQLYFQRIDCELLA 448
YDL + L F C+ LA
Sbjct: 433 YDLENDVLSFAPARCDKLA 451
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 119/438 (27%), Positives = 195/438 (44%), Gaps = 37/438 (8%)
Query: 31 AAGKPKRLVTKLLHR---DSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRA 87
A KP +++HR +S Y N T + R + +S R L+ +S
Sbjct: 21 AISKPDGFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSGFSPEAF 80
Query: 88 HLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSL 144
L + V IG P VP V DTGS L W +C+PC + F+ + S
Sbjct: 81 RLRISQDDT-CYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASR 139
Query: 145 TYATLPCDSSYCTNDCGGYP---DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
TY LPC +CTN+ + D+C Y I Y G + G + Q ++++ + Y
Sbjct: 140 TYRDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGV--AAQDILQSAENDRIPFY 197
Query: 202 DVGFGCSHNNAHFSD-EQFTGVFGLGPATSSTHSLVEKVG----SKFSYCIG--NLNYFE 254
FGCS +N +FS E G+ S SL++++ ++FSYC+ +L+
Sbjct: 198 ---FGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPS 254
Query: 255 YAYNMLILG---EGAILEGDSTPMSVIDG--SYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+A ++L G + + STP G +Y++ L +S+ + I P F
Sbjct: 255 HATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDG 314
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF-QGLLPSYPMDPAWHLCYSGNINRDLQG 368
+ G IDSGT +T++ +AY + ++ F Q + + ++CY
Sbjct: 315 T-GGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQQ-GHTFHN 372
Query: 369 FPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
+P+MAFHF GAD ++ E V+ + FC+A+ P + +IIG + Q N
Sbjct: 373 YPSMAFHFQ-GADFFVEPEYVYLTVQDRGAFCVALQPISP-----QQRTIIGALNQANTQ 426
Query: 428 VAYDLVSKQLYFQRIDCE 445
YD ++QL F +C+
Sbjct: 427 FIYDAANRQLLFTPENCQ 444
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 169/370 (45%), Gaps = 58/370 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ IG+PP +LDTGS + WV+C PC C F+P+ S +++TL C++
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQ 208
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++C D C Y + Y +G + G F ET G + +V GC HNN
Sbjct: 209 CRSLDVSECRN--DTCLYEVSYGDGSYTVG-----DFVTETITLGSAPVDNVAIGCGHNN 261
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F G GL + S ++ + FSYC+ + + + LE
Sbjct: 262 EGL----FVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDS----------ESASTLEF 307
Query: 271 DST--PMSV---------IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ST P +V +D YYV L G+S+G +++ I + F+ +++ + GV +DSG
Sbjct: 308 NSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDES-GNGGVIVDSG 366
Query: 320 TTLTWLVPSAYQTLR----KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
T +T L Y +LR K D LPS + CY + +++ P ++FH
Sbjct: 367 TAITRLQTDVYNSLRDAFVKRTRD-----LPSTNGIALFDTCYDLSSKGNVE-VPTVSFH 420
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F G +L L A++ +S FC A P+ LSIIG + QQ V YDLV+
Sbjct: 421 FPDGKELPLPAKNYLVPLDSEGTFCFAFAPTA------SSLSIIGNVQQQGTRVVYDLVN 474
Query: 435 KQLYFQRIDC 444
+ F C
Sbjct: 475 HLVGFVPNKC 484
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 113/401 (28%), Positives = 184/401 (45%), Gaps = 53/401 (13%)
Query: 77 KSSQKAHDTRAHLHPGISTVPVFYVNF--SIGQPPVPQLAVLDTGSSLIWVKCQPCEQC- 133
+SS A ++ P S + +N+ ++G ++DT S L WV+C+PC+ C
Sbjct: 87 RSSDAASASKLAQVPVTSGARLRTLNYVATVGIGGGEATVIVDTASELTWVQCEPCDACH 146
Query: 134 --GATTFDPSKSLTYATLPCDSSYC----------TNDCGGYPDECWYNIRYTNGPDSQG 181
FDPS S +YA +PC+SS C C P C Y + Y +G S+G
Sbjct: 147 DQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRG 206
Query: 182 TIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKV 239
+ ++ + D + FGC + N F +G+ GLG + S S +++
Sbjct: 207 VLAHDRLSLAGED-----IQGFVFGCGTSNQGPFGGT--SGLMGLGRSQLSLISQTMDQF 259
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM-------SVIDGSYYVT-LEGISL 291
G FSYC+ + L+LG+ A + +STP+ + G +Y+ L GI++
Sbjct: 260 GGVFSYCLPPKE--SGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITV 317
Query: 292 GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD 351
G + D+ F +DSGT +T LVPS Y +R E F L YP
Sbjct: 318 GGE--DVQSPGFSAGGGGK---AIVDSGTIITSLVPSVYAAVRAE----FVSQLAEYPQA 368
Query: 352 PAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES--SSVFCLAVGPSD 406
+ + C+ R++Q P++ F GGA++ +D++ V Y + +S CLA+ +
Sbjct: 369 APFSILDTCFDLTGLREVQ-VPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLAL--AS 425
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ E D IIG Q+N V +D V Q+ F + C+ +
Sbjct: 426 LKSE--YDTPIIGNYQQKNLRVIFDTVGSQIGFAQETCDYI 464
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 175/386 (45%), Gaps = 56/386 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSLTYATLPCDSSYC 156
++ ++G PP V+DTGS L W+ C AT F+P+ S +Y + C S C
Sbjct: 68 ISITVGTPPQNMSMVIDTGSELSWLHCN--TNTTATIPYPFFNPNISSSYTPISCSSPTC 125
Query: 157 TNDCGGYP--------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC- 207
T +P + C + Y + S+G + S+ F F G +F + FGC
Sbjct: 126 TTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGF-----GSSFNPGIVFGCM 180
Query: 208 --SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
S++ SD TG+ G+ + S S ++ KFSYCI ++ +L+LGE
Sbjct: 181 NSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKI--PKFSYCISGSDF----SGILLLGES 234
Query: 266 AILEGD----------STPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
G STP+ D S Y V LEGI + +K+L+I NLF + T + +
Sbjct: 235 NFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTM 294
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINR-DLQG 368
F D GT ++L+ Y LR E + G L P++ A LCY +N+ +L
Sbjct: 295 F-DLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPE 353
Query: 369 FPAMAFHFAGG-----ADLVLDAESVFYQESSSVFCLAVGPSDING-ERFKDLSIIGMIA 422
P+++ F G D +L F + SV+C G SD+ G E F IIG
Sbjct: 354 LPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAF----IIGHHH 409
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELLA 448
QQ+ + +DLV ++ C+L+
Sbjct: 410 QQSMWMEFDLVEHRVGLAHARCDLVG 435
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 166/373 (44%), Gaps = 47/373 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ ++ +IG PP P LDTGS L+W +CQPC C + +D S+S T+A CDS+
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 156 CTND------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C D C Y+ Y + + G + E +F + V FGC
Sbjct: 151 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGAS----VPGVVFGCGL 206
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNMLIL 262
NN TG+ G G S S + KVG+ FS+C ++ F+ ++
Sbjct: 207 NNTGIFRSNETGIAGFGRGPLSLPSQL-KVGN-FSHCFTAVSGRKPSTVLFDLPADLYKN 264
Query: 263 GEGAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVFIDS 318
G G + +TP+ YY++L+GI++G L + + F KN T G IDS
Sbjct: 265 GRGTV---QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGT---GGTIIDS 318
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFA 377
GT T L P Y+ + E + LP P + LC+S P + HF
Sbjct: 319 GTAFTSLPPRVYRLVHDEFAAHVK--LPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE 376
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
GA + L E+ ++ + CLA+ I GE ++IIG QQN +V YDL +
Sbjct: 377 -GATMHLPRENYVFEAKDGGNCSICLAI----IEGE----MTIIGNFQQQNMHVLYDLKN 427
Query: 435 KQLYFQRIDCELL 447
+L F R C+ L
Sbjct: 428 SKLSFVRAKCDKL 440
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 168/378 (44%), Gaps = 49/378 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT----------FDPSKSLTYATLP 150
+ IG PP P+ ++DTGS LIW +C + T ++P +S ++A LP
Sbjct: 86 LTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLP 145
Query: 151 CDSSYCTNDCGGYP-----DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C C Y + C Y+ Y + ++ G + SE F F + + L GF
Sbjct: 146 CSDRLCQEGQFSYKNCARNNRCMYDELYGSA-EAGGVLASETFTFGVNAKVSLPL---GF 201
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGE 264
GC S G GL + SLV ++ +FSYC+ + E + L+ G
Sbjct: 202 GC----GALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCL--TPFAERKTSPLLFGA 255
Query: 265 GAILEGDSTPMSVIDGS-----------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
A L T +V S YYV L G+SLG K LD+ G
Sbjct: 256 MADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSGG 315
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA----WHLCYSGNINRDLQGF 369
+DSG+T+++L +A++ ++K V + + LP + LC++ ++
Sbjct: 316 TIVDSGSTMSYLEETAFRAVKKAVVEAVR--LPVANGTDEDYDDYELCFALPTGVAMEAV 373
Query: 370 --PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + HF GGA + L ++ F + + + CLAVG S + F +SIIG + QQN +
Sbjct: 374 KTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSP---DGF-GVSIIGNVQQQNMH 429
Query: 428 VAYDLVSKQLYFQRIDCE 445
V +D+ +++ F C+
Sbjct: 430 VLFDVRNQKFSFAPTKCD 447
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 134/455 (29%), Positives = 199/455 (43%), Gaps = 64/455 (14%)
Query: 22 IFTSTTAA--PAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMAR----FIYLS 75
+ S+TA P P VT LH Y+P V ++ TL + R Y+
Sbjct: 36 LMKSSTACSEPKVTPPSTGVTVPLHHR---YDPCSPVPSKKVPTLEERLRRDQLRAAYIK 92
Query: 76 QKSS-------QKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
+K S A L +ST+ + + IG P V Q +DTGS + WV+C+
Sbjct: 93 RKFSGAGDIEQSDAATVPTTLGTSLSTLE-YVITVGIGSPAVTQTMSMDTGSDVSWVQCK 151
Query: 129 PCEQCGA---TTFDPSKSLTYATLPCDSSYCT--------NDCGGYPDECWYNIRYTNGP 177
PC QC + + FDPS S TY+ C S+ C N C +C Y + Y +
Sbjct: 152 PCSQCHSEVDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGC--MSSQCQYIVNYGDSS 209
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
+ GT S+ G + + D FGCS + + ++Q G+ GLG S S
Sbjct: 210 STTGTYSSDTLTL-----GSSAMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTA 264
Query: 238 KV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG-DSTPM---SVIDGSYYVTLEGISLG 292
G+ FSYC L + L LG G+ G TPM + I Y V LE I +G
Sbjct: 265 GTFGTAFSYC---LPPTSGSSGFLTLGTGS--SGFVKTPMLRSTQIPTYYVVLLESIKVG 319
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP--- 349
+ L++ ++F AG +DSGT +T L P+AY L + Q P+ P
Sbjct: 320 SQQLNLPTSVFS-------AGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGI 372
Query: 350 MDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDING 409
+D + +I+ P + F+GGA + L + + + SSS+ CLA P NG
Sbjct: 373 LDTCFDFSGQSSIS-----IPTVTLVFSGGAAVDLAFDGIMLEISSSIRCLAFTP---NG 424
Query: 410 ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ L IIG + Q+ + V YD+ + F+ C
Sbjct: 425 DD-SSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 153/359 (42%), Gaps = 42/359 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G P V DTGS WV+C+PC + FDP++S TYA + C +
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAP 220
Query: 155 YCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C++ C G C Y ++Y +G S G + + D K F FGC
Sbjct: 221 ACSDLYIKGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFR----FGCGER 274
Query: 211 NAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGEG 265
N E G+ GLG TS +K G F++C Y ++ L
Sbjct: 275 NEGLYGEA-AGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSLP---- 329
Query: 266 AILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
A+ +TPM V +G YYV L GI +G K+L I ++F + +G +DSGT +T
Sbjct: 330 AVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVF------TTSGTIVDSGTVIT 383
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGA 380
L P+AY +LR Y PA L CY ++ P ++ F GGA
Sbjct: 384 RLPPAAYSSLRSAFASAMAER--GYKKAPALSLLDTCYDFTGMSEVA-IPTVSLLFQGGA 440
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L + A + Y S S CL + D+ I+G + + V YD+ K + F
Sbjct: 441 SLDVHASGIIYAASVSQACLGFA----GNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGF 495
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 115/383 (30%), Positives = 172/383 (44%), Gaps = 65/383 (16%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ--C 133
Q + KA A+L I T+ + V S+G P V Q +DTGS + WV+C+PC C
Sbjct: 109 QLAGSKAATVPANLGFSIGTL-QYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPC 167
Query: 134 GATT---FDPSKSLTYATLPCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIG 184
+ FDP++S +Y+ +PC ++ C +N C G +C Y + Y +G + G
Sbjct: 168 YSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSG--GQCGYVVSYGDGSTTTGVYS 225
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----G 240
S+ S+ K FL FGC H F GV GL SLV + G
Sbjct: 226 SDTLTLTGSNALKGFL----FGCGHAQQGL----FAGVDGLLGLGRQGQSLVSQASSTYG 277
Query: 241 SKFSYCIGNLNYFEYAYNMLILGEGAILEG-DSTPMSVI--DGSYY-VTLEGISLGEKML 296
FSYC L + + + LG + G +TP+ D +YY V L GIS+G + L
Sbjct: 278 GVFSYC---LPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPL 334
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP-SYPMDPAWH 355
ID ++F +G +D+GT +T L P+AY LR + P YP PA
Sbjct: 335 SIDASVFA-------SGAVVDTGTVVTRLPPTAYSALRSAFR---AAMAPYGYPSAPATG 384
Query: 356 L---CYS----GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDIN 408
+ CY G + P ++ F GGA + L + CLA P+ +
Sbjct: 385 ILDTCYDFTRYGTVT-----LPTISIAFGGGAAMDLGTSGILTSG-----CLAFAPTGGD 434
Query: 409 GERFKDLSIIGMIAQQNYNVAYD 431
+ SI+G + Q+++ V +D
Sbjct: 435 SQ----ASILGNVQQRSFEVRFD 453
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 161/380 (42%), Gaps = 55/380 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V ++G P P LDTGS L+W +C PC C DP+ S TYA LPC ++
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143
Query: 156 CT----NDCG----GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY--DVGF 205
C CG G C Y Y + + G I +++F F S L+ + F
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTF 203
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H N TG+ G G S S + + FSYC ++ FE +++ LG
Sbjct: 204 GCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNV--TSFSYCFTSM--FESKSSLVTLGGS 259
Query: 266 ----------------AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
IL+ S P Y+++L+GIS+G+ L + F+
Sbjct: 260 PAALYSHAHSGEVRTTPILKNPSQP-----SLYFLSLKGISVGKTRLPVPETKFRST--- 311
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ-- 367
IDSG ++T L Y+ ++ E GL PS A LC++ + +
Sbjct: 312 -----IIDSGASITTLPEEVYEAVKAEFAAQV-GLPPSGVEGSALDLCFALPVTALWRRP 365
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P++ H G + + VF + V C+ + + GE+ ++IG QQN +
Sbjct: 366 AVPSLTLHLEGADWELPRSNYVFEDLGARVMCIVLDAAP--GEQ----TVIGNFQQQNTH 419
Query: 428 VAYDLVSKQLYFQRIDCELL 447
V YDL + +L F C+ L
Sbjct: 420 VVYDLENDRLSFAPARCDRL 439
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 117/417 (28%), Positives = 179/417 (42%), Gaps = 53/417 (12%)
Query: 41 KLLHRDSLLYN--PN------DTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG 92
KL HRD L N P+ + + ++R ++ ++ + D + G
Sbjct: 74 KLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTEQG 133
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATL 149
++V +G PP Q V+D+GS ++WV+CQPC +C FDP+ S TYA +
Sbjct: 134 SGE---YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGI 190
Query: 150 PCDSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
CDSS C ++ G C Y + Y +G ++GT+ E F G+ + ++ GC
Sbjct: 191 SCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF-----GRVLIRNIAIGC 245
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
H N + G A S L + G FSYC+ ++ + L G GA+
Sbjct: 246 GHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCL--VSRGTESTGTLEFGRGAM 303
Query: 268 LEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G + P+ YYV L G+ +G + I +F+ D GV +D+GT +T
Sbjct: 304 PVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTD-LGYGGVVMDTGTAVT 362
Query: 324 WLVPSAYQTLRKEVEDLFQGL---LPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFH 375
L AY+ R D F G LP + CY +L GF P ++F+
Sbjct: 363 RLPAPAYEAFR----DTFIGQTANLPRSDRVSIFDTCY------NLNGFVSVRVPTVSFY 412
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
F+GG L L A + + FC A S LSIIG I Q+ ++ D
Sbjct: 413 FSGGPILTLPARNFLIPVDGEGTFCFAFAASA------SGLSIIGNIQQEGIQISID 463
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 126/422 (29%), Positives = 185/422 (43%), Gaps = 78/422 (18%)
Query: 64 LNMSMARFIYLSQKSSQKAH----DTRAHLHPGISTVPVFYV-NFSIGQPPVPQLAVLDT 118
L + AR Y+ + S+ D H G S + YV +G P V Q+ ++DT
Sbjct: 84 LRRNRARSKYIMSRVSKGMMGDDADVSIPTHLGGSVDSLEYVVTVGLGTPSVSQVLLIDT 143
Query: 119 GSSLIWVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYC---TND-------C 160
GS L WV+CQPC +TT FDPSKS TYA +PC++ C T+D
Sbjct: 144 GSDLSWVQCQPCN---STTCYPQKDPLFDPSKSSTYAPIPCNTDACRDLTDDGYGGGCAS 200
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFT 220
G +C + I Y +G ++G +E K F FGC H+ +D ++
Sbjct: 201 GDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDFR----FGCGHDQDGAND-KYD 255
Query: 221 GVFGLGPATSSTHSLVEKV-GSKFSYCIGNLN---------YFEYAYNMLILGEGAILEG 270
G+ GLG A S V G FSYC+ LN ++ G +
Sbjct: 256 GLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSGGVVNTSGFVF-- 313
Query: 271 DSTPMSVIDGSYYVT-LEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
TPM + ++YV + GI++G + +D+ P+ F G+ IDSGT +T L +A
Sbjct: 314 --TPMIREEETFYVVNMTGITVGGEPIDVPPSAFS-------GGMIIDSGTVVTELQHTA 364
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWHL--CY--SGNINRDLQGFPAMAFHFAGGADLVLD 385
Y L+ F+ + +YP+ L CY SG N L P +A F+GGA + LD
Sbjct: 365 YNALQAA----FRKAMAAYPLVRNGELDTCYDFSGYSNVTL---PKVALTFSGGATIDLD 417
Query: 386 AESVFYQESSSVFCLAV---GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
+ + CLA GP D G I+G + Q+ V YD ++ F+
Sbjct: 418 VPNGILLDD----CLAFQESGPDDQPG-------ILGNVNQRTLEVLYDAGRGRVGFRAA 466
Query: 443 DC 444
C
Sbjct: 467 VC 468
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 119/418 (28%), Positives = 184/418 (44%), Gaps = 48/418 (11%)
Query: 41 KLLHRD---SLLY-NPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTV 96
+LLHRD S+ Y N + + A+ +R + A +S K + D+R ++ S V
Sbjct: 62 RLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGSDV 121
Query: 97 PV--------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLT 145
++V +G PP Q V+D+GS ++WV+CQPC+ C + FDP+KS +
Sbjct: 122 VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGS 181
Query: 146 YATLPCDSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
Y + C SS C + G + C Y + Y +G ++GT+ E F KT + +V
Sbjct: 182 YTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA-----KTVVRNV 236
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
GC H N + G + S L + G F YC+ ++ + L+ G
Sbjct: 237 AMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCL--VSRGTDSTGSLVFG 294
Query: 264 EGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
A+ G S V + YYV L+G+ +G + + +F +T D GV +D+G
Sbjct: 295 REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTET-GDGGVVMDTG 353
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAF 374
T +T L AY R + LP + CY DL GF P ++F
Sbjct: 354 TAVTRLPTGAYAAFRDGFKSQTAN-LPRASGVSIFDTCY------DLSGFVSVRVPTVSF 406
Query: 375 HFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+F G L L A + + S +C A S LSIIG I Q+ V++D
Sbjct: 407 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTG------LSIIGNIQQEGIQVSFD 458
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 163/371 (43%), Gaps = 55/371 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC--EQC---GATTFDPSKSLTYATLPCDS 153
+ +G P VPQ +LDTGSSL WV+C+PC QC FDP+ S +Y+ +PCDS
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDS 188
Query: 154 SYCTNDCGGYPDE---------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
C G + C Y I Y +G G ++ K F
Sbjct: 189 QECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVKRFH---- 244
Query: 205 FGCSHNNAHFSDEQFTGVFGLG--PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL 262
FGC H+ + GV GLG P + + + + G FS+C+ + L
Sbjct: 245 FGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLP-----PTGVSTGFL 299
Query: 263 GEGAILEGDS---TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
GA + + TP+ +D Y + IS+ ++LDI P +F++ GV
Sbjct: 300 ALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFRE-------GVIT 352
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM-DPAWHL--CYSGNINRDLQGFPAMA 373
DSGT L+ L +AY LR F+ + YP+ P HL C++ D P ++
Sbjct: 353 DSGTVLSALQETAYTALRTA----FRSAMAEYPLAPPVGHLDTCFN-FTGYDNVTVPTVS 407
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F GGA + LDA S + CLA S G+ + L IG ++Q+ V YD+
Sbjct: 408 LTFRGGATVHLDASSGVLMDG----CLAFWSS---GDEYTGL--IGSVSQRTIEVLYDMP 458
Query: 434 SKQLYFQRIDC 444
+++ F+ C
Sbjct: 459 GRKVGFRTGAC 469
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 172/376 (45%), Gaps = 44/376 (11%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQCGAT---TFDPSKSLTYATLPCDS 153
+++ S+G PP+ A++DTGS L W +C PC C A +DP++S T++ LPC S
Sbjct: 95 AYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCAS 154
Query: 154 SYCTNDCGGY----PDECWYNIRYTNGPDSQGTIGSEQFNF---ETSDEGKTFLYDVGFG 206
C + C Y+ RY G + G + ++ + + + V FG
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFG 213
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEG 265
CS N D +G+ GLG S SL+ ++G +FSYC+ + + + ++ G
Sbjct: 214 CSTANGGDMDGA-SGIVGLG---RSALSLLSQIGVGRFSYCL--RSDADAGASPILFGAL 267
Query: 266 AILEGDST--------PMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
A + GD P++ + YYV L GI++G L + + F + GV
Sbjct: 268 ANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGA-GGVI 326
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDPAWHLCY-SGNINRDLQGFPAMA 373
+DSGTT T+L + Y LR+ GLL + LC+ +G + + P +
Sbjct: 327 VDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEAGAADTPV---PRLV 383
Query: 374 FHFAGGADLVLDAESVF--YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
F FAGGA+ + +S F E V CL V P+ + +S+IG + Q + +V YD
Sbjct: 384 FRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPT-------RGVSVIGNVMQMDLHVLYD 436
Query: 432 LVSKQLYFQRIDCELL 447
L F DC L
Sbjct: 437 LDGATFSFAPADCASL 452
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 150/361 (41%), Gaps = 27/361 (7%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
++V +G P V DTGS L WVKC G F P S ++A +PC S C
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPG-RVFRPKTSRSWAPIPCSSDTCKL 174
Query: 159 D-------CGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
D C C Y+ RY G ++G +G+E L DV GCS +
Sbjct: 175 DVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSS 234
Query: 211 NAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+ S GV LG A S + G FSYC+ + A L G G +
Sbjct: 235 HDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPR 294
Query: 270 GDSTPMSV-IDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+T + +D Y V ++ I + K LDI ++ GV +DSG TLT L
Sbjct: 295 TPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAK----SGGVILDSGNTLTVL 350
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRD--LQGFPAMAFHFAGGADLV 383
AY+ + + G +P P H CY+ R + P +A FAG A L
Sbjct: 351 AAPAYKAVVAALSKHLDG-VPKVSFPPFEH-CYNWTARRPGAPEIIPKLAVQFAGSARLE 408
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
A+S V C+ V + G LS+IG I QQ + +DL + Q+ F++ +
Sbjct: 409 PPAKSYVIDVKPGVKCIGVQEGEWPG-----LSVIGNIMQQEHLWEFDLKNMQVRFKQSN 463
Query: 444 C 444
C
Sbjct: 464 C 464
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 136/484 (28%), Positives = 201/484 (41%), Gaps = 74/484 (15%)
Query: 6 AILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQ---AQR 62
A+ L + +P +S + + + A P R L+HR P+ + A+R
Sbjct: 11 AVNLNNFAVVPASSFEPEAACSTSSANSDPNRASVPLVHRHGPC-APSAASGGKPSLAER 69
Query: 63 TLNMSMARFIYLSQKSSQKAHDTRA---HLHPGISTVPVF----------YVNFSIGQPP 109
L AR Y+ K++ A + G +++P F V IG P
Sbjct: 70 -LRRDRARANYIVTKAAGGRTAATAVSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPA 128
Query: 110 VPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCT------- 157
V Q+ ++DTGS L WV+C+PC +C A FDPS S +YA++PCDS C
Sbjct: 129 VQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAY 188
Query: 158 -NDC-GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
+ C G C Y I Y N + G +E + + D GFGC ++ H
Sbjct: 189 GHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKPG----VVVADFGFGCG-DHQHGP 243
Query: 216 DEQFTGVFGLGPATSSTHSLV-EKVGSKFSYCI----GNLNYFEYAYNMLILGEGAILEG 270
E+F G+ GLG A S S + G FSYC+ G + A
Sbjct: 244 YEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGF 303
Query: 271 DSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
TPM I Y VTL GIS+G L + P+ F +G+ IDSGT +T L
Sbjct: 304 LFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFS-------SGMVIDSGTVITGLPA 356
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDP-----AWHLCY--SGNINRDLQGFPAMAFHFAGGA 380
+AY LR F+ + Y + P CY +G+ N + P +A F+GGA
Sbjct: 357 TAYAALRSA----FRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTV---PTIALTFSGGA 409
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ L + + CLA + + + IIG + Q+ + V YD + F+
Sbjct: 410 TIDLATPAGVLVDG----CLAFAGAGTD----DTIGIIGNVNQRTFEVLYDSGKGTVGFR 461
Query: 441 RIDC 444
C
Sbjct: 462 AGAC 465
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 169/397 (42%), Gaps = 39/397 (9%)
Query: 63 TLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSL 122
TL ARF+YLS + GI P + V +IG P L LDT +
Sbjct: 52 TLLQDKARFLYLSSLAGVTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSNDA 111
Query: 123 IWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYCT---NDCGGYPDECWYNIRYTNGPD 178
W+ C C C ++ FDPSKS + TL C++ C N C +N+ Y
Sbjct: 112 AWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTY----- 166
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDV----GFGCSHNNAHFSDEQFTGVFGLGPATSSTHS 234
G + + T DV FGC N A + G+ GLG S S
Sbjct: 167 -----GGSAIEAYLTQDTLTLATDVIPNYTFGC-INKASGTSLPAQGLMGLGRGPLSLIS 220
Query: 235 LVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGIS 290
+ + S FSYC+ N ++ ++ + + + +TP+ YYV L GI
Sbjct: 221 QSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIR 280
Query: 291 LGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM 350
+G K++DI P D + AG DSGT T LV AY +R E + +
Sbjct: 281 VGNKIVDI-PTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNA--NATS 337
Query: 351 DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS---SVFCLAVGPSDI 407
+ CYSG++ FP++ F FA G ++ L +++ S+ S +A P+++
Sbjct: 338 LGGFDTCYSGSV-----VFPSVTFMFA-GMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNV 391
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
N L++I + QQN+ V D+ + +L R C
Sbjct: 392 NSV----LNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 161/386 (41%), Gaps = 39/386 (10%)
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
LS+ S+ + + ++ P S + + + V IG P V DTGS L W +C+P
Sbjct: 103 LSKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEP 162
Query: 130 C-EQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGS 185
C C + F+PS S TY + C S C + C Y+I Y + +QG +
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCVYSIGYGDKSFTQGFLAK 222
Query: 186 EQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSY 245
E+F SD L DV FGC NN D + S + FSY
Sbjct: 223 EKFTLTNSD----VLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSY 278
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDS-TPMSVIDGS--YYVTLEGISLGEKMLDIDPNL 302
C+ ++ + L G I E TP+S + Y + + GIS+G+K L I PN
Sbjct: 279 CLP--SFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNS 336
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYS 359
F S G IDSGT T L Y LR +F+ + SY + L CY
Sbjct: 337 F------STEGAIIDSGTVFTRLPTKVYAELRS----VFKEKMSSYKSTSGYGLFDTCYD 386
Query: 360 GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDL-SII 418
D +P +AF FAGG + LD + S CLA +D DL +I
Sbjct: 387 FT-GLDTVTYPTIAFSFAGGTVVELDGSGISLPIKISQVCLAFAGND-------DLPAIF 438
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G + Q +V YD+ ++ F C
Sbjct: 439 GNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 169/379 (44%), Gaps = 54/379 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ---PCEQCGATTFDPSKSLTYATLPCDSS 154
+ V+ IG PP Q +LDTGS L W++C P + ++ FDPS S +++ LPC+
Sbjct: 81 ILLVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHP 140
Query: 155 YCTNDCGGYP-----DE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C + D+ C Y+ Y +G ++G + E+ F S + G
Sbjct: 141 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLI----LG 196
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI------------------G 248
C+ + SD + G+ G+ S S + +KFSYC+
Sbjct: 197 CAEES---SDAK--GILGMNLGRLSFASQAKL--TKFSYCVPTRQVRPGFTPTGSFYLGE 249
Query: 249 NLNYFEYAY-NMLILGEGAILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKN 306
N N + Y N+L + S M +D +Y V ++GI +G + L+I + F+
Sbjct: 250 NPNSGGFRYINLLTFSQ-------SQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRP- 301
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRD 365
D IDSG+ T+LV AY +R+EV L L Y +C++GN
Sbjct: 302 DPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEI 361
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+ M F F G ++V++ E V V C+ +G S++ G +IIG QQN
Sbjct: 362 GRLIGNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGA---ASNIIGNFHQQN 418
Query: 426 YNVAYDLVSKQLYFQRIDC 444
V +DL ++++ F + DC
Sbjct: 419 IWVEFDLANRRVGFGKADC 437
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 166/373 (44%), Gaps = 47/373 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ ++ +IG PP P LDTGS L+W +CQPC C + +D S+S T+A CDS+
Sbjct: 35 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 94
Query: 156 CTND------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C D C Y+ Y + + G + E +F + V FGC
Sbjct: 95 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGAS----VPGVVFGCGL 150
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNMLIL 262
NN TG+ G G S S + KVG+ FS+C ++ F+ ++
Sbjct: 151 NNTGIFRSNETGIAGFGRGPLSLPSQL-KVGN-FSHCFTAVSGRKPSTVLFDLPADLYKN 208
Query: 263 GEGAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVFIDS 318
G G + +TP+ YY++L+GI++G L + + F KN T G IDS
Sbjct: 209 GRGTV---QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGT---GGTIIDS 262
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFA 377
GT T L P Y+ + E + LP P + LC+S P + HF
Sbjct: 263 GTAFTSLPPRVYRLVHDEFAAHVK--LPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE 320
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
GA + L E+ ++ + CLA+ I GE ++IIG QQN +V YDL +
Sbjct: 321 -GATMHLPRENYVFEAKDGGNCSICLAI----IEGE----MTIIGNFQQQNMHVLYDLKN 371
Query: 435 KQLYFQRIDCELL 447
+L F R C+ L
Sbjct: 372 SKLSFVRAKCDKL 384
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 170/389 (43%), Gaps = 55/389 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GAT-TFDPSKSLTYATLPCDSS 154
+ V+ S+G PP P LDTGS L+W +C PC C GA DP+ S T+A + CD+
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRCDAP 153
Query: 155 YCT----NDCGG-----YPDECWYNIRYTNGPDSQGTIGSEQFNF---ETSDEGKTFLYD 202
C CG C Y Y + + G + S++F F + +D G
Sbjct: 154 VCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVSERR 213
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLI 261
+ FGC H N TG+ G G SL ++G + FSYC ++ FE +++
Sbjct: 214 LTFGCGHFNKGIFQANETGIAGFG---RGRWSLPSQLGVTSFSYCFTSM--FESTSSLVT 268
Query: 262 LGEGAI---LEG--DSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
LG L G STP+ + D S Y+++L+ I++G + I ++ +A
Sbjct: 269 LGVAPAELHLTGQVQSTPL-LRDPSQPSLYFLSLKAITVGATRIPIP----ERRQRLREA 323
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKE-----------VE----DLFQGLLPSYPMDPAWHLC 357
IDSG ++T L Y+ ++ E VE DL L + A+
Sbjct: 324 SAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWR 383
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLS 416
+ G P + FH GGAD L E+ VF + V CL + + G++
Sbjct: 384 WRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQ---TV 440
Query: 417 IIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+IG QQN +V YDL + L F CE
Sbjct: 441 VIGNYQQQNTHVVYDLENDVLSFAPARCE 469
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 116/435 (26%), Positives = 182/435 (41%), Gaps = 63/435 (14%)
Query: 38 LVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKA--------HDTRAHL 89
L +LLHRD + N T R L + R ++ K++ R +
Sbjct: 68 LHIRLLHRDR--FAANATPAQLLARRLQRDVLRAAWIISKAAANGTPPPVAGLSSARGFV 125
Query: 90 HPGISTVPV---FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKS 143
P +S P + ++G P V L LDT S L W++CQPC +C FDP S
Sbjct: 126 APVVSRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHS 185
Query: 144 LTYATLPCDSSYCT---NDCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
+Y + +++ C GG C Y + Y +G + G E F G
Sbjct: 186 TSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTF----AGGV 241
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGN-LNYFEYAY 257
L + GC H+N G+ GLG S + ++ G+ FSYC+ + L+
Sbjct: 242 RLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGT-FSYCLVDFLSGPGSLS 300
Query: 258 NMLILGEGAILEGDSTP-----MSVIDGS----YYVTLEGISLG--------EKMLDIDP 300
+ L G GA+ D++P +V++ + YYV L GIS+G E+ L +DP
Sbjct: 301 STLTFGAGAV---DTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDP 357
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCY 358
+ GV +DSGT +T L AY R + L P+ + CY
Sbjct: 358 YTGR-------GGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCY 410
Query: 359 SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSI 417
+ R ++ P ++ HFAG ++ L ++ +S C A + + +SI
Sbjct: 411 TVG-GRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDH-----SVSI 464
Query: 418 IGMIAQQNYNVAYDL 432
IG I QQ + + YD+
Sbjct: 465 IGNIQQQGFRIVYDI 479
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 167/372 (44%), Gaps = 40/372 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ---PCEQCGATTFDPSKSLTYATLPCDSS 154
+ V+ IG PP Q +LDTGS L W++C P + +T FDPS S +++ LPC+
Sbjct: 76 ILLVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHP 135
Query: 155 YCTNDCGGY--PDE------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C + P C Y+ Y +G ++G + E+ F TS + G
Sbjct: 136 LCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLI----LG 191
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFE--YAYNMLILGE 264
C+ + SD++ G+ G+ S S + +KFSYC+ LGE
Sbjct: 192 CAEDA---SDDK--GILGMNLGRLSFASQAKI--TKFSYCVPTRQVRPGFTPTGSFYLGE 244
Query: 265 GAILEG----------DSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
G S M +D ++ V L+GI +G K L+I + F+ + + +
Sbjct: 245 NPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQS 304
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLF-QGLLPSYPMDPAWHLCYSGNINRDLQGFPAM 372
+ IDSG+ T+LV AY +R+EV L L Y +C+ GN + M
Sbjct: 305 M-IDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDMCFDGNAMEIGRLIGNM 363
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
F F G ++V++ V V C+ +G S++ G +IIG QQN V +D+
Sbjct: 364 VFEFDKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLGA---ASNIIGNFHQQNLWVEFDI 420
Query: 433 VSKQLYFQRIDC 444
++++ F + DC
Sbjct: 421 ANRRVGFGKADC 432
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 170/385 (44%), Gaps = 54/385 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-----GATTFDPSKSLTYATLPCD 152
+ +N S+G PP+ ++DTGS+LIW +C PC +C A P++S T++ LPC+
Sbjct: 90 AYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCN 149
Query: 153 SSYCT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
S+C C C YN Y +G + G + +E G V
Sbjct: 150 GSFCQYLPTSSRPRTCNAT-AACAYNYTYGSG-YTAGYLATETLTV-----GDGTFPKVA 202
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCI------GNLNYFEYAY 257
FGCS N + +G+ GLG SLV ++ +FSYC+ G + +
Sbjct: 203 FGCSTENGV---DNSSGIVGLG---RGPLSLVSQLAVGRFSYCLRSDMADGGASPILFG- 255
Query: 258 NMLILGEGAILEGD---STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
++ L EG++++ P YYV L GI++ L + + F T G
Sbjct: 256 SLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGT 315
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH---LCY---SGNINRDLQG 368
+DSGTTLT+L Y +++ + L + P A + LCY +G + ++
Sbjct: 316 IVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVR- 374
Query: 369 FPAMAFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
P +A FAGGA + ++ F Q +V CL V P+ + +SIIG +
Sbjct: 375 VPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDD----LPISIIGNLM 430
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELL 447
Q + ++ YD+ F DC L
Sbjct: 431 QMDMHLLYDIDGGMFSFAPADCAKL 455
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 117/420 (27%), Positives = 168/420 (40%), Gaps = 89/420 (21%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ------------------------------ 128
++V F +G P P L V DTGS L WVKC+
Sbjct: 55 YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114
Query: 129 -PCEQCGATTFDPSKSLTYATLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQ 180
A F P +S T+A +PC S CT C C Y RY +G ++
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAAR 174
Query: 181 GTIGSEQFNFETS------DEGKTFLYDVGFGCSHNNAHFSDEQFT---GVFGLGPATSS 231
GT+G++ S + + L V GC+ + ++ E F GV LG + S
Sbjct: 175 GTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTS---YTGESFLASDGVLSLGYSNVS 231
Query: 232 THS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS--------- 281
S + G +FSYC+ + A + L G + S + GS
Sbjct: 232 FASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQT 291
Query: 282 -----------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
Y V + G+S+ ++L I P L D G +DSGT+LT LV AY
Sbjct: 292 PLLLDHRMRPFYAVAVNGVSVDGELLRI-PRLVW--DVQKGGGAILDSGTSLTVLVSPAY 348
Query: 331 QTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG------FPAMAFHFAGGADLVL 384
+ + + G LP MDP + CY N L G PA+A HFAG A L
Sbjct: 349 RAVVAALGKKLVG-LPRVAMDP-FDYCY--NWTSPLTGEDLAVAVPALAVHFAGSARLQP 404
Query: 385 DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+S + V C+ + D G +S+IG I QQ + +DL +++L F+R C
Sbjct: 405 PPKSYVIDAAPGVKCIGLQEGDWPG-----VSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 160/350 (45%), Gaps = 39/350 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS + W++C+PC C F+P+ S TY +L C +
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T+ C ++C Y + Y +G + G + ++ F S GK + +V GC H+N
Sbjct: 222 CSLLETSAC--RSNKCLYQVSYGDGSFTVGELATDTVTFGNS--GK--INNVALGCGHDN 275
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYA---YNMLILGEGAI 267
FTG GL S+ ++ + FSYC+ + + + + +N + LG
Sbjct: 276 EGL----FTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLG---- 327
Query: 268 LEGDST-PM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
GD+T P+ ID YYV L G S+G + + + P+ D GV +D GT +T
Sbjct: 328 -GGDATAPLLRNKKIDTFYYVGLSGFSVGGEKV-VLPDAIFDVDASGSGGVILDCGTAVT 385
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
L AY +LR L L + CY + ++ P +AFHF GG L
Sbjct: 386 RLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVK-VPTVAFHFTGGKSLD 444
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
L A++ + S FC A P+ LSIIG + QQ + YDL
Sbjct: 445 LPAKNYLIPVDDSGTFCFAFAPTS------SSLSIIGNVQQQGTRITYDL 488
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 166/373 (44%), Gaps = 47/373 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ ++ +IG PP P LDTGS L+W +CQPC C + +D S+S T+A CDS+
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 156 CTND------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C D C ++ Y + + G + E +F + V FGC
Sbjct: 151 CKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAGAS----VPGVVFGCGL 206
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNMLIL 262
NN TG+ G G S S + KVG+ FS+C ++ F+ ++
Sbjct: 207 NNTGIFRSNETGIAGFGRGPLSLPSQL-KVGN-FSHCFTAVSGRKPSTVLFDLPADLYKN 264
Query: 263 GEGAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVFIDS 318
G G + +TP+ YY++L+GI++G L + + F KN T G IDS
Sbjct: 265 GRGTV---QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGT---GGTIIDS 318
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFA 377
GT T L P Y+ + E + LP P + LC+S P + HF
Sbjct: 319 GTAFTSLPPRVYRLVHDEFAAHVK--LPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE 376
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
GA + L E+ ++ + CLA+ I GE ++IIG QQN +V YDL +
Sbjct: 377 -GATMHLPRENYVFEAKDGGNCSICLAI----IEGE----MTIIGNFQQQNMHVLYDLKN 427
Query: 435 KQLYFQRIDCELL 447
+L F R C+ L
Sbjct: 428 SKLSFVRAKCDKL 440
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 160/350 (45%), Gaps = 39/350 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS + W++C+PC C F+P+ S TY +L C +
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C T+ C ++C Y + Y +G + G + ++ F S GK + +V GC H+N
Sbjct: 222 CSLLETSAC--RSNKCLYQVSYGDGSFTVGELATDTVTFGNS--GK--INNVALGCGHDN 275
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYA---YNMLILGEGAI 267
FTG GL S+ ++ + FSYC+ + + + + +N + LG
Sbjct: 276 EGL----FTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLG---- 327
Query: 268 LEGDST-PM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
GD+T P+ ID YYV L G S+G + + + P+ D GV +D GT +T
Sbjct: 328 -GGDATAPLLRNKKIDTFYYVGLSGFSVGGEKV-VLPDAIFDVDASGSGGVILDCGTAVT 385
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
L AY +LR L L + CY + ++ P +AFHF GG L
Sbjct: 386 RLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVK-VPTVAFHFTGGKSLD 444
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
L A++ + S FC A P+ LSIIG + QQ + YDL
Sbjct: 445 LPAKNYLIPVDDSGTFCFAFAPTS------SSLSIIGNVQQQGTRITYDL 488
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 124/372 (33%), Positives = 167/372 (44%), Gaps = 65/372 (17%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCT 157
+ FS+G PP A+ DTGS LIW KC C++C G+ ++ P+KS +++ LPC S+ C
Sbjct: 83 MTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCR 142
Query: 158 N-------DCGGYPDE---CWYNIRYTNGPDS------QGTIGSEQFNFETSDEGKTFLY 201
CGG C Y RY+ G S QG +GSE F G +
Sbjct: 143 TLESQSLATCGGTRARGAVCSY--RYSYGLSSNPHHYTQGYMGSETFTL-----GSDAVQ 195
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
+GFGC+ + V S L KVG+ FSYC L + L+
Sbjct: 196 GIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQL--KVGA-FSYC---LTSDPSTSSPLL 249
Query: 262 LGEGAILEG---DSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
G GA L G STP+ + S Y V L+ IS+G K G+
Sbjct: 250 FGAGA-LTGPGVQSTPLVNLKTSTFYTVNLDSISIGA----------AKTPGTGRHGIIF 298
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCY--SGNINRDLQGFPAMA 373
DSGTTLT+L AY TL E L Q L P + +C+ SG FP+M
Sbjct: 299 DSGTTLTFLAEPAY-TL-AEAGLLSQTTNLTRVPGTDGYEVCFQTSGGAV-----FPSMV 351
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
HF GG D+ L E+ F + SV C V S ++SI+G I Q +Y++ YDL
Sbjct: 352 LHFDGG-DMALKTENYFGAVNDSVSCWLVQKSP------SEMSIVGNIMQMDYHIRYDLD 404
Query: 434 SKQLYFQRIDCE 445
L FQ +C+
Sbjct: 405 KSVLSFQPTNCD 416
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 118/445 (26%), Positives = 182/445 (40%), Gaps = 60/445 (13%)
Query: 42 LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKA--------HDTRAHLHPGI 93
LLHRDS + N T R L R ++ K++ R + P +
Sbjct: 68 LLHRDS--FAVNATAAELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVAPVV 125
Query: 94 STVPV---FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYA 147
S P + ++G P V L LDT S L W++CQPC +C FDP S +Y
Sbjct: 126 SRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYG 185
Query: 148 TLPCDSSYCT---NDCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
+ D+ C GG C Y ++Y +G S T + + G
Sbjct: 186 EMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAY 245
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--SKFSYCIGN-LNYFEYAYNM 259
+ GC H+N G+ GLG S + +G + FSYC+ + ++ +
Sbjct: 246 LSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSST 305
Query: 260 LILGEGAILEGDSTPMSVIDGS---------YYVTLEGISLG--------EKMLDIDPNL 302
L G GA+ D++P + + YYV L G+S+G E+ L +DP
Sbjct: 306 LTFGAGAV---DTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYT 362
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSG 360
+ GV +DSGTT+T L AY R L P+ + CY+
Sbjct: 363 GR-------GGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTV 415
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIG 419
++ PA++ HFAGG ++ L ++ +S C A G + +S+IG
Sbjct: 416 GGRAGVK-VPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFA-----FAGTGDRSVSVIG 469
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
I QQ + V YDL +++ F +C
Sbjct: 470 NILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 117/439 (26%), Positives = 192/439 (43%), Gaps = 58/439 (13%)
Query: 21 RIFTSTTAAPAAGKPKRLVTKLLHRDSL-----LYNPNDTVDAQAQRTLNMSMARFIYLS 75
+ S T A ++ K K KL+HRD + ++ +A+ QR + + L+
Sbjct: 54 KKLNSATEASSSAKYK---LKLVHRDKVPTFNTYHDHRTRFNARMQRDTKRAASLLRRLA 110
Query: 76 QKSSQKA-----HDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC 130
A D + + G ++V +G PP Q V+D+GS +IWV+C+PC
Sbjct: 111 AGKPTYAAEAFGSDVVSGMEQGSGE---YFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPC 167
Query: 131 EQC---GATTFDPSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRYTNGPDSQGTIGS 185
QC F+P+ S +++ + C S+ C+ ++ + C Y + Y +G ++GT+
Sbjct: 168 TQCYHQSDPVFNPADSSSFSGVSCASTVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLAL 227
Query: 186 EQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSY 245
E F G+T + +V GC H+N + G S L + G FSY
Sbjct: 228 ETITF-----GRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSY 282
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPN 301
C+ ++ + +L G A+ G + P+ YY+ L G+ +G + I +
Sbjct: 283 CL--VSRGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISED 340
Query: 302 LFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CY 358
+FK ++ D GV +D+GT +T L AY+ R D F + P + CY
Sbjct: 341 VFKLSE-LGDGGVVMDTGTAVTRLPTVAYEAFR----DGFIAQTTNLPRASGVSIFDTCY 395
Query: 359 SGNINRDLQGF-----PAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERF 412
DL GF P ++F+F+GG L L A + + FC A PS
Sbjct: 396 ------DLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSS------ 443
Query: 413 KDLSIIGMIAQQNYNVAYD 431
LSIIG I Q+ ++ D
Sbjct: 444 SGLSIIGNIQQEGIQISVD 462
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 106/365 (29%), Positives = 151/365 (41%), Gaps = 41/365 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC----GATTFDPSKSLTYATLPCD-- 152
+ V +G P + DTGS + W +CQPC + FDPS+S +Y + C
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSS 208
Query: 153 -----SSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
+S N G C Y I+Y + S G G+E+ ++D ++ FGC
Sbjct: 209 ICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDA----FNNIYFGC 264
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
NN + S +K FSYC L + L G A
Sbjct: 265 GQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYC---LPSSSSSTGFLTFGGSAS 321
Query: 268 LEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
TP+S I Y + GIS+G K L I ++F S AG IDSGT +T
Sbjct: 322 KNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVF------STAGAIIDSGTVITR 375
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGAD 381
L P+AY LR F+ L+ YPM A + CY + + P + F F+ G +
Sbjct: 376 LPPAAYSALRAS----FRNLMSKYPMTKALSILDTCYDFSSYTTIS-VPKIGFSFSSGIE 430
Query: 382 LVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ +DA + Y S S CLA G SD D+ I G + Q+ V YD + ++ F
Sbjct: 431 VDIDATGILYASSLSQVCLAFAGNSDAT-----DVFIFGNVQQKTLEVFYDGSAGKVGFA 485
Query: 441 RIDCE 445
C
Sbjct: 486 PGGCS 490
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 112/375 (29%), Positives = 156/375 (41%), Gaps = 45/375 (12%)
Query: 89 LHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSK 142
L PG S V + +G P V+DTGSSL W++C PC Q G +DP
Sbjct: 123 LTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGP-LYDPRA 181
Query: 143 SLTYATLPCDSSYCTN--------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSD 194
S TYAT+PC +S C + C Y Y + S G + + +F +
Sbjct: 182 SSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS 241
Query: 195 EGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNYF 253
+ +GC +N G+ GL S + L +G FSYC+
Sbjct: 242 YPNFY-----YGCGQDNEGLFGRS-AGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPAST 295
Query: 254 EYAYNMLILGEGAILEGDSTPM--SVIDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
Y L +G TPM S +D S Y+VTL G+S+G L + P + T
Sbjct: 296 GY----LSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPT-- 349
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
IDSGT +T L + Y L K V G+ S P C+ G ++ P
Sbjct: 350 ----IIDSGTVITRLPTAVYTALSKAVAAAMVGVQ-SAPAFSILDTCFQGQASQ--LRVP 402
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
A+A FAGGA L L ++V S CLA P+D +IIG QQ ++V Y
Sbjct: 403 AVAMAFAGGATLKLATQNVLIDVDDSTTCLAFAPTD-------STTIIGNTQQQTFSVVY 455
Query: 431 DLVSKQLYFQRIDCE 445
D+ ++ F C
Sbjct: 456 DVAQSRIGFAAGGCS 470
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 176/382 (46%), Gaps = 52/382 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ--CGATTFDPSKSLTYATLPCDSSYCTN 158
V+ ++G PP V+DTGS L W+ C + ++TF+P S +Y+ +PC SS CT+
Sbjct: 75 VSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTD 134
Query: 159 DCGGYPDE--------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
+P C + Y + S+G + ++ F G + + +V FGC +
Sbjct: 135 QTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYI-----GSSGIPNVVFGCMDS 189
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYN-MLILGEGAI- 267
+ E+ + GL + S V ++G KFSYCI EY ++ +L+LG+
Sbjct: 190 IFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCIS-----EYDFSGLLLLGDANFS 244
Query: 268 ---------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
L STP+ D +Y V LEGI + K+L I ++F+ + T + +D
Sbjct: 245 WLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGA-GQTMVD 303
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-----MDPAWHLCYSGNINR-DLQGFPA 371
SGT T+L+ AY LR + G L Y A LCY N+ L P+
Sbjct: 304 SGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPS 363
Query: 372 MAFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDING-ERFKDLSIIGMIAQQ 424
+ F GA++ + + + Y + + S+ C G SD+ G E F +IG + QQ
Sbjct: 364 VTLVFR-GAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAF----VIGHLHQQ 418
Query: 425 NYNVAYDLVSKQLYFQRIDCEL 446
N + +DL ++ I C+L
Sbjct: 419 NVWMEFDLKKSRIGLAEIRCDL 440
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 167/384 (43%), Gaps = 56/384 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT------FDPSKSLTYATLPCD 152
+ + +G PPV LA+ DTGS L+WVKC+ + +T F PS S TY + CD
Sbjct: 110 YLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCD 169
Query: 153 SSYC---TNDCGGYPD-ECWYNIRYTNGPDSQGTIGSEQFNFET-SDEGKTF-------- 199
+ C ++ PD C Y Y +G + G + +E F F T +D KT
Sbjct: 170 TKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNNN 229
Query: 200 --------LYDVGFGCS-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI--- 247
+ + FGCS F + G+ G + +S +G KFSYC+
Sbjct: 230 SSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQLGATTSLGRKFSYCLAPY 289
Query: 248 GNLNYFEYAYNMLILGEGAILE---GDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNL 302
N N A + L G A++ STP+ ++ Y + L+ I++
Sbjct: 290 ANTN----ASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAGT-------- 337
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SG 360
K+ T + A + +DSGTTLT+L + L K++ + P + LCY SG
Sbjct: 338 -KRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESP-EKILDLCYDISG 395
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
D G P + GG ++ L ++ F V CLA+ + + +SI+G
Sbjct: 396 VRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLAL----VATSERQSVSILGN 451
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
IAQQN +V YDL + F DC
Sbjct: 452 IAQQNLHVGYDLEKGTVTFAAADC 475
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 111/440 (25%), Positives = 186/440 (42%), Gaps = 45/440 (10%)
Query: 37 RLVTKL--LHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
R +T++ LH+ L N +TV +Q Q+ + + ++ ++A A L G++
Sbjct: 106 RDLTRIQTLHKRVLEKNNQNTV-SQKQKKNDKEVVTTTPVASSVEEQAGQLVATLESGMT 164
Query: 95 T-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLP 150
++++ +G PP +LDTGS L W++C PC C +DP S +Y +
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNIT 224
Query: 151 CDSSYCT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLY 201
C+ C C C Y Y + ++ G E F T++ G + LY
Sbjct: 225 CNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELY 284
Query: 202 DVG---FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYN 258
+V FGC H N + S + L G FSYC+ + N +
Sbjct: 285 NVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 344
Query: 259 MLILGEGAILEGD---------STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
LI GE L + +++D YYV ++ I + ++L+I +TW
Sbjct: 345 KLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNI------PEETW 398
Query: 310 S-----DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINR 364
+ G IDSGTTL++ AY+ ++ ++ + +G P Y P C++ +
Sbjct: 399 NISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIH 458
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
++Q P + FA GA E+ F + + CLA + G SIIG QQ
Sbjct: 459 NVQ-LPELGIAFADGAVWNFPTENSFIWLNEDLVCLA-----MLGTPKSAFSIIGNYQQQ 512
Query: 425 NYNVAYDLVSKQLYFQRIDC 444
N+++ YD +L + C
Sbjct: 513 NFHILYDTKRSRLGYAPTKC 532
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 161/363 (44%), Gaps = 50/363 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ + +G P Q ++D+GS + WV+C+PC QC + FDPS S TY+ C S+
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAA 190
Query: 156 CT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C N C +C Y +RY +G + GT S+ G + + FGCSH
Sbjct: 191 CAQLGQDGNGCSSS-SQCQYIVRYADGSSTTGTYSSDTLAL-----GSNTISNFQFGCSH 244
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+ F+D G+ GLG S S G+ FSYC L + L LG G
Sbjct: 245 VESGFNDLT-DGLMGLGGGAPSLASQTAGTFGTAFSYC---LPPTPSSSGFLTLGAGT-- 298
Query: 269 EG-DSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
G TPM S + Y V LE I +G L I ++F AG+ +DSGT +T
Sbjct: 299 SGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF-------SAGMVMDSGTIITR 351
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYP---MDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD 381
L +AY L + + P+ P MD + +SG + L P++A F+GGA
Sbjct: 352 LPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFD--FSGQSSVRL---PSVALVFSGGAV 406
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
+ LDA + CLA + + I+G + Q+ + V YD+ + F+
Sbjct: 407 VNLDANGIILGN-----CLAFAANSDD----SSPGIVGNVQQRTFEVLYDVGGGAVGFKA 457
Query: 442 IDC 444
C
Sbjct: 458 GAC 460
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 154/374 (41%), Gaps = 42/374 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDS 153
+ V+ +G P V DTGS L WV+C PC G F PS S T++ + C
Sbjct: 85 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGE 144
Query: 154 SYC---TNDCGGYP--DECWYNIRYTNGPDSQGTIGSEQFNFET------SDEGKTFLYD 202
C C P D C Y + Y + + G +G++ T S+ L
Sbjct: 145 PECPRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPG 204
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLI 261
FGC NN + G+FGLG S S K G FSYC+ + + + Y L
Sbjct: 205 FVFGCGENNTGLFGKA-DGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHGYLSLG 263
Query: 262 LGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
A TPM S YYV L GI + + + + + W AG+ +DS
Sbjct: 264 TPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVS----SRPALW-PAGLIVDS 318
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL-----CY--SGNINRDLQGFPA 371
GT +T L P AY LR F + Y A L CY + + N + PA
Sbjct: 319 GTVITRLAPRAYSALRTA----FLSAMGKYGYKRAPRLSILDTCYDFTAHANATVS-IPA 373
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+A FAGGA + +D V Y + CLA P+ NG + I+G Q+ V YD
Sbjct: 374 VALVFAGGATISVDFSGVLYVAKVAQACLAFAPNG-NG---RSAGILGNTQQRTVAVVYD 429
Query: 432 LVSKQLYFQRIDCE 445
+ +++ F C
Sbjct: 430 VGRQKIGFAAKGCS 443
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 170/421 (40%), Gaps = 85/421 (20%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ------PCEQCG------------------ 134
++V F +G P P L V DTGS L WVKC P G
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAA 166
Query: 135 ------ATTFDPSKSLTYATLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQG 181
A F P +S T+A +PC S CT C C Y+ RY +G ++G
Sbjct: 167 ASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAARG 226
Query: 182 TIGSEQFNFETSDEG------KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS- 234
T+G++ S G + L V GC+ + S GV LG + S S
Sbjct: 227 TVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISFASR 286
Query: 235 LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD----------------------- 271
+ G +FSYC+ + A + L G +
Sbjct: 287 AAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPGGA 346
Query: 272 -STPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
TP+ + + Y VT+ GIS+ ++L I P L D G +DSGT+LT LV
Sbjct: 347 RQTPLLLDHRMRPFYAVTVNGISVDGELLRI-PRLVW--DVAKGGGAILDSGTSLTVLVS 403
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY---SGNINRDLQ-GFPAMAFHFAGGADLV 383
AY+ + + G LP MDP + CY S + DL P +A HFAG A L
Sbjct: 404 PAYRAVVAALNKKLAG-LPRVTMDP-FDYCYNWTSPSTGEDLTVAMPELAVHFAGSARLQ 461
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
A+S + V C+ + + G +S+IG I QQ + +DL +++L F+R
Sbjct: 462 PPAKSYVIDAAPGVKCIGLQEGEWPG-----VSVIGNILQQEHLWEFDLKNRRLRFKRSR 516
Query: 444 C 444
C
Sbjct: 517 C 517
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 104/403 (25%), Positives = 177/403 (43%), Gaps = 48/403 (11%)
Query: 78 SSQKAHDTRAHLH-----------PGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ + HD R H G++T +++ IG P +DTGS ++WV
Sbjct: 57 SALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWV 116
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGY------PDECWYNI 171
C C+ C T +DP S + + CD +C + GG C Y+I
Sbjct: 117 NCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSI 176
Query: 172 RYTNGPDSQGTIGSE--QFNFETSDEGKTFLYD--VGFGCSHN---NAHFSDEQFTGVFG 224
Y +G + G ++ Q+N + S +G+T + V FGC + S+ G+ G
Sbjct: 177 SYGDGSSTAGFFVTDFLQYN-QVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILG 235
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G + SS S + KV F++C+ +N + +G + +TP+
Sbjct: 236 FGQSNSSMLSQLAAAGKVRKMFAHCLDTVN----GGGIFAIGNVVQPKVKTTPLVSDMPH 291
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V L+GI +G L + N+F D+ + G IDSGTTL ++ Y+ L V D
Sbjct: 292 YNVILKGIDVGGTALGLPTNIF---DSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH 348
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
Q + D + YSG+++ GFP + FHF G L++ +Q +++C+
Sbjct: 349 QDISVQTLQDFSC-FQYSGSVD---DGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMG 404
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + KD+ ++G + N V YDL ++ + + +C
Sbjct: 405 FQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNC 447
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 157/367 (42%), Gaps = 45/367 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P VLDTGS ++W++C PC +C FDP+KS TYA +PC +
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPL 188
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C C Y + Y +G + G +E F +T + V GC H+N
Sbjct: 189 CRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFR-----RTRVTRVALGCGHDN 243
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE-G 270
+ S + KFSYC+ + + + ++ G+ A+
Sbjct: 244 EGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSA-SAKPSSVVFGDSAVSRTA 302
Query: 271 DSTPM---SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP+ +D YY+ L GIS+ G + + +LF+ D + GV IDSGT++T L
Sbjct: 303 RFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRL-DAAGNGGVIIDSGTSVTRLT 361
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHFAG 378
AY LR D F+ + L C+ DL G P + HF
Sbjct: 362 RPAYIALR----DAFRVGASHLKRAAEFSLFDTCF------DLSGLTEVKVPTVVLHFR- 410
Query: 379 GADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GAD+ L A + ++S FC A + LSIIG I QQ + V++DL ++
Sbjct: 411 GADVSLPATNYLIPVDNSGSFCFAFAGT------MSGLSIIGNIQQQGFRVSFDLAGSRV 464
Query: 438 YFQRIDC 444
F C
Sbjct: 465 GFAPRGC 471
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 154/357 (43%), Gaps = 30/357 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G+P VLDTGS + W++CQPC C A + +DPS S +YAT+ CDS
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSPR 222
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C C Y + Y +G + G +E S + +V GC H+N
Sbjct: 223 CRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAP----VSNVAIGCGHDN 278
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F G GL S ++ + FSYC+ + + + E +
Sbjct: 279 EGL----FVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDSEQPAVTA 334
Query: 271 DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
+ YYV L GIS+G + L I + F +D S GV +DSGT +T L AY
Sbjct: 335 PLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGS-GGVIVDSGTAVTRLQSGAY 393
Query: 331 QTLRKEVEDLFQGL--LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
LR E QG LP + CY +Q PA+A F GG +L L A++
Sbjct: 394 GALR---EAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQ-VPAVALWFEGGGELKLPAKN 449
Query: 389 VFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+++ +CLA + +SIIG + QQ V++D + F C
Sbjct: 450 YLIPVDAAGTYCLAFAGTS------GPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 159/369 (43%), Gaps = 32/369 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +G PP ++DTGS L W++C PC C FDP+ SL+Y + C
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPR 211
Query: 156 C--------TNDCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT-FLYDVGF 205
C C + D C Y Y + ++ G + E F + G + + DV F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H+N + A S L G FSYC+ +++ + ++ G+
Sbjct: 272 GCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL--VDHGSSVGSKIVFGDD 329
Query: 266 AILEGD---------STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
L G + + D YYV L+G+ +G + L+I P+ + S G I
Sbjct: 330 DALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS-GGTII 388
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGTTL++ AY+ +R+ + P P CY+ + ++ P + F
Sbjct: 389 DSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVE-VPEFSLLF 447
Query: 377 AGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
A GA AE+ F + + + CLAV G +SIIG QQN++V YDL +
Sbjct: 448 ADGAVWDFPAENYFVRLDPDGIMCLAV-----LGTPRSAMSIIGNFQQQNFHVLYDLQNN 502
Query: 436 QLYFQRIDC 444
+L F C
Sbjct: 503 RLGFAPRRC 511
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 159/369 (43%), Gaps = 32/369 (8%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +G PP ++DTGS L W++C PC C FDP+ SL+Y + C
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPR 211
Query: 156 C--------TNDCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT-FLYDVGF 205
C C + D C Y Y + ++ G + E F + G + + DV F
Sbjct: 212 CGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVF 271
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H+N + A S L G FSYC+ +++ + ++ G+
Sbjct: 272 GCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL--VDHGSSVGSKIVFGDD 329
Query: 266 AILEGD---------STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
L G + + D YYV L+G+ +G + L+I P+ + S G I
Sbjct: 330 DALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS-GGTII 388
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGTTL++ AY+ +R+ + P P CY+ + ++ P + F
Sbjct: 389 DSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVE-VPEFSLLF 447
Query: 377 AGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
A GA AE+ F + + + CLAV G +SIIG QQN++V YDL +
Sbjct: 448 ADGAVWDFPAENYFVRLDPDGIMCLAV-----LGTPRSAMSIIGNFQQQNFHVLYDLQNN 502
Query: 436 QLYFQRIDC 444
+L F C
Sbjct: 503 RLGFAPRRC 511
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 150/366 (40%), Gaps = 28/366 (7%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V +G PP ++DTGS L W++C PC C FDP S +Y + C +
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTR 209
Query: 156 C--------TNDC-GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C C D C Y Y + ++ G + E F + + V G
Sbjct: 210 CGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLG 269
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
C H N + S L G FSYC+ +++ + ++ G+
Sbjct: 270 CGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCL--VDHGSAVGSKIVFGDDN 327
Query: 267 ILEGDS-------TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+L P + + YYV L+GI +G +MLDI N + + G IDSG
Sbjct: 328 VLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSG 387
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
TTL++ AY+ +R+ D P P CY+ + ++ P + FA G
Sbjct: 388 TTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVE-VPEFSLLFADG 446
Query: 380 ADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
A AE+ F + ++ + CLAV G +SIIG QQN++V YDL +L
Sbjct: 447 AVWDFPAENYFIRLDTEGIMCLAV-----LGTPRSAMSIIGNYQQQNFHVLYDLHHNRLG 501
Query: 439 FQRIDC 444
F C
Sbjct: 502 FAPRRC 507
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/403 (25%), Positives = 177/403 (43%), Gaps = 48/403 (11%)
Query: 78 SSQKAHDTRAHLH-----------PGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ + HD R H G++T +++ IG P +DTGS ++WV
Sbjct: 57 SALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWV 116
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGY------PDECWYNI 171
C C+ C T +DP S + + CD +C + GG C Y+I
Sbjct: 117 NCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSI 176
Query: 172 RYTNGPDSQGTIGSE--QFNFETSDEGKTFLYD--VGFGCSHN---NAHFSDEQFTGVFG 224
Y +G + G ++ Q+N + S +G+T + V FGC + S+ G+ G
Sbjct: 177 SYGDGSSTAGFFVTDFLQYN-QVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILG 235
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G + SS S + KV F++C+ +N + +G + +TP+
Sbjct: 236 FGQSNSSMLSQLAAAGKVRKMFAHCLDTVN----GGGIFAIGNVVQPKVKTTPLVPDMPH 291
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V L+GI +G L + N+F D+ + G IDSGTTL ++ Y+ L V D
Sbjct: 292 YNVILKGIDVGGTALGLPTNIF---DSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH 348
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
Q + D + YSG+++ GFP + FHF G L++ +Q +++C+
Sbjct: 349 QDISVQTLQDFSC-FQYSGSVD---DGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMG 404
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + KD+ ++G + N V YDL ++ + + +C
Sbjct: 405 FQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNC 447
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/376 (31%), Positives = 176/376 (46%), Gaps = 50/376 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP P LDTGS LIW +CQPC C FDPS S T + CDS+
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 156 CTN----DCGG---YPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C CG +P++ C Y Y + + G + ++F F + + V FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---VPGVAFGC 198
Query: 208 S-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL---- 262
NN F + TG+ G G S S + KVG+ FS+C +N + + +L L
Sbjct: 199 GLFNNGVFKSNE-TGIAGFGRGPLSLPSQL-KVGN-FSHCFTAVNGLKPSTVLLDLPADL 255
Query: 263 ---GEGAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVF 315
G GA+ STP+ + YY++L+GI++G L + + F KN T G
Sbjct: 256 YKSGRGAV---QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGT---GGTI 309
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQ-GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAF 374
IDSGT +T L Y+ +R + ++ DP + C S + R P +
Sbjct: 310 IDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDP--YFCLSAPL-RAKPYVPKLVL 366
Query: 375 HFAGGADLVLDAESVFYQ---ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
HF GA + L E+ ++ SS+ CLA+ I G +++ IG QQN +V YD
Sbjct: 367 HFE-GATMDLPRENYVFEVEDAGSSILCLAI----IEG---GEVTTIGNFQQQNMHVLYD 418
Query: 432 LVSKQLYFQRIDCELL 447
L + +L F C+ L
Sbjct: 419 LQNSKLSFVPAQCDKL 434
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 159/371 (42%), Gaps = 51/371 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC---EQCGA---TTFDPSKSLTYATLPCD 152
+ ++ +G P + Q V+DTGS + WV+C+PC C A FDP+ S TYA C
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194
Query: 153 SSYC--------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
++ C N C C Y ++Y +G ++ GT S+ SD + F
Sbjct: 195 AAACAQLGDSGEANGCDA-KSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQ---- 249
Query: 205 FGCSHNN-AHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL 262
FGCSH D++ G+ GLG A S + G FSYC L + L L
Sbjct: 250 FGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYC---LPATPASSGFLTL 306
Query: 263 GEGAILEGD------STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
G A G +TPM + Y+ LE I++G K L + P++F AG
Sbjct: 307 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFA-------AG 359
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMA 373
+DSGT +T L P+AY L + P+ C++ D P +A
Sbjct: 360 SLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLG-ILDTCFN-FTGLDKVSIPTVA 417
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
FAGGA + LDA + S CLA P+ + K IG + Q+ + V YD+
Sbjct: 418 LVFAGGAVVDLDAHGIV-----SGGCLAFAPTRDD----KAFGTIGNVQQRTFEVLYDVG 468
Query: 434 SKQLYFQRIDC 444
F+ C
Sbjct: 469 GGVFGFRAGAC 479
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/376 (31%), Positives = 176/376 (46%), Gaps = 50/376 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP P LDTGS LIW +CQPC C FDPS S T + CDS+
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 156 CTN----DCGG---YPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C CG +P++ C Y Y + + G + ++F F + + V FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---VPGVAFGC 198
Query: 208 S-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL---- 262
NN F + TG+ G G S S + KVG+ FS+C +N + + +L L
Sbjct: 199 GLFNNGVFKSNE-TGIAGFGRGPLSLPSQL-KVGN-FSHCFTAVNGLKPSTVLLDLPADL 255
Query: 263 ---GEGAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVF 315
G GA+ STP+ + YY++L+GI++G L + + F KN T G
Sbjct: 256 YKSGRGAV---QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGT---GGTI 309
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQ-GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAF 374
IDSGT +T L Y+ +R + ++ DP + C S + R P +
Sbjct: 310 IDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDP--YFCLSAPL-RAKPYVPKLVL 366
Query: 375 HFAGGADLVLDAESVFYQ---ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
HF GA + L E+ ++ SS+ CLA+ I G +++ IG QQN +V YD
Sbjct: 367 HFE-GATMDLPRENYVFEVEDAGSSILCLAI----IEG---GEVTTIGNFQQQNMHVLYD 418
Query: 432 LVSKQLYFQRIDCELL 447
L + +L F C+ L
Sbjct: 419 LQNSKLSFVPAQCDKL 434
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 160/367 (43%), Gaps = 53/367 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCGATT---FDPSKSLTYATLPCDSS 154
+ + G P Q V DTGS + W++C+PC +C A FDPS S TY + C
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSCTEP 75
Query: 155 YC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C T C C Y + Y +G + G + + F + + K F+ FGC N
Sbjct: 76 ACVGLSTRGCSS--STCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKNFI----FGCGQN 129
Query: 211 NAHFSDEQFTGVFGL-GPATSSTHSLVEKV----GSKFSYCIGNLN----YFEYAYNMLI 261
N F G GL G SST+SL +V G+ FSYC+ + + Y
Sbjct: 130 NTGL----FQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNPQNT 185
Query: 262 LGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
G A+L P Y++ L GIS+G L + +F+ G IDSGT
Sbjct: 186 PGYTAMLTDTRVPT-----LYFIDLIGISVGGTRLSLSSTVFQS------VGTIIDSGTV 234
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAG 378
+T L P+AY L+ V + + Y + PA + CY + + +P + HFA
Sbjct: 235 ITRLPPTAYSALKTAV----RAAMTQYTLAPAVTILDTCYDFSRTTSVV-YPVIVLHFA- 288
Query: 379 GADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
G D+ + A VF+ +SS CLA G +D + IIG + Q V YD K++
Sbjct: 289 GLDVRIPATGVFFVFNSSQVCLAFAGNTDST-----MIGIIGNVQQLTMEVTYDNELKRI 343
Query: 438 YFQRIDC 444
F C
Sbjct: 344 GFSAGAC 350
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 164/390 (42%), Gaps = 71/390 (18%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCD 152
V + S G P ++DTGS L WV+C+PC C A FDP+ S TYA + C+
Sbjct: 145 VTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCN 204
Query: 153 SSYCTN-------------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
+S C + G ++C+Y + Y +G S+G + ++ G
Sbjct: 205 ASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL-----GGAS 259
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEY 255
L FGC +N F G GL + SLV + S+ FSYC+ +
Sbjct: 260 LGGFVFGCGLSNRGL----FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGD- 314
Query: 256 AYNMLILGEG---AILEGDSTPMS----VIDGS----YYVTLEGISLGEKMLDIDPNLFK 304
A L LG G A ++TP++ + D + Y++ + G ++G L L
Sbjct: 315 ASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ-GLGA 373
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGN 361
N V IDSGT +T L PS Y+ +R E F YP P + + CY
Sbjct: 374 SN-------VLIDSGTVITRLAPSVYRAVRAEFMRQFGA--AGYPAAPGFSILDTCY--- 421
Query: 362 INRDLQG-----FPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKD 414
DL G P + GGAD+ +DA + + ++ S CLA+ E
Sbjct: 422 ---DLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDE---- 474
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
IIG Q+N V YD + +L F DC
Sbjct: 475 TPIIGNYQQKNKRVVYDTLGSRLGFADEDC 504
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 161/362 (44%), Gaps = 42/362 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ IG+PP P VLDTGS + WV+C PC +C T F+P+ S ++ +L C++
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETEQ 210
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++C C Y + Y +G + G F ET G T L ++ GC HNN
Sbjct: 211 CKSLDVSECRN--GTCLYEVSYGDGSYTVG-----DFVTETVTLGSTSLGNIAIGCGHNN 263
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYA---YNMLILGEGAI 267
F G GL + S ++ S FSYC+ + + + +N I +
Sbjct: 264 EGL----FIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNSPITPDAVT 319
Query: 268 LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
P +D +Y+ L G+S+G +L I F+ ++ + G+ +DSGT +T L
Sbjct: 320 APLHRNPN--LDTFFYLGLTGMSVGGAVLPIPETSFQMSEDG-NGGIIVDSGTAVTRLQT 376
Query: 328 SAYQTLR----KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+ Y LR K DL Q D + L + P ++FHFA G +L
Sbjct: 377 TVYNVLRDAFVKSTHDL-QTARGVALFDTCYDLSSKSRVE-----VPTVSFHFANGNELP 430
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L A++ +S FC A P+D LSI+G QQ V +DL + + F
Sbjct: 431 LPAKNYLIPVDSEGTFCFAFAPTD------STLSILGNAQQQGTRVGFDLANSLVGFSPN 484
Query: 443 DC 444
C
Sbjct: 485 KC 486
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 188/425 (44%), Gaps = 44/425 (10%)
Query: 36 KRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST 95
K LV L RDS D V + A R +++++A K +K + A P +S
Sbjct: 95 KSLVLARLERDS------DRVRSLATR-MDLAIAGITKSDLKPVEKELEAEALETPLVSG 147
Query: 96 VPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYAT 148
++ IG PP V+DTGS + WV+C PC C F+PS S +YA
Sbjct: 148 ASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAP 207
Query: 149 LPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
L C++ C ++C D C Y + Y +G + G +E +G L +V
Sbjct: 208 LTCETHQCKSLDVSECRN--DSCLYEVSYGDGSYTVGDFATETITL----DGSASLNNVA 261
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILG 263
GC H+N F G GL + S ++ S FSYC+ N + + + L
Sbjct: 262 IGCGHDNEGL----FVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRD--TDSASTLEFN 315
Query: 264 EGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+ P+ + +D YY+ + GI +G +ML I + F+ +++ + G+ +DSGT
Sbjct: 316 SPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDES-GNGGIIVDSGT 374
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
+T L Y +LR Q LPS + CY + ++ P ++FHF G
Sbjct: 375 AVTRLQSDVYNSLRDSFVRGTQH-LPSTSGVALFDTCYDLSSRSSVE-VPTVSFHFPDGK 432
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L L A++ +S+ FC A P+ LSIIG + QQ V+YDL + + F
Sbjct: 433 YLALPAKNYLIPVDSAGTFCFAFAPTT------SALSIIGNVQQQGTRVSYDLSNSLVGF 486
Query: 440 QRIDC 444
C
Sbjct: 487 SPNGC 491
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 161/362 (44%), Gaps = 42/362 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ IG+PP P VLDTGS + WV+C PC +C T F+P+ S ++ +L C++
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETEQ 210
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++C C Y + Y +G + G F ET G T L ++ GC HNN
Sbjct: 211 CKSLDVSECRN--GTCLYEVSYGDGSYTVG-----DFVTETVTLGSTSLGNIAIGCGHNN 263
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYA---YNMLILGEGAI 267
F G GL + S ++ S FSYC+ + + + +N I +
Sbjct: 264 EGL----FIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNSPITPDAVT 319
Query: 268 LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
P +D +Y+ L G+S+G +L I F+ ++ + G+ +DSGT +T L
Sbjct: 320 APLHRNPN--LDTFFYLGLTGMSVGGAVLPIPETSFQMSEDG-NGGIIVDSGTAVTRLQT 376
Query: 328 SAYQTLR----KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+ Y LR K DL Q D + L + P ++FHFA G +L
Sbjct: 377 TVYNVLRDAFVKSTHDL-QTARGVALFDTCYDLSSKSRVE-----VPTVSFHFANGNELP 430
Query: 384 LDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
L A++ +S FC A P+D LSI+G QQ V +DL + + F
Sbjct: 431 LPAKNYLIPVDSEGTFCFAFAPTD------STLSILGNAQQQGTRVGFDLANSLVGFSPN 484
Query: 443 DC 444
C
Sbjct: 485 KC 486
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 162/377 (42%), Gaps = 47/377 (12%)
Query: 102 NFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDCG 161
+ +IG PP VLDTGS L W++C+ E + F+P S TY +PC S C
Sbjct: 70 SLTIGTPPQNITMVLDTGSELSWLRCKK-EPNFTSIFNPLASKTYTKIPCSSQTCKTRTS 128
Query: 162 GY--------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
C + I Y + +G + E F F + T FGC + +
Sbjct: 129 DLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATV-----FGCMDSGSS 183
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI----- 267
+ E+ GL + S V ++G KFSYCI L+ + L+LGE
Sbjct: 184 SNTEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISGLD----STGFLLLGEARYSWLKP 239
Query: 268 -----LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
L STP+ D +Y V LEGI + K+L + ++F + T + +DSGT
Sbjct: 240 LNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGA-GQTMVDSGTQ 298
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCY-SGNINRDLQGFPAMAFH 375
T+L+ Y LRKE G+L P Y A LCY + + L P +
Sbjct: 299 FTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLM 358
Query: 376 FAGGADLVLDAESVFYQE------SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
F GA++ + + + Y+ SV+C G SD E +IG QQN +
Sbjct: 359 FR-GAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSD---ELGISSFLIGHHQQQNVWME 414
Query: 430 YDLVSKQLYFQRIDCEL 446
YDL + ++ F + C+L
Sbjct: 415 YDLENSRIGFAELRCDL 431
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 160/386 (41%), Gaps = 39/386 (10%)
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
LS+ S+ + + ++ P S + + + V IG P V DTGS L W +C+P
Sbjct: 103 LSKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEP 162
Query: 130 C-EQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGS 185
C C + F+PS S TY + C S C + C Y+I Y + +QG +
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCVYSIVYGDKSFTQGFLAK 222
Query: 186 EQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSY 245
E+F SD L DV FGC NN D + S + FSY
Sbjct: 223 EKFTLTNSD----VLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSY 278
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDS-TPMSVIDGS--YYVTLEGISLGEKMLDIDPNL 302
C+ ++ + L G I E TP+S + Y + + GIS+G+K L I PN
Sbjct: 279 CLP--SFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNS 336
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYS 359
F S G IDSGT T L Y LR +F+ + SY + L CY
Sbjct: 337 F------STEGAIIDSGTVFTRLPTKVYAELRS----VFKEKMSSYKSTSGYGLFDTCYD 386
Query: 360 GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDL-SII 418
D +P +AF FAG + LD + S CLA +D DL +I
Sbjct: 387 FT-GLDTVTYPTIAFSFAGSTVVELDGSGISLPIKISQVCLAFAGND-------DLPAIF 438
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G + Q +V YD+ ++ F C
Sbjct: 439 GNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 117/405 (28%), Positives = 174/405 (42%), Gaps = 54/405 (13%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
R ++ + +A T+ L GI+ + Y+ ++G ++DTGS L WV+C+P
Sbjct: 35 RIRRVASTHNVEASQTQIPLSSGINLQTLNYI-VTMGLGSKNMTVIIDTGSDLTWVQCEP 93
Query: 130 CEQC---GATTFDPSKSLTYATLPCDSSYC---------TNDCGGY-PDECWYNIRYTNG 176
C C F PS S +Y ++ C+SS C T CG P C Y + Y +G
Sbjct: 94 CMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDG 153
Query: 177 PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV 236
+ G +G E +F G + D FGC NN F GV GL S SLV
Sbjct: 154 SYTNGELGVEALSF-----GGVSVSDFVFGCGRNNKGL----FGGVSGLMGLGRSYLSLV 204
Query: 237 EKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSYYV 284
+ G FSYC+ + L++G + + ++ P++ + Y +
Sbjct: 205 SQTNATFGGVFSYCLPTTE--AGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYIL 262
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
L GI +G L K ++ + G+ IDSGT +T L S Y+ L+ E F G
Sbjct: 263 NLTGIDVGGVAL-------KAPLSFGNGGILIDSGTVITRLPSSVYKALKAEFLKKFTG- 314
Query: 345 LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAV 402
PS P C++ D P ++ F G A L +DA FY +E +S CLA+
Sbjct: 315 FPSAPGFSILDTCFN-LTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDASQVCLAL 373
Query: 403 GP-SDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
SD D +IIG Q+N V YD ++ F C
Sbjct: 374 ASLSDA-----YDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCSF 413
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 173/388 (44%), Gaps = 67/388 (17%)
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATL 149
+S+ ++ NF+IG PP P AV+D L+W +C PC+ C FDP+KS T+ L
Sbjct: 51 LSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110
Query: 150 PCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
PC S C + +C D C Y G D+ G G++ F + E +
Sbjct: 111 PCGSHLCESIPESSRNC--TSDVCIYEAPTKAG-DTGGMAGTDTFAIGAAKE------TL 161
Query: 204 GFGCSHNNAHFSDEQF------TGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYA 256
GFGC +D++ +G+ GLG + SLV ++ + FSYC+ +
Sbjct: 162 GFGC----VVMTDKRLKTIGGPSGIVGLG---RTPWSLVTQMNVTAFSYCLAG-----KS 209
Query: 257 YNMLILGEGAI-LEG---DSTPMSVI-------DGS---YYVTLEGISLGEKMLDIDPNL 302
L LG A L G STP + +GS Y V L GI G L
Sbjct: 210 SGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPL------ 263
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI 362
+ + S + V +D+ + ++L AY+ L+K + G+ P + LC+S +
Sbjct: 264 --QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAV-GVQPVASPPKPYDLCFSKAV 320
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS---DINGERFKDLSIIG 419
D P + F F GGA L + + + CL +G S ++ GE + SI+G
Sbjct: 321 AGDA---PELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGE-LEGASILG 376
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ Q+N +V +DL + L F+ DC L
Sbjct: 377 SLQQENVHVLFDLKEETLSFKPADCSSL 404
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 108/389 (27%), Positives = 171/389 (43%), Gaps = 62/389 (15%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-----GATTFDPSKSLTYATLPCD 152
+ +N S+G PP+ ++DTGS+LIW +C PC +C A P++S T++ LPC+
Sbjct: 90 AYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCN 149
Query: 153 SSYCT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
S+C C C YN Y +G + G + +E G V
Sbjct: 150 GSFCQYLPTSSRPRTCNAT-AACAYNYTYGSG-YTAGYLATETLTV-----GDGTFPKVA 202
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILG 263
FGCS N + +G+ GLG SLV ++ +FSYC+ + + + + ++ G
Sbjct: 203 FGCSTENGV---DNSSGIVGLG---RGPLSLVSQLAVGRFSYCLRS-DMADGGASPILFG 255
Query: 264 EGAILEGDSTPMSVIDGS-------------YYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
A L T SV+ + YYV L GI++ L + + F T
Sbjct: 256 SLAKL----TERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGL 311
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH---LCY---SGNINR 364
G +DSGTTLT+L Y +++ + L + P A + LCY +G +
Sbjct: 312 GGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGK 371
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLSII 418
++ P +A FAGGA + ++ F Q +V CL V P+ + +SII
Sbjct: 372 AVR-VPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDD----LPISII 426
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
G + Q + ++ YD+ F DC L
Sbjct: 427 GNLMQMDMHLLYDIDGGMFSFAPADCAKL 455
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/440 (25%), Positives = 183/440 (41%), Gaps = 46/440 (10%)
Query: 37 RLVTKL--LHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
R +T++ LH+ L +TV +Q Q+ N + ++ ++A A L G++
Sbjct: 92 RDLTRIQTLHKRVLAKKNQNTV-SQKQKKKNKEVVT-TPVASSVEEQAGQLVATLESGMT 149
Query: 95 T-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLP 150
++++ +G PP +LDTGS L W++C PC C +DP S +Y +
Sbjct: 150 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNIT 209
Query: 151 CDSSYCT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLY 201
C+ C C C Y Y + ++ G E F T+ G + LY
Sbjct: 210 CNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELY 269
Query: 202 DVG---FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYN 258
+V FGC H N + S + L G FSYC+ + N +
Sbjct: 270 NVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 329
Query: 259 MLILGEGAILEGD---------STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
LI GE L + +++D YYV ++ I + ++L+I +TW
Sbjct: 330 KLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNI------PEETW 383
Query: 310 S-----DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINR 364
+ G IDSGTTL++ AY+ ++ ++ + +G P Y P C++ +
Sbjct: 384 NISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGID 443
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
+Q P + FA GA E+ F + + CLA I G SIIG QQ
Sbjct: 444 SIQ-LPELGIAFADGAVWNFPTENSFIWLNEDLVCLA-----ILGTPKSAFSIIGNYQQQ 497
Query: 425 NYNVAYDLVSKQLYFQRIDC 444
N+++ YD +L + C
Sbjct: 498 NFHILYDTKRSRLGYAPTKC 517
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 128/471 (27%), Positives = 195/471 (41%), Gaps = 87/471 (18%)
Query: 11 SLITLPFTSTRIFTSTTAAPAAGKPKR----LVTKLLHRDSLLYNPNDTVDA--QAQRTL 64
S +T+P S+ T + A KP++ + LLHR P+ + D
Sbjct: 25 SFVTVP--SSSFVPDTVCSGALVKPEQNGSAVYVPLLHRHGPC-APSLSTDTPPSMSEMF 81
Query: 65 NMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIW 124
S AR Y+ S K AHL + ++ + S G P VPQ+ V+DTGS L W
Sbjct: 82 RRSHARLSYIV---SGKKVSVPAHLGTSVKSLE-YVATVSFGTPAVPQVVVIDTGSDLTW 137
Query: 125 VKCQPCE--QCGATT---FDPSKSLTYATLPCDSSYCT--------NDCG-GYPDECWYN 170
++C+PC QC FDPS S TY+ +PC S C + C G P C +
Sbjct: 138 LQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCASGECKKLAADAYGSGCSNGQP--CGFA 195
Query: 171 IRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATS 230
I Y +G + G G ++ K F FGC H+ +
Sbjct: 196 ISYVDGTSTVGVYGKDKLTLAPGAIVKDFY----FGCGHSKSSLPGLFD--------GLL 243
Query: 231 STHSLVEKVGSK------FSYCIGNLNYFEYAYNMLILGEGAILEGDS-TPMSVIDGS-- 281
L E +G++ FSYC+ +N L G G G TPM + G
Sbjct: 244 GLGRLSESLGAQYGGGGGFSYCLPAVNSKP---GFLAFGAGRNPSGFVFTPMGRVPGQPT 300
Query: 282 -YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL 340
VTL GI++G K LD+ P+ F G+ +DSGT +T L + Y+ LR +
Sbjct: 301 FSTVTLAGITVGGKKLDLRPSAF-------SGGMIVDSGTVVTVLQSTVYRALRAAFREA 353
Query: 341 FQGLLPSYPMDPAWHLCYSGNINR--DLQGF-----PAMAFHFAGGADLVLDAESVFYQE 393
+ A+ L + G+++ DL G+ P +A F+GGA + LD +
Sbjct: 354 MK----------AYRLVH-GDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILVN 402
Query: 394 SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA + +G ++G + Q+ + V +D + + F+ C
Sbjct: 403 G----CLAFAETGKDGTA----GVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 115/398 (28%), Positives = 174/398 (43%), Gaps = 53/398 (13%)
Query: 75 SQKSSQKAHDTRAHLHPGISTVPVFY-VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
S + Q +T+ L GI + Y V +G + ++DTGS L WV+CQPC C
Sbjct: 113 SSTTEQSVSETQIPLTSGIKLETLNYIVTVELGGKNMS--LIVDTGSDLTWVQCQPCRSC 170
Query: 134 ---GATTFDPSKSLTYATLPCDSSYC---------TNDCGGY----PDECWYNIRYTNGP 177
+DPS S +Y T+ C+SS C + CGG+ C Y + Y +G
Sbjct: 171 YNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGS 230
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
++G + SE G T L ++ FGC NN +G+ GLG ++ S S
Sbjct: 231 YTRGDLASESIVL-----GDTKLENLVFGCGRNNKGLFGGA-SGLMGLGRSSVSLVSQTL 284
Query: 238 KV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM--------SVIDGSYYVTLEG 288
K FSYC+ +L + A L G + +ST + + Y + L G
Sbjct: 285 KTFNGVFSYCLPSLE--DGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTG 342
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
S+G +++ F + G+ IDSGT +T L PS Y+ ++ E F G PS
Sbjct: 343 ASIGG--VELKTLSFGR-------GILIDSGTVITRLPPSIYKAVKTEFLKQFSG-FPSA 392
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSD 406
P C++ D+ P + F G A+L +D VFY + +S+ CLA+
Sbjct: 393 PGYSILDTCFNLTSYEDIS-IPTIKMIFEGNAELEVDVTGVFYFVKPDASLVCLALASLS 451
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
E + IIG Q+N V YD ++L +C
Sbjct: 452 YENE----VGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 154/379 (40%), Gaps = 49/379 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ +G PP +LDTGS L W++C PC C +DP S ++ + C
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPR 254
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGK---TFLYDV 203
C C G C Y Y + ++ G E F T+ EGK + +V
Sbjct: 255 CQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENV 314
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S L G FSYC+ + N + LI G
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNSSVSSKLIFG 374
Query: 264 EG------------AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW-- 309
E + + G P +D YYV ++ I +G ++L I +TW
Sbjct: 375 EDKELLSHPNLNFTSFVGGKENP---VDTFYYVLIKSIMVGGEVLKI------PEETWHL 425
Query: 310 ---SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
G IDSGTTLT+ AY+ +++ +G P P CY+ + +
Sbjct: 426 SAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKG-FPLVETFPPLKPCYNVSGVEKM 484
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+ P A FA GA E+ F Q E V CLA I G LSIIG QQN
Sbjct: 485 E-LPEFAILFADGAMWDFPVENYFIQIEPEDVVCLA-----ILGTPRSALSIIGNYQQQN 538
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+++ YDL +L + + C
Sbjct: 539 FHILYDLKKSRLGYAPMKC 557
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 168/374 (44%), Gaps = 39/374 (10%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ ++G PP VLDTGS L W+ C+ + FDP +S +Y+ +PC S C
Sbjct: 65 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNL-HSVFDPLRSSSYSPIPCTSPTCRTRT 123
Query: 161 GGY--------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
+ C I Y + +G + S+ F+ S T + G S N+
Sbjct: 124 RDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSD 183
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCI------GNLNYFEYAYNMLILGEG 265
D + TG+ G+ + S V ++G KFSYCI G L + E +++ L +
Sbjct: 184 E--DSKTTGLIGMN---RGSLSFVTQMGLQKFSYCISGQDSSGILLFGESSFSWLKALKY 238
Query: 266 AILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
L STP+ D +Y V LEGI + ML + +++ + T + +DSGT T+
Sbjct: 239 TPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA-GQTMVDSGTQFTF 297
Query: 325 LVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNI-NRDLQGFPAMAFHFAG 378
L+ Y L+ E + L P++ A LCY + R L P + F
Sbjct: 298 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR- 356
Query: 379 GADLVLDAESVFYQE------SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
GA++ + AE + Y+ S SV+C G S++ G + IIG QQN + +DL
Sbjct: 357 GAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGV---ESYIIGHHHQQNVWMEFDL 413
Query: 433 VSKQLYFQRIDCEL 446
++ F + C+L
Sbjct: 414 AKSRVGFAEVRCDL 427
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 163/376 (43%), Gaps = 62/376 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC----GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS L W +CQPC + F+PSKS +Y + C S+
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 163
Query: 155 YC------TNDCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD-VGFG 206
C T + G C Y I+Y + S G + E+F SD ++D V FG
Sbjct: 164 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSD-----VFDGVYFG 218
Query: 207 CSHNNAHFSDEQFTGV---FGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNMLIL 262
C NN FTGV GLG S S +K FSYC+ + + L
Sbjct: 219 CGENNQGL----FTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYT---GHLTF 271
Query: 263 GEGAILEGDS-TPMSVI-DGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
G I TP+S I DG+ Y + + I++G + L I +F S G IDS
Sbjct: 272 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF------STPGALIDS 325
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----P 370
GT +T L P AY LR F+ + YP + C+ DL GF P
Sbjct: 326 GTVITRLPPKAYAALRSS----FKAKMSKYPTTSGVSILDTCF------DLSGFKTVTIP 375
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVA 429
+AF F+GGA + L ++ +FY S CLA G SD + + +I G + QQ V
Sbjct: 376 KVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDS-----NAAIFGNVQQQTLEVV 430
Query: 430 YDLVSKQLYFQRIDCE 445
YD ++ F C
Sbjct: 431 YDGAGGRVGFAPNGCS 446
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 118/417 (28%), Positives = 185/417 (44%), Gaps = 47/417 (11%)
Query: 41 KLLHRD---SLLY-NPNDTVDAQAQRTLNMSMARFIYLSQK------SSQKAHDTRAHLH 90
+LLHRD S+ Y N + + A+ +R + A +S K S + +D + +
Sbjct: 62 RLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIV 121
Query: 91 PGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTY 146
G+ ++V +G PP Q V+D+GS ++WV+CQPC+ C + FDP+KS +Y
Sbjct: 122 SGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSY 181
Query: 147 ATLPCDSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
+ C SS C + G + C Y + Y +G ++GT+ E F KT + +V
Sbjct: 182 TGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA-----KTVVRNVA 236
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
GC H N + G + S L + G F YC+ ++ + L+ G
Sbjct: 237 MGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCL--VSRGTDSTGSLVFGR 294
Query: 265 GAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
A+ G S V + YYV L+G+ +G + + +F +T D GV +D+GT
Sbjct: 295 EALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTET-GDGGVVMDTGT 353
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFH 375
+T L +AY R + LP + CY DL GF P ++F+
Sbjct: 354 AVTRLPTAAYVAFRDGFKSQTAN-LPRASGVSIFDTCY------DLSGFVSVRVPTVSFY 406
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
F G L L A + + S +C A S LSIIG I Q+ V++D
Sbjct: 407 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTG------LSIIGNIQQEGIQVSFD 457
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 156/367 (42%), Gaps = 51/367 (13%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCG 161
+G P ++DTGS L WV+C PC C + F P+ S ++ L C T C
Sbjct: 9 LGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACG----TELCN 64
Query: 162 GYP------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHF 214
G P C Y Y +G S G + + + K + + FGC H+N F
Sbjct: 65 GLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGSF 124
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAI------ 267
+ G+ GLG S S ++ V KFSYC+ + + L+ G+ A+
Sbjct: 125 AGAD--GILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFPGV 182
Query: 268 -----LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
L P YYV L GIS+G K+L+I F D+ AG DSGTT+
Sbjct: 183 KYISLLTNPKVPT-----YYYVKLNGISVGGKLLNISSTAFDI-DSVGRAGTIFDSGTTV 236
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMD----PAWHLCYSGNINRDLQGFPAMAFHFAG 378
T L +Q EV YP LC G L P+M FHF G
Sbjct: 237 TQLAGEVHQ----EVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEG 292
Query: 379 GADLVLDAESVF-YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
G D+ L + F + ESS +C ++ S D++IIG I QQN+ V YD V +++
Sbjct: 293 G-DMELPPSNYFIFLESSQSYCFSMVSS-------PDVTIIGSIQQQNFQVYYDTVGRKI 344
Query: 438 YFQRIDC 444
F C
Sbjct: 345 GFVPKSC 351
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 151/356 (42%), Gaps = 57/356 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
F V+ + G PP +LDTGSS+ W +C+PC +C + FDPS SLTY+ C S
Sbjct: 162 FLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIPST 221
Query: 156 CTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
N YN+ Y + S G G + E SD F FGC NN
Sbjct: 222 VGNT---------YNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQ----FGCGRNNEGDF 268
Query: 216 DEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTP 274
G+ GLG ST S K FSYC+ E + L+ GE A + S
Sbjct: 269 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPE----EDSIGSLLFGEKATSQSSSLK 324
Query: 275 MSVI-----------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ + G Y+V L IS+G K L+I ++F + G IDSGT +T
Sbjct: 325 FTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF------ASPGTIIDSGTVIT 378
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-------LCYSGNINRDLQGFPAMAFHF 376
L AY L+ + YP+ CY+ + +D+ P + HF
Sbjct: 379 RLPQRAYSALKAAFKKAMA----KYPLSNGRRKKGDILDTCYNLSGRKDVL-LPEIVLHF 433
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
GAD+ L+ + V + +S CLA + +L+IIG Q + V YD+
Sbjct: 434 GEGADVRLNGKRVIWGNDASRLCLAFAGNS-------ELTIIGNRQQVSLTVLYDI 482
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 163/376 (43%), Gaps = 62/376 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC----GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS L W +CQPC + F+PSKS +Y + C S+
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 191
Query: 155 YC------TNDCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD-VGFG 206
C T + G C Y I+Y + S G + E+F SD ++D V FG
Sbjct: 192 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSD-----VFDGVYFG 246
Query: 207 CSHNNAHFSDEQFTGV---FGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNMLIL 262
C NN FTGV GLG S S +K FSYC+ + + L
Sbjct: 247 CGENNQGL----FTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYT---GHLTF 299
Query: 263 GEGAILEGDS-TPMSVI-DGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
G I TP+S I DG+ Y + + I++G + L I +F S G IDS
Sbjct: 300 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF------STPGALIDS 353
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----P 370
GT +T L P AY LR F+ + YP + C+ DL GF P
Sbjct: 354 GTVITRLPPKAYAALRSS----FKAKMSKYPTTSGVSILDTCF------DLSGFKTVTIP 403
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVA 429
+AF F+GGA + L ++ +FY S CLA G SD + + +I G + QQ V
Sbjct: 404 KVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDS-----NAAIFGNVQQQTLEVV 458
Query: 430 YDLVSKQLYFQRIDCE 445
YD ++ F C
Sbjct: 459 YDGAGGRVGFAPNGCS 474
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 151/352 (42%), Gaps = 33/352 (9%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN-DCGG---YPDEC 167
VLDTGS ++WV+C PC +C FDP +S +Y + C ++ C D GG C
Sbjct: 2 VLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGAC 61
Query: 168 WYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP 227
Y + Y +G + G +E F G + V GC H+N +
Sbjct: 62 MYQVAYGDGSVTAGDFVTETLTF----AGGARVARVALGCGHDNEGLFVAAAGLLGLGRG 117
Query: 228 ATSSTHSLVEKVGSKFSYCI-------GNLNYFEYAYNMLILGEGAILEGDS--TPM--- 275
S + + G FSYC+ + + + G G++ + TPM
Sbjct: 118 GLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRN 177
Query: 276 SVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLR 334
++ YYV L GIS+ G ++ + + + + + GV +DSGT++T L ++Y LR
Sbjct: 178 PRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALR 237
Query: 335 KEVEDLFQGLLPSYPMD-PAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ- 392
G L P + CY R ++ P ++ HFAGGA+ L E+
Sbjct: 238 DAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVK-VPTVSMHFAGGAEAALPPENYLIPV 296
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+S FC A +D +SIIG I QQ + V +D +++ F C
Sbjct: 297 DSRGTFCFAFAGTD------GGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/458 (24%), Positives = 183/458 (39%), Gaps = 41/458 (8%)
Query: 15 LPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYL 74
LP + A A +P L ++HRD++ + P + R + +
Sbjct: 7 LPLRFLLVVLVACTADATQRPTTLHIPVVHRDAV-FPPRRGAPPGSFRCRHAAPHTAQLE 65
Query: 75 SQKSSQKAHDTRAHLHPGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC 130
S S+ A D P +S VP ++ +G PP L V+DTGS LIW++C PC
Sbjct: 66 SLHSATAAADLLRS--PVMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPC 123
Query: 131 EQCGATT---FDPSKSLTYATLPCDSSYCTN-----DCGGYPDECWYNIRYTNGPDSQGT 182
+C +DP S T+ +PC S C C C Y + Y +G S G
Sbjct: 124 RRCYRQVTPLYDPRNSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGD 183
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS-THSLVEKVGS 241
+ ++ T +++V GC H+N G+ G G S L G
Sbjct: 184 LATDTLVLPD----DTRVHNVTLGCGHDNEGLLASA-AGLLGAGRGQLSFPTQLAPAYGH 238
Query: 242 KFSYCIGN-LNYFEYAYNMLILGEGAILEGDS-TPMSVIDGS---YYVTLEGISL-GEKM 295
FSYC+G+ ++ + + L+ G L + TP+ YYV + G S+ GE++
Sbjct: 239 VFSYCLGDRMSRARNSSSYLVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERV 298
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV--EDLFQGLLPSYPMDPA 353
N GV +DSGT ++ AY +R G+
Sbjct: 299 AGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSV 358
Query: 354 WHLCYSGNINRDLQG--FPAMAFHFAGGADLVLDAES----VFYQESSSVFCLAVGPSDI 407
+ CY + N G P++ HFA AD+ L + V + + FCL + +D
Sbjct: 359 FDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAAD- 417
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
L+++G + QQ + V +D+ ++ F C
Sbjct: 418 -----DGLNVLGNVQQQGFGVVFDVERGRIGFTPNGCS 450
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 173/386 (44%), Gaps = 56/386 (14%)
Query: 84 DTRAHLHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFD 139
D A L+PGI+T F V +G PP + D + W++CQPC +C + FD
Sbjct: 171 DLNASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFD 230
Query: 140 PSKSLTYATLPCDSSYC----TNDCG--GYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
PS+S +Y L C++ +C + C GY C YNI Y +G +++G + +E +FE+S
Sbjct: 231 PSQSSSYTLLSCETKHCNLLPNSSCSDDGY---CRYNITYKDGTNTEGVLINETVSFESS 287
Query: 194 DEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNY 252
++ V GCS+ N F G FGLG + S S + S SYC+
Sbjct: 288 G----WVDRVSLGCSNKNQGPFVGSD--GTFGLGRGSLSFPSRIN--ASSMSYCLVES-- 337
Query: 253 FEYAYNMLILGEGAILEGDSTPMS-----------VIDGSYYVTLEGISLGEKMLDIDPN 301
+ Y+ + LE +S P S + YYV L+GI +G + +D+ PN
Sbjct: 338 -KDGYS------SSTLEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDV-PN 389
Query: 302 LFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWHLCYS 359
D + + G+ + S + +T L Y +R Q L L ++ + CY+
Sbjct: 390 STFTIDPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQ---FDTCYN 446
Query: 360 GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSII 418
+ N ++ P + F G +L ES Y + + FC A PS SI+
Sbjct: 447 LSSNNTVE-LPILEFEVNDGKSWLLPKESYLYAVDKNGTFCFAFAPSK------GSFSIL 499
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G + Q V +DLV+ +Y + C
Sbjct: 500 GTLQQYGTRVTFDLVNSFVYLHTLCC 525
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/344 (26%), Positives = 146/344 (42%), Gaps = 47/344 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ + +G PP P L VLDTGS ++W++C PC QC A + FDP +S +YA + C +
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPP 201
Query: 156 CTNDCGGYPDE-------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C G C Y + Y +G + G + +E F + V GC
Sbjct: 202 CRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARGAR----VPRVAVGCG 257
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
H+N + S + G +FSYC +G+ L
Sbjct: 258 HDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCF----------------QGSDL 301
Query: 269 EGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ + +V + G+ GE+ L +DP+ + GV +DSGT++T L
Sbjct: 302 DHRTIIRTVHQHVGGARVRGV--GERSLRLDPSTGR-------GGVILDSGTSVTRLARP 352
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
Y +R+ GL + + CY R ++ P ++ H AGGA++ L E+
Sbjct: 353 VYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVK-VPTVSVHLAGGAEVALPPEN 411
Query: 389 VFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
++ FCLA+ +D +SI+G I QQ + V +D
Sbjct: 412 YLIPVDTRGTFCLALAGTD------GGVSIVGNIQQQGFRVVFD 449
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/376 (30%), Positives = 163/376 (43%), Gaps = 47/376 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+ +N SIG PPV + DTGSSLIW +C PC +C A F P+ S T++ LPC SS
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSL 149
Query: 156 CTNDCGGY----PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C Y C Y Y G + G + +E + G V FGCS N
Sbjct: 150 CQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHV-----GGASFPGVAFGCSTEN 203
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
+G+ GLG S SLV +VG +FSYC+ + + + ++ G A + G
Sbjct: 204 G--VGNSSSGIVGLG---RSPLSLVSQVGVGRFSYCL--RSDADAGDSPILFGSLAKVTG 256
Query: 271 ---DSTPM-----SVIDGSYYVTLEGISLGEKMLDIDPNLF---KKNDTWSDAGVFIDSG 319
STP+ YYV L GI++G L + F + G +DSG
Sbjct: 257 GNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSG 316
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP---AWHLCYSGNINRDLQGFPA--MAF 374
TTLT+LV Y +++ + ++ + LC+ G P +
Sbjct: 317 TTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVL 376
Query: 375 HFAGGADLVLDAES------VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
FAGGA+ + S V Q ++V CL V P+ E+ +SIIG + Q + +V
Sbjct: 377 RFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPAS---EKLS-ISIIGNVMQMDLHV 432
Query: 429 AYDLVSKQLYFQRIDC 444
YDL F DC
Sbjct: 433 LYDLDGGMFSFAPADC 448
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 159/369 (43%), Gaps = 51/369 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC---GATTFDPSKSLTYATLPCDSS 154
F V G P +LDTGS L W++C+PC C FDP+KS +YA +PC +
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTP 196
Query: 155 YCTND---CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C G C Y ++Y +G + G + + F +S + F FGC N
Sbjct: 197 VCAAAGGMCNG--TTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTGFT----FGCGEKN 250
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
E + S G FSYC+ + N N+ GA
Sbjct: 251 IGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNI-----GATKPTS 305
Query: 272 STPM---SVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ P+ ++I Y++ L I++G +L + P++F K G +DSGT LT
Sbjct: 306 TVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT------GTLLDSGTILT 359
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-----FPAMAFHFAG 378
+L P AY +LR + QG P+ P +P CY D G PA++F+F+
Sbjct: 360 YLPPPAYTSLRDRFKFTMQGNKPAPPYEP-LDTCY------DFTGQGAIVIPAVSFNFSD 412
Query: 379 GADLVLD--AESVFYQESSSVF-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
GA LD +F ++ + CLA ++ SI+G Q+ V YD+ S+
Sbjct: 413 GAVFDLDFYGIMIFPDDAKPLIGCLAF----VSRPAAMPFSIVGNTQQRAAEVIYDVPSQ 468
Query: 436 QLYFQRIDC 444
++ F I C
Sbjct: 469 KIGFIPISC 477
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 165/366 (45%), Gaps = 48/366 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ ++G PP VLDTGS L W+ C+ + F+P S +Y+ +PC S C
Sbjct: 1002 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNL-TSVFNPLSSSSYSPIPCSSPICRTRT 1060
Query: 161 GGYPDE--------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
P+ C + Y + +G + S+ F G + L FGC +
Sbjct: 1061 RDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI-----GSSALPGTLFGCMDSGF 1115
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI---- 267
+ E+ GL + S V ++G KFSYCI + + +L+ G+ +
Sbjct: 1116 SSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRD----SSGVLLFGDLHLSWLG 1171
Query: 268 ------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
L STP+ D +Y V L+GI +G K+L + ++F + T +DSGT
Sbjct: 1172 NLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHT-GAGQTMVDSGT 1230
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
T+L+ Y LR E + +G+L P++ A LCYS L P+++
Sbjct: 1231 QFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLM 1290
Query: 376 FAGGADLVLDAESVFY------QESSSVFCLAVGPSDING-ERFKDLSIIGMIAQQNYNV 428
F GA++V+ E + Y + + V+CL G SD+ G E F +IG QQN +
Sbjct: 1291 FR-GAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAF----VIGHHHQQNVWM 1345
Query: 429 AYDLVS 434
+DLV+
Sbjct: 1346 EFDLVA 1351
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 168/372 (45%), Gaps = 38/372 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATL 149
++Y +G PP +DTGS ++WV C C+QC T +DP S T +T+
Sbjct: 87 LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTV 146
Query: 150 PCDSSYCTNDCGGYPDEC------WYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD 202
CD +C + GG +C Y++ Y +G + G+ ++ F + + +G+T +
Sbjct: 147 MCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPAN 206
Query: 203 --VGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFE 254
V FGC S + G+ G G A +S S + KV F++C+ +
Sbjct: 207 ASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIK--- 263
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +G+ + +TP+ Y V L+ I +G L++ ++FK + G
Sbjct: 264 -GGGIFAIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEK---RGT 319
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLC--YSGNINRDLQGFPAM 372
IDSGTTLT+L ++ + V + Q + D LC YSG+++ GFP +
Sbjct: 320 IIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQD---FLCFEYSGSVD---DGFPTL 373
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
FHF L + F+ + V+C+ + + KD+ ++G + N V YDL
Sbjct: 374 TFHFEDDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDL 433
Query: 433 VSKQLYFQRIDC 444
++ + + +C
Sbjct: 434 ENRVIGWTDYNC 445
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 167/374 (44%), Gaps = 39/374 (10%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ ++G PP VLDTGS L W+ C+ + FDP +S +Y+ +PC S C
Sbjct: 58 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNL-HSVFDPLRSSSYSPIPCTSPTCRTRT 116
Query: 161 GGY--------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
+ C I Y + +G + S+ F+ S T + G S N+
Sbjct: 117 RDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSD 176
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCI------GNLNYFEYAYNMLILGEG 265
D + TG+ G+ + S V ++G KFSYCI G L + E +++ L +
Sbjct: 177 E--DSKTTGLIGMN---RGSLSFVTQMGLQKFSYCISGQDSSGILLFGESSFSWLKALKY 231
Query: 266 AILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
L STP+ D +Y V LEGI + ML + +++ + T + +DSGT T+
Sbjct: 232 TPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA-GQTMVDSGTQFTF 290
Query: 325 LVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNI-NRDLQGFPAMAFHFAG 378
L+ Y L+ E + L P++ A LCY + R L P + F
Sbjct: 291 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR- 349
Query: 379 GADLVLDAESVFYQE------SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
GA++ + AE + Y+ S SV+C G S++ G + IIG QQN + +DL
Sbjct: 350 GAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGV---ESYIIGHHHQQNVWMEFDL 406
Query: 433 VSKQLYFQRIDCEL 446
++ F + C L
Sbjct: 407 AKSRVGFAEVRCXL 420
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/404 (25%), Positives = 173/404 (42%), Gaps = 48/404 (11%)
Query: 78 SSQKAHDTRAH--------LHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ +AHD R L+ G + +P +++ +G PP +DTGS ++WV
Sbjct: 37 SAVRAHDVRRRGRILSAVDLNLGGNGLPTETGLYFTKLGLGSPPRDYYVQVDTGSDILWV 96
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGG-YPD-----ECWYNI 171
C C +C T +DP S T + CD +C+ G P C Y+I
Sbjct: 97 NCVECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSI 156
Query: 172 RYTNGPDSQG-------TIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
Y +G + G T N TS + + ++ G S S+E G+ G
Sbjct: 157 TYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIG 216
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G A SS S + KV FS+C+ N+ + +GE + +TP+
Sbjct: 217 FGQANSSVLSQLAASGKVKKIFSHCLDNVR----GGGIFAIGEVVEPKVSTTPLVPRMAH 272
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V L+ I + +L + ++F D+ + G IDSGTTL +L Y L ++V
Sbjct: 273 YNVVLKSIEVDTDILQLPSDIF---DSVNGKGTVIDSGTTLAYLPDIVYDELIQKVLARQ 329
Query: 342 QGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCL 400
G L Y ++ + Y+GN++R GFP + HF L + +Q ++C+
Sbjct: 330 PG-LKLYLVEQQFRCFLYTGNVDR---GFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCI 385
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S + KD++++G + N V YDL + + + +C
Sbjct: 386 GWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMVIGWTDYNC 429
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 98/348 (28%), Positives = 150/348 (43%), Gaps = 48/348 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++V IG P + Q V+D+GS ++W++C+PC+QC T F+P+ S ++ + C S+
Sbjct: 129 YFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSSNV 188
Query: 156 CT---NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
C +D C Y + Y +G ++GT+ ET G+T + D GC H N
Sbjct: 189 CNQLDDDVACRKGRCGYQVAYGDGSYTKGTLA-----LETITIGRTVIQDTAIGCGHWNE 243
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
+ G S L + G F YC+ + M + L +
Sbjct: 244 GMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCL-------VSRAMPVGAMWVPLIHNP 296
Query: 273 TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
S YYV+L G+++G + I +F+ D + GV +D+GT +T L AY
Sbjct: 297 FYPSF----YYVSLSGLAVGGIRVPISEQIFQLTDIGT-GGVVMDTGTAITRLPTVAYNA 351
Query: 333 LRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PAMAFHFAGGADLVL 384
R D F + P P + CY DL GF P ++F+F+GG L
Sbjct: 352 FR----DAFIAQTTNLPRAPGVSIFDTCY------DLNGFVTVRVPTVSFYFSGGQILTF 401
Query: 385 DAESVFY-QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
A + + FC A PS LSIIG I Q+ V+ D
Sbjct: 402 PARNFLIPADDVGTFCFAFAPSP------SGLSIIGNIQQEGIQVSID 443
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 170/398 (42%), Gaps = 39/398 (9%)
Query: 61 QRTLNMSMARFIYLS---QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
+ TL AR YLS +K S RA I P + V +IG P P L LD
Sbjct: 55 ESTLLKDKARLQYLSSLAKKPSVPIASGRA-----IVQSPTYIVRANIGTPAQPMLVALD 109
Query: 118 TGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYCT---NDCGGYPDECWYNIRY 173
T + WV C C C ++ FDPSKS + L CD+ C N C +N+ Y
Sbjct: 110 TSNDAAWVPCSGCVGCASSVLFDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTY 169
Query: 174 TNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTH 233
G+ +T + FGC + A + G+ GLG S
Sbjct: 170 ------GGSTIEASLTQDTLTLANDVIKSYTFGC-ISKATGTSLPAQGLMGLGRGPLSLI 222
Query: 234 SLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGI 289
S + + S FSYC+ N ++ ++ + + + +TP+ YYV L GI
Sbjct: 223 SQTQNLYMSTFSYCLPNSKSSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGI 282
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
+G K++DI P D + AG DSGT T LV AY +R E + +
Sbjct: 283 RVGNKIVDI-PTSALAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNA--NAT 339
Query: 350 MDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS---SVFCLAVGPSD 406
+ CYSG++ +P++ F FA G ++ L +++ SS S +A P++
Sbjct: 340 SLGGFDTCYSGSV-----VYPSVTFMFA-GMNVTLPPDNLLIHSSSGSTSCLAMAAAPNN 393
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+N L++I + QQN+ V DL + +L R C
Sbjct: 394 VNSV----LNVIASMQQQNHRVLIDLPNSRLGISRETC 427
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 109/362 (30%), Positives = 153/362 (42%), Gaps = 59/362 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC----GATTFDPSKSLTYATLPCDSS 154
++V +G P + DTGS L W +C+PC + FDPSKS +Y+ + C S+
Sbjct: 145 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTST 204
Query: 155 YCTN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
CT C C Y I+Y + S G E+ + +D FL F
Sbjct: 205 LCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATDIVDNFL----F 260
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSS----THSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
GC NN G+ GLG S T ++ K+ FSYC L + L
Sbjct: 261 GCGQNNQGLFGGS-AGLIGLGRHPISFVQQTAAVYRKI---FSYC---LPATSSSTGRLS 313
Query: 262 LGEGAILEGDSTPMSVID-GS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
G TP S I GS Y + + GIS+G L + + T+S G IDS
Sbjct: 314 FGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVS------SSTFSTGGAIIDS 367
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----P 370
GT +T L P+AY LR F+ + YP + CY DL G+ P
Sbjct: 368 GTVITRLPPTAYTALRSA----FRQGMSKYPSAGELSILDTCY------DLSGYEVFSIP 417
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ F FAGG + L + + Y S+ CLA NG+ D++I G + Q+ V Y
Sbjct: 418 KIDFSFAGGVTVQLPPQGILYVASAKQVCLAFAA---NGDD-SDVTIYGNVQQKTIEVVY 473
Query: 431 DL 432
D+
Sbjct: 474 DV 475
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 164/375 (43%), Gaps = 60/375 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC----GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS L W +CQPC + F+PSKS +Y + C S+
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 192
Query: 155 YC------TNDCGG-YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD-VGFG 206
C T + G C Y I+Y + S G + ++F +SD ++D V FG
Sbjct: 193 ACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSD-----VFDGVYFG 247
Query: 207 CSHNNAHFSDEQFTGV---FGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNMLIL 262
C NN FTGV GLG S S +K FSYC+ + Y ++
Sbjct: 248 CGENNQGL----FTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL--PSSASYTGHLTFG 301
Query: 263 GEGAILEGDSTPMSVI-DGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
G TP+S I DG+ Y + + I++G + L I +F S G IDSG
Sbjct: 302 SAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF------STPGALIDSG 355
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF-----PA 371
T +T L P AY LR F+ + YP + C+ DL GF P
Sbjct: 356 TVITRLPPKAYAALRSS----FKAKMSKYPTTSGVSILDTCF------DLSGFKTVTIPK 405
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+AF F+GGA + L ++ +FY S CLA G SD + + +I G + QQ V Y
Sbjct: 406 VAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDS-----NAAIFGNVQQQTLEVVY 460
Query: 431 DLVSKQLYFQRIDCE 445
D ++ F C
Sbjct: 461 DGAGGRVGFAPNGCS 475
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 160/365 (43%), Gaps = 35/365 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--FDPSKSLTYATLPCDSSYC 156
+ + G PP VLDTGS++ W+ C PC C + F+PSKS TY L C S C
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQPFEPSKSSTYNYLTCASQQC 183
Query: 157 T--NDCGGYPD--ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
C + C RY + + + SE + G + + FGCS+
Sbjct: 184 QLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSV-----GSQQVENFVFGCSNAAR 238
Query: 213 HFSDEQFTGV-FGLGPAT--SSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+ V FG P + S T +L + S FSYC+ +L + ++L+ E +
Sbjct: 239 GLIQRTPSLVGFGRNPLSFVSQTATLYD---STFSYCLPSLFSSAFTGSLLLGKEALSAQ 295
Query: 270 G-DSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
G TP+ S YYV L GIS+GE+++ I +++ + G IDSGT +T L
Sbjct: 296 GLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDES-TGRGTIIDSGTVITRL 354
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY---SGNINRDLQGFPAMAFHFAGGADL 382
V AY +R L + P D + CY SG++ FP + HF DL
Sbjct: 355 VEPAYNAMRDSFRSQLSNLTMASPTD-LFDTCYNRPSGDVE-----FPLITLHFDDNLDL 408
Query: 383 VLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L +++ Y + SV CLA G G+ LS G QQ + +D+ +L
Sbjct: 409 TLPLDNILYPGNDDGSVLCLAFGLPPGGGDDV--LSTFGNYQQQKLRIVHDVAESRLGIA 466
Query: 441 RIDCE 445
+C+
Sbjct: 467 SENCD 471
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 157/385 (40%), Gaps = 52/385 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ ++ +G PP ++DTGS L W++C PC C FDP+ S +Y + C
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHR 210
Query: 156 CTNDCGGY--------------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
C + D C Y Y + ++ G + E F + G +
Sbjct: 211 CGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV 270
Query: 202 D-VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYC-------IGNLNYF 253
D V FGC H N + S L G FSYC +G+ F
Sbjct: 271 DGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKVVF 330
Query: 254 EYAYNMLILGEGAILE-----GDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ L L L+ S+ S D YYV L+G+ +G ++L+I +DT
Sbjct: 331 GEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNI------SSDT 384
Query: 309 W-----SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNI 362
W G IDSGTTL++ V AYQ +R D P P P CY+ +
Sbjct: 385 WDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSPCYNVSGV 444
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQ---ESSSVFCLAVGPSDINGERFKDLSIIG 419
R P ++ FA GA AE+ F + + S+ CLAV + G +SIIG
Sbjct: 445 ER--PEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTG-----MSIIG 497
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
QQN++V YDL + +L F C
Sbjct: 498 NFQQQNFHVVYDLQNNRLGFAPRRC 522
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 152/355 (42%), Gaps = 45/355 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC---EQCGA---TTFDPSKSLTYATLPCD 152
+ ++ +G P V Q V+DTGS + WV+C+PC C A FDP+ S TYA C
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167
Query: 153 SSYC--------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
++ C N C C Y ++Y +G ++ GT S+ SD + F
Sbjct: 168 AAACAQLGDSGEANGCDAK-SRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQ---- 222
Query: 205 FGCSHNN-AHFSDEQFTGVFGL-GPATSSTHSLVEKVGSKFSYCIGNL---NYFEYAYNM 259
FGCSH D++ G+ GL G A S + G F YC+ + F
Sbjct: 223 FGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTLGAP 282
Query: 260 LILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
G G +TPM + Y+ LE I++G K L + P++F AG +
Sbjct: 283 ASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFA-------AGSLV 335
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGT +T L P+AY L + P+ C++ D P +A F
Sbjct: 336 DSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLG-ILDTCFN-FTGLDKVSIPTVALVF 393
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
AGGA + LDA + S CLA P+ + K IG + Q+ + V YD
Sbjct: 394 AGGAVVDLDAHGIV-----SGGCLAFAPTRDD----KAFGTIGNVQQRTFEVLYD 439
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 172/388 (44%), Gaps = 67/388 (17%)
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATL 149
+S+ ++ NF+IG PP P AV+D L+W +C PC+ C FDP+KS T+ L
Sbjct: 51 LSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110
Query: 150 PCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
PC S C + +C D C Y G D+ G G++ F + E +
Sbjct: 111 PCGSHLCESIPESSRNC--TSDVCIYEAPTKAG-DTGGKAGTDTFAIGAAKE------TL 161
Query: 204 GFGCSHNNAHFSDEQF------TGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYA 256
GFGC +D++ +G+ GLG + SLV ++ + FSYC+ +
Sbjct: 162 GFGC----VVMTDKRLKTIGGPSGIVGLG---RTPWSLVTQMNVTAFSYCLAG-----KS 209
Query: 257 YNMLILGEGAI-LEG---DSTPMSVI-------DGS---YYVTLEGISLGEKMLDIDPNL 302
L LG A L G STP + +GS Y V L GI G L
Sbjct: 210 SGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPL------ 263
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI 362
+ + S + V +D+ + ++L AY+ L+K + G+ P + LC+ +
Sbjct: 264 --QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAV-GVQPVASPPKPYDLCFPKAV 320
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS---DINGERFKDLSIIG 419
D P + F F GGA L + + + CL +G S ++ GE + SI+G
Sbjct: 321 AGDA---PELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGE-LEGASILG 376
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ Q+N +V +DL + L F+ DC L
Sbjct: 377 SLQQENVHVLFDLKEETLSFKPADCSSL 404
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 167/382 (43%), Gaps = 48/382 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V ++Y +G PPV +DTGS ++WV C C C T+ FDP S T +
Sbjct: 75 VGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSS 134
Query: 148 TLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C C N C ++C Y +Y +G + G S+ + T EG
Sbjct: 135 MIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTT 194
Query: 201 YD---VGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
V FGCS+ SD G+FG G S S + G FS+C L
Sbjct: 195 NSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHC---LK 251
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+L+LGE I+E + S++ Y + L+ IS+ + L ID ++F +++
Sbjct: 252 GDSSGGGILVLGE--IVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNS- 308
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF-QGLLPSYPMDPAWHLCYSGNINRDLQG 368
G +DSGTTL +L AY + Q + +L S +
Sbjct: 309 --RGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRGNQCYLITSSVTDV---- 362
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
FP ++ +FAGGA ++L + Q++S +V+C +G I G+ ++I+G + +
Sbjct: 363 FPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWC--IGFQKIQGQ---GITILGDLVLK 417
Query: 425 NYNVAYDLVSKQLYFQRIDCEL 446
+ V YDL +++ + DC L
Sbjct: 418 DKIVVYDLAGQRIGWANYDCSL 439
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 107/347 (30%), Positives = 150/347 (43%), Gaps = 49/347 (14%)
Query: 107 QPPVPQLAVLDTG-SSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCGG 162
QPP PQ + + S+ W +C+PC +C + FDPS SLTY+ C S N
Sbjct: 82 QPPSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPSTVGNT--- 138
Query: 163 YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGV 222
YN+ Y + S G G + E SD F FGC NN G+
Sbjct: 139 ------YNMTYGDKSTSVGNYGCDTMTLEPSDVFPKFQ----FGCGRNNEGDFGSGADGM 188
Query: 223 FGLGPATSSTHS-LVEKVGSKFSYC------IGNLNYFEYAYNMLILGEGAILEGDSTPM 275
GLG ST S K FSYC IG+L + E A + L +++ G T
Sbjct: 189 LGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLVNGPGTSG 248
Query: 276 SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRK 335
G Y+V L IS+G K L++ ++F + G IDSGT +T L AY L
Sbjct: 249 LEESGYYFVKLLDISVGNKRLNVPSSVF------ASPGTIIDSGTVITCLPQRAYSALTA 302
Query: 336 EVEDLFQGLLPSYPMDPAWH-------LCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
F+ + YP+ CY+ + +D+ P + HF GAD+ L+ +
Sbjct: 303 A----FKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVL-LPEIVLHFGEGADVRLNGKR 357
Query: 389 VFYQESSSVFCLAVG---PSDINGERFKDLSIIGMIAQQNYNVAYDL 432
V + +S CLA S +N E L+IIG Q + V YD+
Sbjct: 358 VIWGNDASRLCLAFAGNSKSTMNSE----LTIIGNRQQVSLTVLYDI 400
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 163/380 (42%), Gaps = 55/380 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V IG PP PQ VLDTGS L W++C + +FDPS S ++ LPC C
Sbjct: 90 VTLPIGTPPQPQQMVLDTGSQLSWIQCH-NKTPPTASFDPSLSSSFYVLPCTHPLCKPRV 148
Query: 161 GGYP-----DE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
+ D+ C Y+ Y +G ++G + E+ F S + GCS +
Sbjct: 149 PDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLI----LGCSSES- 203
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI-------------------GNLNYF 253
G+ G+ S KV +KFSYC+ N N
Sbjct: 204 ----RDARGILGMNLGRLS-FPFQAKV-TKFSYCVPTRQPANNNNFPTGSFYLGNNPNSA 257
Query: 254 EYAY-NMLILGEGAILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ Y +ML + S M +D +Y V ++GI +G + L+I P++F+ N S
Sbjct: 258 RFRYVSMLTFPQ-------SQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGS- 309
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
+DSG+ T+LV AY +R+E + L + Y +C+ GN +
Sbjct: 310 GQTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLG 369
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLS-IIGMIAQQNYNVA 429
+AF F G ++V+ E V V C+ +G S ER S IIG QQN V
Sbjct: 370 DVAFEFEKGVEIVVPKERVLADVGGGVHCVGIGRS----ERLGAASNIIGNFHQQNLWVE 425
Query: 430 YDLVSKQLYFQRIDCELLAD 449
+DL ++++ F DC L+
Sbjct: 426 FDLANRRIGFGVADCSRLSK 445
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 160/362 (44%), Gaps = 50/362 (13%)
Query: 106 GQPPVPQLAVLDTGSSLIWVKCQPCE--QCGATT---FDPSKSLTYATLPCDSSYCTNDC 160
G V Q ++D+GS + WV+CQPC C FDP+ S TYA +PC S+ C
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACAR-L 133
Query: 161 GGYP------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH--NNA 212
G Y +C + I Y NG + GT S+ D + FL FGC+H +
Sbjct: 134 GPYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFL----FGCAHADQGS 189
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCI-GNLNYFEYAYNMLILGEGAI 267
FS + G LG + S V++ S+ FSYC+ + + F + + A+
Sbjct: 190 TFSYD-VAGTLALG---GGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAAL 245
Query: 268 LEG-DSTPM---SVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ STP+ S + ++Y V L I + + L + P +F + IDS T +
Sbjct: 246 VPTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASSV-------IDSATVI 298
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
+ + P+AYQ LR P+ P+ CY + R + P++A F GGA +
Sbjct: 299 SRIPPTAYQALRAAFRSAMTMYRPAPPVS-ILDTCYDFSGVRSIT-LPSIALVFDGGATV 356
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
LDA + Q CLA P+ +R IG + Q+ V YD+ K + F+
Sbjct: 357 NLDAAGILLQG-----CLAFAPT--ASDRMPGF--IGNVQQRTLEVVYDVPGKAIRFRSA 407
Query: 443 DC 444
C
Sbjct: 408 AC 409
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 165/385 (42%), Gaps = 55/385 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ ++G PP P LDTGS L+W +C PC C G DP+ S TYA LPC +
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGAPR 151
Query: 156 CT----NDCGGYPDECW--------YNIRYTNGPDSQGTIGSEQFNFETSD-EGKTFL-- 200
C CGG W Y Y + + G I +++F F + +G + L
Sbjct: 152 CRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLPT 211
Query: 201 YDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNML 260
+ FGC H N TG+ G G S S + + FSYC ++ FE +++
Sbjct: 212 RRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNV--TTFSYCFTSM--FESKSSLV 267
Query: 261 ILG----------EGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKK 305
LG A + G+ ++ Y+++L+GIS+G+ L + +
Sbjct: 268 TLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKLRS 327
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD-PAWHLCYSGNINR 364
IDSG ++T L + Y+ ++ E GL P+ ++ A LC++ +
Sbjct: 328 T--------IIDSGASITTLPEAVYEAVKAEFAAQV-GLPPTGVVEGSALDLCFALPVTA 378
Query: 365 DLQG--FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
+ P++ H G + VF ++ V C+ + + D ++IG
Sbjct: 379 LWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAP------GDQTVIGNFQ 432
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELL 447
QQN +V YDL + L F C+ L
Sbjct: 433 QQNTHVVYDLENDWLSFAPARCDSL 457
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 176/384 (45%), Gaps = 52/384 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V ++Y +G PP +DTGS ++WV C C C + FDP S T +
Sbjct: 49 VGLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTAS 108
Query: 148 TLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C C+ + C + C YN +Y +G + G S+ +F+T G
Sbjct: 109 LISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMN 168
Query: 201 YD---VGFGCSH---NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
+ FGCS + SD G+FG G S S + G FS+C L
Sbjct: 169 NSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHC---LK 225
Query: 252 YFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+ +L+LGE I+E + TP+ Y + ++ IS+ + L IDP++F T
Sbjct: 226 GDDSGGGILVLGE--IVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFG---TS 280
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCY--SGNINRDL 366
S G IDSGTTL +L +AY + + PS P + CY S +IN D+
Sbjct: 281 SSQGTIIDSGTTLAYLAEAAYDPFISAITSIVS---PSVRPYLSKGNHCYLISSSIN-DI 336
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIA 422
FP ++ +FAGGA ++L + Q+SS +++C +G I G+ ++I+G +
Sbjct: 337 --FPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWC--IGFQKIQGQ---GITILGDLV 389
Query: 423 QQNYNVAYDLVSKQLYFQRIDCEL 446
++ YD+ ++++ + DC +
Sbjct: 390 LKDKIFVYDIANQRIGWANYDCSM 413
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 166/364 (45%), Gaps = 46/364 (12%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCT 157
V+Y + ++G PP V+DTGS L WV+C PC ++TFD S TY L C
Sbjct: 123 VYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASNTYKALTC------ 176
Query: 158 NDCGGYPD--ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHF 214
D P W + + +G + T+ + SDE + F V FGC S
Sbjct: 177 ADDLRLPVLLRLWRRL-FHSGRSLRDTL---KMAGAASDELEEFPGFV-FGCGSLLKGLI 231
Query: 215 SDEQFTGVFGLGPATSSTHSLV-EKVGSKFSYCI-----------GNLNYFEYAYNMLIL 262
S E G+ L P + S S + EK G+KFSYC+ + + E A +
Sbjct: 232 SGE--VGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEP 289
Query: 263 GEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
G G E TP+ Y V L+GIS+G + LD+ P+ F D DSGTTL
Sbjct: 290 GSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFLNG---QDKPTIFDSGTTL 346
Query: 323 TWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
T L ++++ + + G + +D + + S QG P + FHF GGA
Sbjct: 347 TMLPSGVCDSIKQSLASMVSGAEFVAIKGLDACFRVPPSSG-----QGLPDITFHFNGGA 401
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
D V S + + S+ CL P++ ++SI G + QQ++ V +D+ ++++ F+
Sbjct: 402 DFV-TRPSNYVIDLGSLQCLIFVPTN-------EVSIFGNLQQQDFFVLHDMDNRRIGFK 453
Query: 441 RIDC 444
DC
Sbjct: 454 ETDC 457
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 114/430 (26%), Positives = 189/430 (43%), Gaps = 54/430 (12%)
Query: 36 KRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST 95
K LV L RD+ N +T A +LN S +Y ++ + D + G +
Sbjct: 96 KTLVLSRLARDTARVNSLNTKLQLALSSLNRSD---LYPTETELLRPEDLSTPVSSGTAQ 152
Query: 96 -VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPC 151
++ +GQP P VLDTGS + W++C+PC C + FDP+ S +Y L C
Sbjct: 153 GSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTC 212
Query: 152 DSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
D+ C + C +C Y + Y +G + G +E +F G + V GC
Sbjct: 213 DAQQCQDLEMSACRN--GKCLYQVSYGDGSFTVGEYVTETVSF-----GAGSVNRVAIGC 265
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGA 266
H+N F G GL SL ++ + FSYC+ + + G+ +
Sbjct: 266 GHDNEGL----FVGSAGLLGLGGGPLSLTSQIKATSFSYCLVDRDS----------GKSS 311
Query: 267 ILE------GDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
LE GDS ++ YYV L G+S+G +++ + P F + + + GV
Sbjct: 312 TLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGA-GGVI 370
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
+DSGT +T L AY ++R + L P+ + + CY + + ++ P ++FH
Sbjct: 371 VDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGV-ALFDTCYDLSSLQSVR-VPTVSFH 428
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F+G L A++ + + +C A P+ +SIIG + QQ V++DL +
Sbjct: 429 FSGDRAWALPAKNYLIPVDGAGTYCFAFAPTT------SSMSIIGNVQQQGTRVSFDLAN 482
Query: 435 KQLYFQRIDC 444
+ F C
Sbjct: 483 SLVGFSPNKC 492
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 105/378 (27%), Positives = 157/378 (41%), Gaps = 46/378 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ ++ +G PP ++DTGS L W++C PC C FDP+ S +Y + C
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQR 210
Query: 156 CTNDCGGYP---------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGF 205
C P D C Y Y + ++ G + E F + G + + DV F
Sbjct: 211 CGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVF 270
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
GC H N + S L G FSYC+ +++ + ++ GE
Sbjct: 271 GCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCL--VDHGSDVASKVVFGED 328
Query: 266 AILEGD-----------STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG- 313
L + S D YYV L+G+ +G ++L+I +DTW
Sbjct: 329 DALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNI------SSDTWGVGEG 382
Query: 314 ------VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
IDSGTTL++ V AYQ +R+ D P P P CY+ + D
Sbjct: 383 EGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVS-GVDRP 441
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
P ++ FA GA AE+ F + + + CLAV + G +SIIG QQN+
Sbjct: 442 EVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG-----MSIIGNFQQQNF 496
Query: 427 NVAYDLVSKQLYFQRIDC 444
+V YDL + +L F C
Sbjct: 497 HVVYDLKNNRLGFAPRRC 514
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 111/428 (25%), Positives = 173/428 (40%), Gaps = 37/428 (8%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS-TVPVF 99
KL HRD+L NP + + + R +S+K K + L GI +
Sbjct: 34 KLAHRDTLWPNPLSRI----EDIIGADQKRHSLISRKRKFKG-GVKMDLGSGIDYGTAQY 88
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-----PCEQCGATTFDPSKSLTYATLPCDSS 154
+ +G P V+DTGS L WV C+ + F +S ++ T+ C +
Sbjct: 89 FTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFTQ 148
Query: 155 YCTND---------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C D C C Y+ RY +G +QG E ++ K L +
Sbjct: 149 TCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLV 208
Query: 206 GCSHNNAHFSDEQFTGVFGLGPAT-SSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG- 263
GCS + + S + GV GL + S T + G+K SYC+ + + N LI G
Sbjct: 209 GCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLIFGY 268
Query: 264 -----EGAILEGDSTP--MSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
G +TP +++I Y + + GIS+G+ MLDI ++ D + G +
Sbjct: 269 SSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVW---DATTGGGTIL 325
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGT+LT L +AY+ + + L P C+S + P + FH
Sbjct: 326 DSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLPQLTFHL 385
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GGA +S + V CL + +++G I QQNY +DL++
Sbjct: 386 KGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPAT-----NVVGNIMQQNYLWEFDLMAST 440
Query: 437 LYFQRIDC 444
L F C
Sbjct: 441 LSFAPSTC 448
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 161/366 (43%), Gaps = 35/366 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
++V+F +G PP ++D+GS L+WV+C PC QC A + PS S T++ +PC SS
Sbjct: 64 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSSD 123
Query: 156 CT-------NDCG-GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C C YP C Y Y + S+G F +E++ + V FGC
Sbjct: 124 CLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGV-----FAYESATVDGVRIDKVAFGC 178
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLILGE-- 264
+N S GV GLG S S V G+KF+YC+ N + LI G+
Sbjct: 179 GSDN-QGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSSLIFGDEL 237
Query: 265 -GAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
I + TP+ S YYV +E +++G K L I + ++ D + G DSGT
Sbjct: 238 ISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEI-DLLGNGGSIFDSGT 296
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
TLT+ PSAY + + P LC D FP+ F GA
Sbjct: 297 TLTYWFPSAYSHILAAFDSGVH--YPRAESVQGLDLCVE-LTGVDQPSFPSFTIEFDDGA 353
Query: 381 DLVLDAESVFYQESSSVFCLAVG--PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
+AE+ F + +V CLA+ S + G + IG + QQN+ V YD +
Sbjct: 354 VFQPEAENYFVDVAPNVRCLAMAGLASPLGG-----FNTIGNLLQQNFFVQYDREENLIG 408
Query: 439 FQRIDC 444
F C
Sbjct: 409 FAPAKC 414
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 170/366 (46%), Gaps = 50/366 (13%)
Query: 111 PQLAVLDTGSSLIWVKCQPCEQCGAT-------TFDPSKSLTYATLPCDSSYCT------ 157
P+ ++DTGS LIW +C+ A +DP +S T+A LPC C
Sbjct: 25 PRKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSF 84
Query: 158 NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDE 217
+C + C Y Y + + G + SE F F + +GFGC +A S
Sbjct: 85 KNCTS-KNRCVYEDVYGSAA-AVGVLASETFTFGAR---RAVSLRLGFGCGALSAG-SLI 138
Query: 218 QFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMS 276
TG+ GL P + SL+ ++ +FSYC+ + + + L+ G A L T
Sbjct: 139 GATGILGLSP---ESLSLITQLKIQRFSYCL--TPFADKKTSPLLFGAMADLSRHKTTRP 193
Query: 277 VIDGS----------YYVTLEGISLGEKMLDI-DPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+ + YYV L GISLG K L + +L + D G +DSG+T+ +L
Sbjct: 194 IQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPD--GGGGTIVDSGSTVAYL 251
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-----SGNINRDLQGFPAMAFHFAGGA 380
V +A++ +++ V D+ + + + ++ + LC+ + + P + HF GGA
Sbjct: 252 VEAAFEAVKEAVMDVVRLPVANRTVED-YELCFVLPRRTAAAAMEAVQVPPLVLHFDGGA 310
Query: 381 DLVLDAESVFYQESSSVFCLAVGP-SDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+VL ++ F + + + CLAVG +D +G +SIIG + QQN +V +D+ + F
Sbjct: 311 AMVLPRDNYFQEPRAGLMCLAVGKTTDGSG-----VSIIGNVQQQNMHVLFDVQHHKFSF 365
Query: 440 QRIDCE 445
C+
Sbjct: 366 APTQCD 371
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 112/378 (29%), Positives = 158/378 (41%), Gaps = 62/378 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDS 153
+ V +G P V DTGS L W +C+PC +Q A FDPSKS +Y + C S
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDA-IFDPSKSSSYTNITCTS 104
Query: 154 SYCT--------NDCGGYPD-ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
S CT ++C D C Y+ +Y + S G + E+ +D FL
Sbjct: 105 SLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFL---- 160
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNML 260
FGC +N F G GL S+V++ S FSYC L + L
Sbjct: 161 FGCGQDNEGL----FNGSAGLMGLGRHPISIVQQTSSNYNKIFSYC---LPATSSSLGHL 213
Query: 261 ILGEGAILEGD--STPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G A TP+S I G Y + + IS+G L P + + T+S G
Sbjct: 214 TFGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKL---PAV--SSSTFSAGGSI 268
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGF--- 369
IDSGT +T L P+ Y LR F+ + YP+ L CY DL G+
Sbjct: 269 IDSGTVITRLAPTVYAALRSA----FRRXMEKYPVANEAGLLDTCY------DLSGYKEI 318
Query: 370 --PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + F F+GG + L + ES CLA NG D+++ G + Q+
Sbjct: 319 SVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAA---NGSD-NDITVFGNVQQKTLE 374
Query: 428 VAYDLVSKQLYFQRIDCE 445
V YD+ ++ F C+
Sbjct: 375 VVYDVKGGRIGFGAAGCK 392
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 160/377 (42%), Gaps = 47/377 (12%)
Query: 89 LHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSK 142
L PG S V + +G P P + V+DTGSSL W++C PC Q G FDP
Sbjct: 106 LTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGP-VFDPKT 164
Query: 143 SLTYATLPCDSSYC-------TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSD 194
S +YA + C S C N P C Y Y + S G + + +F
Sbjct: 165 SSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF---- 220
Query: 195 EGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNYF 253
G + + +GC +N G+ GL S + L +G FSYC+ + +
Sbjct: 221 -GANSVPNFYYGCGQDNEGLFGRS-AGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSS 278
Query: 254 EYAYNMLILGEGAILEG--DSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
Y L G+ G TPM ++ D Y+++L G+++ K L + + + T
Sbjct: 279 GY------LSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPT 332
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
IDSGT +T L S Y L K V +G C+ G ++ L+
Sbjct: 333 ------IIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASK-LRA 385
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
PA++ F+GGA L L A ++ + CLA P+ + +IIG QQ ++V
Sbjct: 386 VPAVSMAFSGGATLKLSAGNLLVDVDGATTCLAFAPA-------RSAAIIGNTQQQTFSV 438
Query: 429 AYDLVSKQLYFQRIDCE 445
YD+ S ++ F C
Sbjct: 439 VYDVKSNRIGFAAAGCS 455
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 153/366 (41%), Gaps = 44/366 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G PP VLDTGS ++W++C PC+ C + T F+P KS ++A + C +
Sbjct: 42 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 101
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C C Y + Y +G + G +E F +T + V GC H+N
Sbjct: 102 CRRLESPGC-NQRQTCLYQVSYGDGSYTTGEFVTETLTFR-----RTKVEQVALGCGHDN 155
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ S KFSYC+ + + +++
Sbjct: 156 EGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTAR 215
Query: 272 STPMSV---IDGSYYVTLEGISLGEKMLD-IDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
TP+ +D YYV L GIS+G + I + FK + T + GV ID GT++T L
Sbjct: 216 FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRT-GNGGVIIDCGTSVTRLNK 274
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQG-----FPAMAFHFAGG 379
AY LR D F+ S P + L CY DL G P + HF G
Sbjct: 275 PAYIALR----DAFRAGASSLKSAPEFSLFDTCY------DLSGKTTVKVPTVVLHFR-G 323
Query: 380 ADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
AD+ L A + + S FC A + LSIIG I QQ + V YDL S ++
Sbjct: 324 ADVSLPASNYLIPVDGSGRFCFAFAGTT------SGLSIIGNIQQQGFRVVYDLASSRVG 377
Query: 439 FQRIDC 444
F C
Sbjct: 378 FSPRGC 383
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 159/370 (42%), Gaps = 52/370 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ +G PP VLDTGS ++W++C PC+ C + T F+P KS ++A + C +
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 188
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C C Y + Y +G + G +E F +T + V GC H+N
Sbjct: 189 CRRLESPGCNQR-QTCLYQVSYGDGSYTTGEFVTETLTFR-----RTKVEQVALGCGHDN 242
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG----SKFSYCIGNLNYFEYAYNMLILGEGAI 267
+ F G GL S + G KFSYC+ + + +++
Sbjct: 243 ----EGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVS 298
Query: 268 LEGDSTPMSV---IDGSYYVTLEGISLGEKMLD-IDPNLFKKNDTWSDAGVFIDSGTTLT 323
TP+ +D YYV L GIS+G + I + FK + T + GV ID GT++T
Sbjct: 299 RTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRT-GNGGVIIDCGTSVT 357
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQG-----FPAMAFH 375
L AY LR D F+ S P + L CY DL G P + H
Sbjct: 358 RLNKPAYIALR----DAFRAGASSLKSAPEFSLFDTCY------DLSGKTTVKVPTVVLH 407
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GAD+ L A + + S FC A + LSIIG I QQ + V YDL S
Sbjct: 408 FR-GADVSLPASNYLIPVDGSGRFCFAFAGTT------SGLSIIGNIQQQGFRVVYDLAS 460
Query: 435 KQLYFQRIDC 444
++ F C
Sbjct: 461 SRVGFSPRGC 470
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 156/374 (41%), Gaps = 60/374 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCGAT---TFDPSKSLTYATLPC--- 151
F V G P + DTGS + W++C PC C FDP+KS TY+ +PC
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGHP 194
Query: 152 -----DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
D S C+N C Y + Y +G S G + E + ++ F FG
Sbjct: 195 QCAAADGSKCSNG------TCLYKVEYGDGSSSAGVLSHETLSLTSTRALPGF----AFG 244
Query: 207 CSHNN-AHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGE 264
C N F D G+ GLG S S G FSYC+ + N + L +G
Sbjct: 245 CGQTNLGDFGDVD--GLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNT---THGYLTIGP 299
Query: 265 GAILEGDSTPMSVI------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
D + + Y+V L I +G +L + P LF +D G F+DS
Sbjct: 300 TTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLF------TDDGTFLDS 353
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-----FPAMA 373
GT LT+L P AY LR + P+ DP + CY D G PA++
Sbjct: 354 GTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDP-FDTCY------DFTGQSAIFIPAVS 406
Query: 374 FHFAGGA--DLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
F F+ G+ DL +F +++ ++ CL + +I+G + Q+N V Y
Sbjct: 407 FKFSDGSVFDLSFFGILIFPDDTAPAIGCLGF----VARPSAMPFTIVGNMQQRNTEVIY 462
Query: 431 DLVSKQLYFQRIDC 444
D+ ++++ F C
Sbjct: 463 DVAAEKIGFASASC 476
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/386 (26%), Positives = 169/386 (43%), Gaps = 56/386 (14%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V ++Y +G PPV +DTGS ++WV C C C T+ FDP S T +
Sbjct: 72 VGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSS 131
Query: 148 TLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C C N C ++C Y +Y +G + G S+ + T EG
Sbjct: 132 MIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTT 191
Query: 201 YD---VGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
V FGCS+ SD G+FG G S S + G FS+C L
Sbjct: 192 NSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHC---LK 248
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+L+LGE I+E + S++ Y + L+ I++ + L ID ++F +++
Sbjct: 249 GDSSGGGILVLGE--IVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS- 305
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGN-----INR 364
G +DSGTTL +L AY D F + + + GN +
Sbjct: 306 --RGTIVDSGTTLAYLAEEAY--------DPFVSAITASIPQSVHTVVSRGNQCYLITSS 355
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGM 420
+ FP ++ +FAGGA ++L + Q++S +V+C +G I G+ ++I+G
Sbjct: 356 VTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWC--IGFQKIQGQ---GITILGD 410
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDCEL 446
+ ++ V YDL +++ + DC L
Sbjct: 411 LVLKDKIVVYDLAGQRIGWANYDCSL 436
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 121/444 (27%), Positives = 183/444 (41%), Gaps = 75/444 (16%)
Query: 33 GKPKRLVTKLLHRDSLLYNPNDTVDAQAQRT------LNMSMARFIYLSQKSSQKAHDTR 86
G + +++H+ ND D +A+ T LN R Y++ + S+
Sbjct: 65 GPKTKASLEVVHKHGPCSQLNDH-DGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQDS 123
Query: 87 AHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--- 133
+ +T+P ++V +G P + DTGS L W +C+PC +
Sbjct: 124 SVEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYK 183
Query: 134 -GATTFDPSKSLTYATLPCDSSYCT-------ND--CGGYPDECWYNIRYTNGPDSQGTI 183
FDPSKS +Y+ + C S+ CT ND C C Y I+Y + S G
Sbjct: 184 QQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYF 243
Query: 184 GSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK- 242
E+ +D FL FGC NN F G GL S V++ +K
Sbjct: 244 SRERLTVTATDVVDNFL----FGCGQNNQGL----FGGSAGLIGLGRHPISFVQQTAAKY 295
Query: 243 ---FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVID-GS--YYVTLEGISLGEKML 296
FSYC+ + + + G L+ TP S I GS Y + + I++G L
Sbjct: 296 RKIFSYCLPSTSSSTGHLSFGPAATGRYLK--YTPFSTISRGSSFYGLDITAIAVGGVKL 353
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL 356
+ + T+S G IDSGT +T L P+AY LR F+ + YP +
Sbjct: 354 PVS------SSTFSTGGAIIDSGTVITRLPPTAYGALRSA----FRQGMSKYPSAGELSI 403
Query: 357 ---CYSGNINRDLQGF-----PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDIN 408
CY DL G+ P + F FAGG + L + + + S+ CLA N
Sbjct: 404 LDTCY------DLSGYKVFSIPTIEFSFAGGVTVKLPPQGILFVASTKQVCLAFAA---N 454
Query: 409 GERFKDLSIIGMIAQQNYNVAYDL 432
G+ D++I G + Q+ V YD+
Sbjct: 455 GDD-SDVTIYGNVQQRTIEVVYDV 477
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 123/426 (28%), Positives = 188/426 (44%), Gaps = 42/426 (9%)
Query: 38 LVTKLLHRDSL---LYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
T+L+HRDS L+N ++T D + + S R +++ + ++ A P I
Sbjct: 37 FTTELIHRDSPNSPLFNASETTDIRLANAVERSADR---VNRFNDLISNSITAAEFPSIL 93
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC---QPC-EQCGATTFDPSKSLTYATLP 150
F + SIG PP L + TGS L+W+ C +PC C FDP +S TY +P
Sbjct: 94 DNGDFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVP 153
Query: 151 CDSSYC--TNDCGGYPDECWYNI--RYTNG-PDSQGTIGSEQFNFETSDEGKTFLY-DVG 204
CDS C TN +C+Y+ R+ + PD + + N S GK+F+ + G
Sbjct: 154 CDSYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLN---STTGKSFMLPNTG 210
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK-VGSKFSYCIGNLNYFEYAYNMLILG 263
F C N D G+ GLG + S + + + KFS+CI + Y + L G
Sbjct: 211 FIC--GNRIGGDYPGVGILGLGHGSLSLLNRISHLIDGKFSHCI--VPYSSNQTSKLSFG 266
Query: 264 EGAILEGD---STPMSVIDGSYYVTLE--GISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
+ A++ G ST + + G Y TL GIS+G K + + G+ +DS
Sbjct: 267 DKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAG----GIGSDYYMNGLGMDS 322
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
GT T+ Y L +V Q P YP DP L + D P + HF G
Sbjct: 323 GTMFTYFPEYFYSQLEYDVRYAIQQ-EPLYP-DPTRRLRLCYRYSPDFSP-PTITMHFEG 379
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
G+ + L + + F + + + CLA S + ++ G Q N + YDL + L
Sbjct: 380 GS-VELSSSNSFIRMTEDIVCLAFATSSSEQD-----AVFGYWQQTNLLIGYDLDAGFLS 433
Query: 439 FQRIDC 444
F + DC
Sbjct: 434 FLKTDC 439
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 122/427 (28%), Positives = 180/427 (42%), Gaps = 68/427 (15%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-------FYVNFSIGQPPVPQL 113
+R + S AR LS ++ + +PV + V+ +IG PP P
Sbjct: 51 RRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAIGTPPQPVS 110
Query: 114 AVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYCTN----DCGGYPDE 166
A+LDTGS LIW +C PC C + F P +S +Y + C + C++ C PD
Sbjct: 111 ALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSC-ERPDT 169
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV--GFGCSHNNAHFSDEQFTGVFG 224
C Y Y +G + G +E+F F +S G V GFGC N S +G+ G
Sbjct: 170 CTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVG-SLNNGSGIVG 228
Query: 225 LGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGE-------GAILEGDSTPMS 276
G + SLV ++ +FSYC+ +Y + L+ G A +TP+
Sbjct: 229 FG---RNPLSLVSQLSIRRFSYCL--TSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLL 283
Query: 277 VIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA---- 329
+ YYV G+++G + L I + F S GV +DSGT LT L+P+A
Sbjct: 284 QSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGS-GGVIVDSGTALT-LLPAAVLAE 341
Query: 330 -YQTLRKEV----------EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
+ R+++ ED L+P+ AW S + P M HF
Sbjct: 342 VVRAFRQQLRLPFANGGNPEDGVCFLVPA-----AWRRSSSTS----QMPVPRMVLHFQ- 391
Query: 379 GADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GADL L + V CL + S +G S IG + QQ+ V YDL ++ L
Sbjct: 392 GADLDLPRRNYVLDDHRRGRLCLLLADSGDDG------STIGNLVQQDMRVLYDLEAETL 445
Query: 438 YFQRIDC 444
C
Sbjct: 446 SIAPARC 452
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 181/409 (44%), Gaps = 50/409 (12%)
Query: 71 FIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC 130
F +QK Q + D + H TV ++G PP VLDTGS L W+ C+
Sbjct: 42 FSLKTQKLPQSSSDKLSFRHNVTLTV-----TLAVGDPPQNISMVLDTGSELSWLHCKKS 96
Query: 131 EQCGATTFDPSKSLTYATLPCDSSYC---TND------CGGYPDECWYNIRYTNGPDSQG 181
G+ F+P S TY+ +PC S C T D C C I Y + +G
Sbjct: 97 PNLGS-VFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEG 155
Query: 182 TIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG- 240
+ E F + T + G S N+ D + TG+ G+ + S V ++G
Sbjct: 156 NLAHETFVIGSVTRPGTLFGCMDSGLSSNSEE--DAKSTGLMGM---NRGSLSFVNQLGF 210
Query: 241 SKFSYCI------GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDG-SYYVTLEGISLGE 293
SKFSYCI G L + +Y+ L + L STP+ D +Y V LEGI +G
Sbjct: 211 SKFSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGS 270
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSY 348
K+L + ++F + T + +DSGT T+L+ Y L+ E + +L P +
Sbjct: 271 KILSLPKSVFVPDHTGA-GQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDF 329
Query: 349 PMDPAWHLCYS-GNINR-DLQGFPAMAFHFAGGADLVLDAESVFYQESSS-------VFC 399
LCY G+ R + G P ++ F GA++ + + + Y+ + + V+C
Sbjct: 330 VFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYC 388
Query: 400 LAVGPSDING-ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ-RIDCEL 446
G SD+ G E F +IG QQN + +DL ++ F + C+L
Sbjct: 389 FTFGNSDLLGIEAF----VIGHHHQQNVWMEFDLAKSRVGFAGNVRCDL 433
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 159/358 (44%), Gaps = 34/358 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+++ IG+PP VLDTGS + W++C PC +C + FDP S +Y+ + CD+
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAPQ 208
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++C C Y + Y +G + G +F ET G + +V GC HNN
Sbjct: 209 CKSLDLSECRN--GTCLYEVSYGDGSYTVG-----EFATETVTLGTAAVENVAIGCGHNN 261
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F G GL S +V + FSYC+ +N A + L
Sbjct: 262 EGL----FVGAAGLLGLGGGKLSFPAQVNATSFSYCL--VNRDSDAVSTLEFNSPLPRNV 315
Query: 271 DSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
+ P+ +D YY+ L+GIS+G + L I ++F+ D G+ IDSGT +T L
Sbjct: 316 VTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEV-DAIGGGGIIIDSGTAVTRLRS 374
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAE 387
Y LR +G +P + CY + +Q P ++FHF G +L L A
Sbjct: 375 EVYDALRDAFVKGAKG-IPKANGVSLFDTCYDLSSRESVQ-VPTVSFHFPEGRELPLPAR 432
Query: 388 SVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +S FC A P+ LSI+G + QQ V +D+ + + F C
Sbjct: 433 NYLIPVDSVGTFCFAFAPTT------SSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 165/359 (45%), Gaps = 43/359 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ + ++G PPV ++DT S L+W +C PC+ C FDP K C+S +
Sbjct: 31 YLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKE-------CNSFF 83
Query: 156 CTNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH- 213
D P++ C Y Y + ++G + E F ++D GK + + FGC HNN
Sbjct: 84 ---DHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTD-GKPIVESIIFGCGHNNTGV 139
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNMLILGEGAILEGD- 271
F++ + G S + GSK FS C+ + + + LGE + + G+
Sbjct: 140 FNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGEASDVSGEG 199
Query: 272 --STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
+TP+ +G Y VTLEGIS+G+ + F ++ S + IDSGT T+L
Sbjct: 200 VVTTPLVSEEGQTPYLVTLEGISVGDTFVP-----FNSSEMLSKGNIMIDSGTPETYLPQ 254
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDP--AWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
Y L +E++ Q LP +DP LCY N L+G P + HF GAD+ L
Sbjct: 255 EFYDRLVEELK--VQINLPPIHVDPDLGTQLCYKSETN--LEG-PILTAHFE-GADVKLL 308
Query: 386 AESVFYQESSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
F VFC A+ G +D L I G AQ N + +DL + ++F+ D
Sbjct: 309 PLQTFIPPKDGVFCFAMTGTTD-------GLYIFGNFAQSNVLIGFDLDKRIVFFKPTD 360
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/412 (26%), Positives = 176/412 (42%), Gaps = 63/412 (15%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPV--------FYVNFSIGQPPVPQLAVLDTGS 120
AR ++LS K++ G+S+ PV + V +G P P L LDT +
Sbjct: 49 ARLLFLSSKAAST----------GVSSAPVASGQSPPSYVVRAGLGSPAQPILLALDTSA 98
Query: 121 SLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYCTN------------DCGGYPDE 166
W C PC C + F P+ S +YA LPC S+ CT D
Sbjct: 99 DATWAHCSPCGTCPSSGSLFAPANSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPM 158
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGL 225
C + + + Q ++ S+ + GK + + FGC S + ++ G+ GL
Sbjct: 159 CAFTKPFADA-SFQASLASDWLHL-----GKDAIPNYAFGCVSAVSGPTANLPKQGLLGL 212
Query: 226 GPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVI 278
G +L+ +VG+ FSYC+ + + ++ ++ + G TPM
Sbjct: 213 G---RGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNR 269
Query: 279 DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
YYV + G+S+G + + F D + AG +DSGT +T P Y LR+E
Sbjct: 270 SSLYYVNVTGLSVGRAPVKVPAGSFAF-DPATGAGTVVDSGTVITRWTPPVYAALREEFR 328
Query: 339 DLFQGLLPS-YPMDPAWHLCYSGNINRDLQGF-PAMAFHFAGGADLVLDAESVFYQESSS 396
PS Y A+ C+ N + G PA+ H GG DL L E+ S++
Sbjct: 329 RHVAA--PSGYTSLGAFDTCF--NTDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSAT 384
Query: 397 -VFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ CLA+ P ++N ++++ + QQN V +D+ + ++ F R C
Sbjct: 385 PLACLAMAEAPQNVNAV----VNVLANLQQQNLRVVFDVANSRVGFARESCN 432
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 174/367 (47%), Gaps = 40/367 (10%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ---PCEQCGATTFDPSKSLTYATLPCDSSYCT 157
+ +G PP P +LD GS L+W +C P + FD ++S +++ LPCDS C
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLC- 167
Query: 158 NDCGGYPD------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
+ G + + +C Y Y + G + +E F F + G + ++ FGC
Sbjct: 168 -EAGTFTNKTCTDRKCAYENDY-GIMTATGVLATETFTF-GAHHGVS--ANLTFGCG-KL 221
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL--- 268
A+ + + +G+ GL P S L + +KFSYC+ + + + ++ G A L
Sbjct: 222 ANGTIAEASGILGLSPGPLSM--LKQLAITKFSYCL--TPFADRKTSPVMFGAMADLGKY 277
Query: 269 ----EGDSTPM---SVIDGSYYVTLEGISLGEKMLDI-DPNLFKKNDTWSDAGVFIDSGT 320
+ + P+ V D YYV + G+S+G K LD+ L K D G +DS T
Sbjct: 278 KTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPD--GTGGTVLDSAT 335
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG--FPAMAFHFAG 378
TL +LV A+ L+K V + + + + +D + +C+ ++G P + HF G
Sbjct: 336 TLAYLVEPAFTELKKAVMEGIKLPVANRSVD-DYPVCFELPRGMSMEGVQVPPLVLHFDG 394
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
A++ L ++ F + S + CLAV + G ++IG + QQN +V YD+ +++
Sbjct: 395 DAEMSLPRDNYFQEPSPGMMCLAVMQAPFEGAP----NVIGNVQQQNMHVLYDVGNRKFS 450
Query: 439 FQRIDCE 445
+ C+
Sbjct: 451 YAPTKCD 457
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/352 (29%), Positives = 150/352 (42%), Gaps = 42/352 (11%)
Query: 115 VLDTGSSLIWVKCQPCE-QCGATT---FDPSKSLTYATLPCDSSYCT-------ND--CG 161
+LDTGSSL W++CQPC C A +DPS S TY L C S C+ ND C
Sbjct: 2 ILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCE 61
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTG 221
+ C Y Y + S G + + +S F Y GC +N G
Sbjct: 62 TDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTY----GCGQDNQGLFGRA-AG 116
Query: 222 VFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SV 277
+ GL S L K G FSYC+ N L +G + TPM S
Sbjct: 117 IIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSK 176
Query: 278 IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV 337
Y++ L I++ + LD+ +++ IDSGT +T L S Y LR+
Sbjct: 177 NPSLYFLRLTAITVSGRPLDLAAAMYR-------VPTLIDSGTVITRLPMSMYAALRQAF 229
Query: 338 EDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES 394
+ Y PA+ + C+ G++ + + P + F GGADL L A S+ +
Sbjct: 230 VKIMS---TKYAKAPAYSILDTCFKGSL-KSISAVPEIKMIFQGGADLTLRAPSILIEAD 285
Query: 395 SSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ CLA G S N ++IIG QQ YN+AYD+ + ++ F C
Sbjct: 286 KGITCLAFAGSSGTN-----QIAIIGNRQQQTYNIAYDVSTSRIGFAPGSCH 332
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 168/382 (43%), Gaps = 53/382 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ ++G PP VLDTGS L W+ C+ +Q + F+P S +Y +PC S C
Sbjct: 72 VSLTVGTPPQSVTMVLDTGSELSWLHCKK-QQNINSVFNPHLSSSYTPIPCMSPICKTRT 130
Query: 161 GGY--------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL--YDVGFGCSHN 210
+ + C + Y + +G + S+ F S + D GF + N
Sbjct: 131 RDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIFGSMDSGFSSNAN 190
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI-- 267
D + TG+ G+ + S V ++G KFSYCI + A +L+ G+
Sbjct: 191 E----DSKTTGLMGM---NRGSLSFVTQMGFPKFSYCISGKD----ASGVLLFGDATFKW 239
Query: 268 --------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
L +TP+ D +Y V L GI +G K L + +F + T + +DS
Sbjct: 240 LGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQ-TMVDS 298
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQGFPAMA 373
GT T+L+ S Y LR E +G+L P++ + A LC+ + PA+
Sbjct: 299 GTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVT 358
Query: 374 FHFAGGADLVLDAESVFY---------QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
F GA++ + E + Y + + V+CL G SD+ G + +IG QQ
Sbjct: 359 MVFE-GAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLG---IEAYVIGHHHQQ 414
Query: 425 NYNVAYDLVSKQLYFQRIDCEL 446
N + +DLV+ ++ F CEL
Sbjct: 415 NVWMEFDLVNSRVGFADTKCEL 436
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 124/464 (26%), Positives = 189/464 (40%), Gaps = 69/464 (14%)
Query: 31 AAGKPKRLVTKLLHRDSLLYNPN--DTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT--- 85
AA + +LLHRDS N + + + QR + A I + + D
Sbjct: 63 AASSSSAMHVRLLHRDSFAVNATGAELLARRLQRD-ELRAAWIISTAAANGTPPPDVVGL 121
Query: 86 ---RAHLHPGISTVPV---FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GAT 136
R + P +S P + ++G P V L LDT S L W++CQPC +C
Sbjct: 122 STGRGLVAPVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGP 181
Query: 137 TFDPSKSLTYATLPCDSSYCT---NDCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFE 191
FDP S +Y + D+ C GG C Y + Y +G D G+ + +
Sbjct: 182 VFDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDG-DGHGSTSTSVGDL- 239
Query: 192 TSDEGKTFLYDV-----GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--SKFS 244
+E TF V GC H+N G+ GL S + +G + FS
Sbjct: 240 -VEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFS 298
Query: 245 YCIGN-LNYFEYAYNMLILGEGAILEGDSTPMSVIDGS---------YYVTLEGISLG-- 292
YC+ + ++ + L G GA+ D++P + + YYV L G+S+G
Sbjct: 299 YCLVDFISGPGSPSSTLTFGAGAV---DTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGV 355
Query: 293 ------EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLP 346
E+ L +DP GV +DSGTT+T L AY R GL
Sbjct: 356 RVPGVTERDLQLDPYT-------GHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQ 408
Query: 347 SYPMDPA--WHLCYSGNINRDLQ---GFPAMAFHFAGGADLVLDAES-VFYQESSSVFCL 400
P+ + CY+ L+ PA++ HFAGG +L L ++ + +S C
Sbjct: 409 VSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCF 468
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A G + +S+IG I QQ + V YD+ +++ F C
Sbjct: 469 A-----FAGTGDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 110/365 (30%), Positives = 170/365 (46%), Gaps = 44/365 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-------QCGATTFDPSKSLTYATLPC 151
++ +GQP V DTGS + W++CQPC+ Q G FDP S +Y+ L C
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGP-IFDPKSSSSYSPLSC 242
Query: 152 DSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
DS C ++ + C Y + Y +G + G + +E F+F S+ + ++ GC H
Sbjct: 243 DSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS----IPNLPIGCGH 298
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+N F G GL SL ++ + FSYC+ +L+ + + L A
Sbjct: 299 DNEGL----FVGAAGLIGLGGGAISLSSQLEATSFSYCLVDLD----SESSSTLDFNADQ 350
Query: 269 EGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
DS ++ YV + G+S+G K L I + F+ +++ S G+ +DSGTT+T
Sbjct: 351 PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGS-GGIIVDSGTTIT 409
Query: 324 WLVPSAYQTLRKEVEDLFQGL---LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
+ Y LR D F GL LP P + CY + +++ P +AF G
Sbjct: 410 EIPSDVYDVLR----DAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVE-VPTIAFILPGEN 464
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L L A++ +Q +S+ FCLA PS LSIIG + QQ V+YDL + + F
Sbjct: 465 SLQLPAKNCLFQVDSAGTFCLAFLPSTF------PLSIIGNVQQQGIRVSYDLANSLVGF 518
Query: 440 QRIDC 444
C
Sbjct: 519 STDKC 523
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 172/372 (46%), Gaps = 43/372 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
FY +G P ++DTGS++ ++ C+ C CG T FDP KS T L C
Sbjct: 13 FYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPL 72
Query: 156 C---TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NN 211
C T C D C+Y+ Y S+G + + F F SD + FGC +
Sbjct: 73 CNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPVRLV----FGCENGET 128
Query: 212 AHFSDEQFTGVFGLGPATSSTHS-LVEK--VGSKFSYCIGNLNYFEYAYN-MLILGEGAI 267
+ G+ G+G ++ S LV++ + FS C G Y + +L+LG+ +
Sbjct: 129 GEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFG------YPKDGILLLGDVTL 182
Query: 268 LEGDSTP----MSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
EG +T ++ + YY V ++GI++ + L D ++F + G +DSGTT
Sbjct: 183 PEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRG-----YGTVLDSGTTF 237
Query: 323 TWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWH-LCYSGNIN--RDLQG-FPAMAFHF 376
T+L A++ + K V D + GL + DP ++ +C+ G + +DL FP F F
Sbjct: 238 TYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFVF 297
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
GGA L L + + +CL + + +G +++G ++ ++ V YD + +
Sbjct: 298 GGGAKLTLPPLRYLFLSKPAEYCLGIFDNGNSG------ALVGGVSVRDVVVTYDRRNSK 351
Query: 437 LYFQRIDCELLA 448
+ F + C +A
Sbjct: 352 VGFTTMACADVA 363
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 119/429 (27%), Positives = 183/429 (42%), Gaps = 60/429 (13%)
Query: 58 AQAQRTLNMS--MARFIYLSQKSSQKAHDTRA---HLHPGISTVPV----FYVNFSIGQP 108
A A R L+ + R S+ S + RA + PG T V + V+ +IG P
Sbjct: 61 ADAGRGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTP 120
Query: 109 PVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN----DCG 161
P P +LDTGS L W +C PC C F+PS+S+T++ LPCD C + CG
Sbjct: 121 PQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCG 180
Query: 162 GYP---DECWYNIRYTNGPDSQGTIGSEQFNFETSDE--GKTFLYDVGFGCS-HNNAHFS 215
C Y Y + + G + S+ F+F ++D G + D+ FGC NN F
Sbjct: 181 EQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFV 240
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNML--------------- 260
+ TG+ G S + ++ FSYC + E + L
Sbjct: 241 SNE-TGIAGFSRGALSMPAQLKV--DNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGH 297
Query: 261 -ILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
++ A++ S+ + +YY++L+G+++G L I ++F + + G +DSG
Sbjct: 298 GVVQSTALIRYHSSQLK----AYYISLKGVTVGTTRLPIPESVFALKEDGT-GGTIVDSG 352
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T +T L P A L + L LC+S PA+ HF G
Sbjct: 353 TGMTML-PEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAK-PDVPALVLHFE-G 409
Query: 380 ADLVLDAESVFYQESSS----VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
A L L E+ ++ + + CLA+ + DLS+IG QQN +V YDL +
Sbjct: 410 ATLDLPRENYMFEIEEAGGIRLTCLAINAGE-------DLSVIGNFQQQNMHVLYDLAND 462
Query: 436 QLYFQRIDC 444
L F C
Sbjct: 463 MLSFVPARC 471
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 168/376 (44%), Gaps = 41/376 (10%)
Query: 87 AHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTF 138
A+L + + +++ +G P +DTGS ++WV C C++C T +
Sbjct: 15 AYLVYFVHWLSLYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLY 74
Query: 139 DPSKSLTYATLPCDSSYCTNDCGGY-PD-----ECWYNIRYTNGPDSQGTIGSEQFNFE- 191
DP+ S++ + CD +CT+ G PD C YN+ Y +G + G S+ FE
Sbjct: 75 DPASSVSATRVSCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFER 134
Query: 192 TSDEGKTFLYD--VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGN 249
+ +T L + V FGC + GLG + + ++ + F++C+ N
Sbjct: 135 VTGNLQTGLSNGTVTFGCGAQQSG----------GLGTSGEA----LDGILGAFAHCLDN 180
Query: 250 LNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+N + +GE + ++TPM Y V ++ I +G +L++ ++F D
Sbjct: 181 VN----GGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDR- 235
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
G IDSGTTL +L Y ++ E+ GL + YSGN++ GF
Sbjct: 236 --RGTIIDSGTTLAYLPEVVYDSMMNEIRSQQPGLSLHTVEEQFICFKYSGNVD---DGF 290
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + FHF L + +Q S ++C + + +D++++G + N V
Sbjct: 291 PDIKFHFKDSLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVL 350
Query: 430 YDLVSKQLYFQRIDCE 445
YD+ ++ + + +C+
Sbjct: 351 YDIENQAIGWTEYNCK 366
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 169/363 (46%), Gaps = 60/363 (16%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-----------TNDC 160
++DT S L WV+C PC C FDP+ S +YA LPC+SS C
Sbjct: 141 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQF 219
GG C Y + Y +G SQG + ++ + + FGC + N F
Sbjct: 201 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSL-----AGEVIDGFVFGCGTSNQGPFGGT-- 253
Query: 220 TGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV- 277
+G+ GLG + S S +++ G FSYC+ L E + L+LG+ + +STP+
Sbjct: 254 SGLMGLGRSQLSLISQTMDQFGGVFSYCL-PLKESE-SSGSLVLGDDTSVYRNSTPIVYT 311
Query: 278 ------IDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-VFIDSGTTLTWLVPSA 329
+ G Y+V L GI++G + ++ S AG V +DSGT +T LVPS
Sbjct: 312 TMVSDPVQGPFYFVNLTGITIGGQEVE------------SSAGKVIVDSGTIITSLVPSV 359
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDA 386
Y ++ E F YP P + + C++ R++Q P++ F F G ++ +D+
Sbjct: 360 YNAVKAE----FLSQFAEYPQAPGFSILDTCFNLTGFREVQ-IPSLKFVFEGNVEVEVDS 414
Query: 387 ESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V Y SS CLA+ + + E + SIIG Q+N V +D + Q+ F + C
Sbjct: 415 SGVLYFVSSDSSQVCLAL--ASLKSE--YETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470
Query: 445 ELL 447
+ +
Sbjct: 471 DYI 473
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 119/429 (27%), Positives = 183/429 (42%), Gaps = 60/429 (13%)
Query: 58 AQAQRTLNMS--MARFIYLSQKSSQKAHDTRA---HLHPGISTVPV----FYVNFSIGQP 108
A A R L+ + R S+ S + RA + PG T V + V+ +IG P
Sbjct: 35 ADAGRGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTP 94
Query: 109 PVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN----DCG 161
P P +LDTGS L W +C PC C F+PS+S+T++ LPCD C + CG
Sbjct: 95 PQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCG 154
Query: 162 GYP---DECWYNIRYTNGPDSQGTIGSEQFNFETSDE--GKTFLYDVGFGCS-HNNAHFS 215
C Y Y + + G + S+ F+F ++D G + D+ FGC NN F
Sbjct: 155 EQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFV 214
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNML--------------- 260
+ TG+ G S + ++ FSYC + E + L
Sbjct: 215 SNE-TGIAGFSRGALSMPAQLKV--DNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGH 271
Query: 261 -ILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
++ A++ S+ + +YY++L+G+++G L I ++F + + G +DSG
Sbjct: 272 GVVQSTALIRYHSSQLK----AYYISLKGVTVGTTRLPIPESVFALKEDGT-GGTIVDSG 326
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T +T L P A L + L LC+S PA+ HF G
Sbjct: 327 TGMTML-PEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAK-PDVPALVLHFE-G 383
Query: 380 ADLVLDAESVFYQESSS----VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
A L L E+ ++ + + CLA+ + DLS+IG QQN +V YDL +
Sbjct: 384 ATLDLPRENYMFEIEEAGGIRLTCLAINAGE-------DLSVIGNFQQQNMHVLYDLAND 436
Query: 436 QLYFQRIDC 444
L F C
Sbjct: 437 MLSFVPARC 445
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/422 (26%), Positives = 170/422 (40%), Gaps = 31/422 (7%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS-TVPVF 99
KL HRD+LL P + + + R +S+K + + L GI +
Sbjct: 30 KLAHRDTLLPKPLSRI----EDVIGADQKRHSLISRKRNSTV-GVKMDLGSGIDYGTAQY 84
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYCT 157
+ +G P V+DTGS L WV C+ + F +S ++ T+ C + C
Sbjct: 85 FTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCK 144
Query: 158 ND---------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D C C Y+ RY +G +QG E ++ L GCS
Sbjct: 145 VDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCS 204
Query: 209 HNNAHFSDEQFTGVFGLGPAT-SSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
+ S + GV GL + S T + G+KFSYC+ + + N LI G
Sbjct: 205 SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRS 264
Query: 268 LE---GDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ +TP+ + I Y + + GISLG MLDI ++ D S G +DSGT+L
Sbjct: 265 TKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVW---DATSGGGTILDSGTSL 321
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T L +AY+ + + L P C+S ++ P + FH GGA
Sbjct: 322 TLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARF 381
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
+S + V CL + ++IG I QQNY +DL++ L F
Sbjct: 382 EPHRKSYLVDAAPGVKCLGFVSAGTPAT-----NVIGNIMQQNYLWEFDLMASTLSFAPS 436
Query: 443 DC 444
C
Sbjct: 437 AC 438
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 167/374 (44%), Gaps = 40/374 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATL 149
++Y IG PP +DTGS ++WV C C+ C T +DP+ S T T+
Sbjct: 83 LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TV 140
Query: 150 PCDSSYCT-NDCGGYP-------DECWYNIRYTNGPDSQGTIGSE--QFNFETSDEGKTF 199
C+ +C N GG P C + I Y +G + G ++ Q+N + S G+T
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYN-QVSGNGQTT 199
Query: 200 LYD--VGFGCSHN---NAHFSDEQFTGVFGLGPATSSTHSLV---EKVGSKFSYCIGNLN 251
+ + FGC + S++ G+ G G + SS S + +V F++C+ +
Sbjct: 200 TSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVR 259
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ +G + +TP+ Y V L+GIS+G L + + F D+
Sbjct: 260 ----GGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGDS--- 312
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
G IDSGTTL +L Y+TL V D +Q LP + +SG+I+ GFP
Sbjct: 313 KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQD-LPLHNYQDFVCFQFSGSID---DGFPV 368
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ F F G L + + +Q + ++C+ + + KD+ ++G + N V YD
Sbjct: 369 ITFSFKGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYD 428
Query: 432 LVSKQLYFQRIDCE 445
L + + + +C
Sbjct: 429 LEKEVIGWTDYNCS 442
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/422 (26%), Positives = 170/422 (40%), Gaps = 31/422 (7%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS-TVPVF 99
KL HRD+LL P + + + R +S+K + + L GI +
Sbjct: 52 KLAHRDTLLPKPLSRI----EDVIGADQKRHSLISRKRNSTV-GVKMDLGSGIDYGTAQY 106
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKC--QPCEQCGATTFDPSKSLTYATLPCDSSYCT 157
+ +G P V+DTGS L WV C + + F +S ++ T+ C + C
Sbjct: 107 FTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCK 166
Query: 158 ND---------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D C C Y+ RY +G +QG E ++ L GCS
Sbjct: 167 VDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCS 226
Query: 209 HNNAHFSDEQFTGVFGLGPAT-SSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
+ S + GV GL + S T + G+KFSYC+ + + N LI G
Sbjct: 227 SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRS 286
Query: 268 LE---GDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ +TP+ + I Y + + GISLG MLDI ++ D S G +DSGT+L
Sbjct: 287 TKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVW---DATSGGGTILDSGTSL 343
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T L +AY+ + + L P C+S ++ P + FH GGA
Sbjct: 344 TLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARF 403
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
+S + V CL + ++IG I QQNY +DL++ L F
Sbjct: 404 EPHRKSYLVDAAPGVKCLGFVSAGTPAT-----NVIGNIMQQNYLWEFDLMASTLSFAPS 458
Query: 443 DC 444
C
Sbjct: 459 AC 460
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/401 (25%), Positives = 170/401 (42%), Gaps = 48/401 (11%)
Query: 81 KAHDTRAH--------LHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
KAHD R L+ G + +P +++ +G PP +DTGS ++WV C
Sbjct: 40 KAHDARRRGRILSAVDLNLGGNGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCV 99
Query: 129 PCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGG------YPDECWYNIRYT 174
C +C T +DP S T + CD +C+ G C Y+I Y
Sbjct: 100 KCSRCPRKSDLGIDLTLYDPKGSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYG 159
Query: 175 NGPDSQG-------TIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP 227
+G + G T N T+ + + ++ G S + S+E G+ G G
Sbjct: 160 DGSATTGYYVQDYLTYNHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQ 219
Query: 228 ATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYV 284
+ SS S + KV FS+C+ N+ + +GE + +TP+ Y V
Sbjct: 220 SNSSVLSQLAASGKVKKIFSHCLDNIR----GGGIFAIGEVVEPKVSTTPLVPRMAHYNV 275
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
L+ I + +L + ++F D+ + G IDSGTTL +L Y L +V Q
Sbjct: 276 VLKSIEVDTDILQLPSDIF---DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMAR-QPR 331
Query: 345 LPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVG 403
L Y ++ + Y+GN++R GFP + HF L + +Q ++C+
Sbjct: 332 LKLYLVEQQFSCFQYTGNVDR---GFPVVKLHFEDSLSLTVYPHDYLFQFKDGIWCIGWQ 388
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S + KD++++G + N V YDL + + + +C
Sbjct: 389 KSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNC 429
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 173/388 (44%), Gaps = 61/388 (15%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C T+ FD S S T
Sbjct: 63 VGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAG 122
Query: 148 TLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C CT+ C ++C Y +Y +G + G S+ F+ + G++ +
Sbjct: 123 LVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFD-AILGESLV 181
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGP------ATSSTHSLVEKVGSKFSYCI 247
+ + FGCS + +D+ G+FG G + STH + +V FS+C+
Sbjct: 182 VNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRV---FSHCL 238
Query: 248 -GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
G IL G + +P+ Y + L+ I++ K+L IDP++F +
Sbjct: 239 KGEGIGGGILVLGEILEPGMVY----SPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATS 294
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYSGNINRD 365
++ G +DSGTTL +LV AY V + PS P+ + CY + +
Sbjct: 295 NS---QGTIVDSGTTLAYLVAEAYDPFVSAVNVIVS---PSVTPIISKGNQCYLVSTSVS 348
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSD-------INGERFKDLSII 418
Q FP +F+FAGGA +VL E + + GPS I ++ + ++I+
Sbjct: 349 -QMFPLASFNFAGGASMVLKPED---------YLIPFGPSQGGSVMWCIGFQKVQGVTIL 398
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
G + ++ YDLV +++ + DC L
Sbjct: 399 GDLVLKDKIFVYDLVRQRIGWANYDCSL 426
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/406 (27%), Positives = 164/406 (40%), Gaps = 46/406 (11%)
Query: 59 QAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIST-VPVFYVNFSIGQPPVPQLAVLD 117
Q Q ++ AR +S + T+ GI+ + V +G P V D
Sbjct: 94 QDQLRVDSIQARLSKISGHGIFEEMVTKLPAQSGIAIGTGNYVVTVGLGTPKEDFTLVFD 153
Query: 118 TGSSLIWVKCQPC--------EQCGATTFDPSKSLTYATLPCDSSYCT------NDCGGY 163
TGS + W +CQPC EQ FDP+KS +Y + C S+ C C
Sbjct: 154 TGSGITWTQCQPCLGSCYPQKEQ----KFDPTKSTSYNNVSCSSASCNLLPTSERGCSAS 209
Query: 164 PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVF 223
C Y I Y + SQG +E +SD FL FGC +N + +
Sbjct: 210 NSTCLYQIIYGDQSYSQGFFATETLTISSSDVFTNFL----FGCGQSNNGLFGQAAGLLG 265
Query: 224 GLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYY 283
+ S EK +FSYC+ + + L G TP+S S+Y
Sbjct: 266 LSSSSVSLPSQTAEKYQKQFSYCLPST---PSSTGYLNFGGKVSQTAGFTPISPAFSSFY 322
Query: 284 -VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
+ + GIS+ L IDP++F + +G IDSGT +T L P+AY+ L+ + F
Sbjct: 323 GIDIVGISVAGSQLPIDPSIF------TTSGAIIDSGTVITRLPPTAYKALK----EAFD 372
Query: 343 GLLPSYPM---DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY-QESSSVF 398
+ +YP D CY + N FP ++ F GG ++ +DA + Y +
Sbjct: 373 EKMSNYPKTNGDELLDTCYDFS-NYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMV 431
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA + + E I G Q+ Y V YD + F C
Sbjct: 432 CLAFAANKDDSE----FGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 158/358 (44%), Gaps = 60/358 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDS 153
+ V S+G P + Q +DTGS L WV+C+PC FDP++S +YA +PC
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGR 196
Query: 154 SYCTNDCGGYPD-----ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
S C G Y +C Y + Y +G ++ G S+ + + FL FGC
Sbjct: 197 SACAG-LGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFL----FGCG 251
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGE 264
H S FTG+ GL SLV++ G FSYC+ + + + G
Sbjct: 252 HAQ---SGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKS--STTGYLTLGGP 306
Query: 265 GAILEGDST----PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+ G ST P Y V L GIS+G + L + + F AG +D+GT
Sbjct: 307 SGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFA-------AGTVVDTGT 359
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYS----GNINRDLQGFPAMA 373
+T L P+AY LR F+ + SYP P + CYS G +N ++A
Sbjct: 360 VITRLPPAAYAALRSA----FRSGMASYPSAPPIGILDTCYSFAGYGTVN-----LTSVA 410
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
F+ GA + L A+ + S CLA S +G ++I+G + Q+++ V D
Sbjct: 411 LTFSSGATMTLGADGIM-----SFGCLAFASSGSDGS----MAILGNVQQRSFEVRID 459
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 167/374 (44%), Gaps = 40/374 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATL 149
++Y IG PP +DTGS ++WV C C+ C T +DP+ S T T+
Sbjct: 83 LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TV 140
Query: 150 PCDSSYCT-NDCGGYP-------DECWYNIRYTNGPDSQGTIGSE--QFNFETSDEGKTF 199
C+ +C N GG P C + I Y +G + G ++ Q+N + S G+T
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYN-QVSGNGQTT 199
Query: 200 LYD--VGFGCSHN---NAHFSDEQFTGVFGLGPATSSTHSLV---EKVGSKFSYCIGNLN 251
+ + FGC + S++ G+ G G + SS S + +V F++C+ +
Sbjct: 200 TSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVR 259
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ +G + +TP+ Y V L+GIS+G L + + F D+
Sbjct: 260 ----GGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGDS--- 312
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
G IDSGTTL +L Y+TL V D +Q LP + +SG+I+ GFP
Sbjct: 313 KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQD-LPLHNYQDFVCFQFSGSID---DGFPV 368
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ F F G L + + +Q + ++C+ + + KD+ ++G + N V YD
Sbjct: 369 ITFSFEGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYD 428
Query: 432 LVSKQLYFQRIDCE 445
L + + + +C
Sbjct: 429 LEKEVIGWTDYNCS 442
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 116/413 (28%), Positives = 181/413 (43%), Gaps = 58/413 (14%)
Query: 71 FIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC 130
F +QK Q + D + H TV ++G PP VLDTGS L W+ C+
Sbjct: 42 FSLKTQKLPQSSSDKLSFRHNVTLTV-----TLAVGDPPQNISMVLDTGSELSWLHCKKS 96
Query: 131 EQCGATTFDPSKSLTYATLPCDSSYC---TND------CGGYPDECWYNIRYTNGPDSQG 181
G+ F+P S TY+ +PC S C T D C C I Y + +G
Sbjct: 97 PNLGS-VFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEG 155
Query: 182 TIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG- 240
+ E F + T + G S N+ D + TG+ G+ + S V ++G
Sbjct: 156 NLAHETFVIGSVTRPGTLFGCMDSGLSSNSEE--DAKSTGLMGM---NRGSLSFVNQLGF 210
Query: 241 SKFSYCIGNLNYFEYAYNMLILGEGAI----------LEGDSTPMSVIDG-SYYVTLEGI 289
SKFSYCI + + L+LG+ + L STP+ D +Y V LEGI
Sbjct: 211 SKFSYCISGSDSSVF----LLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGI 266
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL---- 345
+G K+L + ++F + T + +DSGT T+L+ Y L+ E + +L
Sbjct: 267 RVGSKILSLPKSVFVPDHTGA-GQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVD 325
Query: 346 -PSYPMDPAWHLCYS-GNINR-DLQGFPAMAFHFAGGADLVLDAESVFYQESSS------ 396
P + LCY G+ R + G P ++ F GA++ + + + Y+ + +
Sbjct: 326 DPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKE 384
Query: 397 -VFCLAVGPSDING-ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ-RIDCEL 446
V+C G SD+ G E F +IG QQN + +DL ++ F + C+L
Sbjct: 385 EVYCFTFGNSDLLGIEAF----VIGHHHQQNVWMEFDLAKSRVGFAGNVRCDL 433
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/402 (27%), Positives = 177/402 (44%), Gaps = 53/402 (13%)
Query: 65 NMSMARFIYLSQKSSQKAHDTRAHLHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
N+S A +S + + D A L G + ++ IG+P VLDTGS +
Sbjct: 113 NISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVN 172
Query: 124 WVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNG 176
W++C PC C T F+PS S +Y L CD+ C ++C C Y + Y +G
Sbjct: 173 WLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNA--TCLYEVSYGDG 230
Query: 177 PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV 236
+ G +E G T + +V GC H+N G+F +
Sbjct: 231 SYTVGDFATETLTI-----GSTLVQNVAVGCGHSNE--------GLFVGAAGLLGLGGGL 277
Query: 237 EKVGSK-----FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEG 288
+ S+ FSYC+ ++ + + + G + P+ +D YY+ L G
Sbjct: 278 LALPSQLNTTSFSYCL--VDRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTG 335
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
IS+G ++L I + F+ +++ S G+ IDSGT +T L Y +LR + +G L
Sbjct: 336 ISVGGELLQIPQSSFEMDESGS-GGIIIDSGTAVTRLQTEIYNSLR---DSFVKGTL--- 388
Query: 349 PMDPA-----WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAV 402
++ A + CY+ + ++ P +AFHF GG L L A++ +S FCLA
Sbjct: 389 DLEKAAGVAMFDTCYNLSAKTTVE-VPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAF 447
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
P+ L+IIG + QQ V +DL + + F C
Sbjct: 448 APTA------SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 169/363 (46%), Gaps = 60/363 (16%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-----------TNDC 160
++DT S L WV+C PC C FDP+ S +YA LPC+SS C
Sbjct: 140 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQF 219
GG C Y + Y +G SQG + ++ + + FGC + N F
Sbjct: 200 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSL-----AGEVIDGFVFGCGTSNQGPFGGT-- 252
Query: 220 TGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV- 277
+G+ GLG + S S +++ G FSYC+ L E + L+LG+ + +STP+
Sbjct: 253 SGLMGLGRSQLSLISQTMDQFGGVFSYCL-PLKESE-SSGSLVLGDDTSVYRNSTPIVYT 310
Query: 278 ------IDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG-VFIDSGTTLTWLVPSA 329
+ G Y+V L GI++G + ++ S AG V +DSGT +T LVPS
Sbjct: 311 TMVSDPVQGPFYFVNLTGITIGGQEVE------------SSAGKVIVDSGTIITSLVPSV 358
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDA 386
Y ++ E F YP P + + C++ R++Q P++ F F G ++ +D+
Sbjct: 359 YNAVKAE----FLSQFAEYPQAPGFSILDTCFNLTGFREVQ-IPSLKFVFEGNVEVEVDS 413
Query: 387 ESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V Y SS CLA+ + + E + SIIG Q+N V +D + Q+ F + C
Sbjct: 414 SGVLYFVSSDSSQVCLAL--ASLKSE--YETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469
Query: 445 ELL 447
+ +
Sbjct: 470 DYI 472
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 165/379 (43%), Gaps = 51/379 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP P +LDTGS L W +C PC C F+PS+S+T++ LPCD
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRI 170
Query: 156 CTN----DCGGYP---DECWYNIRYTNGPDSQGTIGSEQFNFETSDE--GKTFLYDVGFG 206
C + CG C Y Y + + G + S+ F+F ++D G + D+ FG
Sbjct: 171 CRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFG 230
Query: 207 CS-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNML----- 260
C NN F + TG+ G S + ++ FSYC + E + L
Sbjct: 231 CGLFNNGIFVSNE-TGIAGFSRGALSMPAQLKV--DNFSYCFTAITGSEPSPVFLGVPPN 287
Query: 261 -----------ILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
++ A++ S+ + +YY++L+G+++G L I ++F +
Sbjct: 288 LYSDAAGGGHGVVQSTALIRYHSSQLK----AYYISLKGVTVGTTRLPIPESVFALKEDG 343
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
+ G +DSGT +T L P A L + L LC+S
Sbjct: 344 T-GGTIVDSGTGMTML-PEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAK-PDV 400
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSS----VFCLAVGPSDINGERFKDLSIIGMIAQQN 425
PA+ HF GA L L E+ ++ + + CLA+ + DLS+IG QQN
Sbjct: 401 PALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGE-------DLSVIGNFQQQN 452
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+V YDL + L F C
Sbjct: 453 MHVLYDLANDMLSFVPARC 471
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 119/458 (25%), Positives = 187/458 (40%), Gaps = 59/458 (12%)
Query: 24 TSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQK------ 77
T +A GK + T + L + + +R L + R L K
Sbjct: 3 TDDSALKNLGKGRESTTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTS 62
Query: 78 --SSQKAHDTRAHLHPGISTVPVFY-VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC- 133
+ Q +T+ L GI + Y V +G + ++DTGS L WV+CQPC C
Sbjct: 63 STTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNMS--LIVDTGSDLTWVQCQPCRSCY 120
Query: 134 --GATTFDPSKSLTYATLPCDSSYC---------TNDCGG----YPDECWYNIRYTNGPD 178
+DPS S +Y T+ C+SS C + CGG C Y + Y +G
Sbjct: 121 NQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSY 180
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK 238
++G + SE G T L + FGC NN + + S ++
Sbjct: 181 TRGDLASESILL-----GDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKT 235
Query: 239 VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSYYVTLEGIS 290
FSYC+ +L + A L G + + +ST +S + Y + L G S
Sbjct: 236 FNGVFSYCLPSLE--DGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGAS 293
Query: 291 LGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM 350
+G +++ + F + G+ IDSGT +T L PS Y+ ++ E F G P+ P
Sbjct: 294 IGG--VELKSSSFGR-------GILIDSGTVITRLPPSIYKAVKIEFLKQFSG-FPTAPG 343
Query: 351 DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDIN 408
C++ D+ P + F G A+L +D VFY + +S+ CLA+
Sbjct: 344 YSILDTCFNLTSYEDIS-IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYE 402
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
E + IIG Q+N V YD ++L +C +
Sbjct: 403 NE----VGIIGNYQQKNQRVIYDTTQERLGIVGENCRV 436
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 151/364 (41%), Gaps = 57/364 (15%)
Query: 107 QPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYC----- 156
+P V QL +LDT S + WV+C PC QC A T +DPSKS + + C S C
Sbjct: 177 RPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGP 236
Query: 157 -TNDCGGYPD---ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NN 211
N C + +C Y +RY +G + GT+ ++Q + + + F FGCSH
Sbjct: 237 YANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFE----FGCSHAAR 292
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCI----GNLNYFEYAYNMLILGEGA 266
FS + G+ LG S S K G FSYC + +F A
Sbjct: 293 GSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRSSSRYA 352
Query: 267 ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+ TPM Y V LE I++ + LD+ P +F AG +DS T +T L
Sbjct: 353 VTPMLKTPM-----LYQVRLEAIAVAGQRLDVPPTVFA-------AGAALDSRTVITRLP 400
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-----FPAMAFHF-AGGA 380
P+AYQ LR D P+ + CY D G P ++ F GA
Sbjct: 401 PTAYQALRSAFRDKMSMYRPA-AANGQLDTCY------DFTGVSSIMLPTISLVFDRTGA 453
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ LD V + CLA S +R IIG + Q V Y++ + F+
Sbjct: 454 GVQLDPSGVLFGS-----CLAFA-STAGDDRAT--GIIGFLQLQTIEVLYNVAGGSVGFR 505
Query: 441 RIDC 444
R C
Sbjct: 506 RGAC 509
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 160/379 (42%), Gaps = 40/379 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSS 154
+F + IG A++DTGS + V QCG+ + FDP+ S +Y +PC S
Sbjct: 99 LFSMQLGIGSLQKNLSAIIDTGSEAVLV------QCGSRSRPVFDPAASQSYRQVPCISQ 152
Query: 155 YC-----------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQ-FNFETSDEGKTFLY- 201
C + C C Y++ Y + +S G + F T+ G+ +
Sbjct: 153 LCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFR 212
Query: 202 DVGFGCSHNNAHF-SDEQFTGVFGLGPATSSTHSLVEKV--GSKFSYCIGNLNYFEYAYN 258
DV FGC+H+ F D G+ G S S ++ GSKFSYC + + A
Sbjct: 213 DVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATG 272
Query: 259 MLILGEGAILEGDSTPMSVIDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
++ LG+ + + ++D YYV L IS+ K L I + FK + +
Sbjct: 273 VIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTG 332
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
D G +DSGTT T +V AY R GL + CY+ + L G
Sbjct: 333 DGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGV 392
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSS----VFCLAVGPSDINGERFKDLSIIGMIAQQN 425
P + L L E +F S++ CLA+ S +G F ++++G Q N
Sbjct: 393 PEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSG--FGKINVLGNYQQSN 450
Query: 426 YNVAYDLVSKQLYFQRIDC 444
Y V YD ++ F+R DC
Sbjct: 451 YLVEYDNERSRVGFERADC 469
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 153/376 (40%), Gaps = 41/376 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ +G PP +LDTGS L W++C PC C +DP S ++ + C+
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLYDVG-- 204
C+ C C Y Y + ++ G E F T+ EG + Y VG
Sbjct: 220 CSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNM 279
Query: 205 -FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S + L G FSYC+ + N + LI G
Sbjct: 280 MFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFG 339
Query: 264 EGAILEGDSTP--MSVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDTWS---- 310
E L + S ++G YY+ ++ I +G K LDI +TW+
Sbjct: 340 EDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDI------PEETWNISSD 393
Query: 311 -DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDLQG 368
D G IDSGTTL++ AY+ ++ + + + P + P C++ I +
Sbjct: 394 GDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIH 453
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
P + F G AE+ F S + CLA I G SIIG QQN+++
Sbjct: 454 LPELGIAFVDGTVWNFPAENSFIWLSEDLVCLA-----ILGTPKSTFSIIGNYQQQNFHI 508
Query: 429 AYDLVSKQLYFQRIDC 444
YD +L F C
Sbjct: 509 LYDTKRSRLGFTPTKC 524
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 168/380 (44%), Gaps = 53/380 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ-----PCEQCGATTFDPSKSLTYATLPCDSSY 155
++ IG P Q VLDTGS L W++C T+FDPS S +++ LPC
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 141
Query: 156 CTNDCGGY--PDE------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C + P C Y+ Y +G ++G + E+F F S + GC
Sbjct: 142 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLI----LGC 197
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI------------------GN 249
+ + +DE+ G+ G+ S S + SKFSYCI N
Sbjct: 198 AKES---TDEK--GILGMNLGRLSFISQAKI--SKFSYCIPTRSNRPGLASTGSFYLGDN 250
Query: 250 LNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
N + Y L+ + + P+ +Y V L+GI +G+K L+I ++F+ D
Sbjct: 251 PNSRGFKYVSLLTFPQSQRMPNLDPL-----AYTVPLQGIRIGQKRLNIPGSVFRP-DAG 304
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRDLQG 368
+DSG+ T LV AY +++E+ L L Y +C+ GN + ++
Sbjct: 305 GSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGR 364
Query: 369 FPA-MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
+ F F G +++++ +S+ + C+ +G S + G +IIG + QQN
Sbjct: 365 LIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGA---ASNIIGNVHQQNLW 421
Query: 428 VAYDLVSKQLYFQRIDCELL 447
V +D+ ++++ F + +C LL
Sbjct: 422 VEFDVTNRRVGFSKAECRLL 441
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 52/366 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ + +G P Q ++DTGS + WV+C+PC QC + FDPS S TY+ C S+
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 257
Query: 156 CT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C N C +C Y + Y +G + GT S+ G + + FGCS+
Sbjct: 258 CAQLGQEGNGCSSS-SQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSN 311
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEK----VGSKFSYCIGNLNYFEYAYNMLILGEG 265
+ F+D Q G+ GLG SLV + +G FSYC+ + G
Sbjct: 312 VESGFND-QTDGLMGLG---GGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 367
Query: 266 AILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
TPM S + Y V L+ I +G + L I ++F AG +DSGT +
Sbjct: 368 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF-------SAGTVMDSGTVI 420
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYP---MDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T L P+AY L + + P+ P +D + +++ P++A F+GG
Sbjct: 421 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-----IPSVALVFSGG 475
Query: 380 ADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
A + LDA + CLA G SD + L IIG + Q+ + V YD+ +
Sbjct: 476 AVVSLDASGIILSN-----CLAFAGNSDDS-----SLGIIGNVQQRTFEVLYDVGRGVVG 525
Query: 439 FQRIDC 444
F+ C
Sbjct: 526 FRAGAC 531
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 163/369 (44%), Gaps = 52/369 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
++ +G P V+DTGS W+ C S ++ + C S C
Sbjct: 113 YFAEVKVGSPGQRFWLVVDTGSEFTWLNC---------------SKSFEAVTCASRKCKV 157
Query: 159 D---------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
D C D C Y+I Y +G ++G G++ ++ + L ++ GC+
Sbjct: 158 DLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGCTK 217
Query: 210 ---NNAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYC-IGNLNYFEYAYNMLILGE 264
N +F++E G+ GLG A S K G+KFSYC + +L++ + N+ I G
Sbjct: 218 SMLNGVNFNEET-GGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIGGH 276
Query: 265 -GAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
A L G+ T + + Y V + GIS+G +ML I P ++ N ++ G IDSGTT
Sbjct: 277 HNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFN---AEGGTLIDSGTT 333
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMD-PAWHLCYSGNINRDLQGF-----PAMAFH 375
LT L+ AY+ + + + + D A C+ D +GF P + FH
Sbjct: 334 LTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCF------DAEGFDDSVVPRLVFH 387
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
FAGGA +S + V C+ + P D G S+IG I QQN+ +DL +
Sbjct: 388 FAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIG----GASVIGNIMQQNHLWEFDLSTN 443
Query: 436 QLYFQRIDC 444
+ F C
Sbjct: 444 TVGFAPSTC 452
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 155/365 (42%), Gaps = 41/365 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQ-C---GATTFDPSKSLTYATLPCDSS 154
+ V+ +G P + DTGS L W +CQPC + C F PS+S TY+ + C S
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSP 190
Query: 155 YCTNDCGGYPDE--------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C+ G ++ C Y I+Y + S G E ++D + FL FG
Sbjct: 191 DCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTDVIENFL----FG 246
Query: 207 CSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
C NN G+ GLG S +K G FSYC+ + G G
Sbjct: 247 CGQNNRGLFGSA-AGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTFGGGGGG 305
Query: 266 AILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
L+ TP++ G Y V + G+ +G + I ++F S +G IDSGT +
Sbjct: 306 GALK--YTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVF------STSGAIIDSGTVI 357
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGG 379
T L P AY L+ F+ + YP P + CY + +Q P + F F GG
Sbjct: 358 TRLPPDAYSALKSA----FEKGMAKYPKAPELSILDTCYDLSKYSTIQ-IPKVGFVFKGG 412
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+L LD + Y S+S CLA + ++IIG + Q+ V YD+ ++ F
Sbjct: 413 EELDLDGIGIMYGASTSQVCLAFA----GNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGF 468
Query: 440 QRIDC 444
C
Sbjct: 469 GYNGC 473
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 154/366 (42%), Gaps = 41/366 (11%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCGAT---TFDPSKSLTYATLPCD 152
P F V G P + DTGS L W++CQPC C FDP+KS +YA +PC
Sbjct: 110 PEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCG 169
Query: 153 SSYCT---NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
++ C +C G C Y + Y +G + G + E F +S E F+ FGC
Sbjct: 170 TTECAAAGGECNG--TTCVYGVEYGDGSSTTGVLARETLTFSSSSEFTGFI----FGCGE 223
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
N E + + S + G FSYC+ + N L +G +
Sbjct: 224 TNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTP---GYLSIGATPVTG 280
Query: 270 GDSTPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ + Y++ L I++G +L + P+ F K G +DSGT LT
Sbjct: 281 QIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT------GTLLDSGTILT 334
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+L P AY LR + QG P+ P D CY + P ++F+F+ GA
Sbjct: 335 YLPPPAYTALRDRFKFTMQGSKPAPPYD-ELDTCYDFTGQSGIL-IPGVSFNFSDGAVFN 392
Query: 384 LDAESVFY---QESSSVFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
L+ + +V CLA P+D+ S++G Q++ V YD+ ++++
Sbjct: 393 LNFFGIMTFPDDTKPAVGCLAFVSRPADM------PFSVVGSTTQRSAEVIYDVPAQKIG 446
Query: 439 FQRIDC 444
F C
Sbjct: 447 FIPASC 452
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 159/382 (41%), Gaps = 58/382 (15%)
Query: 89 LHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSK 142
L PG S V + +G P V+DTGSSL W++C PC Q G FDP
Sbjct: 123 LSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGP-LFDPRA 181
Query: 143 SLTYATLPCDSSYC---------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
S TYA++ C +S C + C + C Y Y + S G++ ++ +F
Sbjct: 182 SSTYASVRCSASQCDELQAATLNPSACSAS-NVCIYQASYGDSSFSVGSLSTDTVSF--- 237
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNY 252
G T +GC +N G+ GL S + L +G FSYC+
Sbjct: 238 --GSTRYPSFYYGCGQDNEGLFGRS-AGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAAS 294
Query: 253 FEYAYNMLILGEGAILEG---DSTPM--SVIDGS-YYVTLEGISLGEKMLDIDPNLFKKN 306
Y L G G TPM S +D S Y++TL G+S+G L + P+ +
Sbjct: 295 TGY------LSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSL 348
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNIN 363
T IDSGT +T L + + L K V G PA+ + C+ G +
Sbjct: 349 PT------IIDSGTVITRLPTAVHTALSKAVAQAMAGAQ----RAPAFSILDTCFEGQAS 398
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
+ P +A FAGGA + L +V S CLA P+D +IIG Q
Sbjct: 399 Q--LRVPTVAMAFAGGASMKLTTRNVLIDVDDSTTCLAFAPTD-------STAIIGNTQQ 449
Query: 424 QNYNVAYDLVSKQLYFQRIDCE 445
Q ++V YD+ ++ F C
Sbjct: 450 QTFSVIYDVAQSRIGFSAGGCS 471
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 154/360 (42%), Gaps = 36/360 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P V+DTGS + W++C PC C F+PS S ++ L C SS
Sbjct: 16 YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSSFKVLDCSSSL 75
Query: 156 CTN--DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS-DEGKTFLYDVGFGCSHNNA 212
C N G ++C Y Y +G + G + ++ + + G+ L ++ GC H+N
Sbjct: 76 CLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCGHDNE 135
Query: 213 HFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ G+ GLG S ++L + FSYC+ + + L+ G+ AI
Sbjct: 136 G-TFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKSTLVFGDAAIPHTA 194
Query: 272 STPMSVI--------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ + I YYV + GIS+G +L P + D+ + G DSGTT+T
Sbjct: 195 TGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDSGTTIT 254
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAG 378
L AY +R L + + CY D G P + FHF G
Sbjct: 255 RLEARAYTAVRDAFRAATMHLTSAADFK-IFDTCY------DFTGMNSISVPTVTFHFQG 307
Query: 379 GADLVLDAESVFYQES-SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
D+ L + S +++FC A S S+IG + QQ++ V YD V KQ+
Sbjct: 308 DVDMRLPPSNYIVPVSNNNIFCFAFAAS-------MGPSVIGNVQQQSFRVIYDNVHKQI 360
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 143/361 (39%), Gaps = 33/361 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCD 152
F V +G P P + DTGS L WV+CQPC G FDPSKS TYA + C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203
Query: 153 SSYCT---NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C + C C Y +RY +G + G + + +S L FGC
Sbjct: 204 EPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRA----LTGFPFGCGT 259
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
N + S G+ FSYC+ + N L +G +
Sbjct: 260 RNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSN---STTGYLTIGATPATD 316
Query: 270 GDSTPMSVI------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ + + Y+V L I +G +L + P +F + G +DSGT LT
Sbjct: 317 TGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRG------GTLLDSGTVLT 370
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
+L AY LR + P+ P D CY ++ PA++F F GA
Sbjct: 371 YLPAQAYALLRDRFRLTMERYTPAPPND-VLDACYDFAGESEVV-VPAVSFRFGDGAVFE 428
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
LD V +V CLA D G LSIIG Q++ V YD+ ++++ F
Sbjct: 429 LDFFGVMIFLDENVGCLAFAAMDTGG---LPLSIIGNTQQRSAEVIYDVAAEKIGFVPAS 485
Query: 444 C 444
C
Sbjct: 486 C 486
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 169/385 (43%), Gaps = 47/385 (12%)
Query: 85 TRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPS 141
R LH + T + IG PP ++D+GS++ +V C CEQCG F P
Sbjct: 71 ARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPD 130
Query: 142 KSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
S TY+ + C S+ CT C +C Y +Y S G +G + +F T E K
Sbjct: 131 LSSTYSPVKC-SADCT--CDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--Q 185
Query: 202 DVGFGCSHNN-AHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYAY 257
FGC ++ + G+ GLG S LV+K +G FS C G ++
Sbjct: 186 RAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD------ 239
Query: 258 NMLILGEGAILEG--DSTPMSVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDT 308
+G GA++ G + P V S Y + L+ I + K L +DP +F
Sbjct: 240 ----IGGGAMVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFD---- 291
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINR 364
S G +DSGTT +L A+ + V + L DP + +C++G N+++
Sbjct: 292 -SKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQ 350
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIA 422
Q FP + F G L L E+ ++ S +CL V NG+ +++G I
Sbjct: 351 LSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQ---NGK--DPTTLLGGIV 405
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELL 447
+N V YD ++++ F + +C L
Sbjct: 406 VRNTLVTYDRHNEKIGFWKTNCSEL 430
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 165/377 (43%), Gaps = 49/377 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKC--QPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
++ IG PP Q VLDTGS L W++C + T+FDPS S +++TLPC C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKP 133
Query: 159 DCGGY--PDE------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
+ P C Y+ Y +G ++G + E+ F ++ + GC+
Sbjct: 134 RIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLI----LGCATE 189
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI------------------GNLNY 252
+ SD++ G+ G+ S S + SKFSYCI N N
Sbjct: 190 S---SDDR--GILGMNRGRLSFVSQAKI--SKFSYCIPPKSNRPGFTPTGSFYLGDNPNS 242
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ Y L+ + + P+ +Y V + GI G K L+I ++F+ D
Sbjct: 243 HGFKYVSLLTFPESQRMPNLDPL-----AYTVPMIGIRFGLKKLNISGSVFRP-DAGGSG 296
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
+DSG+ T LV +AY +R E+ + + L Y +C+ GN+ +
Sbjct: 297 QTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGD 356
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ F F G ++++ E V + C+ +G S + G +IIG + QQN V +D
Sbjct: 357 LVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGA---ASNIIGNVHQQNLWVEFD 413
Query: 432 LVSKQLYFQRIDCELLA 448
+ ++++ F + DC +
Sbjct: 414 VTNRRVGFAKADCSRVV 430
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 113/430 (26%), Positives = 180/430 (41%), Gaps = 43/430 (10%)
Query: 38 LVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVP 97
L +L+HRDS N + D A+R L M R ++ K++ A + G T
Sbjct: 66 LQVRLVHRDSFAVNAS-AADLLARR-LQRDMRRAAWIITKAATPADPENGTVVTGAPTSG 123
Query: 98 VFYVNFSIGQP-----PVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATL 149
+ ++G P L D GS + W++C PC +C ++ KS + + +
Sbjct: 124 EYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDV 183
Query: 150 PCDSSYC-----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
C + C + C + +EC Y + Y +G S G G E F + V
Sbjct: 184 GCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPGVR----VPGVA 239
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
GC +N G+ GLG + S S + + G FSYC+ + + L G
Sbjct: 240 IGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRS-STLTFG 298
Query: 264 EGAILEGDSTPM---------SVIDGSYYVTLEGISLGE-KMLDIDPNLFKKNDTWSDAG 313
GA +T S + YYV L GIS+G ++ + + + + + G
Sbjct: 299 SGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGG 358
Query: 314 VFIDSGTTLTWLVPSAYQTLRK--EVEDLFQGLLPSYPMDPA--WHLCYSGNINRDLQGF 369
V +DSGT +T L AY R V + + PS P P + CYS R ++
Sbjct: 359 VIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPS-PGGPFAFFDTCYSSVRGRVMKKV 417
Query: 370 PAMAFHFAGGADLVLDAES--VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
PA++ HFAGG ++ L ++ + + C A S G +SIIG I Q +
Sbjct: 418 PAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRG-----VSIIGNIQLQGFR 472
Query: 428 VAYDLVSKQL 437
V YD+ +++
Sbjct: 473 VVYDVDGQRV 482
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 52/366 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ + +G P Q ++DTGS + WV+C+PC QC + FDPS S TY+ C S+
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 187
Query: 156 CT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C N C +C Y + Y +G + GT S+ G + + FGCS+
Sbjct: 188 CAQLGQEGNGCSSS-SQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSN 241
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEK----VGSKFSYCIGNLNYFEYAYNMLILGEG 265
+ F+D Q G+ GLG SLV + +G FSYC+ + G
Sbjct: 242 VESGFND-QTDGLMGLG---GGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 266 AILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
TPM S + Y V L+ I +G + L I ++F AG +DSGT +
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-------AGTVMDSGTVI 350
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYP---MDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T L P+AY L + + P+ P +D + +++ P++A F+GG
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-----IPSVALVFSGG 405
Query: 380 ADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
A + LDA + CLA G SD + L IIG + Q+ + V YD+ +
Sbjct: 406 AVVSLDASGIILSN-----CLAFAGNSDDS-----SLGIIGNVQQRTFEVLYDVGRGVVG 455
Query: 439 FQRIDC 444
F+ C
Sbjct: 456 FRAGAC 461
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 110/365 (30%), Positives = 169/365 (46%), Gaps = 44/365 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-------QCGATTFDPSKSLTYATLPC 151
++ +GQP V DTGS + W++CQPC+ Q G FDP S +Y+ L C
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGP-IFDPKSSSSYSPLSC 242
Query: 152 DSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
DS C ++ + C Y + Y +G + G + +E F+F S+ + ++ GC H
Sbjct: 243 DSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS----IPNLPIGCGH 298
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+N F G GL SL ++ + FSYC+ +L+ + + L A
Sbjct: 299 DNEGL----FVGADGLIGLGGGAISLSSQLEATSFSYCLVDLD----SESSSTLDFNADQ 350
Query: 269 EGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
DS ++ YV + G+S+G K L I + F+ +++ S G+ +DSGTT+T
Sbjct: 351 PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGS-GGIIVDSGTTIT 409
Query: 324 WLVPSAYQTLRKEVEDLFQGL---LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
+ Y LR D F GL LP P + CY + +++ P +AF G
Sbjct: 410 EIPSDVYDVLR----DAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVE-VPTIAFILPGEN 464
Query: 381 DLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L L A++ Q +S+ FCLA PS LSIIG + QQ V+YDL + + F
Sbjct: 465 SLQLPAKNCLIQVDSAGTFCLAFLPSTF------PLSIIGNVQQQGIRVSYDLANSLVGF 518
Query: 440 QRIDC 444
C
Sbjct: 519 STDKC 523
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 103/365 (28%), Positives = 159/365 (43%), Gaps = 50/365 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ + +G P Q ++DTGS + WV+C+PC QC + FDPS S TY+ C S+
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAA 187
Query: 156 CT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C N C +C Y + Y +G + GT S+ G + + FGCS+
Sbjct: 188 CAQLGQEGNGCSSS-SQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVKSFQFGCSN 241
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEK----VGSKFSYCIGNLNYFEYAYNMLILGEG 265
+ F+D Q G+ GLG SLV + +G FSYC+ + G
Sbjct: 242 VESGFND-QTDGLMGLG---GGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 266 AILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
TPM S + Y V L+ I +G + L I ++F AG +DSGT +
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-------AGTVMDSGTVI 350
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYP---MDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T L P+AY L + + P+ P +D + +++ P++A F+GG
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-----IPSVALVFSGG 405
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
A + LDA + CLA + + L IIG + Q+ + V YD+ + F
Sbjct: 406 AVVSLDASGIILSN-----CLAFAANSDD----SSLGIIGNVQQRTFEVLYDVGRGVVGF 456
Query: 440 QRIDC 444
+ C
Sbjct: 457 RAGAC 461
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 175/383 (45%), Gaps = 50/383 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G P +DTGS ++W+ C C C ++ FD + S T A
Sbjct: 80 VGLYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAA 139
Query: 148 TLPCDSSYC-------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C C T++C ++C Y +Y +G + G S+ F+T G++ +
Sbjct: 140 LVSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVV 199
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNL 250
+ + FGCS + +D+ G+FG GP S S + G FS+C L
Sbjct: 200 ANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC---L 256
Query: 251 NYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
E +L+LGE ILE +P+ Y + L+ I++ ++L ID N+F T
Sbjct: 257 KGGENGGGVLVLGE--ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFA---T 311
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-SGNINRDLQ 367
++ G +DSGTTL +LV AY K + S P+ + CY N D+
Sbjct: 312 TNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQF--SKPIISKGNQCYLVSNSVGDI- 368
Query: 368 GFPAMAFHFAGGADLVLDAESVF----YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
FP ++ +F GGA +VL+ E + + ++++C+ + + +I+G +
Sbjct: 369 -FPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVE------QGFTILGDLVL 421
Query: 424 QNYNVAYDLVSKQLYFQRIDCEL 446
++ YDL ++++ + DC L
Sbjct: 422 KDKIFVYDLANQRIGWADYDCSL 444
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 168/372 (45%), Gaps = 41/372 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++V+FS+G P ++DTGS L +V+C PC+ C + PS S T+ +PCDS+
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSAE 93
Query: 156 CT-------NDC-GGYPDE-----CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
C C YP+ C Y RY D+ T+G F +ET+ G +
Sbjct: 94 CLLIPAPVGAPCSSSYPESPPQGACSYEYRYG---DNSSTVGV--FAYETATVGGIRVNH 148
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
V FGC + N S GV GLG A S T +KF+YC+ + ++ LI
Sbjct: 149 VAFGCGNRN-QGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSVFSSLI 207
Query: 262 LGE---GAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G+ I + TP+ + YYV + I G + L I P+ K D+ + G
Sbjct: 208 FGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLI-PDSAWKIDSVGNGGTI 266
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP-AWHLCYS-GNINRDLQGFPAMA 373
DSGTT+T+ P AY + E P P P LC + I+ + +P+
Sbjct: 267 FDSGTTVTYWSPQAYARIIAAFEKSVP--YPRAPPSPQGLPLCVNVSGIDHPI--YPSFT 322
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F GA + + F + S ++ CLA+ S +G ++IG I QQNY V YD
Sbjct: 323 IEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDG-----FNVIGNIIQQNYLVQYDRE 377
Query: 434 SKQLYFQRIDCE 445
++ F +C+
Sbjct: 378 EHRIGFAHANCD 389
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 162/374 (43%), Gaps = 38/374 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATL 149
+++ IG P +DTGS ++WV C C+ C T +DP+ S + T+
Sbjct: 88 LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTV 147
Query: 150 PCDSSYC-TNDCGGYP------DECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLY 201
C +C T GG P C Y+I Y +G + G ++ + + S +G+T L
Sbjct: 148 TCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLA 207
Query: 202 D--VGFGCSHNNAHF---SDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYF 253
+ V FGC S+ G+ G G A SS S + KV FS+C+ +N
Sbjct: 208 NASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVN-- 265
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
+ +G + +TP+ Y V L+ I +G L + N+F G
Sbjct: 266 --GGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGG--GSRG 321
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLC--YSGNINRDLQGFPA 371
IDSGTTL +L Y+ + V + D LC YSG+++ GFP
Sbjct: 322 TIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQD---FLCFQYSGSVD---NGFPE 375
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ FHF G LV+ +Q + V+C+ + + KD+ ++G +A N V YD
Sbjct: 376 VTFHFDGDLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYD 435
Query: 432 LVSKQLYFQRIDCE 445
L ++ + + +C
Sbjct: 436 LENQVIGWTNYNCS 449
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 146/353 (41%), Gaps = 34/353 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P V DTGS + W++C PC +C F+PS S ++ L C SS
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 140
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C +EC Y + Y +G + G +E +F G+ + V GC NN
Sbjct: 141 CGKLKIKGC-SRKNECMYQVSYGDGSFTVGDFSTETLSF-----GEHAVRSVAMGCGRNN 194
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ S S FSYC+ A L+ G A+ E
Sbjct: 195 QGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA--SLVFGPSAVPEKA 252
Query: 272 S----TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
P +D YYV L I + ++I P+ F + GV +DSGT ++ L
Sbjct: 253 RFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGT-GGVIVDSGTAISRLTT 311
Query: 328 SAYQTLRKEVEDLFQGLL--PSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY LR D F+ L+ PS P + CY + + PA+ F GGA + L
Sbjct: 312 PAYTALR----DAFRSLVTFPSAPGISLFDTCYDLSSMKTAT-LPAVVLDFDGGASMPLP 366
Query: 386 AESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
A+ + + +CLA P + + SIIG + QQ + ++ D +Q+
Sbjct: 367 ADGILVNVDDEGTYCLAFAPEE------EAFSIIGNVQQQTFRISIDNQKEQM 413
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 171/381 (44%), Gaps = 46/381 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V ++Y +G PP +DTGS ++WV C C C AT+ FDP S T +
Sbjct: 80 VGLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTAS 139
Query: 148 TLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SDEGK 197
+ C C + C G ++C Y +Y +G + G + + + S
Sbjct: 140 LVSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTS 199
Query: 198 TFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
V FGCS + SD G+FG G S S + G FS+C L
Sbjct: 200 NSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHC---LK 256
Query: 252 YFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+ +L+LGE I+E + TP+ Y + L+ IS+ ++L I P +F T
Sbjct: 257 GDDSGGGILVLGE--IVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFA---TS 311
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
S G IDSGTTL +L AY V ++ S + S +++ D+ F
Sbjct: 312 SSQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVS-DI--F 368
Query: 370 PAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
P ++ +FAGGA LVL A+ Q++S +V+C +G I G+ ++I+G + ++
Sbjct: 369 PQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWC--IGFQKIPGQ---GITILGDLVLKD 423
Query: 426 YNVAYDLVSKQLYFQRIDCEL 446
YDL ++++ + DC +
Sbjct: 424 KIFIYDLANQRIGWTNYDCSM 444
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 143/350 (40%), Gaps = 77/350 (22%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ +N S+G PPV L + DTGS LIW +C PC+ C FDP KS TY TL
Sbjct: 29 YLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTL------ 82
Query: 156 CTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA-HF 214
G + SE F +++ + FGC H+N F
Sbjct: 83 -------------------------GYLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTF 117
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTP 274
+++ + G S L KVG +FSYC+ L+ A + + G+ A++ G T
Sbjct: 118 NEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVSGSGTS 177
Query: 275 MSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLR 334
++ + IDSGTTLT L Y +
Sbjct: 178 SPA------------------------------AAEESNIIIDSGTTLTLLPRDFYTDME 207
Query: 335 KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES 394
+ + G + P + LCYSG ++ P + HF G AD+ L + F Q
Sbjct: 208 SALTKVIGGQTTTDPRG-TFSLCYSGVKKLEI---PTITAHFIG-ADVQLPPLNTFVQAQ 262
Query: 395 SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ C ++ PS +L+I G ++Q N+ V YDL + ++ F+ DC
Sbjct: 263 EDLVCFSMIPS-------SNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 305
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 146/356 (41%), Gaps = 43/356 (12%)
Query: 107 QPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCT---- 157
Q V Q V+DT S + WV+C PC QC +DP+KS T+A +PC S C
Sbjct: 164 QDAVSQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGS 223
Query: 158 ---NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
N C DEC Y + Y +G + GT ++ + + D FGCSH
Sbjct: 224 SYGNGCSPTTDECKYIVNYGDGKATTGTYVTDTLTMSPT----IVVKDFRFGCSHAVRGS 279
Query: 215 SDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE--GAILEGD 271
Q G+ LG S + G+ FSYCI + + L LG A L+
Sbjct: 280 FSNQNAGILALGGGRGSLLEQTADAYGNAFSYCIPKPS----SAGFLSLGGPVEASLKFS 335
Query: 272 STPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
TP+ + Y V LE I + K L + P F G +DSG +T L P
Sbjct: 336 YTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFA-------TGAVMDSGAVVTQLPPQ 388
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
Y LR P CY D++ P ++ FAGGA L L+ S
Sbjct: 389 VYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVK-VPKVSLVFAGGATLDLEPAS 447
Query: 389 VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ CLA + GE + + IG + QQ Y V YD+ ++ F+R C
Sbjct: 448 IILDG-----CLAFAATP--GE--ESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/399 (27%), Positives = 170/399 (42%), Gaps = 51/399 (12%)
Query: 75 SQKSSQKAHDTRAHLHPGISTVPVFY-VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
S + Q +T+ L GI + Y V +G + ++DTGS L WV+CQPC C
Sbjct: 110 SSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNMS--LIVDTGSDLTWVQCQPCRSC 167
Query: 134 ---GATTFDPSKSLTYATLPCDSSYC---------TNDCGG----YPDECWYNIRYTNGP 177
+DPS S +Y T+ C+SS C + CGG C Y + Y +G
Sbjct: 168 YNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGS 227
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
++G + SE G T L + FGC NN + + S ++
Sbjct: 228 YTRGDLASESILL-----GDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLK 282
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSYYVTLEGI 289
FSYC+ +L + A L G + + +ST +S + Y + L G
Sbjct: 283 TFNGVFSYCLPSLE--DGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGA 340
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
S+G +++ + F + G+ IDSGT +T L PS Y+ ++ E F G P+ P
Sbjct: 341 SIGG--VELKSSSFGR-------GILIDSGTVITRLPPSIYKAVKIEFLKQFSG-FPTAP 390
Query: 350 MDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDI 407
C++ D+ P + F G A+L +D VFY + +S+ CLA+
Sbjct: 391 GYSILDTCFNLTSYEDIS-IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSY 449
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
E + IIG Q+N V YD ++L +C +
Sbjct: 450 ENE----VGIIGNYQQKNQRVIYDTTQERLGIVGENCRV 484
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 162/364 (44%), Gaps = 46/364 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++ IG P VLDTGS + W++C PC C T F+PS S +Y L CD+
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 210
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++C C Y + Y +G + G F ET G T + +V GC H+N
Sbjct: 211 CNALEVSECRNA--TCLYEVSYGDGSYTVG-----DFATETLTIGSTLVQNVAVGCGHSN 263
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-----FSYCIGNLNYFEYAYNMLILGEGA 266
G+F + + S+ FSYC+ ++ + + + G
Sbjct: 264 E--------GLFVGAAGLLGLGGGLLALPSQLNTTSFSYCL--VDRDSDSASTVEFGTSL 313
Query: 267 ILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ P+ +D YY+ L GIS+G ++L I + F+ +++ S G+ IDSGT +T
Sbjct: 314 PPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGS-GGIIIDSGTAVT 372
Query: 324 WLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD 381
L Y +LR + +G L + CY+ + ++ P +AFHF GG
Sbjct: 373 RLQTGIYNSLR---DSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIE-VPTVAFHFPGGKM 428
Query: 382 LVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L L A++ + +S FCLA P+ L+IIG + QQ V +DL + + F
Sbjct: 429 LALPAKNYMIPVDSVGTFCLAFAPTA------SSLAIIGNVQQQGTRVTFDLANSLIGFS 482
Query: 441 RIDC 444
C
Sbjct: 483 SNKC 486
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 152/363 (41%), Gaps = 40/363 (11%)
Query: 114 AVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC-----------TND 159
A++DTGS + V QCG+ + FDP+ S +Y +PC S C +
Sbjct: 14 AIIDTGSEAVLV------QCGSRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQP 67
Query: 160 CGGYPDECWYNIRYTNGPDSQGTIGSEQ--FNFETSDEGKTFLYDVGFGCSHNNAHF-SD 216
C C Y++ Y + +S G + N S DV FGC+H+ F D
Sbjct: 68 CVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQGFLVD 127
Query: 217 EQFTGVFGLGPATSSTHSLVEKV--GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTP 274
G+ G S S ++ GSKFSYC + + A ++ LG+ + + +
Sbjct: 128 LGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVSY 187
Query: 275 MSVIDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
++D YYV L IS+ K L I + FK + + D G +DSGTT T +V
Sbjct: 188 TPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVV 247
Query: 327 PSAYQTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY R GL + CY+ + L G P + L L
Sbjct: 248 DDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELR 307
Query: 386 AESVFYQESSS----VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
E +F S++ CLA+ S +G F ++++G Q NY V YD ++ F+R
Sbjct: 308 FEHLFVPVSAAGNEVTVCLAILSSQKSG--FGKINVLGNYQQSNYLVEYDNERSRVGFER 365
Query: 442 IDC 444
DC
Sbjct: 366 ADC 368
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 144/362 (39%), Gaps = 35/362 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCD 152
F V +G P P + DTGS L WV+CQPC G FDPSKS TYA + C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208
Query: 153 SSYCTNDCGGYPDE----CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C GG E C Y + Y +G + G + + +S L FGC
Sbjct: 209 EPQCAA-AGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRA----LAGFPFGCG 263
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N + S G+ FSYC+ + N L +G
Sbjct: 264 TRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSN---STTGYLTIGATPAT 320
Query: 269 EGDSTPMSVI------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ + + + Y+V L I +G +L + P +F + G +DSGT L
Sbjct: 321 DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRG------GTLLDSGTVL 374
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T+L AY+ LR + P+ P D CY ++ PA++F F GA
Sbjct: 375 TYLPAQAYELLRDRFRLTMERYTPAPPND-VLDACYDFAGESEVI-VPAVSFRFGDGAVF 432
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
LD V +V CLA D G LSIIG Q++ V YD+ ++++ F
Sbjct: 433 ELDFFGVMIFLDENVGCLAFAAMDAGG---LPLSIIGNTQQRSAEVIYDVAAEKIGFVPA 489
Query: 443 DC 444
C
Sbjct: 490 SC 491
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 165/373 (44%), Gaps = 41/373 (10%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKC--QPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
++ IG PP Q VLDTGS L W++C + T+FDPS S +++TLPC C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKP 133
Query: 159 DCGGY--PDE------CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
+ P C Y+ Y +G ++G + E+ F ++ + GC+
Sbjct: 134 RIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLI----LGCATE 189
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI---GNLNYFEYAYNMLILGEGAI 267
+ SD++ G+ G+ S S + SKFSYCI N F LG+
Sbjct: 190 S---SDDR--GILGMNRGRLSFVSQAKI--SKFSYCIPPKSNRPGFT-PTGSFYLGDNPN 241
Query: 268 LEG----------DSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
G +S M +D +Y V + GI G K L+I ++F+ D +
Sbjct: 242 SHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRP-DAGGSGQTMV 300
Query: 317 DSGTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
DSG+ T LV +AY +R E+ + + L Y +C+ GN+ + + F
Sbjct: 301 DSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFV 360
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
F G ++ + E V + C+ +G S + G +IIG + QQN V +D+ ++
Sbjct: 361 FTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGA---ASNIIGNVHQQNLWVEFDVTNR 417
Query: 436 QLYFQRIDCELLA 448
++ F + DC +
Sbjct: 418 RVGFAKADCSRVV 430
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 176/383 (45%), Gaps = 45/383 (11%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLT 145
S V ++Y +G PP +DTGS ++WV C C C T+ FDP S T
Sbjct: 72 SQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSST 131
Query: 146 YATLPCDSSYC-----TND--CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
+ + C C T+D C G ++C Y +Y +G + G S+ +F + EG
Sbjct: 132 SSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTL 191
Query: 199 FL---YDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGN 249
V FGCS + S+ G+FG G S S + G FS+C+
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 250 LNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
N +L+LGE I+E + +P+ Y + L+ IS+ +++ I P++F
Sbjct: 252 DN---SGGGVLVLGE--IVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFA--- 303
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
T ++ G +DSGTTL +L AY + + + S + + CY + ++
Sbjct: 304 TSNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRS--VLSRGNQCYLITTSSNVD 361
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
FP ++ +FAGGA LVL + Q++ SV+C +G I+G+ ++I+G +
Sbjct: 362 IFPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWC--IGFQKISGQ---SITILGDLVL 416
Query: 424 QNYNVAYDLVSKQLYFQRIDCEL 446
++ YDL +++ + DC L
Sbjct: 417 KDKIFVYDLAGQRIGWANYDCSL 439
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 109/399 (27%), Positives = 170/399 (42%), Gaps = 51/399 (12%)
Query: 75 SQKSSQKAHDTRAHLHPGISTVPVFY-VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
S + Q +T+ L GI + Y V +G + ++DTGS L WV+CQPC C
Sbjct: 110 SSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNMS--LIVDTGSDLTWVQCQPCRSC 167
Query: 134 ---GATTFDPSKSLTYATLPCDSSYC---------TNDCGG----YPDECWYNIRYTNGP 177
+DPS S +Y T+ C+SS C + CGG C Y + Y +G
Sbjct: 168 YNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGS 227
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
++G + SE G T L + FGC NN + + S ++
Sbjct: 228 YTRGDLASESILL-----GDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLK 282
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV--------IDGSYYVTLEGI 289
FSYC+ +L + A L G + + +ST +S + Y + L G
Sbjct: 283 TFNGVFSYCLPSLE--DGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGA 340
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
S+G +++ + F + G+ IDSGT +T L PS Y+ ++ E F G P+ P
Sbjct: 341 SIGG--VELKSSSFGR-------GILIDSGTVITRLPPSIYKAVKIEFLKQFSG-FPTAP 390
Query: 350 MDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDI 407
C++ D+ P + F G A+L +D VFY + +S+ CLA+
Sbjct: 391 GYSILDTCFNLTSYEDIS-IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSY 449
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
E + IIG Q+N V YD ++L +C +
Sbjct: 450 ENE----VGIIGNYQQKNQRVIYDSTQERLGIVGENCRV 484
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 163/379 (43%), Gaps = 49/379 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ + G P VLDTGS L W+ C+ E + F+P S TY +PC S C
Sbjct: 69 VSLTAGTPLQNITMVLDTGSELSWLHCKK-EPNFNSIFNPLASKTYTKIPCSSPTCETRT 127
Query: 161 GGYP--------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
P C + I Y + +G + E F + T FGC +
Sbjct: 128 RDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATV-----FGCMDSGF 182
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI---- 267
+ E+ GL + S V ++G KFSYCI + + + +L+LGE +
Sbjct: 183 SSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISDRD----SSGVLLLGEASFSWLK 238
Query: 268 ------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
L STP+ D +Y V LEGI + +K+L + ++F + T + +DSGT
Sbjct: 239 PLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGA-GQTMVDSGT 297
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINR-DLQGFPAMAF 374
T+L+ Y L++E +G+L P Y A LCY R L P +
Sbjct: 298 QFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNL 357
Query: 375 HFAGGADLVLDAESVFYQE------SSSVFCLAVGPSDING-ERFKDLSIIGMIAQQNYN 427
F GA++ + + + Y+ SV+C G SD G E F +IG QQN
Sbjct: 358 MFR-GAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESF----VIGHHQQQNVW 412
Query: 428 VAYDLVSKQLYFQRIDCEL 446
+ YDL ++ F + C+L
Sbjct: 413 MEYDLEKSRIGFAEVRCDL 431
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 159/366 (43%), Gaps = 39/366 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ +N S+G P + V DTGS LIW +C PC +C A F P+ S T++ LPC SS+
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 156 C------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C C C YN +Y +G + G + +E G V FGCS
Sbjct: 146 CQFLPNSIRTCNA--TGCVYNYKYGSG-YTAGYLATETLKV-----GDASFPSVAFGCST 197
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N +G+ GLG SL+ ++G +FSYC+ + A +L +
Sbjct: 198 ENG--VGNSTSGIAGLG---RGALSLIPQLGVGRFSYCLRS-GSAAGASPILFGSLANLT 251
Query: 269 EGD--STPM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+G+ STP +V YYV L GI++GE L + + F G +DSGTTL
Sbjct: 252 DGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTL 311
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T+L Y+ +++ + + LC+ P++ F GGA+
Sbjct: 312 TYLAKDGYEMVKQAFLSQTANVT-TVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEY 370
Query: 383 VL----DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
+ Q S +V CL + P+ + + +S+IG + Q + ++ YDL
Sbjct: 371 AVPTYFAGVETDSQGSVTVACLMMLPAKGD----QPMSVIGNVMQMDMHLLYDLDGGIFS 426
Query: 439 FQRIDC 444
F DC
Sbjct: 427 FSPADC 432
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 119/439 (27%), Positives = 173/439 (39%), Gaps = 62/439 (14%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-- 98
K L R L+ V R +S+AR L + + PG+ P
Sbjct: 48 KQLSRRELVRR---AVQRSKARAAALSVAR---LGGSNKGARQQDQNQQQPGLPVRPSGD 101
Query: 99 --FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDS 153
+ V+ ++G PP P A+LDTGS LIW +C PC C F P S +Y + C
Sbjct: 102 LEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAG 161
Query: 154 SYCTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF---ETSDEGKTFLYDVGFG 206
C + C PD C Y Y +G ++G +E+F F + E +GFG
Sbjct: 162 ELCNDILHHSC-QRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFG 220
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCI-------------GNLNY 252
C N S +G+ G G A SLV ++ +FSYC+ G+L
Sbjct: 221 CGTMNKG-SLNNGSGIVGFGRA---PLSLVSQLAIRRFSYCLTPYASGRKSTLLFGSLRG 276
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
Y + +L P YYV G+++G + L I + F S
Sbjct: 277 GVYDAATATVQTTRLLRSRQNPT-----FYYVPFTGVTVGARRLRIPISAFALRPDGS-G 330
Query: 313 GVFIDSGTTLTW----LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
G +DSGT LT ++ + R ++ F S P D +C++ +R +
Sbjct: 331 GAIVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDD---GVCFAAAASRVPRP 387
Query: 369 --FPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
P M FH GADL L + V + CL + S +G + IG QQ+
Sbjct: 388 AVVPRMVFHLQ-GADLDLPRRNYVLDDQRKGNLCLLLADSGDSG------TTIGNFVQQD 440
Query: 426 YNVAYDLVSKQLYFQRIDC 444
V YDL + L F C
Sbjct: 441 MRVLYDLEADTLSFAPAQC 459
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 164/354 (46%), Gaps = 34/354 (9%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPC-------EQCGATTFDPSKSLTYATLPCDSSYCT 157
+GQP P VLDTGS + W++C PC EQ FDP S +Y + CDS C
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQI-TPIFDPELSSSYNPVSCDSEQCQ 61
Query: 158 --NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
++ G + C Y + Y +G + G + +E F S+ + ++ GC H+N
Sbjct: 62 LLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNS----IPNISIGCGHDNEGL- 116
Query: 216 DEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTP 274
F G GL S+ ++ S FSYC+ +++ +++ L + +P
Sbjct: 117 ---FVGADGLIGLGGGAISISSQLKASSFSYCLVDID--SPSFSTLDFNTDPPSDSLISP 171
Query: 275 MSVID---GSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQ 331
+ D YV + G+S+G K L I + F+ +++ G+ +DSGTT+T L Y+
Sbjct: 172 LVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGL-GGIIVDSGTTITQLPSDVYE 230
Query: 332 TLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY 391
LR+ L L P+ + P + CY + +++ P +AF G L L A++
Sbjct: 231 VLREAFLGLTTNLPPAPEISP-FDTCYDLSSQSNVE-VPTIAFILPGENSLQLPAKNCLI 288
Query: 392 Q-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
Q +S+ FCLA + LSIIG QQ V+YDL + + F C
Sbjct: 289 QVDSAGTFCLAFVSATF------PLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 155/379 (40%), Gaps = 49/379 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ +G PP +LDTGS L W++C PC +C +DP +S +Y + C S
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSR 240
Query: 156 C--------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLY---DV 203
C C C Y Y + ++ G E F T GK L +V
Sbjct: 241 CHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENV 300
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S + L G FSYC+ + N + LI G
Sbjct: 301 MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFG 360
Query: 264 EG------------AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
E ++ G P +D YYV ++ I +G ++++I P + T
Sbjct: 361 EDKDLLSHPELNFTTLVAGKENP---VDTFYYVQIKSIVVGGEVVNI-PEEKWQIATDGS 416
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD---PAWHLCY--SGNINRDL 366
G IDSGTTL++ AYQ ++ + F + YP+ P CY +G DL
Sbjct: 417 GGTIIDSGTTLSYFAEPAYQVIK----EAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDL 472
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
P F+ GA E+ F + E V CLA I G LSIIG QQN
Sbjct: 473 ---PDFGIVFSDGAVWNFPVENYFIEIEPREVVCLA-----ILGTPPSALSIIGNYQQQN 524
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+++ YD +L F C
Sbjct: 525 FHILYDTKKSRLGFAPTKC 543
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 159/374 (42%), Gaps = 34/374 (9%)
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATT--FDPSKSLTYAT 148
I P + +G PP L +D + WV C C C GA++ FDP++S TY
Sbjct: 94 ILRTPSYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRP 153
Query: 149 LPCDSSYC------TNDCGGYPD-ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
+ C + C T C P C +N+ Y + +G + + S+
Sbjct: 154 VRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASS-TLHAVLGQDALSLSDSNGAAVPDD 212
Query: 202 DVGFGC----SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAY 257
FGC + + + G FG GP + + + GS FSYC+ + ++
Sbjct: 213 HYTFGCLRVVTGSGGSVPPQGLVG-FGRGPLSFLSQTKAT-YGSIFSYCLPSYKSSNFSG 270
Query: 258 NMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ + G +TP+ YYV + G+ + K + I + + G
Sbjct: 271 TLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGT 330
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAF 374
+D+GT T L P AY LR P+ P + CY N + + PA+AF
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGVSA--PAAPALGGFDTCYYVNGTKSV---PAVAF 385
Query: 375 HFAGGADLVLDAESVFYQESS-SVFCLAV--GPSD-INGERFKDLSIIGMIAQQNYNVAY 430
FAGGA + L E+V +S V CLA+ GPSD +N L+++ + QQN+ V +
Sbjct: 386 VFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVN----AGLNVLASMQQQNHRVVF 441
Query: 431 DLVSKQLYFQRIDC 444
D+ + ++ F R C
Sbjct: 442 DVGNGRVGFSRELC 455
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 159/387 (41%), Gaps = 41/387 (10%)
Query: 89 LHPGIS-TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ------PCEQCGAT----- 136
+HP + ++V F +G P + V DTGS L W+ C+ C A
Sbjct: 72 MHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131
Query: 137 -TFDPSKSLTYATLPCDSSYCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSE 186
F + S ++ T+PC + C +C C Y+ RY++G + G +E
Sbjct: 132 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191
Query: 187 QFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSY 245
E + K L++V GCS + S + GV GLG + S EK G KFSY
Sbjct: 192 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 251
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDSTPMS-------VIDGSYYVTLEGISLGEKMLDI 298
C+ + + N L G E M+ +++ Y V + GIS+G ML I
Sbjct: 252 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 311
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE-DLFQGLLPSYPMDPAWHLC 357
++ D G +DSG++LT+L AYQ + + L + + P +
Sbjct: 312 PSEVW---DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 368
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSI 417
S L P + FHFA GA+ +S + V CL G S+
Sbjct: 369 NSTGFEESL--VPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT-----SV 421
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+G I QQN+ +DL K+L F C
Sbjct: 422 VGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 162/385 (42%), Gaps = 66/385 (17%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
+N IG PP Q VLDTGS L W++C +Q +FDPS S T++ LPC C
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWIQCHK-KQPPTASFDPSLSSTFSILPCTHPLCKPRI 135
Query: 161 GGYP-----DE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
+ D+ C Y+ Y +G ++G + E+F F S + GC+ +
Sbjct: 136 PDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLI----LGCATES- 190
Query: 213 HFSDEQFTGVFG--LGPATSSTHSLVEKVGSKFSYCI-----------------GN---L 250
+D + G+ G LG + + S + +KFSYC+ GN
Sbjct: 191 --TDPR--GILGMNLGRLSFAKQSKI----TKFSYCVPPRQTRPGFTPTGSFYLGNNPSS 242
Query: 251 NYFEYAYNMLILGEGAILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
F+Y ++ M D +Y + + GI + K L+I P +F+ D
Sbjct: 243 KGFKYV---------GMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRA-DAG 292
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSG----NINR 364
IDSG+ T+LV AY +R +V + L Y +C+ I R
Sbjct: 293 GSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAVEIGR 352
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
+ M F F G ++V+ E V V C+ +G SD G +IIG QQ
Sbjct: 353 LIG---EMVFEFERGVEVVIPKERVLADVGGGVHCVGIGSSDKLGA---ASNIIGNFHQQ 406
Query: 425 NYNVAYDLVSKQLYFQRIDCELLAD 449
N V +DLV +++ F + DC L
Sbjct: 407 NLWVEFDLVRRRVGFGKADCSRLVK 431
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 171/387 (44%), Gaps = 51/387 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCG----ATTFDPSKSLTYATLPCDS 153
++V+ +G PP L V DTGS L WV+C C+ C +TF S T++ C S
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFS 142
Query: 154 SYCT-------NDCG--GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
S C N C C Y Y++G + G E TS + L +
Sbjct: 143 SLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSIA 202
Query: 205 FGCSHNNAHFS--DEQF---TGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYN 258
FGC + + S F +GV GLG S S L + G FSYC+ + +
Sbjct: 203 FGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPTS 262
Query: 259 MLILGEGAILEGDS------TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTW 309
L++G+ + D+ TP+ + + YY++++G+ + L IDP+++ D
Sbjct: 263 YLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSL-DEL 321
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTL----RKEVEDLFQGLLPSYPMDPAWHLCYSG-NINR 364
+ G IDSGTTLT+L AY+ + ++EV+ LPS P SG ++
Sbjct: 322 GNGGTVIDSGTTLTFLTEPAYREILSAFKREVK------LPS--PTPGGASTRSGFDLCV 373
Query: 365 DLQG-----FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
++ G FP ++ G + + F S + CLA+ P + RF S+IG
Sbjct: 374 NVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRF---SVIG 430
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDCEL 446
+ QQ + + +D +L F R C +
Sbjct: 431 NLMQQGFLLEFDRGKSRLGFSRRGCAV 457
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/366 (29%), Positives = 163/366 (44%), Gaps = 52/366 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ + +G P Q ++DTGS + WV+C+PC QC + FDPS S TY+ C S+
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 111
Query: 156 CT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C N C +C Y + Y +G + GT S+ G + + FGCS+
Sbjct: 112 CAQLGQEGNGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSN 165
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEK----VGSKFSYCIGNLNYFEYAYNMLILGEG 265
+ F+D Q G+ GLG SLV + +G FSYC+ + G
Sbjct: 166 VESGFND-QTDGLMGLG---GGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 221
Query: 266 AILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
TPM S + Y V L+ I +G + L I ++F AG +DSGT +
Sbjct: 222 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-------AGTVMDSGTVI 274
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYP---MDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
T L P+AY L + + P+ P +D + +SG + + P++A F+GG
Sbjct: 275 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFD--FSGQSSVSI---PSVALVFSGG 329
Query: 380 ADLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
A + LDA + CLA G SD + L IIG + Q+ + V YD+ +
Sbjct: 330 AVVSLDASGIILSN-----CLAFAGNSDDS-----SLGIIGNVQQRTFEVLYDVGRGVVG 379
Query: 439 FQRIDC 444
F+ C
Sbjct: 380 FRAGAC 385
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 173/405 (42%), Gaps = 58/405 (14%)
Query: 81 KAHDTRAHL--------------HPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+AHDTR H HP S +++ IG P +DTGS ++WV
Sbjct: 125 RAHDTRRHGRILSAVDLPLGGNGHP--SEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVN 182
Query: 127 CQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD-----ECWYNIRY 173
C C++C T +D S T + CD ++C+ G P +C Y++ Y
Sbjct: 183 CAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY 242
Query: 174 TNGPDSQGTIGSE--QF-----NFETSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVF 223
+G + G + Q+ NF+T+ T V FGC + + S E G+
Sbjct: 243 GDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT----VVFGCGNKQSGELGSSSEALDGIL 298
Query: 224 GLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDG 280
G G A SS S + KV FS+C+ N++ + +GE + + TP+
Sbjct: 299 GFGQANSSMLSQLASSGKVKKVFSHCLDNVD----GGGIFAIGEVVEPKVNITPLVQNQA 354
Query: 281 SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL 340
Y V ++ I +G LD+ + F+ D G IDSGTTL + Y L +++
Sbjct: 355 HYNVVMKEIEVGGDPLDVPSDAFESGDR---KGTIIDSGTTLAYFPQEVYVPLIEKILSQ 411
Query: 341 FQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFC 399
Q L + ++ A+ Y+GN++ GFP + HF L + +Q +C
Sbjct: 412 -QPDLRLHTVEQAFTCFDYTGNVD---DGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWC 467
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S + KDL+++G + N V YDL + + + +C
Sbjct: 468 IGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 512
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 160/379 (42%), Gaps = 58/379 (15%)
Query: 99 FYVNFSIGQPP-VPQLAVLDTGSSLIWVKCQPC-EQCGATT---FDPSKSLTYATLPCDS 153
+ + +G PP Q ++DTGS + WV+C+PC +QC FDPS S TY+ C S
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSS 199
Query: 154 SYC--------TNDCGGYPDECWYNIRYTNGP-DSQGTIGSEQFNFETSDEGKTFLYDVG 204
+ C N C +C Y Y +G + GT S+ S+ +
Sbjct: 200 AACAQLFQEGNANGCSSS-GQCQYIAMYGDGSVGTTGTYSSDTLALG-SNSNTVVVSKFR 257
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-----SKFSYCIGNLNYFEYAYNM 259
FGCSH + + SLV + + FSYC L +
Sbjct: 258 FGCSHAETGITGLTAGLMG----LGGGAQSLVSQTAGTFGTTAFSYC---LPPTPSSSGF 310
Query: 260 LILGEGAILEGD--STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
L LG TPM S + Y V LE I +G + L I +F AG+
Sbjct: 311 LTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVF-------SAGM 363
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA------WHLCY--SGNINRDL 366
+DSGT +T L P+AY +L F+ + YP P+ C+ SG + +
Sbjct: 364 IMDSGTVVTRLPPTAYSSL----SSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSM 419
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
A+ F AGGA + LDA + Q E+SS+FCLA + +G IIG + Q+
Sbjct: 420 PTV-ALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGS----TGIIGNVQQRT 474
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+ V YD+ + F+ C
Sbjct: 475 FQVLYDVAGGAVGFKAGAC 493
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 168/400 (42%), Gaps = 56/400 (14%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWV 125
Q +S H + L P + +P+ +Y+ +G PP +LDTGSSL W+
Sbjct: 87 QGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKLGLGSPPKYYTMILDTGSSLSWL 146
Query: 126 KCQPC-EQCGATT---FDPSKSLTYATLPCDSSYCT-------ND--CGGYPDECWYNIR 172
+C+PC C + F+PS S TY L C SS C+ ND C C Y
Sbjct: 147 QCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLLKAATLNDPLCTA-SGVCVYTAS 205
Query: 173 YTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSS 231
Y + S G + + S +F Y GC +N + G+ GL S
Sbjct: 206 YGDASYSMGYLSRDLLTLTPSQTLPSFTY----GCGQDNEGLFGKA-AGIVGLARDKLSM 260
Query: 232 THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEG 288
L K G FSYC+ L +G+ + TPM S Y++ L
Sbjct: 261 LAQLSPKYGYAFSYCLPTST--SSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAA 318
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
I++ + + + ++ IDSGT +T L S Y LR E + + Y
Sbjct: 319 ITVAGRPVGVAAAGYQ-------VPTIIDSGTVVTRLPISIYAALR---EAFVKIMSRRY 368
Query: 349 PMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS 405
PA+ + C+ G++ + + G P + F GGADL L A ++ + + CLA S
Sbjct: 369 EQAPAYSILDTCFKGSL-KSMSGAPEIRMIFQGGADLSLRAPNILIEADKGIACLAFASS 427
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ ++IIG QQ YN+AYD+ + ++ F C
Sbjct: 428 N-------QIAIIGNHQQQTYNIAYDVSASKIGFAPGGCR 460
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 171/383 (44%), Gaps = 52/383 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C T+ FD S S T
Sbjct: 63 VGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAG 122
Query: 148 TLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C CT+ C D+C Y +Y +G + G S+ F+ + G++ +
Sbjct: 123 QVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFD-AILGQSLI 181
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNL 250
+ + FGCS + +D+ G+FG G S S + G FS+C L
Sbjct: 182 DNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHC---L 238
Query: 251 NYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+L+LGE ILE +P+ Y + L I++ ++L IDP F +++
Sbjct: 239 KGDGSGGGILVLGE--ILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNS 296
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYSGNINRDLQ 367
G +DSGTTL +LV AY V + PS P+ + CY + + Q
Sbjct: 297 ---QGTIVDSGTTLAYLVAEAYDPFVSAVNAIVS---PSVTPITSKGNQCYLVSTSVS-Q 349
Query: 368 GFPAMAFHFAGGADLVLDAESVFY----QESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
FP +F+FAGGA +VL E S+++C+ ++ + ++I+G +
Sbjct: 350 MFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGF-------QKVQGVTILGDLVL 402
Query: 424 QNYNVAYDLVSKQLYFQRIDCEL 446
++ YDLV +++ + DC L
Sbjct: 403 KDKIFVYDLVRQRIGWANYDCSL 425
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 164/375 (43%), Gaps = 42/375 (11%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATL 149
++Y IG P +DTGS ++WV C C+ C T+ +DP+ S T T+
Sbjct: 84 LYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TV 141
Query: 150 PCDSSYCT-NDCGGYP-------DECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFL 200
CD +C N G P C + I Y +G + G S+ + + S G+T
Sbjct: 142 GCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTP 201
Query: 201 YD--VGFGCSHN---NAHFSDEQFTGVFGLGPATSSTHSLV---EKVGSKFSYCIGNLNY 252
+ + FGC + S + G+ G G A SS S + KV F++C+ +
Sbjct: 202 SNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTV-- 259
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ + +G + +TP+ Y V L+GIS+G L + + F D+
Sbjct: 260 --HGGGIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDS---K 314
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLC--YSGNINRDLQGFP 370
G IDSGTTL +L Y+TL V D +Q L D +C +SG+I+ GFP
Sbjct: 315 GTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQD---FVCFQFSGSID---DGFP 368
Query: 371 AMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ F F G L + +Q + ++C+ + + KD+ ++G + N V Y
Sbjct: 369 VVTFSFEGEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVY 428
Query: 431 DLVSKQLYFQRIDCE 445
DL + + + +C
Sbjct: 429 DLEKQVIGWADYNCS 443
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 157/373 (42%), Gaps = 40/373 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATLP 150
+Y IG PP P +DTGS ++WV C C++C + +DP S + + +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 151 CDSSYCTNDCG----------GYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SDEGK 197
CD+ +C G G P C Y Y +G + G+ S+ + + + +
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKP--CEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTR 204
Query: 198 TFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLN 251
+V FGC +++ G+ G G + +ST S + G FS+C+ +
Sbjct: 205 HAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIK 264
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ +GE + STP+ Y V L+ I + L + P++F +T
Sbjct: 265 ----GGGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIF---ETSEK 317
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
G IDSGTTLT+L Y+ + V FQ LC+ + + D GFP
Sbjct: 318 RGTIIDSGTTLTYLPELVYKDILAAV---FQKHQDITFRTIQGFLCFEYSESVD-DGFPK 373
Query: 372 MAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+ FHF L + F+Q +++CL + KD+ ++G + N V YD
Sbjct: 374 ITFHFEDDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYD 433
Query: 432 LVSKQLYFQRIDC 444
L + + + +C
Sbjct: 434 LEKQVIGWTDYNC 446
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 34/358 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+++ IG+PP VLDTGS + W++C PC +C + FDP S +Y+ + CD
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQ 208
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C ++C C Y + Y +G + G +F ET G + +V GC HNN
Sbjct: 209 CKSLDLSECRN--GTCLYEVSYGDGSYTVG-----EFATETVTLGSAAVENVAIGCGHNN 261
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F G GL S +V + FSYC+ +N A + L
Sbjct: 262 EGL----FVGAAGLLGLGGGKLSFPAQVNATSFSYCL--VNRDSDAVSTLEFNSPLPRNA 315
Query: 271 DSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
+ P+ +D YY+ L+GIS+G + L I + F+ D G+ IDSGT +T L
Sbjct: 316 ATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEV-DAIGGGGIIIDSGTAVTRLRS 374
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAE 387
Y LR +G +P + CY + +R+ P ++F F G +L L A
Sbjct: 375 EVYDALRDAFVKGAKG-IPKANGVSLFDTCYDLS-SRESVEIPTVSFRFPEGRELPLPAR 432
Query: 388 SVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +S FC A P+ LSIIG + QQ V +D+ + + F C
Sbjct: 433 NYLIPVDSVGTFCFAFAPTT------SSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 165/372 (44%), Gaps = 38/372 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATL 149
++Y +G PP +DTGS ++WV C CEQC T +DP S T + +
Sbjct: 85 LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMV 144
Query: 150 PCDSSYCTNDCGGYPDECW------YNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD 202
CD ++C GG +C Y++ Y +G + G+ ++ F + + +G+T +
Sbjct: 145 MCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPAN 204
Query: 203 --VGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFE 254
V FGC S++ G+ G G A +S S + KV F++C+ +
Sbjct: 205 ASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIK--- 261
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +G+ + +TP+ Y V L+ I +G L + ++F+ + G
Sbjct: 262 -GGGIFSIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGE---KKGT 317
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLC--YSGNINRDLQGFPAM 372
IDSGTTLT+L ++ + V + Q + D LC Y G+++ GFP +
Sbjct: 318 IIDSGTTLTYLPELVFKEVMLAVFNKHQDITFH---DVQGFLCFQYPGSVD---DGFPTI 371
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
FHF L + F+ + V+C+ + KD+ ++G + N V YDL
Sbjct: 372 TFHFEDDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDL 431
Query: 433 VSKQLYFQRIDC 444
++ + + +C
Sbjct: 432 ENRVIGWTDYNC 443
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 173/405 (42%), Gaps = 58/405 (14%)
Query: 81 KAHDTRAHL--------------HPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+AHDTR H HP S +++ IG P +DTGS ++WV
Sbjct: 44 RAHDTRRHGRILSAVDLPLGGNGHP--SEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVN 101
Query: 127 CQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD-----ECWYNIRY 173
C C++C T +D S T + CD ++C+ G P +C Y++ Y
Sbjct: 102 CAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY 161
Query: 174 TNGPDSQGTIGSE--QF-----NFETSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVF 223
+G + G + Q+ NF+T+ T + FGC + + S E G+
Sbjct: 162 GDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV----FGCGNKQSGELGSSSEALDGIL 217
Query: 224 GLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDG 280
G G A SS S + KV FS+C+ N++ + +GE + + TP+
Sbjct: 218 GFGQANSSMLSQLASSGKVKKVFSHCLDNVD----GGGIFAIGEVVEPKVNITPLVQNQA 273
Query: 281 SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL 340
Y V ++ I +G LD+ + F+ D G IDSGTTL + Y L +++
Sbjct: 274 HYNVVMKEIEVGGDPLDVPSDAFESGDR---KGTIIDSGTTLAYFPQEVYVPLIEKILSQ 330
Query: 341 FQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFC 399
Q L + ++ A+ Y+GN++ GFP + HF L + +Q +C
Sbjct: 331 -QPDLRLHTVEQAFTCFDYTGNVD---DGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWC 386
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S + KDL+++G + N V YDL + + + +C
Sbjct: 387 IGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 431
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 108/366 (29%), Positives = 159/366 (43%), Gaps = 35/366 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
++V+F +G PP ++D+GS L+WV+C PC QC A + PS S T+ +PC S
Sbjct: 65 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPE 124
Query: 156 CT-------NDCG-GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C C YP C Y RY + S+G F +E++ + V FGC
Sbjct: 125 CLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGV-----FAYESATVDDVRIDKVAFGC 179
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLILGE-- 264
+N S GV GLG S S V G+KF+YC+ N + LI G+
Sbjct: 180 GRDN-QGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGDEL 238
Query: 265 -GAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
I + TP+ S YYV +E + +G + L I + + D + G DSGT
Sbjct: 239 ISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSL-DFLGNGGSIFDSGT 297
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
T+T+ +P AY+ + + + P LC D FP+ GGA
Sbjct: 298 TVTYWLPPAYRNILAAFDKNVR--YPRAASVQGLDLCVD-VTGVDQPSFPSFTIVLGGGA 354
Query: 381 DLVLDAESVFYQESSSVFCLAVG--PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
+ F + +V CLA+ PS + G + IG + QQN+ V YD ++
Sbjct: 355 VFQPQQGNYFVDVAPNVQCLAMAGLPSSVGG-----FNTIGNLLQQNFLVQYDREENRIG 409
Query: 439 FQRIDC 444
F C
Sbjct: 410 FAPAKC 415
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 161/377 (42%), Gaps = 44/377 (11%)
Query: 89 LHPGISTVPV-FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQCGATT---FDPSKS 143
L+PG S +YV +G P ++DTGSSL W++C+PC C FDPS S
Sbjct: 2 LNPGASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSAS 61
Query: 144 LTYATLPCDSSYCTN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSD 194
TY +L C SS C++ C + C Y Y + S G + + S
Sbjct: 62 KTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ 121
Query: 195 EGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFE 254
F+Y GC ++ G+ GLG + S++ +V SKF Y
Sbjct: 122 TLPGFVY----GCGQDSEGLFGRA-AGILGLG---RNKLSMLGQVSSKFGYAFSYCLPTR 173
Query: 255 YAYNMLILGEGAILEGDS---TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDT 308
L +G+ A L G + TPM+ G+ Y++ L I++G + L + ++
Sbjct: 174 GGGGFLSIGK-ASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYR---- 228
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
IDSGT +T L S Y ++ + P C+ GN+ +D+Q
Sbjct: 229 ---VPTIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNL-KDMQS 284
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
P + F GGADL L +V Q + CLA ++ ++IIG QQ + V
Sbjct: 285 VPEVRLIFQGGADLNLRPVNVLLQVDEGLTCLAFAGNN-------GVAIIGNHQQQTFKV 337
Query: 429 AYDLVSKQLYFQRIDCE 445
A+D+ + ++ F C
Sbjct: 338 AHDISTARIGFATGGCN 354
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 169/383 (44%), Gaps = 53/383 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC---T 157
V ++G PP VLDTGS L W+ C+ G+ F+P S TY+ +PC S C T
Sbjct: 63 VTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGS-VFNPVSSSTYSPVPCSSPICRTRT 121
Query: 158 ND------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
D C C I Y + +G + + F + T FGC +
Sbjct: 122 RDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTRPGTL-----FGCMDSG 176
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI--- 267
E+ GL + S V ++G SKFSYCI + + +L+LG+ +
Sbjct: 177 LSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSD----SSGILLLGDASYSWL 232
Query: 268 -------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
L +TP+ D +Y V LEGI +G K+L + ++F + T + +DSG
Sbjct: 233 GPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGA-GQTMVDSG 291
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYS-GNINR-DLQGFPAM 372
T T+L+ Y L+ E + +L P++ LCY G+ R + G P +
Sbjct: 292 TQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGLPVI 351
Query: 373 AFHFAGGADLVLDAESVFYQESSS-------VFCLAVGPSDING-ERFKDLSIIGMIAQQ 424
+ F GA++ + + + Y+ + + V+C G SD+ G E F +IG QQ
Sbjct: 352 SLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAF----VIGHHHQQ 406
Query: 425 NYNVAYDLVSKQLYFQ-RIDCEL 446
N + +DL ++ F + C+L
Sbjct: 407 NVWMEFDLAKSRVGFAGNVRCDL 429
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/364 (27%), Positives = 157/364 (43%), Gaps = 38/364 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATL--PCD 152
+ V ++G P + LDTGS + W +C+PC + T FDP KS +Y +
Sbjct: 45 YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSS 104
Query: 153 SSYCTNDCGG----YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
S D GG C Y ++Y +G S G +E+ SD FL FGC
Sbjct: 105 SCRIITDSGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFL----FGCG 160
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
NA + S EK + F+YC+ + + + L LG
Sbjct: 161 QQNAGRFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFS--SSSTGHLTLGGQVPK 218
Query: 269 EGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
TP+S + Y + ++G+S+G +L ID ++F S+AG IDSGT +T L
Sbjct: 219 SVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVF------SNAGAIIDSGTVITRL 272
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADL 382
P+ Y L + FQ L+ YP + + CY + N + P ++F F GG ++
Sbjct: 273 QPTVYSALSSK----FQQLMKDYPKTDGFSILDTCYDFSGNESIS-VPRISFFFKGGVEV 327
Query: 383 VLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
+ + ++ CLA P+D +G D + G QQ Y+V +DL ++ F
Sbjct: 328 DIKFFGILTVINAWDKVCLAFAPNDDDG----DFVVFGNSQQQTYDVVHDLAKGRIGFAP 383
Query: 442 IDCE 445
C
Sbjct: 384 SGCN 387
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 174/382 (45%), Gaps = 52/382 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATL 149
++Y +G PP +DTGS ++WV C C C ++ FDP S T + +
Sbjct: 89 LYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLI 148
Query: 150 PCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG---KTF 199
C C+ + C ++C Y +Y +G + G S+ +F+T G K
Sbjct: 149 SCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNS 208
Query: 200 LYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYF 253
+ FGCS + D G+FG G S S + G FS+C L
Sbjct: 209 SAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHC---LKGD 265
Query: 254 EYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ +L+LGE I+E + TP+ Y + L+ I + + L IDP++F T S+
Sbjct: 266 DSGGGILVLGE--IVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFA---TSSN 320
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCY--SGNINRDLQG 368
G IDSGTTL +L +AY + PS P + CY S +IN D+
Sbjct: 321 QGTIIDSGTTLAYLTEAAYDPFISAITSTVS---PSVSPYLSKGNQCYLTSSSIN-DV-- 374
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
FP ++ +FAGG ++L + Q+SS +++C VG I G+ +++I+G + +
Sbjct: 375 FPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWC--VGFQKIQGQ---EITILGDLVLK 429
Query: 425 NYNVAYDLVSKQLYFQRIDCEL 446
+ YD+ +++ + DC+
Sbjct: 430 DKIFVYDIAGQRIGWANYDCKF 451
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 177/388 (45%), Gaps = 55/388 (14%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C T+ FDP S T A
Sbjct: 81 VGLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAA 140
Query: 148 TLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET-------- 192
+ C CT C ++C Y +Y +G + G ++ + +T
Sbjct: 141 LVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGEL 200
Query: 193 SDEGKTFLYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYC 246
S +T+ V F CS + SD G+FG G S S + G FS+C
Sbjct: 201 SQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHC 260
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFK 304
L + +L+LGE I+E + TP+ Y + L+ IS+ + L IDP++F
Sbjct: 261 ---LKGDDSGGGVLVLGE--IVEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFG 315
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNI 362
+ S+ G +DSGTTL +L AY + + L + + CY + ++
Sbjct: 316 AS---SNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVS--LNARTYLSKGNQCYLVTSSV 370
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSII 418
N D+ FP ++ +FAGGA L+L+ + Q++S +V+C VG G++ ++I+
Sbjct: 371 N-DV--FPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWC--VGFQKTPGQQ---ITIL 422
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
G + ++ YD+ ++++ + DC +
Sbjct: 423 GDLVLKDKIFVYDIANQRVGWTNYDCSM 450
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 156/381 (40%), Gaps = 54/381 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++++ +G PP +LDTGS L W++C PC C + +DP S ++ + C
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 254
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLY---DV 203
C N C C Y Y +G ++ G E F T+ GK+ L +V
Sbjct: 255 CQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENV 314
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S + G FSYC+ + N + LI G
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 374
Query: 264 EGAILEGDST---------PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW----- 309
E L +D YYV + + + +++L I +TW
Sbjct: 375 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKI------PEETWHLSSE 428
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLR----KEVE--DLFQGLLPSYPMDPAWHLCYSGNIN 363
G IDSGTTLT+ AY+ ++ ++++ +L +GL P P CY+ +
Sbjct: 429 GAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKP-------CYNVSGI 481
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
++ P FA GA E+ F Q V CLA I G LSIIG Q
Sbjct: 482 EKME-LPDFGILFADGAVWNFPVENYFIQIDPDVVCLA-----ILGNPRSALSIIGNYQQ 535
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
QN+++ YD+ +L + + C
Sbjct: 536 QNFHILYDMKKSRLGYAPMKC 556
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 147/360 (40%), Gaps = 25/360 (6%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
++V +G P V DTGS L WVKC F P S ++A +PC S C
Sbjct: 91 YFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLVFRPEASKSWAPVPCSSDTCKL 150
Query: 159 D-------CGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
D C C Y+ RY G + G +G++ L DV GCS
Sbjct: 151 DVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSST 210
Query: 211 NAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
+ S + GV LG A S S + G FSYC+ + A L G G +
Sbjct: 211 HDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPR 270
Query: 270 GDSTPMSV-IDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+T + +D + Y V ++ + + + LDI ++ GV +DSGTTLT L
Sbjct: 271 TPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPK----SGGVILDSGTTLTVL 326
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINR-DLQGFPAMAFHFAGGADLVL 384
AY+ + + L G +P P H CY+ R P +A F G A L
Sbjct: 327 ATPAYKAVVAALTKLLAG-VPKVDFPPFEH-CYNWTAPRPGAPEIPKLAVQFTGCARLEP 384
Query: 385 DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A+S V C+ + + G +S+IG I QQ + +DL + ++ F C
Sbjct: 385 PAKSYVIDVKPGVKCIGLQEGEWPG-----VSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 157/366 (42%), Gaps = 62/366 (16%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-----TNDCGGY--- 163
++DTGS L WV+C PC C F+PS S ++ +LPC+S C T G
Sbjct: 159 IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSN 218
Query: 164 --PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTG 221
C Y I Y +G S+G +G FE GKT + + FGC NN F G
Sbjct: 219 KNSTSCDYQIDYGDGSYSRGELG-----FEKLTLGKTEIDNFIFGCGRNNKGL----FGG 269
Query: 222 VFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEG--DSTPM 275
GL S SLV + GS FSYC+ + L LG GA + +P+
Sbjct: 270 ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGV--GSSGSLTLG-GADFSNFKNISPI 326
Query: 276 SV--------IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
S + Y++ L GIS+G L++ P L S +DSGT +T L P
Sbjct: 327 SYTRMIQNPQMSNFYFLNLTGISIGGVNLNV-PRLSSNEGVLS----LLDSGTVITRLSP 381
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGADL 382
S Y+ + E E F G Y P + + N +L G+ P + F F G A++
Sbjct: 382 SIYKAFKAEFEKQFSG----YRTTPGFSIL---NTCFNLTGYEEVNIPTVKFIFEGNAEM 434
Query: 383 VLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
++D E VFY + +S CLA + IIG Q+N V Y+ ++ F
Sbjct: 435 IVDVEGVFYFVKSDASQICLAFASLGYEDQTM----IIGNYQQKNQRVIYNSKESKVGFA 490
Query: 441 RIDCEL 446
C
Sbjct: 491 GEPCSF 496
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 168/373 (45%), Gaps = 48/373 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ---PCEQCGATTFDPSKSLTYATLPCDSSYC- 156
+ SIG PP P+ +LDTGS LIW +C+ + +DP+KS ++A PCD C
Sbjct: 91 LTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCE 150
Query: 157 -----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
T +C ++C Y Y + ++G + SE F F E + + FGC
Sbjct: 151 TGSFNTKNCSR--NKCIYTYNYGSA-TTKGELASETFTF---GEHRRVSVSLDFGCGKLT 204
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ S +G+ G+ P S S ++ +FSYC+ + + G A L
Sbjct: 205 SG-SLPGASGILGISPDRLSLVSQLQI--PRFSYCLTPF-LDRNTTSHIFFGAMADLSKY 260
Query: 272 ST--PMSVI------DGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
T P+ DGS YYV L GIS+G K L++ + F S G F+DSG
Sbjct: 261 RTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGS-GGTFVDSGD 319
Query: 321 TLTWLVPSAYQTLRKE--VEDLFQGLLPSYPMDPAWHLCYS------GNINRDLQGFPAM 372
T T ++PS KE VE + ++ + + LC+ G + +Q P +
Sbjct: 320 T-TGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQ-VPPL 377
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
+HF GGA ++L +S + S+ CL + +G R +IIG QQN +V +D+
Sbjct: 378 VYHFDGGAAMLLRRDSYMVEVSAGRMCLVIS----SGARG---AIIGNYQQQNMHVLFDV 430
Query: 433 VSKQLYFQRIDCE 445
+ + F C
Sbjct: 431 ENHEFSFAPTQCN 443
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 116/426 (27%), Positives = 177/426 (41%), Gaps = 55/426 (12%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAV 115
VDA+ T+ + R + + A +H G + + + IG PP A+
Sbjct: 30 VDAKEHYTVEERVRRATERTHRRLASMGGVTAPIHWGGQSQ--YIAEYLIGDPPQRAEAI 87
Query: 116 LDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSSYCT----NDCGGYPDEC 167
+DTGS+LIW +C C + +DPS+S + C+ + C C C
Sbjct: 88 IDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALGSETQCLSDNKTC 147
Query: 168 WYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC--SHNNAHFSDEQFTGVFGL 225
Y G + GT+ +E F++ + FGC + S +G+ GL
Sbjct: 148 AVVTGYGAG-NIAGTLATENLTFQSET------VSLVFGCIVVTKLSPGSLNGASGIIGL 200
Query: 226 GPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAY---NMLILGEGAILEG--DSTPMSVI- 278
G SL ++G ++FSYC+ YFE +M++ ++ G STP++ +
Sbjct: 201 G---RGKLSLPSQLGDTRFSYCL--TPYFEDTIEPSHMVVGASAGLINGSASSTPVTTVP 255
Query: 279 ----------DGSYYVTLEGISLGEKMLDIDPNLFKKNDT----WSDAGVFIDSGTTLTW 324
YY+ L GI+ G+ L + F W+ G FIDSG LT
Sbjct: 256 FVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWT--GTFIDSGAPLTS 313
Query: 325 LVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA--- 380
LV AYQ LR E+ L L+ + LC + L P + HF GG+
Sbjct: 314 LVDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALKDAERL--VPPLVLHFGGGSGTG 371
Query: 381 -DLVLDAESVFYQESSSVFCLAVGPS-DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
DLV+ + + S+ C+ V S D + ++IG QQN +V YDL L
Sbjct: 372 TDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLS 431
Query: 439 FQRIDC 444
FQ DC
Sbjct: 432 FQPADC 437
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 146/353 (41%), Gaps = 34/353 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P V DTGS + W++C PC +C F+PS S ++ L C SS
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 73
Query: 156 C----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C ++C Y + Y +G + G +E +F G+ + V GC NN
Sbjct: 74 CGKLKIKGC-SRKNKCMYQVSYGDGSFTVGDFSTETLSF-----GEHAVRSVAMGCGRNN 127
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ S S FSYC+ A L+ G A+ E
Sbjct: 128 QGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA--SLVFGPSAVPEKA 185
Query: 272 S----TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
P +D YYV L I + ++I P+ F + GV +DSGT ++ L
Sbjct: 186 RFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGT-GGVIVDSGTAISRLTT 244
Query: 328 SAYQTLRKEVEDLFQGLL--PSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AY LR D F+ L+ PS P + CY + + PA+ F GGA + L
Sbjct: 245 PAYTALR----DAFRSLVTFPSAPGISLFDTCYDLSSMKTAT-LPAVVLDFDGGASMPLP 299
Query: 386 AESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
A+ + + +CLA P + + SIIG + QQ + ++ D +Q+
Sbjct: 300 ADGILVNVDDEGTYCLAFAPEE------EAFSIIGNVQQQTFRISIDNQKEQM 346
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 157/366 (42%), Gaps = 62/366 (16%)
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-----TNDCGGY--- 163
++DTGS L WV+C PC C F+PS S ++ +LPC+S C T G
Sbjct: 80 IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSN 139
Query: 164 --PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTG 221
C Y I Y +G S+G +G FE GKT + + FGC NN F G
Sbjct: 140 KNSTSCDYQIDYGDGSYSRGELG-----FEKLTLGKTEIDNFIFGCGRNNKGL----FGG 190
Query: 222 VFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEG--DSTPM 275
GL S SLV + GS FSYC+ + L LG GA + +P+
Sbjct: 191 ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGV--GSSGSLTLG-GADFSNFKNISPI 247
Query: 276 SV--------IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
S + Y++ L GIS+G L++ P L S +DSGT +T L P
Sbjct: 248 SYTRMIQNPQMSNFYFLNLTGISIGGVNLNV-PRLSSNEGVLS----LLDSGTVITRLSP 302
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAGGADL 382
S Y+ + E E F G Y P + + N +L G+ P + F F G A++
Sbjct: 303 SIYKAFKAEFEKQFSG----YRTTPGFSIL---NTCFNLTGYEEVNIPTVKFIFEGNAEM 355
Query: 383 VLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
++D E VFY + +S CLA + IIG Q+N V Y+ ++ F
Sbjct: 356 IVDVEGVFYFVKSDASQICLAFASLGYEDQTM----IIGNYQQKNQRVIYNSKESKVGFA 411
Query: 441 RIDCEL 446
C
Sbjct: 412 GEPCSF 417
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 152/379 (40%), Gaps = 66/379 (17%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDS 153
+ V IG P V Q ++DTGS L WV+C+PC +DP+ S TYA +PCDS
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDS 186
Query: 154 SY------------CTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
CTN G C Y I Y N + G +E + +
Sbjct: 187 KACKDLVPDAYDHGCTNSSGT--SLCQYGIEYGNRDTTVGVYSTETLTLSP----QVSVK 240
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
D GFGC D + G S E G FSYC+ N L
Sbjct: 241 DFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGN---STTGFLA 297
Query: 262 LGEGAILEGDS-----TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
LG D+ TP+ + Y V L G+S+G K LDI P + G
Sbjct: 298 LGA-PTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLS-------GG 349
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP-----AWHLCY--SGNINRDL 366
+ IDSGT +T L +AY LR F+ + +YP+ P CY +G N +
Sbjct: 350 MIIDSGTIITGLPDTAYSALRTA----FRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTV 405
Query: 367 QGFPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
P +A F GGA + LD S V Q+ CLA G D+ IIG + Q+
Sbjct: 406 ---PTVALTFDGGATIDLDVPSGVLIQD-----CLAFA----GGASDGDVGIIGNVNQRT 453
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+ V YD + F+ C
Sbjct: 454 FEVLYDSGRGHVGFRPGAC 472
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 116/383 (30%), Positives = 162/383 (42%), Gaps = 64/383 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+ ++ ++G PP P A+LDTGS LIW +C C C F P S +Y + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQL 157
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C PD C Y Y +G + G +E+F F +S G+T +GFGC N
Sbjct: 158 CGDILHHSC-VRPDTCTYRYSYGDGTTTLGYYATERFTFASS-SGETQSVPLGFGCGTMN 215
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGA---- 266
S +G+ G G SLV ++ +FSYC+ Y + L G A
Sbjct: 216 VG-SLNNASGIVGFG---RDPLSLVSQLSIRRFSYCL--TPYASSRKSTLQFGSLADVGL 269
Query: 267 ------------ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
IL+ P YYV G+++G + L I + F S GV
Sbjct: 270 YDDATGPVQTTPILQSAQNPT-----FYYVAFTGVTVGARRLRIPASAFALRPDGS-GGV 323
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM----DPAWHLCY--------SGNI 362
IDSGT LT L P+A + EV F+ L P P +C+ G +
Sbjct: 324 IIDSGTALT-LFPAA---VLAEVVRAFRSQL-RLPFANGSSPDDGVCFAAPAVAAGGGRM 378
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMI 421
R + P M FHF GADL L E+ V C+ +G S +G + IG
Sbjct: 379 ARQV-AVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGDDG------ATIGNF 430
Query: 422 AQQNYNVAYDLVSKQLYFQRIDC 444
QQ+ V YDL + L F ++C
Sbjct: 431 VQQDMRVVYDLERETLSFAPVEC 453
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/422 (26%), Positives = 187/422 (44%), Gaps = 57/422 (13%)
Query: 59 QAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDT 118
+ QR ++S AR + + +S + H + ++ V+ ++G PP VLDT
Sbjct: 35 KTQRHSHISTARKYFTTATASSTTNKLLFHHNVSLT------VSLTVGSPPQNVTMVLDT 88
Query: 119 GSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC---TND------CGGYPDECWY 169
GS L W+ C+ Q + F+P S TY+ +PC S C T D C C
Sbjct: 89 GSELSWLHCKK-TQFLNSVFNPLSSKTYSKVPCLSPTCKTRTRDLTIPVSCDA-TKLCHV 146
Query: 170 NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPAT 229
+ Y + +G + E F + + T FGC + + E+ + GL
Sbjct: 147 IVSYADATSIEGNLAFETFRLGSLTKPATI-----FGCMDSGFSSNSEEDSKTTGLIGMN 201
Query: 230 SSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI----------LEGDSTPMSVI 278
+ S V ++G KFSYCI + + +L+LG + L STP+
Sbjct: 202 RGSLSFVNQMGYPKFSYCISGFD----SAGVLLLGNASFPWLKPLSYTPLVQISTPLPYF 257
Query: 279 DG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV 337
D +Y V LEGI + K+L + ++F + T + +DSGT T+L+ Y L+ E
Sbjct: 258 DRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGA-GQTMVDSGTQFTFLLGPVYTALKNEF 316
Query: 338 EDLFQGLLP-----SYPMDPAWHLCYSGNINR-DLQGFPAMAFHFAGGADLVLDAESVFY 391
+G+L ++ A LCY + +R +LQ P ++ F GA++ + E + Y
Sbjct: 317 LSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQ-GAEMSVSGERLLY 375
Query: 392 QE------SSSVFCLAVGPSDING-ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ SV+C G SD+ G E F +IG QQN + +DL ++ + C
Sbjct: 376 RVPGEVRGRDSVWCFTFGNSDLLGVEAF----VIGHHHQQNVWMEFDLEKSRIGLADVRC 431
Query: 445 EL 446
++
Sbjct: 432 DV 433
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 163/363 (44%), Gaps = 53/363 (14%)
Query: 115 VLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC-------------TN 158
++DT S L WV+C PCE C FDPS S +YA +PC+SS C
Sbjct: 167 IVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAA 226
Query: 159 DCGGYPDE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
C G C Y + Y +G S+G + ++ + + FGC +N
Sbjct: 227 ACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL-----AGEVIDGFVFGCGTSNQGPP 281
Query: 216 DEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTP 274
+G+ GLG + S S +++ G FSYC+ L + + L++G+ + + +STP
Sbjct: 282 FGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL-PLKESD-SSGSLVIGDDSSVYRNSTP 339
Query: 275 M---SVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
+ S++ Y+V L GI++G + ++ + IDSGT +T LV
Sbjct: 340 IVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKA----IIDSGTVITSLV 395
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLV 383
PS Y ++ E F YP P + + C++ R++Q P++ F GG ++
Sbjct: 396 PSIYNAVKAE----FLSQFAEYPQAPGFSILDTCFNMTGLREVQ-VPSLKLVFDGGVEVE 450
Query: 384 LDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
+D+ V Y SS CLA+ P E +IIG Q+N V +D Q+ F +
Sbjct: 451 VDSGGVLYFVSSDSSQVCLAMAPLKSEYE----TNIIGNYQQKNLRVIFDTSGSQVGFAQ 506
Query: 442 IDC 444
C
Sbjct: 507 ETC 509
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 162/376 (43%), Gaps = 42/376 (11%)
Query: 89 LHPG--ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--TFDPSKSL 144
+ PG I ++P + +G P L +D + WV C C C A+ +F P++S
Sbjct: 71 IAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQSS 130
Query: 145 TYATLPCDSSYCTN----DC-GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TY T+PC S C C G C +N+ Y Q +G + E
Sbjct: 131 TYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAAS-TFQAVLGQDSLALE-----NNV 184
Query: 200 LYDVGFGCSH--NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAY 257
+ FGC + + G FG GP + + + + GS FSYC+ N ++
Sbjct: 185 VVSYTFGCLRVVSGNSVPPQGLIG-FGRGPLSFLSQTK-DTYGSVFSYCLPNYRSSNFSG 242
Query: 258 NMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ + G +TP+ YYV + GI +G K++ + + N + +G
Sbjct: 243 TLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPV-TGSGT 301
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL--PSYPMDPAWHLCYSGNINRDLQGFPAM 372
ID+GT T L Y +R D F+G + P P + CY+ ++ P +
Sbjct: 302 IIDAGTMFTRLAAPVYAAVR----DAFRGRVRTPVAPPLGGFDTCYNVTVS-----VPTV 352
Query: 373 AFHFAGGADLVLDAESVFYQESS-SVFCLAV--GPSD-INGERFKDLSIIGMIAQQNYNV 428
F FAG + L E+V SS V CLA+ GPSD +N L+++ + QQN V
Sbjct: 353 TFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAA----LNVLASMQQQNQRV 408
Query: 429 AYDLVSKQLYFQRIDC 444
+D+ + ++ F R C
Sbjct: 409 LFDVANGRVGFSRELC 424
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 122/456 (26%), Positives = 192/456 (42%), Gaps = 49/456 (10%)
Query: 20 TRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSS 79
T +FT+ A P AG R L H D T + R S AR L Q+
Sbjct: 17 TLLFTAA-ATPTAGLTMR--ADLTHVDK---GRGFTRWERLSRMAVRSRARAASLYQRGG 70
Query: 80 QKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAV-LDTGSSLIWVKCQPCEQC---GA 135
A P + ++F+IG P ++A+ +DTGS L+W +C PC C
Sbjct: 71 HYGQPVTATAVPSSGE---YLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPF 127
Query: 136 TTFDPSKSLTYATLPCDSSYC-------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQF 188
FDPS S T+ + C C + C C+Y Y + + G I + F
Sbjct: 128 PLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTF 187
Query: 189 NFETSD-EGK--TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSY 245
F + + EG + + FGC N +G+ G G S S + +VG +FSY
Sbjct: 188 TFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQL-RVG-RFSY 245
Query: 246 CIGNLNYFEYAYNMLIL------GEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEK 294
C+ + + E + G A G +I YY++LEGI++G+
Sbjct: 246 CLTSHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKT 305
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY--PMDP 352
L +D ++F S G IDSGT +T + ++ L+ E + Q LP Y +
Sbjct: 306 RLPVDSSVFALKKDGS-GGTVIDSGTGVTTFPAAVFEQLKNEF--VAQLPLPRYDNTSEV 362
Query: 353 AWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES-SSVFCLAVGPSDINGER 411
LC+ P + FH A AD+ L E+ +++ S V CL + +++
Sbjct: 363 GNLLCFQRPKGGKQVPVPKLIFHLA-SADMDLPRENYIPEDTDSGVMCLMINGAEV---- 417
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
D+ +IG QQN ++ YD+ + +L F C+ +
Sbjct: 418 --DMVLIGNFQQQNMHIVYDVENSKLLFASAQCDKM 451
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 162/376 (43%), Gaps = 42/376 (11%)
Query: 89 LHPG--ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--TFDPSKSL 144
+ PG I ++P + +G P L +D + WV C C C A+ +F P++S
Sbjct: 90 IAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQSS 149
Query: 145 TYATLPCDSSYCTN----DC-GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TY T+PC S C C G C +N+ Y Q +G + E
Sbjct: 150 TYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAAS-TFQAVLGQDSLALE-----NNV 203
Query: 200 LYDVGFGCSH--NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAY 257
+ FGC + + G FG GP + + + + GS FSYC+ N ++
Sbjct: 204 VVSYTFGCLRVVSGNSVPPQGLIG-FGRGPLSFLSQTK-DTYGSVFSYCLPNYRSSNFSG 261
Query: 258 NMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ + G +TP+ YYV + GI +G K++ + + N + +G
Sbjct: 262 TLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPV-TGSGT 320
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL--PSYPMDPAWHLCYSGNINRDLQGFPAM 372
ID+GT T L Y +R D F+G + P P + CY+ ++ P +
Sbjct: 321 IIDAGTMFTRLAAPVYAAVR----DAFRGRVRTPVAPPLGGFDTCYNVTVS-----VPTV 371
Query: 373 AFHFAGGADLVLDAESVFYQESS-SVFCLAV--GPSD-INGERFKDLSIIGMIAQQNYNV 428
F FAG + L E+V SS V CLA+ GPSD +N L+++ + QQN V
Sbjct: 372 TFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAA----LNVLASMQQQNQRV 427
Query: 429 AYDLVSKQLYFQRIDC 444
+D+ + ++ F R C
Sbjct: 428 LFDVANGRVGFSRELC 443
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 163/375 (43%), Gaps = 44/375 (11%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATL 149
++Y IG PP +DTGS ++WV C C +C + +DP S + +T+
Sbjct: 82 LYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGSTV 141
Query: 150 PCDSSYCTNDCGG-YPD-----ECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD 202
CD +C GG P C Y++ Y +G + G S+ + + S +G+T +
Sbjct: 142 SCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQTRHAN 201
Query: 203 --VGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYFE 254
V FGC +++ G+ G G + +S S + G FS+C+ +
Sbjct: 202 ASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIK--- 258
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +G+ + STP+ Y V LE I++G L + ++F +T G
Sbjct: 259 -GGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSHMF---ETGEKKGT 314
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-----LCYSGNINRDLQGF 369
IDSGTTLT+L Y+ D+ + +P D +H LC + D GF
Sbjct: 315 IIDSGTTLTYLPELVYK-------DVLAAVFAKHP-DTTFHSVQDFLCIQYFQSVD-DGF 365
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + FHF L + F+Q +++C + + KD+ ++G + N V
Sbjct: 366 PKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNKVVV 425
Query: 430 YDLVSKQLYFQRIDC 444
YDL ++ + + +C
Sbjct: 426 YDLENQVVGWTDYNC 440
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 162/388 (41%), Gaps = 59/388 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT----FDPSKSLTYATLPCDSS 154
++V+ +G PP L V DTGS L+WVKC C C T F S T++ C S
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDS 148
Query: 155 YCT-------NDC--GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C + C C Y Y +G + G E TS + L + F
Sbjct: 149 ACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAF 208
Query: 206 GC------------SHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNY 252
GC S N AH GV GLG S S L + G+KFSYC+ + +
Sbjct: 209 GCAFRISGPSVSGASFNGAH-------GVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDI 261
Query: 253 FEYAYNMLILG--EGAILEGDS----TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLF 303
+ L++G + + G TP+ + S YY+ +E +S+ L I+P+++
Sbjct: 262 SPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVW 321
Query: 304 KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNIN 363
D + G +DSGTTLT+L AY + ++ + P+ P P + LC N++
Sbjct: 322 AL-DELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPT-PGFDLCV--NVS 377
Query: 364 R-DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV----GPSDINGERFKDLSII 418
+ P ++F G + + F V CLA+ PS S+I
Sbjct: 378 EIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPS--------GFSVI 429
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
G + QQ + + +D +L F R C L
Sbjct: 430 GNLMQQGFLLEFDKDRTRLGFSRHGCAL 457
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 166/397 (41%), Gaps = 45/397 (11%)
Query: 69 ARFIYLSQ-KSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC 127
AR YLS +S KA + + + V +G P VLDT WV C
Sbjct: 68 ARVTYLSSLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPC 127
Query: 128 QPCEQCGATTFDPSKSLTYATLPCDSSYCTNDCG-GYPD----ECWYNIRYTNGPDSQGT 182
C C + TF P+ S TYA+L C CT G P C++N Y
Sbjct: 128 ADCAGCSSPTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYG-------- 179
Query: 183 IGSEQFNFETSDEGKTFLYDV----GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEK 238
G F+ S + D FGC N S G+ GLG SL+ +
Sbjct: 180 -GDSSFSAMLSQDSLGLAVDTLPSYSFGCV-NAVSGSTLPPQGLLGLG---RGPMSLLSQ 234
Query: 239 VGS----KFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS---YYVTLEGISL 291
GS FSYC + + ++ ++ + G +TP+ YYV L G+S+
Sbjct: 235 SGSLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSV 294
Query: 292 GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD 351
G ++ + P L D + AG IDSGT +T V Y +R E +G +
Sbjct: 295 GRVLVPVAPELLAF-DPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG---PFATI 350
Query: 352 PAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAV--GPSDIN 408
A+ C++ N D+ P + FHF G DL L E+ S+ S+ CLA+ P+++N
Sbjct: 351 GAFDTCFAAT-NEDIA--PPVTFHFT-GMDLKLPLENTLIHSSAGSLACLAMAAAPNNVN 406
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
L++I + QQN + +D+ + +L R C
Sbjct: 407 SV----LNVIANLQQQNLRIMFDVTNSRLGIARELCN 439
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 128/465 (27%), Positives = 204/465 (43%), Gaps = 46/465 (9%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHR---DSLLYNPNDTVD 57
+PS+ L L ++ +PF + F+ A GK L+H +S Y PN T
Sbjct: 6 IPSAIFRLFLLILHIPFPLSSSFSLPLKELAKGKAYGFKAPLIHWSSPESPFYEPNLTPG 65
Query: 58 AQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYV-NFSIGQPPVPQLAVL 116
+ ++ S AR + + S ++R + IS + YV F+IG PPV A+
Sbjct: 66 ELMRASVRTSRARGDRIRKIRSSGISNSRKYPVSRISIIDKVYVMKFNIGSPPVETYAIP 125
Query: 117 DTGSSLIWVKCQP--CEQC---GATTFDPSKSLTYATLPCDSSYCTN---------DCGG 162
DTGS+++W++C C C F+P+KS TYA C C C
Sbjct: 126 DTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKS 185
Query: 163 YPDECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYDVGFGCSHNNAHFSDE---Q 218
C Y+I Y + S+GTI ++ F E E + + FGC +NN+ +
Sbjct: 186 SVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPNS 245
Query: 219 FT--GVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLI-LGEGAILEGDSTP 274
FT GV GLG + SLV ++ +FSYCI + + + I G A + G ST
Sbjct: 246 FTAPGVVGLG---NEMASLVGQLTLGQFSYCISTPDVQKPNGTIEIRFGLAASISGHSTA 302
Query: 275 MS-VIDGSY-YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
++ ++G Y + ++GI + + + P + G+ +DSGTT T L SA
Sbjct: 303 LANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGLIMDSGTTYTELYFSALDA 362
Query: 333 LRKEVEDLFQGLLPSYP--MDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD--LVLDAES 388
L E+++ + L P + + LCY+ N L PA+ F + +
Sbjct: 363 LIGELKEQIE-LAPDTQDHSNSNYSLCYNA-ANFLLTYVPAIELKFTDNKEAYFPFTLRN 420
Query: 389 VFYQESSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
+ + +CLA+ G S I SIIG+ ++ + YDL
Sbjct: 421 AWIDNGNDQYCLAMFGTSGI--------SIIGIYQHRDIKIGYDL 457
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 159/359 (44%), Gaps = 37/359 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYC 156
++ ++ IG PP LD S L+W C GAT F+P +S T A +PC C
Sbjct: 99 MYVFSYGIGTPPQQVSGALDISSDLVWTAC------GATAPFNPVRSTTVADVPCTDDAC 152
Query: 157 TN----DCGGYPDECWYNIRYTNGP-DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
CG EC Y Y G ++ G +G+E F F G T + V FGC N
Sbjct: 153 QQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTF-----GDTRIDGVVFGCGLKN 207
Query: 212 -AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
FS +GV GLG S S ++ +FSY + + + ++ G+ A +
Sbjct: 208 VGDFSG--VSGVIGLGRGNLSLVSQLQV--DRFSYHFAPDDSVD-TQSFILFGDDATPQT 262
Query: 271 DSTPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
T + + S YYV L GI + K L I F + GVF+ +T
Sbjct: 263 SHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTV 322
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL 384
L +AY+ LR+ V GL LCY+G + P+MA FAGGA + L
Sbjct: 323 LEEAAYKPLRQAVASKI-GLPAVNGSALGLDLCYTGESLAKAK-VPSMALVFAGGAVMEL 380
Query: 385 DAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
+ + FY +S++ + CL + PS D S++G + Q ++ YD+ +L F+ +
Sbjct: 381 ELGNYFYMDSTTGLACLTILPSSAG-----DGSVLGSLIQVGTHMMYDINGSKLVFESL 434
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 158/387 (40%), Gaps = 41/387 (10%)
Query: 89 LHPGIS-TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ------PCEQCGAT----- 136
+HP + + V F +G P + V DTGS L W+ C+ C A
Sbjct: 72 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131
Query: 137 -TFDPSKSLTYATLPCDSSYCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSE 186
F + S ++ T+PC + C +C C Y+ RY++G + G +E
Sbjct: 132 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191
Query: 187 QFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSY 245
E + K L++V GCS + S + GV GLG + S EK G KFSY
Sbjct: 192 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 251
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDSTPMS-------VIDGSYYVTLEGISLGEKMLDI 298
C+ + + N L G E M+ +++ Y V + GIS+G ML I
Sbjct: 252 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 311
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE-DLFQGLLPSYPMDPAWHLC 357
++ D G +DSG++LT+L AYQ + + L + + P +
Sbjct: 312 PSEVW---DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 368
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSI 417
S L P + FHFA GA+ +S + V CL G S+
Sbjct: 369 NSTGFEESL--VPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT-----SV 421
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+G I QQN+ +DL K+L F C
Sbjct: 422 VGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 114/410 (27%), Positives = 177/410 (43%), Gaps = 43/410 (10%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFY-VNFSIGQP-PVPQLAVLDT 118
+R + S AR L S A A + + V Y ++ SIG P P + LDT
Sbjct: 53 RRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDT 112
Query: 119 GSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRY 173
GS ++W +C+PC +C FD + S T ++ C C ++ G + C Y Y
Sbjct: 113 GSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGY 172
Query: 174 TNGPDSQGTIGSEQFNFETSD-EGKTFLYDVGFGCSHNNAHFSDEQFTGV--FGLGPATS 230
+G S G + F F+ GK + D+GFGC NA + TG+ FG GP +
Sbjct: 173 GDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSL 232
Query: 231 STHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDST------------PMSVI 278
+ V +FSYC FE + + LG L+ +T P
Sbjct: 233 PSQLKVR----QFSYCF--TTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTD 286
Query: 279 DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
+ Y ++ +G+++G+ L + P + K D FIDSGT +T + ++ L+
Sbjct: 287 NSHYVLSFKGVTVGKTRLPV-PEI--KAD--GSGATFIDSGTDITTFPDAVFRQLKSAF- 340
Query: 339 DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES-VFYQESSSV 397
+ Q LP +C+S + + P + FH GAD L E+ V S
Sbjct: 341 -IAQAALPVNKTADEDDICFSWD-GKKTAAMPKLVFHLE-GADWDLPRENYVTEDRESGQ 397
Query: 398 FCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
C+AV S G+ D ++IG QQN ++ YDL + +L C+ L
Sbjct: 398 VCVAVSTS---GQ--MDRTLIGNFQQQNTHIVYDLAAGKLLLVPAQCDKL 442
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 154/376 (40%), Gaps = 57/376 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
F V+ + G PP +LDTGSS+ W +C+ C C FD S TY+ C S
Sbjct: 127 FLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIPST 186
Query: 156 CTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
N YN+ Y + S G G + E SD + F FGC NN
Sbjct: 187 VGNT---------YNMTYGDKSTSVGNYGCDTMTLEPSDVFQKF----QFGCGRNNEGDF 233
Query: 216 DEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTP 274
G+ GLG ST S K FSYC+ E + L+ GE A + S
Sbjct: 234 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPE----ENSIGSLLFGEKATSQSSSLK 289
Query: 275 MSVI-----------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ + G Y+V L IS+G K L+I ++F + G IDSGT +T
Sbjct: 290 FTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF------ASPGTIIDSGTVIT 343
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-------LCYSGNINRDLQGFPAMAFHF 376
L AY L+ + YP+ CY+ + +D+ P HF
Sbjct: 344 RLPQRAYSALKAAFKKAMA----KYPLSNGRRKENDMLDTCYNLSGRKDVL-LPEXVLHF 398
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVG---PSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
GAD+ L+ + V + +S CLA S +N E L+IIG Q + V YD+
Sbjct: 399 GDGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNPE----LTIIGNRQQVSLTVLYDIR 454
Query: 434 SKQLYFQRIDCELLAD 449
+++ F C L +
Sbjct: 455 GRRIGFGGNGCSNLKN 470
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 162/371 (43%), Gaps = 48/371 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ +N S+G P + V DTGS LIW +C PC +C A F P+ S T++ LPC SS+
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 156 C------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C C C YN +Y +G + G + +E G V FGCS
Sbjct: 146 CQFLPNSIRTCNA--TGCVYNYKYGSG-YTAGYLATETLKV-----GDASFPSVAFGCST 197
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N +G+ GLG SL+ ++G +FSYC+ + A +L +
Sbjct: 198 ENG--VGNSTSGIAGLG---RGALSLIPQLGVGRFSYCLRS-GSAAGASPILFGSLANLT 251
Query: 269 EGD--STPM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+G+ STP +V YYV L GI++GE L + + F G +DSGTTL
Sbjct: 252 DGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTL 311
Query: 323 TWLVPSAYQTLRK----EVEDLFQGLLPSYPMDPAWHLCY-SGNINRDLQGFPAMAFHFA 377
T+L Y+ +++ + D + + LC+ S P++ F
Sbjct: 312 TYLAKDGYEMVKQAFLSQTAD-----VTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFD 366
Query: 378 GGADLVL----DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
GGA+ + Q S +V CL + P+ + + +S+IG + Q + ++ YDL
Sbjct: 367 GGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGD----QPMSVIGNVMQMDMHLLYDLD 422
Query: 434 SKQLYFQRIDC 444
F DC
Sbjct: 423 GGIFSFAPADC 433
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 112/412 (27%), Positives = 168/412 (40%), Gaps = 68/412 (16%)
Query: 61 QRTLNMSMARFIYLSQKSSQ-KAHDTRAHLHPG--ISTVPVFYVNFSIGQPPVPQLAVLD 117
Q +R +++ K +Q + + + H H F V+ + G P +LD
Sbjct: 87 QEIFGRDESRVSFINSKCNQYTSGNLKNHAHNNNLFDEDGNFLVDVAFGTPXTEIXLILD 146
Query: 118 TGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYT 174
TGSS+ W +C+ C C FD S S TY+ C S N+ YN+ Y
Sbjct: 147 TGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSCIPSTVENN---------YNMTYG 197
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS 234
+ S G G + E SD + F FGC NN G+ GLG ST S
Sbjct: 198 DDSTSVGNYGCDTMTLEPSDVFQKFQ----FGCGRNNKGDFGSGVDGMLGLGQGQLSTVS 253
Query: 235 -LVEKVGSKFSYC------IGNLNYFEYA--------YNMLILGEGAILEGDSTPMSVID 279
K FSYC IG+L + E A + L+ G G + E
Sbjct: 254 QTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQE---------S 304
Query: 280 GSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVED 339
G Y+V L IS+G + L+I ++F + G IDS T +T L AY L+ +
Sbjct: 305 GYYFVNLSDISVGNERLNIPSSVF------ASPGTIIDSRTVITRLPQRAYSALKAAFKK 358
Query: 340 LFQGLLPSYPMDPAWH-------LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
YP+ CY+ + +D+ P + HF GGAD+ L+ ++ +
Sbjct: 359 AMA----KYPLSNGRRKKGDILDTCYNLSGRKDVL-LPEIVLHFGGGADVRLNGTNIVWG 413
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+S CLA + +L+IIG Q + V YD+ +++ F C
Sbjct: 414 SDASRLCLAFAGT-------SELTIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 127/479 (26%), Positives = 192/479 (40%), Gaps = 77/479 (16%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKR-----LVTKLLHRDSLLY--NPN 53
+P SHA + ++ + +ST ++P P+R V +L HR +
Sbjct: 24 LPCSHAAAVAPGYVAVSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAPSRAS 83
Query: 54 DTVDAQAQRTLNMSMARFIYLSQKSSQKAH---DTRAHLHPGISTVPV----------FY 100
TL R Y+ ++ S +A D++A +TVP +
Sbjct: 84 SLAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAA--AATVPASWGYDIGTLNYV 141
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCDSS 154
V S+G P V Q +DTGS L WV+C+PC + FDP++S +YA +PC
Sbjct: 142 VTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGP 201
Query: 155 YCTNDCGGY------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C G Y +C Y + Y +G ++ G S+ S + F FGC
Sbjct: 202 VCAG-LGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF----FGCG 256
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGE 264
H + F GV GL SLVE+ G FSYC+ + + G
Sbjct: 257 HAQSGL----FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGP 312
Query: 265 GAILEGDST----PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
G ST P Y V L GIS+G + L + + F G +D+GT
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA-------GGTVVDTGT 365
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYS----GNINRDLQGFPAMAFH 375
+T L P+AY LR P+ P + CY+ G + P +A
Sbjct: 366 VITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVT-----LPNVALT 420
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GA ++L A+ + S CLA PS +G ++I+G + Q+++ V D S
Sbjct: 421 FGSGATVMLGADGIL-----SFGCLAFAPSGSDG----GMAILGNVQQRSFEVRIDGTS 470
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 116/383 (30%), Positives = 161/383 (42%), Gaps = 64/383 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+ ++ ++G PP P A+LDTGS LIW +C C C F P S +Y + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQL 157
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C + C PD C Y Y +G + G +E+F F +S G+T +GFGC N
Sbjct: 158 CGDILHHSC-VRPDTCTYRYSYGDGTTTLGYYATERFTFASS-SGETQSVPLGFGCGTMN 215
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGA---- 266
S +G+ G G SLV ++ +FSYC+ Y + L G A
Sbjct: 216 VG-SLNNASGIVGFG---RDPLSLVSQLSIRRFSYCL--TPYASSRKSTLQFGSLADVGL 269
Query: 267 ------------ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
IL+ P YYV G+++G + L I + F S GV
Sbjct: 270 YDDATGPVQTTPILQSAQNPT-----FYYVAFTGVTVGARRLRIPASAFALRPDGS-GGV 323
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM----DPAWHLCY--------SGNI 362
IDSGT LT L P A + EV F+ L P P +C+ G +
Sbjct: 324 IIDSGTALT-LFPVA---VLAEVVRAFRSQL-RLPFANGSSPDDGVCFAAPAVAAGGGRM 378
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMI 421
R + P M FHF GADL L E+ V C+ +G S +G + IG
Sbjct: 379 ARQV-AVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGDDG------ATIGNF 430
Query: 422 AQQNYNVAYDLVSKQLYFQRIDC 444
QQ+ V YDL + L F ++C
Sbjct: 431 VQQDMRVVYDLERETLSFAPVEC 453
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 171/398 (42%), Gaps = 37/398 (9%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR ++LS K++ T A + G T P + V +G P L LDT + W C
Sbjct: 50 ARLLFLSSKAASSGGITSAPVASG-QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCA 108
Query: 129 PCEQCGA-TTFDPSKSLTYATLPCDSSYC----------TNDCGGYPDECWYNIRYTNGP 177
PC+ C A + F P+ S +YA+LPC S +C D C ++ + +
Sbjct: 109 PCDTCPAGSRFIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADT- 167
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA-HFSDEQFTGVFGLGPATSSTHSLV 236
Q ++GS+ GK + FGC A ++ G+ GLG SL+
Sbjct: 168 SFQASLGSDTLRL-----GKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLG---RGPMSLL 219
Query: 237 EKVGSK----FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV---IDGSYYVTLEGI 289
+ GS+ FSYC+ + + ++ ++ + G TP+ YYV + G+
Sbjct: 220 SQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGL 279
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-Y 348
S+G + + F D + AG IDSGT +T Y LR+E Q PS Y
Sbjct: 280 SVGRTWVKVPAGSFAF-DPATGAGTVIDSGTVITRWTAPVYAALREEFRR--QVAAPSGY 336
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDI 407
A+ C++ + G P + H GG DL L E+ S++ + CLA+ ++
Sbjct: 337 TSLGAFDTCFNTD-EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM--AEA 393
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
++++ + QQN V D+ ++ F R C
Sbjct: 394 PQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 155/351 (44%), Gaps = 42/351 (11%)
Query: 116 LDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR 172
DT + ++C+PC GA F+PS+S ++A +PC S C +C G C + I+
Sbjct: 105 FDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAAIPCGSPECAVECTGA--SCPFTIQ 161
Query: 173 YTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSST 232
+ N + GT+ + S F FGC A + F G GL + S+
Sbjct: 162 FGNVTVANGTLVRDTLTLPPSATFAGFT----FGCIEVGAD--ADTFDGAVGLIDLSRSS 215
Query: 233 HSLVEKV--------GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS---TPMSVI--- 278
HSL +V + FSYC+ + + + I G PMS
Sbjct: 216 HSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNH 275
Query: 279 DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
SY+V L GIS+G + L + P +F + G +++ T T+L P+AY LR
Sbjct: 276 PNSYFVDLVGISVGGEDLPVPPAVFAAH------GTLLEAATEFTFLAPAAYAALR---- 325
Query: 339 DLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY-QES 394
D F+ + YP P + + CY+ L PA+A FAGG +L LD + Y +
Sbjct: 326 DAFRKDMAPYPAAPPFRVLDTCYNLTGLASLA-VPAVALRFAGGTELELDVRQMMYFADP 384
Query: 395 SSVF-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
SSVF +A +S+IG +AQ++ V YDL ++ F C
Sbjct: 385 SSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 171/398 (42%), Gaps = 37/398 (9%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR ++LS K++ T A + G T P + V +G P L LDT + W C
Sbjct: 50 ARLLFLSSKAASSGGVTSAPVASG-QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCA 108
Query: 129 PCEQCGA-TTFDPSKSLTYATLPCDSSYC----------TNDCGGYPDECWYNIRYTNGP 177
PC+ C A + F P+ S +YA+LPC S +C D C ++ + +
Sbjct: 109 PCDTCPAGSRFIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADT- 167
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA-HFSDEQFTGVFGLGPATSSTHSLV 236
Q ++GS+ GK + FGC A ++ G+ GLG SL+
Sbjct: 168 SFQASLGSDTLRL-----GKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLG---RGPMSLL 219
Query: 237 EKVGSK----FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV---IDGSYYVTLEGI 289
+ GS+ FSYC+ + + ++ ++ + G TP+ YYV + G+
Sbjct: 220 SQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGL 279
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-Y 348
S+G + + F D + AG IDSGT +T Y LR+E Q PS Y
Sbjct: 280 SVGRTWVKVPAGSFAF-DPATGAGTVIDSGTVITRWTAPVYAALREEFRR--QVAAPSGY 336
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDI 407
A+ C++ + G P + H GG DL L E+ S++ + CLA+ ++
Sbjct: 337 TSLGAFDTCFNTD-EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM--AEA 393
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
++++ + QQN V D+ ++ F R C
Sbjct: 394 PQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 154/347 (44%), Gaps = 42/347 (12%)
Query: 115 VLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNI 171
DT + ++C+PC GA F+PS+S ++A +PC S C +C G C + I
Sbjct: 192 AFDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAAIPCGSPECAVECTGA--SCPFTI 248
Query: 172 RYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS 231
++ N + GT+ + S F FGC A + F G GL + S
Sbjct: 249 QFGNVTVANGTLVRDTLTLPPSATFAGFT----FGCIEVGAD--ADTFDGAVGLIDLSRS 302
Query: 232 THSLVEKV--------GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS---TPMSVI-- 278
+HSL +V + FSYC+ + + + I G PMS
Sbjct: 303 SHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPN 362
Query: 279 -DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV 337
SY+V L GIS+G + L + P +F + G +++ T T+L P+AY LR
Sbjct: 363 HPNSYFVDLVGISVGGEDLPVPPAVFAAH------GTLLEAATEFTFLAPAAYAALR--- 413
Query: 338 EDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY-QE 393
D F+ + YP P + + CY+ L PA+A FAGG +L LD + Y +
Sbjct: 414 -DAFRKDMAPYPAAPPFRVLDTCYNLTGLASLA-VPAVALRFAGGTELELDVRQMMYFAD 471
Query: 394 SSSVF-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
SSVF +A +S+IG +AQ++ V YDL ++ F
Sbjct: 472 PSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGF 518
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 105/402 (26%), Positives = 173/402 (43%), Gaps = 50/402 (12%)
Query: 81 KAHDTRAH---------LHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWVKC 127
+AHD R H L G + +P +++ IG P +DTGS ++WV C
Sbjct: 50 RAHDARRHGRSLAAAVDLPLGGNGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNC 109
Query: 128 QPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGY------PDECWYNIRY 173
C+ C T +DPS S + + C +C GG C Y+I Y
Sbjct: 110 VFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISY 169
Query: 174 TNGPDSQGTIGSE--QFNFETSDEGKTFLYD--VGFGCSHN---NAHFSDEQFTGVFGLG 226
+G + G ++ Q+N + S +T L + + FGC + S + G+ G G
Sbjct: 170 GDGSSTTGFFVTDFLQYN-QVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFG 228
Query: 227 PATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYY 283
+ SS S + KV F++C+ +N + +G+ + +TP+ Y
Sbjct: 229 QSNSSMLSQLAAAGKVRKVFAHCLDTIN----GGGIFAIGDVVQPKVSTTPLVPGMPHYN 284
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
V LE I +G L + N+F D G IDSGTTL +L Y + +V + G
Sbjct: 285 VNLEAIDVGGVKLQLPTNIF---DIGESKGTIIDSGTTLAYLPGVVYNAIMSKVFAQY-G 340
Query: 344 LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVG 403
+P YSG+++ GFP + FHF GG L + +Q + ++C+
Sbjct: 341 DMPLKNDQDFQCFRYSGSVD---DGFPIITFHFEGGLPLNIHPHDYLFQ-NGELYCMGFQ 396
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ + KD+ ++G +A N V YDL ++ + + +C
Sbjct: 397 TGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNCS 438
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 111/416 (26%), Positives = 179/416 (43%), Gaps = 67/416 (16%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPV--------FYVNFSIGQPPVPQLAVLDTGS 120
AR ++LS K++ G+S+ PV + V +G P L LDT +
Sbjct: 53 ARLLFLSSKAATA----------GVSSAPVASGQAPPSYVVRAGLGSPSQQLLLALDTSA 102
Query: 121 SLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYCT-------------NDCGGYP-- 164
W C PC C +++ F P+ S +YA+LPC SS+C D P
Sbjct: 103 DATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPAT 162
Query: 165 -DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGV 222
C ++ + + Q + S+ GK + + FGC S ++ G+
Sbjct: 163 LPTCAFSKPFADA-SFQAALASDTLRL-----GKDAIPNYTFGCVSSVTGPTTNMPRQGL 216
Query: 223 FGLGPATSSTHSLVEKVGS----KFSYCIGNLNYFEYAYNM-LILGEGAILEGDSTPM-- 275
GLG +L+ + GS FSYC+ + + ++ ++ L G G TPM
Sbjct: 217 LGLG---RGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTPMLR 273
Query: 276 -SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT-WLVPSAYQTL 333
YYV + G+S+G + + F D + AG +DSGT +T W P Y L
Sbjct: 274 NPHRSSLYYVNVTGLSVGRAWVKVPAGSFAF-DAATGAGTVVDSGTVITRWTAP-VYAAL 331
Query: 334 RKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
R+E Q PS Y A+ C++ + G PA+ H GG DL L E+
Sbjct: 332 REEFRR--QVAAPSGYTSLGAFDTCFNTD-EVAAGGAPAVTVHMDGGVDLALPMENTLIH 388
Query: 393 ESSS-VFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
S++ + CLA+ P ++N +++I + QQN V +D+ + ++ F + C
Sbjct: 389 SSATPLACLAMAEAPQNVN----SVVNVIANLQQQNIRVVFDVANSRIGFAKESCN 440
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 167/392 (42%), Gaps = 44/392 (11%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR +LS ++K+ A I P + V IG P L +DT + W+ C
Sbjct: 67 ARLQFLSSLVARKSVVPIASGR-QIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCS 125
Query: 129 PCEQCGATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIG 184
C C +T F+ KS T+ T+ C++ C + CGG C +N+ Y G
Sbjct: 126 GCVGCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGG--SACAFNMTY----------G 173
Query: 185 SEQFNFETSDEGKTFLYD----VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV- 239
S S + T D FGC A S G+ GLG S S + +
Sbjct: 174 SSSIAANLSQDVVTLATDSIPSYTFGC-LTEATGSSIPPQGLLGLGRGPMSLLSQTQNLY 232
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKML 296
S FSYC+ + ++ ++ + G +TP+ YYV L I +G +++
Sbjct: 233 QSTFSYCLPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVV 292
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP--AW 354
DI P+ N T + AG DSGT T LV AY +R D F+ + + + +
Sbjct: 293 DIPPSALAFNPT-TGAGTIFDSGTVFTRLVAPAYTAVR----DAFRKRVGNATVTSLGGF 347
Query: 355 HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGERF 412
CY+ I P + F F+G + + + +SS+ CLA+ P ++N
Sbjct: 348 DTCYTSPIVA-----PTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSV-- 400
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L++I + QQN+ + +D+ + +L R C
Sbjct: 401 --LNVIANMQQQNHRILFDVPNSRLGVAREPC 430
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 163/386 (42%), Gaps = 63/386 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ IG PP +LDTGS L W++C PC C +DP +S ++ + C
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPR 149
Query: 156 C----TND----CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKT---FLYDV 203
C + D C C Y Y + ++ G +E F TS GK+ + +V
Sbjct: 150 CHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENV 209
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S + L G FSYC+ + N + LI G
Sbjct: 210 MFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 269
Query: 264 EG------------AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW-- 309
E ++ G P +D YYV ++ I +G ++L+I TW
Sbjct: 270 EDKDLLNHPELNFTTLVGGKENP---VDTFYYVQIKSIMVGGEVLNI------PESTWNM 320
Query: 310 -SD--AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-------MDPAWHLCYS 359
SD G +DSGTTL++ AYQ ++ D F + YP +DP +++ S
Sbjct: 321 TSDGVGGTIVDSGTTLSYFTEPAYQIIK----DAFVKKVKGYPIVQDFPILDPCYNV--S 374
Query: 360 GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSII 418
G DL P FA GA E+ F + + V CLA I G LSII
Sbjct: 375 GVEKIDL---PDFGILFADGAVWNFPVENYFIRLDPEEVVCLA-----ILGTPRSALSII 426
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G QQN++V YD +L + ++C
Sbjct: 427 GNYQQQNFHVLYDTKKSRLGYAPMNC 452
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 173/405 (42%), Gaps = 59/405 (14%)
Query: 81 KAHDTRAHL--------------HPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+AHDTR H HP S +++ IG P +DTGS ++WV
Sbjct: 125 RAHDTRRHGRILSAVDLPLGGNGHP--SEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVN 182
Query: 127 CQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD-----ECWYNIRY 173
C C++C T +D S T + CD ++C+ G P +C Y++ Y
Sbjct: 183 CAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY 242
Query: 174 TNGPDSQGTIGSE--QF-----NFETSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVF 223
+G + G + Q+ NF+T+ T V FGC + + S E G+
Sbjct: 243 GDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT----VVFGCGNKQSGELGSSSEALDGIL 298
Query: 224 GLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDG 280
G G A SS S + KV FS+C+ N++ + +GE + + TP+
Sbjct: 299 GFGQANSSMLSQLASSGKVKKVFSHCLDNVD----GGGIFAIGEVVEPKVNITPLVQNQA 354
Query: 281 SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL 340
Y V ++ I +G LD+ + F+ D G IDSGTTL + Y L +++
Sbjct: 355 HYNVVMKEIEVGGDPLDVPSDAFESGDR---KGTIIDSGTTLAYFPQEVYVPLIEKILSQ 411
Query: 341 FQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFC 399
Q L + ++ A+ Y+GN++ GFP + HF L + +Q +C
Sbjct: 412 -QPDLRLHTVEQAFTCFDYTGNVD---DGFPTVTLHFDKSISLTVYPHEYLFQHEFE-WC 466
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S + KDL+++G + N V YDL + + + +C
Sbjct: 467 IGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 511
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 158/387 (40%), Gaps = 41/387 (10%)
Query: 89 LHPGIS-TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ------PCEQCGAT----- 136
+HP + + V F +G P + V DTGS L W+ C+ C A
Sbjct: 1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60
Query: 137 -TFDPSKSLTYATLPCDSSYCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSE 186
F + S ++ T+PC + C +C C Y+ RY++G + G +E
Sbjct: 61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120
Query: 187 QFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSY 245
E + K L++V GCS + S + GV GLG + S EK G KFSY
Sbjct: 121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDSTPMS-------VIDGSYYVTLEGISLGEKMLDI 298
C+ + + N L G E M+ +++ Y V + GIS+G ML I
Sbjct: 181 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 240
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE-DLFQGLLPSYPMDPAWHLC 357
++ D G +DSG++LT+L AYQ + + L + + P +
Sbjct: 241 PSEVW---DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 297
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSI 417
S L P + FHFA GA+ +S + V CL G S+
Sbjct: 298 NSTGFEESL--VPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT-----SV 350
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+G I QQN+ +DL K+L F C
Sbjct: 351 VGNIMQQNHLWEFDLGLKKLGFAPSSC 377
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 167/385 (43%), Gaps = 47/385 (12%)
Query: 85 TRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPS 141
R LH + T + IG PP ++D+GS++ +V C CEQCG F P
Sbjct: 74 ARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPD 133
Query: 142 KSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
S TY+ + C+ CT C ++C Y +Y S G +G + +F T E K
Sbjct: 134 LSSTYSPVKCNVD-CT--CDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--Q 188
Query: 202 DVGFGCSHNN-AHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYAY 257
FGC ++ + G+ GLG S LV+K +G FS C G ++
Sbjct: 189 RAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD------ 242
Query: 258 NMLILGEGAILEGDS--------TPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDT 308
+G GA++ G T + + YY + L+ + + K L +DP +F
Sbjct: 243 ----IGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKH- 297
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINR 364
G +DSGTT +L A+ + V L DP + +C++G N+++
Sbjct: 298 ----GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQ 353
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIA 422
+ FP + F G L L E+ ++ S +CL V NG+ +++G I
Sbjct: 354 LSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGV---FQNGK--DPTTLLGGIV 408
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELL 447
+N V YD ++++ F + +C L
Sbjct: 409 VRNTLVTYDRHNEKIGFWKTNCSEL 433
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 175/386 (45%), Gaps = 50/386 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G P +DTGS ++W+ C C C ++ FD + S T A
Sbjct: 80 VGLYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAA 139
Query: 148 TLPCDSSYC-------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C C T+ C ++C Y +Y +G + G S+ F+T G++ +
Sbjct: 140 LVSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMV 199
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNL 250
+ + FGCS + +D+ G+FG GP S S + G FS+C L
Sbjct: 200 ANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC---L 256
Query: 251 NYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
E +L+LGE ILE +P+ Y + L+ I++ ++L ID N+F T
Sbjct: 257 KGGENGGGVLVLGE--ILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFA---T 311
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-SGNINRDLQ 367
++ G +DSGTTL +LV AY + S P+ + CY N D+
Sbjct: 312 TNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQF--SKPIISKGNQCYLVSNSVGDI- 368
Query: 368 GFPAMAFHFAGGADLVLDAESVF----YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
FP ++ +F GGA +VL+ E + +S++++C+ + + +I+G +
Sbjct: 369 -FPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVE------RGFTILGDLVL 421
Query: 424 QNYNVAYDLVSKQLYFQRIDCELLAD 449
++ YDL ++++ + +C L +
Sbjct: 422 KDKIFVYDLANQRIGWADYNCSLAVN 447
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 156/382 (40%), Gaps = 58/382 (15%)
Query: 89 LHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSK 142
L PG S V + +G P V+DTGSSL W++C PC Q G FDP
Sbjct: 123 LSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGP-LFDPRA 181
Query: 143 SLTYATLPCDSSYC---------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
S TY ++ C +S C + C + C Y Y + S G + ++ +F
Sbjct: 182 SSTYTSVRCSASQCDELQAATLNPSACSAS-NVCIYQASYGDSSFSVGYLSTDTVSF--- 237
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNY 252
G T +GC +N G+ GL S + L +G FSYC+
Sbjct: 238 --GSTSYPSFYYGCGQDNEGLFGRS-AGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAAS 294
Query: 253 FEYAYNMLILGEGAILEG---DSTPM--SVIDGS-YYVTLEGISLGEKMLDIDPNLFKKN 306
Y L G G TPM S +D S Y++TL G+S+G L + P+ +
Sbjct: 295 TGY------LSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSL 348
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNIN 363
T IDSGT +T L + + L K V G PA+ + C+ G +
Sbjct: 349 PT------IIDSGTVITRLPTAVHTALSKAVAQAMAGAQ----RAPAFSILDTCFEGQAS 398
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
+ P + FAGGA + L +V S CLA P+D +IIG Q
Sbjct: 399 Q--LRVPTVVMAFAGGASMKLTTRNVLIDVDDSTTCLAFAPTD-------STAIIGNTQQ 449
Query: 424 QNYNVAYDLVSKQLYFQRIDCE 445
Q ++V YD+ ++ F C
Sbjct: 450 QTFSVIYDVAQSRIGFSAGGCS 471
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 148/367 (40%), Gaps = 47/367 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G PP V DTGS WV+C+PC + FDP+KS TYA + C
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 155 YCT--NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
C + G C Y I+Y +G + G + D K F FGC N
Sbjct: 223 ACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVA-QDAIKGFK----FGCGEKNR 277
Query: 213 HFSDEQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGEGAI 267
Q G+ GLG TS T EK G FSYC+ Y E+ +
Sbjct: 278 GLFG-QTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEFGPLSPSS---SG 333
Query: 268 LEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+TPM G YYV L GI +G K L P +S++G +DSGT +T L
Sbjct: 334 SNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIP-----ESVFSNSGTLVDSGTVITRL 388
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQG-----FPAMAFHFA 377
+AY L Y A+ + CY D G P ++ F
Sbjct: 389 PDTAYAALSSAFAAAMA--ASGYKKAAAYSILDTCY------DFTGLSQVSLPTVSLVFQ 440
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GGA L LDA + Y S S CL NG+ + + I+G Q+ Y V YD+ K +
Sbjct: 441 GGACLDLDASGIVYAISQSQVCLGFAS---NGDD-ESVGIVGNTQQRTYGVLYDVSKKVV 496
Query: 438 YFQRIDC 444
F C
Sbjct: 497 GFAPGAC 503
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 121/459 (26%), Positives = 191/459 (41%), Gaps = 94/459 (20%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG-----------ISTVPV------ 98
VDA +LN++ + +++ Q++ D A + P ++ PV
Sbjct: 31 VDASDTESLNLTDHELL---RRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGE 87
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+ V +G P A +DT S LIW +CQPC +C F+P S +YA +PC+S
Sbjct: 88 YLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDT 147
Query: 156 C----TNDC---GGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C T+ C G DE C Y Y ++G + ++ G V FG
Sbjct: 148 CDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAI-----GDDVFRGVVFG 202
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEG 265
CS ++ Q +GV GLG SLV ++ +F YC+ + L+LG
Sbjct: 203 CSSSSVGGPPPQVSGVVGLG---RGALSLVSQLSVRRFMYCLP--PPVSRSAGRLVLGAD 257
Query: 266 AIL------EGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDA-- 312
A E PMS GS YY+ L+GIS+G++ + T A
Sbjct: 258 AAATVRNASERVVVPMST--GSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAG 315
Query: 313 ----------------------GVFIDSGTTLTWLVPSAYQTLRKEVED---LFQGLLPS 347
G+ ID +T+T+L S Y+ + ++E+ L +G
Sbjct: 316 APASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSD 375
Query: 348 YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSD 406
+D + L ++R ++AF G L LD E +F ++ +S + CL VG +D
Sbjct: 376 LGLDLCFILPEGVPMSRVYAPPVSLAFE---GVWLRLDKEQMFVEDRASGMMCLMVGKTD 432
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+SI+G QQN V Y+L ++ F + CE
Sbjct: 433 -------GVSILGNYQQQNMQVMYNLRRGRITFIKTACE 464
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 111/416 (26%), Positives = 179/416 (43%), Gaps = 67/416 (16%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPV--------FYVNFSIGQPPVPQLAVLDTGS 120
AR ++LS K++ G+S+ PV + V +G P L LDT +
Sbjct: 51 ARLLFLSSKAATA----------GVSSAPVASGQAPPSYVVRAGLGSPSQQLLLALDTSA 100
Query: 121 SLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYCT-------------NDCGGYP-- 164
W C PC C +++ F P+ S +YA+LPC SS+C D P
Sbjct: 101 DATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPAT 160
Query: 165 -DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGV 222
C ++ + + Q + S+ GK + + FGC S ++ G+
Sbjct: 161 LPTCAFSKPFADA-SFQAALASDTLRL-----GKDAIPNYTFGCVSSVTGPTTNMPRQGL 214
Query: 223 FGLGPATSSTHSLVEKVGS----KFSYCIGNLNYFEYAYNM-LILGEGAILEGDSTPM-- 275
GLG +L+ + GS FSYC+ + + ++ ++ L G G TPM
Sbjct: 215 LGLG---RGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTPMLR 271
Query: 276 -SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT-WLVPSAYQTL 333
YYV + G+S+G + + F D + AG +DSGT +T W P Y L
Sbjct: 272 NPHRSSLYYVNVTGLSVGHAWVKVPAGSFAF-DAATGAGTVVDSGTVITRWTAP-VYAAL 329
Query: 334 RKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
R+E Q PS Y A+ C++ + G PA+ H GG DL L E+
Sbjct: 330 REEFRR--QVAAPSGYTSLGAFDTCFNTD-EVAAGGAPAVTVHMDGGVDLALPMENTLIH 386
Query: 393 ESSS-VFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
S++ + CLA+ P ++N +++I + QQN V +D+ + ++ F + C
Sbjct: 387 SSATPLACLAMAEAPQNVN----SVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 154/366 (42%), Gaps = 46/366 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC----GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS L W +C+PC + DP+KS +Y + C S+
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSA 192
Query: 155 YCT--NDCGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
+C + GG C Y ++Y +G S G +E +S+ K FL FGC
Sbjct: 193 FCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFL----FGCGQ 248
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCI----GNLNYFEYAYNMLILGE 264
N+ G+ GLG S S +K FSYC+ + Y + + +
Sbjct: 249 QNSGLF-RGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYLSFGGQVSKTVK 307
Query: 265 GAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
L D STP Y + + +S+G L ID ++F S +G IDSGT +
Sbjct: 308 FTPLSEDFKSTPF------YGLDITELSVGGNKLSIDASIF------STSGTVIDSGTVI 355
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGG 379
T L +AY L FQ L+ YP + + CY + N ++ P + F GG
Sbjct: 356 TRLPSTAYSAL----SSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIK-IPKVGVSFKGG 410
Query: 380 ADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
++ +D + Y + CLA NG+ K +I G Q+ Y V YD ++
Sbjct: 411 VEMDIDVSGILYPVNGLKKVCLAFAG---NGDDVK-AAIFGNTQQKTYQVVYDDAKGRVG 466
Query: 439 FQRIDC 444
F C
Sbjct: 467 FAPSGC 472
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 170/398 (42%), Gaps = 37/398 (9%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR ++LS K++ T A + G T P + V +G P L LDT + W C
Sbjct: 50 ARLLFLSSKAASSGGVTSAPVASG-QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCA 108
Query: 129 PCEQCGA-TTFDPSKSLTYATLPCDSSYC----------TNDCGGYPDECWYNIRYTNGP 177
PC+ C A + F P+ S +YA+LPC S +C D C ++ + +
Sbjct: 109 PCDTCPAGSRFIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADT- 167
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA-HFSDEQFTGVFGLGPATSSTHSLV 236
Q ++GS+ GK + FGC A ++ G+ GLG SL+
Sbjct: 168 SFQASLGSDTLRL-----GKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLG---RGPMSLL 219
Query: 237 EKVGSK----FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV---IDGSYYVTLEGI 289
+ GS FSYC+ + + ++ ++ + G TP+ YYV + G+
Sbjct: 220 SQTGSTYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGL 279
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-Y 348
S+G + + F D + AG IDSGT +T Y LR+E Q PS Y
Sbjct: 280 SVGRTWVKVPAGSFAF-DPATGAGTVIDSGTVITRWTAPVYAALREEFRR--QVAAPSGY 336
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDI 407
A+ C++ + G P + H GG DL L E+ S++ + CLA+ ++
Sbjct: 337 TSLGAFDTCFNTD-EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM--AEA 393
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
++++ + QQN V D+ ++ F R C
Sbjct: 394 PQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 154/351 (43%), Gaps = 42/351 (11%)
Query: 116 LDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR 172
DT + ++C+PC GA F+PS+S ++A +PC S C +C G C + I+
Sbjct: 105 FDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAAIPCGSPECAVECTG--ASCPFTIQ 161
Query: 173 YTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSST 232
+ N + GT+ + S F FGC A + F G GL + S+
Sbjct: 162 FGNVTVANGTLVRDTLTLPPSATFAGFT----FGCIEVGAD--ADTFDGAVGLIDLSRSS 215
Query: 233 HSLVEKV--------GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS---TPMSVI--- 278
HSL +V + FSYC+ + + + I G PMS
Sbjct: 216 HSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNH 275
Query: 279 DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
SY+V L GIS+G + L + P +F + G +++ T T+L P+AY LR
Sbjct: 276 PNSYFVELVGISVGGEDLPVPPAVFAAH------GTLLEAATEFTFLAPAAYAALR---- 325
Query: 339 DLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY-QES 394
D F+ + YP P + + CY+ L P +A FAGG +L LD + Y +
Sbjct: 326 DAFRRDMAPYPAAPPFRVLDTCYNLTGLASL-AVPTVALRFAGGTELELDVRQMMYFADP 384
Query: 395 SSVF-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
SSVF +A +S+IG +AQ++ V YDL ++ F C
Sbjct: 385 SSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 160/369 (43%), Gaps = 55/369 (14%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C CEQCG F P S TY + C+ S +D G
Sbjct: 83 IGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCNPSCNCDDEG 142
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFT 220
+C Y RY S G I + +F E K FGC + ++
Sbjct: 143 ---KQCTYERRYAEMSSSSGVIAEDVVSFGNESELKP--QRAVFGCENVETGDLYSQRAD 197
Query: 221 GVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV 277
G+ GLG S LV+K +G FS C G ++ +G GA++ G +P
Sbjct: 198 GIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMD----------VGGGAMVLGQISPPPN 247
Query: 278 IDGS---------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ S Y + L+ + + K L + P +F + G +DSGTT + +
Sbjct: 248 MVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH-----GTVLDSGTTYAYFPEA 302
Query: 329 AYQTLR----KEVEDLFQGLLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGA 380
A+ L+ KE+ L Q P DP +H +C+SG ++ + FP + F G
Sbjct: 303 AFHALKDAIMKEIRHLKQIPGP----DPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQ 358
Query: 381 DLVLDAESVFYQES--SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
L L E+ ++ + S +CL + NG L +G I +N V YD + ++
Sbjct: 359 KLSLSPENYLFRHTKVSGAYCLGIFQ---NGNDLTTL--LGGIVVRNTLVTYDRENDKIG 413
Query: 439 FQRIDCELL 447
F + +C L
Sbjct: 414 FWKTNCSEL 422
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 160/375 (42%), Gaps = 35/375 (9%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLT 145
S+ ++Y +G P +DTGS ++WV C C C T +DP+ S T
Sbjct: 67 SSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKT 126
Query: 146 YATLPCDSSYCTNDCGG------YPDECWYNIRYTNGPDSQGTIGSEQFNFE-------T 192
+PC +CT+ G C Y+I Y +G + G+ ++ F+ T
Sbjct: 127 SNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHT 186
Query: 193 SDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGN 249
+ + ++ G S + + SDE G+ G G A SS S + KV FS+C+ +
Sbjct: 187 KPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDS 246
Query: 250 LNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+ + +G+ + ++TP+ Y V L+ + + + + + LF D+
Sbjct: 247 ----HHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLF---DSG 299
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
S G IDSGTTL +L S Y L +V GL D YS ++ +GF
Sbjct: 300 SGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTCFHYSDKLD---EGF 356
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + FHF G L + + ++C+ S + +DL +IG + N V
Sbjct: 357 PVVKFHFE-GLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVV 415
Query: 430 YDLVSKQLYFQRIDC 444
YDL + + + +C
Sbjct: 416 YDLENMVIGWTNFNC 430
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 157/366 (42%), Gaps = 59/366 (16%)
Query: 106 GQPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCTNDC 160
G V Q ++D+GS + WV+C+PC C FDP+ S TYA +PC S+ C
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQ-L 220
Query: 161 GGYPD------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
G Y +C + I Y +G + GT + D + F FGC+H +
Sbjct: 221 GPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFR----FGCAHADRGS 276
Query: 215 S-DEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNMLILG---EGA 266
+ D G LG + SLV++ ++ FSYC L + L+LG E A
Sbjct: 277 AFDYDVAGSLALG---GGSQSLVQQTATRYGRVFSYC---LPPTASSLGFLVLGVPPERA 330
Query: 267 ILEGD--STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
L STP+ S+ Y V L I + + L + P +F A IDS T
Sbjct: 331 QLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFS-------ASSVIDSSTI 383
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAG 378
++ L P+AYQ LR F+ + Y P + CY R + P++A F G
Sbjct: 384 ISRLPPTAYQALRAA----FRSAMTMYRAAPPVSILDTCYDFTGVRSIT-LPSIALVFDG 438
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
GA + LDA + CLA P+ +R IG + Q+ V YD+ +K +
Sbjct: 439 GATVNLDAAGILLGS-----CLAFAPT--ASDRMPGF--IGNVQQKTLEVVYDVPAKAMR 489
Query: 439 FQRIDC 444
F+ C
Sbjct: 490 FRTAAC 495
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 128/474 (27%), Positives = 199/474 (41%), Gaps = 63/474 (13%)
Query: 9 LLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSL---LYNPNDTVDAQAQRTLN 65
+S +L + +F S+ A A K +L+H DS +N ++T + + L
Sbjct: 10 FISFTSLIIILSTVFLSSFAIIQADK-FSFTAELIHIDSPNSPFFNASETTTHRLAKALQ 68
Query: 66 MSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S R L+ S+ A + G + + IG PP A +DTGS++IW+
Sbjct: 69 RSANRVARLNPLSNSD-EGVHASIFSGDGN---YLMKLLIGTPPTEIHAAIDTGSNVIWI 124
Query: 126 KCQPCEQC---GATTFDPSKSLTYATLPCDSSYC--TNDCGGYPDECWYNI---RYTNGP 177
C C+ C ++ F+P S TY PCDS C T+ + C Y+ N P
Sbjct: 125 PCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKHQLNCP 184
Query: 178 DSQGTIGSEQFNFETSDEGKTF-LYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSL 235
+ G I + +SD G+ F L F C N+ + GV GLG A S T L
Sbjct: 185 N--GRIAVDTMTLTSSD-GRPFPLPYSDFVC--GNSIYKTFAGVGVIGLGRGALSLTSKL 239
Query: 236 VEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI---------DGSYYVTL 286
KFSYC+ +Y+ + + G + + D + V+ G+YYVTL
Sbjct: 240 YHLSDGKFSYCLA--DYYSKQPSKINFGLQSFISDDD--LEVVSTTLGHHRHSGNYYVTL 295
Query: 287 EGISLGEKMLDIDPNLFKKNDTWSD--AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
EGIS+GEK D L+ +D ++ + IDSGT T L Y L V
Sbjct: 296 EGISVGEKRQD----LYYVDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVS----YA 347
Query: 345 LPSYPMDPAWHLCYSGNINRDLQ-----------GFPAMAFHFAGGADLVLDAESVFYQE 393
+P P + + + +++ L+ FP + HF AD+ L ++ F +
Sbjct: 348 IPENPQNHPHNSRFPFSMDNTLKLSPCFWYYPELKFPKITIHFT-DADVELSDDNSFIRV 406
Query: 394 SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ V C A + ++ G Q N+ + YDL + F+R DC L
Sbjct: 407 AEDVVCFAFAATQPGQS-----TVYGSWQQMNFILGYDLKRGTVSFKRTDCSKL 455
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 116/382 (30%), Positives = 168/382 (43%), Gaps = 58/382 (15%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ----PCEQCGATT--FDPSKSLTYATLPCDSS 154
V IG PP Q VLDTGS L W++C P ++ TT FDPS S ++ LPC+
Sbjct: 84 VTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHP 143
Query: 155 YCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
C DC C Y+ Y +G ++G + E+ F S +
Sbjct: 144 LCKPRVPDFSLPTDCDAN-SLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPII----L 198
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI--------------GN-- 249
GC A SD+ G+ G+ S + +KFSYC+ GN
Sbjct: 199 GC----ATQSDDA-RGILGMNLGRLGFPSQAKI--TKFSYCVPTKQAQPASGSFYLGNNP 251
Query: 250 -LNYFEYAYNMLILGEGAILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKND 307
+ F Y N+L G+ S M +D +Y + L+GIS+G K L+I P++FK N
Sbjct: 252 ASSSFRYV-NLLTFGQ-------SQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNA 303
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
S IDSG+ T+LV AY +R+E V+ + + Y +C+ G+
Sbjct: 304 GGSGQ-TMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIG 362
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
+ M F F G +V+ E V V CL +G S+ G +IIG QQN
Sbjct: 363 RLVGDMVFEFEKGVQIVIPKERVLATVDGGVHCLGMGRSERLGA---GGNIIGNFHQQNL 419
Query: 427 NVAYDLVSKQLYFQRIDCELLA 448
V +DL ++++ F DC LA
Sbjct: 420 WVEFDLANRRVGFGEADCSKLA 441
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 157/379 (41%), Gaps = 47/379 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ +G PP +LDTGS L W++C PC C +DP S ++ + C+
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPR 221
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLYDVG-- 204
C+ C C Y Y + ++ G E F T+ EG++ Y V
Sbjct: 222 CSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENM 281
Query: 205 -FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S + L G FSYC+ + N + LI G
Sbjct: 282 MFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 341
Query: 264 EGAILEGDSTP--MSVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDTWS---- 310
E L + S ++G YY+ ++ I +G + LDI +TW+
Sbjct: 342 EDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDI------PEETWNISPD 395
Query: 311 -DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG---LLPSYP-MDPAWHLCYSGNINRD 365
G IDSGTTL++ AY+ ++ + + + + +P +DP +++ I +
Sbjct: 396 GAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNV---SGIEEN 452
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
P + FA GA AE+ F S + CLA I G SIIG QQN
Sbjct: 453 NIHLPELGIAFADGAVWNFPAENSFIWLSEDLVCLA-----ILGTPKSTFSIIGNYQQQN 507
Query: 426 YNVAYDLVSKQLYFQRIDC 444
+++ YD +L F C
Sbjct: 508 FHILYDTKMSRLGFTPTKC 526
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 161/382 (42%), Gaps = 54/382 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ IG PP +LDTGS L W++C PC C +DP S+++ + C+
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQF--NFETSDEGKT---FLYD 202
C C C Y Y + ++ G E F N +S GK+ + +
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL 262
V FGC H N + S + L G FSYC+ + + + LI
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIF 375
Query: 263 GEG------------AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
GE +++ G P +D YY+ ++ I +G + L I + W+
Sbjct: 376 GEDKDLLTHPELNFTSLIAGKENP---VDTFYYLQIKSIFVGGEKLQIP------EENWN 426
Query: 311 -----DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYSGNIN 363
G IDSGTTL++ AY+ +++ +G L+ +P+ H CY+ +
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPI---LHPCYNVSGT 483
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
+L FP FA GA E+ F + + + CLA + G LSIIG
Sbjct: 484 DEL-NFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLA-----MLGTPKSALSIIGNYQ 537
Query: 423 QQNYNVAYDLVSKQLYFQRIDC 444
QQN+++ YD + +L + + C
Sbjct: 538 QQNFHILYDTKNSRLGYAPMRC 559
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 161/382 (42%), Gaps = 54/382 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ IG PP +LDTGS L W++C PC C +DP S+++ + C+
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQF--NFETSDEGKT---FLYD 202
C C C Y Y + ++ G E F N +S GK+ + +
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL 262
V FGC H N + S + L G FSYC+ + + + LI
Sbjct: 316 VMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIF 375
Query: 263 GEG------------AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
GE +++ G P +D YY+ ++ I +G + L I + W+
Sbjct: 376 GEDKDLLTHPELNFTSLIAGKENP---VDTFYYLQIKSIFVGGEKLQIP------EENWN 426
Query: 311 -----DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYSGNIN 363
G IDSGTTL++ AY+ +++ +G L+ +P+ H CY+ +
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPI---LHPCYNVSGT 483
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
+L FP FA GA E+ F + + + CLA + G LSIIG
Sbjct: 484 DEL-NFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLA-----MLGTPKSALSIIGNYQ 537
Query: 423 QQNYNVAYDLVSKQLYFQRIDC 444
QQN+++ YD + +L + + C
Sbjct: 538 QQNFHILYDTKNSRLGYAPMRC 559
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 153/361 (42%), Gaps = 36/361 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ V IG PP + DT S L W +C FDP+KS ++A + C S
Sbjct: 91 YTVTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKL 150
Query: 156 CTNDCGGYP----DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
CT D G C Y Y + ++ G + E F SD + GFGC
Sbjct: 151 CTEDNPGTKRCSNKTCRYVYPYVS-VEAAGVLAYESFTL--SDNNQHICMSFGFGC---- 203
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
+D G G+ + + S+V ++ KFSYC+ Y + + L G A L G
Sbjct: 204 GALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCL--TPYTDRKSSPLFFGAWADL-G 260
Query: 271 DSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
I S YYV L G+SLG + LD+ F G +D G T+ L
Sbjct: 261 RYKTTGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATF----ALKQGGTVVDLGCTVGQLA 316
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS--GNINRDLQGFPAMAFHFAGGADLVL 384
A+ L++ V L + + + +C++ + P + +F GGAD+VL
Sbjct: 317 EPAFTALKEAVLHTLNLPLTNRTVKD-YKVCFALPSGVAMGAVQTPPLVLYFDGGADMVL 375
Query: 385 DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
++ F + ++ + CLA+ P +SIIG + QQN+++ +D+ + F C
Sbjct: 376 PRDNYFQEPTAGLMCLALVPGG-------GMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
Query: 445 E 445
+
Sbjct: 429 D 429
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 173/381 (45%), Gaps = 51/381 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C T+ FD + S T
Sbjct: 78 VGLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTAR 137
Query: 148 TLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+PC CT+ C ++C Y +Y +G + G S+ F F+ + G++ +
Sbjct: 138 LVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFD-AVLGESLI 196
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGP------ATSSTHSLVEKVGSKFSYCI 247
+ + FGCS + +D+ G+FG G + S+H + +V FS+C
Sbjct: 197 ANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRV---FSHC- 252
Query: 248 GNLNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKK 305
L + +L+LGE ILE +P+ Y + L+ I++ ++L IDP F
Sbjct: 253 --LKGEDSGGGILVLGE--ILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFA- 307
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRD 365
T S+ G ID+GTTL +LV AY + L + P + CY + N
Sbjct: 308 --TSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQL--ATPTINKGNQCYLVS-NSV 362
Query: 366 LQGFPAMAFHFAGGADLVLDAES--VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
+ FP ++F+FAGGA ++L E ++ + +G I G ++I+G +
Sbjct: 363 SEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQG----GITILGDLVL 418
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
++ YDL +++ + DC
Sbjct: 419 KDKIFVYDLAHQRIGWANYDC 439
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 99/404 (24%), Positives = 176/404 (43%), Gaps = 50/404 (12%)
Query: 78 SSQKAHDTRAH--------LHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ + HD R H L G +P +++ +G PP +DTGS ++WV
Sbjct: 51 SALRVHDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWV 110
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGG-YPD-----ECWYNI 171
C CE+C T +DP S + +T+ CD +C GG P C Y++
Sbjct: 111 NCISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSV 170
Query: 172 RYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD--VGFGCSHNNA---HFSDEQFTGVFGL 225
Y +G + G ++ F + + +G+T + V FGC S++ G+ G
Sbjct: 171 MYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGF 230
Query: 226 GPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSY 282
G A +S S + KV F++C+ + + +G + +TP+ Y
Sbjct: 231 GQANTSMLSQLAAAGKVKKIFAHCLDTIK----GGGIFAIGNVVQPKVKTTPLVADMPHY 286
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
V L+ I +G L + ++F +T G IDSGTTLT+L ++ + + + Q
Sbjct: 287 NVNLKSIDVGGTTLQLPAHVF---ETGERKGTIIDSGTTLTYLPELVFKEVMAAIFNKHQ 343
Query: 343 GLLPSYPMDPAWHLC--YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCL 400
++ D +C Y G+++ GFP + FHF L + F+ + ++C+
Sbjct: 344 DIVFHNVQD---FMCFQYPGSVD---DGFPTITFHFEDDLALHVYPHEYFFPNGNDMYCV 397
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + KD+ ++G + N V YDL ++ + + +C
Sbjct: 398 GFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNC 441
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 122/454 (26%), Positives = 185/454 (40%), Gaps = 74/454 (16%)
Query: 18 TSTRIFTSTTAAPAAGKPKRL--VTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLS 75
+S ++ + G PK ++L RD L +A+ ++N S
Sbjct: 65 SSLKVVSKYGPCTVTGDPKTFPSAAEILRRDQLRVK-----SIRAKHSMNSSTTGVF--- 116
Query: 76 QKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC- 133
+ K H G + V +G P + DTGS L W +C+PC C
Sbjct: 117 --NEMKTRVPTTHFGGG------YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCF 168
Query: 134 --GATTFDPSKSLTYATLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIG 184
FDP+KS +Y L C S C + C + C Y ++Y G + G +
Sbjct: 169 PQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSS-SNSCLYGVKYGTG-YTVGFLA 226
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA-------TSSTHSLVE 237
+E SD + F+ G N FS G+ GLG + TSST+ +
Sbjct: 227 TETLTITPSDVFENFVIGCG---ERNGGRFSGT--AGLLGLGRSPVALPSQTSSTYKNL- 280
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM-SVIDGSYYVTLEGISLGEKML 296
FSYC L + L G G TP+ S I Y + + GIS+G + L
Sbjct: 281 -----FSYC---LPASSSSTGHLSFGGGVSQAAKFTPITSKIPELYGLDVSGISVGGRKL 332
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP---A 353
IDP++F+ AG IDSGTTLT+L +A+ L FQ ++ +Y +
Sbjct: 333 PIDPSVFRT------AGTIIDSGTTLTYLPSTAHSAL----SSAFQEMMTNYTLTKGTSG 382
Query: 354 WHLCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGE 410
CY S + N ++ P ++ F GG ++ +D +F + CLA NG
Sbjct: 383 LQPCYDFSKHANDNIT-IPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAF---KDNGN 438
Query: 411 RFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
D++I G + Q+ Y V YD+ + F C
Sbjct: 439 D-TDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/403 (26%), Positives = 176/403 (43%), Gaps = 48/403 (11%)
Query: 78 SSQKAHDTRAHLH-----------PGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ + HD R H G++T +++ IG P +DTGS ++WV
Sbjct: 57 SALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWV 116
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGY------PDECWYNI 171
C C+ C T +DP S + + CD +C + GG C Y+I
Sbjct: 117 NCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSI 176
Query: 172 RYTNGPDSQGTIGSE--QFNFETSDEGKTFLYD--VGFGCSHN---NAHFSDEQFTGVFG 224
Y +G + G ++ Q+N + S +G+T + V FGC + S+ G+ G
Sbjct: 177 SYGDGSSTAGFFVTDFLQYN-QVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILG 235
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G + SS S + KV F++C+ +N + +G + +TP+
Sbjct: 236 FGQSNSSMLSQLAAAGKVRKMFAHCLDTVN----GGGIFAIGNVVQPKVKTTPLVPDMPH 291
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V L+GI +G L + N+F D+ + G IDSGTTL ++ Y+ L V D
Sbjct: 292 YNVILKGIDVGGTALGLPTNIF---DSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKH 348
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
Q + D + YSG+++ GFP + FHF G L++ +Q +++C+
Sbjct: 349 QDISVQTLQDFSC-FQYSGSVD---DGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMG 404
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ KDL ++G + N V YDL ++ + + +C
Sbjct: 405 FQNGGGKTKDGKDLGLLGDLVLSNKLVLYDLENQAIGWADYNC 447
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 163/382 (42%), Gaps = 47/382 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC----EQCGATTFDPSKSLTYATLPCDSS 154
++V F +G P P + V DTGS L WVKC F + S ++A + C S
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIACSSD 171
Query: 155 YCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF-----ETSDEG--KTFL 200
CT+ +C C Y+ RY +G ++G +G++ E+ D G + L
Sbjct: 172 TCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRAKL 231
Query: 201 YDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNM 259
V GC+ + S + GV LG + S S + G +FSYC+ + A +
Sbjct: 232 QGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNATSY 291
Query: 260 LILG----EGA-------ILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFK 304
L G EG TP+ ++D Y V ++ + + + LDI +++
Sbjct: 292 LTFGPPGPEGGAAASSSSSSAAARTPL-LLDRRMSPFYAVAVDAVHVAGEALDIPADVW- 349
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINR 364
D G +DSGT+LT L AY+ + + + G LP MDP + CY N
Sbjct: 350 --DVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAG-LPRVSMDP-FEYCY--NWTA 403
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
P + FAG A L A+S + V C+ V G +S+IG I QQ
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQEGAWPG-----VSVIGNILQQ 458
Query: 425 NYNVAYDLVSKQLYFQRIDCEL 446
++ +DL + L F+ C L
Sbjct: 459 DHLWEFDLRDRWLRFKHTRCAL 480
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 116/444 (26%), Positives = 184/444 (41%), Gaps = 54/444 (12%)
Query: 22 IFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQK 81
I+ +TAA +A + L PN + RTL+ S +L + S
Sbjct: 24 IYIQSTAADSADSATPFGPSAMVLPLTLSAPNSS------RTLSHSRR---HLQRSESHS 74
Query: 82 AHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTF 138
R L+ + + IG PP ++DTGS+L +V C CEQCG F
Sbjct: 75 TATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNF 134
Query: 139 DPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
P S TY L C S CT C C Y+ +Y S G +G + +F E K
Sbjct: 135 QPDWSSTYQPLKC-SMECT--CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP 191
Query: 199 FLYDVGFGCSH-NNAHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFE 254
FGC + ++ G+ GLG S LVEK +G+ FS C G ++
Sbjct: 192 --QRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMD--- 246
Query: 255 YAYNMLILGEGAILEGDSTPMSVI--------DGSYYVTLEGISLGEKMLDIDPNLFKKN 306
+ G +L G S P ++ Y + L+ I + K L I+P +F
Sbjct: 247 ------VGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD-- 298
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYSG---NI 362
G +DSGTT +L A++ + + L L P +C+SG ++
Sbjct: 299 ---GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDV 355
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS--VFCLAVGPSDINGERFKDLSIIGM 420
++ + FPA+ F+ G L L E+ +Q S + +CL + ++ + +++G
Sbjct: 356 SQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNEND-----QTTLLGG 410
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
I +N V YD ++ F + +C
Sbjct: 411 IIVRNTLVMYDREHLKIGFWKTNC 434
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 116/444 (26%), Positives = 184/444 (41%), Gaps = 54/444 (12%)
Query: 22 IFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQK 81
I+ +TAA +A + L PN + RTL+ S +L + S
Sbjct: 24 IYIQSTAADSADSATPFGPSAMVLPLTLSAPNSS------RTLSHSRR---HLQRSESHS 74
Query: 82 AHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTF 138
R L+ + + IG PP ++DTGS+L +V C CEQCG F
Sbjct: 75 TATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNF 134
Query: 139 DPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
P S TY L C S CT C C Y+ +Y S G +G + +F E K
Sbjct: 135 QPDWSSTYQPLKC-SMECT--CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP 191
Query: 199 FLYDVGFGCSH-NNAHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFE 254
FGC + ++ G+ GLG S LVEK +G+ FS C G ++
Sbjct: 192 --QRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMD--- 246
Query: 255 YAYNMLILGEGAILEGDSTPMSVI--------DGSYYVTLEGISLGEKMLDIDPNLFKKN 306
+ G +L G S P ++ Y + L+ I + K L I+P +F
Sbjct: 247 ------VGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD-- 298
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYSG---NI 362
G +DSGTT +L A++ + + L L P +C+SG ++
Sbjct: 299 ---GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDV 355
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS--VFCLAVGPSDINGERFKDLSIIGM 420
++ + FPA+ F+ G L L E+ +Q S + +CL + ++ + +++G
Sbjct: 356 SQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNEND-----QTTLLGG 410
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
I +N V YD ++ F + +C
Sbjct: 411 IIVRNTLVMYDREHLKIGFWKTNC 434
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 164/372 (44%), Gaps = 44/372 (11%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDS 153
P++ N +IG PP P A++ +W +C PC +C F+ S S TY PC +
Sbjct: 26 PLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGT 85
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
+ C + C G C Y + G D+ G G++ F T+ + FGC+
Sbjct: 86 ALCESVPASTCSG-DGVCSYEVETMFG-DTSGIGGTDTFAIGTATA------SLAFGCAM 137
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
++ +GV GLG + SLV ++ + FSYC+ + + L+LG A L
Sbjct: 138 DSNIKQLLGASGVVGLG---RTPWSLVGQMNATAFSYCLAP-HGAAGKKSALLLGASAKL 193
Query: 269 EGD----STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
G +TP+ S Y + LEGI G+ ++ PN + V +D+
Sbjct: 194 AGGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPN---------GSVVLVDTIFG 244
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-----SGNINRDLQGFPAMAFHF 376
+++LV +A+Q ++K V + P P + LC+ + N L P + F
Sbjct: 245 VSFLVDAAFQAIKKAVTVAVGAAPMATPTKP-FDLCFPKAAAAAGANSSLP-LPDVVLTF 302
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
G A L + Y + CLA+ S + +LSI+G + Q+N + +DL +
Sbjct: 303 QGAAALTVPPSKYMYDAGNGTVCLAMMSSAML-NLTTELSILGRLHQENIHFLFDLDKET 361
Query: 437 LYFQRIDCELLA 448
L F+ DC L+
Sbjct: 362 LSFEPADCSSLS 373
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 113/433 (26%), Positives = 170/433 (39%), Gaps = 69/433 (15%)
Query: 40 TKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVF 99
T++L RD D VDA + R + S + A+ +ST +
Sbjct: 96 TEILRRD------QDRVDA---------IRRKVTASSNKPKGGVSLLANWGKSLSTTN-Y 139
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC 156
+ +G P + LDTGS WV+C+PC C FDP+ S TY+ +PC + C
Sbjct: 140 VASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGAREC 199
Query: 157 TN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF------ETSDEGKTFLY 201
C Y + Y + + G + + +D F+
Sbjct: 200 QELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFV- 258
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
FGC H+NA E + S + + G+ FSYC L A L
Sbjct: 259 ---FGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYC---LPSSPSAAGYLS 312
Query: 262 LGEGAILEGDSTPMSVIDG----SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
G GA ++ ++ G SYY+ L GI + + + + + F + AG ID
Sbjct: 313 FG-GAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFA-----TAAGTIID 366
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-----PMDPAWHLCYSGNINRDLQGFPAM 372
SGT + L PSAY LR F+ + Y P P + CY + ++ PA+
Sbjct: 367 SGTAFSRLPPSAYAALRSS----FRSAMGRYRYKRAPSSPIFDTCYDFTGHETVR-IPAV 421
Query: 373 AFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
FA GA + L V Y + CLA P+ DL I+G Q+ V YD
Sbjct: 422 ELVFADGATVHLHPSGVLYTWNDVAQTCLAFVPN-------HDLGILGNTQQRTLAVIYD 474
Query: 432 LVSKQLYFQRIDC 444
+ S+++ F R C
Sbjct: 475 VGSQRIGFGRKGC 487
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 151/363 (41%), Gaps = 71/363 (19%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP------CEQCGATTFDPSKSLTYATLPCD 152
+ + ++G PP LA+ DTGS L+WVKC+ T FDPS+S TY + C
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160
Query: 153 SSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT----FLYD 202
+ C T D G C Y Y +G ++ G + +E F F+ G++ +
Sbjct: 161 TDACEALGRATCDDG---SNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSPRQVRIGG 217
Query: 203 VGFGCSHNNA---------HFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI--GNLN 251
V FGCS A + V LG ATS +G +FSYC+ ++N
Sbjct: 218 VKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATS--------LGRRFSYCLVPHSVN 269
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
A N L + STP+ +G K + + +
Sbjct: 270 A-SSALNFGALADVTEPGAASTPL---------------VGNKTV----------ASAAS 303
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGF 369
+ + +DSGTTLT+L PS + E+ L P D LCY +G +
Sbjct: 304 SRIIVDSGTTLTFLDPSLLGPIVDELSRRIT-LPPVQSPDGLLQLCYNVAGREVEAGESI 362
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + F GGA + L E+ F CLA+ + + +SI+G +AQQN +V
Sbjct: 363 PDLTLEFGGGAAVALKPENAFVAVQEGTLCLAI----VATTEQQPVSILGNLAQQNIHVG 418
Query: 430 YDL 432
YDL
Sbjct: 419 YDL 421
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 67/152 (44%), Gaps = 9/152 (5%)
Query: 297 DIDPNLFKKNDTWSDAG--VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
D+D S A + +DSGTTLT+L PS + E+ L P D
Sbjct: 420 DLDAGTVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRIT-LPPVQSPDGLL 478
Query: 355 HLCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERF 412
LCY +G + P + F GGA + L E+ F CLA+ +
Sbjct: 479 QLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAI----VATTEQ 534
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ +SI+G +AQQN +V YDL + + F DC
Sbjct: 535 QPVSILGNLAQQNIHVGYDLDAGTVTFAVADC 566
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 149/364 (40%), Gaps = 48/364 (13%)
Query: 116 LDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLP------CDSSYCTNDCGGYPDE 166
+D + W++C PC C FDP+KS T+ + C Y G
Sbjct: 120 MDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHNAVLCRPPYHPLQDG----R 175
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF-SDEQFTGVFGL 225
C + I Y NG + G + + F+F T D L + FGC++ A F + GV G+
Sbjct: 176 CGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVFGCANRIARFDTHGALAGVLGM 235
Query: 226 G------PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS----TPM 275
G P T L G +FSYC + AY+ L G + + M
Sbjct: 236 GMGAEGKPLTGFMRQLYHNGGGRFSYC--PIVPGTTAYSFLRFGNDIPSQPPAGVHRQSM 293
Query: 276 SVI-----DGSYYVTLEGISLGE-KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
+V+ +YYV L GIS+G ++ + P +F++ D G ID GT +T +V +A
Sbjct: 294 AVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFER-DQHGRGGCAIDIGTKMTAIVQTA 352
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESV 389
Y + V Q + P HLC + + P+M HF GG L + + +
Sbjct: 353 YAHVEAAVRGHLQRNRARFVQSPGHHLCVHRTPAIE-ERLPSMTLHFVGGPWLRVKPQHL 411
Query: 390 FYQESSSV-----FCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK--QLYFQRI 442
F S CL + P ++++IG + Q + +DL + + F
Sbjct: 412 FLVVGSPTGGGEYLCLGLVPD-------AEMTVIGAMQQIDTRFIFDLHNNIPIVSFNPE 464
Query: 443 DCEL 446
DC L
Sbjct: 465 DCHL 468
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 152/361 (42%), Gaps = 34/361 (9%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSY 155
P + V +G PP L +DT + W+ C C C TT F+P+ S +Y +PC S
Sbjct: 106 PTYVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTPFNPAASKSYRAVPCGSPA 165
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C+ C C +++ Y DS Q + +++ + FGC
Sbjct: 166 CSRAPNPSCSLNTKSCGFSLTYA---DSSLEAALSQDSLAVAND---VVKSYTFGCLQKA 219
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ + S + FSYC+ + ++ + + +G L
Sbjct: 220 TGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLRIK 279
Query: 272 STPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+TP+ V YYV++ GI +G+K++ I P D + AG +DSGT T LV
Sbjct: 280 TTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAAL-AFDPATGAGTVLDSGTMFTRLVAP 338
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDP--AWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDA 386
AY +R EV +G P+ + CY+ + +P + F F G + L A
Sbjct: 339 AYVAVRDEVRRRIRGA----PLSSLGGFDTCYNTTVK-----WPPVTFMFT-GMQVTLPA 388
Query: 387 ESVFYQE---SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
+++ ++S +A P +N L++I + QQN+ + +D+ + ++ F R
Sbjct: 389 DNLVIHSTYGTTSCLAMAAAPDGVN----TVLNVIASMQQQNHRILFDVPNGRVGFAREQ 444
Query: 444 C 444
C
Sbjct: 445 C 445
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 109/388 (28%), Positives = 163/388 (42%), Gaps = 73/388 (18%)
Query: 89 LHPGIS-TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSK 142
L PG S V + +G P P + V+DTGSSL W++C PC Q G FDP
Sbjct: 126 LTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGP-VFDPKT 184
Query: 143 SLTYATLPCDSSYCTNDCG---------GYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
S +YA + C + C ND D C Y Y + S G + + +F
Sbjct: 185 SSSYAAVSCSTPQC-NDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF--- 240
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCI----- 247
G + + +GC +N G+ GL S + L +G FSYC+
Sbjct: 241 --GSNSVPNFYYGCGQDNEGLFGRS-AGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSS 297
Query: 248 ------GNLNYFEYAYNMLILGEGAILEGDSTPM--SVIDGS-YYVTLEGISLGEKMLDI 298
G+ N +Y+Y TPM S +D S Y++ L G+++ K L +
Sbjct: 298 SGYLSIGSYNPGQYSY---------------TPMVSSTLDDSLYFIKLSGMTVAGKPLAV 342
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWHL 356
+ +S IDSGT +T L + Y L K V +G +Y +
Sbjct: 343 ------SSSEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSI---LDT 393
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLS 416
C+ G + PA++ F+GGA L L A+++ SS CLA P+ + +
Sbjct: 394 CFVGQASS--LRVPAVSMAFSGGAALKLSAQNLLVDVDSSTTCLAFAPA-------RSAA 444
Query: 417 IIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
IIG QQ ++V YD+ S ++ F C
Sbjct: 445 IIGNTQQQTFSVVYDVKSNRIGFAAGGC 472
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 155/375 (41%), Gaps = 77/375 (20%)
Query: 115 VLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCT---NDCGGYP---- 164
++DTGS L WV+C+PC C A FDPS S +YA +PC++S C G P
Sbjct: 179 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238
Query: 165 -----------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
+ C+Y++ Y +G S+G + ++ G + FGC +N
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL-----GGASVDGFVFGCGLSNRG 293
Query: 214 FSDEQFTGVFGLGPATSSTHSLVE----KVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
F G GL + SLV + G FSYC+ + A ++ + G
Sbjct: 294 L----FGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGG------ 343
Query: 270 GDSTPMSVIDGSYYVTLEGISLGEKMLD-IDPNLFKKNDT-------------WSDAGVF 315
D S Y +S + D P + N T A V
Sbjct: 344 ---------DTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVL 394
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAM 372
+DSGT +T L PS Y+ +R E F YP P + L CY+ + +++ P +
Sbjct: 395 LDSGTVITRLAPSVYRAVRAEFARQFGA--ERYPAAPPFSLLDACYNLTGHDEVK-VPLL 451
Query: 373 AFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLS-IIGMIAQQNYNVA 429
GGAD+ +DA + + ++ S CLA+ F+D + IIG Q+N V
Sbjct: 452 TLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLS-----FEDQTPIIGNYQQKNKRVV 506
Query: 430 YDLVSKQLYFQRIDC 444
YD V +L F DC
Sbjct: 507 YDTVGSRLGFADEDC 521
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 159/374 (42%), Gaps = 64/374 (17%)
Query: 112 QLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSS-YCTNDCGGYPD-E 166
QLA LD G L W++C PC C + FDP+KS T++ +P ++ +C +
Sbjct: 112 QLA-LDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGA 170
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQ-FTGVFGL 225
C ++I Y + + G + + F+F ++ L + FGC+H HF +++ G+ GL
Sbjct: 171 CGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGILGL 230
Query: 226 G------PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI------LEGDST 273
G P T+ T ++ G +FSYC Y+Y L G + ST
Sbjct: 231 GMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSY--LRFGSDIPSHPPPNVHRQST 288
Query: 274 PM---SVIDGSYYVTLEGISLGEKMLD-IDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
P+ + +Y+V L G+S+G L + P +F++N G +D GT +T + SA
Sbjct: 289 PVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRN-AHGAGGCVVDIGTRMTAFIHSA 347
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI-------NRDLQGFPAMAFHFAGGADL 382
Y + V Q A + GN + D+ P+M HF GA L
Sbjct: 348 YVHIDHAVRQHLQ-------RRGAHIVVVRGNTCVQQPAPHHDV--LPSMTLHFENGAWL 398
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFK--------DLSIIGMIAQQNYNVAYDL-- 432
+ E VF P + G ++ DL++IG Q N+ +DL
Sbjct: 399 RVMPEHVFM------------PFVVGGHHYQCFGFVSSTDLTVIGARQQVNHRFIFDLHD 446
Query: 433 VSKQLYFQRIDCEL 446
+ F DC L
Sbjct: 447 TIPIMSFNPEDCHL 460
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 163/402 (40%), Gaps = 44/402 (10%)
Query: 76 QKSSQKAHDTRAHLH------------PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
Q S K+HD+ H ++ +++ +G PP +DTGS ++
Sbjct: 43 QLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDIL 102
Query: 124 WVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNI 171
WV C PC +C T +D S T + C+ +C+ ++ G C Y++
Sbjct: 103 WVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHV 162
Query: 172 RYTNGPDSQGTIGSEQFNFETSD---EGKTFLYDVGFGCSHNNA---HFSDEQFTGVFGL 225
Y +G S G + E +V FGC N + +D G+ G
Sbjct: 163 VYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGF 222
Query: 226 GPATSSTHSLVEKVGSK---FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSY 282
G + +S S + GS FS+C+ N+N + +GE +TP+ Y
Sbjct: 223 GQSNTSIISQLAAGGSTKRIFSHCLDNMN----GGGIFAVGEVESPVVKTTPIVPNQVHY 278
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
V L+G+ + +D+ P+L N D G IDSGTTL +L + Y +L +++ Q
Sbjct: 279 NVILKGMDVDGDPIDLPPSLASTN---GDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQ 335
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV 402
L M C+S N D + FP + HF L + + ++C
Sbjct: 336 VKL---HMVQETFACFSFTSNTD-KAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGW 391
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + D+ ++G + N V YDL ++ + + +C
Sbjct: 392 QSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 433
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 155/375 (41%), Gaps = 77/375 (20%)
Query: 115 VLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCT---NDCGGYP---- 164
++DTGS L WV+C+PC C A FDPS S +YA +PC++S C G P
Sbjct: 180 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 239
Query: 165 -----------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH 213
+ C+Y++ Y +G S+G + ++ G + FGC +N
Sbjct: 240 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL-----GGASVDGFVFGCGLSNRG 294
Query: 214 FSDEQFTGVFGLGPATSSTHSLVE----KVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
F G GL + SLV + G FSYC+ + A ++ + G
Sbjct: 295 L----FGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGG------ 344
Query: 270 GDSTPMSVIDGSYYVTLEGISLGEKMLD-IDPNLFKKNDT-------------WSDAGVF 315
D S Y +S + D P + N T A V
Sbjct: 345 ---------DTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVL 395
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAM 372
+DSGT +T L PS Y+ +R E F YP P + L CY+ + +++ P +
Sbjct: 396 LDSGTVITRLAPSVYRAVRAEFARQFGA--ERYPAAPPFSLLDACYNLTGHDEVK-VPLL 452
Query: 373 AFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLS-IIGMIAQQNYNVA 429
GGAD+ +DA + + ++ S CLA+ F+D + IIG Q+N V
Sbjct: 453 TLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLS-----FEDQTPIIGNYQQKNKRVV 507
Query: 430 YDLVSKQLYFQRIDC 444
YD V +L F DC
Sbjct: 508 YDTVGSRLGFADEDC 522
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 163/402 (40%), Gaps = 44/402 (10%)
Query: 76 QKSSQKAHDTRAHLH------------PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
Q S K+HD+ H ++ +++ +G PP +DTGS ++
Sbjct: 39 QLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDIL 98
Query: 124 WVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNI 171
WV C PC +C T +D S T + C+ +C+ ++ G C Y++
Sbjct: 99 WVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHV 158
Query: 172 RYTNGPDSQGTIGSEQFNFETSD---EGKTFLYDVGFGCSHNNA---HFSDEQFTGVFGL 225
Y +G S G + E +V FGC N + +D G+ G
Sbjct: 159 VYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGF 218
Query: 226 GPATSSTHSLVEKVGSK---FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSY 282
G + +S S + GS FS+C+ N+N + +GE +TP+ Y
Sbjct: 219 GQSNTSIISQLAAGGSTKRIFSHCLDNMN----GGGIFAVGEVESPVVKTTPIVPNQVHY 274
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
V L+G+ + +D+ P+L N D G IDSGTTL +L + Y +L +++ Q
Sbjct: 275 NVILKGMDVDGDPIDLPPSLASTN---GDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQ 331
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV 402
L M C+S N D + FP + HF L + + ++C
Sbjct: 332 VKL---HMVQETFACFSFTSNTD-KAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGW 387
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + D+ ++G + N V YDL ++ + + +C
Sbjct: 388 QSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 429
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 166/382 (43%), Gaps = 51/382 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATL 149
++ +G PP +DTGS ++W+ C C C ++ FD S T A +
Sbjct: 83 LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALV 142
Query: 150 PCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF-----ETSDEGK 197
PC C + C ++C Y +Y +G + G S+ F +++
Sbjct: 143 PCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANV 202
Query: 198 TFLYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCI-GNL 250
+ FGCS + +D+ G+ G GP S S + G FS+C+ G+
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDG 262
Query: 251 NYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
N +L+LGE ILE +P+ Y + L+ I++ ++L I+P +F +D
Sbjct: 263 N----GGGILVLGE--ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSD- 315
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
G IDSGTTL++LV AY L V+ S+ + CY + D
Sbjct: 316 --KRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ--CYLVLTSID-DS 370
Query: 369 FPAMAFHFAGGADLVLDAESVF----YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
FP ++F+F GGA + L +Q+ + ++C+ + ++I+G + +
Sbjct: 371 FPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQ------EGVTILGDLVLK 424
Query: 425 NYNVAYDLVSKQLYFQRIDCEL 446
+ V YDL +Q+ + DC +
Sbjct: 425 DKIVVYDLARQQIGWTNYDCSM 446
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 163/368 (44%), Gaps = 50/368 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCD 152
+ +GQP V DTGS + W++CQPC FDP S +Y+ L C+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 153 SSYCT--NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
S C + D C Y + Y +G + G + +E +F S+ + ++ GC H+
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS----IPNLPIGCGHD 263
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLN-----YFEYAYNMLILGE 264
N F G GL SL ++ S FSYC+ NL+ E+ NM
Sbjct: 264 NEGL----FAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSNM----- 314
Query: 265 GAILEGDSTPMSVIDG----SY-YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
DS ++ SY YV + GIS+G K L I P F+ +++ G+ +DSG
Sbjct: 315 ----PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGL-GGIIVDSG 369
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAMAFHFA 377
T ++ L Y++LR+ L L P+ P + CY SG N ++ P +AF +
Sbjct: 370 TIISRLPSDVYESLREAFVKLTSSLSPA-PGISVFDTCYNFSGQSNVEV---PTIAFVLS 425
Query: 378 GGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQ 436
G L L A + + +++ +CLA + LSIIG QQ V+YDL +
Sbjct: 426 EGTSLRLPARNYLIMLDTAGTYCLAFIKTK------SSLSIIGSFQQQGIRVSYDLTNSL 479
Query: 437 LYFQRIDC 444
+ F C
Sbjct: 480 VGFSTNKC 487
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 161/377 (42%), Gaps = 48/377 (12%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATL 149
++Y IG P +DTGS ++WV C C++C T +DP S T + +
Sbjct: 88 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 147
Query: 150 PCDSSYCTNDCGGY------PDECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD 202
CD +C GG C Y++ Y +G + G S+ F + S +G+T +
Sbjct: 148 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 207
Query: 203 --VGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFE 254
V FGC S++ G+ G G + +S S + KV F++C+ +N
Sbjct: 208 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTIN--- 264
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +G + +TP+ Y V L+ I +G L + ++F DT G
Sbjct: 265 -GGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMF---DTGEKKGT 320
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-----LC--YSGNINRDLQ 367
IDSGTTLT+L Y+ + V + D +H LC Y G ++ D
Sbjct: 321 IIDSGTTLTYLPEIVYKEIMLAVF--------AKHKDITFHNVQEFLCFQYVGRVDDD-- 370
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
FP + FHF L + F++ +++C+ + + K + ++G + N
Sbjct: 371 -FPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKL 429
Query: 428 VAYDLVSKQLYFQRIDC 444
V YDL ++ + + +C
Sbjct: 430 VVYDLENQVIGWTEYNC 446
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 116/444 (26%), Positives = 186/444 (41%), Gaps = 73/444 (16%)
Query: 30 PAAGKPKRLVTK----LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT 85
P G P R +K + HRD L+ + +R N + + S + D
Sbjct: 50 PGDGLPNRDSSKYYRVMAHRDRLI---------RGRRLANEDQS-LVTFSDGNETVRVDA 99
Query: 86 RAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-------GATTF 138
LH Y N ++G P + LDTGS L W+ C C C G ++
Sbjct: 100 LGFLH---------YANVTVGTPSDWFMVALDTGSDLFWLPCD-CTNCVRELKAPGGSSL 149
Query: 139 D-----PSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNF 190
D P+ S T +PC+S+ CT + C +C Y IRY +NG S G + + +
Sbjct: 150 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHL 209
Query: 191 ETSDE-GKTFLYDVGFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFS 244
++D+ K V FGC F D G+FGLG S S++ K G + FS
Sbjct: 210 VSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFS 269
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVID--GSYYVTLEGISLGEKMLDIDPNL 302
C GN ++ G+ ++ TP+++ +Y +T+ IS+G D++ +
Sbjct: 270 MCFGNDGAGRISF-----GDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFD- 323
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL-FQGLLPSYPMDPAWHLCYSGN 361
VF DSGT+ T+L +AY + + L + + + CY+ +
Sbjct: 324 ----------AVF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALS 372
Query: 362 INRDLQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
N+D +PA+ GG+ V V + + V+CLA+ + +D+SIIG
Sbjct: 373 PNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI-------MKIEDISIIGQ 425
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
Y V +D L ++ DC
Sbjct: 426 NFMTGYRVVFDREKLILGWKESDC 449
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 125/436 (28%), Positives = 182/436 (41%), Gaps = 65/436 (14%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPG--ISTVPV------FYVNFSIGQ 107
VDA+ T M R ++++ H A + G ++ P+ + + IG
Sbjct: 40 VDAKQNCTTKERMRR-------ATERTHRRLASMAGGGGEASAPIHWNETQYIAEYLIGD 92
Query: 108 PPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDSSYC----TN 158
PP A++DTGS+LIW +C C G T +DPS+S T + C+ + C
Sbjct: 93 PPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACLLGSET 152
Query: 159 DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC--SHNNAHFSD 216
C C Y G G +G+E F F + + + FGC + S
Sbjct: 153 RCARDGKACAVLTAYGAGAIG-GFLGTEVFTFGHGQSSENNV-SLAFGCITASRLTPGSL 210
Query: 217 EQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYN--MLILGEGAILEGDST 273
+ +G+ GLG SL ++G +KFSYC+ YF A N L +G A L G
Sbjct: 211 DGASGIIGLG---RGKLSLPSQLGDNKFSYCL--TPYFSDAANTSTLFVGASAGLSGGGA 265
Query: 274 PMSVI-----------DGSYYVTLEGISLGEKMLDIDPNLFKKNDT----WSDAGVFIDS 318
P + + D YY+ L GI++G LD+ F + W G IDS
Sbjct: 266 PATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKW--GGTLIDS 323
Query: 319 GTTLTWLVPSAYQTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-PAMAFHF 376
G+ T L+ AYQ LR E V L ++P LC G D P + HF
Sbjct: 324 GSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLVPPLVLHF 383
Query: 377 ----AGGADLVLDAESVFYQESSSVFCLAV----GPSDINGERFKDLSIIGMIAQQNYNV 428
GG D+V+ E+ + S C+ V GP+ + +IIG QQ+ ++
Sbjct: 384 GSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNST--LPLNETTIIGNYMQQDMHL 441
Query: 429 AYDLVSKQLYFQRIDC 444
YDL L FQ DC
Sbjct: 442 LYDLGQGVLSFQPADC 457
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 117/443 (26%), Positives = 175/443 (39%), Gaps = 79/443 (17%)
Query: 30 PAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHL 89
P+ K + T+LL RD L N Y+ ++ S + + L
Sbjct: 73 PSGKKKQPTFTELLRRDQLRAN---------------------YIQRQFSDEHYPRTGGL 111
Query: 90 HPGISTVPV----------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFD 139
+TVP+ + + SIG P V +DTGS + W++C+ + +D
Sbjct: 112 QQSEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK------SRLYD 165
Query: 140 PSKSLTYATLPCDSSYCTN------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
P S TYA C + C C C Y+++Y +G ++ GT GS+ +
Sbjct: 166 PGTSSTYAPFSCSAPACAQLGRRGTGCSS-GSTCVYSVKYGDGSNTTGTYGSDTLTLAGT 224
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCIGNLNY 252
E + FGCS F ++ G+ GLG A S GS FSYC L
Sbjct: 225 SE--PLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYC---LPP 279
Query: 253 FEYAYNMLILG---EGAILEGDSTPM--SVIDGSYY-VTLEGISLGEKMLDIDPNLFKKN 306
+ L LG +TPM S ++Y + L GIS+G K L+I ++F
Sbjct: 280 TWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS-- 337
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVED-----LFQGLLPSYPMDPAWHLCYSGN 361
AG +DSGT +T L P+AY L D +Q P +D + G
Sbjct: 338 -----AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGE 392
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMI 421
N P++A GGA + L + CLA +D +G IIG +
Sbjct: 393 GNNFT--VPSVALVLDGGAVVDLHPNGIVQDG-----CLAFAATDDDGR----TGIIGNV 441
Query: 422 AQQNYNVAYDLVSKQLYFQRIDC 444
Q+ + V YD+ F+ C
Sbjct: 442 QQRTFEVLYDVGQSVFGFRPGAC 464
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 155/382 (40%), Gaps = 55/382 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++++ IG PP +LDTGS L W++C PC C +DP +S ++ + C
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPR 251
Query: 156 C--------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKT---FLYDV 203
C C C Y Y + ++ G E F TS GK+ + +V
Sbjct: 252 CHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENV 311
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S + L G FSYC+ + N + LI G
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 371
Query: 264 EG------------AILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW-- 309
E +++ G P +D YYV ++ I +G ++L I +TW
Sbjct: 372 EDKDLLNHPEVNFTSLVAGKENP---VDTFYYVQIKSIMVGGEVLKI------PEETWHL 422
Query: 310 ---SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD---PAWHLCYSGNIN 363
G +DSGTTL++ +Y+ ++ D F + YP+ P CY+ +
Sbjct: 423 SPEGAGGTIVDSGTTLSYFAEPSYEIIK----DAFVKKVKGYPVIKDFPILDPCYNVSGV 478
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
++ P F GA E+ F + E + CLA I G LSIIG
Sbjct: 479 EKME-LPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLA-----ILGTPRSALSIIGNYQ 532
Query: 423 QQNYNVAYDLVSKQLYFQRIDC 444
QQN+++ YD +L + + C
Sbjct: 533 QQNFHILYDTKKSRLGYAPMKC 554
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 162/379 (42%), Gaps = 48/379 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYA 147
+ ++Y IG P +DTGS ++WV C C++C T +DP S T +
Sbjct: 1 MKLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGS 60
Query: 148 TLPCDSSYCTNDCGGY------PDECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFL 200
+ CD +C GG C Y++ Y +G + G S+ F + S +G+T
Sbjct: 61 KVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRP 120
Query: 201 YD--VGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNY 252
+ V FGC S++ G+ G G + +S S + KV F++C+ +N
Sbjct: 121 ANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTIN- 179
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ +G + +TP+ Y V L+ I +G L + ++F DT
Sbjct: 180 ---GGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMF---DTGEKK 233
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-----LC--YSGNINRD 365
G IDSGTTLT+L Y+ + V + D +H LC Y G ++ D
Sbjct: 234 GTIIDSGTTLTYLPEIVYKEIMLAVF--------AKHKDITFHNVQEFLCFQYVGRVDDD 285
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
FP + FHF L + F++ +++C+ + + K + ++G + N
Sbjct: 286 ---FPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSN 342
Query: 426 YNVAYDLVSKQLYFQRIDC 444
V YDL ++ + + +C
Sbjct: 343 KLVVYDLENQVIGWTEYNC 361
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 159/376 (42%), Gaps = 43/376 (11%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ-----PCEQCGATTFDPSKSLTYATLPCDSSY 155
++ IG P Q VLDTGS L W++C T+FDPS S +++ LPC
Sbjct: 83 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 142
Query: 156 CTNDCGGYP--------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C + C Y+ Y +G ++G + E+F F S + GC
Sbjct: 143 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLI----LGC 198
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFE--YAYNMLILGEG 265
+ + G+ G+ S S + SKFSYCI + + LGE
Sbjct: 199 AKEST-----DVKGILGMNLGRLSFISQAKI--SKFSYCIPTRSNRPGLASTGSFYLGEN 251
Query: 266 AILEG----------DSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
G S M +D +Y V L GI +G+K L+I ++F+ D
Sbjct: 252 PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRP-DAGGSGQT 310
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRDLQGFPA-M 372
+DSG+ T LV AY +++E+ L L Y +C+ GN + +
Sbjct: 311 MVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDL 370
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
F F G +++++ + + + C+ +G S + G +IIG + QQN V +D+
Sbjct: 371 VFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGA---ASNIIGNVHQQNLWVEFDV 427
Query: 433 VSKQLYFQRIDCELLA 448
++++ F + +C L+
Sbjct: 428 ANRRVGFSKAECSRLS 443
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 95/402 (23%), Positives = 168/402 (41%), Gaps = 44/402 (10%)
Query: 76 QKSSQKAHDTRAHLH------------PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLI 123
Q S K+HD+ H ++ +++ +G PP +DTGS ++
Sbjct: 42 QLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDIL 101
Query: 124 WVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNI 171
WV C PC +C T +D S T + C+ ++C+ ++ G C Y++
Sbjct: 102 WVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHV 161
Query: 172 RYTNGPDSQGTIGSEQFNF-ETSDEGKT--FLYDVGFGCSHNNA---HFSDEQFTGVFGL 225
Y +G S G + + + +T +V FGC N + ++ G+ G
Sbjct: 162 VYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGF 221
Query: 226 GPATSSTHSLVEKVGSK---FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSY 282
G + +S S + GS FS+C+ N+N + +GE +TP+ Y
Sbjct: 222 GQSNTSVISQLAAGGSVKRIFSHCLDNMN----GGGIFAIGEVESPVVKTTPLVPNQVHY 277
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
V L+G+ + + +D+ P+L N D G IDSGTTL +L + Y +L +++ Q
Sbjct: 278 NVILKGMDVDGEPIDLPPSLASTN---GDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQ 334
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV 402
L M C+S N D + FP + HF L + + ++C
Sbjct: 335 VKL---HMVQETFACFSFTSNTD-KAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGW 390
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + D+ ++G + N V YDL ++ + + +C
Sbjct: 391 QSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 432
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 160/353 (45%), Gaps = 49/353 (13%)
Query: 111 PQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYN 170
P+ ++DTGS LIW +C+ A S L+ T P + T C
Sbjct: 52 PRKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPLSR-TAPARTGAFTRTC---------- 100
Query: 171 IRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATS 230
T + G + SE F F + +GFGC +A S TG+ GL P
Sbjct: 101 ---TASAAAVGVLASETFTFGAR---RAVSLRLGFGCGALSAG-SLIGATGILGLSP--- 150
Query: 231 STHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS-------- 281
+ SL+ ++ +FSYC+ + + + L+ G A L T + +
Sbjct: 151 ESLSLITQLKIQRFSYCL--TPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVET 208
Query: 282 --YYVTLEGISLGEKMLDI-DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE 338
YYV L GISLG K L + +L + D G +DSG+T+ +LV +A++ +++ V
Sbjct: 209 VYYYVPLVGISLGHKRLAVPAASLAMRPD--GGGGTIVDSGSTVAYLVEAAFEAVKEAVM 266
Query: 339 DLFQGLLPSYPMDPAWHLCY-----SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQE 393
D+ + + + ++ + LC+ + + P + HF GGA +VL ++ F +
Sbjct: 267 DVVRLPVANRTVED-YELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEP 325
Query: 394 SSSVFCLAVGP-SDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ + CLAVG +D +G +SIIG + QQN +V +D+ + F C+
Sbjct: 326 RAGLMCLAVGKTTDGSG-----VSIIGNVQQQNMHVLFDVQHHKFSFAPTQCD 373
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 166/385 (43%), Gaps = 47/385 (12%)
Query: 85 TRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPS 141
R LH + T + IG PP ++D+GS++ +V C CEQCG F P
Sbjct: 74 ARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPD 133
Query: 142 KSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
S TY+ + C+ CT C ++C Y +Y S G +G + +F T E K
Sbjct: 134 LSSTYSPVKCNVD-CT--CDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--Q 188
Query: 202 DVGFGCSHNN-AHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYAY 257
FGC ++ + G+ GLG S LV+K +G FS C G ++
Sbjct: 189 RAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD------ 242
Query: 258 NMLILGEGAILEGDS--------TPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDT 308
+G GA++ G T + + YY + L+ + + K L +DP +F
Sbjct: 243 ----IGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKH- 297
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINR 364
G +DSGTT +L A+ + V L D + +C++G N+++
Sbjct: 298 ----GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQ 353
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIA 422
+ FP + F G L L E+ ++ S +CL V NG+ +++G I
Sbjct: 354 LSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGV---FQNGK--DPTTLLGGIV 408
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELL 447
+N V YD ++++ F + +C L
Sbjct: 409 VRNTLVTYDRHNEKIGFWKTNCSEL 433
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 159/368 (43%), Gaps = 53/368 (14%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C C+QCG F P S TY + C+ CT C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPD-CT--CD 58
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD---EQ 218
D+C Y +Y S G +G + +F E K FGC NA D +
Sbjct: 59 TENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP--QRAVFGC--ENAETGDLFSQH 114
Query: 219 FTGVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
G+ GLG S LVEK + FS C G + +G GA++ G +P
Sbjct: 115 ADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----------VGGGAMVLGQISPP 164
Query: 276 SVIDGS---------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
S + S Y + L G+ + K LDI+P +F G +DSGTT +L
Sbjct: 165 SDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKH-----GTILDSGTTYAYLP 219
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGADL 382
+A+ + + GL DP ++ +C+SG I + FP++ F G
Sbjct: 220 EAAFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKY 279
Query: 383 VLDAESVFYQESS--SVFCLAVGPSDINGERFKD-LSIIGMIAQQNYNVAYDLVSKQLYF 439
L E+ ++ S +CL V + KD +++G I +N V YD ++ F
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGV------FQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333
Query: 440 QRIDCELL 447
+ +C +L
Sbjct: 334 WKTNCSVL 341
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 159/368 (43%), Gaps = 53/368 (14%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C C+QCG F P S TY + C+ CT C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPD-CT--CD 58
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD---EQ 218
D+C Y +Y S G +G + +F E K FGC NA D +
Sbjct: 59 TENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP--QRAVFGC--ENAETGDLFSQH 114
Query: 219 FTGVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
G+ GLG S LVEK + FS C G + +G GA++ G +P
Sbjct: 115 ADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----------VGGGAMVLGQISPP 164
Query: 276 SVIDGS---------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
S + S Y + L G+ + K LDI+P +F G +DSGTT +L
Sbjct: 165 SDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKH-----GTILDSGTTYAYLP 219
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGADL 382
+A+ + + GL DP ++ +C+SG I + FP++ F G
Sbjct: 220 EAAFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKY 279
Query: 383 VLDAESVFYQESS--SVFCLAVGPSDINGERFKD-LSIIGMIAQQNYNVAYDLVSKQLYF 439
L E+ ++ S +CL V + KD +++G I +N V YD ++ F
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGV------FQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333
Query: 440 QRIDCELL 447
+ +C +L
Sbjct: 334 WKTNCSVL 341
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 158/386 (40%), Gaps = 52/386 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ---------PCEQCGAT-TFDPSKSLTYAT 148
+ V+ + G PP L + DTGS LIW++C P + C F SKS T +
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 113
Query: 149 LPCDSSYCT---------NDCG-GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
+PC ++ C C P C Y Y +G + G + + G
Sbjct: 114 VPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGA 173
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLG------PATSSTHSLVEKVGSKFSYCIGNLNY 252
+ V FGC N S GV GLG PA S SL + FSYC+ +L
Sbjct: 174 AVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSG--SLFAQT---FSYCLLDLEG 228
Query: 253 FEYAY--NMLILGEGAILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
+ L LG + TP+ + YYV + I +G ++L + P
Sbjct: 229 GRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV-PGSEWAI 287
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW----HLCYSGNI 362
D + G IDSG+TLT+L AY L LP P + LCY+ +
Sbjct: 288 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH--LPRIPSSATFFQGLELCYNVSS 345
Query: 363 NRDLQ----GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSII 418
+ L GFP + FA G L L + + V CLA+ P+ ++ F ++
Sbjct: 346 SSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPT-LSPFAFN---VL 401
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G + QQ Y+V +D S ++ F R +C
Sbjct: 402 GNLMQQGYHVEFDRASARIGFARTEC 427
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 109/438 (24%), Positives = 171/438 (39%), Gaps = 71/438 (16%)
Query: 62 RTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV----------------------- 98
R + I QKS++K +++ P +S V
Sbjct: 132 RVIEKKNQNTISRLQKSTKKQTNSKQSYKPAVSPVAAASPEYSSQLVATLESGVSLGSGE 191
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++++ IG PP +LDTGS L W++C PC C + +DP +S ++ + C
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPR 251
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKT---FLYDV 203
C C C Y Y + ++ G E F T+ GK+ + +V
Sbjct: 252 CKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENV 311
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S L G FSYC+ + N + LI G
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDTSVSSKLIFG 371
Query: 264 EGAILEGDST---------PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW----- 309
E L + +D YYV ++ I + ++L I +TW
Sbjct: 372 EDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIP------EETWHLSKE 425
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG--LLPSY-PMDPAWHLCYSGNINRDL 366
G IDSGTTLT+ AY+ +++ +G L+ + P+ P +++ SG +L
Sbjct: 426 GGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNV--SGIEKMEL 483
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
P F+ GA E+ F Q + CLA I G LSIIG QQN+
Sbjct: 484 ---PDFGILFSDGAMWDFPVENYFIQIEPDLVCLA-----ILGTPKSALSIIGNYQQQNF 535
Query: 427 NVAYDLVSKQLYFQRIDC 444
++ YD+ +L + + C
Sbjct: 536 HILYDMKKSRLGYAPMKC 553
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 159/375 (42%), Gaps = 48/375 (12%)
Query: 108 PPVPQLAVLDTGSSLIWVKCQPCEQCG-ATTFDPSKSLTYATLPCDSSYCTNDCGGY--- 163
PP V+DTGS L W++C FDP++S +Y+ +PC S C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 164 -----PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQ 218
C + Y + S+G + +E F+F S + FGC + + E+
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLI----FGCMGSVSGSDPEE 197
Query: 219 FTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI---------- 267
T GL + S + ++G KFSYCI + F L+LG+
Sbjct: 198 DTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFP---GFLLLGDSNFTWLTPLNYTP 254
Query: 268 LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
L STP+ D +Y V L GI + K+L I ++ + T + +DSGT T+L+
Sbjct: 255 LIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGA-GQTMVDSGTQFTFLL 313
Query: 327 PSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQG----FPAMAFHFA 377
Y LR + G+L P + LCY + R G P ++ F
Sbjct: 314 GPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFE 373
Query: 378 GGADLVLDAESVFYQ------ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
GA++ + + + Y+ + SV+C G SD+ G + +IG QQN + +D
Sbjct: 374 -GAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMG---MEAYVIGHHHQQNMWIEFD 429
Query: 432 LVSKQLYFQRIDCEL 446
L ++ ++C++
Sbjct: 430 LQRSRIGLAPVECDV 444
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 125/477 (26%), Positives = 208/477 (43%), Gaps = 61/477 (12%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQA 60
MPSS +IL L L F + + T A G P L+T L R + N V+ +
Sbjct: 1 MPSSISILAL---ILAFAAILL---TAAVVHCGSPASLLT--LERA---FPVNQRVELEV 49
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGS 120
R + AR L + D + V +++ +G PP +DTGS
Sbjct: 50 LRARDQ--ARHGRLLRGVVGGVVDFTVYGTSDPYLVGLYFTKVKLGSPPREFNVQIDTGS 107
Query: 121 SLIWVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCTN-------DCGGYPD 165
++WV C C C T+ FDPS S T + + C CT+ +C +
Sbjct: 108 DILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECSPQSN 167
Query: 166 ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD----VGFGCS---HNNAHFSDEQ 218
+C Y+ Y +G + G S+ F+T G + + + + FGCS + D+
Sbjct: 168 QCSYSFHYGDGSGTTGYYVSDMLYFDTV-LGDSLIANSSASIVFGCSTYQSGDLTKVDKA 226
Query: 219 FTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAYNMLILGEGAILEGDS--T 273
G+FG G S S + +G FS+C+ L+LGE ILE + +
Sbjct: 227 IDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEG---DGGGKLVLGE--ILEPNIIYS 281
Query: 274 PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTL 333
P+ Y + L+ IS+ ++L IDP +F T ++ G +DSGTTLT+LV +AY
Sbjct: 282 PLVPSQSHYNLNLQSISVNGQLLPIDPAVFA---TSNNQGTIVDSGTTLTYLVETAYDPF 338
Query: 334 RKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF--- 390
+ + P+ + CY + + D + FP ++ +FAGGA +VL
Sbjct: 339 VSAITATVSS--STTPVLSKGNQCYLVSTSVD-EIFPPVSLNFAGGASMVLKPGEYLMHL 395
Query: 391 -YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
+ + ++++C+ G ++I+G + ++ YDL +++ + DC L
Sbjct: 396 GFSDGAAMWCIGFQKVAEPG-----ITILGDLVLKDKIFVYDLAHQRIGWANYDCSL 447
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 164/363 (45%), Gaps = 40/363 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCD 152
+ +GQP V DTGS + W++CQPC FDP S +Y+ L C+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 153 SSYCT--NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
S C + D C Y + Y +G + G + +E +F S+ + ++ GC H+
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS----IPNLPIGCGHD 263
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
N F G GL SL ++ S FSYC+ NL+ + + L + +
Sbjct: 264 NEGL----FAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLD----SDSSSTLEFNSYMP 315
Query: 270 GDSTPMSVIDG----SY-YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
DS ++ SY YV + GIS+G K L I P F+ +++ G+ +DSGT ++
Sbjct: 316 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGL-GGIIVDSGTIISR 374
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAMAFHFAGGADL 382
L Y++LR+ L L P+ P + CY SG N ++ P +AF + G L
Sbjct: 375 LPSDVYESLREAFVKLTSSLSPA-PGISVFDTCYNFSGQSNVEV---PTIAFVLSEGTSL 430
Query: 383 VLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L A + + +++ +CLA + LSIIG QQ V+YDL + + F
Sbjct: 431 RLPARNYLIMLDTAGTYCLAFIKTK------SSLSIIGSFQQQGIRVSYDLTNSIVGFST 484
Query: 442 IDC 444
C
Sbjct: 485 NKC 487
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 172/383 (44%), Gaps = 45/383 (11%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLT 145
S V ++Y +G PP +DTGS ++WV C C C T+ FDP S T
Sbjct: 72 SQVGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSST 131
Query: 146 YATL-----PCDSSYCTND--CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
+ + C S T+D C ++C Y +Y +G + G S+ +F EG
Sbjct: 132 SSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTL 191
Query: 199 FL---YDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGN 249
V FGCS + S+ G+FG G S S + G FS+C+
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKG 251
Query: 250 LNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
N +L+LGE I+E + +P+ Y + L+ IS+ +++ I P +F
Sbjct: 252 DN---SGGGVLVLGE--IVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFA--- 303
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
T ++ G +DSGTTL +L AY + L + S + + CY + ++
Sbjct: 304 TSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRS--VLSRGNQCYLITTSSNVD 361
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
FP ++ +FAGGA LVL + Q++ SV+C +G I G+ ++I+G +
Sbjct: 362 IFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWC--IGFQRIPGQ---SITILGDLVL 416
Query: 424 QNYNVAYDLVSKQLYFQRIDCEL 446
++ YDL +++ + DC L
Sbjct: 417 KDKIFVYDLAGQRIGWANYDCSL 439
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 108/391 (27%), Positives = 168/391 (42%), Gaps = 33/391 (8%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR YLS ++QK + V + V +G P VLDT + W C
Sbjct: 65 ARIRYLSSLTAQKTVAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCS 124
Query: 129 PCEQCGA-TTFDPSKSLTYATLPCDSSYCTNDCG-GYPD----ECWYNIRYTNGPDSQGT 182
C C + TTF S T+ATL C CT G P +C +N Y T
Sbjct: 125 GCIGCSSTTTFSAQNSSTFATLDCSKPECTQARGLSCPTTGNVDCLFNQTYGGDSTFSAT 184
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS- 241
+ + + G + + FGC ++A S G+ GLG SL+ + GS
Sbjct: 185 LVQDSLHL-----GPNVIPNFSFGC-ISSASGSSIPPQGLMGLG---RGPLSLISQSGSL 235
Query: 242 ---KFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKM 295
FSYC+ + + ++ ++ + G +TP+ YYV L GIS+G +
Sbjct: 236 YSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVL 295
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH 355
+ I P L D + AG IDSGT +T VP+ Y +R E G S+ A+
Sbjct: 296 VPISPELLAF-DPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGG---SFSPLGAFD 351
Query: 356 LCYSGNINRDLQGFPAMAFHFAGGADLVLDAE-SVFYQESSSVFCLAVGPSDINGERFKD 414
C++ N + PA+ H + G DL L E S+ + + S+ CLA+ +
Sbjct: 352 TCFATN---NEVSAPAITLHLS-GLDLKLPMENSLIHSSAGSLACLAMAAAPN--NVNSV 405
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+++I + QQN+ + +D+ + +L R C
Sbjct: 406 VNVIANLQQQNHRILFDINNSKLGIARELCN 436
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/411 (26%), Positives = 170/411 (41%), Gaps = 55/411 (13%)
Query: 69 ARFIYLSQKSS----QKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIW 124
+R +YLS +S R LH P + V S+G PP L +DT + W
Sbjct: 65 SRVLYLSSLASGFGGAPLASGRQLLH-----TPTYLVRASLGTPPQRLLLAVDTSNDAAW 119
Query: 125 VKCQPCEQCGAT--TFDPSKSLTYATLPCDSSYCTN-------DCGGYPDECWYNIRYTN 175
V C C C T +F+P+ S T+ +PC + C+ + C +++ Y
Sbjct: 120 VPCAGCHGCPTTAPSFNPASSATFRPVPCGAPPCSQAPNPSCTSLAKSKNSCGFSLSYG- 178
Query: 176 GPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC---SHNNAHFSDEQFTGVFGLGPATSST 232
DS Q N + G + FGC S+ +A + G + T
Sbjct: 179 --DSSLDATLSQDNLAVTANGG-VIKGYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQT 235
Query: 233 HSLVEKVGSKFSYCIGNLNYFEYAYNM---LIL---GEGAILEGDSTPMSVIDGS---YY 283
+ E FSYC+ +Y+ A N L L G+ A + +TP+ YY
Sbjct: 236 KGIYEGT---FSYCL--PSYYRSAANFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLYY 290
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
V + G+ +G+K + I P+ D + AG +DSGT L AY +R EV G
Sbjct: 291 VAMTGVRIGKKSVPIPPSALAF-DAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAG 349
Query: 344 LLPSYPMDPA---------WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES 394
L A + CY N +PA+ F GG ++ L E+V + +
Sbjct: 350 SLRRRGGGGASVSVSSLGGFDTCY----NVSTVAWPAVTLVFGGGMEVRLPEENVVIRST 405
Query: 395 -SSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S CLA+ S +G L++IG + QQN+ V +D+ + ++ F R C
Sbjct: 406 YGSTSCLAMAASPADGVN-AALNVIGSLQQQNHRVLFDVPNARVGFARERC 455
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 174/389 (44%), Gaps = 62/389 (15%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTY 146
V ++Y +G PP +DTGS ++WV C C C T+ FDP S+T
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 147 ATLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
+ + C C+ + C + C Y +Y +G + G S+ F+ G +
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMI-VGSSL 195
Query: 200 LYD----VGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGN 249
+ + V FGCS + SD G+FG G S S + G FS+C+
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 250 LNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
N +L+LGE I+E + TP+ Y V L IS+ + L I+P++F
Sbjct: 256 EN---GGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS--- 307
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-----SGNI 362
T + G ID+GTTL +L +AY + + + + P+ + CY G+I
Sbjct: 308 TSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSV--RPVVSKGNQCYVITTSVGDI 365
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKD--LS 416
FP ++ +FAGGA + L+ + Q++ ++V+C+ +R ++ ++
Sbjct: 366 ------FPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGF-------QRIQNQGIT 412
Query: 417 IIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
I+G + ++ YDLV +++ + DC
Sbjct: 413 ILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 115/421 (27%), Positives = 181/421 (42%), Gaps = 75/421 (17%)
Query: 75 SQKSSQKAHDTRAHLH-------------PGISTVPVFY-------VNFSIGQPPVPQLA 114
++ ++ +AHD R L G S VP+ + NF+IG PP P A
Sbjct: 23 TRTAAFRAHDLRRGLEQAMRGRLLADATPAGGSAVPIHWSRHLYNVANFTIGTPPQPASA 82
Query: 115 VLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDEC 167
++D L+W +C C +C F P+ S T+ PC + C T++C + C
Sbjct: 83 IIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPTSNCSS--NMC 140
Query: 168 WY--NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL 225
Y I G + G + ++ F T+ +GFGC + + +G+ GL
Sbjct: 141 TYEGTINSKLGGHTLGIVATDTFAIGTATA------SLGFGCVVASGIDTMGGPSGLIGL 194
Query: 226 GPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYN-MLILGEGAILE--GDSTPMSVIDGS 281
G A S SLV ++ +KFSYC L + N L+LG A L G+ST + S
Sbjct: 195 GRAPS---SLVSQMNITKFSYC---LTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTS 248
Query: 282 --------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTL 333
Y + L+GI G+ + + P S V + + +++LV SAYQ L
Sbjct: 249 PGDDMSQYYPIQLDGIKAGDAAIALPP---------SGNTVLVQTLAPMSFLVDSAYQAL 299
Query: 334 RKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVF- 390
+KEV + P+ P + LC+ +G N P + F F GA + +
Sbjct: 300 KKEVTKAVGAAPTATPLQP-FDLCFPKAGLSNASA---PDLVFTFQQGAAALTVPPPKYL 355
Query: 391 --YQESSSVFCLAV-GPSDINGERF-KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
E C+A+ S +N ++L+I+G + Q+N + DL K L F+ DC
Sbjct: 356 IDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCSS 415
Query: 447 L 447
L
Sbjct: 416 L 416
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 158/363 (43%), Gaps = 41/363 (11%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSYC 156
++ ++ IG PP LD S L+W C GAT F+P +S T A +PC C
Sbjct: 99 MYVFSYGIGTPPQQVSGALDISSDLVWTAC------GATAPFNPVRSTTVADVPCTDDAC 152
Query: 157 TN--------DCGGYPDECWYNIRYTNGP-DSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
G EC Y Y G ++ G +G+E F F G T + V FGC
Sbjct: 153 QQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTF-----GDTRIDGVVFGC 207
Query: 208 S-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
N FS +GV GLG S S ++ +FSY + + + ++ G+ A
Sbjct: 208 GLQNVGDFSG--VSGVIGLGRGNLSLVSQLQV--DRFSYHFAPDDSVD-TQSFILFGDDA 262
Query: 267 ILEGDSTPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+ T + + S YYV L GI + K L I F + GVF+
Sbjct: 263 TPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITD 322
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
+T L +AY+ LR+ V GL LCY+G + P+MA FAGGA
Sbjct: 323 LVTVLEEAAYKPLRQAVASKI-GLPAVNGSALGLDLCYTGESLAKAK-VPSMALVFAGGA 380
Query: 381 DLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ L+ + FY +S++ + CL + PS D S++G + Q ++ YD+ +L F
Sbjct: 381 VMELELGNYFYMDSTTGLACLTILPSSAG-----DGSVLGSLIQVGTHMMYDINGSKLVF 435
Query: 440 QRI 442
+ +
Sbjct: 436 ESL 438
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 116/395 (29%), Positives = 158/395 (40%), Gaps = 67/395 (16%)
Query: 81 KAHDTRAHLHPGISTVPVFYV-NFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCG 134
+A + L PG S YV +G P + V+DTGSSL W++C PC Q G
Sbjct: 112 QASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAG 171
Query: 135 ATTFDPSKSLTYATLPCDSSYCTN--------DCGGYPDECWYNIRYTNGPDSQGTIGSE 186
FDP S TYA + C SS C + C Y Y + S G + +
Sbjct: 172 P-VFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKD 230
Query: 187 QFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSY 245
+F S F Y GC +N G+ GL S + L +G FSY
Sbjct: 231 TVSFG-SGSFPGFYY----GCGQDNEGLFGRS-AGLIGLAKNKLSLLYQLAPSLGYAFSY 284
Query: 246 C------------IGNLNYFEYAYNMLILGEGAILEGDSTPM--SVIDGS-YYVTLEGIS 290
C IG+ N +Y+Y TPM S +D S Y+VTL GIS
Sbjct: 285 CLPTSSAAAGYLSIGSYNPGQYSY---------------TPMASSSLDASLYFVTLSGIS 329
Query: 291 LGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM 350
+ L + P+ ++ T IDSGT +T L P+ Y L + V P P
Sbjct: 330 VAGAPLAVPPSEYRSLPT------IIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPT 383
Query: 351 DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGE 410
C+ G+ L+ P + FAGGA L L +V S CLA P+
Sbjct: 384 YSILDTCFRGSA-AGLR-VPRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFAPTG---- 437
Query: 411 RFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+IIG QQ ++V YD+ ++ F C
Sbjct: 438 ---GTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 165/385 (42%), Gaps = 45/385 (11%)
Query: 84 DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDP 140
+ R LH + T + IG PP ++D+GS++ +V C CEQCG F P
Sbjct: 73 NARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQP 132
Query: 141 SKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
S +Y+ + C+ CT C +C Y +Y S G +G + +F E K
Sbjct: 133 DLSSSYSPVKCNVD-CT--CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKP-- 187
Query: 201 YDVGFGCSHNN-AHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYA 256
FGC ++ + G+ GLG S LVEK + FS C G ++
Sbjct: 188 QHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----- 242
Query: 257 YNMLILGEGAILEGDSTPMSVIDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDT 308
I G +L G P +I + Y + L+ I + K L ++ +F
Sbjct: 243 ----IGGGAMVLGGMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIFN---- 294
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINR 364
S G +DSGTT +L A+ ++ V L DP++ +C++G N+++
Sbjct: 295 -SKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSK 353
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIA 422
+ FP + F G L L E+ ++ S +CL V NG+ +++G I
Sbjct: 354 LHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQ---NGK--DPTTLLGGII 408
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELL 447
+N V YD ++++ F + +C L
Sbjct: 409 VRNTLVTYDRHNEKIGFWKTNCSEL 433
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 162/380 (42%), Gaps = 45/380 (11%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C ++ FDP S T +
Sbjct: 80 VGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTAS 139
Query: 148 TLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE--TSDEGKT 198
+ C C+ C ++C Y +Y +G + G S+ NF+
Sbjct: 140 LISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 199
Query: 199 FLYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNY 252
+ FGCS + SD G+FG G S S + G FS+C+
Sbjct: 200 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 259
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ + E I+ +P+ Y + L+ IS+ K L IDP +F T ++
Sbjct: 260 GGGILVLGEIVEEDIVY---SPLVPSQPHYNLNLQSISVNGKSLAIDPEVFA---TSTNR 313
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-FP 370
G +DSGTTL +L AY + E + Q + P +L I ++G FP
Sbjct: 314 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYL-----ITSSVKGIFP 368
Query: 371 AMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
++ +FAGG + L E Q++S +V+C +G I G+ ++I+G + ++
Sbjct: 369 TVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWC--IGFQKIQGQ---GITILGDLVLKDK 423
Query: 427 NVAYDLVSKQLYFQRIDCEL 446
YDL +++ + DC +
Sbjct: 424 IFVYDLAGQRIGWANYDCSM 443
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/365 (28%), Positives = 154/365 (42%), Gaps = 64/365 (17%)
Query: 115 VLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCT---NDCGGYP-- 164
++DTGS L WV+C+PC C A FDP+ S T+A +PC S C D G P
Sbjct: 197 IVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGS 256
Query: 165 ---------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS 215
C+Y + Y +G S+G + + T+ + F+ FGC +N
Sbjct: 257 CARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTTKLDGFV----FGCGLSNRGL- 311
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNMLILGEG--AILE 269
F G GL + SLV + ++ FSYC L + L LG G +
Sbjct: 312 ---FGGTAGLMGLGRTDLSLVSQTAARFGGVFSYC---LPATTTSTGSLSLGPGPSSSFP 365
Query: 270 GDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+ + D + Y++ + G ++G P N V +DSGT +T L
Sbjct: 366 NMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGN-------VLVDSGTVITRL 418
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADL 382
PS Y+ +R E F+ YP P + + CY RD P + GGA +
Sbjct: 419 APSVYKAVRAEFARRFE-----YPAAPGFSILDACYD-LTGRDEVNVPLLTLTLEGGAQV 472
Query: 383 VLDAESVFY--QESSSVFCLAVGPSDINGERFKDLS-IIGMIAQQNYNVAYDLVSKQLYF 439
+DA + + ++ S CLA+ ++D + IIG Q+N V YD V +L F
Sbjct: 473 TVDAAGMLFVVRKDGSQVCLAMASLP-----YEDQTPIIGNYQQRNKRVVYDTVGSRLGF 527
Query: 440 QRIDC 444
DC
Sbjct: 528 ADEDC 532
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/395 (26%), Positives = 162/395 (41%), Gaps = 48/395 (12%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR +LS ++K+ A G+ P + V +G PP L LD W+ C+
Sbjct: 6 ARLQFLSSLVAKKSVVPIASGR-GVIQSPSYIVKAKVGTPPQTLLMALDNSYDAAWIPCK 64
Query: 129 PCEQCGATTFDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIG 184
C C +T F+ KS T+ TL C + C CGG C +N Y G
Sbjct: 65 GCVGCSSTVFNTVKSTTFKTLGCGAPQCKQVPNPICGG--STCTWNTTY----------G 112
Query: 185 SEQFNFETSDEGKTFLYD----VGFGC--SHNNAHFSDEQFTGVFGLGPAT--SSTHSLV 236
S + + D FGC + + G FG GP + S T +L
Sbjct: 113 SSTILSNLTRDTIALSMDPVPYYAFGCIQKATGSSVPPQGLLG-FGRGPLSFLSQTQNLY 171
Query: 237 EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGE 293
+ S FSYC+ + ++ ++ + G +TP+ YYV L GI +G
Sbjct: 172 K---STFSYCLPSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGR 228
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
K++DI + N T + AG DSGT T LV AY +R E S
Sbjct: 229 KIVDIPRSALAFNPT-TGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGNATVSS--LGG 285
Query: 354 WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSV---FCLAVGPSDINGE 410
+ CYS I P + F F+ G ++ + E++ ++ V +A P ++N
Sbjct: 286 FDTCYSVPIVP-----PTITFMFS-GMNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSV 339
Query: 411 RFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
L++I + QQN+ + +D+ + +L R C
Sbjct: 340 ----LNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 149/345 (43%), Gaps = 43/345 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V ++Y +G PPV +DTGS ++WV C C C T+ FDP S T +
Sbjct: 22 VGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSS 81
Query: 148 TLPCDSSYCTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C C N C ++C Y +Y +G + G S+ + T EG
Sbjct: 82 MIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTT 141
Query: 201 YD---VGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
V FGCS+ SD G+FG G S S + G FS+C L
Sbjct: 142 NSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHC---LK 198
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+L+LGE I+E + S++ Y + L+ I++ + L ID ++F +++
Sbjct: 199 GDSSGGGILVLGE--IVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS- 255
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
G +DSGTTL +L AY + + Q + + +L S +
Sbjct: 256 --RGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRGNQCYLITS----SVTEV 309
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDING 409
FP ++ +FAGGA ++L + Q++S +V+C+ S + G
Sbjct: 310 FPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKSRVKG 354
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 163/385 (42%), Gaps = 45/385 (11%)
Query: 84 DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDP 140
+ R LH + T + IG P ++D+GS++ +V C CEQCG F P
Sbjct: 76 NARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQP 135
Query: 141 SKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
S TY+ + C+ CT C +C Y +Y S G +G + +F E K
Sbjct: 136 DLSSTYSPVKCNVD-CT--CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP-- 190
Query: 201 YDVGFGCSH-NNAHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYA 256
FGC + + G+ GLG S LVEK + FS C G ++
Sbjct: 191 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----- 245
Query: 257 YNMLILGEGAILEGDSTPMSV-------IDGSYY-VTLEGISLGEKMLDIDPNLFKKNDT 308
+ G +L G P + + YY + L+ I + K L +DP +F
Sbjct: 246 ----VGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN---- 297
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINR 364
S G +DSGTT +L A+ + V + L DP + +C++G N+++
Sbjct: 298 -SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQ 356
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIA 422
+ FP + F G L L E+ ++ S +CL V NG+ +++G I
Sbjct: 357 LSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ---NGK--DPTTLLGGIV 411
Query: 423 QQNYNVAYDLVSKQLYFQRIDCELL 447
+N V YD ++++ F + +C L
Sbjct: 412 VRNTLVTYDRHNEKIGFWKTNCSEL 436
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 160/375 (42%), Gaps = 59/375 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
+ + S+G PP A++DTGS L WV+C PC +C F P S +Y+ C S
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSL 67
Query: 156 CTNDCGGYP-----DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
C D P + C Y+ Y +G +++G F FET + L +GFGC HN
Sbjct: 68 C--DALPRPTCSMRNTCTYSYSYGDGSNTRG-----DFAFETVTLNGSTLARIGFGCGHN 120
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNMLILGEGA 266
+ F G GL SL ++ S FSYC+ + + ++ + G A
Sbjct: 121 Q----EGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQST-TGTFSPITFGNAA 175
Query: 267 ---------ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
+L+ + P YYV +E IS+G + + P+ F+ D GV +D
Sbjct: 176 ENSRASFTPLLQNEDNP-----SYYYVGVESISVGNRRVPTPPSAFRI-DANGVGGVILD 229
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP-MDP---AWHLCYS-GNINRDLQGFPAM 372
SGTT+T+ +A+ + E+ SYP DP +LCY +++ P+M
Sbjct: 230 SGTTITYWRLAAFIPILAELRRQI-----SYPEADPTPYGLNLCYDISSVSASSLTLPSM 284
Query: 373 AFHFAGGADLVLDAES--VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
H D + + V C A+ SD SIIG + QQN +
Sbjct: 285 TVHLT-NVDFEIPVSNLWVLVDNFGETVCTAMSTSD-------QFSIIGNVQQQNNLIVT 336
Query: 431 DLVSKQLYFQRIDCE 445
D+ + ++ F DC
Sbjct: 337 DVANSRVGFLATDCS 351
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 174/389 (44%), Gaps = 62/389 (15%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTY 146
V ++Y +G PP +DTGS ++WV C C C T+ FDP S+T
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 147 ATLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
+ + C C+ + C + C Y +Y +G + G S+ F+ G +
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMI-VGSSL 195
Query: 200 LYD----VGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGN 249
+ + V FGCS + SD G+FG G S S + G FS+C+
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 250 LNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
N +L+LGE I+E + TP+ Y V L IS+ + L I+P++F
Sbjct: 256 EN---GGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS--- 307
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-----SGNI 362
T + G ID+GTTL +L +AY + + + + P+ + CY G+I
Sbjct: 308 TSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSV--RPVVSKGNQCYVITTSVGDI 365
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKD--LS 416
FP ++ +FAGGA + L+ + Q++ ++V+C+ +R ++ ++
Sbjct: 366 ------FPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGF-------QRIQNQGIT 412
Query: 417 IIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
I+G + ++ YDLV +++ + DC
Sbjct: 413 ILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 173/386 (44%), Gaps = 54/386 (13%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTY 146
V ++Y +G PP +DTGS ++WV C C C T+ FDP S+T
Sbjct: 77 VVGLYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 147 ATLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
+ C C+ + C + C Y +Y +G + G S+ F+ G +
Sbjct: 137 TPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMI-VGSSL 195
Query: 200 LYD----VGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGN 249
+ + V FGCS + SD G+FG G S S + G FS+C+
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKG 255
Query: 250 LNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
N +L+LGE I+E + TP+ Y V L IS+ + L I+P++F
Sbjct: 256 EN---GGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS--- 307
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS-GNINRDL 366
T + G ID+GTTL +L +AY + + + + P+ + CY D+
Sbjct: 308 TSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSV--RPVVSKGNQCYVIATSVADI 365
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKD--LSIIGM 420
FP ++ +FAGGA + L+ + Q++ ++V+C+ +R ++ ++I+G
Sbjct: 366 --FPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGF-------QRIQNQGITILGD 416
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDCEL 446
+ ++ YDLV +++ + DC +
Sbjct: 417 LVLKDKIFVYDLVGQRIGWANYDCSM 442
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 162/380 (42%), Gaps = 45/380 (11%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C ++ FDP S T +
Sbjct: 65 VGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTAS 124
Query: 148 TLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE--TSDEGKT 198
+ C C+ C ++C Y +Y +G + G S+ NF+
Sbjct: 125 LISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 184
Query: 199 FLYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNY 252
+ FGCS + SD G+FG G S S + G FS+C+
Sbjct: 185 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 244
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ + E I+ +P+ Y + L+ IS+ K L IDP +F T ++
Sbjct: 245 GGGILVLGEIVEEDIVY---SPLVPSQPHYNLNLQSISVNGKSLAIDPEVFA---TSTNR 298
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-FP 370
G +DSGTTL +L AY + E + Q + P +L I ++G FP
Sbjct: 299 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYL-----ITSSVKGIFP 353
Query: 371 AMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
++ +FAGG + L E Q++S +V+C +G I G+ ++I+G + ++
Sbjct: 354 TVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWC--IGFQKIQGQ---GITILGDLVLKDK 408
Query: 427 NVAYDLVSKQLYFQRIDCEL 446
YDL +++ + DC +
Sbjct: 409 IFVYDLAGQRIGWANYDCSM 428
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 159/375 (42%), Gaps = 48/375 (12%)
Query: 108 PPVPQLAVLDTGSSLIWVKCQPCEQCG-ATTFDPSKSLTYATLPCDSSYCTNDCGGY--- 163
PP V+DTGS L W++C FDP++S +Y+ +PC S C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 164 -----PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQ 218
C + Y + S+G + +E F+F S + FGC + + E+
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLI----FGCMGSVSGSDPEE 197
Query: 219 FTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAI---------- 267
T GL + S + ++G KFSYCI + F L+LG+
Sbjct: 198 DTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFP---GFLLLGDSNFTWLTPLNYTP 254
Query: 268 LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
L STP+ D +Y V L GI + K+L I ++ + T + +DSGT T+L+
Sbjct: 255 LIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGA-GQTMVDSGTQFTFLL 313
Query: 327 PSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQG----FPAMAFHFA 377
Y LR + + G+L P + LCY + R G P ++ F
Sbjct: 314 GPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSLVFE 373
Query: 378 GGADLVLDAESVFYQ------ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
GA++ + + + Y+ + SV+C G SD+ G + +IG QQN + +D
Sbjct: 374 -GAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMG---MEAYVIGHHHQQNMWIEFD 429
Query: 432 LVSKQLYFQRIDCEL 446
L ++ + C++
Sbjct: 430 LQRSRIGLAPVQCDV 444
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 170/405 (41%), Gaps = 57/405 (14%)
Query: 81 KAHDTRAH--------LHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
K+HD R H L G + P ++Y IG PP +DTGS ++WV C
Sbjct: 43 KSHDVRRHGRLLSVIDLELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNCV 102
Query: 129 PCEQCGATT--------FDPSKSLTYATLPCDSSYC--TNDC---GGYPD-ECWYNIRYT 174
C C + ++P S T + CD +C T D G PD C Y + Y
Sbjct: 103 GCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYG 162
Query: 175 NGPDSQGTIGSEQF-------NFETSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVFG 224
+G + G ++ N +TS+ + + FGC + S E G+ G
Sbjct: 163 DGSATAGYFVNDYIQLQRAVGNHKTSETNGSIV----FGCGAKQSGELGSSSEALDGILG 218
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G A SS S + KV F++C+ +++ + +GE + +TP+
Sbjct: 219 FGQANSSMISQLAATGKVKKIFAHCLDSIS----GGGIFAIGEVVEPKLKTTPVVPNQAH 274
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V L G+ +G+ LD+ LF +T G IDSGTTL +L S Y L +++
Sbjct: 275 YNVVLNGVKVGDTALDLPLGLF---ETSYKRGAIIDSGTTLAYLPDSIYLPLMEKI---- 327
Query: 342 QGLLPSYPMDPA--WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFC 399
G P + C+ + N D GFP + F F L + +Q V+C
Sbjct: 328 LGAQPDLKLRTVDDQFTCFVFDKNVD-DGFPTVTFKFEESLILTIYPHEYLFQIRDDVWC 386
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S + +++++G + QN V Y+L ++ + + +C
Sbjct: 387 VGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNC 431
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 161/388 (41%), Gaps = 71/388 (18%)
Query: 89 LHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSK 142
L PG S V + +G P + V+DTGSSL W++C PC Q G F+P
Sbjct: 111 LSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGP-VFNPKS 169
Query: 143 SLTYATLPCDSSYCTN--------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSD 194
S TYA++ C + C++ + C Y Y + S G + + +F
Sbjct: 170 SSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF---- 225
Query: 195 EGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCI------ 247
G T L + +GC +N G+ GL S + L +G F+YC+
Sbjct: 226 -GSTSLPNFYYGCGQDNEGLFGRS-AGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSS 283
Query: 248 -----GNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDID 299
G+ N +Y+Y TPM S+ D Y++ L G+++ L +
Sbjct: 284 GYLSLGSYNPGQYSY---------------TPMVSSSLDDSLYFIKLSGMTVAGNPLSVS 328
Query: 300 PNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWHLC 357
+ + T IDSGT +T L S Y L K V +G +Y + C
Sbjct: 329 SSAYSSLPT------IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTC 379
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSI 417
+ G +R PA+ FAGGA L L A+++ S CLA P+ + +I
Sbjct: 380 FKGQASR--VSAPAVTMSFAGGAALKLSAQNLLVDVDDSTTCLAFAPA-------RSAAI 430
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
IG QQ ++V YD+ S ++ F C
Sbjct: 431 IGNTQQQTFSVVYDVKSSRIGFAAGGCS 458
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 145/364 (39%), Gaps = 41/364 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS + W +C+PC + C +PS S +Y + C S+
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 178
Query: 155 YCTNDCGGYP-------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C G C Y ++Y +G S G +E +S+ K FL FGC
Sbjct: 179 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFL----FGC 234
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
N + + + FSYC L + L LG
Sbjct: 235 GQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYC---LPASSSSKGYLSLGGQVS 291
Query: 268 LEGDSTPMSV-IDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
TP+S D + Y + + G+S+G + L ID + F AG IDSGT +T
Sbjct: 292 KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF-------SAGTVIDSGTVITR 344
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGAD 381
L P+AY E+ FQ L+ YP + + CY + D P + F GG +
Sbjct: 345 LSPTAYS----ELSSAFQNLMTDYPSTSGYSIFDTCYDFS-KYDTVRIPKVGVTFKGGVE 399
Query: 382 LVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ +D + Y + CLA +D + D SI G + Q+ Y V YD ++ F
Sbjct: 400 MDIDVSGILYPVNGLKKVCLAFAGNDDD----SDTSIFGNVQQRTYQVVYDGAKGRVGFA 455
Query: 441 RIDC 444
C
Sbjct: 456 PGGC 459
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 145/364 (39%), Gaps = 41/364 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS + W +C+PC + C +PS S +Y + C S+
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 190
Query: 155 YCTNDCGGYP-------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C G C Y ++Y +G S G +E +S+ K FL FGC
Sbjct: 191 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFL----FGC 246
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
N + + + FSYC L + L LG
Sbjct: 247 GQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYC---LPASSSSKGYLSLGGQVS 303
Query: 268 LEGDSTPMSV-IDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
TP+S D + Y + + G+S+G + L ID + F AG IDSGT +T
Sbjct: 304 KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF-------SAGTVIDSGTVITR 356
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGAD 381
L P+AY E+ FQ L+ YP + + CY + D P + F GG +
Sbjct: 357 LSPTAYS----ELSSAFQNLMTDYPSTSGYSIFDTCYDFS-KYDTVRIPKVGVTFKGGVE 411
Query: 382 LVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ +D + Y + CLA +D + D SI G + Q+ Y V YD ++ F
Sbjct: 412 MDIDVSGILYPVNGLKKVCLAFAGNDDD----SDTSIFGNVQQRTYQVVYDGAKGRVGFA 467
Query: 441 RIDC 444
C
Sbjct: 468 PGGC 471
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 120/437 (27%), Positives = 182/437 (41%), Gaps = 72/437 (16%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLH--PGISTVPVFYVNFSIGQPPVPQLAVLDT 118
+R + S R + + A +A + P + + V IG PP A +DT
Sbjct: 49 RRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDT 108
Query: 119 GSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC----TNDCGGYPDE-CWYN 170
S LIW +CQPC C F+P S TYA LPC S C + CG DE C Y
Sbjct: 109 ASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYT 168
Query: 171 IRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS-DEQFTGVFGLGPAT 229
Y+ ++GT+ ++ G+ V FGCS ++ + Q +GV GLG
Sbjct: 169 YTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSSTGGAPPPQASGVVGLG--- 220
Query: 230 SSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDST-----PMS---VIDG 280
SLV ++ +F+YC+ L+LG A ++T PM
Sbjct: 221 RGPLSLVSQLSVRRFAYCLPPPA--SRIPGKLVLGADADAARNATNRIAVPMRRDPRYPS 278
Query: 281 SYYVTLEGISLGEKMLDI---------------------DPNLFKKNDTWSDA---GVFI 316
YY+ L+G+ +G++ + + PN DA G+ I
Sbjct: 279 YYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPN--ATAVAVGDANRYGMII 336
Query: 317 DSGTTLTWLVPSAYQTLRKEVE---DLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPA 371
D +T+T+L S Y L ++E L +G S +D LC+ + D PA
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLD----LCFILPDGVAFDRVYVPA 392
Query: 372 MAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+A F G L LD +F ++ S + CL VG ++ +SI+G QQN V Y
Sbjct: 393 VALAF-DGRWLRLDKARLFAEDRESGMMCLMVGRAEAG-----SVSILGNFQQQNMQVLY 446
Query: 431 DLVSKQLYFQRIDCELL 447
+L ++ F + C L
Sbjct: 447 NLRRGRVTFVQSPCGAL 463
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 165/388 (42%), Gaps = 46/388 (11%)
Query: 82 AH-DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TT 137
AH + R LH + T + IG PP ++D+GS++ +V C CEQCG
Sbjct: 71 AHPNARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPR 130
Query: 138 FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
F P S +Y+ + C+ CT C +C Y +Y S G +G + +F E K
Sbjct: 131 FQPDLSSSYSPVKCNVD-CT--CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELK 187
Query: 198 TFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYF 253
FGC ++ + G+ GLG S LVEK + FS C G ++
Sbjct: 188 P--QRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD-- 243
Query: 254 EYAYNMLILGEGAILEGDSTPMSVI-------DGSYY-VTLEGISLGEKMLDIDPNLFKK 305
I G +L G P ++ YY + L+ I + K L +D +F
Sbjct: 244 -------IGGGAMVLGGVPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVFN- 295
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---N 361
S G +DSGTT +L A+ + V L DP + +C++G N
Sbjct: 296 ----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRN 351
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIG 419
+++ + FP + F G L L E+ ++ S +CL V NG+ +++G
Sbjct: 352 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQ---NGK--DPTTLLG 406
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDCELL 447
I +N V YD ++++ F + +C L
Sbjct: 407 GIIVRNTLVTYDRHNEKIGFWKTNCSEL 434
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 162/388 (41%), Gaps = 56/388 (14%)
Query: 99 FYVNFSIGQPPVPQLAVL--DTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDS 153
+ ++ IG P PQ VL DTGS L+W +C C C F S S T++ +PC
Sbjct: 94 YLIHLGIGTP-RPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSD 151
Query: 154 SYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF--LYDVG 204
C + C C+Y Y + + G + + F F+ D T + ++
Sbjct: 152 PLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIR 211
Query: 205 FGCSHNNAHFSDEQFTGV--FGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL 262
FGC N +G+ FG GP + + V + FSYC + E + +IL
Sbjct: 212 FGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRR----FSYCFTAME--ESRVSPVIL 265
Query: 263 G-EGAILEGDST-----------PMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKND 307
G E +E +T P GS Y+++L G+++GE L + + F
Sbjct: 266 GGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKG 325
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP--MDPAWHLCYSGNINRD 365
S G FIDSGT +T+ + +++LR+ + Q LP DP LC+S +
Sbjct: 326 DGS-GGTFIDSGTAITFFPQAVFRSLREAF--VAQVPLPVAKGYTDPDNLLCFSVPAKKK 382
Query: 366 LQGFPAMAFHFAGG------ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
P + H G + VLD + + + + + NG +IIG
Sbjct: 383 APAVPKLILHLEGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNG------TIIG 436
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDCELL 447
QQN ++ YDL S ++ F C+ L
Sbjct: 437 NFQQQNMHIVYDLESNKMVFAPARCDKL 464
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 120/437 (27%), Positives = 182/437 (41%), Gaps = 72/437 (16%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHDTRAHLH--PGISTVPVFYVNFSIGQPPVPQLAVLDT 118
+R + S R + + A +A + P + + V IG PP A +DT
Sbjct: 49 RRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDT 108
Query: 119 GSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYC----TNDCGGYPDE-CWYN 170
S LIW +CQPC C F+P S TYA LPC S C + CG DE C Y
Sbjct: 109 ASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYT 168
Query: 171 IRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS-DEQFTGVFGLGPAT 229
Y+ ++GT+ ++ G+ V FGCS ++ + Q +GV GLG
Sbjct: 169 YTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSSTGGAPPPQASGVVGLG--- 220
Query: 230 SSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDST-----PMS---VIDG 280
SLV ++ +F+YC+ L+LG A ++T PM
Sbjct: 221 RGPLSLVSQLSVRRFAYCLPPPA--SRIPGKLVLGADADAARNATNRIAVPMRRDPRYPS 278
Query: 281 SYYVTLEGISLGEKMLDI---------------------DPNLFKKNDTWSDA---GVFI 316
YY+ L+G+ +G++ + + PN DA G+ I
Sbjct: 279 YYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPN--ATAVAVGDANRYGMII 336
Query: 317 DSGTTLTWLVPSAYQTLRKEVE---DLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPA 371
D +T+T+L S Y L ++E L +G S +D LC+ + D PA
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLD----LCFILPDGVAFDRVYVPA 392
Query: 372 MAFHFAGGADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+A F G L LD +F ++ S + CL VG ++ +SI+G QQN V Y
Sbjct: 393 VALAF-DGRWLRLDKARLFAEDRESGMMCLMVGRAEAG-----SVSILGNFQQQNMQVLY 446
Query: 431 DLVSKQLYFQRIDCELL 447
+L ++ F + C L
Sbjct: 447 NLRRGRVTFVQSPCGAL 463
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 145/364 (39%), Gaps = 41/364 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---GATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS + W +C+PC + C +PS S +Y + C S+
Sbjct: 71 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 130
Query: 155 YCTNDCGGYP-------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C G C Y ++Y +G S G +E +S+ K FL FGC
Sbjct: 131 LCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFL----FGC 186
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
N + + + FSYC L + L LG
Sbjct: 187 GQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYC---LPASSSSKGYLSLGGQVS 243
Query: 268 LEGDSTPMSV-IDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
TP+S D + Y + + G+S+G + L ID + F AG IDSGT +T
Sbjct: 244 KSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAF-------SAGTVIDSGTVITR 296
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGAD 381
L P+AY E+ FQ L+ YP + + CY + D P + F GG +
Sbjct: 297 LSPTAYS----ELSSAFQNLMTDYPSTSGYSIFDTCYDFS-KYDTVRIPKVGVTFKGGVE 351
Query: 382 LVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ +D + Y + CLA +D + D SI G + Q+ Y V YD ++ F
Sbjct: 352 MDIDVSGILYPVNGLKKVCLAFAGNDDD----SDTSIFGNVQQRTYQVVYDGAKGRVGFA 407
Query: 441 RIDC 444
C
Sbjct: 408 PGGC 411
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 111/368 (30%), Positives = 164/368 (44%), Gaps = 60/368 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP P LDTGS LIW +CQPC C FDPS S T + CDS+
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 148
Query: 156 CTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS-HNNAHF 214
C G P S++F F + + V FGC NN F
Sbjct: 149 CQ----GLPVASLPR--------------SDKFTFVGAGAS---VPGVAFGCGLFNNGVF 187
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNMLILGEGAI 267
+ TG+ G G S S + KVG+ FS+C + + ++ G+GA+
Sbjct: 188 KSNE-TGIAGFGRGPLSLPSQL-KVGN-FSHCFTTITGAIPSTVLLDLPADLFSNGQGAV 244
Query: 268 LEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVFIDSGTTLT 323
+TP+ + YY++L+GI++G L + + F KN T G IDSGT +T
Sbjct: 245 ---QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGT---GGTIIDSGTAMT 298
Query: 324 WLVPSAYQTLRKEVEDLFQ-GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
L Y+ +R + ++ DP + C S + R P + HF GA +
Sbjct: 299 SLPTRVYRLVRDAFAAQVKLPVVSGNTTDP--YFCLSAPL-RAKPYVPKLVLHFE-GATM 354
Query: 383 VLDAESVFYQ---ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L E+ ++ SS+ CLA+ I G +++ IG QQN +V YDL + +L F
Sbjct: 355 DLPRENYVFEVEDAGSSILCLAI----IEG---GEVTTIGNFQQQNMHVLYDLQNSKLSF 407
Query: 440 QRIDCELL 447
C+ L
Sbjct: 408 VPAQCDKL 415
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 168/405 (41%), Gaps = 63/405 (15%)
Query: 81 KAHDTRAHL---------HPGISTVPV------FYVNFSIGQPPVPQLAVLDTGSSLIWV 125
+AH+ R L PG TVPV + VN +IG PP P A++D G L+W
Sbjct: 18 RAHELRRGLELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWT 77
Query: 126 KC-QPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGP 177
+C Q C +C FD + S T+ PC ++ C T C G T+
Sbjct: 78 QCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAAVCESIPTRSCAGDGGGACGYEASTSFG 137
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
+ G IG++ T+ + + FGC+ + + +G GLG + SL
Sbjct: 138 RTVGRIGTDAVAIGTAATAR-----LAFGCAVASEMDTMWGSSGSVGLG---RTNLSLAA 189
Query: 238 KV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG-------------DSTPMSVIDGSYY 283
++ + FSYC+ + + + L LG A L G + P S + SY
Sbjct: 190 QMNATAFSYCLAPPDTGK--SSALFLGASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYL 247
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
+ LE I G + + S + + + T +T LV S Y+ LRK V D G
Sbjct: 248 LRLEAIRAGNATIAMP---------QSGNTIMVSTATPVTALVDSVYRDLRKAVADAV-G 297
Query: 344 LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVG 403
P P + LC+ G P + F GGA++ + S + + C+A+
Sbjct: 298 AAPVPPPVQNYDLCFPKASASG--GAPDLVLAFQGGAEMTVPVSSYLFDAGNDTACVAI- 354
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLA 448
+ +SI+G + Q N ++ +DL + L F+ DC L+
Sbjct: 355 ---LGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCSALS 396
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 161/378 (42%), Gaps = 40/378 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT------FDPSKSLTYATLPCD 152
++V +G P ++DTGS L W++C P ++ +D S S +Y +PC
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 118
Query: 153 SSYCT-------NDCG-GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG-------- 196
C + C P C Y Y++ + G + E + ++
Sbjct: 119 DDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHK 178
Query: 197 --KTFLYDVGFGCSHNNAHFSDEQFTGVFGL--GPATSSTHSLVEKVGSKFSYCIGNLNY 252
+ + +V GCS + S +GV GL GP + +T + +G FSYC+ +
Sbjct: 179 TRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYLR 238
Query: 253 FEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
A + L++G + TP+ YYV + G+++ K +D + D
Sbjct: 239 GSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGD 298
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP-AWHLCYSGNINRDLQG 368
+ G DSGTTL++L AY + + LP P + LCY N+ R +G
Sbjct: 299 GNKGTIFDSGTTLSYLREPAYSKVLGALNASI--YLPRAQEIPEGFELCY--NVTRMEKG 354
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGP-SDINGERFKDLSIIGMIAQQNYN 427
P + F GGA + L + + +V C+A+ + NG +I+G + QQ+++
Sbjct: 355 MPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGS-----NILGNLLQQDHH 409
Query: 428 VAYDLVSKQLYFQRIDCE 445
+ YDL ++ F+ C
Sbjct: 410 IEYDLAKARIGFKWSPCH 427
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 170/405 (41%), Gaps = 57/405 (14%)
Query: 81 KAHDTRAH--------LHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
K+HD R H L G + P ++Y IG PP +DTGS ++WV C
Sbjct: 43 KSHDVRRHGRLLSVIDLELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNCV 102
Query: 129 PCEQCGATT--------FDPSKSLTYATLPCDSSYC--TNDC---GGYPD-ECWYNIRYT 174
C C + ++P S T + CD +C T D G PD C Y + Y
Sbjct: 103 GCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYG 162
Query: 175 NGPDSQGTIGSEQF-------NFETSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVFG 224
+G + G ++ N +TS+ + + FGC + S E G+ G
Sbjct: 163 DGSATAGYFVNDYIQLQRAVGNHKTSETNGSIV----FGCGAKQSGELGSSSEALDGILG 218
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G A SS S + KV F++C+ +++ + +GE + +TP+
Sbjct: 219 FGQANSSMISQLAATGKVKKIFAHCLDSIS----GGGIFAIGEVVEPKLXNTPVVPNQAH 274
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V L G+ +G+ LD+ LF +T G IDSGTTL +L S Y L +++
Sbjct: 275 YNVVLNGVKVGDTALDLPLGLF---ETSYKRGAIIDSGTTLAYLPESIYLPLMEKI---- 327
Query: 342 QGLLPSYPMDPA--WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFC 399
G P + C+ + N D GFP + F F L + +Q V+C
Sbjct: 328 LGAQPDLKLRTVDDQFTCFVFDKNVD-DGFPTVTFKFEESLILTIYPHEYLFQIRDDVWC 386
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S + +++++G + QN V Y+L ++ + + +C
Sbjct: 387 VGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNC 431
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 153/379 (40%), Gaps = 45/379 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-----PCEQCGATTFDPSKSLTYATLPCDS 153
++V F +G P P + V DTGS L WVKC+ P A F S+S ++A L C S
Sbjct: 105 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSS 164
Query: 154 SYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE----------TSDEG 196
CT+ +C C Y+ RY +G ++G +G++
Sbjct: 165 DTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGR 224
Query: 197 KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEY 255
+ L V GC+ S + GV LG + S S + G +FSYC+ +
Sbjct: 225 RAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 284
Query: 256 AYNMLIL---GEGAILEGDSTPMSVIDGSY-----YVTLEGISLGEKMLDIDPNLFKKND 307
A + L EG TP+ V+D GE LDI +++ D
Sbjct: 285 ASSYLTFGPGPEGGGAPAARTPL-VLDRRVSPFYAVAVDAVYVAGE-ALDIPADVW---D 339
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
G +DSGT+LT L AY+ + + LP MDP + CY N
Sbjct: 340 VGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAA-LPRVAMDP-FEYCY--NWTAGAP 395
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + FAG A L A+S + V C+ V G +S+IG I QQ +
Sbjct: 396 EIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPG-----VSVIGNILQQEHL 450
Query: 428 VAYDLVSKQLYFQRIDCEL 446
+DL + L F+ C L
Sbjct: 451 WEFDLRDRWLRFKHTRCAL 469
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 124/410 (30%), Positives = 165/410 (40%), Gaps = 62/410 (15%)
Query: 67 SMARFIYLSQKSSQ--KAHDTRAHL--HPGISTVPV--------FYVNFSIGQPPVPQLA 114
S R +L+ +SSQ K + A + TVP+ + + FSIG PP A
Sbjct: 56 SHRRLSFLASRSSQVDKPQSSSASQLSNNDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTA 115
Query: 115 VLDTGSSLIWVKCQ---PCEQCGATTFDPSKSLTYATLPCD-------SSYCTNDCGGYP 164
+ DTGS LIW KC G++++ P+ S T+ LPC SY C
Sbjct: 116 LADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGG 175
Query: 165 DECWYNIRYTNGPD---SQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD-EQFT 220
EC Y Y G D +QG +GSE F G + VGFGC+ A D +
Sbjct: 176 AECDYKYAYGLGDDPDFTQGFLGSETFTL-----GGDAVPGVGFGCT--TALEGDYGEGA 228
Query: 221 GVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG-----DSTPM 275
G+ GLG S S ++ F YC L + L+ G A + G ST +
Sbjct: 229 GLVGLGRGPLSLVSQLDA--GTFMYC---LTADASKASPLLFGALATMTGAGAGVQSTGL 283
Query: 276 SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRK 335
Y V L I++G GV DSGTTLT+L AY +
Sbjct: 284 LASTTFYAVNLRSITIGSA---------TTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKA 334
Query: 336 EVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS 395
L P + CY + L PAM HF GGAD+ L + +
Sbjct: 335 AFLSQTTSLTP-VEGRYGFEACYEKPDSARL--IPAMVLHFDGGADMALPVANYVVEVDD 391
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
V C V +R LSIIG I Q NY V +D+ L FQ +C+
Sbjct: 392 GVVCWVV-------QRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANCD 434
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 122/284 (42%), Gaps = 38/284 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ ++G PP P LDTGS L+W +C PC C G DP+ S TYA LPC +
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPCGAPR 145
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF-----ETSDEGKTFLYDVGFG 206
C CGG C Y Y + + G I +++F F D + FG
Sbjct: 146 CRALPFTSCGG--RSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTFG 203
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
C H N TG+ G G S S + + FSYC ++ F+ +++ LG
Sbjct: 204 CGHFNKGVFQSNETGIAGFGRGRWSLPSQLN--ATSFSYCFTSM--FDSKSSIVTLGGAP 259
Query: 267 IL--------EGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
E +TP+ Y+++L+GIS+G+ L + F+
Sbjct: 260 AALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST--------I 311
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYS 359
IDSG ++T L Y+ ++ E GL PS A +C++
Sbjct: 312 IDSGASITTLPEEVYEAVKAEFAAQV-GLPPSGVEGSALDVCFA 354
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 106/429 (24%), Positives = 178/429 (41%), Gaps = 59/429 (13%)
Query: 37 RLVTKL--LHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGIS 94
R +T++ LH+ L N +TV +Q Q+ + + ++ ++A A L G++
Sbjct: 106 RDLTRIQTLHKRVLEKNNQNTV-SQKQKKNDKEVVTTTPVASSVEEQAGQLVATLESGMT 164
Query: 95 T-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDS 153
++++ +G PP +LDTGS L W++C PC C
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDC-------------------- 204
Query: 154 SYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLYDVG---FGCSH 209
+ ND P WY + ++ G E F T++ G + LY+V FGC H
Sbjct: 205 -FQQNDNQSCPYYYWYG----DSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH 259
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
N + S + L G FSYC+ + N + LI GE L
Sbjct: 260 WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL 319
Query: 270 GD---------STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS-----DAGVF 315
+ +++D YYV ++ I + ++L+I +TW+ G
Sbjct: 320 SHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNI------PEETWNISSDGAGGTI 373
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
IDSGTTL++ AY+ ++ ++ + +G P Y P C++ + ++Q P +
Sbjct: 374 IDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQ-LPELGIA 432
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
FA GA E+ F + + CLA + G SIIG QQN+++ YD
Sbjct: 433 FADGAVWNFPTENSFIWLNEDLVCLA-----MLGTPKSAFSIIGNYQQQNFHILYDTKRS 487
Query: 436 QLYFQRIDC 444
+L + C
Sbjct: 488 RLGYAPTKC 496
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 168/393 (42%), Gaps = 40/393 (10%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPV--FYVNFSIGQPPVPQLAVLDTGSSLIWVKC 127
R YLS + QK T + PG + + + V +G P VLDT + WV C
Sbjct: 69 RLKYLSTLADQKT--TAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPC 126
Query: 128 QPCEQCGATTFDPSKSLTYATLPCDSSYCTNDCG-GYP----DECWYNIRYTNGPDSQGT 182
C C +TTF P+ S T +L C + C+ G P C +N Y G DS T
Sbjct: 127 SGCTGCSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY--GGDSSLT 184
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK 242
Q +++ + FGC N G+ GLG SL+ + G+
Sbjct: 185 ATLVQDAITLAND---VIPGFTFGC-INAVSGGSIPPQGLLGLG---RGPISLISQAGAM 237
Query: 243 ----FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKM 295
FSYC+ + + ++ ++ + G +TP+ YYV L G+S+G
Sbjct: 238 YSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIK 297
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH 355
+ I P+ D + AG IDSGT +T V Y +R E G + S A+
Sbjct: 298 VPI-PSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSL---GAFD 353
Query: 356 LCYSGNINRDLQGFPAMAFHFAGGADLVLDAE-SVFYQESSSVFCL--AVGPSDINGERF 412
C++ + PA+ HF G +LVL E S+ + S S+ CL A P+++N
Sbjct: 354 TCFAATNEAEA---PAITLHFE-GLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSV-- 407
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
L++I + QQN + +D + +L R C
Sbjct: 408 --LNVIANLQQQNLRIMFDTTNSRLGIARELCN 438
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 165/387 (42%), Gaps = 63/387 (16%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATL 149
++Y +G PP P +DTGS ++WV C+PC C T+ FDP S T + L
Sbjct: 40 LYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPL 99
Query: 150 PC-----------DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE------- 191
C S CT D C Y+ Y +G + G S++F++
Sbjct: 100 SCIDSKCVSSNQISESVCTTD-----RYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYV 154
Query: 192 TSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSY 245
T++ + FGCS+N + D G+FG G S S + G FS+
Sbjct: 155 TNNASA----KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSH 210
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKK 305
C L + +L+LGE TP+ Y + L+GI++ + L IDP +F
Sbjct: 211 C---LEGADPGGGILVLGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFAT 267
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQ----TLRKEVEDLFQGLLPSYPMDPAWHLCYSGN 361
+T G ID GTTL +L AY+ T+ V Q + +P + +S +
Sbjct: 268 TNT---RGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFM--LKGNPCFLTVHSID 322
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQE----SSSVFCLAVGPSDINGERFKDLSI 417
+ FP++ +F GA + L + Q+ SS V+C+ S ++I
Sbjct: 323 -----EIFPSVTLYFE-GAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTI 376
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+G + ++ YDL ++++ + DC
Sbjct: 377 LGDLVLKDKVFVYDLENQRIGWTSFDC 403
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 164/380 (43%), Gaps = 38/380 (10%)
Query: 92 GISTVP-VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSK 142
G+ TV +++ +G P +DTGS ++WV C C +C G T +DP +
Sbjct: 61 GLPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKR 120
Query: 143 SLTYATLPCDSSYCTNDCGGY------PDECWYNIRYTNGPDSQG-------TIGSEQFN 189
S T + C+ ++C++ G + C Y+I Y +G + G T N
Sbjct: 121 SKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGN 180
Query: 190 FETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYC 246
T+ + + ++ G S A S+E G+ G G A SS S + KV FS+C
Sbjct: 181 PHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHC 240
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
+ + +GE + +TP+ Y V L+ I + +L + + F
Sbjct: 241 LDT----NVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTF--- 293
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRD 365
D+ + G IDSGTTL +L Y L +V Q L Y ++ + Y+GN++
Sbjct: 294 DSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAK-QPRLKVYLVEEQYSCFQYTGNVD-- 350
Query: 366 LQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
GFP + HF L V + +F + S +C+ S + KD++++G
Sbjct: 351 -SGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLS 409
Query: 425 NYNVAYDLVSKQLYFQRIDC 444
N V YDL + + + +C
Sbjct: 410 NKLVVYDLENMTIGWTDYNC 429
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 162/378 (42%), Gaps = 40/378 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT------FDPSKSLTYATLPCD 152
++V +G P ++DTGS L W++C P ++ +D S S +Y +PC
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCT 86
Query: 153 SSYCT-------NDCG-GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE-GKTF---- 199
C + C P C Y Y++ + G + E + ++ GK
Sbjct: 87 DDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHK 146
Query: 200 -----LYDVGFGCSHNNAHFSDEQFTGVFGL--GPATSSTHSLVEKVGSKFSYCIGNLNY 252
+ +V GCS + S +GV GL GP + +T + +G FSYC+ +
Sbjct: 147 TRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYLR 206
Query: 253 FEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
A + L++G + TP+ YYV + G+++ K +D + D
Sbjct: 207 GSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGD 266
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP-AWHLCYSGNINRDLQG 368
+ G DSGTTL++L AY + + LP P + LCY N+ R +G
Sbjct: 267 GNKGTIFDSGTTLSYLREPAYSKVLGALNASI--YLPRAQEIPEGFELCY--NVTRMEKG 322
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGP-SDINGERFKDLSIIGMIAQQNYN 427
P + F GGA + L + + +V C+A+ + NG +I+G + QQ+++
Sbjct: 323 MPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGS-----NILGNLLQQDHH 377
Query: 428 VAYDLVSKQLYFQRIDCE 445
+ YDL ++ F+ C
Sbjct: 378 IEYDLAKARIGFKWSPCH 395
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 154/387 (39%), Gaps = 32/387 (8%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR +LS +K+ A I P + V ++G P L LDT + W+ C
Sbjct: 61 ARLQFLSSLVGRKSWVPIASGR-QIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCN 119
Query: 129 PCEQCGATTFDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIG 184
C C +T F+ S T+ TL CD+ C CGG C +N Y G+
Sbjct: 120 GCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGG--STCTWNTTY------GGSTI 171
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFS 244
+T + FGC S + S + S FS
Sbjct: 172 LSNLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFS 231
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPN 301
YC+ + ++ + + G L +TP+ YYV L GI +G K++DI +
Sbjct: 232 YCLPSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPAS 291
Query: 302 LFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGN 361
N T + AG DSGT T LV Y +R E + S + CY+G
Sbjct: 292 ALAFNPT-TGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSS--LGGFDTCYTGP 348
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQE---SSSVFCLAVGPSDINGERFKDLSII 418
I P M F F+ G ++ L +++ + S+S +A P ++N L++I
Sbjct: 349 IVA-----PTMTFMFS-GMNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSV----LNVI 398
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ QQN+ + +D+ + ++ R C
Sbjct: 399 ANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 148/359 (41%), Gaps = 43/359 (11%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPC--EQC---GATTFDPSKSLTYATLPCDSSYCTN 158
+I P + Q +DT L W++C PC +C FDP +S T A +PC S+ C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213
Query: 159 ----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
G ++C Y + Y +G + GT + S T + + FGCSH
Sbjct: 214 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPS----TVVMNFRFGCSHAVRGN 269
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDST 273
+G LG S S G+ FSYC+ + + + G T
Sbjct: 270 FSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFART 329
Query: 274 PM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
P+ S+I Y V L GI +G + L++ P +F G +DS +T L P+A
Sbjct: 330 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA-------GGAVMDSSVIITQLPPTA 382
Query: 330 YQTLRKEVEDLFQGLLPSYPM----DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
Y+ LR F+ + +YP CY + PA++ F GGA + LD
Sbjct: 383 YRALRLA----FRSAMAAYPRVAGGRAGLDTCYD-FVRFTSVTVPAVSLVFDGGAVVRLD 437
Query: 386 AESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A V + CLA P+ + L IG + QQ + V YD+ + F+R C
Sbjct: 438 AMGVMVEG-----CLAFVPTPGDFA----LGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 154/387 (39%), Gaps = 32/387 (8%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR +LS +K+ A I P + V ++G P L LDT + W+ C
Sbjct: 61 ARLQFLSSLVGRKSWVPIASGR-QIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCN 119
Query: 129 PCEQCGATTFDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTIG 184
C C +T F+ S T+ TL CD+ C CGG C +N Y G+
Sbjct: 120 GCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGG--STCTWNTTY------GGSTI 171
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFS 244
+T + FGC S + S + S FS
Sbjct: 172 LSNLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFS 231
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPN 301
YC+ + ++ + + G L +TP+ YYV L GI +G K++DI +
Sbjct: 232 YCLPSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPAS 291
Query: 302 LFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGN 361
N T + AG DSGT T LV Y +R E + S + CY+G
Sbjct: 292 ALAFNPT-TGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSS--LGGFDTCYTGP 348
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQE---SSSVFCLAVGPSDINGERFKDLSII 418
I P M F F+ G ++ L +++ + S+S +A P ++N L++I
Sbjct: 349 IVA-----PTMTFMFS-GMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSV----LNVI 398
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ QQN+ + +D+ + ++ R C
Sbjct: 399 ANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 123/434 (28%), Positives = 176/434 (40%), Gaps = 93/434 (21%)
Query: 64 LNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSI------------------ 105
L R Y+ +K+S A D L+P V + +F++
Sbjct: 82 LRWDQVRTEYVRRKASGGAEDV---LNPAKPRVLMSQTDFAVRSPFGVGSGSGSSAWIDA 138
Query: 106 -GQPPV--PQLAVLDTGSSLIWVKCQPCE--QCGATT---FDPSKSLTYATLPCDSSYCT 157
G P V Q +DT + W++C PC QC FDP+ S T A + C S C
Sbjct: 139 DGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACR 198
Query: 158 ------NDCGGYPD--ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
N C EC Y I Y++ + GT ++ G T + + FGCSH
Sbjct: 199 SLGPYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTLTIS----GTTAVRNFRFGCSH 254
Query: 210 N-NAHFSDEQFTGVFGLGPATSSTHSLVEK-VGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
FSD G LG S + + +G+ FSYC+ + A L +G A
Sbjct: 255 AVRGRFSDLT-AGTMSLGGGAQSLLAQTARSLGNAFSYCVPQAS----ASGFLSIGGPAT 309
Query: 268 LEGDS----TPM--SVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+ TP+ S I+ S Y V L+GI + + L I P F AG +DS
Sbjct: 310 TNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFS-------AGAVMDSSA 362
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA---WHLCYSGNINRDLQGF-----PAM 372
+T L P+AY+ LR+ F+ + +YP A CY D G PA+
Sbjct: 363 VITQLPPTAYRALRRA----FRNAMRAYPRSGATGTLDTCY------DFLGLTNVRVPAV 412
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLA--VGPSDINGERFKDLSIIGMIAQQNYNVAY 430
+ F GGA +VLD +V CLA SD+ L IG + QQ + V Y
Sbjct: 413 SLVFGGGAVVVLDPPAVMIGG-----CLAFTATSSDLA------LGFIGNVQQQTHEVLY 461
Query: 431 DLVSKQLYFQRIDC 444
D+ + + F+R C
Sbjct: 462 DVAAGGVGFRRGAC 475
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 148/361 (40%), Gaps = 59/361 (16%)
Query: 110 VPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDSSYCT------N 158
V Q VLDT S + WV+C PC +DP+KS + C+S CT N
Sbjct: 142 VTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN 201
Query: 159 DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN-NAHFS-D 216
C ++C Y +RY +G + GT S+ + ++F FGCSH FS
Sbjct: 202 GCTNN-NQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQ----FGCSHGVQGSFSFG 256
Query: 217 EQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
G+ LG SLV + G FS+C + L + A
Sbjct: 257 SSAAGIMALG---GGPESLVSQTAATYGRVFSHCFPPPT--RRGFFTLGVPRVAAWRYVL 311
Query: 273 TPM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
TPM ++ Y V LE I++ + + + P +F AG +DS T +T L P+
Sbjct: 312 TPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFA-------AGAALDSRTAITRLPPT 364
Query: 329 AYQTLRKEVED---LFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AYQ LR+ D ++Q P P+D CY R P + F A + LD
Sbjct: 365 AYQALRQAFRDRMAMYQPAPPKGPLD----TCYDMAGVRSF-ALPRITLVFDKNAAVELD 419
Query: 386 AESVFYQESSSVFCLA--VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
V +Q CLA GP+D + IIG I Q V Y++ + + F+
Sbjct: 420 PSGVLFQG-----CLAFTAGPND------QVPGIIGNIQLQTLEVLYNIPAALVGFRHAA 468
Query: 444 C 444
C
Sbjct: 469 C 469
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 159/383 (41%), Gaps = 48/383 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-----GATTFDPSKSLTYATLPCDSSY 155
V+ ++G PP VLDTGS L W+ C P A +F P SLT+A++PCDS+
Sbjct: 68 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQ 127
Query: 156 CTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C + C G +C ++ Y +G S G + +E F G+ FGC
Sbjct: 128 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV-----GQGPPLRAAFGCM 182
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI- 267
S + LG + + + +FSYCI + + +L+LG +
Sbjct: 183 ATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRD----DAGVLLLGHSDLP 238
Query: 268 --------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
L + P+ D +Y V L GI +G K L I ++ + T + +DS
Sbjct: 239 FLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGA-GQTMVDS 297
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPS-----YPMDPAWHLCYSGNINRDLQG-FPAM 372
GT T+L+ AY L+ E + LP+ + A+ C+ R PA+
Sbjct: 298 GTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAV 357
Query: 373 AFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
F GA + + + + Y + V+CL G +D+ +IG Q N
Sbjct: 358 TLLF-NGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVP---ITAYVIGHHHQMNV 413
Query: 427 NVAYDLVSKQLYFQRIDCELLAD 449
V YDL ++ I C++ ++
Sbjct: 414 WVEYDLERGRVGLAPIRCDVASE 436
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 148/359 (41%), Gaps = 43/359 (11%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPC--EQC---GATTFDPSKSLTYATLPCDSSYCTN 158
+I P + Q +DT L W++C PC +C FDP +S T A +PC S+ C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 159 ----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
G ++C Y + Y +G + GT + S T + + FGCSH
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPS----TVVMNFRFGCSHAVRGN 253
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDST 273
+G LG S S G+ FSYC+ + + + G T
Sbjct: 254 FSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFART 313
Query: 274 PM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
P+ S+I Y V L GI +G + L++ P +F G +DS +T L P+A
Sbjct: 314 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA-------GGAVMDSSVIITQLPPTA 366
Query: 330 YQTLRKEVEDLFQGLLPSYPM----DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
Y+ LR F+ + +YP CY + PA++ F GGA + LD
Sbjct: 367 YRALRLA----FRSAMAAYPRVAGGRAGLDTCYD-FVRFTSVTVPAVSLVFDGGAVVRLD 421
Query: 386 AESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
A V + CLA P+ + L IG + QQ + V YD+ + F+R C
Sbjct: 422 AMGVMVEG-----CLAFVPTPGDFA----LGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 122/460 (26%), Positives = 188/460 (40%), Gaps = 54/460 (11%)
Query: 9 LLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTL---- 64
L S LP S + + + P +VT LH + P TV + TL
Sbjct: 27 LRSYKVLPVGSLKSAAVSCSLPKVAPSSGVVTVPLHHR---HGPCSTVPSTNAPTLEDML 83
Query: 65 NMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLA 114
R Y+++K S + + + TVP + + +G P V Q
Sbjct: 84 RRDQLRAAYITRKYS-GVNGSAGDVEGSDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTM 142
Query: 115 VLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCT--NDCGGYPDECWY 169
++DTGS + WV+C+PC QC + + FDPS S TY+ C S+ C G +C Y
Sbjct: 143 LIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQY 202
Query: 170 NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH--FSDEQFTGVFGLGP 227
++Y +G GT S+ G + + + FGCS + + D+ + G
Sbjct: 203 TVKYGDGSTGSGTYSSDTLAL-----GSSTVENFQFGCSQSESGNLLQDQTAGLMGLGGG 257
Query: 228 ATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM--SVIDGSYY-V 284
A S G FSYC L + L LG TPM S SYY V
Sbjct: 258 AESLATQTAGTFGKAFSYC---LPPTPGSSGFLTLGASTSGFVVKTPMLRSTQVPSYYGV 314
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
L+ I +G + L+I + F AG +DSGT +T L +AY L + +
Sbjct: 315 LLQAIRVGGRQLNIPASAFS-------AGSIMDSGTIITRLPRTAYSALSSAFKAGMKQY 367
Query: 345 LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGP 404
P+ PM + C+ + + P +A F+GGA + L ++ + CLA
Sbjct: 368 PPAQPMG-IFDTCFDFSGQSSVS-IPTVALVFSGGAVVDLASDGIILGS-----CLAFAA 420
Query: 405 SDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + L IIG + Q+ + V YD+ + F+ C
Sbjct: 421 NSDD----TSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 148/361 (40%), Gaps = 59/361 (16%)
Query: 110 VPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDSSYCT------N 158
V Q VLDT S + WV+C PC +DP+KS + C+S CT N
Sbjct: 167 VTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN 226
Query: 159 DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN-NAHFS-D 216
C ++C Y +RY +G + GT S+ + ++F FGCSH FS
Sbjct: 227 GCTNN-NQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQ----FGCSHGVQGSFSFG 281
Query: 217 EQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
G+ LG SLV + G FS+C + L + A
Sbjct: 282 SSAAGIMALG---GGPESLVSQTAATYGRVFSHCFPPPT--RRGFFTLGVPRVAAWRYVL 336
Query: 273 TPM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
TPM ++ Y V LE I++ + + + P +F AG +DS T +T L P+
Sbjct: 337 TPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFA-------AGAALDSRTAITRLPPT 389
Query: 329 AYQTLRKEVED---LFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
AYQ LR+ D ++Q P P+D CY R P + F A + LD
Sbjct: 390 AYQALRQAFRDRMAMYQPAPPKGPLD----TCYDMAGVRSF-ALPRITLVFDKNAAVELD 444
Query: 386 AESVFYQESSSVFCLA--VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
V +Q CLA GP+D + IIG I Q V Y++ + + F+
Sbjct: 445 PSGVLFQG-----CLAFTAGPND------QVPGIIGNIQLQTLEVLYNIPAALVGFRHAA 493
Query: 444 C 444
C
Sbjct: 494 C 494
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 102/411 (24%), Positives = 179/411 (43%), Gaps = 50/411 (12%)
Query: 63 TLNMSMARFIYLSQKSSQKAHDT---RAH--LHPGISTVPVFYVNFSIGQPPVPQLAVLD 117
T N+S R + S ++ H++ AH L+ + + + IG PP ++D
Sbjct: 47 TSNISSHRKPFTSNYHRRQLHNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVD 106
Query: 118 TGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYT 174
TGS++ +V C CEQCG F P S TY + C+ S +D G +C Y RY
Sbjct: 107 TGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCNPSCNCDDEG---KQCTYERRYA 163
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPAT-SST 232
S G + + +F +E + FGC + ++ G+ GLG S
Sbjct: 164 EMSSSSGLLAEDVLSF--GNESELTPQRAIFGCETVETGELFSQRADGIMGLGRGPLSVV 221
Query: 233 HSLV--EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI--------DGSY 282
LV E VG+ FS C G ++ ++G +L P ++ Y
Sbjct: 222 DQLVIKEVVGNSFSLCYGGMD---------VVGGAMVLGNIPPPPDMVFAHSDPYRSAYY 272
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
+ L+ + + K L ++P +F G +DSGTT +L A+ + + +
Sbjct: 273 NIELKELHVAGKRLKLNPRVFD-----GKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIK 327
Query: 343 GLLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQES--SS 396
L + DP+++ +C+SG ++++ + FP + F G L L E+ ++ + S
Sbjct: 328 FLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSG 387
Query: 397 VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+CL + NG+ +++G I +N V YD + ++ F + +C L
Sbjct: 388 AYCLGIFQ---NGK--DPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNCSEL 433
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 103/413 (24%), Positives = 173/413 (41%), Gaps = 48/413 (11%)
Query: 70 RFIYLSQKSSQ---KAHDTRAHLH--PGI----------STVPVFYVNFSIGQPPVPQLA 114
++ + QK S KAHD L G+ V ++Y IG P
Sbjct: 54 KYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYYV 113
Query: 115 VLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATLPCDSSYCTNDCGGYPDE 166
+DTGS ++WV C C +C T +D +SLT + CD +C GG P
Sbjct: 114 QVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPSY 173
Query: 167 CWYNIR------YTNGPDSQGTIGSE--QFNFETSD-EGKTFLYDVGFGCSHNNAH--FS 215
C N+ Y +G S G + Q++ + D E + V FGCS + S
Sbjct: 174 CIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSS 233
Query: 216 DEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
+E G+ G G + +S S + KV F++C+ LN + +G + ++
Sbjct: 234 EEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN----GGGIFAIGHIVQPKVNT 289
Query: 273 TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
TP+ Y V ++ + +G L++ ++F D G IDSGTTL +L Y
Sbjct: 290 TPLVPNQTHYNVNMKAVEVGGYFLNLPTDVF---DVGDKKGTIIDSGTTLAYLPEVVYDQ 346
Query: 333 LRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
L ++ L D YS +++ GFPA+ FHF L + +
Sbjct: 347 LLSKIFSWQSDLKVHTIHDQFTCFQYSESLD---DGFPAVTFHFENSLYLKVHPHEYLF- 402
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
++C+ S + ++++++G +A N V YDL ++ + + +C+
Sbjct: 403 SYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNCK 455
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 154/365 (42%), Gaps = 47/365 (12%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C CEQCG FDP S TY + C+ C D
Sbjct: 89 IGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNID-CICDSD 147
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFT 220
G +C Y +Y S G +G + +F ++ + FGC + ++
Sbjct: 148 GV--QCVYERQYAEMSTSSGVLGEDVISF--GNQSELIPQRAVFGCENMETGDLFSQRAD 203
Query: 221 GVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMS- 276
G+ GLG S LVEK + FS C G ++ +G GA++ G +P S
Sbjct: 204 GIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD----------IGGGAMVLGGISPPSD 253
Query: 277 --------VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
V Y V L+ I + K L + +F G +DSGTT +L
Sbjct: 254 MIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGR-----YGAVLDSGTTYAYLPAE 308
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQ---GFPAMAFHFAGGADLVL 384
A+ + + D L DP + +C+SG + + FP + F G L L
Sbjct: 309 AFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSL 368
Query: 385 DAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
E+ F++ S +CL + NG +++G I +N V YD + ++ F +
Sbjct: 369 TPENYFFRHSKVHGAYCLGIFE---NGN--DQTTLLGGIVVRNTLVMYDRANSKIGFWKT 423
Query: 443 DCELL 447
+C L
Sbjct: 424 NCSEL 428
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 101/344 (29%), Positives = 153/344 (44%), Gaps = 55/344 (15%)
Query: 126 KCQPCEQCGATTFDPSKSLTYATLPCDSSYCTND---------CGGYPDECWYNIRYTNG 176
K PC+ F P +S ++ + C S C D C D C Y+I Y +G
Sbjct: 183 KSNPCK----GVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYADG 238
Query: 177 PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSS-T 232
++G G++ + + + L ++ GC+ N +F +E G+ GLG A S
Sbjct: 239 SSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNF-NEDTGGILGLGFAKDSFI 297
Query: 233 HSLVEKVGSKFSYC-IGNLNYFEYAYNMLILG-EGAILEGD--STPMSVIDGSYYVTLEG 288
+ G+KFSYC + +L++ + + I G A L G+ T + + Y V + G
Sbjct: 298 DKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKRTELILFPPFYGVNVVG 357
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
IS+G +ML I P ++ N S G IDSGTTLT L+ AY E +F+ L+ S
Sbjct: 358 ISIGGQMLKIPPQVWDFN---SQGGTLIDSGTTLTALLVPAY-------EPVFEALIKSL 407
Query: 349 PMDP--------AWHLCYSGNINRDLQGF-----PAMAFHFAGGADLVLDAESVFYQESS 395
A C+ D +GF P + FHFAGGA +S +
Sbjct: 408 TKVKRVTGEDFGALDFCF------DAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAP 461
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
V C+ + P D G S+IG I QQN+ +DL + + F
Sbjct: 462 LVKCIGIVPIDGIG----GASVIGNIMQQNHLWEFDLSTNTIGF 501
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 154/365 (42%), Gaps = 47/365 (12%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C CEQCG FDP S TY + C+ C D
Sbjct: 89 IGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNID-CICDSD 147
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFT 220
G +C Y +Y S G +G + +F ++ + FGC + ++
Sbjct: 148 GV--QCVYERQYAEMSTSSGVLGEDVISF--GNQSELIPQRAVFGCENMETGDLFSQRAD 203
Query: 221 GVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMS- 276
G+ GLG S LVEK + FS C G ++ +G GA++ G +P S
Sbjct: 204 GIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD----------IGGGAMVLGGISPPSD 253
Query: 277 --------VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
V Y V L+ I + K L + +F G +DSGTT +L
Sbjct: 254 MIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGR-----YGAVLDSGTTYAYLPAE 308
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQ---GFPAMAFHFAGGADLVL 384
A+ + + D L DP + +C+SG + + FP + F G L L
Sbjct: 309 AFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSL 368
Query: 385 DAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
E+ F++ S +CL + NG +++G I +N V YD + ++ F +
Sbjct: 369 TPENYFFRHSKVHGAYCLGIFE---NGN--DQTTLLGGIVVRNTLVMYDRANSKIGFWKT 423
Query: 443 DCELL 447
+C L
Sbjct: 424 NCSEL 428
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 168/405 (41%), Gaps = 63/405 (15%)
Query: 81 KAHDTRAHL---------HPGISTVPV------FYVNFSIGQPPVPQLAVLDTGSSLIWV 125
+AH+ R L PG TVPV + VN +IG PP P A++D G L+W
Sbjct: 18 RAHELRRGLELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWT 77
Query: 126 KC-QPCEQC---GATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGP 177
+C Q C +C FD + S T+ PC ++ C T C G T+
Sbjct: 78 QCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAAVCESIPTRSCAGDGGGACGYEASTSFG 137
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
+ G IG++ T+ + + FGC+ + + +G GLG + SL
Sbjct: 138 RTVGRIGTDAVAIGTAATAR-----LAFGCAVASEMDTMWGSSGSVGLG---RTNLSLAA 189
Query: 238 KV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEG-------------DSTPMSVIDGSYY 283
++ + FSYC+ + + + L LG A L G + P S + SY
Sbjct: 190 QMNATAFSYCLAPPDTGK--SSALFLGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYL 247
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
+ LE I G + + S + + + T +T LV S Y+ LRK V D G
Sbjct: 248 LRLEAIRAGNATIAMP---------QSGNTITVSTATPVTALVDSVYRDLRKAVADAV-G 297
Query: 344 LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVG 403
P P + LC+ G P + F GGA++ + S + + C+A+
Sbjct: 298 AAPVPPPVQNYDLCFPKASASG--GAPDLVLAFQGGAEMTVPVSSYLFDAGNDTACVAI- 354
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLA 448
+ +SI+G + Q N ++ +DL + L F+ DC L+
Sbjct: 355 ---LGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCSALS 396
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 168/392 (42%), Gaps = 51/392 (13%)
Query: 71 FIYLSQKSSQ-KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
F Y+S K+S+ + G+ T ++ ++ +G P Q+ +DTGSS WV C+
Sbjct: 54 FRYISNKTSRLSTQAVQVGWDRGLQT-SLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE- 111
Query: 130 CEQC--GATTFDPSKSLTYATLPCDSSYC--------TNDCGGYPDECWYNIRYTNGPDS 179
C+ C TF S+S T A + C +S C D YPD C + + Y +G S
Sbjct: 112 CDGCHTNPRTFLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPD-CPFRVSYQDGSAS 170
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV 239
G + + F + +F FGC N F +F V GL + S++++
Sbjct: 171 YGILYQDTLTFSDVQKIPSFT----FGC--NLDSFGANEFGNVDGLLGMGAGPMSVLKQS 224
Query: 240 GSK---FSYCI----GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTLEG 288
+ FSYC+ +F LG+ A V ++V L
Sbjct: 225 SPRFDGFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAA 284
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
IS+ + L + P++F + GV DSG+ L+++ A L + + +L L
Sbjct: 285 ISVDGERLGLSPSIFSRK------GVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGA 336
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES---SSVFCLAVGPS 405
+ + CY + D PA++ HF GA L + VF + S V+CLA P+
Sbjct: 337 AEEESERNCYDMR-SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPT 395
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
+ +SIIG + Q + V YDL +QL
Sbjct: 396 E-------SVSIIGSLMQTSKEVVYDL-KRQL 419
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 150/370 (40%), Gaps = 49/370 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCGAT---TFDPSKSLTYATLPCDSS 154
F V G P +DTGS + W++C PC C FDP+KS TY+ +PC
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGHP 220
Query: 155 YCTNDCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN- 211
C G + C Y + Y +G + G + E + ++ + F FGC N
Sbjct: 221 QCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPGF----AFGCGQTNL 276
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGE------- 264
F G G A S G+ FSYC L ++ + L +G
Sbjct: 277 GEFGGVDGLVGLGRG-ALSLPSQAAATFGATFSYC---LPSYDTTHGYLTMGSTTPAASN 332
Query: 265 -------GAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
A+++ + P Y+V + I +G +L + P +F ++ G D
Sbjct: 333 DDDDVQYTAMIQKEDYP-----SLYFVEVVSIDIGGYILPVPPTVFTRD------GTLFD 381
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SGT LT+L P AY +LR + P+ DP + CY + + PA+AF F+
Sbjct: 382 SGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDP-FDTCYDFTGHNAIF-MPAVAFKFS 439
Query: 378 GGADLVLDAESVFY---QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
GA L ++ + + CLA P +IIG Q+ V YD+ +
Sbjct: 440 DGAVFDLSPVAILIYPDDTAPATGCLAFVPRPST----MPFNIIGNTQQRGTEVIYDVAA 495
Query: 435 KQLYFQRIDC 444
+++ F + C
Sbjct: 496 EKIGFGQFTC 505
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 167/398 (41%), Gaps = 51/398 (12%)
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
L+ S++ + R LH + + IG PP ++DTGS++ +V C CEQC
Sbjct: 59 LTGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC 118
Query: 134 GA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPD--ECWYNIRYTNGPDSQGTIGSEQF 188
G F P S TY + CT DC D +C Y +Y S G +G +
Sbjct: 119 GRHQDPKFQPESSSTYQPVK-----CTIDCNCDSDRMQCVYERQYAEMSTSSGVLGEDLI 173
Query: 189 NFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFS 244
+F ++ + FGC + + G+ GLG S LV+K + FS
Sbjct: 174 SF--GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFS 231
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMS---------VIDGSYYVTLEGISLGEKM 295
C G ++ +G GA++ G +P S V Y + L+ I + K
Sbjct: 232 LCYGGMD----------VGGGAMVLGGISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKR 281
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH 355
L ++ N+F G +DSGTT +L +A+ + + Q L DP ++
Sbjct: 282 LPLNANVFD-----GKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYN 336
Query: 356 -LCYSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDING 409
+C+SG ++++ + FP + F G L E+ ++ S +CL V NG
Sbjct: 337 DICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQ---NG 393
Query: 410 ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+++G I +N V YD ++ F + +C L
Sbjct: 394 N--DQTTLLGGIIVRNTLVVYDREQTKIGFWKTNCAEL 429
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 172/388 (44%), Gaps = 56/388 (14%)
Query: 94 STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLT 145
++V ++Y +G PP +DTGS ++WV C C C ++ FD S T
Sbjct: 73 NSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSST 132
Query: 146 YATLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE------- 191
A +PC CT+ +C ++C Y +Y +G + G S+ F
Sbjct: 133 AALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPP 192
Query: 192 TSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSY 245
+ T + FGCS + + +D+ G+FG GP S S + G FS+
Sbjct: 193 AVNSSATIV----FGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSH 248
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKK 305
C+ + + E +I+ +P+ Y + L+ I++ ++L I+P +F
Sbjct: 249 CLKGDGDGGGVLVLGEILEPSIVY---SPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSI 305
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNIN 363
++ + G +D GTTL +L+ AY L + + + + CY S +I
Sbjct: 306 SN--NRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQ--SARQTNSKGNQCYLVSTSIG 361
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVF----YQESSSVFCLAVGPSDINGERFKD-LSII 418
D+ FP+++ +F GGA +VL E Y + + ++C+ ++F++ SI+
Sbjct: 362 -DI--FPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGF-------QKFQEGASIL 411
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
G + ++ V YD+ +++ + DC L
Sbjct: 412 GDLVLKDKIVVYDIAQQRIGWANYDCSL 439
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 151/355 (42%), Gaps = 54/355 (15%)
Query: 72 IYLSQKSSQKAHDTRAHLHPGISTVP--------------VFYVNFSIGQPPVPQLAVLD 117
++LS ++S K T+ H S P + IG PP ++D
Sbjct: 49 LFLSHRNSSKTTSTQQHRRLQGSARPNARMRLYDDLLLNGYYTTRIWIGTPPQTFALIVD 108
Query: 118 TGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYT 174
TGS++ +V C CEQCG F+P S TY + C+ CT C +C Y +Y
Sbjct: 109 TGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCNID-CT--CDNERKQCVYERQYA 165
Query: 175 NGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPA-TSST 232
S G +G + +F ++ + FGC + ++ G+ GLG S
Sbjct: 166 EMSSSSGVLGEDIISF--GNQSELVPQRAIFGCENQETGDLYSQRADGIMGLGRGDLSIV 223
Query: 233 HSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMS---------VIDGS 281
LVEK + FS C G ++ +G GA++ G +P S V
Sbjct: 224 DQLVEKGVISDSFSLCYGGMD----------IGGGAMILGGISPPSGMVFAESDPVRSQY 273
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y + L+ I + K L +DP++F G +DSGTT +L +A+ + +
Sbjct: 274 YNIDLKAIHVAGKQLHLDPSIFDGKH-----GTVLDSGTTYAYLPEAAFTAFKDAMMKEL 328
Query: 342 QGLLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
L + DP ++ +C+SG ++++ FPA+ F+ G L L E+ +Q
Sbjct: 329 TSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEMVFSNGQKLSLSPENYLFQ 383
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 169/387 (43%), Gaps = 59/387 (15%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDP---SKSLTYATLPCD-- 152
V+ IG+ Q ++DTGSSL+W +C C C P S+S T+ + C
Sbjct: 81 VYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHCHIGDVPPYGRSQSRTFQEVSCGDD 140
Query: 153 ---------SSYCTNDCGGY-----PDECWYNIRYT---NGPDSQGTIGSEQFNFETSDE 195
+SYC GY C + Y G QG + + F+F +
Sbjct: 141 DDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTFHF---ID 197
Query: 196 GKTFLYDVG----FGCSH--NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIG 248
+ F Y FGC+H N + ++ TG+ GLG + S + + G +KFSYC+
Sbjct: 198 DRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDA---SFLRQTGITKFSYCVP 254
Query: 249 NL--NYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGI--SLGEKMLDIDPNLFK 304
Y ++ L G A + G P+ + G YY+ L I + E M + +K
Sbjct: 255 PRMPGYSYRRHSWLRFGSHAQISGKKVPLVMRWGKYYLPLTAITYTYNELMSPVPIIAYK 314
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF--QGLLPSYPMDPAWHLCYSGNI 362
+ + + +D+GT+L L S + L KE+E + + ++ P CY +
Sbjct: 315 SQEDY--LHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWPKH--CYKRTM 370
Query: 363 N--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSS---VFCLAVGPSDINGERFKDLSI 417
+ +D+ + F GG D+ L ++F + ++ CLAV D + + +I
Sbjct: 371 DEVKDI----TVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSK-----AI 421
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+GM AQ N NV YDL+S+++ I C
Sbjct: 422 LGMFAQTNINVGYDLLSREIAMDPIRC 448
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 109/426 (25%), Positives = 185/426 (43%), Gaps = 76/426 (17%)
Query: 78 SSQKAHDTRAH--------LHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ +AHD R H L G +P +++ +G PP +DTGS ++WV
Sbjct: 54 SALRAHDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWV 113
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGG-YPD-----ECWYNI 171
C C +C T +DP S + +T+ CD +C GG P C Y++
Sbjct: 114 NCISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSV 173
Query: 172 RYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD--VGFGCSHNNAH---FSDEQFTGVFGL 225
Y +G + G ++ F + + +G+T + + FGC S++ G+ G
Sbjct: 174 MYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQALDGILGF 233
Query: 226 GPATSSTHSLVEKVGSK---FSYC-----------IGNLN----YFEYAYNMLILGEGAI 267
G A +S S + G F++C IGN+ YF + + +L
Sbjct: 234 GQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLF 293
Query: 268 LEGDSTPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
L M ++ +Y V L+ I +G L + ++F +T G IDSGTTLT+L
Sbjct: 294 L----LVMILLSRPHYNVNLKSIDVGGTTLQLPAHVF---ETGEKKGTIIDSGTTLTYLP 346
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWH-----LC--YSGNINRDLQGFPAMAFHFAGG 379
+ + K+V D+ + S D A+H LC YSG+++ GFP + FHF
Sbjct: 347 ----ELVFKQVMDV----VFSKHRDIAFHNLQDFLCFQYSGSVD---DGFPTITFHFEDD 395
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L + F+ + ++C+ + + KD+ ++G + N V YDL ++ + +
Sbjct: 396 LALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENQVIGW 455
Query: 440 QRIDCE 445
+C
Sbjct: 456 TDYNCS 461
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/412 (25%), Positives = 172/412 (41%), Gaps = 48/412 (11%)
Query: 70 RFIYLSQKSSQ---KAHDTRAHLH--PGI----------STVPVFYVNFSIGQPPVPQLA 114
++ + QK S KAHD L G+ V ++Y IG P
Sbjct: 54 KYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYYV 113
Query: 115 VLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATLPCDSSYCTNDCGGYPDE 166
+DTGS ++WV C C +C T +D +SLT + CD +C GG P
Sbjct: 114 QVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPSY 173
Query: 167 CWYNIR------YTNGPDSQGTIGSE--QFNFETSD-EGKTFLYDVGFGCSHNNAH--FS 215
C N+ Y +G S G + Q++ + D E + V FGCS + S
Sbjct: 174 CIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSS 233
Query: 216 DEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS 272
+E G+ G G + +S S + KV F++C+ LN + +G + ++
Sbjct: 234 EEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN----GGGIFAIGHIVQPKVNT 289
Query: 273 TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
TP+ Y V ++ + +G L++ ++F D G IDSGTTL +L Y
Sbjct: 290 TPLVPNQTHYNVNMKAVEVGGYFLNLPTDVF---DVGDKKGTIIDSGTTLAYLPEVVYDQ 346
Query: 333 LRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
L ++ L D YS +++ GFPA+ FHF L + +
Sbjct: 347 LLSKIFSWQSDLKVHTIHDQFTCFQYSESLD---DGFPAVTFHFENSLYLKVHPHEYLF- 402
Query: 393 ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
++C+ S + ++++++G +A N V YDL ++ + + +C
Sbjct: 403 SYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 168/376 (44%), Gaps = 39/376 (10%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V ++Y +G PP +DTGS ++WV C C C T+ FDP S + +
Sbjct: 81 VGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSAS 140
Query: 148 TLPCDSSYC----TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
+ C C + G P+ C Y+ +Y +G + G S+ +F+T +
Sbjct: 141 LVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200
Query: 203 VG---FGCSH---NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYF 253
FGCS+ + G+FGLG + S S + G FS+C L
Sbjct: 201 SAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHC---LKGD 257
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
+ +++LG+ + TP+ Y V L+ I++ ++L IDP++F + G
Sbjct: 258 KSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFT---IATGDG 314
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMA 373
ID+GTTL +L AY + V + P+ + C+ D+ FP ++
Sbjct: 315 TIIDTGTTLAYLPDEAYSPFIQAVANAVSQY--GRPITYESYQCFEITAG-DVDVFPQVS 371
Query: 374 FHFAGGADLVLDAES---VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
FAGGA +VL + +F SS++C +G ++ R ++I+G + ++ V Y
Sbjct: 372 LSFAGGASMVLGPRAYLQIFSSSGSSIWC--IGFQRMSHRR---ITILGDLVLKDKVVVY 426
Query: 431 DLVSKQLYFQRIDCEL 446
DLV +++ + DC L
Sbjct: 427 DLVRQRIGWAEYDCSL 442
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/394 (25%), Positives = 169/394 (42%), Gaps = 46/394 (11%)
Query: 76 QKSSQKAH-DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG 134
Q+S K H + R L+ + + IG PP ++DTGS++ +V C CE CG
Sbjct: 65 QRSESKRHPNARMRLYDDLLINGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCG 124
Query: 135 A---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE 191
F P S TY + C C +C G ++C Y+ +Y S G +G + +F
Sbjct: 125 RHQDPKFQPDLSETYQPVKCTPD-C--NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFG 181
Query: 192 TSDEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFSYCI 247
E FGC ++ ++ G+ GLG S LV+K + FS C
Sbjct: 182 NLSELAP--QRAVFGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY 239
Query: 248 GNLNYFEYAYNMLILGEGAILEGDSTPMSVI------DGS--YYVTLEGISLGEKMLDID 299
G ++ + G IL G S P ++ D S Y + L+ + + K L ++
Sbjct: 240 GGMD---------VGGGAMILGGISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLN 290
Query: 300 PNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCY 358
P +F G +DSGTT +L +A+ ++ + L DP + +C+
Sbjct: 291 PKVFD-----GKHGTVLDSGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICF 345
Query: 359 SG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFK 413
+G ++++ + FP + F G L L E+ ++ S +CL V NG
Sbjct: 346 TGAGIDVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFS---NGR--D 400
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+++G I +N V YD + ++ F + +C L
Sbjct: 401 PTTLLGGIFVRNTLVMYDRENSKIGFWKTNCSEL 434
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 116/438 (26%), Positives = 182/438 (41%), Gaps = 74/438 (16%)
Query: 56 VDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-----------FYVNFS 104
+D+ T N + R + S+ + K P T PV + ++F
Sbjct: 38 IDSGRGFTRNELLRRMVLRSRARAAKQLCPSRSGTPVRVTAPVASGSHVVGYTEYLIHFG 97
Query: 105 IGQPPVPQLAV-LDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYCT--- 157
IG P Q+A+ +DTGS ++W +C+PC C FD S S T + C C
Sbjct: 98 IGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPICRALR 157
Query: 158 -NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC-SHNNAHF- 214
+ C + C Y + Y + + G + + F F+ GK + D+ FGC +N +F
Sbjct: 158 PHAC--FLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNTGNFH 215
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG--- 270
S+E FG GP SL ++G S FSYC + FE + LG GA +G
Sbjct: 216 SNETGIAGFGRGPL-----SLPRQLGVSSFSYCFTTI--FESKSTPVFLG-GAPADGLRA 267
Query: 271 ------DSTP-MSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
STP + YY++L+GI++G+ L + + F S G IDSGT +T
Sbjct: 268 HATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGS-GGTIIDSGTAIT 326
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF-------------P 370
+ +++ L++ + P+ H Y+ LQ F P
Sbjct: 327 AFPRAVFRS-------LWEAFVAQVPLP---HTSYNDTGEPTLQCFSTESVPDASKVPVP 376
Query: 371 AMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
M H GAD L E+ + S C+ V D D ++IG QQN ++
Sbjct: 377 KMTLHLE-GADWELPRENYMAEYPDSDQLCVVVLAGD------DDRTMIGNFQQQNMHIV 429
Query: 430 YDLVSKQLYFQRIDCELL 447
+DL +L + C+ +
Sbjct: 430 HDLAGNKLVIEPAQCDKM 447
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 158/386 (40%), Gaps = 52/386 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ---------PCEQCGAT-TFDPSKSLTYAT 148
+ V+ + G PP L + DTGS LIW++C P + C F SKS T +
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 112
Query: 149 LPCDSSYCT---------NDCG-GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
+PC ++ C C P C Y Y +G + G + + G
Sbjct: 113 VPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGA 172
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLG------PATSSTHSLVEKVGSKFSYCIGNLNY 252
+ V FGC N S GV GLG PA S SL + FSYC+ +L
Sbjct: 173 AVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSG--SLFAQT---FSYCLLDLEG 227
Query: 253 FEYAY--NMLILGEGAILEGDS-TPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
+ L LG + TP+ + YYV + I +G ++L + P
Sbjct: 228 GRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV-PGSEWAI 286
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW----HLCYSGNI 362
D + G IDSG+TLT+L AY L LP P + LCY+ +
Sbjct: 287 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH--LPRIPSSATFFQGLELCYNVSS 344
Query: 363 NRDLQ----GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSII 418
+ GFP + FA G L L + + V CLA+ P+ ++ F +++
Sbjct: 345 SSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPT-LSPFAF---NVL 400
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G + QQ Y+V +D S ++ F R +C
Sbjct: 401 GNLMQQGYHVEFDRASARIGFARTEC 426
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 153/379 (40%), Gaps = 45/379 (11%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-----PCEQCGATTFDPSKSLTYATLPCDS 153
++V F +G P P + V DTGS L WVKC+ P A F S+S ++A L C S
Sbjct: 14 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSS 73
Query: 154 SYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE----------TSDEG 196
CT+ +C C Y+ RY +G ++G +G++
Sbjct: 74 DTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGR 133
Query: 197 KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEY 255
+ L V GC+ S + GV LG + S S + G +FSYC+ +
Sbjct: 134 RAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 193
Query: 256 AYNMLIL---GEGAILEGDSTPMSVIDGSY-----YVTLEGISLGEKMLDIDPNLFKKND 307
A + L EG TP+ V+D GE LDI +++ D
Sbjct: 194 ASSYLTFGPGPEGGGAPAARTPL-VLDRRVSPFYAVAVDAVYVAGE-ALDIPADVW---D 248
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ 367
G +DSGT+LT L AY+ + + LP MDP + CY N
Sbjct: 249 VGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAA-LPRVAMDP-FEYCY--NWTAGAP 304
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + FAG A L A+S + V C+ V G +S+IG I QQ +
Sbjct: 305 EIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPG-----VSVIGNILQQEHL 359
Query: 428 VAYDLVSKQLYFQRIDCEL 446
+DL + L F+ C L
Sbjct: 360 WEFDLRDRWLRFKHTRCAL 378
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/413 (25%), Positives = 184/413 (44%), Gaps = 53/413 (12%)
Query: 62 RTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSS 121
RT+++ F++L++ ++ R +H + + + G + LD ++
Sbjct: 52 RTIHVDDDGFVHLNEHATSA---LRPPMHTQVGGMYSVVTSVGTGAGRRTYVLALDMTTN 108
Query: 122 LIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGY----PDEC-WYNIRY 173
L+W++C+P ++ F+P+KS ++ LP ++++C G+ D C +++IR
Sbjct: 109 LLWMQCKPVQEPFTQLPPPFEPAKSPSFRRLPGNNAFCLPAPRGHRRTVQDPCKFHSIRL 168
Query: 174 TNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF---SDEQFTGVFGLGPATS 230
D++G + +E F S + +T + V GC+HN+ F S GV GLG
Sbjct: 169 DGSADARGVLSNETLAFAASGQQQTEVTGVVIGCTHNSKGFNFNSHGVLAGVLGLG---R 225
Query: 231 STHSLVEKVGS---------KFSYCIGN-----------LNYFEYAYNMLILGEGAILEG 270
SL+ +G +FSYC+ + L + + N + I+
Sbjct: 226 QAPSLIWTLGQHRHGTVQVHRFSYCLPSHGSSSSDHHTFLRFDDDVPNTQHMVSTKIMYM 285
Query: 271 DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKN---DTWSDAGVFIDSGTTLTWLVP 327
DST S +Y+V+L GIS+ K L LFK++ W+ F D+GT ++
Sbjct: 286 DST-TSRDFRAYFVSLTGISVAGKPLQDVKELFKRHVHGQVWTSGCAF-DAGTPTMVMIM 343
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA-GGADLVLDA 386
AY L+ V + L + +HLC+ ++ Q P + FA A LVL
Sbjct: 344 PAYNKLKDAVVRHLKPLGLQI-VSGQYHLCFRAT-SQLWQHLPTVMLQFAETEARLVLPP 401
Query: 387 ESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
+ +F + CLAV R D++IIG + Q + YD+ ++YF
Sbjct: 402 QRLFVAVGYDI-CLAV-------VRSYDITIIGAMQQVDKRFVYDVRHGRIYF 446
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 175/382 (45%), Gaps = 50/382 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C ++ FD SLT
Sbjct: 97 VGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAG 156
Query: 148 TLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
++ C C++ C ++C Y+ RY +G + G ++ F F+ + G++ +
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYGDGSGTSGYYMTDTFYFD-AILGESLV 214
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNL 250
+ + FGCS + SD+ G+FG G S S + G FS+C L
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---L 271
Query: 251 NYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
+ +LGE + +P+ Y + L I + +ML +D +F+ ++T
Sbjct: 272 KGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-- 329
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQG 368
G +D+GTTLT+LV AY + + L+ P+ CY S +I+ D+
Sbjct: 330 -RGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVT--PIISNGEQCYLVSTSIS-DM-- 383
Query: 369 FPAMAFHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
FP+++ +FAGGA ++L + + + +S++C+ + ++ +I+G + +
Sbjct: 384 FPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAP------EEQTILGDLVLK 437
Query: 425 NYNVAYDLVSKQLYFQRIDCEL 446
+ YDL +++ + DC +
Sbjct: 438 DKVFVYDLARQRIGWASYDCSM 459
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 160/362 (44%), Gaps = 49/362 (13%)
Query: 115 VLDTGSSLIWVKCQ---PCEQCGATTFDPSKSLTYATLPCDSSYCTNDCGGYP--DECWY 169
VLDT SSL W++C P ++ + FDPS S +Y L S C P D+C +
Sbjct: 92 VLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPVLPAGDKCSF 151
Query: 170 NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDE-QFTGVFGLGP- 227
++ ++ G +G++ ++ V FGC+ + F + F G G+G
Sbjct: 152 HLPG----EAHGYVGTDTIILGNP---TLPIHSVAFGCAQSTEGFDTKGTFAGTLGMGKL 204
Query: 228 ATSSTHSLVEKVGSKFSYCI-------GNLNYFEYAYN-----MLILGEGAILEGDS-TP 274
TS + ++VGS+FSYC+ G + + + +L+ IL P
Sbjct: 205 PTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILPTPPHLP 264
Query: 275 MSVIDGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTL 333
V D +YYV L GISL G + I +F++ S G F+D+GT +T LVP+AY +
Sbjct: 265 HGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGS-GGCFVDAGTQVTHLVPAAYAVV 323
Query: 334 RKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG----FPAMAFHFAGGAD-----LVL 384
+ V + Q DP + LC+ R+ G P + F G A L +
Sbjct: 324 EEAVAHMVQQWGYKRVRDPNFSLCF-----REHPGIWSHIPKLTLDFEGPASRTVAHLEI 378
Query: 385 DAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
+ ++F + ++ + C V + +++G + Q + +DL + + F R
Sbjct: 379 VSRNLFLKVDNQPLVCFGVYRTSRGSP-----TVVGAMQQVDTRFIFDLHANTITFHRES 433
Query: 444 CE 445
CE
Sbjct: 434 CE 435
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 175/381 (45%), Gaps = 50/381 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C ++ FD SLT
Sbjct: 97 VGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAG 156
Query: 148 TLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
++ C C++ C ++C Y+ RY +G + G ++ F F+ + G++ +
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYGDGSGTSGYYMTDTFYFD-AILGESLV 214
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNL 250
+ + FGCS + SD+ G+FG G S S + G FS+C L
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---L 271
Query: 251 NYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
+ +LGE + +P+ Y + L I + +ML +D +F+ ++T
Sbjct: 272 KGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-- 329
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQG 368
G +D+GTTLT+LV AY + + L+ P+ CY S +I+ D+
Sbjct: 330 -RGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVT--PIISNGEQCYLVSTSIS-DM-- 383
Query: 369 FPAMAFHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
FP+++ +FAGGA ++L + + + +S++C+ + ++ +I+G + +
Sbjct: 384 FPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAP------EEQTILGDLVLK 437
Query: 425 NYNVAYDLVSKQLYFQRIDCE 445
+ YDL +++ + DC+
Sbjct: 438 DKVFVYDLARQRIGWASYDCK 458
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/415 (23%), Positives = 168/415 (40%), Gaps = 50/415 (12%)
Query: 69 ARFIYLSQKSSQ---KAHDTRAHLH--PGIS----------TVPVFYVNFSIGQPPVPQL 113
++ Y Q+ S KAHD R L G+ TV ++Y IG P
Sbjct: 41 VKYRYAGQQRSLSDLKAHDDRRQLRILAGVDLPLGGSGRPDTVGLYYAKVGIGTPSKDYY 100
Query: 114 AVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCTNDCGGYPD 165
+DTGS ++WV C C +C T+ ++ S++ +PCD +C GG
Sbjct: 101 VQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVPCDEEFCYEVNGGPLS 160
Query: 166 ECWYNIR------YTNGPDSQGTIGSEQFNF-------ETSDEGKTFLYDVGFGCSHNNA 212
C N+ Y +G + G + + +T+ + ++ G S +
Sbjct: 161 GCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFGCGARQSGDLG 220
Query: 213 HFSDEQFTGVFGLGPATSSTHSLV---EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
S+E G+ G G + SS S + KV F++C+ +N + +G +
Sbjct: 221 PTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGIN----GGGIFAIGHVVQPK 276
Query: 270 GDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
+ TP+ Y V + + +GE L + F+ D G IDSGTTL +L
Sbjct: 277 VNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDR---KGAIIDSGTTLAYLPEIV 333
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESV 389
Y+ L ++ L D YSG+++ GFP + FHF L +
Sbjct: 334 YEPLVSKIISQQPDLKVHIVRDEYTCFQYSGSVD---DGFPNVTFHFENSVFLKVHPHEY 390
Query: 390 FYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ ++C+ S + ++++++G + N V YDL ++ + + +C
Sbjct: 391 LF-PFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNC 444
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 168/376 (44%), Gaps = 39/376 (10%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V ++Y +G PP +DTGS ++WV C C C T+ FDP S + +
Sbjct: 81 VGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSAS 140
Query: 148 TLPCDSSYC----TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
+ C C + G P+ C Y+ +Y +G + G S+ +F+T +
Sbjct: 141 LVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200
Query: 203 VG---FGCSH---NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYF 253
FGCS+ + G+FGLG + S S + G FS+C L
Sbjct: 201 SAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHC---LKGD 257
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
+ +++LG+ + TP+ Y V L+ I++ ++L IDP++F + G
Sbjct: 258 KSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFT---IATGDG 314
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMA 373
ID+GTTL +L AY + + + P+ + C+ D+ FP ++
Sbjct: 315 TIIDTGTTLAYLPDEAYSPFIQAIANAVSQY--GRPITYESYQCFEITAG-DVDVFPEVS 371
Query: 374 FHFAGGADLVLDAES---VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAY 430
FAGGA +VL + +F SS++C +G ++ R ++I+G + ++ V Y
Sbjct: 372 LSFAGGASMVLRPHAYLQIFSSSGSSIWC--IGFQRMSHRR---ITILGDLVLKDKVVVY 426
Query: 431 DLVSKQLYFQRIDCEL 446
DLV +++ + DC L
Sbjct: 427 DLVRQRIGWAEYDCSL 442
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/408 (23%), Positives = 165/408 (40%), Gaps = 69/408 (16%)
Query: 81 KAHDTRAHLH------------PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
K+HDTR H + +V +++ +G PP +DTGS ++WV C+
Sbjct: 44 KSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNCK 103
Query: 129 PCEQCGATT--------FDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNIRYTNG 176
PC +C + T FD + S T + CD +C+ +D C Y+I Y +
Sbjct: 104 PCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVYADE 163
Query: 177 PDSQGTIGSEQFNFE--TSD-EGKTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATS 230
S+G ++ E T D + +V FGC + + SD GV G G + +
Sbjct: 164 STSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNT 223
Query: 231 STHSLVEKVGSK---FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLE 287
S S + G FS+C+ N+ + +G + +TPM Y V L
Sbjct: 224 SVLSQLAATGDAKRVFSHCLDNVK----GGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE----------- 336
G+ + LD+ P++ + + G +DSGTTL + Y +L +
Sbjct: 280 GMDVDGTALDLPPSIMR------NGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHI 333
Query: 337 VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSS 396
VED FQ C+S + N D+ FP ++F F L + +
Sbjct: 334 VEDTFQ--------------CFSFSENVDV-AFPPVSFEFEDSVKLTVYPHDYLFTLEKE 378
Query: 397 VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
++C + ++ ++G + N V YDL ++ + + +C
Sbjct: 379 LYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNC 426
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 164/386 (42%), Gaps = 55/386 (14%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G P +DTGS ++WV C PC C ++ F+P S T +
Sbjct: 86 VGLYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSS 145
Query: 148 TLPCDSSYCT------------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET--- 192
+PC CT +D P C Y Y +G + G S+ F+T
Sbjct: 146 RIPCSDDRCTAALQTGEAVCQSSDSPSSP--CGYTFTYGDGSGTSGFYVSDTMYFDTVMG 203
Query: 193 SDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYC 246
+++ V FGCS++ + +D G+FG G S S + +G FS+C
Sbjct: 204 NEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHC 263
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFK 304
L + +L+LGE I+E TP+ Y + LE I++ + L ID +LF
Sbjct: 264 ---LKGSDNGGGILVLGE--IVEPGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFA 318
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINR 364
++T G +DSGTTL +LV AY + + S + +++
Sbjct: 319 TSNT---QGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDS 375
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGM 420
FP +F GG + + E+ Q+ S ++C I +R + ++I+G
Sbjct: 376 S---FPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWC-------IGWQRSQGITILGD 425
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDCEL 446
+ ++ YDL + ++ + DC L
Sbjct: 426 LVLKDKIFVYDLANMRMGWADYDCSL 451
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/395 (25%), Positives = 163/395 (41%), Gaps = 55/395 (13%)
Query: 84 DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-------- 135
+ R LH + T + IG P ++D+GS++ +V C CEQCG
Sbjct: 77 NARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNI 136
Query: 136 -----TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF 190
F P S TY+ + C+ CT C +C Y +Y S G +G + +F
Sbjct: 137 IEAHDPRFQPDLSSTYSPVKCNVD-CT--CDNERSQCTYERQYAEMSSSSGVLGEDIMSF 193
Query: 191 ETSDEGKTFLYDVGFGCSH-NNAHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYC 246
E K FGC + + G+ GLG S LVEK + FS C
Sbjct: 194 GKESELKP--QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLC 251
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPMSV-------IDGSYY-VTLEGISLGEKMLDI 298
G ++ + G +L G P + + YY + L+ I + K L +
Sbjct: 252 YGGMD---------VGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRL 302
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LC 357
DP +F S G +DSGTT +L A+ + V + L DP + +C
Sbjct: 303 DPKIFN-----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDIC 357
Query: 358 YSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERF 412
++G N+++ + FP + F G L L E+ ++ S +CL V NG+
Sbjct: 358 FAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ---NGK-- 412
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+++G I +N V YD ++++ F + +C L
Sbjct: 413 DPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSEL 447
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/395 (25%), Positives = 163/395 (41%), Gaps = 55/395 (13%)
Query: 84 DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-------- 135
+ R LH + T + IG P ++D+GS++ +V C CEQCG
Sbjct: 76 NARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNI 135
Query: 136 -----TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF 190
F P S TY+ + C+ CT C +C Y +Y S G +G + +F
Sbjct: 136 IEAHDPRFQPDLSSTYSPVKCNVD-CT--CDNERSQCTYERQYAEMSSSSGVLGEDIMSF 192
Query: 191 ETSDEGKTFLYDVGFGCSH-NNAHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYC 246
E K FGC + + G+ GLG S LVEK + FS C
Sbjct: 193 GKESELKP--QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLC 250
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPMSV-------IDGSYY-VTLEGISLGEKMLDI 298
G ++ + G +L G P + + YY + L+ I + K L +
Sbjct: 251 YGGMD---------VGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRL 301
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LC 357
DP +F S G +DSGTT +L A+ + V + L DP + +C
Sbjct: 302 DPKIFN-----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDIC 356
Query: 358 YSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERF 412
++G N+++ + FP + F G L L E+ ++ S +CL V NG+
Sbjct: 357 FAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ---NGK-- 411
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+++G I +N V YD ++++ F + +C L
Sbjct: 412 DPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSEL 446
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 148/368 (40%), Gaps = 46/368 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-----QCGATTFDPSKSLTYATLPCDS 153
+ +G P + V+D+GSSL W++C PC Q G +DP S TYA +PC +
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGP-LYDPRASSTYAAVPCSA 166
Query: 154 SYCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
C + C G C Y Y +G S G + + + +S F Y
Sbjct: 167 PQCAELQAATLNPSSCSGS-GVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGFYY--- 222
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCI-----GNLNYFEYAYN 258
GC +N G+ GL S S L VG+ F+YC+ + Y + N
Sbjct: 223 -GCGQDNVGLFGRA-AGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGSN 280
Query: 259 MLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
G S+ Y+V+L G+S+ L + + + T IDS
Sbjct: 281 SDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLPT------IIDS 334
Query: 319 GTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
GT +T L Y L K V L P+Y + C+ G + + PA+ FA
Sbjct: 335 GTVITRLPTPVYTALSKAVGAALAAPSAPAYSI---LQTCFKGQVAK--LPVPAVNMAFA 389
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GGA L L +V + + CLA P+D +IIG QQ ++V YD+ ++
Sbjct: 390 GGATLRLTPGNVLVDVNETTTCLAFAPTD-------STAIIGNTQQQTFSVVYDVKGSRI 442
Query: 438 YFQRIDCE 445
F C
Sbjct: 443 GFAAGGCS 450
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 169/381 (44%), Gaps = 49/381 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C PC C +++ F+P S T +
Sbjct: 88 VGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSS 147
Query: 148 TLPCDSSYCTND-------CGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFET---SDEG 196
+PC CT C + C Y Y +G + G S+ F+T +++
Sbjct: 148 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQT 207
Query: 197 KTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNL 250
+ FGCS++ + +D G+FG G S S + +G FS+C L
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---L 264
Query: 251 NYFEYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ +L+LGE I+E TP+ Y + LE I + + L ID +LF ++T
Sbjct: 265 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYSGNINRDLQ 367
G +DSGTTL +L AY V + + PS + + C+ + + D
Sbjct: 323 ---QGTIVDSGTTLAYLADGAYDPF---VNAITAAVSPSVRSLVSKGNQCFVTSSSVD-S 375
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQ 423
FP ++ +F GG + + E+ Q++S ++C+ + + ++I+G +
Sbjct: 376 SFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIG-----WQRNQGQQITILGDLVL 430
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
++ YDL + ++ + DC
Sbjct: 431 KDKIFVYDLANMRMGWTDYDC 451
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 115/444 (25%), Positives = 183/444 (41%), Gaps = 73/444 (16%)
Query: 30 PAAGKPKRLVTK----LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT 85
P G P R +K + HRD L+ + +R N + + S + D
Sbjct: 50 PGDGLPNRDSSKYYRVMAHRDRLI---------RGRRLANEDQS-LVTFSDGNETIRVDA 99
Query: 86 RAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-------GATTF 138
LH Y N ++G P L LDTGS L W+ C C C G ++
Sbjct: 100 LGFLH---------YANVTVGTPSDWFLVALDTGSDLFWLPCD-CTNCVRELKAPGGSSL 149
Query: 139 D-----PSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNF 190
D P+ S T +PC+S+ CT + C C Y IRY +NG S G + + +
Sbjct: 150 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHL 209
Query: 191 ETSDE-GKTFLYDVGFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFS 244
++D+ K V GC F D G+FGLG S S++ K G + FS
Sbjct: 210 VSNDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFS 269
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVID--GSYYVTLEGISLGEKMLDIDPNL 302
C GN ++ G+ ++ TP+++ +Y +T+ IS+ D++ +
Sbjct: 270 MCFGNDGAGRISF-----GDKGSVDQRETPLNIRQPHPTYNITVTKISVEGNTGDLEFD- 323
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL-FQGLLPSYPMDPAWHLCYSGN 361
VF DSGT+ T+L +AY + + L + + + CY+ +
Sbjct: 324 ----------AVF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALS 372
Query: 362 INRDLQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
N+D +PA+ GG+ V V + + V+CLA+ + +D+SIIG
Sbjct: 373 PNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI-------LKIEDISIIGQ 425
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
Y V +D L ++ DC
Sbjct: 426 NFMTGYRVVFDREKLILGWKESDC 449
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 152/381 (39%), Gaps = 54/381 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSY 155
++++ +G PP +LDTGS L W++C PC C + +DP S ++ + C
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 256
Query: 156 CT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFE-TSDEGKTFLY---DV 203
C C C Y Y +G ++ G E F T+ G + L +V
Sbjct: 257 CQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENV 316
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC H N + S + G FSYC+ + N + LI G
Sbjct: 317 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 376
Query: 264 EGAILEGDST---------PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW----- 309
E L +D YYV ++ + + +++L I +TW
Sbjct: 377 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKI------PEETWHLSSE 430
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLR----KEVE--DLFQGLLPSYPMDPAWHLCYSGNIN 363
G IDSGTTLT+ AY+ ++ ++++ L +GL P P CY+ +
Sbjct: 431 GAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKP-------CYNVSGI 483
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
++ P FA A E+ F V CLA I G LSIIG Q
Sbjct: 484 EKME-LPDFGILFADEAVWNFPVENYFIWIDPEVVCLA-----ILGNPRSALSIIGNYQQ 537
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
QN+++ YD+ +L + + C
Sbjct: 538 QNFHILYDMKKSRLGYAPMKC 558
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 156/370 (42%), Gaps = 62/370 (16%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ +N S+G P + V DTGS LIW +C PC +C A F P+ S T++ LPC SS+
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 156 C------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
C C C YN +Y +G + G + +E G V FGCS
Sbjct: 146 CQFLPNSIRTCNA--TGCVYNYKYGSG-YTAGYLATETLKV-----GDASFPSVAFGCST 197
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
N GLG L VG +FSYC+ + A +L + +
Sbjct: 198 EN------------GLG-------QLDLGVG-RFSYCLRS-GSAAGASPILFGSLANLTD 236
Query: 270 GD--STPM----SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G+ STP +V YYV L GI++GE L + + F G +DSGTTLT
Sbjct: 237 GNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLT 296
Query: 324 WLVPSAYQTLRK----EVEDLFQGLLPSYPMDPAWHLCY-SGNINRDLQGFPAMAFHFAG 378
+L Y+ +++ + D + + LC+ S P++ F G
Sbjct: 297 YLAKDGYEMVKQAFLSQTAD-----VTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDG 351
Query: 379 GADLVL----DAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
GA+ + Q S +V CL + P+ + + +S+IG + Q + ++ YDL
Sbjct: 352 GAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGD----QPMSVIGNVMQMDMHLLYDLDG 407
Query: 435 KQLYFQRIDC 444
F DC
Sbjct: 408 GIFSFAPADC 417
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 174/380 (45%), Gaps = 50/380 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATL 149
+++ +G PP +DTGS ++WV C C C ++ FD SLT ++
Sbjct: 104 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 163
Query: 150 PCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
C C++ C ++C Y+ RY +G + G ++ F F+ + G++ + +
Sbjct: 164 TCSDPICSSVFQTTAAQC-SENNQCGYSFRYGDGSGTSGYYMTDTFYFD-AILGESLVAN 221
Query: 203 ----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNY 252
+ FGCS + SD+ G+FG G S S + G FS+C L
Sbjct: 222 SSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKG 278
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ +LGE + +P+ Y + L I + +ML +D +F+ ++T
Sbjct: 279 DGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT---R 335
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFP 370
G +D+GTTLT+LV AY + + L+ P+ CY S +I+ D+ FP
Sbjct: 336 GTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVT--PIISNGEQCYLVSTSIS-DM--FP 390
Query: 371 AMAFHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
+++ +FAGGA ++L + + + +S++C+ + ++ +I+G + ++
Sbjct: 391 SVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAP------EEQTILGDLVLKDK 444
Query: 427 NVAYDLVSKQLYFQRIDCEL 446
YDL +++ + DC +
Sbjct: 445 VFVYDLARQRIGWASYDCSM 464
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 144/343 (41%), Gaps = 41/343 (11%)
Query: 82 AH-DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TT 137
AH + R LH + T + IG PP ++D+GS++ +V C CEQCG
Sbjct: 71 AHPNARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPR 130
Query: 138 FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
F P S +Y+ + C+ CT C +C Y +Y S G +G + +F E K
Sbjct: 131 FQPDLSSSYSPVKCNVD-CT--CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELK 187
Query: 198 TFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYF 253
FGC ++ + G+ GLG S LVEK + FS C G ++
Sbjct: 188 A--QRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMD-- 243
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGS--------YYVTLEGISLGEKMLDIDPNLFKK 305
I G +L G TP ++ Y + L+ I + K L +D +F
Sbjct: 244 -------IGGGAMVLGGVPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFD- 295
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---N 361
S G +DSGTT +L A+ + V L DP++ +C++G N
Sbjct: 296 ----SKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRN 351
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAV 402
+++ + FP + F G L L E+ ++ S +CL V
Sbjct: 352 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV 394
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 149/368 (40%), Gaps = 55/368 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKC-----QPCEQCGATTFDPSKSLTYATLPCDSSY 155
+ FS+G PP A+ DTGS LIW KC CE G+ ++ P+ S T+A LPC
Sbjct: 93 MEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSDRL 152
Query: 156 CT---ND----CGGYPDECWYNIRYTNGPD----SQGTIGSEQFNFETSDEGKTFLYDVG 204
C+ +D C EC Y Y G D +QG + E F G + V
Sbjct: 153 CSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTL-----GADAVPSVR 207
Query: 205 FGC-SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLIL 262
FGC + + + G GP SLV ++ S F YC L + L+
Sbjct: 208 FGCTTASEGGYGSGSGLVGLGRGPL-----SLVSQLNASTFMYC---LTSDASKASPLLF 259
Query: 263 GEGAILEG---DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
G A L G ST + Y V L IS+G P + + GV DSG
Sbjct: 260 GSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSA---TTPGVGEPE------GVVFDSG 310
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ--GFPAMAFHFA 377
TTLT+L AY + L Q L + C+ N L P M HF
Sbjct: 311 TTLTYLAEPAYSEAKAAF--LSQTSLDQVEDTDGFEACFQKPANGRLSNAAVPTMVLHF- 367
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GAD+ L + + V C V +R LSIIG I Q NY V +D+ L
Sbjct: 368 DGADMALPVANYVVEVEDGVVCWIV-------QRSPSLSIIGNIMQVNYLVLHDVHRSVL 420
Query: 438 YFQRIDCE 445
FQ +C+
Sbjct: 421 SFQPANCD 428
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 117/438 (26%), Positives = 179/438 (40%), Gaps = 75/438 (17%)
Query: 31 AAGKPKRLVTKLLHRDSLL--YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHD---- 84
A KP +L+HRDS + P +++ R L + S +AH+
Sbjct: 25 ATSKPNGFRLQLIHRDSPESPFYPGKLTNSE----------RISRLVEFSKIRAHNFDSG 74
Query: 85 --TRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIW-VKCQPCEQCGATTFDPS 141
+ A P + V IG P +P V DTGS+LIW V Q QC
Sbjct: 75 FSSEAFRPPVFQDFTCYLVKVRIGNPGIPLYLVPDTGSALIWTVNNQNIFQCRN------ 128
Query: 142 KSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
++C Y RY +G + G + E S+ +
Sbjct: 129 -----------------------NKCSYTRRYDDGSITTGVAAQDILQSEGSERIPFY-- 163
Query: 202 DVGFGCSHNNAHFSDEQFTGVFG--LGPATSSTHSLVEKVG----SKFSYCIGNLNY-FE 254
FGCS +N +FS + TG G +G TS SL++++ +FSYC+ + E
Sbjct: 164 ---FGCSRDNQNFSVFEHTGKSGGVMGLNTSPV-SLLQQLSHITQRRFSYCLNPYQHGSE 219
Query: 255 YAYNMLILGEGAILEG----DSTP-MSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ L+ I +G STP MS D +Y++ L +++ + L + P F
Sbjct: 220 PPPSSLLRFGNDIRKGRRRFQSTPLMSSPDRPNYFLNLLDMTVAGQRLHLPPGTFALRQD 279
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
+ G IDSGT LT++ +AY L ++ F P + LCYS N
Sbjct: 280 GT-GGTIIDSGTGLTFITQTAYPRLISAFQNYFDHRGFQRVHIPEFDLCYSFRGNHTFHD 338
Query: 369 FPAMAFHFAGGADLVLDAESVFY-QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
+M FHF AD + A+ V+ E + FC+A+ P+ + ++IG I Q N
Sbjct: 339 HASMTFHFE-RADFTVQADYVYLPMEDDNAFCVALQPTPP-----QQRTVIGAINQGNTR 392
Query: 428 VAYDLVSKQLYFQRIDCE 445
YD + QL F +C
Sbjct: 393 FIYDAAAHQLLFIAENCR 410
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 147/361 (40%), Gaps = 49/361 (13%)
Query: 110 VPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYC------TN 158
V Q V+DT S + WV+C PC C A T +DPSKS + A PC S C N
Sbjct: 154 VAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYAN 213
Query: 159 DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH---NNAHFS 215
C D+C Y ++Y +G S GT S+ + + + FGCSH FS
Sbjct: 214 GCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASA-ISEFRFGCSHALLQPGSFS 272
Query: 216 DEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIG----NLNYFEYAYNMLILGEGAI--- 267
++ +G+ LG S + + G FSYC+ + +F + A+
Sbjct: 273 NKT-SGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPM 331
Query: 268 LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
L + PM Y V L I + K L + P +F AG +DS T +T L P
Sbjct: 332 LRSKAAPM-----LYLVRLIAIEVAGKRLPVPPAVFA-------AGAVMDSRTIVTRLPP 379
Query: 328 SAYQTLRKE-VEDL--FQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV- 383
+AY LR V ++ ++ P +D + + P + F G V
Sbjct: 380 TAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVE 439
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
LD V CLA P+ + + IIG + QQ V Y++ + F+R
Sbjct: 440 LDPSGVLLDG-----CLAFAPNTDD----QMTGIIGNVQQQALEVLYNVDGATVGFRRGA 490
Query: 444 C 444
C
Sbjct: 491 C 491
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 109/440 (24%), Positives = 175/440 (39%), Gaps = 56/440 (12%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQR-TLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV- 98
KL H SL PN T A + R+ + + A+ + + P ++ +P+
Sbjct: 34 KLYHMTSLKSPPNSTSLLFAYMFAKDEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLK 93
Query: 99 ---------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCGAT---TFDPSKSLT 145
+YV +G P ++DTGSS W++CQPC C F+PS S T
Sbjct: 94 SGLSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKT 153
Query: 146 YATLPC---------DSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG 196
Y T+PC ++ C + C Y Y + S G + + S
Sbjct: 154 YKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTL 213
Query: 197 KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCI----GNLN 251
+F+Y GC +N G+ GL S S L K G+ FSYC+ N
Sbjct: 214 SSFVY----GCGQDNQGLFGRT-DGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPN 268
Query: 252 YFEYAYNMLILGEGAILEGDS---TPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKK 305
+ + L +G ++ S TP+ + Y++ LE I++ + L + + +K
Sbjct: 269 SPKEGF--LSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYK- 325
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRD 365
IDSGT +T L Y TL+ + P C+ G++
Sbjct: 326 ------VPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGI 379
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+ P + F GGADL L + + + + CLA+ S ++IIG QQ
Sbjct: 380 SEVAPDIRIIFKGGADLQLKGHNSLVELETGITCLAMAGSS-------SIAIIGNYQQQT 432
Query: 426 YNVAYDLVSKQLYFQRIDCE 445
VAYD+ + ++ F C+
Sbjct: 433 VKVAYDVGNSRVGFAPGGCQ 452
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 151/355 (42%), Gaps = 46/355 (12%)
Query: 108 PPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCT----- 157
P V Q VLD+ S + WV+C PC C +DPS+S T A C S CT
Sbjct: 25 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPY 84
Query: 158 -NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
N C ++C Y +RY PD T G+ + T D G + FGCSH D
Sbjct: 85 ANGCAN--NQCQYLVRY---PDGSSTSGAYIADLLTLDAGNA-VSGFKFGCSHAEQGSFD 138
Query: 217 EQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
+ G+ LG S S + G+ FSYCI + + L + A TPM
Sbjct: 139 ARAAGIMALGGGPESLLSQTASRYGNAFSYCI-PATASDSGFFTLGVPRRASSRYVVTPM 197
Query: 276 SVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
+ Y V L I++G + L + P +F AG +DS T +T L P+AYQ
Sbjct: 198 VRFRQAATFYGVLLRTITVGGQRLGVAPAVFA-------AGSVLDSRTAITRLPPTAYQA 250
Query: 333 LRKEVED---LFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESV 389
LR +++ P +D + ++G +N L P ++ F A L LD +
Sbjct: 251 LRAAFRSSMTMYRSAPPKGYLDTCYD--FTGVVNIRL---PKISLVFDRNAVLPLDPSGI 305
Query: 390 FYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + CLA + +R ++G + QQ V YD+ + F++ C
Sbjct: 306 LFND-----CLAF--TSNADDRMP--GVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 158/383 (41%), Gaps = 48/383 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-----GATTFDPSKSLTYATLPCDSSY 155
V+ ++G PP VLDTGS L W+ C P A +F P SLT+A++PC S+
Sbjct: 67 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQ 126
Query: 156 CTND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C + C G +C ++ Y +G S G + +E F G+ FGC
Sbjct: 127 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV-----GQGPPLRAAFGCM 181
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI- 267
S + LG + + + +FSYCI + + +L+LG +
Sbjct: 182 ATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRD----DAGVLLLGHSDLP 237
Query: 268 --------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
L + P+ D +Y V L GI +G K L I ++ + T + +DS
Sbjct: 238 FLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQ-TMVDS 296
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPS-----YPMDPAWHLCYSGNINRDLQG-FPAM 372
GT T+L+ AY L+ E + LP+ + A+ C+ R PA+
Sbjct: 297 GTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAV 356
Query: 373 AFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
F GA + + + + Y + V+CL G +D+ +IG Q N
Sbjct: 357 TLLF-NGAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVP---ITAYVIGHHHQMNV 412
Query: 427 NVAYDLVSKQLYFQRIDCELLAD 449
V YDL ++ I C++ ++
Sbjct: 413 WVEYDLERGRVGLAPIRCDVASE 435
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 166/379 (43%), Gaps = 59/379 (15%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-------GATTFD-----PSKSLTYA 147
Y N ++G P + LDTGS L W+ C C C G ++ D P+ S T
Sbjct: 56 YANVTVGTPSDWFMVALDTGSDLFWLPCD-CTNCVRELKAPGGSSLDLNIYSPNASSTST 114
Query: 148 TLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNFETSDE-GKTFLYDV 203
+PC+S+ CT + C +C Y IRY +NG S G + + + ++D+ K V
Sbjct: 115 KVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARV 174
Query: 204 GFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAYN 258
FGC F D G+FGLG S S++ K G + FS C GN ++
Sbjct: 175 TFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISF- 233
Query: 259 MLILGEGAILEGDSTPMSVID--GSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
G+ ++ TP+++ +Y +T+ IS+G D++ + VF
Sbjct: 234 ----GDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFD-----------AVF- 277
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDL-----FQGLLPSYPMDPAWHL---CYSG--NINRDL 366
DSGT+ T+L +AY + + L +Q P + + L YSG + N+D
Sbjct: 278 DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDS 337
Query: 367 QGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+PA+ GG+ V V + + V+CLA+ + +D+SIIG
Sbjct: 338 FQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI-------MKIEDISIIGQNFMTG 390
Query: 426 YNVAYDLVSKQLYFQRIDC 444
Y V +D L ++ DC
Sbjct: 391 YRVVFDREKLILGWKESDC 409
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 108/419 (25%), Positives = 172/419 (41%), Gaps = 65/419 (15%)
Query: 69 ARFIY--LSQKSSQKAHDTRAHLHPG---ISTVPV----------FYVNFSIGQPPVPQL 113
RF++ L+ K S + T L G +ST P+ +YV +G P
Sbjct: 68 VRFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKSGLSIGSGNYYVKIGLGTPAKYFS 127
Query: 114 AVLDTGSSLIWVKCQPCE-QCGATT---FDPSKSLTYATLPC-------------DSSYC 156
++DTGSSL W++CQPC C F PS S TY LPC ++ C
Sbjct: 128 MIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQCSSLKSSTLNAPGC 187
Query: 157 TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT-FLYDVGFGCSHNNAHFS 215
+N G C Y Y + S G + + S+ + F+Y GC +N
Sbjct: 188 SNATGA----CVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVY----GCGQDNQGLF 239
Query: 216 DEQFTGVFGLG-PATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNM---LILGEGAILEG- 270
+G+ GL S L +K G+ FSYC+ + + ++ L +G ++
Sbjct: 240 GRS-SGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIGASSLTSSP 298
Query: 271 -DSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP+ I Y++ L I++ K L + + + + IDSGT +T L
Sbjct: 299 YKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSY-------NVPTIIDSGTVITRLP 351
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDA 386
+ Y L+K + P C+ G++ +++ P + F GGA L L A
Sbjct: 352 VAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSV-KEMSTVPEIQIIFRGGAGLELKA 410
Query: 387 ESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ + CLA+ S +SIIG QQ + VAYD+ + ++ F C+
Sbjct: 411 HNSLVEIEKGTTCLAIAASS------NPISIIGNYQQQTFKVAYDVANFKIGFAPGGCQ 463
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 151/371 (40%), Gaps = 69/371 (18%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-----------------FDP 140
++Y N S+G PP L LDTGS L W+ C CG T + P
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPC----NCGTTCIRDLEDIGVPQSVPLNLYTP 156
Query: 141 SKSLTYATLPCDSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
+ S T +++ C C + C C Y I Y+N ++GT+ + + T DE T
Sbjct: 157 NASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLT 216
Query: 199 FLY-DVGFGCSHNNAHF--SDEQFTGVFGLGPATSSTHSLVEKV---GSKFSYC----IG 248
+ +V GC + GV GLG S SL+ K + FS C IG
Sbjct: 217 PVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIG 276
Query: 249 NLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKN 306
N+ + G+ + + TP + S Y V + G+S+ +DI LF K
Sbjct: 277 NVGRISF-------GDRGYTDQEETPFISVAPSTAYGVNISGVSVAGDPVDI--RLFAK- 326
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSGNINR 364
D+G++ T L AY L K ++L + P+DP + CY + N
Sbjct: 327 ---------FDTGSSFTHLREPAYGVLTKSFDELVEDR--RRPVDPELPFEFCYDLSPNA 375
Query: 365 DLQGFPAMAFHFAGGADLVLDAE--SVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIA 422
FP + F GG+ ++L+ + QE + ++CL V K + + +
Sbjct: 376 TTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGV---------LKSVGLKINVI 426
Query: 423 QQNYNVAYDLV 433
QN+ Y +V
Sbjct: 427 GQNFVAGYRIV 437
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 155/371 (41%), Gaps = 70/371 (18%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSKSLTYATLPCDSSYCTN- 158
+G P + V+DTGSSL W++C PC Q G F+P S TYA++ C + C++
Sbjct: 3 LGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGP-VFNPKSSSTYASVGCSAQQCSDL 61
Query: 159 -------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
+ C Y Y + S G + + +F G T L + +GC +N
Sbjct: 62 PSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSLPNFYYGCGQDN 116
Query: 212 AHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYCI-----------GNLNYFEYAYNM 259
G+ GL S + L +G F+YC+ G+ N +Y+Y
Sbjct: 117 EGLFGRS-AGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNPGQYSY-- 173
Query: 260 LILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
TPM S+ D Y++ L G+++ L + + + T I
Sbjct: 174 -------------TPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT------II 214
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWHLCYSGNINRDLQGFPAMAF 374
DSGT +T L S Y L K V +G +Y + C+ G +R PA+
Sbjct: 215 DSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASR--VSAPAVTM 269
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
FAGGA L L A+++ S CLA P+ + +IIG QQ ++V YD+ S
Sbjct: 270 SFAGGAALKLSAQNLLVDVDDSTTCLAFAPA-------RSAAIIGNTQQQTFSVVYDVKS 322
Query: 435 KQLYFQRIDCE 445
++ F C
Sbjct: 323 SRIGFAAGGCS 333
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 104/386 (26%), Positives = 158/386 (40%), Gaps = 58/386 (15%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---------GATTFDPSKSLTYAT 148
++Y +G PPV +DTGS + W+ C PC C TT+DPS+S T
Sbjct: 36 LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGA 95
Query: 149 LPCDSSYCTNDCG---------GYPDECWYNIRYTNGPDSQGTIGSEQFNFE-----TSD 194
L C S C G GY C Y+ Y +G +QG + F+ T
Sbjct: 96 LSCRDSNCGAALGSNEVSCTSAGY---CAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQV 152
Query: 195 EGKTFLYDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIG 248
G +Y FGC N S G+ G G A S S + KVG++F++C+
Sbjct: 153 NGTASVY---FGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQ 209
Query: 249 NLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
N +++G + TP+ V Y V ++ I++ + + P F T
Sbjct: 210 GDN---QGGGTIVIGSVSEPNISYTPI-VSRNHYAVGMQNIAVNGRNV-TTPASFDTTST 264
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP--MDPAWHLCYSGNINRDL 366
S GV +DSGTTL +LV AY V + S+ + AW C L
Sbjct: 265 -SAGGVIMDSGTTLAYLVDPAYTQFVNAVSTFESSMFSSHSQCLQLAW--C-------SL 314
Query: 367 QG-FPAMAFHFAGGADLVLDAESVFY----QESSSVFCLAVGPSDINGERFKDLSIIGMI 421
Q FP + F GA + L + Y Q + +C+ S + SI+G I
Sbjct: 315 QADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAG-YLSYSILGDI 373
Query: 422 AQQNYNVAYDLVSKQLYFQRIDCELL 447
+++ V YD ++ + ++ DC+
Sbjct: 374 VLKDHLVVYDNDNRVVGWKSFDCKFF 399
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 160/386 (41%), Gaps = 52/386 (13%)
Query: 86 RAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSK 142
R LH + T + IG PP ++DTGS++ +V C C CG F P+
Sbjct: 22 RMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPAL 81
Query: 143 SLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLY 201
S +Y L C S T C G Y +Y S G +G + F +SD G L
Sbjct: 82 SSSYKPLECGSECSTGFCDGSRK---YQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLV 138
Query: 202 DVGFGC-SHNNAHFSDEQFTGVFGLGPA-TSSTHSLVEKVGSK--FSYCIGNLNYFEYAY 257
FGC + D+ G+ GLG S LVEK + FS C G ++
Sbjct: 139 ---FGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDE----- 190
Query: 258 NMLILGEGAILEGDSTPMS--VIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDT 308
G GA++ G P V S Y + L+GI +G L + P +F
Sbjct: 191 -----GGGAMILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD---- 241
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL--LPSYPMDPAWHLCYSG---NIN 363
G +DSGTT + +A+Q + V++ L +P P + +CY+G N++
Sbjct: 242 -GKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPG-PDEKFKDICYAGAGTNVS 299
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFYQES--SSVFCLAVGPSDINGERFKDLSIIGMI 421
Q FP++ F F G + L E+ ++ + S +CL V E +++G I
Sbjct: 300 NLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGV------FENGDPTTLLGGI 353
Query: 422 AQQNYNVAYDLVSKQLYFQRIDCELL 447
+N V Y+ + F + C L
Sbjct: 354 IVRNMLVTYNRGKASIGFLKTKCNDL 379
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 113/470 (24%), Positives = 199/470 (42%), Gaps = 61/470 (12%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQA 60
M + ++L+ +++ L F + I++S+ A+ P R V + + +P + QA
Sbjct: 1 MTQTWSLLISAIVILSFVT--IYSSS----ASQIPNRGVRRPMIFPLYFASPKSSGHRQA 54
Query: 61 QRTLNMSMARFIYLSQKSSQKAH-DTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTG 119
+ KS H + R L+ + + + IG PP ++DTG
Sbjct: 55 IE------GSYWRRHLKSDPYHHPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTG 108
Query: 120 SSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNG 176
S++ +V C CE CG F P +S TY + C+ C D G C Y RY
Sbjct: 109 STVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCNMD-CNCDHDGV--NCVYERRYAEM 165
Query: 177 PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFTGVFGLGPAT-SSTHS 234
S G +G + +F ++ + FGC + ++ G+ GLG S
Sbjct: 166 SSSSGVLGEDIISF--GNQSEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQ 223
Query: 235 LVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS---------YY 283
LV+K + FS C G ++ +G GA++ G P + S Y
Sbjct: 224 LVDKNVINDSFSLCYGGMH----------VGGGAMVLGGIPPPPDMVFSRSDPYRSPYYN 273
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
+ L+ I + K L + P+ F + G +DSGTT +L A+ R +
Sbjct: 274 IELKEIHVAGKPLKLSPSTFDRKH-----GTVLDSGTTYAYLPEEAFVAFRDAIIKKSHN 328
Query: 344 LLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SV 397
L + DP ++ +C+SG ++++ + FP + F+ G L L E+ +Q +
Sbjct: 329 LKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGA 388
Query: 398 FCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+CL + NG+ +++G I +N V YD ++++ F + +C L
Sbjct: 389 YCLGIFR---NGD---STTLLGGIIVRNTLVTYDRENEKIGFWKTNCSEL 432
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/451 (23%), Positives = 181/451 (40%), Gaps = 56/451 (12%)
Query: 34 KPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSM-ARFIYLSQK--SSQKAHDTRAH-- 88
+ + ++ +L +LL +P + A A L + ++F K + +AHD H
Sbjct: 5 RRQWFLSAILLSAALLIDPQFSTAATASENLVFEVRSKFAGKRVKDLGALRAHDVHRHSR 64
Query: 89 LHPGI----------STVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC----- 133
L I ++ +++ +G P +DTGS ++WV C C +C
Sbjct: 65 LLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSD 124
Query: 134 --GATTFDPSKSLTYATLPCDSSYCT-----NDCGGYPDECWYNIRYTNGPDSQGTIGSE 186
T +D S T ++ C ++C+ ++C C Y I Y +G + G + +
Sbjct: 125 LVELTPYDVDASSTAKSVSCSDNFCSYVNQRSECHS-GSTCQYVIMYGDGSSTNGYLVKD 183
Query: 187 QF-------NFETSDEGKTFLYDVGFGC-SHNNAHFSDEQ--FTGVFGLGPATSSTHSLV 236
N +T T + FGC S + + Q G+ G G + SS S +
Sbjct: 184 VVHLDLVTGNRQTGSTNGTII----FGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQL 239
Query: 237 E---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGE 293
KV F++C+ N N + +GE + +TPM Y V L I +G
Sbjct: 240 ASQGKVKRSFAHCLDNNN----GGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGN 295
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
+L++ N F D D GV IDSGTTL +L + Y L E+ L +
Sbjct: 296 SVLELSSNAFDSGD---DKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF 352
Query: 354 WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
Y+ ++R FP + F F L + +Q +C + +
Sbjct: 353 TCFHYTDKLDR----FPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGA 408
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L+I+G +A N V YD+ ++ + + +C
Sbjct: 409 SLTILGDMALSNKLVVYDIENQVIGWTNHNC 439
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 163/392 (41%), Gaps = 51/392 (13%)
Query: 71 FIYLSQKSSQ-KAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
F Y++ K+S+ + G+ T ++ ++ +G P Q+ +DTGSS WV C+
Sbjct: 54 FRYITNKTSRLSTKAVQVGWDRGLQT-SLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE- 111
Query: 130 CEQC--GATTFDPSKSLTYATLPCDSSYC--------TNDCGGYPDECWYNIRYTNGPDS 179
C+ C TF S+S T A + C +S C D YPD C + + Y +G S
Sbjct: 112 CDGCHTNPRTFLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPD-CPFRVSYQDGSAS 170
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQF---TGVFGLGPATSSTHSLV 236
G + + F + F FGC N F +F G+ G+G S
Sbjct: 171 YGILYQDTLTFSDVQKIPGF----SFGC--NMDSFGANEFGNVDGLLGMGAGPMSVLKQS 224
Query: 237 EKVGSKFSYCI----GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS----YYVTLEG 288
FSYC+ +F LG+ A V ++V L
Sbjct: 225 SPTFDCFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTA 284
Query: 289 ISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY 348
IS+ + L + P++F + GV DSG+ L+++ A L + + +L L
Sbjct: 285 ISVDGERLGLSPSVFSRK------GVVFDSGSELSYIPDRALSVLSQRIRELL--LKRGA 336
Query: 349 PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES---SSVFCLAVGPS 405
+ + CY + D PA++ HF GA L + VF + S V+CLA P+
Sbjct: 337 AEEESERNCYDMR-SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPT 395
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
+ +SIIG + Q + V YDL +QL
Sbjct: 396 E-------SVSIIGSLMQTSKEVVYDL-KRQL 419
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 138/297 (46%), Gaps = 28/297 (9%)
Query: 166 ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL 225
+C+Y Y + + G I + F F + + + ++ FGC N +G+ G
Sbjct: 32 QCFYLCSYGDRSITAGHIFKDTFTFMSPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGF 91
Query: 226 GPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG---------DSTPM- 275
G S S + KVG +FSYC+ + E +++ILG +G STP+
Sbjct: 92 GRGPQSLPSQL-KVG-RFSYCLTLVT--ESKSSVVILGTPPDPDGLRAHTTGPFQSTPII 147
Query: 276 --SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTL 333
+I YY++LEGI++G+ L D ++F S G IDSGT+LT L + ++ L
Sbjct: 148 YNPLIPTFYYLSLEGITVGKTRLPFDKSVFALKKDGS-GGTVIDSGTSLTTLPEAVFELL 206
Query: 334 RKEVEDLFQGLLPSYPMDPAW--HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY 391
++E+ F LP Y P LC+ P + H AG AD+ L ++ F
Sbjct: 207 QEELVAQFP--LPRYDNTPEVGDRLCFRRPKGGKQVPVPKLILHLAG-ADMDLPRDNYFV 263
Query: 392 QE-SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+E S V CL ING + +IG QQN +V YD+ + +L F C+ L
Sbjct: 264 EEPDSGVMCL-----QINGAEDTTMVLIGNFQQQNMHVVYDVENNKLLFAPAQCDKL 315
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 169/381 (44%), Gaps = 49/381 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C PC C +++ F+P S T +
Sbjct: 88 VGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSS 147
Query: 148 TLPCDSSYCTND-------CGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFET---SDEG 196
+PC CT C + C Y Y +G + G S+ F++ +++
Sbjct: 148 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQT 207
Query: 197 KTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNL 250
+ FGCS++ + +D G+FG G S S + +G FS+C L
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---L 264
Query: 251 NYFEYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ +L+LGE I+E TP+ Y + LE I + + L ID +LF ++T
Sbjct: 265 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYSGNINRDLQ 367
G +DSGTTL +L AY V + + PS + + C+ + + D
Sbjct: 323 ---QGTIVDSGTTLAYLADGAYDPF---VNAITAAVSPSVRSLVSKGNQCFVTSSSVD-S 375
Query: 368 GFPAMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQ 423
FP ++ +F GG + + E+ Q++S ++C+ + + ++I+G +
Sbjct: 376 SFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIG-----WQRNQGQQITILGDLVL 430
Query: 424 QNYNVAYDLVSKQLYFQRIDC 444
++ YDL + ++ + DC
Sbjct: 431 KDKIFVYDLANMRMGWTDYDC 451
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 160/385 (41%), Gaps = 64/385 (16%)
Query: 89 LHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-----EQCGATTFDPSK 142
L PG S V + +G P + V+DTGSSL W++C PC Q G F+P
Sbjct: 110 LGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGP-VFNPRS 168
Query: 143 ---------------SLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQ 187
+LT ATL + S C+ + C Y Y + S G + +
Sbjct: 169 SSSYASVSCSAPQCDALTTATL--NPSTCSTS-----NVCIYQASYGDSSFSVGYLSKDT 221
Query: 188 FNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGP-ATSSTHSLVEKVGSKFSYC 246
+F G T + + +GC +N Q G+ GL S + L +G FSYC
Sbjct: 222 VSF-----GSTSVPNFYYGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYC 275
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLF 303
+ + ++ G + TPM S+ D Y++ + GI++ K L + +
Sbjct: 276 LPTSSSSSGYLSIGSYNPG---QYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSAS-- 330
Query: 304 KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSG 360
+S IDSGT +T L Y L K V +G P A+ + C+ G
Sbjct: 331 ----AYSSLPTIIDSGTVITRLPTDVYSALSKAVAGAMKGT----PRASAFSILDTCFQG 382
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
+R P ++ FAGGA L L A ++ S+ CLA P+ + +IIG
Sbjct: 383 QASR--LRVPQVSMAFAGGAALKLKATNLLVDVDSATTCLAFAPA-------RSAAIIGN 433
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDCE 445
QQ ++V YD+ + ++ F C
Sbjct: 434 TQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 152/363 (41%), Gaps = 50/363 (13%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C C+QCG F P S +Y L C+ +D G
Sbjct: 82 IGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCNPDCNCDDEG 141
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHFSDEQFT 220
C Y RY S G + + +F +E + FGC + ++
Sbjct: 142 KL---CVYERRYAEMSSSSGVLSEDLISF--GNESQLSPQRAVFGCENEETGDLFSQRAD 196
Query: 221 GVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV 277
G+ GLG S LV+K + FS C G + +G GA++ G +P
Sbjct: 197 GIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME----------VGGGAMVLGKISPPPG 246
Query: 278 IDGS---------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ S Y + L+ + + K L ++P +F G +DSGTT +
Sbjct: 247 MVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN-----GKHGTVLDSGTTYAYFPKE 301
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAW-HLCYSGNINRDLQG----FPAMAFHFAGGADLV 383
A+ ++ V L + DP + +C+SG RD+ FP +A F G L+
Sbjct: 302 AFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSG-AGRDVAEIHNFFPEIAMEFGNGQKLI 360
Query: 384 LDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L E+ ++ + +CL + P +++G I +N V YD + +L F +
Sbjct: 361 LSPENYLFRHTKVRGAYCLGIFPDR------DSTTLLGGIVVRNTLVTYDRENDKLGFLK 414
Query: 442 IDC 444
+C
Sbjct: 415 TNC 417
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 169/396 (42%), Gaps = 65/396 (16%)
Query: 85 TRAHLHPGISTVPVFYVNFSIGQPPVPQL-AVLDTGSSLIWVKCQPCEQCGATTFDPSKS 143
T A + P S + FY+ Q P + AV+DTGS++ W + C S+S
Sbjct: 22 TLAFMTPRTSCI-TFYLG---NQRPKDNISAVVDTGSNIFWTTEKEC----------SRS 67
Query: 144 LTYATLPCDSSYCTN--DCGGYPDE----------CWYNIRYT-NGPDS-QGTIGSEQFN 189
T + LPC S C CG E C Y I+Y N DS G + ++
Sbjct: 68 KTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAGVLYEDKLT 127
Query: 190 F----ETSDEGKTFLYDVGFGCSHNNA-HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKF 243
+ G +V GCS + F D GVFGLG S SL ++ SKF
Sbjct: 128 IVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSIKGVFGLG---RSATSLPRQLNFSKF 184
Query: 244 SYCIGNLNYFEYAYNMLILGEGAILEGDST-----------PMSVIDGSYYVTLEGISLG 292
SYC+ + + +L+ + G P S Y+V L+GIS+G
Sbjct: 185 SYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIG 244
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPM 350
L P + T S +F+D+GT+ T L + + L E++ + + + P
Sbjct: 245 GTRL---PAV----STKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPG 297
Query: 351 DPAWHLCYS--GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDIN 408
+CYS + P M HFA A++VL +S ++ ++S CLA+ S+I
Sbjct: 298 RNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWK-TTSKLCLAIDKSNIK 356
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
G +S++G QN ++ D +++L F R DC
Sbjct: 357 G----GISVLGNFQMQNTHMLLDTGNEKLSFVRADC 388
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 164/377 (43%), Gaps = 51/377 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATL 149
+++ +G PP +DTGS L+WV C PC C A +D S + + +
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 150 PCDSSYCT----------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
PC CT ND ++C Y+ +Y +G + G + + ++ + T
Sbjct: 95 PCSDPSCTLITQISESGCND----QNQCGYSFQYGDGSGTLGYLVEDVLHYMV-NATATV 149
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYFEYA 256
++ GF S + S+ G+ G G + S +S + K G F++C L+ E
Sbjct: 150 IFGCGFKQS-GDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC---LDGGERG 205
Query: 257 YNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+L+LG ++E D TP+ Y V L+ IS+ L IDP LF ND G
Sbjct: 206 GGILVLGN--VIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLF-SNDVMQ--GT 260
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAF 374
DSGTTL +L AYQ + V + L LC + + FP +
Sbjct: 261 IFDSGTTLAYLPDEAYQAFTQAVSLVVAPFL----------LCDTRLSRFIYKLFPNVVL 310
Query: 375 HFAGGADLVLDAESVFYQESSS---VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
+F G + + AE + Q S++ ++C+ S + E +I G + +N V YD
Sbjct: 311 YFEGASMTLTPAEYLIRQASAANAPIWCMG-WQSMGSAESELQYTIFGDLVLKNKLVVYD 369
Query: 432 LVSKQLYFQRIDCELLA 448
L ++ ++ DC+ L+
Sbjct: 370 LERGRIGWRPFDCKFLS 386
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 169/396 (42%), Gaps = 47/396 (11%)
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
L+ S++ + R LH + + IG PP ++DTGS++ +V C CEQC
Sbjct: 87 LTGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC 146
Query: 134 GA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF 190
G F P S TY + C C +C G +C Y +Y S G +G + +F
Sbjct: 147 GRHQDPKFQPESSSTYQPVKCTID-C--NCDGDRMQCVYERQYAEMSTSSGVLGEDVISF 203
Query: 191 ETSDEGKTFLYDVGFGCSH-NNAHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFSYC 246
++ + FGC + + G+ GLG S LV+K + FS C
Sbjct: 204 --GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLC 261
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPMSVI-------DGS--YYVTLEGISLGEKMLD 297
G ++ +G GA++ G +P S + D S Y + L+ + + K L
Sbjct: 262 YGGMD----------VGGGAMVLGGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLP 311
Query: 298 IDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-L 356
++ N+F G +DSGTT +L +A+ + + Q L DP ++ +
Sbjct: 312 LNANVFDGKH-----GTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDI 366
Query: 357 CYSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGER 411
C+SG ++++ + FP + F G L E+ ++ S +CL + NG
Sbjct: 367 CFSGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQ---NGN- 422
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+++G I +N V YD ++ F + +C L
Sbjct: 423 -DQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCAEL 457
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 167/378 (44%), Gaps = 49/378 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATLP 150
++ +G PP +DTGS ++WV C PC C +++ F+P S T + +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 151 CDSSYCTND-------CGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFET---SDEGKTF 199
C CT C + C Y Y +G + G S+ F+T +++
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 200 LYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYF 253
+ FGCS++ + +D G+FG G S S + +G FS+C L
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKGS 293
Query: 254 EYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ +L+LGE I+E TP+ Y + LE I + + L ID +LF ++T
Sbjct: 294 DNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT--- 348
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSY-PMDPAWHLCYSGNINRDLQGFP 370
G +DSGTTL +L AY V + + PS + + C+ + + D FP
Sbjct: 349 QGTIVDSGTTLAYLADGAYDPF---VNAITAAVSPSVRSLVSKGNQCFVTSSSVD-SSFP 404
Query: 371 AMAFHFAGGADLVLDAESVFYQESS----SVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
++ +F GG + + E+ Q++S ++C+ + + ++I+G + ++
Sbjct: 405 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIG-----WQRNQGQQITILGDLVLKDK 459
Query: 427 NVAYDLVSKQLYFQRIDC 444
YDL + ++ + DC
Sbjct: 460 IFVYDLANMRMGWTDYDC 477
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 152/363 (41%), Gaps = 50/363 (13%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C C+QCG F P S +Y L C+ +D G
Sbjct: 82 IGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCNPDCNCDDEG 141
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHFSDEQFT 220
C Y RY S G + + +F +E + FGC + ++
Sbjct: 142 KL---CVYERRYAEMSSSSGVLSEDLISF--GNESQLSPQRAVFGCENEETGDLFSQRAD 196
Query: 221 GVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV 277
G+ GLG S LV+K + FS C G + +G GA++ G +P
Sbjct: 197 GIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME----------VGGGAMVLGKISPPPG 246
Query: 278 IDGS---------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ S Y + L+ + + K L ++P +F G +DSGTT +
Sbjct: 247 MVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN-----GKHGTVLDSGTTYAYFPKE 301
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAW-HLCYSGNINRDLQG----FPAMAFHFAGGADLV 383
A+ ++ V L + DP + +C+SG RD+ FP +A F G L+
Sbjct: 302 AFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSG-AGRDVAEIHNFFPEIAMEFGNGQKLI 360
Query: 384 LDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L E+ ++ + +CL + P +++G I +N V YD + +L F +
Sbjct: 361 LSPENYLFRHTKVRGAYCLGIFPDR------DSTTLLGGIVVRNTLVTYDRENDKLGFLK 414
Query: 442 IDC 444
+C
Sbjct: 415 TNC 417
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 158/375 (42%), Gaps = 38/375 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSS 154
+ + +G PP A++DTGS L+W++C+PC QC + + +DPS S T+A C +S
Sbjct: 3 AYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTS 62
Query: 155 YC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH- 209
C + C C Y +Y + +QG E +S + FGC
Sbjct: 63 SCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGRL 122
Query: 210 NNAHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
N+ F G+ GLG S + L + +KFSYC+ + + + LI G A
Sbjct: 123 NSGSFGGA--AGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180
Query: 269 EGDSTPMSVIDGS-----YYVTLEGISLGEKMLDI------------DPNLFKKNDTWSD 311
+ +I S Y+V LEGIS+G K L + L + +
Sbjct: 181 GSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNS 240
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA-WHLCYSGNINRDLQGFP 370
G DSGTTLT L + Y ++ LP+ + + LCY + +++ + FP
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAFASSVS--LPTVDASSSGFDLCYDVSKSKNFK-FP 297
Query: 371 AMAFHFAGGADLVLDAES-VFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
A+ F G V + +V CLA + G L IIG + QQNY+V
Sbjct: 298 ALTLAFKGTKFSPPQKNYFVIVDTAETVACLA-----MGGSGSLGLGIIGNLMQQNYHVV 352
Query: 430 YDLVSKQLYFQRIDC 444
YD + + C
Sbjct: 353 YDRGTSTISMSPAQC 367
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 112/440 (25%), Positives = 183/440 (41%), Gaps = 60/440 (13%)
Query: 38 LVTKLLHRDSLLYNPN----DTVDAQAQRTLNMS------MARFIYLSQKSSQKAHDTRA 87
+L+ RDS PN + ++A A R+ N S + RF +S S A +
Sbjct: 37 FTAELIRRDS----PNSPFYNALEAAATRSTNASQHYDAQIGRFNLMSD--SYYASQSEL 90
Query: 88 HLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--TFDPSKSLT 145
+ G + + S+G PP LA+ D L W+ C+ C+ C TF PS+S T
Sbjct: 91 NFSKG-----NYLIKISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFFPSESST 145
Query: 146 YATLPCDSSYC--TNDCGGYPDECWYNI----RYTNGPDSQGTIGSEQFNFETSDEGKTF 199
Y + C+S C TN C Y + + ++G + + +F +S G+
Sbjct: 146 YTSAACESYQCQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSS-SGQAL 204
Query: 200 LY-DVGFGCSH--NNAHFSDEQFTGVFGLGPAT-SSTHSLVEKVGSKFSYCI-----GNL 250
Y + F C +N H+ G+ GLG S T + + FS C+
Sbjct: 205 SYPNTNFICGTFIDNWHYIG---AGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQS 261
Query: 251 NYFEYAYNMLILGEGAILEGDSTPMS--VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ + ++ GEG + STP++ G+Y++ LE +S+G + N+
Sbjct: 262 SKINFGLKGVVSGEGVV----STPIADDGESGAYFLFLEAMSVGGNRV--------ANNF 309
Query: 309 WS--DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
+S + ++ID TT T L Y+ + EV +Y + LCY + D
Sbjct: 310 YSAPKSNIYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLSLCYKSESDHDF 369
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
P + HF AD+ L + F + +V C A N + ++ G Q N+
Sbjct: 370 DA-PPITMHFT-NADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNF 427
Query: 427 NVAYDLVSKQLYFQRIDCEL 446
V YDL S + F++ DC L
Sbjct: 428 IVGYDLKSSTVSFKQADCTL 447
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 157/366 (42%), Gaps = 60/366 (16%)
Query: 114 AVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN--DCGGYPDE----- 166
AV+DTGS++ W + C S+S T + LPC S C CG E
Sbjct: 71 AVVDTGSNIFWTTEKEC----------SRSKTRSMLPCCSPKCEQRASCGCRRSELKAEA 120
Query: 167 -----CWYNIRYT-NGPDS-QGTIGSEQFNF----ETSDEGKTFLYDVGFGCSHNNA-HF 214
C Y I+Y N DS G + ++ + G +V GCS + F
Sbjct: 121 EKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKF 180
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDST 273
D GVFGLG S SL ++ SKFSYC+ + + +L+ + G
Sbjct: 181 KDPSIKGVFGLG---RSATSLPRQLNFSKFSYCLSSYQKPDLPSYLLLTAAPDMATGAVG 237
Query: 274 -----------PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
P S Y+V L+GIS+G L P + T S +F+D+GT+
Sbjct: 238 GAAAVATTALQPNSDYKTRYFVDLQGISIGGTRL---PAV----STKSGGNMFVDTGTSF 290
Query: 323 TWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCYS--GNINRDLQGFPAMAFHFAG 378
T L + + L E++ + + + P +CYS + P M HFA
Sbjct: 291 TRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFAD 350
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLY 438
A++VL +S ++ ++S CLA+ S+I G +S++G QN ++ D +++L
Sbjct: 351 SANMVLPWDSYLWK-TTSKLCLAIDKSNIKG----GISVLGNFQMQNTHMLLDTGNEKLS 405
Query: 439 FQRIDC 444
F R DC
Sbjct: 406 FVRADC 411
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 153/376 (40%), Gaps = 48/376 (12%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFD---PSKSLTY 146
++Y ++G P VP L LDTGS L W+ C C C G F+ P+ S T
Sbjct: 129 LYYAEVTVGTPGVPYLVALDTGSDLFWLPCD-CVNCITGLNTTQGPVNFNIYSPNNSSTS 187
Query: 147 ATLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNFETSD-EGKTFLYD 202
+ C SS C+ + C D C Y + Y ++ S G + + + T+D + K
Sbjct: 188 KEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNAR 247
Query: 203 VGFGCSHNN--AHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAY 257
+ GC + A S G+FGLG S S++ G + FS C G +
Sbjct: 248 ITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEF 307
Query: 258 NMLILGEGAILEGDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G+ + TP ++ +Y V++ I +G + D+ D V
Sbjct: 308 -----GDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDL------------DVAVI 350
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
DSGT+ T+L AY + + + + D + CY + N+ +P M
Sbjct: 351 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 410
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
GG V++ V ES +FCLA+ SD ++IIG Y++ +D
Sbjct: 411 MKGGGHFVINHPIVLISTESKRLFCLAIARSD-------SINIIGQNFMTGYHIVFDREK 463
Query: 435 KQLYFQRIDCELLADD 450
L ++ +C D+
Sbjct: 464 MVLGWKESNCTGYEDE 479
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 151/355 (42%), Gaps = 46/355 (12%)
Query: 108 PPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCT----- 157
P V Q VLD+ S + WV+C PC C +DPS+S + A C S CT
Sbjct: 155 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPY 214
Query: 158 -NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
N C ++C Y +RY PD T G+ + T D G + FGCSH D
Sbjct: 215 ANGCAN--NQCQYLVRY---PDGSSTSGAYIADLLTLDAGNA-VSGFKFGCSHAEQGSFD 268
Query: 217 EQFTGVFGLGPATSSTHS-LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
+ G+ LG S S + G+ FSYCI + + L + A TPM
Sbjct: 269 ARAAGIMALGGGPESLLSQTASRYGNAFSYCI-PATASDSGFFTLGVPRRASSRYVVTPM 327
Query: 276 SVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
+ Y V L I++G + L + P +F AG +DS T +T L P+AYQ
Sbjct: 328 VRFRQAATFYGVLLRTITVGGQRLGVAPAVFA-------AGSVLDSRTAITRLPPTAYQA 380
Query: 333 LRKEVED---LFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESV 389
LR +++ P +D + ++G +N L P ++ F A L LD +
Sbjct: 381 LRSAFRSSMTMYRSAPPKGYLDTCYD--FTGVVNIRL---PKISLVFDRNAVLPLDPSGI 435
Query: 390 FYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + CLA + +R ++G + QQ V YD+ + F++ C
Sbjct: 436 LFND-----CLAF--TSNADDRMP--GVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 166/400 (41%), Gaps = 47/400 (11%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
R +L + + R LH + T + IG PP ++DTGS++ +V C
Sbjct: 60 RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSN 119
Query: 130 CEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSE 186
C QCG F P S TY + C++ C D G +C Y RY S G + +
Sbjct: 120 CVQCGNHQDPRFQPELSSTYQPVKCNAD-CNCDENGV--QCTYERRYAEMSTSSGVLAED 176
Query: 187 QFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSK 242
+F E + FGC + + ++ G+ GLG T S LV K V +
Sbjct: 177 VMSF--GKESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNS 234
Query: 243 FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI----DGS----YYVTLEGISLGEK 294
FS C G ++ + G +L G S+P ++ D S Y + L+ I + K
Sbjct: 235 FSLCYGGMD---------VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGK 285
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
L ++P F G +DSGTT + AY + + L DP +
Sbjct: 286 PLKLNPRTFD-----GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNF 340
Query: 355 H-LCYSGNINRDL----QGFPAMAFHFAGGADLVLDAESVFYQES--SSVFCLAVGPSDI 407
+C+SG RD+ + FP + FA G + L E+ ++ + S +CL +
Sbjct: 341 KDICFSG-AGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFK--- 396
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
NG +++G I +N V Y+ + + F + +C L
Sbjct: 397 NGN--DQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 153/376 (40%), Gaps = 48/376 (12%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFD---PSKSLTY 146
++Y ++G P VP L LDTGS L W+ C C C G F+ P+ S T
Sbjct: 106 LYYAEVTVGTPGVPYLVALDTGSDLFWLPCD-CVNCITGLNTTQGPVNFNIYSPNNSSTS 164
Query: 147 ATLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNFETSD-EGKTFLYD 202
+ C SS C+ + C D C Y + Y ++ S G + + + T+D + K
Sbjct: 165 KEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNAR 224
Query: 203 VGFGCSHNN--AHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAY 257
+ GC + A S G+FGLG S S++ G + FS C G +
Sbjct: 225 ITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEF 284
Query: 258 NMLILGEGAILEGDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
G+ + TP ++ +Y V++ I +G + D+ D V
Sbjct: 285 -----GDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDL------------DVAVI 327
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
DSGT+ T+L AY + + + + D + CY + N+ +P M
Sbjct: 328 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 387
Query: 376 FAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
GG V++ V ES +FCLA+ SD ++IIG Y++ +D
Sbjct: 388 MKGGGHFVINHPIVLISTESKRLFCLAIARSD-------SINIIGQNFMTGYHIVFDREK 440
Query: 435 KQLYFQRIDCELLADD 450
L ++ +C D+
Sbjct: 441 MVLGWKESNCTGYEDE 456
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 167/393 (42%), Gaps = 40/393 (10%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPV--FYVNFSIGQPPVPQLAVLDTGSSLIWVKC 127
R YLS + QK T + PG + + + V +G P VLDT + WV C
Sbjct: 69 RLKYLSTLADQKT--TAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPC 126
Query: 128 QPCEQCGATTFDPSKSLTYATLPCDSSYCTNDCG-GYP----DECWYNIRYTNGPDSQGT 182
C +TTF P+ S T +L C + C+ G P C +N Y G DS T
Sbjct: 127 SGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY--GGDSSLT 184
Query: 183 IGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK 242
Q +++ + FGC N G+ GLG SL+ + G+
Sbjct: 185 ATLVQDAITLAND---VIPGFTFGC-INAVSGGSIPPQGLLGLG---RGPISLISQAGAM 237
Query: 243 ----FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKM 295
FSYC+ + + ++ ++ + G +TP+ YYV L G+S+G
Sbjct: 238 YSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIK 297
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH 355
+ I P+ D + AG IDSGT +T V Y +R E G + S A+
Sbjct: 298 VPI-PSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSL---GAFD 353
Query: 356 LCYSGNINRDLQGFPAMAFHFAGGADLVLDAE-SVFYQESSSVFCL--AVGPSDINGERF 412
C++ + PA+ HF G +LVL E S+ + S S+ CL A P+++N
Sbjct: 354 TCFAATNEAEA---PAITLHFE-GLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSV-- 407
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
L++I + QQN + +D + +L R C
Sbjct: 408 --LNVIANLQQQNLRIMFDTTNSRLGIARELCN 438
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 123/469 (26%), Positives = 185/469 (39%), Gaps = 65/469 (13%)
Query: 17 FTSTRIFTSTTAAPA--AGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYL 74
+ + R FT+ AAP+ + +R +LLHRD++ + + + AR YL
Sbjct: 35 YINPRNFTAA-AAPSVPSSTTRRPSLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYL 93
Query: 75 SQK-SSQKAHDTRAHLHPGISTVP----VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
++ S + + + + G + V + V IG PP+ Q V DTGS +IWV+C P
Sbjct: 94 QRRLSPSPSPSSTSSVESGGTIVSHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSP 153
Query: 130 CEQC---GATTFDPSKSLTYATLPCDSSYCTNDC-------GGYPDECWYNIRYTNGPDS 179
C C G FDP+ S +++ +PC+S C GG EC Y + Y + +
Sbjct: 154 CSDCYAQGDPLFDPANSASFSPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYT 213
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLG-PATSSTHSLVEK 238
G + E +G T + V GC H N E G+ GLG S L
Sbjct: 214 NGVLALETLTL----DGGTEVQGVAMGCGHENRGLFAEA-AGLLGLGWGPMSLVGQLGGA 268
Query: 239 VGSKFSYCIG-NLNYFEYAYNMLILGEGAILEGDSTPMSVI----------DGSYYVTLE 287
G FSYC+ + L+LG D+ P + YYV +
Sbjct: 269 AGGAFSYCLAGYYSGEGSGSGSLVLG-----REDAAPTGAVWVPLVRNPDAPSFYYVGVN 323
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS 347
G+ + + L + + GV +D+GT +T L AY LR F+ P
Sbjct: 324 GLGVAGERLQLQ-DGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPR 382
Query: 348 YPMDPAWHLCYSGNINRDLQGF-----PAMAFHFAG------GADLVLDAESVFYQ-ESS 395
P + CY DL G+ P +A +F G A L L A ++ +
Sbjct: 383 APGVSLFDTCY------DLSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDG 436
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+CLA SI+G I QQ + D S + F C
Sbjct: 437 GTYCLAFAAVA------SGPSILGNIQQQGIEITVDSASGYVGFGPATC 479
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 148/369 (40%), Gaps = 64/369 (17%)
Query: 108 PPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYC------ 156
P V Q V+DT S + WV+C PC QC A + +DP+KS+ A PC S C
Sbjct: 170 PGVAQSMVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRY 229
Query: 157 TNDCGGYPD--ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH---NN 211
N C G + C Y + Y +G + GT S+ +G + FGCSH
Sbjct: 230 ANGCTGAGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQ--FGCSHALLRP 287
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYFEYAYNMLILGEGA-- 266
F+++ G LG S S + SK FSYC+ + ++ + A
Sbjct: 288 GSFNNKT-AGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASR 346
Query: 267 -----ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
+L+ PM Y V L GI + + L + P +F N +DS T
Sbjct: 347 YAVTPMLKSKMAPM-----IYMVRLIGIDVAGQRLPVPPAVFAAN-------AAMDSRTI 394
Query: 322 LTWLVPSAYQTLR---KEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
+T L P+AY LR + ++ + P +D CY D G P +
Sbjct: 395 ITRLPPTAYMALRAAFRAQMRAYRAVAPKGQLD----TCY------DFTGVPMVRLP--- 441
Query: 379 GADLVLDAESVFYQESSSVF---CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
LV D + + S V CLA P N F IIG + QQ V Y++
Sbjct: 442 KVTLVFDRNAAVELDPSGVMLDSCLAFAP---NANDFMP-GIIGNVQQQTLEVLYNVDGA 497
Query: 436 QLYFQRIDC 444
+ F+R C
Sbjct: 498 SVGFRRAAC 506
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 166/400 (41%), Gaps = 47/400 (11%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP 129
R +L + + R LH + T + IG PP ++DTGS++ +V C
Sbjct: 60 RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSN 119
Query: 130 CEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSE 186
C QCG F P S TY + C++ C D G +C Y RY S G + +
Sbjct: 120 CVQCGNHQDPRFQPELSSTYQPVKCNAD-CNCDENGV--QCTYERRYAEMSTSSGVLAED 176
Query: 187 QFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPAT-SSTHSLVEK--VGSK 242
+F E + FGC + + ++ G+ GLG T S LV K V +
Sbjct: 177 VMSF--GKESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNS 234
Query: 243 FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVI----DGS----YYVTLEGISLGEK 294
FS C G ++ + G +L G S+P ++ D S Y + L+ I + K
Sbjct: 235 FSLCYGGMD---------VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGK 285
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
L ++P F G +DSGTT + AY + + L DP +
Sbjct: 286 PLKLNPRTFD-----GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNF 340
Query: 355 H-LCYSGNINRDL----QGFPAMAFHFAGGADLVLDAESVFYQES--SSVFCLAVGPSDI 407
+C+SG RD+ + FP + FA G + L E+ ++ + S +CL +
Sbjct: 341 KDICFSG-AGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFK--- 396
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
NG +++G I +N V Y+ + + F + +C L
Sbjct: 397 NGN--DQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 163/386 (42%), Gaps = 71/386 (18%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------FDPSKSLTYATLPC 151
F+++ S+G PPV L +DTGS+L WV CQ C+ TT FDP KS TY + C
Sbjct: 75 FFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELVGC 134
Query: 152 DSSYCTN---------DCGGYPDECWYNIRYTNGPDSQ---GTIGSEQFNFETSDEGKTF 199
S C + C D C Y++RY +GP Q G +G+++ +S +
Sbjct: 135 SSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASS---SSI 191
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS--KFSYC----------- 246
+ FGCS +++ E +GV G G A S + V + + FSYC
Sbjct: 192 IDGFIFGCSGDDSFKGYE--SGVIGFGGANFSFFNQVARQTNYRAFSYCFPGDHTAEGFL 249
Query: 247 -IGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKK 305
IG E Y LI G D S Y SL + + +D N +
Sbjct: 250 SIGAYPKDELVYTNLIPHFG-------------DRSVY------SLQQIDMMVDGNRLQV 290
Query: 306 NDT-WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY--SG 360
+ + ++ + +DSGT T+L+ + K + Q G L C+ +G
Sbjct: 291 DQSEYTKRMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSD---TVGTETCFRPNG 347
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQ--ESSSVFCLAVGPSDINGERFKDLSII 418
+ D P + F G L L E+VF+ S CLA P D+ G R ++ I+
Sbjct: 348 GDSVDSGDLPTVEMRFI-GTTLKLPPENVFHDLLPSHDKICLAFKP-DVAGVR--NVQIL 403
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G A ++ V YDL + FQ C
Sbjct: 404 GNKATXSFRVVYDLQAMYFGFQAGAC 429
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 157/400 (39%), Gaps = 67/400 (16%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSS 154
TVPV ++G PP VLDTGS L W+ C FD S S +YA +PC S
Sbjct: 64 TVPV-----AVGTPPQNVTMVLDTGSELSWLLCNGSRH--DAPFDASASSSYAPVPCSSP 116
Query: 155 YCT---NDCGGYP----DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
CT D P C ++ Y + + G + ++ F +S FGC
Sbjct: 117 ACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSP------MPALFGC 170
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-FSYCIGNLNYFEYAYNMLILGEG- 265
+ + +D T GL S V + ++ F+YCI +L+LG
Sbjct: 171 ITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAA----GQGPGILLLGGND 226
Query: 266 ---------------AILEGDSTPMSVID-GSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
L S P+ D +Y V LEGI +G +L I +L + T
Sbjct: 227 TETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTG 286
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKE-----VEDLFQGLL----PSYPMDPAWHLCYSG 360
+ +DSGT T+L+P AY L+ E L GL P + A+ C+ G
Sbjct: 287 AGQ-TMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRG 345
Query: 361 NINRDLQG-----FPAMAFHFAGGADLVLDAESVFYQ-------ESSSVFCLAVGPSDIN 408
R P + G +V AE + Y+ E V+CL G SD+
Sbjct: 346 TEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMA 405
Query: 409 GERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLA 448
G +IG QQ+ V YDL + +L F C LA
Sbjct: 406 G---VSAYVIGHHHQQDVWVEYDLRNARLGFAAARCADLA 442
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 167/398 (41%), Gaps = 51/398 (12%)
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
L S++ + R LH + + IG PP ++DTGS++ +V C CEQC
Sbjct: 56 LHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC 115
Query: 134 GA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPD--ECWYNIRYTNGPDSQGTIGSEQF 188
G F P S TY + CT DC D +C Y +Y S G +G +
Sbjct: 116 GRHQDPKFQPDLSSTYQPVK-----CTLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVV 170
Query: 189 NFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFS 244
+F ++ + FGC + + G+ GLG S LV+K V FS
Sbjct: 171 SF--GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFS 228
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMS---------VIDGSYYVTLEGISLGEKM 295
C G ++ +G GA++ G +P S V Y + L+ I + K
Sbjct: 229 LCYGGMD----------VGGGAMVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKR 278
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH 355
L ++P++F G +DSGTT +L A+ ++ + Q DP ++
Sbjct: 279 LPLNPSVFD-----GKHGSVLDSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYN 333
Query: 356 -LCYSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDING 409
LC+SG ++++ + FP + F G L E+ ++ S +CL + NG
Sbjct: 334 DLCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQ---NG 390
Query: 410 ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ +++G I +N V YD ++ F + +C L
Sbjct: 391 K--DPTTLLGGIVVRNTLVLYDREQTKIGFWKTNCAEL 426
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 153/375 (40%), Gaps = 41/375 (10%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-------GATTFDPSKSLTYA 147
++ +++ +G P +DTGS ++WV C C +C T +D S T
Sbjct: 81 SIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAK 140
Query: 148 TLPCDSSYCT-----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQF-------NFETSDE 195
++ C ++C+ ++C C Y I Y +G + G + + N +T
Sbjct: 141 SVSCSDNFCSYVNQRSECHS-GSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGST 199
Query: 196 GKTFLYDVGFGC-SHNNAHFSDEQ--FTGVFGLGPATSSTHSLVE---KVGSKFSYCIGN 249
T + FGC S + + Q G+ G G + SS S + KV F++C+ N
Sbjct: 200 NGTII----FGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDN 255
Query: 250 LNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
N + +GE + +TPM Y V L I +G +L + + F D
Sbjct: 256 NN----GGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGD-- 309
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
D GV IDSGTTL +L + Y L ++ Q L D Y ++R F
Sbjct: 310 -DKGVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFTCFHYIDRLDR----F 364
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + F F L + + +Q +C + + L+I+G +A N V
Sbjct: 365 PTVTFQFDKSVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVV 424
Query: 430 YDLVSKQLYFQRIDC 444
YD+ ++ + + +C
Sbjct: 425 YDIENQVIGWTNHNC 439
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 162/371 (43%), Gaps = 45/371 (12%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATL 149
+++ +G PP +DTGS L+WV C PC C A +D S + + +
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 150 PCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
PC CT + C ++C Y+ +Y +G + G + + ++ + T ++
Sbjct: 95 PCSDPSCTLITQISESGCND-QNQCGYSFQYGDGSGTLGYLVEDVLHYMV-NATATVIFG 152
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYFEYAYNM 259
GF S + S+ G+ G G + S +S + K G F++C L+ E +
Sbjct: 153 CGFKQS-GDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC---LDGGERGGGI 208
Query: 260 LILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
L+LG ++E D TP+ Y V L+ IS+ L IDP LF ND G D
Sbjct: 209 LVLGN--VIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLF-SNDVMQ--GTIFD 263
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SGTTL +L AYQ + V + L LC + + FP + +F
Sbjct: 264 SGTTLAYLPDEAYQAFTQAVSLVVAPFL----------LCDTRLSRFIYKLFPNVVLYFE 313
Query: 378 GGADLVLDAESVFYQESSS---VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
G + + AE + Q S++ ++C+ S + E +I G + +N V YDL
Sbjct: 314 GASMTLTPAEYLIRQASAANAPIWCMG-WQSMGSAESELQYTIFGDLVLKNKLVVYDLER 372
Query: 435 KQLYFQRIDCE 445
++ ++ DC+
Sbjct: 373 GRIGWRPFDCK 383
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 155/361 (42%), Gaps = 50/361 (13%)
Query: 116 LDTGSSLIWVKCQPCEQCGATTF---DP----SKSLTYATLPCDS-SYCT-NDCGGYPDE 166
+DTG+ L W++C+ C+ G F DP S+S +Y + C+ S+C N C
Sbjct: 105 IDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCNQHSFCEPNQCK--EGL 162
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAH------FSDEQFT 220
C YN+ Y G + G + +E F F ++ T L + FGCS ++ + +
Sbjct: 163 CAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKNPVS 222
Query: 221 GVFGLGPATSSTHSLVEKVGS----KFSYCI-GNLNYFEYAYNMLILGEGAILEGDSTPM 275
GV G+G S + ++GS KFSYCI N + Y L G+ + +
Sbjct: 223 GVLGMG---WGPRSFLAQLGSISHGKFSYCITANNTHNTY----LRFGKHVVKSKNLQTT 275
Query: 276 SVID----GSYYVTLEGISLGEKMLDI-DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAY 330
++ +Y+V L GIS+ L+I +L + D G ID+GT T LV +
Sbjct: 276 KIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKD--GSRGCIIDAGTLATLLVKPIF 333
Query: 331 QTLRKEVEDLF---QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAE 387
TL + + Q L LCY + + P + FH ADL + E
Sbjct: 334 DTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLE-NADLEVKPE 392
Query: 388 SVFY---QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
++F E +VFCL++ D +IIG Q YD ++ L F DC
Sbjct: 393 AIFLFREFEGKNVFCLSMLSDD-------SKTIIGAYQQMKQKFVYDTKARVLSFGPEDC 445
Query: 445 E 445
E
Sbjct: 446 E 446
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 138/348 (39%), Gaps = 88/348 (25%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC---EQCGA---TTFDPSKSLTYATLPCD 152
+ ++ +G P V Q V+DTGS + WV+C+PC C A FDP+ S TYA C
Sbjct: 106 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 165
Query: 153 SSYC--------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
++ C N C C Y ++Y +G ++ GT F+
Sbjct: 166 AAACAQLGDSGEANGCDAK-SRCQYIVKYGDGSNTTGT------GFQ------------- 205
Query: 205 FGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGCSH D++ G+ GLG SLV + ++
Sbjct: 206 FGCSHAELGAGMDDKTDGLIGLG---GDAQSLVSQTAARSKK------------------ 244
Query: 264 EGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
+ Y+ LE I++G K L + P++F AG +DSGT +T
Sbjct: 245 --------------VPTYYFAALEDIAVGGKKLGLSPSVFA-------AGSLVDSGTVIT 283
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLV 383
L P+AY L + P+ C++ D P +A FAGGA +
Sbjct: 284 RLPPAAYAALSSAFRAGMTRYARAEPLG-ILDTCFN-FTGLDKVSIPTVALVFAGGAVVD 341
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
LDA + S CLA P+ + K IG + Q+ + V YD
Sbjct: 342 LDAHGIV-----SGGCLAFAPTRDD----KAFGTIGNVQQRTFEVLYD 380
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 137/302 (45%), Gaps = 45/302 (14%)
Query: 160 CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQF 219
CG C Y I Y +G ++G +G E+ F G + D FGC NN F
Sbjct: 126 CGSAAPICNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGL----F 176
Query: 220 TGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
GV GL S SL+ + G FSYC+ + LILG + + +S+P+
Sbjct: 177 GGVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTE--RKGSGSLILGGNSSVYRNSSPI 234
Query: 276 S---VIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
S +I+ Y++ L GIS+G L + + + + +DSGT +T L P
Sbjct: 235 SYAKMIENPQLYNFYFINLTGISIGGVAL--------QAPSVGPSRILVDSGTVITRLPP 286
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVL 384
+ Y+ L+ E F G +P PA+ + C++ + +++ P + HF G A+L +
Sbjct: 287 TIYKALKAEFLKQFTG----FPPAPAFSILDTCFNLSAYQEVD-IPTIKMHFEGNAELTV 341
Query: 385 DAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
D VFY + +S CLA+ + E ++I+G Q+N V YD ++ F
Sbjct: 342 DVTGVFYFVKSDASQVCLALASLEYQDE----VAILGNYQQKNLRVIYDTKETKVGFALE 397
Query: 443 DC 444
C
Sbjct: 398 TC 399
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/403 (23%), Positives = 163/403 (40%), Gaps = 47/403 (11%)
Query: 78 SSQKAHDTRAHLH--PGIS----------TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ KAHD R L G+ V ++Y IG PP +DTGS ++WV
Sbjct: 52 SALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWV 111
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR----- 172
C C++C T +D +S + +PCD +C GG C NI
Sbjct: 112 NCIQCKECPTRSNLGMDLTLYDIKESSSGKFVPCDQEFCKEINGGLLTGCTANISCPYLE 171
Query: 173 -YTNGPDSQGTIGSEQF-------NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
Y +G + G + + +T + ++ G S + + ++E G+ G
Sbjct: 172 IYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILG 231
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G A SS S + KV F++C+ +N + +G + + TP+
Sbjct: 232 FGKANSSMISQLASSGKVKKMFAHCLNGVN----GGGIFAIGHVVQPKVNMTPLLPDQPH 287
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V + + +G L + + + D G IDSGTTL +L Y+ L ++
Sbjct: 288 YSVNMTAVQVGHAFLSLSTDTSTQGDR---KGTIIDSGTTLAYLPEGIYEPLVYKIISQH 344
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
L D YS +++ GFPA+ F+F G L + + S +C+
Sbjct: 345 PDLKVRTLHDEYTCFQYSESVD---DGFPAVTFYFENGLSLKVYPHDYLF-PSGDFWCIG 400
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S K+++++G + N V YDL ++ + + +C
Sbjct: 401 WQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNC 443
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 163/367 (44%), Gaps = 52/367 (14%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYAT-LPCDSSYCTNDCGG- 162
+G PP P L+ G+ LIW P +C F + LT++ LP S CG
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFAS------CGSP 54
Query: 163 --YPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS-HNNAHFSDEQ 218
+P++ C Y Y + + G + ++F F + + V FGC NN F +
Sbjct: 55 KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---VPGVAFGCGLFNNGVFKSNE 111
Query: 219 FTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------YFEYAYNMLILGEGAILEGD 271
TG+ G G S S + KVG+ FS+C + + ++ G+GA+
Sbjct: 112 -TGIAGFGRGPLSLPSQL-KVGN-FSHCFTTITGAIPSTVLLDLPADLFSNGQGAV---Q 165
Query: 272 STPMSVIDGS------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
+TP+ + YY++L+GI++G L + + F T G IDSGT++T L
Sbjct: 166 TTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL--TNGTGGTIIDSGTSITSL 223
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADLVL 384
P YQ +R E + LP P + H C+S ++ P + HF GA + L
Sbjct: 224 PPQVYQVVRDEFAAQIK--LPVVPGNATGHYTCFSAP-SQAKPDVPKLVLHFE-GATMDL 279
Query: 385 DAESVFYQ----ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
E+ ++ +S+ CLA+ D + +IIG QQN +V YDL + L F
Sbjct: 280 PRENYVFEVPDDAGNSIICLAINKGD-------ETTIIGNFQQQNMHVLYDLQNNMLSFV 332
Query: 441 RIDCELL 447
C+ L
Sbjct: 333 AAQCDKL 339
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/409 (24%), Positives = 162/409 (39%), Gaps = 71/409 (17%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------FDPSKSLT 145
+ ++ IG PP P AV+DTGS L+W +C C A ++ S S T
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 146 YATLPCD------------SSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
+PCD ++ C G D C Y G + G +G++ F F +S
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSS 196
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNY 252
+ FGC + S G G+ SLV ++ ++FSYC+
Sbjct: 197 SS-----VTLAFGCV-SQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPYFR 250
Query: 253 FEYAYNMLILGEG---------AILEGDSTPMSVIDGS-----------YYVTLEGISLG 292
+ + L +G+G G P++ + + YY+ L G++ G
Sbjct: 251 DTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAG 310
Query: 293 EKMLDIDPNLFKKND----TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG----L 344
+ + F + W+ G IDSG+ T LV A++ L KE+ +G +
Sbjct: 311 NATVALPAGAFDLREAAPKVWA-GGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLV 369
Query: 345 LPSYPMDPAWHLCYSGNINRD---LQGFPAMAFHF----AGGADLVLDAESVFYQESSSV 397
P + A LC + D P + F GG +LV+ AE + + +S
Sbjct: 370 PPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEAST 429
Query: 398 FCLAVGPSDINGERF--KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+C+AV S + +IIG QQ+ V YDL + L FQ +C
Sbjct: 430 WCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 163/403 (40%), Gaps = 47/403 (11%)
Query: 78 SSQKAHDTRAHLH--PGIS----------TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ KAHD L GI V ++Y IG P +DTGS ++WV
Sbjct: 54 STLKAHDISRQLRFLAGIDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWV 113
Query: 126 KCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR----- 172
C C +C T+ +D +S T + CD +C GG C N+
Sbjct: 114 NCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQ 173
Query: 173 -YTNGPDSQGTIGSE--QFNFETSD-EGKTFLYDVGFGC----SHNNAHFSDEQFTGVFG 224
Y +G + G + Q+N + D E + FGC S + +E G+ G
Sbjct: 174 IYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILG 233
Query: 225 LGPATSSTHSLV---EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G + SS S + KV F++C+ N + +G + + TP+
Sbjct: 234 FGKSNSSIISQLASTRKVKKMFAHCLDGTN----GGGIFAMGHVVQPKVNMTPLVPNQPH 289
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V + G+ +G +L+I ++F+ D G IDSGTTL +L Y+ L ++
Sbjct: 290 YNVNMTGVQVGHIILNISADVFEAGDR---KGTIIDSGTTLAYLPELIYEPLVAKILSQQ 346
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
L YS ++ GFP + FHF L + +Q +++C+
Sbjct: 347 HNLEVQTIHGEYKCFQYSERVD---DGFPPVIFHFENSLLLKVYPHEYLFQ-YENLWCIG 402
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S + K++++ G + N V YDL ++ + + +C
Sbjct: 403 WQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC 445
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 146/364 (40%), Gaps = 38/364 (10%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC 156
P + V IG PP L +DT + W+ C C+ C +T F P KS T+ + C S C
Sbjct: 96 PTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTLFAPEKSTTFKNVSCGSPQC 155
Query: 157 TN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA 212
CG C +N+ Y + + + +T + D FGC
Sbjct: 156 NQVPNPSCGT--SACTFNLTYGSSSIAANVVQ------DTVTLATDPIPDYTFGCVAKTT 207
Query: 213 HFS---DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILE 269
S G S T +L + S FSYC+ + ++ ++ + +
Sbjct: 208 GASAPPQGLLGLGRGPLSLLSQTQNLYQ---STFSYCLPSFKSLNFSGSLRLGPVAQPIR 264
Query: 270 GDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP+ YYV L I +G K++DI P N + AG DSGT T LV
Sbjct: 265 IKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFN-AATGAGTVFDSGTVFTRLV 323
Query: 327 PSAYQTLRKEVEDLF----QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
AY +R E + + L + + CY+ I P + F F+G
Sbjct: 324 APAYTAVRDEFQRRVAIAAKANLTVTSLG-GFDTCYTVPIVA-----PTITFMFSGMNVT 377
Query: 383 VLDAESVFYQESSSVFCLAVG--PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ + + + + S CLA+ P ++N L++I + QQN+ V YD+ + +L
Sbjct: 378 LPEDNILIHSTAGSTTCLAMASAPDNVNSV----LNVIANMQQQNHRVLYDVPNSRLGVA 433
Query: 441 RIDC 444
R C
Sbjct: 434 RELC 437
>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 161/378 (42%), Gaps = 83/378 (21%)
Query: 86 RAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLT 145
+ H H +S V+ ++G PP VLDTGS L W++C Q TTFDP++S +
Sbjct: 59 KLHFHHNVS----LTVSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTTFDPNRSSS 113
Query: 146 YATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGT----IGSEQFNFETSDEGKTFLY 201
Y+ +PC S CT+ DS+ T + +F + + F Y
Sbjct: 114 YSPVPCSSLTCTDQ------------------DSKNTGLMGMNRGSLSFVSQMDFPKFSY 155
Query: 202 DVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
+ SD F+GV LG A FS+ + LNY
Sbjct: 156 CI-----------SDSDFSGVLLLGDA-------------NFSWLM-PLNY--------- 181
Query: 262 LGEGAILEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
L STP+ D +Y V LEGI + K+L + ++F + T + +DSGT
Sbjct: 182 ----TPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGA-GQTMVDSGT 236
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINR-DLQGFPAMAF 374
T+L+ Y LR E + +L P+Y LCY +++ L P ++
Sbjct: 237 QFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSL 296
Query: 375 HFAGGADLVLDAESVFYQE------SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
F GA++ + + + Y+ S SV+C G SD+ + +IG QQN +
Sbjct: 297 MFR-GAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLA---VEAYVIGHHHQQNVWM 352
Query: 429 AYDLVSKQLYFQRIDCEL 446
+DL ++ F ++ C+L
Sbjct: 353 EFDLEKSRIGFAQVQCDL 370
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 150/371 (40%), Gaps = 53/371 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QCGA---TTFDPSKSLTYATLPCDSS 154
F V G P + DTGS + W++C PC C FDP+KS TY+ +PC
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHP 179
Query: 155 YCTNDCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN- 211
C G C Y ++Y +G + G + E + ++ F FGC N
Sbjct: 180 QCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARALPGF----AFGCGETNL 235
Query: 212 AHFSDEQFTGVFGLGPATSS-THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEG 270
F D G+ GLG S + G+ FSYC+ + N G + G
Sbjct: 236 GDFGDVD--GLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN----------TSHGYLTIG 283
Query: 271 DSTPMSVIDGS--------------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+TP S DG Y+V L I +G +L + P LF ++ G +
Sbjct: 284 TTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRD------GTLL 337
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGT LT+L P AY LR + P+ DP + CY ++ P ++F F
Sbjct: 338 DSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDP-FDTCYD-FAGQNAIFMPLVSFKF 395
Query: 377 AGGADLVLDAESVFY---QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
+ G+ L V + + CLA P +I+G Q+N + YD+
Sbjct: 396 SDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPST----MPFTIVGNTQQRNTEMIYDVA 451
Query: 434 SKQLYFQRIDC 444
++++ F C
Sbjct: 452 AEKIGFVSGSC 462
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 123/443 (27%), Positives = 190/443 (42%), Gaps = 52/443 (11%)
Query: 23 FTSTTAAPAAGKPKRLVTK--LLH---RDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQK 77
F+S+ K RL K LLH +S Y PN T+ Q ++ S AR S +
Sbjct: 19 FSSSLPDHVLLKDNRLGFKVPLLHWLSTESPFYEPNLTLAELTQASIRTSGARGD--SIR 76
Query: 78 SSQKAHDTRAHLHPGIS----TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVK-----CQ 128
S + T + +P IS T + + FSIG P V A+ D+GSSL+W++ C+
Sbjct: 77 SIMSGNITSSMKYP-ISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCR 135
Query: 129 PCEQCGATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECW----------YNIRYTNGPD 178
C + F+PSKS+TY C+++ C G DE W Y+ Y +
Sbjct: 136 NCYRQKIPLFNPSKSVTYMKRLCNTAECRVALG---DEYWRCKKPNQICKYHEDYLDDSY 192
Query: 179 SQGTIGSEQFNFETSDEG-KTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
++G I ++ F F G + + FGC +NN SD Q GL T++ SLV
Sbjct: 193 TEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNN---SDPQHFYPPGLVGLTNNKASLVG 249
Query: 238 KVG-SKFSYCIGNLNYFEYAYNMLI-LGEGAILEGDSTPMSVIDGSYYV--TLEGISLGE 293
++ +FSYC+ +M I G A + G ST + +Y+ ++GI + E
Sbjct: 250 QMDVDQFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLVPNSDGWYIFKNVDGIYVNE 309
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
++ P K G+ +D+GTT T L S L K +E+ + +
Sbjct: 310 FEVEGYPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSNSG 369
Query: 354 WHLCYSGNINRDLQG--FPAMAFHFAGGAD--LVLDAESVFYQESSSVFCLAVGPSDING 409
+ LCY + D G P + F D + + + S CLA+
Sbjct: 370 FELCY---FSDDFLGATLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAM------- 419
Query: 410 ERFKDLSIIGMIAQQNYNVAYDL 432
R +SIIGM ++ + YDL
Sbjct: 420 FRTNGMSIIGMHQLRDIKIGYDL 442
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 119/466 (25%), Positives = 182/466 (39%), Gaps = 84/466 (18%)
Query: 34 KPKRLVTKLLHRDSLLYN--PNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT------ 85
P L +LLHRDS N P + + QR + A +I + + A+DT
Sbjct: 57 SPSALHVRLLHRDSFAVNATPAQLLARRLQR--DELRAAWIIKAAAPAAAANDTPVVGLS 114
Query: 86 --RAHLHPGISTVPV----FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GAT 136
A + P +S P + ++G P V L +DTGS + W++CQPC +C
Sbjct: 115 SGGAFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGP 174
Query: 137 TFDPSKSLTYATLPCDSSYCT---NDCGGYPDE--CWYNIRYTNGPDSQGTIG---SEQF 188
FDP S +Y + D+ C GG C Y + Y G D T+G E
Sbjct: 175 VFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGY--GDDGSTTVGDFIEETL 232
Query: 189 NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSY 245
F G + + GC H+N G+ GLG S S + +G + FSY
Sbjct: 233 TF----AGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSY 288
Query: 246 CIGNL---NYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLG---------- 292
C+ + + + L +G+GA G P S+ T++ +++
Sbjct: 289 CLADFFLSSPGRSVSSTLTIGDGAA-AGSPPP------SFTPTVQNLNMATFYYVRLVGV 341
Query: 293 -----------EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
E L +DP + GV +DSGT +T L AY R
Sbjct: 342 SVGGVRVPGVTEDDLKLDPYTGR-------GGVILDSGTAVTRLARRAYIAFRDAFRAAA 394
Query: 342 QGLLPSYPMDPA--WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ-ESSSVF 398
L P+ + CY+ + P ++ HFAGG +L L ++ +S
Sbjct: 395 VDLGQVSIGGPSGFFDTCYT--MGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTV 452
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
C A G + +SIIG I QQ + V Y++ ++ F C
Sbjct: 453 CFA-----FAGTGDRSVSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 91/363 (25%), Positives = 152/363 (41%), Gaps = 50/363 (13%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C C+QCG F P S +Y L C+ +D G
Sbjct: 86 IGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCNPDCNCDDEG 145
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFT 220
C Y RY S G + + +F +E + FGC + ++
Sbjct: 146 KL---CVYERRYAEMSSSSGVLSEDLISF--GNESQLTPQRAVFGCENVETGDLFSQRAD 200
Query: 221 GVFGLGPAT-SSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV 277
G+ GLG S LV+K + FS C G + +G GA++ G +P +
Sbjct: 201 GIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME----------VGGGAMVLGKISPPAG 250
Query: 278 IDGS---------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ S Y + L+ + + K L ++P +F G +DSGTT +
Sbjct: 251 MVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN-----GKHGTVLDSGTTYAYFPKE 305
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAW-HLCYSGNINRDLQG----FPAMAFHFAGGADLV 383
A+ ++ + L + DP + +C+SG RD+ FP + F G L+
Sbjct: 306 AFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSG-AGRDVAEIHNFFPEIDMEFGNGQKLI 364
Query: 384 LDAESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQR 441
L E+ ++ + +CL + P +++G I +N V YD + +L F +
Sbjct: 365 LSPENYLFRHTKVRGAYCLGIFPDR------DSTTLLGGIVVRNTLVTYDRENDKLGFLK 418
Query: 442 IDC 444
+C
Sbjct: 419 TNC 421
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 100/410 (24%), Positives = 165/410 (40%), Gaps = 55/410 (13%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGISTVPV----------FYVNFSIGQPPVPQLAVLDTG 119
R+ + + A+ + + P ++ +P+ +YV +G P ++DTG
Sbjct: 64 RYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTMIVDTG 123
Query: 120 SSLIWVKCQPCE-QCGAT---TFDPSKSLTYATLPC---------DSSYCTNDCGGYPDE 166
SS W++CQPC C F+PS S TY T+PC ++ C +
Sbjct: 124 SSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNA 183
Query: 167 CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLG 226
C Y Y + S G + + S +F+Y GC +N G+ GL
Sbjct: 184 CVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVY----GCGQDNQGLFGRT-DGIIGLA 238
Query: 227 PATSSTHS-LVEKVGSKFSYCI----GNLNYFEYAYNMLILGEGAILEGDS---TPMSVI 278
S S L K G+ FSYC+ N + + L +G ++ S TP+
Sbjct: 239 NNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGF--LSIGTSSLTPSSSYKFTPLLKN 296
Query: 279 DGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRK 335
+ Y++ LE I++ + L + + +K IDSGT +T L Y TL+
Sbjct: 297 PNNPSLYFIDLESITVAGRPLGVAASSYK-------VPTIIDSGTVITRLPTPVYTTLKN 349
Query: 336 EVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS 395
+ P C+ G++ + P + F GGADL L + + +
Sbjct: 350 AYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELET 409
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ CLA+ S ++IIG QQ VAYD+ + ++ F C+
Sbjct: 410 GITCLAMAGSS-------SIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 91/390 (23%), Positives = 153/390 (39%), Gaps = 32/390 (8%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPG--ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+R +YL +++ A + G + P + V +G PP L +DT + W+
Sbjct: 78 SRLLYLDSLAARGKARAYAPIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIP 137
Query: 127 CQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDS 179
C C C A FDP+ S +Y ++PC S C C C +++ Y +
Sbjct: 138 CAGCAGCPTSSAPPFDPAASTSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADS-SL 196
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV 239
Q + + D KT+ FGC + + S +
Sbjct: 197 QAALSQDSLAVA-GDAVKTYT----FGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMY 251
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV---IDGSYYVTLEGISLGEKML 296
FSYC+ + ++ + + G +TP+ YYV + GI +G K++
Sbjct: 252 QGTFSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVV 311
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL 356
I P D + AG +DSGT T LV AY +R EV + S +
Sbjct: 312 PIPPPAL-AFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSL---GGFDT 367
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGERFKD 414
C+ N +P + F G + + V + ++ CLA+ P +N
Sbjct: 368 CF----NTTAVAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN----TV 419
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L++I + QQN+ V +D+ + ++ F R C
Sbjct: 420 LNVIASMQQQNHRVLFDVPNGRVGFARERC 449
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 96/403 (23%), Positives = 163/403 (40%), Gaps = 47/403 (11%)
Query: 78 SSQKAHDTRAHLH--PGIS----------TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ KAHD R L G+ V ++Y IG PP +DTGS ++WV
Sbjct: 50 SALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWV 109
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR----- 172
C C++C T +D +S + +PCD +C GG C NI
Sbjct: 110 NCIQCKECPTRSSLGMDLTLYDIKESSSGKLVPCDQEFCKEINGGLLTGCTANISCPYLE 169
Query: 173 -YTNGPDSQGTIGSEQF-------NFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
Y +G + G + + +T + ++ G S + + ++E G+ G
Sbjct: 170 IYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILG 229
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G A SS S + KV F++C+ +N + +G + + TP+
Sbjct: 230 FGKANSSMISQLASSGKVKKMFAHCLNGVN----GGGIFAIGHVVQPKVNMTPLLPDQPH 285
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V + + +G L + + + D G IDSGTTL +L Y+ L ++
Sbjct: 286 YSVNMTAVQVGHTFLSLSTDTSAQGDR---KGTIIDSGTTLAYLPEGIYEPLVYKMISQH 342
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
L D YS +++ GFPA+ F F G L + + S + +C+
Sbjct: 343 PDLKVQTLHDEYTCFQYSESVD---DGFPAVTFFFENGLSLKVYPHDYLF-PSVNFWCIG 398
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S K+++++G + N V YDL ++ + + +C
Sbjct: 399 WQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWAEYNC 441
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 162/384 (42%), Gaps = 49/384 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--------TFDPSKSLTYA 147
V +++ +G P +DTGS ++WV C PC C + +F+P S T +
Sbjct: 86 VGLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTAS 145
Query: 148 TLPCDSSYCT----------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SD 194
+ C CT C Y Y +G + G S+ FET ++
Sbjct: 146 RITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 205
Query: 195 EGKTFLYDVGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIG 248
+ + FGCS++ + +D G+FG G S S + +G FS+C
Sbjct: 206 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC-- 263
Query: 249 NLNYFEYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
L + +L+LGE I+E TP+ Y + LE I++ + L ID +LF +
Sbjct: 264 -LKGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 320
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
+T G +DSGTTL +L AY + + S + S +++
Sbjct: 321 NT---QGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS- 376
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKDLSIIGMIA 422
FP + +F GG + + E+ Q++ S ++C+ + ++++I+G +
Sbjct: 377 --FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIG-----WQRNQGQEITILGDLV 429
Query: 423 QQNYNVAYDLVSKQLYFQRIDCEL 446
++ YDL + ++ + DC +
Sbjct: 430 LKDKIFVYDLANMRMGWADYDCSM 453
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 162/384 (42%), Gaps = 49/384 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--------TFDPSKSLTYA 147
V +++ +G P +DTGS ++WV C PC C + +F+P S T +
Sbjct: 88 VGLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTAS 147
Query: 148 TLPCDSSYCT----------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SD 194
+ C CT C Y Y +G + G S+ FET ++
Sbjct: 148 RITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 207
Query: 195 EGKTFLYDVGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIG 248
+ + FGCS++ + +D G+FG G S S + +G FS+C
Sbjct: 208 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC-- 265
Query: 249 NLNYFEYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
L + +L+LGE I+E TP+ Y + LE I++ + L ID +LF +
Sbjct: 266 -LKGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 322
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
+T G +DSGTTL +L AY + + S + S +++
Sbjct: 323 NT---QGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS- 378
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKDLSIIGMIA 422
FP + +F GG + + E+ Q++ S ++C+ + ++++I+G +
Sbjct: 379 --FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIG-----WQRNQGQEITILGDLV 431
Query: 423 QQNYNVAYDLVSKQLYFQRIDCEL 446
++ YDL + ++ + DC +
Sbjct: 432 LKDKIFVYDLANMRMGWADYDCSM 455
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 167/389 (42%), Gaps = 60/389 (15%)
Query: 81 KAHDTRAHL--------------HPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+AHDTR H HP S +++ IG P +DTGS ++WV
Sbjct: 48 RAHDTRRHGRILSAVDLPLGGNGHP--SEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVN 105
Query: 127 CQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD-----ECWYNIRY 173
C C++C T +D S T + CD ++C+ G P +C Y++ Y
Sbjct: 106 CAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY 165
Query: 174 TNGPDSQGTIGSE--QF-----NFETSDEGKTFLYDVGFGCSHNNA---HFSDEQFTGVF 223
+G + G + Q+ NF+T+ T + FGC + + S E G+
Sbjct: 166 GDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV----FGCGNKQSGELGSSSEALDGIL 221
Query: 224 GLGPATSSTHSLVE---KVGSKFSYCIGNLNYFE-YAYNMLILGEGAILEGDSTPMSVI- 278
G G A SS S + KV FS+C+ N++ +A ++ + L +S + V+
Sbjct: 222 GFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGEVVEPKVRFLLMNSVMIVVLF 281
Query: 279 --DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE 336
Y V ++ I +G LD+ + F+ D G IDSGTTL + Y L ++
Sbjct: 282 LSRAHYNVVMKEIEVGGDPLDVPSDAFESGDR---KGTIIDSGTTLAYFPQEVYVPLIEK 338
Query: 337 VEDLFQGLLPSYPMDPAWHLC--YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQES 394
+ Q L + ++ A+ C Y+GN++ GFP + HF L + +Q
Sbjct: 339 ILSQ-QPDLRLHTVEQAF-TCFDYTGNVD---DGFPTVTLHFDKSISLTVYPHEYLFQVK 393
Query: 395 SSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
+C+ S + KDL+++G AQ
Sbjct: 394 EFEWCIGWQNSGAQTKDGKDLTLLGEDAQ 422
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 156/381 (40%), Gaps = 50/381 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPC---EQCGATTFDPSKSLTYATLPCDSSYCT 157
V+ ++G PP VLDTGS L W+ C P + A +F P S T+A +PC S+ C
Sbjct: 87 VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146
Query: 158 ND-------CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN 210
+ C G C ++ Y +G S G + ++ F G FGC +
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAV-----GSGPPLRAAFGCMSS 201
Query: 211 NAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI--- 267
S + LG + + + +FSYCI + + +L+LG +
Sbjct: 202 AFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCISDRD----DAGVLLLGHSDLPTF 257
Query: 268 -------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ + P+ D +Y V L GI +G K L I ++ + T + +DSG
Sbjct: 258 LPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQ-TMVDSG 316
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRD--LQGFPAM 372
T T+L+ AY L+ E + LL PS+ A+ C+ R P +
Sbjct: 317 TQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGV 376
Query: 373 AFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLS-IIGMIAQQN 425
F GA++ + + + Y + V+CL G N + ++ +IG Q N
Sbjct: 377 TLLF-NGAEMAVAGDRLLYKVPGERRGGDGVWCLTFG----NADMVPIMAYVIGHHHQMN 431
Query: 426 YNVAYDLVSKQLYFQRIDCEL 446
V YDL ++ + C++
Sbjct: 432 VWVEYDLERGRVGLAPVRCDV 452
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 162/379 (42%), Gaps = 50/379 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATL 149
++Y IG PP +DTGS ++WV C+ C T +DP+ S T T+
Sbjct: 84 LYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TV 141
Query: 150 PCDSSYCTND---------CGGYPDECWYNIRYTNGPDSQGTIGSE--QFNFETSDEGKT 198
C+ +C + C C + I Y +G + G ++ Q+N + S G+T
Sbjct: 142 GCEQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYN-QVSGNGQT 200
Query: 199 FLYDVG--FGCSHN---NAHFSDEQFTGVFGLGPATSSTHSLV---EKVGSKFSYCIGNL 250
+V FGC + S + G+ G G + +S S + KV F++C+
Sbjct: 201 TPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCL--- 257
Query: 251 NYFEYAYNMLILGEGAILEG---DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
+ I G +++ +TP+ Y V L+GIS+G L + + F D
Sbjct: 258 ---DTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGD 314
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRD 365
+ G IDSGTTL +L Y+TL V D L D +C+ SG+++ +
Sbjct: 315 S---KGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYED---FICFQFSGSLDEE 368
Query: 366 LQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
FP + F F G L + +Q + ++C+ + + KD+ ++G + N
Sbjct: 369 ---FPVITFSFEGDLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSN 425
Query: 426 YNVAYDLVSKQLYFQRIDC 444
V YDL + + + +C
Sbjct: 426 KLVVYDLEKQVIGWTDYNC 444
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 96/305 (31%), Positives = 140/305 (45%), Gaps = 39/305 (12%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V+ +IG PP P LDTGS LIW +CQPC C FDPS S T + CDS+
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTL 141
Query: 156 CTN----DCGG---YPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C CG +P++ C Y Y + + G + ++F F + + V FGC
Sbjct: 142 CQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---VPGVAFGC 198
Query: 208 S-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL---- 262
NN F + TG+ G G S S + KVG+ FS+C +N + + +L L
Sbjct: 199 GLFNNGVFKSNE-TGIAGFGRGPLSLPSQL-KVGN-FSHCFTAVNGLKPSTVLLDLPADL 255
Query: 263 ---GEGAILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFK-KNDTWSDAGVF 315
G GA+ STP+ + YY++L+GI++G L + + F KN T G
Sbjct: 256 YKSGRGAV---QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGT---GGTI 309
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQ-GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAF 374
IDSGT +T L Y+ +R + ++ DP + C S + R P +
Sbjct: 310 IDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDP--YFCLSAPL-RAKPYVPKLVL 366
Query: 375 HFAGG 379
HF G
Sbjct: 367 HFEGA 371
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 158/367 (43%), Gaps = 53/367 (14%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------FDPSK 142
V + Y IG P V L LDTGS + WV C C +C + + PS
Sbjct: 99 VWLHYTWIDIGTPNVSFLVALDTGSDMFWVPCD-CIECAPLSAAFYNALDRDLNQYSPSL 157
Query: 143 SLTYATLPCDSSYC--TNDCGGYPDECWYNIRYT-NGPDSQGTIGSEQFNFETSDEGKTF 199
S + LPC C ++C G+ D C Y YT + S G + ++ + +++ K
Sbjct: 158 SSSSRHLPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKNS 217
Query: 200 LY-DVGFGCSHNNAHFSDEQF--TGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYF 253
+ V GC + + E G+ GLGP + S +L+ K G + S C+
Sbjct: 218 IQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLN----- 272
Query: 254 EYAYNMLILG-EGAILEGDSTPMSVIDG---SYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
E ++ G +G + STP + DG +Y+V +E +G F +T
Sbjct: 273 EKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGS---------FCYKET- 322
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
+ FID+GT+ T+L Y+T+ E E + + ++ CY+ + +R+ F
Sbjct: 323 -EFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNAS-SRESNNF 380
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM---IAQQNY 426
P M F F+ ++ + + + CLAV SD +L IG IA QN+
Sbjct: 381 PPMKFTFSKNQSFIIQNPFISMDQEDTTICLAVVQSD------DELITIGRKYTIACQNF 434
Query: 427 NVAYDLV 433
+ YD+V
Sbjct: 435 LMGYDMV 441
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 98/405 (24%), Positives = 173/405 (42%), Gaps = 51/405 (12%)
Query: 78 SSQKAHDTRAHLH-----------PGISTVP-VFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ K HD R L G +P ++Y IG P +DTGS ++WV
Sbjct: 47 SALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWV 106
Query: 126 KCQPCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR----- 172
C C+QC T ++ +S + + CD +C GG C N+
Sbjct: 107 NCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSCPYLE 166
Query: 173 -YTNGPDSQGTIGSEQFNFETSD---EGKTFLYDVGFGCSHNNA----HFSDEQFTGVFG 224
Y +G + G + +++ + +T V FGC + ++E G+ G
Sbjct: 167 IYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILG 226
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G A SS S + +V F++C+ N + +G + + TP+
Sbjct: 227 FGKANSSMISQLASSGRVKKIFAHCLDGRN----GGGIFAIGRVVQPKVNMTPLVPNQPH 282
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V + + +G++ L+I +LF+ D G IDSGTTL +L Y+ L K++
Sbjct: 283 YNVNMTAVQVGQEFLNIPADLFQPGDR---KGAIIDSGTTLAYLPEIIYEPLVKKITSQ- 338
Query: 342 QGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFC 399
+ L + +D + YSG ++ +GFP + FHF L V + +F E ++C
Sbjct: 339 EPALKVHIVDKDYKCFQYSGRVD---EGFPNVTFHFENSVFLRVYPHDYLFPYE--GMWC 393
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S + ++++++G + N V YDL ++ + + +C
Sbjct: 394 IGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 153/371 (41%), Gaps = 53/371 (14%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------------TFDPSKSLTYA 147
Y +G P V + LDTGS L WV C C +C T ++P +S T
Sbjct: 98 YTTVELGTPGVKFMVALDTGSDLFWVPCD-CSRCAPTHGASYASDFELSIYNPRESSTSK 156
Query: 148 TLPCDSSYCT--NDCGGYPDECWYNIRYTNGPDS-QGTIGSEQFNFETSDEGKTFLYD-V 203
+ C++ C N C G C Y + Y + S G + + + T D G+ F+ V
Sbjct: 157 KVTCNNDMCAQRNRCLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREFVEAYV 216
Query: 204 GFGCSH-NNAHFSD-EQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAYN 258
FGC + F D G+FGLG S S++ + G FS C G+
Sbjct: 217 TFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCFGH-----DGIG 271
Query: 259 MLILGEGAILEGDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+ G+ + + TP +V +Y VT+ +G ++D++
Sbjct: 272 RISFGDKGSPDQEETPFNVNPAHPTYNVTVTQARVGTMLIDVEFT------------ALF 319
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSGNINRDLQGFPAMAF 374
DSGT+ T++V AY + ++ L + P DP + CY + + + P+M+
Sbjct: 320 DSGTSFTYMVDPAYSRVSEKFHSLARD--KRRPPDPRIPFEYCYDMSPDANASLVPSMSL 377
Query: 375 HFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
GG V D V ++ V+CLAV S +L+IIG Y V +D
Sbjct: 378 TMKGGRHFTVYDPIIVISTQNEIVYCLAVVKS-------TELNIIGQNFMTGYRVVFDRE 430
Query: 434 SKQLYFQRIDC 444
L +++ DC
Sbjct: 431 KLVLGWKKFDC 441
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 168/392 (42%), Gaps = 58/392 (14%)
Query: 98 VFYVNF------SIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------- 137
+FY +F ++G PPV LAV DTGS L+W+KC +
Sbjct: 75 LFYGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPP 134
Query: 138 --------FDPSKSLTYATLPCDSSYC----TN-DCGGYPDECWYNIRYTNGPDSQGTIG 184
F+P S +Y+ + CD C TN C G C + Y +G + G +
Sbjct: 135 PPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLA 194
Query: 185 SEQFNFETS-DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKF 243
++ F F + + T + FGC+ A + Q G+ GLG + SL ++G KF
Sbjct: 195 ADTFTFGGNINNDTTSTASIDFGCATGTAG-REFQADGMVGLG---AGPLSLASQLGRKF 250
Query: 244 SYCIGNLNYFEYAYNMLILGEGAILE--GDSTPMSVIDGSYYVTLEGISLGEKMLDIDPN 301
S+C+ + + A ++L G A++ G +T + S IS+ + P
Sbjct: 251 SFCLTAYD-IDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQP- 308
Query: 302 LFKKNDTWSDAGVFIDSGTTLTWLVPSAY-----QTLRKEVEDLFQGLLPSYPMDPAWHL 356
T S + V +D+GT LT+L +A ++L + ++ GL + P D L
Sbjct: 309 ---VPGTTSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDG--AGLPRAPPPDETLEL 363
Query: 357 CYSGNINRDLQGF---PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
CY + +D+ G + GG ++ L E F V CLAV + +
Sbjct: 364 CYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTT---SPELQ 420
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
LS++G +A Q+ +V DL ++ F +C+
Sbjct: 421 PLSVLGNVALQDLHVGIDLDARTATFATANCD 452
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 156/360 (43%), Gaps = 67/360 (18%)
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATL 149
+S+ ++ NF+IG PP P AV+D L+W +C PC+ C FDP+KS T+ L
Sbjct: 51 LSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110
Query: 150 PCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
PC S C + +C D C Y G D+ G G++ F + E +
Sbjct: 111 PCGSHLCESIPESSRNC--TSDVCIYEAPTKAG-DTGGKAGTDTFAIGAAKE------TL 161
Query: 204 GFGCSHNNAHFSDEQF------TGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYA 256
GFGC +D++ +G+ GLG + SLV ++ + FSYC+ +
Sbjct: 162 GFGC----VVMTDKRLKTIGGPSGIVGLG---RTPWSLVTQMNVTAFSYCLAG-----KS 209
Query: 257 YNMLILGEGAI-LEG---DSTPMSVI-------DGS---YYVTLEGISLGEKMLDIDPNL 302
L LG A L G STP + +GS Y V L GI G L
Sbjct: 210 SGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPL------ 263
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI 362
+ + S + V +D+ + ++L AY+ L+K + G+ P + LC+ +
Sbjct: 264 --QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAV-GVQPVASPPKPYDLCFPKAV 320
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS---DINGERFKDLSIIG 419
D P + F F GGA L + + + CL +G S ++ GE + SI+G
Sbjct: 321 AGDA---PELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGE-LEGASILG 376
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 195/466 (41%), Gaps = 62/466 (13%)
Query: 8 LLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQAQRTLNMS 67
+L+ +LP++ T + +P+A + LV L L PN +
Sbjct: 15 ILIYFFSLPYSITAGENNLHHSPSARSRRPLVFPLF-----LSQPNSSSSRSISIPHRK- 68
Query: 68 MARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC 127
L + S+ +R L+ + + IG PP ++D+GS++ +V C
Sbjct: 69 ------LHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC 122
Query: 128 QPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIG 184
CEQCG F P S TY + C+ C +C ++C Y Y S+G +G
Sbjct: 123 SDCEQCGKHQDPKFQPELSSTYQPVKCNMD-C--NCDDDKEQCVYEREYAEHSSSKGVLG 179
Query: 185 SEQFNFETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPA-TSSTHSLVEK--VG 240
+ +F +E + FGC + ++ G+ GLG S LV+K +
Sbjct: 180 EDLISF--GNESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLIS 237
Query: 241 SKFSYCIGNLNY---------FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISL 291
+ F C G ++ F+Y +M+ + D +P Y + L GI +
Sbjct: 238 NSFGLCYGGMDVGGGSMILGGFDYPSDMIFTDS----DPDRSPY------YNIDLTGIRV 287
Query: 292 GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL----VPSAYQTLRKEVEDLFQ--GLL 345
K L ++ +F + G +DSGTT +L + + + +EV L Q G
Sbjct: 288 AGKKLSLNSRVFD-----GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPD 342
Query: 346 PSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVG 403
P++ D + + S +++ + FP++ F G +L E+ ++ S +CL V
Sbjct: 343 PNF-KDTCFLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVF 401
Query: 404 PSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
P NG+ +++G I +N V YD + ++ F R +C L+D
Sbjct: 402 P---NGK--DHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSD 442
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 162/384 (42%), Gaps = 49/384 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--------TFDPSKSLTYA 147
V +++ +G P +DTGS ++WV C PC C + +F+P S T +
Sbjct: 2 VGLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTAS 61
Query: 148 TLPCDSSYCT----------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SD 194
+ C CT C Y Y +G + G S+ FET ++
Sbjct: 62 RITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 121
Query: 195 EGKTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIG 248
+ + FGCS++ + +D G+FG G S S + +G FS+C
Sbjct: 122 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC-- 179
Query: 249 NLNYFEYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKN 306
L + +L+LGE I+E TP+ Y + LE I++ + L ID +LF +
Sbjct: 180 -LKGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 236
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDL 366
+T G +DSGTTL +L AY + + S + S +++
Sbjct: 237 NT---QGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS- 292
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKDLSIIGMIA 422
FP + +F GG + + E+ Q++ S ++C+ + ++++I+G +
Sbjct: 293 --FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIG-----WQRNQGQEITILGDLV 345
Query: 423 QQNYNVAYDLVSKQLYFQRIDCEL 446
++ YDL + ++ + DC +
Sbjct: 346 LKDKIFVYDLANMRMGWADYDCSM 369
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 175/386 (45%), Gaps = 61/386 (15%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP+ +DTGS ++WV C C C ++ FD S S + +
Sbjct: 76 VGLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSS 135
Query: 148 TLP-----CDSSYCT--NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C+S++ T C ++C Y +Y +G + G SE F+ G++ +
Sbjct: 136 LVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMV-MGQSMI 194
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCI--- 247
+ V FGCS + SD G+FG GP S S + G FS+C+
Sbjct: 195 ANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGE 254
Query: 248 GNLNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKK 305
GN +L+LGE +LE +P+ Y + L+ IS+ + L IDP++F
Sbjct: 255 GN------GGGILVLGE--VLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFA- 305
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV-EDLFQGLLPSYPMDPAWHLCYS--GNI 362
T + G IDSGTTL +LV AY + + Q + P+ +L + G I
Sbjct: 306 --TSINRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQCYLVSTSVGEI 363
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVF----YQESSSVFCLAVGPSDINGERFKDLSII 418
FP ++ +FAG A +VL E + + ++++C +G + + ++I+
Sbjct: 364 ------FPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWC--IGFQKVQ----EGVTIL 411
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDC 444
G + ++ YDL +++ + DC
Sbjct: 412 GDLVMKDKIFVYDLARQRIGWASYDC 437
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 140/364 (38%), Gaps = 42/364 (11%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC 156
P + V IG PP L +DT + W+ C C+ C +T F P KS T+ + C + C
Sbjct: 76 PTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAPEKSTTFKNVSCAAPEC 135
Query: 157 TN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD----VGFGCS 208
CG C +N+ Y GS + T D FGC
Sbjct: 136 KQVPNPGCGV--SSCNFNLTY----------GSSSIAANLVQDTITLATDPVPSYTFGCV 183
Query: 209 HNNAHFS---DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
S G S T +L + S FSYC+ + ++ ++ +
Sbjct: 184 SKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQ---STFSYCLPSFKSLNFSGSLRLGPVA 240
Query: 266 AILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
TP+ YYV LE I +G K++DI P N T + AG DSGT
Sbjct: 241 QPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPT-TGAGTIFDSGTVF 299
Query: 323 TWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL 382
T LV Y +R E L + + CY+ I P + F F G
Sbjct: 300 TRLVAPVYVAVRDEFRRRVGPKLTVTSLG-GFDTCYNVPI-----VVPTITFIFTGMNVT 353
Query: 383 VLDAESVFYQESSSVFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ + + + S CLA+ P ++N L++I + QQN+ V YD+ + ++
Sbjct: 354 LPQDNILIHSTAGSTTCLAMAGAPDNVNSV----LNVIANMQQQNHRVLYDVPNSRVGVA 409
Query: 441 RIDC 444
R C
Sbjct: 410 RELC 413
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 144/359 (40%), Gaps = 37/359 (10%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIW--VKCQP-----CEQCGATTFDPSKSLTYATLPC 151
++ +G P L VLDTGS ++W V+ P Q +T P+ + + C
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN---C 178
Query: 152 DSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
+ C + C + C Y + Y +G + G SE F + V GC
Sbjct: 179 VAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR----VQRVAIGC 234
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
H+N + S + G FSYC+ + A G
Sbjct: 235 GHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRWG---- 290
Query: 268 LEGDSTPMSVIDGSYYVTLEGISLG-EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLV 326
TP + YYV L G S+G ++ + + + N T GV +DSGT++T L
Sbjct: 291 ----GTPR--MATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLA 344
Query: 327 PSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDA 386
Y+ +R GL S + CY+ + R ++ P ++ H AGGA + L
Sbjct: 345 RPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVK-VPTVSMHLAGGASVALPP 403
Query: 387 ESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
E+ ++S FC A+ +D +SIIG I QQ + V +D ++++ F C
Sbjct: 404 ENYLIPVDTSGTFCFAMAGTD------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 120/426 (28%), Positives = 179/426 (42%), Gaps = 64/426 (15%)
Query: 53 NDTVDAQAQRT-LNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVP 111
N T Q R+ L+M AR + S + + L G + ++F IG P
Sbjct: 50 NYTRAVQRSRSRLSMLAARAV--SNAGAAPGESAQTPLKKGSGD---YAMSFGIGTPATG 104
Query: 112 QLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD--- 165
DTGS LIW KC C +C G+ ++ P+ S + A + C CG P
Sbjct: 105 LSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGD----RTCGELPRPLC 160
Query: 166 -----------ECWYNIRYTNGPDS----QGTIGSEQFNFETSDEGKTFLYDVGFGCS-H 209
C Y+ Y N D+ +G + +E F F D+ F + FGC+
Sbjct: 161 SNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF--GDDAAAF-PGIAFGCTLR 217
Query: 210 NNAHFSDEQFTGVFGLGPATSS--THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
+ F +G+ GLG S T VE G + S + + + + G
Sbjct: 218 SEGGFGTG--SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGG--- 272
Query: 268 LEGDS---TPM---SVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
GDS TP+ V+ YYV L GIS+G K++ I F + + GV DSG
Sbjct: 273 -NGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSG 331
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
TTLT L AY +R E+ P + +C++G + FP+M HF GG
Sbjct: 332 TTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTT--FPSMVLHFDGG 389
Query: 380 ADLVLDAESVF----YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV-S 434
AD+ L E+ Q + C +V S + L+IIG I Q +++V +DL +
Sbjct: 390 ADMDLSTENYLPQMQGQNGETARCWSVVKSS------QALTIIGNIMQMDFHVVFDLSGN 443
Query: 435 KQLYFQ 440
++ FQ
Sbjct: 444 ARMLFQ 449
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 119/440 (27%), Positives = 180/440 (40%), Gaps = 69/440 (15%)
Query: 41 KLLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFY 100
+L H D+ N N T D +R + S R L+ S + H+ R + P S + FY
Sbjct: 61 ELTHVDA---NLNLTSDELMRRAYDRSRLRAASLAAYSDGR-HEGRVSI-PDASYIITFY 115
Query: 101 VNFSIGQPPVPQL-AVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN- 158
+ Q P + AV+DTGS + W + C S+S T + LPC S C
Sbjct: 116 LG---NQRPEDNISAVVDTGSDIFWTTEKEC----------SRSKTRSMLPCCSPKCEQR 162
Query: 159 -DCGGYPDE----------CWYNIRYT-NGPDSQGTIGSEQFNFETSDEGKTF-----LY 201
CG E C Y I Y N DS + E + K
Sbjct: 163 ASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAGVMYEDKLTIVAVASKAVPSSQSFK 222
Query: 202 DVGFGCSHNNA-HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNM 259
+V GCS + F D GVFGLG S SL ++ SKFSYC+ + + +
Sbjct: 223 EVAIGCSTSATLKFKDPSIKGVFGLG---RSATSLPRQLNFSKFSYCLSSYQEPDLPSYL 279
Query: 260 LILGEGAILEGDST-----------PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
L+ + G P S Y+V L+ IS+G F T
Sbjct: 280 LLTAAPDMATGAVGGGAAVATTALQPNSDYKTLYFVHLQNISIGG-------TRFPAVST 332
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCYS--GNINR 364
S +F+D+G + T L + + L E++ + + + P +CYS
Sbjct: 333 KSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTAAD 392
Query: 365 DLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
+ P M HFA A++VL +S ++ ++S CLA+ S+I G +S++G Q
Sbjct: 393 ESSKLPDMVLHFADSANMVLPWDSYLWK-TTSKLCLAIYKSNIKG----GISVLGNFQMQ 447
Query: 425 NYNVAYDLVSKQLYFQRIDC 444
N ++ D +++L F R DC
Sbjct: 448 NTHMLLDTGNEKLSFVRADC 467
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 139/327 (42%), Gaps = 55/327 (16%)
Query: 106 GQPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCTNDC 160
G V Q ++D+GS + WV+C+PC C FDP+ S TYA +PC S+ C
Sbjct: 71 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQ-L 129
Query: 161 GGYPD------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAH 213
G Y +C + I Y +G + GT + D + F FGC+H +
Sbjct: 130 GPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFR----FGCAHADRGS 185
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNMLILG---EGA 266
D G LG + SLV++ ++ FSYC L + L+LG E A
Sbjct: 186 AFDYDVAGSLALG---GGSQSLVQQTATRYGRVFSYC---LPPTASSLGFLVLGVPPERA 239
Query: 267 ILEGD--STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
L STP+ S+ Y V L I + + L + P +F A IDS T
Sbjct: 240 QLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFS-------ASSVIDSSTI 292
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAG 378
++ L P+AYQ LR F+ + Y P + CY R + P++A F G
Sbjct: 293 ISRLPPTAYQALRAA----FRSAMTMYRAAPPVSILDTCYDFTGVRSIT-LPSIALVFDG 347
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPS 405
GA + LDA + CLA P+
Sbjct: 348 GATVNLDAAGILLGS-----CLAFAPT 369
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 70/166 (42%), Gaps = 24/166 (14%)
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V L I + + L + P +F + I S T ++ L P+AYQ LR F
Sbjct: 485 YRVLLRAIIVAGRPLPVPPTVFSTSSV-------IASTTVISRLPPTAYQALRAA----F 533
Query: 342 QGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF 398
+ + Y P + CY R + P++A F GGA + LDA + Q
Sbjct: 534 RRAMTMYRTAPPVSILDTCYDFTGVRSIT-LPSIALVFDGGATVNLDAAGILLQG----- 587
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA P+ + + IG + Q+ V YD+ K + F+ C
Sbjct: 588 CLAFAPTATD----RMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/413 (26%), Positives = 172/413 (41%), Gaps = 62/413 (15%)
Query: 79 SQKAHDTRAHLHPGISTVPVF-------YVNFSIGQPPVPQLAVLDTGSSLIWVKCQ--- 128
S+ H R G T+P + V FS+G PP VLDTGSSL+W C
Sbjct: 47 SRARHLKRPPTLTGKVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPT 106
Query: 129 ---PCEQCGATTFDPSKSLTYA--------TLPCDSSYCT------NDCGGYPDECWYNI 171
C+ C + DP+K YA +LPC S C +C +Y +
Sbjct: 107 ATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGL 166
Query: 172 RYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS 231
Y G + G + S+ + FL FGCS S+ Q G+ G G +
Sbjct: 167 EYGLG-STTGQLVSDVLGLSKLNRIPDFL----FGCS----LVSNRQPEGIAGFGRGLA- 216
Query: 232 THSLVEKVG-SKFSYCIGNLNYFEYAYNM-LILGEG-----AILEG-------DSTPMSV 277
S+ ++G +KFSYC+ + + + + L+L G A G S +S
Sbjct: 217 --SIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSP 274
Query: 278 IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEV 337
YY++L I +G K + I P + D G+ +DSG+T T++ + + +E+
Sbjct: 275 YSEYYYISLSKILVGGKDVPIPPRYLVPSKE-GDGGMIVDSGSTFTFMERIIFDPVAREL 333
Query: 338 EDLFQGLLPSYPMDPAWHL--CY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQE 393
E + ++ + L CY +G D+ P + F F GGA++ L F
Sbjct: 334 EKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDV---PKLTFSFKGGANMDLPLTDYFSLV 390
Query: 394 SSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ V C+ V D G I+G QQN+ + YDL ++ F+ C+
Sbjct: 391 TDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDLKKQRFGFKPQQCD 443
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 133/289 (46%), Gaps = 45/289 (15%)
Query: 160 CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQF 219
CG C Y I Y +G ++G +G E+ F G + D FGC NN F
Sbjct: 69 CGSAAPICNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGL----F 119
Query: 220 TGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
GV GL S SL+ + G FSYC+ + LILG + + +S+P+
Sbjct: 120 GGVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTE--RKGSGSLILGGNSSVYRNSSPI 177
Query: 276 S---VIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
S +I+ Y++ L GIS+G L + + + + +DSGT +T L P
Sbjct: 178 SYAKMIENPQLYNFYFINLTGISIGGVAL--------QAPSVGPSRILVDSGTVITRLPP 229
Query: 328 SAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVL 384
+ Y+ L+ E F G +P PA+ + C++ + +++ P + HF G A+L +
Sbjct: 230 TIYKALKAEFLKQFTG----FPPAPAFSILDTCFNLSAYQEVD-IPTIKMHFEGNAELTV 284
Query: 385 DAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYD 431
D VFY + +S CLA+ + E ++I+G Q+N V YD
Sbjct: 285 DVTGVFYFVKSDASQVCLALASLEYQDE----VAILGNYQQKNLRVIYD 329
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 120/426 (28%), Positives = 179/426 (42%), Gaps = 64/426 (15%)
Query: 53 NDTVDAQAQRT-LNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVP 111
N T Q R+ L+M AR + S + + L G + ++F IG P
Sbjct: 50 NYTRAVQRSRSRLSMLAARAV--SNAGAAPGESAQTPLKKGSGD---YAMSFGIGTPATG 104
Query: 112 QLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYCTNDCGGYPD--- 165
DTGS LIW KC C +C G+ ++ P+ S + A + C CG P
Sbjct: 105 LSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGD----RTCGELPRPLC 160
Query: 166 -----------ECWYNIRYTNGPDS----QGTIGSEQFNFETSDEGKTFLYDVGFGCS-H 209
C Y+ Y N D+ +G + +E F F D+ F + FGC+
Sbjct: 161 SNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF--GDDAAAF-PGIAFGCTLR 217
Query: 210 NNAHFSDEQFTGVFGLGPATSS--THSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
+ F +G+ GLG S T VE G + S + + + + G
Sbjct: 218 SEGGFGTG--SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGG--- 272
Query: 268 LEGDS---TPM---SVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
GDS TP+ V+ YYV L GIS+G K++ I F + + GV DSG
Sbjct: 273 -NGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSG 331
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGG 379
TTLT L AY +R E+ P + +C++G + FP+M HF GG
Sbjct: 332 TTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTT--FPSMVLHFDGG 389
Query: 380 ADLVLDAESVF----YQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV-S 434
AD+ L E+ Q + C +V S + L+IIG I Q +++V +DL +
Sbjct: 390 ADMDLSTENYLPQMQGQNGETARCWSVVKSS------QALTIIGNIMQMDFHVVFDLSGN 443
Query: 435 KQLYFQ 440
++ FQ
Sbjct: 444 ARMLFQ 449
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 99/400 (24%), Positives = 171/400 (42%), Gaps = 50/400 (12%)
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
L + S+ +R L+ + + IG PP ++D+GS++ +V C CEQC
Sbjct: 68 LHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC 127
Query: 134 GA---TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF 190
G F P S TY + C+ C +C ++C Y Y S+G +G + +F
Sbjct: 128 GKHQDPKFQPEMSSTYQPVKCNMD-C--NCDDDREQCVYEREYAEHSSSKGVLGEDLISF 184
Query: 191 ETSDEGKTFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFSYC 246
+E + FGC + ++ G+ GLG S LV+K + + F C
Sbjct: 185 --GNESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLC 242
Query: 247 IGNLNY---------FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLD 297
G ++ F+Y +M+ + D +P Y + L GI + K L
Sbjct: 243 YGGMDVGGGSMILGGFDYPSDMVFTDS----DPDRSPY------YNIDLTGIRVAGKQLS 292
Query: 298 IDPNLFKKNDTWSDAGVFIDSGTTLTWL----VPSAYQTLRKEVEDLFQ--GLLPSYPMD 351
+ +F + G +DSGTT +L + + + +EV L Q G P++ D
Sbjct: 293 LHSRVFD-----GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNF-KD 346
Query: 352 PAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDING 409
+ + S ++ + FP++ F G +L E+ ++ S +CL V P NG
Sbjct: 347 TCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFP---NG 403
Query: 410 ERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
+ +++G I +N V YD + ++ F R +C L+D
Sbjct: 404 K--DHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSD 441
>gi|222640101|gb|EEE68233.1| hypothetical protein OsJ_26421 [Oryza sativa Japonica Group]
Length = 439
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 113/430 (26%), Positives = 192/430 (44%), Gaps = 77/430 (17%)
Query: 73 YLSQKSSQ-KAHDTRAHLHPGISTVPVFYVNFS------IGQPPVPQLAVLDTGSSLIWV 125
YL K+S +A D + I P++ V+ S +G +L DT +++W+
Sbjct: 29 YLFNKTSALEAADVNGNNSAEILAAPLYPVSHSYLLEIAVGSLGKTRLVSFDTAVNMVWL 88
Query: 126 KCQP-CEQCG-------ATTFDPSKSLTYATLPCDSSYCTNDCGGYPDE----------C 167
+C C C T ++ S S++Y L CD C G D+ C
Sbjct: 89 QCSDYCRDCNPSQVGTSTTYYNASMSISYNPLSCDHPLC--GAGDNHDQQVLAECMDGTC 146
Query: 168 WYNIRY--TNGPDSQGTIGSEQFNFETSDEGKTFLYD--VGFGCSH-NNAHFSDEQF--T 220
+ + NG QG +GS++ + + FL+D + FGC+ +++ ++ +Q+ +
Sbjct: 147 TFKVDSLDNNGGWVQGILGSDRISIS---DHFFFLFDTNIIFGCATVDHSKYTLDQYGSS 203
Query: 221 GVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFE-YAYNMLILGEGAILEGDSTPMSVI 278
GV GLG +SL +++ ++FSYC+ + E ++ ++ G A+L+GD TP
Sbjct: 204 GVVGLGLGK---YSLPQQISVTRFSYCLPSWVKNELFSPPYVLFGSNAVLQGDMTPFLPG 260
Query: 279 DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF---------------IDSGTTLT 323
YY+ LEGIS G LDI + D + F ++S T
Sbjct: 261 FPKYYLKLEGISYGIVRLDIFGSNAAAADQYHQQAQFCRGPYLPDAQFYAMSVESATFPL 320
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSY--PMDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD 381
L AY+ L KE E L+ S PM+ CY G+++ D+ + HF GG D
Sbjct: 321 MLPSRAYELLEKEFEQDNPLLIKSRLQPMNT----CYKGSVD-DIADNATITLHFHGGID 375
Query: 382 LVLDAESVFYQESS-------SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
L L + F + +S CL V + ++G +++G+ Q ++N+ +DL +
Sbjct: 376 LQLSRNATFMEITSMNGDQEERYVCLIVDKT-VDGT-----AVLGLSPQLDHNIGFDLEN 429
Query: 435 KQLYFQRIDC 444
KQ+ R C
Sbjct: 430 KQISIYRKIC 439
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 55/327 (16%)
Query: 106 GQPPVPQLAVLDTGSSLIWVKCQPC--EQCGATT---FDPSKSLTYATLPCDSSYCTNDC 160
G V Q ++D+GS + WV+C+PC C FDP+ S TYA +PC S+ C
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQ-L 220
Query: 161 GGYPD------ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
G Y +C + I Y +G + GT + D + F FGC+H +
Sbjct: 221 GPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFR----FGCAHADRGS 276
Query: 215 S-DEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNMLILG---EGA 266
+ D G LG + SLV++ ++ FSYC L + L+LG E A
Sbjct: 277 AFDYDVAGSLALG---GGSQSLVQQTATRYGRVFSYC---LPPTASSLGFLVLGVPPERA 330
Query: 267 ILEGD--STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTT 321
L STP+ S+ Y V L I + + L + P +F A IDS T
Sbjct: 331 QLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFS-------ASSVIDSSTI 383
Query: 322 LTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAG 378
++ L P+AYQ LR F+ + Y P + CY R + P++A F G
Sbjct: 384 ISRLPPTAYQALRAA----FRSAMTMYRAAPPVSILDTCYDFTGVRSIT-LPSIALVFDG 438
Query: 379 GADLVLDAESVFYQESSSVFCLAVGPS 405
GA + LDA + CLA P+
Sbjct: 439 GATVNLDAAGILLGS-----CLAFAPT 460
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 87/218 (39%), Gaps = 38/218 (17%)
Query: 240 GSKFSYCI----GNLNYF------EYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGI 289
G FSYCI +L + + A + +L S P + Y V L I
Sbjct: 528 GRVFSYCIPPSPSSLGFITLGVPPQRAALVPTFVSTPLLSSSSMPPTF----YRVLLRAI 583
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
+ + L + P +F + I S T ++ L P+AYQ LR F+ + Y
Sbjct: 584 IVAGRPLPVPPTVFSTSSV-------IASTTVISRLPPTAYQALRAA----FRRAMTMYR 632
Query: 350 MDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSD 406
P + CY R + P++A F GGA + LDA + Q CLA P+
Sbjct: 633 TAPPVSILDTCYDFTGVRSIT-LPSIALVFDGGATVNLDAAGILLQG-----CLAFAPTA 686
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + IG + Q+ V YD+ K + F+ C
Sbjct: 687 TD----RMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 158/364 (43%), Gaps = 45/364 (12%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGSS+ +V C CEQCG F P S TY ++ C+ +C
Sbjct: 19 IGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN---IDCNCD 75
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFT 220
+C Y +Y S G +G + +F + FGC + +
Sbjct: 76 DEKQQCVYERQYAEMSTSSGVLGEDIISF--GNLSALAPQRAVFGCENMETGDLYSQHAD 133
Query: 221 GVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV 277
G+ G+G S LV+K + FS C G + A ++LG G S P ++
Sbjct: 134 GIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGA---MVLG------GISPPSNM 184
Query: 278 IDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
+ Y + L+ I + K L ++P +F G +DSGTT +L +A
Sbjct: 185 VFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFD-----GKHGTILDSGTTYAYLPEAA 239
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGADLVLD 385
+ + + + L P DP ++ +C+SG +I++ FPA+ F G L+L
Sbjct: 240 FVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLLS 299
Query: 386 AESVFYQESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
E+ ++ S +CL + NG+ +++G I +N V YD + ++ F + +
Sbjct: 300 PENYLFRHSKVHGAYCLGIFQ---NGK--DPTTLLGGIVVRNTLVLYDRENSKIGFWKTN 354
Query: 444 CELL 447
C L
Sbjct: 355 CSEL 358
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 148/371 (39%), Gaps = 54/371 (14%)
Query: 89 LHPG--ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--FDPSKSL 144
+ PG I ++P + +G P L +D + WV C C C A++ F P++S
Sbjct: 90 IAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQSS 149
Query: 145 TYATLPCDSSYCTN----DC-GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
TY T+PC S C C G C +N+ Y Q +G + E
Sbjct: 150 TYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAAS-TFQAVLGQDSLALE-----NNV 203
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNM 259
+ FGC V G A + H L + G+L
Sbjct: 204 VVSYTFGC-----------LRVVNGNSRAAAGAHRLRPRAALLLVADQGHLGPIGQPKR- 251
Query: 260 LILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ +L P YYV + GI +G K++ + + N + +G ID+G
Sbjct: 252 --IKTTPLLYNPHRP-----SLYYVNMIGIRVGSKVVQVPQSALAFNPV-TGSGTIIDAG 303
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLL--PSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
T T L Y +R D F+G + P P + CY+ ++ P + F FA
Sbjct: 304 TMFTRLAAPVYAAVR----DAFRGRVRTPVAPPLGGFDTCYNVTVS-----VPTVTFMFA 354
Query: 378 GGADLVLDAESVFYQESSS-VFCLAV--GPSD-INGERFKDLSIIGMIAQQNYNVAYDLV 433
G + L E+V SS V CLA+ GPSD +N L+++ + QQN V +D+
Sbjct: 355 GAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAA----LNVLASMQQQNQRVLFDVA 410
Query: 434 SKQLYFQRIDC 444
+ ++ F R C
Sbjct: 411 NGRVGFSRELC 421
>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 149/365 (40%), Gaps = 46/365 (12%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSSY 155
+V+ GQ ++ LDT +S WV C+PC Q G F P++S T+ + D
Sbjct: 69 FVSIGTGQGGRRKILALDTAASTSWVMCEPCRPPLHQLG-RLFSPAESPTFRGVRRDDPV 127
Query: 156 CTNDCGGYPDECWYNIRYTNG-----PDSQGTIGSEQFNFETSDEGKT-FLYDVGFGCSH 209
C ++ + TNG P + G + + F+ S+ + V FGC+H
Sbjct: 128 CVPP--------YHRLHSTNGCSFAFPSAIGYLARDTFHLRHSERSVVKSISGVAFGCAH 179
Query: 210 NNAHFSDEQ-FTGVFGLGPATSS-THSLVEKVGSKFSYCIGN-------LNYFEYAYNML 260
F +E GV L P+ S + G +FSYC+ + + ++ +
Sbjct: 180 TTTGFYNEDILGGVLSLSPSPLSFLTQFGSRAGGRFSYCLPDPTTSHNPSGFIQFGIEVP 239
Query: 261 ILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
L A +T ++V Y+++L GISLG K LDID ++ + G I+
Sbjct: 240 SLPRHA----HTTTLTVSASGYHLSLIGISLGNKRLDIDRHILTSH------GCSINPAE 289
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-FPAMAFHFAGG 379
T+T + AY + +E+ L P I+R ++ P M FHFA G
Sbjct: 290 TITKIAEPAYIIVARELMAQMNELGSKQVKGPPSSPLVFNKISRRVRARLPNMVFHFADG 349
Query: 380 ADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
D+ A +F ++ L G ++IG Q N +++ + +L F
Sbjct: 350 GDMWFTAGKLFQVIGTTARFLVEGHGS-------HRTVIGAAQQVNARFIFNVAAGRLTF 402
Query: 440 QRIDC 444
C
Sbjct: 403 AEELC 407
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 149/340 (43%), Gaps = 51/340 (15%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTY 146
V ++Y +G PP +DTGS ++WV C C C T+ FDP S+T
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 147 ATLPCDSSYCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF 199
+ + C C+ + C + C Y +Y +G + G S+ F+ G +
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMI-VGSSL 195
Query: 200 LYD----VGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGN 249
+ + V FGCS + SD G+FG G S S + G FS+C+
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 250 LNYFEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKND 307
N +L+LGE I+E + TP+ Y V L IS+ + L I+P++F
Sbjct: 256 EN---GGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFS--- 307
Query: 308 TWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-----SGNI 362
T + G ID+GTTL +L +AY + + + + P+ + CY G+I
Sbjct: 308 TSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSV--RPVVSKGNQCYVITTSVGDI 365
Query: 363 NRDLQGFPAMAFHFAGGADLVLDAESVFYQES--SSVFCL 400
FP ++ +FAGGA + L+ + Q++ +S C
Sbjct: 366 ------FPPVSLNFAGGASMFLNPQDYLIQQNNVASALCF 399
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/405 (23%), Positives = 172/405 (42%), Gaps = 51/405 (12%)
Query: 78 SSQKAHDTRAHL-----------HPGISTVP-VFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
++ K HD R L G +P ++Y IG P +DTGS ++WV
Sbjct: 47 TALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWV 106
Query: 126 KCQPCEQCGA--------TTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR----- 172
C C+QC T ++ +S + + CD +C GG C N+
Sbjct: 107 NCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSCPYLE 166
Query: 173 -YTNGPDSQGTIGSEQFNFETSD---EGKTFLYDVGFGCSHNNA----HFSDEQFTGVFG 224
Y +G + G + +++ + +T V FGC + ++E G+ G
Sbjct: 167 IYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILG 226
Query: 225 LGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G A SS S + +V F++C+ N + +G + + TP+
Sbjct: 227 FGKANSSMISQLASSGRVKKIFAHCLDGRN----GGGIFAIGRVVQPKVNMTPLVPNQPH 282
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V + + +G++ L I +LF+ D G IDSGTTL +L Y+ L K++
Sbjct: 283 YNVNMTAVQVGQEFLTIPADLFQPGDR---KGAIIDSGTTLAYLPEIIYEPLVKKITSQ- 338
Query: 342 QGLLPSYPMDPAWH-LCYSGNINRDLQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFC 399
+ L + +D + YSG ++ +GFP + FHF L V + +F E ++C
Sbjct: 339 EPALKVHIVDKDYKCFQYSGRVD---EGFPNVTFHFENSVFLRVYPHDYLFPHE--GMWC 393
Query: 400 LAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ S + ++++++G + N V YDL ++ + + +C
Sbjct: 394 IGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 170/380 (44%), Gaps = 46/380 (12%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C C C ++ FD S T
Sbjct: 97 VGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAG 156
Query: 148 TLPCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
++ C C++ C ++C Y+ RY +G + G ++ F F+ + G++ +
Sbjct: 157 SVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYGDGSGTSGYYMTDTFYFD-AILGESLV 214
Query: 201 YD----VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNL 250
+ + FGCS + SD+ G+FG G S S + G FS+C+
Sbjct: 215 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 251 NYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
+ +LGE + +P+ Y + L I + ++L ID +F+ ++T
Sbjct: 275 G---SGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNT-- 329
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
G +D+GTTLT+LV AY + + L+ + S +I+ D+ FP
Sbjct: 330 -RGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQCYLVSTSIS-DM--FP 385
Query: 371 AMAFHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
++ +FAGGA ++L + + + +S++C+ + ++ +I+G + ++
Sbjct: 386 PVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAP------EEQTILGDLVLKDK 439
Query: 427 NVAYDLVSKQLYFQRIDCEL 446
YDL +++ + DC +
Sbjct: 440 VFVYDLARQRIGWANYDCSM 459
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 161/395 (40%), Gaps = 67/395 (16%)
Query: 92 GISTVPVFY-------VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPS 141
G S VP+ + NF+IG PP P A++D L+W +C C +C F P+
Sbjct: 29 GGSAVPIHWSRHLYNVANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPN 88
Query: 142 KSLTYATLPCDSSYC----TNDCGGYPDECWY----NIRYTNGPDSQGTIGSEQFNFETS 193
S T+ PC + C T++C G D C Y NIR + + G +G+E F T+
Sbjct: 89 ASSTFRPEPCGTDACKSTPTSNCSG--DVCTYESTTNIRL-DRHTTLGIVGTETFAIGTA 145
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNY 252
+ FGC + + + +G GLG + SLV ++ +KFSYC+
Sbjct: 146 TA------SLAFGCVVASDIDTMDGTSGFIGLG---RTPRSLVAQMKLTKFSYCLSPRGT 196
Query: 253 FEYAYNMLILGEGAILEG----------DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNL 302
+ + L LG A L G ++P Y ++L+ I G +
Sbjct: 197 GK--SSRLFLGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI------ 248
Query: 303 FKKNDTWSDAGVFI-DSGTTLTWLVPSAYQTLRKEVEDLFQG------LLPSYPMDPAWH 355
T G+ + + + + LV SAY+ +K V + G P P D
Sbjct: 249 ----ATAQSGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFD---- 300
Query: 356 LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ--ESSSVFCLAV-GPSDINGERF 412
LC+ P + F F G A L + E C A+ + +N
Sbjct: 301 LCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGL 360
Query: 413 KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ +S++G + Q++ + YDL + L F+ DC L
Sbjct: 361 EGVSVLGSLQQEDVHFLYDLKKETLSFEPADCSSL 395
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/397 (22%), Positives = 165/397 (41%), Gaps = 47/397 (11%)
Query: 81 KAHDTRAHLH------------PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
K+HDTR H + +V +++ +G PP +DTGS ++W+ C+
Sbjct: 44 KSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCK 103
Query: 129 PCEQCGATT--------FDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNIRYTNG 176
PC +C T FD + S T + CD +C+ +D C Y+I Y +
Sbjct: 104 PCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADE 163
Query: 177 PDSQGTIGSEQFNFE-TSDEGKT--FLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATS 230
S G + E + + KT +V FGC + + D GV G G + +
Sbjct: 164 STSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNT 223
Query: 231 STHSLVEKVGSK---FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLE 287
S S + G FS+C+ N+ + +G + +TPM Y V L
Sbjct: 224 SVLSQLAATGDAKRVFSHCLDNVK----GGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS 347
G+ + LD+ ++ + + G +DSGTTL + Y +L + + L + +
Sbjct: 280 GMDVDGTSLDLPRSIVR------NGGTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKL 331
Query: 348 YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDI 407
+ ++ + C+S + N D + FP ++F F L + + ++C +
Sbjct: 332 HIVEETFQ-CFSFSTNVD-EAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGL 389
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ ++ ++G + N V YDL ++ + + +C
Sbjct: 390 TTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNC 426
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 170/391 (43%), Gaps = 47/391 (12%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
RF YLS + K+ T + G + + V +G PP VLDT + +W+ C
Sbjct: 75 RFTYLSSLVAGKSKPTSVPVASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCS 134
Query: 129 PCEQC--GATTFDPSKSLTYATLPCDSSYCTNDCG-------GYPDECWYNIRYTNGPDS 179
C C +T+F+ + S TY+T+ C ++ CT G P C +N Y G DS
Sbjct: 135 GCSGCSNASTSFNTNSSSTYSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSY--GGDS 192
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA----TSSTHSL 235
+ Q S + + + FGC N+A + G+ GLG S T SL
Sbjct: 193 SFSANLVQDTLTLSPD---VIPNFSFGC-INSASGNSLPPQGLMGLGRGPMSLVSQTTSL 248
Query: 236 VEKVGSKFSYCIGNLN--YFEYAYNMLILGE------GAILEGDSTPMSVIDGSYYVTLE 287
V FSYC+ + YF + + +LG+ +L P YYV L
Sbjct: 249 YSGV---FSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRP-----SLYYVNLT 300
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS 347
G+S+G + +DP ++ D+ S AG IDSGT +T Y+ +R E G S
Sbjct: 301 GVSVGSVQVPVDP-VYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNG---S 356
Query: 348 YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF-CLAVGPSD 406
+ A+ C+S + N ++ P + H DL L E+ S+ CL++ +
Sbjct: 357 FSTLGAFDTCFSAD-NENVT--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSM--AG 410
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
I L++I + QQN + +D+ + ++
Sbjct: 411 IRQNANAVLNVIANLQQQNLRILFDVPNSRI 441
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 169/382 (44%), Gaps = 55/382 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ---------PCEQCGATTFDPSKSLTYATLPC 151
V+ IG PP P VLDTGS L W++C P + T+FDPS S +++ LPC
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLLPC 127
Query: 152 DSSYCTNDCGGYP-----DE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ C + D+ C Y+ Y +G ++G + E+F F S +
Sbjct: 128 NHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVI--- 184
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI---------------G 248
GC+ + + G+ G+ S S + SKFSYC+
Sbjct: 185 -LGCAQ-----ASTENRGILGMNRGRLSFISQAKI--SKFSYCVPSRTGSNPTGLFYLGD 236
Query: 249 NLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
N N ++ Y ++ E S+P ++ +Y + ++ I + K L++ P FK D
Sbjct: 237 NPNSSKFKYVTML----TFPESQSSP-NLDPLAYTLPMKAIKIAGKRLNVPPAAFKP-DA 290
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRDL- 366
IDSG+ LT+LV AY+ +++EV L ++ Y +C+ + ++
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVG 350
Query: 367 QGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+ ++F F G ++ V E V + V C+ +G S+ G +IIG + QQN
Sbjct: 351 RRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLG---IGSNIIGTVHQQN 407
Query: 426 YNVAYDLVSKQLYFQRIDCELL 447
V YDL +K++ F +C L
Sbjct: 408 MWVEYDLANKRVGFGGAECSRL 429
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 150/389 (38%), Gaps = 88/389 (22%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------FDPSKSLT 145
+ ++ IG PP P AV+DTGS L+W +C C A ++ S S T
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 146 YATLPCD------------SSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETS 193
+PCD ++ C G D C Y G + G +G++ F F +S
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSS 196
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYF 253
+ FGC + S TG G+ IG
Sbjct: 197 SS-----VTLAFGCV-SQTRISPGALTGASGI---------------------IG----- 224
Query: 254 EYAYNMLILGEGAI-LEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT---- 308
LG GA+ L +P S YY+ L G++ G + + F +
Sbjct: 225 --------LGRGALSLNPKDSPFSTF---YYLPLVGLAAGNATVALPAGAFDLREAAPKV 273
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG----LLPSYPMDPAWHLCYSGNINR 364
W+ G IDSG+ T LV A++ L KE+ +G + P + A LC +
Sbjct: 274 WA-GGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDG 332
Query: 365 D---LQGFPAMAFHF----AGGADLVLDAESVFYQESSSVFCLAVGPSDINGERF--KDL 415
D P++ F GG +LV+ AE + + +S +C+AV S +
Sbjct: 333 DSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATLPTNET 392
Query: 416 SIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+IIG QQ+ V YDL + L FQ +C
Sbjct: 393 TIIGNFMQQDMRVLYDLANGLLSFQPANC 421
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/385 (23%), Positives = 159/385 (41%), Gaps = 47/385 (12%)
Query: 81 KAHDTRAHLH------------PGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
K+HDTR H + +V +++ +G PP +DTGS ++W+ C+
Sbjct: 44 KSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCK 103
Query: 129 PCEQCGATT--------FDPSKSLTYATLPCDSSYCT----NDCGGYPDECWYNIRYTNG 176
PC +C T FD + S T + CD +C+ +D C Y+I Y +
Sbjct: 104 PCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADE 163
Query: 177 PDSQGTIGSEQFNFE-TSDEGKT--FLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATS 230
S G + E + + KT +V FGC + + D GV G G + +
Sbjct: 164 STSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNT 223
Query: 231 STHSLVEKVGSK---FSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLE 287
S S + G FS+C+ N+ + +G + +TPM Y V L
Sbjct: 224 SVLSQLAATGDAKRVFSHCLDNVK----GGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS 347
G+ + LD+ ++ + + G +DSGTTL + Y +L + + L + +
Sbjct: 280 GMDVDGTSLDLPRSIVR------NGGTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKL 331
Query: 348 YPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDI 407
+ ++ + C+S + N D + FP ++F F L + + ++C +
Sbjct: 332 HIVEETFQ-CFSFSTNVD-EAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGL 389
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDL 432
+ ++ ++G + N V YDL
Sbjct: 390 TTDERSEVILLGDLVLSNKLVVYDL 414
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 150/366 (40%), Gaps = 84/366 (22%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDC 160
V+ ++G PP VLDTGS L W+ C+ + FDP +S +Y+ +PC S C
Sbjct: 377 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNL-HSVFDPLRSSSYSPIPCTSPTC---- 431
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFT 220
T KT T
Sbjct: 432 ------------------------------RTRTHSKT---------------------T 440
Query: 221 GVFGLGPATSSTHSLVEKVG-SKFSYCI------GNLNYFEYAYNMLILGEGAILEGDST 273
G+ G+ + S V ++G KFSYCI G L + E +++ L + L ST
Sbjct: 441 GLIGM---NRGSLSFVTQMGLQKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQIST 497
Query: 274 PMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
P+ D +Y V LEGI + ML + +++ + T + +DSGT T+L+ Y
Sbjct: 498 PLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA-GQTMVDSGTQFTFLLGPVYTA 556
Query: 333 LRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNI-NRDLQGFPAMAFHFAGGADLVLDA 386
L+ E + L P++ A LCY + R L P + F G A++ + A
Sbjct: 557 LKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG-AEMSVSA 615
Query: 387 ESVFYQ------ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
E + Y+ S SV+C G S++ G + IIG QQN + +DL ++ F
Sbjct: 616 ERLMYRVPGVIRGSDSVYCFTFGNSELLG---VESYIIGHHHQQNVWMEFDLAKSRVGFA 672
Query: 441 RIDCEL 446
+ C+L
Sbjct: 673 EVRCDL 678
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 124/479 (25%), Positives = 186/479 (38%), Gaps = 77/479 (16%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKR-----LVTKLLHRDSLLY--NPN 53
+P SHA + ++ + +ST ++P P+R V +L HR +
Sbjct: 24 LPCSHAAAVAPGYVAVSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAPSRAS 83
Query: 54 DTVDAQAQRTLNMSMARFIYLSQKSSQKAH---DTRAHLHPGISTVPV----------FY 100
TL R Y+ ++ S +A D++A +TVP +
Sbjct: 84 SLAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAA--AATVPASWGYDIGTLNYV 141
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCDSS 154
V S+G P V Q +DTGS L WV+C+PC + FDP++S +YA +PC
Sbjct: 142 VTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGP 201
Query: 155 YCTNDCGGY------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
C G Y +C Y + Y +G ++ G S+ S + F FGC
Sbjct: 202 VCAG-LGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF----FGCG 256
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLILGE 264
H + F GV GL SLVE+ G FSYC+ + + G
Sbjct: 257 HAQSGL----FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGP 312
Query: 265 GAILEGDST----PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
G ST P Y V L GIS+G + L + + F T
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG-------T 365
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYS----GNINRDLQGFPAMAFH 375
+T L P+AY LR P+ P + CY+ G + P +A
Sbjct: 366 VVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVT-----LPNVALT 420
Query: 376 FAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
F GA + L A+ + S CLA PS +G ++I+G + Q+++ V D S
Sbjct: 421 FGSGATVTLGADGIL-----SFGCLAFAPSGSDG----GMAILGNVQQRSFEVRIDGTS 470
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 150/374 (40%), Gaps = 49/374 (13%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------------TFDPSKSLTYA 147
Y IG P V + LDTGS L WV C C +C A+ ++P+ S T
Sbjct: 101 YTTVQIGTPGVKFMVALDTGSDLFWVPCD-CTRCAASDSTAFASDFDLNVYNPNGSSTSK 159
Query: 148 TLPCDSSYCTN--DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD--V 203
+ C++S CT+ C G C Y + Y + S I E T ++ L + V
Sbjct: 160 KVTCNNSLCTHRSQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANV 219
Query: 204 GFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAYN 258
FGC + F D G+FGLG S S++ + G FS C G
Sbjct: 220 IFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG-----RDGIG 274
Query: 259 MLILGEGAILEGDSTPMSVIDG--SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+ G+ + D TP ++ +Y +T+ + +G ++D++
Sbjct: 275 RISFGDKGSFDQDETPFNLNPSHPTYNITVTQVRVGTTVIDVEFT------------ALF 322
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGT+ T+LV Y L + Q + CY + + + P+++
Sbjct: 323 DSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSVSLTM 382
Query: 377 AGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
GG+ V D + +S V+CLAV S +L+IIG Y V +D
Sbjct: 383 GGGSHFAVYDPIIIISTQSELVYCLAVVKS-------AELNIIGQNFMTGYRVVFDREKL 435
Query: 436 QLYFQRIDCELLAD 449
L +++ DC + D
Sbjct: 436 VLGWKKFDCYDIED 449
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 149/374 (39%), Gaps = 49/374 (13%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------------TFDPSKSLTYA 147
Y IG P V + LDTGS L WV C C +C AT ++P+ S T
Sbjct: 97 YTTVQIGTPGVKFMVALDTGSDLFWVPCD-CTRCAATDSSAFASDFDLNVYNPNGSSTSK 155
Query: 148 TLPCDSSYCTN--DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD--V 203
+ C++S C + C G C Y + Y + S I E T ++ L + V
Sbjct: 156 KVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANV 215
Query: 204 GFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAYN 258
FGC + F D G+FGLG S S++ + G FS C G
Sbjct: 216 IFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG-----RDGIG 270
Query: 259 MLILGEGAILEGDSTPMSVIDG--SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+ G+ + D TP ++ +Y +T+ + +G ++D+ +
Sbjct: 271 RISFGDKGSFDQDETPFNLNPSHPTYNITVTQVRVGTTLIDV------------EFTALF 318
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGT+ T+LV Y L + Q + CY + + + P+++
Sbjct: 319 DSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSVSLTM 378
Query: 377 AGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
GG+ V D + +S V+CLAV + +L+IIG Y V +D
Sbjct: 379 GGGSHFAVYDPIIIISTQSELVYCLAV-------VKTAELNIIGQNFMTGYRVVFDREKL 431
Query: 436 QLYFQRIDCELLAD 449
L +++ DC + D
Sbjct: 432 VLGWKKFDCYDIED 445
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 161/389 (41%), Gaps = 55/389 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG---------ATTFDPSKSLTYATLPC 151
V+ ++G PP VLDTGS L W+ C Q +F P S T+A +PC
Sbjct: 65 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124
Query: 152 DSSYCTN-------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
S+ C++ C G +C ++ Y +G S G + ++ F G+
Sbjct: 125 GSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAV-----GEAPPLRSA 179
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILG 263
FGC + A+ S GL T S V + + +FSYCI + + +L+LG
Sbjct: 180 FGC-MSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCISDRD----DAGVLLLG 234
Query: 264 EGAI---------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
+ L + P+ D +Y V L GI +G K L I ++ + T +
Sbjct: 235 HSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQ- 293
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQG 368
+DSGT T+L+ AY L+ E + LL PS+ A C+ R
Sbjct: 294 TMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPS 353
Query: 369 --FPAMAFHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLSIIGM 420
P + F GA++ + + + Y + + V+CL G +D+ +IG
Sbjct: 354 ARLPPVTLLF-NGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVP---LTAYVIGH 409
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
Q N V YDL ++ + C++ ++
Sbjct: 410 HHQMNLWVEYDLERGRVGLAPVKCDVASE 438
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 148/375 (39%), Gaps = 73/375 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-----------------FDP 140
++Y N S+G PP L LDTGS L W+ C CG T + P
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPC----NCGTTCIRDLEDIGVPQSVPLNLYTP 156
Query: 141 SKSLTYATLPCDSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
+ S T +++ C C + C C Y I Y+N + GT+ + + T DE T
Sbjct: 157 NASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISYSNSTGTTGTLLQDVLHLATEDENLT 216
Query: 199 FLY-DVGFGCSHNNAHF--SDEQFTGVFGLGPATSSTHSLVEKV---GSKFSYC----IG 248
+ +V GC + GV GLG S SL+ K FS C IG
Sbjct: 217 PVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITADSFSMCFGRVIG 276
Query: 249 NLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKN 306
N+ + G+ + + TP + S Y + + G+S+G + LF K
Sbjct: 277 NVGRISF-------GDKGYTDQEETPFISVAPSTAYGLNVTGVSVGGD--PVGTRLFAK- 326
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSGNINR 364
D+G++ T L+ AY L K +DL + P+DP + CY + N
Sbjct: 327 ---------FDTGSSFTHLMEPAYGVLTKSFDDLVED--KRRPVDPELPFEFCYDLSPNA 375
Query: 365 DLQGFPAMAFHFAGGADLVLD------AESVFYQESSSVFCLAVGPSDINGERFKDLSII 418
FP + F GG+ ++L+ + E + ++CL V K + +
Sbjct: 376 TSIEFPFVEMTFVGGSKIILNNPFFTARTQARHGEGNVMYCLGV---------LKSVGLK 426
Query: 419 GMIAQQNYNVAYDLV 433
+ QN+ Y +V
Sbjct: 427 INVIGQNFVAGYRIV 441
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 146/370 (39%), Gaps = 49/370 (13%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------------TFDPSKSLTYA 147
Y S+G P L LDTGS L WV C C +C T ++P S T
Sbjct: 104 YTTVSLGTPGKKFLVALDTGSDLFWVPCD-CSRCAPTEGTTYASDFELSIYNPKGSSTSR 162
Query: 148 TLPCDSSYCT--NDCGGYPDECWYNIRYTNGPDS-QGTIGSEQFNFETSDEGKTFLYD-V 203
+ CD+S C N C G C Y + Y + S G + + + T D + F+ V
Sbjct: 163 KVTCDNSLCAHRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVEAYV 222
Query: 204 GFGCSH-NNAHFSD-EQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAYN 258
FGC F D G+FGLG S S++ K G FS C G
Sbjct: 223 TFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFG-----PDGIG 277
Query: 259 MLILGEGAILEGDSTP--MSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI 316
+ G+ + + TP ++ + +Y +T+ + +G ++D+ D
Sbjct: 278 RISFGDKGSPDQEETPFNLNALHPTYNITVTQVRVGTTLIDL------------DFTALF 325
Query: 317 DSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHF 376
DSGT+ T+LV Y + K Q + CY + + P+M+
Sbjct: 326 DSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPFEFCYDMSPGENTSLIPSMSLTM 385
Query: 377 AGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
GG+ V D + +S ++C+AV R +L+IIG Y + +D
Sbjct: 386 KGGSQFPVYDPIIIISSQSELIYCMAV-------VRSAELNIIGQNFMTGYRIIFDREKL 438
Query: 436 QLYFQRIDCE 445
L ++ +C+
Sbjct: 439 VLGWKEFECD 448
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 171/403 (42%), Gaps = 51/403 (12%)
Query: 81 KAHDTRAH--LHPGIS----------TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
KAHD R L G+ +V ++Y IG P +DTG+ ++WV C
Sbjct: 43 KAHDYRRQISLLTGVDLPLGGTGRPDSVGLYYAKIGIGTPSKDYYLQVDTGTDMMWVNCI 102
Query: 129 PCEQC--------GATTFDPSKSLTYATLPCDSSYCTNDCGGY--------PDECWYNIR 172
C++C T ++ +S + +PCD C GG D C Y
Sbjct: 103 QCKECPTRSNLGMDLTLYNIKESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEI 162
Query: 173 YTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD--VGFGC----SHNNAHFSDEQFTGVFGL 225
Y +G + G + F + S + KT + V FGC S + ++ ++E G+ G
Sbjct: 163 YGDGSSTAGYFVKDVVLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGF 222
Query: 226 GPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSY 282
G A S S + KV F++C+ +N + +G ++TP+ Y
Sbjct: 223 GKANYSMISQLSSSGKVKKMFAHCLNGVN----GGGIFAIGHVVQPTVNTTPLLPDQPHY 278
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
V + I +G L++ + ++ D+ G IDSGTTL +L YQ L ++
Sbjct: 279 SVNMTAIQVGHTFLNLSTDASEQRDS---KGTIIDSGTTLAYLPDGIYQPLVYKILSQQP 335
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLA 401
L D YSG+++ GFP + F+F G L V + +F E +++C+
Sbjct: 336 NLKVQTLHDEYTCFQYSGSVD---DGFPNVTFYFENGLSLKVYPHDYLFLSE--NLWCIG 390
Query: 402 VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S K+++++G + N V YDL ++ + + +C
Sbjct: 391 WQNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNC 433
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 163/389 (41%), Gaps = 56/389 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP---CEQC--------GATTFDPSKSLTYA 147
+ ++ + G PP V+DTGSSL+W C C +C G TF P +S +
Sbjct: 92 YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSN 151
Query: 148 TLPCDSSYCT-----------NDCGGYPDECW-----YNIRYTNGPDSQGTIGSEQFNFE 191
+ C + C+ +C C Y I+Y G + G + SE +F
Sbjct: 152 LIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLG-STAGLLLSETLDFP 210
Query: 192 TSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNL 250
FL GCS FS Q G+ G G S SL ++G KFSYC+ +
Sbjct: 211 HKKTIPGFL----VGCS----LFSIRQPEGIAGFG---RSPESLPSQLGLKKFSYCLVSH 259
Query: 251 NYFEY-AYNMLILGEGAILEGDST-----------PMSVIDGSYYVTLEGISLGEKMLDI 298
+ + A + L+L G+ + T P + YYV L I +G+ + +
Sbjct: 260 AFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKV 319
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY 358
P F + + G +DSGTT T++ Y+ + KE E + + L
Sbjct: 320 -PYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRP 378
Query: 359 SGNINRDLQ-GFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKD--L 415
NI+ + P FHF GGA + L + F S V CL + +++G
Sbjct: 379 CFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPA 438
Query: 416 SIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G Q+N++V +DL +++ F++ +C
Sbjct: 439 IILGNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 119/444 (26%), Positives = 185/444 (41%), Gaps = 67/444 (15%)
Query: 40 TKLLHRDSLL-----YNPNDTVDAQAQRTLNM-SMARFIYLSQKSSQKAHDTRAHLHP-- 91
T++L+R S L Y P V A +T+N+ S A F+ L + K+ R ++P
Sbjct: 62 TRVLNRASSLKVVNKYGPCIPVTG-APKTINVPSTAEFL-LQDQLRVKSFQVRLSMNPSS 119
Query: 92 GI-----STVPV--------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQC---G 134
G+ +T+P + V +G P DTGS L W +C+PC C
Sbjct: 120 GVFKEMQTTIPASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQN 179
Query: 135 ATTFDPSKSLTYATLPCDSSYCTNDC-GGYP------DECWYNIRYTNGPDSQGTIGSEQ 187
FDP+ S +Y + C S +C G YP + C Y I+Y +G + G + +E
Sbjct: 180 QPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATET 238
Query: 188 FNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYC 246
+SD K FL FGCS + + TG+ GLG + + S K + FSYC
Sbjct: 239 LAIASSDVFKNFL----FGCSE-ESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYC 293
Query: 247 IGNLNYFEYAYNMLILGEGAILEGDSTPMS-VIDGSYYVTLEGISLGEKMLDIDPNLFKK 305
L + L G STP+S + Y + GIS+ + L I+ ++ +
Sbjct: 294 ---LPASPSSTGHLSFGVEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPINGSISR- 349
Query: 306 NDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM---DPAWHLCYS-GN 361
IDSGTT T+L Y L F+ ++ +Y + ++ CY N
Sbjct: 350 --------TIIDSGTTFTFLPSPTYSALGSA----FREMMANYTLTNGTSSFQPCYDFSN 397
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGM 420
I P ++ F GG ++ +D + + CLA + + D +I G
Sbjct: 398 IGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSD----SDFAIFGN 453
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDC 444
Q+ Y V YD+ + F C
Sbjct: 454 YQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/404 (24%), Positives = 169/404 (41%), Gaps = 49/404 (12%)
Query: 78 SSQKAHDTRAHLH---------PGIS---TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S KAHD + L GI + ++Y IG P +DTGS ++WV
Sbjct: 45 SDLKAHDDQRQLRILAGVDLPLGGIGRPDILGLYYAKIGIGTPTKDYYVQVDTGSDIMWV 104
Query: 126 KCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR----- 172
C C +C T+ ++ ++S T +PCD +C GG C N+
Sbjct: 105 NCIQCRECPKTSSLGIDLTLYNINESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPYLE 164
Query: 173 -YTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD--VGFGC----SHNNAHFSDEQFTGVFG 224
Y +G + G + + S + KT + V FGC S + ++E G+ G
Sbjct: 165 IYGDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILG 224
Query: 225 LGPATSSTHS---LVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G + SS S + KV F++C+ N + ++G + + TP+
Sbjct: 225 FGKSNSSMISQLAVTGKVKKIFAHCLDGTN----GGGIFVIGHVVQPKVNMTPLIPNQPH 280
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V + + +G + L + ++F+ D G IDSGTTL +L Y+ L ++
Sbjct: 281 YNVNMTAVQVGHEFLSLPTDVFEAGDR---KGAIIDSGTTLAYLPEMVYKPLVSKIISQQ 337
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCL 400
L D YS +++ GFP + FHF L V E +F E ++C+
Sbjct: 338 PDLKVHTVRDEYTCFQYSDSLD---DGFPNVTFHFENSVILKVYPHEYLFPFE--GLWCI 392
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S + ++++++G + N V YDL ++ + + +C
Sbjct: 393 GWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNC 436
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 153/391 (39%), Gaps = 38/391 (9%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR +LS ++++ A I + P F V IG P L LDT + W+ C
Sbjct: 74 ARLQFLSSLVARRSFVPIASARQLIQS-PTFVVRAKIGTPAQTLLLALDTSNDAAWIPCS 132
Query: 129 PCEQCGATT-FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTI 183
C C +TT F KS ++ LPC S C C G C +N+ Y
Sbjct: 133 GCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSG--SACGFNLTY---------- 180
Query: 184 GSEQFNFETSDEGKTFLYDV----GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV 239
GS + + T D FGC S + S
Sbjct: 181 GSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLY 240
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKML 296
S FSYC+ + ++ ++ + + TP+ YYV L I +G K++
Sbjct: 241 QSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIV 300
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL 356
DI P+ N + + AG IDSGTT T LV AY +R E + + +
Sbjct: 301 DIPPSALAFN-SATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLG-GFDT 358
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGERFKD 414
CY+ I P + F FAG + + + + S CLA+ P ++N
Sbjct: 359 CYTVPIIS-----PTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSV---- 409
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
L++I + QQN+ + +D+ + ++ R C
Sbjct: 410 LNVIASMQQQNHRILFDIPNSRVGVARESCS 440
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 168/382 (43%), Gaps = 55/382 (14%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQ---------PCEQCGATTFDPSKSLTYATLPC 151
V+ IG PP P VLDTGS L W++C P + +FDPS S +++ LPC
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLPC 127
Query: 152 DSSYCTNDCGGYP-----DE---CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ C + D+ C Y+ Y +G ++G + E+F F S +
Sbjct: 128 NHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVI--- 184
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI---------------G 248
GC+ + + G+ G+ S S + SKFSYC+
Sbjct: 185 -LGCAQ-----ASTENRGILGMNHGRLSFISQAKI--SKFSYCVPSRTGSNPTGLFYLGD 236
Query: 249 NLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
N N ++ Y ++ E S+P ++ +Y + ++ I + K L+I P FK D
Sbjct: 237 NPNSSKFKYVTML----TFPESQSSP-NLDPLAYTLPMKAIKIAGKRLNIPPAAFKP-DA 290
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRDL- 366
IDSG+ LT+LV AY+ +++EV L ++ Y +C+ + ++
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVG 350
Query: 367 QGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+ ++F F G ++ V E V + V C+ +G S+ G +IIG + QQN
Sbjct: 351 RRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLG---IGSNIIGTVHQQN 407
Query: 426 YNVAYDLVSKQLYFQRIDCELL 447
V YDL +K++ F +C L
Sbjct: 408 MWVEYDLANKRVGFGGAECSRL 429
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 141/363 (38%), Gaps = 37/363 (10%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSY 155
P F V IG P L LDT + W+ C C C +TT F KS ++ LPC S
Sbjct: 24 PTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQ 83
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV----GFGC 207
C C G C +N+ Y GS + + T D FGC
Sbjct: 84 CNQVPNPSCSG--SACGFNLTY----------GSSTVAADLVQDNLTLATDSVPSYTFGC 131
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAI 267
S + S S FSYC+ + ++ ++ +
Sbjct: 132 IRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQP 191
Query: 268 LEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
+ TP+ YYV L I +G K++DI P+ N + + AG IDSGTT T
Sbjct: 192 IRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFN-SATGAGTVIDSGTTFTR 250
Query: 325 LVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVL 384
LV AY +R E + + + CY+ I P + F FAG +
Sbjct: 251 LVAPAYTAVRDEFRRRVGRNVTVSSLG-GFDTCYTVPIIS-----PTITFMFAGMNVTLP 304
Query: 385 DAESVFYQESSSVFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
+ + S S CLA+ P ++N L++I + QQN+ + +D+ + ++ R
Sbjct: 305 PDNFLIHSTSGSTTCLAMAAAPDNVNSV----LNVIASMQQQNHRILFDIPNSRVGVARE 360
Query: 443 DCE 445
C
Sbjct: 361 SCS 363
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 104/423 (24%), Positives = 161/423 (38%), Gaps = 75/423 (17%)
Query: 57 DAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-----------FYVNFSI 105
+A+ +TL AR YLS L G S VP+ + V I
Sbjct: 58 EARVLQTLAQDQARLQYLS------------SLVAGRSVVPIASGRQMLQSTTYIVKALI 105
Query: 106 GQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-TTFDPSKSLTYATLPCDSSYCTN----DC 160
G P P L +DT S + W+ C C C + T F P+KS ++ + C + C C
Sbjct: 106 GTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTC 165
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS----- 215
G C +N+ Y G S S+ +D K F FGC + A
Sbjct: 166 GA--RACSFNLTY--GSSSIAANLSQDTIRLAADPIKAFT----FGCVNKVAGGGTIPPP 217
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
G S S+ + S FSYC+ + ++ ++ L S P
Sbjct: 218 QGLLGLGRGPLSLMSQAQSIYK---STFSYCLPSFRSLTFSGSLR-------LGPTSQPQ 267
Query: 276 SVI----------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
V YYV L I +G K++D+ P N + + AG DSGT T L
Sbjct: 268 RVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPS-TGAGTIFDSGTVYTRL 326
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
Y+ +R E + + CYSG + P + F F G ++ +
Sbjct: 327 AKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVK-----VPTITFMFK-GVNMTMP 380
Query: 386 AESVFYQE---SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
A+++ S+S +A P ++N +++I + QQN+ V D+ + +L R
Sbjct: 381 ADNLMLHSTAGSTSCLAMAAAPENVNSV----VNVIASMQQQNHRVLIDVPNGRLGLARE 436
Query: 443 DCE 445
C
Sbjct: 437 RCS 439
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 100/393 (25%), Positives = 150/393 (38%), Gaps = 44/393 (11%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR Y S ++K+ A I + P + V G PP L LDT S W+ C
Sbjct: 68 ARMQYFSSLVARKSVVPIASARQIIQS-PTYIVKAKFGTPPQTLLLALDTSSDAAWIPCS 126
Query: 129 PCEQCGATT-FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTI 183
C C + F P KS ++ + C S +C CGG C +N Y
Sbjct: 127 GCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGG--SACAFNFTY---------- 174
Query: 184 GSEQFNFETSDEGKTFLYD----VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV 239
GS + T D FGC + S Q + S
Sbjct: 175 GSSSIAASVVQDTLTLATDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLY 234
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS---TPM---SVIDGSYYVTLEGISLGE 293
S FSYC+ + ++ ++ + G + + TP+ YYV L I +G
Sbjct: 235 KSTFSYCLPSFKSINFSGSLRL---GPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGR 291
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
K++DI P N T + AG DSGT T L Y +R E LP +
Sbjct: 292 KIVDIPPAALAFNPT-TGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLG-G 349
Query: 354 WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGER 411
+ CY+ I P + F F+G + V + + S CLA+ P ++N
Sbjct: 350 FDTCYNVPI-----VVPTITFLFSGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSV- 403
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L++I + QQN+ V +D+ + ++ R C
Sbjct: 404 ---LNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 146/364 (40%), Gaps = 32/364 (8%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATLPCDSSYC 156
IG P +DTGS +WV C C C T +DP+ S T +PCD +C
Sbjct: 81 IGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDEFC 140
Query: 157 TNDCGG------YPDECWYNIRYTNGPDSQGTIGSEQFNFE-------TSDEGKTFLYDV 203
T+ G C Y+I Y +G + G+ + F+ T + + ++
Sbjct: 141 TSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGC 200
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNML 260
G S + +D G+ G G A SS S + KV FS+C+ +N +
Sbjct: 201 GSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVN----GGGIF 256
Query: 261 ILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+GE + +TP+ Y V L+ I + + + ++F D+ S G IDSGT
Sbjct: 257 AIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIF---DSTSGRGTIIDSGT 313
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
TL +L S Y L ++ G+ D YS + D FP + F F G
Sbjct: 314 TLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFTCFHYSDEKSLD-DAFPTVKFTFEEGL 372
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L + ++C+ S + KDL ++G + N YDL + + +
Sbjct: 373 TLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIYDLDNMSIGWT 432
Query: 441 RIDC 444
+C
Sbjct: 433 DYNC 436
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 173/404 (42%), Gaps = 75/404 (18%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-------PCEQCGATTFDPSKSLTYA 147
TVPV ++G PP VLDTGS L W++C P Q A F+ S S TYA
Sbjct: 63 TVPV-----AVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPA-AFNGSASSTYA 116
Query: 148 TLPCDSSYCT---ND------CGGYP-DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
C S C D C G P + C ++ Y + + G + ++ F + +
Sbjct: 117 AAHCSSPECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVR 176
Query: 198 TFLYDVGFGC--SHNNAHFSD----EQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNL 250
FGC S+++A ++ E TG+ G+ + S V + + +F+YCI
Sbjct: 177 AL-----FGCVTSYSSATATNSSDSEAATGLLGM---NRGSLSFVTQTATLRFAYCIAPG 228
Query: 251 NYFEYAYNMLIL-GEGAILEGD---------STPMSVIDG-SYYVTLEGISLGEKMLDID 299
+ +L+L G+GA L S P+ D +Y V LEGI +G +L I
Sbjct: 229 D----GPGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIP 284
Query: 300 PNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-----YPMDPAW 354
++ + T + +DSGT T+L+ AY L+ E + LL + A+
Sbjct: 285 KSVLAPDHTGAGQ-TMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAF 343
Query: 355 HLCYSGNINRDL---QGFPAMAFHFAGGADLVLDAESVFYQ---------ESSSVFCLAV 402
C+ + R Q P + GA++ + E + Y+ + +V+CL
Sbjct: 344 DACFRASEARVAAASQMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTF 402
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
G SD+ G +IG QQN V YDL + ++ F C+L
Sbjct: 403 GNSDMAG---MSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 443
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 100/393 (25%), Positives = 150/393 (38%), Gaps = 44/393 (11%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR Y S ++K+ A I + P + V G PP L LDT S W+ C
Sbjct: 68 ARMQYFSSLVARKSVVPIASARQIIQS-PTYIVKAKFGTPPQTLLLALDTSSDAAWIPCS 126
Query: 129 PCEQCGATT-FDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDSQGTI 183
C C + F P KS ++ + C S +C CGG C +N Y
Sbjct: 127 GCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGG--SACAFNFTY---------- 174
Query: 184 GSEQFNFETSDEGKTFLYD----VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV 239
GS + T D FGC + S Q + S
Sbjct: 175 GSSSIAASVVQDTLTLAADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLY 234
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS---TPM---SVIDGSYYVTLEGISLGE 293
S FSYC+ + ++ ++ + G + + TP+ YYV L I +G
Sbjct: 235 KSTFSYCLPSFKSINFSGSLRL---GPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGR 291
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
K++DI P N T + AG DSGT T L Y +R E LP +
Sbjct: 292 KIVDIPPAALAFNPT-TGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLG-G 349
Query: 354 WHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGER 411
+ CY+ I P + F F+G + V + + S CLA+ P ++N
Sbjct: 350 FDTCYNVPI-----VVPTITFLFSGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSV- 403
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L++I + QQN+ V +D+ + ++ R C
Sbjct: 404 ---LNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 112/453 (24%), Positives = 178/453 (39%), Gaps = 83/453 (18%)
Query: 43 LHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYV- 101
LHR +P++ L AR Y+ +K++ + D P + + + ++
Sbjct: 71 LHRPYGPCSPSEGTPPSLVEMLRWDQARTDYVRRKATGEVDDVLEPDRPHVDMMQMDFML 130
Query: 102 --NFSIGQPP----------------VPQLAVLDTGSSLIWVKCQPC--EQCGATT---F 138
F IG + Q +DT + W++C PC QC F
Sbjct: 131 RGTFGIGSGSGYGAVIDGDDDDDPMILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFF 190
Query: 139 DPSKSLTYATLPCDSSYCTNDCGGYPD---------ECWYNIRYTNGPDSQGTIGSEQFN 189
DP +S T A + C S C GGY + +C Y I Y+ D + T+G+ +
Sbjct: 191 DPRRSSTGAPVRCGSRAC-RTLGGYANGCSKPNSTGDCLYRIEYS---DHRLTLGTYMTD 246
Query: 190 FETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIG 248
T TFL + FGCSH Q +G LG S S + G+ FSYC+
Sbjct: 247 TLTISPSTTFL-NFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVP 305
Query: 249 NLNYFEYAYNMLILGEGAILEGD---------STPM----SVIDGSYYVT-LEGISLGEK 294
+ + L G + GD +TP+ +VI+ + YV L+GI + +
Sbjct: 306 GPSAAGF------LSIGGPVNGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIEVAGR 359
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLR---KEVEDLFQGLLPSYPMD 351
L++ P +F G +DS +T L P+AY+ LR + ++ P+ +D
Sbjct: 360 RLNVPPVVFS-------GGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLD 412
Query: 352 PAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGER 411
+ + P ++ F GGA + L SV CLA P +
Sbjct: 413 TCFDFVGVSKVT-----VPTVSLVFDGGAVIELGLLSVLLDS-----CLAFAPMAAD--- 459
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L IG + QQ + V YD+ + F+ C
Sbjct: 460 -FALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 164/407 (40%), Gaps = 72/407 (17%)
Query: 76 QKSSQKAHDTRAHLHPGISTVPV---FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ---P 129
Q +S + T G ST + Y N +IG P L LDTGS L W+ C
Sbjct: 63 QLTSNNNNQTTISFAQGNSTEEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNST 122
Query: 130 C---------EQCGATTFDPSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRYTN-GP 177
C E+ ++PSKS + + + C+S+ C N C +C Y IRY + G
Sbjct: 123 CVRSMETDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGS 182
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLV 236
S G + + + T +EG+ + FGCS + F + G+ GL A + +++
Sbjct: 183 KSTGVLVEDVIHMST-EEGEARDARITFGCSESQLGLFKEVAVNGIMGLAIADIAVPNML 241
Query: 237 EKVG---SKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-------TPMS-VIDGSYY-V 284
K G FS C G G+G I GD TP+S I +Y V
Sbjct: 242 VKAGVASDSFSMCFGP------------NGKGTISFGDKGSSDQLETPLSGTISPMFYDV 289
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
++ +G+ +D ++ DSGT +TWL+ Y L
Sbjct: 290 SITKFKVGKVTVD------------TEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDR 337
Query: 345 LPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGAD-------LVLDAESVFYQESSSV 397
S +D + CY D P+++F GGA LV D +Q V
Sbjct: 338 RLSKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQ----V 393
Query: 398 FCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+CLAV +N D SIIG NY + +D + L +++ +C
Sbjct: 394 YCLAV-LKQVNA----DFSIIGQNFMTNYRIVHDRERRILGWKKSNC 435
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 104/423 (24%), Positives = 161/423 (38%), Gaps = 75/423 (17%)
Query: 57 DAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-----------FYVNFSI 105
+A+ +TL AR YLS L G S VP+ + V I
Sbjct: 74 EARVLQTLAQDQARLQYLS------------SLVAGRSVVPIASGRQMLQSTTYIVKALI 121
Query: 106 GQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-TTFDPSKSLTYATLPCDSSYCTN----DC 160
G P P L +DT S + W+ C C C + T F P+KS ++ + C + C C
Sbjct: 122 GTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTC 181
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS----- 215
G C +N+ Y G S S+ +D K F FGC + A
Sbjct: 182 GA--RACSFNLTY--GSSSIAANLSQDTIRLAADPIKAFT----FGCVNKVAGGGTIPPP 233
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
G S S+ + S FSYC+ + ++ ++ L S P
Sbjct: 234 QGLLGLGRGPLSLMSQAQSIYK---STFSYCLPSFRSLTFSGSLR-------LGPTSQPQ 283
Query: 276 SVI----------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
V YYV L I +G K++D+ P N + + AG DSGT T L
Sbjct: 284 RVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPS-TGAGTIFDSGTVYTRL 342
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
Y+ +R E + + CYSG + P + F F G ++ +
Sbjct: 343 AKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVK-----VPTITFMFK-GVNMTMP 396
Query: 386 AESVFYQE---SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
A+++ S+S +A P ++N +++I + QQN+ V D+ + +L R
Sbjct: 397 ADNLMLHSTAGSTSCLAMAAAPENVNSV----VNVIASMQQQNHRVLIDVPNGRLGLARE 452
Query: 443 DCE 445
C
Sbjct: 453 RCS 455
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 104/446 (23%), Positives = 182/446 (40%), Gaps = 81/446 (18%)
Query: 61 QRTLNMSMARFIYLSQKSSQKAHD------TRAHLHPGIST------------VPV---F 99
+R+L++ +AR + ++ H+ R+ PG++ VP +
Sbjct: 29 RRSLHLELARVDDAAAAANLTDHELIRRAVQRSLDRPGVAARNRKAVVGEAPLVPRGGEY 88
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYC 156
V IG P A +DT S L+W++CQPC C F+P S +YA +PC S C
Sbjct: 89 LVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTC 148
Query: 157 TNDCGGYPDE-----CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
+ G DE C YN +Y+ + GT+ ++ G + V GCS ++
Sbjct: 149 SQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV-----GGNVFHAVVLGCSDSS 203
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA---IL 268
Q +G+ GL A L + +F YC+ L+LG GA +
Sbjct: 204 VGGPPPQASGLVGL--ARGPLSLLSQLSVRRFMYCLP--PPMSRTPGKLVLGAGAGADAV 259
Query: 269 EGDSTPMSVIDGS-------YYVTLEGISLGEKMLDIDPNLFKKNDT------------- 308
S ++V S YY+ +G+++G++ P ++ +
Sbjct: 260 RNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQT----PGTIRRPTSPPATGGGVGGGGG 315
Query: 309 -----WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGN 361
+ G+ +D +T+++L S Y L ++E+ + + LC+
Sbjct: 316 DGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRLPRATPSTRLGLDLCFILPEG 375
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMI 421
+ D P ++ F G L L+ + +F E + CL +G R +SI+G
Sbjct: 376 VGIDRVYVPTVSMSFDGRW-LELERDRLFL-EDGRMMCLMIG-------RTSGVSILGNY 426
Query: 422 AQQNYNVAYDLVSKQLYFQRIDCELL 447
QQN +V Y+L ++ F + C+ L
Sbjct: 427 QQQNMHVLYNLRRGKITFAKASCDSL 452
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 162/392 (41%), Gaps = 60/392 (15%)
Query: 92 GISTVPVFY-------VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPS 141
G S VP+ + NF+IG PP P A++D L+W +C C +C F P+
Sbjct: 29 GGSAVPIHWSRHLYNVANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPN 88
Query: 142 KSLTYATLPCDSSYC----TNDCGGYPDECWY----NIRYTNGPDSQGTIGSEQFNFETS 193
S T+ PC + C T++C G D C Y NIR + + G +G+E F T+
Sbjct: 89 ASSTFRPEPCGTDACKSTPTSNCSG--DVCTYESTTNIRL-DRHTTLGIVGTETFAIGTA 145
Query: 194 DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNY 252
+ FGC + + + +G GLG + SLV ++ +KFSYC+
Sbjct: 146 TA------SLAFGCVVASDIDTMDGTSGFIGLG---RTPRSLVAQMKLTKFSYCLSPRGT 196
Query: 253 FEYAYNMLILGEGAILEGDSTPMSV--IDGS--------YYVTLEGISLGEKMLDIDPNL 302
+ + L LG A L G + + I S Y ++L+ I G +
Sbjct: 197 GK--SSRLFLGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI------ 248
Query: 303 FKKNDTWSDAGVFI-DSGTTLTWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYS 359
T G+ + + + + LV SAY+ +K V + G P + LC+
Sbjct: 249 ----ATAQSGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFK 304
Query: 360 GNINRDLQGFPAMAFHF-AGGADLVLDAESVFYQ--ESSSVFCLAV-GPSDINGERFKDL 415
P + F F GGA L + E C A+ + +N + +
Sbjct: 305 KAAGFSRATAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGV 364
Query: 416 SIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S++G + Q+N + YDL + L F+ DC L
Sbjct: 365 SVLGSLQQENVHFLYDLKKETLSFEPADCSSL 396
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 113/431 (26%), Positives = 174/431 (40%), Gaps = 90/431 (20%)
Query: 81 KAHDTRAHLHPG-ISTV--PVFYVNFSI----GQPPVPQLAVLDTGSSLIWVKCQP---C 130
+AH + H +P + T+ P Y +SI G PP VLDTGSSL+W+ C C
Sbjct: 191 RAHHLKNHNNPSSLKTLVHPKTYGGYSIDLKFGTPPQTFPFVLDTGSSLVWLPCYSHYLC 250
Query: 131 EQCGATT------FDPSKSLTYATLPCDSSYC--------TNDCGGYPDECW-------- 168
+C + + F P S + + C + C T+ C +
Sbjct: 251 SKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQ 310
Query: 169 ----YNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
Y ++Y G + G + SE NF + + D GCS + + Q G+ G
Sbjct: 311 TCPAYTVQYGLG-STAGFLLSENLNFPAKN-----VSDFLVGCSVVSVY----QPGGIAG 360
Query: 225 LGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLIL------GEG---------AILE 269
G S + + ++FSYC+ + + E N ++ GEG A L+
Sbjct: 361 FGRGEESLPAQMNL--TRFSYCLLSHQFDESPENSDLVMEATNSGEGKKTNGVSYTAFLK 418
Query: 270 GDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL---- 325
ST YY+TL I +GEK + + P + D D G +DSG+TLT++
Sbjct: 419 NPSTKKPAFGAYYYITLRKIVVGEKRVRV-PRRMLEPDVNGDGGFIVDSGSTLTFMERPI 477
Query: 326 --------VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
V T +E+E F GL P C+ + FP M F F
Sbjct: 478 FDLVAEEFVKQVNYTRARELEKQF-GLSP----------CFVLAGGAETASFPEMRFEFR 526
Query: 378 GGADLVLDAESVFYQESSS-VFCLAVGPSDINGE--RFKDLSIIGMIAQQNYNVAYDLVS 434
GGA + L + F + V CL + D+ G+ I+G QQN+ V DL +
Sbjct: 527 GGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQNFYVECDLEN 586
Query: 435 KQLYFQRIDCE 445
++ F+ C+
Sbjct: 587 ERFGFRSQSCQ 597
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 163/365 (44%), Gaps = 54/365 (14%)
Query: 116 LDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCTN-------DC 160
+DTGS ++WV C C C ++ FD S T A +PC CT+ +C
Sbjct: 85 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SDEGKTFLYDVGFGCSHNNA---HF 214
++C Y +Y +G + G S+ F + FGCS + +
Sbjct: 145 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGDLTK 204
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCI-GNLNYFEYAYNMLILGEGAILEG 270
+D+ G+FG GP S S + G FS+C+ G+ N +L+LGE ILE
Sbjct: 205 TDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGN----GGGILVLGE--ILEP 258
Query: 271 DS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+P+ Y + L+ I++ + L I+P +F ++ + G +D GTTL +L+
Sbjct: 259 SIVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISN--NRGGTIVDCGTTLAYLIQE 316
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAMAFHFAGGADLVLDA 386
AY L + + + + CY S +I D+ FP ++ +F GGA +VL
Sbjct: 317 AYDPLVTAINTAVSQ--SARQTNSKGNQCYLVSTSIG-DI--FPLVSLNFEGGASMVLKP 371
Query: 387 ESVF----YQESSSVFCLAVGPSDINGERFKD-LSIIGMIAQQNYNVAYDLVSKQLYFQR 441
E Y + + ++C+ ++ ++ SI+G + ++ V YD+ +++ +
Sbjct: 372 EQYLMHNGYLDGAEMWCVGF-------QKLQEGASILGDLVLKDKIVVYDIAQQRIGWAN 424
Query: 442 IDCEL 446
DC L
Sbjct: 425 YDCSL 429
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 144/369 (39%), Gaps = 48/369 (13%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC 156
P + V IG PP L +DT + W+ C C+ C +T F P KS T+ + C S C
Sbjct: 95 PTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTSTLFAPEKSTTFKNVSCGSPEC 154
Query: 157 TN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD----VGFGCS 208
CG C +N+ Y GS + T D FGC
Sbjct: 155 NKVPSPSCGT--SACTFNLTY----------GSSSIAANVVQDTVTLATDPIPGYTFGCV 202
Query: 209 HNNAHFS---DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEG 265
S G S T +L + S FSYC+ + ++ ++ +
Sbjct: 203 AKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQ---STFSYCLPSFKSLNFSGSLRLGPVA 259
Query: 266 AILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ TP+ YYV L I +G K++DI P N + AG DSGT
Sbjct: 260 QPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAA-TGAGTVFDSGTVF 318
Query: 323 TWLVPSAYQTLRKE----VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
T LV Y +R E V + L + + CY+ I P + F F+
Sbjct: 319 TRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLG-GFDTCYTVPIVA-----PTITFMFS- 371
Query: 379 GADLVLDAESVFYQE---SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSK 435
G ++ L +++ S+S +A P ++N L++I + QQN+ V YD+ +
Sbjct: 372 GMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSV----LNVIANMQQQNHRVLYDVPNS 427
Query: 436 QLYFQRIDC 444
+L R C
Sbjct: 428 RLGVARELC 436
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 153/375 (40%), Gaps = 46/375 (12%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDS 153
P + NF+IG PP P A++D L+W +C C +C F P+ S T+ PC +
Sbjct: 43 PYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGT 102
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
+ C T C G D C Y GP +Q + F + T + FGC
Sbjct: 103 AVCESIPTRSCSG--DVCSY-----KGPPTQLRGNTSGFAATDTFAIGTATVRLAFGCVV 155
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+ + + +G GLG + SLV ++ ++FSYC+ N + + L LG A L
Sbjct: 156 ASDIDTMDGPSGFIGLG---RTPWSLVAQMKLTRFSYCLSPRNTGK--SSRLFLGSSAKL 210
Query: 269 EGDSTPMSVI--------DGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI-D 317
G + + DGS Y ++L+ I G + T G+ +
Sbjct: 211 AGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI----------ATAQSGGILVMH 260
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
+ + + LV SAY+ +K V + G P + LC+ P + F
Sbjct: 261 TVSPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFT 320
Query: 376 FAGGADLVLDAESVFYQ--ESSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
F G A L + E C A+ + +N + +S++G + Q++ + YDL
Sbjct: 321 FQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDL 380
Query: 433 VSKQLYFQRIDCELL 447
+ L F+ DC L
Sbjct: 381 KKETLSFEPADCSSL 395
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 118/470 (25%), Positives = 176/470 (37%), Gaps = 95/470 (20%)
Query: 40 TKLLHR-----DSLLYNPNDTVDAQAQRTLNMSMARFIYLS-------QKSSQKAHDTRA 87
+KL+HR SLL + ND V +Q N F YL ++ K
Sbjct: 26 SKLIHRFSEEAKSLLISGNDNVSSQTWPNKN----SFQYLQLLLDNDLKRQKMKLGAQNQ 81
Query: 88 HLHPGISTVPVFYVN---------FSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--- 135
L P + + FY N IG P V L LD GS L WV C C QC
Sbjct: 82 LLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSDLSWVPCD-CIQCAPLSA 140
Query: 136 ----------TTFDPSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRYTN-GPDSQGT 182
+ + PS S T L C+ C + C D C Y Y + S G
Sbjct: 141 SLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTSSSGF 200
Query: 183 IGSEQFNF-----ETSDEGKTFLYDVGFGCSHNN--AHFSDEQFTGVFGLGPATSSTHSL 235
+ + + +++ K V GC + GV GLGP + S SL
Sbjct: 201 LVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVPSL 260
Query: 236 VEKVG---SKFSYCIGNLNYFEYAYNMLILGEGAILEGD-------STPMSVIDG---SY 282
+ K G FS C + G G IL GD STP+ G +Y
Sbjct: 261 LAKAGLIRKSFSLCFD------------VNGSGTILFGDQGHTSQKSTPLLPTQGNYDAY 308
Query: 283 YVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ 342
+ +E +G L S +DSG + T+L Y + E +
Sbjct: 309 LIEVESYCVGNSCLK-----------QSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVN 357
Query: 343 GLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCL 400
S P W+ CY+ + ++ L PAM F L++ + + ++ +VFCL
Sbjct: 358 AQRISSQGGP-WNYCYNTS-SKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCL 415
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLADD 450
+ P+D+N IIG Y V +D+ + +L + +C+ ++D+
Sbjct: 416 TLQPTDLN------YGIIGQNYMTGYRVVFDMENLKLGWSSSNCKDISDE 459
>gi|357449519|ref|XP_003595036.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|87162831|gb|ABD28626.1| Peptidase M, neutral zinc metallopeptidases, zinc-binding site;
Peptidase aspartic, catalytic [Medicago truncatula]
gi|355484084|gb|AES65287.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 217
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/173 (36%), Positives = 87/173 (50%), Gaps = 19/173 (10%)
Query: 20 TRIFTSTTAAPAAGKPKRLVTKLLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQ 76
T F+ + AP+ KP+R V+KL+H S+ YNPN+TV+ + + S R +
Sbjct: 32 TSTFSGNSLAPSTSKPRRFVSKLIHPHSIHHPHYNPNETVEDWIKLDIEYSHTRLSFFKA 91
Query: 77 K---SSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
+ S +D R HL P + VN SIGQPP+PQL ++DT SS+ W C PC C
Sbjct: 92 RIEGSLDSNNDYRTHLSPSPKGASIL-VNLSIGQPPIPQLLIMDTASSIFWTMCTPCPNC 150
Query: 134 ---GATTFDPSKSLTYATL---PCDSSYCTNDCGGYPDECWYNIRYTNGPDSQ 180
FDPSKS TY PC S C +C D+ Y + Y + S+
Sbjct: 151 IQHPGQIFDPSKSSTYVPTCKEPCYSKDC--EC----DQLTYTVTYADESSSK 197
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 111/390 (28%), Positives = 162/390 (41%), Gaps = 52/390 (13%)
Query: 99 FYVNFSIGQPPVPQLAV-LDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSS 154
+ ++ SIG P ++A+ LDTGS L+W +C C C A TFD S T +PC
Sbjct: 100 YLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSDP 158
Query: 155 YCTNDCGGYP--------DECWYNIRYTNGPDSQGTIGSEQFNFETSD-------EGKTF 199
CT+ G YP + C+Y Y + + G I + F F +
Sbjct: 159 ICTS--GKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVA 216
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLN-------Y 252
+ +V FGC N +G+ G S S ++ ++FS+C + +
Sbjct: 217 VPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKV--ARFSHCFTAIADARTSPVF 274
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGS-YYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
A LG A STP + +GS YY+TL+GI++G+ L ++ F T S
Sbjct: 275 LGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTGSG 334
Query: 312 AGV-FIDSGTTLTWLVPSAYQTLRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
+G IDSGT + L Y++LR V + + D LC+ + L
Sbjct: 335 SGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFEAARSASLPPE 394
Query: 370 PA------MAFHFAGGADLVLDAESVFYQ------ESSSVFCLAVGPSDINGERFKDLSI 417
+ H AG AD L ES S S CL +N DL+I
Sbjct: 395 APAPALPKVVLHVAG-ADWDLPRESYVLDLLEDEDGSGSGLCLV-----MNSAGDSDLTI 448
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
IG QQN +VAYDL +L F C+ +
Sbjct: 449 IGNFQQQNMHVAYDLEKNKLVFVPARCDKM 478
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/390 (23%), Positives = 151/390 (38%), Gaps = 36/390 (9%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPG--ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
+R +YL + + A + G + P + V S+G PP L +DT + W+
Sbjct: 80 SRLLYLDSLAVRGRARAYAPIASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIP 139
Query: 127 CQPCEQC---GATTFDPSKSLTYATLPCDSSYCTN----DCGGYPDECWYNIRYTNGPDS 179
C C C A FDP+ S +Y T+PC S C C C +++ Y DS
Sbjct: 140 CAGCAGCPTSSAAPFDPASSASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA---DS 196
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV 239
Q + + + FGC + + S +
Sbjct: 197 SLQAALSQDSLAVAGNA---VKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMY 253
Query: 240 GSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSV---IDGSYYVTLEGISLGEKML 296
+ FSYC+ + ++ + + G +TP+ YYV + GI +G K++
Sbjct: 254 EATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVV 313
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL 356
I D + AG +DSGT T LV AY +R EV + S +
Sbjct: 314 PI-----PAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSL---GGFDT 365
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAV--GPSDINGERFKD 414
C+ N +P + F G + + V + ++ CLA+ P +N
Sbjct: 366 CF----NTTAVAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN----TV 417
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
L++I + QQN+ V +D+ + ++ F R C
Sbjct: 418 LNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/404 (24%), Positives = 157/404 (38%), Gaps = 79/404 (19%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT----------------TFDPSKS 143
Y +G P V + LDTGS L WV C C +C AT ++P+ S
Sbjct: 102 YTTIELGTPGVKFMVALDTGSDLFWVPCD-CTRCSATRSSAFASALASDFDLSVYNPNGS 160
Query: 144 LTYATLPCDSSYCT--NDCGGYPDECWYNIRYTNGPDS-QGTIGSEQFNFETSDEGKTFL 200
T + C++S CT N C G C Y + Y + S G + + + D+ +
Sbjct: 161 STSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLV 220
Query: 201 -YDVGFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFE 254
+V FGC + F D G+FGLG S S++ + G FS C G
Sbjct: 221 EANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG-----R 275
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDG--SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+ G+ L+ D TP +V +Y +T+ + +G ++D++
Sbjct: 276 DGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVE------------F 323
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDL----------------------FQGLLPSYPM 350
DSGT+ T+LV Y L + V D F +
Sbjct: 324 TALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVEDRRR 383
Query: 351 DPAWHL----CYSGNINRDLQGFPAMAFHFAGGADLVL-DAESVFYQESSSVFCLAVGPS 405
P + CY + + + P+M+ GG+ V+ D + +S V+CLAV S
Sbjct: 384 PPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKS 443
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
+L+IIG Y V +D L +++ DC + D
Sbjct: 444 -------AELNIIGQNFMTGYRVVFDREKLILGWKKSDCYDIED 480
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/423 (24%), Positives = 161/423 (38%), Gaps = 75/423 (17%)
Query: 57 DAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV-----------FYVNFSI 105
+A+ +TL AR YLS L G S VP+ + V I
Sbjct: 58 EARVLQTLAQDQARLQYLS------------SLVAGRSVVPIASGRQMLQSTTYIVKVLI 105
Query: 106 GQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-TTFDPSKSLTYATLPCDSSYCTN----DC 160
G P P L +DT S + W+ C C C + T F P+KS ++ + C + C C
Sbjct: 106 GTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPAC 165
Query: 161 GGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFS----- 215
G C +N+ Y G S S+ +D K F FGC + A
Sbjct: 166 GA--RACSFNLTY--GSSSIAANLSQDTIRLAADPIKAFT----FGCVNKVAGGGTIPPP 217
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
G S S+ + S FSYC+ + ++ ++ L S P
Sbjct: 218 QGLLGLGRGPLSLMSQAQSVYK---STFSYCLPSFRSLTFSGSLR-------LGPTSQPQ 267
Query: 276 SVI----------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
V YYV L I +G K++D+ P N + + AG DSGT T L
Sbjct: 268 RVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPS-TGAGTIFDSGTVYTRL 326
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
Y+ +R E + + CYSG + P + F F G ++ +
Sbjct: 327 AKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCYSGQVK-----VPTITFMFK-GVNMTMP 380
Query: 386 AESVFYQE---SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
A+++ S+S +A P ++N +++I + QQN+ V D+ + +L R
Sbjct: 381 ADNLMLHSTAGSTSCLAMASAPENVNSV----VNVIASMQQQNHRVLIDVPNGRLGLARE 436
Query: 443 DCE 445
C
Sbjct: 437 RCS 439
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 154/398 (38%), Gaps = 68/398 (17%)
Query: 72 IYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE 131
I + +SQ + + + +HP +T G P VLDT + W++C PC
Sbjct: 132 ISVEVGTSQTSSEPSSGIHPAAAT---------DGSSSPPVTVVLDTAGDVPWMRCVPCT 182
Query: 132 QCGATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYN-----IRYTNGPD--SQGTIG 184
+DP++S TY+ PC+SS C G Y + C N + T G + GT
Sbjct: 183 FAQCADYDPTRSSTYSAFPCNSSAC-KQLGRYANGCDANGQCQYMVVTAGDSFTTSGTYS 241
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-GSKF 243
S+ + D + F FGCS N + Q G+ LG S + G F
Sbjct: 242 SDVLTINSGDRVEGFR----FGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAF 297
Query: 244 SYCI----GNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS--------YYVTLEGISL 291
SYC+ +F+ + GA +TPM G Y L I++
Sbjct: 298 SYCLPPTETTKGFFQIGVPI-----GASYRFVTTPMLKERGGASAAAATLYRALLLAITV 352
Query: 292 GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMD 351
K L++ +F AG +DS T +T L +AY LR + + + P
Sbjct: 353 DGKELNVPAEVFA-------AGTVMDSRTIITRLPVTAYGALRAAFRNRMRYRV--APPQ 403
Query: 352 PAWHLCYSGNINRDLQG-----FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSD 406
CY DL G P +A F G A + +D + CLA +D
Sbjct: 404 EELDTCY------DLTGVRYPRLPRIALVFDGNAVVEMDRSGILLNG-----CLAFASND 452
Query: 407 INGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ SI+G + QQ V +D+ ++ F+ C
Sbjct: 453 DD----SSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 150/368 (40%), Gaps = 51/368 (13%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-TTFDPSKSLTYATLPCDS 153
T P + V +G P L LDT + W C PC+ C A + F P+ S +YA+LPC S
Sbjct: 75 TPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCAS 134
Query: 154 SYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSE------QFNFETSDEGKTFLYDVGFGC 207
+C R P G +G+ Q T G G+
Sbjct: 135 DWCPL------------FRRPAVPGEPGRVGAAADVRLLQAASRTPRSGVLAATRCGWAR 182
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK----FSYCIGNLNYFEYAYNMLILG 263
+ + A S GP SL+ + GS+ FSYC+ + + ++ ++ +
Sbjct: 183 TPSPATRS----------GP-----MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 227
Query: 264 EGAILEGDSTPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
G TP+ YYV + G+S+G ++ F D + AG IDSGT
Sbjct: 228 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFA-FDPSTGAGTVIDSGT 286
Query: 321 TLT-WLVPSAYQTLRKEVEDLFQGLLPS-YPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
+T W P Y LR E Q PS Y A+ C++ + G P + H G
Sbjct: 287 VITRWTAP-VYAALRDEFRR--QVAAPSGYTSLGAFDTCFNTD-EVAAGGAPPVTLHMGG 342
Query: 379 GADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
G DL L E+ S++ + CLA+ ++ ++++ + QQN V D+ ++
Sbjct: 343 GVDLTLPMENTLIHSSATPLACLAM--AEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRV 400
Query: 438 YFQRIDCE 445
F R C
Sbjct: 401 GFAREPCN 408
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 155/376 (41%), Gaps = 57/376 (15%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------FDPSKSLTY 146
Y IG P V L LD GS ++WV C C +C + + + PS S T
Sbjct: 106 YTWIDIGTPNVSFLVALDAGSDMLWVPCD-CIECASLSAGNYNVLDRDLNQYRPSLSNTS 164
Query: 147 ATLPCDSSYCT--NDCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSD---EGKTFL 200
LPC C + C G D C Y ++Y++ S G + ++ + ++ E +
Sbjct: 165 RHLPCGHKLCDVHSVCKGSKDPCPYAVQYSSANTSSSGYVFEDKLHLTSNGKHAEQNSVQ 224
Query: 201 YDVGFGCSHNNA--HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEY 255
+ GC + GV GLGP S SL+ K G + FS C + E
Sbjct: 225 ASIILGCGRKQTGEYLRGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSIC-----FEEN 279
Query: 256 AYNMLILGEGAILEGDSTPMSVIDG---SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
+I G+ + STP IDG +Y V +E +G L + F+
Sbjct: 280 ESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIVGVESFCVGS--LCLKETRFQ-------- 329
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAM 372
IDSG++ T+L YQ + E + Q S + +W CY+ + +++L P +
Sbjct: 330 -ALIDSGSSFTFLPNEVYQKVVIEFDK--QVNATSIVLQNSWEYCYNAS-SQELISIPPL 385
Query: 373 AFHFAGGADLVLDAESVFYQESS---SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
F+ ++ +F +S ++FCL V PSD D + IG Y +
Sbjct: 386 NLAFSRNQTYLIQ-NPIFIDPASQEYTIFCLPVSPSD------DDYAAIGQNFLMGYRMV 438
Query: 430 YDLVSKQLYFQRIDCE 445
+D + + + R +C+
Sbjct: 439 FDRENLRFSWSRWNCQ 454
>gi|18420846|ref|NP_568459.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|67633818|gb|AAY78833.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|111074346|gb|ABH04546.1| At5g24820 [Arabidopsis thaliana]
gi|332005983|gb|AED93366.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 407
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 167/389 (42%), Gaps = 70/389 (17%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCD-SSYCT 157
YV +IG P LD+ + L + QC + S T++T+ C+ SS C
Sbjct: 46 LYVEITIGTPTRTFNLKLDSSTHLTCLDNDDDHQC---SLSDKSSNTFSTISCNNSSLCP 102
Query: 158 NDCGGY---------------------PDECWYNIRYTNGPDSQGTIGSEQFNFETS--- 193
+ Y D C RY P S G + S+ +S
Sbjct: 103 HVSTNYTNYFNATTTNTTTSVSLLCTPSDFC----RYEASPSSSGYLVSDTLQLTSSITD 158
Query: 194 -DEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCI---- 247
+ + + FGC N +E GV G T+ SL+ ++ ++FS+C+
Sbjct: 159 QENSLSIVRGFVFGCGARNRATPEEDGGGVDGRLSLTTHRFSLLSQLRLTRFSHCLWPSA 218
Query: 248 -GNLNYFEYAYNMLILGEGAILEGDST--PMSVIDG----SYYVTLEGISLGEKMLDIDP 300
G+ NY LG A GD PM + G SY+V L GISLG++ +
Sbjct: 219 AGSRNYIR-------LGSAASYGGDMVLVPMLNMTGTEAYSYHVALFGISLGQQRM---- 267
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSG 360
+ N++ +G+ ID GT T L PS Y+ ++ E+L + P+ + +C++
Sbjct: 268 ---RSNES---SGIAIDVGTYYTSLEPSLYEEVK---EELTAQIGPAVAYEVNELMCFTT 318
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGM 420
+ ++ P + HF G D + + ++ Q+S S C A+ S + E + ++++G
Sbjct: 319 EVGLEIDSLPKLTLHFQG-LDYTISNKGLYLQDSPSSLCTALVRSSMKDE--ERINVLGA 375
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
A ++ V YD + L FQ+ DC LAD
Sbjct: 376 SAFVDHAVGYDTSQRMLAFQQRDC--LAD 402
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/420 (26%), Positives = 169/420 (40%), Gaps = 89/420 (21%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-------PCEQCGATTFDPSKSLTYA 147
TVPV ++G PP VLDTGS L W+ C P + F+ S S TYA
Sbjct: 60 TVPV-----AVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYA 114
Query: 148 TLPCDSS----YCTND------CGGYP-DECWYNIRYTNGPDSQGTIGSEQFNFETSDEG 196
C SS + D C G P + C ++ Y + + G + ++ F +
Sbjct: 115 AAHCSSSPECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGVLAADTFLLGGAPPV 174
Query: 197 KTFLYDVGFGC----------------SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG 240
+ FGC + +A S E TG+ G+ + S V + G
Sbjct: 175 RAL-----FGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGM---NRGSLSFVTQTG 226
Query: 241 S-KFSYCIGNLNYFEYAYNMLIL---GEGAILEGD-----------STPMSVIDG-SYYV 284
+ +F+YCI + +L+L G+GA L S P+ D +Y V
Sbjct: 227 TLRFAYCIAPGD----GPGLLVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRVAYSV 282
Query: 285 TLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGL 344
LEGI +G +L I ++ + T + +DSGT T+L+ AY L+ E + L
Sbjct: 283 QLEGIRVGAALLPIPKSVLAPDHTGAGQ-TMVDSGTQFTFLLADAYAPLKGEFLNQTSAL 341
Query: 345 L-----PSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG----GADLVLDAESVFYQ--- 392
L P + A+ C+ + R + G GA++ + E + Y
Sbjct: 342 LAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPG 401
Query: 393 ------ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
S +V+CL G SD+ G +IG QQN V YDL + ++ F C+L
Sbjct: 402 ERRGEGGSEAVWCLTFGNSDMAG---MSAYVIGHHHQQNVWVEYDLQNSRVGFAPARCDL 458
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 144/361 (39%), Gaps = 55/361 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCD 152
+ V S+G P V Q +DTGS L WV+C+PC + FDP++S +YA +PC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 153 SSYCTNDCGGY------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C G Y +C Y + Y +G ++ G S+ S + F FG
Sbjct: 200 GPVCAG-LGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF----FG 254
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLIL 262
C H + F GV GL SLVE+ G FSYC+ + +
Sbjct: 255 CGHAQSGL----FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVG 310
Query: 263 GEGAILEGDST----PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
G G ST P Y V L GIS+G + L + + F
Sbjct: 311 GPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG------ 364
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYS----GNINRDLQGFPAMA 373
T +T L P+AY LR P+ P + CY+ G + P +A
Sbjct: 365 -TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVT-----LPNVA 418
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F GA + L A+ + S CLA PS +G ++I+G + Q+++ V D
Sbjct: 419 LTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG----GMAILGNVQQRSFEVRIDGT 469
Query: 434 S 434
S
Sbjct: 470 S 470
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 155/397 (39%), Gaps = 50/397 (12%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
AR YLS ++++ A I+ P + V IG P L +DT + WV C
Sbjct: 69 ARMQYLSSLVARRSIVPIASGR-QITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCT 127
Query: 129 PCEQCGATT-FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQ 187
C C TT F P+KS T+ + C +S C + P G+ +
Sbjct: 128 ACVGCSTTTPFAPAKSTTFKKVGCGASQC---------------KQVRNPTCDGSACAFN 172
Query: 188 FNFETSDEGKTFLYDV-----------GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV 236
F + TS + + D FGC S + S
Sbjct: 173 FTYGTSSVAASLVQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQ 232
Query: 237 EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGE 293
+ S FSYC+ + ++ ++ + TP+ YYV L I +G
Sbjct: 233 KLYQSTFSYCLPSFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGR 292
Query: 294 KMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA 353
+++DI P N + AG DSGT T LV AY +R E F+ + +
Sbjct: 293 RIVDIPPEALAFNAN-TGAGTVFDSGTVFTRLVEPAYNAVRNE----FRRRIAVHKKLTV 347
Query: 354 WHL-----CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDI 407
L CY+ I P + F F+ G ++ L +++ ++ SV CLA+ P+
Sbjct: 348 TSLGGFDTCYTAPIVA-----PTITFMFS-GMNVTLPPDNILIHSTAGSVTCLAMAPAPD 401
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
N L++I + QQN+ V +D+ + +L R C
Sbjct: 402 NVNSV--LNVIANMQQQNHRVLFDVPNSRLGVARELC 436
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 100/211 (47%), Gaps = 21/211 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSY 155
+ + SIG PPV A DTGS LIW++C PC C FD S T++ + C S
Sbjct: 59 YLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSES 118
Query: 156 CTN--DCGGYPDE--CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C+ PD+ C YN Y +G ++QG + E ++ V FGC HNN
Sbjct: 119 CSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNN 178
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-----FSYCIGNLNYFEYAYNMLILGEGA 266
+++ G+ GLG SLV ++GS FS C+ N + + G+G+
Sbjct: 179 NGAFNDKEMGIIGLG---RGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGS 235
Query: 267 ILEGD---STPM---SVIDGSYYVTLEGISL 291
+ G+ STP+ + Y+VTL GIS+
Sbjct: 236 EVLGNGVVSTPLVSKTTYQSFYFVTLLGISV 266
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/407 (24%), Positives = 166/407 (40%), Gaps = 83/407 (20%)
Query: 89 LHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQCGA----TTFDPSKS 143
LH + FY +G P ++DTGS++ +V C C CG FDP+ S
Sbjct: 52 LHGAVKDYGYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASS 111
Query: 144 LTYATLPCDSSYCTNDCGGYP------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
+ A + CDS C CG P EC Y Y S G + S+Q +
Sbjct: 112 SSSAVIGCDSDKCI--CGRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQL------R 163
Query: 198 TFLYDVGFGC-SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK------FSYCIGNL 250
+V FGC + +++ G+ GLG +S SLV ++ F+ C G++
Sbjct: 164 DGAVEVVFGCETKETGEIYNQEADGILGLG---NSEVSLVNQLAGSGVIDDVFALCFGSV 220
Query: 251 NYFEYAYNMLILGEGAILEGDSTPM-------------SVIDGSYY-VTLEGISLGEKML 296
G+GA++ GD S+ YY V LE + +G + L
Sbjct: 221 E-----------GDGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQL 269
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVE--DLFQGLLPSYPMDPA- 353
+ P +++ G +DSGTT T+L A+Q ++ V L GL DP
Sbjct: 270 PVKPERYEEG-----YGTVLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKE 324
Query: 354 -----WH-LCYSG-------NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSV--F 398
+H +C+ G + ++ + FP FA G L + + + + +
Sbjct: 325 KSFAQFHDICFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAY 384
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
CL V + +G +++G I+ +N V YD ++++ F C+
Sbjct: 385 CLGVFDNGASG------TLLGGISFRNILVQYDRRNRRVGFGAASCQ 425
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 160/382 (41%), Gaps = 48/382 (12%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT--TFDPSKSLTYATLPCDSSYCTN 158
V+ ++G PP VLDTGS L W+ C A +F P S T+A +PC S+ C++
Sbjct: 63 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADSFRPRASATFAAVPCGSARCSS 122
Query: 159 -------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C ++ Y +G S G + ++ F + ++ FGC +
Sbjct: 123 RDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRS-----AFGC-MSA 176
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGAI--- 267
A+ S GL S V + + +FSYCI + + +L+LG +
Sbjct: 177 AYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSYCISDRD----DAGVLLLGHSDLPFL 232
Query: 268 ------LEGDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
L + P+ D +Y V L GI +G K L I P++ + T + +DSGT
Sbjct: 233 PLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGA-GQTMVDSGT 291
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSGNINRDLQG--FPAMA 373
T+L+ AY ++ E + LL PS+ A+ C+ R P +
Sbjct: 292 QFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVT 351
Query: 374 FHFAGGADLVLDAESVFY------QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
F GA + + + + Y + + V+CL G +D+ +IG Q N
Sbjct: 352 LLF-NGAQMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVP---LTAYVIGHHHQMNLW 407
Query: 428 VAYDLVSKQLYFQRIDCELLAD 449
V YDL ++ + C++ ++
Sbjct: 408 VEYDLERGRVGLAPVKCDVASE 429
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 162/389 (41%), Gaps = 33/389 (8%)
Query: 74 LSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC 133
L + S+ + R L+ + + IG PP ++DTGS++ +V C C C
Sbjct: 68 LKESDSEHHPNARMRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHC 127
Query: 134 GA---TTFDPSKSLTYATLPCD-SSYCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFN 189
G+ F P S TY + C C ND +C Y RY S G +G + +
Sbjct: 128 GSHQDPKFRPEDSETYQPVKCTWQCNCDND----RKQCTYERRYAEMSTSSGALGEDVVS 183
Query: 190 FETSDEGKTFLYDVGFGCSHNN-AHFSDEQFTGVFGLGPA-TSSTHSLVEK--VGSKFSY 245
F E FGC ++ +++ G+ GLG S LVEK + FS
Sbjct: 184 FGNQTELSP--QRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSL 241
Query: 246 CIGNLNYFEYAYNMLILGEGA-ILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFK 304
C G + A + + A ++ S P V Y + L+ I + K L ++P +F
Sbjct: 242 CYGGMGVGGGAMVLGGISPPADMVFTRSDP--VRSPYYNIDLKEIHVAGKRLHLNPKVFD 299
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-LCYSG--- 360
G +DSGTT +L SA+ + + L DP ++ +C+SG
Sbjct: 300 -----GKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEI 354
Query: 361 NINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS--SVFCLAVGPSDINGERFKDLSII 418
++++ + FP + F G L L E+ ++ S +CL V NG +++
Sbjct: 355 DVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFS---NGN--DPTTLL 409
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
G I +N V YD ++ F + +C L
Sbjct: 410 GGIVVRNTLVMYDREHTKIGFWKTNCSEL 438
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 162/390 (41%), Gaps = 58/390 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQP---CEQC--------GATTFDPSKSLTYA 147
+ ++ + G PP V+DTGSSL+W C C +C G TF P S +
Sbjct: 83 YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSK 142
Query: 148 TLPCDSSYCT-----------NDCGGYPDECW-----YNIRYTNGPDSQGTIGSEQFNFE 191
+ C + C+ +C C Y I+Y +G + G + SE +F
Sbjct: 143 LIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSG-STAGLLLSETLDFP 201
Query: 192 TSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNL 250
FL GCS FS +Q G+ G G S SL ++G KFSYC+ +
Sbjct: 202 NKKTIPDFL----VGCS----IFSIKQPEGIAGFG---RSPESLPSQLGLKKFSYCLVSH 250
Query: 251 NYFEYAYN---MLILGEGAILEGDS---------TPMSVIDGSYYVTLEGISLGEKMLDI 298
+ + + +L G G+ + + P + YYV L I +G+ + +
Sbjct: 251 AFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKV 310
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL-- 356
P F T + G +DSGTT T++ Y+ + KE E + + L
Sbjct: 311 -PYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRP 369
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLS 416
CY+ + + L P + F F GGA + L + F S V CL + ++ G
Sbjct: 370 CYNISGEKSLS-VPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAGPGLGGGP 428
Query: 417 --IIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G Q+N+ V +DL +++ F++ C
Sbjct: 429 AIILGNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 154/357 (43%), Gaps = 31/357 (8%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSSYCTNDCG 161
IG PP ++DTGS++ +V C C+ CG+ F P S TY + C + C +C
Sbjct: 99 IGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC-TWQC--NCD 155
Query: 162 GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN-AHFSDEQFT 220
+C Y RY S G +G + +F E FGC ++ +++
Sbjct: 156 DDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSP--QRAIFGCENDETGDIYNQRAD 213
Query: 221 GVFGLGPA-TSSTHSLVEK--VGSKFSYCIGNLNYFEYAYNMLILGEGA-ILEGDSTPMS 276
G+ GLG S LVEK + FS C G + A + + A ++ S P
Sbjct: 214 GIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISPPADMVFTHSDP-- 271
Query: 277 VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE 336
V Y + L+ I + K L ++P +F G +DSGTT +L SA+ +
Sbjct: 272 VRSPYYNIDLKEIHVAGKRLHLNPKVFD-----GKHGTVLDSGTTYAYLPESAFLAFKHA 326
Query: 337 VEDLFQGLLPSYPMDPAWH-LCYSG---NINRDLQGFPAMAFHFAGGADLVLDAESVFYQ 392
+ L DP ++ +C+SG N+++ + FP + F G L L E+ ++
Sbjct: 327 IMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFR 386
Query: 393 ESS--SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
S +CL V NG +++G I +N V YD ++ F + +C L
Sbjct: 387 HSKVRGAYCLGVFS---NGN--DPTTLLGGIVVRNTLVMYDREHSKIGFWKTNCSEL 438
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/359 (23%), Positives = 141/359 (39%), Gaps = 28/359 (7%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSY 155
P + V +G P L +DT + W+ C C C ++ F+P+ S +Y +PC S
Sbjct: 105 PTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRPVPCGSPQ 164
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C C +++ Y + Q + + D K + FGC
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSYADS-SLQAALSQDTLAVA-GDVVKAYT----FGCLQRA 218
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ + S + G+ FSYC+ + ++ + + G
Sbjct: 219 TGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIK 278
Query: 272 STPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+TP+ YYV + GI +G+K++ I P D + AG +DSGT T LV
Sbjct: 279 TTPLLANPHRSSLYYVNMTGIRVGKKVVSI-PASALAFDPATGAGTVLDSGTMFTRLVAP 337
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
Y LR EV + + CY+ + +P + F G + L E+
Sbjct: 338 VYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTV-----AWPPVTLLF-DGMQVTLPEEN 391
Query: 389 VFYQE---SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V ++S +A P +N L++I + QQN+ V +D+ + ++ F R C
Sbjct: 392 VVIHTTYGTTSCLAMAAAPDGVN----TVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 94/391 (24%), Positives = 161/391 (41%), Gaps = 61/391 (15%)
Query: 90 HPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKS------ 143
HP S +++ +G P +DTGS ++WV C C C P KS
Sbjct: 67 HP--SESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNC------PKKSDLGIEL 118
Query: 144 --------LTYATLPCDSSYCTNDCGG-----YPDE-CWYNIRYTNGPDSQGTIGSEQF- 188
T + C+ +CT+ G P+ C Y + Y +G + G +
Sbjct: 119 SLYSPSSSSTSNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVV 178
Query: 189 ------NFETSDEGKTFLYDVGFGCSHNNAH---FSDEQFTGVFGLGPATSSTHSLVE-- 237
NF+T+ + + FGC + + G+ G G A SS S +
Sbjct: 179 LDRVTGNFQTTSTNGSIV----FGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASS 234
Query: 238 -KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKML 296
KV F++C+ N+N + +GE + +TP+ Y V ++ I + ++L
Sbjct: 235 GKVKRVFAHCLDNIN----GGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVL 290
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF--QGLLPSYPMDPAW 354
++ ++F DT G IDSGTTL + Y+ L + +F Q L + ++ +
Sbjct: 291 NLPTDVF---DTDLRKGTIIDSGTTLAYFPDVIYEPL---ISKIFARQSTLKLHTVEEQF 344
Query: 355 H-LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
Y GN++ GFP + FHF L + + S+ +C+ S K
Sbjct: 345 TCFEYDGNVD---DGFPTVTFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGK 401
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
D+ ++G + QN V YDL ++ + + +C
Sbjct: 402 DMILLGDLVLQNRLVMYDLENQTIGWTEYNC 432
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/359 (23%), Positives = 141/359 (39%), Gaps = 28/359 (7%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-FDPSKSLTYATLPCDSSY 155
P + V +G P L +DT + W+ C C C ++ F+P+ S +Y +PC S
Sbjct: 52 PTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRPVPCGSPQ 111
Query: 156 CT----NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C C +++ Y + Q + + D K + FGC
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYADS-SLQAALSQDTLAVA-GDVVKAYT----FGCLQRA 165
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ + S + G+ FSYC+ + ++ + + G
Sbjct: 166 TGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIK 225
Query: 272 STPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+TP+ YYV + GI +G+K++ I P D + AG +DSGT T LV
Sbjct: 226 TTPLLANPHRSSLYYVNMTGIRVGKKVVSI-PASALAFDPATGAGTVLDSGTMFTRLVAP 284
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
Y LR EV + + CY+ + +P + F G + L E+
Sbjct: 285 VYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTV-----AWPPVTLLF-DGMQVTLPEEN 338
Query: 389 VFYQE---SSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V ++S +A P +N L++I + QQN+ V +D+ + ++ F R C
Sbjct: 339 VVIHTTYGTTSCLAMAAAPDGVN----TVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 105/413 (25%), Positives = 174/413 (42%), Gaps = 116/413 (28%)
Query: 42 LLHRDSLL---YNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPV 98
L+HRDS L YNP+ T ++R + ++ SS + + L P
Sbjct: 33 LIHRDSPLSPFYNPSLT---PSERITDAAL---------SSNENKLPESILIPNNGE--- 77
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
+ + IG PPV +L + DTGS IWV+C PC+ C
Sbjct: 78 YLMRLYIGTPPVERLVIADTGSDFIWVQCSPCQNC------------------------- 112
Query: 159 DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY-DVGFGC-SHNNAHF-S 215
+C Y Y N + +G+E +F+++ +T + + FGC ++NN F S
Sbjct: 113 -------QCVYLNIYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGANNNLTFRS 165
Query: 216 DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM 275
++ TG+ GL + SLV ++G++ Y +Y ++ +I G + STP+
Sbjct: 166 SDKATGLVGL---VAGQLSLVSQLGAQIGY---KFSYLKFGSEAIITTNGVV----STPL 215
Query: 276 SVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQT 332
+I S Y++ LE +++G+K+ VP+ +T
Sbjct: 216 -IIKPSLPLYFLNLEVVTIGQKV------------------------------VPT--ET 242
Query: 333 LRKE-VEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFY 391
L E V+DL +P + C+ RD PA+AF F G + + +
Sbjct: 243 LGVESVQDL------PFP----FKFCFP---YRDNMTVPAIAFQFTGASVALRPKNLLIK 289
Query: 392 QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ ++ LAV PS +SI G+IAQ ++ V YDL K++ DC
Sbjct: 290 LQDRNMLXLAVVPS---ASSLSVISIFGIIAQFDFQVLYDLDGKKVSVAPTDC 339
>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
Length = 340
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/287 (29%), Positives = 123/287 (42%), Gaps = 41/287 (14%)
Query: 125 VKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIRYTNGPDSQGTIG 184
V PC+ FDPS+S ++A +PC S C +C G C + I++ N + GT+
Sbjct: 24 VGGAPCD----VAFDPSRSSSFAAIPCGSPECAVECTGA--SCPFTIQFGNVTVANGTLV 77
Query: 185 SEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK-- 242
+ S F FGC A + F G GL + S+HSL +V S
Sbjct: 78 RDTLTLSPSATFAGFT----FGCIEVGAD--ADTFDGAVGLIDLSRSSHSLASRVISNGA 131
Query: 243 -------FSYCIGNLNYFEYAYNMLILGEGAILEGDS---TPMSVI---DGSYYVTLEGI 289
FSYC+ +L+ + I G PMS SY+V L GI
Sbjct: 132 TTTTTAAFSYCLPSLSSTRSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGI 191
Query: 290 SLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYP 349
S+G + L + P + + G +++ T T+L P+AY LR D F+ + YP
Sbjct: 192 SVGGEDLPVPPAVLAAH------GTLLEAATEFTFLAPAAYAALR----DAFRNDMAQYP 241
Query: 350 MDPAWHL---CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQE 393
P + + CY+ L PA+A FAGG +L LD Y E
Sbjct: 242 AAPPFRVLDTCYNLTGLASLA-VPAVALRFAGGTELELDVRQTMYFE 287
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 141/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A LR+ + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLRQRIRELL--LKRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P+ K +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPT-------KSVSIIG 321
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 160/372 (43%), Gaps = 50/372 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPC 151
+++ S+G PPV L +DTGS+L WV+C+ C+ +C F+P S TY+ + C
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 84
Query: 152 DSSYCTN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
+ C C D C Y++RY +G S G +G ++ ++ F+
Sbjct: 85 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFI-- 142
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV--EKVGSKFSYCI-------GNLNYF 253
FGC +N + + G+ G G + S + V + + FSYC G+L
Sbjct: 143 --FGCGEDNLY--NGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIG 198
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
YA ++ ++ I D P +Y + + + L+IDP ++ T
Sbjct: 199 PYARDINLMWTKLIYY-DHKP------AYAIQQLDMMVNGIRLEIDPYIYISKMT----- 246
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI-NRDLQGFPAM 372
+DSGT T+++ + L K + Q + D +C+ N + + FP +
Sbjct: 247 -IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER-RICFISNSGSANWNDFPTV 304
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
+ L L E+ FY+ S++V C P D + + ++G A +++ + +D+
Sbjct: 305 EMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDAG---VRGVQMLGNRAVRSFKLVFDI 360
Query: 433 VSKQLYFQRIDC 444
+ F+ C
Sbjct: 361 QAMNFGFKARAC 372
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 160/372 (43%), Gaps = 50/372 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPC 151
+++ S+G PPV L +DTGS+L WV+C+ C+ +C F+P S TY+ + C
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65
Query: 152 DSSYCTN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
+ C C D C Y++RY +G S G +G ++ ++ F+
Sbjct: 66 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFI-- 123
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV--EKVGSKFSYCI-------GNLNYF 253
FGC +N + + G+ G G + S + V + + FSYC G+L
Sbjct: 124 --FGCGEDNLY--NGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIG 179
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
YA ++ ++ I D P +Y + + + L+IDP ++ T
Sbjct: 180 PYARDINLMWTKLIYY-DHKP------AYAIQQLDMMVNGIRLEIDPYIYISKMT----- 227
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI-NRDLQGFPAM 372
+DSGT T+++ + L K + Q + D +C+ N + + FP +
Sbjct: 228 -IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER-RICFISNSGSANWNDFPTV 285
Query: 373 AFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
+ L L E+ FY+ S++V C P D + + ++G A +++ + +D+
Sbjct: 286 EMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDAG---VRGVQMLGNRAVRSFKLVFDI 341
Query: 433 VSKQLYFQRIDC 444
+ F+ C
Sbjct: 342 QAMNFGFKARAC 353
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 136/349 (38%), Gaps = 36/349 (10%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC 156
P + V IG PP L +DT + W+ C C+ C +T F P KS T+ + C +
Sbjct: 91 PTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAPEKSTTFKNVSCAAP-- 148
Query: 157 TNDCGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD----VGFGCSHNN 211
+C P+ C + R N T GS + T D FGC
Sbjct: 149 --ECKQVPNPGCGVSSRNFN-----LTYGSSSIAANLVQDTITLATDPVPSYTFGCVSKT 201
Query: 212 AHFS---DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAIL 268
S G S T +L + S FSYC+ + ++ ++ +
Sbjct: 202 TGTSAPPQGLLGLGRGPLSLLSQTQNLYQ---STFSYCLPSFKSLNFSGSLRLGPVAQPK 258
Query: 269 EGDSTPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
TP+ YYV LE I +G K++DI P N T + AG DSGT T L
Sbjct: 259 RIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPT-TGAGTIFDSGTVFTRL 317
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
V Y +R E L + + CY+ I P + F F G +
Sbjct: 318 VAPVYVAVRDEFRRRVGPKLTVTSLG-GFDTCYNVPI-----VVPTITFIFTGMNVTLPQ 371
Query: 386 AESVFYQESSSVFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
+ + + S CLA+ P ++N L++I + QQN+ V YD+
Sbjct: 372 DNILIHSTAGSTTCLAMAGAPDNVNSV----LNVIANMQQQNHRVLYDV 416
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/358 (23%), Positives = 139/358 (38%), Gaps = 34/358 (9%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
+ V S+G PP L +DT + W+ C C C A FDP+ S +Y T+PC S
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPL 171
Query: 156 CTN----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C C C +++ Y DS Q + + + FGC
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYA---DSSLQAALSQDSLAVAGNA---VKAYTFGCLQRA 225
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
+ + S + + FSYC+ + ++ + + G
Sbjct: 226 TGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIK 285
Query: 272 STPMSV---IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+TP+ YYV + G+ +G K++ I D + AG +DSGT T LV
Sbjct: 286 TTPLLANPHRSSLYYVNMTGVRVGRKVVPI-----PAFDPATGAGTVLDSGTMFTRLVAP 340
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAES 388
AY +R EV + S + C+ N +P M F G + +
Sbjct: 341 AYVAVRDEVRRRVGAPVSSL---GGFDTCF----NTTAVAWPPMTLLFDGMQVTLPEENV 393
Query: 389 VFYQESSSVFCLAV--GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
V + ++ CLA+ P +N L++I + QQN+ V +D+ + ++ F R C
Sbjct: 394 VIHSTYGTISCLAMAAAPDGVN----TVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 163/391 (41%), Gaps = 75/391 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLP 150
+F + S+G+PPV L +DTGS+L WV+CQPC C FDP +S T +
Sbjct: 113 LFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 151 CDSSYCTN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTF 199
C S C +C D C Y++ Y NG S G + ++ G +F
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSF 227
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLN 251
+ D+ FGCS + + E G+ SS+ S E++ FSYC L
Sbjct: 228 M-DLMFGCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LP 278
Query: 252 YFEYAYNMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKN 306
E +ILG + A ++G TP+ S+ +Y +T+E I+ G++++
Sbjct: 279 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT--------- 329
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY------ 358
S + + +DSG T L PS + L K + G + ++CY
Sbjct: 330 ---SSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDY 386
Query: 359 ---SGNIN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
+G I + P + FAGGA L L +VFY + C+ + +
Sbjct: 387 SGWNGTITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQ-- 444
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G +++ +D+ KQ F+ C
Sbjct: 445 ---ILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 143/365 (39%), Gaps = 87/365 (23%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSS 154
+ V +G P + DTGS L W +C+PC Q FDPS SL+Y+ + CDS
Sbjct: 89 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSP 148
Query: 155 YCT-------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGC 207
C N G C Y IRY +G S G E+ + ++D F FGC
Sbjct: 149 SCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQ----FGC 204
Query: 208 SHNNAHFSDEQFTGVFGLGPATSSTHSLV----EKVGSKFSYCIGNLNYFEYAYNMLILG 263
NN F G GL + SLV +K G FSYC L + L G
Sbjct: 205 GQNNRGL----FGGTAGLLGLARNPLSLVSQTAQKYGKVFSYC---LPSSSSSTGYLSFG 257
Query: 264 EGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G +GDS K + P
Sbjct: 258 SG---DGDS---------------------KAVKFTPR---------------------- 271
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQGFPAMAFHFAGGA 380
L P+ Y +++K +F+ L+ YP + CY + + ++ P + +F+GGA
Sbjct: 272 -LPPTVYSSVQK----VFRELMSDYPRVKGVSILDTCYDLSKYKTVK-VPKIILYFSGGA 325
Query: 381 DLVLDAESVFYQESSSVFCLA-VGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
++ L E + Y S CLA G SD + +++IIG + Q+ +V YD ++ F
Sbjct: 326 EMDLAPEGIIYVLKVSQVCLAFAGNSDDD-----EVAIIGNVQQKTIHVVYDDAEGRVGF 380
Query: 440 QRIDC 444
C
Sbjct: 381 APSGC 385
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 163/391 (41%), Gaps = 75/391 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLP 150
+F + S+G+PPV L +DTGS+L WV+CQPC C FDP +S T +
Sbjct: 115 LFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 174
Query: 151 CDSSYCTN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTF 199
C S C +C D C Y++ Y NG S G + ++ G +F
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSF 229
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLN 251
+ D+ FGCS + + E G+ SS+ S E++ FSYC L
Sbjct: 230 M-DLMFGCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LP 280
Query: 252 YFEYAYNMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKN 306
E +ILG + A ++G TP+ S+ +Y +T+E I+ G++++
Sbjct: 281 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT--------- 331
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY------ 358
S + + +DSG T L PS + L K + G + ++CY
Sbjct: 332 ---SSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDY 388
Query: 359 ---SGNIN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
+G I + P + FAGGA L L +VFY + C+ + +
Sbjct: 389 SGWNGTITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQ-- 446
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G +++ +D+ KQ F+ C
Sbjct: 447 ---ILGNRVTRSFGTTFDIQGKQFGFKYAAC 474
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 104/434 (23%), Positives = 177/434 (40%), Gaps = 61/434 (14%)
Query: 53 NDTVDAQAQRTLNMS--MARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPV 110
+ + QR+L+ +AR + + KA + A L PG + V G P
Sbjct: 47 QELIRRAVQRSLDRPGIVARSGGGAADEAGKAVASEAPLVPGGGE---YLVKLGTGTPQH 103
Query: 111 PQLAVLDTGSSLIWVKCQPCEQCGAT---TFDPSKSLTYATLPCDSSYCTNDCGGYPDE- 166
A +DT S L+W++CQPC C F+P S +YA +PC S C G E
Sbjct: 104 FFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHED 163
Query: 167 ----CWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGV 222
C Y +Y+ ++GT+ ++ G + V FGCS ++ Q +G+
Sbjct: 164 DDGACQYTYKYSGHGVTKGTLAIDKLAI-----GGDVFHAVVFGCSDSSVGGPAAQASGL 218
Query: 223 FGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGA-ILEGDSTPMSVIDG 280
GLG SLV ++ +F YC+ L+LG GA + S ++V
Sbjct: 219 VGLG---RGPLSLVSQLSVHRFMYCLP--PPMSRTSGKLVLGAGADAVRNMSDRVTVTMS 273
Query: 281 S-------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDA------------------GVF 315
S YY+ L+G+++G++ N + G+
Sbjct: 274 SSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMI 333
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAMA 373
+D +T+++L S Y L ++E+ + + + LC+ + D P ++
Sbjct: 334 VDVASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVS 393
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F G L LD + +F + + CL +G R +SI+G QN V ++L
Sbjct: 394 LSFDGRW-LELDRDRLFVTD-GRMMCLMIG-------RTSGVSILGNFQLQNMRVLFNLR 444
Query: 434 SKQLYFQRIDCELL 447
++ F + C+ L
Sbjct: 445 RGKITFAKASCDSL 458
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 152/375 (40%), Gaps = 46/375 (12%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDS 153
P + NF+IG PP P A++D L+W +C C +C F P+ S T+ PC +
Sbjct: 60 PYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGT 119
Query: 154 SYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
+ C T C G D C Y GP +Q + F + T + FGC
Sbjct: 120 AVCESIPTRSCSG--DVCSY-----KGPPTQLRGNTSGFAATDTFAIGTATVRLAFGCVV 172
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAIL 268
+ + + +G GLG + SLV ++ ++FSYC+ N + + L LG A L
Sbjct: 173 ASDIDTMDGPSGFIGLG---RTPWSLVAQMKLTRFSYCLSPRNTGK--SSRLFLGSSAKL 227
Query: 269 EGDSTPMSV--IDGS--------YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFI-D 317
G + + I S Y ++L+ I G + T G+ +
Sbjct: 228 AGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI----------ATAQSGGILVMH 277
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQG--LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
+ + + LV SAY+ +K V + G P + LC+ P + F
Sbjct: 278 TVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFT 337
Query: 376 FAGGADLVLDAESVFYQ--ESSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
F G A L + E C A+ + +N + +S++G + Q++ + YDL
Sbjct: 338 FQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDL 397
Query: 433 VSKQLYFQRIDCELL 447
+ L F+ DC L
Sbjct: 398 KKETLSFEPADCSSL 412
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 103/210 (49%), Gaps = 35/210 (16%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTN--- 158
+G P + DTGS LIW++C PC C T FDP++S TY T+ DS C
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 159 -DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG---FGCSHNNAHF 214
C C Y Y +G ++GT+ ++ F FE D +T + +VG FGCSH+
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFE--DPTRTIV-EVGYLTFGCSHDTKAR 179
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCI--------GNLNYFEYAYNMLILGEG 265
GV GL +SLV ++ KFSYC+ G+ YF G
Sbjct: 180 LKGHQAGVVGL---NRHPNSLVSQLKVKKFSYCMVIPDDHGSGSRMYF---------GSR 227
Query: 266 AILEGDSTPMSVIDGS-YYVTLEGISLGEK 294
A++ G TP+ D S Y+VTL+GIS+GE+
Sbjct: 228 AVILGGKTPLLKGDYSHYFVTLKGISVGEE 257
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 125 VKCQPCEQCGATT---FDPSKSLTYATLPCDSSYCTNDCGGYP-----DECWYNIRYTNG 176
++ Q QC T FDPSKS TY+T+P D+ C GGY ++C Y I Y +G
Sbjct: 326 LEAQEVAQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQ-AGGYACHIDEEDCCYRISYGSG 384
Query: 177 PDS-QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGL 225
S +GTI + F FE + + + + FGCS G+ GL
Sbjct: 385 STSTEGTISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGL 434
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
P + FHF G AD +L + + + ++CLA+ ++ + LSI+G I QQNY+V
Sbjct: 269 PDITFHFYG-ADFILTKXTTYVEVEKGLWCLAM----LSSNSTRKLSILGNIQQQNYHVG 323
Query: 430 YDLVSKQL 437
YDL ++++
Sbjct: 324 YDLEAQEV 331
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 167/403 (41%), Gaps = 54/403 (13%)
Query: 70 RFIYLSQKSSQKAHDTRAHLHPGIST-VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ 128
R YLS + K T + G + + V +G PP VLDT + +W+ C
Sbjct: 74 RLTYLSSLVAGKPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCS 133
Query: 129 PCEQC--GATTFDPSKSLTYATLPCDSSYCTNDCG-------GYPDECWYNIRYTNGPDS 179
C C +T+F+ + S TY+T+ C ++ CT G P C +N Y
Sbjct: 134 GCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYG----- 188
Query: 180 QGTIGSEQFNFETSDEGKTFLYDV----GFGCSHNNAHFSDEQFTGVFGLGPA----TSS 231
G F+ + T DV FGC N+A + G+ GLG S
Sbjct: 189 ----GDSSFSASLVQDTLTLAPDVIPNFSFGC-INSASGNSLPPQGLMGLGRGPMSLVSQ 243
Query: 232 THSLVEKVGSKFSYCIGNLN--YFEYAYNMLILGE------GAILEGDSTPMSVIDGSYY 283
T SL V FSYC+ + YF + + +LG+ +L P YY
Sbjct: 244 TTSLYSGV---FSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRP-----SLYY 295
Query: 284 VTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQG 343
V L G+S+G + +DP ++ D S AG IDSGT +T Y+ +R E Q
Sbjct: 296 VNLTGVSVGSVQVPVDP-VYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRK--QV 352
Query: 344 LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF-CLAV 402
+ S+ A+ C+S + N ++ P + H DL L E+ S+ CL++
Sbjct: 353 NVSSFSTLGAFDTCFSAD-NENVA--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSM 408
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
+ I L++I + QQN + +D+ + ++ C
Sbjct: 409 --AGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 112/415 (26%), Positives = 170/415 (40%), Gaps = 79/415 (19%)
Query: 86 RAHLHPGISTVPVFY-------VNFSIGQPPVPQLAVLDTGSSLIWVKCQP---CEQCGA 135
RAH T PVF ++ S G PP V+DTGSS +W C C C
Sbjct: 57 RAHHLKNPQTTPVFSHSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSF 116
Query: 136 TT----FDPSKSLTYATLPCDSSYCT---------NDCGGYPDECW-----YNIRYTNGP 177
T+ F P S + + C + C+ DC C Y I Y +G
Sbjct: 117 TSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSG- 175
Query: 178 DSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
+ G SE + + + GCS FS Q G+ G G SS L
Sbjct: 176 TTGGVALSETLHLH-----GLIVPNFLVGCSV----FSSRQPAGIAGFGRGPSS---LPS 223
Query: 238 KVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-----------TPM----SVIDGS 281
++G +KFSYC+ + ++ +L+ S TP+ V D
Sbjct: 224 QLGLTKFSYCL-----LSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKP 278
Query: 282 -----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE 336
YYV+L IS+G + + I P + D + G IDSGTT T++ A++ L E
Sbjct: 279 AFSVYYYVSLRRISIGGRSVKI-PYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNE 337
Query: 337 VEDLFQGLLPSYPMDPAWHL--CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVF-YQE 393
+ + ++ L C++ + ++L+ P + HF GGAD+ L E+ F +
Sbjct: 338 FISQVKNYERALMVEALSGLKPCFNVSGAKELE-LPQLRLHFKGGADVELPLENYFAFLG 396
Query: 394 SSSVFCLAV---GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
S V C V G +G I+G QN+ V YDL +++L F++ C+
Sbjct: 397 SREVACFTVVTDGAEKASGPGM----ILGNFQMQNFYVEYDLQNERLGFKKESCK 447
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 170/404 (42%), Gaps = 75/404 (18%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-------PCEQCGATTFDPSKSLTYA 147
TVPV ++G PP VLDTGS L W++C P Q A F+ S S TYA
Sbjct: 61 TVPV-----AVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPA-AFNGSASSTYA 114
Query: 148 TLPCDSSYCT---ND------CGGYPD-ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGK 197
C S C D C G P C ++ Y + + G + ++ F G
Sbjct: 115 AAHCSSPECQWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLL-----GG 169
Query: 198 TFLYDVGFGC--SHNNAHFSD----EQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNL 250
FGC S+++A ++ E TG+ G+ + S V + + +F+YCI
Sbjct: 170 APPVXALFGCVTSYSSATATNSSDSEAATGLLGM---NRGSLSFVTQTATLRFAYCIAPG 226
Query: 251 NYFEYAYNMLIL-GEGAILEGD---------STPMSVIDG-SYYVTLEGISLGEKMLDID 299
+ +L+L G+GA L S P+ D +Y V LEGI +G +L I
Sbjct: 227 D----GPGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIP 282
Query: 300 PNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS-----YPMDPAW 354
++ + T + +DSGT T+L+ AY L+ E + LL + A+
Sbjct: 283 KSVLAPDHTGAGQ-TMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAF 341
Query: 355 HLCYSGNINRDLQG---FPAMAFHFAGGADLVLDAESVFYQ---------ESSSVFCLAV 402
C+ + R P + GA++ + E + Y+ + +V+CL
Sbjct: 342 DACFRASEARVAAASXMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTF 400
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
G SD+ G +IG QQN V YDL + ++ F C+L
Sbjct: 401 GNSDMAG---MSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 441
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 149/380 (39%), Gaps = 63/380 (16%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GAT----------TFDPS 141
Y N +IG P L LDTGS L W+ C C G T ++PS
Sbjct: 112 YANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYNPS 171
Query: 142 KSLTYATLPCDSSYCT--NDCGGYPDECWYNIRYTN-GPDSQGTIGSEQFNFETSDEGKT 198
S + + + C+S+ C N C +C Y IRY + G S G + + + T +EG+
Sbjct: 172 ISTSSSKVTCNSTLCALRNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVIHMST-EEGEA 230
Query: 199 FLYDVGFGCSHNN-AHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFE 254
+ FGCS F + G+ GL A + +++ K G FS C G
Sbjct: 231 RDARITFGCSETQLGLFQEVAVNGIMGLAMADIAVPNMLVKAGVASDSFSMCFGP----- 285
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFK--KNDTWSDA 312
G+G I GD + T G ++ D+ FK K +
Sbjct: 286 -------NGKGTISFGDKG-----SSDQHETPLGGTISPLFYDVSITKFKVGKVTVETKF 333
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVE-DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
DSGT +TWL+ Y L + LP+ +D + CY D + P+
Sbjct: 334 SAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPA-NVDSTFEFCYIITSTSDEEKLPS 392
Query: 372 MAFHFAGGAD-------LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
++F GGA LV D +Q V+CLAV D D +IIG
Sbjct: 393 ISFEMKGGAAYDVFSPILVFDTSDGSFQ----VYCLAVLKQDK-----ADFNIIGQNFMT 443
Query: 425 NYNVAYDLVSKQLYFQRIDC 444
NY + +D L +++ +C
Sbjct: 444 NYRIVHDRERMILGWKKSNC 463
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 144/371 (38%), Gaps = 86/371 (23%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCTN 158
F V+ + G PP +LDTGSS+ W +C+ C
Sbjct: 128 FLVDVAFGTPPQNFTLILDTGSSITWTQCKACTV-------------------------- 161
Query: 159 DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQ 218
E YN+ Y + S G G + E SD + F FG NN
Sbjct: 162 -------ENNYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQ----FGRGRNNKGDFGSG 210
Query: 219 FTGVFGLGPATSSTHS-LVEKVGSKFSYC------IGNLNYFEYA--------YNMLILG 263
G+ GLG ST S K FSYC IG+L + E A + L+ G
Sbjct: 211 VDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNG 270
Query: 264 EGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLT 323
G + E G Y+V L IS+G + L+I ++F + G IDS T +T
Sbjct: 271 PGTLQE---------SGYYFVNLSDISVGNERLNIPSSVF------ASPGTIIDSRTVIT 315
Query: 324 WLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH-------LCYSGNINRDLQGFPAMAFHF 376
L AY L+ + YP+ CY+ + +D+ P + HF
Sbjct: 316 RLPQRAYSALKAAFKKAMA----KYPLSNGRRKKGDILDTCYNLSGRKDVL-LPEIVLHF 370
Query: 377 AGGADLVLDAESVFYQESSSVFCLAVG---PSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
GGAD+ L+ ++ + S CLA S +N E L+IIG Q + V YD+
Sbjct: 371 GGGADVRLNGTNIVWGSDESRLCLAFAGNSKSTMNPE----LTIIGNRQQLSLTVLYDIQ 426
Query: 434 SKQLYFQRIDC 444
++ F+ C
Sbjct: 427 GGRIGFRSNGC 437
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 109/410 (26%), Positives = 165/410 (40%), Gaps = 85/410 (20%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT-TFDPSKSLTYATLPCDS 153
TVPV ++G PP VLDTGS L W+ C T F+ S S +Y +PC S
Sbjct: 56 TVPV-----AVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSSSYGAVPCPS 110
Query: 154 SYCTNDCGGYP----------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ C P + C ++ Y + + G + ++ F G V
Sbjct: 111 TACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLL----TGGAPPVAV 166
Query: 204 G--FGC---------SHNNAHFSD--EQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGN 249
G FGC +++N +D E TG+ G+ T S V + G+ +F+YCI
Sbjct: 167 GAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGM---NRGTLSFVTQTGTRRFAYCIAP 223
Query: 250 LNYFEYAYNMLILGEGAILEGD----------------STPMSVIDG-SYYVTLEGISLG 292
G G +L GD S P+ D +Y V LEGI +G
Sbjct: 224 GE-----------GPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVG 272
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PS 347
+L I ++ + T + +DSGT T+L+ AY L+ E + LL P
Sbjct: 273 CALLPIPKSVLTPDHTGAGQ-TMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPG 331
Query: 348 YPMDPAWHLCYSGNINR--DLQGFPAMAFHFAGGADLVLDAESVFYQ---------ESSS 396
+ A+ C+ G R G + GA++ + E + Y + +
Sbjct: 332 FVFQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEA 391
Query: 397 VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
V+CL G SD+ G +IG QQN V YDL + ++ F C+L
Sbjct: 392 VWCLTFGNSDMAG---MSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 438
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 163/391 (41%), Gaps = 75/391 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLP 150
+F + S+G+PPV L +DTGS+L WV+CQPC C FDP +S T +
Sbjct: 113 LFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 151 CDSSYCTN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTF 199
C S C +C D C Y++ Y NG S G + ++ G +F
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSF 227
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLN 251
+ D+ FGCS + + E G+ SS+ S E++ FSYC L
Sbjct: 228 M-DLMFGCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LP 278
Query: 252 YFEYAYNMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKN 306
E +ILG + A ++G TP+ S+ +Y +T+E I+ G++++
Sbjct: 279 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT--------- 329
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY------ 358
S + + +DSG T L PS + L K + G + ++CY
Sbjct: 330 ---SSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDY 386
Query: 359 ---SGNIN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
+G I + P + FAGGA L L +VFY + C+ + +
Sbjct: 387 SGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQ-- 444
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G +++ +D+ KQ F+ C
Sbjct: 445 ---ILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 144/361 (39%), Gaps = 55/361 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------TFDPSKSLTYATLPCD 152
+ V S+G P V Q +DTGS L WV+C+PC + FDP++S +YA +PC
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 107
Query: 153 SSYCTNDCGGY------PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFG 206
C G Y +C Y + Y +G ++ G S+ S + F FG
Sbjct: 108 GPVCAG-LGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF----FG 162
Query: 207 CSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV----GSKFSYCIGNLNYFEYAYNMLIL 262
C H + F GV GL SLVE+ G FSYC+ + +
Sbjct: 163 CGHAQSGL----FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVG 218
Query: 263 GEGAILEGDST----PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
G G ST P Y V L GIS+G + L + + F
Sbjct: 219 GPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG------ 272
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYS----GNINRDLQGFPAMA 373
T +T L P+AY LR P+ P + CY+ G + P +A
Sbjct: 273 -TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVT-----LPNVA 326
Query: 374 FHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLV 433
F GA + L A+ + S CLA PS +G ++I+G + Q+++ V D
Sbjct: 327 LTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG----GMAILGNVQQRSFEVRIDGT 377
Query: 434 S 434
S
Sbjct: 378 S 378
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 136/378 (35%), Gaps = 55/378 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSY 155
++ +G P P L VLDTGS ++W++C PC +C FDP S +Y + C +
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206
Query: 156 CTN-DCGG---YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNN 211
C D GG C Y + Y +G + G +E F + + V GC H+N
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGAR----VPRVALGCGHDN 262
Query: 212 AHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGEGA- 266
+ + S + + G FSYC+ + + + G GA
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAR 322
Query: 267 ------ILEGDSTP-------MSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
+L D + G G DP+ + G
Sbjct: 323 GALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPDPSTGR-------GG 375
Query: 314 VFIDSGT-TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF--- 369
V +DSG + W GL S + CY DL G
Sbjct: 376 VIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFDTCY------DLSGLKVV 429
Query: 370 --PAMAFHFAGGADLVLDAESVFYQ-ESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
P ++ HFAGGA+ L E+ +S FC A +D +SIIG I QQ +
Sbjct: 430 KVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD------GGVSIIGNIQQQGF 483
Query: 427 NVAYDLVSKQLYFQRIDC 444
V +D ++L F C
Sbjct: 484 RVVFDGDGQRLGFVPKGC 501
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 105/442 (23%), Positives = 172/442 (38%), Gaps = 56/442 (12%)
Query: 48 LLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQ 107
++ N + T A ++R ++ + +S R+ L+ I+ V ++ V+ G
Sbjct: 78 MMGNGSGTGSASSRRRQAKESSKLPEVMSATSMFELPMRSALN--IAHVGMYLVSVRFGT 135
Query: 108 PPVPQLAVLDTGSSLIWVKCQPCEQCGA-----------------------TTFDPSKSL 144
P +P VLDT + L W+ C+ + G + P+KS
Sbjct: 136 PALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARRKNWYRPAKSS 195
Query: 145 TYATLPCDSSYCT----NDCGG--YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT 198
++ + C C N C + C Y + +G + G G E+ SD
Sbjct: 196 SWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEKATVTVSDGRMA 255
Query: 199 FLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS--THSLVEKVGSKFSYCIGNLN----- 251
L + GCS A S + GV LG S H+ ++ G +FS+C+ + N
Sbjct: 256 KLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHA-AKRFGQRFSFCLLSANSSRDA 314
Query: 252 --YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
Y + N ++G G +E D + +Y + GI +G + LDI P +
Sbjct: 315 SSYLTFGPNPAVMGPGT-METDIVYNVDVKPAYGPLVTGIFVGGERLDI-PQEIWDAEKV 372
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
GV +D+ T++T LVP AY + ++ L Y +D + CY D
Sbjct: 373 VGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELD-GFEYCYRWTFAGDGVDL 431
Query: 370 ------PAMAFHFAGGADLVLDAESVFYQE-SSSVFCLAVGPSDINGERFKDLSIIGMIA 422
P + AGGA L +A+SV E V CLA G I+G +
Sbjct: 432 THNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGP-----GILGNVL 486
Query: 423 QQNYNVAYDLVSKQLYFQRIDC 444
Q Y D ++ F++ C
Sbjct: 487 MQEYIWEIDHGKGKMRFRKDKC 508
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 110/445 (24%), Positives = 177/445 (39%), Gaps = 75/445 (16%)
Query: 30 PAAGKPKRLVTK----LLHRDSLLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDT 85
P G P R +K + HRD L+ R L + + + +
Sbjct: 50 PGDGLPNRDSSKYYRVMAHRDRLIRG----------RRLASEDQSLVTFADGNETIRVNA 99
Query: 86 RAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC-------GATTF 138
LH Y N ++G P L LDTGS L W+ C C G ++
Sbjct: 100 LGFLH---------YANVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSL 150
Query: 139 D-----PSKSLTYATLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNF 190
D P+ S T + +PC+S+ CT + C +C Y IRY +NG S G + + +
Sbjct: 151 DLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHL 210
Query: 191 ETSDEG-KTFLYDVGFGCS--HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFS 244
+ ++ K + GC G+FGLG S S++ K G + FS
Sbjct: 211 VSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFS 270
Query: 245 YCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVID--GSYYVTLEGISLGEKMLDIDPNL 302
C G+ ++ G+ ++ TP+++ +Y VT+ IS+G D++ +
Sbjct: 271 MCFGDDGAGRISF-----GDKGSVDQRETPLNIRQPHPTYNVTVTQISVGGNTGDLEFD- 324
Query: 303 FKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA--WHLCYSG 360
VF D+GT+ T+L + Y + + L L Y D + CY+
Sbjct: 325 ----------AVF-DTGTSFTYLTDAPYTLISESFNSL--ALDKRYQTDSELPFEYCYAV 371
Query: 361 NINRDLQGFPAMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
+ N+ +P + GG+ V V E + V+CLA+ S+ D+SIIG
Sbjct: 372 SPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSE-------DISIIG 424
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
Y V +D L ++ DC
Sbjct: 425 QNFMTGYRVVFDREKLILGWKESDC 449
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 160/379 (42%), Gaps = 54/379 (14%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTY 146
T ++Y +G PP +DTGS + WV C PC C + FDP KS +
Sbjct: 44 TTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSK 103
Query: 147 ATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
++ C C + C C Y+ Y +G + G + ++ +F G +
Sbjct: 104 TSISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATS 163
Query: 203 ----VGFGCSHNNAH--FSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYF 253
+ FGC N +D G+ G G A S S + K F++C+ N
Sbjct: 164 GTARLTFGCGSNQTGTWLTD----GLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDN-- 217
Query: 254 EYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDI-DPNLFKKNDTWSDA 312
L++G I E ++ + +E +++G ++ P F D +
Sbjct: 218 -KGSGTLVIGH--IREPGLVYTPIVPKQSHYNVELLNIGVSGTNVTTPTAF---DLSNSG 271
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ-GLLPSYPMDPAWHLCYSGNINRDLQG-FP 370
GV +DSGTTLT+LV AY + +V D + G+LP C ++G FP
Sbjct: 272 GVIMDSGTTLTYLVQPAYDQFQAKVRDCMRSGVLPV----AFQFFC-------TIEGYFP 320
Query: 371 AMAFHFAGGADLVLDAESVFYQE----SSSVFCLA-VGPSDINGERFKDLSIIGMIAQQN 425
+ +FAGGA ++L S Y+E S +C + + + + G + +I G ++
Sbjct: 321 NVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYG--YLSYTIFGDNVLKD 378
Query: 426 YNVAYDLVSKQLYFQRIDC 444
V YD V+ ++ ++ DC
Sbjct: 379 QLVVYDNVNNRIGWKNFDC 397
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 154/397 (38%), Gaps = 54/397 (13%)
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA----------------- 135
I+ V ++ V+ G P +P VLDT + L W+ C+ + G
Sbjct: 121 IAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAA 180
Query: 136 ------TTFDPSKSLTYATLPCDSSYCT----NDCGG--YPDECWYNIRYTNGPDSQGTI 183
+ P+KS ++ + C C N C + C Y + +G + G
Sbjct: 181 KEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIY 240
Query: 184 GSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS--THSLVEKVGS 241
G E+ SD L + GCS A S + GV LG S H+ ++ G
Sbjct: 241 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHA-AKRFGQ 299
Query: 242 KFSYCIGNLN-------YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEK 294
+FS+C+ + N Y + N ++G G +E D + +Y + GI +G +
Sbjct: 300 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGT-METDIVYNVDVKPAYGPLVTGIFVGGE 358
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
LDI P + GV +D+ T++T LVP AY + ++ L Y +D +
Sbjct: 359 RLDI-PQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELD-GF 416
Query: 355 HLCYSGNINRDLQGF------PAMAFHFAGGADLVLDAESVFYQE-SSSVFCLAVGPSDI 407
CY D P + AGGA L +A+SV E V CLA
Sbjct: 417 EYCYRWTFAGDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPR 476
Query: 408 NGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
G I+G + Q Y D ++ F++ C
Sbjct: 477 GGP-----GILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/411 (26%), Positives = 165/411 (40%), Gaps = 87/411 (21%)
Query: 95 TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT-TFDPSKSLTYATLPCDS 153
TVPV ++G PP VLDTGS L W+ C T F+ S S +Y +PC S
Sbjct: 56 TVPV-----AVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSSSYGAVPCPS 110
Query: 154 SYCTNDCGGYP----------DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
+ C P + C ++ Y + + G + ++ F G V
Sbjct: 111 TACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLL----TGGAPPVAV 166
Query: 204 G--FGC---------SHNNAHFSD--EQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGN 249
G FGC +++N +D E TG+ G+ T S V + G+ +F+YCI
Sbjct: 167 GAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGM---NRGTLSFVTQTGTRRFAYCIAP 223
Query: 250 LNYFEYAYNMLILGEGAILEGD----------------STPMSVIDG-SYYVTLEGISLG 292
G G +L GD S P+ D +Y V LEGI +G
Sbjct: 224 GE-----------GPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVG 272
Query: 293 EKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLL-----PS 347
+L I ++ + T + +DSGT T+L+ AY L+ E + LL P
Sbjct: 273 CALLPIPKSVLTPDHTGAGQ-TMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPG 331
Query: 348 YPMDPAWHLCYSGNINRDLQG---FPAMAFHFAGGADLVLDAESVFYQ---------ESS 395
+ A+ C+ G R P + GA++ + E + Y +
Sbjct: 332 FVFQGAFDACFRGPEARVAAASGLLPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEGGAE 390
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCEL 446
+V+CL G SD+ G +IG QQN V YDL + ++ F C+L
Sbjct: 391 AVWCLTFGNSDMAG---MSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 438
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 141/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSRGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 151/379 (39%), Gaps = 53/379 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------FDPSKSL 144
+ Y IG P L LD GS L+W+ C C QC + + PS+SL
Sbjct: 96 LHYTWIDIGTPSTSFLVALDAGSDLLWIPCD-CVQCAPLSSSYYSNLDRDLNEYSPSRSL 154
Query: 145 TYATLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNFET----SDEGK 197
+ L C C ++C +C Y + Y + S G + + + ++ S+
Sbjct: 155 SSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGTLSNSSV 214
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFE 254
+G G + + G+ GLGP SS S + K G FS C + E
Sbjct: 215 QAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHYSFSLC-----FNE 269
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDG---SYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ G+ ST +DG +Y + +E +G L + FK
Sbjct: 270 DDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKM--TSFKAQ----- 322
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
+DSGT+ T+L Y + +E + G S+ P W CY + ++DL P+
Sbjct: 323 ----VDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSP-WEYCYVPS-SQDLPKVPS 376
Query: 372 MAFHFAGGADLVL-DAESVFYQESSSV-FCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
F V+ D VFY + FCLA+ P++ D+ IG Y +
Sbjct: 377 FTLMFQRNNSFVVYDPVFVFYGNEGVIGFCLAILPTE------GDMGTIGQNFMTGYRLV 430
Query: 430 YDLVSKQLYFQRIDCELLA 448
+D +K+L + R +C+ L+
Sbjct: 431 FDRGNKKLAWSRSNCQDLS 449
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 108/434 (24%), Positives = 183/434 (42%), Gaps = 70/434 (16%)
Query: 41 KLLHRDS---LLYNPNDTVDAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTV- 96
+ +HRDS L ++P T +A+ ++ SMAR + ++ ++ A + + V
Sbjct: 7 EFIHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDSDADVV 66
Query: 97 -PVFYVNFS------IGQPPVPQLAVLDTGSSLIWVKCQ-PCEQCGATTFDPSKSLTYAT 148
P+ NF + PPV LA+ DTGSSL+W+KC+ P A++ +YA
Sbjct: 67 SPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCKLPAAHTPASS-------SYAR 119
Query: 149 LPCDSSYCT--------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
LPCD+ C G + C Y + +G + G + + F F T
Sbjct: 120 LPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTR------- 172
Query: 201 YDVGFGCSHNNAHFS--DEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYN 258
+ FGC+ S D+ G+ + S S KFSYC+ + E +
Sbjct: 173 --LDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSS 230
Query: 259 MLILGEGAILEGD----STPMSV-IDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
L G AI+ +TP+ + S+Y + L+ I + K + + K
Sbjct: 231 SLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTTK-------- 282
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM-DPAWHLCYSGNINRDL----- 366
+ +DSGT LT+L + L + + LP + + +CY ++ R
Sbjct: 283 -LIVDSGTMLTYLPKAVLDPLVAALTAAIK--LPRVKSPETLYAVCY--DVRRRAPEDVG 337
Query: 367 QGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQN 425
+ P + GG ++ L + F E+ + CLA+ S + F I+G +AQQN
Sbjct: 338 KSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHL--PEF----ILGNVAQQN 391
Query: 426 YNVAYDLVSKQLYF 439
+V +DL + + F
Sbjct: 392 LHVGFDLERRTVSF 405
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 97/402 (24%), Positives = 159/402 (39%), Gaps = 64/402 (15%)
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKC-----------------------QP 129
I+ V ++ V+ IG P +P VLDT + L W+ C +
Sbjct: 119 IAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEG 178
Query: 130 CEQCGATTFDPSKSLTYATLPCDSSYCT----NDCG--GYPDECWYNIRYTNGPDSQGTI 183
++ + P+KS ++ + C C N C + C Y + +G + G
Sbjct: 179 AKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIY 238
Query: 184 GSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS--THSLVEKVGS 241
G E+ SD L + GCS A S + GV LG S H+ ++ G
Sbjct: 239 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHA-AKRFGQ 297
Query: 242 KFSYCIGNLN-------YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEK 294
+FS+C+ + N Y + N ++G G +E D + +Y + G+ +G +
Sbjct: 298 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGT-METDILYNVDVKPAYGAQVTGVLVGGE 356
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
LDI P+ + + GV +D+ T++T LVP AY + ++ L Y ++ +
Sbjct: 357 RLDI-PDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELE-GF 414
Query: 355 HLCYSGNINRDLQG------FPAMAFHFAGGADLVLDAESVFYQE-SSSVFCLAVGPSDI 407
CY D P+ AGGA L +A+SV E V CLA
Sbjct: 415 EYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLA------ 468
Query: 408 NGERFKDL-----SIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F+ L I+G + Q Y D ++ F++ C
Sbjct: 469 ----FRKLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 506
>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 416
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 158/367 (43%), Gaps = 43/367 (11%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSSY 155
+V+ GQ Q+ LDT +S+ WV C+PC+ Q G F P+ S T+ + +
Sbjct: 70 FVSIGTGQGFKLQVLGLDTSTSMSWVMCEPCQPSLPQAG-HLFSPAASPTFHGVHSNDPV 128
Query: 156 CTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF- 214
CT + C + + +G S+ T ++ + + FGC+H+ A F
Sbjct: 129 CTAPYRPTANGCSFRFPFASGYLSRDTFHLRNGGLSGGAPIES-VPGIMFGCAHSVAGFH 187
Query: 215 SDEQFTGVFGLGPATSSTHS-LVEKVGSKFSYCI-----GNLNYFEYAYNMLILGEGA-- 266
+D GV L S + L + G +FSYC+ GN + F L LG
Sbjct: 188 NDGTLGGVLSLSHLRLSLLTQLSARAGGRFSYCLPKPTQGNPHGF------LRLGADVLP 241
Query: 267 -ILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTL 322
+ T ++V GS YY++L GI+L EK L IDP +F G I+ T+
Sbjct: 242 PLPHSHMTALTVRSGSAPDYYLSLVGITLAEKRLRIDPRVFAAG----RGGCSINPAATI 297
Query: 323 TWLVPSAYQTLRKEV----EDLFQGLLPSYPMDPAWHLCYSGNINRDLQG-FPAMAFHFA 377
T ++ AY + + + ++L + P P + + + +Q P+MAFHF
Sbjct: 298 TAIMEPAYLVVERALVAYMKELGSDRVKKGP--PGGGALFFDRMYKSVQARLPSMAFHFK 355
Query: 378 GGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GA+L E +F + + VG G R ++IG Q N +D+ + +L
Sbjct: 356 DGAELWFTPEQLFEVHGMVAWFMMVG----KGYR---RTVIGAPQQVNTRFTFDVAAGRL 408
Query: 438 YFQRIDC 444
F C
Sbjct: 409 SFASELC 415
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 121/286 (42%), Gaps = 37/286 (12%)
Query: 165 DECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFG 224
+C + I Y +G + G ++ + F FGC H H F GV G
Sbjct: 35 KQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFY----FGCGHGK-HAVRGLFDGVLG 89
Query: 225 LGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDS-TPMSVIDGS-- 281
LG SL + G FSYC+ +++ L LG G G TPM + G
Sbjct: 90 LG---RLRESLGARYGGVFSYCLPSVSSKP---GFLALGAGKNPSGFVFTPMGTVPGQPT 143
Query: 282 -YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL 340
VTL GI++G K LD+ P+ F G+ +DSGT +T L +AY+ LR
Sbjct: 144 FSTVTLAGINVGGKKLDLRPSAF-------SGGMIVDSGTVITGLQSTAYRALRSAFRKA 196
Query: 341 FQG--LLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVF 398
+ LLP+ +D ++L N+ P +A F GGA + LD +
Sbjct: 197 MEAYRLLPNGDLDTCYNLTGYKNVV-----VPKIALTFTGGATINLDVPNGILVNG---- 247
Query: 399 CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
CLA S +G ++G + Q+ + V +D + + F+ C
Sbjct: 248 CLAFAESGPDGS----AGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 149/364 (40%), Gaps = 40/364 (10%)
Query: 102 NFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-- 156
NF+IG PP A +D L+W +C C C F P+ S T+ PC + C
Sbjct: 57 NFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKS 116
Query: 157 --TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
T C D C Y+ G + G + ++ F T+ +GFGC +
Sbjct: 117 IPTPKCAS--DVCAYDGVTGLGGHTVGIVATDTFAIGTAAPAS-----LGFGCVVASDID 169
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG--- 270
+ +G GLG + SLV ++ ++FSYC+ + + + L LG A L G
Sbjct: 170 TMGGPSGFIGLG---RTPWSLVAQMKLTRFSYCLAPHDTGK--NSRLFLGASAKLAGGGA 224
Query: 271 -----DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
++P + Y + LE I G+ + + +N V ++ L
Sbjct: 225 WTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRG---RNTVLVQTAV-----VRVSLL 276
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
V S YQ +K V + P+ + +C+ + G P + F F GA L +
Sbjct: 277 VDSVYQEFKKAVMASVGAAPTATPVGAPFEVCFP---KAGVSGAPDLVFTFQAGAALTVP 333
Query: 386 AESVFYQESSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + + CL+V + +N L+I+G Q+N ++ +DL L F+ DC
Sbjct: 334 PANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 393
Query: 445 ELLA 448
L+
Sbjct: 394 SSLS 397
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 159/380 (41%), Gaps = 50/380 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA---TTFDPSKSLTYATLPCDSS- 154
+Y + +G P + ++DTGS L W+KC PC+ C T +D ++S++Y + C++S
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQ 159
Query: 155 YCTNDCGG------YPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF-LYDVGFGC 207
C+N G +C + Y +G S G++ ++ ET GK + D FGC
Sbjct: 160 LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGC 219
Query: 208 SHNNAHFSDEQFTGVFGLGPATSST-HSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGA 266
+ + +G+ GL + L ++ G KFS+C + + + ++ G
Sbjct: 220 AQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNAE 279
Query: 267 ILEGDSTPMSVI-------DGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSG 319
+ SV Y+V L+G+S+ L + P + V +DSG
Sbjct: 280 LPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPR---------GSVVILDSG 330
Query: 320 TTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL--CY---SGNINRDLQGFPAMAF 374
++ + V + LR+ L D L C+ + +I+ + P+++
Sbjct: 331 SSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSL 390
Query: 375 HFAGGADLVLDAESVF-----YQESSSVFCLAV---GPSDINGERFKDLSIIGMIAQQNY 426
F G + + + V YQ + C A GP+ +N +IG QQN
Sbjct: 391 VFEDGVTIGIPSIGVLLPVARYQNHVKM-CFAFEDGGPNPVN--------VIGNYQQQNL 441
Query: 427 NVAYDLVSKQLYFQRIDCEL 446
V YD+ ++ F R C +
Sbjct: 442 WVEYDIQRSRVGFARASCVI 461
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 153/379 (40%), Gaps = 53/379 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------FDPSKSL 144
+ Y IG P L LD GS L+W+ C C QC + + PS+SL
Sbjct: 95 LHYTWIDIGTPSTSFLVALDAGSDLLWIPCD-CVQCAPLSSSYYSNLDRDLNEYSPSRSL 153
Query: 145 TYATLPCDSSYCT--NDCGGYPDECWYNIRY-TNGPDSQGTIGSEQFNFET----SDEGK 197
+ L C C ++C +C Y + Y + S G + + + ++ S+
Sbjct: 154 SSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGSLSNSSV 213
Query: 198 TFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFE 254
+G G + + G+ GLGP SS S + K G FS C + E
Sbjct: 214 QAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHDSFSLC-----FNE 268
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDG---SYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
+ G+ ST +DG +Y + +E +G L + FK
Sbjct: 269 DDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKM--TSFK------- 319
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPA 371
V +DSGT+ T+L Y + +E + G S+ P W CY + +++L P+
Sbjct: 320 --VQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSP-WEYCYVPS-SQELPKVPS 375
Query: 372 MAFHFAGGADLVL-DAESVFYQESSSV-FCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
+ F V+ D VFY + FCLA+ P++ D+ IG Y +
Sbjct: 376 LTLTFQQNNSFVVYDPVFVFYGNEGVIGFCLAIQPTE------GDMGTIGQNFMTGYRLV 429
Query: 430 YDLVSKQLYFQRIDCELLA 448
+D +K+L + R +C+ L+
Sbjct: 430 FDRGNKKLAWSRSNCQDLS 448
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 120/298 (40%), Gaps = 43/298 (14%)
Query: 157 TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
T C G C Y ++Y +G + G + + D K F FGC N
Sbjct: 13 TRGCSG--GHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFR----FGCGERNEGLFG 66
Query: 217 EQFTGVFGLGPA-TSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLILGEGAILEGD 271
E G+ GLG TS +K G F++C Y E+ +
Sbjct: 67 EA-AGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKL---S 122
Query: 272 STPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSA 329
+TPM + G YYV + GI +G K+L I ++F AG +DSGT +T L P+A
Sbjct: 123 TTPMLIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAA------AGTIVDSGTVITRLPPAA 176
Query: 330 YQTLRKEVEDLFQGLLPSYPMDPAWHL---CYSGNINRDLQG-----FPAMAFHFAGGAD 381
Y +LR Y PA L CY DL G P ++ F GG
Sbjct: 177 YSSLRSAFAASMAAR--GYKRAPALSLLDTCY------DLTGASEVAIPTVSLLFQGGVS 228
Query: 382 LVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
L +DA + Y S S CL E D++I+G + + V YD+ SK + F
Sbjct: 229 LDVDASGIIYAASVSQACLGF----AGNEAADDVAIVGNTQLKTFGVVYDIASKVVGF 282
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 146/345 (42%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCI----GNLNYFEYAYNMLI 261
N F +F V GL + S++++ + FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L ++ VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSKGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 158/370 (42%), Gaps = 50/370 (13%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPCDS 153
+ S+G PPV L +DTGS+L WV+C+ C+ +C F+P S TY+ + C +
Sbjct: 1 MGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCST 60
Query: 154 SYCTN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
C C D C Y++RY +G S G +G ++ ++ F+
Sbjct: 61 EACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFI---- 116
Query: 205 FGCSHNNAHFSDEQFTGVFGLGPATSSTHSLV--EKVGSKFSYCI-------GNLNYFEY 255
FGC +N + + G+ G G + S + V + + FSYC G+L Y
Sbjct: 117 FGCGEDNLY--NGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIGPY 174
Query: 256 AYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
A ++ ++ I D P +Y + + + L+IDP ++ T
Sbjct: 175 ARDINLMWTKLIYY-DHKP------AYAIQQLDMMVNGIRLEIDPYIYISKMT------I 221
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNI-NRDLQGFPAMAF 374
+DSGT T+++ + L K + Q + D +C+ N + + FP +
Sbjct: 222 VDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER-RICFISNSGSANWNDFPTVEM 280
Query: 375 HFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVS 434
+ L L E+ FY+ S++V C P D + + ++G A +++ + +D+ +
Sbjct: 281 KLI-RSTLKLPVENAFYESSNNVICSTFLPDDAG---VRGVQMLGNRAVRSFKLVFDIQA 336
Query: 435 KQLYFQRIDC 444
F+ C
Sbjct: 337 MNFGFKARAC 346
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/213 (35%), Positives = 108/213 (50%), Gaps = 18/213 (8%)
Query: 241 SKFSYCIGNLNYFEYAYNMLILGEGAILEGD--STPMSVIDGS---YYVTLEGISLGEKM 295
+KFSYC+ +++ + ++L+LG A D STP+ YY++LEGI +G
Sbjct: 4 AKFSYCLTSMD--DSKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGTQ 61
Query: 296 LDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWH 355
L I+ ++F +D S GV IDSGTT+T+L S + TL+KE L
Sbjct: 62 LSIEQSIFDVSDDGS-GGVIIDSGTTITYLEKSVFDTLKKEFISQ-SNLQLDKSSSTGLD 119
Query: 356 LCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKD 414
+C+S P + FHF GG DL L AES +S V CLA+G S NG
Sbjct: 120 VCFSLPSETTQVEVPKLVFHFKGG-DLELPAESYMIADSKLGVACLAMGAS--NG----- 171
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+SI G + QQN V +DL + + F C+ L
Sbjct: 172 MSIFGNVQQQNILVNHDLEKETISFVPTQCDQL 204
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/271 (31%), Positives = 119/271 (43%), Gaps = 31/271 (11%)
Query: 193 SDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLN 251
S G L VG A +G+ GLG SLV + G+ KFSYC+
Sbjct: 127 SQAGPAVLKLVGLRAPSRRAR--SMAPSGLMGLG---RGRLSLVSQTGATKFSYCLTPYF 181
Query: 252 YFEYAYNMLILGEGAIL--EGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFK 304
+ A L +G A L GD + G YY+ L G+++GE L I +F
Sbjct: 182 HNNGATGHLFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFD 241
Query: 305 KNDTWS---DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPA-WHLCYSG 360
+ GV IDSG+ T LV AY L E+ G L + P D LC +
Sbjct: 242 LREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVA- 300
Query: 361 NINRDL-QGFPAMAFHFAGGADLVLDAESVFY---QESSSVFCLAVGPSDINGERFKDLS 416
RD+ + PA+ FHF GGAD+ + AES + + ++ + + GP ++ S
Sbjct: 301 --RRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGP-------YRRQS 351
Query: 417 IIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+IG QQN V YDL + FQ DC L
Sbjct: 352 VIGNYQQQNMRVLYDLANGDFSFQPADCSAL 382
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 136/355 (38%), Gaps = 82/355 (23%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCE--QC---GATTFDPSKSLTYATLPCDSSYCTN 158
+I P + Q +DT L W++C PC +C FDP +S T A +PC S+ C
Sbjct: 156 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 215
Query: 159 ----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN-NAH 213
G ++C Y + Y +G + GT + S T + + FGCSH +
Sbjct: 216 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPS----TVVMNFRFGCSHAVRGN 271
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDST 273
FS +F P +
Sbjct: 272 FSASTSGTMFARTPLVRNP----------------------------------------- 290
Query: 274 PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTL 333
S+I Y V L GI +G + L++ P +F G +DS +T L P+AY+ L
Sbjct: 291 --SIIPTLYLVRLRGIEVGGRRLNVPPVVFA-------GGAVMDSSVIITQLPPTAYRAL 341
Query: 334 RKEVEDLFQGLLPSYPM----DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESV 389
R F+ + +YP CY + PA++ F GGA + LDA V
Sbjct: 342 RLA----FRSAMAAYPRVAGGRAGLDTCYD-FVRFTSVTVPAVSLVFDGGAVVRLDAMGV 396
Query: 390 FYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ CLA P+ + L IG + QQ + V YD+V + F+R C
Sbjct: 397 MVEG-----CLAFVPTPGDFA----LGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 152/375 (40%), Gaps = 61/375 (16%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------------TFDPSKSLTYA 147
Y +G P + LDTGS L WV C C +C T + P KS T
Sbjct: 5 YTTVQLGTPGTKFMVALDTGSDLFWVPCD-CSRCAPTEGSPYASDFELSVYSPKKSSTSK 63
Query: 148 TLPCDSSYCT--NDCGGYPDECWYNIRYTNGPDS-QGTIGSEQFNFETSDE-GKTFLYDV 203
T+PC++S C + C C Y + Y + S G + + + +T ++ + +
Sbjct: 64 TVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHSEPIQAYI 123
Query: 204 GFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFSYC-----IGNLNYF 253
FGC + F D G+FGLG S S++ + G + FS C +G +N+
Sbjct: 124 TFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRINF- 182
Query: 254 EYAYNMLILGEGAILEGDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
G+ LE + TP ++ + +Y +T+ I +G ++D +D
Sbjct: 183 ---------GDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLID------------AD 221
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVE-DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
DSGT+ ++ Y L G P P P + CY+ + + + P
Sbjct: 222 ITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIP-FEYCYNMSPDANASLTP 280
Query: 371 AMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
++ GG V D V ++ ++CLAV S +L+IIG Y +
Sbjct: 281 GISLTMKGGGPFPVYDPIIVISTQNELIYCLAVVKS-------AELNIIGQNFMTGYRIV 333
Query: 430 YDLVSKQLYFQRIDC 444
+D L +++ DC
Sbjct: 334 FDREKLVLGWKKFDC 348
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 166/449 (36%), Gaps = 92/449 (20%)
Query: 67 SMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVN--------FSIGQPPVPQLAVLDT 118
S+AR ++L ++ + HP + Y + S+G PP P +LDT
Sbjct: 27 SLARALHLKRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDT 86
Query: 119 GSSLIWVKCQP---CEQCGATT------FDPSKSLTYATLPCDSSYC--TNDCGGYPDEC 167
GS L WV C C C + + F P S + + C + C + +C
Sbjct: 87 GSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKC 146
Query: 168 WY---NIRYTNGPDSQGTIGSE-QFNFETSDEGKTFLYDV---------GF--GCSHNNA 212
+ N P + + + + + D GF GCS +
Sbjct: 147 RRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSV 206
Query: 213 HFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAY--NMLILGEGAIL- 268
H G FG G S+ ++G KFSYC+ + + + A L+LG
Sbjct: 207 HQPPSGLAG-FGRG-----APSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGE 260
Query: 269 -----------EGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
GD P V YY+ L G+++G K + + P + G +D
Sbjct: 261 GMQYVPLVKSAAGDKLPYGVY---YYLALRGVTVGGKAVRL-PARAFAANAAGSGGTIVD 316
Query: 318 SGTTLTWLVPSAYQ--------------TLRKEVEDLFQGLLPSYPMDPAWHLCYSGNIN 363
SGTT T+L P+ +Q K+ ED H C++
Sbjct: 317 SGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDEL-----------GLHPCFALPQG 365
Query: 364 RDLQGFPAMAFHFAGGADLVLDAESVFY---QESSSVFCLAV-----GPSDINGERFKDL 415
P ++FHF GGA + L E+ F + + CLAV G S E
Sbjct: 366 ARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPA 425
Query: 416 SIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G QQNY V YDL ++L F+R C
Sbjct: 426 IILGSFQQQNYLVEYDLEKERLGFRRQSC 454
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/406 (24%), Positives = 161/406 (39%), Gaps = 68/406 (16%)
Query: 93 ISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG------------------ 134
I+ V ++ V+ IG P +P VLDT + L W+ C+ + G
Sbjct: 118 IAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEG 177
Query: 135 ATT---------FDPSKSLTYATLPCDSSYCT----NDCGG--YPDECWYNIRYTNGPDS 179
AT + P+KS ++ + C C N C + C Y + +G +
Sbjct: 178 ATAAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVT 237
Query: 180 QGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSS--THSLVE 237
G G E+ SD L + GCS A S + GV LG S H+ +
Sbjct: 238 IGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHA-AK 296
Query: 238 KVGSKFSYCIGNLN-------YFEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGIS 290
+ G +FS+C+ + N Y + N ++G G +E D + +Y + G+
Sbjct: 297 RFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGT-METDILYNVDVKPAYGAKVTGVL 355
Query: 291 LGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPM 350
+G + LDI P+ + + GV +D+ T++T LVP AY + ++ L Y +
Sbjct: 356 VGGERLDI-PDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYEL 414
Query: 351 DPAWHLCYSGNINRDLQG------FPAMAFHFAGGADLVLDAESVFYQE-SSSVFCLAVG 403
+ + CY D P+ AGGA L +A+SV E V CLA
Sbjct: 415 E-GFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLA-- 471
Query: 404 PSDINGERFKDL-----SIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
F+ L I+G + Q Y D ++ F++ C
Sbjct: 472 --------FRKLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 509
>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 410
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 155/362 (42%), Gaps = 38/362 (10%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE----QCGATTFDPSKSLTYATLPCDSSY 155
+V+ G+ ++ LDTG+S W+ C+PC+ Q G F P+ S T+ + D
Sbjct: 71 FVSIGTGEGTRRKVLALDTGASTSWLMCEPCQPPLPQVG-HLFSPAASPTFQGVRGDGPV 129
Query: 156 CTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTF--LYDVGFGCSHNNAH 213
CT C + P + G + + F+ + G + + FGC+H+
Sbjct: 130 CTVPYRHTDKGCSFRF-----PFAAGYLSRDTFHLRSGRSGTVMESVPGIMFGCAHSVTG 184
Query: 214 F-SDEQFTGVFGLGPATSSTHSLVE-KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGD 271
F +D +GV L + S +L+ + +FSYC+ + + L
Sbjct: 185 FHNDGTLSGVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTHNPDSFLRFGADVPSLPPH 244
Query: 272 STPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+ +++ Y++ + GISLG K L ID ++F + G I+ T+T ++
Sbjct: 245 AHTTTLVHAGVPGYHLNIVGISLGNKRLHIDRHVFA-----AGGGCSINPAVTITRIMEL 299
Query: 329 AY----QTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQ-GFPAMAFHFAGGADLV 383
AY L +++L G + P LC+ +++R ++ P M+FHF GA+L
Sbjct: 300 AYLAVEHALVAHMKELGSGRVKGMP---GRSLCFD-HMDRSVRVQLPGMSFHFEDGAELR 355
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
AE +F + L VG R ++IG Q + +D+ + +L F
Sbjct: 356 FAAEQLFDVRVMAACFLVVG-------RGHHQTVIGAAQQVDTRFTFDIAAGRLAFVPET 408
Query: 444 CE 445
C+
Sbjct: 409 CD 410
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/334 (29%), Positives = 152/334 (45%), Gaps = 46/334 (13%)
Query: 138 FDPSKSLTYATLPCDSSYCTN----DCGG---YPDE-CWYNIRYTNGPDSQGTIGSEQFN 189
FD S S T CDS+ C CG +P++ C Y Y + + G + ++F
Sbjct: 177 FDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDKFT 236
Query: 190 FETSDEGKTFLYDVGFGCS-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIG 248
F + V FGC NN F + TG+ G G S S + KVG+ FS+C
Sbjct: 237 FGAGAS----VPGVAFGCGLFNNGVFKSNE-TGIAGFGRGPLSLPSQL-KVGN-FSHCFT 289
Query: 249 NLNYFEYAYNMLIL-------GEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDI 298
+N + + +L L G GA+ STP+ S YY++L+GI++G L +
Sbjct: 290 AVNGLKQSTVLLDLLADLYKNGRGAV---QSTPLIQNSANPTLYYLSLKGITVGSTRLPV 346
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW-HLC 357
+ F T G IDSGT++T L P YQ +R E + LP P + + C
Sbjct: 347 PESAFAL--TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK--LPVVPGNATGPYTC 402
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFK 413
+S ++ P + HF GA + L E+ ++ +S+ CLA+ +++ ER
Sbjct: 403 FSAP-SQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSMICLAI--NELGDER-- 456
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ IG QQN +V YDL + L F C+ L
Sbjct: 457 --ATIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 488
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 65/143 (45%), Gaps = 18/143 (12%)
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS 347
GI++G L + + F T G IDSGT++T L P YQ +R E + LP
Sbjct: 41 GITVGSTRLPVPESAFAL--TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK--LPV 96
Query: 348 YPMDPAW-HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ----ESSSVFCLAV 402
P + + C+S ++ P + HF GA + L E+ ++ +S+ CLA+
Sbjct: 97 VPGNATGPYTCFSAP-SQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAI 154
Query: 403 GPSDINGERFKDLSIIGMIAQQN 425
D + +IIG QQN
Sbjct: 155 NKGD-------ETTIIGNFQQQN 170
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 141/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 163/391 (41%), Gaps = 75/391 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLP 150
+F + S+G+PPV L +DTGS+L WV+CQPC C FDP +S T +
Sbjct: 113 LFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 151 CDSSYCTN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTF 199
C S C +C + C Y++ Y NG S G + ++ G +F
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSF 227
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLN 251
+ D+ FGCS + + E G+ SS+ S E++ FSYC L
Sbjct: 228 M-DLMFGCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LP 278
Query: 252 YFEYAYNMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKN 306
E +ILG + A ++G TP+ S+ +Y +T+E I+ G++++
Sbjct: 279 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT--------- 329
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY------ 358
S + + +DSG T L PS + L K + G + ++CY
Sbjct: 330 ---SSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDY 386
Query: 359 ---SGNIN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
+G I + P + FAGGA L L +VFY + C+ + +
Sbjct: 387 SGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQ-- 444
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G +++ +D+ KQ F+ C
Sbjct: 445 ---ILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/390 (25%), Positives = 157/390 (40%), Gaps = 59/390 (15%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATL 149
+++ +G P + +DTGS ++WV C+PC C T +DP +S T + +
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 150 PCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQG--TIGSEQFNFETSDEGKTFL 200
C C C + C Y Y +G S+G + Q+N +S+
Sbjct: 61 SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTT 120
Query: 201 YDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLV---EKVGSKFSYCIGNLNYFE 254
V FGCS + S + G+ G G S + + + + FS+C+
Sbjct: 121 SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGG 180
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ + E + P SV Y V L GIS+ L ID F + D GV
Sbjct: 181 GILVIGGIAEPGMTYTPLVPDSV---HYNVVLRGISVNSNRLPIDAEDFSSTN---DTGV 234
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYSGNINRDLQGFPAMA 373
+DSGTTL + AY + + + + MD L SG ++ DL FP +
Sbjct: 235 IMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLV-SGRLS-DL--FPNVT 290
Query: 374 FHFAGGA-----DLVL-----------DAESVFYQESSSVFCLAVGPSDINGERFKDLSI 417
+F GGA D L D + +Q SSS + GP D L+I
Sbjct: 291 LNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSS----SAGPKD-----GSQLTI 341
Query: 418 IGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+G I ++ V YDL + ++ + +C+ L
Sbjct: 342 LGDIVLKDKLVVYDLDNSRIGWMSYNCKFL 371
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 141/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ + +G P Q+ +DTGSS+ WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSSGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 141/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 156/365 (42%), Gaps = 53/365 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ V +G PP VLDT + +W+ C C C +T+F+ + S TY+T+ C ++ C
Sbjct: 30 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQC 89
Query: 157 TNDCG-------GYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV----GF 205
T G P C +N Y G F+ + T DV F
Sbjct: 90 TQARGLTCPSSSPQPSVCSFNQSYG---------GDSSFSASLVQDTLTLAPDVIPNFSF 140
Query: 206 GCSHNNAHFSDEQFTGVFGLGPA----TSSTHSLVEKVGSKFSYCIGNLN--YFEYAYNM 259
GC N+A + G+ GLG S T SL V FSYC+ + YF + +
Sbjct: 141 GC-INSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGV---FSYCLPSFRSFYFSGSLKL 196
Query: 260 LILGE------GAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
+LG+ +L P YYV L G+S+G + +DP ++ D S AG
Sbjct: 197 GLLGQPKSIRYTPLLRNPRRP-----SLYYVNLTGVSVGSVQVPVDP-VYLTFDANSGAG 250
Query: 314 VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMA 373
IDSGT +T Y+ +R E Q + S+ A+ C+S + N ++ P +
Sbjct: 251 TIIDSGTVITRFAQPVYEAIRDEFRK--QVNVSSFSTLGAFDTCFSAD-NENVA--PKIT 305
Query: 374 FHFAGGADLVLDAESVFYQESSSVF-CLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDL 432
H DL L E+ S+ CL++ + I L++I + QQN + +D+
Sbjct: 306 LHMT-SLDLKLPMENTLIHSSAGTLTCLSM--AGIRQNANAVLNVIANLQQQNLRILFDV 362
Query: 433 VSKQL 437
+ ++
Sbjct: 363 PNSRI 367
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 148/364 (40%), Gaps = 32/364 (8%)
Query: 105 IGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATLPCDSSYC 156
IG P +DTGS +WV C C C T +DP+ S T +PCD +C
Sbjct: 80 IGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFC 139
Query: 157 TNDCGGYPDECW------YNIRYTNGPDSQGTIGSEQFNFE-------TSDEGKTFLYDV 203
T+ G C Y+I Y +G + G+ + F+ T + + ++
Sbjct: 140 TSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGC 199
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNML 260
G S + +D G+ G G A SS S + KV FS+C+ +++ +
Sbjct: 200 GSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSIS----GGGIF 255
Query: 261 ILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
+GE + +TP+ Y V L+ I + + + ++ D+ S G IDSGT
Sbjct: 256 AIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDIL---DSSSGRGTIIDSGT 312
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGA 380
TL +L S Y L +++ G+ D YS + D FP + F F G
Sbjct: 313 TLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFTCFHYSDEESVD-DLFPTVKFTFEEGL 371
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
L + ++C+ S + K+L ++G + N V YDL + + +
Sbjct: 372 TLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVYDLDNMAIGWA 431
Query: 441 RIDC 444
+C
Sbjct: 432 DYNC 435
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 141/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 141/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----SFGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 145/345 (42%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCI----GNLNYFEYAYNMLI 261
N F +F V GL + S++++ + FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 136/355 (38%), Gaps = 82/355 (23%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCE--QC---GATTFDPSKSLTYATLPCDSSYCTN 158
+I P + Q +DT L W++C PC +C FDP +S T A +PC S+ C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 159 ----DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN-NAH 213
G ++C Y + Y +G + GT + S T + + FGCSH +
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPS----TVVMNFRFGCSHAVRGN 253
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDST 273
FS +F P +
Sbjct: 254 FSASTSGTMFARTPLVRNP----------------------------------------- 272
Query: 274 PMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTL 333
S+I Y V L GI +G + L++ P +F G +DS +T L P+AY+ L
Sbjct: 273 --SIIPTLYLVRLRGIEVGGRRLNVPPVVFA-------GGAVMDSSVIITQLPPTAYRAL 323
Query: 334 RKEVEDLFQGLLPSYPM----DPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESV 389
R F+ + +YP CY + PA++ F GGA + LDA V
Sbjct: 324 RLA----FRSAMAAYPRVAGGRAGLDTCYD-FVRFTSVTVPAVSLVFDGGAVVRLDAMGV 378
Query: 390 FYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ CLA P+ + L IG + QQ + V YD+V + F+R C
Sbjct: 379 MVEG-----CLAFVPTPGDFA----LGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 144/345 (41%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCI----GNLNYFEYAYNMLI 261
N F +F V GL + S++++ + FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGRRGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 160/385 (41%), Gaps = 75/385 (19%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPCDSSYC 156
S+G+PPV L +DTGS+L WV+CQPC C FDP +S T + C S C
Sbjct: 4 SLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSSVKC 63
Query: 157 TN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
+C D C Y++ Y NG S G + ++ G +F+ D+ F
Sbjct: 64 GEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSFM-DLMF 117
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLNYFEYAY 257
GCS + + E G+ SS+ S E++ FSYC L E
Sbjct: 118 GCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LPTDETKP 169
Query: 258 NMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKNDTWSDA 312
+ILG + A ++G TP+ S+ +Y +T+E I+ G++++ S +
Sbjct: 170 GYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT------------SSS 217
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY---------SGN 361
+ +DSG T L PS + L K + G + ++CY +G
Sbjct: 218 EMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGT 277
Query: 362 IN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
I + P + FAGGA L L +VFY + C+ + + I+G
Sbjct: 278 ITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQ-----ILG 332
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
+++ +D+ KQ F+ C
Sbjct: 333 NRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 145/345 (42%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCI----GNLNYFEYAYNMLI 261
N F +F V GL + S++++ + FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 140/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPGF----SFGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/280 (27%), Positives = 131/280 (46%), Gaps = 28/280 (10%)
Query: 179 SQGTIGSEQFNFETSDEGKTFLYDVGFGCSH-NNAHFSDEQFTGVFGLGPATSSTHSLVE 237
S G + +E F F + F ++ FGC N + +G+ G+ P S L +
Sbjct: 3 STGVLATETFTFGAH---QNFSANLTFGCGKLTNGTIAGA--SGIMGVSPGPLSV--LKQ 55
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGEGAIL-------EGDSTPM---SVIDGSYYVTLE 287
+KFSYC+ + ++ + ++ G A L + + P+ V D YYV +
Sbjct: 56 LSITKFSYCL--TPFTDHKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMV 113
Query: 288 GISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPS 347
GIS+G K LD+ P G +DS TTL +LV A++ L+K V + + +
Sbjct: 114 GISIGSKRLDV-PEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAAN 172
Query: 348 YPMDPAWHLCYSGNINRDLQGF--PAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS 405
+D + +C+ ++G P + HFAG A++ L +S F + S + CLAV +
Sbjct: 173 RSID-DYPVCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCLAVMQA 231
Query: 406 DINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
G ++IG + QQN +V YDL +++ + C+
Sbjct: 232 PFEGAP----NVIGNVQQQNMHVLYDLGNRKFSYAPTKCD 267
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 165/378 (43%), Gaps = 44/378 (11%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G P +DTGS ++WV C PC+ C ++ FD +KS +
Sbjct: 81 VGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSAR 140
Query: 148 TLPCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
LPC C T+ C D C Y+ Y + + G ++ +F+ T
Sbjct: 141 VLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN 200
Query: 202 D---VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNY 252
+ FGCS + + + + G+FG G S S + G FS+C L
Sbjct: 201 SSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHC---LKG 257
Query: 253 FEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
E +L+LGE ILE +P+ Y + L+ I+L ++ +P +F S
Sbjct: 258 GENGGGILVLGE--ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP-NPTMFP----IS 310
Query: 311 DAG-VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNIN-RDLQG 368
+AG IDSGTTL +LV Y + + + P C+ +++ D+
Sbjct: 311 NAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQ--SATPTISRGSQCFRVSMSVADI-- 366
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPS-DINGERFKD-LSIIGMIAQQNY 426
FP + F+F G A +V+ E + Q S V C I ++ +D L+I+G + ++
Sbjct: 367 FPVLRFNFEGIASMVVTPEE-YLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDK 425
Query: 427 NVAYDLVSKQLYFQRIDC 444
+ YDL +++ + DC
Sbjct: 426 IIVYDLAQQRIGWANYDC 443
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 151/383 (39%), Gaps = 56/383 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------------TTFDPSKS 143
+ Y IG P V L LD GS L+WV C C QC + + PS S
Sbjct: 106 LHYTWIDIGTPNVSFLVALDAGSDLLWVPCD-CIQCAPLSASYYNISLDRDLSEYSPSLS 164
Query: 144 LTYATLPCDSSYCT--NDCGGYPDECWYNIRYTNGPD--SQGTIGSEQFNFETSDE---G 196
T L CD C ++C D C Y Y + + S G + ++ + + +
Sbjct: 165 STSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFENTTSAGFLVEDKLHLASVGDHTAR 224
Query: 197 KTFLYDVGFGCSHNN--AHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
K V GC + F GV GLGP S SL+ K G + FS C
Sbjct: 225 KMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAKAGLIQNCFSLC----- 279
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDG---SYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ E ++ G+ STP I G +Y+V +E +G L
Sbjct: 280 FDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGNSCL-----------K 328
Query: 309 WSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQG 368
S +DSG++ T+L Y L E + S+ D W CY+ + +++L
Sbjct: 329 RSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISF-QDGLWDYCYNAS-SQELHD 386
Query: 369 FPAMAFHFAGGADLVLD--AESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNY 426
PA+ F + V+ S+ + + ++FCL++ P+D IIG Y
Sbjct: 387 IPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTD------GSYGIIGQNFMIGY 440
Query: 427 NVAYDLVSKQLYFQRIDCELLAD 449
+ +D+ + +L + C+ +D
Sbjct: 441 RMVFDIENLKLGWSNSSCQDTSD 463
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 164/394 (41%), Gaps = 40/394 (10%)
Query: 69 ARFIYLSQKSSQKAHDTRAHLHPGISTVPV--FYVNFSIGQPPVPQLAVLDTGSSLIWVK 126
AR YLS ++Q T + PG + + + V +G P VLDT + WV
Sbjct: 67 ARLKYLSSLAAQMT--TAVPIAPGQQVLNIGNYVVRVKLGTPGQFMFMVLDTSNDAAWVP 124
Query: 127 CQPCEQCGATTFDPSKSLTYATLPCDSSYCTNDCG-GYP----DECWYNIRYTNGPDSQG 181
C C C +TTF + S TY +L C + CT G P C +N Y
Sbjct: 125 CSGCTGCSSTTFSTNTSSTYGSLDCSMAQCTQVRGFSCPATGSSSCVFNQSYG------- 177
Query: 182 TIGSEQFNFETSDEGKTFLYDV----GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVE 237
G F+ ++ + DV FGC ++ + S + S
Sbjct: 178 --GDSSFSATLVEDSLRLVNDVIPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGS 235
Query: 238 KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPM---SVIDGSYYVTLEGISLGEK 294
FSYC+ + + ++ ++ + G TP+ YYV L G+S+G
Sbjct: 236 LYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRT 295
Query: 295 MLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW 354
++ I P L N + AG IDSGT +T V Y +R E G S A+
Sbjct: 296 LVPIAPELLAFNPN-TGAGTIIDSGTVITRFVQPIYTAIRDEFRKQVAGPFSSL---GAF 351
Query: 355 HLCYSGNINRDLQGFPAMAFHFAGGADLVLDAE-SVFYQESSSVFCLAV--GPSDINGER 411
C++ N + PA+ HF G +LVL E S+ + + S+ CLA+ P+++N
Sbjct: 352 DTCFAAT-NEAVA--PAVTLHFT-GLNLVLPMENSLIHSSAGSLACLAMAAAPNNVNSV- 406
Query: 412 FKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
L++I + QQN + +D+ + +L R C
Sbjct: 407 ---LNVIANLQQQNLRLLFDVPNSRLGIARELCN 437
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 154/364 (42%), Gaps = 92/364 (25%)
Query: 138 FDPSKSLTYATLPCDSSYCT------NDCGGY---------------PDECWYNIRYTNG 176
FDP+KS + A +PC S C N C +C Y + Y++G
Sbjct: 196 FDPTKSFSAAAVPCGSRACRALGNYGNGCSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDG 255
Query: 177 PDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN-NAHFSDEQFTGVFGLGPATSSTHSL 235
S GT ++ T G +FL + FGCSH FS E +G LG S S
Sbjct: 256 RVSSGTYMTDIL---TISPGTSFL-NFRFGCSHGVRGSFSGET-SGTMSLGGGRQSLLSQ 310
Query: 236 VEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGD----------STPM----SVIDG 280
+ G+ FSYC+ + A L LG GAI +GD +TP+ +++
Sbjct: 311 TARAYGNAFSYCVPKPS----ASGFLSLG-GAINDGDSDSDSPSSFVTTPLMRNARIVNP 365
Query: 281 SYYVT-LEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVED 339
+YYV L+GI + + L++ P +F G +DS +T L P+AY+ LR +
Sbjct: 366 TYYVVRLQGIDVAGRRLNVPPVVFS-------GGTLMDSSAVVTQLPPTAYRALRLAFRN 418
Query: 340 LFQGLLPSYPMD---------PA-----WHLCYSGNINRDLQGF-----PAMAFHFAGGA 380
+G Y M+ PA CY D +G P ++ F GGA
Sbjct: 419 AMRG----YRMNTRNGSTSSTPAGGEMILDTCY------DFEGLDNVTVPTVSLVFFGGA 468
Query: 381 DLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQ 440
+ LD + E CLA P+ + DL IG + QQ + V YD+ ++ + F+
Sbjct: 469 VVDLDPTTAVMMEG----CLAFVPTPAD----FDLGFIGNVQQQTHEVLYDVGARNVGFR 520
Query: 441 RIDC 444
R C
Sbjct: 521 RGAC 524
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 160/385 (41%), Gaps = 75/385 (19%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPCDSSYC 156
S+G+PPV L +DTGS+L WV+CQPC C FDP +S T + C S C
Sbjct: 4 SLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSSVKC 63
Query: 157 TN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
+C D C Y++ Y NG S G + ++ G +F+ D+ F
Sbjct: 64 GELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSFM-DLMF 117
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLNYFEYAY 257
GCS + + E G+ SS+ S E++ FSYC L E
Sbjct: 118 GCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LPTDETKP 169
Query: 258 NMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKNDTWSDA 312
+ILG + A ++G TP+ S+ +Y +T+E I+ G++++ S +
Sbjct: 170 GYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT------------SSS 217
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY---------SGN 361
+ +DSG T L PS + L K + G + ++CY +G
Sbjct: 218 EMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGT 277
Query: 362 IN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
I + P + FAGGA L L +VFY + C+ + + I+G
Sbjct: 278 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQ-----ILG 332
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
+++ +D+ KQ F+ C
Sbjct: 333 NRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 140/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ + +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSRGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 159/384 (41%), Gaps = 65/384 (16%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-PCEQCGATT---FDPSKSLTYATLPCDS 153
++YV +IG PP P +D+GS L W++C PC C + P+KS +PC
Sbjct: 56 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKS---KLVPCVH 112
Query: 154 SYCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
C + C ++C Y I+Y + S G + ++ F ++ G V
Sbjct: 113 RLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN-GSVARPSVA 171
Query: 205 FGCSHNNAHFSDEQFT---GVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
FGC ++ S + + GV GLG + S S +++ G + + + +
Sbjct: 172 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG---------VTKNVVGHCLSL 222
Query: 262 LGEGAILEGDS---------TPMSVIDGSYYVTLEGISL--GEKMLDIDPNLFKKNDTWS 310
G G + GD TPM+ Y + SL G++ L +
Sbjct: 223 RGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGV-----------R 271
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSG-----NINRD 365
A V DSG++ T+ YQ L ++D L P D + LC+ G ++
Sbjct: 272 LAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLDV 330
Query: 366 LQGFPAMAFHFAGGADLVLD--AESVFYQESSSVFCLAVGPSDINGER--FKDLSIIGMI 421
+ F ++ +FA G +++ E+ + CL + +NG KDLSIIG I
Sbjct: 331 RKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNACLGI----LNGSEIGLKDLSIIGDI 386
Query: 422 AQQNYNVAYDLVSKQLYFQRIDCE 445
Q++ V YD ++ + R C+
Sbjct: 387 TMQDHMVIYDNEKGKIGWIRAPCD 410
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 162/391 (41%), Gaps = 75/391 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLP 150
+F + S+G+PPV L +DTGS+L WV+CQPC C FDP +S T +
Sbjct: 113 LFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 151 CDSSYCTN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTF 199
C S C +C D C Y++ Y NG S G + ++ G +F
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSF 227
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLN 251
+ D+ FGCS + + E G+ SS+ S E++ SYC L
Sbjct: 228 M-DLMFGCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKALSYC---LP 278
Query: 252 YFEYAYNMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKN 306
E +ILG + A ++G TP+ S+ +Y +T+E I+ G++++
Sbjct: 279 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT--------- 329
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY------ 358
S + + +DSG T L PS + L K + G + ++CY
Sbjct: 330 ---SSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDY 386
Query: 359 ---SGNIN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
+G I + P + FAGGA L L +VFY + C+ + +
Sbjct: 387 SGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQ-- 444
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G +++ +D+ KQ F+ C
Sbjct: 445 ---ILGNRVTRSFGTTFDIQGKQFGFKYAVC 472
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 162/391 (41%), Gaps = 75/391 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLP 150
+F + S+G+PPV L +DTGS+L WV+CQPC C FDP +S T +
Sbjct: 115 LFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 174
Query: 151 CDSSYCTN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTF 199
C S C +C D C Y++ Y NG S G + ++ G +F
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSF 229
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLN 251
+ D+ FGCS + + E G+ SS+ S E++ SYC L
Sbjct: 230 M-DLMFGCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKALSYC---LP 280
Query: 252 YFEYAYNMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKN 306
E +ILG + A ++G TP+ S+ +Y +T+E I+ G++++
Sbjct: 281 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT--------- 331
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY------ 358
S + + +DSG T L PS + L K + G + ++CY
Sbjct: 332 ---SSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDY 388
Query: 359 ---SGNIN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
+G I + P + FAGGA L L +VFY + C+ + +
Sbjct: 389 SGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQ-- 446
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G +++ +D+ KQ F+ C
Sbjct: 447 ---ILGNRVTRSFGTTFDIQGKQFGFKYAVC 474
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 159/384 (41%), Gaps = 65/384 (16%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-PCEQCGATT---FDPSKSLTYATLPCDS 153
++YV +IG PP P +D+GS L W++C PC C + P+KS +PC
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKS---KLVPCVH 121
Query: 154 SYCT---------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVG 204
C + C ++C Y I+Y + S G + ++ F ++ G V
Sbjct: 122 RLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN-GSVARPSVA 180
Query: 205 FGCSHNNAHFSDEQFT---GVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLI 261
FGC ++ S + + GV GLG + S S +++ G + + + +
Sbjct: 181 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG---------VTKNVVGHCLSL 231
Query: 262 LGEGAILEGDS---------TPMSVIDGSYYVTLEGISL--GEKMLDIDPNLFKKNDTWS 310
G G + GD TPM+ Y + SL G++ L +
Sbjct: 232 RGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGV-----------R 280
Query: 311 DAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSG-----NINRD 365
A V DSG++ T+ YQ L ++D L P D + LC+ G ++
Sbjct: 281 LAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLDV 339
Query: 366 LQGFPAMAFHFAGGADLVLD--AESVFYQESSSVFCLAVGPSDINGER--FKDLSIIGMI 421
+ F ++ +FA G +++ E+ + CL + +NG KDLSIIG I
Sbjct: 340 RKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNACLGI----LNGSEIGLKDLSIIGDI 395
Query: 422 AQQNYNVAYDLVSKQLYFQRIDCE 445
Q++ V YD ++ + R C+
Sbjct: 396 TMQDHMVIYDNEKGKIGWIRAPCD 419
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 151/378 (39%), Gaps = 47/378 (12%)
Query: 78 SSQKAHDTRAHLH--PGIS----------TVPVFYVNFSIGQPPVPQLAVLDTGSSLIWV 125
S+ KAHD L G+ V ++Y IG P +DTGS ++WV
Sbjct: 54 STLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWV 113
Query: 126 KCQPCEQCGATT--------FDPSKSLTYATLPCDSSYCTNDCGGYPDECWYNIR----- 172
C C +C T+ +D +S T + CD +C GG C N+
Sbjct: 114 NCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQ 173
Query: 173 -YTNGPDSQGTIGSE--QFNFETSD-EGKTFLYDVGFGC----SHNNAHFSDEQFTGVFG 224
Y +G + G + Q+N + D E + FGC S + +E G+ G
Sbjct: 174 IYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILG 233
Query: 225 LGPATSSTHSLV---EKVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDGS 281
G + SS S + KV F++C+ N + +G + + TP+
Sbjct: 234 FGKSNSSIISQLASTRKVKKMFAHCLDGTN----GGGIFAMGHVVQPKVNMTPLVPNQPH 289
Query: 282 YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLF 341
Y V + G+ +G +L+I ++F+ D G IDSGTTL +L Y+ L ++
Sbjct: 290 YNVNMTGVQVGHIILNISADVFEAGDR---KGTIIDSGTTLAYLPELIYEPLVAKILSQQ 346
Query: 342 QGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLA 401
L YS ++ GFP + FHF L + +Q +++C+
Sbjct: 347 HNLEVQTIHGEYKCFQYSERVD---DGFPPVIFHFENSLLLKVYPHEYLFQ-YENLWCIG 402
Query: 402 VGPSDINGERFKDLSIIG 419
S + K++++ G
Sbjct: 403 WQNSGMQSRDRKNVTLFG 420
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 160/385 (41%), Gaps = 75/385 (19%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPCDSSYC 156
S+G+PPV L +DTGS+L WV+CQPC C FDP +S T + C S C
Sbjct: 4 SLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSSVKC 63
Query: 157 TN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
+C D C Y++ Y NG S G + ++ G +F+ D+ F
Sbjct: 64 GEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSFM-DLMF 117
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLNYFEYAY 257
GCS + + E G+ SS+ S E++ FSYC L E
Sbjct: 118 GCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LPTDETKP 169
Query: 258 NMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKNDTWSDA 312
+ILG + A ++G TP+ S+ +Y +T+E I+ G++++ S +
Sbjct: 170 GYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT------------SSS 217
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY---------SGN 361
+ +DSG T L PS + L K + G + ++CY +G
Sbjct: 218 EMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGT 277
Query: 362 IN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
I + P + FAGGA L L +VFY + C+ + + I+G
Sbjct: 278 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQ-----ILG 332
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
+++ +D+ KQ F+ C
Sbjct: 333 NRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/415 (24%), Positives = 156/415 (37%), Gaps = 59/415 (14%)
Query: 57 DAQAQRTLNMSMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVL 116
D Q Q+ + + LS+ S PG ++Y +G P L L
Sbjct: 66 DLQRQKRRLAGKNQLLSLSKGGST--------FSPGNDLGWLYYAWVDVGTPTTSFLVAL 117
Query: 117 DTGSSLIWVKCQPCEQCGATT------------FDPSKSLTYATLPCDSSYCT--NDCGG 162
DTGS L WV C C QC + + P++S T LPC C + C
Sbjct: 118 DTGSDLFWVPCD-CIQCAPLSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTN 176
Query: 163 YPDECWYNIRY-TNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNA--HFSDEQF 219
C YNI Y + S G + + + + + V GC + +
Sbjct: 177 PKQPCTYNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNASVIIGCGRKQSGDYLDGIAP 236
Query: 220 TGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMS 276
G+ GLG A S S + + G + FS C + E + + G+ + STP
Sbjct: 237 DGLLGLGMADISVPSFLARAGLVRNSFSMC-----FKEDSSGRIFFGDQGVSSQQSTPFV 291
Query: 277 VIDG---SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTL 333
+ G +Y V ++ +G K L+ S +DSGT+ T L P Y+
Sbjct: 292 PLYGKLQTYAVNVDKSCIGHKCLE-----------GSSFQALVDSGTSFTSLPPDVYKAF 340
Query: 334 RKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADL-VLDAESVFYQ 392
E + Y D W CYS + ++ P + FA ++ F
Sbjct: 341 TTEFDKQINASRVPY-EDSTWKYCYSAS-PLEMPDVPTIILAFAANKSFQAVNPILPFND 398
Query: 393 ESSSV--FCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCE 445
E ++ FCLAV PS + + IIG Y+V +D S +L + R +C
Sbjct: 399 EQGALARFCLAVLPST------EPIGIIGQNFLVGYHVVFDRESMKLGWYRSECR 447
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 151/365 (41%), Gaps = 61/365 (16%)
Query: 97 PVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC 156
P++ N +IG PP P A++ +W +C PC +C LP + Y
Sbjct: 26 PLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRC-----------FKQDLPLFNRYE 74
Query: 157 TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSD 216
G D+ G G++ F T+ + FGC+ ++
Sbjct: 75 VETMFG---------------DTSGIGGTDTFAIGTATA------SLAFGCAMDSNIKQL 113
Query: 217 EQFTGVFGLGPATSSTHSLVEKV-GSKFSYCIGNLNYFEYAYNMLILGEGAILEGD---- 271
+GV GLG + SLV ++ + FSYC+ + + L+LG A L G
Sbjct: 114 LGASGVVGLG---RTPWSLVGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLAGGKSAA 169
Query: 272 STPM---SVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPS 328
+TP+ S Y + LEGI G+ +++ PN + V +D+ +++LV +
Sbjct: 170 TTPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPN---------GSVVLVDTIFGVSFLVDA 220
Query: 329 AYQTLRKEVEDLFQGLLPSYPMDPAWHLCY-----SGNINRDLQGFPAMAFHFAGGADLV 383
A+ ++K V + P P + LC+ + N L P + F G A L
Sbjct: 221 AFHAIKKAVTVAVGAAPMATPTKP-FDLCFPKAAAAAGANSSLP-LPDVVLTFQGAAALT 278
Query: 384 LDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRID 443
+ Y + CLA+ S + +LSI+G + Q+N + +DL + L F+ D
Sbjct: 279 VPPSKYMYDAGNGTVCLAMMSSAML-NLTTELSILGRLHQENIHFLFDLDKETLSFEPAD 337
Query: 444 CELLA 448
C L+
Sbjct: 338 CSSLS 342
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 119/438 (27%), Positives = 178/438 (40%), Gaps = 81/438 (18%)
Query: 39 VTKLLHRDSLLYNPNDTVDAQA-QRTLNMSMARFIYLSQK----------------SSQK 81
V +L HR P+ + A + L R Y+ ++ SS K
Sbjct: 424 VLRLTHRHGPCAGPSRSASAPSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSSK 483
Query: 82 AHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT---- 137
+ A++ I T+ + V S+G P V Q +DTGS + WV+C PC
Sbjct: 484 SVTIPANIGHSIGTLQ-YVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQ 542
Query: 138 -FDPSKSLTYATLPCDSSYCT------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNF 190
FDP+KS +Y+ +PC + C+ + C +C Y + Y +G ++ G GS+
Sbjct: 543 LFDPAKSSSYSAVPCAADACSELSTYGHGCAAG-SQCGYVVSYGDGSNTTGVYGSDTLTL 601
Query: 191 ETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKV-----GSKFSY 245
+D FL FGC H A F G+ GL SL + G FSY
Sbjct: 602 TDADAVTGFL----FGCGHAQAGL----FAGIDGLLALGRKGMSLTSQTSGAYGGGVFSY 653
Query: 246 CIGNLNYFEYAYNMLILGEGAILEGDSTPMSV----IDGSYYVTLEGISLGEKMLDIDP- 300
C L + L LG + G +T + + Y V L GI +G + L P
Sbjct: 654 C---LPPSPSSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPA 710
Query: 301 NLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL---C 357
+ F G +D+GT +T L P+AY LR YP PA + C
Sbjct: 711 SAFA-------GGTVVDTGTVITRLPPTAYAALRAAFRAAMAPY--GYPAAPATGILDTC 761
Query: 358 YS----GNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
Y+ G + P ++ F+GGA L LDA S CLA + +G
Sbjct: 762 YNFTDYGTVT-----LPTVSLTFSGGATLKLDAPGFL-----SSGCLAFATNSGDG---- 807
Query: 414 DLSIIGMIAQQNYNVAYD 431
D +I+G + Q+++ V +D
Sbjct: 808 DPAILGNVQQRSFAVRFD 825
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 159/385 (41%), Gaps = 66/385 (17%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQ-PCEQCGATT---FDPSKSLTYATLPCDS 153
++YV +IG PP P +D+GS L W++C PC C + P+KS +PC
Sbjct: 63 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKS---KLVPCVH 119
Query: 154 SYCT----------NDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDV 203
C + C ++C Y I+Y + S G + ++ F ++ G V
Sbjct: 120 RLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLTN-GSVARPSV 178
Query: 204 GFGCSHNNAHFSDEQFT---GVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNML 260
FGC ++ S + + GV GLG + S S +++ G + + +
Sbjct: 179 AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG---------VTKNVVGHCLS 229
Query: 261 ILGEGAILEGDS---------TPMSVIDGSYYVTLEGISL--GEKMLDIDPNLFKKNDTW 309
+ G G + GD TPM+ Y + SL G++ L +
Sbjct: 230 LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGV----------- 278
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSG-----NINR 364
A V DSG++ T+ YQ L ++D L P D + LC+ G ++
Sbjct: 279 RLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLD 337
Query: 365 DLQGFPAMAFHFAGGADLVLD--AESVFYQESSSVFCLAVGPSDINGER--FKDLSIIGM 420
+ F ++ +FA G +++ E+ + CL + +NG KDLSIIG
Sbjct: 338 VRKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNACLGI----LNGSEIGLKDLSIIGD 393
Query: 421 IAQQNYNVAYDLVSKQLYFQRIDCE 445
I Q++ V YD ++ + R C+
Sbjct: 394 ITMQDHMVIYDNEKGKIGWIRAPCD 418
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 149/364 (40%), Gaps = 40/364 (10%)
Query: 102 NFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC---GATTFDPSKSLTYATLPCDSSYC-- 156
NF+IG PP A +D L+W +C C C F P+ S T+ PC + C
Sbjct: 27 NFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKS 86
Query: 157 --TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHF 214
T C D C ++ G + G + ++ F T+ +GFGC +
Sbjct: 87 IPTPKCAS--DVCAFDGVTGLGGHTVGIVATDTFAIGTAAPAS-----LGFGCVVASDID 139
Query: 215 SDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLNYFEYAYNMLILGEGAILEG--- 270
+ +G GLG + SLV ++ ++FSYC+ + + + L LG A L G
Sbjct: 140 TMGGPSGFIGLG---RTPWSLVAQMKLTRFSYCLAPHDTGK--NSRLFLGASAKLAGGGA 194
Query: 271 -----DSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWL 325
++P + Y + LE I G+ + + +N V ++ L
Sbjct: 195 WTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRG---RNTVLVQTAV-----VRVSLL 246
Query: 326 VPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLD 385
V S YQ +K V + P+ + +C+ + G P + F F GA L +
Sbjct: 247 VDSVYQEFKKAVMASVGAAPTATPVGEPFEVCFP---KAGVSGAPDLVFTFQAGAALTVP 303
Query: 386 AESVFYQESSSVFCLAV-GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
+ + + CL+V + +N L+I+G Q+N ++ +DL L F+ DC
Sbjct: 304 PANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 363
Query: 445 ELLA 448
L+
Sbjct: 364 SSLS 367
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 163/396 (41%), Gaps = 61/396 (15%)
Query: 89 LHPGISTVPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-EQCGA----TTFDPSKS 143
LH + FY +G P ++DTGS++ +V C C CG FDP S
Sbjct: 68 LHGAVKDYGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEAS 127
Query: 144 LTYATLPCDSSYC---TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
T + + C S C + CG +C Y Y S G + + G +
Sbjct: 128 STASRISCTSPKCSCGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALHDGLPGAPII 187
Query: 201 YDVGFGC-SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYA 256
FGC + ++ G+FGLG + +S + + K G FS C G
Sbjct: 188 ----FGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFG-------- 235
Query: 257 YNMLILGEGAILEGDS----------TPM--SVIDGSYY-VTLEGISLGEKMLDIDPNLF 303
++ G+GA+L GD+ TP+ S YY V + +++ ++L + +LF
Sbjct: 236 ---MVEGDGALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLF 292
Query: 304 KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVED--LFQGLLPSYPMDPAW-HLCY-S 359
+ G +DSGTT T++ ++ VE L GL DP + +C+
Sbjct: 293 DQG-----YGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQ 347
Query: 360 GNINRDLQG----FPAMAFHFAGGADLVLDAESVFYQES--SSVFCLAVGPSDINGERFK 413
+ DL+ FP+M F G LVL + + + S +CL V + G
Sbjct: 348 APSHDDLEALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSGKYCLGVFDNGRAG---- 403
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLAD 449
+++G I +N V YD ++++ F C+ L +
Sbjct: 404 --TLLGGITFRNVLVRYDRANQRVGFGPALCKELGE 437
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 86/344 (25%), Positives = 146/344 (42%), Gaps = 32/344 (9%)
Query: 122 LIWVKCQPCEQCGA-----TTFDPSKSLTYATLPCDSSYCTNDCGG------YPDECWYN 170
L+ + C C + T +DP+ S T +PC +CT+ G C Y+
Sbjct: 28 LLQLGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYS 87
Query: 171 IRYTNGPDSQGTIGSEQFNFE-------TSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVF 223
I Y +G + G+ ++ F+ T + + ++ G S + + SDE G+
Sbjct: 88 ITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGII 147
Query: 224 GLGPATSSTHSLVE---KVGSKFSYCIGNLNYFEYAYNMLILGEGAILEGDSTPMSVIDG 280
G G A SS S + KV FS+C+ + + + +G+ + ++TP+
Sbjct: 148 GFGQANSSVLSQLAASGKVKRIFSHCLDS----HHGGGIFSIGQVMEPKFNTTPLVPRMA 203
Query: 281 SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDL 340
Y V L+ + + + + + LF D+ S G IDSGTTL +L S Y L +V
Sbjct: 204 HYNVILKDMDVDGEPILLPLYLF---DSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGR 260
Query: 341 FQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCL 400
GL D YS ++ +GFP + FHF G L + + ++C+
Sbjct: 261 QPGLKLMIVEDQFTCFHYSDKLD---EGFPVVKFHFE-GLSLTVHPHDYLFLYKEDIYCI 316
Query: 401 AVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S + +DL +IG + N V YDL + + + +C
Sbjct: 317 GWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNC 360
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 162/391 (41%), Gaps = 75/391 (19%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLP 150
+F + S+G+PPV L +DTGS+L WV+CQPC C FDP +S T +
Sbjct: 113 LFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 151 CDSSYCTN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTF 199
C S C +C D C Y++ Y NG S G + ++ G +F
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSF 227
Query: 200 LYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLN 251
+ D+ FGCS + + E G+ SS+ S E++ FSYC L
Sbjct: 228 M-DLMFGCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LP 278
Query: 252 YFEYAYNMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKN 306
E +ILG + A ++G T + S+ +Y +T+E I+ G++++
Sbjct: 279 TDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYSLTMEMLIANGQRLVT--------- 329
Query: 307 DTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY------ 358
S + + +DSG T L PS + L K + G + ++CY
Sbjct: 330 ---SSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDY 386
Query: 359 ---SGNIN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFK 413
+G I + P + FAGGA L L +VFY + C+ + +
Sbjct: 387 SGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQ-- 444
Query: 414 DLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
I+G +++ +D+ KQ F+ C
Sbjct: 445 ---ILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 118/262 (45%), Gaps = 35/262 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G PP +DTGS ++WV C PC C +++ F+P S T +
Sbjct: 88 VGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSS 147
Query: 148 TLPCDSSYCTND-------CGGYPDE-CWYNIRYTNGPDSQGTIGSEQFNFET---SDEG 196
+PC CT C + C Y Y +G + G S+ F+T +++
Sbjct: 148 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQT 207
Query: 197 KTFLYDVGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNL 250
+ FGCS++ + +D G+FG G S S + +G FS+C L
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---L 264
Query: 251 NYFEYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDT 308
+ +L+LGE I+E TP+ Y + LE I + + L ID +LF ++T
Sbjct: 265 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 309 WSDAGVFIDSGTTLTWLVPSAY 330
G +DSGTTL +L AY
Sbjct: 323 Q---GTIVDSGTTLAYLADGAY 341
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 139/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ + +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGRHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 153/378 (40%), Gaps = 59/378 (15%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT-------------FDPSKSL 144
+ Y IG P V L LD GS ++WV C C +C + + + PS S
Sbjct: 104 LHYTWIDIGTPNVSFLVALDAGSDMLWVPCD-CIECASLSAGNYNVLDRDLNQYRPSLSN 162
Query: 145 TYATLPCDSSYCT--NDCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSD----EGK 197
T LPC C + C G D C Y ++Y + S G + ++ + TSD E
Sbjct: 163 TSRHLPCGHKLCDVHSFCKGSKDPCPYEVQYASANTSSSGYVFEDKLHL-TSDGKHAEQN 221
Query: 198 TFLYDVGFGCSHNNA--HFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNY 252
+ + GC + GV GLGP S SL+ K G + FS C+
Sbjct: 222 SVQASIILGCGRKQTGDYLHGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICLD---- 277
Query: 253 FEYAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDA 312
E +I G+ + STP I +Y V +E +G L + F+
Sbjct: 278 -ENESGRIIFGDQGHVTQHSTPFLPII-AYMVGVESFCVGS--LCLKETRFQ-------- 325
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAM 372
IDSG++ T+L YQ + E + Q + +W CY+ + +++L P +
Sbjct: 326 -ALIDSGSSFTFLPNEVYQKVVTEFDK--QVNASRIVLQSSWEYCYNAS-SQELVNIPPL 381
Query: 373 AFHFAGGADLVLDAESVFYQESS-----SVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
F+ ++ +FY +S ++FCL V PS D + IG Y
Sbjct: 382 KLAFSRNQTFLIQ-NPIFYDPASQEQEYTIFCLPVSPSA------DDYAAIGQNFLMGYR 434
Query: 428 VAYDLVSKQLYFQRIDCE 445
+ +D + + + R +C+
Sbjct: 435 LVFDRENLRFGWSRWNCQ 452
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 126/507 (24%), Positives = 200/507 (39%), Gaps = 106/507 (20%)
Query: 1 MPSSHAILLLSLITLPFTSTRIFTSTTAAPAAGKPKRLVTKLLHRDSLLYNPNDTVDAQA 60
MPSSH +L SL++ F S I T +++ P T LH L N + + +
Sbjct: 1 MPSSH--ILFSLLS--FLSIIITTFSSSTPN--------TITLHLSPLFTN-HPSSSSHP 47
Query: 61 QRTLNM----SMARFIYLSQKSSQKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVL 116
TL + S+ R +L K+ +T H T + ++ G P VL
Sbjct: 48 FHTLKLAVSTSITRAHHLKNHKPNKSLETPVH----PKTYGGYSIDLEFGTPSQTFPFVL 103
Query: 117 DTGSSLIWVKCQP---CEQCGATT----FDPSKSLTYATLPCDSSYCT------------ 157
DTGS+L+W+ C C +C + + F P S + + C + C
Sbjct: 104 DTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCC 163
Query: 158 -------NDCGGYPDEC-WYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSH 209
N+C C Y ++Y G + G + SE NF T D GCS
Sbjct: 164 RQDKAAFNNCS---QTCPAYTVQYGLG-STAGFLLSENLNFPTKKYS-----DFLLGCSV 214
Query: 210 NNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYA--YNMLILGEGAI 267
+ + Q G+ G G S S + ++FSYC+ + + + A + L+L +
Sbjct: 215 VSVY----QPAGIAGFGRGEESLPSQMNL--TRFSYCLLSHQFDDSATITSNLVLETASS 268
Query: 268 LEGDSTPMS--------------VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAG 313
+G + +S YY+TL+ I +GEK + + L + N D G
Sbjct: 269 RDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPN-VDGDGG 327
Query: 314 VFIDSGTTLTWLVPSAYQ------------TLRKEVEDLFQGLLPSYPMDPAWHLCYSGN 361
+DSG+T T++ + T +E E F GL P C+
Sbjct: 328 FIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQF-GLSP----------CFVLA 376
Query: 362 INRDLQGFPAMAFHFAGGADLVLDAESVF-YQESSSVFCLAVGPSDI--NGERFKDLSII 418
+ FP + F F GGA + L + F V CL + D+ +G I+
Sbjct: 377 GGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVIL 436
Query: 419 GMIAQQNYNVAYDLVSKQLYFQRIDCE 445
G QQN+ V YDL +++ F+ C+
Sbjct: 437 GNYQQQNFYVEYDLENERFGFRSQSCQ 463
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 151/375 (40%), Gaps = 61/375 (16%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGAT------------TFDPSKSLTYA 147
Y +G P + LDTGS L WV C C +C T + P KS T
Sbjct: 113 YTTVQLGTPGTKFMVALDTGSDLFWVPCD-CSRCAPTEGSPYASDFELSVYSPKKSSTSK 171
Query: 148 TLPCDSSYCT--NDCGGYPDECWYNIRYTNGPDS-QGTIGSEQFNFETSDE-GKTFLYDV 203
T+PC+++ C + C C Y + Y + S G + + + +T + + +
Sbjct: 172 TVPCNNNLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEHKHSEPIQAYI 231
Query: 204 GFGCSH-NNAHFSDEQF-TGVFGLGPATSSTHSLVEKVG---SKFSYC-----IGNLNYF 253
FGC + F D G+FGLG S S++ + G + FS C +G +N+
Sbjct: 232 TFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRINF- 290
Query: 254 EYAYNMLILGEGAILEGDSTPMSV--IDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
G+ LE + TP ++ + +Y +T+ I +G ++D +D
Sbjct: 291 ---------GDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLID------------AD 329
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVE-DLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
DSGT+ ++ Y L G P P P + CY+ + + + P
Sbjct: 330 ITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIP-FEYCYNMSPDANASLTP 388
Query: 371 AMAFHFAGGADL-VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVA 429
++ GG V D V ++ ++CLAV S +L+IIG Y +
Sbjct: 389 GISLTMKGGGPFPVYDPIIVISTQNELIYCLAVVKS-------AELNIIGQNFMTGYRIV 441
Query: 430 YDLVSKQLYFQRIDC 444
+D L +++ DC
Sbjct: 442 FDREKLVLGWKKFDC 456
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 123/472 (26%), Positives = 185/472 (39%), Gaps = 117/472 (24%)
Query: 61 QRTLNMSMARFIYLSQKSS-QKAHDTRAHLHPGISTVPVFYVNFSIGQPPVPQLAVLDTG 119
+ T + S +RF + QK + H L PG F +N PP LDTG
Sbjct: 47 KSTSSRSASRFQHQHQKRHLRNRHQVSLPLSPGSDYTLSFTLN---SNPPQHVSLYLDTG 103
Query: 120 SSLIWVKCQP---------CEQCGATTFDPSKSLTYATLPCDSSYC-------------- 156
S L+W C+P E A+T P S T ++ C SS C
Sbjct: 104 SDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCA 163
Query: 157 ----------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKT---FLYDV 203
T+DC + +Y Y G G++ + ++ T L++
Sbjct: 164 IADCPLESIETSDCHSFSCPSFY---YAYG---DGSLVARLYHDSIKLPLATPSLSLHNF 217
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSST----HSLVEKVGSKFSYCIGNLNYFEYAYNM 259
FGC AH + + GV G G S S ++G++FSYC+ + ++ +
Sbjct: 218 TFGC----AHTALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRL 273
Query: 260 ---LILG-----EGAILEGDSTPM--SVIDGS-----YYVTLEGISLGEKMLDIDPNLFK 304
LILG E + + D + S++D Y V LEGIS+G+K + P K
Sbjct: 274 PSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIPA-PEFLK 332
Query: 305 KNDTWSDAGVFIDSGTTLTWLVPSAYQTL--------------RKEVEDLFQGLLPSYPM 350
+ D GV +DSGTT T L S Y ++ KEVED GL P Y
Sbjct: 333 RVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDK-TGLGPCYYY 391
Query: 351 DPAWHLCYSGNINRDLQGFPAMAFHFAGG-ADLVLDAESVFY---------QESSSVFCL 400
D ++ P++ HF G + +VL ++ FY + V CL
Sbjct: 392 DTVVNI-------------PSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCL 438
Query: 401 AVGPSDINGERFKDLS-----IIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+ +NG +L+ +G Q + V YDL +++ F R C L
Sbjct: 439 ML----MNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCASL 486
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 163/376 (43%), Gaps = 52/376 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATL 149
++Y + IG P V LDTGS WV C+QC T +DP S++ +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 150 PCDSSYCTND--CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SDEGKTFLYDVG 204
CD + CT+ C C Y Y +G + G + ++ ++ + + + V
Sbjct: 142 KCDDTICTSRPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 200
Query: 205 FGC------SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYFEY 255
FGC S NN+ + + G+ G G + + S + G FS+C+ + N
Sbjct: 201 FGCGLQQSGSLNNSAVAID---GIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN---- 253
Query: 256 AYNMLILGEGAILEGDSTPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +GE + +TP+ + Y+ V L+ I++ L + N+F T G
Sbjct: 254 GGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT---KGT 310
Query: 315 FIDSGTTLTWLVPSAYQTLRKEV----EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
FIDSG+TL +L Y L V D+ G + ++ +H + G+++ FP
Sbjct: 311 FIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNF---QCFH--FLGSVDDK---FP 362
Query: 371 AMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
+ FHF DL LD Y + + +C + I+G +KD+ I+G + N V
Sbjct: 363 KITFHFEN--DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG--YKDMIILGDMVISNKVV 418
Query: 429 AYDLVSKQLYFQRIDC 444
YD+ + + + +C
Sbjct: 419 VYDMEKQAIGWTEHNC 434
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 140/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPGF----SFGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LKRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|297808489|ref|XP_002872128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317965|gb|EFH48387.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 405
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 134/289 (46%), Gaps = 39/289 (13%)
Query: 172 RYTNGPDSQGTIGSEQFNFETS--DEGKTFLYDVGF--GCSHNNAHFSDEQFTGVFGLGP 227
RY P S G + S+ +S D+ + GF GC +N +E GV G
Sbjct: 132 RYEASPSSSGYLVSDTLQLTSSITDQENSLSIARGFVFGCGASNRATPEEDGGGVDGRLS 191
Query: 228 ATSSTHSLVEKVG-SKFSYCI-----GNLNYFEYAYNMLILGEGAILEGDST--PMSVID 279
T+ S + ++ ++FS+C+ G+ NY LG A GD PM
Sbjct: 192 LTTHRFSFLSQLRLTRFSHCLWPSSAGSRNYIR-------LGSAASYGGDMVLVPMLNTT 244
Query: 280 G----SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRK 335
G SY+V L GISL ++ + + ++T +G+ ID GT T L PS Y+ ++
Sbjct: 245 GTEAYSYHVALFGISLAQQRM-------RSSET---SGLAIDIGTYYTSLEPSLYEEVK- 293
Query: 336 EVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQESS 395
E+L + P+ + +C++ + D+ P + FHF G D + + ++ ++S
Sbjct: 294 --EELMAQIGPTVAYEVNELMCFTTEVGLDIDSLPKLTFHFQ-GYDYTISNKGLYLRDSP 350
Query: 396 SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
S C A+ S + E + +++IG A ++ V YD + L FQ+ DC
Sbjct: 351 SSLCTALVRSSMKDE--ERINVIGASALVDHAVGYDTSQRMLAFQQRDC 397
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 159/385 (41%), Gaps = 75/385 (19%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPCDSSYC 156
S+G+PPV L +DTGS+L WV+CQPC C FDP +S T + C S C
Sbjct: 4 SLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSSVKC 63
Query: 157 TN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
+C D C Y++ Y NG S G + ++ G +F+ D+ F
Sbjct: 64 GEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSFM-DLMF 117
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLNYFEYAY 257
GCS + + E G+ SS+ S E++ FSYC L E
Sbjct: 118 GCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LPTDETKP 169
Query: 258 NMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKNDTWSDA 312
+ILG + A ++G TP+ S+ +Y +T E I+ G++++ S +
Sbjct: 170 GYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVT------------SSS 217
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY---------SGN 361
+ +DSG T L PS + L K + G + ++CY +G
Sbjct: 218 EMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGT 277
Query: 362 IN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
I + P + FAGGA L L +VFY + C+ + + I+G
Sbjct: 278 ITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQ-----ILG 332
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
+++ +D+ KQ F+ C
Sbjct: 333 NRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 159/385 (41%), Gaps = 75/385 (19%)
Query: 104 SIGQPPVPQLAVLDTGSSLIWVKCQPCE-QC------GATTFDPSKSLTYATLPCDSSYC 156
S+G+PPV L +DTGS+L WV+CQPC C FDP +S T + C S C
Sbjct: 4 SLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSSVKC 63
Query: 157 TN----------DCGGYPDECWYNIRYTNG-PDSQGTIGSEQFNFETSDEGKTFLYDVGF 205
+C D C Y++ Y NG S G + ++ G +F+ D+ F
Sbjct: 64 GELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI-----GDSFM-DLMF 117
Query: 206 GCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--------SKFSYCIGNLNYFEYAY 257
GCS + + E G+ SS+ S E++ FSYC L E
Sbjct: 118 GCSMDVKYSEFEA-----GIFGFGSSSFSFFEQLAGYPDILSYKAFSYC---LPTDETKP 169
Query: 258 NMLILG--EGAILEGDSTPM--SVIDGSYYVTLEG-ISLGEKMLDIDPNLFKKNDTWSDA 312
+ILG + A ++G TP+ S+ +Y +T E I+ G++++ S +
Sbjct: 170 GYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVT------------SSS 217
Query: 313 GVFIDSGTTLTWLVPSAYQTLRKEVEDLFQ--GLLPSYPMDPAWHLCY---------SGN 361
+ +DSG T L PS + L K + G + ++CY +G
Sbjct: 218 EMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGT 277
Query: 362 IN--RDLQGFPAMAFHFAGGADLVLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIG 419
I + P + FAGGA L L +VFY + C+ + + I+G
Sbjct: 278 ITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQ-----ILG 332
Query: 420 MIAQQNYNVAYDLVSKQLYFQRIDC 444
+++ +D+ KQ F+ C
Sbjct: 333 NRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 164/380 (43%), Gaps = 51/380 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G P +DTGS ++WV C PC+ C ++ FD +KS +
Sbjct: 81 VGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSAR 140
Query: 148 TLPCDSSYC------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLY 201
LPC C T+ C D C Y+ Y + + G ++ +F+ T
Sbjct: 141 VLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN 200
Query: 202 D---VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNY 252
+ FGCS + + + + G+FG G S S + G FS+C L
Sbjct: 201 SSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHC---LKG 257
Query: 253 FEYAYNMLILGEGAILEGDS--TPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWS 310
E +L+LGE ILE +P+ Y + L+ I+L ++ +P +F S
Sbjct: 258 GENGGGILVLGE--ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP-NPTMFP----IS 310
Query: 311 DAG-VFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNIN-RDLQG 368
+AG IDSGTTL +LV Y + + + P C+ +++ D+
Sbjct: 311 NAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQ--SATPTISRGSQCFRVSMSVADI-- 366
Query: 369 FPAMAFHFAGGADLVLDAESVFYQES----SSVFCLAVGPSDINGERFKDLSIIGMIAQQ 424
FP + F+F G A +V+ E +S +++C+ ++ L+I+G + +
Sbjct: 367 FPVLRFNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAE------DGLNILGDLVLK 420
Query: 425 NYNVAYDLVSKQLYFQRIDC 444
+ + YDL +++ + DC
Sbjct: 421 DKIIVYDLARQRIGWANYDC 440
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 142/347 (40%), Gaps = 50/347 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 262 L-GEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
L G+ A D ++ ++V L IS+ + L + P++F + GV
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK------GVV 226
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
DSG+ L+++ A L + + +L L + + CY + D PA++ H
Sbjct: 227 FDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLH 283
Query: 376 FAGGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
F GA L + VF + S V+CLA P++ +SIIG
Sbjct: 284 FDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 323
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 97/319 (30%), Positives = 144/319 (45%), Gaps = 47/319 (14%)
Query: 138 FDPSKSLTYATLPCDSSYCTN----DCGG---YPDE-CWYNIRYTNGPDSQGTIGSEQFN 189
FD S S T CDS+ C CG +P++ C Y Y + + G I ++F
Sbjct: 25 FDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDKFT 84
Query: 190 FETSDEGKTFLYDVGFGCS-HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIG 248
F + V FGC NN F + TG+ G G S S + KVG+ FS+C
Sbjct: 85 FGAGAS----VPGVAFGCGLFNNGVFKSNE-TGIAGFGRGPLSLPSQL-KVGN-FSHCFT 137
Query: 249 NLNYFEYAYNMLIL-------GEGAILEGDSTPM---SVIDGSYYVTLEGISLGEKMLDI 298
+N + + +L L G GA+ STP+ S YY++L+GI++G L +
Sbjct: 138 AVNGLKQSTVLLDLPADLYKNGRGAV---QSTPLIQNSANPTFYYLSLKGITVGSTRLPV 194
Query: 299 DPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAW-HLC 357
+ F T G IDSGT++T L P YQ +R E + LP P + + C
Sbjct: 195 PESAFAL--TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK--LPVVPGNATGPYTC 250
Query: 358 YSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQ----ESSSVFCLAVGPSDINGERFK 413
+S ++ P + HF GA + L E+ ++ +S+ CLA+ D
Sbjct: 251 FSAP-SQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD------- 301
Query: 414 DLSIIGMIAQQNYNVAYDL 432
+ +IIG QQN +V YDL
Sbjct: 302 ETTIIGNFQQQNMHVLYDL 320
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 156/383 (40%), Gaps = 45/383 (11%)
Query: 101 VNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYCT--- 157
V+ +G PP VLDTGS L + C F+ S SLTY+ + C S C
Sbjct: 67 VSVVVGTPPQNVTMVLDTGSELSGLLCNGSSLSPPAPFNASASLTYSAVDCSSPACVWRG 126
Query: 158 ND------CGGYPD-ECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS-- 208
D C P C +I Y + + G + ++ F T F + S
Sbjct: 127 RDLPVRPFCDAPPSTSCRVSISYADASSADGHLVADTFILGTQAVPALFGCITSYSSSTA 186
Query: 209 -HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGS-KFSYCIGNLNYFEYAYNMLILGEGA 266
+++A E TG+ G+ + S V + + +F+YCI G
Sbjct: 187 INSSATDPSEAATGLLGM---NRGSLSFVTQTATLRFAYCIAPGQGPGILLLGGDGGAAP 243
Query: 267 ILE-----GDSTPMSVIDG-SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGT 320
L S P+ D +Y V LEGI +G +L I ++ + T + +DSGT
Sbjct: 244 PLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQ-TMVDSGT 302
Query: 321 TLTWLVPSAYQTLRKEVEDLFQGLL-----PSYPMDPAWHLCYSG---NINRDLQGFPAM 372
T+L+ AY L+ E + + LL P + A+ C+ G ++ + P +
Sbjct: 303 QFTFLLADAYAALKAEFLNQARSLLAPLGEPGFVFQGAFDACFRGPEERVSAASRLLPEV 362
Query: 373 AFHFAGGADLVLDAESVFY---------QESSSVFCLAVGPSDINGERFKDLSIIGMIAQ 423
GA++ + E + Y + + +V+CL G SD+ G +IG Q
Sbjct: 363 GLVLR-GAEVAVAGEKLLYSVPGERRGEEGAEAVWCLTFGNSDMAG---MSAYVIGHHHQ 418
Query: 424 QNYNVAYDLVSKQLYFQRIDCEL 446
Q+ V YDL + ++ F CEL
Sbjct: 419 QDVWVEYDLQNGRVGFAPARCEL 441
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 161/397 (40%), Gaps = 66/397 (16%)
Query: 91 PGISTVPV------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFD 139
PG + VP+ + NF+IG PP ++D L+W +C C G FD
Sbjct: 48 PGGAVVPLHWSGACYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFD 107
Query: 140 PSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
PS S TY C S C T +C G EC Y G D+ G ++ + E
Sbjct: 108 PSASNTYRAEQCGSPLCKSIPTRNCSGD-GECGYEAPSMFG-DTFGIASTDAIAIGNA-E 164
Query: 196 GKTFLYDVGFGC---SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLN 251
G+ + FGC S + + + +G GLG + SLV + + FSYC+
Sbjct: 165 GR-----LAFGCVVASDGSIDGAMDGPSGFVGLG---RTPWSLVGQSNVTAFSYCLA--P 214
Query: 252 YFEYAYNMLILGEGAIL--EGDSTPMSVI----------DGS---YYVTLEGISLGEKML 296
+ + L LG A L G S P + + DGS Y V LEGI G+ +
Sbjct: 215 HGPGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAV 274
Query: 297 DIDPNLFKKNDTWSDAGVF----IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP 352
S G +++ L++L +AYQ L K V + P +P
Sbjct: 275 AA---------ASSGGGAITILQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEP 325
Query: 353 AWHLCYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQE--SSSVFCLAVGPSDINGE 410
+ LC+ N + G P + F F GGA L + + CL++ S
Sbjct: 326 -FDLCFQ---NAAVSGVPDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDS 381
Query: 411 RFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+SI+G + Q+N + +DL + L F+ DC L
Sbjct: 382 ADDGVSILGSLLQENVHFLFDLEKETLSFEPADCSSL 418
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 162/393 (41%), Gaps = 58/393 (14%)
Query: 91 PGISTVPV------FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA-----TTFD 139
PG + VP+ + NF+IG PP ++D L+W +C C G FD
Sbjct: 48 PGGAVVPLHWSGAHYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFD 107
Query: 140 PSKSLTYATLPCDSSYC----TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDE 195
PS S TY C S C T +C G EC Y G D+ G ++ + E
Sbjct: 108 PSASNTYRAEQCGSPLCKSIPTRNCSG-DGECGYEAPSMFG-DTFGIASTDAIAIGNA-E 164
Query: 196 GKTFLYDVGFGC---SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFSYCIGNLN 251
G+ + FGC S + + + +G GLG + SLV + + FSYC+
Sbjct: 165 GR-----LAFGCVVASDGSIDGAMDGPSGFVGLG---RTPWSLVGQSNVTAFSYCLA--L 214
Query: 252 YFEYAYNMLILGEGAIL--EGDSTPMSVI----------DGS---YYVTLEGISLGEKML 296
+ + L LG A L G S P + + DGS Y V LEGI G+ +
Sbjct: 215 HGPGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAV 274
Query: 297 DIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHL 356
+ + + +++ L++L +AYQ L K V + P +P + L
Sbjct: 275 AA-----ASSGGGAITVLQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEP-FDL 328
Query: 357 CYSGNINRDLQGFPAMAFHFAGGADLVLDAESVFYQE--SSSVFCLAVGPSDINGERFKD 414
C+ N + G P + F F GGA L + + CL++ S
Sbjct: 329 CFQ---NAAVSGVPDLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDG 385
Query: 415 LSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELL 447
+SI+G + Q+N + +DL + L F+ DC L
Sbjct: 386 VSILGSLLQENVHFLFDLEKETLSFEPADCSSL 418
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 139/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ + +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + +F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGIHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 139/345 (40%), Gaps = 48/345 (13%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPGF----SFGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172
Query: 262 LGEGAILEGDSTPMSVIDGS----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFID 317
LG+ A V ++V L IS+ + L + P++F + GV D
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK------GVVFD 226
Query: 318 SGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFA 377
SG+ L+++ A L + + +L L + + CY + D PA++ HF
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLHFD 283
Query: 378 GGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
GA L VF + S V+CLA P++ +SIIG
Sbjct: 284 DGARFDLGRGGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 321
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 140/342 (40%), Gaps = 45/342 (13%)
Query: 130 CEQCGATTFDPSKSLTYATLPCDSSYCTNDCGGY----PDECWYNIRYTNGPDSQGTIGS 185
C A F P+ S T++ LPC SS C Y C Y Y G + G + +
Sbjct: 88 CAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLAT 146
Query: 186 EQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG-SKFS 244
E + G V FGCS N +G+ GLG S SLV +VG +FS
Sbjct: 147 ETLHV-----GGASFPGVAFGCSTENG--VGNSSSGIVGLG---RSPLSLVSQVGVGRFS 196
Query: 245 YCI------GNLNYFEYAYNMLILGEG--AILEGDSTPMSVIDGSYYVTLEGISLGEKML 296
YC+ G+ + + G+ AILE P S YYV L GI++G L
Sbjct: 197 YCLRSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSY---YYVNLTGITVGATDL 253
Query: 297 DIDPNLF---KKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDP- 352
+ F + G +DSGTTLT+LV Y +++ + ++
Sbjct: 254 PVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGT 313
Query: 353 --AWHLCYSGNINRDLQGFPA--MAFHFAGGADLVLDAES------VFYQESSSVFCLAV 402
+ LC+ N G P + FAGGA+ + S V Q ++V CL V
Sbjct: 314 RFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLV 373
Query: 403 GPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDC 444
P+ E+ +SIIG + Q + +V YDL F DC
Sbjct: 374 LPAS---EKL-SISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 116/261 (44%), Gaps = 34/261 (13%)
Query: 96 VPVFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------FDPSKSLTYA 147
V +++ +G P +DTGS ++W+ C C C ++ FD + S T A
Sbjct: 68 VGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAA 127
Query: 148 TLPCDSSYC-------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFL 200
+ C C T+ C ++C Y +Y +G + G + F+ F
Sbjct: 128 LVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFS 187
Query: 201 YD---VGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
V FGCS + +++ G+FG GP S S V G FS+C+
Sbjct: 188 NSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQG 247
Query: 252 YFEYAYNMLILGEGAILEGD--STPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+L+LGE ILE + TP+ + Y + L+ I++ ++L ID ++F T
Sbjct: 248 ---SGGGILVLGE--ILEPNIVYTPLVPLQPHYNLNLQSIAVNGQILPIDQDVFA---TG 299
Query: 310 SDAGVFIDSGTTLTWLVPSAY 330
++ G +DSGTTL +LV AY
Sbjct: 300 NNRGTIVDSGTTLAYLVQEAY 320
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 116/263 (44%), Gaps = 30/263 (11%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATL 149
++Y IG P +DTGS ++WV C C++C T +DP S T + +
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 91
Query: 150 PCDSSYCTNDCGG-YPD-----ECWYNIRYTNGPDSQGTIGSEQFNF-ETSDEGKTFLYD 202
CD +C GG P C Y++ Y +G + G S+ F + S +G+T +
Sbjct: 92 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 151
Query: 203 --VGFGCSHNNA---HFSDEQFTGVFGLGPATSSTHSLVE---KVGSKFSYCIGNLNYFE 254
V FGC S++ G+ G G + +S S + KV F++C+ +N
Sbjct: 152 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTIN--- 208
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +G + +TP+ Y V L+ I +G L + ++F DT G
Sbjct: 209 -GGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMF---DTGEKKGT 264
Query: 315 FIDSGTTLTWLVPSAYQTLRKEV 337
IDSGTTLT+L Y+ + V
Sbjct: 265 IIDSGTTLTYLPEIVYKEIMLAV 287
>gi|326532334|dbj|BAK05096.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 437
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 150/362 (41%), Gaps = 54/362 (14%)
Query: 116 LDTGSSLIWVKCQPC----EQCGATTFDPSKSLTYATLPCDSSYCT--------NDCGGY 163
LD +L W++CQPC Q GA FD ++S Y + CT N C Y
Sbjct: 87 LDLVGNLTWMQCQPCVPEVRQEGAV-FDSAESPRYKHMKATDPMCTPPYTPSVGNRCSFY 145
Query: 164 PDECWYNIRYTNGPDSQGTIGSEQFNFETSDEG--KTFLYDVGFGCSHNN---AHFSDEQ 218
+N+ + G +GS+ F F + G T + + FGC+H S
Sbjct: 146 TTT--WNV------AAHGYLGSDMFAFAGTGAGGHSTDVDQLIFGCAHTTDGLERLSHGV 197
Query: 219 FTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLNYFEYA-YNMLILGEGAILEGDSTP 274
G L S S + G S+FSYC+ A + L G +
Sbjct: 198 LAGALSLSRHPMSFLSQLTARGLADSRFSYCLFPEQSHPIAKHGFLRFGRDIPRHDHAHS 257
Query: 275 MSVI------DGSYYVTLEGISL-GEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVP 327
S++ G Y++ + GISL G +++ + P +F +N G +D GT LT LV
Sbjct: 258 TSLLFTGPGSGGMYHIRVVGISLNGRRIMRLQPAMFTRNLQTRRGGSVVDPGTPLTRLVR 317
Query: 328 SAYQTLRKEVEDLF--QGLLPSYPMDPAWHLCYS--GNINRDLQGFPAMAFH-FAGGADL 382
AY + EV QG + LC+ G+++ P++ + + A L
Sbjct: 318 QAYDIVEAEVVANMQKQGARRAKAQVQGHRLCFVSWGHVH-----LPSLTINMYEDTAKL 372
Query: 383 VLDAESVFYQESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYFQRI 442
+ E +F + ++ + C V P + +++++G Q + +DL + +LYF +
Sbjct: 373 FIKPELLFRKVTARLLCFTVMPDE-------EMTVLGAAQQMDTRFTFDLHANRLYFAQE 425
Query: 443 DC 444
+C
Sbjct: 426 NC 427
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 161/377 (42%), Gaps = 57/377 (15%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPC-------EQCGATTFDPSKSLTYATLPC 151
F++ S+G P V L +DTGS++ WV+CQ C +Q TF+ S S TY + C
Sbjct: 23 FFMGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGC 82
Query: 152 DSSYCTN---------DCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD 202
+ C + C D C Y++RY +G S G + ++ S + F+
Sbjct: 83 SAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLTLANSYSIQKFI-- 140
Query: 203 VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG--SKFSYCI-------GNLNYF 253
FGC +N + + G+ G G + S + + ++ S FSYC G L+
Sbjct: 141 --FGCGSDNRY--NGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQENEGFLSIG 196
Query: 254 EYAY--NMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSD 311
Y N LIL + G P+ + + + + G+ L +DP ++ T
Sbjct: 197 PYVRDSNKLILTQ-LFDYGAHLPVYALQ-QFDMMVNGMR-----LQVDPPVYTTRMT--- 246
Query: 312 AGVFIDSGTTLTWLVPSAYQTLRKEVED--LFQGLLPSYPMDPAWHLCYSGNINR-DLQG 368
+DSGT T+++ ++ L + + + +G + + +C+ N + D
Sbjct: 247 ---VVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRG---SDSKEICFHSNGDSVDWSK 300
Query: 369 FPAMAFHFAGGADLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
P + F+ L L AE+VFY E+S C P D + I+G A +++
Sbjct: 301 LPVVEIKFSRSI-LKLPAENVFYYETSDGSICSTFQPDDAG---VPGVQILGNRATRSFR 356
Query: 428 VAYDLVSKQLYFQRIDC 444
V +D+ + F+ C
Sbjct: 357 VVFDIQQRNFGFEAGAC 373
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 51/383 (13%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGA--------TTFDPSKSLTYATL 149
+++ +G P + +DTGS ++WV C+PC C T +DP +S T + +
Sbjct: 28 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 87
Query: 150 PCDSSYCTN-------DCGGYPDECWYNIRYTNGPDSQG--TIGSEQFNFETSDEGKTFL 200
C C C + C Y Y +G S+G + Q+N +S+
Sbjct: 88 SCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTT 147
Query: 201 YDVGFGCS---HNNAHFSDEQFTGVFGLGPATSSTHSLV---EKVGSKFSYCIGNLNYFE 254
V FGCS + S + G+ G G S + + + + FS+C+
Sbjct: 148 SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGG 207
Query: 255 YAYNMLILGEGAILEGDSTPMSVIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ + E + P SV Y V L GIS+ L ID F + D GV
Sbjct: 208 GILVIGGIAEPGMTYTPLVPDSV---HYNVVLRGISVNSNRLPIDAEDFSSTN---DTGV 261
Query: 315 FIDSGTTLTWLVPSAYQTLRKEVEDLFQGL-LPSYPMDPAWHLCYSGNINRDLQGFPAMA 373
+DSGTTL + AY + + + + MD L SG ++ DL FP +
Sbjct: 262 IMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFL-VSGRLS-DL--FPNVT 317
Query: 374 FHFAGGA-----DLVLDAESVFYQESSSVFCL-------AVGPSDINGERFKDLSIIGMI 421
+F GGA D L ++ V+C+ + GP D L+I+G I
Sbjct: 318 LNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKD-----GSQLTILGDI 372
Query: 422 AQQNYNVAYDLVSKQLYFQRIDC 444
++ V YDL + ++ + +C
Sbjct: 373 VLKDKLVVYDLDNSRIGWMSYNC 395
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 141/347 (40%), Gaps = 50/347 (14%)
Query: 99 FYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--GATTFDPSKSLTYATLPCDSSYC 156
+ ++ +G P Q+ +DTGSS WV C+ C+ C TF S+S T A + C +S C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 59
Query: 157 --------TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCS 208
D YPD C + + Y +G S G + + F + F FGC
Sbjct: 60 LLGGSDPHCQDSENYPD-CPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFT----FGC- 113
Query: 209 HNNAHFSDEQF---TGVFGLGPATSSTHSLVEKVGSKFSYCI----GNLNYFEYAYNMLI 261
N F +F G+ G+G S FSYC+ +F
Sbjct: 114 -NMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 262 L-GEGAILEGDSTPMSVIDGS-----YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVF 315
L G+ A D ++ ++V L IS+ + L + P++F + GV
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK------GVV 226
Query: 316 IDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFH 375
DSG+ L+++ A L + + +L L + + CY + D PA++ H
Sbjct: 227 FDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMR-SVDEGDMPAISLH 283
Query: 376 FAGGADLVLDAESVFYQES---SSVFCLAVGPSDINGERFKDLSIIG 419
F GA L VF + S V+CLA P++ +SIIG
Sbjct: 284 FDDGARFDLGRHGVFVERSVQEQDVWCLAFAPTE-------SVSIIG 323
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 155/366 (42%), Gaps = 39/366 (10%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCG-----ATTFDPSKSLTYATL--- 149
++ ++FS+G PP VLD S +W++C C CG AT+ P + +T+
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREV 155
Query: 150 PCDSSYCTN----DCGGYPDECWYNIRYTNGP--DSQGTIGSEQFNFETSDEGKTFLYDV 203
C + C C C Y+ Y G + G + + F F T
Sbjct: 156 RCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVI---- 211
Query: 204 GFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSKFSYCIGNLNYFEYAYNMLILG 263
FGC A ++ GV GLG S S ++ +FSY + + + +L L
Sbjct: 212 -FGC----AVATEGDIGGVIGLGRGELSPVSQLQI--GRFSYYLAPDDAVDVGSFILFLD 264
Query: 264 EGA--ILEGDSTPMSVIDGS---YYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDS 318
+ STP+ S YYV L GI + + L I F S GV +
Sbjct: 265 DAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGS-GGVVLSI 323
Query: 319 GTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFPAMAFHFAG 378
+T+L AY+ +R+ + + L + + LCY+ + P+MA FAG
Sbjct: 324 TIPVTFLDAGAYKVVRQAMASKIE-LRAADGSELGLDLCYTSE-SLATAKVPSMALVFAG 381
Query: 379 GADLVLDAESVFYQESSS-VFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQL 437
GA + L+ + FY +S++ + CL + PS D S++G + Q ++ YD+ +L
Sbjct: 382 GAVMELEMGNYFYMDSTTGLECLTILPSPAG-----DGSLLGSLIQVGTHMIYDISGSRL 436
Query: 438 YFQRID 443
F+ ++
Sbjct: 437 VFESLE 442
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 160/369 (43%), Gaps = 52/369 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQC--------GATTFDPSKSLTYATL 149
++Y + IG P V LDTGS WV C+QC T +DP S++ +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 150 PCDSSYCTND--CGGYPDECWYNIRYTNGPDSQGTIGSEQFNFET---SDEGKTFLYDVG 204
CD + CT+ C C Y Y +G + G + ++ ++ + + + V
Sbjct: 118 KCDDTICTSRPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 176
Query: 205 FGC------SHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVGSK---FSYCIGNLNYFEY 255
FGC S NN+ + + G+ G G + + S + G FS+C+ + N
Sbjct: 177 FGCGLQQSGSLNNSAVAID---GIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN---- 229
Query: 256 AYNMLILGEGAILEGDSTPMSVIDGSYY-VTLEGISLGEKMLDIDPNLFKKNDTWSDAGV 314
+ +GE + +TP+ + Y+ V L+ I++ L + N+F T G
Sbjct: 230 GGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT---KGT 286
Query: 315 FIDSGTTLTWLVPSAYQTLRKEV----EDLFQGLLPSYPMDPAWHLCYSGNINRDLQGFP 370
FIDSG+TL +L Y L V D+ G + ++ +H + G+++ FP
Sbjct: 287 FIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNF---QCFH--FLGSVDDK---FP 338
Query: 371 AMAFHFAGGADLVLDAESVFY--QESSSVFCLAVGPSDINGERFKDLSIIGMIAQQNYNV 428
+ FHF DL LD Y + + +C + I+G +KD+ I+G + N V
Sbjct: 339 KITFHFEN--DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG--YKDMIILGDMVISNKVV 394
Query: 429 AYDLVSKQL 437
YD+ + +
Sbjct: 395 VYDMEKQAI 403
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 152/366 (41%), Gaps = 52/366 (14%)
Query: 98 VFYVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCE--QC-GATTFDPSKSLTYATLPCDSS 154
+F VN G P ++DTGS W++C C C TF+PS S +Y+ C S
Sbjct: 128 LFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSCIPS 187
Query: 155 YCTNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHN-NAH 213
TN Y ++Y + S+G ++ + FGC +
Sbjct: 188 TDTN----------YTMKYEDNSYSKGVFVCDEVTLKPD-----VFPKFQFGCGDSGGGE 232
Query: 214 FSDEQFTGVFGLGPATSSTHSLVEKVGS----KFSYCIGNLNYFEYAYNMLILGEGAILE 269
F +GV GL A +SL+ + S KFSYC E+ L+ GE AI
Sbjct: 233 FGTA--SGVLGL--AKGEQYSLISQTASKFKKKFSYCFPPK---EHTLGSLLFGEKAISA 285
Query: 270 GDSTPMSVIDG-----SYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTW 324
S + + Y+V L GIS+ +K L++ +LF + G IDSGT +T
Sbjct: 286 SPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLF------ASPGTIIDSGTVITR 339
Query: 325 LVPSAYQTLRK--EVEDLFQGLLPSYPMDPAWHLCYS--GNINRDLQGFPAMAFHFAGGA 380
L +AY+ LR + E L + P + CY+ G R+++ P + HF G
Sbjct: 340 LPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIK-LPEIVLHFVGEV 398
Query: 381 DLVLDAESVFYQESS-SVFCLAVGPSDINGERFKDLSIIGMIAQQNYNVAYDLVSKQLYF 439
D+ L + + + CLA ++IIG Q + V YD+ +L F
Sbjct: 399 DVSLHPSGILWANGDLTQACLAFA----RKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGF 454
Query: 440 QRIDCE 445
DC+
Sbjct: 455 GN-DCK 459
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 109/419 (26%), Positives = 174/419 (41%), Gaps = 85/419 (20%)
Query: 75 SQKSSQKAHDTRAHLH-------------PGISTVPVFY-------VNFSIGQPPVPQLA 114
++ ++ +AHD R L G S VP+ + NF+IG PP P A
Sbjct: 23 TRTAAFRAHDLRRGLEQAMRGRLLADATPAGGSAVPIHWSRHLYNVANFTIGTPPQPASA 82
Query: 115 VLDTGSSLIWVKCQPCEQCGATTFDPSKSLTYATLPCDSSYC----TNDCGGYPDECWY- 169
++D PC P+ S T+ PC + C T++C + C Y
Sbjct: 83 IIDVAGP------APCSF-------PNASSTFRPEPCGTDACKSIPTSNCSS--NMCTYE 127
Query: 170 -NIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYDVGFGCSHNNAHFSDEQFTGVFGLGPA 228
I G + G + ++ F T+ +GFGC + + +G+ GLG A
Sbjct: 128 GTINSKLGGHTLGIVATDTFAIGTATA------SLGFGCVVASGIDTMGGPSGLIGLGRA 181
Query: 229 TSSTHSLVEKVG-SKFSYCIGNLNYFEYAYN-MLILGEGAILEG----------DSTPMS 276
SS LV ++ +KFSYC L + N L+LG A L G ++P
Sbjct: 182 PSS---LVSQMNITKFSYC---LTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSPGD 235
Query: 277 VIDGSYYVTLEGISLGEKMLDIDPNLFKKNDTWSDAGVFIDSGTTLTWLVPSAYQTLRKE 336
+ Y + L+GI G+ + + P S V + + +++LV SAYQ L+KE
Sbjct: 236 DMSQYYPIQLDGIKAGDAAIALPP---------SGNTVLVQTLAPMSFLVDSAYQALKKE 286
Query: 337 VEDLFQGLLPSYPMDPAWHLCY--SGNINRDLQGFPAMAFHFAGGADLVLDAESVF---Y 391
V + P+ P + LC+ +G N P + F F GA + +
Sbjct: 287 VTKAVGAAPTATPLQP-FDLCFPKAGLSNASA---PDLVFTFQQGAAALTVPPPKYLIDV 342
Query: 392 QESSSVFCLAV-GPSDINGERF-KDLSIIGMIAQQNYNVAYDLVSKQLYFQRIDCELLA 448
E C+A+ S +N ++L+I+G + Q+N + DL K L F+ DC L+
Sbjct: 343 GEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCAHLS 401
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 145/378 (38%), Gaps = 58/378 (15%)
Query: 100 YVNFSIGQPPVPQLAVLDTGSSLIWVKCQPCEQCGATT--------------FDPSKSLT 145
Y IG P V L LDTGS L+W+ C C QC T ++PS S T
Sbjct: 101 YTWIDIGTPSVSFLVALDTGSDLLWIPCN-CVQCAPLTSTYYSSLATKDLNEYNPSSSST 159
Query: 146 YATLPCDSSYC--TNDCGGYPDECWYNIRYTNGPDSQGTIGSEQFNFETSDEGKTFLYD- 202
C C +DC ++C Y + Y +G S + E T + +
Sbjct: 160 SKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGS 219
Query: 203 --------VGFGCSHNNAHFSDEQFTGVFGLGPATSSTHSLVEKVG---SKFSYCIGNLN 251
+G G + + G+ GLGPA S S + K G + FS C
Sbjct: 220 SSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLC----- 274
Query: 252 YFEYAYNMLILGEGAILEGDSTPMSVIDGS--YYVTLEGISLGEKMLDIDPNLFKKNDTW 309
+ E + G+ STP ++ + Y V +E +G L K ++
Sbjct: 275 FDEEDSGRIYFGDMGPSIQQSTPFLQLENNSGYIVGVEACCIGNSCL--------KQTSF 326
Query: 310 SDAGVFIDSGTTLTWLVPSAYQTLRKEVEDLFQGLLPSYPMDPAWHLCYSGNINRDLQGF 369
+ FIDSG + T+L Y+ + E++ S+ +W CY ++ +
Sbjct: 327 T---TFIDSGQSFTYLPEEIYRKVALEIDRHINATSKSFE-GVSWEYCYESSVEPKV--- 379
Query: 370 PAMAFHFAGGADLVLDAESVFYQESSSV--FCLAVGPSDINGERFKDLSIIGMIAQQNYN 427
PA+ F+ V+ +Q+S + FCL + PS G + IG + Y
Sbjct: 380 PAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEG-----IGSIGQNYMRGYR 434
Query: 428 VAYDLVSKQLYFQRIDCE 445
+ +D + +L + C+
Sbjct: 435 MVFDRENMKLRWSASKCQ 452
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,437,665,945
Number of Sequences: 23463169
Number of extensions: 330668933
Number of successful extensions: 664524
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 760
Number of HSP's successfully gapped in prelim test: 1819
Number of HSP's that attempted gapping in prelim test: 657776
Number of HSP's gapped (non-prelim): 2984
length of query: 450
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 304
effective length of database: 8,933,572,693
effective search space: 2715806098672
effective search space used: 2715806098672
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)