BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014679
(420 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255552241|ref|XP_002517165.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543800|gb|EEF45328.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 434
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 262/434 (60%), Positives = 328/434 (75%), Gaps = 16/434 (3%)
Query: 3 VAQSFLLLCSLLIFSLSP-ITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLT 61
+A+ ++ CSL++F + P I QTSFRPKALVL V++D +TLQY+T I QRTPLVPVKLT
Sbjct: 1 MAKLLIIFCSLMLFFVYPSIADQTSFRPKALVLPVSRDPSTLQYLTSINQRTPLVPVKLT 60
Query: 62 VHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKAC------------GGGICGAGP 109
+ LGG LWVDC++GYVSS+ K RC SAQC+LA +K+C C P
Sbjct: 61 LDLGGQYLWVDCDQGYVSSSYKPVRCRSAQCSLAKSKSCISECFSSPRPGCNNDTCALLP 120
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
DN ++++GT G++ DV+++QSTDG +PGR V+VP IF C + F+L+GLASGV G+AGL
Sbjct: 121 DNTVTHSGTSGEVGQDVVTVQSTDGFSPGRVVSVPKLIFTCATTFLLEGLASGVKGMAGL 180
Query: 170 GRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSKNLKYTPLFI 228
GR+K++LPSQ +AAFS RKFA+CL+ + G + FGDGPY L N DVSK+L YTPL +
Sbjct: 181 GRTKISLPSQFSAAFSFDRKFAICLTSSNAKGIVFFGDGPYVFLPNIDVSKSLIYTPLIL 240
Query: 229 NKVNTASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVL 287
N V+TAS F G+PS EYFIGV S+ +NGKAVPLN +LL ID EGVGGTKISTV+PYTVL
Sbjct: 241 NPVSTASAFFKGDPSSEYFIGVKSIKINGKAVPLNTSLLFIDKEGVGGTKISTVDPYTVL 300
Query: 288 ETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVW 347
ET+IY+A+ + F + +V RVAPV+PFG CF +IG TR+GP VPQIDLVLQ+ +V W
Sbjct: 301 ETTIYQAVTKVFIKELAEVPRVAPVSPFGVCFNSSNIGSTRVGPAVPQIDLVLQSSSVFW 360
Query: 348 SIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLF 407
I G NSMVQ+ D LCLGFVDGG+NPRTSIVIG Q+E+NLLQFDLA S+LGFS SLLF
Sbjct: 361 RIFGANSMVQVKSDVLCLGFVDGGLNPRTSIVIGGHQIEDNLLQFDLAASKLGFSSSLLF 420
Query: 408 ERATCT-FNFTSIA 420
+ TC FNFTS A
Sbjct: 421 RQTTCANFNFTSKA 434
>gi|222822564|gb|ACM68431.1| xyloglucan-specific endoglucanase inhibitor protein [Capsicum
annuum]
Length = 437
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 259/428 (60%), Positives = 320/428 (74%), Gaps = 15/428 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+L CS L F+ + QTSFRPK L++ V KD +TLQY+TQI+QRTPLVPV LT+ LGG
Sbjct: 10 ILFCSFLFFTSTIAQNQTSFRPKGLIIPVMKDGSTLQYLTQIQQRTPLVPVSLTLDLGGQ 69
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ K ARC SAQC+LA A CG CG PDN ++ T
Sbjct: 70 FLWVDCDQGYVSSSYKPARCRSAQCSLAGATGCGECFSPPRPGCNNNTCGLFPDNTVTRT 129
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ DV+S+QS++G+NPGR V+ NF+F+CG+ F+LQGLASGV G+AGLGR++++L
Sbjct: 130 ATSGELASDVVSVQSSNGKNPGRNVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISL 189
Query: 177 PSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNTA 234
PSQ +A FS RKFA+CLS G ++FGDGPY+ L N + S N +YTPL IN V+TA
Sbjct: 190 PSQFSAEFSFPRKFAVCLSSSKSKGVVLFGDGPYFFLPNTEFSNNDFQYTPLLINPVSTA 249
Query: 235 SGF-LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYK 293
S F G+PS EYFIGV SV +N K VP+N TLLSIDN+GVGGTKISTVNPYTVLETS+Y
Sbjct: 250 SAFSAGQPSSEYFIGVKSVKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTVLETSLYN 309
Query: 294 ALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQN 353
A+ F + V RVA VAPFGACF ++IG TR+GP VPQIDLVLQN+NV+W+I G N
Sbjct: 310 AITNFFVKELANVTRVASVAPFGACFDSRNIGSTRVGPAVPQIDLVLQNENVIWTIFGAN 369
Query: 354 SMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC- 412
SMVQ+ + LCLGFVDGGVN RTSIVIG +E+NLLQ D+A SRLGF+ S+LF + TC
Sbjct: 370 SMVQVSENVLCLGFVDGGVNSRTSIVIGGHTIEDNLLQLDIARSRLGFTSSILFRQTTCA 429
Query: 413 TFNFTSIA 420
FNFTSIA
Sbjct: 430 NFNFTSIA 437
>gi|225432542|ref|XP_002277699.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 435
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 261/418 (62%), Positives = 310/418 (74%), Gaps = 15/418 (3%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY 77
SP AQTSFRPKALVL V+KD A+LQY+T I QRT LV + LT+ LGG LWVDC++GY
Sbjct: 18 FSPSLAQTSFRPKALVLPVSKDAASLQYITHINQRTHLVSIPLTLDLGGQFLWVDCDQGY 77
Query: 78 VSSTNKTARCGSAQCNLANAKACGG-----------GICGAGPDNPISNTGTHGDIRIDV 126
VSS+ + RCGSAQC+L +KACG C PDN ++ T T G++ D
Sbjct: 78 VSSSYRPVRCGSAQCSLTRSKACGECFSGPVKGCNYSTCVLSPDNTVTGTATSGEVGEDA 137
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
+SIQSTDG NPGR V+V +F CGS F+L+GLAS V G+AGLGRS+VALPSQ ++AFS
Sbjct: 138 VSIQSTDGSNPGRVVSVRRLLFTCGSTFLLEGLASRVKGMAGLGRSRVALPSQFSSAFSF 197
Query: 187 KRKFALCLSPF--DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG-FLGEPSV 243
RKF++CLS G + FGDGPY L D S++L YTPL N V+TAS F GE SV
Sbjct: 198 NRKFSICLSSSTKSTGVVFFGDGPYVLLPKVDASQSLTYTPLITNPVSTASAYFQGEASV 257
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
EYFIGV S+ +NGKAVPLN TLLSID++G GGTKISTV+PYTVLETSIYKA+ QAF +
Sbjct: 258 EYFIGVKSIKINGKAVPLNATLLSIDSQGYGGTKISTVHPYTVLETSIYKAVTQAFLKEL 317
Query: 304 PKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
+ RVA V+PFGACF +DIG TR+GP VP IDLVLQ ++V W + G NSMVQ+ + L
Sbjct: 318 STITRVASVSPFGACFSSKDIGSTRVGPAVPPIDLVLQRQSVYWRVFGANSMVQVSDNVL 377
Query: 364 CLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
CLGFVDGGVNPRTSIVIG RQLE+NLLQFDLATSRLGFS SLL + TC+ FNFTS A
Sbjct: 378 CLGFVDGGVNPRTSIVIGGRQLEDNLLQFDLATSRLGFSSSLLSRQTTCSNFNFTSNA 435
>gi|295646769|gb|ADG23123.1| xyloglucan specific endoglucanase inhibitor [Solanum melongena]
Length = 437
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 255/428 (59%), Positives = 320/428 (74%), Gaps = 15/428 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+LLC LL F+ + QTSFRPK L++ V KD +TLQY+TQI+QRTPLVP+ LT+ LGG
Sbjct: 10 ILLCCLLFFTSTIAQNQTSFRPKGLIIPVTKDASTLQYLTQIQQRTPLVPISLTLDLGGQ 69
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ K ARC SAQC+LA A ACG C PDN ++ T
Sbjct: 70 FLWVDCDQGYVSSSYKPARCRSAQCSLAGASACGECFSPPRPGCNNNTCSLFPDNTVTGT 129
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ D++S+QS++G+NPGR V+ NF+F+CG+ F+LQGLASGV G+AGLGR++++L
Sbjct: 130 ATGGELASDIVSVQSSNGKNPGRNVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISL 189
Query: 177 PSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNTA 234
PSQ +A FS RKFALCL S G ++FGDGPY+ L N + S N +YTPLFIN V+TA
Sbjct: 190 PSQFSAEFSFPRKFALCLTSSNSKGVVLFGDGPYFFLPNKEFSNNDFQYTPLFINPVSTA 249
Query: 235 SGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYK 293
+ F G+PS EYFIGV S+ +N K VP+N TLLSIDN+GVGGTK+STVNPYTV+ETS+Y
Sbjct: 250 AAFSSGQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKLSTVNPYTVMETSLYN 309
Query: 294 ALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQN 353
A+ F + V RVAPV PFGACF ++IG TR+GP VP IDLVLQN+NVVW+I G N
Sbjct: 310 AITNFFVKELANVTRVAPVTPFGACFDSRNIGSTRVGPAVPWIDLVLQNQNVVWTIFGAN 369
Query: 354 SMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC- 412
SMVQ+ + LCLG VDGGVN RTSIVIG +E+NLLQFD A SRLGF+ S+LF + TC
Sbjct: 370 SMVQVSENVLCLGIVDGGVNARTSIVIGGHTIEDNLLQFDHAASRLGFTSSILFRQTTCA 429
Query: 413 TFNFTSIA 420
FNFTS+A
Sbjct: 430 NFNFTSVA 437
>gi|32482806|gb|AAP84703.1| putative xyloglucanase inhibitor [Solanum tuberosum]
Length = 437
Score = 510 bits (1313), Expect = e-142, Method: Compositional matrix adjust.
Identities = 255/428 (59%), Positives = 318/428 (74%), Gaps = 15/428 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+LLCSLL + + QTSFRPK L++ V KD +TLQY+TQI+QRTPLVP+ LT+ LGG
Sbjct: 10 ILLCSLLFITSTIAQNQTSFRPKGLIIPVTKDASTLQYLTQIQQRTPLVPISLTLDLGGQ 69
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ K ARC SAQC+L A CG CG PDN ++ T
Sbjct: 70 FLWVDCDQGYVSSSYKPARCRSAQCSLGGASGCGECFSPPRPGCNNNTCGLLPDNTVTRT 129
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ D++S+QST+G+NPGR+V+ NF+F+CG+ F+LQGLASGV G+AGLGR++++L
Sbjct: 130 ATSGELASDIVSVQSTNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISL 189
Query: 177 PSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNTA 234
PSQ +A FS RKFALCL S G ++FGDGPY+ L N + S N +YTPLFIN V+TA
Sbjct: 190 PSQFSAEFSFPRKFALCLTSSNSKGVVLFGDGPYFFLPNREFSNNDFQYTPLFINPVSTA 249
Query: 235 SGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYK 293
S F G+PS EYFIGV S+ +N K VP+N TLLSIDN+GVGGTKISTVNPYT+LETS+Y
Sbjct: 250 SAFSSGQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYN 309
Query: 294 ALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQN 353
A+ F + V RVA VAPF CF ++IG TR+GP VP IDLVLQN+NVVW+I G N
Sbjct: 310 AITNFFVKELANVTRVAAVAPFKVCFDSRNIGSTRVGPAVPSIDLVLQNENVVWTIFGAN 369
Query: 354 SMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC- 412
SMVQ+ + LCLG +DGGVN RTSIVIG +E+NLLQFD A SRLGF+ S+LF + TC
Sbjct: 370 SMVQVSENVLCLGVLDGGVNSRTSIVIGGHTIEDNLLQFDHAASRLGFTSSILFRQTTCA 429
Query: 413 TFNFTSIA 420
FNFTSIA
Sbjct: 430 NFNFTSIA 437
>gi|62362434|gb|AAX81588.1| nectarin IV [Nicotiana langsdorffii x Nicotiana sanderae]
Length = 437
Score = 509 bits (1312), Expect = e-142, Method: Compositional matrix adjust.
Identities = 257/429 (59%), Positives = 317/429 (73%), Gaps = 16/429 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+LLCSLL + + QTSFRPK L+L + KD +TLQY+TQI QRT LVPV LT+ LGG
Sbjct: 9 ILLCSLLFITSTTAQNQTSFRPKGLILPITKDASTLQYLTQIHQRTHLVPVSLTLDLGGQ 68
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ K ARC SAQC+LA A CG C PDN I+ T
Sbjct: 69 FLWVDCDQGYVSSSYKPARCRSAQCSLAGAGGCGQCFSPPKPGCNNNTCSLLPDNTITRT 128
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ D++ +QS++G+NPGR VT +F+F+CGS F+L+GLASGV G+AGLGR++++L
Sbjct: 129 ATSGELASDIVQVQSSNGKNPGRNVTDKDFLFVCGSTFLLEGLASGVKGMAGLGRTRISL 188
Query: 177 PSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNT 233
PSQ +A FS RKFA+CLS + G ++FGDGPY L N + S N YTPLFIN V+T
Sbjct: 189 PSQFSAEFSFPRKFAVCLSSSTNSKGVVLFGDGPYSFLPNREFSNNDFSYTPLFINPVST 248
Query: 234 ASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
AS F GEPS EYFIGV S+ +N K VP+N TLLSIDN+GVGGTKISTVNPYT+LETS+Y
Sbjct: 249 ASAFSSGEPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSMY 308
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
A+ F + + RVA VAPFGACF + I TR+GP VPQIDLVLQN+NV W+I G
Sbjct: 309 NAVTNFFVKELVNITRVASVAPFGACFDSRTIVSTRVGPAVPQIDLVLQNENVFWTIFGA 368
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
NSMVQ+ + LCLGFVDGG+NPRTSIVIG +E+NLLQFDLA+SRLGF+ S+LF + TC
Sbjct: 369 NSMVQVSENVLCLGFVDGGINPRTSIVIGGYTIEDNLLQFDLASSRLGFTSSILFRQTTC 428
Query: 413 -TFNFTSIA 420
FNFTSIA
Sbjct: 429 ANFNFTSIA 437
>gi|350536487|ref|NP_001234249.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
gi|27372527|gb|AAN87262.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
Length = 438
Score = 509 bits (1310), Expect = e-141, Method: Compositional matrix adjust.
Identities = 256/428 (59%), Positives = 316/428 (73%), Gaps = 16/428 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+LLCSLL + + QTSFRPK L++ V KD +TLQY+TQI+QRTPLVP+ LT+ LGG
Sbjct: 10 ILLCSLLFITSTIAQNQTSFRPKGLIIPVTKDASTLQYLTQIQQRTPLVPISLTLDLGGQ 69
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ K ARCGSAQC+L A CG CG PDN ++ T
Sbjct: 70 FLWVDCDQGYVSSSYKPARCGSAQCSLGGASGCGECFSPPRPGCNNNTCGLLPDNTVTGT 129
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ DV+S++S++G+NPGR+V+ NF+F+CG+ F+LQGLASGV G+AGLGR+K++L
Sbjct: 130 ATSGELASDVVSVESSNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTKISL 189
Query: 177 PSQLAAAFSLKRKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNT 233
PSQ +A FS RKFALCL S G ++FGDGPY+ L N S N +YTPLFIN V+T
Sbjct: 190 PSQFSAEFSFPRKFALCLTSSSNSKGVVLFGDGPYFFLPNRQFSNNDFQYTPLFINPVST 249
Query: 234 ASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
AS F G+PS EYFIGV S+ +N K VP+N TLLSIDN+GVGGTKISTVNPYT+LETS+Y
Sbjct: 250 ASAFSSGQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLY 309
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
A+ F + V RVA VAPF CF +DIG TR+GP VP IDLVLQN NVVW+I G
Sbjct: 310 NAITNFFVKELANVTRVAVVAPFRVCFDSRDIGSTRVGPAVPSIDLVLQNANVVWTIFGA 369
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
NSMVQ+ + LCLG +DGGVN RTSIVIG +E+NLLQFD A SRLGF+ S+LF + TC
Sbjct: 370 NSMVQVSENVLCLGVLDGGVNARTSIVIGGHTIEDNLLQFDHAASRLGFTSSILFRQTTC 429
Query: 413 -TFNFTSI 419
FNFTSI
Sbjct: 430 DNFNFTSI 437
>gi|350536203|ref|NP_001234746.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
gi|68449754|gb|AAY97864.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
Length = 438
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 255/429 (59%), Positives = 315/429 (73%), Gaps = 16/429 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+LLCSLL + + QTSFRPK L++ V KD +TLQY+TQI+QRTPLVP+ LT+ LGG
Sbjct: 10 ILLCSLLFITSTIAQNQTSFRPKGLIIPVTKDASTLQYLTQIQQRTPLVPISLTLDLGGQ 69
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ K ARCGSAQC+L A CG CG PDN ++ T
Sbjct: 70 FLWVDCDQGYVSSSYKPARCGSAQCSLGGASGCGECFSPPRPGCDNNTCGLLPDNTVTGT 129
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ DV+S++S++G+NPGR+V+ NF+F+CG+ F+LQGLASGV G+AGLGR+K++L
Sbjct: 130 ATSGELASDVVSVESSNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTKISL 189
Query: 177 PSQLAAAFSLKRKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNT 233
PSQ +A FS RK ALCL S G ++FGDGPY+ L N S N +YTPLFIN V+T
Sbjct: 190 PSQFSAEFSFPRKSALCLTSSSNSKGVVLFGDGPYFFLPNRQFSNNDFQYTPLFINPVST 249
Query: 234 ASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
AS F G+PS EYFIGV S+ +N K VP+N TLLSIDN+GVGGTKISTVNPYT+LETS+Y
Sbjct: 250 ASAFSSGQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLY 309
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
A+ F + V RVA VAPF CF +DIG TR+GP VP IDLVLQN NVVW+I G
Sbjct: 310 NAITNFFVKELANVTRVAVVAPFRVCFDSRDIGSTRVGPAVPSIDLVLQNANVVWTIFGA 369
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
NSMVQ+ + LCLG +DGGVN TSIVIG +E+NLLQFD A SRLGF+ S+LF + TC
Sbjct: 370 NSMVQVSENVLCLGVLDGGVNAGTSIVIGGHTIEDNLLQFDHAASRLGFTSSILFRQTTC 429
Query: 413 -TFNFTSIA 420
FNFTSIA
Sbjct: 430 ANFNFTSIA 438
>gi|343161843|dbj|BAK57511.1| extracellular dermal glycoprotein [Nicotiana benthamiana]
Length = 440
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 253/429 (58%), Positives = 314/429 (73%), Gaps = 16/429 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+LLCSLL + + QTSFRPK L+L + KD +T QY+TQI+QRTPLVPV LT+ LGG
Sbjct: 12 ILLCSLLFITSTTAQNQTSFRPKGLILPITKDASTFQYLTQIQQRTPLVPVSLTLDLGGQ 71
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ K ARC SAQC+LA A CG CG PDN ++ T
Sbjct: 72 FLWVDCDQGYVSSSYKPARCRSAQCSLARAGGCGQCFSPPKPGCNNDTCGLIPDNTVTQT 131
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ D + +QS++G+NPGR V +F+F+CGS F+L+ LASGV G+AGLGR++++L
Sbjct: 132 ATSGELASDTVQVQSSNGKNPGRNVVDKDFLFVCGSTFLLKRLASGVKGMAGLGRTRISL 191
Query: 177 PSQLAAAFSLKRKFALCLSPF--DDGAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNT 233
PSQ +A FS RKFA+CLS G ++FGDGPY L N + + + YTPLFIN V+T
Sbjct: 192 PSQFSAEFSFPRKFAVCLSSSTKSKGVVLFGDGPYSFLPNREFANDDFSYTPLFINPVST 251
Query: 234 ASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
AS F GEPS EYFIGV S+ +N K V +N TLLSIDN+GVGGTKISTVNPYT+LETSIY
Sbjct: 252 ASAFSSGEPSSEYFIGVKSIKINQKVVSINTTLLSIDNQGVGGTKISTVNPYTILETSIY 311
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
A+ F + + RVA VAPFGACF ++I TR+GP VP IDLVLQN+NV W+I G
Sbjct: 312 NAVTNFFVKELVNITRVASVAPFGACFDSRNIVSTRVGPTVPPIDLVLQNENVFWTIFGA 371
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
NSMVQ+ + LCLGFVDGGVNPRTSIVIG +E+NLLQFDLA+SRLGF+ S+LF + TC
Sbjct: 372 NSMVQVSENVLCLGFVDGGVNPRTSIVIGGYTIEDNLLQFDLASSRLGFTSSILFRQTTC 431
Query: 413 -TFNFTSIA 420
FNFTSIA
Sbjct: 432 ANFNFTSIA 440
>gi|225436984|ref|XP_002272235.1| PREDICTED: basic 7S globulin [Vitis vinifera]
Length = 436
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 251/412 (60%), Positives = 300/412 (72%), Gaps = 17/412 (4%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
+TSFRP ALV+ V+KD +TLQY+T I QRTPLVPVKL V LG LWVDCE+ YVSS+ +
Sbjct: 23 KTSFRPDALVIPVSKDASTLQYLTTINQRTPLVPVKLVVDLGAQFLWVDCEQNYVSSSYR 82
Query: 84 TARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
ARC SAQC+LA A CG CG PDN ++ T T G++ D +S+QST
Sbjct: 83 PARCRSAQCSLARANGCGDCFSAPRPGCNNNTCGVLPDNTVTRTATSGELAEDFVSVQST 142
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
DG NPGR V+V F+F C F+L+GLAS +G+AGLGR+++A PSQ A+AFS RKFA
Sbjct: 143 DGSNPGRVVSVSKFLFSCAPTFLLEGLASSAMGMAGLGRTRIAFPSQFASAFSFHRKFAT 202
Query: 193 CLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFL-GEPSVEYFIGV 249
CLS +G + FGDGPY L N D S++L YTPL+IN V+TAS + GEPS EYFI V
Sbjct: 203 CLSSSTTANGVVFFGDGPYRLLPNIDASQSLIYTPLYINPVSTASAYTQGEPSAEYFIRV 262
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVA 307
S+ +N KA+ LN +LLSID+EGVGGTKISTVNPYTV+ETSIYKA +AF A+A +
Sbjct: 263 KSIRINEKAISLNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKAFTKAFISAAAAINIT 322
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
RVA VAPF CF +++ TR+GP VP IDLVLQN++V W I G NSMV + D LCLGF
Sbjct: 323 RVAAVAPFNVCFSSKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCLGF 382
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
VDGG NPRTSIVIG QLE+NLLQFDLATSRLGFS SLLF R TC FNFTS
Sbjct: 383 VDGGANPRTSIVIGGYQLEDNLLQFDLATSRLGFSSSLLFRRTTCANFNFTS 434
>gi|147857949|emb|CAN80378.1| hypothetical protein VITISV_038701 [Vitis vinifera]
Length = 436
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 250/412 (60%), Positives = 299/412 (72%), Gaps = 17/412 (4%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
+TSFRP ALV+ V+KD +TLQY+T I QRTPLVPVKL V LG LWVDCE+ YVSS+ +
Sbjct: 23 KTSFRPDALVIPVSKDASTLQYLTTINQRTPLVPVKLVVDLGAQFLWVDCEQNYVSSSYR 82
Query: 84 TARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
ARC SAQC+LA A CG CG PDN ++ T T G++ D +S+QST
Sbjct: 83 PARCRSAQCSLARANGCGDCFSAPRPGCNNNTCGVLPDNTVTRTATSGELAEDFVSVQST 142
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
DG NPGR V+V F+F C F+L+GLAS +G+AGLGR+++A PSQ A+AFS RKFA
Sbjct: 143 DGSNPGRVVSVSKFLFSCAPTFLLEGLASSAMGMAGLGRTRIAFPSQFASAFSFHRKFAT 202
Query: 193 CLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFL-GEPSVEYFIGV 249
CLS +G + FGDGPY L N D S++L YTPL+IN V+TAS + GEPS EYFI V
Sbjct: 203 CLSSSTTANGVVFFGDGPYRLLPNIDASQSLIYTPLYINPVSTASAYTQGEPSAEYFIRV 262
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVA 307
S+ +N KA+ LN +LLSID+EGVGGTKISTVNPYTV+ETSIYK +AF A+A +
Sbjct: 263 KSIRINEKAISLNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKXFTKAFISAAAAINIT 322
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
RVA VAPF CF +++ TR+GP VP IDLVLQN++V W I G NSMV + D LCLGF
Sbjct: 323 RVAAVAPFNVCFSSKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCLGF 382
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
VDGG NPRTSIVIG QLE+NLLQFDLATSRLGFS SLLF R TC FNFTS
Sbjct: 383 VDGGANPRTSIVIGGYQLEDNLLQFDLATSRLGFSSSLLFRRTTCANFNFTS 434
>gi|224090425|ref|XP_002308984.1| predicted protein [Populus trichocarpa]
gi|222854960|gb|EEE92507.1| predicted protein [Populus trichocarpa]
Length = 416
Score = 494 bits (1271), Expect = e-137, Method: Compositional matrix adjust.
Identities = 254/416 (61%), Positives = 309/416 (74%), Gaps = 10/416 (2%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
L + S AQT+FRPKALVL V+KD ++LQY+ QI QRTPLVPV++T+ LGG LWVD
Sbjct: 1 LFLISSPHSIAQTTFRPKALVLPVSKDPSSLQYLAQINQRTPLVPVEVTLDLGGQYLWVD 60
Query: 73 CEKGYVSSTNKTARCGSAQCNLA--NAKAC--GGGICGAGPDNPISNTGTHGDIRIDVLS 128
C++GYVSS+ K C +AQC+LA K C C PDN + TGT + DV+S
Sbjct: 61 CQQGYVSSSKKNPSCNTAQCSLAVYRLKTCTVDKKFCVLSPDNTATRTGTSDYLTQDVVS 120
Query: 129 IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
IQSTDG NPGR V+VPNF+F C F+LQGLA GV G+AGLGR+K++LPSQ +AAFS +
Sbjct: 121 IQSTDGSNPGRVVSVPNFLFSCAPTFILQGLAKGVKGMAGLGRTKISLPSQFSAAFSFPK 180
Query: 189 KFALCLSPFD-DGAIVFGDGPYYDLNNFD-VSKNLKYTPLFINKVNTASGFL-GEPSVEY 245
KFA+CL+ + G ++FGDGPY L + D +S++L YTPL +N V+TASG+ GEPS +Y
Sbjct: 181 KFAICLTSSNAKGVVIFGDGPYVLLPHADDLSQSLIYTPLILNPVSTASGYFEGEPSTDY 240
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
FIGV S+ +N VPLN +LLSI+ EG GGTKISTVN YTV+ET+IY A+ +F + K
Sbjct: 241 FIGVKSIKINENVVPLNASLLSINREGYGGTKISTVNAYTVMETTIYNAVTDSFVRELAK 300
Query: 306 --VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
V RVA VAPFGACF ++IG TR+GP VPQIDLVLQ+KNV W I G NSMVQ+ D L
Sbjct: 301 ANVPRVASVAPFGACFNSKNIGSTRVGPAVPQIDLVLQSKNVYWRIFGANSMVQVKDDVL 360
Query: 364 CLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
CLGFVDGGVNPRTSIVIG QLE+NLLQFDLA SRLGFS SLLF + TC FNFTS
Sbjct: 361 CLGFVDGGVNPRTSIVIGGHQLEDNLLQFDLAASRLGFSSSLLFRQTTCANFNFTS 416
>gi|222822566|gb|ACM68432.1| xyloglucanase-specific endoglucanase inhibitor protein [Petunia x
hybrida]
Length = 436
Score = 489 bits (1260), Expect = e-136, Method: Compositional matrix adjust.
Identities = 252/429 (58%), Positives = 313/429 (72%), Gaps = 17/429 (3%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
+LL SLL S + + AQTSFRPK L+L V KD +TLQY+TQI QRTPLVPV LT+ LGG
Sbjct: 9 ILLFSLLFISSTIVHAQTSFRPKGLILPVTKDASTLQYLTQISQRTPLVPVSLTLDLGGQ 68
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWVDC++GYVSS+ ARC SA+C+LA + CG CGA PDN I+ T
Sbjct: 69 FLWVDCDQGYVSSSYIPARCRSAKCSLAGSSGCGDCFSPPSPGCNNNTCGAFPDNSITRT 128
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G++ D++S+QS++G+NPGR V+ +F+F+CG+ F+L GLASGV G+AGLGR++++L
Sbjct: 129 ATSGELASDIVSVQSSNGKNPGRNVSDKDFLFVCGATFLLNGLASGVKGMAGLGRTRISL 188
Query: 177 PSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNT 233
PSQ +A FS RKFA+CLS + G ++FGDGPY L N + S + YTPLFIN V+T
Sbjct: 189 PSQFSAEFSFPRKFAVCLSSTSNSKGVVLFGDGPYSFLPNREYSSDDFSYTPLFINPVST 248
Query: 234 ASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
AS F G PS EYFIGV S+ +N K VP+N TLLSID++GVGGTKISTVNPYT+LETSIY
Sbjct: 249 ASAFSSGTPSSEYFIGVKSIKINEKVVPINTTLLSIDSQGVGGTKISTVNPYTILETSIY 308
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
A+ F + + V VAPFG CF ++I TR+GP VP IDLVLQN+NV W I G
Sbjct: 309 NAVTNFFVKEL-AIPTVPSVAPFGVCFDSRNITSTRVGPGVPSIDLVLQNENVFWRIFGA 367
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
NSMV + + LCLGFVDGGVNPRTSIVIG +E+NLLQFDLA SRLGF+ S+LF + TC
Sbjct: 368 NSMVLVSENVLCLGFVDGGVNPRTSIVIGGHTIEDNLLQFDLAASRLGFTSSILFRQTTC 427
Query: 413 -TFNFTSIA 420
FNFTSIA
Sbjct: 428 ANFNFTSIA 436
>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
Length = 445
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 268/447 (59%), Positives = 311/447 (69%), Gaps = 29/447 (6%)
Query: 1 MSVAQSFLLLCSLLI-FSLSPITAQT-SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPV 58
M+ F+L CSLL ++P A+T SFRPKAL+L V KD +T QY+TQI QRTPLVPV
Sbjct: 1 MASFTHFVLFCSLLFPILITPTIAETPSFRPKALLLPVTKDASTKQYLTQINQRTPLVPV 60
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGA 107
KLTV+LGG LWVDCEKGYVSST K ARC SAQCNLA +K+CG CG
Sbjct: 61 KLTVNLGGEFLWVDCEKGYVSSTYKPARCRSAQCNLAGSKSCGECFDGPKPGCNNNTCGL 120
Query: 108 GPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIA 167
P NP T T G++ D++SIQST+G NP + V+ PN IF CGS F+L+GLASGV GIA
Sbjct: 121 FPYNPFIRTSTSGELAQDIISIQSTNGSNPSKVVSFPNVIFTCGSTFLLEGLASGVTGIA 180
Query: 168 GLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTP 225
GLGR K+ALPSQ AAAFS KRKFALCLS G + FGDGPY L N DVS+NL YTP
Sbjct: 181 GLGRKKIALPSQFAAAFSFKRKFALCLSSSTRATGVVFFGDGPYIMLPNKDVSQNLIYTP 240
Query: 226 LFINKVNTA-SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPY 284
L +N V+TA + F GEPS +YFIGV + VNG+ V LN +LLSI +G GGTKIST PY
Sbjct: 241 LILNPVSTAGASFEGEPSADYFIGVKGIKVNGEDVKLNTSLLSIAKDGTGGTKISTTQPY 300
Query: 285 TVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKN 344
T LETSIYKA++ AF A+ KV RV VAPF CF TR+GP VPQIDLVL N N
Sbjct: 301 TSLETSIYKAVIGAFGKAVAKVPRVTAVAPFELCFNSTSFSSTRVGPGVPQIDLVLPN-N 359
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGG----------VNPRTSIVIGARQLENNLLQFDL 394
W+I G NSMVQ+ D LCLGFVDGG P T+IVIG Q+E+NLLQFDL
Sbjct: 360 KAWTIFGANSMVQVSDDVLCLGFVDGGPLHFVDWGIPFTP-TAIVIGGHQIEDNLLQFDL 418
Query: 395 ATSRLGFSDSLLFERATCT-FNFTSIA 420
+S LGFS SLLF + TC+ FNFTSIA
Sbjct: 419 GSSTLGFSSSLLFRQTTCSNFNFTSIA 445
>gi|255552239|ref|XP_002517164.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543799|gb|EEF45327.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 433
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 258/435 (59%), Positives = 308/435 (70%), Gaps = 17/435 (3%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
M+++++ ++LCSLL F +SP +QTSFRPKAL+L V KD +TL Y TQ QRTPLVPV
Sbjct: 1 MALSRNLIILCSLLFF-ISPCISQTSFRPKALLLPVTKDPSTLLYFTQFNQRTPLVPVHT 59
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKAC------------GGGICGAG 108
+ LGG LWVDC++GYVSST + ARC SAQCNLANA C C
Sbjct: 60 ILDLGGLYLWVDCDRGYVSSTYRPARCNSAQCNLANANGCITACFDAPRPGCNNNTCALL 119
Query: 109 PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
DN ++N GT G++ DV+S+QSTDG NPGR V+V NF+F+C F+L GL SG G+AG
Sbjct: 120 VDNTVTNIGTDGELGQDVVSLQSTDGSNPGRVVSVSNFLFVCAPSFILNGLPSGTEGMAG 179
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFI 228
LGR+KV+LPSQ AAAFS RKFA+CLS G + FG PY N DVSK L YTPL I
Sbjct: 180 LGRTKVSLPSQFAAAFSFNRKFAICLSS-SKGVVFFGKEPYIIQPNIDVSKILTYTPLII 238
Query: 229 NKVNTASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTV 286
N V+TA+ F+ G+PS +YFIGV S+++NGK VPLN TLLSI+++ G GGT ISTV PYTV
Sbjct: 239 NPVSTAAAFVQGDPSSDYFIGVKSININGKPVPLNTTLLSINSQTGFGGTMISTVVPYTV 298
Query: 287 LETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV 346
+ET+IY A V AF + V RVA VAPFGACF I TR+G VP IDLVLQ+ NV
Sbjct: 299 METTIYNAFVNAFVKELVDVPRVASVAPFGACFDASKIVGTRLGAAVPSIDLVLQSSNVF 358
Query: 347 WSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLL 406
W I G NSMVQ+ D LCLGFVDGG NPRTSIVIG QLE+NLLQFDLATSRLGFS SL
Sbjct: 359 WRIVGANSMVQVNEDVLCLGFVDGGENPRTSIVIGGHQLEDNLLQFDLATSRLGFSSSLF 418
Query: 407 FERATC-TFNFTSIA 420
+ TC F+FT A
Sbjct: 419 SRQTTCANFDFTPKA 433
>gi|224066523|ref|XP_002302122.1| predicted protein [Populus trichocarpa]
gi|222843848|gb|EEE81395.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 255/420 (60%), Positives = 302/420 (71%), Gaps = 18/420 (4%)
Query: 19 SPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV 78
SP AQ SFRPKALV+ V KD+ATLQYVTQIKQRTP VP+ L V LGG LWVDC+K YV
Sbjct: 19 SPSIAQQSFRPKALVVPVTKDSATLQYVTQIKQRTPQVPINLVVDLGGQFLWVDCDKNYV 78
Query: 79 SSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDIRIDVL 127
SST + ARCGSA C+LA A CG CG PDN ++ T T G++ DV+
Sbjct: 79 SSTYRPARCGSALCSLARAGGCGDCFSGPRPGCNNNTCGVIPDNTVTRTATGGELATDVV 138
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
S+ ST+G NPGR +VP F+F C F+LQGLASGVVG+AGLGR+++A PSQ A+AFS
Sbjct: 139 SVNSTNGSNPGREASVPRFLFSCAPTFLLQGLASGVVGMAGLGRTRIAFPSQFASAFSFN 198
Query: 188 RKFALCL-SPFD-DGAIVFGDGPYYDLNNFDV-SKNLKYTPLFINKVNTASGF-LGEPSV 243
RKFA+CL SP G I+FGDGPY L N + S++L +TPLFIN V+TAS F GEPS
Sbjct: 199 RKFAICLTSPAPAKGVIIFGDGPYNFLPNIQLTSQSLSFTPLFINPVSTASAFSQGEPSA 258
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--AS 301
EYFIGV S+ ++ K VPLN TLLSID++G GGTKISTVNPYTVLE+SI+ A+ +AF S
Sbjct: 259 EYFIGVKSIRISDKTVPLNATLLSIDSQGKGGTKISTVNPYTVLESSIFNAVTRAFINES 318
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD 361
A + RVA VAPF CF +I TR+G VP I LVLQN+NV+W I G NSMVQ+ +
Sbjct: 319 AARNITRVASVAPFDVCFSSDNIFSTRLGAAVPTISLVLQNENVIWRIFGANSMVQVSDN 378
Query: 362 ALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTSIA 420
LCLGFV+GG NP TSIVIG QLE+NL QFDLA SRLGFS L + TC FNFTSIA
Sbjct: 379 VLCLGFVNGGSNPTTSIVIGGYQLEDNLFQFDLAASRLGFSSLLFGRQTTCANFNFTSIA 438
>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 435
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 252/428 (58%), Positives = 309/428 (72%), Gaps = 18/428 (4%)
Query: 10 LCSLLIFSL-SPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNI 68
L +L +FSL +P AQ SFRP+ALV+ V KD +TLQY+TQIKQRTPLVP L + +GG
Sbjct: 9 LFTLFLFSLIAPSLAQQSFRPRALVVPVKKDASTLQYITQIKQRTPLVPENLVLDIGGQF 68
Query: 69 LWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTG 117
LWVDC+ YVSST + ARCGSAQC+LA + +CG CG PDN ++ T
Sbjct: 69 LWVDCDNNYVSSTYRPARCGSAQCSLARSDSCGNCFSAPKPGCNNNTCGVTPDNTVTGTA 128
Query: 118 THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALP 177
T G++ DV+S+QST+G NP + TV F+F C F+LQGLA+GV G+AGLGR+++ALP
Sbjct: 129 TSGELAQDVVSLQSTNGFNPIQNATVSRFLFSCAPTFLLQGLATGVSGMAGLGRTRIALP 188
Query: 178 SQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF 237
SQLA+AFS +RKFA+CLS +G FGDGPY L N D S+ L +TPL IN V+TAS F
Sbjct: 189 SQLASAFSFRRKFAVCLSS-SNGVAFFGDGPYVLLPNVDASQLLTFTPLLINPVSTASAF 247
Query: 238 -LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALV 296
GEPS EYFIGV S+ ++ K VPLN TLLSI+++GVGGTKIS+VNPYTVLE SI+KA+
Sbjct: 248 SQGEPSAEYFIGVKSIKIDEKTVPLNTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVT 307
Query: 297 QAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
+AF AS+ + RVA VAPF CF +++ TR+G VP I+LVLQN+ VW I G NS
Sbjct: 308 EAFVKASSARNITRVASVAPFEVCFSRENVLATRLGAAVPTIELVLQNQKTVWRIFGANS 367
Query: 355 MVQIGGD-ALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC- 412
MV + D LCLGFV+GG NPRTSIVIG QLE+NLLQFDLATSRLGFS L R TC
Sbjct: 368 MVSVSDDKVLCLGFVNGGENPRTSIVIGGYQLEDNLLQFDLATSRLGFSSLLYGSRTTCA 427
Query: 413 TFNFTSIA 420
FNFTS A
Sbjct: 428 NFNFTSAA 435
>gi|449527083|ref|XP_004170542.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Cucumis
sativus]
Length = 432
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 246/435 (56%), Positives = 306/435 (70%), Gaps = 18/435 (4%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
M+ + SF L+FS+S A TSFRPK+L+L V K + QY+TQI+QRTPLVPVKL
Sbjct: 1 MAASTSFSFSILFLLFSIS--FAATSFRPKSLLLPVTKHPSG-QYITQIRQRTPLVPVKL 57
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGP 109
TV LGG +WVDC++GYVSS+ K RC SAQC+L+ + +CG CG P
Sbjct: 58 TVDLGGQFMWVDCDRGYVSSSYKPVRCRSAQCSLSKSTSCGDCFSPPXPGCNNNTCGHFP 117
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
N I T G++ DVLS+ ST+G NP RAV++PNF+F+CG F+L+GLA GV G+AG
Sbjct: 118 GNTIIQLSTSGEVTSDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGF 177
Query: 170 GRSKVALPSQLAAAFSLKRKFALCLS--PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLF 227
GR+ ++LPSQ +AAFS RKFA+CLS G I G+GPY+ L N DV+K+L YTPLF
Sbjct: 178 GRTGISLPSQFSAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLF 237
Query: 228 INKVNTAS-GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTV 286
IN V+TA GE S EYFIGV S+ N K VP+N TLL ID+ G GGTKISTV+PYTV
Sbjct: 238 INPVSTAGVSTSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTV 297
Query: 287 LETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV 346
LE+SIY ALV+ + + RVA VAPFG C++ + G TR+GP +P IDL+LQNK V+
Sbjct: 298 LESSIYNALVKTITRELRNIPRVAAVAPFGVCYKSKSFGSTRLGPGMPSIDLILQNKKVI 357
Query: 347 WSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLL 406
W I G NSMVQ+ + LCLGFVDGGV RT+IVIGA Q+E+NLL+FDLATSRLGFS +LL
Sbjct: 358 WRIFGANSMVQVNEEVLCLGFVDGGVEARTAIVIGAYQMEDNLLEFDLATSRLGFSSTLL 417
Query: 407 FERATC-TFNFTSIA 420
TC FNFTS A
Sbjct: 418 GRMTTCANFNFTSTA 432
>gi|449432733|ref|XP_004134153.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 432
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 246/435 (56%), Positives = 306/435 (70%), Gaps = 18/435 (4%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
M+ + SF L+FS+S A TSFRPK+L+L V K + QY+TQI+QRTPLVPVKL
Sbjct: 1 MAASTSFSFSILFLLFSIS--FAATSFRPKSLLLPVTKHPSG-QYITQIRQRTPLVPVKL 57
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGP 109
TV LGG +WVDC++GYVSS+ K RC SAQC+L+ + +CG CG P
Sbjct: 58 TVDLGGQFMWVDCDRGYVSSSYKPVRCRSAQCSLSKSTSCGDCFSPPRPGCNNNTCGHFP 117
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
N I T G++ DVLS+ ST+G NP RAV++PNF+F+CG F+L+GLA GV G+AG
Sbjct: 118 GNTIIQLSTSGEVTSDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGF 177
Query: 170 GRSKVALPSQLAAAFSLKRKFALCLS--PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLF 227
GR+ ++LPSQ +AAFS RKFA+CLS G I G+GPY+ L N DV+K+L YTPLF
Sbjct: 178 GRTGISLPSQFSAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLF 237
Query: 228 INKVNTAS-GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTV 286
IN V+TA GE S EYFIGV S+ N K VP+N TLL ID+ G GGTKISTV+PYTV
Sbjct: 238 INPVSTAGVSTSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTV 297
Query: 287 LETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV 346
LE+SIY ALV+ + + RVA VAPFG C++ + G TR+GP +P IDL+LQNK V+
Sbjct: 298 LESSIYNALVKTITRELRNIPRVAAVAPFGVCYKSKSFGSTRLGPGMPSIDLILQNKKVI 357
Query: 347 WSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLL 406
W I G NSMVQ+ + LCLGFVDGGV RT+IVIGA Q+E+NLL+FDLATSRLGFS +LL
Sbjct: 358 WRIFGANSMVQVNEEVLCLGFVDGGVEARTAIVIGAYQMEDNLLEFDLATSRLGFSSTLL 417
Query: 407 FERATC-TFNFTSIA 420
TC FNFTS A
Sbjct: 418 GRMTTCANFNFTSTA 432
>gi|356576537|ref|XP_003556387.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 438
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 257/421 (61%), Positives = 302/421 (71%), Gaps = 27/421 (6%)
Query: 23 AQTSFRPKALVLRVAKDTATL--QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSS 80
AQTSFRPKALVL V KD + QYVTQIKQRTPLV VKLTV LGG LWV+CEKGYVSS
Sbjct: 22 AQTSFRPKALVLPVTKDVSASVPQYVTQIKQRTPLVAVKLTVDLGGGYLWVNCEKGYVSS 81
Query: 81 TNKTARCGSAQCNLANAKACG--GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPG 138
T++ ARCGSAQC+L C ICG P N ++ T+GDI DV+++ STDG NP
Sbjct: 82 TSRPARCGSAQCSLFGLYGCSTEDKICGRSPSNTVTGVSTYGDIHADVVAVNSTDGNNPT 141
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP-- 196
+ V+VP F+F+CGS V +GLASGV G+AGLGR+KV+LPSQ A+AFS RKFA+CLS
Sbjct: 142 KVVSVPKFLFICGSNVVQKGLASGVTGMAGLGRTKVSLPSQFASAFSFHRKFAICLSSST 201
Query: 197 FDDGAIVFGDGPY-YDLNNFDVSKNLKYTPLFINKVNTA-SGFLGEPSVEYFIGVTSVHV 254
+G + FGDGPY + N D+SK L +TPL N V+TA S F GEPSVEYFIGV S+ V
Sbjct: 202 MTNGVMFFGDGPYNFGYLNSDLSKVLTFTPLISNPVSTAPSYFQGEPSVEYFIGVKSIKV 261
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
+ K V LN TLLSID G+GGTKISTVNPYTV+ET+IYKA+ + F + VAPVAP
Sbjct: 262 SDKNVALNTTLLSIDRNGIGGTKISTVNPYTVMETTIYKAVSEVFVKEV-GAPTVAPVAP 320
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD----- 369
FG CF +DIG TR+GP VP IDLVLQN +VVW+I G NSMV + D +CLGFVD
Sbjct: 321 FGTCFATKDIGSTRMGPAVPGIDLVLQN-DVVWTIIGANSMVYV-NDVICLGFVDAGSSP 378
Query: 370 ---------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTSI 419
GG +PRTSI IGA QLENNLLQFDLATSRLGF S+ F+ + C FNFTS
Sbjct: 379 SVAQVGFVAGGSHPRTSITIGAHQLENNLLQFDLATSRLGFR-SIFFDHSNCANFNFTSS 437
Query: 420 A 420
A
Sbjct: 438 A 438
>gi|285741|dbj|BAA03413.1| EDGP precursor [Daucus carota]
Length = 433
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 247/423 (58%), Positives = 300/423 (70%), Gaps = 18/423 (4%)
Query: 14 LIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC 73
L+F + AQ SFRP ALV+ V KD +TLQYVT I QRTPLV L V LGG LWVDC
Sbjct: 11 LLFIFTITQAQPSFRPSALVVPVKKDASTLQYVTTINQRTPLVSENLVVDLGGRFLWVDC 70
Query: 74 EKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDI 122
++ YVSST + RC ++QC+L+ + ACG CG P+NP+ NT T G++
Sbjct: 71 DQNYVSSTYRPVRCRTSQCSLSGSIACGDCFNGPRPGCNNNTCGVFPENPVINTATGGEV 130
Query: 123 RIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
DV+S++STDG + GR VTVP FIF C +LQ LASGVVG+AGLGR+++ALPSQ A+
Sbjct: 131 AEDVVSVESTDGSSSGRVVTVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFAS 190
Query: 183 AFSLKRKFALCLS--PFDDGAIVFGDGPYYDLNNFDVS-KNLKYTPLFINKVNT-ASGFL 238
AFS KRKFA+CLS + I+FG+ PY L N VS K L YTPL N V+T A+
Sbjct: 191 AFSFKRKFAMCLSGSTSSNSVIIFGNDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQ 250
Query: 239 GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQA 298
GEPSVEYFIGV S+ +N K V LN +LLSI + G+GGTKIST+NPYTVLETSIYKA+ +A
Sbjct: 251 GEPSVEYFIGVKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEA 310
Query: 299 F--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
F SA + RVA VAPFGACF +I TR+GP VP IDLVLQ+++VVW+I G NSMV
Sbjct: 311 FIKESAARNITRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMV 370
Query: 357 QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFN 415
I + +CLG VDGG N RTSIVIG QLE+NL+QFDLATSR+GFS +LL R TC FN
Sbjct: 371 YINDNVVCLGVVDGGSNLRTSIVIGGHQLEDNLVQFDLATSRVGFSGTLLGSRTTCANFN 430
Query: 416 FTS 418
FTS
Sbjct: 431 FTS 433
>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 436
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 255/433 (58%), Positives = 302/433 (69%), Gaps = 27/433 (6%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHL 64
F+LL L +F +S AQTSFRPKALVL + +DT+ T QY TQIKQRTPLVP+ LT+ L
Sbjct: 8 FILLPFLSLF-ISTSLAQTSFRPKALVLPITRDTSASTPQYTTQIKQRTPLVPINLTIDL 66
Query: 65 GGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG-ICGAGPDNPISNTGTHGDIR 123
GG WV+C+K YVSST K C S+QC+L + C ICG P N ++ T GDI+
Sbjct: 67 GGGYFWVNCDKSYVSSTLKPILCSSSQCSLFGSHGCSDKKICGRSPYNIVTGVSTSGDIQ 126
Query: 124 IDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAA 183
D++S+QST+G GR V+VPNF+F+CGS V GLA GV G+AGLGR+KV+LPSQ ++A
Sbjct: 127 SDIVSVQSTNGNYSGRFVSVPNFLFICGSNVVQNGLAKGVKGMAGLGRTKVSLPSQFSSA 186
Query: 184 FSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA-SGFLGEPS 242
FS K KFA+CL +G + FGDGPY L NFD SKNL YTPL N V+T+ S FLGE S
Sbjct: 187 FSFKNKFAICLGT-QNGVLFFGDGPY--LFNFDESKNLIYTPLITNPVSTSPSSFLGEKS 243
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
VEYFIGV S+ V+ K V LN TLLSID G GGTKISTVNPYT++ETSIYKA+ AF A
Sbjct: 244 VEYFIGVKSIRVSSKNVKLNTTLLSIDQNGFGGTKISTVNPYTIMETSIYKAVADAFVKA 303
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-D 361
+ V+ V PVAPFG CF Q I +R+GP VP IDLVLQN+NVVW+I G N+MV+I D
Sbjct: 304 L-NVSTVEPVAPFGTCFASQSISSSRMGPDVPSIDLVLQNENVVWNIIGANAMVRINDKD 362
Query: 362 ALCLGFVD---------------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLL 406
+CLGFVD GG P TSI IGA QLENNLLQFDLATSRLGF SL
Sbjct: 363 VICLGFVDAGSDFAKTSQVGFVVGGSKPMTSITIGAHQLENNLLQFDLATSRLGFR-SLF 421
Query: 407 FERATC-TFNFTS 418
E C FNFTS
Sbjct: 422 LEHDNCGNFNFTS 434
>gi|388508700|gb|AFK42416.1| unknown [Lotus japonicus]
Length = 440
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 257/440 (58%), Positives = 306/440 (69%), Gaps = 28/440 (6%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKD-TATL-QYVTQIKQRTPLVPV 58
M+ + + S L ++ AQTSFRPKAL L + KD T++L QY+TQIKQRTPLVPV
Sbjct: 5 MATSLKLFIFSSFLSLMITSSIAQTSFRPKALALPITKDVTSSLPQYITQIKQRTPLVPV 64
Query: 59 KLTVHLGGNILWVDCE-KGYVSSTNKTARCGSAQCNLANAKACGGG-ICGAGPDNPISNT 116
KLT+ LGG LWV+CE + YVSST K ARCGS+QC+L C G ICG P N ++
Sbjct: 65 KLTLDLGGGYLWVNCENRQYVSSTFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGV 124
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
++GDI DV+S+ STDG P + V+VPNF+F+CGS+ V GLA GV G+AGLGR++V+L
Sbjct: 125 SSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSL 184
Query: 177 PSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA 234
PSQ ++AFS RKFA+CL+ DG + FGDGP Y+LN DVSK L YTPL N V+TA
Sbjct: 185 PSQFSSAFSFHRKFAICLTANSGADGVMFFGDGP-YNLNQ-DVSKVLTYTPLITNPVSTA 242
Query: 235 -SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYK 293
S FLGEPSVEYFIGV SV V+ K VPLN TLLSI+ GVGGTKISTVNPYTV+ET+IYK
Sbjct: 243 PSAFLGEPSVEYFIGVKSVKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYK 302
Query: 294 ALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQN 353
A+ AF ++ V+PVAPFG CF +DI F+RIGP VP IDLVLQN V W I G N
Sbjct: 303 AVADAFVKSL-GAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQN-GVEWPIIGAN 360
Query: 354 SMVQIGGDALCLGFVDGGVNPR--------------TSIVIGARQLENNLLQFDLATSRL 399
SMVQ D +CLGFVD G NP+ TSI IGA QLENNLL+FDLA SRL
Sbjct: 361 SMVQF-DDVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRL 419
Query: 400 GFSDSLLFERATC-TFNFTS 418
GF SL E C F FTS
Sbjct: 420 GFR-SLFLEHDNCQNFRFTS 438
>gi|384482418|pdb|3VLB|A Chain A, Crystal Structure Of Xeg-Edgp
gi|384482420|pdb|3VLB|C Chain C, Crystal Structure Of Xeg-Edgp
Length = 413
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 244/413 (59%), Positives = 295/413 (71%), Gaps = 18/413 (4%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
Q SFRP ALV+ V KD +TLQYVT I QRTPLV L V LGG LWVDC++ YVSST +
Sbjct: 1 QPSFRPSALVVPVKKDASTLQYVTTINQRTPLVSENLVVDLGGRFLWVDCDQNYVSSTYR 60
Query: 84 TARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
RC ++QC+L+ + ACG CG P+NP+ NT T G++ DV+S++ST
Sbjct: 61 PVRCRTSQCSLSGSIACGDCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVEST 120
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
DG + GR VTVP FIF C +LQ LASGVVG+AGLGR+++ALPSQ A+AFS KRKFA+
Sbjct: 121 DGSSSGRVVTVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAM 180
Query: 193 CLS--PFDDGAIVFGDGPYYDLNNFDVS-KNLKYTPLFINKVNT-ASGFLGEPSVEYFIG 248
CLS + I+FG+ PY L N VS K L YTPL N V+T A+ GEPSVEYFIG
Sbjct: 181 CLSGSTSSNSVIIFGNDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYFIG 240
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKV 306
V S+ +N K V LN +LLSI + G+GGTKIST+NPYTVLETSIYKA+ +AF SA +
Sbjct: 241 VKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAARNI 300
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
RVA VAPFGACF +I TR+GP VP IDLVLQ+++VVW+I G NSMV I + +CLG
Sbjct: 301 TRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYINDNVVCLG 360
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
VDGG N RTSIVIG QLE+NL+QFDLATSR+GFS +LL R TC FNFTS
Sbjct: 361 VVDGGSNLRTSIVIGGHQLEDNLVQFDLATSRVGFSGTLLGSRTTCANFNFTS 413
>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 435
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 250/426 (58%), Positives = 299/426 (70%), Gaps = 22/426 (5%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATL-QYVTQIKQRTPLVPVKLTVHLGGNILWV 71
LL F +SP +Q SFRPKALVL + KD AT QY QI QRTPLVP+ + V LGG LWV
Sbjct: 14 LLFFFISPTFSQQSFRPKALVLPITKDGATTNQYKAQINQRTPLVPLNVIVDLGGQFLWV 73
Query: 72 DCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHG 120
DCE Y+SST + ARC SAQC+LAN+ CG CG PDN I++T T G
Sbjct: 74 DCENKYISSTYRPARCRSAQCSLANSDGCGDCFSSPKPGCNNNTCGVTPDNSITHTATSG 133
Query: 121 DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
++ DVLSIQS++G NPG+ V V F+F C F+L+GLA+G G+AGLGR+K+ALPSQL
Sbjct: 134 ELAEDVLSIQSSNGFNPGQNVVVSRFLFSCAPTFLLKGLATGASGMAGLGRTKIALPSQL 193
Query: 181 AAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNN--FDVSKNLKYTPLFINKVNTASGF- 237
A+AFS RKFA+CLS G ++FGDGPY L N FD S +L YTPL IN V+TAS F
Sbjct: 194 ASAFSFARKFAICLSS-SKGVVLFGDGPYGFLPNVVFD-SDSLTYTPLLINPVSTASAFS 251
Query: 238 LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQ 297
G+PS EYFIGV ++ ++ K V LN +LLSIDN GVGGTKISTV+PYTVLE SIYKA+
Sbjct: 252 QGQPSAEYFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTD 311
Query: 298 AF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM 355
AF ASA + RV VAPF C+ ++ TR+G VP I+L LQN+NVVW I G NSM
Sbjct: 312 AFVKASAARNIKRVGSVAPFEFCY--TNLTGTRLGAAVPTIELFLQNENVVWRIFGANSM 369
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-F 414
V I + LCLGFV+GG N RTSIVIG QLENNLLQFDLA S+LGFS L + TC+ F
Sbjct: 370 VSINDEVLCLGFVNGGKNTRTSIVIGGYQLENNLLQFDLAASKLGFSSLLFGRQTTCSNF 429
Query: 415 NFTSIA 420
NFTS A
Sbjct: 430 NFTSTA 435
>gi|356535355|ref|XP_003536212.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 444
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 255/421 (60%), Positives = 295/421 (70%), Gaps = 27/421 (6%)
Query: 23 AQTSFRPKALVLRVAKDTATL--QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSS 80
AQ SFRPKALVL V KD + QYVTQIKQRTPLVPVKLTV LGG WV+CEKGYVSS
Sbjct: 28 AQPSFRPKALVLPVTKDVSASVPQYVTQIKQRTPLVPVKLTVDLGGGYFWVNCEKGYVSS 87
Query: 81 TNKTARCGSAQCNLANAKACG--GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPG 138
T+K ARCGSAQC+L C IC N ++ T G+I DV++I +TDG NP
Sbjct: 88 TSKPARCGSAQCSLFGLYGCNVEDKICSRSLSNTVTGVSTFGEIHADVVAINATDGNNPV 147
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP-- 196
R V+VP F+F+CG+ V GLASGV G+AGLGR+KV+LPSQ ++AFS RKFA+CLS
Sbjct: 148 RVVSVPKFLFICGANVVQNGLASGVTGMAGLGRTKVSLPSQFSSAFSFLRKFAICLSSST 207
Query: 197 FDDGAIVFGDGPY-YDLNNFDVSKNLKYTPLFINKVNTA-SGFLGEPSVEYFIGVTSVHV 254
+G + FGDGPY + N D+SK L +TPL N V+TA S F GEPSVEYFIGV S+ V
Sbjct: 208 MTNGVMFFGDGPYNFGYLNSDLSKVLTFTPLITNPVSTAPSYFQGEPSVEYFIGVKSIRV 267
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
+ K VPLN TLLSID G+GGTKISTVNPYTVLET+IYKA+ +AF A+ VAPVAP
Sbjct: 268 SDKNVPLNTTLLSIDRNGIGGTKISTVNPYTVLETTIYKAVSEAFVKAV-GAPTVAPVAP 326
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
FG CF +DI TR+GP VP I+LVLQN+ VVWSI G NSMV D +CLGFVD G +P
Sbjct: 327 FGTCFATKDIQSTRMGPAVPDINLVLQNE-VVWSIIGANSMVYT-NDVICLGFVDAGSDP 384
Query: 375 R--------------TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTSI 419
TSI IGA QLENN+LQFDLATSRLGF SL E A C FNFTS
Sbjct: 385 STAQVGFVVGYSQPITSITIGAHQLENNMLQFDLATSRLGFR-SLFLEHANCANFNFTSS 443
Query: 420 A 420
A
Sbjct: 444 A 444
>gi|384482417|pdb|3VLA|A Chain A, Crystal Structure Of Edgp
Length = 413
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 243/411 (59%), Positives = 294/411 (71%), Gaps = 18/411 (4%)
Query: 26 SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA 85
SFRP ALV+ V KD +TLQYVT I QRTPLV L V LGG LWVDC++ YVSST +
Sbjct: 3 SFRPSALVVPVKKDASTLQYVTTINQRTPLVSENLVVDLGGRFLWVDCDQNYVSSTYRPV 62
Query: 86 RCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
RC ++QC+L+ + ACG CG P+NP+ NT T G++ DV+S++STDG
Sbjct: 63 RCRTSQCSLSGSIACGDCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVESTDG 122
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ GR VTVP FIF C +LQ LASGVVG+AGLGR+++ALPSQ A+AFS KRKFA+CL
Sbjct: 123 SSSGRVVTVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL 182
Query: 195 S--PFDDGAIVFGDGPYYDLNNFDVS-KNLKYTPLFINKVNT-ASGFLGEPSVEYFIGVT 250
S + I+FG+ PY L N VS K L YTPL N V+T A+ GEPSVEYFIGV
Sbjct: 183 SGSTSSNSVIIFGNDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYFIGVK 242
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVAR 308
S+ +N K V LN +LLSI + G+GGTKIST+NPYTVLETSIYKA+ +AF SA + R
Sbjct: 243 SIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAARNITR 302
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
VA VAPFGACF +I TR+GP VP IDLVLQ+++VVW+I G NSMV I + +CLG V
Sbjct: 303 VASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYINDNVVCLGVV 362
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
DGG N RTSIVIG QLE+NL+QFDLATSR+GFS +LL R TC FNFTS
Sbjct: 363 DGGSNLRTSIVIGGHQLEDNLVQFDLATSRVGFSGTLLGSRTTCANFNFTS 413
>gi|147801500|emb|CAN61502.1| hypothetical protein VITISV_011733 [Vitis vinifera]
Length = 415
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 239/402 (59%), Positives = 289/402 (71%), Gaps = 15/402 (3%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN 82
AQ+SFRP ALV+ V+KD++TLQYVT I QRTPLVP++L V LGG LWVDCE+ YVSS+
Sbjct: 22 AQSSFRPHALVIPVSKDSSTLQYVTSINQRTPLVPLQLVVDLGGQFLWVDCEQNYVSSS- 80
Query: 83 KTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
R G+ Q C C PDN ++ T + ++ D +S+QSTDG NPGR+V+
Sbjct: 81 --YRPGAVQ------PGCNNNTCSVLPDNTVTRTASSDELAEDAVSVQSTDGSNPGRSVS 132
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DG 200
V F+F C +L+GLASG G+AGLGR+++ALPSQ A+AFS RKFA+CLS DG
Sbjct: 133 VSKFLFSCAPTSLLEGLASGAKGMAGLGRTRIALPSQFASAFSFHRKFAICLSSSTTADG 192
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGEPSVEYFIGVTSVHVNGKAV 259
I+ GDG Y L N D S+ L YTPL +N V+TAS GEPS EYFIGV S+ +N KAV
Sbjct: 193 VILLGDGSYGLLPNVDASQLLIYTPLILNPVSTASAHSQGEPSAEYFIGVKSIQINEKAV 252
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGA 317
PLN +LLSI+++GVGGTKISTVNPYTV+ETSIY A +AF A+A + RVA VAPF
Sbjct: 253 PLNTSLLSINSKGVGGTKISTVNPYTVMETSIYSAFTKAFISAAASMNITRVAAVAPFSV 312
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
CF +++ TR G VP I LVLQN +VVW I G NSMV + GD LCLGFVDGG NPRTS
Sbjct: 313 CFSSKNVYSTRGGAAVPTIGLVLQNNSVVWRIFGANSMVFVNGDVLCLGFVDGGANPRTS 372
Query: 378 IVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTS 418
IVIG QLE+NLLQFDLA SRLGFS SLLF + TC+ FNFTS
Sbjct: 373 IVIGGYQLEDNLLQFDLAASRLGFSSSLLFSQTTCSNFNFTS 414
>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
Length = 435
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 249/426 (58%), Positives = 298/426 (69%), Gaps = 22/426 (5%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATL-QYVTQIKQRTPLVPVKLTVHLGGNILWV 71
LL F +SP +Q SFRPKALVL + KD AT QY QI QRTPLVP+ + V LGG LWV
Sbjct: 14 LLFFFISPTFSQQSFRPKALVLPITKDGATTNQYKAQINQRTPLVPLNVIVDLGGQFLWV 73
Query: 72 DCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHG 120
DCE Y+SST + ARC SAQC+LAN+ CG CG PDN I++T T G
Sbjct: 74 DCENKYISSTYRPARCRSAQCSLANSDGCGDCFSSPKPGCNNNTCGVTPDNSITHTATSG 133
Query: 121 DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
++ DVLSIQS++G NPG+ V V F+F C F+L+GLA+G G+AGLGR+K+ALPSQL
Sbjct: 134 ELAEDVLSIQSSNGFNPGQNVVVSRFLFSCAPTFLLKGLATGASGMAGLGRTKIALPSQL 193
Query: 181 AAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNN--FDVSKNLKYTPLFINKVNTASGF- 237
A+AFS RKFA+CLS G ++FGDGPY L N FD S +L YTPL IN V+TAS F
Sbjct: 194 ASAFSFARKFAICLSS-SKGVVLFGDGPYGFLPNVVFD-SDSLTYTPLLINPVSTASAFS 251
Query: 238 LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQ 297
G+PS EYFIGV ++ ++ K V LN +LLSIDN GVGGTKISTV+PYTVLE SIYKA+
Sbjct: 252 QGQPSAEYFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTD 311
Query: 298 AF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM 355
AF A A + RV VAPF C+ ++ TR+G VP I+L LQN+NVVW I G NSM
Sbjct: 312 AFVKAPAARNIKRVGSVAPFEFCY--TNLTGTRLGAAVPTIELFLQNENVVWRIFGANSM 369
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-F 414
V I + LCLGFV+GG N RTSIVIG QLENNLLQFDLA S+LGFS L + TC+ F
Sbjct: 370 VSINDEVLCLGFVNGGKNTRTSIVIGGYQLENNLLQFDLAASKLGFSSLLFGRQTTCSNF 429
Query: 415 NFTSIA 420
NFTS A
Sbjct: 430 NFTSTA 435
>gi|296086729|emb|CBI32364.3| unnamed protein product [Vitis vinifera]
Length = 400
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 236/400 (59%), Positives = 281/400 (70%), Gaps = 29/400 (7%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
+TSFRP ALV+ V+KD +TLQY+T I QRTPLVPVKL V LG LWVDCE+ YVSS+ +
Sbjct: 23 KTSFRPDALVIPVSKDASTLQYLTTINQRTPLVPVKLVVDLGAQFLWVDCEQNYVSSSYR 82
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
ARC SAQC+LA A CG C + P P N T G + D +S+QSTDG NPGR V+V
Sbjct: 83 PARCRSAQCSLARANGCGD--CFSAP-RPGCNNNTCG-LAEDFVSVQSTDGSNPGRVVSV 138
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGA 201
F+F C F+L+GLAS +G+AGLGR+++A PSQ A+AFS RKFA CLS +G
Sbjct: 139 SKFLFSCAPTFLLEGLASSAMGMAGLGRTRIAFPSQFASAFSFHRKFATCLSSSTTANGV 198
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ FGDGPY L N D S++L YTPL+IN S+ +N KA+ L
Sbjct: 199 VFFGDGPYRLLPNIDASQSLIYTPLYINP--------------------SIRINEKAISL 238
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGACF 319
N +LLSID+EGVGGTKISTVNPYTV+ETSIYKA +AF A+A + RVA VAPF CF
Sbjct: 239 NTSLLSIDSEGVGGTKISTVNPYTVMETSIYKAFTKAFISAAAAINITRVAAVAPFNVCF 298
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIV 379
+++ TR+GP VP IDLVLQN++V W I G NSMV + D LCLGFVDGG NPRTSIV
Sbjct: 299 SSKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCLGFVDGGANPRTSIV 358
Query: 380 IGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
IG QLE+NLLQFDLATSRLGFS SLLF R TC FNFTS
Sbjct: 359 IGGYQLEDNLLQFDLATSRLGFSSSLLFRRTTCANFNFTS 398
>gi|449432735|ref|XP_004134154.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
gi|449527085|ref|XP_004170543.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 435
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 241/414 (58%), Positives = 296/414 (71%), Gaps = 17/414 (4%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN 82
A TSFRPK+L+L V K +LQY+T+I QRTPLVPVKLTV LGG +WVDC++GYVSS+
Sbjct: 23 AATSFRPKSLLLPVTKH-PSLQYITEIHQRTPLVPVKLTVDLGGQFMWVDCDRGYVSSSY 81
Query: 83 KTARCGSAQCNLANAKA------------CGGGICGAGPDNPISNTGTHGDIRIDVLSIQ 130
K ARC SAQC+LA+ + C C P N I T G++ DV+S+
Sbjct: 82 KPARCRSAQCSLASKSSACGQCFSPPRPGCNNNTCSLFPGNTIIRLSTSGEVASDVVSVS 141
Query: 131 STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
ST+G NP RAV++PNF+F+CGS F+L+GLA GV G+AG GR+ ++LPSQ AAAFS RKF
Sbjct: 142 STNGFNPTRAVSIPNFLFVCGSTFLLEGLAPGVTGMAGFGRNGISLPSQFAAAFSFNRKF 201
Query: 191 ALCLS--PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS-GFLGEPSVEYFI 247
A+CLS G I G+GPY+ L N D++ + YTPLFIN V+TA GE S EYFI
Sbjct: 202 AVCLSGSTSSPGVIFSGNGPYHFLPNIDLTNSFTYTPLFINPVSTAGVSSAGEKSTEYFI 261
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
GVTS+ VN K VPLN TLL ID+ G GGTKISTVNP+TVLE+SIYKALV+AF + + KV
Sbjct: 262 GVTSIVVNSKPVPLNTTLLKIDSNGNGGTKISTVNPFTVLESSIYKALVKAFTTEVSKVP 321
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
RV VAPF C+ + TR+G VP IDLVLQNK V+WS+ G NSMVQ+ + LCLGF
Sbjct: 322 RVGAVAPFEVCYSSKSFPSTRLGAGVPTIDLVLQNKKVIWSMFGANSMVQVNDEVLCLGF 381
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTSIA 420
VDGGV+ RT+IVIGA Q+E+ LL+FDLATSRLGF+ +LL TC FNFTS A
Sbjct: 382 VDGGVDVRTAIVIGAHQIEDKLLEFDLATSRLGFTPTLLGRMTTCANFNFTSNA 435
>gi|15218740|ref|NP_171821.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|13272443|gb|AAK17160.1|AF325092_1 unknown protein [Arabidopsis thaliana]
gi|3850579|gb|AAC72119.1| Strong similarity to gb|D14550 extracellular dermal glycoprotein
(EDGP) precursor from Daucus carota. ESTs gb|H37281,
gb|T44167, gb|T21813, gb|N38437, gb|Z26470, gb|R65072,
gb|N76373, gb|F15470, gb|Z35182, gb|H76373, gb|Z34678
and gb|Z35387 come from this gene [Arabidopsis thaliana]
gi|14334706|gb|AAK59531.1| unknown protein [Arabidopsis thaliana]
gi|16323420|gb|AAL15204.1| unknown protein [Arabidopsis thaliana]
gi|332189425|gb|AEE27546.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 433
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 235/424 (55%), Positives = 295/424 (69%), Gaps = 19/424 (4%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
L IFSLS +AQT FRPKAL+L V KD +TLQY T I QRTPLVP + LGG LWVD
Sbjct: 13 LFIFSLSS-SAQTPFRPKALLLPVTKDQSTLQYTTVINQRTPLVPASVVFDLGGRELWVD 71
Query: 73 CEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGD 121
C+KGYVSST ++ RC SA C+ A + +CG CG PDN ++ T T G+
Sbjct: 72 CDKGYVSSTYQSPRCNSAVCSRAGSTSCGTCFSPPRPGCSNNTCGGIPDNTVTGTATSGE 131
Query: 122 IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLA 181
+DV+SIQST+G NPGR V +PN IF CG+ F+L+GLA G VG+AG+GR + LPSQ A
Sbjct: 132 FALDVVSIQSTNGSNPGRVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHNIGLPSQFA 191
Query: 182 AAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGE 240
AAFS RKFA+CL+ G FG+GPY L +S +L+ TPL IN V+TAS F GE
Sbjct: 192 AAFSFHRKFAVCLTS-GKGVAFFGNGPYVFLPGIQIS-SLQTTPLLINPVSTASAFSQGE 249
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAF 299
S EYFIGVT++ + K VP+N TLL I+ + G+GGTKIS+VNPYTVLE+SIY A F
Sbjct: 250 KSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIGGTKISSVNPYTVLESSIYNAFTSEF 309
Query: 300 A--SAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
+A + RVA V PFGACF +++G TR+G VP+I+LVL +K+VVW I G NSMV
Sbjct: 310 VKQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVWRIFGANSMVS 369
Query: 358 IGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNF 416
+ D +CLGFVDGGVN RTS+VIG QLE+NL++FDLA+++ GFS +LL + C FNF
Sbjct: 370 VSDDVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQTNCANFNF 429
Query: 417 TSIA 420
TS A
Sbjct: 430 TSTA 433
>gi|449432731|ref|XP_004134152.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
gi|449527081|ref|XP_004170541.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 429
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 233/434 (53%), Positives = 293/434 (67%), Gaps = 19/434 (4%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
M+ + SF L S+L S A TSF P++LVL V K +LQY+ QI QRTPLVPV L
Sbjct: 1 MAASTSFSLFSSILFLLFSISIASTSFTPRSLVLPVTKH-PSLQYIIQIHQRTPLVPVNL 59
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGP 109
TV LGG ++WVDC++G+VSS+ K ARC SAQC+LA + +CG C
Sbjct: 60 TVDLGGWLMWVDCDRGFVSSSYKPARCRSAQCSLAKSISCGKCYLPPHPGCNNYTCSLSA 119
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
N I + G++ D++S+ ST+G N RA++VPNF+F+C S F+L+GLA GV G+AG
Sbjct: 120 RNTIIQLSSGGEVTSDLVSVSSTNGFNSTRALSVPNFLFICSSTFLLEGLAGGVTGMAGF 179
Query: 170 GRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLF 227
GR++++LPSQ AAAFS RKF +CLS G I G GPY+ L N D++ +L YTPL
Sbjct: 180 GRTRISLPSQFAAAFSFSRKFTMCLSGSTGFPGVIFSGYGPYHFLPNIDLTNSLTYTPLL 239
Query: 228 INKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVL 287
IN V GF GE S EYFIGV S+ N K VPLN TLL ID+ G GGTKISTVNPYTVL
Sbjct: 240 INPV----GFAGEKSSEYFIGVKSIEFNSKTVPLNTTLLKIDSNGNGGTKISTVNPYTVL 295
Query: 288 ETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVW 347
ETSIY+ALV+ F S + + RVA VAPF C+ + G T +GP VP IDL+LQNK V+W
Sbjct: 296 ETSIYRALVKTFTSELGNIPRVAAVAPFEVCYSSKSFGSTELGPSVPSIDLILQNKKVIW 355
Query: 348 SIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLF 407
+ G NSMV + + LCLGFV+GGV T++VIG Q+E+NLL+FDLATSRLGFS +LL
Sbjct: 356 RMFGANSMVVVTEEVLCLGFVEGGVEAETAMVIGGHQIEDNLLEFDLATSRLGFSSTLLG 415
Query: 408 ERATCT-FNFTSIA 420
C FNFTS A
Sbjct: 416 RNTNCANFNFTSTA 429
>gi|21537233|gb|AAM61574.1| EDGP precursor [Arabidopsis thaliana]
Length = 433
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 236/424 (55%), Positives = 295/424 (69%), Gaps = 19/424 (4%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
L IFSLS +AQT FRPKAL+L V KD +TLQY T I QRTPLVP + LGG LWVD
Sbjct: 13 LFIFSLSS-SAQTPFRPKALLLPVTKDQSTLQYTTVINQRTPLVPASVVFDLGGRELWVD 71
Query: 73 CEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGD 121
C+KGYVSST ++ RC SA C+ A + +CG CG PDN ++ T T G+
Sbjct: 72 CDKGYVSSTYQSPRCNSAVCSRAGSTSCGTCFSPPRPGCSNNTCGGIPDNTVTGTATSGE 131
Query: 122 IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLA 181
+DV+SIQST+G NPGR V +PN IF CG+ F+L+GLA G VG+AG+GR + LPSQ A
Sbjct: 132 FALDVVSIQSTNGSNPGRVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHNIGLPSQFA 191
Query: 182 AAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGE 240
AAFS RKFA+CL+ G FG+GPY L +S +L+ TPL IN V+TAS F GE
Sbjct: 192 AAFSFHRKFAVCLTS-GKGVAFFGNGPYVFLPGIQIS-SLQTTPLLINPVSTASAFSQGE 249
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAF 299
S EYFIGVT++ + K VP+N TLL I+ + G GGTKIS+VNPYTVLE+SIY A F
Sbjct: 250 KSSEYFIGVTAIQIVEKTVPINPTLLKINASTGFGGTKISSVNPYTVLESSIYNAFTSEF 309
Query: 300 AS-AMPK-VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
A+ + + RVA V PFGACF +++G TR+G VP+I+LVL +K+VVW I G NSMV
Sbjct: 310 VKQALARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVWRIFGANSMVS 369
Query: 358 IGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNF 416
+ D +CLGFVDGGVN RTS+VIG QLE+NL++FDLA++R GFS +LL + C FNF
Sbjct: 370 VSDDVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNRFGFSSTLLGRQTNCANFNF 429
Query: 417 TSIA 420
TS A
Sbjct: 430 TSTA 433
>gi|297843130|ref|XP_002889446.1| EDGP precursor [Arabidopsis lyrata subsp. lyrata]
gi|297335288|gb|EFH65705.1| EDGP precursor [Arabidopsis lyrata subsp. lyrata]
Length = 433
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 235/424 (55%), Positives = 291/424 (68%), Gaps = 19/424 (4%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
L IFSLS +AQTSFRPKAL+L V KD +TLQY T I QRTPLVP + LGG LWVD
Sbjct: 13 LFIFSLSS-SAQTSFRPKALLLPVTKDQSTLQYTTVINQRTPLVPASVVFDLGGRELWVD 71
Query: 73 CEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGD 121
C+K YVSST ++ RC SA C+ A + +CG CG PDN ++ T T G+
Sbjct: 72 CDKDYVSSTYQSPRCKSAVCSRAGSNSCGTCFSPPRPGCSNNTCGGIPDNTVTGTATSGE 131
Query: 122 IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLA 181
+DV+SIQST+G NPGR V +PN IF CG+ F+L+GLA+G VG+AG+GR + LPSQ A
Sbjct: 132 FALDVVSIQSTNGSNPGRVVKIPNLIFDCGATFLLKGLATGTVGMAGMGRHNIGLPSQFA 191
Query: 182 AAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGE 240
AAFS RKFA+CL+ G FG+GPY L +S L+ TPL IN V+TAS F GE
Sbjct: 192 AAFSFNRKFAVCLTS-GRGVAFFGNGPYVFLPGIQIS-GLQTTPLLINPVSTASAFSQGE 249
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAF 299
S EYFIGVT++ + K VP+N TLL I+ + G GGTKIS+VNPYTVLE+SIY A F
Sbjct: 250 KSSEYFIGVTAIKIVEKTVPINPTLLKINASTGFGGTKISSVNPYTVLESSIYNAFTSEF 309
Query: 300 A--SAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
+A + RVA V PF ACF +++G TR+G VP+I LVL + +VVW I G NSMV
Sbjct: 310 VKQAAARNITRVASVKPFSACFSTKNVGVTRLGYAVPEIQLVLHSNDVVWRIFGANSMVS 369
Query: 358 IGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNF 416
+ D +CLGFVDGGVN RTS+VIG QLE+NL++FDLA++R GFS +LL R C FNF
Sbjct: 370 VSDDVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNRFGFSSTLLGRRTNCANFNF 429
Query: 417 TSIA 420
TS A
Sbjct: 430 TSTA 433
>gi|291002746|gb|ADD71505.1| xyloglucanase inhibitor 3 [Humulus lupulus]
Length = 441
Score = 439 bits (1130), Expect = e-121, Method: Compositional matrix adjust.
Identities = 229/403 (56%), Positives = 282/403 (69%), Gaps = 21/403 (5%)
Query: 18 LSPITAQT-SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
+SP +QT SFRPKALVL+V KD+AT QY T I QRTP V VK+ + LGG LWVDCEKG
Sbjct: 19 ISPSISQTISFRPKALVLQVTKDSATHQYYTHITQRTPPVQVKVAIDLGGEFLWVDCEKG 78
Query: 77 YVSSTNKTARCGSAQCNLANAKACGGG------ICGAGPDNPISNTGTHGDIRIDVLSIQ 130
+ SST K C SAQCNLA +KAC +CG P NP +T T GD+ D++ IQ
Sbjct: 79 FNSSTKKPVPCRSAQCNLAKSKACSTNGNPSEDVCGEFPHNPFISTSTSGDLSQDIIYIQ 138
Query: 131 STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
ST+G PG+ V+VP FIF C F+L+GL SG VG+AGLGR+K+ALPS +AAFS +K
Sbjct: 139 STNGSRPGKVVSVPKFIFTCAPTFLLKGLTSGAVGVAGLGRNKIALPSLFSAAFSFPKKM 198
Query: 191 ALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
A+CLS +G + FG+GPY + DVSK+L YTPL +N VN GF GE S EYFIGV
Sbjct: 199 AVCLSS-TNGVVFFGNGPYELSSGIDVSKSLTYTPLILNPVNLIGGFQGESSSEYFIGVK 257
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP--KVAR 308
S+ V+GK V +N +LLS D +G GGTKISTV+PYT LETSIY +V AF +A+ V +
Sbjct: 258 SIKVDGKPVSVNSSLLSFDVDGNGGTKISTVDPYTTLETSIYNTVVNAFVNALAVRNVHK 317
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
VA VAPF ACF +DIG +R GP+VP I+ VLQ++ VVW + G NSMV++ + LCLGFV
Sbjct: 318 VAAVAPFSACFNAKDIGLSRAGPIVPPIEFVLQSEKVVWRVTGANSMVRVSNEVLCLGFV 377
Query: 369 DGG----------VNPRTSIVIGARQLENNLLQFDLATSRLGF 401
DGG P T+IVIG RQ+E+NLLQFDLATSRLGF
Sbjct: 378 DGGPLHFVDWGIKFTP-TAIVIGGRQIEDNLLQFDLATSRLGF 419
>gi|255552237|ref|XP_002517163.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543798|gb|EEF45326.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 469
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 236/416 (56%), Positives = 289/416 (69%), Gaps = 25/416 (6%)
Query: 18 LSPITAQT-SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
+SP TA+T FRPKALVL V KD T QY+TQIKQRTPLVPVKL V LG +WVDCE+G
Sbjct: 52 ISPSTAETVPFRPKALVLPVFKDKCTNQYITQIKQRTPLVPVKLIVDLGARFMWVDCEEG 111
Query: 77 YVSSTNKTARCGSAQCNLANAKACG------------GGICGAGPDNPISNTGTHGDIRI 124
YVSS+ C S C LAN+ AC C P+NP+ GT G I
Sbjct: 112 YVSSSYTPVSCDSLLCKLANSLACATECNSTPKPGCHNNTCAHSPENPVIRLGTSGQIGQ 171
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
DV+S+QS +G+ P R V+VPNF F+CG F+L+ LA GV G+AGLG S ++LP+Q ++AF
Sbjct: 172 DVVSLQSFNGKTPDRIVSVPNFPFVCGPTFLLENLADGVTGLAGLGNSNISLPAQFSSAF 231
Query: 185 SLKRKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG-FLGEP 241
+KFA+CL S +G I FGDGPY +L N +L YTPL N V+TA G +LGE
Sbjct: 232 GFPKKFAVCLSNSTKSNGLIFFGDGPYSNLPN-----DLTYTPLIHNPVSTAGGSYLGEA 286
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
SVEYFIGV S+ + GK V NKTLLSID+EG GGTKISTV+PYTVL TSIYKA+V+AF
Sbjct: 287 SVEYFIGVKSIRIGGKDVKFNKTLLSIDSEGKGGTKISTVDPYTVLHTSIYKAVVKAFVK 346
Query: 302 AMPK--VARV-APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQ 357
M K + +V P+APFGACF+ I GPV+P IDLVL+ + +V W I G NSMV+
Sbjct: 347 EMDKKFIPQVQPPIAPFGACFQSIVIDSNEFGPVLPFIDLVLEGQGSVTWRIWGANSMVK 406
Query: 358 IGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
I +CLGFVDGG+ PRTSIVIG RQ+E+NLLQFDLA+S+LGFS SLL + ATC+
Sbjct: 407 ISSLVMCLGFVDGGIEPRTSIVIGGRQIEDNLLQFDLASSKLGFSSSLLVKNATCS 462
>gi|291002742|gb|ADD71503.1| xyloglucanase inhibitor 1 [Humulus lupulus]
Length = 443
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 237/419 (56%), Positives = 287/419 (68%), Gaps = 25/419 (5%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST 81
TA+T+ PKALVL V KDT T QY+TQI QRTP V +K+ + +GG LW+DCEKGY SST
Sbjct: 25 TAKTAAFPKALVLPVTKDTTTRQYITQITQRTPPVQLKVVLDVGGEFLWIDCEKGYKSST 84
Query: 82 NKTARCGSAQCNLANAKACGG-------GICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
+ CGS QC L+ + AC G+CG P+NP S+ GT GD+ D+L IQST+G
Sbjct: 85 KRPVPCGSPQCVLSGSGACTTSDNPSDVGVCGVMPNNPFSSVGTSGDLFEDILYIQSTNG 144
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
NPG+ V+VPN +F C +L+GLASG++G+AG GR+KVALPS ++AFS RKF +CL
Sbjct: 145 FNPGKQVSVPNLLFSCAPNSLLEGLASGIIGMAGFGRNKVALPSLFSSAFSFPRKFGVCL 204
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSK--NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
S +G I FG PY L DVS +L YTPL N + S F G PS EYFIGV S+
Sbjct: 205 SS-SNGVIFFGKEPYVLLPGIDVSDPTSLTYTPLIQNPRSLVSSFEGNPSAEYFIGVKSI 263
Query: 253 HVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVA 310
V+GK + LN TLL+ DNE G GGTKISTV+P+T LETSIYKA+V AF A+ PKV RV
Sbjct: 264 KVDGKPLRLNTTLLTFDNEGGHGGTKISTVDPFTTLETSIYKAVVGAFVKALGPKVPRVK 323
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
VAPFGACF + IG TR+GP VPQIDLVL+N +WSI G NSMV +G D LCLGFVDG
Sbjct: 324 AVAPFGACFNAKYIGNTRVGPAVPQIDLVLRNDK-LWSIFGANSMVSVGDDVLCLGFVDG 382
Query: 371 G----------VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTS 418
G P T++VIG Q+ENN L FDL SRLGFS SLLF + TC+ FNF S
Sbjct: 383 GPLNFVDWGVKFTP-TAVVIGGHQIENNFLLFDLGASRLGFSSSLLFRQTTCSNFNFNS 440
>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
Length = 500
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 239/435 (54%), Positives = 287/435 (65%), Gaps = 30/435 (6%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTAT-LQYVTQIKQRTPLVPVKLTVHLG 65
F CS SL ++ PK L V KD+AT LQY+ QI QRTPLVP+ L V LG
Sbjct: 14 FSFYCS----SLHQLSLNNHSDPKHLFSPVTKDSATTLQYIAQINQRTPLVPLNLVVDLG 69
Query: 66 GNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG----------GGICGAGPDNPISN 115
G LWVDCE Y SST + RC SAQC+LA + +CG CG PDN I++
Sbjct: 70 GKFLWVDCENHYTSSTYRPVRCPSAQCSLAKSDSCGDCFSSPKPGCNNTCGLIPDNTITH 129
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
+ T GD+ DVLSIQST G N G+ V V F+F C +L+GLA G G+AGLGR+K+A
Sbjct: 130 SATRGDLAEDVLSIQSTSGFNTGQNVVVSRFLFSCAPTSLLRGLAGGASGMAGLGRTKIA 189
Query: 176 LPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNN--------FDVSKNLKYTPLF 227
LPSQLA+AF KRKFA C S DG I+FGDGPY L + FD SK+L YTPL
Sbjct: 190 LPSQLASAFIFKRKFAFCFSS-SDGVIIFGDGPYSFLADNPSLPNVVFD-SKSLTYTPLL 247
Query: 228 INKVNTASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTV 286
IN V+TAS FL GE SVEYFIGV ++ ++GK V LN +LLSIDN+GVGGTKISTV+PYTV
Sbjct: 248 INHVSTASAFLQGESSVEYFIGVKTIKIDGKVVSLNSSLLSIDNKGVGGTKISTVDPYTV 307
Query: 287 LETSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKN 344
LE SIYKA+ AF AS + PF C+ ++ T +G VP I+L+LQN N
Sbjct: 308 LEASIYKAVTDAFVKASVARNITTEDSSPPFEFCYSFDNLPGTPLGASVPTIELLLQN-N 366
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
V+WS+ G NSMV I + LCLGFV+GGVN RTSIVIG QLENNLLQFDLA SRLGFS++
Sbjct: 367 VIWSMFGANSMVNINDEVLCLGFVNGGVNLRTSIVIGGYQLENNLLQFDLAASRLGFSNT 426
Query: 405 LLFERATC-TFNFTS 418
+ + C FNFTS
Sbjct: 427 IFAHQTDCFRFNFTS 441
>gi|357440781|ref|XP_003590668.1| Basic 7S globulin [Medicago truncatula]
gi|355479716|gb|AES60919.1| Basic 7S globulin [Medicago truncatula]
Length = 434
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 236/426 (55%), Positives = 287/426 (67%), Gaps = 23/426 (5%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATL-QYVTQIKQRTPLVPVKLTVHLGGNILWV 71
LL F +SP ++ SFRPKALVL V KD AT QY QI QRTPLVP+ + V LGG LWV
Sbjct: 14 LLFFFISPTFSKQSFRPKALVLPVTKDVATTNQYKAQINQRTPLVPLNIIVDLGGLFLWV 73
Query: 72 DCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHG 120
DCE Y+SST + ARC SAQC+LA CG C P N ++ + G
Sbjct: 74 DCENQYISSTYRPARCRSAQCSLAKFDDCGVCFSSPKPGCNNNTCSVAPGNSVTQSAMSG 133
Query: 121 DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
++ D+LSIQS++G NPG+ V V F+F C F+L+GLASG G+AGLGR+K+ALPSQL
Sbjct: 134 ELAEDILSIQSSNGFNPGQNVMVSRFLFSCARTFLLEGLASGASGMAGLGRNKLALPSQL 193
Query: 181 AAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNN--FDVSKNLKYTPLFINKVNTASGFL 238
A+AFS +KFA+CLS G ++FGDGPY L N FD SK+L YTPL IN +TA+
Sbjct: 194 ASAFSFAKKFAICLSS-SKGVVLFGDGPYGFLPNVVFD-SKSLTYTPLLINPFSTAAFAK 251
Query: 239 GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQ 297
EPS EYFIGV ++ ++GK V L+ +LLSID+ G GGTKISTV+PYTVLE SIYKA+
Sbjct: 252 SEPSAEYFIGVKTIKIDGKVVSLDTSLLSIDSSNGAGGTKISTVDPYTVLEASIYKAVTD 311
Query: 298 AF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM 355
AF ASA + RV VAPF C+ ++ TR+G VP I+L LQN NV+W I G NSM
Sbjct: 312 AFVKASAARNIKRVDSVAPFEFCY--TNVTGTRLGADVPTIELYLQN-NVIWRIFGANSM 368
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-F 414
V I + LCLGFV GG N SIVIG QLENNLLQFDLA S+LGFS L + TC+ F
Sbjct: 369 VNINDEVLCLGFVIGGENTWASIVIGGYQLENNLLQFDLAASKLGFSSLLFGRQTTCSNF 428
Query: 415 NFTSIA 420
NFTS A
Sbjct: 429 NFTSTA 434
>gi|18379072|ref|NP_563679.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12083230|gb|AAG48774.1|AF332411_1 unknown protein [Arabidopsis thaliana]
gi|3850580|gb|AAC72120.1| Strong similarity to gb|D14550 extracellular dermal glycoprotein
(EDGP) precursor from Daucus carota. ESTs gb|84105 and
gb|AI100071 come from this gene [Arabidopsis thaliana]
gi|332189426|gb|AEE27547.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 434
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 222/414 (53%), Positives = 280/414 (67%), Gaps = 18/414 (4%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN 82
AQ SFRPKAL+L V KD +TLQY T I QRTPLVP + LGG WVDC++GYVS+T
Sbjct: 23 AQPSFRPKALLLPVTKDPSTLQYTTVINQRTPLVPASVVFDLGGREFWVDCDQGYVSTTY 82
Query: 83 KTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDIRIDVLSIQS 131
++ RC SA C+ A + ACG CGA PDN I+ T G+ +DV+SIQS
Sbjct: 83 RSPRCNSAVCSRAGSIACGTCFSPPRPGCSNNTCGAFPDNSITGWATSGEFALDVVSIQS 142
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
T+G NPGR V +PN IF CGS +L+GLA G VG+AG+GR + LP Q AAAFS RKFA
Sbjct: 143 TNGSNPGRFVKIPNLIFSCGSTSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNRKFA 202
Query: 192 LCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGEPSVEYFIGVT 250
+CL+ G FG+GPY L +S+ L+ TPL IN T F GE S EYFIGVT
Sbjct: 203 VCLTS-GRGVAFFGNGPYVFLPGIQISR-LQKTPLLINPGTTVFEFSKGEKSPEYFIGVT 260
Query: 251 SVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVA 307
++ + K +P++ TLL I+ + G+GGTKIS+VNPYTVLE+SIYKA F +A +
Sbjct: 261 AIKIVEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAARSIK 320
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
RVA V PFGACF +++G TR+G VP+I LVL +K+VVW I G NSMV + D +CLGF
Sbjct: 321 RVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVICLGF 380
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
VDGGVNP S+VIG QLE+NL++FDLA+++ GFS +LL + C FNFTS A
Sbjct: 381 VDGGVNPGASVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQTNCANFNFTSTA 434
>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
Length = 440
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 226/418 (54%), Positives = 273/418 (65%), Gaps = 20/418 (4%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN 82
A+T +P+A +L V KD +T Q+VT I QRTPLVPVKLT+ LG LWVDCEKGYVSS+
Sbjct: 23 AKTPSKPRAFLLPVTKDASTKQFVTTISQRTPLVPVKLTIDLGQRFLWVDCEKGYVSSSY 82
Query: 83 KTARCGSAQCNLANAKAC------------GGGICGAGPDNPISNTGTHGDIRIDVLSIQ 130
K CGS C + + AC C P N T T G++ DV+S+Q
Sbjct: 83 KPVPCGSIPCKRSLSGACVESCVGPPSPGCNNNTCSHIPYNHFIRTSTGGELAQDVVSLQ 142
Query: 131 STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
STDG NP + ++ +F C +L+GLA GV GI GLG V P+QLA AFS+ RKF
Sbjct: 143 STDGSNPRKYLSTNGVVFDCAPHSLLEGLAKGVKGILGLGNGYVGFPTQLANAFSVPRKF 202
Query: 191 ALCL--SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA-SGFLGEPSVEYFI 247
A+CL S G I FGD PY L DVSK L YTPL N V+T+ S F GEPS +YFI
Sbjct: 203 AICLTSSTTSRGVIFFGDSPYVFLPGMDVSKRLVYTPLLKNPVSTSGSYFEGEPSTDYFI 262
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
GVTS+ +NG VP+N TLL+I +G GGTKISTV+PYT LETSIY AL +AF ++ KV
Sbjct: 263 GVTSIKINGNVVPINTTLLNITKDGKGGTKISTVDPYTKLETSIYNALTKAFVKSLAKVP 322
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV--WSIHGQNSMVQIGGDALCL 365
RV PVAPF C+ +G TR+G VP I+LVL NKN W+I G NSMV + D LCL
Sbjct: 323 RVKPVAPFKVCYNRTSLGSTRVGRGVPPIELVLGNKNATTSWTIWGVNSMVAMNNDVLCL 382
Query: 366 GFVDGGV--NPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTSIA 420
GF+DGGV P TSIVIGA Q+E+NLLQFD+A RLGF+ SLLF + TC FNFTS A
Sbjct: 383 GFLDGGVEFEPTTSIVIGAHQIEDNLLQFDIANKRLGFTSSLLFGQTTCANFNFTSKA 440
>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 435
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 222/432 (51%), Positives = 287/432 (66%), Gaps = 30/432 (6%)
Query: 7 FLLLCSLLIFSLSPITAQT-SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLG 65
FLL S+L+F +SP A++ RPK LVL V KD + QYVTQI QRTPLV VKLTV LG
Sbjct: 11 FLLFSSILLF-ISPSAARSVPARPKPLVLPVLKDKCSHQYVTQINQRTPLVAVKLTVDLG 69
Query: 66 GNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG------------GGICGAGPDNPI 113
G +WVDC+ YVSS+ RC SA C LA++ +C C P NP+
Sbjct: 70 GTFMWVDCDN-YVSSSYTPVRCDSALCKLADSHSCTTECYSSPKPGCYNNTCSHIPYNPV 128
Query: 114 SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
+ T GDI +DV+S+QS DG+ PGR V+VPN F+CG+ F+L+ LA GV+G+AGLGR
Sbjct: 129 VHVSTSGDIGLDVVSLQSMDGKYPGRNVSVPNVPFVCGTGFMLENLADGVLGVAGLGRGN 188
Query: 174 VALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGD--GPYYDLNNFDVSKNLKYTPLFIN 229
++LP+ ++A L+ KFA+CLS + G I FGD GP S L YTPL N
Sbjct: 189 ISLPAYFSSALGLQSKFAICLSSLTNSSGVIYFGDSIGPL-------SSDFLIYTPLVRN 241
Query: 230 KVNTASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLE 288
V+TA + G+ S +YFI V ++ V GK + NKTLLSIDNEG GGT+ISTV+PYT+L
Sbjct: 242 PVSTAGAYFEGQSSTDYFIAVKTLRVGGKEIKFNKTLLSIDNEGKGGTRISTVHPYTLLH 301
Query: 289 TSIYKALVQAFASAMPKVARV-APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVV 346
TSIYKA+++AFA M + V P+APFG C++ + GPVVP IDLVL+++ +V
Sbjct: 302 TSIYKAVIKAFAKQMKFLIEVNPPIAPFGLCYQSAAMDINEYGPVVPFIDLVLESQGSVY 361
Query: 347 WSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLL 406
W I G NSMV+I +CLGFVDGG+ P +SI+IG RQLE+NLLQFDLA++RLGF+ SLL
Sbjct: 362 WRIWGANSMVKISSYVMCLGFVDGGLKPDSSIIIGGRQLEDNLLQFDLASARLGFTSSLL 421
Query: 407 FERATCTFNFTS 418
TC+ NF S
Sbjct: 422 VRNTTCS-NFNS 432
>gi|297818546|ref|XP_002877156.1| hypothetical protein ARALYDRAFT_484681 [Arabidopsis lyrata subsp.
lyrata]
gi|297322994|gb|EFH53415.1| hypothetical protein ARALYDRAFT_484681 [Arabidopsis lyrata subsp.
lyrata]
Length = 420
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 212/406 (52%), Positives = 267/406 (65%), Gaps = 29/406 (7%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARC 87
RPKAL+L V KD +TLQY T I QRTPLVP + L G LWVDC+KGYVS+T + RC
Sbjct: 27 RPKALLLPVTKDQSTLQYTTIINQRTPLVPASVVFDLSGRELWVDCDKGYVSTTYHSPRC 86
Query: 88 GSAQCNLANAKACG-----------GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN 136
SA C+ A + +CG CGA P N ++ T G+ +DV+SIQST+G N
Sbjct: 87 NSAVCSRAGSISCGTCFSPPKPGCSNNTCGAFPSNSVTGWSTSGEFALDVVSIQSTNGSN 146
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
PGR V +PN IF CGS +L+GLA G VG+AG+GR K++LPSQ AAAFS RKFA+CL+
Sbjct: 147 PGRFVKIPNIIFSCGSTSLLKGLAKGTVGMAGMGRHKISLPSQFAAAFSFNRKFAVCLTS 206
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
G FG+GPY L +S+ L+ TPL IN P EYFIGV + +
Sbjct: 207 -GRGVTFFGNGPYVFLPGIQISR-LQKTPLLIN-----------PG-EYFIGVREIKIVE 252
Query: 257 KAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFA--SAMPKVARVAPVA 313
K VP+N+ LL I+ E G GGTKIS+VNPYTVLE+SI+K+ F + + RVA V
Sbjct: 253 KTVPINQMLLKINKETGFGGTKISSVNPYTVLESSIFKSFTSMFVRQATARNMTRVASVK 312
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
PF ACF Q++G TR+G VP+I LVL + +VVW I G NSMV + D +CLGFVDGGVN
Sbjct: 313 PFSACFSTQNVGVTRLGYAVPEIQLVLHSNDVVWRIFGGNSMVSVSDDVICLGFVDGGVN 372
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
RTS+VIG QLE+NL++FDLA++R GFS +LL + C FNFTS
Sbjct: 373 ARTSVVIGGFQLEDNLIEFDLASNRFGFSSTLLGRQTNCANFNFTS 418
>gi|147821119|emb|CAN68736.1| hypothetical protein VITISV_030193 [Vitis vinifera]
Length = 441
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 223/432 (51%), Positives = 276/432 (63%), Gaps = 25/432 (5%)
Query: 8 LLLCSLLIFSLSPITAQTSF-RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGG 66
L+L SL FS+S QT P ALVL + K +TLQYVT I QRTPLVP+ + V LGG
Sbjct: 12 LILLSLTSFSIS----QTPLIHPNALVLPLTKHASTLQYVTIISQRTPLVPLNVIVDLGG 67
Query: 67 NILWVDCEKGYVSSTNKTARCGSAQCNLANA------------KACGGGICGAGPDNPIS 114
LWV C YVSS+ + ARC S+QC LA+ C G C +N +
Sbjct: 68 QFLWVGCGSNYVSSSYRPARCHSSQCFLAHGPKSCDHCLSRGRPKCNNGTCILFSENVFT 127
Query: 115 NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKV 174
+ + GD+ DVLS+QSTDG NP AV +P+F+F C E +LQGLA G GIAGLG ++
Sbjct: 128 SKVSAGDLSEDVLSLQSTDGLNPRSAVAIPHFLFSCAPEVLLQGLAGGAEGIAGLGHGRI 187
Query: 175 ALPSQLAAAFSLKRKFALCLSP--FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
LP+ L++A + RKFA+CL P G I FGDGPY L DVSK L YTPL N +
Sbjct: 188 GLPTLLSSALNFTRKFAVCLPPTTTSSGVIFFGDGPYALLPGIDVSKLLIYTPLIKNPRS 247
Query: 233 TASG-FLGE--PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLET 289
A+ ++ E PS EYFI V S+ +NGK VPL+ +LL+I+ G+GGTKISTVNPYT+L+T
Sbjct: 248 VATRVYVTEPLPSYEYFIRVKSIQINGKQVPLDSSLLAINKNGIGGTKISTVNPYTLLQT 307
Query: 290 SIYKALVQAF-ASAMPK-VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVW 347
SIY + + F AM V RV+PVAPF CF ++ P +P IDLVLQNK V W
Sbjct: 308 SIYNSFTKLFLQEAMAHNVTRVSPVAPFDVCFSTKNTNGAFSTPAIPVIDLVLQNKKVFW 367
Query: 348 SIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLF 407
I NSMV +G D CLGF+DGG+N RTSIVIG QLE+NLLQFDL +SRLGF+ SLL
Sbjct: 368 RIFETNSMVLVGDDVACLGFLDGGLNQRTSIVIGGHQLEDNLLQFDLESSRLGFTSSLLL 427
Query: 408 ERATC-TFNFTS 418
+C FNFTS
Sbjct: 428 RETSCANFNFTS 439
>gi|225451013|ref|XP_002284868.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 441
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 222/432 (51%), Positives = 276/432 (63%), Gaps = 25/432 (5%)
Query: 8 LLLCSLLIFSLSPITAQTSF-RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGG 66
L+L SL FS+S QT P ALVL + K +TLQYVT I QRTPLVP+ + V LGG
Sbjct: 12 LILLSLTSFSIS----QTPLIHPNALVLPLTKHASTLQYVTIISQRTPLVPLNVIVDLGG 67
Query: 67 NILWVDCEKGYVSSTNKTARCGSAQCNLANA------------KACGGGICGAGPDNPIS 114
LWV C YVSS+ + A+C S+QC LA+ C G C +N +
Sbjct: 68 QFLWVGCGSNYVSSSYRPAQCHSSQCFLAHGPKSCDHCLSRGRPKCNNGTCILFSENVFT 127
Query: 115 NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKV 174
+ + GD+ DVLS+QSTDG NP AV +P+F+F C E +LQGLA G GIAGLG ++
Sbjct: 128 SKVSAGDLSEDVLSLQSTDGLNPRSAVAIPHFLFSCAPEVLLQGLAGGAEGIAGLGHGRI 187
Query: 175 ALPSQLAAAFSLKRKFALCLSP--FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
LP+ L++A + RKFA+CL P G I FGDGPY L DVSK L YTPL N +
Sbjct: 188 GLPTLLSSALNFTRKFAVCLPPTTTSSGVIFFGDGPYALLPGIDVSKLLIYTPLIKNPRS 247
Query: 233 TASG-FLGE--PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLET 289
A+ ++ E PS EYFI V S+ +NGK VPL+ +LL+I+ G+GGTKISTVNPYT+L+T
Sbjct: 248 VATRVYVTEPLPSYEYFIRVKSIQINGKQVPLDSSLLAINKNGIGGTKISTVNPYTLLQT 307
Query: 290 SIYKALVQAF-ASAMPK-VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVW 347
SIY + + F AM V RV+PVAPF CF ++ P +P IDLVLQNK V W
Sbjct: 308 SIYNSFTKLFLQEAMAHNVTRVSPVAPFDVCFSTKNTNGAFSTPAIPVIDLVLQNKKVFW 367
Query: 348 SIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLF 407
I NSMV +G D CLGF+DGG+N RTSIVIG QLE+NLLQFDL +SRLGF+ SLL
Sbjct: 368 RIFETNSMVLVGDDVACLGFLDGGLNQRTSIVIGGHQLEDNLLQFDLESSRLGFTSSLLL 427
Query: 408 ERATC-TFNFTS 418
+C FNFTS
Sbjct: 428 RETSCANFNFTS 439
>gi|326496543|dbj|BAJ94733.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511583|dbj|BAJ91936.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 205/399 (51%), Positives = 258/399 (64%), Gaps = 16/399 (4%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG 88
P A+VL V KD+AT QY+T +QRTP VPV + LGG LWVDC+ GYVSS+ C
Sbjct: 27 PSAVVLPVRKDSATGQYLTGFRQRTPQVPVTAVLDLGGASLWVDCDAGYVSSSYAGVPCA 86
Query: 89 SAQCNLANAKACG------------GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN 136
S C LA + AC C P+N ++ T G++ DVLS+ +T
Sbjct: 87 SKLCRLAKSVACATSCVGKPSPGCLNDTCSGFPENTVTRVSTGGNLITDVLSVPTTFRPA 146
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-S 195
PG T P F+F CG+ F+ GLA+G G+A L R++ ALP+QLAA F RKFALCL S
Sbjct: 147 PGPLATAPAFLFTCGATFLTDGLAAGATGMASLSRARFALPTQLAATFRFSRKFALCLTS 206
Query: 196 PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGE--PSVEYFIGVTSVH 253
G +VFGD PY D+SK+L YTPL +N V+TA G G+ S EYFIGVT++
Sbjct: 207 TSAAGVVVFGDAPYAFQPGVDLSKSLTYTPLLVNNVSTA-GVSGQKDKSNEYFIGVTAIK 265
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG+AVPLN +LL+ID +G GGTK+STV PYTVLETSI+KA+ AFA+ + RV VA
Sbjct: 266 VNGRAVPLNASLLAIDKQGGGGTKLSTVAPYTVLETSIHKAVTDAFAAETAMIPRVRAVA 325
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
PF C+ +G TR+GP VP ++LVLQN+ W + G NSMV G ALCLG VDGG
Sbjct: 326 PFKLCYDGSKVGSTRVGPAVPTVELVLQNEAASWVVFGANSMVAAKGGALCLGVVDGGAA 385
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
PRTS+VIG +E+NLL+FDL +RLGFS SLLF + TC
Sbjct: 386 PRTSVVIGGHTMEDNLLEFDLQRARLGFSSSLLFRQTTC 424
>gi|125552283|gb|EAY97992.1| hypothetical protein OsI_19909 [Oryza sativa Indica Group]
Length = 437
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 206/410 (50%), Positives = 261/410 (63%), Gaps = 14/410 (3%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
S+SP A + P A++L V+KD AT QYVT +QRTP PVK + L G LWVDCE G
Sbjct: 22 SVSPCRAASGGGPSAVLLPVSKDDATQQYVTMFRQRTPQAPVKAVLDLAGATLWVDCEAG 81
Query: 77 YVSSTNKTARCGSAQCNLANAKACG------------GGICGAGPDNPISNTGTHGDIRI 124
YVSS+ CGS QC LA AC CG P+N +++ T G+I
Sbjct: 82 YVSSSYARVPCGSKQCRLAKTNACATSCDGAPSPACLNDTCGGFPENTVTHVSTSGNIIT 141
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
DVLS+ +T PG T P F+F CG+ F+ +GLA+G G+ L R++ A P+QLAA F
Sbjct: 142 DVLSLPTTFRPAPGPLATAPAFLFTCGATFLTEGLAAGATGMVSLSRARFAFPTQLAATF 201
Query: 185 SLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS-GFLGEPS 242
RKFALCL P G ++FGD PY D+SK+L YTPL +N V+TA G+ S
Sbjct: 202 RFSRKFALCLPPAAAAGVVIFGDAPYVFQPGVDLSKSLIYTPLLVNPVSTAGVSTKGDKS 261
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
EYF+GVT + VNG+AVPLN TLL+I+ +GVGGTK+STV PYTVLETSI+KA+ AFA+
Sbjct: 262 TEYFVGVTRIKVNGRAVPLNTTLLAINKKGVGGTKLSTVTPYTVLETSIHKAVTDAFAAE 321
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
+ RV VAPF C+ + TR+GP VP ++LV Q++ W + G NSMV G A
Sbjct: 322 TSMIPRVPAVAPFKLCYDGSKVASTRVGPAVPTVELVFQSEATSWVVFGANSMVATKGGA 381
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
LCLG VDGG P TS+VIG +E+NLL+FDL SRLGFS SLLF + TC
Sbjct: 382 LCLGVVDGGAAPETSVVIGGHMMEDNLLEFDLVGSRLGFSSSLLFRQTTC 431
>gi|115463793|ref|NP_001055496.1| Os05g0402900 [Oryza sativa Japonica Group]
gi|113579047|dbj|BAF17410.1| Os05g0402900 [Oryza sativa Japonica Group]
Length = 437
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 202/410 (49%), Positives = 260/410 (63%), Gaps = 14/410 (3%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
S+SP A + P A++L V+KD AT QYVT +QRTP P+K + L G LWVDCE G
Sbjct: 22 SVSPCRAASGGDPSAVLLPVSKDDATQQYVTMFRQRTPQAPLKAVLDLAGATLWVDCEAG 81
Query: 77 YVSSTNKTARCGSAQCNLANAKACG------------GGICGAGPDNPISNTGTHGDIRI 124
YVSS+ CGS QC LA AC CG P+N +++ T G++
Sbjct: 82 YVSSSYARVPCGSKQCRLAKTNACATSCDGAPSPACLNDTCGGFPENTVTHVSTSGNVIT 141
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
DVLS+ +T PG T P F+F CG+ F+ +GLA+G G+ L R++ A P+QLAA F
Sbjct: 142 DVLSLPTTFRPAPGPLATAPAFLFTCGATFLTEGLAAGATGMVSLSRARFAFPTQLAATF 201
Query: 185 SLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS-GFLGEPS 242
RKFALCL P G ++FGD PY D+SK+L YTPL +N V+T G+ S
Sbjct: 202 RFSRKFALCLPPAAAAGVVIFGDAPYVFQPGVDLSKSLIYTPLLVNPVSTGGVSTKGDKS 261
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
EYF+G+T + VNG+AVPLN TLL+I+ +GVGGTK+STV PYTVLETSI+KA+ AFA+
Sbjct: 262 TEYFVGLTRIKVNGRAVPLNTTLLAINKKGVGGTKLSTVTPYTVLETSIHKAVTDAFAAE 321
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
+ RV VAPF C+ + TR+GP VP ++LV Q++ W + G NSMV G A
Sbjct: 322 TSMIPRVPAVAPFKLCYDGSKVAGTRVGPAVPTVELVFQSEATSWVVFGANSMVATKGGA 381
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
LCLG VDGGV TS+VIG +E+NLL+FDL SRLGFS SLLF + TC
Sbjct: 382 LCLGVVDGGVASETSVVIGGHMMEDNLLEFDLVGSRLGFSSSLLFRQTTC 431
>gi|297724111|ref|NP_001174419.1| Os05g0403000 [Oryza sativa Japonica Group]
gi|50878436|gb|AAT85210.1| hypothetical protein [Oryza sativa Japonica Group]
gi|222631539|gb|EEE63671.1| hypothetical protein OsJ_18489 [Oryza sativa Japonica Group]
gi|255676353|dbj|BAH93147.1| Os05g0403000 [Oryza sativa Japonica Group]
Length = 437
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 203/411 (49%), Positives = 256/411 (62%), Gaps = 18/411 (4%)
Query: 19 SPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV 78
SP + RP+A+VL V+KD AT QY T +QRTP VPVK + L G LWVDC+ GYV
Sbjct: 22 SPCRSAAGGRPRAVVLPVSKDDATQQYATVFRQRTPQVPVKAVLDLAGATLWVDCDTGYV 81
Query: 79 SSTNKTARCGSAQCNL------------ANAKACGGGICGAGPDNPISNTGTHGDIRIDV 126
SS+ CGS C L A + AC G C PDN ++ G+I DV
Sbjct: 82 SSSYARVPCGSKPCRLTKTGGCFNSCFGAPSPACLNGTCSGFPDNTVTRVTAGGNIITDV 141
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
LS+ +T PG TVP F+F CG F+ +GLA+G G+ L R++ A P+QLA F
Sbjct: 142 LSLPTTFRTAPGPFATVPEFLFTCGHTFLTEGLANGATGMVSLSRARFAFPTQLARTFGF 201
Query: 187 KRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNTASGFL-GEPSV 243
R+FALCL P G +VFGD PY D+SK+ L YTPL +N V TA + GE S+
Sbjct: 202 SRRFALCLPPASAAGVVVFGDAPYVFQPGVDLSKSSLIYTPLLVNAVRTAGKYTTGETSI 261
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
EY IG+T + VNG+ VPLN TLL+ID GVGGT +ST +PYTVLETSIYKA++ AFA+
Sbjct: 262 EYLIGLTGIKVNGRDVPLNATLLAIDKNGVGGTTLSTASPYTVLETSIYKAVIDAFAAET 321
Query: 304 PKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
+ RV VAPF C+ + +G TR GP VP I+LVLQ + V W ++G NSMV G AL
Sbjct: 322 ATIPRVPAVAPFELCYDGRKVGSTRAGPAVPTIELVLQREAVSWIMYGANSMVPAKGGAL 381
Query: 364 CLGFVDGG--VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
CLG VDGG + P +S+VIG +E+NLL+FDL SRLGFS L + TC
Sbjct: 382 CLGVVDGGPALYP-SSVVIGGHMMEDNLLEFDLEGSRLGFSSYLPLRQTTC 431
>gi|115463795|ref|NP_001055497.1| Os05g0403300 [Oryza sativa Japonica Group]
gi|50878438|gb|AAT85212.1| unknown protein [Oryza sativa Japonica Group]
gi|113579048|dbj|BAF17411.1| Os05g0403300 [Oryza sativa Japonica Group]
Length = 455
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 207/446 (46%), Positives = 262/446 (58%), Gaps = 37/446 (8%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
++VA LLL L S+SP A P ++VL V+KD AT QYVT +QRTP VPVK
Sbjct: 7 LAVASGALLL---LFVSVSPCRAAAGGGPSSVVLPVSKDDATQQYVTMFRQRTPQVPVKA 63
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL------------ANAKACGGGICGAG 108
+ L G +LWVDC+ GYVSS+ RCG+ C L A + C C
Sbjct: 64 VLDLAGTMLWVDCDAGYVSSSYAGVRCGAKPCRLLKNAGCAITCLDAVSAGCLNDTCSEF 123
Query: 109 PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
P N ++ T G+I DVLS+ +T PG T P F+F CG F+ QGLA G G+
Sbjct: 124 PKNTATSVSTAGNIITDVLSLPTTFRPAPGPLATAPAFLFTCGHTFLTQGLADGATGMVS 183
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLF 227
L R++ ALP+QLA F RKFALCL P G +VFGD PY D+SK+L YTPL
Sbjct: 184 LSRARFALPTQLADTFGFSRKFALCLPPASAAGVVVFGDAPYTFQPGVDLSKSLIYTPLL 243
Query: 228 INKVNTA---------SGFLGE------------PSVEYFIGVTSVHVNGKAVPLNKTLL 266
+N V+TA F+GE S +YFIG+T + VNG VP+N TLL
Sbjct: 244 VNPVSTAPYGRKDKTTKYFIGETTIQLKGRVWREKSTDYFIGLTGIKVNGHTVPVNATLL 303
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGF 326
+ID +GVGGTK+STV+PYTVLE SI++A+ AFA M + R V PF C+ + +G
Sbjct: 304 AIDKKGVGGTKLSTVSPYTVLERSIHQAVTDAFAKEMAAIPRAPAVEPFKLCYDGRKVGS 363
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLE 386
TR+GP VP I+LVLQ+ W + G NSMV G ALCLG VD G P+TS+VIG +E
Sbjct: 364 TRVGPAVPTIELVLQSTGASWVVFGANSMVATKGGALCLGVVDAGTEPQTSVVIGGHMME 423
Query: 387 NNLLQFDLATSRLGFSDSLLFERATC 412
+NLL+FDL SRLGFS L + TC
Sbjct: 424 DNLLEFDLEASRLGFSSYLPSRQTTC 449
>gi|255552253|ref|XP_002517171.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543806|gb|EEF45334.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 437
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 213/436 (48%), Positives = 280/436 (64%), Gaps = 38/436 (8%)
Query: 12 SLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWV 71
S L+F +SP TA+ FRPKAL+L V KD T QY+TQI QRTPLVPVKLTV LGG+++W+
Sbjct: 12 SFLLFLISPSTARAPFRPKALLLPVFKDKCTRQYITQIDQRTPLVPVKLTVDLGGSLMWI 71
Query: 72 DCEKGYVSSTNKTARCGSAQCNLANAKACG------------GGICGAGPDNPISNTGTH 119
+CE+GYVSS+ + C SA C+L+N+++C CG +N + GT
Sbjct: 72 NCEEGYVSSSYRPLSCDSALCSLSNSQSCNKECYSSPKPGCYNNTCGQSSNNRVVYIGTG 131
Query: 120 GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQ 179
GD+ DV+++QS DG+N GR V+VPNF F+CG ++L LA GV G+AGLGRS ++LP+
Sbjct: 132 GDLGQDVVALQSFDGKNLGRIVSVPNFPFVCGITWLLDDLADGVTGMAGLGRSNISLPAY 191
Query: 180 LAAAFSLKRKFALCLSPF--DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF 237
++A + F++CLS +G IVFGDGP + VS +L Y L +N V T G+
Sbjct: 192 FSSAIGFSKTFSICLSSSTKSNGVIVFGDGP-----SSIVSNDLIYIRLILNPVGTP-GY 245
Query: 238 --LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKAL 295
LGE S +Y+IGV S+ V+GK V +KTLLSID +G GGT +STVNPYTVL TSIYKAL
Sbjct: 246 SSLGESSADYYIGVKSIRVDGKEVKFDKTLLSIDKDGNGGTMLSTVNPYTVLHTSIYKAL 305
Query: 296 VQAFASAMP-KVARVAPV--APFGAC-----FRLQDIGFTRIGPVVPQIDLVL---QNKN 344
++AF + + + V P PFGAC FR + VP I+L L Q +
Sbjct: 306 LKAFIKKLVFRFSLVVPSVPVPFGACVFSNGFRTTE----EFLSYVPIINLELESEQGNS 361
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
V W I G NSMV + +CL F+DGG PRT I+IG QLE+NLL FDLA+SRLGFS S
Sbjct: 362 VYWRILGANSMVAVNSYTMCLAFIDGGSQPRTPIIIGGHQLEDNLLHFDLASSRLGFSSS 421
Query: 405 LLFERATCT-FNFTSI 419
LL TC+ NF ++
Sbjct: 422 LLPRNTTCSNLNFNAL 437
>gi|50878437|gb|AAT85211.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 435
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 199/407 (48%), Positives = 249/407 (61%), Gaps = 16/407 (3%)
Query: 19 SPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV 78
SP A + P A++L V KD AT QYVT QRTP VPVK V L G +LWVDCE GY
Sbjct: 26 SPCRAASGSSPSAVLLPVDKDGATQQYVTMFWQRTPSVPVKAVVDLAGAMLWVDCESGYE 85
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAG------------PDNPISNTGTHGDIRIDV 126
SS+ CGS C LA + AC G GA P+ I+ T G+I D
Sbjct: 86 SSSYARVPCGSKPCRLAKSAACATGCSGAASPGCLNDTCTGFPEYTITRVSTGGNIITDK 145
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
LS+ +T P T P F+F CG+ + +GL + G+ L R++ ALP+Q+A+ F
Sbjct: 146 LSLYTTCRPMPVPRATAPGFLFTCGATSLTKGLGAAATGMMSLSRARFALPTQVASIFRF 205
Query: 187 KRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
RKFALCL+P + G +VFGD PY D+SK+L YTPL +N V T G + S EY
Sbjct: 206 SRKFALCLAPAESSGVVVFGDAPYEFQPVMDLSKSLIYTPLLVNPVTTTGG---DKSTEY 262
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
FIGVT + VNG+AVPLN TLL+I GVGGTK+S ++PYTVLETSIYKA+ AFA+
Sbjct: 263 FIGVTGIKVNGRAVPLNATLLAIAKSGVGGTKLSMLSPYTVLETSIYKAVTDAFAAETAM 322
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
+ RV VAPF C+ +G TR GP VP ++LVLQ+K V W + G NSMV ALC
Sbjct: 323 IPRVPAVAPFKLCYDGTMVGSTRAGPAVPTVELVLQSKAVSWVVFGANSMVATKDGALCF 382
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
G VDGGV P TS+VIG +E+NLL+FDL SRLGF+ L + TC
Sbjct: 383 GVVDGGVAPETSVVIGGHMMEDNLLEFDLEGSRLGFTSYLPLLQTTC 429
>gi|388509650|gb|AFK42891.1| unknown [Lotus japonicus]
Length = 347
Score = 367 bits (942), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 204/350 (58%), Positives = 243/350 (69%), Gaps = 25/350 (7%)
Query: 88 GSAQCNLANAKACGGG-ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
GS+QC+L C G ICG P N +++ ++GDI DV+S+ STDG P + V+VPNF
Sbjct: 2 GSSQCSLFGLTGCSGDKICGRSPSNTVTSVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNF 61
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVF 204
+F+CGS+ V GLA GV G+AGLGR++V+LPSQ ++AFS RKFA+CL+ DG + F
Sbjct: 62 LFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFF 121
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTA-SGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
GDGP Y+LN DVSK L YTPL N V+TA S FLGEPSVEYFIGV S+ V+ K VPLN
Sbjct: 122 GDGP-YNLNQ-DVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNT 179
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
TLLSI+ GVGGTKISTVNPYTV+ET+IYKA+ AF ++ V+PVAPFG CF +D
Sbjct: 180 TLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSL-GAPTVSPVAPFGTCFATKD 238
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR-------- 375
I F+RIGP VP IDLVLQN V W I G NSMVQ D +CLGFVD G NP+
Sbjct: 239 ISFSRIGPGVPAIDLVLQN-GVEWPIIGANSMVQF-DDVICLGFVDAGSNPKASQVGFVN 296
Query: 376 ------TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
TSI IGA QLENNLL+FDLA SRLGF SL E C F FTS
Sbjct: 297 GGSHPVTSITIGAHQLENNLLKFDLAASRLGFR-SLFLEHDNCQNFRFTS 345
>gi|359480063|ref|XP_003632393.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Vitis
vinifera]
Length = 433
Score = 366 bits (940), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 206/417 (49%), Positives = 264/417 (63%), Gaps = 27/417 (6%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN 82
AQ+SFRP A V+ ++KD +TLQYVT I Q TPLVP +L V LGG L VDCE+ YVSS+
Sbjct: 22 AQSSFRPHAFVVPISKDGSTLQYVTSINQMTPLVPFQLVVDLGGQFLCVDCEQNYVSSSY 81
Query: 83 KTARCGSAQCNLANAK-----------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS 131
+ + AQC +A A C CG DN ++ T + ++ +DV+S+ +
Sbjct: 82 RPTQYKLAQCLVAKASGYGNFFSASKLGCNNNTCGVLSDNTVTRTASSDELVVDVVSVXA 141
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
T+G NPGR+V+V F+F +L+GLASG G+ +ALPSQ A+AF+ RKF+
Sbjct: 142 TNGSNPGRSVSVSKFLFSYAPTSLLEGLASGAKGMM-----HIALPSQFASAFNFHRKFS 196
Query: 192 LCLSP--FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGEPSVEYFIG 248
+CLS DG I GDGPY L N D S+ L YTPL +N V+ S + GE S+EY G
Sbjct: 197 ICLSSSTIVDGIIFLGDGPYELLLNVDASQLLIYTPLILNPVSIVSTYSQGESSIEYLFG 256
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKV 306
V S+ +N K VP+ L+ GV TKI+TVNPY V+ETSIY A +AF S A +
Sbjct: 257 VNSIXINEK-VPIEHILVVHXXXGVRETKINTVNPYIVMETSIYSAFTKAFISTTASMNI 315
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
RVA VAPF F +++ T+ G +P I LVLQN ++VW I NSMV + GD LCLG
Sbjct: 316 TRVATVAPFNIYFNSKNVYSTQGGATIPTIGLVLQNNSMVWRIFRANSMVFVNGDVLCLG 375
Query: 367 FVDGGVN----PRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTS 418
FVDGG N PRTSIVIG QLE+NL+QFDLATSRLGF+ LLF + TC+ FNFTS
Sbjct: 376 FVDGGENPIPTPRTSIVIGGYQLEDNLIQFDLATSRLGFNSFLLFSQTTCSNFNFTS 432
>gi|413945301|gb|AFW77950.1| hypothetical protein ZEAMMB73_390094 [Zea mays]
Length = 438
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 209/427 (48%), Positives = 268/427 (62%), Gaps = 15/427 (3%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
++ A FL L S+ + + A +P A++L V+KD AT QYVT +QRTPLVPVK
Sbjct: 6 LAAAVLFLCLLSVCRAAGAGAGAAGKGKPSAVLLPVSKDGATQQYVTGFRQRTPLVPVKA 65
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG------------GGICGAG 108
+ L G LWVDCE GY S+T CGS+ C L+ + AC CG
Sbjct: 66 VLDLAGATLWVDCEAGYASATYSRVPCGSSLCRLSRSAACATSCSGAPSPSCLNDTCGGF 125
Query: 109 PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
P+N ++ T G++ DVL++ +T PG T P F+F CG+ F+ QGLA+G G+A
Sbjct: 126 PENTVTQVSTSGNVITDVLALPTTFRPAPGPLATAPAFLFTCGATFLTQGLAAGAAGMAS 185
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSK-NLKYTPL 226
L RS+ ALP+QLA+ F RKFALCL P G +VFGD PY +S +L YTPL
Sbjct: 186 LSRSRFALPTQLASTFRFSRKFALCLPPAAAAGVVVFGDAPYAFQPGVVLSDTSLSYTPL 245
Query: 227 FINKVNTAS-GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYT 285
+N V+TA + S EYF+GVT + VNG+AVPLN TLL+ID +GVGGTK+STV PYT
Sbjct: 246 LVNPVSTAGVSTRHDKSDEYFVGVTGIKVNGRAVPLNATLLAIDRKGVGGTKLSTVAPYT 305
Query: 286 VLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNV 345
VL++SIYKA+ AFA+ + R P+APF C+ +G TR+GP VP I+LVL N+
Sbjct: 306 VLQSSIYKAVTDAFAAETAMIPRAPPLAPFKLCYDGSKVGSTRVGPAVPTIELVLGNEAT 365
Query: 346 VWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
W + G NSMV G ALCLG VDGG PRTS+VIG +E+NLLQFDL SRLGFS SL
Sbjct: 366 SWVVFGANSMVATEGGALCLGVVDGGKAPRTSVVIGGHMMEDNLLQFDLEASRLGFSSSL 425
Query: 406 LFERATC 412
LF + C
Sbjct: 426 LFRQTNC 432
>gi|357133735|ref|XP_003568479.1| PREDICTED: basic 7S globulin-like [Brachypodium distachyon]
Length = 441
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 205/423 (48%), Positives = 261/423 (61%), Gaps = 23/423 (5%)
Query: 12 SLLIF--SLSPITAQTSFRPKALVLRVAKDTATLQYVTQ-IKQRTPLVPVKLTVHLGGNI 68
+LL F S S A TS P A+VL V KD AT QYV +QRTP PV + LGG
Sbjct: 14 ALLFFLVSSSCCLAATSSNPSAVVLAVQKDDATGQYVAGGFRQRTPQAPVTAVLDLGGAT 73
Query: 69 LWVDCEKG-YVSSTNKTARCGSAQCNLANAKACG------------GGICGAGPDNPISN 115
LWVDC+ G Y SS+ C S C LA AC CG P+N ++
Sbjct: 74 LWVDCDPGQYASSSYARVPCASKPCRLARTSACATSCVGAPSPGCLNDTCGGFPENTVTR 133
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
T G++ DVLS+ +T PG T P F+F CG+ F+ +GLASG G+A L R++ A
Sbjct: 134 LRTSGNLITDVLSLPTTFRPAPGPLATAPAFLFACGATFLTKGLASGAAGMASLSRARFA 193
Query: 176 LPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNT 233
LP+QLA F RKFA CL P G ++FGD PY ++SK+ L YTPL ++ V+T
Sbjct: 194 LPTQLADTFRFPRKFAHCLPPASGAGFVLFGDAPYAFQPGVEISKSSLIYTPLLVDNVST 253
Query: 234 A--SGFLGEPSVEYFIGVTSVHVNGKAVP-LNKTLLSIDNE-GVGGTKISTVNPYTVLET 289
A SG G+ S EYFIGVT++ VNG+AVP LN TLL+ID + GVGGTK+STV PYTVLET
Sbjct: 254 AGVSG-KGDKSTEYFIGVTAIKVNGRAVPRLNATLLAIDGKTGVGGTKLSTVAPYTVLET 312
Query: 290 SIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
SI++A+ AFA+ + RV V PF C+ +G TR+GP VP ++LV+Q++ W +
Sbjct: 313 SIHQAVTDAFAAETAMIPRVPSVPPFRLCYDGSKVGSTRVGPAVPTVELVMQSEAASWVV 372
Query: 350 HGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFER 409
G NSMV G ALCL VDGG PRTS+V+G +E+NLL+FDL RLGFS SLLF +
Sbjct: 373 FGANSMVATKGGALCLAVVDGGKAPRTSVVVGGHMMEDNLLEFDLQGLRLGFSSSLLFRQ 432
Query: 410 ATC 412
TC
Sbjct: 433 TTC 435
>gi|222631541|gb|EEE63673.1| hypothetical protein OsJ_18491 [Oryza sativa Japonica Group]
Length = 456
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 203/451 (45%), Positives = 258/451 (57%), Gaps = 39/451 (8%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
++VA LLL L S+SP A P ++VL V+KD AT QYVT +QRTP VPVK
Sbjct: 7 LAVASGALLL---LFVSVSPCRAAAGGGPSSVVLPVSKDDATQQYVTMFRQRTPQVPVKA 63
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL------------ANAKACGGGICGAG 108
+ L G +LWVDC+ GYVSS+ RCG+ C L A + C C
Sbjct: 64 VLDLAGTMLWVDCDAGYVSSSYAGVRCGAKPCRLLKNAGCAITCLDAVSAGCLNDTCSEF 123
Query: 109 PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA-VTVPNFIFLCGSEFVLQGLASGVVGIA 167
P N ++ T G+I DVLS+ +T PG A + + QGLA G G+
Sbjct: 124 PKNTATSVSTAGNIITDVLSLPTTFRPAPGAAGHRAGRSCSPAATRSLTQGLADGATGMV 183
Query: 168 GLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPL 226
L R++ ALP+QLA F RKFALCL P G +VFGD PY D+SK+L YTPL
Sbjct: 184 SLSRARFALPTQLADTFGFSRKFALCLPPASAAGVVVFGDAPYTFQPGVDLSKSLIYTPL 243
Query: 227 FINKVNTA---------SGFLGE------------PSVEYFIGVTSVHVNGKAVPLNKTL 265
+N V+TA F+GE S +YFIG+T + VNG VP+N TL
Sbjct: 244 LVNPVSTAPYGRKDKTTKYFIGETTIQLKGRVWREKSTDYFIGLTGIKVNGHTVPVNATL 303
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
L+ID +GVGGTK+STV+PYTVLE SI++A+ AFA M + R V PF C+ + +G
Sbjct: 304 LAIDKKGVGGTKLSTVSPYTVLERSIHQAVTDAFAKEMAAIPRAPAVEPFKLCYDGRKVG 363
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
TR+GP VP I+LVLQ+ W + G NSMV G ALCLG VD G P+TS+VIG +
Sbjct: 364 STRVGPAVPTIELVLQSTGASWVVFGANSMVATKGGALCLGVVDAGTEPQTSVVIGGHMM 423
Query: 386 ENNLLQFDLATSRLGFSDSLLFERATCTFNF 416
E+NLL+FDL SRLGFS L + TC NF
Sbjct: 424 EDNLLEFDLEASRLGFSSYLPSRQTTCN-NF 453
>gi|225436982|ref|XP_002272199.1| PREDICTED: basic 7S globulin 2-like, partial [Vitis vinifera]
Length = 415
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 198/404 (49%), Positives = 239/404 (59%), Gaps = 21/404 (5%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL- 94
+ KD T QY + +TPL P KL + LGG+ WVDC K YVSST C S+ C L
Sbjct: 7 ITKDHQTNQYSLSLCLKTPLKPSKLLLDLGGSFSWVDCYKHYVSSTYHHIPCNSSLCTLL 66
Query: 95 ----------ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
A + C C N ++ +D ++ +TDGRNPGR +
Sbjct: 67 SLNSCAHCYRAPSPTCANDTCATTLHNSVTGKSIFHSALVDAAALPTTDGRNPGRLALLA 126
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAI 202
NF F C + +L+GLA GV G AGLG S ++LP Q A SL R FALCL SP G
Sbjct: 127 NFAFACSTTDLLKGLAKGVTGSAGLGWSDLSLPVQFIAGLSLPRVFALCLSGSPSAPGVG 186
Query: 203 VFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTA-SGFLGEPSVEYFIGVTSVHVNGKAVP 260
+G GPY+ L D+SK L YTPL +N TA G PS EYFIGVT++ VNG AV
Sbjct: 187 FYGSAGPYHFLPEIDLSKKLIYTPLLVNPYGTALDSNHGRPSDEYFIGVTALKVNGHAVD 246
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGAC 318
LN LL++D G GGTKISTV PYTVLE+SIY+AL AF SA + PV PF C
Sbjct: 247 LNPALLTVDLNGNGGTKISTVAPYTVLESSIYEALTHAFIAESAGLNLTVHYPVKPFRVC 306
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVDGGVNPR 375
F D+ T +GP VP +DLV+Q+ +V W I G+NSMV+I G D CLGFVDGGV PR
Sbjct: 307 FPADDVMETTVGPAVPTVDLVMQSDDVFWRIFGRNSMVRILEEGVDVWCLGFVDGGVRPR 366
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTS 418
TSIVIG Q+E+NLLQFDL RLGFS S+L C FNFTS
Sbjct: 367 TSIVIGGHQMEDNLLQFDLGLKRLGFSSSVLVHHTMCANFNFTS 410
>gi|147821120|emb|CAN68737.1| hypothetical protein VITISV_030194 [Vitis vinifera]
Length = 439
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 189/400 (47%), Positives = 235/400 (58%), Gaps = 27/400 (6%)
Query: 26 SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA 85
S RP ALVL V+K+ AT +V I++RTPL PV L + + G LWVDCE Y+SST
Sbjct: 35 SSRPNALVLLVSKNEATNLHVVDIQKRTPLKPVPLVLDVNGRSLWVDCESNYLSSTYNAP 94
Query: 86 RCGSAQCNLANAK-----------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
+C S QC+ AN C CG NPIS G++ DVLSI STDG
Sbjct: 95 QCHSTQCSRANLHDCRTCSAQTRPGCHNNTCGLNAANPISGETAFGELAQDVLSIPSTDG 154
Query: 135 RNPGRAVTVPNFIFLCG-SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+ G+ VT+P F+F C S +G V G+ GLG + +ALP+QLA+ F ++KFALC
Sbjct: 155 SSLGQLVTIPQFLFACAPSSLAQKGFPPAVQGVVGLGHTSIALPTQLASHFGFQQKFALC 214
Query: 194 L-SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
L SP + G + G+ PY DVS L TPL I++ EYFI VTS+
Sbjct: 215 LTSPLNHGVLFLGEAPYRLHPGIDVSHPLGSTPLSISREG-----------EYFIQVTSI 263
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+N + VP+N LL N G T IST PYTVLE SIY+ Q +A+ M RV P+
Sbjct: 264 RINERVVPVNPALL---NRRPGSTLISTTTPYTVLEHSIYQTFTQFYANQMSWAPRVQPI 320
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
APFG CF + T+IGP V IDLVL N+N VW I G NSMVQ CLGFVDGG
Sbjct: 321 APFGLCFDATKMTATQIGPEVANIDLVLHNRNNVWRIVGANSMVQPRPGVWCLGFVDGGS 380
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
NP+ I++G+ QLE+NLLQFDLA S+LGFS SLLF C
Sbjct: 381 NPKAPIILGSYQLEDNLLQFDLARSKLGFSSSLLFRGTHC 420
>gi|255552257|ref|XP_002517173.1| pepsin A, putative [Ricinus communis]
gi|223543808|gb|EEF45336.1| pepsin A, putative [Ricinus communis]
Length = 449
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 200/442 (45%), Positives = 275/442 (62%), Gaps = 35/442 (7%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGG 66
FLL L+ LS AQ S +PKAL+L V KD TLQYVT + TPL V LGG
Sbjct: 9 FLLFSFLVSLPLS--IAQASSKPKALILPVNKDATTLQYVTHLNIGTPLAKKDFVVDLGG 66
Query: 67 NILWVDCEKG-YVSSTNKTARCGSAQCNLANAKACGGGICG---AGPDNP----ISNTGT 118
LW+DC+ G YVSST + + CGSA C++A A GG + G +G N +S
Sbjct: 67 AHLWMDCDDGSYVSSTFRQSLCGSAPCSVAKATCTGGCVPGHHKSGCSNETCYVLSTNTI 126
Query: 119 HGDIRI-----DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
G + + D++++QSTDG G V++P++IF C + + L+ LASG G+ GLGR +
Sbjct: 127 QGRLEVGVVSRDIIALQSTDGAKSGSLVSIPDYIFACANAWDLKSLASGAKGMLGLGREQ 186
Query: 174 VALPSQLAAAF--SLKRKFALCLSPFD---DGAIVFGDGPY-----YDLNN-FDVSKNLK 222
+ALP QL+++F S +RKFA+CL P D +G + FGD PY Y+ + DVS K
Sbjct: 187 IALPKQLSSSFGGSFRRKFAICL-PSDSKSNGVMFFGDSPYVFYPSYNTSKAIDVSSRFK 245
Query: 223 YTPLFINKVNTASGFL--GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
+T L+IN V T S + G PS EYF+ VTS+ VN K +P+N+ L G GGTKIST
Sbjct: 246 HTKLYINTVFTGSSTVIRGPPSPEYFVRVTSILVNRKPIPINRAFLEFHANGTGGTKIST 305
Query: 281 VNPYTVLETSIYKALVQAFASAMP--KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
V PYT LE++IYKA+V+AF + V++VAPVAPF C+ L ++G T +G VP I
Sbjct: 306 VEPYTQLESTIYKAVVEAFDEEISVWNVSKVAPVAPFKDCYSLGNMGITGLGISVPDIAF 365
Query: 339 VLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP--RTSIVIGARQLENNLLQFDLA 395
+ NKN+ W ++G N+MV++ D +CL F+D G P T IVIGA QL++NLLQFDL
Sbjct: 366 EFENNKNLNWGMYGANTMVEVSRDVVCLAFLDRGEMPLITTPIVIGAHQLQDNLLQFDLH 425
Query: 396 TSRLGFSDSLLFERATCT-FNF 416
++RL F+++LL+E C+ F F
Sbjct: 426 SNRLAFTETLLWEEVECSNFKF 447
>gi|359487782|ref|XP_002280966.2| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 620
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 189/400 (47%), Positives = 235/400 (58%), Gaps = 27/400 (6%)
Query: 26 SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA 85
S RP ALVL V+K+ AT +V I++RTPL PV L + + G LWVDCE Y+SST
Sbjct: 216 SSRPNALVLLVSKNEATNLHVVDIQKRTPLKPVPLVLDVNGRSLWVDCESNYLSSTYNAP 275
Query: 86 RCGSAQCNLANAK-----------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
+C S QC+ AN C CG NPIS G++ DVLSI STDG
Sbjct: 276 QCHSTQCSRANLHDCRTCSAQTRPGCHNNTCGLNAANPISGETAFGELAQDVLSIPSTDG 335
Query: 135 RNPGRAVTVPNFIFLCG-SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+ G+ VT+P F+F C S +G V G+ GLG + +ALP+QLA+ F ++KFALC
Sbjct: 336 SSLGQLVTIPQFLFACAPSSLAQKGFPPAVQGVVGLGHTSIALPTQLASHFGFQQKFALC 395
Query: 194 L-SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
L SP + G + G+ PY DVS L TPL I++ EYFI VTS+
Sbjct: 396 LTSPLNHGVLFLGEAPYRLHPGIDVSHPLGSTPLSISREG-----------EYFIQVTSI 444
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+N + VP+N LL N G T IST PYTVLE SIY+ Q +A+ M RV P+
Sbjct: 445 RINERVVPVNPALL---NRRPGSTLISTTTPYTVLEHSIYQTFTQFYANQMSWAPRVQPI 501
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
APFG CF + T+IGP V IDLVL N+N VW I G NSMVQ CLGFVDGG
Sbjct: 502 APFGLCFDATKMTATQIGPEVANIDLVLHNRNNVWRIVGANSMVQPRPGVWCLGFVDGGS 561
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
NP+ I++G+ QLE+NLLQFDLA S+LGFS SLLF C
Sbjct: 562 NPKAPIILGSYQLEDNLLQFDLARSKLGFSSSLLFRGTHC 601
>gi|359806276|ref|NP_001241217.1| uncharacterized protein LOC100818868 precursor [Glycine max]
gi|255644718|gb|ACU22861.1| unknown [Glycine max]
Length = 450
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 193/441 (43%), Positives = 255/441 (57%), Gaps = 31/441 (7%)
Query: 2 SVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLT 61
S+ F L + L+ L ++ ++ +P+A +L + KD TLQY T I TP + + L
Sbjct: 7 SIIIHFFLFSAPLLSVLWSCSSVSALKPRAFILPIEKDPTTLQYSTSIDMGTPPLTLDLV 66
Query: 62 VHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKAC------------GGGICGAGP 109
+ + LW +C Y SST RCG+ +C A AC CG P
Sbjct: 67 IDIRERFLWFECGNDYNSSTYYPVRCGTKKCKKAKGTACITCTNHPLKTGCTNNTCGVDP 126
Query: 110 DNPISNTGTHGDIRIDVLS-IQSTDGRNPGRAVTVPNFIFLC------GSEFVLQGLASG 162
NP GD+ D+LS + ST G + VP F+ C G E LQGLA G
Sbjct: 127 FNPFGEFFVSGDVGEDILSSLHSTSGARAPSTLHVPRFVSTCVYPDKFGVEGFLQGLAKG 186
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCL---SPFDD-GAIVFGDGPYYDLNNFDVS 218
G+ GL R+ ++LP+QLAA ++L+ KFALCL S ++ G + G GPYY L D S
Sbjct: 187 KKGVLGLARTAISLPTQLAAKYNLEPKFALCLPSTSKYNKLGDLFVGGGPYY-LPPHDAS 245
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
K L YTP+ N +T F +PS EYFI V S+ ++GK V +N +LLSID +G GG K+
Sbjct: 246 KFLSYTPILTNPQSTGPIFDADPSSEYFIDVKSIKLDGKIVNVNTSLLSIDRQGNGGCKL 305
Query: 279 STVNPYTVLETSIYKALVQAFA--SAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQI 336
STV PYT TSIY+ LV F +A+ K+ RV VAPFGACF + IG T GP VP I
Sbjct: 306 STVVPYTKFHTSIYQPLVNDFVKQAALRKIKRVTSVAPFGACFDSRTIGKTVTGPNVPTI 365
Query: 337 DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP----RTSIVIGARQLENNLLQF 392
DLVL+ V W I+G NSMV++ + LCLGFVDGG+ P TSIVIG Q+E+NLL+F
Sbjct: 366 DLVLKG-GVQWRIYGANSMVKVSKNVLCLGFVDGGLEPGSPIATSIVIGGYQMEDNLLEF 424
Query: 393 DLATSRLGFSDSLLFERATCT 413
DL +S+LGFS SLL A+C+
Sbjct: 425 DLVSSKLGFSSSLLLHMASCS 445
>gi|222631538|gb|EEE63670.1| hypothetical protein OsJ_18488 [Oryza sativa Japonica Group]
Length = 419
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 190/403 (47%), Positives = 239/403 (59%), Gaps = 18/403 (4%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
S+SP A + P A++L V+KD AT QYVT +QRTP P+K + L G LWVDCE G
Sbjct: 22 SVSPCRAASGGDPSAVLLPVSKDDATQQYVTMFRQRTPQAPLKAVLDLAGATLWVDCEAG 81
Query: 77 YVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNT-----GTHGDIRIDVLSIQS 131
YVSS+ CGS QC LA AC GA +++T G HG R
Sbjct: 82 YVSSSYARVPCGSKQCRLAKTNACATSCDGAPSPACLNDTCGGFPGEHGHARQHQRQRHH 141
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
VP G A+G+V L R++ A P+QLAA F RKFA
Sbjct: 142 --------RRAVPAHHLPPGPGAAFAAGATGMVS---LSRARFAFPTQLAATFRFSRKFA 190
Query: 192 LCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS-GFLGEPSVEYFIGV 249
LCL P G ++FGD PY D+SK+L YTPL +N V+T G+ S EYF+G+
Sbjct: 191 LCLPPAAAAGVVIFGDAPYVFQPGVDLSKSLIYTPLLVNPVSTGGVSTKGDKSTEYFVGL 250
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
T + VNG+AVPLN TLL+I+ +GVGGTK+STV PYTVLETSI+KA+ AFA+ + RV
Sbjct: 251 TRIKVNGRAVPLNTTLLAINKKGVGGTKLSTVTPYTVLETSIHKAVTDAFAAETSMIPRV 310
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
VAPF C+ + TR+GP VP ++LV Q++ W + G NSMV G ALCLG VD
Sbjct: 311 PAVAPFKLCYDGSKVAGTRVGPAVPTVELVFQSEATSWVVFGANSMVATKGGALCLGVVD 370
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
GGV TS+VIG +E+NLL+FDL SRLGFS SLLF + TC
Sbjct: 371 GGVASETSVVIGGHMMEDNLLEFDLVGSRLGFSSSLLFRQTTC 413
>gi|255552259|ref|XP_002517174.1| conserved hypothetical protein [Ricinus communis]
gi|223543809|gb|EEF45337.1| conserved hypothetical protein [Ricinus communis]
Length = 445
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 194/442 (43%), Positives = 260/442 (58%), Gaps = 36/442 (8%)
Query: 8 LLLCSLLIFSLSPITAQT----SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVH 63
L + SL I S S + AQT S P+ALVL + KD+ TLQY+T++ TP VP L V
Sbjct: 7 LYILSLFI-SFSLVRAQTPSKSSSNPRALVLPITKDSYTLQYLTRLNLGTPPVPRNLVVD 65
Query: 64 LGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK-----------ACGGGICGAGPDNP 112
LGG LW+DC+ GY SST + CGSA C+LA A C C N
Sbjct: 66 LGGQHLWIDCDTGYQSSTYRPGYCGSASCSLAKAACVSICPNPRGPGCNNNTCKVLARNS 125
Query: 113 ISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRS 172
+ G ++ +DV+S+QSTDG G V+V +FIF C + + L LA+ G+ GLG+
Sbjct: 126 VFGGGIFPEVSLDVISLQSTDGSEAGPPVSVSDFIFGCANTWDLIDLANAANGMIGLGKE 185
Query: 173 KVALPSQLAAAF--SLKRKFALCL---SPFDDGAIVFGDGPYYDLNNF------DVSKNL 221
+VA PSQL++ F S +RKFA+CL S F +G + FGD PY+ + D+S
Sbjct: 186 RVAFPSQLSSVFGGSFRRKFAICLPSNSKF-NGVLFFGDSPYHFYPGYNTSKLIDISSRF 244
Query: 222 KYTPLFINKVNTASGFL-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
YT L N TAS L G EYF+ +TSV VN K +P+N TLL G+GG++IST
Sbjct: 245 TYTKLHTNYERTASPRLQGAQVPEYFVKITSVLVNDKPIPINTTLLDFHRTGIGGSRIST 304
Query: 281 VNPYTVLETSIYKALVQAFAS--AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
V PYT+LE SIY +LV+AF A KV + A V PF C+ + T +G VP I
Sbjct: 305 VKPYTILEGSIYDSLVKAFDKEIATWKVKKAAAVTPFKDCYSKGHLAMTPLGLTVPDISF 364
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP---RTSIVIGARQLENNLLQFDLA 395
V +NK+V W+I+G NSMV+I D +CLGF+ GVN TSI +GA Q+++N LQFDLA
Sbjct: 365 VFENKHVRWNIYGANSMVEISNDVVCLGFLR-GVNETWTTTSIDMGAHQMQDNFLQFDLA 423
Query: 396 TSRLGFSDSLLFERATCT-FNF 416
S++ F+++LL E C+ F F
Sbjct: 424 ASKMAFTNTLLLEDVECSNFKF 445
>gi|255552261|ref|XP_002517175.1| pepsin A, putative [Ricinus communis]
gi|223543810|gb|EEF45338.1| pepsin A, putative [Ricinus communis]
Length = 445
Score = 325 bits (833), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 184/421 (43%), Positives = 253/421 (60%), Gaps = 27/421 (6%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN 82
+++S +PKALVL + KD++TLQY+T++ TPLV L V LGG LW+DC+ Y S T
Sbjct: 25 SKSSLKPKALVLPITKDSSTLQYLTRLNLGTPLVLKNLVVDLGGQHLWIDCDTDYHSLTY 84
Query: 83 KTARCGSAQCNLANA-----------KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS 131
+ CGSA C+ A A C C N + GT + +D +S QS
Sbjct: 85 RPGHCGSATCSQAKAFCVSICLNPPGPNCNNNSCIVDAQNSVFGAGTTAHVSLDTISFQS 144
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF--SLKRK 189
TDG G V+ NFIF C F L LASG G+ GLG+ ++A PSQL++ F SL+RK
Sbjct: 145 TDGSKAGPPVSFSNFIFGCAHNFSLTSLASGANGMIGLGKERLAFPSQLSSLFGGSLRRK 204
Query: 190 FALCL--SPFDDGAIVFGDGPY-----YDLNN-FDVSKNLKYTPLFINKVNTASGFL-GE 240
FA+CL + +G + GD PY Y+ + DVS L YT L N TA+ L G
Sbjct: 205 FAICLPSTSKSNGVLFLGDSPYQFYPGYNTSKAIDVSSRLTYTKLHTNYKRTATPRLQGA 264
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
EYF+ +TS+ VN K +P+N TLL G+GG++I+TV PYT+LE+SIY +LV+AF
Sbjct: 265 QVPEYFVKITSILVNRKPIPINTTLLDFHRTGIGGSRITTVKPYTILESSIYDSLVKAFD 324
Query: 301 S--AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI 358
+ A KV +VA V PF C+ ++ + +G VP I V +NK+V W ++G NSMV+I
Sbjct: 325 TEIATWKVKKVAAVEPFRDCYSKGNLAMSPLGLAVPDITFVFENKDVSWDMYGANSMVEI 384
Query: 359 GGDALCLGFVDG--GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FN 415
D +CLGF+ G + TSI IGA QL++NL+QFDLA SR+ F+++LL E C+ FN
Sbjct: 385 SNDVVCLGFLRGVTEIWTTTSIDIGAYQLQDNLVQFDLAASRMAFTNTLLLEEVECSNFN 444
Query: 416 F 416
F
Sbjct: 445 F 445
>gi|381148024|gb|AFF60302.1| xyloglucanase-specific endoglucanase inhibitor [Solanum tuberosum]
Length = 438
Score = 324 bits (830), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 185/407 (45%), Positives = 242/407 (59%), Gaps = 32/407 (7%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG 88
P L+L V KD++TLQY+T I QRTPL+PV LTVHLGG L VDCE GY SST K ARC
Sbjct: 34 PTTLILPVTKDSSTLQYITVIGQRTPLIPVNLTVHLGGESLVVDCESGYTSSTYKPARCK 93
Query: 89 SAQCNLANAK--------------ACGGGICGAGPDNPISNTGTHG-DIRIDVLSIQSTD 133
S QC+ A K C C NP+ T T G ++ DVL+I
Sbjct: 94 SKQCSFAKVKFDACGDYCLTKPKPGCNNNTCHTLVGNPVITTYTFGAELAEDVLAI---- 149
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR-SKVALPSQLAAAFS-LKRKFA 191
G +P V+ P FIF C ++++ LA GV GIAG G S +++P+QLA+ S RKF
Sbjct: 150 GTSPIVLVSQPKFIFTCVESYIMKRLAKGVTGIAGFGHNSTISIPNQLASLDSKFTRKFG 209
Query: 192 LCLSPF--DDGAIVFGDGPYYDLNNF-DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
+CLS G I G PYY N D+SKNL YTPL N ++ + +EY +
Sbjct: 210 ICLSSSTRSSGVIFIGSSPYYVYNPMIDISKNLIYTPLVGNPMDWLT------PMEYHVN 263
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
V+S+ + GK VPLNKTLLSI+++G GGT+IST P+T+L TSIY+ + AF +A+PK
Sbjct: 264 VSSIRIAGKDVPLNKTLLSINDQGHGGTRISTTIPFTILHTSIYEVVKTAFINALPKNVT 323
Query: 309 VA--PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ P+ FGACF ++I T +GP VP ID V K+ W I+G NS+VQ+ D +CL
Sbjct: 324 MVDPPMKRFGACFSSKNIRITNVGPDVPVIDFVFHKKSAFWRIYGANSVVQVSKDIMCLA 383
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
FV SIVIG QLE NLL FDL ++GFS SL ++ +C+
Sbjct: 384 FVGRDQTWEPSIVIGGYQLEENLLVFDLPHKKIGFSSSLKLQQTSCS 430
>gi|222631540|gb|EEE63672.1| hypothetical protein OsJ_18490 [Oryza sativa Japonica Group]
Length = 400
Score = 320 bits (821), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 180/395 (45%), Positives = 228/395 (57%), Gaps = 27/395 (6%)
Query: 19 SPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV 78
SP A + P A++L V KD AT QYVT QRTP VPVK V L G +LWVDCE GY
Sbjct: 26 SPCRAASGSSPSAVLLPVDKDGATQQYVTMFWQRTPSVPVKAVVDLAGAMLWVDCESGYE 85
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPG 138
SS+ A + C C P+ I+ T G+I D LS+ +T P
Sbjct: 86 SSSAPPV---PPAAPGAASPGCLNDTCTGFPEYTITRVSTGGNIITDKLSLYTTCRPMPV 142
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
T P F+F CG+ + +GL + G+ L R++ ALP+Q+A+ F RKFALCL+P +
Sbjct: 143 PRATAPGFLFTCGATSLTKGLGAAATGMMSLSRARFALPTQVASIFRFSRKFALCLAPAE 202
Query: 199 D-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G +VFGD PY D+SK+L YTPL +N VN G+
Sbjct: 203 SSGVVVFGDAPYEFQPVMDLSKSLIYTPLLVNPVN-----------------------GR 239
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
AVPLN TLL+I GVGGTK+S ++PYTVLETSIYKA+ AFA+ + RV VAPF
Sbjct: 240 AVPLNATLLAIAKSGVGGTKLSMLSPYTVLETSIYKAVTDAFAAETAMIPRVPAVAPFKL 299
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
C+ +G TR GP VP ++LVLQ+K V W + G NSMV ALC G VDGGV P TS
Sbjct: 300 CYDGTMVGSTRAGPAVPTVELVLQSKAVSWVVFGANSMVATKDGALCFGVVDGGVAPETS 359
Query: 378 IVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
+VIG +E+NLL+FDL SRLGF+ L + TC
Sbjct: 360 VVIGGHMMEDNLLEFDLEGSRLGFTSYLPLLQTTC 394
>gi|242087871|ref|XP_002439768.1| hypothetical protein SORBIDRAFT_09g019770 [Sorghum bicolor]
gi|241945053|gb|EES18198.1| hypothetical protein SORBIDRAFT_09g019770 [Sorghum bicolor]
Length = 450
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 203/405 (50%), Positives = 252/405 (62%), Gaps = 24/405 (5%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
A+VL V+KD AT QYVT +QRTPLVPVK + L G LWVDC+ GY SST CGS
Sbjct: 41 AVVLPVSKDDATQQYVTGFRQRTPLVPVKAVLDLAGATLWVDCD-GYASSTYTRVPCGST 99
Query: 91 QC-----NLANAKACGGG--------ICGAGPDNPISNTGTHGDIRIDVLSIQST--DGR 135
C + A A C G CG P+N ++ T G++ DVL++ +T
Sbjct: 100 LCRGLSRSPACATTCSGAPSPSCLNDTCGGFPENTVTRLSTGGNVITDVLALPTTFRPAP 159
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
PG T P F+F CGS + +GLA+G G+A L R++ ALP+QLA+ F RKFALCL
Sbjct: 160 APGPLATAPAFLFACGSTSLTRGLAAGAAGMASLSRARFALPTQLASTFRFSRKFALCLP 219
Query: 196 PFDD-GAIVFGDGPYYDLNNFDV--SKNLKYTPLFINKVNTAS-GFLGEPSVEYFIGVTS 251
P G +VFGD P Y + +L YT L +N V+TA G+ S EYF+GVT
Sbjct: 220 PAAAAGVVVFGDAPAYAFQPGVALSATDLTYTRLLVNPVSTAGVSARGDKSDEYFVGVTG 279
Query: 252 VHVNGKAVPLNKTLLSIDNE--GVGGTKISTVNPYTVLETSIYKALVQAFA--SAMPKVA 307
+ VNG+AVPLN TLL+ID + GVGGTK+STV PYTVLE+SIYKA+ AFA +AM A
Sbjct: 280 IKVNGRAVPLNATLLAIDRKRGGVGGTKLSTVAPYTVLESSIYKAVTDAFAAETAMIPRA 339
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
PV PF C+ +G TR+GP VP I+LVL ++ W + G NSMV G ALCLG
Sbjct: 340 PAPPVPPFKLCYDGSKVGSTRVGPAVPTIELVLGDEATSWVVFGANSMVATQGGALCLGV 399
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
VDGG PRTS+VIG +E+NLLQFDL SRLGFS SLLF + C
Sbjct: 400 VDGGKAPRTSVVIGGHMMEDNLLQFDLEASRLGFSSSLLFRQTNC 444
>gi|316927702|gb|ADU58604.1| xyloglucan-specific endoglucanase inhibitor 8 [Solanum tuberosum]
Length = 437
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 186/405 (45%), Positives = 244/405 (60%), Gaps = 33/405 (8%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
L L + KD +TLQY+T++ QRTPL+P+KL VHLGG LWVDC+KGY SST K A C +
Sbjct: 36 TLYLPITKDASTLQYITEVGQRTPLIPIKLLVHLGGRSLWVDCDKGYKSSTYKPAVCNAT 95
Query: 91 QCNLANAKACGGGI-------------CGAGPDNPISNTGT-HGDIRIDVLSIQSTDGRN 136
C+ A + ACG I C +NP+ N+ +I D+L+I ST
Sbjct: 96 LCSFAKSHACGDCIFKPQLQPGCNNNTCYIWGENPLINSYMDRAEIAEDILAIGST---- 151
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR-SKVALPSQLAAAFSLKRKFALCLS 195
PG +T FIF C ++ + LA+GV GIAG G S +++P+QLA +L +KF LCLS
Sbjct: 152 PGVRITWQRFIFTCVESYLSRRLANGVTGIAGFGHESPLSIPNQLALDPTLNKKFGLCLS 211
Query: 196 P--FDDGAIVFGDGPYYDLN--NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
G I G GPYY N ++SK+L YT KV T GFL S EY+I V+S
Sbjct: 212 SSTRSRGVIFIGSGPYYVYNPKKINISKDLVYT-----KVITNRGFLL--SEEYYIQVSS 264
Query: 252 VHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV- 309
+ + G+ VPLN+TLLSI+ N GVGGTKIS+ P+T+L TSIY A+ AF A+PK A +
Sbjct: 265 IRIAGQDVPLNRTLLSINKNNGVGGTKISSTIPFTILHTSIYDAVKIAFIKALPKNATLI 324
Query: 310 -APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
P+ FG CF ++I T IGP VP ID VL + W I+G NS+VQ+ D +CL FV
Sbjct: 325 EPPMKRFGVCFSSKNIRHTNIGPDVPVIDFVLHKPSAFWRIYGVNSVVQVKKDVMCLAFV 384
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
SIVIG QLE NLL FDL ++GFS SL ++ +C+
Sbjct: 385 GRDQTWEPSIVIGGYQLEENLLVFDLPRKKIGFSSSLKLQQTSCS 429
>gi|384111000|gb|AFH67006.1| xyloglucan-specific endo-beta-1,4-glucanase inhibitor [Capsicum
annuum]
Length = 430
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 189/407 (46%), Positives = 242/407 (59%), Gaps = 32/407 (7%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
+ L + V KDT TL+Y+T++ QRTPLVP+KL VHLGG L VDC+KGY SST K+A C S
Sbjct: 25 EVLYIPVTKDTTTLRYITEVGQRTPLVPIKLLVHLGGRSLLVDCDKGYKSSTYKSAVCNS 84
Query: 90 AQCNLANAKACGGGI-------------CGAGPDNPISNT-GTHGDIRIDVLSIQSTDGR 135
QC+ A + ACG I C +NP+ N+ +I DVL+I ST
Sbjct: 85 TQCSFAKSHACGDCIFKSQLQPGCNNNTCYIWGENPLINSFHDRAEIAEDVLTIGST--- 141
Query: 136 NPGRAVTVPNFIFLC-GSEFVLQGLASGVVGIAGLGR-SKVALPSQLAAAFSLKRKFALC 193
PG VT FIF C + +++ LA GV GIAG GR S ++LP+QLA RKF LC
Sbjct: 142 -PGVHVTWSRFIFTCLLDQDMMRLLAKGVTGIAGFGRESPISLPNQLALDPRFTRKFGLC 200
Query: 194 LSP--FDDGAIVFGDGPY--YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
LS G I G GPY Y+ D+SK+L YT L NK GF+ S EY+I V
Sbjct: 201 LSSSTRSRGVIFIGSGPYNIYNPKKIDISKDLVYTKLIANK---RGGFVA--SEEYYIQV 255
Query: 250 TSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+S+ V GK VPLNKTLLSI+ + GVGGT+IST P+T+L TSIY A AF A+PK
Sbjct: 256 SSIRVAGKDVPLNKTLLSINKKNGVGGTRISTATPFTILHTSIYDAFKTAFIKALPKNVT 315
Query: 309 VA--PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ P+ FG CF ++I T GP +P ID+VL + W I+G NS+VQ+ D +CL
Sbjct: 316 LVDPPIKQFGVCFSSKNIKSTNTGPDLPVIDVVLHKPSAFWRIYGTNSVVQVNKDVMCLA 375
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
FV SIVIG Q+E NLL FDL +GFS SL ++ +C+
Sbjct: 376 FVGQDQTWEPSIVIGGHQMEENLLVFDLPGKNIGFSSSLKLQQTSCS 422
>gi|334262925|gb|AEG74550.1| xyloglucan specific endoglucanase inhibitor [Solanum tuberosum]
Length = 435
Score = 310 bits (795), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 184/408 (45%), Positives = 236/408 (57%), Gaps = 35/408 (8%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG 88
P L+L V KD++TLQY+T I QRTPL+PV TVHLG L VDCE GY SST K C
Sbjct: 34 PTTLILPVTKDSSTLQYITVIGQRTPLIPVNFTVHLGSASLLVDCESGYNSSTYKPVLCE 93
Query: 89 SAQCNLANAK--ACGGGICGAGPD-------------NPISNTGTHGDIRIDVLSIQSTD 133
S QC+ A A+ ACG +C + P NPI +T T ++ DVL+I
Sbjct: 94 SKQCSFAKAEYNACGS-LCLSKPRPGCNNNTCHTLDGNPIIDTYTGAELAEDVLAI---- 148
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR-SKVALPSQLAAAFS-LKRKFA 191
G P V+ P FIF C F++ LA GV GIAG G S +++P+QLA+ S KF
Sbjct: 149 GSKPVVLVSQPKFIFTCIRSFIMTHLAKGVTGIAGFGHNSTISIPNQLASLDSRFTNKFG 208
Query: 192 LCLSPF---DDGAIVFGDGPYYDLNNF-DVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
+CLS G I G PYY N D+SKNL YTPL N F E EY +
Sbjct: 209 ICLSSSTTRSSGVIFIGSTPYYVYNPMIDISKNLVYTPLVKNT------FTTEKFSEYHV 262
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
V+S+ + GK VPLNKTLLSI +G+GGT+IST P+T+L T+IY A+ AF +A+PK
Sbjct: 263 KVSSIRIAGKNVPLNKTLLSIK-QGLGGTRISTTTPFTILHTTIYDAVKTAFINALPKNV 321
Query: 308 RVA--PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
+ P FG CF ++I T +GP VP ID+V K+ W I+G NS+VQ+ D +CL
Sbjct: 322 TIVEPPTKQFGLCFSSKNIRNTNVGPDVPVIDIVFHKKSAFWRIYGTNSVVQVNKDVMCL 381
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
FV SI IG QLE NLL FDL ++GFS SL ++ +C+
Sbjct: 382 AFVGQDQTRAPSIEIGGYQLEENLLLFDLIEKKIGFSSSLKLQQTSCS 429
>gi|331271603|gb|AED02502.1| xyloglucan-specific endoglucanase inhibitor [Solanum tuberosum]
Length = 435
Score = 310 bits (794), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 184/408 (45%), Positives = 236/408 (57%), Gaps = 35/408 (8%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG 88
P L+L V KD++TLQY+T I QRTPL+PV TVHLG L VDCE GY SST K C
Sbjct: 34 PTTLILPVTKDSSTLQYITVIGQRTPLIPVNFTVHLGSASLLVDCESGYNSSTCKPVLCE 93
Query: 89 SAQCNLANAK--ACGGGICGAGPD-------------NPISNTGTHGDIRIDVLSIQSTD 133
S QC+ A A+ ACG +C + P NPI +T T ++ DVL+I
Sbjct: 94 SKQCSFAKAEYNACGS-LCLSKPRPGCNNNTCHTLDGNPIIDTYTGAELAEDVLAI---- 148
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR-SKVALPSQLAAAFS-LKRKFA 191
G P V+ P FIF C F++ LA GV GIAG G S +++P+QLA+ S KF
Sbjct: 149 GSKPVVLVSQPKFIFTCIRSFIMTHLAKGVTGIAGFGHNSTISIPNQLASLDSRFTNKFG 208
Query: 192 LCLSPF---DDGAIVFGDGPYYDLNNF-DVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
+CLS G I G PYY N D+SKNL YTPL N F E EY +
Sbjct: 209 ICLSSSTTRSSGVIFIGSTPYYVYNPMIDISKNLVYTPLVKNT------FTTEKFSEYHV 262
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
V+S+ + GK VPLNKTLLSI +G+GGT+IST P+T+L T+IY A+ AF +A+PK
Sbjct: 263 KVSSIRIAGKNVPLNKTLLSIK-QGLGGTRISTTTPFTILHTTIYDAVKTAFINALPKNV 321
Query: 308 RVA--PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
+ P FG CF ++I T +GP VP ID+V K+ W I+G NS+VQ+ D +CL
Sbjct: 322 TIVEPPTKQFGLCFSSKNIRNTNVGPDVPVIDIVFHKKSAFWRIYGTNSVVQVNKDVMCL 381
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
FV SI IG QLE NLL FDL ++GFS SL ++ +C+
Sbjct: 382 AFVGQDQTRAPSIEIGGYQLEENLLLFDLIEKKIGFSSSLKLQQTSCS 429
>gi|316927700|gb|ADU58603.1| xyloglucan-specific endoglucanase inhibitor 1 [Solanum tuberosum]
Length = 430
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 179/407 (43%), Positives = 240/407 (58%), Gaps = 32/407 (7%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
+ L + V KD +TL+Y+ ++ QRTPL+P+KL ++LGG LWVDC+KGY SST K A C S
Sbjct: 25 EVLYIPVTKDASTLEYIIEVGQRTPLIPIKLLINLGGRSLWVDCDKGYKSSTYKPAVCNS 84
Query: 90 AQCNLANAKACGGGI-------------CGAGPDNPISNT-GTHGDIRIDVLSIQSTDGR 135
QC A + ACG I C +NP+ N+ +I DVL+I ST
Sbjct: 85 TQCTFAKSHACGDCIFKPQVQPGCSNNTCYIWGENPLINSFHDRAEIAEDVLAIGST--- 141
Query: 136 NPGRAVTVPNFIFLCG-SEFVLQGLASGVVGIAGLGR-SKVALPSQLAAAFSLKRKFALC 193
PG VT P FIF C + +++ A+GV G+AG GR S V++P+QLA +KF +C
Sbjct: 142 -PGVRVTWPRFIFSCLLDQDMMRQFANGVTGVAGFGRESPVSIPNQLALDSRFTKKFGIC 200
Query: 194 LSPFDD--GAIVFGDGPYYDLN--NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
LS G I G GPYY N D+S ++ YT L NT GF+ S EY+I V
Sbjct: 201 LSSSTQSRGVIFIGSGPYYVYNPKKIDISNDILYTKLI---ANTRGGFVT--SEEYYIQV 255
Query: 250 TSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+S+ + G+ VPLNKTLLSI+ + GV GT+IST P+T+L T+IY A AF A+PK
Sbjct: 256 SSIRIAGQDVPLNKTLLSINKKNGVAGTRISTATPFTILHTTIYDAFKTAFIKALPKNVT 315
Query: 309 VA--PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ P+ FG CF ++I T +GP VP ID VL + W I+G NS+VQ+ D +CL
Sbjct: 316 IVEPPMKQFGLCFSSKNIKSTNVGPDVPVIDFVLHKPSAFWRIYGTNSVVQVNKDVMCLA 375
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
FV SIVIG Q+E NLL FDL +GFS SL ++A+C+
Sbjct: 376 FVGRDQTWEPSIVIGGHQMEENLLVFDLVRRNIGFSSSLKLQQASCS 422
>gi|323435816|gb|ADX66725.1| xyloglucan-specific endoglucanase inhibitor protein 2 [Solanum
tuberosum]
Length = 429
Score = 305 bits (780), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 178/407 (43%), Positives = 239/407 (58%), Gaps = 32/407 (7%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
+ L + V KD +TL+Y+ ++ QRTPL+P+KL ++LGG LWVDC+KGY SST K A C S
Sbjct: 24 EVLYIPVTKDASTLEYIIEVGQRTPLIPIKLLINLGGRSLWVDCDKGYKSSTYKPAVCNS 83
Query: 90 AQCNLANAKACGGGI-------------CGAGPDNPISNT-GTHGDIRIDVLSIQSTDGR 135
QC A + ACG I C +NP+ N+ +I DVL+I ST
Sbjct: 84 TQCTFAKSHACGDCIFKPQVQPGCSNNTCYIWGENPLINSFHDRAEIAEDVLAIGST--- 140
Query: 136 NPGRAVTVPNFIFLCG-SEFVLQGLASGVVGIAGLGR-SKVALPSQLAAAFSLKRKFALC 193
PG VT P FIF C + +++ A+GV G+AG GR S ++P+QLA +KF +C
Sbjct: 141 -PGVRVTWPRFIFSCLLDQDMVRQFANGVTGVAGFGRESPGSIPNQLALDSRFTKKFGIC 199
Query: 194 LSPFDD--GAIVFGDGPYYDLN--NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
LS G I G GPYY N D+S ++ YT L NT GF+ S EY+I V
Sbjct: 200 LSSSTQSRGVIFIGSGPYYVYNPKKIDISNDILYTKLI---ANTRGGFVT--SEEYYIQV 254
Query: 250 TSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+S+ + G+ VPLNKTLLSI+ + GV GT+IST P+T+L T+IY A AF A+PK
Sbjct: 255 SSIRIAGQDVPLNKTLLSINKKNGVAGTRISTATPFTILHTTIYDAFKTAFIKALPKNVT 314
Query: 309 VA--PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ P+ FG CF ++I T +GP VP ID VL + W I+G NS+VQ+ D +CL
Sbjct: 315 IVEPPMKQFGLCFSSKNIKSTNVGPDVPVIDFVLHKPSAFWRIYGTNSVVQVNKDVMCLA 374
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
FV SIVIG Q+E NLL FDL +GFS SL ++A+C+
Sbjct: 375 FVGRDQTWEPSIVIGGHQMEENLLVFDLVRRNIGFSSSLKLQQASCS 421
>gi|388493426|gb|AFK34779.1| unknown [Medicago truncatula]
Length = 454
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 183/419 (43%), Positives = 236/419 (56%), Gaps = 35/419 (8%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARC 87
+P VL +AKD TLQY T IK TP VP+ L + + LW +C+ Y S+T +C
Sbjct: 33 KPNTFVLPIAKDPKTLQYSTSIKLGTPAVPLDLVIDIRERFLWFECDDSYNSTTYNPIQC 92
Query: 88 GSAQC------------NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQ--STD 133
G+ +C N + C CG P NP GD+ D+LS ++D
Sbjct: 93 GTKKCKQARGTGCIDCTNHPSKTGCTNNTCGVEPFNPFGGFFVSGDVGEDILSFPRVTSD 152
Query: 134 GRNPGRAVTVPNFIFLC------GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
GR V VP FI C G + L+GL+ G G+ GL R+ ++LP+Q+A F L
Sbjct: 153 GRRVTN-VRVPRFISSCVYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIATRFKLD 211
Query: 188 RKFALCLSP------FDDGAIVFGDGPY-YDLNNFDVSKNLKYTPLFINKVNTASGFLGE 240
RKF LCL G++ G GPY N D SK LKYTPL N+ +T F
Sbjct: 212 RKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGPIFDNF 271
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF- 299
PS EYFI V S+ V+ V N TLLSI+ G GGTK+STV P+T L TSIY L+ AF
Sbjct: 272 PSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPLLNAFV 331
Query: 300 -ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI 358
+ + K+ RV VAPFGACF + I + GP VP IDLVL+ V W I G NSMV++
Sbjct: 332 KKAEIRKIKRVKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKG-GVEWRIFGANSMVKV 390
Query: 359 GGDALCLGFVDGG---VNPR-TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
+ LCLGFVD G V P TSI+IG QLE+NL++FDL +S+LGFS SLL +A+C+
Sbjct: 391 NENVLCLGFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFSSSLLLNKASCS 449
>gi|358347314|ref|XP_003637703.1| Basic 7S globulin [Medicago truncatula]
gi|355503638|gb|AES84841.1| Basic 7S globulin [Medicago truncatula]
Length = 454
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 183/419 (43%), Positives = 235/419 (56%), Gaps = 35/419 (8%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARC 87
+P VL +AKD TLQY T IK TP VP+ L + + LW +C+ Y S+T +C
Sbjct: 33 KPNTFVLPIAKDPKTLQYSTSIKLGTPAVPLDLVIDIRERFLWFECDDSYNSTTYNPIQC 92
Query: 88 GSAQCNLANAKAC------------GGGICGAGPDNPISNTGTHGDIRIDVLSIQ--STD 133
G+ +C A C CG P NP GD+ D+LS ++D
Sbjct: 93 GTKKCKQARGTGCIDCTNHPFKTGCTNNTCGVEPFNPFGGFFVSGDVGEDILSFPRVTSD 152
Query: 134 GRNPGRAVTVPNFIFLC------GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
GR V VP FI C G + L+GL+ G G+ GL R+ ++LP+Q+A F L
Sbjct: 153 GRRVTN-VRVPRFISSCVYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIATRFKLD 211
Query: 188 RKFALCLSP------FDDGAIVFGDGPY-YDLNNFDVSKNLKYTPLFINKVNTASGFLGE 240
RKF LCL G++ G GPY N D SK LKYTPL N+ +T F
Sbjct: 212 RKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGPIFDNF 271
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF- 299
PS EYFI V S+ V+ V N TLLSI+ G GGTK+STV P+T L TSIY L+ AF
Sbjct: 272 PSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPLLNAFV 331
Query: 300 -ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI 358
+ + K+ RV VAPFGACF + I + GP VP IDLVL+ V W I G NSMV++
Sbjct: 332 KKAEIRKIKRVKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKG-GVEWRIFGANSMVKV 390
Query: 359 GGDALCLGFVDGG---VNPR-TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
+ LCLGFVD G V P TSI+IG QLE+NL++FDL +S+LGFS SLL +A+C+
Sbjct: 391 NENVLCLGFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFSSSLLLNKASCS 449
>gi|255552235|ref|XP_002517162.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543797|gb|EEF45325.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 411
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 177/402 (44%), Positives = 240/402 (59%), Gaps = 23/402 (5%)
Query: 21 ITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-YVS 79
+ AQ S P + L V KD +TLQY+T+I +P L + L G LW+DC VS
Sbjct: 18 VHAQISSSPDSFHLPVTKDLSTLQYITRINHGALQIPTNLVIDLDGAHLWLDCASSEQVS 77
Query: 80 STNKTARCGSAQCNLAN--AKACGG-GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN 136
S+ + S QC++A K+C C N I G++ DVL+I S DG N
Sbjct: 78 SSLRLIPSCSIQCSMAKPGHKSCNHHSSCDIFTQNGIIQLVKTGELVEDVLAIPSVDGSN 137
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
G + NFI C +L GLASG G+ GLGRSK+AL SQLAA F RKFA CLS
Sbjct: 138 SGTNFEIENFILACAPATLLDGLASGAQGMLGLGRSKIALQSQLAARFDFHRKFATCLSS 197
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
+G I+FG+ +++ ++ ++L Y+PL V G S+EYFI V S+ +NG
Sbjct: 198 -SNGVILFGNVGSDSISDPEILRSLSYSPL----VTKPDG----SSLEYFIEVRSIKING 248
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV--ARVAPVAP 314
K + L + EG+G TKIST+ PYT LE+SIY+ ++A+ A + RVA VAP
Sbjct: 249 KKLALGQ-------EGIGFTKISTIVPYTTLESSIYETFIKAYLKAANSMNLIRVASVAP 301
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
FG CF + I + +GP VP IDLVLQ++ V W +HG NSMV++ +A+CLGF+DGG++P
Sbjct: 302 FGLCFSSKGIERSILGPNVPAIDLVLQSEMVKWRLHGGNSMVEVNDEAMCLGFLDGGLDP 361
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNF 416
+ SIVIG QLE+ LL+FDL TS LGFS LL + +C+ NF
Sbjct: 362 KNSIVIGGLQLEDTLLEFDLGTSMLGFSLPLLQRQTSCS-NF 402
>gi|225432540|ref|XP_002280508.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 388
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 169/389 (43%), Positives = 234/389 (60%), Gaps = 40/389 (10%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
+L+L V KD ATLQYVTQI TPLVP+KL + LG LW+DC G+VSS+N CGS
Sbjct: 22 SLLLPVTKDAATLQYVTQIHHGTPLVPIKLVLDLGAPFLWLDCSSGHVSSSNTPILCGSI 81
Query: 91 QCNLANAKACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
QC A G G C P N I+ G++ D+++++ ++ + F
Sbjct: 82 QCLTAKTSDSGHGGGTSTCRLSPKNTITGLAEAGELAEDMVAVEGSEMGS--------RF 133
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
+F C + +L+GLASG VG+ GLGR+++ALPSQLAA+ L RKFA+CLS +G + +
Sbjct: 134 LFSCAPKPLLKGLASGTVGMLGLGRTRIALPSQLAASVGLHRKFAVCLSS-SEGTVFLEN 192
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
++ DVSK+L YTPL + + G YFI V S+ +NG+ V L
Sbjct: 193 ----EIAGTDVSKSLMYTPLLPGQDPNSEG--------YFISVKSIRINGRGVSLGTI-- 238
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDI 324
GGT++STV PYT ++ S+Y +A+ A+A + RV +APFG CFR +
Sbjct: 239 ------TGGTRLSTVVPYTTMKRSVYDIFTKAYIKAAASMNITRVESMAPFGVCFRSESS 292
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
P VP IDLVLQ++ V W I G+NSMV++ +CLGF+DGGV+P T+IVIG Q
Sbjct: 293 E-----PAVPTIDLVLQSEMVKWRILGRNSMVRVSDKVMCLGFLDGGVDPGTAIVIGGHQ 347
Query: 385 LENNLLQFDLATSRLGFSDSLLFERATCT 413
LE+NLL+FDL+TS LGFS SL ++C+
Sbjct: 348 LEDNLLEFDLSTSMLGFSSSLSTRESSCS 376
>gi|224100331|ref|XP_002311834.1| predicted protein [Populus trichocarpa]
gi|222851654|gb|EEE89201.1| predicted protein [Populus trichocarpa]
Length = 437
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 180/436 (41%), Positives = 257/436 (58%), Gaps = 37/436 (8%)
Query: 1 MSVAQSFLLLCSLL-IFSLSPITAQTSFRP-KALVLRVAKDTATLQYVTQIKQRTPLVPV 58
M+ Q LL CS + IF+++P AQ P ++LVL V KD AT Q++T I T P+
Sbjct: 2 MACIQFLLLFCSFMYIFNVNPSCAQIPNSPIRSLVLPVTKDPATFQFLTTIYHGTSREPI 61
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA--NAKACGGGI---------CGA 107
++ + LG LW+DC G +SS+ + S QC A N +C CG
Sbjct: 62 RVVLDLGCPSLWLDCSSGRLSSSRRLIPSCSIQCAAAKPNNMSCAFSAAMPTRKRTACGL 121
Query: 108 GPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIA 167
+N I+ + T G++ D+L+++S DG G TV +F+F C F+L LA G G+
Sbjct: 122 STENSIARSATRGELVEDILTVESVDGSKAGPVTTVDHFLFSCAPRFLLNRLARGAQGML 181
Query: 168 GLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD-LNNFDVSKNLKYTPL 226
GLG+S++ALPSQLA+ F L+RKFA CLS DG I+FG P YD + ++S++L YTPL
Sbjct: 182 GLGKSRIALPSQLASKFGLQRKFATCLSS-SDGLILFGHEPGYDSIFGTEISRSLMYTPL 240
Query: 227 FINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV-GGTKISTVNPYT 285
+ + S S +Y I V S+ +NGK LS+ +G+ GGTKIST PYT
Sbjct: 241 VTSPDGSGS------SQDYSINVKSIKINGKR-------LSLRQKGIGGGTKISTTVPYT 287
Query: 286 VLETSIYKALVQAFASAMP-----KVARVAPVAPFGACFRLQDIGFTR-IGPVVPQIDLV 339
LE+SIY ++A+ + + VAPVAPFG CF +++ + +GP+VP IDLV
Sbjct: 288 TLESSIYSTFIKAYKESATNNYFLNMTVVAPVAPFGLCFSSKEVPSSMLLGPMVPVIDLV 347
Query: 340 LQNKNVVWSIHGQNSMVQIGGDALCLGFVD--GGVNPRTSIVIGARQLENNLLQFDLATS 397
LQ++ V W +HG+N+MV + + +CLGF+D +SIVIG QLE+NLL+F+L TS
Sbjct: 348 LQSEMVKWRVHGRNAMVPVLDEVMCLGFLDGGSKSKTSSSIVIGGFQLEDNLLEFNLGTS 407
Query: 398 RLGFSDSLLFERATCT 413
LGFS SLL +C+
Sbjct: 408 MLGFSSSLLTRHTSCS 423
>gi|356503531|ref|XP_003520561.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 427
Score = 287 bits (734), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 161/392 (41%), Positives = 231/392 (58%), Gaps = 14/392 (3%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
L+ ++KD T Y + +TPL P KL +HLG ++ WV C+ Y SS++ C +
Sbjct: 28 LIAPISKDDTTQLYTLSVFLKTPLQPTKLHLHLGSSLSWVLCDSTYTSSSSHHIPCNTPL 87
Query: 92 CNLANAKACG--GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
CN + AC +C P+NP++ ID L++ + D + V + +FIF
Sbjct: 88 CNSFPSNACSNNSSLCALFPENPVTRNTLLDTALIDSLALPTYDASS--SLVLISDFIFS 145
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAIVFGDG 207
C + +LQGLA+ +G+A LGRS +LP+Q++ + + R F LCL S + GA +F
Sbjct: 146 CATAHLLQGLAANALGLASLGRSNYSLPAQISTSLTSPRSFTLCLPASSANTGAAIFAST 205
Query: 208 PYYDLNNFDVSKNLKYTPLFINKV-NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
L F +L YT L +N V +T +PS EYFI +TS+ +NGK + +N ++L
Sbjct: 206 ASSFL--FSSKIDLTYTQLIVNPVADTVVTDNPQPSDEYFINLTSIKINGKPLYINSSIL 263
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA--SAMPKVARVAPVAPFGACFRLQDI 324
++D G GGTKIST PYTVLETSIY+ VQ F S+ + V PFG C+ D+
Sbjct: 264 TVDQTGFGGTKISTAEPYTVLETSIYRLFVQRFVNESSAFNLTVTEAVEPFGVCYPAGDL 323
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVDGGVNPRTSIVIG 381
TR+GP VP +DLV+ +++V W I G NSMV++ G D CLGFVDGG RT IVIG
Sbjct: 324 TETRVGPAVPTVDLVMHSEDVFWRIFGGNSMVRVAKGGVDVWCLGFVDGGTRGRTPIVIG 383
Query: 382 ARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
QLE+NL+QFDL ++R GF+ +LL + A C+
Sbjct: 384 GHQLEDNLMQFDLDSNRFGFTSTLLLQDAKCS 415
>gi|148907857|gb|ABR17052.1| unknown [Picea sitchensis]
Length = 422
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 181/433 (41%), Positives = 242/433 (55%), Gaps = 26/433 (6%)
Query: 3 VAQSFLLLCSLLIFSLSPITAQTS-FRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLT 61
+A L+L + +I + A S RP+ALV ++ +D A+ +Y +I+QRTPL +L
Sbjct: 1 MASLHLVLITAVIICFCHLPANASQRRPRALVTQITQDPASQRYTVEIRQRTPLRIQRLV 60
Query: 62 VHLGGNILWVDCE-KGYVSSTNKTARCGSAQCNLANAKACGG--GICGAGPDNPISNTGT 118
+ + + +WV C+ K Y+SST C + C CG G G G +N
Sbjct: 61 LDIEEDYMWVRCDNKSYISSTYSPLGCSAQLCKSYQYSGCGTCYGSRGPGCNNNTCVVAV 120
Query: 119 HG----DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKV 174
G ++ DVL + S+DG NPG P F C + + SG VG+AG+ S +
Sbjct: 121 QGSRSVELAQDVLVLPSSDGSNPGPLARFPQLAFAC--DLSSNRVISGTVGVAGMTSSTL 178
Query: 175 ALPSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGPYYDLN--NFDVSKNLKYTPLFINKV 231
ALPSQL+AA RKFA+CL S GA+ FGD P L D+S + TPL N V
Sbjct: 179 ALPSQLSAAEGFSRKFAMCLPSGNAPGALFFGDEPLVFLPPPGRDLSSQIIRTPLIKNSV 238
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
T +++GV + V G V ++ L D +G GGTK+STV YT L + I
Sbjct: 239 YTDV---------FYLGVQRIEVGGVNVAIDAEKLRFDKDGRGGTKLSTVVRYTQLASPI 289
Query: 292 YKALVQAFASAMPK--VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWS 348
Y +L F S K + RVA V+PFGACF +G TR+GP VP ID+VLQ N W
Sbjct: 290 YNSLEGVFTSVAKKMNITRVASVSPFGACFDSSGVGSTRVGPAVPTIDIVLQGNSTTTWR 349
Query: 349 IHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
I G NSMV++ LCLGFVDGG N + SIVIG Q+++NLLQFDLATS LGFS SLLF
Sbjct: 350 IFGANSMVRVNNKVLCLGFVDGGDNLQQSIVIGTYQMQDNLLQFDLATSTLGFSSSLLFG 409
Query: 409 RATCT-FNFTSIA 420
+ TC+ FNFT+ +
Sbjct: 410 QTTCSNFNFTATS 422
>gi|354508535|gb|AER26945.1| xyloglucan-specific endoglucanase inhibitor 9 [Solanum tuberosum]
Length = 438
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 166/413 (40%), Positives = 234/413 (56%), Gaps = 39/413 (9%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN--KTARC 87
+ L + V KD +TLQY+ ++ Q+TPL+P KL +HLGG LWVDC +++ K+A C
Sbjct: 24 EVLYIPVTKDASTLQYIIEVGQKTPLIPTKLLLHLGGKSLWVDCTNSTTHTSSTYKSAVC 83
Query: 88 GSAQCNLANAKACG-------------GGICGAGPDNPISNTGTHG-DIRIDVLSIQSTD 133
S +C++A + CG C +NPI G ++ DVL+I
Sbjct: 84 NSTECSMAKSYGCGDCKFRSELQPGCNNNTCYIWSENPIKKMYHDGSEVAEDVLTI---- 139
Query: 134 GRNPGRAVTVPNFIFLCGSE-FVLQGLASGVVGIAGLGRSK-VALPSQLAAAFSLKRKFA 191
G +PG VT P FIF C + ++L+ LA+GV G+AG G++ + +P+QL + RKF
Sbjct: 140 GSSPGVLVTSPRFIFTCLIDPYMLEKLANGVTGVAGFGQTTPITIPNQLGSDPRFSRKFG 199
Query: 192 LCLSP--FDDGAIVFGDGPYYDLN--NFDVS--KNLKYTPLFINKVNTASGFLGEPSVEY 245
+CLS G I G PYY N D+S K+L YT L +NK GFL + EY
Sbjct: 200 MCLSSSTTSRGVIFIGPTPYYVYNPKKIDISNSKDLAYTKLLVNK----RGFLL--TDEY 253
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNE--GVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
+ ++S+ V G+ PLNKTLL I+ + G GT IST PYT+L T+ Y ++ AF +A+
Sbjct: 254 YFQMSSIRVAGQDAPLNKTLLIINKKRHGTDGTSISTAIPYTILHTTFYDSVKTAFTNAL 313
Query: 304 PK---VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
PK + PV+PF CF ++I T +GP VP ID+V +V W I G NSM+Q+
Sbjct: 314 PKNVTIVEPPPVSPFATCFSSENIKNTNVGPDVPPIDIVFYKPSVFWRISGANSMIQVSK 373
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
D +CL FV SIVIG QLE NLL FDL ++GFS SL ++ +C+
Sbjct: 374 DVMCLAFVRQDQTWLPSIVIGGYQLEENLLVFDLPGRKIGFSSSLKLKQTSCS 426
>gi|118487589|gb|ABK95620.1| unknown [Populus trichocarpa]
Length = 450
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 193/441 (43%), Positives = 239/441 (54%), Gaps = 40/441 (9%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
+L+F S I++Q + L + KD +T QYV +TPL P KL + LG WV+
Sbjct: 12 ILLFYSSFISSQAAL---PLQTPIQKDHSTSQYVITAYLQTPLKPTKLLLDLGATYTWVN 68
Query: 73 CEKGYVSSTNKTARCGSAQCNLANAKAC-------GGGICGAG-----PDNPISNTGTHG 120
C+ GY SST + C S+ NL A AC G CG PDNPI
Sbjct: 69 CD-GYTSSTYQHVPCNSSIANLLGAYACLDLCDGPPGPNCGNNSFLLFPDNPIKPVDYKK 127
Query: 121 DIRIDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQ 179
I++ I S G + NFIF C L+GLA GV G+A LGRS V++P Q
Sbjct: 128 VKGINIALIDSFALSTTQGSLTLINNFIFSCARTGFLKGLAKGVAGLAALGRSNVSIPVQ 187
Query: 180 LAAAFSLKRK-FALCLS--PFDDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTAS 235
FS FA+CLS G +FG GPY L D+SK+L YTPL N S
Sbjct: 188 FNRFFSSSPNCFAICLSGSKSQPGVALFGSKGPYDFLPGIDLSKSLLYTPLISNPFGKDS 247
Query: 236 GFLGEP--SVEYFIGVTSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIY 292
+P S EY+IG+ S+ VNGK V LNK+LL+ID E G GGT ISTV PYT L+ SIY
Sbjct: 248 D-PDKPRSSPEYYIGLNSIKVNGKMVALNKSLLAIDGETGPGGTTISTVVPYTKLQRSIY 306
Query: 293 KALVQAF----ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWS 348
K + AF AS + PV PFG C+ + T++GP VP IDLVL ++VVW
Sbjct: 307 KTFILAFLKEAASPAFNLTATKPVKPFGVCYPASAVKNTQMGPAVPIIDLVLDRQDVVWK 366
Query: 349 IHGQNSMVQIGG---DALCLGFVDGGVNPRT-------SIVIGARQLENNLLQFDLATSR 398
I G NSMV+I D CLGFVD GVNP SIVIG QLE+N+LQFDL + R
Sbjct: 367 IFGSNSMVRITKKSVDLWCLGFVDAGVNPMVASWIGGPSIVIGGHQLEDNMLQFDLQSKR 426
Query: 399 LGFSDSLLFERATC-TFNFTS 418
LGFS SLL + C F F++
Sbjct: 427 LGFSSSLLSKGTNCANFKFST 447
>gi|357512051|ref|XP_003626314.1| Basic 7S globulin [Medicago truncatula]
gi|87240526|gb|ABD32384.1| Peptidase A1, pepsin [Medicago truncatula]
gi|355501329|gb|AES82532.1| Basic 7S globulin [Medicago truncatula]
Length = 437
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 175/443 (39%), Positives = 244/443 (55%), Gaps = 54/443 (12%)
Query: 9 LLCS-LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
L CS LL+ S ++ +P LVL V +D +T + I +RTPL+ V + + L G
Sbjct: 14 LFCSFLLVSSRHQQQPNSNPKPNLLVLPVQQDASTGLHWANIHKRTPLMQVPVLLDLNGK 73
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGPDNPISNT 116
LWV+CE+ Y SST + C S QC+ ANA C CG NP++
Sbjct: 74 HLWVNCEQHYASSTYQAPYCHSTQCSRANAHTCHTCVSSFRPGCHNNTCGLMSANPVTQQ 133
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQ-GLASGVVGIAGLGRSKVA 175
G++ DVL+I + +G PG VT+P F+F C F+ Q GL + V G+ GL S ++
Sbjct: 134 TAMGELAQDVLAIYAINGPKPGPMVTIPQFLFSCAPSFLAQKGLPNNVQGVVGLAHSPIS 193
Query: 176 LPSQLAAAFSLKRKFALCLS--PFDDGAIVFGDGPYYDLNNFDVSK-------------- 219
L +QL++ F LKR+F +CLS P +GAI+FGD P NN +
Sbjct: 194 LQNQLSSHFGLKRQFTMCLSRHPNSNGAILFGDAP----NNMHFGQGNNYNNKNNPNLFN 249
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV-PLNKTLLSIDNEGV-GGTK 277
NL YTPL I T G EY I VTS+ +N V P++ +LS EGV GGT
Sbjct: 250 NLVYTPLTI----TQQG-------EYRIHVTSIRLNQHTVVPVSAPMLSSYPEGVMGGTL 298
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQID 337
IST PYT+L+ S+++A Q FA P+ A+V V PFG CF + RI + ++
Sbjct: 299 ISTSIPYTILQHSLFEAFTQVFAKQYPRQAQVNAVGPFGMCFDSK-----RINQAL-SVE 352
Query: 338 LVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATS 397
V+ +VVW I G+N MVQ CL FV+GG++P+ +I IG+RQLE N++ FDLA S
Sbjct: 353 FVMDRPDVVWRISGENLMVQPRNGVSCLAFVNGGLHPKAAITIGSRQLEENMMMFDLARS 412
Query: 398 RLGFSDSLLFERATCT--FNFTS 418
RLGF++SL C+ F+FT+
Sbjct: 413 RLGFTNSLNSHGMKCSDLFDFTN 435
>gi|218202185|gb|EEC84612.1| hypothetical protein OsI_31447 [Oryza sativa Indica Group]
Length = 598
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 169/408 (41%), Positives = 225/408 (55%), Gaps = 45/408 (11%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHL-GGNILWVDCEK--GYVSSTNKT 84
RPKA+ + V +D AT QYV +QRTP V +K V L GG LWVDC+ GY SS+
Sbjct: 38 RPKAVAMPVGRDGATRQYVATFQQRTPRVAMKAVVDLSGGATLWVDCDAAAGYASSSYAG 97
Query: 85 ARCGSAQCNLANAKACG---------------GGICGAGPDNPISNTGTHGDIRIDVLSI 129
CGS C L + +C C +N ++++ G++ DVLS+
Sbjct: 98 VPCGSKPCRLVESPSCSYIASCLGSPPSPACLNRTCTGHAENTVTSSVGRGNVVTDVLSL 157
Query: 130 QSTDGRNPGRA---VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
+T P R T P F+F CG + QGLA+G G+A L R+++ALP+QLA F
Sbjct: 158 PTTFPSAPVRQGPLATAPAFLFTCGPTSLTQGLAAGATGMASLSRARLALPAQLAGTFRF 217
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
RKFALCL D G +VFGD Y + D S +L YTPL + +S EYF
Sbjct: 218 SRKFALCLPSVDAGVVVFGDARYV-FDGMDHSNSLLYTPLITRTTDRSS--------EYF 268
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK- 305
I + V V+ +AVPLN TLL + GTK+STV+PYTVLETSI++A+ +AFA++M
Sbjct: 269 ISLKRVVVDDRAVPLNATLLDV------GTKLSTVSPYTVLETSIHEAVTRAFAASMATA 322
Query: 306 -VARVAPVAPFGACFRLQDIGFTRI--GPVVPQI-DLVLQ----NKNVVWSIHGQNSMVQ 357
+ RV VAPF C+ + + I P VP + +L +Q +K W + G N M +
Sbjct: 323 GIPRVPAVAPFELCYDGSKVESSAITGEPAVPVVFELYVQSEARSKVAPWMVSGANLMAR 382
Query: 358 IGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
G ALCL VDGG P T +VIG +E LL FDL SRLGFS +L
Sbjct: 383 ADGGALCLAVVDGGAAPETPVVIGGHMMEEILLVFDLEKSRLGFSPNL 430
>gi|1401240|gb|AAB03390.1| 7S seed globulin precursor [Glycine max]
Length = 427
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 177/441 (40%), Positives = 233/441 (52%), Gaps = 45/441 (10%)
Query: 2 SVAQSFLLL---CSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPV 58
S+ FL L CS L F +T +VL V D +T + +++RTPL+ V
Sbjct: 3 SILHYFLALSLSCSFLFFLSDSVTPTKPI--NLVVLPVQNDGSTGLHSANLQKRTPLMQV 60
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK-----------ACGGGICGA 107
+ V L GN LWV+CE+ Y S T + C S QC+ AN C CG
Sbjct: 61 PVLVDLNGNHLWVNCEQQYSSKTYQAPFCHSTQCSRANTHQCLSCPAASRPGCHKNTCGL 120
Query: 108 GPDNPISNTGTHGDIRIDVLSIQSTDG--RNPGRAVTVPNFIFLCGSEFVLQ-GLASGVV 164
NPI+ G++ DVL+I +T G + G VTVP F+F C F++Q GL
Sbjct: 121 MSTNPITQQTGLGELGEDVLAIHATQGSTQQLGPLVTVPQFLFSCAPSFLVQKGLPRNTQ 180
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLS--PFDDGAIVFGDGP--YYDLNNFDVSKN 220
G+AGLG + ++LP+QLA+ F L+R+F CLS P GAI+FGD P N D+ +
Sbjct: 181 GVAGLGHAPISLPNQLASHFGLQRQFTTCLSRYPTSKGAIIFGDAPNNMRQFQNQDIFHD 240
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV-PLNKTLLSIDNEGVGGTKIS 279
L +TPL I T G EY + V S+ +N +V PLNK +I GGT IS
Sbjct: 241 LAFTPLTI----TLQG-------EYNVRVNSIRINQHSVFPLNKISSTIVGSTSGGTMIS 289
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLV 339
T P+ VL+ S+Y+A Q FA +PK A+V VAPFG CF I P +DLV
Sbjct: 290 TSTPHMVLQQSVYQAFTQVFAQQLPKQAQVKSVAPFGLCFNSNKIN------AYPSVDLV 343
Query: 340 LQNKN-VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSR 398
+ N VW I G++ MVQ CLG ++GG+ PR I +GARQLE NL+ FDLA SR
Sbjct: 344 MDKPNGPVWRISGEDLMVQAQPGVTCLGVMNGGMQPRAEITLGARQLEENLVVFDLARSR 403
Query: 399 LGFSDSLLFERAT-CT--FNF 416
+GFS S L C FNF
Sbjct: 404 VGFSTSSLHSHGVKCADLFNF 424
>gi|356505878|ref|XP_003521716.1| PREDICTED: basic 7S globulin [Glycine max]
gi|14549156|sp|P13917.2|7SB1_SOYBN RecName: Full=Basic 7S globulin; AltName: Full=SBg7S; Short=Bg;
Contains: RecName: Full=Basic 7S globulin high kDa
subunit; Contains: RecName: Full=Basic 7S globulin low
kDa subunit; Flags: Precursor
gi|434061|dbj|BAA03681.1| basic 7S globulin [Glycine max]
Length = 427
Score = 274 bits (700), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 177/441 (40%), Positives = 233/441 (52%), Gaps = 45/441 (10%)
Query: 2 SVAQSFLLL---CSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPV 58
S+ FL L CS L F +T +VL V D +T + +++RTPL+ V
Sbjct: 3 SILHYFLALSLSCSFLFFLSDSVTPTKPI--NLVVLPVQNDGSTGLHWANLQKRTPLMQV 60
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK-----------ACGGGICGA 107
+ V L GN LWV+CE+ Y S T + C S QC+ AN C CG
Sbjct: 61 PVLVDLNGNHLWVNCEQQYSSKTYQAPFCHSTQCSRANTHQCLSCPAASRPGCHKNTCGL 120
Query: 108 GPDNPISNTGTHGDIRIDVLSIQSTDG--RNPGRAVTVPNFIFLCGSEFVLQ-GLASGVV 164
NPI+ G++ DVL+I +T G + G VTVP F+F C F++Q GL
Sbjct: 121 MSTNPITQQTGLGELGEDVLAIHATQGSTQQLGPLVTVPQFLFSCAPSFLVQKGLPRNTQ 180
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLS--PFDDGAIVFGDGP--YYDLNNFDVSKN 220
G+AGLG + ++LP+QLA+ F L+R+F CLS P GAI+FGD P N D+ +
Sbjct: 181 GVAGLGHAPISLPNQLASHFGLQRQFTTCLSRYPTSKGAIIFGDAPNNMRQFQNQDIFHD 240
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV-PLNKTLLSIDNEGVGGTKIS 279
L +TPL I T G EY + V S+ +N +V PLNK +I GGT IS
Sbjct: 241 LAFTPLTI----TLQG-------EYNVRVNSIRINQHSVFPLNKISSTIVGSTSGGTMIS 289
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLV 339
T P+ VL+ S+Y+A Q FA +PK A+V VAPFG CF I P +DLV
Sbjct: 290 TSTPHMVLQQSVYQAFTQVFAQQLPKQAQVKSVAPFGLCFNSNKIN------AYPSVDLV 343
Query: 340 LQNKN-VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSR 398
+ N VW I G++ MVQ CLG ++GG+ PR I +GARQLE NL+ FDLA SR
Sbjct: 344 MDKPNGPVWRISGEDLMVQAQPGVTCLGVMNGGMQPRAEITLGARQLEENLVVFDLARSR 403
Query: 399 LGFSDSLLFERAT-CT--FNF 416
+GFS S L C FNF
Sbjct: 404 VGFSTSSLHSHGVKCADLFNF 424
>gi|330689364|pdb|3AUP|A Chain A, Crystal Structure Of Basic 7s Globulin From Soybean
gi|330689365|pdb|3AUP|B Chain B, Crystal Structure Of Basic 7s Globulin From Soybean
gi|330689366|pdb|3AUP|C Chain C, Crystal Structure Of Basic 7s Globulin From Soybean
gi|330689367|pdb|3AUP|D Chain D, Crystal Structure Of Basic 7s Globulin From Soybean
Length = 403
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 168/408 (41%), Positives = 222/408 (54%), Gaps = 40/408 (9%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
+VL V D +T + +++RTPL+ V + V L GN LWV+CE+ Y S T + C S Q
Sbjct: 10 VVLPVQNDGSTGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCEQQYSSKTYQAPFCHSTQ 69
Query: 92 CNLANAK-----------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG--RNPG 138
C+ AN C CG NPI+ G++ DVL+I +T G + G
Sbjct: 70 CSRANTHQCLSCPAASRPGCHKNTCGLMSTNPITQQTGLGELGEDVLAIHATQGSTQQLG 129
Query: 139 RAVTVPNFIFLCGSEFVLQ-GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS-- 195
VTVP F+F C F++Q GL G+AGLG + ++LP+QLA+ F L+R+F CLS
Sbjct: 130 PLVTVPQFLFSCAPSFLVQKGLPRNTQGVAGLGHAPISLPNQLASHFGLQRQFTTCLSRY 189
Query: 196 PFDDGAIVFGDGP--YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
P GAI+FGD P N D+ +L +TPL I T G EY + V S+
Sbjct: 190 PTSKGAIIFGDAPNNMRQFQNQDIFHDLAFTPLTI----TLQG-------EYNVRVNSIR 238
Query: 254 VNGKAV-PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+N +V PLNK +I GGT IST P+ VL+ S+Y+A Q FA +PK A+V V
Sbjct: 239 INQHSVFPLNKISSTIVGSTSGGTMISTSTPHMVLQQSVYQAFTQVFAQQLPKQAQVKSV 298
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKN-VVWSIHGQNSMVQIGGDALCLGFVDGG 371
APFG CF I P +DLV+ N VW I G++ MVQ CLG ++GG
Sbjct: 299 APFGLCFNSNKIN------AYPSVDLVMDKPNGPVWRISGEDLMVQAQPGVTCLGVMNGG 352
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERAT-CT--FNF 416
+ PR I +GARQLE NL+ FDLA SR+GFS S L C FNF
Sbjct: 353 MQPRAEITLGARQLEENLVVFDLARSRVGFSTSSLHSHGVKCADLFNF 400
>gi|18543|emb|CAA34489.1| unnamed protein product [Glycine max]
Length = 427
Score = 267 bits (683), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 175/441 (39%), Positives = 231/441 (52%), Gaps = 45/441 (10%)
Query: 2 SVAQSFLLL---CSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPV 58
S+ FL L CS L F +T +VL V D +T + +++RTPL+ V
Sbjct: 3 SILHYFLALSLSCSFLFFLSDSVTPTKPI--NLVVLPVQNDGSTGLHWANLQKRTPLMQV 60
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK-----------ACGGGICGA 107
+ V L GN LWV+CE+ Y S T + C S QC+ AN C CG
Sbjct: 61 PVLVDLNGNHLWVNCEQQYSSKTYQAPFCHSTQCSRANTHQCLSCPAASRPGCHKNTCGL 120
Query: 108 GPDNPISNTGTHGDIRIDVLSIQSTDG--RNPGRAVTVPNFIFLCGSEFVLQ-GLASGVV 164
NPI+ G++ DVL+I +T G + G VTVP F+F C F++Q GL
Sbjct: 121 MSTNPITQQTGLGELGEDVLAIHATQGSTQQLGPLVTVPQFLFSCAPSFLVQKGLPRNTQ 180
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLS--PFDDGAIVFGDGP--YYDLNNFDVSKN 220
G+AGLG + ++LP+QLA+ F L+R+F CLS P GAI+FGD P N D+ +
Sbjct: 181 GVAGLGHAPISLPNQLASHFGLQRQFTTCLSRYPTSKGAIIFGDAPNNMRQFQNQDIFHD 240
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV-PLNKTLLSIDNEGVGGTKIS 279
L +TPL I T G EY + V S+ + +V PLNK +I GGT IS
Sbjct: 241 LAFTPLTI----TLQG-------EYNVRVNSIRITQHSVFPLNKISSTIVGSTSGGTMIS 289
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLV 339
T P+ VL+ S+Y+A Q A +PK A+V VAPFG CF I P +DLV
Sbjct: 290 TSTPHMVLQQSVYQACTQVCAQQLPKQAQVKSVAPFGLCFNSNKIN------AYPSVDLV 343
Query: 340 LQNKN-VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSR 398
+ N VW I G++ MVQ CLG ++GG+ PR I +GARQLE NL+ FDLA SR
Sbjct: 344 MDKPNGPVWRISGEDLMVQAQPGVTCLGVMNGGMQPRAEITLGARQLEENLVVFDLARSR 403
Query: 399 LGFSDSLLFERAT-CT--FNF 416
+GFS S L C FNF
Sbjct: 404 VGFSTSSLHSHGVKCADLFNF 424
>gi|11191819|emb|CAC16394.1| conglutin gamma [Lupinus albus]
Length = 452
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 160/420 (38%), Positives = 224/420 (53%), Gaps = 47/420 (11%)
Query: 25 TSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKT 84
+S +P LVL + +D +T + I +RTPL+ V + + L G LWV C + Y SST +
Sbjct: 41 SSSKPNLLVLPIQQDASTKLHWGNILKRTPLMQVPVLLDLNGKHLWVTCSQHYSSSTYQA 100
Query: 85 ARCGSAQCNLANAK-------------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS 131
C S QC+ AN C CG NP++ G++ DVL++ S
Sbjct: 101 PFCHSTQCSRANTHQCFTCTDSTTSRPGCHNNTCGLISSNPVTQESGLGELAQDVLALHS 160
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQ-GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
T G G V +P F+F C F+ Q GL + V G GLG + ++LP+QL + F LKR+F
Sbjct: 161 THGSKLGSLVKIPQFLFSCAPTFLTQKGLPNNVQGALGLGHAPISLPNQLFSHFGLKRQF 220
Query: 191 ALCLS--PFDDGAIVFGD-----GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
+CLS P +GAI+FGD Y N+ DV ++ YTPL I+K
Sbjct: 221 TMCLSSYPTSNGAILFGDINDPNNNNYIHNSLDVLHDMVYTPLTISKQG----------- 269
Query: 244 EYFIGVTSVHVNGKAVPLNKT---------LLSIDNEGVGGTKISTVNPYTVLETSIYKA 294
EYFI V+++ VN V K ++ +GG I+T NPYTVL SI++
Sbjct: 270 EYFIQVSAIRVNKHMVIPTKNPSMFPSSSSSSYHESSEIGGAMITTTNPYTVLRHSIFEV 329
Query: 295 LVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
Q FA+ +PK A+V V PFG C+ + +I VP +DL++ +VVW I G+N
Sbjct: 330 FTQVFANNVPKQAQVKAVGPFGLCYDTK-----KISGGVPSVDLIMDKSDVVWRISGENL 384
Query: 355 MVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF-SDSLLFERATCT 413
MVQ CLGFVDGGV+ R I +G QLE NL+ FDLA SR+GF ++SL +C+
Sbjct: 385 MVQAQDGVSCLGFVDGGVHTRAGIALGTHQLEENLVVFDLARSRVGFNTNSLKSHGKSCS 444
>gi|224127985|ref|XP_002329226.1| predicted protein [Populus trichocarpa]
gi|222871007|gb|EEF08138.1| predicted protein [Populus trichocarpa]
Length = 442
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 164/406 (40%), Positives = 227/406 (55%), Gaps = 36/406 (8%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
+ KD +T QY+ +TPL+P KL + LG WV+C+ Y+SST + C S+ N
Sbjct: 34 IQKDHSTSQYIITAYLKTPLMPTKLVLDLGATYSWVNCDD-YISSTYQHVPCNSSISNSL 92
Query: 96 NAKACGGGICGAGP-------------DNPIS-----NTGTHGDIRIDVLSIQSTDGRNP 137
+A C IC P D P+ + D +D L++ + N
Sbjct: 93 SAYGCED-ICDGPPGPNCANNSFLFLLDKPLETVDYKKVNSLNDALVDYLALLN----NL 147
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK-FALCLSP 196
G ++ NFIF C L+GLA GV G+A LG S +++P Q+ AFS FA+CLS
Sbjct: 148 GSLSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSPNCFAMCLSG 207
Query: 197 F--DDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
G +FG GPY L+ D+SK+L YTPL N S EY++G+TS+
Sbjct: 208 SISQPGVALFGSKGPYNFLHGIDLSKSLLYTPLIFNPFGKDFDPYSHRSPEYYVGLTSIK 267
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF----ASAMPKVARV 309
VNG+ V NK LL+ ++ G GGT+ISTV PYT L++SIYKA AF AS+ +
Sbjct: 268 VNGEMVAFNKALLAFNDRGYGGTRISTVVPYTKLQSSIYKAFTLAFLKEAASSAFNLTTT 327
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALCLG 366
PV PF C+ + + T++GP VP I+LVL ++VVW + G NSMV++ D CLG
Sbjct: 328 KPVKPFRVCYPARAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMVRVTKKSVDVWCLG 387
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
FVDGG++ SI+IG QLE+NLLQFDL + +LGFS S+L + C
Sbjct: 388 FVDGGID-GPSIMIGGLQLEDNLLQFDLQSQKLGFSSSILSKGTNC 432
>gi|449527745|ref|XP_004170870.1| PREDICTED: basic 7S globulin 2-like [Cucumis sativus]
Length = 451
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 161/422 (38%), Positives = 228/422 (54%), Gaps = 34/422 (8%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
AL+ + K +L Y + +TPL P L + LGG W+DC + Y SS+ K C +
Sbjct: 24 ALIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTP 83
Query: 91 QCNLANAKACGG------GICG-------AGPDNP----------ISNTGTHGDIRIDVL 127
N N CG IC A P+NP ++ DVL
Sbjct: 84 LSNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVL 143
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
++ +TDG +P F F C L+ +A V+G+A LGRS +++PS ++A F+
Sbjct: 144 ALSTTDGSTSAPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSP 203
Query: 188 RKFALCLSPFDDGAIV--FGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE 244
+ FA+CLS G V FG GPY N D+SK+L YTPL N V+ + PS E
Sbjct: 204 KYFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYE 263
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDN-EGVGGTKISTVNPYTVLETSIYKALVQAF--AS 301
Y++G++++ +NGK VP N +LLS + G GG KIST Y +L +SIY+A F +
Sbjct: 264 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 323
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQI-- 358
+ + V PFG C+ + +G T G P +DLV++ + VVW + G+N+MV+I
Sbjct: 324 VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKK 383
Query: 359 -GGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNF 416
G DA CLGF++GG PRT IVIG Q+E++LLQFDL R GFS S L E +C+ F+F
Sbjct: 384 KGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDF 443
Query: 417 TS 418
TS
Sbjct: 444 TS 445
>gi|662366|gb|AAB53771.1| conglutin gamma [Lupinus angustifolius]
gi|666056|emb|CAA46552.1| conglutin gamma [Lupinus angustifolius]
gi|328684579|gb|AEB33719.1| conglutin gamma 1 [Lupinus angustifolius]
Length = 449
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 164/418 (39%), Positives = 221/418 (52%), Gaps = 45/418 (10%)
Query: 25 TSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKT 84
TS +P LVL V +D +T + I +RTPL+ V L + L G LWV C + Y SST +
Sbjct: 40 TSSKPNLLVLPVQEDASTGLHWANIHKRTPLMQVPLLLDLNGKHLWVTCSQHYSSSTYQA 99
Query: 85 ARCGSAQCNLANAK-------------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS 131
C S QC+ AN C CG NP++ G++ DVL+I S
Sbjct: 100 PFCHSTQCSRANTHQCFTCTDSTTTRPGCHNNTCGLLSSNPVTQESGLGELAQDVLAIHS 159
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQ-GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
T G G V VP F+F C F+ Q GL + V G GLG++ ++L +QL + F LKR+F
Sbjct: 160 THGSKLGPMVKVPQFLFSCAPSFLAQKGLPNNVQGALGLGQAPISLQNQLFSHFGLKRQF 219
Query: 191 ALCLSPFD--DGAIVFGD-----GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
++CLS + +GAI+FGD Y N+ DV +L YTPL I+K
Sbjct: 220 SVCLSRYSTSNGAILFGDINDPNNNNYIHNSLDVLHDLVYTPLTISKQG----------- 268
Query: 244 EYFIGVTSVHVNGKAV-PLNKTLLSIDNEG------VGGTKISTVNPYTVLETSIYKALV 296
EYFI V ++ VN V P +S + +GG I+T +PYTVL SI++
Sbjct: 269 EYFIQVNAIRVNKHLVIPTKNPFISPSSTSYHGSGEIGGALITTTHPYTVLSHSIFEVFT 328
Query: 297 QAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
Q FA+ MPK A+V V PFG C+ + +I P +DL+L + VW I +N MV
Sbjct: 329 QVFANNMPKQAQVKAVGPFGLCYDSR-----KISGGAPSVDLILDKNDAVWRISSENFMV 383
Query: 357 QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF-SDSLLFERATCT 413
Q CLGFVDGGV+ R I +GA LE NL+ FDL SR+GF S+SL TC+
Sbjct: 384 QAQDGVSCLGFVDGGVHARAGIALGAHHLEENLVVFDLERSRVGFNSNSLKSYGKTCS 441
>gi|224127977|ref|XP_002329224.1| predicted protein [Populus trichocarpa]
gi|222871005|gb|EEF08136.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 167/408 (40%), Positives = 230/408 (56%), Gaps = 38/408 (9%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
+ KD +T QY+ +TPL+P KL + LG WV+C+ Y+SST + C S+ N
Sbjct: 34 IQKDHSTSQYIITAYLKTPLMPTKLVLDLGATYSWVNCDD-YISSTYQHVPCNSSIANSL 92
Query: 96 NAKACGGGICGAGP-------------DNPIS-----NTGTHGDIRIDVLSIQSTDGRNP 137
+A C IC P D P+ + D +D L++ + N
Sbjct: 93 SAYGCED-ICDGPPGPNCANNSFLFLLDKPLETVDYKKVNSLNDALVDYLALLN----NL 147
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK-FALCLSP 196
G ++ NFIF C L+GLA GV G+A LG S +++P Q+ AFS FA+CLS
Sbjct: 148 GSLSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSPNCFAMCLSG 207
Query: 197 F--DDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGEPSVEYFIGVTSV 252
G +FG GPY L+ D+SK+L YTPL N + S S EY++G+T++
Sbjct: 208 SISQPGVALFGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDSDSNTHRLSPEYYVGLTAI 267
Query: 253 HVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAF----ASAMPKVA 307
VNGK V NK LL+ID++ G GGT+ISTV PYT L++SIYKA AF AS+ +
Sbjct: 268 KVNGKMVAFNKALLAIDDQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLKEAASSAFNLT 327
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALC 364
PV PF C+ + T++GP VP I+LVL ++VVW + G NSMV++ D C
Sbjct: 328 TTKPVKPFRVCYPADAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMVRVTKKSVDLWC 387
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
LGFVDGG++ SI+IG QLE+NLLQFDL + +LGFS S+L + C
Sbjct: 388 LGFVDGGID-GPSIMIGGLQLEDNLLQFDLQSQKLGFSSSILSKGTNC 434
>gi|224127969|ref|XP_002329222.1| predicted protein [Populus trichocarpa]
gi|222871003|gb|EEF08134.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 169/407 (41%), Positives = 227/407 (55%), Gaps = 35/407 (8%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
+ KD +T QY+ +TPL+P KL + LG WV+C+ Y+SST + C S+ N
Sbjct: 34 IQKDHSTSQYIITAYLKTPLMPTKLLLDLGATYSWVNCDD-YISSTYQHVPCNSSIANSL 92
Query: 96 NAKAC-------GGGICGAG-----PDNPISNTGTH-----GDIRIDVLSIQSTDGRNPG 138
+ C G C PDNPI D +D L++ +T G
Sbjct: 93 GSYGCVDICDGPPGPNCANNSFLFLPDNPIKPVDYKKVNGLNDALVDYLALLNTLGS--- 149
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK-FALCLSPF 197
++ NFIF C L+GLA GV G+A LG S +++P Q+ AFS FA+CLS
Sbjct: 150 -LSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSPNCFAMCLSGS 208
Query: 198 --DDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKV-NTASGFLGEPSVEYFIGVTSVH 253
G +FG GPY L+ D+SK+L YTPL N + A S EY++G+T++
Sbjct: 209 ISQPGVALFGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDAVPNTHTLSPEYYVGLTAIK 268
Query: 254 VNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAF----ASAMPKVAR 308
VNGK V NKTLL+ID + G GGT+ISTV PYT L++SIYKA AF AS+ +
Sbjct: 269 VNGKMVAFNKTLLAIDGQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLREAASSAFNLTT 328
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALCL 365
PV PF C+ + T++GP VP I+LVL ++VVW + G NSMV++ D CL
Sbjct: 329 TKPVKPFSVCYPAGAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMVRVTKKSVDVWCL 388
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
GFVDGG SI+IG QLE+NLLQFDL + +LGFS S+L + C
Sbjct: 389 GFVDGGAIDGPSIMIGGLQLEDNLLQFDLQSKKLGFSSSILSKGTNC 435
>gi|255552243|ref|XP_002517166.1| ATP binding protein, putative [Ricinus communis]
gi|223543801|gb|EEF45329.1| ATP binding protein, putative [Ricinus communis]
Length = 324
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 210/369 (56%), Gaps = 65/369 (17%)
Query: 69 LWVDCEKGYVSSTNKTARCGSAQCNLANAKACG------------GGICGAGPDNPISNT 116
+WV+CE+GYVSS+ + C S C LAN+ AC C P NP+ +
Sbjct: 1 MWVNCEEGYVSSSYRPVSCDSVLCTLANSHACDTECYSTPKPDCHNNTCAHSPGNPVIHL 60
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G I D++S+QS +G+ P R V+VPNF F+C S +
Sbjct: 61 STGGQIGQDIVSLQSFNGKTPDRIVSVPNFPFVCSSTLI--------------------- 99
Query: 177 PSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLK-YTPLFINKVNT 233
KF++CLS +GAI+FGDGP V K+L YTPL N V+T
Sbjct: 100 ------------KFSICLSSSTKPNGAILFGDGP-----RSIVPKDLLIYTPLIKNPVST 142
Query: 234 ---ASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETS 290
+ + S +YFI V S+ V GK + +NKTLLSI+N+G GGT+IST+ PYT+L TS
Sbjct: 143 LGPENNVVPTTSSDYFISVNSIRVGGKDIKVNKTLLSINNKGKGGTRISTIKPYTMLHTS 202
Query: 291 IYKALVQAFASAMPKVARVAPVAPFGACF-RLQDIGFTRIGPVVPQIDLVLQNK-NVVWS 348
+YKALV AF A + V P PFGACF D +GP VP IDLVL+ + +V W
Sbjct: 203 LYKALVTAFVRAYGVIPHVEP--PFGACFPSFSD----ELGPKVPFIDLVLEGQGSVYWR 256
Query: 349 IHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
I NS+V+I CLGFVDGG +P TSIVIG QLE+NLLQFDLA+SRLGFS SLL
Sbjct: 257 ISSANSLVKISSIVTCLGFVDGGPDPFTSIVIGGCQLEDNLLQFDLASSRLGFSSSLLAR 316
Query: 409 RATCTFNFT 417
+C+ NFT
Sbjct: 317 NTSCS-NFT 324
>gi|351727625|ref|NP_001237167.1| basic 7S globulin 2 precursor [Glycine max]
gi|51316037|sp|Q8RVH5.1|7SBG2_SOYBN RecName: Full=Basic 7S globulin 2; AltName: Full=SBg7S; Short=Bg;
Contains: RecName: Full=Basic 7S globulin 2 high kDa
subunit; Contains: RecName: Full=Basic 7S globulin 2 low
kDa subunit; Flags: Precursor
gi|20302594|dbj|BAB91077.1| basic 7S globulin isoform [Glycine max]
Length = 433
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 161/408 (39%), Positives = 219/408 (53%), Gaps = 41/408 (10%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
LVL V D +T + +++RTPL+ V + V L GN LWV+CE+ Y S T + C S Q
Sbjct: 41 LVLPVQNDASTGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCEQHYSSKTYQAPFCHSTQ 100
Query: 92 CNLANAK-----------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG--RNPG 138
C+ AN C CG NPI+ G++ DVL+I +T G + G
Sbjct: 101 CSRANTHQCLSCPAASRPGCHKNTCGLMSTNPITQQTGLGELGQDVLAIHATQGSTQQLG 160
Query: 139 RAVTVPNFIFLCGSEFVLQ-GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS-- 195
VTVP F+F C F+LQ GL + G+AGLG + ++LP+QLA+ F L+ +F CLS
Sbjct: 161 PLVTVPQFLFSCAPSFLLQKGLPRNIQGVAGLGHAPISLPNQLASHFGLQHQFTTCLSRY 220
Query: 196 PFDDGAIVFGDGP--YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
P GA++FGD P +N D+ +L +TPL + P EY + V+S+
Sbjct: 221 PTSKGALIFGDAPNNMQQFHNQDIFHDLAFTPLTVT-----------PQGEYNVRVSSIR 269
Query: 254 VNGKAV-PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+N +V P NK +I GGT IST P+ VL+ S+Y+A Q FA + K A+V V
Sbjct: 270 INQHSVFPPNKISSTIVGSS-GGTMISTSTPHMVLQQSLYQAFTQVFAQQLEKQAQVKSV 328
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKN-VVWSIHGQNSMVQIGGDALCLGFVDGG 371
APFG CF I P +DLV+ N VW I G++ MVQ CLG ++GG
Sbjct: 329 APFGLCFNSNKIN------AYPSVDLVMDKPNGPVWRISGEDLMVQAQPGVTCLGVMNGG 382
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERAT-C--TFNF 416
+ PR + +G RQLE L+ FDLA SR+GFS S L C FNF
Sbjct: 383 MQPRAEVTLGTRQLEEKLMVFDLARSRVGFSTSSLHSHGVKCGDLFNF 430
>gi|356563517|ref|XP_003550008.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 425
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 166/438 (37%), Positives = 240/438 (54%), Gaps = 42/438 (9%)
Query: 3 VAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTV 62
+A SF ++ S L+ +S T S P ++ V KD +TLQY+T + TPL+P KL +
Sbjct: 1 MAYSFAMIISFLLCLMS--TLSHSLSPVWFLIPVTKDASTLQYITTLSYGTPLLPTKLVL 58
Query: 63 HLGGNILWVDC-EKGYVSSTNKTARCGSAQCNLANAKACGGGI-------------CGAG 108
LGG LW+ C + SS++ T S QC A C
Sbjct: 59 DLGGPFLWLHCASRNTPSSSSLTTPHRSLQCFTAKTHKSTNSFLSSPVDEVHQYHPCQVF 118
Query: 109 PDNPISNT-GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIA 167
P+N I+ T + G++ D++++QS G+ V + +F C +L GLA G G+
Sbjct: 119 PENSITGTVASEGELVEDLMALQSPQEEEGGQLVEHQS-LFTCSPTTLLNGLARGARGML 177
Query: 168 GLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLF 227
GLGRS+ + PSQ+ FS RK LCLS G ++ G+ Y+ +V K+L +TPL
Sbjct: 178 GLGRSRSSFPSQVFDNFSTHRKLTLCLSS-SKGVVLLGNVATYE---SEVLKSLTFTPLI 233
Query: 228 INKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG----TKISTVNP 283
+ P EY I V+SV +NG + L+ + NE G T +ST+ P
Sbjct: 234 TSF----------PRQEYIINVSSVKINGNRLSLDTSSSESSNEQDGSVGALTLLSTILP 283
Query: 284 YTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ 341
YT +++SIY + +F A+ + RVA VAPF CF + + GP VP I+LVLQ
Sbjct: 284 YTTMQSSIYNSFKTSFEDAAVAMNMTRVASVAPFELCFSSRG---EQAGPSVPVIELVLQ 340
Query: 342 NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
++ V W+IHG+NSMV++ + +CLGF+DGGVNPR SIVIG QLE+ ++QFDLATS +GF
Sbjct: 341 SEMVKWTIHGRNSMVRVSDEVVCLGFLDGGVNPRNSIVIGGYQLEDVVVQFDLATSMVGF 400
Query: 402 SDSLLFERATCT-FNFTS 418
S SL+ + C+ F F S
Sbjct: 401 SSSLVAKNTKCSDFKFAS 418
>gi|449462344|ref|XP_004148901.1| PREDICTED: basic 7S globulin 2-like [Cucumis sativus]
Length = 451
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 160/422 (37%), Positives = 226/422 (53%), Gaps = 34/422 (8%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
AL+ + K +L Y + +TPL P L + LGG W+ C + Y SS+ K C +
Sbjct: 24 ALIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIHCYQNYNSSSYKFVLCNTP 83
Query: 91 QCNLANAKACGG------GICG-------AGPDNP----------ISNTGTHGDIRIDVL 127
N N CG IC A P+NP ++ DVL
Sbjct: 84 LSNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVL 143
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
++ +T G +P F F C L+ +A V+G+A LGRS +++PS ++A FS
Sbjct: 144 ALSTTGGSTSAPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFSSP 203
Query: 188 RKFALCLSPFDDGAIV--FGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE 244
+ FA+CLS G V FG GPY N D+SK+L YTPL N V+ + PS E
Sbjct: 204 KYFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYE 263
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDN-EGVGGTKISTVNPYTVLETSIYKALVQAF--AS 301
Y++G++++ +NGK VP N +LLS + G GG KIST Y +L +SIY+A F +
Sbjct: 264 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 323
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQI-- 358
+ + V PFG C+ + +G T G P +DLV++ + VVW + G+N+MV+I
Sbjct: 324 VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKK 383
Query: 359 -GGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNF 416
G DA CLGF++GG PRT IVIG Q+E++LLQFDL R GFS S L E +C+ F+F
Sbjct: 384 KGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALKEGTSCSKFDF 443
Query: 417 TS 418
TS
Sbjct: 444 TS 445
>gi|50726102|dbj|BAD33624.1| putative dermal glycoprotein precursor, extracellular [Oryza sativa
Japonica Group]
gi|50726491|dbj|BAD34099.1| putative dermal glycoprotein precursor, extracellular [Oryza sativa
Japonica Group]
Length = 444
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 168/412 (40%), Positives = 223/412 (54%), Gaps = 45/412 (10%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHL-GGNILWVDCEK--GYVSS 80
Q RPKA+ + V +D AT QYV +QRTP V VK V L GG LWVDC+ GY SS
Sbjct: 35 QRPTRPKAVAMPVVRDGATRQYVATFQQRTPRVAVKAVVDLSGGATLWVDCDAAAGYASS 94
Query: 81 TNKTARCGSAQCNLANAKACG---------------GGICGAGPDNPISNTGTHGDIRID 125
+ CGS C L + +C C +N ++++ G++ D
Sbjct: 95 SYAGVPCGSKPCRLVESPSCSYIASCLGSPPSPACLNRTCTGHAENTVTSSVGRGNVVTD 154
Query: 126 VLSIQSTDGRNP---GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
VLS+ +T P G T P F+F CG + QGLA+G G+A L R+++ALP+QLA
Sbjct: 155 VLSLPTTFPSAPVRQGPLATAPAFLFTCGPTSLTQGLAAGAAGMASLSRARLALPAQLAG 214
Query: 183 AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
F RKFALCL D G +VFGD Y + D S +L YTPL + +S
Sbjct: 215 TFRFSRKFALCLPSVDAGVVVFGDARYV-FDGMDHSNSLLYTPLITRTTDRSS------- 266
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
EYFI + V V+ +AVPLN TLL + GTK+STV+PYTVLETSI++A+ +AFA++
Sbjct: 267 -EYFISLKRVVVDDRAVPLNATLLDV------GTKLSTVSPYTVLETSIHEAVTRAFAAS 319
Query: 303 MPK--VARVAPVAPFGACFRLQDIGFTRI--GPVVP-----QIDLVLQNKNVVWSIHGQN 353
M + RV VAPF C+ + + I P VP + +++K W + G N
Sbjct: 320 MATAGIPRVPAVAPFELCYDGSKVESSAITGEPAVPVVFELHVQSEVRSKVAPWMVSGAN 379
Query: 354 SMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
M + G ALCL VDGG P +VIG +E LL FDL SRLGFS +L
Sbjct: 380 LMARADGGALCLAVVDGGAAPEAPVVIGGHMMEEILLVFDLEKSRLGFSPNL 431
>gi|224127981|ref|XP_002329225.1| predicted protein [Populus trichocarpa]
gi|222871006|gb|EEF08137.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 166/408 (40%), Positives = 229/408 (56%), Gaps = 38/408 (9%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
+ KD +T QY+ +TPL+P KL + LG WV+C+ Y+SST + C S+
Sbjct: 34 IQKDHSTSQYIITAYLKTPLMPTKLVLDLGATYSWVNCDD-YISSTYQHVPCNSSIFYSL 92
Query: 96 NAKACGGGICGAGP-------------DNPIS-----NTGTHGDIRIDVLSIQSTDGRNP 137
+A C IC P D P+ + D +D L++ + N
Sbjct: 93 SAYGCED-ICDGPPGPNCANNSFLFLLDKPLETVDYKKVNSLNDALVDYLALLN----NL 147
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK-FALCLSP 196
G ++ NFIF C L+GLA GV G+A LG S +++P Q+ AFS FA+CLS
Sbjct: 148 GSLSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSPNCFAMCLSG 207
Query: 197 F--DDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGF-LGEPSVEYFIGVTSV 252
G +FG GPY L+ D+SK+L YTPL N + S S EY++G+T++
Sbjct: 208 SISQPGVALFGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDSDSNTHRLSPEYYVGLTAI 267
Query: 253 HVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAF----ASAMPKVA 307
VNGK V NK LL+ID++ G GGT+ISTV PYT L++SIYKA AF AS+ +
Sbjct: 268 KVNGKMVAFNKALLAIDDQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLKEAASSAFNLT 327
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALC 364
PV PF C+ + T++GP VP I+LVL ++VVW + G NSMV++ D C
Sbjct: 328 TTKPVKPFRVCYPADAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMVRVTKKSVDLWC 387
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
LGFVDGG++ SI+IG QLE+NLLQFDL + +LGFS S+L + C
Sbjct: 388 LGFVDGGID-GPSIMIGGLQLEDNLLQFDLQSQKLGFSSSILSKGTNC 434
>gi|358249022|ref|NP_001239980.1| uncharacterized protein LOC100806719 precursor [Glycine max]
gi|255646101|gb|ACU23537.1| unknown [Glycine max]
Length = 414
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 163/410 (39%), Positives = 233/410 (56%), Gaps = 47/410 (11%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-EKGYVSSTNKTARC 87
P + ++ V KD +TLQY+T + TPLVP L + LGG LW+ C + SS++ T
Sbjct: 25 PASFLIPVTKDASTLQYITTLSYGTPLVPTPLVLDLGGPFLWLHCASRNTPSSSSLTTPH 84
Query: 88 GSAQCNLANAKACGGGI-------------CGAGPDNPISNT-GTHGDIRIDVLSIQSTD 133
S QC A C P+N I+ T G++ D++++QS
Sbjct: 85 RSLQCFTAKTHKSTNSFLSSPVDEVDQYQPCQVFPENSITGTIAAEGELVEDLMALQS-- 142
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+ G+ V + F C +L GLA G G+ GLGRS+ +LPSQ+ FS RK LC
Sbjct: 143 AKEKGQLVEHQSR-FTCSPTTLLHGLAKGARGMVGLGRSRSSLPSQVFDNFSTHRKLTLC 201
Query: 194 LSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
LS G ++ G+ Y+ +V K+L +TPL + F P+ EYFI V SV
Sbjct: 202 LSS-SKGVVLLGNVATYE---SEVLKSLTFTPL-------VTSF---PTQEYFINVNSVK 247
Query: 254 VNGKAVPLNKTLLSIDNEGVGG--TKISTVNPYTVLETSIYKALVQAF--ASAMPKVARV 309
+NGK LS ++EG GG T +ST+ PYT +++SIY + +F A+ + RV
Sbjct: 248 INGKR-------LSNEHEGGGGVLTLLSTIVPYTTMQSSIYNSFKTSFEDAAVAMNITRV 300
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
A VAPF CF + +++GP +P I+LVLQ++ V W+IHG+NSMV++ + LCLGF+D
Sbjct: 301 ASVAPFELCFSSRG---SQVGPSMPVIELVLQSEMVKWTIHGRNSMVRVSDEVLCLGFLD 357
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTS 418
GGVNPR SIVIG QLE+ ++QFDLATS +GFS SL+ + C+ F + S
Sbjct: 358 GGVNPRNSIVIGGYQLEDVIVQFDLATSMVGFSSSLVAKNTKCSDFKYAS 407
>gi|224146829|ref|XP_002336347.1| predicted protein [Populus trichocarpa]
gi|222834772|gb|EEE73235.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 167/407 (41%), Positives = 226/407 (55%), Gaps = 35/407 (8%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
+ KD +T QY+ +TPL+P KL + LG WV+C+ Y+SST + C S+ N
Sbjct: 34 IQKDHSTSQYIITAYLKTPLMPTKLLLDLGATYSWVNCDD-YISSTYQHVPCNSSIANSL 92
Query: 96 NAKAC-------GGGICGAG-----PDNPISNTGTH-----GDIRIDVLSIQSTDGRNPG 138
+ C G C PDNPI D +D L++ +T G
Sbjct: 93 GSYGCVDICDGPPGPNCANNSFLFLPDNPIKPVDYKKVNGLNDALVDYLALLNTLGS--- 149
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK-FALCLSPF 197
++ NFIF C L+GLA GV G+A LG S +++P Q+ AFS FA+CLS
Sbjct: 150 -LSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSPNCFAMCLSGS 208
Query: 198 --DDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKV-NTASGFLGEPSVEYFIGVTSVH 253
G +FG GPY L+ D+SK+L YTPL N + A S EY++G+T++
Sbjct: 209 ISQPGVALFGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDAVPNTHTLSPEYYVGLTAIK 268
Query: 254 VNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAF----ASAMPKVAR 308
VNGK V NKTLL+ID + G GGT+ISTV PYT L++SIYKA AF AS+ +
Sbjct: 269 VNGKMVTFNKTLLAIDAQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLREAASSAFNLTT 328
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALCL 365
PV PF C+ + T++GP VP I+LVL ++VVW + G NSM+++ D CL
Sbjct: 329 TKPVKPFSVCYPASAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMMRVTKKSVDLWCL 388
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
G VDGG SI+IG QLE+NLLQFDL + +LGFS S+L + C
Sbjct: 389 GVVDGGAIDGPSIMIGGLQLEDNLLQFDLQSKKLGFSSSILSKGTNC 435
>gi|50878435|gb|AAT85209.1| unknown protein [Oryza sativa Japonica Group]
Length = 255
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 133/249 (53%), Positives = 169/249 (67%), Gaps = 2/249 (0%)
Query: 166 IAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYT 224
+ L R++ A P+QLAA F RKFALCL P G ++FGD PY D+SK+L YT
Sbjct: 1 MVSLSRARFAFPTQLAATFRFSRKFALCLPPAAAAGVVIFGDAPYVFQPGVDLSKSLIYT 60
Query: 225 PLFINKVNTAS-GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
PL +N V+T G+ S EYF+G+T + VNG+AVPLN TLL+I+ +GVGGTK+STV P
Sbjct: 61 PLLVNPVSTGGVSTKGDKSTEYFVGLTRIKVNGRAVPLNTTLLAINKKGVGGTKLSTVTP 120
Query: 284 YTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK 343
YTVLETSI+KA+ AFA+ + RV VAPF C+ + TR+GP VP ++LV Q++
Sbjct: 121 YTVLETSIHKAVTDAFAAETSMIPRVPAVAPFKLCYDGSKVAGTRVGPAVPTVELVFQSE 180
Query: 344 NVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
W + G NSMV G ALCLG VDGGV TS+VIG +E+NLL+FDL SRLGFS
Sbjct: 181 ATSWVVFGANSMVATKGGALCLGVVDGGVASETSVVIGGHMMEDNLLEFDLVGSRLGFSS 240
Query: 404 SLLFERATC 412
SLLF + TC
Sbjct: 241 SLLFRQTTC 249
>gi|328684581|gb|AEB33720.1| conglutin gamma 2 [Lupinus angustifolius]
Length = 431
Score = 250 bits (639), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 155/415 (37%), Positives = 217/415 (52%), Gaps = 45/415 (10%)
Query: 25 TSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKT 84
+S +P LVL + +D +T + I +RTPL+ V + + L G LWV C Y SST +
Sbjct: 26 SSSKPSLLVLPIQQDASTGLHWANIHKRTPLMQVPVLLDLNGKHLWVTCSYHYSSSTYQA 85
Query: 85 ARCGSAQCNLANAK-------------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS 131
C S QC+ AN+ C C NP++ G++ DVL I S
Sbjct: 86 PFCHSTQCSRANSHQCFTCTDSATTRPGCHNNTCALMTSNPVTQEAGFGELAQDVLPIHS 145
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQ-GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
T G G V V F+F C F+ Q GL + + G GLG + ++LP+QL + F L+R+F
Sbjct: 146 THGSKLGPMVKVLQFLFSCAPSFLAQKGLPNNIQGALGLGHAPISLPNQLFSHFGLRRQF 205
Query: 191 ALCLS--PFDDGAIVFGDGPYYDLNN------FDVSKNLKYTPLFINKVNTASGFLGEPS 242
+CLS P +GAI+FGD YD NN +V ++ YTPL I+
Sbjct: 206 TMCLSRYPTSNGAILFGD--IYDPNNNYIDNSVEVLLDMVYTPLGISLQG---------- 253
Query: 243 VEYFIGVTSVHVNGK-AVPLNKTLLSIDNEG---VGGTKISTVNPYTVLETSIYKALVQA 298
EY + V+++ VN VP + N G +GG I+T NPYT+L SIY+ Q
Sbjct: 254 -EYLMQVSAIRVNKHIVVPTKNPSMLSSNHGDSRIGGVMITTTNPYTILHHSIYEVFTQV 312
Query: 299 FASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI 358
FA+ +PK A+V V PFG CF + +I +P ++ V+ + + VW I +N MVQ
Sbjct: 313 FANNIPKQAQVEAVGPFGLCFDSK-----KISGGIPNVEFVMDSPDDVWRISEENLMVQA 367
Query: 359 GGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF-SDSLLFERATC 412
CLGFVDGG++ RT I +GA QLE NL+ FD A SR+ F S+ L TC
Sbjct: 368 QNGVSCLGFVDGGMHTRTEIALGAHQLEENLVVFDFAKSRVEFNSNPLKSHGKTC 422
>gi|67966634|emb|CAC17729.2| conglutin gamma [Lupinus albus]
Length = 448
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 154/408 (37%), Positives = 219/408 (53%), Gaps = 45/408 (11%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
LVL + +D +T + I +RTPL+ V + + L G LWV C Y SST + C S Q
Sbjct: 50 LVLPIQQDASTGLHWANIHKRTPLMQVPVLLDLNGKHLWVTCSYHYSSSTYQAPFCHSTQ 109
Query: 92 CNLANAK-------------ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPG 138
C+ AN+ C C NP++ G++ DVL+I ST G G
Sbjct: 110 CSRANSHHCFTCTDSATSRPGCHNNTCALMSSNPVTQEAGFGELAQDVLAIHSTHGSKLG 169
Query: 139 RAVTVPNFIFLCGSEFVLQ-GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS-- 195
V V ++F C F+ Q GL + V G GLG + ++L +QL + F LKR+FA+CLS
Sbjct: 170 PMVRVLQYLFSCAPSFLAQKGLPNNVQGPLGLGHAPISLQNQLFSHFGLKRQFAMCLSRY 229
Query: 196 PFDDGAIVFGDGPYYDLNN------FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
P +GAI+FGD YDL+N DV ++ YTPL I++ EYF+ V
Sbjct: 230 PTSNGAILFGD--IYDLDNNYIHNSIDVLIDMVYTPLRISQQG-----------EYFMQV 276
Query: 250 TSVHVNGKAV--PLNKTLLS--IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
++ VN V N ++LS + +GG I+T NPYT+L SI++ Q FA+ MPK
Sbjct: 277 NAIRVNKHMVVPTKNPSMLSSYHGDSRIGGAMITTTNPYTILHHSIFEVFTQVFANNMPK 336
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
A+V V PFG C+ + ++ +P ++ V+ + + VW I +N MVQ CL
Sbjct: 337 EAQVESVGPFGLCYDSR-----KLSGGIPSVEFVMDSHDDVWRISDENLMVQAQNGVSCL 391
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF-SDSLLFERATC 412
GFVDGG++ RT IV+G QLE N++ FDL SR+ F S+SL TC
Sbjct: 392 GFVDGGMHTRTEIVLGTHQLEENMVVFDLERSRVEFNSNSLKSHGKTC 439
>gi|224127973|ref|XP_002329223.1| predicted protein [Populus trichocarpa]
gi|222871004|gb|EEF08135.1| predicted protein [Populus trichocarpa]
Length = 389
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 157/386 (40%), Positives = 216/386 (55%), Gaps = 36/386 (9%)
Query: 56 VPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGP------ 109
+P KL + LG WV+C+ Y+SST + C S+ +A C IC P
Sbjct: 1 MPTKLVLDLGATYSWVNCDD-YISSTYQHVPCNSSIFYSLSAYGCED-ICDGPPGPNCAN 58
Query: 110 -------DNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQ 157
D P+ + D +D L++ + N G ++ NFIF C L+
Sbjct: 59 NSFIFLLDGPLETVDYKKVNSLNDALVDYLALLN----NLGSLSSIDNFIFSCARTGFLK 114
Query: 158 GLASGVVGIAGLGRSKVALPSQLAAAFSLKRK-FALCLSPF--DDGAIVFGD-GPYYDLN 213
GLA GV G+A LG S +++P Q+ AFS FA+CLS G +FG GPY L+
Sbjct: 115 GLAKGVTGLASLGNSNLSIPVQINKAFSSSPNCFAMCLSGSISQPGVALFGSKGPYNFLH 174
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV 273
D+SK+L YTPL N S + S EY++G+TS+ VNGK V NK LL+ ++ G
Sbjct: 175 GIDLSKSLLYTPLIFNPFGRDSDPYTQRSPEYYVGLTSIKVNGKMVAFNKALLAFNDRGY 234
Query: 274 GGTKISTVNPYTVLETSIYKALVQAF----ASAMPKVARVAPVAPFGACFRLQDIGFTRI 329
GGT+IST+ PYT L++SIYKA AF AS+ + PV PF C+ + + T++
Sbjct: 235 GGTRISTLVPYTKLQSSIYKAFTLAFLKEAASSAFNLTTTKPVKPFRVCYPARAVKTTQM 294
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALCLGFVDGGVNPRTSIVIGARQLE 386
GP VP I+LVL ++VVW I G NSMV++ D CLGFVDGG++ SI+IG QLE
Sbjct: 295 GPAVPIIELVLDRQDVVWKIFGSNSMVRVTKKSVDLWCLGFVDGGID-GPSIMIGGLQLE 353
Query: 387 NNLLQFDLATSRLGFSDSLLFERATC 412
+NLLQFDL + +LGFS S+L + C
Sbjct: 354 DNLLQFDLQSQKLGFSSSILSKGTNC 379
>gi|255559492|ref|XP_002520766.1| conserved hypothetical protein [Ricinus communis]
gi|223540151|gb|EEF41728.1| conserved hypothetical protein [Ricinus communis]
Length = 273
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 133/267 (49%), Positives = 170/267 (63%), Gaps = 30/267 (11%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
M+V+ F L SLL+ +SP AQ SFRP+ALV+ V+KD +TLQYVTQ++QRTPLVP+ L
Sbjct: 1 MAVSVHFFLASSLLLIFVSPSIAQQSFRPRALVVPVSKDASTLQYVTQVEQRTPLVPINL 60
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG-----------GGICGAGP 109
VHLGG LW+DCE+ YVSST + ARCGSA C+L + CG CG P
Sbjct: 61 VVHLGGKFLWIDCEQNYVSSTYRPARCGSALCSLGGSDGCGDCFSGPRPGCNNNTCGVSP 120
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
DNP +NT T G++ DV+S+ ST+G NPGRAVTVP F+F C F+LQGLA+G VGIAGL
Sbjct: 121 DNPFTNTATGGELATDVVSVNSTNGSNPGRAVTVPRFLFACAPTFLLQGLATGAVGIAGL 180
Query: 170 GRSKVALPSQLAAAFSLKRKFALCLSPF---DD----GAIVFGDGPY-------YDLNN- 214
GR++ A PSQ A+AFSL RKFA+CL DD G I G P Y + N
Sbjct: 181 GRNRAAFPSQFASAFSLHRKFAICLGSVQVSDDVLCLGLIDGGSNPRTSIVIGGYQVENN 240
Query: 215 ---FDV-SKNLKYTPLFINKVNTASGF 237
FD+ + L ++ L ++ T + F
Sbjct: 241 LLQFDLATSRLGFSSLLFGRMTTCANF 267
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/67 (73%), Positives = 53/67 (79%), Gaps = 3/67 (4%)
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERAT--CT 413
VQ+ D LCLG +DGG NPRTSIVIG Q+ENNLLQFDLATSRLGFS SLLF R T
Sbjct: 208 VQVSDDVLCLGLIDGGSNPRTSIVIGGYQVENNLLQFDLATSRLGFS-SLLFGRMTTCAN 266
Query: 414 FNFTSIA 420
FNFTS+A
Sbjct: 267 FNFTSVA 273
>gi|255647537|gb|ACU24232.1| unknown [Glycine max]
Length = 403
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 157/403 (38%), Positives = 220/403 (54%), Gaps = 50/403 (12%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST------NKT 84
+L L V KD +T QY+T + TP+ K + LGG+ILW DC S+ +++
Sbjct: 27 SLTLPVTKDDSTHQYLTTLSYGTPVESAKFVLDLGGSILWADCASRTTPSSTLAPIFHRS 86
Query: 85 ARCGSAQ----------CNLANAKACGGGICGAGPDNPISNTG-THGDIRIDVLSIQSTD 133
RC +A+ +LAN C +N IS T G++ D++ +S +
Sbjct: 87 IRCLTAKGPEIETHRWLSSLANPID-QDQPCQIPAENSISGKRVTEGELVEDLVINRSHE 145
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+F C +L GLA+G G+ GL RS+ + SQ+ + +RK LC
Sbjct: 146 ------------LLFTCSPTLLLNGLATGAKGMVGLDRSRTSFSSQVFHSLGTQRKITLC 193
Query: 194 LSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
LS G + FG+ + ++ ++L +TPL N+ T + PS I V SV
Sbjct: 194 LSS-SSGIVQFGNVAHESQPGSEIFRSLTFTPLVANQDQTQT----HPS----INVNSVK 244
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF---ASAMPKVARVA 310
+NGK V S D+ GG ++STV PYT L+TSIY A+ AS+M + RV
Sbjct: 245 INGKKV-------SFDSPLGGGAQLSTVVPYTTLQTSIYANFESAYLKAASSM-SMKRVD 296
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
PV+PFG CF +G +++GP VP IDLVLQ++ V WSIHG+NSMVQ+ D +CLGFVDG
Sbjct: 297 PVSPFGLCFESNGVGSSQVGPNVPVIDLVLQSEMVKWSIHGRNSMVQVNDDVMCLGFVDG 356
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
G NPR IVIG QLE+ L+Q D TS +GFS SLL + ATC+
Sbjct: 357 GENPRNPIVIGGYQLEDVLVQIDFDTSMVGFSPSLLTKHATCS 399
>gi|356548995|ref|XP_003542884.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 403
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 157/403 (38%), Positives = 219/403 (54%), Gaps = 50/403 (12%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST------NKT 84
+L L V KD +T QY+T + TP+ K + LGG+ILW DC S+ +++
Sbjct: 27 SLTLPVTKDDSTHQYLTTLSYGTPVESAKFVLDLGGSILWADCASRTTPSSTLAPIFHRS 86
Query: 85 ARCGSAQ----------CNLANAKACGGGICGAGPDNPISNTG-THGDIRIDVLSIQSTD 133
RC +A+ +LAN C +N IS T G++ D++ +S +
Sbjct: 87 IRCLTAKGPEIETHRWLSSLANPID-QDQPCQIPAENSISGKRVTEGELVEDLVINRSHE 145
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+F C +L GLA+G G+ GL RS+ + SQ+ + +RK LC
Sbjct: 146 ------------LLFTCSPTLLLNGLATGAKGMVGLDRSRTSFSSQVFHSLGTQRKITLC 193
Query: 194 LSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
LS G + FG+ + ++ ++L +TPL N+ T + PS I V SV
Sbjct: 194 LSS-SSGIVQFGNVAHESQPGSEIFRSLTFTPLVANQDQTQT----HPS----INVNSVK 244
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF---ASAMPKVARVA 310
+NGK V S D GG ++STV PYT L+TSIY A+ AS+M + RV
Sbjct: 245 INGKKV-------SFDTPLGGGAQLSTVVPYTTLQTSIYANFESAYLKAASSM-SMKRVD 296
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
PV+PFG CF +G +++GP VP IDLVLQ++ V WSIHG+NSMVQ+ D +CLGFVDG
Sbjct: 297 PVSPFGLCFESNGVGSSQVGPNVPVIDLVLQSEMVKWSIHGRNSMVQVNDDVMCLGFVDG 356
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
G NPR IVIG QLE+ L+Q D TS +GFS SLL + ATC+
Sbjct: 357 GENPRNPIVIGGYQLEDVLVQIDFDTSMVGFSPSLLTKHATCS 399
>gi|449466574|ref|XP_004151001.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 414
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 148/390 (37%), Positives = 221/390 (56%), Gaps = 19/390 (4%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
+LV+ + KD+ T QYV + +P+ PV L V LGG LW+ C S + +
Sbjct: 25 SLVIPLTKDSLTNQYVATVFHGSPIKPVHLAVDLGGQSLWMACGGSSSSRSIPSRSIQCI 84
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
GG C NP + + D ++++S D V + + C
Sbjct: 85 AATGGGRSGSVGGACDVIAGNPFGDLEGKAILVEDTVAVRSLDRSTAAVIVALHS----C 140
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
F+LQGLA V G+ GLGR++++LP+Q+A R+F+LCLS +VF D
Sbjct: 141 APRFLLQGLAKSVKGVLGLGRNQISLPAQIATELGSHRRFSLCLS--STNGVVFPDSGSQ 198
Query: 211 D-LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID 269
D + ++S +L YTP+ K++ S EYFI V ++ V+G + LNK+LL ++
Sbjct: 199 DSVYGSEISSSLTYTPILTKKIDALQ------SPEYFINVKAIKVDGNRLDLNKSLLDLE 252
Query: 270 NEGVGG----TKISTVNPYTVLETSIYKALVQAFASAMP--KVARVAPVAPFGACFRLQD 323
G G T++STV PYTVLE+SI+ +L AF +A + VAPVAPF CF ++
Sbjct: 253 GVGDGEGGGGTRLSTVVPYTVLESSIFNSLTAAFRAAAAAMNMKEVAPVAPFEVCFESEN 312
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
+ T GP VP+I+L+LQ++ V W I+G+NSMV++ +A CLGFVDGG+ PR +IV+G
Sbjct: 313 MEMTAAGPKVPEIELILQSEMVGWKIYGRNSMVKVNDEAYCLGFVDGGLKPRNAIVLGGY 372
Query: 384 QLENNLLQFDLATSRLGFSDSLLFERATCT 413
Q+E+ +L FD+ TS LGFS SLL + +C+
Sbjct: 373 QMEDIVLDFDMGTSMLGFSSSLLQRKRSCS 402
>gi|449526822|ref|XP_004170412.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 414
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 148/390 (37%), Positives = 220/390 (56%), Gaps = 19/390 (4%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
+LV+ + KD+ T QYV + +P+ PV L V LGG LW+ C S + +
Sbjct: 25 SLVIPLTKDSLTNQYVATVFHGSPIKPVHLAVDLGGQSLWMACGGSSSSRSIPSRSIQCI 84
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
GG C NP + + D ++++S D V + + C
Sbjct: 85 AATGGGRSGSVGGACDVIAGNPFGDLEGKAILVEDTVAVRSLDRSTAAVIVALHS----C 140
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
F+LQGLA V G+ GLGR++++LP+Q+A R+F+LCLS +VF D
Sbjct: 141 APRFLLQGLAKSVKGVLGLGRNQISLPAQIATELGSHRRFSLCLS--STNGVVFPDSGSQ 198
Query: 211 D-LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID 269
D + ++S +L YTP+ K++ S EYFI V ++ V+G + LNK+LL ++
Sbjct: 199 DSVYGSEISSSLTYTPILTKKIDALQ------SPEYFINVKAIKVDGNRLDLNKSLLDLE 252
Query: 270 NEGVGG----TKISTVNPYTVLETSIYKALVQAFASAMP--KVARVAPVAPFGACFRLQD 323
G G T++STV PYTVLE+SI+ +L AF +A + VAPVAPF CF ++
Sbjct: 253 GVGDGEGGGGTRLSTVVPYTVLESSIFNSLTAAFRAAAAAMNMKEVAPVAPFEVCFESEN 312
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
+ T GP VP+I+L+LQ++ V W I+G+NSMV++ +A CLGFVDGG+ PR +IV+G
Sbjct: 313 MEMTAAGPKVPEIELILQSEMVGWKIYGRNSMVKVNDEAYCLGFVDGGLKPRNAIVLGGY 372
Query: 384 QLENNLLQFDLATSRLGFSDSLLFERATCT 413
Q+E+ +L FD+ TS LGFS SLL + C+
Sbjct: 373 QMEDIVLDFDMGTSMLGFSSSLLQRKRFCS 402
>gi|217073766|gb|ACJ85243.1| unknown [Medicago truncatula]
Length = 232
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/229 (58%), Positives = 160/229 (69%), Gaps = 14/229 (6%)
Query: 204 FGDGPYYDLNN--------FDVSKNLKYTPLFINKVNTASGFL-GEPSVEYFIGVTSVHV 254
FGDGPY L + FD SK+L YTPL IN V+TAS FL GE SVEYFIGV ++ +
Sbjct: 6 FGDGPYSFLADNPSLPNVVFD-SKSLTYTPLLINHVSTASAFLQGESSVEYFIGVKTIKI 64
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPV 312
+GK V LN +LLSIDN+GVGGTKISTV+PYTVLE SIYKA+ AF AS +
Sbjct: 65 DGKVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNITTEDSS 124
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
PF C+ ++ T +G VP I+L+LQN NV+WS+ G NSMV I + LCLGFV+GGV
Sbjct: 125 PPFEFCYSFDNLPGTPLGASVPTIELLLQN-NVIWSMFGANSMVNINDEVLCLGFVNGGV 183
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFTSIA 420
N RTSIVIG QLENNLLQFDLA SRLGFS+++ + C FNFTS A
Sbjct: 184 NLRTSIVIGGYQLENNLLQFDLAASRLGFSNTIFAHQTDCFRFNFTSTA 232
>gi|356555630|ref|XP_003546133.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 403
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 152/403 (37%), Positives = 220/403 (54%), Gaps = 50/403 (12%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST------NKT 84
+L L V KD +T QY+T + TP+ K + LGG++LW DC S+ +++
Sbjct: 27 SLTLPVTKDHSTHQYLTILSYGTPVESAKFVLDLGGSLLWADCASRTTPSSTLAPIFHRS 86
Query: 85 ARCGSAQ----------CNLANAKACGGGICGAGPDNPISNTG-THGDIRIDVLSIQSTD 133
RC +A+ +LAN C +N I+ T G++ D++ +S +
Sbjct: 87 IRCLTAKGPEIETHRWLSSLANPID-QDQPCQITAENSITGKRVTEGELVEDLVIHRSHE 145
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+F C F+L GLA+ GI GL +S+++ SQ+ + ++RK LC
Sbjct: 146 ------------LLFTCSPTFLLNGLATDAKGIIGLDKSRISFSSQVFHSLKIQRKITLC 193
Query: 194 LSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
LS G I FG + ++ + L +TPL N+ T + + I V SV
Sbjct: 194 LS-HTSGVIQFGKMTHKSQTESEIFRYLTFTPLVANQDPTQT--------QSSINVNSVK 244
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF---ASAMPKVARVA 310
+NGK V + L GG ++STV PYT L+TSIY A+ AS+M + RV
Sbjct: 245 INGKKVAFDTPL-------GGGAQLSTVVPYTTLQTSIYDNFESAYLKAASSM-DMKRVD 296
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
PV+PFG CF +G +++GP VP IDLVLQ++ V WSI+G+NSMVQ+ D +CLGFVDG
Sbjct: 297 PVSPFGLCFESNGVGSSQVGPNVPIIDLVLQSEMVKWSIYGRNSMVQVSDDVMCLGFVDG 356
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
G NPR SIVIG QLE+ L+Q D TS +GFS SLL ++A+C+
Sbjct: 357 GENPRNSIVIGGFQLEDVLVQIDFDTSMVGFSPSLLTKQASCS 399
>gi|115442107|ref|NP_001045333.1| Os01g0937200 [Oryza sativa Japonica Group]
gi|20160768|dbj|BAB89709.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
gi|113534864|dbj|BAF07247.1| Os01g0937200 [Oryza sativa Japonica Group]
Length = 402
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 159/424 (37%), Positives = 221/424 (52%), Gaps = 38/424 (8%)
Query: 3 VAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTV 62
++Q+F L +++ F L + A S +ALV + KDT T + I + L+
Sbjct: 4 ISQAFFL--AIIFFILVQLQASPSPAIQALVAPITKDTKTGLHTLSISNKNYLL------ 55
Query: 63 HLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANA--KAC--GGGICGAGPDNPISNTGT 118
L G +LW C S ++ T C S +C A+ K+C GG C A P NP++
Sbjct: 56 DLSGQLLWSPC-----SPSHPTVPCSSGECAAASGAHKSCNNGGRACTARPTNPVTGERA 110
Query: 119 HGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPS 178
GD+ + + +TDG+ VTV + C +L+ L + G AGLGR V+LP+
Sbjct: 111 VGDLTLADIVANATDGKTLTSEVTVRGVVSSCAPGSLLRSLPAMAAGDAGLGRGGVSLPT 170
Query: 179 QLAAAFSLKRKFALCL--SPFDDGAIVFGDGPYYDL--NNFDVSKNLKYTPLFINKVNTA 234
QL + SLKR+FA+CL + G FG GPY + FD S L YT L + N +
Sbjct: 171 QLYSKLSLKRQFAVCLPSTAAAPGVAFFGGGPYNLMPPTLFDASTVLSYTDLARSPTNPS 230
Query: 235 SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKA 294
+ Y I + + +N +AV L LS GG + T PYTVL +Y+
Sbjct: 231 A---------YSIKLRGIAMNQEAVHLPPGALSRG----GGVTLDTAAPYTVLRRDVYRP 277
Query: 295 LVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQN-KNVVWSIHGQN 353
V AFA A ++ R+ VAPF CF +GFTR+G V IDLV +N W++ G N
Sbjct: 278 FVAAFAKATARITRMPSVAPFELCFNSSALGFTRVGYAVAPIDLVTSGGRN--WTVFGSN 335
Query: 354 SMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC- 412
S+ Q+ GD CL FVDGG R+++ +GA Q+ENN L FD A SRLGFS +L F R TC
Sbjct: 336 SLAQVAGDTACLAFVDGGRAARSAVTVGAFQMENNFLLFDEAASRLGFSGTLFFIRTTCG 395
Query: 413 TFNF 416
FNF
Sbjct: 396 NFNF 399
>gi|218189696|gb|EEC72123.1| hypothetical protein OsI_05112 [Oryza sativa Indica Group]
Length = 534
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 159/422 (37%), Positives = 216/422 (51%), Gaps = 36/422 (8%)
Query: 5 QSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHL 64
Q L+L L IF + A S +ALV + KDT T + I + L+ L
Sbjct: 136 QQILVLNGLAIFPFDYLQASPSPAIQALVAPITKDTKTGLHTLSISNKNYLL------DL 189
Query: 65 GGNILWVDCEKGYVSSTNKTARCGSAQCNLANA--KAC--GGGICGAGPDNPISNTGTHG 120
G +LW C S ++ T C S +C A+ K+C GG C A P NP++ G
Sbjct: 190 SGQLLWSPC-----SPSHPTVPCSSGECAAASGAHKSCNNGGRACTARPTNPVTGERAVG 244
Query: 121 DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
D+ + + +TDG+ VTV + C +L+ L + G AGLGR V+LP+QL
Sbjct: 245 DLTLADIVANATDGKTLTSEVTVRGVVSSCAPGSLLRSLPAMAAGDAGLGRGGVSLPTQL 304
Query: 181 AAAFSLKRKFALCL--SPFDDGAIVFGDGPYYDL--NNFDVSKNLKYTPLFINKVNTASG 236
+ SLKR+FA+CL + G FG GPY + FD S L YT L + N ++
Sbjct: 305 YSKLSLKRQFAVCLPSTAAAPGVAFFGGGPYNLMPPTLFDASAVLSYTDLARSPTNPSA- 363
Query: 237 FLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALV 296
Y I + + +N +AV L LS GG + T PYTVL +Y+ V
Sbjct: 364 --------YSIKLRGIAMNQEAVHLPPGALSRG----GGVTLDTAAPYTVLRRDVYRPFV 411
Query: 297 QAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQN-KNVVWSIHGQNSM 355
AFA A ++ R+ VAPF CF +GFTR+G V IDLV +N W++ G NS+
Sbjct: 412 AAFAKATARIPRMPSVAPFELCFNSSALGFTRVGYAVAPIDLVTSGGRN--WTVFGSNSL 469
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TF 414
Q+ D CL FVDGG R+++ +GA Q+ENN L FD A SRLGFS +L F R TC F
Sbjct: 470 AQVASDTACLAFVDGGRAARSAVTVGAFQMENNFLLFDEAASRLGFSGTLFFIRTTCGNF 529
Query: 415 NF 416
NF
Sbjct: 530 NF 531
>gi|255577645|ref|XP_002529699.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223530801|gb|EEF32665.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 407
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 148/430 (34%), Positives = 220/430 (51%), Gaps = 34/430 (7%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
M + FLLL L +F+ ++AQ K LV + KD+ T Y +++ +
Sbjct: 1 MHILAIFLLLV-LALFTSFEVSAQPY---KTLVTSINKDSKTPLYSIELQG-------QY 49
Query: 61 TVHLGGNILWVDCEKGYVSSTN--KTARCGSAQCNL----ANAKACGGGICGAGPDNPIS 114
+ + LW C+ + + C + + NL N + G +C P NP++
Sbjct: 50 VIDINAPFLWYTCQGQWFIYPMGCSSLECINGRTNLFCPSDNIYSDGQCLCTVTPVNPVT 109
Query: 115 NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKV 174
++ + + + + T GRNP ++ N C +LQ L G G+AGL + +
Sbjct: 110 SSCSSAQLTYKSIIVAWTAGRNPTVSINFNNIYVSCAPTSLLQSLPEGSSGVAGLSWNPL 169
Query: 175 ALPSQLA-AAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
+L Q L FA+CL +G I FG GPY+ L+ +VS L YTPL ++
Sbjct: 170 SLAMQFTYPHLELTHMFAMCLPSTSGANGVIFFGQGPYF-LHQVEVSSVLAYTPLL--RL 226
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
N S EYFIGV+ + +NG+ + + D G GG +IST+ PYT L + I
Sbjct: 227 NN--------SEEYFIGVSGISINGEKIKFQSSTFEFDQLGNGGVQISTIVPYTTLRSDI 278
Query: 292 YKALVQAFASAMPKVARVAPVA-PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIH 350
YK ++ F+ A + R V PF C + G+ +G VP+IDL L + +W I+
Sbjct: 279 YKEFLKEFSKATKGIPRAQKVVHPFDLCLVTSENGWRHVGLSVPEIDLELGD-GAIWRIY 337
Query: 351 GQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERA 410
G NS+ Q+ D CL F+DGG + + + VIG+ Q+ENNLLQFDLA SRLGFS SLLF
Sbjct: 338 GANSLKQVEDDVACLAFIDGGKSAKRAAVIGSYQMENNLLQFDLAASRLGFSSSLLFYNI 397
Query: 411 TCT-FNFTSI 419
TC+ FNFT+I
Sbjct: 398 TCSNFNFTTI 407
>gi|224145466|ref|XP_002336232.1| predicted protein [Populus trichocarpa]
gi|222832781|gb|EEE71258.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 189/317 (59%), Gaps = 18/317 (5%)
Query: 109 PDNPISNTGTHG--DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGI 166
P P+ +G D +D L++ + N G ++ NFIF C L+GLA GV G+
Sbjct: 41 PIKPVDYKKVNGLNDALVDYLALLN----NLGSLSSIDNFIFSCARTGFLKGLAKGVTGL 96
Query: 167 AGLGRSKVALPSQLAAAFSLKRK-FALCLSPF--DDGAIVFGD-GPYYDLNNFDVSKNLK 222
A LG S +++P Q++ AFS FA+CLS G +FG GPY L+ D+SK+L
Sbjct: 97 ASLGNSNLSIPVQISKAFSSSPNCFAMCLSGSISQPGVALFGSKGPYNFLHGIDLSKSLL 156
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
YTPL N S EY++G+TS+ VNG+ V NK LL+ ++ G GGT+IST+
Sbjct: 157 YTPLIFNPFGKDFDPYSHRSPEYYVGLTSIKVNGEMVAFNKALLAFNDRGYGGTRISTLV 216
Query: 283 PYTVLETSIYKALVQAF----ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
PYT L++SIYKA AF AS+ + PV PF C+ + + T++GP VP I+L
Sbjct: 217 PYTKLQSSIYKAFTLAFLKEAASSAFNLTTTKPVKPFRVCYPARAVKTTQMGPAVPIIEL 276
Query: 339 VLQNKNVVWSIHGQNSMVQIGG---DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLA 395
VL ++VVW I G NSMV++ D CLGFVDGG++ SI+IG QLE+NLLQFDL
Sbjct: 277 VLDRQDVVWKIFGSNSMVRVTKKSVDLWCLGFVDGGID-GPSIMIGGLQLEDNLLQFDLQ 335
Query: 396 TSRLGFSDSLLFERATC 412
+ +LGFS S+L + C
Sbjct: 336 SQKLGFSSSILSKGTNC 352
>gi|356548993|ref|XP_003542883.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 473
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 165/431 (38%), Positives = 230/431 (53%), Gaps = 20/431 (4%)
Query: 2 SVAQSFLLLCSLLIFSLSPITAQTSFRPKA--LVLRVAKDTATLQYVTQIKQRTPLVPVK 59
S + F LL S+ +FS+ + A + K+ +L + KD AT Y T + TP
Sbjct: 42 SFSIHFFLLLSIALFSVCCLAASQAPTTKSHPYILPIKKDPATNLYYTSVGIGTPRHNFD 101
Query: 60 LTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGG--------GICGAGPDN 111
L + L G LW DC+ Y SS+ + CGS QC C G C A N
Sbjct: 102 LVIDLSGENLWYDCDTHYNSSSYRPIACGSKQCPEIGCVGCNGPFKPGCTNNTCPANVIN 161
Query: 112 PISNTGTHGDIRIDVLSI-QSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLG 170
++ G + D + I Q+ + F E L GL + GI GL
Sbjct: 162 QLAKFIYSGGLGEDFIFIRQNKVSGLLSSCIDTDAFPSFSDDELPLFGLPNNTKGIIGLS 221
Query: 171 RSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINK 230
+S++ALP QLA+A + KF+LCL ++ + + +SK LK TPL +N
Sbjct: 222 KSQLALPIQLASANKVPSKFSLCLPSLNNQGFTNLLVRAGEEHPQGISKFLKTTPLIVNN 281
Query: 231 VNT-ASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLET 289
V+T A G PS EYFI V +V ++G V L +LL+IDN+G GGTK+ST++P+T L+T
Sbjct: 282 VSTGAISVEGVPSKEYFIDVKAVQIDGNVVNLKPSLLAIDNKGNGGTKLSTMSPFTELQT 341
Query: 290 SIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVW 347
++YK ++ F ++ ++ RVA VAPF AC+ I + G VVP IDLVL+ V W
Sbjct: 342 TVYKTFIRDFIKKASDRRLKRVASVAPFEACYDSTSIRNSSTGLVVPTIDLVLRG-GVQW 400
Query: 348 SIHGQNSMVQIGGDALCLGFVDGGVNPR-----TSIVIGARQLENNLLQFDLATSRLGFS 402
+I+G NSMV + CL VDGG PR SIVIG QLE+NLL+FD+A+S+L FS
Sbjct: 401 TIYGANSMVMAKKNVACLAIVDGGTEPRMSFVKASIVIGGYQLEDNLLEFDVASSKLSFS 460
Query: 403 DSLLFERATCT 413
SLL ATC+
Sbjct: 461 SSLLLHNATCS 471
>gi|125605769|gb|EAZ44805.1| hypothetical protein OsJ_29439 [Oryza sativa Japonica Group]
Length = 453
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 143/367 (38%), Positives = 195/367 (53%), Gaps = 45/367 (12%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHL-GGNILWVDCEK--GYVSS 80
Q RPKA+ + V +D AT QYV +QRTP V VK V L GG LWVDC+ GY SS
Sbjct: 35 QRPTRPKAVAMPVVRDGATRQYVATFQQRTPRVAVKAVVDLSGGATLWVDCDAAAGYASS 94
Query: 81 TNKTARCGSAQCNLANAKACG---------------GGICGAGPDNPISNTGTHGDIRID 125
+ CGS C L + +C C +N ++++ G++ D
Sbjct: 95 SYAGVPCGSKPCRLVESPSCSYIASCLGSPPSPACLNRTCTGHAENTVTSSVGRGNVVTD 154
Query: 126 VLSIQSTDGRNP---GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
VLS+ +T P G T P F+F CG + QGLA+G G+A L R+++ALP+QLA
Sbjct: 155 VLSLPTTFPSAPVRQGPLATAPAFLFTCGPTSLTQGLAAGAAGMASLSRARLALPAQLAG 214
Query: 183 AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
F RKFALCL D G +VFGD Y + D S +L YTPL + +S
Sbjct: 215 TFRFSRKFALCLPSVDAGVVVFGDA-RYVFDGMDHSNSLLYTPLITRTTDRSS------- 266
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
EYFI + V V+ +AVPLN TLL + GTK+STV+PYTVLETSI++A+ +AFA++
Sbjct: 267 -EYFISLKRVVVDDRAVPLNATLLDV------GTKLSTVSPYTVLETSIHEAVTRAFAAS 319
Query: 303 MPK--VARVAPVAPFGACFRLQDIGFTRI--GPVVP-----QIDLVLQNKNVVWSIHGQN 353
M + RV VAPF C+ + + I P VP + +++K W + G N
Sbjct: 320 MATAGIPRVPAVAPFELCYDGSKVESSAITGEPAVPVVFELHVQSEVRSKVAPWMVSGAN 379
Query: 354 SMVQIGG 360
M + G
Sbjct: 380 LMARADG 386
>gi|356518052|ref|XP_003527698.1| PREDICTED: basic 7S globulin 2-like [Glycine max]
Length = 447
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 228/452 (50%), Gaps = 56/452 (12%)
Query: 2 SVAQSFLLLCSLLIFSLS--PITAQTSFRPKALV-LRVAKDTATLQYVTQIKQRTPLVPV 58
S A +L S+ +FS+S P+ +++ +PK + L + D AT + T I TP
Sbjct: 10 SAATIHFILLSISLFSVSSLPLPDESTSKPKKIFFLPIKIDAATNMFYTTIGIGTPQHST 69
Query: 59 KLTVHLGGNILWVDCE-KGYVSSTNKTARCGSAQCNLANAKACGGGI-----------CG 106
L + LGG LW DC + Y SS+ + C S +C A G I C
Sbjct: 70 NLVIDLGGENLWHDCSNRRYNSSSKRKIVCKSKKCPEGAACVSTGCIGPYKPGCAISDCT 129
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG------SEFVLQGLA 160
NP++ + + D + + T +P F+ C S LQGL
Sbjct: 130 ITVSNPLAQFSSSYTMVEDTIFLSHT---------YIPGFLAGCVDLDDGLSGNALQGLP 180
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD----GAIVFGDG---PYYDLN 213
GI G S++ALPSQL + L KF+LC ++ G I G G P +
Sbjct: 181 RTSKGIIGFSHSELALPSQLVLSNKLIPKFSLCFPSSNNLKGFGNIFIGAGGGHPQVE-- 238
Query: 214 NFDVSKNLKYTPLFINKVNT-ASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEG 272
SK L+ TPL +N V T A G PS+EYFI V ++ ++G + LN +LLSID +G
Sbjct: 239 ----SKFLQTTPLVVNPVATGAVSIYGAPSIEYFIDVKAIKIDGHVLNLNSSLLSIDKKG 294
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMP--KVARVAPVAPFGACFRLQDIGFTRIG 330
GGTKIST+ P+T L +S+YK VQ F + ++ RVAPV PF ACF I + G
Sbjct: 295 NGGTKISTMTPWTELHSSLYKPFVQEFINKAEGRRMKRVAPVPPFDACFDTSTIRNSITG 354
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSM-VQIGGDALCLGFVDGGVNPR--------TSIVIG 381
VP IDLVL W+I+G NSM V + CL FVDGG+ P+ S+VIG
Sbjct: 355 LAVPSIDLVLPG-GAQWTIYGANSMTVMTSKNVACLAFVDGGMKPKEMHSIQLEASVVIG 413
Query: 382 ARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
QLE+NLL D+A+S+L FS SLL ATC+
Sbjct: 414 GHQLEDNLLVIDMASSKLSFSSSLLLRNATCS 445
>gi|297736988|emb|CBI26189.3| unnamed protein product [Vitis vinifera]
Length = 283
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 107/200 (53%), Positives = 134/200 (67%), Gaps = 13/200 (6%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY 77
SP AQTSFRPKALVL V+KD A+LQY+T I QRT LV + LT+ LGG LWVDC++GY
Sbjct: 18 FSPSLAQTSFRPKALVLPVSKDAASLQYITHINQRTHLVSIPLTLDLGGQFLWVDCDQGY 77
Query: 78 VSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNP 137
VSS+ + K C C PDN ++ T T G++ D +SIQSTDG NP
Sbjct: 78 VSSSYRPV-----------LKGCNYSTCVLSPDNTVTGTATSGEVGEDAVSIQSTDGSNP 126
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
GR V+V +F CGS F+L+GLAS V G+AGLGRS+VALPSQ ++AFS RKF++CLS
Sbjct: 127 GRVVSVRRLLFTCGSTFLLEGLASRVKGMAGLGRSRVALPSQFSSAFSFNRKFSICLSSS 186
Query: 198 --DDGAIVFGDGPYYDLNNF 215
G + FGDGPY+ + F
Sbjct: 187 TKSTGVVFFGDGPYHCIGLF 206
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/65 (75%), Positives = 55/65 (84%), Gaps = 1/65 (1%)
Query: 357 QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FN 415
++ + LCLGFVDGGVNPRTSIVIG RQLE+NLLQFDLATSRLGFS SLL + TC+ FN
Sbjct: 219 KVSDNVLCLGFVDGGVNPRTSIVIGGRQLEDNLLQFDLATSRLGFSSSLLSRQTTCSNFN 278
Query: 416 FTSIA 420
FTS A
Sbjct: 279 FTSNA 283
>gi|15239656|ref|NP_197413.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15010798|gb|AAK74058.1| AT5g19120/T24G5_20 [Arabidopsis thaliana]
gi|15810069|gb|AAL06960.1| AT5g19120/T24G5_20 [Arabidopsis thaliana]
gi|332005272|gb|AED92655.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 386
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 144/406 (35%), Positives = 199/406 (49%), Gaps = 52/406 (12%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGG 66
F S LI S S I+ + +V V KD T QY+ QI+ PVKL V L G
Sbjct: 11 FFSFLSALIISKSQISDSVN----GVVFPVVKDLPTGQYLAQIRLGDSPDPVKLVVDLAG 66
Query: 67 NILWVDCEKGYVSSTNKTARCGSAQCNLANA-----------KACGGGICGAGPDNPISN 115
+ILW DC +VSS+ S+ C A + C N
Sbjct: 67 SILWFDCSSRHVSSSRNLISGSSSGCLKAKVGNERVSSSSSSRKDQNADCELLVKNDAFG 126
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
G++ DV+S+ S +PG + +F C ++L+GLASG G+ GLGR++++
Sbjct: 127 ITARGELFSDVMSVGSVT--SPGTV----DLLFACTPPWLLRGLASGAQGVMGLGRAQIS 180
Query: 176 LPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS 235
LPSQLAA + +R+ + LSP + V ++ S++L YTPL
Sbjct: 181 LPSQLAAETNERRRLTVYLSPLNG---VVSTSSVEEVFGVAASRSLVYTPLLTGS----- 232
Query: 236 GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKAL 295
S Y I V S+ VNG+ + + EG ++STV PYT+LE+SIYK
Sbjct: 233 ------SGNYVINVKSIRVNGEKLSV---------EGPLAVELSTVVPYTILESSIYKVF 277
Query: 296 VQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM 355
+A+A A + V PVAPFG CF D+ F P +DL LQ++ V W IHG+N M
Sbjct: 278 AEAYAKAAGEATSVPPVAPFGLCFT-SDVDF-------PAVDLALQSEMVRWRIHGKNLM 329
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
V +GG C G VDGG + IV+G QLE +L FDL S +GF
Sbjct: 330 VDVGGGVRCSGIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375
>gi|297807959|ref|XP_002871863.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297317700|gb|EFH48122.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 377
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 141/395 (35%), Positives = 197/395 (49%), Gaps = 50/395 (12%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
+LS +Q S +V V KD T QY+ QI PVKL V L G+I W DC
Sbjct: 15 ALSISKSQISDSLNGVVFSVVKDLPTGQYIAQIHLGDSPEPVKLVVDLAGSIPWFDCSSR 74
Query: 77 YVSSTNK----------TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDV 126
+VSS+ A+ G+ + + ++ C N G++ DV
Sbjct: 75 HVSSSRNLISGSSSGCLKAKVGNDRVSSSSRGDHQNADCELLVRNGAVGITARGELFSDV 134
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
+S S PG + +F C ++L+GLASG G+ GL R++++LPSQLAA +
Sbjct: 135 MSFGS-----PGTV----DLLFACTPPWLLRGLASGAQGVMGLARAQISLPSQLAAETNE 185
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
+R+ + LSP + V ++ VS++L YTPL L + S Y
Sbjct: 186 RRRLTVFLSPLNG---VVSTSSVEEVFGVAVSRSLVYTPL-----------LTDSSGNYV 231
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
I V S+ VNGK + + EG ++STV PYT+LE+SIY +A+A A +
Sbjct: 232 INVKSIRVNGKKLSV---------EGPLAVELSTVVPYTMLESSIYAVFAEAYAKAASEA 282
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
VAPVAPFG CF D+ F P +DL LQ++ V W I G+N MV +GG CLG
Sbjct: 283 TSVAPVAPFGLCFT-SDVDF-------PAVDLALQSEMVRWRIQGKNLMVDVGGGVRCLG 334
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
VDGG + IV+G QLE +L FDL S +GF
Sbjct: 335 IVDGGSSRVNPIVMGGLQLEGLILDFDLGNSMMGF 369
>gi|110742808|dbj|BAE99306.1| conglutin gamma - like protein [Arabidopsis thaliana]
Length = 386
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 143/406 (35%), Positives = 198/406 (48%), Gaps = 52/406 (12%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGG 66
F S LI S S I+ + +V V KD T QY+ QI+ PVKL V L G
Sbjct: 11 FFSFLSALIISKSQISDSVN----GVVFPVVKDLPTGQYLAQIRLGDSPDPVKLVVDLAG 66
Query: 67 NILWVDCEKGYVSSTNKTARCGSAQCNLANA-----------KACGGGICGAGPDNPISN 115
+ILW DC +VSS+ S+ C A + C N
Sbjct: 67 SILWFDCSSRHVSSSRNLISGSSSGCLKAKVGNERVSSSSSSRKDQNADCELLVKNDAFG 126
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
G++ DV+S+ S +PG + +F C ++L+GLASG G+ GLGR++++
Sbjct: 127 ITARGELFSDVMSVGSVT--SPGTV----DLLFACTPPWLLRGLASGAQGVMGLGRAQIS 180
Query: 176 LPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS 235
LPSQLAA + +R+ + LSP + V ++ S++L YTPL
Sbjct: 181 LPSQLAAETNERRRLTVYLSPLNG---VVSTSSVEEVFGVAASRSLVYTPLLTGS----- 232
Query: 236 GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKAL 295
S Y I V S+ VNG+ + + EG ++STV PYT+LE+SIYK
Sbjct: 233 ------SGNYVINVKSIRVNGEKLSV---------EGPLAVELSTVVPYTILESSIYKVF 277
Query: 296 VQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM 355
+A+A A + V PVAPFG CF D+ F P +DL LQ++ V W IHG+N M
Sbjct: 278 AEAYAKAAGEATSVPPVAPFGLCFT-SDVDF-------PAVDLALQSEMVRWRIHGKNLM 329
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
V +GG C G V GG + IV+G QLE +L FDL S +GF
Sbjct: 330 VDVGGGVRCSGIVGGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375
>gi|10334495|emb|CAC10209.1| putative extracellular dermal glycoprotein [Cicer arietinum]
Length = 369
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 149/378 (39%), Positives = 199/378 (52%), Gaps = 36/378 (9%)
Query: 60 LTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGG--------GICGAGPDN 111
L + L G LW DC+ Y SS+ CGS +C C G C A N
Sbjct: 4 LAIDLAGENLWYDCDTHYNSSSYIPIECGSKKCPDVACIGCNGPFKPGCTNNTCAANTIN 63
Query: 112 PISNTGTHGDIRIDVLSI--QSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
++N G + D + I Q G + F G++ L GL GI GL
Sbjct: 64 TLANFIFGGGLGQDFIFISQQKVSGL-LSSCIDTDGFPSFTGNDSPLNGLPKITKGIIGL 122
Query: 170 GRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV------FGDGPYYDLNNFDVSKNLKY 223
RS ++LP+QLA L KF+LCL + G P+ +L F ++
Sbjct: 123 ARSNLSLPTQLALKNELPPKFSLCLPSSNKQGFTNLLVGSIGKDPFQELYKF-----VQT 177
Query: 224 TPLFINKVNT-ASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
TPL +N V+T A G PS+EYFI V ++ ++GK V L +L SIDN+G GGTKIST++
Sbjct: 178 TPLIVNPVSTGAVSVQGVPSIEYFIDVKAIKIDGKVVNLKPSLWSIDNKGNGGTKISTMS 237
Query: 283 PYTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL 340
P+T L+ S+YK ++ F ++ K+ +V VAPF ACF T I +P+IDLVL
Sbjct: 238 PFTELQRSVYKPFIRDFLKKASDRKLKKVESVAPFEACFES-----TNIENSLPRIDLVL 292
Query: 341 QNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR-----TSIVIGARQLENNLLQFDLA 395
Q V WSI+G N MV + + CLGFVDGG PR SIVIG QLE+NLL FDL
Sbjct: 293 QG-GVQWSIYGNNLMVNVKKNVACLGFVDGGTEPRMSFAKASIVIGGHQLEDNLLVFDLN 351
Query: 396 TSRLGFSDSLLFERATCT 413
+S+L FS SLL A+C+
Sbjct: 352 SSKLSFSSSLLVHNASCS 369
>gi|297736987|emb|CBI26188.3| unnamed protein product [Vitis vinifera]
Length = 400
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 173/346 (50%), Gaps = 82/346 (23%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
+L+L V KD ATLQYVTQI TPLVP+KL + LG LW+DC G+VSS+N CGS
Sbjct: 102 SLLLPVTKDAATLQYVTQIHHGTPLVPIKLVLDLGAPFLWLDCSSGHVSSSNTPILCGSI 161
Query: 91 QCNLANAKACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
QC A G G C P N I+ G++ D+++++ ++ + F
Sbjct: 162 QCLTAKTSDSGHGGGTSTCRLSPKNTITGLAEAGELAEDMVAVEGSEMGS--------RF 213
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
+F C + +L+GLASG VG+ GLGR+++ALPSQLAA+ P +G +
Sbjct: 214 LFSCAPKPLLKGLASGTVGMLGLGRTRIALPSQLAAS-----------DPNSEGYFI--- 259
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
V S+ +NG+ V L
Sbjct: 260 -----------------------------------------SVKSIRINGRGVSLGTI-- 276
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDI 324
GGT++STV PYT ++ S+Y +A+ A+A + RV +APFG CFR +
Sbjct: 277 ------TGGTRLSTVVPYTTMKRSVYDIFTKAYIKAAASMNITRVESMAPFGVCFRSES- 329
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
P VP IDLVLQ++ V W I G+NSMV++ +CLGF+DG
Sbjct: 330 ----SEPAVPTIDLVLQSEMVKWRILGRNSMVRVSDKVMCLGFLDG 371
>gi|222619835|gb|EEE55967.1| hypothetical protein OsJ_04693 [Oryza sativa Japonica Group]
Length = 432
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 135/414 (32%), Positives = 207/414 (50%), Gaps = 42/414 (10%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG 88
P+ +++R+ KDT+T Y I+ + +L + LGG +LW C + + ++ C
Sbjct: 32 PRPILVRITKDTSTSLYTMSIRTGS-----RLVLDLGGPLLWSTCLAAHSTVPCRSDVCA 86
Query: 89 SA--QCNLANAKACGGGI------------CGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
+A Q N N + G C A P NP++ GD+ + TDG
Sbjct: 87 AAAVQDNPWNCSSSTDGRGSDGGGGRGLCACSAYPYNPLNGQCARGDVTTTPMLANVTDG 146
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
NP V P C +L L SG VG+AGL + ++LPSQ+AA+ ++RKFALCL
Sbjct: 147 VNPLYPVAFPVHA-ACAPGALLGSLPSGAVGVAGLSGAPLSLPSQVAASLKVERKFALCL 205
Query: 195 -SPFDDGAIVFGDGPYYDL---NNFD-VSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
GA +FG GP++ L F VS L Y N N +++ V
Sbjct: 206 PGGGGTGAAIFGGGPFHLLVVPEEFGMVSNGLSYISYLRNPKNGG----------FYLDV 255
Query: 250 TSVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+ VN + + L++D G GG +STV PYT L IY+A+++A + + +AR
Sbjct: 256 VGIAVNHRGADVPPDSLALDAGTGHGGVMLSTVAPYTALRPDIYRAVIEAIDAELRLIAR 315
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
P PF C++ + +TR+GP + +DL+L++ W+ G N +VQ+ + LC V
Sbjct: 316 APPSWPFERCYQRSAMWWTRVGPPLATVDLMLRSGG-NWTFFGSNMIVQVNEETLCFAIV 374
Query: 369 DGGVNP----RTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFT 417
+ G P +++IG QLE+NLL FDL RLG + L + R TC+ FNF+
Sbjct: 375 EMGPTPAMDESPAVIIGGFQLEDNLLVFDLEKGRLGSTGLLYWIRTTCSNFNFS 428
>gi|57899195|dbj|BAD87305.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
Length = 428
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 132/410 (32%), Positives = 203/410 (49%), Gaps = 38/410 (9%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG 88
P+ +++R+ KDT+T Y I+ + +L + LGG +LW C + + ++ C
Sbjct: 32 PRPILVRITKDTSTSLYTMSIRTGS-----RLVLDLGGPLLWSTCLAAHSTVPCRSDVCA 86
Query: 89 SA--QCNLANAKACGGGI------------CGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
+A Q N N + G C A P NP++ GD+ + TDG
Sbjct: 87 AAAVQDNPWNCSSSTDGRGSDGGGGRGLCACSAYPYNPLNGQCARGDVTTTPMLANVTDG 146
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
NP V P C +L L SG VG+AGL + ++LPSQ+AA+ ++RKFALCL
Sbjct: 147 VNPLYPVAFPVHA-ACAPGALLGSLPSGAVGVAGLSGAPLSLPSQVAASLKVERKFALCL 205
Query: 195 -SPFDDGAIVFGDGPYYDL---NNFD-VSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
GA +FG GP++ L F VS L Y N N +++ V
Sbjct: 206 PGGGGTGAAIFGGGPFHLLVVPEEFGMVSNGLSYISYLRNPKNGG----------FYLDV 255
Query: 250 TSVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+ VN + + L++D G GG +STV PYT L IY+A+++A + + +AR
Sbjct: 256 VGIAVNHRGADVPPDSLALDAGTGHGGVMLSTVAPYTALRPDIYRAVIEAIDAELRLIAR 315
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
P PF C++ + +TRIGP +DL+L W+I G +++V++ +A C FV
Sbjct: 316 APPSWPFERCYQRSAMWWTRIGPYTASVDLMLAGGQ-NWTIVGASAVVEVSQEAACFAFV 374
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFT 417
D G +++IG Q+E+NL+ FDL + GFS LL C F+F+
Sbjct: 375 DMGAAAAPAVIIGGHQMEDNLVVFDLEKWQFGFSGLLLGTMTRCGNFDFS 424
>gi|15239655|ref|NP_197412.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|332005271|gb|AED92654.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 405
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/364 (35%), Positives = 187/364 (51%), Gaps = 23/364 (6%)
Query: 57 PVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPI-SN 115
PV L + LG N+ W+DC K S+ + C S+ C C G C NP+ N
Sbjct: 52 PVNLLLDLGTNLTWLDCRKLKSLSSLRLVTCQSSTCKSIPGNGCAGKSCLYKQPNPLGQN 111
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
G + D S+ +TDG V+V +F F C E LQGL V G+ L +
Sbjct: 112 PVVTGRVVQDRASLYTTDGGKFLSQVSVRHFTFSCAGEKALQGLPPPVDGVLALSPGSSS 171
Query: 176 LPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS 235
Q+ +AF++ KF+LCL G G +Y + F+ S N P+ T +
Sbjct: 172 FTKQVTSAFNVIPKFSLCLPSSGTGHFYIA-GIHYFIPPFNSSDN----PI----PRTLT 222
Query: 236 GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKAL 295
G S +Y I V S++V G A+ LN LL+ GG K+STV YTVL+T IY AL
Sbjct: 223 PIKGTDSGDYLITVKSIYVGGTALKLNPDLLT------GGAKLSTVVHYTVLQTDIYNAL 276
Query: 296 VQAF---ASAMPKVARVAPVAPFGACFRLQDIGFT-RIGPVVPQIDLVLQNK--NVVWSI 349
Q+F A AM +A+V VAPF CF + G GP VP I++ L + V W
Sbjct: 277 AQSFTLKAKAM-GIAKVPSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRIGEVKWGF 335
Query: 350 HGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFER 409
+G N++V++ +CL F+DGG P+ +VIG QL++++L+FD + + L FS+SLL
Sbjct: 336 YGANTVVKVKETVMCLAFIDGGKTPKDLMVIGTHQLQDHMLEFDFSGTVLAFSESLLLHN 395
Query: 410 ATCT 413
+C+
Sbjct: 396 TSCS 399
>gi|110737364|dbj|BAF00627.1| dermal glycoprotein - like [Arabidopsis thaliana]
Length = 397
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 130/364 (35%), Positives = 187/364 (51%), Gaps = 23/364 (6%)
Query: 57 PVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPI-SN 115
PV L + LG N+ W+DC K S+ + C S+ C C G C NP+ N
Sbjct: 44 PVNLLLDLGTNLTWLDCRKLKSLSSLRLVTCQSSTCKSIPGNGCAGKSCLYKQPNPLGQN 103
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
G + D S+ +TDG V+V +F F C E LQGL V G+ L +
Sbjct: 104 PVVTGRVVQDRASLYTTDGGKFLSQVSVRHFTFSCAGEKALQGLPPPVDGVLALSPGSSS 163
Query: 176 LPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS 235
Q+ +AF++ KF+LCL G G +Y + F+ S N P+ T +
Sbjct: 164 FTKQVTSAFNVIPKFSLCLPSSGTGHFYIA-GIHYFIPPFNSSDN----PI----PRTLT 214
Query: 236 GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKAL 295
G S +Y I V S++V G A+ LN LL+ GG K+STV YTVL+T IY AL
Sbjct: 215 PIKGTDSGDYLITVKSIYVGGTALKLNPDLLT------GGAKLSTVVHYTVLQTDIYNAL 268
Query: 296 VQAF---ASAMPKVARVAPVAPFGACFRLQDIGFT-RIGPVVPQIDLVLQNK--NVVWSI 349
Q+F A AM +A+V VAPF CF + G GP VP I++ L + V W
Sbjct: 269 AQSFTLKAKAM-GIAKVPSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRIGEVKWGF 327
Query: 350 HGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFER 409
+G N++V++ +CL F+DGG P+ +VIG QL++++L+FD + + L FS+SLL
Sbjct: 328 YGANTVVKVKETVMCLAFIDGGKTPKDLMVIGTHQLQDHMLEFDFSGTVLAFSESLLLHN 387
Query: 410 ATCT 413
+C+
Sbjct: 388 TSCS 391
>gi|297812095|ref|XP_002873931.1| hypothetical protein ARALYDRAFT_351013 [Arabidopsis lyrata subsp.
lyrata]
gi|297319768|gb|EFH50190.1| hypothetical protein ARALYDRAFT_351013 [Arabidopsis lyrata subsp.
lyrata]
Length = 403
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 134/413 (32%), Positives = 200/413 (48%), Gaps = 28/413 (6%)
Query: 10 LCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPL-VPVKLTVHLGGNI 68
L L++F LS + A T + + K T Q+ T +P PV L + LG N+
Sbjct: 4 LTRLIVF-LSILAAITRKSNSQYLFPITKHEPTNQFYTTFNIGSPTKSPVNLLLDLGTNL 62
Query: 69 LWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPI-SNTGTHGDIRIDVL 127
W++C K S+ + C S+ C C G C NP+ N G + D+
Sbjct: 63 TWLNCRKLKSLSSLRLVTCQSSTCKFIPGNGCDGKSCLYKQPNPLGQNPIVTGRVVQDIA 122
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
SI +TDG V+VP F F C E L+GL V G+ L + Q+ +AF++
Sbjct: 123 SISTTDGGKFLSQVSVPRFTFSCAGEKTLEGLPPPVAGVLALSPGSSSFTKQVTSAFNVI 182
Query: 188 RKFALCLSPFDDGAIVFGDGPYY--DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
KF+LCL G Y+ N+ S + TP+ G S +Y
Sbjct: 183 PKFSLCLPSSGTGRFYIAGIHYFIPPFNDSSSSIPMTLTPI-----------RGTDSGDY 231
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF---ASA 302
+ V +++V G + LN LL+ GG K+STV YTVL+T IY AL Q+F A
Sbjct: 232 LLLVLNIYVGGSPLKLNPDLLT------GGAKLSTVVHYTVLQTDIYNALAQSFTLEAKT 285
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK--NVVWSIHGQNSMVQIGG 360
M + +V VAPF CF + G GP V I++ L + V W +G N++V++
Sbjct: 286 M-GIFKVPSVAPFKHCFDARTAGKNLRGPNVSVIEIGLPGRIGEVKWGFYGANTVVKVKE 344
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
+CL F+DGG P +VIG+ QL++++L+FD + + L FS+SLL +C+
Sbjct: 345 TVMCLAFIDGGKKPENLMVIGSHQLQDHMLEFDFSGTVLAFSESLLLHNTSCS 397
>gi|326500850|dbj|BAJ95091.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 438
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 139/419 (33%), Positives = 197/419 (47%), Gaps = 47/419 (11%)
Query: 25 TSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKT 84
+S+ P LV RV KD +T Y IK VP+ L + L G ++W+ ++
Sbjct: 37 SSWSP--LVARVNKDASTSLYTIAIKDGG--VPL-LLLDLAGPMIWI----ANCPCRHRA 87
Query: 85 ARCGSAQC----NLANAKACGGG----------ICGAGPDNPISNTGTHGDIRIDVLSIQ 130
CGS C N+ C G IC A P NP+ ++
Sbjct: 88 IECGSNDCLGISNMFAPDICAGAEWPVQVQGRCICTAMPYNPVDGRCVAAQATTISVAAN 147
Query: 131 STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
+TDGRNP V+ P + C +L L +GV G+AGL R +LP Q+A F LK++F
Sbjct: 148 ATDGRNPLFPVSFP-VVGSCAPGELLASLPAGVAGVAGLARLPNSLPLQVANWFRLKQEF 206
Query: 191 ALCLSPFDDGAIVFGDGPYYDLNN---FDVSKNLKYTPL--FINKVNTASGFLGEPSVEY 245
ALCL DG +FG GP+ L +++ NL+ PL N N A Y
Sbjct: 207 ALCLPRGGDGVAIFGGGPFQLLAAPTVEELADNLRKNPLPFLFNPKNRA----------Y 256
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+ +T + VN + VP +D G GG STV PYT L IY L AF +A
Sbjct: 257 YFTITGIAVNQQRVPTPSGAFGMDWRGQGGAAFSTVTPYTALRWDIYWPLRNAFDAATSG 316
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
+AR VAPF C++ ++ TR+G V IDL+L W++ G +S+VQ+ +C
Sbjct: 317 IARADKVAPFDMCYQASELTMTRVGYAVASIDLMLDGGQ-NWTLPGASSLVQVNDQTVCF 375
Query: 366 GFVDGGVN------PRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFT 417
FV + ++++G QLE+NLL FDL FS LL TC+ F+F+
Sbjct: 376 AFVQTAASSAPAHAESPAVILGGHQLEDNLLLFDLDKDTFAFSGLLLGIGTTCSNFDFS 434
>gi|15238970|ref|NP_199654.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8777373|dbj|BAA96963.1| dermal glycoprotein precursor, extracellular-like [Arabidopsis
thaliana]
gi|62320322|dbj|BAD94668.1| dermal glycoprotein precursor [Arabidopsis thaliana]
gi|66792680|gb|AAY56442.1| At5g48430 [Arabidopsis thaliana]
gi|133778812|gb|ABO38746.1| At5g48430 [Arabidopsis thaliana]
gi|332008286|gb|AED95669.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 406
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 143/434 (32%), Positives = 206/434 (47%), Gaps = 59/434 (13%)
Query: 3 VAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTV 62
+ +S L+LC +L F+ S ++A + PKALV V+K+T + + + +
Sbjct: 1 MEKSLLVLCLILFFTYSYVSANY-YPPKALVSTVSKNTILPIFTFTLNTNQ-----EFFI 54
Query: 63 HLGGNILWVDCEKGYVSSTNKTARCGSA-----------QCNLANAKACGG-GICGAGPD 110
H+GG L C G CGS QC+L + K G C A
Sbjct: 55 HIGGPYLVRKCNDGL---PRPIVPCGSPVCALTRRFTPHQCSLPSNKIINGVCACQATAF 111
Query: 111 NPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVG 165
P S+ T+GD LSI S +P +VT+ N +LC + L GV G
Sbjct: 112 EPFQRICNSDQFTYGD-----LSISSLKPISP--SVTINNVYYLCIPQPFLVDFPPGVFG 164
Query: 166 IAGLGRSKVALPSQLAAA-FSLKRKFALCL----SPFDDGAIVFGDGPYYDLNNFDVSKN 220
+AGL + +A +QL L++KFALCL +P GAI FG GPY L N D
Sbjct: 165 LAGLAPTALATWNQLTRPRLGLEKKFALCLPSDENPLKKGAIYFGGGPY-KLRNIDARSM 223
Query: 221 LKYTPLFIN--KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
L YT L N K+N YF+G+ + VNG + + D G GG +
Sbjct: 224 LSYTRLITNPRKLN-----------NYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTL 272
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
ST+ P+T+L + IY+ ++AF+ A + RV+ PF C T VP+IDL
Sbjct: 273 STIFPFTMLRSDIYRVFIEAFSQATSGIPRVSSTTPFEFCLS------TTTNFQVPRIDL 326
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSR 398
L N V+W + N+M ++ D CL FV+GG +++IG Q+EN L++FD+ S
Sbjct: 327 ELAN-GVIWKLSPANAMKKVSDDVACLAFVNGGDAAAQAVMIGIHQMENTLVEFDVGRSA 385
Query: 399 LGFSDSLLFERATC 412
GFS SL A+C
Sbjct: 386 FGFSSSLGLVSASC 399
>gi|297795499|ref|XP_002865634.1| hypothetical protein ARALYDRAFT_494897 [Arabidopsis lyrata subsp.
lyrata]
gi|297311469|gb|EFH41893.1| hypothetical protein ARALYDRAFT_494897 [Arabidopsis lyrata subsp.
lyrata]
Length = 406
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 145/434 (33%), Positives = 201/434 (46%), Gaps = 59/434 (13%)
Query: 3 VAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTV 62
+ +S ++LC LL+FS + ++ PK LV V+K+T + + + + +
Sbjct: 1 MEKSLVVLC-LLLFSAYSYVSAHNYSPKTLVSTVSKNTILPIFTFTLNKNQ-----EFFI 54
Query: 63 HLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA------------NAKACGGGICGAGPD 110
H+GG L C G C S C L N G C A
Sbjct: 55 HIGGPYLVRKCNDGL---PRPIVPCDSPVCALTRGVSPHQCPLPTNTVINGVCACQATAF 111
Query: 111 NPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVG 165
P S+ T+GD LSI S + +P +VTV N +LC + L GV G
Sbjct: 112 EPFQRLCNSDQFTYGD-----LSISSLNPISP--SVTVNNVYYLCIPKPFLVDFPPGVFG 164
Query: 166 IAGLGRSKVALPSQLAAA-FSLKRKFALCL----SPFDDGAIVFGDGPYYDLNNFDVSKN 220
+AGL + +A +QL L++KFALCL SP + GAI FG GPY L N D
Sbjct: 165 LAGLAPTALATWNQLTRPRLGLEKKFALCLPSDESPLNKGAIYFGGGPY-KLRNIDARSM 223
Query: 221 LKYTPLFIN--KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
L YT L N K+N YF+G+ + VNGK + L D G GG +
Sbjct: 224 LSYTRLIRNPRKLN-----------NYFLGLKGISVNGKRILLAPNAFDFDRNGDGGVTL 272
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
STV P+T L + IYK ++AFA A + RV P C + F VP+IDL
Sbjct: 273 STVFPFTTLRSDIYKVFIEAFAKATSDIPRVISTTPLEFCLK-STTNFQ-----VPRIDL 326
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSR 398
L V+W + N+M ++ D CL FV+GG ++VIG Q+EN L++FD+ S
Sbjct: 327 ELA-AGVIWKVSPANAMKKVSDDVACLAFVNGGDAAAQAVVIGLHQMENTLVEFDVGRSA 385
Query: 399 LGFSDSLLFERATC 412
GFS SL A+C
Sbjct: 386 FGFSCSLGLVNASC 399
>gi|297812091|ref|XP_002873929.1| hypothetical protein ARALYDRAFT_909934 [Arabidopsis lyrata subsp.
lyrata]
gi|297319766|gb|EFH50188.1| hypothetical protein ARALYDRAFT_909934 [Arabidopsis lyrata subsp.
lyrata]
Length = 407
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 140/425 (32%), Positives = 209/425 (49%), Gaps = 51/425 (12%)
Query: 9 LLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIK---------QRTPLVPVK 59
LC L+ F+L+ T+ + + +A + + KD AT Y + P + V
Sbjct: 9 FLC-LIFFTLAN-TSHSLEKIQAFLQPIRKDDATNLYYCPLSIGMSKSNGVNHLP-ISVN 65
Query: 60 LTVHLGGNI-LWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGT 118
L + LGG+ + C S + +CGS++C A + P+N
Sbjct: 66 LAIDLGGSAPFLLTCAAAVKSISYHPIKCGSSRCTYAKPDLL------SCPNNSKKRATC 119
Query: 119 HGDIRID--VLSIQSTDGRNPGRAVTVPNFIF-LCGSEFVLQGLASGVVGIAGLGRSKVA 175
H V I+S R+ + N + + +++ S V G GL ++ V+
Sbjct: 120 HKSFSTSFTVHPIKSRLFRDTVSLLYTQNACTDMWNVDPLIKPYLSVVNGTLGLAKTHVS 179
Query: 176 LPSQLAAAFSLKRKFALCLSPFDDG-----AIVFGDGPYY--DLNNFDVSKNLKYTPLFI 228
LPSQL +++ + K ALCL A+ G GPY+ N DVSK TPL
Sbjct: 180 LPSQLVSSYKVPLKVALCLPSSYGSPSGSGALYVGGGPYFFAPYPN-DVSKFFASTPLLA 238
Query: 229 NKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLE 288
N + EYFI V S+ + GKA+ + K GTKI T+ PYTVL
Sbjct: 239 NDQSPG---------EYFIDVKSIQIGGKAIVIAKK----------GTKICTLAPYTVLH 279
Query: 289 TSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWS 348
+SIYKALV FA K+ + V PFG+CF + +G T +G VP I+LVL W
Sbjct: 280 SSIYKALVLTFAGKA-KMVKAPAVKPFGSCFSSKGLGKTMMGSGVPVIELVLSG-GAKWK 337
Query: 349 IHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
I+G NS+V++ D +CLGF+DGGVN + ++VIG Q+E+NL++FD+ S+ F+ SLL
Sbjct: 338 IYGWNSLVKVSKDVVCLGFLDGGVNLKEAMVIGGFQMEDNLVEFDIKASKFSFTSSLLLR 397
Query: 409 RATCT 413
A+C+
Sbjct: 398 NASCS 402
>gi|115442101|ref|NP_001045330.1| Os01g0936900 [Oryza sativa Japonica Group]
gi|113534861|dbj|BAF07244.1| Os01g0936900 [Oryza sativa Japonica Group]
Length = 379
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 124/380 (32%), Positives = 187/380 (49%), Gaps = 33/380 (8%)
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSA--QCNLANAKACGGGI------------ 104
+L + LGG +LW C + + ++ C +A Q N N + G
Sbjct: 8 RLVLDLGGPLLWSTCLAAHSTVPCRSDVCAAAAVQDNPWNCSSSTDGRGSDGGGGRGLCA 67
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
C A P NP++ GD+ + TDG NP V P C +L L SG V
Sbjct: 68 CSAYPYNPLNGQCARGDVTTTPMLANVTDGVNPLYPVAFPVHA-ACAPGALLGSLPSGAV 126
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGPYYDL---NNFD-VSK 219
G+AGL + ++LPSQ+AA+ ++RKFALCL GA +FG GP++ L F VS
Sbjct: 127 GVAGLSGAPLSLPSQVAASLKVERKFALCLPGGGGTGAAIFGGGPFHLLVVPEEFGMVSN 186
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKI 278
L Y N N +++ V + VN + + L++D G GG +
Sbjct: 187 GLSYISYLRNPKNGG----------FYLDVVGIAVNHRGADVPPDSLALDAGTGHGGVML 236
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
STV PYT L IY+A+++A + + +AR P PF C++ + +TRIGP +DL
Sbjct: 237 STVAPYTALRPDIYRAVIEAIDAELRLIARAPPSWPFERCYQRSAMWWTRIGPYTASVDL 296
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSR 398
+L W+I G +++V++ +A C FVD G +++IG Q+E+NL+ FDL +
Sbjct: 297 MLAGGQ-NWTIVGASAVVEVSQEAACFAFVDMGAAAAPAVIIGGHQMEDNLVVFDLEKWQ 355
Query: 399 LGFSDSLLFERATC-TFNFT 417
GFS LL C F+F+
Sbjct: 356 FGFSGLLLGTMTRCGNFDFS 375
>gi|297720741|ref|NP_001172732.1| Os01g0937050 [Oryza sativa Japonica Group]
gi|20160766|dbj|BAB89707.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
gi|255674045|dbj|BAH91462.1| Os01g0937050 [Oryza sativa Japonica Group]
Length = 424
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 146/419 (34%), Positives = 199/419 (47%), Gaps = 61/419 (14%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
K LV + KD AT Y IK PLV + L G ++W+ C ++ + T C
Sbjct: 33 KPLVAAITKDAATSLYTVPIKDGRPLV-----LDLAGALVWMSC-----AAAHPTLECHH 82
Query: 90 AQCNLANAKACGGG----------------ICGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
C A++ G C A P NP S D+ LS +TD
Sbjct: 83 HFCMHAHSYHPPGCPHNGYGRADVEDPFRCKCTAHPYNPFSGESATADLTRTRLSANATD 142
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
G+NP V+ + C + +L L +G VG+AGL R+++AL +Q+A + + KFALC
Sbjct: 143 GKNPLYPVSF-AAVTSCAPDSLLAKLPAGAVGVAGLARTRLALQAQVARSQKVANKFALC 201
Query: 194 LSP--FDDGAIVFGDGPYYDL--NNFDVSKNLK-YTPLFINKVNTASGFLGEPSVEYFIG 248
L DG +FG GP + L DV+ L TPL NK YFI
Sbjct: 202 LPSGGGGDGVAIFGGGPLFLLPPGRPDVAATLAGETPLHRNKDLPG----------YFIS 251
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
T + VN + V L + ++ T PYT L +Y+A+V AFA A R
Sbjct: 252 ATKIAVNQEQVQLYTQEPLV-------VELCTRIPYTALRPDVYRAVVDAFARATAGRKR 304
Query: 309 V----APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDAL 363
V P APF C+ +D+G TR+G VPQIDLVL+ KN W++ G NSM Q+ +
Sbjct: 305 VTPPPPPAAPFELCYDSRDLGSTRLGYAVPQIDLVLEGGKN--WTVFGGNSMAQVSDNTA 362
Query: 364 CLGFV----DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFT 417
CL V + G P + +IG Q+ENNL+ FD RLGFS L + TC+ FNFT
Sbjct: 363 CLAVVKVKGEKGSPPPPAAIIGGFQMENNLVVFDEEKQRLGFSGLLWGRQTTCSNFNFT 421
>gi|356557887|ref|XP_003547241.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 678
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 135/403 (33%), Positives = 191/403 (47%), Gaps = 50/403 (12%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
+ L + D+ T QY T + T + L + L GN LW +C+ Y SS+ C S
Sbjct: 300 ITLPINIDSTTPQYFTSVCIGTQRHNMNLAIDLSGNYLWYECDSHYNSSSYNPVTCVSPH 359
Query: 92 CNLAN----------AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
C + C CG NP S++ GD+ D L + +
Sbjct: 360 CPQGSPCLGCDGSPRKPGCTNDTCGFDVVNPFSDSTFIGDMGHDFLFLPQ---------I 410
Query: 142 TVPN-FIFLCG--SEF----VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+P F++ C S F +L GLA G+ GI GL R+ LP Q++++F++ KF LCL
Sbjct: 411 KLPQTFVYGCAETSRFSSIPILSGLAKGIKGILGLARTPHTLPFQISSSFNVPPKFTLCL 470
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
G + G P + + + +GF G S EYFI V S+ +
Sbjct: 471 PSSGKGKLFIGGRPSSSIISL-----------------SQTGFGGFSSTEYFIHVNSITI 513
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPV 312
N K V + L D G GG+ IST++PYTVL SIYK V+ F A+ + RV V
Sbjct: 514 NDKPVKFGASFLFRDENGNGGSVISTMSPYTVLHHSIYKPFVRDFVEAATAKNIKRVKSV 573
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNK--NVVWSIHGQNSMVQIGGDALCLGFVDG 370
PFG CF I + G VP I L + + V + I NS+V++ LCL FVDG
Sbjct: 574 HPFGECFDANTI---KDGKAVPDIKLAMDGRFRKVSYGICAHNSLVEVRKGVLCLAFVDG 630
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
G T +V+ QL + +L+FDL+TS L FS SLL + TC+
Sbjct: 631 GEFAVTGVVLDGHQLRDRVLEFDLSTSVLSFSSSLLLQNKTCS 673
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 102/220 (46%), Gaps = 28/220 (12%)
Query: 172 SKVALPSQLAAAFSLKRKFALCL-SPFDDGAIVF-GDGPYYDLNNFDVSKNLKYTPLFIN 229
S + + ++++F++ KF LCL S G +F G GP +S +L
Sbjct: 85 SGLPFTAHISSSFNVPPKFTLCLPSSGKKGHHLFIGGGPTL------ISTSLS------- 131
Query: 230 KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS-IDNEGVGGTKISTVNPYTVLE 288
T G + EY + S+++N K V N + + +D G G IST+ PYTVL
Sbjct: 132 --QTGFGDGNFSNYEYAFHLNSININHKPVKFNTSDIRFLDGNGNAGAIISTIQPYTVLH 189
Query: 289 TSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK--N 344
S+Y+ V+ F A + RV V PFG C+ I VP I+LVL+++
Sbjct: 190 RSVYQPFVKVFVKAEKAKNMKRVKKVHPFGTCYDANTIA------DVPAINLVLESRIGK 243
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
+ I G +S+V++ +CL F DG +++G
Sbjct: 244 GNYDISGHDSLVEVRKGVMCLAFADGAKQAFCGVLLGGHN 283
>gi|125529032|gb|EAY77146.1| hypothetical protein OsI_05111 [Oryza sativa Indica Group]
Length = 424
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 144/419 (34%), Positives = 197/419 (47%), Gaps = 61/419 (14%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
K LV + KD AT Y IK PLV + L G ++W C ++ + T C
Sbjct: 33 KPLVAAITKDAATSLYTVPIKDGRPLV-----LDLAGALVWTSC-----AAAHPTLECHH 82
Query: 90 AQCNLANAKACGGG----------------ICGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
C A++ G C A P NP S D+ LS +TD
Sbjct: 83 HFCMHAHSYHPPGCPHNGYGRADVEDPFRCKCTAHPYNPFSGESATADLTRTRLSANATD 142
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
G+NP V+ + C + +L L +G VG+AGL R+++AL +Q+A + + KFALC
Sbjct: 143 GKNPLYPVSF-AAVTSCAPDSLLAKLPAGAVGVAGLARTRLALQAQVARSQKVANKFALC 201
Query: 194 LSP--FDDGAIVFGDGPYYDL--NNFDVSKNLK-YTPLFINKVNTASGFLGEPSVEYFIG 248
L DG +FG GP + L DV+ L TPL NK YFI
Sbjct: 202 LPSGGGGDGVAIFGGGPLFLLPPGRPDVAATLAGETPLHRNKDLPG----------YFIS 251
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
T + VN + V L + ++ T PYT L +Y+A+V AFA A R
Sbjct: 252 ATKIAVNQEQVQLYTQEPLV-------VELCTRIPYTALRPDVYRAVVDAFARATAGRKR 304
Query: 309 V----APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDAL 363
V A PF C+ +++G TR+G VPQIDLVL+ KN W++ G NSM Q+ +
Sbjct: 305 VTPPAAAAPPFELCYDSRELGSTRLGYAVPQIDLVLEGGKN--WTVFGGNSMAQVSDNTA 362
Query: 364 CLGFV----DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFT 417
CL V + G P + +IG Q+ENNL+ FD RLGFS L + TC+ FNFT
Sbjct: 363 CLAVVKVKGEKGSPPPPAAIIGGFQMENNLVVFDEEKQRLGFSGLLWGRQTTCSNFNFT 421
>gi|20160764|dbj|BAB89705.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
Length = 422
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 196/412 (47%), Gaps = 46/412 (11%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
K LV V KD AT Y +K PL + L G ++W C+ + + C
Sbjct: 36 KPLVTAVTKDGATKLYTIAVKDGHPLA-----LDLSGELVWSTCDASHSTVLPYERECVE 90
Query: 90 AQ------CNLANAKACG----GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
A C + A G G C A P N ++ GD+ L+ +T+G NP
Sbjct: 91 ANHYTPPSCWMQYGGAGGDYRYGNKCTAHPYNGVTGRCAPGDLTRTALAADATNGSNPLY 150
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
VT P + C +L L +G VG+AGLGRS +AL +Q+AA ++ +KFALCL
Sbjct: 151 PVTFPA-VASCAPGSLLASLPAGAVGVAGLGRSDLALHAQVAATQNVAKKFALCLP---- 205
Query: 200 GAIVFGDGPY---YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
VFG GP+ + + D+ + L YT L + G Y+I S+ VN
Sbjct: 206 SVAVFGGGPFVLIFPYSRPDIMQKLSYTAL-----RRSPELAGGNGGGYYITAKSIEVNH 260
Query: 257 KAVPLNKTLLSIDNEGVG-GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA-PVAP 314
VPL N G ++S++ PYT L +Y V+A+ + +VA PVAP
Sbjct: 261 HQVPL-------PNHGAPLVVQLSSMVPYTELRPDVYGPFVKAWDEILQWPKKVAPPVAP 313
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
F C+ + IG R+G VP I++ L++ W I G NS+VQ+ C FV+ + P
Sbjct: 314 FELCYESRTIGSNRLGYAVPDININLED-GAAWYIFGGNSLVQVDDATACFAFVE--MRP 370
Query: 375 RT-----SIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
++VIG Q+E+NL+ FD +LGFS L + TC+ FNFT A
Sbjct: 371 EKVGYGPAVVIGGHQMEHNLVVFDEEKQQLGFSGLLFGLQTTCSNFNFTVAA 422
>gi|125529031|gb|EAY77145.1| hypothetical protein OsI_05110 [Oryza sativa Indica Group]
Length = 422
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 196/412 (47%), Gaps = 46/412 (11%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
K LV V KD AT Y +K PL + L G ++W C+ + + C
Sbjct: 36 KPLVTAVTKDGATKLYTIAVKDGHPLA-----LDLSGELVWSTCDASHSTVLPYERECVE 90
Query: 90 AQ------CNLANAKACG----GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
A C + A G G C A P N ++ GD+ L+ +T+G NP
Sbjct: 91 ANRYTPPSCWMQYGGAGGDYRYGNKCTAHPYNGVTGRCAPGDLTRTALAADATNGSNPLY 150
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
VT P + C +L L +G VG+AGLGRS +AL +Q+AA ++ +KFALCL
Sbjct: 151 PVTFPA-VASCAPGSLLASLPAGAVGVAGLGRSDLALHAQVAATQNVAKKFALCLP---- 205
Query: 200 GAIVFGDGPY---YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
VFG GP+ + + D+ + L YT L + G Y+I S+ VN
Sbjct: 206 SVAVFGGGPFVLIFPYSRPDIMQKLSYTAL-----RRSPELAGGNGGGYYITAKSIEVNH 260
Query: 257 KAVPLNKTLLSIDNEGVG-GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA-PVAP 314
VPL N G ++S++ PYT L +Y V+A+ + +VA PVAP
Sbjct: 261 HQVPL-------PNHGAPLVVQLSSMVPYTELRPDVYGPFVKAWDEILQWPKKVAPPVAP 313
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
F C+ + IG R+G VP I++ L++ W I G NS+VQ+ C FV+ + P
Sbjct: 314 FELCYESRTIGSNRLGYAVPDININLED-GAAWYIFGGNSLVQVDDATACFAFVE--MRP 370
Query: 375 RT-----SIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
++VIG Q+E+NL+ FD +LGFS L + TC+ FNFT A
Sbjct: 371 EKVGYGPAVVIGGHQMEHNLVVFDEEKQQLGFSGLLFGLQTTCSNFNFTVAA 422
>gi|115442103|ref|NP_001045331.1| Os01g0937000 [Oryza sativa Japonica Group]
gi|113534862|dbj|BAF07245.1| Os01g0937000, partial [Oryza sativa Japonica Group]
Length = 395
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 196/412 (47%), Gaps = 46/412 (11%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
K LV V KD AT Y +K PL + L G ++W C+ + + C
Sbjct: 9 KPLVTAVTKDGATKLYTIAVKDGHPLA-----LDLSGELVWSTCDASHSTVLPYERECVE 63
Query: 90 AQ------CNLANAKACG----GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
A C + A G G C A P N ++ GD+ L+ +T+G NP
Sbjct: 64 ANHYTPPSCWMQYGGAGGDYRYGNKCTAHPYNGVTGRCAPGDLTRTALAADATNGSNPLY 123
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
VT P + C +L L +G VG+AGLGRS +AL +Q+AA ++ +KFALCL
Sbjct: 124 PVTFPA-VASCAPGSLLASLPAGAVGVAGLGRSDLALHAQVAATQNVAKKFALCLP---- 178
Query: 200 GAIVFGDGPY---YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
VFG GP+ + + D+ + L YT L + G Y+I S+ VN
Sbjct: 179 SVAVFGGGPFVLIFPYSRPDIMQKLSYTAL-----RRSPELAGGNGGGYYITAKSIEVNH 233
Query: 257 KAVPLNKTLLSIDNEGVG-GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA-PVAP 314
VPL N G ++S++ PYT L +Y V+A+ + +VA PVAP
Sbjct: 234 HQVPL-------PNHGAPLVVQLSSMVPYTELRPDVYGPFVKAWDEILQWPKKVAPPVAP 286
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
F C+ + IG R+G VP I++ L++ W I G NS+VQ+ C FV+ + P
Sbjct: 287 FELCYESRTIGSNRLGYAVPDININLED-GAAWYIFGGNSLVQVDDATACFAFVE--MRP 343
Query: 375 RT-----SIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
++VIG Q+E+NL+ FD +LGFS L + TC+ FNFT A
Sbjct: 344 EKVGYGPAVVIGGHQMEHNLVVFDEEKQQLGFSGLLFGLQTTCSNFNFTVAA 395
>gi|195658759|gb|ACG48847.1| xylanase inhibitor TAXI-IV [Zea mays]
Length = 426
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 139/421 (33%), Positives = 197/421 (46%), Gaps = 62/421 (14%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA---- 85
K LV + +D AT Y +K PLV + L G +LW C + S A
Sbjct: 32 KPLVTAITRDAATKLYTAPLKDELPLV-----LDLSGPLLWATCAAPHPSYECHHAACAH 86
Query: 86 ----------RCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR 135
R G + + C C A P NP + GD+ ++ +TDG
Sbjct: 87 AHAHHPPGCPRTGHGVADEFDPFRC---RCRAHPYNPFARRAGSGDLTRARVTANTTDGA 143
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
NP A + C +L GL +G VG+AGL RS++ALP+Q+A + R+FALCL
Sbjct: 144 NP-LAAASFTAVAACAPPTLLAGLPAGAVGVAGLARSRLALPAQVARKQKVARRFALCL- 201
Query: 196 PFDDGAI---VFGDGPYYDL--NNFDVSKNLK-YTPLFINKVNTASGFLGEPSVEYFIGV 249
P + G + +FG GP + L DV+ +L TPL N G P YF+
Sbjct: 202 PGEGGGMGVAIFGGGPLFLLPPGRPDVTASLAGTTPLRRNP--------GVPG--YFVSA 251
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
T + V N + + +G + + PYTVL +Y V+AF M R
Sbjct: 252 TGIAV-------NHVQVQVQQQGPLTVALCSRVPYTVLRPDVYAPFVRAF-EVMAMAGRK 303
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
P PF C+ +++G TR+G VPQ+DL+L++ W++ G NSMVQ+ D C F++
Sbjct: 304 PPTPPFELCYDSRELGSTRLGYAVPQVDLMLES-GANWTVFGGNSMVQVSDDTACFAFLE 362
Query: 370 ------------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNF 416
GG P ++VIG Q+ENNLL FD +LGFS L + TC+ FNF
Sbjct: 363 MKEEKQQGGHGYGGGAPAPAVVIGGFQMENNLLVFDEENGQLGFSGLLFGRQTTCSNFNF 422
Query: 417 T 417
T
Sbjct: 423 T 423
>gi|326489434|dbj|BAK01698.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 429
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 132/411 (32%), Positives = 191/411 (46%), Gaps = 40/411 (9%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW----------VDCEKGYVS 79
+ +V RV+KD +T Y IK + V L + L G +LW V C
Sbjct: 31 QPIVARVSKDASTSLYNIAIK----VGGVPLLLDLAGPMLWLANCPSPHRIVPCVSPVCD 86
Query: 80 STNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
+ T R C + G C A P NP+ D L+ +TDG+NP
Sbjct: 87 EVSTTYR--PPGCPKPGLRGEGQCACPAYPRNPVDGRCRSDDATTITLAASTTDGQNPIF 144
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFD 198
VT + C +L+ L +G G+AG R ++LP+Q A+ + +FALCL S
Sbjct: 145 PVTF-RAVGSCAPGELLESLPAGAAGVAGFSRLPLSLPTQFASLLKVANEFALCLPSGGS 203
Query: 199 DGAIVFGDGPYYDLNN--FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
DG VFG GP+ L +++ L+ PL + K G Y+ +T + VN
Sbjct: 204 DGVAVFGGGPFQLLAAPPVELAGRLRENPLPLLKHPYNGG--------YYFNITGIAVNQ 255
Query: 257 KAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
+ VP + +D + G GG STV PYT L IY L AF +A +AR V PF
Sbjct: 256 QLVPTPPGVFDLDASSGTGGAVFSTVTPYTALRWDIYWPLRNAFDAATSGIARADKVEPF 315
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
C++ + TR+G V I+L+L +N W++ G +S+VQ+ +C FV +
Sbjct: 316 DLCYQASALTVTRVGYGVANIELMLDGGRN--WTLPGASSLVQVNNQTVCFAFVQMASSS 373
Query: 375 RT-------SIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFT 417
++++G Q+ENNLL FDL FS LL R TC+ FNFT
Sbjct: 374 SMPAALDSPAVILGGHQMENNLLMFDLVKETFAFSGLLLGIRTTCSNFNFT 424
>gi|226510522|ref|NP_001142024.1| xylanase inhibitor TAXI-IV precursor [Zea mays]
gi|194706824|gb|ACF87496.1| unknown [Zea mays]
gi|414878790|tpg|DAA55921.1| TPA: xylanase inhibitor TAXI-IV [Zea mays]
Length = 429
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 141/423 (33%), Positives = 200/423 (47%), Gaps = 63/423 (14%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA---- 85
K LV + +D AT Y +K PLV + L G +LW C + S A
Sbjct: 32 KPLVTAITRDAATKLYTAPLKDELPLV-----LDLSGPLLWATCAAPHPSYECHHAACAH 86
Query: 86 ----------RCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR 135
R G + + C C A P NP + GD+ ++ +TDG
Sbjct: 87 AHAHHPPGCPRTGHGVADEFDPFRC---RCRAHPYNPFARRAGSGDLTRARVTANTTDGA 143
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
NP A + C +L GL +G VG+AGL RS++ALP+Q+A + R+FALCL
Sbjct: 144 NP-LAAASFTAVAACAPPTLLAGLPAGAVGVAGLARSRLALPAQVARKQKVARRFALCL- 201
Query: 196 PFDDGAI---VFGDGPYYDL--NNFDVSKNLK-YTPLFINKVNTASGFLGEPSVEYFIGV 249
P + G + +FG GP + L DV+ +L TPL N G P YF+
Sbjct: 202 PGEGGGMGVAIFGGGPLFLLPPGRPDVTASLAGTTPLRRNP--------GVPG--YFVSA 251
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF-ASAMPKVAR 308
T + V N + + +G + + PYTVL +Y V+AF A AM R
Sbjct: 252 TGIAV-------NHVQVQVQQQGPLTVALCSRVPYTVLRPDVYAPFVRAFEAMAMAGRKR 304
Query: 309 VA-PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
+ P PF C+ +++G TR+G VPQ+DL+L++ W++ G NSMVQ+ D C F
Sbjct: 305 MTPPTPPFELCYDSRELGSTRLGYAVPQVDLMLES-GTNWTVFGGNSMVQVSDDTACFAF 363
Query: 368 VD------------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-F 414
++ GG P ++VIG Q+ENNLL FD +LGFS L + TC+ F
Sbjct: 364 LEMKEEKQQGGHGYGGGAPAPTVVIGGFQMENNLLVFDEENGQLGFSGLLFGRQTTCSNF 423
Query: 415 NFT 417
NFT
Sbjct: 424 NFT 426
>gi|242059841|ref|XP_002459066.1| hypothetical protein SORBIDRAFT_03g045270 [Sorghum bicolor]
gi|241931041|gb|EES04186.1| hypothetical protein SORBIDRAFT_03g045270 [Sorghum bicolor]
Length = 417
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 139/414 (33%), Positives = 197/414 (47%), Gaps = 56/414 (13%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
++L VAKD AT Y + V + L G +LW C+ C
Sbjct: 28 VLLPVAKDAATSLYTIPTRDGAHHV-----IDLAGPLLWSTCDH-----IPAKISCRDPV 77
Query: 92 CNLANA-KACGGGICGAG----------PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C LANA +A GI GAG P NPI+ ++ L +TDG+NP
Sbjct: 78 CKLANAYRAPSCGIAGAGQQCSKRCKAYPYNPITGRCAAAELVHTRLIANTTDGKNPLSQ 137
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP---F 197
V+VP + C S +L+ L V G+AGL + +ALP+Q+AA+ + + F LCL
Sbjct: 138 VSVPA-VAACASATLLEKLPRDVTGVAGLSAAGLALPAQVAASQRVAKTFLLCLPRSGGR 196
Query: 198 DDGAIVFG-DGPYY-------DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
DG VFG GP+Y + ++ D+++ L++ PL N Y+I V
Sbjct: 197 GDGVAVFGTRGPFYLKLFLTGEPSSGDLTQTLQFAPLRSRPGNPL----------YYIPV 246
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV-AR 308
T+V V VPL LS GG + T PYT L +Y+ +V+AF + + R
Sbjct: 247 TNVSVGRVPVPLPPHALS-----AGGVVLCTRVPYTALRPDVYRPVVEAFDRGLIRSDMR 301
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
VA V PF C+ + TRIG VP+I VL+ W+ G +SMV + CL FV
Sbjct: 302 VAAVPPFEFCYNRTLLPPTRIGYGVPEITFVLEGGK-EWTFVGSSSMVDVNAKTACLAFV 360
Query: 369 D-----GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFT 417
+ G +IV+G Q+E++LLQFDL +LGF+ + F A FNFT
Sbjct: 361 EMKGVKAGDPAAAAIVVGGFQMEDHLLQFDLEKKQLGFA-KVPFISACSNFNFT 413
>gi|242059843|ref|XP_002459067.1| hypothetical protein SORBIDRAFT_03g045280 [Sorghum bicolor]
gi|241931042|gb|EES04187.1| hypothetical protein SORBIDRAFT_03g045280 [Sorghum bicolor]
Length = 414
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 137/407 (33%), Positives = 196/407 (48%), Gaps = 46/407 (11%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
++L VAKD AT Y ++ V + L G +LW C ++ + C
Sbjct: 32 VLLPVAKDPATSLYTIPVRDGANHV-----MDLAGPLLWSTCAADHLPAK---VSCRDPV 83
Query: 92 CNLANA----------KACGGGI-CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C LANA CG C A P NP++ + L +TDGRNP
Sbjct: 84 CKLANAYRAPSCRIAGHPCGAKRRCKAYPYNPVTGRCAAASLVHTRLIANTTDGRNPLSQ 143
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
V+V + C +L L +G G+AGL + +ALP+Q+AA+ + +F LCL +G
Sbjct: 144 VSV-RAVAACAPRTLLPRLPAGAAGVAGLADAGLALPAQVAASQRVANRFLLCLPRRGEG 202
Query: 201 AIVFGDGPYY---DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
VFG GP + D D++ L +T L + N Y+I V V VN
Sbjct: 203 VAVFGGGPLFLIPDSAVGDLTSTLAFTALRRRRGNPL----------YYIPVQGVAVNQA 252
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK-VARVAPVAPFG 316
VPL+ + L+ GG + T PYT L +Y+ +VQAF A+ + A+V VAPF
Sbjct: 253 RVPLSASALA-----TGGVVLCTRVPYTELRPDVYRPVVQAFDRALARNDAKVPGVAPFE 307
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD-----GG 371
C+R +G TR+G VP I LVL++ W+ G ++MV + G CL FV+ G
Sbjct: 308 LCYRSSMLGNTRLGYAVPDIALVLEDGK-SWTFVGSSTMVDVNGQTACLAFVEMKGVKAG 366
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFTS 418
++V+G Q+EN+LLQFDL +LGF+ F A FNFT
Sbjct: 367 DPAAAAVVVGGFQMENHLLQFDLEKKQLGFAKVPFFT-ACSNFNFTK 412
>gi|115442105|ref|NP_001045332.1| Os01g0937100 [Oryza sativa Japonica Group]
gi|20160767|dbj|BAB89708.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
gi|113534863|dbj|BAF07246.1| Os01g0937100 [Oryza sativa Japonica Group]
gi|215740721|dbj|BAG97377.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 419
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 129/416 (31%), Positives = 198/416 (47%), Gaps = 47/416 (11%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST 81
+A R +V+ V +D AT Y ++ LV V L G ++W C ++ ++
Sbjct: 28 SAAAPRRRDPVVVPVTRDPATSLYTIPVRYYDNLV-----VDLAGPLVWSTCAADHLPAS 82
Query: 82 NKTARCGSAQCNLANA------KACGGG------ICGAGPDNPISNTGTHGDIRIDVLSI 129
C C +ANA K GGG +C A P NP++ G++
Sbjct: 83 ---LSCQDPTCVVANAYRAPTCKVTGGGGDCSKNVCTAYPYNPVTGQCAAGNLAHTRFIA 139
Query: 130 QSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK 189
+TDG+NP V+V + C + +L L G G+AGL S +ALP+Q+A++ + +
Sbjct: 140 NTTDGKNPLIQVSV-KAVAACAPKRLLARLPRGATGVAGLAASGLALPAQVASSQGVAGR 198
Query: 190 FALCLSP--FDDGAIVFGDGPYYDLNNF-DVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
F LCL + G +FG GP Y D + L YTPL + N Y+
Sbjct: 199 FLLCLPRLGYGQGVAIFGGGPIYLGEGLPDFTTTLDYTPLVAKRDNPG----------YY 248
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+ ++ ++ +PL L+ GG + T P+ L +++ V+ F + +
Sbjct: 249 VTANAIALDDARLPLPSGALA-----AGGVALRTAVPFGQLRPDVFRPFVREFEKGLNRS 303
Query: 307 -ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALC 364
A+VA VAPF C+R +G TRIG VP + L+L KN +++ G NSMV + G C
Sbjct: 304 DAKVAAVAPFPLCYRASMLGNTRIGYFVPAVRLMLAGGKN--YTMTGTNSMVDVKGGKAC 361
Query: 365 LGFVD---GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFT 417
L FV+ G ++++G Q+EN LLQFD RLGF+ L F + FNFT
Sbjct: 362 LAFVEMKSGDAASSPAVILGGFQMENMLLQFDSEKKRLGFA-RLPFYTSCSNFNFT 416
>gi|326504674|dbj|BAK06628.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 416
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 142/441 (32%), Positives = 201/441 (45%), Gaps = 66/441 (14%)
Query: 10 LCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNIL 69
L LL+ ++S + S +P +++ V KD ATL Y LV V G ++
Sbjct: 5 LLPLLVLAVS-LAWPASCKPLPVLVPVTKDPATLLYTIPFHYGNDLV-----VDTAGPLV 58
Query: 70 WVDCEKGYVSSTNKTARCGSAQCNLANA--------KACG-----GGICGAGPDNPISNT 116
W C+ G++ + C S C ANA CG G C A P NP++
Sbjct: 59 WSTCQPGHLPAEFP---CNSDTCRKANAFHVPGCHAPGCGRDGRKGSTCTAYPYNPVTGA 115
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
GD+ L +TDG +P V+V I C +L+ L G G+AGL S +AL
Sbjct: 116 CAAGDLVHTRLVANTTDGVHPVSRVSV-RAIAACAPSSLLKSLPRGASGVAGLAGSDLAL 174
Query: 177 PSQLAAAFSLKRKFALCLS----PFDDGAIVFGDGPYY--DLNNFDVSKNLKYTPLFINK 230
P+Q+A+A ++ KF LCL D G +FG G + D ++ L YTPL +
Sbjct: 175 PAQVASAQNVSNKFLLCLPRGGFSGDTGVAIFGGGQFQVTAQPGRDFTQELLYTPLVTKQ 234
Query: 231 VNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETS 290
G P Y + + S+ V V GG + T P+T+L
Sbjct: 235 --------GMPPAHY-VSIQSIAVENTRV-----------RATGGAVVCTKVPFTLLRPD 274
Query: 291 IYKALVQAFASAM--------PKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL-Q 341
+Y+ V AFA A+ P RV PV PF C+ + + TRIG +VP + L L
Sbjct: 275 VYRPFVYAFARALTAQGAQGGPVARRVKPVPPFERCYDARSLANTRIGYLVPGVTLTLGG 334
Query: 342 NKNVVWSIHGQNSMVQIGGDALCLGFVD-GGVNPR----TSIVIGARQLENNLLQFDLAT 396
KN W+++G +SMV I CL F GV R +++IG Q+EN LL+FD+A
Sbjct: 335 GKN--WTMNGLSSMVDIKPGTACLAFARMEGVKGRDLAAPAVLIGGFQMENTLLEFDMAK 392
Query: 397 SRLGFSDSLLFERATCTFNFT 417
RLGF F + FNFT
Sbjct: 393 KRLGFVRLPFFTQCG-HFNFT 412
>gi|156186249|gb|ABU55395.1| xylanase inhibitor 602OS [Triticum aestivum]
Length = 416
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 137/440 (31%), Positives = 200/440 (45%), Gaps = 63/440 (14%)
Query: 10 LCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNIL 69
L LL+ ++S + S +P +++ V KD ATL Y LV V G ++
Sbjct: 4 LLPLLVLAVS-LAWPASCKPLPVLVPVTKDPATLLYTIPFHYGADLV-----VDTAGPLV 57
Query: 70 WVDCEKGYVSSTNKTARCGSAQCNLANA--------KACGGGI-----CGAGPDNPISNT 116
W C++G++ + C S C LANA + CG C A P NP++
Sbjct: 58 WSTCQRGHLPAEFP---CNSPTCRLANAFHAPSCRARGCGRDTRKDRTCTAYPYNPVTGA 114
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
GD+ +TDG +P V+V + C +L+ L G G+AGL S +AL
Sbjct: 115 CAAGDLVHTRFVANTTDGIHPVSQVSV-RPLAACAPSRLLKSLTRGXSGVAGLAGSGLAL 173
Query: 177 PSQLAAAFSLKRKFALCL----SPFDDGAIVFGDGPYY--DLNNFDVSKNLKYTPLFINK 230
P+Q+A+A S+ KF LCL S G +FG GP D ++ L YTPL K
Sbjct: 174 PAQVASAQSVPNKFLLCLPRGGSSGSTGVAIFGGGPXQVSXQPGRDFTQELVYTPLVAAK 233
Query: 231 VNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETS 290
G P Y + + S+ + VP G G + T P+T+L
Sbjct: 234 K-------GMPPAHY-VSLESIAMENTRVP-----------GAGAAVVCTKVPFTLLRPD 274
Query: 291 IYKALVQAFASAM--------PKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQN 342
+Y+ V+AFA A+ P V PV PF C+ Q + TRIG +VP + L L
Sbjct: 275 VYRPFVEAFARALKAQGAQGGPVARPVKPVPPFELCYDTQSLANTRIGYLVPGVTLTL-G 333
Query: 343 KNVVWSIHGQNSMVQIGGDALCLGF-----VDGGVNPRTSIVIGARQLENNLLQFDLATS 397
W+++G +SMV + CL F V G ++++G Q+EN +L+FD+A
Sbjct: 334 GGTNWTMNGLSSMVDLRPGTACLAFARMEGVKAGDRSAPAVLVGGFQMENTVLEFDVAKK 393
Query: 398 RLGFSDSLLFERATCTFNFT 417
RLGF F + FNFT
Sbjct: 394 RLGFVRLPFFTQCG-HFNFT 412
>gi|255544316|ref|XP_002513220.1| conserved hypothetical protein [Ricinus communis]
gi|223547718|gb|EEF49211.1| conserved hypothetical protein [Ricinus communis]
Length = 174
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 94/172 (54%), Positives = 116/172 (67%), Gaps = 8/172 (4%)
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
V S+ V G+ + NKTLLSI+NEG GGT+IST+ PYT+L TSI++ALV+AF A +
Sbjct: 8 VKSIRVGGEDIKANKTLLSINNEGKGGTRISTIKPYTILHTSIFQALVKAFVKAY-DIKL 66
Query: 309 VAPVA--PFGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDALCL 365
+ PV PFGACF G GP VP IDLVL+ + +V W I NS+V+I CL
Sbjct: 67 IPPVVEPPFGACFPSFSEG---SGPEVPLIDLVLEGQGSVYWRIWAANSLVKISSTLTCL 123
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFT 417
GFVDGG +P TSIVIG Q+E+NLLQFDL +SR GFS SL TC+ NFT
Sbjct: 124 GFVDGGADPFTSIVIGGHQIEDNLLQFDLDSSRFGFSSSLFRRNTTCS-NFT 174
>gi|326492147|dbj|BAJ98298.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 419
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 132/419 (31%), Positives = 192/419 (45%), Gaps = 55/419 (13%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARC 87
R ++++ V KD AT Y LV V + G ++W C ++ + C
Sbjct: 24 RVRSVLAPVTKDPATRLYTMPFHYGANLV-----VDIAGPLVWSTCAPDHLPAAFP---C 75
Query: 88 GSAQCNLANA------------KACGGG--ICGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
SA C LAN K C +C A P NP++ GD+ +TD
Sbjct: 76 KSATCRLANKYHIPGCTESAADKLCDSSHKVCRAFPYNPVTGACAAGDLIHTRFVANTTD 135
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
G+NP V V C +L+ L G G+AGL S +ALP+Q+A+A + KF LC
Sbjct: 136 GKNPASQVNVRGDA-ACAPSKLLESLPQGASGVAGLAGSDLALPAQVASAQKVPNKFLLC 194
Query: 194 LS---PFDDGAIVFGDGPYYDLNN--FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
L D G VFG GP + + D K L YTPL K N A +FI
Sbjct: 195 LPRGLSSDPGVAVFGGGPLHFMAQPGRDYGKELAYTPLVAQKGNPA----------HFIS 244
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK--- 305
+ S+ V+ VP L+ GG + T P+T+L + ++ ++ AF A+ K
Sbjct: 245 IKSIAVDNARVPFPAGALT-----TGGAVLCTRVPFTMLRSDVFLPVLDAFTKALAKQGG 299
Query: 306 -VAR-VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
VA+ V P APF C+ + + TR G +VP + L L W+ G +SMV +
Sbjct: 300 PVAKAVKPYAPFQQCYDTRTLAITRNGYLVPDVTLTLGGGK-KWTWDGLSSMVDMAPRTA 358
Query: 364 CLGFVD-----GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFT 417
CL FV GG N +++IG Q+EN +++FD+ R GF+ F + + FNFT
Sbjct: 359 CLAFVQMEGVKGGDNSAPAVLIGGFQMENTVVEFDMKKKRFGFARLPSFTQCS-HFNFT 416
>gi|356555628|ref|XP_003546132.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 421
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 137/444 (30%), Positives = 209/444 (47%), Gaps = 61/444 (13%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKA--LVLRVAKDTATLQYVTQIKQRTPLVPV 58
M+ L C +L+F +SP + ++ PK + L + D T Q+ T I TP +
Sbjct: 1 MATPTCVLYFC-VLVFFVSPSLSASNEFPKTGYISLPINIDPTTHQHFTSIGIGTPRHNM 59
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCN-----LANAKA-------CGGGICG 106
L + + G+ LW DC Y SS+ S QC +N A C C
Sbjct: 60 NLAIDISGSYLWYDCGGNYNSSSYNPVLWDSPQCPGPEPFQSNCDAGFPFKPGCTNNTCN 119
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEF-------VLQGL 159
DNP ++ G GD+ D L + +P F SE +L GL
Sbjct: 120 VALDNPFADFGFGGDLGHDFLFTPQ---------IKLPQTFFSVCSESSRFPQLPILVGL 170
Query: 160 ASGVVGIAGLGR-SKVALPSQLAAAFS-LKRKFALCL-SPFDDGAIVFGDGPYYDLNNFD 216
G G GL R S L SQ++++F+ + KF LCL S G + G P +
Sbjct: 171 PKGTKGSLGLARQSPFTLQSQISSSFNNVPPKFTLCLPSSGKKGHLFIGGRPTFSTPLSQ 230
Query: 217 VSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGT 276
+ + +Y+ + +YF + S+H+N K V N + LS+D GT
Sbjct: 231 IGFDSRYS-----------------NYDYFFHLNSIHINHKPVQFNTSGLSVDLNDNVGT 273
Query: 277 KISTVNPYTVLETSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVP 334
KIST++P+TVL +Y+ V+AF A+ + RV V PFG C+ +G R VP
Sbjct: 274 KISTLHPFTVLHPQVYQPFVKAFVKAAKTKNMKRVKKVHPFGTCYDATTVGDHR--EAVP 331
Query: 335 QIDLVLQNKN------VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENN 388
IDLVL+ + V + I+G +S+V++ LCL FV+GG+ ++++GA QL++
Sbjct: 332 AIDLVLEAEELGRFGKVSYEIYGHDSLVEVKKGVLCLAFVNGGIRALDAVLLGAHQLKDR 391
Query: 389 LLQFDLATSRLGFSDSLLFERATC 412
+L FD +TS + FS SL+ + TC
Sbjct: 392 ILVFDESTSIISFSSSLVHQNKTC 415
>gi|242059839|ref|XP_002459065.1| hypothetical protein SORBIDRAFT_03g045260 [Sorghum bicolor]
gi|241931040|gb|EES04185.1| hypothetical protein SORBIDRAFT_03g045260 [Sorghum bicolor]
Length = 431
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 140/433 (32%), Positives = 201/433 (46%), Gaps = 65/433 (15%)
Query: 23 AQTSFRP--KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSS 80
A TS P K LV + +D AT Y +K PLV + L G +LW C + S
Sbjct: 23 APTSTTPGGKPLVTAITRDAATKLYTAPLKDALPLV-----LDLSGTLLWSTCAAAHPSY 77
Query: 81 TNKTA--------------RCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDV 126
A R G + + C C A P NP + GD+
Sbjct: 78 ECHHAACAHAHAHHPPGCPRTGHGVADEDDPFRC---RCRAHPYNPFARRAASGDLTRAR 134
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
++ +TDG NP V+ + C +L GL +G VG+AGL RS +ALP+Q+A +
Sbjct: 135 VTANATDGANPLAPVSF-TAVAACAPPTLLAGLPAGAVGVAGLARSWLALPAQVARKQKV 193
Query: 187 KRKFALCLSPFDD--GAIVFGDGPYYDL--NNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
RKFALCL + G +FG GP + L DV+ +L T + G+P
Sbjct: 194 ARKFALCLPGAGNGQGVAIFGGGPLFLLPPGRPDVTASLAGT----------TPLRGKPR 243
Query: 243 VE-YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
V YF V+ K + +N+ + + G + + PYTVL +Y V+AF +
Sbjct: 244 VPGYF-------VSAKGIAVNQAQVQVQQLGPLVVALCSRIPYTVLRPDVYAPFVRAFDA 296
Query: 302 AMPKVARVA-PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
A RV P PF C+ +++G TR+G VPQ+DL+L++ W++ G NSMVQ+
Sbjct: 297 ATAGRKRVTPPTPPFELCYDSRELGSTRLGYAVPQVDLMLES-GANWTVFGGNSMVQVSD 355
Query: 361 DALCLGFVD---------------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
D C F++ GG +++IG Q+ENNLL FD +LGFS L
Sbjct: 356 DTACFAFLEMKEEKHEGGHGYGHGGGAGTAPAVIIGGFQMENNLLVFDEEKRQLGFSGLL 415
Query: 406 LFERATCT-FNFT 417
+ TC+ FNFT
Sbjct: 416 FGRQTTCSNFNFT 428
>gi|357131652|ref|XP_003567450.1| PREDICTED: basic 7S globulin-like [Brachypodium distachyon]
Length = 455
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 149/441 (33%), Positives = 210/441 (47%), Gaps = 74/441 (16%)
Query: 30 KALVLRVAKDTATLQY-VTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG 88
K LV + KD +T Y IK +PLV + L G I+W C SS + T C
Sbjct: 38 KPLVTAITKDGSTRLYSFPVIKNGSPLV-----LDLSGPIIWSTCPD---SSAHDTIDCN 89
Query: 89 SAQCNLA------NAKACGGG-----------ICGAGPDNPI-SNTG-----THGDIRID 125
S C A N G G C A P NP+ SN+G + D+
Sbjct: 90 SPACMRAHRYHPPNCPHTGYGQPDAPRNPYRCKCTAHPHNPLGSNSGGSTPQSGQDLTRV 149
Query: 126 VLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
LS +TDG NP + C E +L+GL G VG+AGLGRS ++LP+Q+ A
Sbjct: 150 ALSANATDGNNPLSPPVAFTAVASCAPESLLEGLPEGSVGVAGLGRSALSLPAQVGKAQG 209
Query: 186 LKRKFALCL----SPFDDGAIVFGDGP--YYDLNNFDVSKNLK-YTPLFINKVNTASGFL 238
+ KFALCL + + G +FG GP + D++ +L TPL K G
Sbjct: 210 VCNKFALCLPSGSASGNLGVAIFGGGPLSLLPMVGTDLTASLAGETPLV--KYKECPG-- 265
Query: 239 GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN--EGVGGTKI--STVNPYTVLETSIYKA 294
Y++ T+ G AV + +L +D+ +G G + ST PYT L + +Y+A
Sbjct: 266 ------YYVKATA----GIAVNQAQVVLPLDDGKDGCGPLVVGFSTTAPYTELRSDVYRA 315
Query: 295 LVQAFASAMPKVARV-APVA--PFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIH 350
++AF +A + R+ +P + F C+ +G TR+G VPQ+D++L KN W++
Sbjct: 316 FIKAFDAATSGIPRLPSPTSGPKFELCYESAKLGSTRLGYAVPQVDVMLDGGKN--WTVF 373
Query: 351 GQNSMVQIGGDALCLGFVDGGVNPRT----------SIVIGARQLENNLLQFDLATSRLG 400
G NSM Q+ CL FV+ T ++VIG Q+ENNL+ FD RLG
Sbjct: 374 GGNSMAQVDDRTACLAFVEMAEGKATYGGGGEAAAPAVVIGGFQMENNLVVFDEEEQRLG 433
Query: 401 FSDSLLFERATCT-FNFTSIA 420
FS L R TC+ FNFT A
Sbjct: 434 FSGLLWGRRTTCSNFNFTMAA 454
>gi|156186245|gb|ABU55393.1| xylanase inhibitor 725ACCN [Triticum aestivum]
Length = 403
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/392 (30%), Positives = 177/392 (45%), Gaps = 52/392 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KD AT Y LV + + G ++W CE G + C S C LA
Sbjct: 28 VTKDPATSLYTIPFHDGASLV-----LDVAGPLVWSTCEGGQPPAEIP---CSSPTCLLA 79
Query: 96 NA--------KACGGGI----CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + + +TDG P V V
Sbjct: 80 NAYPAPGCPAPSCGSDTHDKPCTAYPYNPVTGACAAGSLFHTRFAANTTDGSKPVSKVNV 139
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + ++F LCL G +
Sbjct: 140 -GVLAACAPSKLLASLPRGSTGVAGLADSGLALPAQVASAQKVAKRFLLCLPTGGPGVAI 198
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L ++++ YTPL G P+ ++I + V VP+++
Sbjct: 199 FGGGP---LPWPQFTQSMPYTPLVTKG--------GSPA--HYISARFIEVGDTRVPVSE 245
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY VL +Y+ LV AF A+ P V PVAPF
Sbjct: 246 GALA-----TGGVMLSTRLPYAVLRRDVYRPLVDAFTKALAAQHANGAPVARAVEPVAPF 300
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
G C+ + +G G VP + L L + W++ G+NSMV + C+ F V+
Sbjct: 301 GVCYDTKTLGNNLGGYSVPNVQLALDGGSDTWTMTGKNSMVDVKPGTACVAFVEMKGVEA 360
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G ++++G Q+E+ +L FD+ RLGFS
Sbjct: 361 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGFS 392
>gi|15239644|ref|NP_197411.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|91806880|gb|ABE66167.1| extracellular dermal glycoprotein-like protein/EDGP-like
[Arabidopsis thaliana]
gi|332005270|gb|AED92653.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 391
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 165/343 (48%), Gaps = 53/343 (15%)
Query: 65 GGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRI 124
G L +C S+T RCGS +C AN P+ P N +
Sbjct: 63 GAAPLLQNCPTAAKSTTYHPIRCGSTRCKYAN------------PNFPCPNNVIAKKRTV 110
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEF-------VLQGLASGVVGIAGLGRSKVALP 177
+ S S R+ + N ++ SE G + GL + +++P
Sbjct: 111 CLSSDNSRLFRDTVPLLYTFNGVYTRDSEMSSSLTLTCTDGAPALKQRTIGLANTHLSIP 170
Query: 178 SQLAAAFSLKRKFALCL-----SPFDDGAIVFGDGPYYDL-NNFDVSKNLKYTPLFINKV 231
SQL + + L K ALCL S +G + G G YY L + DVSK TPL N
Sbjct: 171 SQLISMYQLPHKIALCLPSTERSQSHNGDLWIGKGEYYYLPYDKDVSKIFASTPLIGN-- 228
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
G+ S EY I V S+ + K VP+ G TKIST+ PYTV +TS+
Sbjct: 229 -------GK-SGEYLIDVKSIQIGAKTVPI----------PYGATKISTLAPYTVFQTSL 270
Query: 292 YKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHG 351
YKAL+ AF + K+A+ V PFGACF ++ G VP IDLVL W I+G
Sbjct: 271 YKALLTAFTENI-KIAKAPAVKPFGACF------YSNGGRGVPVIDLVLSG-GAKWRIYG 322
Query: 352 QNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDL 394
NS+V++ + +CLGFVDGGV P+ IVIG Q+E+NL++FDL
Sbjct: 323 SNSLVKVNKNVVCLGFVDGGVKPKYPIVIGGFQMEDNLVEFDL 365
>gi|116831501|gb|ABK28703.1| unknown [Arabidopsis thaliana]
Length = 392
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 165/343 (48%), Gaps = 53/343 (15%)
Query: 65 GGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRI 124
G L +C S+T RCGS +C AN P+ P N +
Sbjct: 63 GAAPLLQNCPTAAKSTTYHPIRCGSTRCKYAN------------PNFPCPNNVIAKKRTV 110
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEF-------VLQGLASGVVGIAGLGRSKVALP 177
+ S S R+ + N ++ SE G + GL + +++P
Sbjct: 111 CLSSDNSRLFRDTVPLLYTFNGVYTRDSEMSSSLTLTCTDGAPALKQRTIGLANTHLSIP 170
Query: 178 SQLAAAFSLKRKFALCL-----SPFDDGAIVFGDGPYYDL-NNFDVSKNLKYTPLFINKV 231
SQL + + L K ALCL S +G + G G YY L + DVSK TPL N
Sbjct: 171 SQLISMYQLPHKIALCLPSTERSQSHNGDLWIGKGEYYYLPYDKDVSKIFASTPLIGN-- 228
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
G+ S EY I V S+ + K VP+ G TKIST+ PYTV +TS+
Sbjct: 229 -------GK-SGEYLIDVKSIQIGAKTVPI----------PYGATKISTLAPYTVFQTSL 270
Query: 292 YKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHG 351
YKAL+ AF + K+A+ V PFGACF ++ G VP IDLVL W I+G
Sbjct: 271 YKALLTAFTENI-KIAKAPAVKPFGACF------YSNGGRGVPVIDLVLSG-GAKWRIYG 322
Query: 352 QNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDL 394
NS+V++ + +CLGFVDGGV P+ IVIG Q+E+NL++FDL
Sbjct: 323 SNSLVKVNKNVVCLGFVDGGVKPKYPIVIGGFQMEDNLVEFDL 365
>gi|20160773|dbj|BAB89714.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
gi|125529039|gb|EAY77153.1| hypothetical protein OsI_05119 [Oryza sativa Indica Group]
gi|125573260|gb|EAZ14775.1| hypothetical protein OsJ_04703 [Oryza sativa Japonica Group]
Length = 434
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 135/428 (31%), Positives = 193/428 (45%), Gaps = 70/428 (16%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
L++ + +DT T Y IK+ L V L G ++W C SST+ T C S
Sbjct: 34 LIVPLIRDTNTSLYTIAIKKDD----APLVVDLAGALVWSTCR----SSTHATVSCLSGA 85
Query: 92 CNLAN---AKAC----GGGI-----------CGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
C AN + C GG C A P NP++ + GD+ +S +T
Sbjct: 86 CGAANQQQPRRCRYVDGGWFWSGREAGSRCACTAHPFNPVTGECSTGDLTSFAMSANTTS 145
Query: 134 GRNPGRAVTVPN---FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
G + P + C E +L L +G G+AG R ++LPSQLAA S KF
Sbjct: 146 S---GTKLLCPEAFATVGACAPERLLASLPAGATGVAGFSRRPLSLPSQLAAQRSFGNKF 202
Query: 191 ALCLSPFDDGAIVFGDGPYY----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
ALCL G FGD P Y L + +++L YTPL N N Y+
Sbjct: 203 ALCLP----GFAAFGDTPVYIGTESLGIVNYTESLPYTPLLTNPRNPG----------YY 248
Query: 247 IGVTSVHVN--GKAVPLNKTLLSIDNE---GVGGTKISTVNPYTVLETSIYKALVQAFAS 301
+ V + V+ G+ VP + ++D + G GG +ST PY V+ +++A +AF +
Sbjct: 249 LPVKGITVSWYGRDVPASLPAGALDMDARTGRGGVVLSTTTPYAVMRPDVFRAFAEAFDA 308
Query: 302 AM-----PKVARVAPVAPFGACFRLQDIGFTRIGPV--VPQIDLVLQNKNVVWSIHGQNS 354
A+ KV RV V PF C+ F + P VP IDL L +W + +N
Sbjct: 309 AIRGTDYAKVVRVPAVEPFKLCYD-GAFPFRKRPPTWDVPTIDLELAGATGIWRLFTENY 367
Query: 355 MVQIGGDALCLGFVD---GG---VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
MVQ +C+G ++ GG V+ ++V+G +QL+ NLL FDL L FS L F
Sbjct: 368 MVQT-PRGMCVGILEMEAGGGMPVDGEPAMVLGLKQLDTNLLVFDLDKMLLWFSGELSFR 426
Query: 409 RATCTFNF 416
C F
Sbjct: 427 LTGCVSPF 434
>gi|125552284|gb|EAY97993.1| hypothetical protein OsI_19910 [Oryza sativa Indica Group]
Length = 237
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 113/205 (55%), Gaps = 15/205 (7%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
++VA LLL L S+SP A P ++VL V+KD AT QYVT +QRTP VPVK
Sbjct: 7 LAVASGALLL---LFVSVSPCRAAAGGGPSSVVLPVSKDDATQQYVTMFRQRTPQVPVKA 63
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL------------ANAKACGGGICGAG 108
+ L G +LWVDC+ GYVSS+ RCG+ C L A + C C
Sbjct: 64 VLDLAGTMLWVDCDAGYVSSSYAGVRCGAKPCRLLKNAGCAITCLDAPSAGCLNDTCSEF 123
Query: 109 PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
P N ++ T G+I DVLS+ +T PG T P F+F C F+ QGLA G G+
Sbjct: 124 PKNTATSVSTAGNIITDVLSLPTTFRPAPGPLATAPAFLFTCAHTFLTQGLADGATGMVS 183
Query: 169 LGRSKVALPSQLAAAFSLKRKFALC 193
L R++ ALP+QLA F RKFALC
Sbjct: 184 LSRARFALPTQLADTFGFSRKFALC 208
>gi|297720745|ref|NP_001172734.1| Os01g0937800 [Oryza sativa Japonica Group]
gi|255674047|dbj|BAH91464.1| Os01g0937800 [Oryza sativa Japonica Group]
Length = 472
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 135/428 (31%), Positives = 193/428 (45%), Gaps = 70/428 (16%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
L++ + +DT T Y IK+ L V L G ++W C SST+ T C S
Sbjct: 72 LIVPLIRDTNTSLYTIAIKKDD----APLVVDLAGALVWSTCR----SSTHATVSCLSGA 123
Query: 92 CNLAN---AKAC----GGGI-----------CGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
C AN + C GG C A P NP++ + GD+ +S +T
Sbjct: 124 CGAANQQQPRRCRYVDGGWFWSGREAGSRCACTAHPFNPVTGECSTGDLTSFAMSANTTS 183
Query: 134 GRNPGRAVTVPN---FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
G + P + C E +L L +G G+AG R ++LPSQLAA S KF
Sbjct: 184 S---GTKLLCPEAFATVGACAPERLLASLPAGATGVAGFSRRPLSLPSQLAAQRSFGNKF 240
Query: 191 ALCLSPFDDGAIVFGDGPYY----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
ALCL F FGD P Y L + +++L YTPL N N Y+
Sbjct: 241 ALCLPGF----AAFGDTPVYIGTESLGIVNYTESLPYTPLLTNPRNPG----------YY 286
Query: 247 IGVTSVHVN--GKAVPLNKTLLSIDNE---GVGGTKISTVNPYTVLETSIYKALVQAFAS 301
+ V + V+ G+ VP + ++D + G GG +ST PY V+ +++A +AF +
Sbjct: 287 LPVKGITVSWYGRDVPASLPAGALDMDARTGRGGVVLSTTTPYAVMRPDVFRAFAEAFDA 346
Query: 302 AM-----PKVARVAPVAPFGACFRLQDIGFTRIGPV--VPQIDLVLQNKNVVWSIHGQNS 354
A+ KV RV V PF C+ F + P VP IDL L +W + +N
Sbjct: 347 AIRGTDYAKVVRVPAVEPFKLCYD-GAFPFRKRPPTWDVPTIDLELAGATGIWRLFTENY 405
Query: 355 MVQIGGDALCLGFVD---GG---VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
MVQ +C+G ++ GG V+ ++V+G +QL+ NLL FDL L FS L F
Sbjct: 406 MVQT-PRGMCVGILEMEAGGGMPVDGEPAMVLGLKQLDTNLLVFDLDKMLLWFSGELSFR 464
Query: 409 RATCTFNF 416
C F
Sbjct: 465 LTGCVSPF 472
>gi|115442115|ref|NP_001045337.1| Os01g0937600 [Oryza sativa Japonica Group]
gi|20160771|dbj|BAB89712.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
gi|113534868|dbj|BAF07251.1| Os01g0937600 [Oryza sativa Japonica Group]
gi|125573258|gb|EAZ14773.1| hypothetical protein OsJ_04702 [Oryza sativa Japonica Group]
gi|215693801|dbj|BAG89000.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 442
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 135/436 (30%), Positives = 195/436 (44%), Gaps = 73/436 (16%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
Q ++P LV+ + +D+ T Y IK PLV V L G ++W C ST+
Sbjct: 31 QAPYKP--LVVPLVRDSDTSFYTIPIKNGAPLV-----VDLAGTLVWSTCP-----STHT 78
Query: 84 TARCGSAQCNLAN---AKAC----GGGI-----------CGAGPDNPISNTGTHGDIRID 125
T C S C AN + C GG C A P NP++ + GD+
Sbjct: 79 TVSCLSGTCGAANQQQPRRCRYVDGGWFWSGREAGSRCACTAHPFNPVTGECSTGDLTTF 138
Query: 126 VLSIQSTDGRNPGRAVTVPNFIFL--CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAA 183
+S ST N R + F + C + +L L +G G+AG R ++LPSQLAA
Sbjct: 139 AMSANST--VNGTRTLHPEEFAAVGSCAPQRLLASLPAGATGVAGFSRRPLSLPSQLAAQ 196
Query: 184 FSLKRKFALCLSPFDDGAIVFGDGPYY---DLNNF-DVSKNLKYTPLFINKVNTASGFLG 239
+ KFALC+S F FGD P Y + F D + L YTPL N
Sbjct: 197 RNFGNKFALCMSQF----ATFGDAPVYLGMEGRGFVDYREILPYTPLLTN---------- 242
Query: 240 EPSVE-YFIGVTSVHVNGKAVPLNKTL----LSID-NEGVGGTKISTVNPYTVLETSIYK 293
P + Y++ V + V+ +L L +D G GG +ST PYTV+ +++
Sbjct: 243 -PRIPGYYLPVKGISVSWSVPETPASLPAGALDLDARTGRGGVVLSTTTPYTVMRPDVFR 301
Query: 294 ALVQAFASAMPK--------VARVAPVAPFGACFRLQDIGFTRIGPV-VPQIDLVLQNKN 344
A +AF +A+ + V R PV PF C+ R + +P I L L
Sbjct: 302 AFAEAFDTAIIRRSKYTYSNVTRHPPVGPFKLCYNGAFPMLKRPASMDIPTIHLELDGAT 361
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGG-----VNPRTSIVIGARQLENNLLQFDLATSRL 399
WS N +V G ALC+G ++ G V+ ++V+G +QL+ NLL FDL +
Sbjct: 362 GTWSWFNDNYLVFAPGAALCVGVLEMGPGGMPVDGEPAMVVGVKQLDWNLLVFDLDKMLM 421
Query: 400 GFSDSLLFERATCTFN 415
FS L F A C+++
Sbjct: 422 WFSGDLAFRLAGCSYS 437
>gi|125529037|gb|EAY77151.1| hypothetical protein OsI_05117 [Oryza sativa Indica Group]
Length = 442
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 135/436 (30%), Positives = 195/436 (44%), Gaps = 73/436 (16%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
Q ++P LV+ + +D+ T Y IK PLV V L G ++W C ST+
Sbjct: 31 QAPYKP--LVVPLVRDSDTSFYTIPIKNGAPLV-----VDLAGTLVWSTCP-----STHT 78
Query: 84 TARCGSAQCNLAN---AKAC----GGGI-----------CGAGPDNPISNTGTHGDIRID 125
T C S C AN + C GG C A P NP++ + GD+
Sbjct: 79 TVSCLSGTCGAANQQQPRRCRYVDGGWFWSGREAGSRCACTAHPFNPVTGECSTGDLTAF 138
Query: 126 VLSIQSTDGRNPGRAVTVPNFIFL--CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAA 183
+S ST N R + F + C + +L L +G G+AG R ++LPSQLAA
Sbjct: 139 AMSANST--VNGTRTLHPEEFAAVGSCAPQRLLASLPAGATGVAGFSRRPLSLPSQLAAQ 196
Query: 184 FSLKRKFALCLSPFDDGAIVFGDGPYY---DLNNF-DVSKNLKYTPLFINKVNTASGFLG 239
+ KFALC+S F FGD P Y + F D + L YTPL N
Sbjct: 197 RNFGNKFALCMSQF----ATFGDAPVYLGMEGRGFVDYREILPYTPLLTN---------- 242
Query: 240 EPSVE-YFIGVTSVHVNGKAVPLNKTL----LSID-NEGVGGTKISTVNPYTVLETSIYK 293
P + Y++ V + V+ +L L +D G GG +ST PYTV+ +++
Sbjct: 243 -PRIPGYYLPVKGISVSWSVPETPASLPAGALDLDARTGRGGVVLSTTTPYTVMRPDVFR 301
Query: 294 ALVQAFASAMPK--------VARVAPVAPFGACFRLQDIGFTRIGPV-VPQIDLVLQNKN 344
A +AF +A+ + V R PV PF C+ R + +P I L L
Sbjct: 302 AFAEAFDTAIIRRSKYTYSNVTRHPPVGPFKLCYNGAFPMLKRPASMDIPTIHLELDGAT 361
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGG-----VNPRTSIVIGARQLENNLLQFDLATSRL 399
WS N +V G ALC+G ++ G V+ ++V+G +QL+ NLL FDL +
Sbjct: 362 GTWSWFNDNYLVFAPGAALCVGVLEMGPGGMPVDGEPAMVVGVKQLDWNLLVFDLDKMLM 421
Query: 400 GFSDSLLFERATCTFN 415
FS L F A C+++
Sbjct: 422 WFSGDLAFRLAGCSYS 437
>gi|156186247|gb|ABU55394.1| xylanase inhibitor 725OS [Triticum aestivum]
Length = 428
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 138/454 (30%), Positives = 202/454 (44%), Gaps = 66/454 (14%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKL 60
M + +L+ L S + RP LV V +D AT Y +K +PLV
Sbjct: 1 MWQTKDLILVAVSLCVLASTVAGDGGGRP--LVTAVTRDAATSLYTIPVKSGSPLV---- 54
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA------NAKACGGGI---------- 104
+ L G ++W C+ G +++ T C S C A N + G G+
Sbjct: 55 -LDLSGPMVWSTCDDG---ASHDTLECNSIDCMRAHRFHPPNCQHTGYGMPDPGNPYRCK 110
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
C A P NP+S D+ LS +TDGRNP V+ + C + +L GL +G V
Sbjct: 111 CTAHPHNPVSGGTASADMTRVTLSANATDGRNPLGPVSF-TAVTSCAPDSLLAGLPAGAV 169
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSKNL-K 222
G+AGL RS +A P+Q+A + FALCL + DG +FG GP + N +++ L
Sbjct: 170 GVAGLARSGLAFPAQVARTQGVANSFALCLGNRERDGVAIFGGGPLFAANGRSITEMLGG 229
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK-TLLSIDNEGVGGTKISTV 281
TPL + GE Y V V+G VPL+ L+I ST
Sbjct: 230 DTPLRKH---------GESPGYYVSASRGVFVDGVKVPLDTYAPLTI--------GFSTT 272
Query: 282 NPYTVLETSIYKALVQAFASAMPKV------ARVAPVA--PFGACFRLQDIGFTRIGPVV 333
PY ++ +Y+ L+ AF AM + ARV A PF C+ + TR G V
Sbjct: 273 TPYALVRRDVYRPLIDAFDQAMERDGAITAGARVPSPAGSPFELCYNSSRLSLTRFGYFV 332
Query: 334 PQIDLVLQNKNVVWSIHGQNSM-VQIGGDALCLGFVDG--------GVNPRTSIVIGARQ 384
P + L+ + W++ G NSM + IG C GFV+ G P ++V+G Q
Sbjct: 333 PTVGFGLEGGS-GWAVQGINSMALVIGRPTACFGFVEMKEGDKAGYGGGPAPAVVLGGLQ 391
Query: 385 LENNLLQFDLATSRLGFSDSLLFERATCT-FNFT 417
+E NL+ F+ + F+ + C+ FNFT
Sbjct: 392 MEENLVVFNEEKQTMAFTGQINGRGLFCSNFNFT 425
>gi|242556632|pdb|3HD8|A Chain A, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor-Iia In Complex With Bacillus Subtilis Xylanase
gi|242556634|pdb|3HD8|C Chain C, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor-Iia In Complex With Bacillus Subtilis Xylanase
Length = 389
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/406 (31%), Positives = 182/406 (44%), Gaps = 54/406 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KDTAT Y LV + + G ++W CE G + C S C LA
Sbjct: 9 VTKDTATSLYTIPFHDGASLV-----LDVAGLLVWSTCEGG---QSPAEIACSSPTCLLA 60
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + + +TDG P V V
Sbjct: 61 NAYPAPGCPAPSCGSDRHDKPCTAYPSNPVTGACAAGSLFHTRFAANTTDGNKPVSEVNV 120
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALPSQ+A+A + KF LCL G +
Sbjct: 121 -RVLAACAPSKLLASLPRGSTGVAGLAGSGLALPSQVASAQKVPNKFLLCLPTGGPGVAI 179
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L ++++ YTPL G P+ ++I S+ V VP+++
Sbjct: 180 FGGGP---LPWPQFTQSMDYTPLVAKG--------GSPA--HYISARSIKVENTRVPISE 226
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ LV AF A+ P V PVAPF
Sbjct: 227 RALA-----TGGVMLSTRLPYVLLRRDVYRPLVDAFTKALAAQPANGAPVARAVKPVAPF 281
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
C+ + +G G VP + L L + W++ G+NSMV + C+ F VD
Sbjct: 282 ELCYDTKTLGNNPGGYWVPNVLLELDGGS-DWAMTGKNSMVDVKPGTACVAFVEMKGVDA 340
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNF 416
G ++++G Q+E+ +L FD+ RLGF F + +FNF
Sbjct: 341 GDGSAPAVILGGAQMEDFVLDFDMEKKRLGFLRLPHFTGCS-SFNF 385
>gi|62996368|emb|CAG26970.1| xylanase inhibitor precursor [Triticum aestivum]
Length = 389
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/406 (31%), Positives = 182/406 (44%), Gaps = 54/406 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KDTAT Y LV + + G ++W CE G + C S C LA
Sbjct: 9 VTKDTATSLYTIPFHDGASLV-----LDVAGLLVWSTCEGG---QSPAEIACSSPTCLLA 60
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + + +TDG P V V
Sbjct: 61 NAYPAPGCPAPSCGSDRHDKPCTAYPSNPVTGACAAGSLFHTRFAANTTDGNKPVSEVNV 120
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALPSQ+A+A + KF LCL G +
Sbjct: 121 -RVLAACAPSKLLASLPRGSTGVAGLAGSGLALPSQVASAQKVANKFLLCLPTGGPGVAI 179
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L ++++ YTPL G P+ ++I S+ V VP+++
Sbjct: 180 FGGGP---LPWPQFTQSMDYTPLVAKG--------GSPA--HYISARSIKVENTRVPISE 226
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ LV AF A+ P V PVAPF
Sbjct: 227 RALA-----TGGVMLSTRLPYVLLRRDVYRPLVDAFTKALAAQPANGAPVARAVKPVAPF 281
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
C+ + +G G VP + L L + W++ G+NSMV + C+ F VD
Sbjct: 282 ELCYDTKTLGNNPGGYWVPNVLLELDGGS-DWALTGKNSMVDVKPGTACVAFVEMKGVDA 340
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNF 416
G ++++G Q+E+ +L FD+ RLGF F + +FNF
Sbjct: 341 GDGSAPAVILGGAQMEDFVLDFDMEKKRLGFLRLPHFTGCS-SFNF 385
>gi|413951363|gb|AFW84012.1| hypothetical protein ZEAMMB73_776056 [Zea mays]
Length = 434
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 130/425 (30%), Positives = 195/425 (45%), Gaps = 55/425 (12%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN 82
A + P ++ VAKD AT Y ++ V + L G +LW C+ ++ +
Sbjct: 31 ASCTADPVPVLFPVAKDPATSLYTIPVRDGASHV-----IDLAGPLLWSTCDDDHLPANI 85
Query: 83 KTARCGSAQCNLANA---KACGGGI--------CGAGPDNPISNTGTHGDIRIDVLSIQS 131
C C LANA +CGGG+ C A P NP++ D+ L +
Sbjct: 86 S---CRDRLCKLANAYRAPSCGGGVAGHPCSKRCKAYPYNPVTGRCAAADLVHTRLVANT 142
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVL-QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK- 189
TDGRNP V V + C +L L G+AGL + +ALP+Q+A + +
Sbjct: 143 TDGRNPLSQVPV-RAVAACAPRTLLDHRLPRDATGVAGLSAAGLALPAQVATSQRVANAN 201
Query: 190 -FALCL--SPFDDGAIVFG-DGPYY-------DLNNFDVSKNLKYTPLFINKVNTASGFL 238
F LCL S DG VFG GP++ + ++ D+++ L++ PL N
Sbjct: 202 AFLLCLPRSGSGDGVAVFGGRGPFFLKLFVTGEPSSGDLTRTLQFAPLRSRPGNPL---- 257
Query: 239 GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQA 298
Y++ V+ V V VPL L+ GG + T PYT L +Y+ +V+A
Sbjct: 258 ------YYVPVSGVAVGRAPVPLPPRALA-----AGGVVLCTRVPYTALRPDVYRPVVEA 306
Query: 299 FASAMPKV-ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
F + + RVA V PF C+ + TR+G VP+I L+L+ W+ G +SMV
Sbjct: 307 FDRGLVRSDMRVAAVPPFEFCYNRTLLPPTRLGYGVPEIALLLEGGKQEWTFVGSSSMVD 366
Query: 358 IGGDALCLGF-----VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
+ CL V G ++V+G Q+E++LLQFDL +LGF+ + A
Sbjct: 367 VDARTACLALLEMKGVKAGDPSAAAVVVGGFQMEDHLLQFDLDKKQLGFA-RVPIPSACS 425
Query: 413 TFNFT 417
FNFT
Sbjct: 426 NFNFT 430
>gi|125573252|gb|EAZ14767.1| hypothetical protein OsJ_04694 [Oryza sativa Japonica Group]
Length = 395
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/411 (30%), Positives = 184/411 (44%), Gaps = 71/411 (17%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
K LV V KD AT Y +K PL + L G ++W C+ + + C
Sbjct: 36 KPLVTAVTKDGATKLYTIAVKDGHPLA-----LDLSGELVWSTCDASHSTVLPYERECVE 90
Query: 90 AQ------CNLANAKACG----GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
A C + A G G C A P N ++ GD+ L+ +T+G NP
Sbjct: 91 ANHYTPPSCWMQYGGAGGDYRYGNKCTAHPYNGVTGRCAPGDLTRTALAADATNGSNPLY 150
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
VT P + C +L L +G V +AGLGRS +AL +Q+AA ++ +KFALCL
Sbjct: 151 PVTFPA-VASCAPGSLLASLPAGAVCVAGLGRSDLALHAQVAATQNVAKKFALCLP---- 205
Query: 200 GAIVFGDGPY---YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
VFG GP+ + + D+ + L YT L
Sbjct: 206 SVAVFGGGPFVLIFPYSRPDIMQKLSYTAL------------------------------ 235
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP-VAPF 315
S + G ++S++ PYT L +Y V+A+ + +VAP VAPF
Sbjct: 236 --------RRSPELAGGQWRRLSSMVPYTELRPDVYGPFVKAWDEILQWPKKVAPPVAPF 287
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
C+ + IG R+G VP I++ L++ W I G NS+VQ+ C FV+ + P
Sbjct: 288 ELCYESRTIGSNRLGYAVPDININLED-GAAWYIFGGNSLVQVDDATACFAFVE--MRPE 344
Query: 376 T-----SIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
++VIG Q+E+NL+ FD +LGFS L + TC+ FNFT A
Sbjct: 345 KVGYGPAVVIGGHQMEHNLVVFDEEKQQLGFSGLLFGLQTTCSNFNFTVAA 395
>gi|47824820|emb|CAE46333.1| xylanase inhibitor [Secale cereale]
Length = 396
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/389 (30%), Positives = 174/389 (44%), Gaps = 53/389 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KD AT Y LV + G ++W CE G + CGS C LA
Sbjct: 28 VTKDPATSLYTIPFHDGASLV-----LDAAGPLVWSTCEAGQPPAG---IPCGSPTCLLA 79
Query: 96 NA---KACGGGICG------AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
NA C CG A P NP++ G + +TDG P V V
Sbjct: 80 NAYPAPGCPAPTCGSDKPCTAFPSNPVTGACAAGSLFHTSFVANTTDGTKPVSEVKV-GV 138
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +FG
Sbjct: 139 LAACAPSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFFLCLPTGGAGVAIFGG 198
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
GP L ++++ YTPL G P+ ++I + S+ V+ VP+
Sbjct: 199 GP---LPWPQFTQSMPYTPLVTKG--------GSPA--HYISLKSIKVDNTRVPV----- 240
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPFGAC 318
+E GG +ST PY +L +Y+ LV AF A+ P V PVAPFG C
Sbjct: 241 ---SEATGGVMLSTRLPYALLRRDVYRPLVDAFTKALAAQPANGAPVARAVQPVAPFGVC 297
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDGGVN 373
+ + +G G VP + L L W++ G+NSMV + C+ F V+ G
Sbjct: 298 YDTKTLGNNLGGYAVPNVLLALDGGG-EWAMTGKNSMVDVKPGTACVAFVEMKGVEAGDG 356
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
++++G Q+E+ +L FD+ RLGF+
Sbjct: 357 RAPAVILGGAQMEDFVLDFDMEKKRLGFT 385
>gi|156186243|gb|ABU55392.1| xylanase inhibitor 725ACC [Triticum aestivum]
Length = 403
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 179/392 (45%), Gaps = 52/392 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KDTAT Y LV + + G ++W C+ G + C S C LA
Sbjct: 28 VTKDTATSLYTIPFHDGASLV-----LDVAGPLVWSTCDGGQPPAE---IPCSSPTCLLA 79
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + + +TDG P V V
Sbjct: 80 NAYPAPGCPAPSCGSDKHDKPCTAYPYNPVTGACAAGSLFHTRFAANTTDGSKPVSKVNV 139
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +
Sbjct: 140 -GVLAACPPSKLLASLPRGSTGVAGLADSGLALPAQVASAQKVANRFLLCLPTGGPGVAI 198
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP + ++++ YTPL G P+ ++I + V VP+++
Sbjct: 199 FGGGP---VPWPQFTQSMPYTPLVTKG--------GSPA--HYISARFIEVGDTRVPVSE 245
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP-------KVARVA-PVAPF 315
L+ GG +ST PY VL +Y+ LV AF A+ VAR A PVAPF
Sbjct: 246 GALA-----TGGVMLSTRLPYAVLRRDVYRPLVDAFTKALAAQHANGAPVARAAEPVAPF 300
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
G C+ + +G G VP + L L + W++ G+NSMV + C+ F V+
Sbjct: 301 GVCYDTKTLGNNLGGYSVPNVQLGLDGGSDTWTMTGKNSMVDVKPGTACVAFVEMKGVEA 360
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G ++++G Q+E+ +L FD+ RLGFS
Sbjct: 361 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGFS 392
>gi|255552263|ref|XP_002517176.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543811|gb|EEF45339.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 230
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 102/233 (43%), Positives = 134/233 (57%), Gaps = 32/233 (13%)
Query: 120 GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQ 179
G+I DV+S+QS GRN V+VPN F+C S+F L+ LA G+ G+A LGRS ++LP
Sbjct: 8 GEIGQDVVSLQSISGRN----VSVPNIPFVCASKFPLENLADGITGMAALGRSNISLPVY 63
Query: 180 LAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF 237
++AF + R A+CLS + G I FGDGPY + S L YTPL N V+TA +
Sbjct: 64 FSSAFGIPRISAVCLSSLTNSSGVIFFGDGPYSIIP----SNLLIYTPLIRNPVSTAGSY 119
Query: 238 L-GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALV 296
+ GEPS +YFIGV S+ V D E GT+ TV+P+TVL T+IYK V
Sbjct: 120 VEGEPSTDYFIGVKSIRV--------------DREDNVGTRNGTVHPHTVLHTAIYKPFV 165
Query: 297 QAFASAMPKV--ARVAP--VAPFGACFRLQDIGF--TRIGPVVPQIDLVLQNK 343
+AF M + +V P FG CF+L D G+ GPVVP IDL + K
Sbjct: 166 KAFVKQMRAIFMTQVEPPIAVSFGPCFQLID-GYNSNEYGPVVPFIDLYWRAK 217
>gi|326487890|dbj|BAJ89784.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 403
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/391 (31%), Positives = 177/391 (45%), Gaps = 54/391 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KD AT Y T LV + + G ++W C+ G C S C LA
Sbjct: 28 VTKDAATSLYKIPFHDGTNLV-----LDVAGPLVWSTCDGGQPPPAADIT-CSSPTCLLA 81
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + L +TDG P AVTV
Sbjct: 82 NAYPAAGCPAPSCGSDRHDKPCTAYPSNPVTGACAAGSLFRARLVANTTDGNRPVSAVTV 141
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +
Sbjct: 142 -GVLAACAPTKLLASLPRGSTGVAGLAGSGLALPAQVASAQKVAHRFLLCLPTGGAGVAI 200
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
G GP L ++++ YTPL + G P+ +++ TS+ V VP+
Sbjct: 201 LGGGP---LPWPQFTQSMAYTPLVAKQ--------GSPA--HYVSGTSIRVEDTRVPVPD 247
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ +V AF A+ P V PVAPF
Sbjct: 248 RALA-----TGGVMLSTRLPYVLLRRDVYRPVVDAFTKALAAQHANGAPAARAVDPVAPF 302
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV-----DG 370
G C+ + +G G VP + L L W++ G+NSMV + C+ FV DG
Sbjct: 303 GLCYDAKTLGNNLGGYSVPNVVLALDGGG-EWAMTGKNSMVDVKPGTACVAFVEMEAGDG 361
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G ++++G Q+E+ +L FD+ RLGF
Sbjct: 362 GA---PAVILGGAQMEDFVLDFDMEKKRLGF 389
>gi|326489137|dbj|BAK01552.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 134/447 (29%), Positives = 196/447 (43%), Gaps = 63/447 (14%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGG 66
FL+ S+ IF+ + + + LV +AKD AT Y K PLV + L G
Sbjct: 8 FLVAVSICIFACATAGDRGG---QPLVTAIAKDAATSLYTIPAKSGRPLV-----LDLSG 59
Query: 67 NILWVDCEKGYVSSTNKTARCGSAQCNLA------NAKACGGGI----------CGAGPD 110
I+W C+ +++ T C C A N G G+ C A P
Sbjct: 60 PIVWSTCDDD--GASHDTLECNDMDCMRAHRFHPPNCPHNGNGMPDAQNDHRCKCTAHPH 117
Query: 111 NPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLG 170
NP+S GD+ LS +TDG+NP V+ C + +L GL +G VG+AGL
Sbjct: 118 NPVSGDTASGDMVRVTLSANATDGKNPLHEVSF-TAAASCAPDSLLAGLPAGAVGVAGLA 176
Query: 171 RSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNL-KYTPLFI 228
RS +A P+Q+A + FALCL + G +FG GP + N ++ L TPL
Sbjct: 177 RSGLAFPAQVARTQGVANSFALCLGNRERAGVAIFGGGPLFAANGRSITDMLGGDTPLRK 236
Query: 229 NKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLE 288
+ GE Y V V+G VPL+ D+ G ST PY ++
Sbjct: 237 H---------GESPGYYVTASRGVFVDGARVPLD------DSYGPLAIGFSTTTPYALVR 281
Query: 289 TSIYKALVQAFASAMPKV------ARVAPVA--PFGACFRLQDIGFTRIGPVVPQIDLVL 340
+Y+ L+ AF AM + AR+ A PF C+ + TR G VP + L
Sbjct: 282 RDVYRPLIDAFDRAMERDGAITAGARIPSPAGSPFELCYNSSRLSLTRFGYFVPTVGFGL 341
Query: 341 QNKNVVWSIHGQNSM-VQIGGDALCLGFVDGGVNPR--------TSIVIGARQLENNLLQ 391
+ W++ G NSM + IG C GFV+ + ++V+G Q+E NL+
Sbjct: 342 EG-GASWAVQGINSMALVIGRRMACFGFVEMKEGDKAGYGGGAAPAVVLGGLQMEENLVV 400
Query: 392 FDLATSRLGFSDSLLFERATCT-FNFT 417
FD + F+ + +C+ FNFT
Sbjct: 401 FDEEKRTMAFTGQINGRGLSCSNFNFT 427
>gi|47824814|emb|CAE46330.1| xylanase inhibitor [Hordeum vulgare]
Length = 403
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 125/403 (31%), Positives = 182/403 (45%), Gaps = 54/403 (13%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
Q S + ++ V KD AT Y LV + + G ++W C+ G +
Sbjct: 16 QASSKALPVLAPVTKDAATSLYTIPFHDGANLV-----LDVAGPLVWSTCDGGQRPPPAE 70
Query: 84 TARCGSAQCNLANA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQS 131
C S C LANA +CG C A P NP++ G + L
Sbjct: 71 IT-CSSPTCLLANAYPAPGCPAPSCGSDRHDKPCTAYPSNPVTGACAAGSLFRARLVANI 129
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
TDG P AVTV + C +L L G G+AGL S +ALP+Q+A+A + +F
Sbjct: 130 TDGNRPVSAVTV-GVLAACAPTKLLASLPRGSTGVAGLAGSGLALPAQVASAQKVSHRFL 188
Query: 192 LCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
LCL G + G GP L ++++ YTPL + G P+ +++ TS
Sbjct: 189 LCLPTGGAGVAILGGGP---LPWPQFTQSMAYTPLVAKQ--------GSPA--HYVSGTS 235
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK------ 305
+ V VP+ L+ GG +ST PY +L +Y+ V AFA A+
Sbjct: 236 IRVEDTRVPVPDRALA-----TGGVMLSTRLPYVLLRRDVYRPFVDAFAKALAAQHANGA 290
Query: 306 -VAR-VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
AR V PVAPFG C+ + +G G VP + L L W++ G+NSMV +
Sbjct: 291 LAARGVNPVAPFGLCYDAKTLGNNLGGYSVPNVVLALDGGG-EWAMTGKNSMVDVKPGTA 349
Query: 364 CLGFV-----DGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
C+ FV DGG ++++G Q+E+ +L FD+ RLGF
Sbjct: 350 CVAFVEMEAGDGGA---PAVILGGAQMEDFVLDFDMEKKRLGF 389
>gi|156186253|gb|ABU55397.1| xylanase inhibitor 801NEW [Triticum aestivum]
Length = 404
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 125/411 (30%), Positives = 183/411 (44%), Gaps = 60/411 (14%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KD AT Y Q LV + + G ++W C++G + + C S C LA
Sbjct: 28 VTKDPATSLYTIPFHQGASLV-----LDIAGPLVWSTCQRGDLPTDIP---CSSPTCLLA 79
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + L +T+G P V V
Sbjct: 80 NAYPAPGCPASSCGSDRHHKPCKAYPYNPVTGACAAGSLARTTLVASTTNGNYPVSEVNV 139
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGLG S +ALP+Q+A+ + KF LCL G +
Sbjct: 140 -RVLAACAPRKLLASLPRGSTGVAGLGGSGLALPAQVASTQKVDNKFLLCLPSGGPGVAI 198
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L +++++ YTPL G P+ ++I V ++ V V +++
Sbjct: 199 FGGGP---LPWPQLTRSMPYTPLVTKG--------GSPA--HYISVKAIQVEDTRVSVSE 245
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L + +ST PY +L +Y+ LV AF A+ P V PVAPF
Sbjct: 246 RALVM---------LSTRLPYAMLRRDVYRPLVDAFTKALAAQPANGAPVARAVKPVAPF 296
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
C+ + +G G VP + L L + W + G+N MV + C+GF VD
Sbjct: 297 ELCYDTKSLGNNPGGYWVPNVGLALDGGSDWW-MTGKNFMVDVKPGTACVGFVEMKGVDA 355
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
G ++++G QLE +L FD+ RLGF L C+ FNFT A
Sbjct: 356 GAGRAPAVILGGAQLEELVLDFDMEKKRLGFLR--LPHYMDCSRFNFTRSA 404
>gi|326488955|dbj|BAJ98089.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 133/428 (31%), Positives = 187/428 (43%), Gaps = 65/428 (15%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
+ L++ V+KD AT Y IK P+V + L G I+W C+ +++ T C
Sbjct: 48 RPLMMAVSKDAATSLYTVPIKSGRPMV-----LDLSGPIIWSTCDDD--GASHDTLECND 100
Query: 90 AQCNLA------NAKACGGGI----------CGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
C A N G G+ C A P NP+S GD+ LS +TD
Sbjct: 101 MDCMRAHRFHPPNCPHNGNGMPDAGNTHRCKCTAHPHNPVSGDTASGDMTRVTLSANATD 160
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
GRNP V + C + +L GL G VG+AGLGRS +A P+Q+A + + FALC
Sbjct: 161 GRNPLGPVAF-TAVTSCAPDSLLAGLPVGAVGVAGLGRSGIAFPAQVARTQGVPKSFALC 219
Query: 194 L-SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFL--GEPSVEYFIGVT 250
L S G +FG GP + F S+ P +++ + GE Y
Sbjct: 220 LGSRQTTGVAIFGGGPLF---LFPASR-----PSLTELLSSGTPLRKHGESPGYYVSASR 271
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
V V+G VPL + + VG ST Y L +Y+ L+ AF AM + A A
Sbjct: 272 GVFVDGAQVPLEDSYAPLT---VG---FSTTTAYAQLRRDVYRPLIDAFEQAMEEQAAGA 325
Query: 311 PV-------APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDA 362
V APF C+ +G TR G VP + L+ W + G NSM+ + GG
Sbjct: 326 RVPSSSPAAAPFELCYNSSKLGQTRSGFPVPTVSFRLEG-GTSWLVQGVNSMLVVNGGAT 384
Query: 363 LCLGFVD------------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERA 410
C FV+ GG P ++V+G Q+E NL+ FD + F+ +
Sbjct: 385 ACFAFVEMKEGDKAGYATGGGSAP--AVVLGGLQMEENLVVFDEEKQTMAFTGQINGRGF 442
Query: 411 TC-TFNFT 417
C FNFT
Sbjct: 443 FCNNFNFT 450
>gi|156186251|gb|ABU55396.1| xylanase inhibitor 801OS [Triticum aestivum]
Length = 433
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 129/412 (31%), Positives = 185/412 (44%), Gaps = 64/412 (15%)
Query: 38 KDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA-- 95
K T Y +K PL L + L G I+W C+ G +++ T C + C A
Sbjct: 18 KGCRTSLYTIPVKSSRPL---PLVLDLSGPIVWSTCDGG---ASHDTLECNNMDCMRAHR 71
Query: 96 ----NAKACGGGI----------CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
N + G G+ C A P NP+S GD+ LS +TDGRNP V
Sbjct: 72 FHPPNCQHTGYGMPDAHNPYRCKCTAHPHNPVSGDTASGDMTRVTLSANATDGRNPLGPV 131
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDG 200
+ + C + +L GL G VG+AGL RS +A P+Q++ + FALCL S +G
Sbjct: 132 SF-TAVTSCALDSLLAGLPVGAVGVAGLARSGLAFPAQVSRTQGVANSFALCLPSGQGNG 190
Query: 201 AIVFGDGPYYDLNNFDVSKNL-KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+FG GP + N +++ L TPL + GE Y + V+G V
Sbjct: 191 VAIFGGGPLFAANGRSITELLGSRTPLRKH---------GESPGYYISASRGIAVDGARV 241
Query: 260 PLNKTL-LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV------ARV--A 310
PL+ L+I ST PY L +Y+ L+ AF AM + ARV
Sbjct: 242 PLDSYAPLTIG--------FSTTIPYAELRHDVYRPLINAFDQAMERQGAITTGARVPSP 293
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVD 369
APF C+ + TR G VP ++L+L+ +N W++ G NSM Q+ C FV+
Sbjct: 294 AAAPFELCYNSSRLSPTRFGYFVPTVELMLEGGRN--WTVFGINSMAQVNRATACFAFVE 351
Query: 370 ---------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
GGV ++V+G Q++ NLL FD LGF+ L +C
Sbjct: 352 MKAGDKSWYGGVA-APAVVLGGFQMQQNLLMFDEEKQTLGFTGQLTGRGLSC 402
>gi|62996370|emb|CAG26971.1| xylanase inhibitor precursor [Triticum aestivum]
Length = 389
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 123/406 (30%), Positives = 184/406 (45%), Gaps = 54/406 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KDTAT Y LV + + G ++W C+ G + C S C LA
Sbjct: 9 VTKDTATSLYTIPFHDGANLV-----LDVAGPLVWSTCDGGQPPAE---IPCSSPTCLLA 60
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + +TDG P V V
Sbjct: 61 NAYPAPGCPAPSCGSDRHDKPCTAYPYNPVTGACAAGSLFHTKFVANTTDGNKPVSKVNV 120
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +
Sbjct: 121 -GVVAACAPSKLLASLPRGSTGVAGLADSGLALPAQVASAQKVANRFLLCLPTGGLGVAI 179
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L ++++ YTPL A G G P+ ++I + S+ V VP+++
Sbjct: 180 FGGGP---LPWPQFTQSMDYTPL------VAKG--GSPA--HYISLKSIKVENTRVPVSE 226
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ V AF A+ P V PVAPF
Sbjct: 227 RALA-----TGGVMLSTRLPYVLLRRDVYRPFVDAFTKALAAQPANGAPVARAVKPVAPF 281
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
C+ + +G G VP + L + + W++ G+NSMV + C+ F V+
Sbjct: 282 ELCYDTKSLGNNLGGYWVPNVGLAVDGGS-DWAMTGKNSMVDVKPGTACVAFVEMKGVEA 340
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNF 416
G ++++G Q+E+ +L FD+ RLGFS F + +FNF
Sbjct: 341 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGFSRLPQFTGCS-SFNF 385
>gi|56201272|dbj|BAD72882.1| xylanase inhibitor TAXI-IV [Triticum aestivum]
Length = 408
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 123/406 (30%), Positives = 184/406 (45%), Gaps = 54/406 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KDTAT Y LV + + G ++W C+ G + C S C LA
Sbjct: 28 VTKDTATSLYTIPFHDGANLV-----LDVAGPLVWSTCDGGQPPAE---IPCSSPTCLLA 79
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + +TDG P V V
Sbjct: 80 NAYPAPGCPAPSCGSDRHDKPCTAYPYNPVTGACAAGSLFHTKFVANTTDGNKPVSKVNV 139
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +
Sbjct: 140 -GVVAACAPSKLLASLPRGSTGVAGLADSGLALPAQVASAQKVANRFLLCLPTGGLGVAI 198
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L ++++ YTPL A G G P+ ++I + S+ V VP+++
Sbjct: 199 FGGGP---LPWPQFTQSMDYTPL------VAKG--GSPA--HYISLKSIKVENTRVPVSE 245
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ V AF A+ P V PVAPF
Sbjct: 246 RALA-----TGGVMLSTRLPYVLLRRDVYRPFVDAFTKALAAQPANGAPVARAVKPVAPF 300
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
C+ + +G G VP + L + + W++ G+NSMV + C+ F V+
Sbjct: 301 ELCYDTKSLGNNLGGYWVPNVGLAVDGGS-DWAMTGKNSMVDVKPGTACVAFVEMKGVEA 359
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNF 416
G ++++G Q+E+ +L FD+ RLGFS F + +FNF
Sbjct: 360 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGFSRLPQFTGCS-SFNF 404
>gi|242059837|ref|XP_002459064.1| hypothetical protein SORBIDRAFT_03g045250 [Sorghum bicolor]
gi|241931039|gb|EES04184.1| hypothetical protein SORBIDRAFT_03g045250 [Sorghum bicolor]
Length = 448
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 137/448 (30%), Positives = 201/448 (44%), Gaps = 77/448 (17%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQR--TPLVPVKLTVHLGGNILWVDCEKGYVS 79
+A +P +V RV+KD++T Y+ ++ P+V + L G +LW C
Sbjct: 19 SAAARSKPLPVVARVSKDSSTGLYLISVRNYDANPVV-----LDLAGPLLWWPCSG--RQ 71
Query: 80 STNKTARCGSAQCNLANAKA------CGGG---------ICGAGPDNPISNTGTHGDIRI 124
C S C++AN GG C A P NP++ G +
Sbjct: 72 EQEHPILCSSGTCHVANRNHPPNCPYIDGGRPGSPEPSCHCTAYPYNPVNGKCGSGVLTW 131
Query: 125 DVLSIQSTDGRNP-----GRAVT------VPNFIFLCGSEFVLQGLASG-----VVGIAG 168
+ LS +TDG+ P RAV +P F + +QG G+AG
Sbjct: 132 EWLSANTTDGQRPLYPVSFRAVASCAPDDLPFSSFPELFPWQVQGRQPPSSRYPYPGVAG 191
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFD----VSKNLKYT 224
L RS ++LPSQ+AA + KFALCL +FG GP + + D V+ +L T
Sbjct: 192 LSRSPLSLPSQVAAELKVSSKFALCLP----HVAIFGGGPVHIPGSADDVETVTDHLSRT 247
Query: 225 PLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPY 284
L N N+A Y+I V + VNG V L L++D G GG +STV PY
Sbjct: 248 RLLRNPRNSA----------YYIDVAGIAVNGARVALPDGALTLDATGQGGVALSTVTPY 297
Query: 285 TVLETSIYKALVQAFASAMPKVARV--APVAPFGACFRL---QDIGFTRIGPVVPQIDLV 339
T L IY+A++ AF + RV AP P CF L +G T G + +DL+
Sbjct: 298 TALRPDIYRAVLAAFDAVTAGFPRVSEAPNKPLERCFNLTVMNQMG-TWTGSLPVSVDLM 356
Query: 340 LQN-KNVVWSIHGQNSMVQIGGDALCLGFVDGG--------VNPRTSIVIGARQLENNLL 390
L + KN W+ ++ ++ LC FV+ G V ++V+G Q+ENNL+
Sbjct: 357 LADGKN--WTFTSLSATDEVVPQTLCFAFVEMGAGTAAAYAVPDSPAVVVGGHQMENNLM 414
Query: 391 QFDLATSRLGFSDSLLFER--ATCTFNF 416
+FDL LG++ + +R A FNF
Sbjct: 415 EFDLKKGVLGYTGLMFDQRMGACSNFNF 442
>gi|56201270|dbj|BAD72881.1| xylanase inhibitor TAXI-III [Triticum aestivum]
gi|56201352|dbj|BAD72883.1| xylanase inhibitor TAXI-III [Triticum aestivum]
Length = 401
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 174/391 (44%), Gaps = 53/391 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KDTAT Y LV + + G ++W CE S C S C L+
Sbjct: 28 VTKDTATSLYTIPFHDGASLV-----LDVAGPLVWSTCEG---SQPPAEIPCSSPTCLLS 79
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + + +TDG P V V
Sbjct: 80 NAYPAPGCPAPSCGSDRHDKPCTAYPSNPVTGACAAGSLFHTKFAANTTDGNKPVSEVNV 139
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+ + +F LCL G +
Sbjct: 140 -GVLAACAPSKLLASLPRGSTGVAGLANSGLALPAQVASTQKVANRFLLCLPTGGLGVAI 198
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L ++++ YTPL G P+ ++I + S+ V VP+++
Sbjct: 199 FGGGP---LPWPQFTQSMDYTPLVAKG--------GSPA--HYISLKSIKVENTRVPVSE 245
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ V AF A+ P V PVAPF
Sbjct: 246 RALA-----TGGVMLSTRLPYVLLRRDVYRPFVGAFTKALAAQPANGAPVARAVKPVAPF 300
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
C+ + +G G VP + L + + W++ G+NSMV + C+ F V+
Sbjct: 301 ELCYDTKSLGNNLGGYWVPNVGLAVDGGS-DWAMTGKNSMVDVKPGTACVAFVEMKGVEA 359
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G ++++G Q+E+ +L FD+ RLGF
Sbjct: 360 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGF 390
>gi|62996372|emb|CAG26972.1| xylanase inhibitor precursor [Triticum aestivum]
Length = 401
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 174/391 (44%), Gaps = 53/391 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KDTAT Y LV + + G ++W CE S C S C L+
Sbjct: 28 VTKDTATSLYTIPFHDGASLV-----LDVAGPLVWSTCEG---SQPPAEIPCSSPTCLLS 79
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP++ G + + +TDG P V V
Sbjct: 80 NAYPAPGCPAPSCGSDRHDKPCTAYPSNPVTGACAAGSLFHTKFAANTTDGNKPVSEVNV 139
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+ + +F LCL G +
Sbjct: 140 -GVLAACAPSKLLASLPRGSTGVAGLANSGLALPAQVASTQKVANRFLLCLPTGGLGVAI 198
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP L ++++ YTPL G P+ ++I + S+ V VP+++
Sbjct: 199 FGGGP---LPWPQFTQSMDYTPLVAKG--------GSPA--HYISLKSIKVENTRVPVSE 245
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ V AF A+ P V PVAPF
Sbjct: 246 RALA-----TGGVMLSTRLPYVLLRRDVYRPFVGAFTKALAAQPANGAPVARAVKPVAPF 300
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDG 370
C+ + +G G VP + L + + W++ G+NSMV + C+ F V+
Sbjct: 301 ELCYDTKSLGNNLGGYWVPNVGLAVDGGS-DWAMTGKNSMVDVKPGTACVAFVEMKGVEA 359
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G ++++G Q+E+ +L FD+ RLGF
Sbjct: 360 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGF 390
>gi|23954367|emb|CAD27730.1| xylanase inhibitor [Triticum aestivum]
gi|56201268|dbj|BAD72880.1| xylanase inhibitor TAXI-I [Triticum aestivum]
Length = 402
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 174/392 (44%), Gaps = 53/392 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KD AT Y LV + + G ++W C+ G + C S C LA
Sbjct: 28 VTKDPATSLYTIPFHDGASLV-----LDVAGPLVWSTCDGGQPPAE---IPCSSPTCLLA 79
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP+S G + +TDG P V V
Sbjct: 80 NAYPAPGCPAPSCGSDKHDKPCTAYPYNPVSGACAAGSLSHTRFVANTTDGSKPVSKVNV 139
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +
Sbjct: 140 -GVLAACAPSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCLPTGGPGVAI 198
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP + ++++ YTPL G P+ ++I S+ V VP+ +
Sbjct: 199 FGGGP---VPWPQFTQSMPYTPLVTKG--------GSPA--HYISARSIVVGDTRVPVPE 245
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ L+ AF A+ P V VAPF
Sbjct: 246 GALA-----TGGVMLSTRLPYVLLRPDVYRPLMDAFTKALAAQHANGAPVARAVEAVAPF 300
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD-----G 370
G C+ + +G G VP + L L + W++ G+NSMV + C+ FV+
Sbjct: 301 GVCYDTKTLGNNLGGYAVPNVQLGLDGGS-DWTMTGKNSMVDVKQGTACVAFVEMKGVAA 359
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G ++++G Q+E+ +L FD+ RLGFS
Sbjct: 360 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGFS 391
>gi|55669876|pdb|1T6E|X Chain X, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor I
gi|55669877|pdb|1T6G|A Chain A, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor-i In Complex With Aspergillus Niger Xylanase-i
gi|55669878|pdb|1T6G|B Chain B, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor-i In Complex With Aspergillus Niger Xylanase-i
Length = 381
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 174/392 (44%), Gaps = 53/392 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KD AT Y LV + + G ++W C+ G + C S C LA
Sbjct: 7 VTKDPATSLYTIPFHDGASLV-----LDVAGPLVWSTCDGGQPPAE---IPCSSPTCLLA 58
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP+S G + +TDG P V V
Sbjct: 59 NAYPAPGCPAPSCGSDKHDKPCTAYPYNPVSGACAAGSLSHTRFVANTTDGSKPVSKVNV 118
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +
Sbjct: 119 -GVLAACAPSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCLPTGGPGVAI 177
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP + ++++ YTPL G P+ ++I S+ V VP+ +
Sbjct: 178 FGGGP---VPWPQFTQSMPYTPLVTKG--------GSPA--HYISARSIVVGDTRVPVPE 224
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPF 315
L+ GG +ST PY +L +Y+ L+ AF A+ P V VAPF
Sbjct: 225 GALA-----TGGVMLSTRLPYVLLRPDVYRPLMDAFTKALAAQHANGAPVARAVEAVAPF 279
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD-----G 370
G C+ + +G G VP + L L + W++ G+NSMV + C+ FV+
Sbjct: 280 GVCYDTKTLGNNLGGYAVPNVQLGLDGGS-DWTMTGKNSMVDVKQGTACVAFVEMKGVAA 338
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G ++++G Q+E+ +L FD+ RLGFS
Sbjct: 339 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGFS 370
>gi|297812093|ref|XP_002873930.1| hypothetical protein ARALYDRAFT_488794 [Arabidopsis lyrata subsp.
lyrata]
gi|297319767|gb|EFH50189.1| hypothetical protein ARALYDRAFT_488794 [Arabidopsis lyrata subsp.
lyrata]
Length = 365
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 123/386 (31%), Positives = 176/386 (45%), Gaps = 86/386 (22%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY 77
L PI S ++ L + T++ ++V + PL+ +C
Sbjct: 31 LHPIYKDKSTNIYSIPLSIGSTTSSEEFVLDLNGAAPLLQ--------------NCATAA 76
Query: 78 VSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNP 137
S+T +CGS +CN AN P+ P N +V++ + T R+
Sbjct: 77 KSTTYHPIKCGSTRCNYAN------------PNFPCPN---------NVITKKRTVCRSS 115
Query: 138 GRAV----TVP-----NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
A TVP N ++ SE + G L + V L + F LKR
Sbjct: 116 DNARLFRDTVPLLYTFNGVYTMDSEKSSSLTLTCSDGAPTLKQRTVGLAN---THFFLKR 172
Query: 189 KFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
+CL P I+ G + TPL + + S EY I
Sbjct: 173 WLFVCLPPKGQRLILVTFGSIF-----------ASTPLIAS----------DKSGEYLID 211
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
V S+ + GK VP+ G TKIST+ PYTVL+TSIYKAL+ AFA + K+A+
Sbjct: 212 VKSIQIGGKTVPILH----------GTTKISTLAPYTVLQTSIYKALLTAFAGSA-KIAK 260
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
V PFGACFR + G VP IDL+++ W I+G NS+V++ + +CLGFV
Sbjct: 261 APAVKPFGACFR------SNGGRGVPVIDLLVRG-GAKWRIYGSNSLVKVNKNVVCLGFV 313
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDL 394
DGGVNP+ IVIG Q+E+NL++FDL
Sbjct: 314 DGGVNPKNPIVIGGLQMEDNLVEFDL 339
>gi|388493468|gb|AFK34800.1| unknown [Lotus japonicus]
Length = 145
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 82/147 (55%), Positives = 93/147 (63%), Gaps = 19/147 (12%)
Query: 287 LETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV 346
+ET+IYKA+ AF ++ V+PVAPFG CF +DI F+RIGP VP IDLVLQN V
Sbjct: 1 METTIYKAVADAFVKSL-GAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQN-GVE 58
Query: 347 WSIHGQNSMVQIGGDALCLGFVDGGVNPR--------------TSIVIGARQLENNLLQF 392
W I G NSMVQ D +CLGFVD G NP+ TSI IGA QLENNLL+F
Sbjct: 59 WPIIGANSMVQFD-DVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKF 117
Query: 393 DLATSRLGFSDSLLFERATC-TFNFTS 418
DLA SRLGF SL E C F FTS
Sbjct: 118 DLAASRLGFR-SLFLEHDNCQNFRFTS 143
>gi|302760219|ref|XP_002963532.1| hypothetical protein SELMODRAFT_438360 [Selaginella moellendorffii]
gi|300168800|gb|EFJ35403.1| hypothetical protein SELMODRAFT_438360 [Selaginella moellendorffii]
Length = 344
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 104/294 (35%), Positives = 141/294 (47%), Gaps = 39/294 (13%)
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
+S+ +TDG NP VT+P F CGS +A LGR ALP++L+
Sbjct: 75 VSLNATDGSNPTGPVTIPGVPFKCGSP------------VAALGRGSQALPARLSP--RN 120
Query: 187 KRKFALCLSPFDDGAIVFGDGPY-YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
K+ CLS I FG + N +S L+YTPL + Y
Sbjct: 121 KKIVTYCLSQQGSSPIFFGAQDINFMPNKRPISPLLQYTPLVSPPARHS----------Y 170
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
I V SV VNG+ +P K +S+ PYT L T Y A+ AF +
Sbjct: 171 AIRVNSVRVNGQRLPAVKP---------AAWALSSTVPYTRLVTPAYVAIRDAFRNL--T 219
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-C 364
V RVAPVAPF CF +G TR+GP VP ++L L+ N W++ G N+MV + + C
Sbjct: 220 VPRVAPVAPFDTCFNASGLGSTRVGPPVPPVELQLEG-NATWTLFGANTMVFLKDSTVAC 278
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFT 417
L FVD G + V+G Q +NL++ DL R GF+ L F + TC+ FN T
Sbjct: 279 LAFVDAGSSSPGLSVVGTFQQMHNLVRLDLEKQRFGFTGILFFYQTTCSNFNTT 332
>gi|357126720|ref|XP_003565035.1| PREDICTED: basic 7S globulin-like [Brachypodium distachyon]
Length = 420
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 120/419 (28%), Positives = 186/419 (44%), Gaps = 57/419 (13%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
+++ V KD T Y LV + G ++W C+ + + C S
Sbjct: 23 VLVPVTKDPKTSLYTIPFHDGAALV-----LDTAGPLVWTTCQPDHHPAV---LACTSPT 74
Query: 92 CNLANA---------------KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN 136
C LAN C P NP++ + GD+ +TDGRN
Sbjct: 75 CKLANGFPFPGCRSSSSGSSCPPNSHNKCTVYPYNPVTGSCAPGDLSHTRFVANTTDGRN 134
Query: 137 PGRAVTVPNF---IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
P V+V + L ++ +L+ L G G+AGL + +ALP+Q+A + L +KF LC
Sbjct: 135 PVSQVSVKAIAACVSLGDNKKLLEKLPLGSAGVAGLAGTGLALPAQVAGSQRLPKKFLLC 194
Query: 194 LSP---FDDGAIVFGD-GPYYDL-NNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
LS + G VFG GP + L + + +++L YTPL + K G P+ Y++
Sbjct: 195 LSRGGVYGPGVAVFGSGGPLFLLRDQPEYTQSLTYTPLVVTKK-------GSPA--YYVS 245
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
V S+ + V L K L+ GG + T PYT+L +Y+ + AF A+ K
Sbjct: 246 VKSILLENTPVRLPKKALA-----TGGAVLCTRTPYTLLRRDVYRPFLAAFEKALAKQIP 300
Query: 309 VA-----PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
A PV C+ + G +VP + L ++ W++ G +SMV +
Sbjct: 301 WAKKARSPVKQLKLCYEANTLPNGLSGYLVPSVALAMEGGG-SWTMTGSSSMVDVKPGTA 359
Query: 364 CLGFVD-----GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFT 417
CL FV+ G ++++G Q+EN +LQFDL RLGF +F + FNFT
Sbjct: 360 CLAFVEMEGVKEGDASAPAVLVGGFQMENFVLQFDLEKKRLGFFRLPVFTQCG-HFNFT 417
>gi|357126718|ref|XP_003565034.1| PREDICTED: uncharacterized protein LOC100822007 [Brachypodium
distachyon]
Length = 432
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 189/425 (44%), Gaps = 66/425 (15%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ 91
+V+ V KDT T Y LV + G ++W C+ ++ + C S
Sbjct: 32 VVVPVTKDTQTSLYTIPFHDGATLV-----LDTAGPLVWTTCQPDHIPAA---LACTSPT 83
Query: 92 CNLANA-------KACGGGICGAG--------PDNPISNTGTHGDIRIDVLSIQSTDGRN 136
C LANA + G C A P NP++ GD+ +TDGRN
Sbjct: 84 CKLANAFPFPGCRASSSGSSCPANSHDKCTVYPCNPVTVACAPGDLSHTRFVANTTDGRN 143
Query: 137 PGRAVTVPNFIFLCGS----EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL---KRK 189
P R V+V + C S + +L+ L G G+AGL + +ALP+Q+AA+ L K K
Sbjct: 144 PVRQVSV-KALAACISPRDDKMLLEKLPVGSAGMAGLAGTGLALPAQVAASQGLPADKAK 202
Query: 190 FALCLSPFDDG----AIVFGDGPYYDL--NNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
F LCL G AI+ GP Y L D +++L+YTPL + + + S
Sbjct: 203 FLLCLPRGSAGGPGVAILGSGGPLYLLAGQPEDYTRSLQYTPLVVTRKDHPS-------- 254
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
Y++ V S+ V+ AVP K L G + T PYT+L +Y+ AF +A+
Sbjct: 255 -YYVSVKSIAVDNAAVP-EKAL------ATGRVVLCTRTPYTLLRRDVYRPFAAAFEAAL 306
Query: 304 PK-VARVAPVAP-----FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
K + R F C+ + T G +VP + L ++ W++ G NSMV
Sbjct: 307 AKQIPRAKKTKKPPVKPFTLCYEAASLANTLSGYLVPTVTLAMEGGGK-WALAGSNSMVD 365
Query: 358 IGGDALCLGFVD-----GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
+ CL FV+ G ++++G Q+EN +LQFDL RLGF L
Sbjct: 366 VKPGTACLAFVEMPGVKAGDGSAPAVIVGGFQMENFVLQFDLEKKRLGFF-RLPVSTQCS 424
Query: 413 TFNFT 417
FNFT
Sbjct: 425 RFNFT 429
>gi|116666775|pdb|2B42|A Chain A, Crystal Structure Of The Triticum Xylanse Inhibitor-I In
Complex With Bacillus Subtilis Xylanase
Length = 381
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 173/392 (44%), Gaps = 53/392 (13%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA 95
V KD AT Y LV + + G ++W C+ G + C S C LA
Sbjct: 7 VTKDPATSLYTIPFHDGASLV-----LDVAGPLVWSTCKGGQPPAE---IPCSSPTCLLA 58
Query: 96 NA--------KACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
NA +CG C A P NP+S G + +TDG P V V
Sbjct: 59 NAYPAPGCPAPSCGSDKHDKPCTAYPYNPVSGACAAGSLSHTRFVANTTDGSKPVSKVNV 118
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
+ C +L L G G+AGL S +ALP+Q+A+A + +F LCL G +
Sbjct: 119 -GVLAACAPSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCLPTGGPGVAI 177
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG GP + ++++ YTPL G P+ ++I S+ V VP+ +
Sbjct: 178 FGGGP---VPWPQFTQSMPYTPLVTKG--------GSPA--HYISARSIVVGDTRVPVPE 224
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK--------VARVAPVAPF 315
L+ GG +ST PY +L +Y+ L+ AF A+ V VAPF
Sbjct: 225 GALA-----TGGVMLSTRLPYVLLRPDVYRPLMDAFTKALAAQHANGAPVARAVVAVAPF 279
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD-----G 370
G C+ + +G G VP + L L + W++ G+NSMV + C+ FV+
Sbjct: 280 GVCYDTKTLGNNLGGYAVPNVQLGLDGGS-DWTMTGKNSMVDVKQGTACVAFVEMKGVAA 338
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G ++++G Q+E+ +L FD+ RLGFS
Sbjct: 339 GDGRAPAVILGGAQMEDFVLDFDMEKKRLGFS 370
>gi|302813128|ref|XP_002988250.1| hypothetical protein SELMODRAFT_427034 [Selaginella moellendorffii]
gi|300143982|gb|EFJ10669.1| hypothetical protein SELMODRAFT_427034 [Selaginella moellendorffii]
Length = 377
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/364 (30%), Positives = 171/364 (46%), Gaps = 39/364 (10%)
Query: 60 LTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTH 119
L V L W+ C K K++ C+ + AC G H
Sbjct: 45 LAVDLNAETSWLPCGKNSSFEPGKSSTFSPLPCS---SNACSG----------------H 85
Query: 120 GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSE-FVLQGLASG-VVGIAGLGRSKVALP 177
+L I S P T+ F G++ + G A+G VG+A L ++ +ALP
Sbjct: 86 CSTNKCLLPI-SPKTSVPAFQETLTGFTAPAGAKGTAIFGCAAGKSVGVAALSKNSLALP 144
Query: 178 SQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF 237
Q+A++FS+ RKFALCLSP ++ FGD + ++S + + P N V +
Sbjct: 145 LQIASSFSVPRKFALCLSPDSPSSLFFGDDSSIIIGGINISSLVSFVPFVSNPVFPS--- 201
Query: 238 LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALV 296
Y++ + ++ + + L+ +L SI+ + G+GG +S+ N YT + T +Y A+
Sbjct: 202 ------RYYLDLRTIQTDFSDLKLDPSLFSINPKTGIGGLTLSSTNRYTKVPTPVYAAIA 255
Query: 297 QAF---ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQN 353
Q+F A+A A PF CF + F R+GPV P I L+ +N N+ W++ G
Sbjct: 256 QSFKKYATAFNISIVPAQNLPFDLCFNASGMNFNRLGPVFPAIQLIFRN-NIPWNLVGSR 314
Query: 354 SMVQIGGDAL-CLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD-SLLFERAT 411
+ G+A+ CL G P TS IG +NLL FDLA +R GF D SL R +
Sbjct: 315 VIEFFRGNAIGCLAIQSAGDPPATS-SIGLFHQFDNLLYFDLAQTRFGFVDRSLHILRQS 373
Query: 412 CTFN 415
C F+
Sbjct: 374 CKFS 377
>gi|115442113|ref|NP_001045336.1| Os01g0937500 [Oryza sativa Japonica Group]
gi|20160770|dbj|BAB89711.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
gi|113534867|dbj|BAF07250.1| Os01g0937500 [Oryza sativa Japonica Group]
gi|125573257|gb|EAZ14772.1| hypothetical protein OsJ_04701 [Oryza sativa Japonica Group]
gi|215766348|dbj|BAG98576.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 443
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 134/457 (29%), Positives = 195/457 (42%), Gaps = 82/457 (17%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
LLL ++ + TAQ + K LV R+AKD T Y +K P L V L G
Sbjct: 11 LLLAAIAVQVFVRCTAQAASDQKPLVSRLAKDYNTSLYTISVKNGAP----PLVVDLAGA 66
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLAN---AKAC----GGGI-----------CGAGP 109
++W C ST+ T C SA C+ N + C GG C A P
Sbjct: 67 LVWSTCP-----STHSTVPCQSAACDAVNRQQPRRCRYVDGGWFWAGREPGSRCACTAHP 121
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN---FIFLCGSEFVLQG--LASGVV 164
NP++ + GD+ +S +T+G + + P + C E +L L
Sbjct: 122 FNPVTGECSTGDLTTFTMSANTTNGTD----LLYPESFTAVGACAPERLLASPSLPQAAA 177
Query: 165 GIAGL-GRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLN-----NFDVS 218
G+AG G + ++LPSQLAA FALCL F FGD P Y N FD +
Sbjct: 178 GVAGFSGTTPLSLPSQLAAQRRFGSTFALCLPVF----ATFGDTPVYLPNYNPYGPFDYT 233
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA---VPLNKTLLSID---NEG 272
K L+ TP N Y++ V + V+ + VP++ ++D G
Sbjct: 234 KMLRRTPFLTNPRRNGG---------YYLPVKRISVSWRGPGDVPVSLPAGALDLNARTG 284
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAM-----PKVARVAPVAPFGACF------RL 321
GG +ST PY ++ T +++A +AF + + ++ARVA F C+ L
Sbjct: 285 RGGVVLSTTTPYAIMRTDVFRAFGKAFDTVVTRGTESRMARVARQKQFELCYGGAGDTML 344
Query: 322 QDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGG-----VNPR 375
R G P I L L W+I N +V+ C+G V+ G V+
Sbjct: 345 SFPMMKRTGFDAPAITLELDAGATGNWTILNGNYLVR----ETCVGVVEMGPEGMPVDGE 400
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
++V+G QLEN L+ FDL LGFS L ++ C
Sbjct: 401 PAVVLGGMQLENILMVFDLDKRTLGFSRLLEWDLTNC 437
>gi|218189700|gb|EEC72127.1| hypothetical protein OsI_05116 [Oryza sativa Indica Group]
Length = 443
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 134/457 (29%), Positives = 195/457 (42%), Gaps = 82/457 (17%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGN 67
LLL ++ + TAQ + K LV R+AKD T Y +K P L V L G
Sbjct: 11 LLLAAIAVQVFVRCTAQAASDQKPLVSRLAKDYNTSLYTISVKNGAP----PLVVDLAGA 66
Query: 68 ILWVDCEKGYVSSTNKTARCGSAQCNLAN---AKAC----GGGI-----------CGAGP 109
++W C ST+ T C SA C+ N + C GG C A P
Sbjct: 67 LVWSTCP-----STHSTVPCQSAACDAVNRQQPRRCRYVDGGWFWAGREPGSRCACTAHP 121
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN---FIFLCGSEFVLQG--LASGVV 164
NP++ + GD+ +S +T+G + + P + C E +L L
Sbjct: 122 FNPVTGECSTGDLTTFAMSANTTNGTD----LLYPESFTAVGACAPERLLASPSLPQAAA 177
Query: 165 GIAGL-GRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLN-----NFDVS 218
G+AG G + ++LPSQLAA FALCL F FGD P Y N FD +
Sbjct: 178 GVAGFSGTTPLSLPSQLAAQRRFGSTFALCLPVF----ATFGDTPVYLPNYNPYGPFDYT 233
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA---VPLNKTLLSID---NEG 272
K L+ TP N Y++ V + V+ + VP++ ++D G
Sbjct: 234 KMLRRTPFLTNPRRNGG---------YYLPVKRISVSWRGPGDVPVSLPAGALDLNARTG 284
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAM-----PKVARVAPVAPFGACF------RL 321
GG +ST PY ++ T +++A +AF + + ++ARVA F C+ L
Sbjct: 285 RGGVVLSTTTPYAIMRTDVFRAFGKAFDTVVTRGTESRMARVARQKQFELCYGGAGDTML 344
Query: 322 QDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGG-----VNPR 375
R G P I L L W+I N +V+ C+G V+ G V+
Sbjct: 345 SFPMMKRTGFDAPAITLELDAGATGNWTILNGNYLVR----ETCVGVVEMGPEGMPVDGE 400
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
++V+G QLEN L+ FDL LGFS L ++ C
Sbjct: 401 PAVVLGGMQLENILMVFDLDKRTLGFSRLLEWDLTNC 437
>gi|242072051|ref|XP_002451302.1| hypothetical protein SORBIDRAFT_05g027350 [Sorghum bicolor]
gi|241937145|gb|EES10290.1| hypothetical protein SORBIDRAFT_05g027350 [Sorghum bicolor]
Length = 370
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 152/321 (47%), Gaps = 43/321 (13%)
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRS-KV 174
T T + + LS +TDG+NP V+ + + C + + +G VG+AGL S
Sbjct: 71 TCTSKNGTVTTLSANATDGQNPLFPVSF-SAVASCTPQ---AKVPAGAVGVAGLAPSTSQ 126
Query: 175 ALPSQLAAAFSLKRKFALCLSPFDD--------GAIVFGDGPYYDLNNFDVSKNLKYTPL 226
+ P+Q+A + K ALCL P D G +FG GP + D + L T
Sbjct: 127 SFPAQVARTQKVANKIALCL-PSDGKSTTGDSVGVAIFGGGPLVFPDRGDFTTMLAGT-- 183
Query: 227 FINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI----STVN 282
+ GF G P YF+ T + V V GG+ + S+
Sbjct: 184 -----ASLQGFNGSPG--YFVSATGIAVEQSRV---------GTSSAGGSSLVVALSSTT 227
Query: 283 PYTVLETSIYKALVQAFASA-----MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQID 337
PYT L +Y V AF +A P ++RVA VAPF C+ + TR+G VPQID
Sbjct: 228 PYTSLRPDVYVPFVNAFDAAATGPNFPWMSRVAAVAPFERCYDSTKLPPTRLGYAVPQID 287
Query: 338 LVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATS 397
++LQ +S+ G NSMVQ+ G+ CLGFV + VIG QLEN+LL D+
Sbjct: 288 VMLQGGQN-YSVLGGNSMVQVNGNTACLGFVKAAAGQAPAAVIGGFQLENHLLVLDVEKK 346
Query: 398 RLGFSDSLLFERATCT-FNFT 417
+LGF+ L +C+ FNFT
Sbjct: 347 QLGFTTFLNAVGLSCSNFNFT 367
>gi|125573254|gb|EAZ14769.1| hypothetical protein OsJ_04696 [Oryza sativa Japonica Group]
Length = 389
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 117/417 (28%), Positives = 178/417 (42%), Gaps = 79/417 (18%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST 81
+A R +V+ V +D AT Y ++ LV V L G ++W C ++ ++
Sbjct: 28 SAAAPRRRDPVVVPVTRDPATSLYTIPVRYYDNLV-----VDLAGPLVWSTCAADHLPAS 82
Query: 82 NKTARCGSAQCNLANA------KACGGG------ICGAGPDNPISNTGTHGDIRIDVLSI 129
C C +ANA K GGG +C A P NP++ G++
Sbjct: 83 ---LSCQDPTCVVANAYRAPTCKVTGGGGDCSKNVCTAYPYNPVTGQCAAGNLAHTRFIA 139
Query: 130 QSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK 189
+TDG+NP V+V + C + +L L G G+AGL S +ALP+Q+A++ + +
Sbjct: 140 NTTDGKNPLIQVSV-KAVAACAPKRLLARLPRGATGVAGLAASGLALPAQVASSQGVAGR 198
Query: 190 FALCLSP--FDDGAIVFGDGPYYDLNNF-DVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
F LCL + G +FG GP Y D + L YTPL + N Y+
Sbjct: 199 FLLCLPRLGYGQGVAIFGGGPIYLGEGLPDFTTTLDYTPLVAKRDNPG----------YY 248
Query: 247 IGVTSVHV-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+ ++ + + +A P + ++ P
Sbjct: 249 VTANAIALDDARATPPERR------------------------------------ASPPA 272
Query: 306 VARVAPVAPFGACFRLQD-IGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDAL 363
R AP R +G TRIG VP + L+L KN +++ G NSMV + G
Sbjct: 273 AWRCAPPCRSANSGRTASMLGNTRIGYFVPAVRLMLAGGKN--YTMTGTNSMVDVKGGKA 330
Query: 364 CLGFVD---GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFT 417
CL FV+ G ++++G Q+EN LLQFD RLGF+ L F + FNFT
Sbjct: 331 CLAFVEMKSGDAASSPAVILGGFQMENMLLQFDSEKKRLGFA-RLPFYTSCSNFNFT 386
>gi|223005|prf||0402194A conglutin gamma smaller subunit
Length = 154
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/142 (45%), Positives = 85/142 (59%), Gaps = 6/142 (4%)
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV 332
+GG I+T +PYTVL SI++ Q FA+ MPK A+V V PFG C+ + +I
Sbjct: 10 IGGALITTTHPYTVLSHSIFEVFTQVFANNMPKQAQVKAVGPFGLCYDSR-----KISGG 64
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQF 392
P +DL+L + VW I +N MVQ CLGFVDGGV+ R I +GA LE NL+ F
Sbjct: 65 APSVDLILDKNDAVWRISSENFMVQAQDGVSCLGFVDGGVHARAGIALGAHHLEENLVVF 124
Query: 393 DLATSRLGF-SDSLLFERATCT 413
DL SR+GF S+SL TC+
Sbjct: 125 DLERSRVGFNSNSLKSYGKTCS 146
>gi|357443045|ref|XP_003591800.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355480848|gb|AES62051.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 177
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
S L +TP N V+T S FLGE +VEY IG+ S+ V+ K V L+ TLLSI G GGTK
Sbjct: 6 SNVLIFTPFLTNPVSTTSYFLGEKTVEYIIGMKSIRVSDKNVKLSTTLLSIHKNGFGGTK 65
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQID 337
IS NPYT++ETSI KA+ AF A V+ V PVAPFG CF +DI
Sbjct: 66 ISPFNPYTIMETSINKAVSCAFVIAF-GVSDVQPVAPFGTCFATKDI------------- 111
Query: 338 LVLQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGG 371
N+ V W+I G SMV IG D +CL F+D G
Sbjct: 112 ----NETVKWNIIGDKSMVSIGNNDVICLVFLDVG 142
>gi|301642667|gb|ADK87894.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642669|gb|ADK87895.1| AtV9-like protein, partial [Arabidopsis halleri]
Length = 149
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 92/157 (58%), Gaps = 17/157 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y I V S+ VNG NK LS+ EG K+STV PYT+LE+SIY +A+A A
Sbjct: 8 YVINVKSIRVNG-----NK--LSV--EGPLAAKLSTVVPYTMLESSIYAVFAEAYAKAAS 58
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ VAPVAPFG CF D+ F P +DL LQ++ V W I G+N MV +GG C
Sbjct: 59 EATSVAPVAPFGLCFT-SDVEF-------PAVDLALQSEMVRWRIQGKNLMVDVGGGVRC 110
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LG VDGG + IV+G QLE +L FDL S +GF
Sbjct: 111 LGIVDGGSSRVNPIVMGGLQLEGLILDFDLGNSMMGF 147
>gi|326487133|dbj|BAJ97919.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/308 (31%), Positives = 142/308 (46%), Gaps = 21/308 (6%)
Query: 112 PISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR 171
P T G LS STDG NP V+ + SG VG++GL R
Sbjct: 56 PSITTPCSGQTTTVTLSANSTDGSNPLSPVSF-------AATATCAAAPSGSVGVSGLAR 108
Query: 172 SKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
S + P+Q+++ + FALCL GA +FG GP++ D T L + V
Sbjct: 109 SGSSFPAQVSSTQKVANSFALCLPSDGVGAAIFGGGPFFLAPPAD---RPAITTLLSDGV 165
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
F G P YF+ T NG A+ + T +++ G +S+ PY L + +
Sbjct: 166 PLRQPFAGNPG--YFVSAT----NGIAI--DGTRVAVSGSGALVVGLSSTTPYAQLRSDV 217
Query: 292 YKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHG 351
Y+ + AF AM A+VA VAPF C+ + T G VPQ+D++L+ +++ G
Sbjct: 218 YRPFITAFDRAMGPSAKVAAVAPFELCYDSSKLSPTLSGYSVPQVDVMLEG-GTNFTVVG 276
Query: 352 QNSMVQIGGDALCLGFVDGGVNPRT-SIVIGARQLENNLLQFDLATSRLGFSDSLLFERA 410
NSM Q+ C FV G T ++VIG Q+EN L+ D L F+ L
Sbjct: 277 GNSMAQVNSGTACFAFVQSGSTGATPAVVIGGFQMENKLVVLDNDKKTLSFTPYLPARGF 336
Query: 411 TCT-FNFT 417
+C+ FNFT
Sbjct: 337 SCSNFNFT 344
>gi|125573249|gb|EAZ14764.1| hypothetical protein OsJ_04691 [Oryza sativa Japonica Group]
Length = 346
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 110/205 (53%), Gaps = 13/205 (6%)
Query: 215 FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGV 273
F VS L+YTPL N NTA Y+IGV V VN VPL LS+ +G
Sbjct: 149 FQVSDRLRYTPLLKNPKNTA----------YYIGVIGVAVNSVQVPLPPGALSLSARQGT 198
Query: 274 GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVV 333
GG +ST PYT L + IY+ + AFA+A +AR PF C++ + TRIGP
Sbjct: 199 GGVAVSTATPYTALRSDIYRPVRDAFAAATAGLARAPAAGPFDLCYQKSALPPTRIGPYT 258
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
+DL+L W+I G +++V++ +A C FVD G +++IG Q+E+NL+ FD
Sbjct: 259 ASVDLMLAGGQ-NWTIVGASAVVEVSQEAACFAFVDMGAAAAPAVIIGGHQMEDNLVVFD 317
Query: 394 LATSRLGFSDSLLFERATC-TFNFT 417
L + GFS LL C F+F+
Sbjct: 318 LEKWQFGFSGLLLGTMTRCGNFDFS 342
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 26/128 (20%)
Query: 27 FRP--KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKT 84
RP K ++ R+AKD +T Y +K +L + L G ++W C ++T
Sbjct: 29 HRPPSKPILTRLAKDPSTSLYTASVKNGG----GQLVLDLAGPLVWSTCP-----GKHRT 79
Query: 85 ARCGSAQCNLAN----------AKACGGG-----ICGAGPDNPISNTGTHGDIRIDVLSI 129
C + C +AN A GG C A NP S +GD+ LS
Sbjct: 80 IPCCAGVCAVANRNHRPNCPYTAAGSNGGDGGRCACSATAYNPASGQCGYGDLTTVQLSA 139
Query: 130 QSTDGRNP 137
++TDG+NP
Sbjct: 140 RATDGKNP 147
>gi|301642631|gb|ADK87876.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642633|gb|ADK87877.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642635|gb|ADK87878.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642641|gb|ADK87881.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642651|gb|ADK87886.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642659|gb|ADK87890.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642661|gb|ADK87891.1| AtV9-like protein, partial [Arabidopsis halleri]
Length = 149
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 92/157 (58%), Gaps = 17/157 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y I V S+ VNG NK LS+ EG ++STV PYT+LE+SIY +A+A A
Sbjct: 8 YVINVKSIRVNG-----NK--LSV--EGPLAAELSTVVPYTMLESSIYAVFAEAYAKAAS 58
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ VAPVAPFG CF D+ F P +DL LQ++ V W I G+N MV +GG C
Sbjct: 59 EATSVAPVAPFGLCFT-SDVEF-------PAVDLALQSEMVRWRIQGKNLMVDVGGGVRC 110
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LG VDGG + IV+G QLE +L FDL S +GF
Sbjct: 111 LGIVDGGSSRVNPIVMGGLQLEGLILDFDLGNSMMGF 147
>gi|125529030|gb|EAY77144.1| hypothetical protein OsI_05109 [Oryza sativa Indica Group]
Length = 348
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/384 (29%), Positives = 167/384 (43%), Gaps = 72/384 (18%)
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSA--QCNLANAKACGGGI------------ 104
+L + LGG +LW C + + ++ C +A Q N N + G
Sbjct: 8 RLVLDLGGPLLWSTCLAAHSTVPCRSDVCAAAAVQDNPWNCSSSTDGRGSDGGGGRGLCA 67
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
C A P NP++ GD+ + TDG NP V P C +L L SG V
Sbjct: 68 CSAYPYNPLNGQCARGDVTTTPMLANVTDGVNPLYPVAFPVHA-ACAPGALLGSLPSGAV 126
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGPYYDL---NNFD-VSK 219
G+AG + ++LPSQ+AA+ ++RKFALCL GA +FG GP++ L F VS
Sbjct: 127 GVAGGSGAPLSLPSQVAASLKVERKFALCLPGGGGTGAAIFGGGPFHLLVVPEEFGMVSN 186
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKI 278
L Y N N +++ V + VN + + L++D G GG +
Sbjct: 187 GLSYISYLRNPKNGG----------FYLDVVGIAVNHRGADVPPDSLALDAGTGHGGVML 236
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
STV PYT L IY+A+++A + + +AR P PF C
Sbjct: 237 STVAPYTALRPDIYRAVIEAIDAELRLIARAPPSWPFERC-------------------- 276
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP----RTSIVIGARQLENNLLQFDL 394
+ ++ LC V+ G P +++IG QLE+NLL FDL
Sbjct: 277 ----------------LPEVNEGTLCFAIVEMGPTPAMDESPAVIIGGFQLEDNLLVFDL 320
Query: 395 ATSRLGFSDSLLFERATCT-FNFT 417
RLG + L + R TC+ FNF+
Sbjct: 321 EKGRLGSTGLLYWIRTTCSNFNFS 344
>gi|255542576|ref|XP_002512351.1| basic 7S globulin, putative [Ricinus communis]
gi|223548312|gb|EEF49803.1| basic 7S globulin, putative [Ricinus communis]
Length = 252
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 64/162 (39%), Positives = 89/162 (54%), Gaps = 11/162 (6%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGG 66
FLL+ + F + +QT F+P L+L KD A +V +I +RTP P+ V L G
Sbjct: 27 FLLIILGISFVVFVSDSQTLFKPNNLMLLTHKDGAANLHVARISKRTPQTPLYFAVDLNG 86
Query: 67 NILWVDCEKGYVSSTNKTARCGSAQCNLANAK-----------ACGGGICGAGPDNPISN 115
LWV+CEK YVSST + RC S QC+ AN++ C CG NP+++
Sbjct: 87 RFLWVNCEKNYVSSTYRAPRCHSTQCSRANSQYCHKCSSKARPGCHNNTCGLMSANPVTH 146
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQ 157
G++ D+LSIQST G NPG V +P F+F+C +LQ
Sbjct: 147 QNAMGEVAQDMLSIQSTRGSNPGPVVMIPQFLFVCAPSRLLQ 188
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 39/64 (60%), Positives = 50/64 (78%), Gaps = 1/64 (1%)
Query: 355 MVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-T 413
++Q+G GFVDGG++PR S+V+GA QLE+NL+QFDLA SRLGFS SLL +R +C
Sbjct: 186 LLQLGIPIYVQGFVDGGLHPRASVVVGAHQLEDNLVQFDLARSRLGFSSSLLSQRTSCAN 245
Query: 414 FNFT 417
FNFT
Sbjct: 246 FNFT 249
>gi|301642637|gb|ADK87879.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642647|gb|ADK87884.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642649|gb|ADK87885.1| AtV9-like protein, partial [Arabidopsis halleri]
Length = 149
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 92/157 (58%), Gaps = 17/157 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y I V S+ VNG NK LS+ EG ++STV PYT+LE+SIY +A+A A
Sbjct: 8 YVINVKSIRVNG-----NK--LSV--EGPLAAELSTVVPYTMLESSIYAVFAEAYAIAAS 58
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ VAPVAPFG CF D+ F P +DL LQ++ V W I G+N MV +GG C
Sbjct: 59 EATSVAPVAPFGLCFT-SDVEF-------PAVDLALQSEMVRWRIQGKNLMVDVGGGVRC 110
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LG VDGG + IV+G QLE +L FDL S +GF
Sbjct: 111 LGIVDGGSSRVNPIVMGGLQLEGLILDFDLGNSMMGF 147
>gi|301642645|gb|ADK87883.1| AtV9-like protein, partial [Arabidopsis halleri]
Length = 149
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 91/157 (57%), Gaps = 17/157 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y I V S+ VNG NK LS+ EG K+STV PYT+LE+SIY +A+A A
Sbjct: 8 YVINVKSIRVNG-----NK--LSV--EGPLAAKLSTVVPYTMLESSIYAVFAEAYAKAAS 58
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ VAPVAPFG CF D+ F P +DL LQ++ V W I G+N MV +GG C
Sbjct: 59 EATSVAPVAPFGLCFT-SDVEF-------PAVDLALQSEMVRWRIQGKNLMVDVGGGVRC 110
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LG VDGG + IV+G QLE +L F L S +GF
Sbjct: 111 LGIVDGGSSRVNPIVMGGLQLEGLILDFHLGNSMMGF 147
>gi|301642639|gb|ADK87880.1| AtV9-like protein, partial [Arabidopsis halleri]
Length = 149
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 92/157 (58%), Gaps = 17/157 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y I V S+ VNG NK LS+ EG ++STV PYT+LE+SIY +A+A A
Sbjct: 8 YIINVKSIRVNG-----NK--LSV--EGPLAAELSTVVPYTMLESSIYAVFAEAYAKAAS 58
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ VAPVAPFG CF D+ F P +DL LQ++ V W I G+N +V +GG C
Sbjct: 59 EATSVAPVAPFGLCFT-SDVEF-------PAVDLALQSEMVRWRIQGKNLVVDVGGGVRC 110
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LG VDGG + IV+G QLE +L FDL S +GF
Sbjct: 111 LGIVDGGSSRVNPIVMGGLQLEGLILDFDLGNSMMGF 147
>gi|301642643|gb|ADK87882.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642653|gb|ADK87887.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642655|gb|ADK87888.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642657|gb|ADK87889.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642663|gb|ADK87892.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642665|gb|ADK87893.1| AtV9-like protein, partial [Arabidopsis halleri]
Length = 149
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 91/157 (57%), Gaps = 17/157 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y I V S+ VNG NK LS+ EG ++STV PYT+LE+SIY +A+A A
Sbjct: 8 YVINVKSIRVNG-----NK--LSV--EGPLAAELSTVVPYTMLESSIYAVFAEAYAKAAS 58
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ VAPVAPFG CF D+ F P +DL LQ++ V W I G+N MV +GG C
Sbjct: 59 EATSVAPVAPFGLCFT-SDVEF-------PAVDLALQSEMVRWRIQGKNLMVDVGGGVRC 110
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LG VDGG + IV+G QLE +L F L S +GF
Sbjct: 111 LGIVDGGSSRVNPIVMGGLQLEGLILDFHLGNSMMGF 147
>gi|301642671|gb|ADK87896.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642673|gb|ADK87897.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642675|gb|ADK87898.1| AtV9-like protein, partial [Arabidopsis halleri]
gi|301642677|gb|ADK87899.1| AtV9-like protein, partial [Arabidopsis halleri]
Length = 149
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 91/157 (57%), Gaps = 17/157 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y I V S+ VNG NK LS+ EG ++STV PYT+LE+SIY +A+A A
Sbjct: 8 YVINVKSIRVNG-----NK--LSV--EGPLAAELSTVVPYTMLESSIYAVFAEAYAIAAS 58
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ VAPVAPFG CF D+ F P +DL LQ++ V W I G+N MV +GG C
Sbjct: 59 EATSVAPVAPFGLCFT-SDVEF-------PAVDLALQSEMVRWRIQGKNLMVDVGGGVRC 110
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LG VDGG + IV+G QLE +L F L S +GF
Sbjct: 111 LGIVDGGSSRVNPIVMGGLQLEGLILDFHLGNSMMGF 147
>gi|361066165|gb|AEW07394.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173654|gb|AFG70243.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173655|gb|AFG70244.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173656|gb|AFG70245.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173657|gb|AFG70246.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173658|gb|AFG70247.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173659|gb|AFG70248.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173660|gb|AFG70249.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173661|gb|AFG70250.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
gi|383173662|gb|AFG70251.1| Pinus taeda anonymous locus 0_248_01 genomic sequence
Length = 139
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 86/145 (59%), Gaps = 12/145 (8%)
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
DV++ STDG+NPG VT P F F C F++QGLA G G+AGL R+++A P+QL A
Sbjct: 3 DVVAAYSTDGKNPGPKVTAPGFAFSCAPSFLMQGLAKGASGMAGLSRARLAPPTQLFGAS 62
Query: 185 SLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEP 241
+ RKFALCL G + FG+GPY+ L D S+ L YTPL N
Sbjct: 63 ASNRKFALCLPSTGSNTPGVLFFGNGPYFFLPGIDASQRLSYTPLLNNP---------RY 113
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLL 266
+YFIGVT++ ++GK++ ++ L
Sbjct: 114 KNQYFIGVTAIQIDGKSIAVDSARL 138
>gi|357132834|ref|XP_003568033.1| PREDICTED: uncharacterized protein LOC100837784 [Brachypodium
distachyon]
Length = 350
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 143/312 (45%), Gaps = 54/312 (17%)
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
LS +TDG P V+ P +S +AGLGRS ++ P+Q+A+ +
Sbjct: 73 LSANATDGSRPLSPVSFP----------AAASCSSTGASVAGLGRSTLSFPAQVASTQKV 122
Query: 187 KRKFALCLSPFDDGAIVFG----------DGPYYDLNNFDVSKNLKYTPLFINKVNTASG 236
FALCL DG F GP++ D P ++
Sbjct: 123 SNSFALCLP--SDGKTGFSGTGFGAAIFGGGPFFLAPPAD-------RPSITTLLSAGVP 173
Query: 237 FLGEPSVE----YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
+ P+ Y++ T + V+G V TL +ST PYT L + +Y
Sbjct: 174 LVRRPATRNPAAYYVAGTGIAVDGLRVQGELTL-----------GLSTKIPYTALRSDVY 222
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
+AL+ AF AM + A+VA VAPF C+ + +R+G +VPQ+DLVL ++ V W++ G
Sbjct: 223 RALINAFDRAMGRAAKVAAVAPFELCYDSSKLSPSRLGYLVPQVDLVL-DRGVNWTVVGG 281
Query: 353 NSMVQIGGDALCLGFVD-----GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLF 407
NSM Q+ C FV+ GG ++V+G Q+EN L+ D L F+ L
Sbjct: 282 NSMAQVNSGTACFAFVEEKESFGGA---PAVVVGGFQMENKLVVLDEEKQTLSFTGYLPA 338
Query: 408 ERATCT-FNFTS 418
+C+ FNFT+
Sbjct: 339 MGFSCSNFNFTT 350
>gi|302799581|ref|XP_002981549.1| hypothetical protein SELMODRAFT_114882 [Selaginella moellendorffii]
gi|300150715|gb|EFJ17364.1| hypothetical protein SELMODRAFT_114882 [Selaginella moellendorffii]
Length = 199
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 77/208 (37%), Positives = 105/208 (50%), Gaps = 26/208 (12%)
Query: 213 NNFDVSKNLKYTPLFINKVNTASGFLGEPSVE-YFIGVTSVHVNGKAVPLNKTLLSIDNE 271
N +S L+YTPL + P+ Y I V SV VNG+ +P K
Sbjct: 3 NKRPISPLLQYTPL-----------VSPPARHSYAIRVNSVRVNGQRLPAVKP------- 44
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGP 331
+S+ PYT L T Y A+ AF + V RVAPVAPF CF +G TR+GP
Sbjct: 45 --AAWALSSTEPYTRLVTPAYVAIRDAFRNL--TVPRVAPVAPFDTCFNASGLGSTRVGP 100
Query: 332 VVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSIVIGARQLENNLL 390
VP ++L L+ N W++ G N+MV + + CL FVD G + V+G Q +NL+
Sbjct: 101 PVPPVELQLEG-NATWTLFGANTMVFLKDSTVACLAFVDAGPSSPGLSVVGTFQQMHNLV 159
Query: 391 QFDLATSRLGFSDSLLFERATCT-FNFT 417
+ DL R GF+ L F + TC+ FN T
Sbjct: 160 RLDLEKQRFGFTGILFFYQTTCSNFNTT 187
>gi|125573253|gb|EAZ14768.1| hypothetical protein OsJ_04695 [Oryza sativa Japonica Group]
Length = 374
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 98/183 (53%), Gaps = 19/183 (10%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
YFI T + VN + V L + ++ T PYT L +Y+A+V AFA A
Sbjct: 198 YFISATKIAVNQEQVQLYTQEPLV-------VELCTRIPYTALRPDVYRAVVDAFARATA 250
Query: 305 KVARV----APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIG 359
RV P APF C+ +D+G TR+G VPQIDLVL+ KN W++ G NSM Q+
Sbjct: 251 GRKRVTPPPPPAAPFELCYDSRDLGSTRLGYAVPQIDLVLEGGKN--WTVFGGNSMAQVS 308
Query: 360 GDALCLGFV----DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT-F 414
+ CL V + G P + +IG Q+ENNL+ FD RLGFS L + TC+ F
Sbjct: 309 DNTACLAVVKVKGEKGSPPPPAAIIGGFQMENNLVVFDEEKQRLGFSGLLWGRQTTCSNF 368
Query: 415 NFT 417
NFT
Sbjct: 369 NFT 371
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 75/168 (44%), Gaps = 27/168 (16%)
Query: 30 KALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
K LV + KD AT Y IK PLV + L G ++W+ C ++ + T C
Sbjct: 33 KPLVAAITKDAATSLYTVPIKDGRPLV-----LDLAGALVWMSC-----AAAHPTLECHH 82
Query: 90 AQCNLANAKACGGG----------------ICGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
C A++ G C A P NP S D+ LS +TD
Sbjct: 83 HFCMHAHSYHPPGCPHNGYGRADVEDPFRCKCTAHPYNPFSGESATADLTRTRLSANATD 142
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLA 181
G+NP V+ + C + +L L +G VG+AGL R+++AL +Q+A
Sbjct: 143 GKNPLYPVSFAA-VTSCAPDSLLAKLPAGAVGVAGLARTRLALQAQVA 189
>gi|357131654|ref|XP_003567451.1| PREDICTED: basic 7S globulin 2-like [Brachypodium distachyon]
Length = 449
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 128/438 (29%), Positives = 181/438 (41%), Gaps = 86/438 (19%)
Query: 20 PITAQTSFRPKALVLRVAKDTATLQYVTQIK-QRTPLVPVKLTVHLGGNILWVDCEKGYV 78
P TA T L+ R+AKD T Y IK R+PLV V L G+++W C
Sbjct: 45 PPTAPTK---PPLISRLAKDPETSLYTISIKADRSPLV-----VDLAGSLVWSTCPPPLT 96
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPI----SNTGTHGDIRIDVLSIQSTDG 134
G+ C+ C G G P NP+ S++G + +S +TDG
Sbjct: 97 PH-------GTVPCH-----KCTGA--GDEPFNPVTRECSSSGPGNILTSFPMSANATDG 142
Query: 135 RNPGRAVTVPNFIFL----CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
+ P F C +L+ + G+AG R ++LPSQLAA KF
Sbjct: 143 V---MELYPPEESFAVTGKCAPRRLLRSFPAAATGVAGFSRRPLSLPSQLAARRLFGNKF 199
Query: 191 ALCLSPFDDGAIVFGDGPYY-------DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
+LCL F FGD P + ++ ++ YTPL N
Sbjct: 200 SLCLPFF----ATFGDTPVFLSTPDPRGFIDYTAPTSIPYTPLLTNAAGGG--------- 246
Query: 244 EYFIGVTSV------HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQ 297
Y+I + ++ V+ A+P L + N GG +ST Y + ++ A
Sbjct: 247 -YYIPIKAISVSWHGEVSRAAIPAGALDLDLANNH-GGVVLSTATQYGHMRRDVFDAFAA 304
Query: 298 AFAS-------AMPKVARVAPV--APFGACFRLQDIGFTRIGPV-VPQIDLVLQNKNV-V 346
AF M V RVAP PF C+R R + VP+IDL L +
Sbjct: 305 AFDDAITRGKIPMTTVERVAPAKGEPFELCYRGGFPMLKRPAVLDVPRIDLELGDGATGN 364
Query: 347 WSIHGQNSMVQIGGDALCLGFVD----------GG--VNPRTSIVIGARQLENNLLQFDL 394
W++ N MVQ + LC+G + GG V ++V+G +QLENNLL FDL
Sbjct: 365 WTLFNGNYMVQTE-NGLCVGILPMDDDAAAGRRGGMHVEGEPAVVLGGKQLENNLLVFDL 423
Query: 395 ATSRLGFSDSLLFERATC 412
+ LGFS L F + C
Sbjct: 424 EKNVLGFSMLLDFRLSGC 441
>gi|33772275|gb|AAQ54572.1| dermal glycoprotein precursor [Malus x domestica]
Length = 101
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/100 (52%), Positives = 63/100 (63%), Gaps = 2/100 (2%)
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP--KVARVAP 311
VN K V +N TLLSID EGVGGTKISTVNPYTVLE SI+KA+ F S + +
Sbjct: 1 VNEKPVAINTTLLSIDGEGVGGTKISTVNPYTVLEASIFKAVTDMFISEAKARNITQTDS 60
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHG 351
PF CF +++ TR+GP VP ID V QN + W + G
Sbjct: 61 TGPFEVCFSTENVLSTRVGPSVPSIDFVFQNNSTFWRVFG 100
>gi|222619836|gb|EEE55968.1| hypothetical protein OsJ_04697 [Oryza sativa Japonica Group]
Length = 1710
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/263 (32%), Positives = 124/263 (47%), Gaps = 28/263 (10%)
Query: 5 QSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHL 64
Q L+L L IF + A S +ALV + KDT T + I + L+ L
Sbjct: 1303 QQILVLNGLAIFPFDYLQASPSPAIQALVAPITKDTKTGLHTLSISNKNYLL------DL 1356
Query: 65 GGNILWVDCEKGYVSSTNKTARCGSAQCNLANA--KAC--GGGICGAGPDNPISNTGTHG 120
G +LW C S ++ T C S +C A+ K+C GG C A P NP++ G
Sbjct: 1357 SGQLLWSPC-----SPSHPTVPCSSGECAAASGAHKSCNNGGRACTARPTNPVTGERAVG 1411
Query: 121 DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
D+ + + +TDG+ VTV + C +L+ L + G AGLGR V+LP+QL
Sbjct: 1412 DLTLADIVANATDGKTLTSEVTVRGVVSSCAPGSLLRSLPAMAAGDAGLGRGGVSLPTQL 1471
Query: 181 AAAFSLKRKFALCL--SPFDDGAIVFGDGPYYDL--NNFDVSKNLKYTPLFINKVNTASG 236
+ SLKR+FA+CL + G FG GPY + FD S L YT L + N ++
Sbjct: 1472 YSKLSLKRQFAVCLPSTAAAPGVAFFGGGPYNLMPPTLFDASTVLSYTDLARSPTNPSA- 1530
Query: 237 FLGEPSVEYFIGVTSVHVNGKAV 259
Y I + + +N +AV
Sbjct: 1531 --------YSIKLRGIAMNQEAV 1545
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 101/364 (27%), Positives = 164/364 (45%), Gaps = 67/364 (18%)
Query: 60 LTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANA--KACGGGI 104
L + G +I W+ C+ + S+T K C S C + +C
Sbjct: 3 LLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSS 62
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
C +T T GD ++ L+++S D V+VPNF F CG +GL +G
Sbjct: 63 CNYMVSYGDKST-TRGDFALETLTLRSDDTI----LVSVPNFAFGCGH--ANKGLFNGAA 115
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGDGPYYDLNNFDVSKN 220
G+ GLG+S + P+Q + AF + F+ CL S G + FG+ D +DV
Sbjct: 116 GLMGLGKSSIGFPAQTSVAFG--KVFSYCLPSVSSTIPSGILHFGEAAMLD---YDV--- 167
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
++TPL V+++SG PS +YF+ +T ++V + +P++ T++ +D+ GT IS
Sbjct: 168 -RFTPL----VDSSSG----PS-QYFVSMTGINVGDELLPISATVM-VDS----GTVISR 212
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL 340
E S Y+ L AF +P + VAPF CFR+ + I P I L
Sbjct: 213 ------FEQSAYERLRDAFTQILPGLQTAVSVAPFDTCFRVSTVDDINI----PLITLHF 262
Query: 341 QNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSR 398
++ + + + + + +C F P +S V+G Q +N +D+ SR
Sbjct: 263 RD-DAELRLSPVHILYPVDDGVMCFAFA-----PSSSGRSVLGNFQQQNLRFVYDIPKSR 316
Query: 399 LGFS 402
LG S
Sbjct: 317 LGIS 320
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 103/400 (25%), Positives = 155/400 (38%), Gaps = 77/400 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------GYVSSTNKTARCGS 89
QY ++ TP + L G +++WV C S+T C S
Sbjct: 85 QYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYS 144
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS------------TDGRNP 137
QC L NP + T H R S T +
Sbjct: 145 PQCQLVPHPH----------PNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTST 194
Query: 138 GRAVTVPNFIFLCGSEFVLQGLA------SGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
G+ + F CG F + G + G G+ GLGR+ ++ SQL F K +
Sbjct: 195 GKVKKLNGLSFGCG--FRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYC 252
Query: 192 L---CLSPFDDGAIVFGDGPYYDLNNFDVSKN--LKYTPLFINKVNTASGFLGEPSVEYF 246
L LSP + G N VSK + +TPL IN ++ Y+
Sbjct: 253 LMDYTLSPPPTSFLTIGGA-----QNVAVSKKGIMSFTPLLINPLS---------PTFYY 298
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
I + V+VNG +P+N ++ SID+ G GGT I + T + Y +++AF +
Sbjct: 299 IAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLP 358
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ P F C + G TR P +P++ L + V+S +N ++ G CL
Sbjct: 359 SPAEPTPGFDLCMNVS--GVTR--PALPRMSFNLAGGS-VFSPPPRNYFIETGDQIKCLA 413
Query: 367 F----VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
DGG + V+G + LL+FD SRLGF+
Sbjct: 414 VQPVSQDGGFS-----VLGNLMQQGFLLEFDRDKSRLGFT 448
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 165/378 (43%), Gaps = 54/378 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ + +P + V G ++ WV C V S + + A C
Sbjct: 38 EYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDN 97
Query: 91 QCNLANA--KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
CN++ KAC +C + T+GD+ + +S+ + G +VPNF F
Sbjct: 98 LCNVSALPLKACAANVCQYQYTYG-DQSNTNGDLAFETISLNNGAGTQ-----SVPNFAF 151
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFG 205
CG++ + G +G G+ GLG+ ++L SQL+ F+ KF+ CL + + FG
Sbjct: 152 GCGTQNL--GTFAGAAGLVGLGQGPLSLNSQLSHTFA--NKFSYCLVSLNSLSASPLTFG 207
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
+ + N++YT + +N + Y++ + S+ V G+ + L ++
Sbjct: 208 --------SIAAAANIQYTSIVVNARH---------PTYYYVQLNSIEVGGQPLNLAPSV 250
Query: 266 LSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+ID G GGT I + T+L Y A+++A+ S + CF + +
Sbjct: 251 FAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGV 310
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
P VP D+V + + + + G+N V + A L GG + +IG Q
Sbjct: 311 S----NPSVP--DMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGGSQGFS--IIGNIQ 362
Query: 385 LENNLLQFDLATSRLGFS 402
+N+L+ +DL ++GF+
Sbjct: 363 QQNHLVVYDLEAKKIGFA 380
>gi|47824816|emb|CAE46331.1| xylanase inhibitor [Secale cereale]
Length = 192
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 99/198 (50%), Gaps = 29/198 (14%)
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+++++YTPL G P+ ++I + S+ V+ V L+++ L+ GG
Sbjct: 4 TQSMQYTPLVTKG--------GSPA--HYISLKSIKVDNTGVTLSQSALA-----TGGVM 48
Query: 278 ISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPFGACFRLQDIGFTRI 329
+ST PY +L + +Y+ LV AF A+ P V PV PFG C+ + +G
Sbjct: 49 LSTRLPYALLRSDVYRPLVDAFTKALAAQPVNGAPVARAVKPVEPFGVCYDTKTLGNNLG 108
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF-----VDGGVNPRTSIVIGARQ 384
G VP + L L W++ G+NSMV + C+ F V+ G ++++G Q
Sbjct: 109 GYAVPNVLLALDGGG-DWAMTGKNSMVDVKPGTACVAFVEMKGVEAGDGRAPAVILGGAQ 167
Query: 385 LENNLLQFDLATSRLGFS 402
+E+ +L FD+ RLGF+
Sbjct: 168 MEDFVLDFDMEKKRLGFT 185
>gi|326493694|dbj|BAJ85308.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 330
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 77/239 (32%), Positives = 104/239 (43%), Gaps = 28/239 (11%)
Query: 112 PISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR 171
P T G LS STDG NP V+ + SG VG+AGL R
Sbjct: 56 PAITTPCSGQTTTVTLSANSTDGSNPLSPVSF-------AATATCAAAPSGAVGVAGLAR 108
Query: 172 SKVALPSQLAAAFSLKRKFALCLSPFDD---------GAIVFGDGPYYDLNNFDVSKNLK 222
S + P+Q+A + FALCL P D GA +FG GP++ D
Sbjct: 109 SSASFPAQVAKTQKVANSFALCL-PSDGRTGFTGNGMGAAIFGGGPFFLAPPAD---RPS 164
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
T L + V F G P YF+ T NG AV +++ G +ST
Sbjct: 165 ITTLLSDGVPLRQPFAGNPG--YFVSAT----NGIAV--GGARVAVSGSGALVVGLSTTI 216
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ 341
PY L +Y+ + AF AM A+VA VAPF C+ + TR G +VP +D++L+
Sbjct: 217 PYAQLRGDVYRPFISAFDRAMGSSAKVAAVAPFELCYNSSKLFLTRFGYLVPDVDVMLE 275
>gi|226508498|ref|NP_001140805.1| uncharacterized protein LOC100272880 precursor [Zea mays]
gi|194701170|gb|ACF84669.1| unknown [Zea mays]
Length = 380
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 95/170 (55%), Gaps = 16/170 (9%)
Query: 261 LNKTLLSIDNEGVGGTK------ISTVNPYTVLETSIYKALVQAFASA-----MPKVARV 309
++ T ++++ VGG + +S+ PYT L +Y V+AF +A P ++RV
Sbjct: 211 VSSTGIAVEQARVGGPRGALVVALSSTVPYTALRPDVYAPFVKAFDAAAAGPNFPWMSRV 270
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
A VAPF C+ + + +G VPQID++L+ +++ G NSMVQ+ + CLGFV
Sbjct: 271 AAVAPFDRCYDSTKLPQSLLGYAVPQIDVMLEGGQ-NFTVLGGNSMVQVNANTACLGFVQ 329
Query: 370 G-GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFT 417
G P + VIG QLEN+LL D+ +LGF+ L +C +FNFT
Sbjct: 330 APGQAP--AAVIGGFQLENHLLLLDVDKKQLGFTTFLNAIGLSCSSFNFT 377
>gi|414591869|tpg|DAA42440.1| TPA: hypothetical protein ZEAMMB73_410724 [Zea mays]
Length = 384
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 95/170 (55%), Gaps = 16/170 (9%)
Query: 261 LNKTLLSIDNEGVGGTK------ISTVNPYTVLETSIYKALVQAFASA-----MPKVARV 309
++ T ++++ VGG + +S+ PYT L +Y V+AF +A P ++RV
Sbjct: 215 VSSTGIAVEQARVGGPRGALVVALSSTVPYTALRPDVYAPFVKAFDAAAAGPNFPWMSRV 274
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
A VAPF C+ + + +G VPQID++L+ +++ G NSMVQ+ + CLGFV
Sbjct: 275 AAVAPFDRCYDSTKLPQSLLGYAVPQIDVMLEGGQ-NFTVLGGNSMVQVNANTACLGFVQ 333
Query: 370 G-GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC-TFNFT 417
G P + VIG QLEN+LL D+ +LGF+ L +C +FNFT
Sbjct: 334 APGQAP--AAVIGGFQLENHLLLLDVDKKQLGFTTFLNAIGLSCSSFNFT 381
>gi|47824818|emb|CAE46332.1| xylanase inhibitor [Secale cereale]
Length = 196
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 98/198 (49%), Gaps = 29/198 (14%)
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+++++YTPL G P+ ++I + S+ V+ V ++++ + GG
Sbjct: 4 TQSMQYTPLVTKG--------GSPA--HYISLKSIKVDNTGVTVSQSAFA-----TGGVM 48
Query: 278 ISTVNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPFGACFRLQDIGFTRI 329
+ST PY +L +Y+ LV AF A+ P V PVAPFG C+ + +G
Sbjct: 49 LSTRLPYALLRRDVYRPLVDAFTKALAAQPANGAPVARAVQPVAPFGVCYDTKTLGNNLG 108
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV-----DGGVNPRTSIVIGARQ 384
G VP + L L W++ G+NSMV + C+ FV + G ++++G Q
Sbjct: 109 GYAVPNVLLALDGGG-EWAMTGKNSMVDVRPGTACVAFVEMKGAEAGDGRAPAVILGGAQ 167
Query: 385 LENNLLQFDLATSRLGFS 402
+E+ +L FD+ RLGF+
Sbjct: 168 MEDFVLDFDMEKKRLGFT 185
>gi|326506604|dbj|BAJ91343.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 189
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 93/194 (47%), Gaps = 32/194 (16%)
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
+ YTPL + G P+ +++ TS+ V VP+ L+ GG +ST
Sbjct: 1 MAYTPLVAKQ--------GSPA--HYVSGTSIRVEDTRVPVPDRALA-----TGGVMLST 45
Query: 281 VNPYTVLETSIYKALVQAFASAM--------PKVARVAPVAPFGACFRLQDIGFTRIGPV 332
PY +L +Y+ +V AF A+ P V PVAPFG C+ + +G G
Sbjct: 46 RLPYVLLRRDVYRPVVDAFTKALAAQHANGAPAARAVDPVAPFGLCYDAKTLGNNLGGYS 105
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV-----DGGVNPRTSIVIGARQLEN 387
VP + L L W++ G+NSMV + C+ FV DGG ++++G Q+E+
Sbjct: 106 VPNVVLALDGGG-EWAMTGKNSMVDVKPGTACVAFVEMEAGDGGA---PAVILGGAQMED 161
Query: 388 NLLQFDLATSRLGF 401
+L FD+ RLGF
Sbjct: 162 FVLDFDMEKKRLGF 175
>gi|194707592|gb|ACF87880.1| unknown [Zea mays]
Length = 178
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 84/147 (57%), Gaps = 10/147 (6%)
Query: 278 ISTVNPYTVLETSIYKALVQAFASA-----MPKVARVAPVAPFGACFRLQDIGFTRIGPV 332
+S+ PYT L +Y V+AF +A P ++RVA VAPF C+ + + +G
Sbjct: 32 LSSTVPYTALRPDVYAPFVKAFDAAAAGPNFPWMSRVAAVAPFDRCYDSTKLPQSLLGYA 91
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG-GVNPRTSIVIGARQLENNLLQ 391
VPQID++L+ +++ G NSMVQ+ + CLGFV G P + VIG QLEN+LL
Sbjct: 92 VPQIDVMLEGGQ-NFTVLGGNSMVQVNANTACLGFVQAPGQAP--AAVIGGFQLENHLLL 148
Query: 392 FDLATSRLGFSDSLLFERATC-TFNFT 417
D+ +LGF+ L +C +FNFT
Sbjct: 149 LDVDKKQLGFTTFLNAIGLSCSSFNFT 175
>gi|217069992|gb|ACJ83356.1| unknown [Medicago truncatula]
Length = 247
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 114/251 (45%), Gaps = 30/251 (11%)
Query: 2 SVAQSFLLLCSLLIFSLSPITAQ---TSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPV 58
SV F L+ SL +FS+S ++A T+ +P +L + KD +T + T + TP
Sbjct: 5 SVIMHFFLI-SLALFSVSSLSATQPPTTTKPHPFILPIRKDPSTNLFYTSVGIGTPRTNF 63
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGG--------GICGAGPD 110
L + L G LW DC+ Y SS+ +CGS +C C G C A
Sbjct: 64 NLAIDLAGENLWYDCDTHYNSSSYTPIQCGSTRCTDTACVGCNGPFKPGCTNNTCAASAT 123
Query: 111 NPISNTGTHGDIRIDVLSI--QSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
N ++ G + D + I Q G + + F + L GL GI G
Sbjct: 124 NSLAKFIFGGGLGEDFIFISQQKVSGLL-SSCIDIDGFSSTAEDDSPLNGLPKNTKGIFG 182
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPFDD--------GAIVFGDGPYYDLNNFDVSKN 220
L RS ++LP+QLA L+ KF+LCL + G+I GD P+++L+ F
Sbjct: 183 LARSNLSLPTQLALKNKLQPKFSLCLPSSNKQRFTNLLVGSIA-GD-PFHELSKF----- 235
Query: 221 LKYTPLFINKV 231
++ TPL +N V
Sbjct: 236 VQTTPLIVNPV 246
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 97/397 (24%), Positives = 168/397 (42%), Gaps = 62/397 (15%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYV-----------SSTNKTARC 87
A L+Y ++ TP V V L + G ++ W+ C K V SS+ C
Sbjct: 135 AGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPC 194
Query: 88 GSAQCNLANAKACGGGICG-AGPDNPISNTGTHGDIRIDVLSIQSTDGRNP----GRAVT 142
S+ C N C +G S G + +L++++ G P G V
Sbjct: 195 ASSTCT--NVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVK 252
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--- 199
+ N C ++ +GL +G G+ G+ R ++ PSQL++ ++ RKF+ C F D
Sbjct: 253 LSNITLGC-ADIDREGLPTGASGLLGMDRRPISFPSQLSSRYA--RKFSHC---FPDKIA 306
Query: 200 -----GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
G + FG+ D+ +S L+YTPL N ++ Y++G+ + V
Sbjct: 307 HLNSSGLVFFGES---DI----ISPYLRYTPLVQNPAVPSASL-----DYYYVGLVGISV 354
Query: 255 NGKAVPLNKTLLSIDN-EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
+ +PL+ ID G GGT I + +T L+ ++A+ + F + +A+V +
Sbjct: 355 DESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNS 414
Query: 314 PFGACFRLQDIGFTRIGPVVPQI--------DLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
F C+ + ++P I D+VL +++ + LCL
Sbjct: 415 GFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQT-----TLCL 469
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
F+ G P +IG Q +N +++DL RLG +
Sbjct: 470 AFLMSGDIPFN--IIGNYQQQNLWVEYDLEKLRLGIA 504
>gi|297744239|emb|CBI37209.3| unnamed protein product [Vitis vinifera]
Length = 220
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/55 (69%), Positives = 46/55 (83%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY 77
AQ+SFRP ALV+ V+KD++TLQYVT I QRTPLVP++L V LGG LWVDCE+ Y
Sbjct: 22 AQSSFRPHALVIPVSKDSSTLQYVTSINQRTPLVPLQLVVDLGGQFLWVDCEQNY 76
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/397 (24%), Positives = 167/397 (42%), Gaps = 62/397 (15%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYV-----------SSTNKTARC 87
A L+Y ++ TP V V L + G ++ W+ C K V SS+ C
Sbjct: 134 AGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPC 193
Query: 88 GSAQCNLANAKACGGGICG-AGPDNPISNTGTHGDIRIDVLSIQSTDGRNP----GRAVT 142
S+ C N C +G S G + +L++++ G P G V
Sbjct: 194 ASSTCT--NVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVK 251
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--- 199
+ N C ++ +GL +G G+ G+ R ++ PSQL++ ++ RKF+ C F D
Sbjct: 252 LSNITLGC-ADIDREGLPTGASGLLGMDRRPISFPSQLSSRYA--RKFSHC---FPDKIA 305
Query: 200 -----GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
G + FG+ D+ +S L+YTPL N ++ Y++G+ + V
Sbjct: 306 HLNSSGLVFFGES---DI----ISPYLRYTPLVQNPAVPSASL-----DYYYVGLVGISV 353
Query: 255 NGKAVPLNKTLLSIDN-EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
+ +PL+ ID G GGT I + +T L+ ++A+ + F + +A+V +
Sbjct: 354 DESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNS 413
Query: 314 PFGACFRLQDIGFTRIGPVVPQI--------DLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
F C+ + ++P I D+VL +++ + LCL
Sbjct: 414 GFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQT-----TLCL 468
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
F G P +IG Q +N +++DL RLG +
Sbjct: 469 AFQMSGDIPFN--IIGNYQQQNLWVEYDLEKLRLGIA 503
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 162/401 (40%), Gaps = 56/401 (13%)
Query: 21 ITAQTSFRPKALVLRVAK--DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EK 75
+ + + + P+ LV V D + +Y ++ +P L V G +++WV C E+
Sbjct: 104 VASTSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ 163
Query: 76 GYV----------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPIS---NTGTHGDI 122
Y SS+ CGSA C + CGGG D ++ + T G++
Sbjct: 164 CYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGEL 223
Query: 123 RIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
++ L++ T V CG GL G G+ GLG ++L QL
Sbjct: 224 ALETLTLGGT---------AVQGVAIGCGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGG 272
Query: 183 AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
A F+ CL+ G G G V + PL N N AS F
Sbjct: 273 A--AGGVFSYCLASRGAG----GAGSLVLGRTEAVPVGAVWVPLVRN--NQASSF----- 319
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
Y++G+T + V G+ +PL +L + +G GG + T T L Y AL AF A
Sbjct: 320 --YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGA 377
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
M + R V+ C+ L R VP + ++ V ++ +N +V++GG
Sbjct: 378 MGALPRSPAVSLLDTCYDLSGYASVR----VPTVSFYF-DQGAVLTLPARNLLVEVGGAV 432
Query: 363 LCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
CL F P +S ++G Q E + D A +GF
Sbjct: 433 FCLAFA-----PSSSGISILGNIQQEGIQITVDSANGYVGF 468
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 157/376 (41%), Gaps = 58/376 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGS 89
YVT++ TP P + V G ++ W+ C V SS+ C S
Sbjct: 116 NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSS 175
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
QC+ + +C P N ++GD V + S D + G A +VPNF +
Sbjct: 176 PQCDGLSTATLNPAVC--SPSNVCIYQASYGDSSFSVGYL-SKDTVSFG-ANSVPNFYYG 231
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGP 208
CG + +GL G+ GL R+K++L QLA +L F+ CL S G + G
Sbjct: 232 CGQDN--EGLFGRSAGLMGLARNKLSLLYQLAP--TLGYSFSYCLPSTSSSGYLSIGS-- 285
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT-LLS 267
Y+ + YTP+ N ++ + YFI ++ + V GK + ++ + S
Sbjct: 286 -YNPGGY------SYTPMVSNTLDDS---------LYFISLSGMTVAGKPLAVSSSEYTS 329
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP-KVARVAPVAPFGACFRLQDIGF 326
+ GT I T L TS+Y AL +A A+AM R A + CF Q
Sbjct: 330 LPTIIDSGTVI------TRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKL 383
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLE 386
VP + + + N +V + G CL F R++ +IG Q +
Sbjct: 384 R----AVPAVSMAFSG-GATLKLSAGNLLVDVDGATTCLAFAPA----RSAAIIGNTQQQ 434
Query: 387 NNLLQFDLATSRLGFS 402
+ +D+ ++R+GF+
Sbjct: 435 TFSVVYDVKSNRIGFA 450
>gi|255640308|gb|ACU20443.1| unknown [Glycine max]
Length = 247
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 102/217 (47%), Gaps = 31/217 (14%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST------NKT 84
+L L V KD +T QY+T + TP+ K + LGG++LW DC S+ +++
Sbjct: 27 SLTLPVTKDHSTHQYLTILSYGTPVESAKFVLDLGGSLLWADCASRTTPSSTLAPIFHRS 86
Query: 85 ARCGSAQ----------CNLANAKACGGGICGAGPDNPISNTG-THGDIRIDVLSIQSTD 133
RC +A+ +LAN C +N I+ T G++ D++ +S +
Sbjct: 87 IRCLTAKGPEIETHRWLSSLANPID-QDQPCQITAENSITGKRVTEGELVEDLVIHRSHE 145
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+F C F+L GLA+ GI GL +S+++ SQ+ + ++RK LC
Sbjct: 146 ------------LLFTCSPTFLLNGLATDAKGIIGLDKSRISFSSQVFHSLKIQRKITLC 193
Query: 194 LSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINK 230
LS G I FG + ++ + L +TPL N+
Sbjct: 194 LS-HTSGVIQFGKMTHKSQTESEIFRYLTFTPLVANQ 229
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 161/401 (40%), Gaps = 56/401 (13%)
Query: 21 ITAQTSFRPKALVLRVAK--DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EK 75
+ + + + P+ LV V D + +Y ++ +P L V G +++WV C E+
Sbjct: 104 VASTSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ 163
Query: 76 GYV----------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPIS---NTGTHGDI 122
Y SS+ CGSA C + CGGG D ++ + T G++
Sbjct: 164 CYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGEL 223
Query: 123 RIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
++ L++ T V CG GL G G+ GLG ++L QL
Sbjct: 224 ALETLTLGGT---------AVQGVAIGCGHRN--SGLFVGAAGLLGLGWGAMSLIGQLGG 272
Query: 183 AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
A F+ CL+ G G G V + PL N N AS F
Sbjct: 273 A--AGGVFSYCLASRGAG----GAGSLVLGRTEAVPVGAVWVPLVRN--NQASSF----- 319
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
Y++G+T + V G+ +PL L + +G GG + T T L Y AL AF A
Sbjct: 320 --YYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGA 377
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
M + R V+ C+ L R VP + ++ V ++ +N +V++GG
Sbjct: 378 MGALPRSPAVSLLDTCYDLSGYASVR----VPTVSFYF-DQGAVLTLPARNLLVEVGGAV 432
Query: 363 LCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
CL F P +S ++G Q E + D A +GF
Sbjct: 433 FCLAFA-----PSSSGISILGNIQQEGIQITVDSANGYVGF 468
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 102/424 (24%), Positives = 176/424 (41%), Gaps = 75/424 (17%)
Query: 18 LSPITAQTSFRPKALVLRV-------AKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW 70
LS + + SFR ++ + ++ + Y T+IK +P + V G +ILW
Sbjct: 43 LSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILW 102
Query: 71 VDCE------------------KGYVSSTNKTARCGSAQCN-LANAKACGGG-------I 104
V+C SST+K C A C+ + ++ CG +
Sbjct: 103 VNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVV 162
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFV--LQGLASG 162
G G + + GD D +++ G N A +F CG L S
Sbjct: 163 YGDG-------STSDGDFVKDNITLDQVTG-NLRTAPLAQEVVFGCGKNQSGQLGQTESA 214
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLK 222
V GI G G+S ++ SQLAA S+KR F+ CL + G I F G S +K
Sbjct: 215 VDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGI-FAIGEVE-------SPVVK 266
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
TPL N+V+ Y + + + V+G+ + L +L S + G GGT I +
Sbjct: 267 TTPLVPNQVH------------YNVILKGMDVDGEPIDLPPSLASTN--GDGGTIIDSGT 312
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQN 342
L ++Y +L++ + + ++ V ACF + P ++L ++
Sbjct: 313 TLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFT----SNTDKAFPVVNLHFED 366
Query: 343 KNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRL 399
++ S++ + + + D C G+ GG+ + I++G L N L+ +DL +
Sbjct: 367 -SLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVI 425
Query: 400 GFSD 403
G++D
Sbjct: 426 GWAD 429
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 151/358 (42%), Gaps = 46/358 (12%)
Query: 60 LTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSAQCNLANAKACGGGIC 105
+ + G ++ W+ C+ VS T K C S +C+ A +C
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 106 GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVG 165
N T ++GD + + S D + T+P F + CG + QGL G
Sbjct: 61 ETD-SNACLYTASYGDTSFSIGYL-SQDLLTLTSSQTLPQFTYGCGQDN--QGLFGRAAG 116
Query: 166 IAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTP 225
I GL R K+++ +QL+ + F+ CL + G+ G ++ + K+TP
Sbjct: 117 IIGLARDKLSMLAQLSTKYG--HAFSYCLPTANSGSSGGGFLSIGSIS----PTSYKFTP 170
Query: 226 LFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYT 285
+ + N PS+ YF+ +T++ V+G+ + L + + GT I T
Sbjct: 171 MLTDSKN--------PSL-YFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVI------T 215
Query: 286 VLETSIYKALVQAFASAMP-KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKN 344
L S+Y AL QAF M K A+ + CF+ G + VP+I ++ Q
Sbjct: 216 RLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFK----GSLKSISAVPEIKMIFQG-G 270
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
++ + +++ CL F G +IG RQ + + +D++TSR+GF+
Sbjct: 271 ADLTLRAPSILIEADKGITCLAFA-GSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFA 327
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 156/373 (41%), Gaps = 46/373 (12%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
Y ++ TP + + G ++ W+ C+ VS T K C S
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASV 184
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
+C+ A +C N T ++GD + + S D + T+P F + C
Sbjct: 185 ECSRLKAATLNDPLCETD-SNACLYTASYGDTSFSIGYL-SQDLLTLTSSQTLPQFTYGC 242
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + QGL GI GL R K+++ +QL+ + F+ CL + G+ G
Sbjct: 243 GQDN--QGLFGRAAGIIGLARDKLSMLAQLSTKYG--HAFSYCLPTANSGSSGGGFLSIG 298
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
++ + K+TP+ + N PS+ YF+ +T++ V+G+ + L + +
Sbjct: 299 SIS----PTSYKFTPMLTDSKN--------PSL-YFLRLTAITVSGRPLDLAAAMYRVPT 345
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMP-KVARVAPVAPFGACFRLQDIGFTRI 329
GT I T L S+Y AL QAF M K A+ + CF+ G +
Sbjct: 346 LIDSGTVI------TRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFK----GSLKS 395
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNL 389
VP+I ++ Q ++ + +++ CL F G +IG RQ +
Sbjct: 396 ISAVPEIKMIFQG-GADLTLRAPSILIEADKGITCLAFA-GSSGTNQIAIIGNRQQQTYN 453
Query: 390 LQFDLATSRLGFS 402
+ +D++TSR+GF+
Sbjct: 454 IAYDVSTSRIGFA 466
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 119/258 (46%), Gaps = 32/258 (12%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSK 219
S G+ G+ R ++ +Q+ KF+ C+S D G ++FGD + L
Sbjct: 193 SKTTGLMGMNRGSLSFVTQMGFP-----KFSYCISGKDASGVLLFGDATFKWLGP----- 242
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
LKYTPL K+NT + V Y + + + V K + + K + + D+ G G T +
Sbjct: 243 -LKYTPLV--KMNTPLPYFDR--VAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVD 297
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF---GA---CFRLQDIGFTRIGPVV 333
+ +T L S+Y AL F + V + F GA CFR++ G + P V
Sbjct: 298 SGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGG---VVPAV 354
Query: 334 PQIDLVLQN-------KNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLE 386
P + +V + + +++ + G + + GD CL F + + + VIG +
Sbjct: 355 PAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQ 414
Query: 387 NNLLQFDLATSRLGFSDS 404
N ++FDL SR+GF+D+
Sbjct: 415 NVWMEFDLVNSRVGFADT 432
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 138/309 (44%), Gaps = 43/309 (13%)
Query: 120 GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLAS--GVVGIAGLGRSKVALP 177
GD DV+ + ST+ + G+AV + F C QG G +GI G R ++LP
Sbjct: 190 GDFSQDVIFLNSTN--SSGQAVQFRDVAFGCAHS--PQGFLVDLGSLGIVGFNRGNLSLP 245
Query: 178 SQLAAAFSLKRKFALCL-----SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
SQL KF+ C P G I GD + SK + YTPL N V
Sbjct: 246 SQLKDRLG-GSKFSYCFPSQPWQPRATGVIFLGD------SGLSKSK-VGYTPLLDNPVT 297
Query: 233 TASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSI 291
A L Y++G+TS+ V+GK + + ++ +D + G GGT + + +T +
Sbjct: 298 PARSQL------YYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDA 351
Query: 292 YKALVQAFASAMPKVAR--VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
Y A AFA++ R V A F C+ +I P VP++ L LQN NV +
Sbjct: 352 YTAFRNAFAASNRSGLRKKVGAAAGFDDCY---NISAGSSLPGVPEVRLSLQN-NVRLEL 407
Query: 350 HGQNSMVQIGGDA----LCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDS 404
++ V + +CL + + I V+G Q N L+++D SR+G
Sbjct: 408 RFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVG---- 463
Query: 405 LLFERATCT 413
FERA C+
Sbjct: 464 --FERADCS 470
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/401 (23%), Positives = 156/401 (38%), Gaps = 57/401 (14%)
Query: 18 LSPITAQ-TSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-- 74
LSP Q T F + D + +Y ++ +P L V G +++WV C+
Sbjct: 99 LSPAAYQPTGFSGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC 158
Query: 75 -----------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPIS---NTGTHG 120
S+T CGSA C CG G D +S + T G
Sbjct: 159 LECYAQADPLFDPATSATFSAVPCGSAVCRTLRTSGCGDS---GGCDYEVSYGDGSYTKG 215
Query: 121 DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
+ ++ L++ T V CG +GL G G+ GLG ++L QL
Sbjct: 216 ALALETLTLGGT---------AVEGVAIGCGHRN--RGLFVGAAGLLGLGWGPMSLVGQL 264
Query: 181 AAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGE 240
A F+ CL+ G++V G + V + + PL N
Sbjct: 265 GGA--AGGAFSYCLASRGAGSLVLG-------RSEAVPEGAVWVPLVRNPQ--------A 307
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
PS Y++G++ + V + +PL + L + +G GG + T T L Y AL AF
Sbjct: 308 PSF-YYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFV 366
Query: 301 SAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
+A+ + R V+ C+ L G+T + VP + + ++ +N ++++ G
Sbjct: 367 AAVGALPRAPGVSLLDTCYDLS--GYTSV--RVPTVSFYFDGAATL-TLPARNLLLEVDG 421
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
CL F P ++G Q E + D A +GF
Sbjct: 422 GIYCLAFAPSSSGPS---ILGNIQQEGIQITVDSANGYIGF 459
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 150/384 (39%), Gaps = 60/384 (15%)
Query: 35 RVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVS--------------S 80
R+ T YV + TP + G N+ W+ C+ VS S
Sbjct: 6 RIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSS 65
Query: 81 TNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
T + C SA C +++ C G C G T+GD + +T+
Sbjct: 66 TYRNISCTSAACTGLSSRGCSGSTCVYGV--------TYGDGS-STVGFLATETFTLAAG 116
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
NFIF CG QGL +G G+ GLGRS +L SQLA SL F+ CL P
Sbjct: 117 NVFNNFIFGCGQNN--QGLFTGAAGLIGLGRSPYSLNSQLAT--SLGNIFSYCL-PSTSS 171
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
A + LN + + YT + N YFI + + V G +
Sbjct: 172 ATGY-------LNIGNPLRTPGYTAMLTNS---------RAPTLYFIDLIGISVGGTRLA 215
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
L+ T+ + V GT I + T L + Y AL AF +AM + R A + C+
Sbjct: 216 LSSTVF----QSV-GTIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYD 270
Query: 321 LQDIGFTRIGPVV-PQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI- 378
F+R V P I L +V +I G I +CL F G + T I
Sbjct: 271 -----FSRTTTVTFPTIKLHYTGLDV--TIPGAGVFYVISSSQVCLAFA--GNSDSTQIG 321
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
+IG Q + +D A R+GF+
Sbjct: 322 IIGNVQQRTMEVTYDNALKRIGFA 345
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 154/377 (40%), Gaps = 63/377 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
YVTQ+ TP + V G ++ W+ C V SST + RC ++
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSAS 193
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ A C A N ++GD V S+ STD + G + P+F + C
Sbjct: 194 QCDELQAATLNPSACSA--SNVCIYQASYGDSSFSVGSL-STDTVSFG-STRYPSFYYGC 249
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL R+K++L QLA SL F+ CL P GPY
Sbjct: 250 GQDN--EGLFGRSAGLIGLARNKLSLLYQLAP--SLGYSFSYCL-PTAASTGYLSIGPY- 303
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS--- 267
+ YTP+ + ++ + YFI ++ + V G + ++ + S
Sbjct: 304 -----NTGHYYSYTPMASSSLDAS---------LYFITLSGMSVGGSPLAVSPSEYSSLP 349
Query: 268 --IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
ID+ GT I T L T+++ AL +A A AM R + CF Q
Sbjct: 350 TIIDS----GTVI------TRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ 399
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
VP + + + +N ++ + CL F ++ +IG Q
Sbjct: 400 LR-----VPTVAMAFAG-GASMKLTTRNVLIDVDDSTTCLAFAPT----DSTAIIGNTQQ 449
Query: 386 ENNLLQFDLATSRLGFS 402
+ + +D+A SR+GFS
Sbjct: 450 QTFSVIYDVAQSRIGFS 466
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 159/397 (40%), Gaps = 63/397 (15%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------GYVSSTNKTA 85
T + QY ++ TP + L G +++WV C S+T
Sbjct: 84 TGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPN 143
Query: 86 RCGSAQCNLA---NAKACGGGICGAGPDNPISNTGTHGD--IRIDVLSIQSTD-GRNPGR 139
C + C L C A +P ++GD S ++T + GR
Sbjct: 144 HCYDSACQLVPLPKHHRCNH----ARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGR 199
Query: 140 AVTVPNFIFLCGSEFVLQGLA------SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+ F C F + G + +G G+ GLGR ++L SQL F KF+ C
Sbjct: 200 EAKLKGIAFGCA--FRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFG--NKFSYC 255
Query: 194 L-----SPFDDGAIVFGDGPYYDLNNFDVS---KNLKYTPLFINKVNTASGFLGEPSVEY 245
L SP ++ G DV+ + +++TPL IN ++ P+ Y
Sbjct: 256 LMDHDISPSPTSYLLIGS------TQNDVAPGKRRMRFTPLHINPLS--------PTF-Y 300
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+IG+ SV V+G +P+N ++ ++D G GGT + + T L Y ++ +
Sbjct: 301 YIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRL 360
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
+ P F C + +I R+ P++ L + V+S +N V D CL
Sbjct: 361 PSPAEPTPGFDLCVNVSEIEHPRL----PKLSFKL-GGDSVFSPPPRNYFVDTDEDVKCL 415
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ + P VIG + LL+FD +RLGFS
Sbjct: 416 A-LQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFS 451
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 136/308 (44%), Gaps = 39/308 (12%)
Query: 118 THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALP 177
+ GD DV+ + ST+ + +AV + F C + G +GI G R ++LP
Sbjct: 87 STGDFSQDVIFLNSTNSSS--QAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLP 144
Query: 178 SQLAAAFSLKRKFALCL-----SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
SQL KF+ C P G I GD + SK + YTPL N V
Sbjct: 145 SQLKDRLG-GSKFSYCFPSQPWQPRATGVIFLGD------SGLSKSK-VSYTPLLDNPVT 196
Query: 233 TASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKISTVNPYTVLETSI 291
A L Y++G+TS+ V+GK + + ++ +D + G GGT + + +T +
Sbjct: 197 PARSQL------YYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDA 250
Query: 292 YKALVQAFASAMPKVAR--VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
Y A AFA++ R V A F C+ +I P VP++ L LQN NV +
Sbjct: 251 YTAFRNAFAASNRSGLRKKVGAAAGFDDCY---NISAGSSLPGVPEVRLSLQN-NVRLEL 306
Query: 350 HGQNSMVQIGGDA----LCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDS 404
++ V + +CL + + I V+G Q N L+++D SR+G
Sbjct: 307 RFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVG---- 362
Query: 405 LLFERATC 412
FERA C
Sbjct: 363 --FERADC 368
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 100/424 (23%), Positives = 174/424 (41%), Gaps = 75/424 (17%)
Query: 18 LSPITAQTSFRPKALVLRV-------AKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW 70
LS + + SFR ++ + ++ + Y T+IK +P + V G +ILW
Sbjct: 40 LSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILW 99
Query: 71 VDCE------------------KGYVSSTNKTARCGSAQCN-LANAKACGGG-------I 104
V+C SST+K C C+ + ++ CG +
Sbjct: 100 VNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVV 159
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFV--LQGLASG 162
G G + + GD D ++++ G N A +F CG L S
Sbjct: 160 YGDG-------STSDGDFIKDNITLEQVTG-NLRTAPLAQEVVFGCGKNQSGQLGQTDSA 211
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLK 222
V GI G G+S ++ SQLAA S KR F+ CL + G I F G S +K
Sbjct: 212 VDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI-FAVGEVE-------SPVVK 263
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
TP+ N+V+ Y + + + V+G + L +L S + G GGT I +
Sbjct: 264 TTPIVPNQVH------------YNVILKGMDVDGDPIDLPPSLASTN--GDGGTIIDSGT 309
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQN 342
L ++Y +L++ + + ++ V ACF + P ++L ++
Sbjct: 310 TLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFT----SNTDKAFPVVNLHFED 363
Query: 343 KNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRL 399
++ S++ + + + D C G+ GG+ + I++G L N L+ +DL +
Sbjct: 364 -SLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVI 422
Query: 400 GFSD 403
G++D
Sbjct: 423 GWAD 426
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 149/382 (39%), Gaps = 72/382 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
YV + TP L+ G ++ W CE S++ K C S
Sbjct: 140 YVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSE 199
Query: 91 QCNLAN-----AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C L A+ C C G + T G + + L+I S+D N
Sbjct: 200 FCKLIAEGNYPAQDCISNTCLYGIQ--YGSGYTIGFLATETLAIASSD--------VFKN 249
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAIV 203
F+F C E +G +G G+ GLGRS +ALPSQ + K F+ CL SP G +
Sbjct: 250 FLFGCSEE--SRGTFNGTTGLLGLGRSPIALPSQTTNKY--KNLFSYCLPASPSSTGHLS 305
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV--HVNGKAVPL 261
FG +VS+ K TP+ P ++ G+ +V V G+ +P+
Sbjct: 306 FG---------VEVSQAAKSTPI-------------SPKLKQLYGLNTVGISVRGRELPI 343
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
N + + T I + +T L + Y AL AF M + F C+
Sbjct: 344 NGS--------ISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDF 395
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTSIVI 380
+IG + +P I + + V I M+ + G +CL F D G + +I
Sbjct: 396 SNIGNGTL--TIPGISIFFEG-GVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAI-F 451
Query: 381 GARQLENNLLQFDLATSRLGFS 402
G Q + + +D+A +GF+
Sbjct: 452 GNYQQKTYEVIYDVAKGMVGFA 473
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 100/424 (23%), Positives = 174/424 (41%), Gaps = 75/424 (17%)
Query: 18 LSPITAQTSFRPKALVLRV-------AKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW 70
LS + + SFR ++ + ++ + Y T+IK +P + V G +ILW
Sbjct: 44 LSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILW 103
Query: 71 VDCE------------------KGYVSSTNKTARCGSAQCN-LANAKACGGG-------I 104
V+C SST+K C C+ + ++ CG +
Sbjct: 104 VNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVV 163
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFV--LQGLASG 162
G G + + GD D ++++ G N A +F CG L S
Sbjct: 164 YGDG-------STSDGDFIKDNITLEQVTG-NLRTAPLAQEVVFGCGKNQSGQLGQTDSA 215
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLK 222
V GI G G+S ++ SQLAA S KR F+ CL + G I F G S +K
Sbjct: 216 VDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI-FAVGEVE-------SPVVK 267
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
TP+ N+V+ Y + + + V+G + L +L S + G GGT I +
Sbjct: 268 TTPIVPNQVH------------YNVILKGMDVDGDPIDLPPSLASTN--GDGGTIIDSGT 313
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQN 342
L ++Y +L++ + + ++ V ACF + P ++L ++
Sbjct: 314 TLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFT----SNTDKAFPVVNLHFED 367
Query: 343 KNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRL 399
++ S++ + + + D C G+ GG+ + I++G L N L+ +DL +
Sbjct: 368 -SLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVI 426
Query: 400 GFSD 403
G++D
Sbjct: 427 GWAD 430
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 97/393 (24%), Positives = 155/393 (39%), Gaps = 57/393 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVSSTNKTARCGSAQCNLANAKACG 101
QY I+ +P + L G ++ WV C K S + + + C
Sbjct: 82 QYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCF 141
Query: 102 GGICGAGPD---NPISNTGTHGDIRIDVL------------SIQSTDGRNPGRAVTVPNF 146
+C P NP ++T H R + + +T + GR + + +
Sbjct: 142 SSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSI 201
Query: 147 IFLCGSEFVLQGLA------SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-----S 195
F CG F G + +G G+ GLGR ++ SQL F R F+ CL S
Sbjct: 202 AFGCG--FHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFG--RSFSYCLLDYTLS 257
Query: 196 PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
P ++ GD +N + + +TPL IN E Y+I + V V+
Sbjct: 258 PPPTSYLMIGDVVSTKKDNKSM---MSFTPLLINP---------EAPTFYYISIKGVFVD 305
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G + ++ ++ S+D G GGT I + T L Y+ ++ AF + ++ P
Sbjct: 306 GVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFK----REVKLPSPTPG 361
Query: 316 GACFR------LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
GA R + G +R P P++ L L ++ ++S +N + I CL
Sbjct: 362 GASTRSGFDLCVNVTGVSR--PRFPRLSLELGGES-LYSPPPRNYFIDISEGIKCLAIQP 418
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
VIG + LL+FD SRLGFS
Sbjct: 419 VEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFS 451
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/377 (25%), Positives = 154/377 (40%), Gaps = 63/377 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
YVTQ+ TP + V G ++ W+ C V SST + RC ++
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSAS 193
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ A C A N ++GD V + STD + G + + P+F + C
Sbjct: 194 QCDELQAATLNPSACSA--SNVCIYQASYGDSSFSVGYL-STDTVSFG-STSYPSFYYGC 249
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL R+K++L QLA SL F+ CL P GPY
Sbjct: 250 GQDN--EGLFGRSAGLIGLARNKLSLLYQLAP--SLGYSFSYCL-PTAASTGYLSIGPY- 303
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS--- 267
+ YTP+ + ++ + YFI ++ + V G + ++ + S
Sbjct: 304 -----NTGHYYSYTPMASSSLDAS---------LYFITLSGMSVGGSPLAVSPSEYSSLP 349
Query: 268 --IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
ID+ GT I T L T+++ AL +A A AM R + CF Q
Sbjct: 350 TIIDS----GTVI------TRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ 399
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
VP + + + +N ++ + CL F ++ +IG Q
Sbjct: 400 LR-----VPTVVMAFAG-GASMKLTTRNVLIDVDDSTTCLAFAPT----DSTAIIGNTQQ 449
Query: 386 ENNLLQFDLATSRLGFS 402
+ + +D+A SR+GFS
Sbjct: 450 QTFSVIYDVAQSRIGFS 466
>gi|125573250|gb|EAZ14765.1| hypothetical protein OsJ_04692 [Oryza sativa Japonica Group]
Length = 195
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 75/152 (49%), Gaps = 22/152 (14%)
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLAN----AKACG------------G 102
L V L G +LW C + +T C S+ C +AN +C G
Sbjct: 3 PLVVDLAGPLLWSTCPPAH-----RTVPCSSSVCKVANWYRSPASCPYSDGGRPGSGDRG 57
Query: 103 GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASG 162
C A P NP+S GD+ L+ +TDG+NP V+ F C +L L SG
Sbjct: 58 CACAAYPYNPVSGQCGRGDVAAVPLAANATDGKNPLFPVSFSAFAS-CAPSGLLASLPSG 116
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
V G+AG+ R ++LPSQ+A++ ++R+FALCL
Sbjct: 117 VAGVAGMSRLPLSLPSQVASSLKVERQFALCL 148
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/397 (25%), Positives = 154/397 (38%), Gaps = 79/397 (19%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV----------------------SSTNKTARCGSA 90
TP + + G +I+W C Y+ SS++K C +
Sbjct: 75 TPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLGCKNP 134
Query: 91 QCNLANA-----------KACGGGICGAGPDNPISNTGTHGDIRI-DVLSIQSTDGRNPG 138
+C+ + K+C C P +GT G + + + L + S
Sbjct: 135 KCSWIHHSNINCDQDCSIKSCLNQTCP--PYMIFYGSGTTGGVALSETLHLHS------- 185
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SP 196
++ PNF+ C S F A GIAG GR +LPSQL KF+ CL
Sbjct: 186 --LSKPNFLVGC-SVFSSHQPA----GIAGFGRGLSSLPSQLGLG-----KFSYCLLSHR 233
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKN---LKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSV 252
FDD D+ D K L YTP N KV+ S F SV Y++G+ +
Sbjct: 234 FDDDTKK-SSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSF----SVYYYLGLRRI 288
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V G V + LS +G GG I + +T + ++ L F + RV +
Sbjct: 289 TVGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEI 348
Query: 313 AP---FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV- 368
CF + D P++ L + V ++ +N +GG+ CL V
Sbjct: 349 EDAIGLRPCFNVSDAKTVSF----PELRLYFKGGADV-ALPVENYFAFVGGEVACLTVVT 403
Query: 369 DGGVNPRT----SIVIGARQLENNLLQFDLATSRLGF 401
DG P +++G Q++N +++DL RLGF
Sbjct: 404 DGVAGPERVGGPGMILGNFQMQNFYVEYDLRNERLGF 440
>gi|217071718|gb|ACJ84219.1| unknown [Medicago truncatula]
Length = 241
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 93/221 (42%), Gaps = 22/221 (9%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARC 87
+P + +L + KD +T + T + TP L + L G LW DC Y SS+ C
Sbjct: 27 QPHSFILPIKKDPSTNLFYTSVGIGTPRTNFNLAIDLAGENLWYDCNTHYNSSSYIPIAC 86
Query: 88 GSAQCNLANAKACGG--------GICGAGPDNPISNTGTHGDIRIDVLSI--QSTDGRNP 137
GS +C+ C G C A N ++ GD+ D + I Q G
Sbjct: 87 GSERCSDVACIGCNGPFKPGCTNNTCPATATNSLAKFIFGGDLGEDFIFISQQKVSGLL- 145
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
+ + G + L GL GI GL RS ++LP+QLA L KF+LCL
Sbjct: 146 SSCIDIDRLPSFTGEDSPLNGLPKITKGIIGLSRSNLSLPTQLALKNKLPHKFSLCLPSS 205
Query: 198 DDGAIV------FGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
+ G P+ +L+ F ++ TPL +N V+
Sbjct: 206 NKQGFTNLLVGSIGGDPFKELSKF-----VQTTPLIVNPVS 241
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 155/378 (41%), Gaps = 66/378 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
YVT++ TP + V G ++ W+ C V SST T C ++
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSAS 193
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ A C N ++GD V + S D + G + PNF + C
Sbjct: 194 QCDELQAATLNPSACSV--RNVCIYQASYGDSSFSVGYL-SRDTVSFGSG-SYPNFYYGC 249
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGPY 209
G + +GL G+ GL R+K++L QLA SL F+ CL +P G + GPY
Sbjct: 250 GQDN--EGLFGRSAGLIGLARNKLSLLYQLAP--SLGYSFSYCLPTPASTGYLSI--GPY 303
Query: 210 YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS-- 267
S + YTP+ + ++ + YF+ ++ + V G + ++ S
Sbjct: 304 -------TSGHYSYTPMASSSLDAS---------LYFVTLSGMSVGGSPLAVSPAEYSSL 347
Query: 268 ---IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
ID+ GT I T L T++Y AL +A A+AM V + CF+ Q
Sbjct: 348 PTIIDS----GTVI------TRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTCFQGQAS 397
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
VP + + + QN ++ + CL F ++ +IG Q
Sbjct: 398 QLR-----VPAVAMAFAG-GATLKLATQNVLIDVDDSTTCLAFAPT----DSTTIIGNTQ 447
Query: 385 LENNLLQFDLATSRLGFS 402
+ + +D+A SR+GF+
Sbjct: 448 QQTFSVVYDVAQSRIGFA 465
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 160/386 (41%), Gaps = 69/386 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGY----------VSSTNKTARCGSA 90
+Y+ ++ TP + V G ++ WV C K Y S++ CGSA
Sbjct: 12 EYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSA 71
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
CN C C G G + T GD D +++ DG N G+ VP
Sbjct: 72 LCNGLPFPMCNQTTCVYWYSYGDG-------SLTTGDFVYDTITM---DGIN-GQKQQVP 120
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-----SPFDD 199
NF F CG + +G +G GI GLG+ ++ SQL + ++ KF+ CL P
Sbjct: 121 NFAFGCGHD--NEGSFAGADGILGLGQGPLSFHSQLKSVYN--GKFSYCLVDWLAPPTQT 176
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
++FGD + ++KY P+ N KV T Y++ + + V
Sbjct: 177 SPLLFGDAA------VPILPDVKYLPILANPKVPT----------YYYVKLNGISVGDNL 220
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF-ASAMPKVARVAPVAPFGA 317
+ ++ T+ ID+ G GT + T L + YK ++ A AS M ++ ++
Sbjct: 221 LNISSTVFDIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDL 280
Query: 318 CFRLQDIGFTRIG-PVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPR 375
C GF + P VP + + ++V + N + + + C +P
Sbjct: 281 CLS----GFPKDQLPTVPAMTFHFEGGDMV--LPPSNYFIYLESSQSYCFAMTS---SPD 331
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
+I IG+ Q +N + +D A +LGF
Sbjct: 332 VNI-IGSVQQQNFQVYYDTAGRKLGF 356
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 96/393 (24%), Positives = 156/393 (39%), Gaps = 71/393 (18%)
Query: 37 AKDTATL---QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------- 74
AKD +TL Y+ + TP + L G ++ W C+
Sbjct: 122 AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKS 181
Query: 75 KGYVSSTNKTARCGSAQCNLANAKACGGGICGAG---PDNPISNTGTHGDIRIDVLSIQS 131
Y + + +A CGS NA +C C G D S G + D ++ S
Sbjct: 182 TSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFS----VGFLAKDKFTLTS 237
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
+D F CG QGL +GV G+ GLGR K++ PSQ A A++ + F+
Sbjct: 238 SD--------VFDGVYFGCGENN--QGLFTGVAGLLGLGRDKLSFPSQTATAYN--KIFS 285
Query: 192 LCL--SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
CL S G + FG +S+++K+TP I+ + + F G + +
Sbjct: 286 YCLPSSASYTGHLTFGSA--------GISRSVKFTP--ISTITDGTSFYG-------LNI 328
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
++ V G+ +P+ T+ S G I + T L Y AL +F + M K
Sbjct: 329 VAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTT 383
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
+ V+ CF L GF + +P++ VV + + +CL F
Sbjct: 384 SGVSILDTCFDLS--GFKTV--TIPKVAFSFSGGAVV-ELGSKGIFYAFKISQVCLAFA- 437
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G + + + G Q + + +D A R+GF+
Sbjct: 438 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 470
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 97/390 (24%), Positives = 147/390 (37%), Gaps = 60/390 (15%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYV 78
L L+ T Y+ TP L + G ++ W+ C+ +
Sbjct: 125 LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQ 184
Query: 79 SSTNKTARCGSAQC-NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNP 137
SS+ K C S+ C L C G C N + + GD + L++ S
Sbjct: 185 SSSYKHLSCLSSACTELTTMNHCRLGGC-VYEINYGDGSRSQGDFSQETLTLGSD----- 238
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
+ P+F F CG GL G G+ GLGR+ ++ PSQ + + +F+ CL F
Sbjct: 239 ----SFPSFAFGCGHTNT--GLFKGSAGLLGLGRTALSFPSQTKSKYG--GQFSYCLPDF 290
Query: 198 ----DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
G+ G G + + PL N PS YF+G+ +
Sbjct: 291 VSSTSTGSFSVGQG--------SIPATATFVPLVSNS--------NYPSF-YFVGLNGIS 333
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
V G+ + + +L G GGT + + T L Y AL +F S + P +
Sbjct: 334 VGGERLSIPPAVL-----GRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFS 388
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
C+ L RI P I QN +V S G +Q G +CL F
Sbjct: 389 ILDTCYDLSSYSQVRI----PTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASASQ 444
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ T+I IG Q + + FD R+GF+
Sbjct: 445 SISTNI-IGNFQQQRMRVAFDTGAGRIGFA 473
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 100/404 (24%), Positives = 161/404 (39%), Gaps = 71/404 (17%)
Query: 21 ITAQTSFRPKALVLRVAK--DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EK 75
+ + + + P+ LV V D + +Y ++ +P L V G +++WV C E+
Sbjct: 104 VASTSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ 163
Query: 76 GYV----------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPIS---NTGTHGDI 122
Y SS+ CGSA C + CGGG D ++ + T G++
Sbjct: 164 CYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGEL 223
Query: 123 RIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
++ L++ T V CG GL G G+ GLG ++L QL
Sbjct: 224 ALETLTLGGT---------AVQGVAIGCGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGG 272
Query: 183 AFSLKRKFALCLSPFDDG---AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLG 239
A F+ CL+ G ++V G + P + AS F
Sbjct: 273 A--AGGVFSYCLASRGAGGAGSLVLG--------------RTEAVP----RGRRASSF-- 310
Query: 240 EPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF 299
Y++G+T + V G+ +PL +L + +G GG + T T L Y AL AF
Sbjct: 311 -----YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 365
Query: 300 ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG 359
AM + R V+ C+ L R VP + ++ V ++ +N +V++G
Sbjct: 366 DGAMGALPRSPAVSLLDTCYDLSGYASVR----VPTVSFYF-DQGAVLTLPARNLLVEVG 420
Query: 360 GDALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
G CL F P +S ++G Q E + D A +GF
Sbjct: 421 GAVFCLAFA-----PSSSGISILGNIQQEGIQITVDSANGYVGF 459
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 151/384 (39%), Gaps = 70/384 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
Y + TP L G ++ W CE S++ K C SA
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSA 192
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C L + + GG C + P G I + T+ + NF+F C
Sbjct: 193 FCKLLDTE--GGESCSS-PTCLYQVQYGDGSYSIGFFA---TETLTLSSSNVFKNFLFGC 246
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAIVFGDGP 208
G + GL G G+ GLGR+K++LPSQ A + K+ F+ CL S G + FG
Sbjct: 247 GQQ--NSGLFRGAAGLLGLGRTKLSLPSQTAQKY--KKLFSYCLPASSSSKGYLSFGG-- 300
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
VSK +K+TPL + F P Y + +T + V G + ++ ++ S
Sbjct: 301 -------QVSKTVKFTPL-------SEDFKSTPF--YGLDITELSVGGNKLSIDASIFS- 343
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF--------R 320
GT I + T L ++ Y AL AF M + F C+ +
Sbjct: 344 ----TSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIK 399
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVI 380
+ +G + G V ID+ +++ ++G +CL F G + + +I
Sbjct: 400 IPKVGVSFKGGVEMDIDV----SGILYPVNGLK--------KVCLAFAGNGDDVKAAI-F 446
Query: 381 GARQLENNLLQFDLATSRLGFSDS 404
G Q + + +D A R+GF+ S
Sbjct: 447 GNTQQKTYQVVYDDAKGRVGFAPS 470
>gi|383140376|gb|AFG51471.1| Pinus taeda anonymous locus CL29Contig1_01 genomic sequence
gi|383140378|gb|AFG51472.1| Pinus taeda anonymous locus CL29Contig1_01 genomic sequence
gi|383140380|gb|AFG51473.1| Pinus taeda anonymous locus CL29Contig1_01 genomic sequence
gi|383140382|gb|AFG51474.1| Pinus taeda anonymous locus CL29Contig1_01 genomic sequence
gi|383140384|gb|AFG51475.1| Pinus taeda anonymous locus CL29Contig1_01 genomic sequence
Length = 87
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 54/87 (62%), Gaps = 1/87 (1%)
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ V G + ++ L I +G GGTK+STV PYT L T IY ++V AFA + RVA
Sbjct: 2 IDVGGVPLVIDAAKLRIGTQGRGGTKLSTVVPYTQLATPIYNSIVAAFAK-QKNLRRVAS 60
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDL 338
VAPF ACF +G TR+GP VP IDL
Sbjct: 61 VAPFDACFNSSAVGVTRVGPAVPFIDL 87
>gi|168008086|ref|XP_001756738.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691976|gb|EDQ78335.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 174
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 93/187 (49%), Gaps = 24/187 (12%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+L++TPL L P VE YF+ + +V VNG +P++ +L +++EG GG
Sbjct: 1 HLEFTPL-----------LKHPLVETFYFVNLVAVAVNGAKLPISSKVLKMNSEGNGGAI 49
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQID 337
+ +T S + LV+A + + V P F C+ + G ++P +
Sbjct: 50 LDMSTRFTRFPNSAFDHLVKALKALIRLPTMVVP--RFQLCYSTVNTGTL----IIPTVT 103
Query: 338 LVLQNKNVVWSIHGQNSMVQI--GGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLA 395
L+ +N V + +N+ V + GD +CL V G NP T+ VIG+ Q +N L+ D
Sbjct: 104 LIFEN-GVRMRLPMENTFVSVTEQGDVMCLAMVPG--NPGTATVIGSAQQQNFLIVIDRE 160
Query: 396 TSRLGFS 402
SRLGF+
Sbjct: 161 ASRLGFA 167
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 105/408 (25%), Positives = 158/408 (38%), Gaps = 94/408 (23%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARC 87
T YV + + TP + + + + WV C SST + RC
Sbjct: 97 TPSYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRC 156
Query: 88 GSAQCNLANAKACGGGICGAGP------DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
G+ QC C AGP + +++ H + D LS+ ++G
Sbjct: 157 GAPQCAQVPPATPS---CPAGPGASCAFNLSYASSTLHAVLGQDALSLSDSNG------A 207
Query: 142 TVPNFIFLCGSEFVLQGLASGVV--GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
VP+ + G V+ G V G+ G GR ++ SQ A + F+ CL
Sbjct: 208 AVPDDHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYG--SIFSYCL----- 260
Query: 200 GAIVFGDGPYYDLNNFDVS---------KNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
P Y +NF + + +K TPL N PS+ Y++ +
Sbjct: 261 --------PSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNP--------HRPSL-YYVAMV 303
Query: 251 SVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
V VNGKAVP+ + L++D G GGT + +T L Y AL AF +
Sbjct: 304 GVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFRRGV-----S 358
Query: 310 APVAP----FGACFRLQDIGFTRIGPVVPQI-----DLVLQNKNVVWSIHGQNSMVQIGG 360
AP AP F C+ + T+ P V + + L +NVV S G
Sbjct: 359 APAAPALGGFDTCYYVNG---TKSVPAVAFVFAGGARVTLPEENVVIS--------STSG 407
Query: 361 DALCLGFVDG---GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
CL G GVN + V+ + Q +N+ + FD+ R+GFS L
Sbjct: 408 GVACLAMAAGPSDGVNAGLN-VLASMQQQNHRVVFDVGNGRVGFSREL 454
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/407 (24%), Positives = 161/407 (39%), Gaps = 71/407 (17%)
Query: 22 TAQTSFRPKALVLRVA--KDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----- 74
TA +S +PK + L+V K T Y T ++ TP + + + G + W+ C+
Sbjct: 109 TAASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDC 168
Query: 75 --------KGYVSSTNKTARCGSAQC---------NLANAKACGGGICGAGPDNPISNTG 117
SST C S +C N ++ K C I A ++
Sbjct: 169 YEQHEALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYA------DDSY 222
Query: 118 THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALP 177
T G++ D L++ TD VP F+F CG G + G+ GLGR K +L
Sbjct: 223 TVGNLARDTLTLSPTDA--------VPGFVFGCGHNNA--GSFGEIDGLLGLGRGKASLS 272
Query: 178 SQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF 237
SQ+AA + + L SP G + F N ++T + +
Sbjct: 273 SQVAARYGAGFSYCLPSSPSATGYLSFSG------AAAAAPTNAQFTEMVAGQ------- 319
Query: 238 LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQ 297
PS Y++ +T + V G+A+ + ++ + GT I + ++ L S Y AL
Sbjct: 320 --HPSF-YYLNLTGITVAGRAIKVPPSVFAT----AAGTIIDSGTAFSCLPPSAYAALRS 372
Query: 298 AFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
+ SAM + R F C+ L RI P + LV + V +H +
Sbjct: 373 SVRSAMGRYKRAPSSTIFDTCYDLTGHETVRI----PSVALVFADGATV-HLHPSGVLYT 427
Query: 358 IGG-DALCLGFVDGGVNP-RTSI-VIGARQLENNLLQFDLATSRLGF 401
CL F+ NP TS+ V+G Q + +D+ ++GF
Sbjct: 428 WSNVSQTCLAFLP---NPDDTSLGVLGNTQQRTLAVIYDVDNQKVGF 471
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 151/382 (39%), Gaps = 53/382 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-----------------VSSTNKTARC 87
Y +I TP + V G +ILWV+C SST K+ C
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVSC 144
Query: 88 GSAQCNLANAKA-CGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
C+ N ++ C G + T+G + DV+ + G N T
Sbjct: 145 SDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTG-NRQTGSTNGTI 203
Query: 147 IFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
IF CGS+ L + V GI G G+S + SQLA+ +KR FA CL ++G +F
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDN-NNGGGIF 262
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G VS +K TP+ S Y + + ++ V + L+
Sbjct: 263 AIGEV-------VSPKVKTTPML------------SKSAHYSVNLNAIEVGNSVLELSSN 303
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+ D+ G I + L ++Y L+ ++ P++ F CF D
Sbjct: 304 --AFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYTD- 359
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR---TSIVIG 381
R V Q D K+V +++ + + Q+ D C G+ +GG+ + + ++G
Sbjct: 360 KLDRFPTVTFQFD-----KSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILG 414
Query: 382 ARQLENNLLQFDLATSRLGFSD 403
L N L+ +D+ +G+++
Sbjct: 415 DMALSNKLVVYDIENQVIGWTN 436
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 157/386 (40%), Gaps = 64/386 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T +Y+ + TP PV+LT+ G +++W C+ SST C
Sbjct: 79 TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCD 138
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
S C +CG P+ T ++GD + ++ G +VP F
Sbjct: 139 STLCQGLPVASCGSP--KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAF 196
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGD 206
CG F S GIAG GR ++LPSQL F+ C + + + V D
Sbjct: 197 GCGL-FNNGVFKSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTAVNGLKPSTVLLD 250
Query: 207 GPYYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
P D+ K+ ++ TPL N N P+ Y++ + + V +P+
Sbjct: 251 LPA------DLYKSGRGAVQSTPLIQNPAN--------PTF-YYLSLKGITVGSTRLPVP 295
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVA--RVAPVAPFGAC 318
++ ++ N G GGT I + T L T +Y+ + AFA+ +P V+ P A
Sbjct: 296 ESEFTLKN-GTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVDGGVNPR 375
R + P VP+ LVL + + +N + ++ G LCL ++GG
Sbjct: 355 LRAK--------PYVPK--LVLHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGG---- 400
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
IG Q +N + +DL S+L F
Sbjct: 401 EVTTIGNFQQQNMHVLYDLQNSKLSF 426
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 125/280 (44%), Gaps = 42/280 (15%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFD 198
+TVP+F+ C Q GIAG GR V+LPSQ+ +LKR F+ CL FD
Sbjct: 208 LTVPDFVVGCSIISTRQ-----PAGIAGFGRGPVSLPSQM----NLKR-FSHCLVSRRFD 257
Query: 199 DGAIVF------GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
D + G G N+ + L YTP N + FL Y++ + +
Sbjct: 258 DTNVTTDLDLDTGSG----HNSGSKTPGLTYTPFRKNPNVSNKAFL----EYYYLNLRRI 309
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+V K V + L+ G GG+ + + + +T +E +++ + + FAS M R +
Sbjct: 310 YVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDL 369
Query: 313 AP---FGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGG-DALCLGF 367
G CF + G VP+ L+ + K + N +G D +CL
Sbjct: 370 EKETGLGPCFNISGKGDV----TVPE--LIFEFKGGAKLELPLSNYFTFVGNTDTVCLTV 423
Query: 368 V-DGGVNPR----TSIVIGARQLENNLLQFDLATSRLGFS 402
V D VNP +I++G+ Q +N L+++DL R GF+
Sbjct: 424 VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFA 463
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 125/280 (44%), Gaps = 42/280 (15%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFD 198
+TVP+F+ C Q GIAG GR V+LPSQ+ +LKR F+ CL FD
Sbjct: 208 LTVPDFVVGCSIISTRQ-----PAGIAGFGRGPVSLPSQM----NLKR-FSHCLVSRRFD 257
Query: 199 DGAIVF------GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
D + G G N+ + L YTP N + FL Y++ + +
Sbjct: 258 DTNVTTDLDLDTGSG----HNSGSKTPGLTYTPFRKNPNVSNKAFL----EYYYLNLRRI 309
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+V K V + L+ G GG+ + + + +T +E +++ + + FAS M R +
Sbjct: 310 YVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDL 369
Query: 313 AP---FGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGG-DALCLGF 367
G CF + G VP+ L+ + K + N +G D +CL
Sbjct: 370 EKETGLGPCFNISGKGDV----TVPE--LIFEFKGGAKLELPLSNYFTFVGNTDTVCLTV 423
Query: 368 V-DGGVNPR----TSIVIGARQLENNLLQFDLATSRLGFS 402
V D VNP +I++G+ Q +N L+++DL R GF+
Sbjct: 424 VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFA 463
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 68/252 (26%), Positives = 117/252 (46%), Gaps = 32/252 (12%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKY 223
G+ G+ R ++ +Q+ RKF+ C+S D G ++ G+ Y L K L Y
Sbjct: 194 GLMGMNRGSLSFVNQMGF-----RKFSYCISGLDSTGFLLLGEARYSWL------KPLNY 242
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
TPL +++T + V Y + + + VN K +PL K++ D+ G G T + +
Sbjct: 243 TPLV--QISTPLPYFDR--VAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQ 298
Query: 284 YTVLETSIYKALVQAFASAMPKVARV--APVAPF-GA---CFRLQDIGFTRIGPVVPQID 337
+T L +Y AL + F V RV P F GA C+ + T P +P +
Sbjct: 299 FTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTL--PNLPVVK 356
Query: 338 LVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVDGGVNPRTSIVIGARQLENNLLQ 391
L+ + + S+ GQ + ++ G+ C F + +S +IG Q +N ++
Sbjct: 357 LMFRGAEM--SVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNVWME 414
Query: 392 FDLATSRLGFSD 403
+DL SR+GF++
Sbjct: 415 YDLENSRIGFAE 426
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 157/401 (39%), Gaps = 78/401 (19%)
Query: 21 ITAQTSFRPKALVLRVAK--DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EK 75
+ + + + P+ LV V D + +Y ++ +P L V G +++WV C E+
Sbjct: 104 VASTSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ 163
Query: 76 GYV----------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPIS---NTGTHGDI 122
Y SS+ CGSA C + CGGG D ++ + T G++
Sbjct: 164 CYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGEL 223
Query: 123 RIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
++ L++ T V CG GL G G+ GLG ++L QL
Sbjct: 224 ALETLTLGGT---------AVQGVAIGCGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGG 272
Query: 183 AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
A F+ CL+ G G G + AS F
Sbjct: 273 A--AGGVFSYCLASRGAG----GAG------------------------SLASSF----- 297
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
Y++G+T + V G+ +PL +L + +G GG + T T L Y AL AF A
Sbjct: 298 --YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGA 355
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
M + R V+ C+ L R VP + ++ V ++ +N +V++GG
Sbjct: 356 MGALPRSPAVSLLDTCYDLSGYASVR----VPTVSFYF-DQGAVLTLPARNLLVEVGGAV 410
Query: 363 LCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
CL F P +S ++G Q E + D A +GF
Sbjct: 411 FCLAFA-----PSSSGISILGNIQQEGIQITVDSANGYVGF 446
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 157/386 (40%), Gaps = 64/386 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T +Y+ + TP PV+LT+ G +++W C+ SST C
Sbjct: 79 TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCD 138
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
S C +CG P+ T ++GD + ++ G +VP F
Sbjct: 139 STLCQGLPVASCGSP--KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAF 196
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGD 206
CG F S GIAG GR ++LPSQL F+ C + + + V D
Sbjct: 197 GCGL-FNNGVFKSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTAVNGLKPSTVLLD 250
Query: 207 GPYYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
P D+ K+ ++ TPL N N P+ Y++ + + V +P+
Sbjct: 251 LPA------DLYKSGRGAVQSTPLIQNPAN--------PTF-YYLSLKGITVGSTRLPVP 295
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVA--RVAPVAPFGAC 318
++ ++ N G GGT I + T L T +Y+ + AFA+ +P V+ P A
Sbjct: 296 ESEFALKN-GTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVDGGVNPR 375
R + P VP+ LVL + + +N + ++ G LCL ++GG
Sbjct: 355 LRAK--------PYVPK--LVLHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGG---- 400
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
IG Q +N + +DL S+L F
Sbjct: 401 EVTTIGNFQQQNMHVLYDLQNSKLSF 426
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 153/385 (39%), Gaps = 57/385 (14%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T +Y+ ++ TP PV LT+ G +++W C SST CG
Sbjct: 81 TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQST-----DGRNPGRAVTV 143
+A+C +CG + G +GD + V I + D G ++
Sbjct: 141 AARCRALPFTSCG--VRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHT 198
Query: 144 PNFIFLCG--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDG 200
F CG ++ V Q S GIAG GR + +LPSQL F+ C S F+
Sbjct: 199 RRLTFGCGHLNKGVFQ---SNETGIAGFGRGRWSLPSQLNVT-----SFSYCFTSMFESK 250
Query: 201 A--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
+ + G P L + S ++ TP+ N +PS+ YF+ + K
Sbjct: 251 SSLVTLGGSP-AALYSHAHSGEVRTTPILKNP--------SQPSL-YFLSL-------KG 293
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ + KT L + T I + T L +Y+A+ FA+ + + C
Sbjct: 294 ISVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLC 353
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTS 377
F L R P VP + L L+ + W + N + + +G +C+ +D P
Sbjct: 354 FALPVTALWR-RPAVPSLTLHLEGAD--WELPRSNYVFEDLGARVMCI-VLDAA--PGEQ 407
Query: 378 IVIGARQLENNLLQFDLATSRLGFS 402
VIG Q +N + +DL RL F+
Sbjct: 408 TVIGNFQQQNTHVVYDLENDRLSFA 432
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 153/390 (39%), Gaps = 65/390 (16%)
Query: 37 AKDTATL---QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------- 74
AKD +TL Y+ + TP + L G ++ W C+
Sbjct: 93 AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKS 152
Query: 75 KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
Y + + +A CGS NA +C C G +GD V + +
Sbjct: 153 TSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ--------YGDQSFSV-GFLAKEK 203
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ F CG QGL +GV G+ GLGR K++ PSQ A A++ + F+ CL
Sbjct: 204 FTLTNSDVFDGVYFGCGEN--NQGLFTGVAGLLGLGRDKLSFPSQTATAYN--KIFSYCL 259
Query: 195 --SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
S G + FG +S+++K+TP I+ + + F G + + ++
Sbjct: 260 PSSASYTGHLTFGSA--------GISRSVKFTP--ISTITDGTSFYG-------LNIVAI 302
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V G+ +P+ T+ S G I + T L Y AL +F + M K + V
Sbjct: 303 TVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV 357
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
+ CF L GF + +P++ VV + + +CL F G
Sbjct: 358 SILDTCFDLS--GFKTV--TIPKVAFSFSGGAVV-ELGSKGIFYVFKISQVCLAFA-GNS 411
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ + + G Q + + +D A R+GF+
Sbjct: 412 DDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 441
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 159/396 (40%), Gaps = 71/396 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSA 90
+Y + TP L + G ++ W+ C Y S++ K C
Sbjct: 159 EYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDP 218
Query: 91 QCNLANAKACGGGICGAGPDNPI----------------SNTGTHGDIRIDVLSIQSTDG 134
+C+L ++ PD P+ + T GD ++ ++ T
Sbjct: 219 RCSLISS-----------PDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTT 267
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
V N +F CG +GL SG G+ GLGR ++ SQL + + F+ CL
Sbjct: 268 EGGSSEYKVGNMMFGCGH--WNRGLFSGASGLLGLGRGPLSFSSQLQSLYG--HSFSYCL 323
Query: 195 SPFDDGA-----IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
+ ++FG+ DL N NL +T K N+ F Y+I +
Sbjct: 324 VDRNSNTNVSSKLIFGEDK--DLLN---HTNLNFTSFVNGKENSVETF-------YYIQI 371
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK---V 306
S+ V GKA+ + + +I ++G GGT I + + Y+ + FA M + +
Sbjct: 372 KSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPI 431
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
R PV CF + I I +P++ + + VW+ +NS + + D +CL
Sbjct: 432 FRDFPV--LDPCFNVSGIEENNIH--LPELGIAFVD-GTVWNFPAENSFIWLSEDLVCLA 486
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ G T +IG Q +N + +D SRLGF+
Sbjct: 487 IL--GTPKSTFSIIGNYQQQNFHILYDTKRSRLGFT 520
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 109/416 (26%), Positives = 171/416 (41%), Gaps = 81/416 (19%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
+L+P A T+ R +L +A D +Y+ ++ TP + G +++W C
Sbjct: 71 TLAPGDAITAAR----ILVLASDG---EYLMEMGIGTPARFYSAILDTGSDLIWTQCAPC 123
Query: 77 YV-------------SSTNKTARCGSAQCNLANAKACGGGIC------GAGPDNPISNTG 117
+ SST ++ C + CN C C G +
Sbjct: 124 LLCVDQPTPYFDPANSSTYRSLGCSAPACNALYYPLCYQKTCVYQYFYG-------DSAS 176
Query: 118 THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALP 177
T G + + + + D R VT+P F CG+ LA+G G+ G GR ++L
Sbjct: 177 TAGVLANETFTFGTNDTR-----VTLPRISFGCGN-LNAGSLANGS-GMVGFGRGSLSLV 229
Query: 178 SQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA 234
SQL + +F+ CL+ F + FG Y LN+ + S ++ TP IN
Sbjct: 230 SQLGSP-----RFSYCLTSFLSPVRSRLYFG--AYATLNSTNAST-VQSTPFIINP---- 277
Query: 235 SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI-DNEGVGGTKISTVNPYTVLETSIYK 293
P++ YF+ +T + V G +P++ +L+I D +G GGT I + T L Y
Sbjct: 278 ----ALPTM-YFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEPAYY 332
Query: 294 ALVQAFA----SAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
A+ +AF S +P + V + CF+ R +PQ LVL W +
Sbjct: 333 AVREAFVLYLNSTLP-LLDVTETSVLDTCFQWPPP--PRQSVTLPQ--LVLHFDGADWEL 387
Query: 350 HGQNSM-VQIGGDALCLGFV---DGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
QN M V LCL DG +IG+ Q +N + +DL S L F
Sbjct: 388 PLQNYMLVDPSTGGLCLAMATSSDGS-------IIGSYQHQNFNVLYDLENSLLSF 436
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 156/380 (41%), Gaps = 60/380 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y ++ +P L + G ++ W+ C SS+ + C +
Sbjct: 13 EYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTP 72
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC L + KAC A DN ++GD V + S D + R T P +F C
Sbjct: 73 QCKLLDVKAC------ASTDNRCLYQVSYGDGSFTVGDLAS-DSFSVSRGRTSP-VVFGC 124
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG-----AIVFG 205
G + +GL G G+ GLG K++ PSQL++ RKF+ CL D+G A++FG
Sbjct: 125 GHD--NEGLFVGAAGLLGLGAGKLSFPSQLSS-----RKFSYCLVSRDNGVRASSALLFG 177
Query: 206 DGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
D + S + YT L N K++T Y+ G++ + + G + + T
Sbjct: 178 D------SALPTSASFAYTQLLKNPKLDTF----------YYAGLSGISIGGTLLSIPST 221
Query: 265 LLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ + G GG I + T L T Y + AF SA K+ R A + F C+
Sbjct: 222 AFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSA 281
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGGVNPRTSIVIGA 382
+ +P + + V + N +V + C F ++ +IG
Sbjct: 282 L----TSVTIPTVSFHFEGGASV-QLPPSNYLVPVDTSGTFCFAFSKTSLDLS---IIGN 333
Query: 383 RQLENNLLQFDLATSRLGFS 402
Q + + DL +SR+GF+
Sbjct: 334 IQQQTMRVAIDLDSSRVGFA 353
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 159/377 (42%), Gaps = 59/377 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
Y ++ +P + V G ++ W+ C+ V S T K+ C S+
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 72
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ +C N T ++GD + S D + T+P F++ C
Sbjct: 73 QCSSLVDATLNNPLCETS-SNVCVYTASYGDSSYS-MGYLSQDLLTLAPSQTLPGFVYGC 130
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL GI GLGR+K+++ Q+++ F F+ CL G G +
Sbjct: 131 GQD--SEGLFGRAAGILGLGRNKLSMLGQVSSKFGY--AFSYCLP-------TRGGGGFL 179
Query: 211 DLNNFDVSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID 269
+ ++ + K+TP+ + G PS+ YF+ +T++ V G+A+ + +
Sbjct: 180 SIGKASLAGSAYKFTPMTTDP--------GNPSL-YFLRLTAITVGGRALGVAAAQYRVP 230
Query: 270 NEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVAPVAPFGACFR--LQDIGF 326
GT I T L S+Y QAF M K AR + CF+ L+D+
Sbjct: 231 TIIDSGTVI------TRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDM-- 282
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV-DGGVNPRTSIVIGARQL 385
VP++ L+ Q ++ N ++Q+ CL F + GV +IG Q
Sbjct: 283 ----QSVPEVRLIFQG-GADLNLRPVNVLLQVDEGLTCLAFAGNNGV-----AIIGNHQQ 332
Query: 386 ENNLLQFDLATSRLGFS 402
+ + D++T+R+GF+
Sbjct: 333 QTFKVAHDISTARIGFA 349
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/255 (25%), Positives = 117/255 (45%), Gaps = 32/255 (12%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSKN 220
G+ G+ R ++ SQ+ KF+ C+S DD G ++ GD +NF
Sbjct: 201 TTGLLGMNRGSLSFISQMGFP-----KFSYCISGTDDFPGFLLLGD------SNFTWLTP 249
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTPL +++T + V Y + +T + VNGK +P+ K++L D+ G G T + +
Sbjct: 250 LNYTPLI--RISTPLPYFDR--VAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDS 305
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF------GACFRLQDIGF-TRIGPVV 333
+T L +Y AL F + + V F C+R+ + + I +
Sbjct: 306 GTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRL 365
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQ-----IGGDAL-CLGFVDGGVNPRTSIVIGARQLEN 387
P + LV + + ++ GQ + + +G D++ C F + + + VIG +N
Sbjct: 366 PTVSLVFEGAEI--AVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQN 423
Query: 388 NLLQFDLATSRLGFS 402
++FDL SR+G +
Sbjct: 424 MWIEFDLQRSRIGLA 438
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 116/255 (45%), Gaps = 32/255 (12%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSKN 220
G+ G+ R ++ SQ+ KF+ C+S DD G ++ GD +NF
Sbjct: 201 TTGLLGMNRGSLSFISQMGFP-----KFSYCISGTDDFPGFLLLGD------SNFTWLTP 249
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTPL +++T + V Y + +T + VNGK +P+ K++L D+ G G T + +
Sbjct: 250 LNYTPLI--RISTPLPYFDR--VAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDS 305
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF------GACFRLQDIGF-TRIGPVV 333
+T L +Y AL F + + V F C+R+ T I +
Sbjct: 306 GTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRL 365
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPRTSIVIGARQLEN 387
P + LV + + ++ GQ + ++ G D++ C F + + + VIG +N
Sbjct: 366 PTVSLVFEGAEI--AVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQN 423
Query: 388 NLLQFDLATSRLGFS 402
++FDL SR+G +
Sbjct: 424 MWIEFDLQRSRIGLA 438
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 154/388 (39%), Gaps = 73/388 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKG---------YVSSTNKTARCGSAQCNLAN------- 96
TPL + + + G + W+ C+K S T C S C
Sbjct: 75 TPLQNITMVLDTGSELSWLHCKKEPNFNSIFNPLASKTYTKIPCSSPTCETRTRDLPLPV 134
Query: 97 ----AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGS 152
AK C I A + G++ + + S VT P +F C
Sbjct: 135 SCDPAKLCHFIISYA------DASSVEGNLAFETFRVGS---------VTGPATVFGCMD 179
Query: 153 EFVLQGLASG------VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFG 205
G +S G+ G+ R ++ +Q+ RKF+ C+S D G ++ G
Sbjct: 180 S----GFSSNSEEDAKTTGLMGMNRGSLSFVNQMGF-----RKFSYCISDRDSSGVLLLG 230
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
+ +F K L YTPL +++T + V Y + + + V+ K + L K++
Sbjct: 231 EA------SFSWLKPLNYTPLV--EMSTPLPYFDR--VAYSVQLEGIRVSDKVLSLPKSV 280
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV--APVAPF-GACFRLQ 322
D+ G G T + + +T L +Y AL Q F V RV P F GA
Sbjct: 281 FVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCY 340
Query: 323 DIGFTRIG-PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVDGGVNPR 375
I TR P +P ++L+ + + S+ GQ + ++ G+ C F +
Sbjct: 341 LIEPTRAALPNLPVVNLMFRGAEM--SVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGI 398
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFSD 403
S VIG Q +N +++DL SR+GF++
Sbjct: 399 ESFVIGHHQQQNVWMEYDLEKSRIGFAE 426
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 155/385 (40%), Gaps = 49/385 (12%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSA 90
+Y + TP L + G ++ W+ C Y S++ K C
Sbjct: 161 EYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDP 220
Query: 91 QCNLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
+C+L ++ P + T GD ++ ++ T V N
Sbjct: 221 RCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVEN 280
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----G 200
+F CG +GL SG G+ GLGR ++ SQL + + F+ CL +
Sbjct: 281 MMFGCGH--WNRGLFSGASGLLGLGRGPLSFSSQLQSLYG--HSFSYCLVDRNSDTNVSS 336
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
++FG+ DL N NL +T K N+ F Y+I + S+ V G+A+
Sbjct: 337 KLIFGEDK--DLLN---HTNLNFTSFVNGKENSVETF-------YYIQIKSILVGGEALD 384
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK---VARVAPVAPFGA 317
+ + +I +G GGT I + + Y+ + FA M + V R PV
Sbjct: 385 IPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPV--LDP 442
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
CF + I I +P++ + + VW+ +NS + + D +CL + G T
Sbjct: 443 CFNVSGIEENNIH--LPELGIAFAD-GAVWNFPAENSFIWLSEDLVCLAIL--GTPKSTF 497
Query: 378 IVIGARQLENNLLQFDLATSRLGFS 402
+IG Q +N + +D SRLGF+
Sbjct: 498 SIIGNYQQQNFHILYDTKMSRLGFT 522
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 153/390 (39%), Gaps = 65/390 (16%)
Query: 37 AKDTATL---QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------- 74
AKD +TL Y+ + TP + L G ++ W C+
Sbjct: 121 AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKS 180
Query: 75 KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
Y + + +A CGS NA +C C G +GD V + +
Sbjct: 181 TSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ--------YGDQSFSV-GFLAKEK 231
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ F CG QGL +GV G+ GLGR K++ PSQ A A++ + F+ CL
Sbjct: 232 FTLTNSDVFDGVYFGCGEN--NQGLFTGVAGLLGLGRDKLSFPSQTATAYN--KIFSYCL 287
Query: 195 --SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
S G + FG +S+++K+TP I+ + + F G + + ++
Sbjct: 288 PSSASYTGHLTFGSA--------GISRSVKFTP--ISTITDGTSFYG-------LNIVAI 330
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V G+ +P+ T+ S G I + T L Y AL +F + M K + V
Sbjct: 331 TVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV 385
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
+ CF L GF + +P++ VV + + +CL F G
Sbjct: 386 SILDTCFDLS--GFKTV--TIPKVAFSFSGGAVV-ELGSKGIFYVFKISQVCLAFA-GNS 439
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ + + G Q + + +D A R+GF+
Sbjct: 440 DDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 469
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 89/382 (23%), Positives = 156/382 (40%), Gaps = 57/382 (14%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWV---DCEKGY----------VSSTNKTARCGSAQ 91
Y+ ++ TP + + G N++W+ +C+ + SST + A C S Q
Sbjct: 98 YLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQ 157
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG 151
C ++ +C D +G I +D +++ S+D GR +P F+CG
Sbjct: 158 CETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSD----GRPFPLPYSDFVCG 213
Query: 152 SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR-KFALCLSPF---DDGAIVFGDG 207
+ + + A VG+ GLGR ++L S+L + L KF+ CL+ + I FG
Sbjct: 214 NS-IYKTFAG--VGVIGLGRGALSLTSKL---YHLSDGKFSYCLADYYSKQPSKINFG-- 265
Query: 208 PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
L +F +L+ V+T G S Y++ + + V K + L
Sbjct: 266 ----LQSFISDDDLEV-------VSTTLGH-HRHSGNYYVTLEGISVGEK----RQDLYY 309
Query: 268 IDN---EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA---PFGA--CF 319
+D+ VG I + +T+L Y L + A+P+ + P PF
Sbjct: 310 VDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTL 369
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIV 379
+L + P+I + + +V + NS +++ D +C F P S V
Sbjct: 370 KLSPCFWYYPELKFPKITIHFTDADV--ELSDDNSFIRVAEDVVCFAF--AATQPGQSTV 425
Query: 380 IGARQLENNLLQFDLATSRLGF 401
G+ Q N +L +DL + F
Sbjct: 426 YGSWQQMNFILGYDLKRGTVSF 447
>gi|226427704|gb|ACO55041.1| xylanase inhibitor [Triticum aestivum]
gi|226427706|gb|ACO55042.1| xylanase inhibitor [Triticum aestivum]
Length = 134
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 65/128 (50%), Gaps = 2/128 (1%)
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ ++ T +++ G +ST Y L +Y+ + AF AM A+VA VAPF C
Sbjct: 8 IAIDGTRVAVSGTGALVVGLSTTISYAQLRADVYRPFITAFDRAMGSSAKVAAVAPFELC 67
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT-S 377
+ + TR G +VP +DL+L+ W++ G NSM Q+ C FV G T +
Sbjct: 68 YDSSKLAPTRFGYLVPNVDLMLEG-GTNWTVVGGNSMAQVNSGTACFAFVRSGSTDATPA 126
Query: 378 IVIGARQL 385
+VIG Q+
Sbjct: 127 LVIGGFQM 134
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 156/379 (41%), Gaps = 65/379 (17%)
Query: 58 VKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+ + + G ++ WV CE K SS+ ++ C S+ C G
Sbjct: 76 MTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135
Query: 105 CGAGPD--NPISNTG----THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
CG+ P N + N G T+G++ ++ LS V+V +F+F CG +G
Sbjct: 136 CGSNPSTCNYVVNYGDGSYTNGELGVEQLSFG---------GVSVSDFVFGCGRNN--KG 184
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS 218
L GV G+ GLGRS ++L SQ A F F+ CL + GA G N V
Sbjct: 185 LFGGVSGLMGLGRSYLSLVSQTNATFG--GVFSYCLPTTESGA----SGSLVMGNESSVF 238
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGT 276
KN+ TP+ T + L P + Y + +T + V+G A L + + G GG
Sbjct: 239 KNV--TPI------TYTRMLPNPQLSNFYILNLTGIDVDGVA-------LQVPSFGNGGV 283
Query: 277 KISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQI 336
I + T L +S+YKAL F + CF L G+ + +P I
Sbjct: 284 LIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNL--TGYDEVS--IPTI 339
Query: 337 DLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLA 395
+ + N + G +V+ +CL T+I IG Q N + +D
Sbjct: 340 SMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAI-IGNYQQRNQRVIYDTK 398
Query: 396 TSRLGFSDSLLFERATCTF 414
S++GF++ +C+F
Sbjct: 399 QSKVGFAEE------SCSF 411
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 155/388 (39%), Gaps = 74/388 (19%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYV------------SSTNKTARC 87
TL++V + TP + G ++ W+ C G+ S+T C
Sbjct: 132 TLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPC 191
Query: 88 GSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
G QC A+ C G C G G + + G + + LS+ ST RA+
Sbjct: 192 GHPQCAAADGSKCSNGTCLYKVEYGDG-------SSSAGVLSHETLSLTST------RAL 238
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
P F F CG + G V G+ GLGR +++L SQ AA+F F+ CL +
Sbjct: 239 --PGFAFGCGQTNL--GDFGDVDGLIGLGRGQLSLSSQAAASFG--GTFSYCLPSDNTTH 292
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
GP +N DV +YT + + K + S YF+ + S+ + G +P+
Sbjct: 293 GYLTIGPTTPASNDDV----QYTAM-VQKQDYPS--------FYFVELVSIDIGGYILPV 339
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
TL + D GT + + T L Y AL F M + PF C+
Sbjct: 340 PPTLFTDD-----GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDF 394
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVV-WSIHG-----QNSMVQIGGDALCLGFV-DGGVNP 374
G + I +P + + +V S G ++ IG CLGFV P
Sbjct: 395 --TGQSAI--FIPAVSFKFSDGSVFDLSFFGILIFPDDTAPAIG----CLGFVARPSAMP 446
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFS 402
T ++G Q N + +D+A ++GF+
Sbjct: 447 FT--IVGNMQQRNTEVIYDVAAEKIGFA 472
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 153/384 (39%), Gaps = 78/384 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
YVT++ TP P + V G ++ W+ C V SS+ C +
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDV--LSIQSTD-GRNPGRAVTVPNFI 147
QCN + C + D I ++GD V LS + G N +VPNF
Sbjct: 197 QCNDLSTATLNPAACSSS-DVCIYQ-ASYGDSSFSVGYLSKDTVSFGSN-----SVPNFY 249
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIV 203
+ CG + +GL G+ GL R+K++L QLA +L F+ CL S
Sbjct: 250 YGCGQDN--EGLFGRSAGLMGLARNKLSLLYQLAP--TLGYSFSYCLPSSSSSGYLSIGS 305
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
+ G Y YTP+ + ++ + YFI ++ + V GK + ++
Sbjct: 306 YNPGQY------------SYTPMVSSTLDDS---------LYFIKLSGMTVAGKPLAVSS 344
Query: 264 TLLS-----IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ S ID+ GT I T L T++Y AL +A A AM R + C
Sbjct: 345 SEYSSLPTIIDS----GTVI------TRLPTTVYDALSKAVAGAMKGTKRADAYSILDTC 394
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
F Q VP + + + QN +V + CL F R++
Sbjct: 395 FVGQASSLR-----VPAVSMAFSG-GAALKLSAQNLLVDVDSSTTCLAFAPA----RSAA 444
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
+IG Q + + +D+ ++R+GF+
Sbjct: 445 IIGNTQQQTFSVVYDVKSNRIGFA 468
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 162/400 (40%), Gaps = 45/400 (11%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPL-VPVKLTVHLGGNILWVDCEKG 76
L P + T R A V + +Y+ TP V L V G +++W C
Sbjct: 65 LCPSRSGTPVRVTAPVASGSHVVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPC 124
Query: 77 YVSSTNKTARCGSAQCNLANAKACGGGICGA-GPD-----------NPISNTGTHGDIRI 124
+ T R ++ + + C IC A P N N+ T G +
Sbjct: 125 FDCFTQPLPRFDTSASDTVHGVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAK 184
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
D + DG+ G+ VTVP+ +F CG ++ S GIAG GR ++LP QL +
Sbjct: 185 DSFTF---DGKGGGK-VTVPDLVFGCG-QYNTGNFHSNETGIAGFGRGPLSLPRQLGVS- 238
Query: 185 SLKRKFALCLSP-FDDGAI-VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
F+ C + F+ + VF G D + + TP N
Sbjct: 239 ----SFSYCFTTIFESKSTPVFLGGAPADGLRAHATGPILSTPFLPNHPEY--------- 285
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS- 301
Y++ + + V + + ++ + +G GGT I + T +++++L +AF +
Sbjct: 286 --YYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQ 343
Query: 302 -AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-G 359
+P + P CF + + P VP++ L L+ + W + +N M +
Sbjct: 344 VPLPHTSYNDTGEPTLQCFSTESVPDASKVP-VPKMTLHLEGAD--WELPRENYMAEYPD 400
Query: 360 GDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRL 399
D LC+ V G + RT +IG Q +N + DLA ++L
Sbjct: 401 SDQLCV-VVLAGDDDRT--MIGNFQQQNMHIVHDLAGNKL 437
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 156/373 (41%), Gaps = 44/373 (11%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ ++ +P + G +++W C+ T Q + +C
Sbjct: 110 EFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSE 169
Query: 104 ICGAGPDNPISNTG-----THGDIRID--VLSIQS-TDGRNPGRAVTVPNFIFLCGSEFV 155
+CGA P + S+ G T+GD VL+ ++ T G + +++P F CG++
Sbjct: 170 LCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNN 229
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---AIVFGDGPYYDL 212
G + G G+ GLGR ++L SQL ++KFA CL+ DD +++ G ++
Sbjct: 230 GDGFSQGA-GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSLLLGS--LANI 281
Query: 213 NNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEG 272
+K TPL N +PS Y++ + + V G + + K+ + ++G
Sbjct: 282 TPKTSKDEMKTTPLIKNP--------SQPSF-YYLSLQGISVGGTQLSIPKSTFELHDDG 332
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV 332
GG I + T +E S + +L F + M + CF L G ++
Sbjct: 333 SGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLP-AGTNQV--E 389
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGDA----LCLGFVDGGVNPRTSIVIGARQLENN 388
VP+ L K + G+N M+ GD+ LCL + R + G Q +N
Sbjct: 390 VPK--LTFHFKGADLELPGENYMI---GDSKAGLLCLAI----GSSRGMSIFGNLQQQNF 440
Query: 389 LLQFDLATSRLGF 401
++ DL L F
Sbjct: 441 MVVHDLQEETLSF 453
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 100/413 (24%), Positives = 160/413 (38%), Gaps = 53/413 (12%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
S +P+ ++F A+ L T T QY Q + TP P L G ++ WV C
Sbjct: 85 STAPMPEASAF---AMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGR 141
Query: 77 YVSSTNKTARCGSAQCNLANAKA-----CGGGIC-------------GAGPDNPI----- 113
SS + + AN+K+ C C G P P
Sbjct: 142 RASSPDASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYR 201
Query: 114 --SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR 171
+ G + D +I + G R + + C + + Q S G+ LG
Sbjct: 202 YKDKSSARGVVGTDAATI-ALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSS-DGVLSLGN 259
Query: 172 SKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
S ++ S+ AA F +F+ CL D Y + + TPL ++
Sbjct: 260 SNISFASRAAARFG--GRFSYCLV---DHLAPRNATSYLTFGPVGAAHSPSRTPLLLD-- 312
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
+ + Y + V +V V GKA LN D + GG + + T+L T
Sbjct: 313 -------AQVAPFYAVTVDAVSVAGKA--LNIPAEVWDVKKNGGAILDSGTSLTILATPA 363
Query: 292 YKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHG 351
YKA+V A + + +V RV + PF C+ + TR P VP++++ +
Sbjct: 364 YKAVVAALSKQLARVPRVT-MDPFEYCY---NWTATRRPPAVPRLEVRFAGSARL-RPPT 418
Query: 352 QNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
++ ++ C+G + GV P S VIG + +L +FDLA L F +S
Sbjct: 419 KSYVIDAAPGVKCIGLQE-GVWPGVS-VIGNILQQEHLWEFDLANRWLRFQES 469
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 156/378 (41%), Gaps = 53/378 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILW------VDCEKGYV-------SSTNKTARCGSA 90
+++ + TP + V G +++W VDC K SST T C SA
Sbjct: 73 EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSA 132
Query: 91 QC-NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
C +L +K CG T T+GD + +T+ ++ +P +F
Sbjct: 133 SCSDLPTSKCTSASKCG--------YTYTYGDSS-STQGVLATETFTLAKS-KLPGVVFG 182
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGD 206
CG G + G G+ GLGR ++L SQL KF+ CL+ DD ++ G
Sbjct: 183 CGDTNEGDGFSQGA-GLVGLGRGPLSLVSQLGL-----DKFSYCLTSLDDTNNSPLLLGS 236
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
+ + +++ TPL N +PS Y++ + ++ V + L +
Sbjct: 237 LAGISEASA-AASSVQTTPLIKNP--------SQPSF-YYVSLKAITVGSTRISLPSSAF 286
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGF 326
++ ++G GG + + T LE Y+AL +AFA+ M A CFR G
Sbjct: 287 AVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGV 346
Query: 327 TRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPRTSIVIGARQ 384
++ VP+ LV + + +N MV GG ALCL + R +IG Q
Sbjct: 347 DQV--EVPR--LVFHFDGGADLDLPAENYMVLDGGSGALCLTV----MGSRGLSIIGNFQ 398
Query: 385 LENNLLQFDLATSRLGFS 402
+N +D+ L F+
Sbjct: 399 QQNFQFVYDVGHDTLSFA 416
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 97/394 (24%), Positives = 152/394 (38%), Gaps = 72/394 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDC--------------------EKGYVSSTNKT 84
Y T+I+ TP P + V G +ILWV+C KG SS+
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKG--SSSGSA 144
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNP 137
C + C A G + G P +GD D L G
Sbjct: 145 VSCDNKFC--AATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQ 202
Query: 138 GRAVTVPNFIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
R N IF CG++ L+ + GI G G+S + SQLA+A +K+ F+ CL
Sbjct: 203 TRHAKA-NVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLD 261
Query: 196 PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
G I F G V +K TPL N + Y + + S+ V
Sbjct: 262 TIKGGGI-FAIGEV-------VQPKVKSTPLLPNMSH------------YNVNLQSIDVA 301
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G A+ L + + GT I + T L +YK ++ A + + F
Sbjct: 302 GNALQLPPHIFETSEK--RGTIIDSGTTLTYLPELVYKDILAAVFQKHQDIT-FRTIQGF 358
Query: 316 GACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
CF D GF P+I ++ ++ +++ + Q G + CLGF +GG
Sbjct: 359 -LCFEYSESVDDGF-------PKITFHFED-DLGLNVYPHDYFFQNGDNLYCLGFQNGGF 409
Query: 373 NPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
P+ + +++G L N ++ +DL +G++D
Sbjct: 410 QPKDAKDMVLLGDLVLSNKVVVYDLEKQVIGWTD 443
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 155/380 (40%), Gaps = 60/380 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y ++ +P L + G ++ W+ C SS+ + C +
Sbjct: 13 EYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTP 72
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC L + KAC A DN ++GD V + S D R T P +F C
Sbjct: 73 QCKLLDVKAC------ASTDNRCLYQVSYGDGSFTVGDLAS-DSFLVSRGRTSP-VVFGC 124
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG-----AIVFG 205
G + +GL G G+ GLG K++ PSQL++ RKF+ CL D+G A++FG
Sbjct: 125 GHD--NEGLFVGAAGLLGLGAGKLSFPSQLSS-----RKFSYCLVSRDNGVRASSALLFG 177
Query: 206 DGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
D + S + YT L N K++T Y+ G++ + + G + + T
Sbjct: 178 D------SALPTSASFAYTQLLKNPKLDTF----------YYAGLSGISIGGTLLSIPST 221
Query: 265 LLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ + G GG I + T L T Y + AF SA K+ R A + F C+
Sbjct: 222 AFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSA 281
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGGVNPRTSIVIGA 382
+ +P + + V + N +V + C F ++ +IG
Sbjct: 282 L----TSVTIPTVSFHFEGGASV-QLPPSNYLVPVDTSGTFCFAFSKTSLDLS---IIGN 333
Query: 383 RQLENNLLQFDLATSRLGFS 402
Q + + DL +SR+GF+
Sbjct: 334 IQQQTMRVAIDLDSSRVGFA 353
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 163/392 (41%), Gaps = 83/392 (21%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ + TP PV+LT+ G +++W C+ V SST C S
Sbjct: 90 EYLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDST 149
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIR-----IDVLSIQSTDGRNPGRAVTVPN 145
QC L + +C + + ++GD +DV ++ G +VP
Sbjct: 150 QCKLDPSVT----MCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAG------ASVPG 199
Query: 146 FIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
+F CG G+ S GIAG GR ++LPSQL F+ C + V
Sbjct: 200 VVFGCGLNNT--GIFRSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFT------AVS 246
Query: 205 GDGP---YYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G P +DL D+ KN ++ TPL N + P+ Y++ + + V
Sbjct: 247 GRKPSTVLFDLPA-DLYKNGRGTVQTTPLIKNPAH--------PTF-YYLSLKGITVGST 296
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+P+ ++ ++ N G GGT I + +T L +Y+ + FA+ + PV P
Sbjct: 297 RLPVPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV-----KLPVVPSNE 350
Query: 318 -----CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVD 369
CF +G P VP+ LVL + + +N + + G ++CL ++
Sbjct: 351 TGPLLCFSAPPLG---KAPHVPK--LVLHFEGATMHLPRENYVFEAKDGGNCSICLAIIE 405
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G + +IG Q +N + +DL S+L F
Sbjct: 406 GEMT-----IIGNFQQQNMHVLYDLKNSKLSF 432
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 159/388 (40%), Gaps = 53/388 (13%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW------VDCEKGYV-------SS 80
L+V +++ + TP + V G +++W VDC K SS
Sbjct: 84 LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSS 143
Query: 81 TNKTARCGSAQC-NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
T T C SA C +L +K CG T T+GD + +T+ +
Sbjct: 144 TYATVPCSSASCSDLPTSKCTSASKCG--------YTYTYGDSS-STQGVLATETFTLAK 194
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+ +P +F CG G + G G+ GLGR ++L SQL KF+ CL+ DD
Sbjct: 195 S-KLPGVVFGCGDTNEGDGFSQGA-GLVGLGRGPLSLVSQLGL-----DKFSYCLTSLDD 247
Query: 200 ---GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
++ G + + +++ TPL N +PS Y++ + ++ V
Sbjct: 248 TNNSPLLLGSLAGISEASA-AASSVQTTPLIKNP--------SQPSF-YYVSLKAITVGS 297
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+ L + ++ ++G GG + + T LE Y+AL +AFA+ M A
Sbjct: 298 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD 357
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNP 374
CFR G ++ VP+ LV + + +N MV GG ALCL +
Sbjct: 358 LCFRAPAKGVDQV--EVPR--LVFHFDGGADLDLPAENYMVLDGGSGALCLTV----MGS 409
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFS 402
R +IG Q +N +D+ L F+
Sbjct: 410 RGLSIIGNFQQQNFQFVYDVGHDTLSFA 437
>gi|388522823|gb|AFK49473.1| unknown [Medicago truncatula]
Length = 254
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 89/211 (42%), Gaps = 37/211 (17%)
Query: 5 QSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRT-PLVPVKLTVH 63
+ +L +LL FS S +Q F P V KD T + T + T P +
Sbjct: 9 STLILTIALLSFSCS---SQQFFSP------VEKDPITNLFSTLLWVGTEPTHEFNFVID 59
Query: 64 LGGNILWVDCEKGYVSSTNKTARCGSAQC-NLANAKACGG--------GICGAGPDNPIS 114
+GG ILW DC K Y SST C S C N A +C G CGA NP+
Sbjct: 60 IGGPILWYDCNKAYNSSTYNPISCESKHCTNDAGCTSCNGPFKPGCSNNTCGANIINPLV 119
Query: 115 NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC----------GSEFVLQGLASGVV 164
+ GD D L I P + V +FI C S+F L+ L
Sbjct: 120 DAIFSGDTGSDALFI-------PKSKIKVSDFISGCTDSNAFADSADSDFPLKNLPKTSK 172
Query: 165 GIAGLGRSKVALPSQLA-AAFSLKRKFALCL 194
GI GL R+ ++LP QL+ A + KF LCL
Sbjct: 173 GILGLARTPLSLPKQLSLAPQKILNKFVLCL 203
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 159/388 (40%), Gaps = 53/388 (13%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW------VDCEKGYV-------SS 80
L+V +++ + TP + V G +++W VDC K SS
Sbjct: 94 LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSS 153
Query: 81 TNKTARCGSAQC-NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
T T C SA C +L +K CG T T+GD + +T+ +
Sbjct: 154 TYATVPCSSASCSDLPTSKCTSASKCG--------YTYTYGDSS-STQGVLATETFTLAK 204
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+ +P +F CG G + G G+ GLGR ++L SQL KF+ CL+ DD
Sbjct: 205 S-KLPGVVFGCGDTNEGDGFSQGA-GLVGLGRGPLSLVSQLGL-----DKFSYCLTSLDD 257
Query: 200 ---GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
++ G + + +++ TPL N +PS Y++ + ++ V
Sbjct: 258 TNNSPLLLGSLAGISEASA-AASSVQTTPLIKNP--------SQPSF-YYVSLKAITVGS 307
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+ L + ++ ++G GG + + T LE Y+AL +AFA+ M A
Sbjct: 308 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD 367
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNP 374
CFR G ++ VP+ LV + + +N MV GG ALCL +
Sbjct: 368 LCFRAPAKGVDQV--EVPR--LVFHFDGGADLDLPAENYMVLDGGSGALCLTV----MGS 419
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFS 402
R +IG Q +N +D+ L F+
Sbjct: 420 RGLSIIGNFQQQNFQFVYDVGHDTLSFA 447
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 154/389 (39%), Gaps = 67/389 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----------------KGYVSSTNKTARC 87
Y +I TP + V G +ILWV+C SST K+ C
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSVSC 144
Query: 88 GSAQCNLANAKA-CGGG-------ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
C+ N ++ C G + G G + T+G + DV+ + G N
Sbjct: 145 SDNFCSYVNQRSECHSGSTCQYVILYGDG-------SSTNGYLVRDVVHLDLVTG-NRQT 196
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
T IF CGS+ L + V GI G G+S + SQLA+ +KR FA CL
Sbjct: 197 GSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDN- 255
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++G +F G VS +K TP+ S Y + + ++ V
Sbjct: 256 NNGGGIFAIGEV-------VSPKVKTTPML------------SKSAHYSVNLNAIEVGNS 296
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ L+ + D+ G I + L ++Y L+ ++ ++ + V
Sbjct: 297 VLQLSSD--AFDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILASHQEL-NLHTVQDSFT 353
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR-- 375
CF D R V Q D K+V +++ Q + Q+ D C G+ +GG+ +
Sbjct: 354 CFHYID-RLDRFPTVTFQFD-----KSVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGG 407
Query: 376 -TSIVIGARQLENNLLQFDLATSRLGFSD 403
+ ++G L N L+ +D+ +G+++
Sbjct: 408 ASLTILGDMALSNKLVVYDIENQVIGWTN 436
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 159/397 (40%), Gaps = 71/397 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGY------VSSTNKTARCG 88
T +Y+ + TP PV LT+ G +++W C +G SST CG
Sbjct: 89 TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148
Query: 89 SAQCNLANAKACGGG---ICGAGPDNPISNTGTHGDIRIDVLSIQST-----------DG 134
+ +C +CGGG G G + + +GD + V I + D
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNG-NRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDS 207
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
R P R +T F CG F S GIAG GR + +LPSQL F+ C
Sbjct: 208 RLPTRRLT-----FGCG-HFNKGVFQSNETGIAGFGRGRWSLPSQLNVT-----TFSYCF 256
Query: 195 -SPFDDGA--IVFGDGPYYDL---NNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
S F+ + + G P L + +S ++ TPL N +PS+ YF+
Sbjct: 257 TSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNP--------SQPSL-YFLS 307
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKV 306
+ K + + KT L++ + T I + T L ++Y+A+ FA+ +P
Sbjct: 308 L-------KGISVGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPT 360
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCL 365
V A CF L R P VP + L L + W + N + + + +C+
Sbjct: 361 GVVEGSA-LDLCFALPVTALWR-RPPVPSLTLHLDGAD--WELPRGNYVFEDLAARVMCV 416
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+D P VIG Q +N + +DL L F+
Sbjct: 417 -VLDAA--PGDQTVIGNFQQQNTHVVYDLENDWLSFA 450
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/377 (22%), Positives = 151/377 (40%), Gaps = 53/377 (14%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV---------SSTNKTARCGSAQC-----NLANAK 98
TP V + + G + W+ C K S++ +T C S C +
Sbjct: 39 TPPQNVSMVIDTGSELSWLHCNKTLSYPTTFDPTRSTSYQTIPCSSPTCTNRTQDFPIPA 98
Query: 99 ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
+C + + + G++ DV I S+D + +F C
Sbjct: 99 SCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD---------ISGLVFGCMDSVFSSN 149
Query: 159 --LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNF 215
S G+ G+ R ++ SQL KF+ C+S D G ++ G+ +N
Sbjct: 150 SDEDSKSTGLMGMNRGSLSFVSQLGFP-----KFSYCISGTDFSGLLLLGE------SNL 198
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
S L YTPL +++T + V Y + + + V K +P+ K+ D+ G G
Sbjct: 199 TWSVPLNYTPLI--QISTPLPYFDR--VAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQ 254
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF---GACFRLQDIGFT-RIGP 331
T + + +T L +Y AL AF + V RV F GA + + R+ P
Sbjct: 255 TMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLP 314
Query: 332 VVPQIDLVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVDGGVNPRTSIVIGARQL 385
++P + LV + + ++ G + ++ G+ CL F + + + VIG
Sbjct: 315 LLPTVTLVFRGAEM--TVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQ 372
Query: 386 ENNLLQFDLATSRLGFS 402
+N ++FDL SR+G +
Sbjct: 373 QNVWMEFDLEKSRIGLA 389
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 119/276 (43%), Gaps = 36/276 (13%)
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SP 196
+ VPNF+ C S F + A GIAG GR +LPSQL KF+ CL
Sbjct: 189 HGLIVPNFLVGC-SVFSSRQPA----GIAGFGRGPSSLPSQLGLT-----KFSYCLLSHK 238
Query: 197 FDD----GAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTS 251
FDD ++V D + L YTPL N KV F SV Y++ +
Sbjct: 239 FDDTQESSSLVLDSQSDSD----KKTAALMYTPLVKNPKVQDKPAF----SVYYYVSLRR 290
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR--- 308
+ + G++V + LS D +G GGT I + +T + T ++ L F S + R
Sbjct: 291 ISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALM 350
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGF 367
V ++ CF + + PQ+ L + V + +N +G + C
Sbjct: 351 VEALSGLKPCFNVSGAKELEL----PQLRLHFKGGADV-ELPLENYFAFLGSREVACFTV 405
Query: 368 VDGGVNPRT--SIVIGARQLENNLLQFDLATSRLGF 401
V G + +++G Q++N +++DL RLGF
Sbjct: 406 VTDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGF 441
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/400 (24%), Positives = 166/400 (41%), Gaps = 70/400 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC-----------------EKGYVSSTNKTAR 86
QY+ + TP V L G +++W+ C +V+S + T
Sbjct: 53 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 112
Query: 87 ---CGSAQCNLANAKACGGGICGAGPDNPI---------SNTGTHGDIRIDVLSIQSTDG 134
C +AQC L A G C P P+ + T G + D +I ++G
Sbjct: 113 VVPCSAAQCLLVPAPRGHGPSCS--PAAPVPCGYAYDYADGSSTTGFLARDTATI--SNG 168
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ G AV F CG+ G SG G+ GLG+ +++ P+Q + F+ + F+ CL
Sbjct: 169 TSGGAAVR--GVAFGCGTR-NQGGSFSGTGGVIGLGQGQLSFPAQSGSLFA--QTFSYCL 223
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
+ G + L + YTPL N + P+ Y++GV ++ V
Sbjct: 224 LDLEGGRRGRSSS-FLFLGRPERRAAFAYTPLVSNPL--------APTF-YYVGVVAIRV 273
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
+ +P+ + +ID G GGT I + + T L Y LV AFA+++ + R+ A
Sbjct: 274 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSAT 332
Query: 315 F----GACFRLQ--------DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
F C+ + + GF R+ ID + + + N +V + D
Sbjct: 333 FFQGLELCYNVSSSSSLAPANGGFPRL-----TIDFA---QGLSLELPTGNYLVDVADDV 384
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
CL + ++P V+G + ++FD A++R+GF+
Sbjct: 385 KCLA-IRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFA 423
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 85/379 (22%), Positives = 151/379 (39%), Gaps = 57/379 (15%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV---------SSTNKTARCGSAQC-----NLANAK 98
TP V + + G + W+ C K SS+ C S C +
Sbjct: 93 TPPQNVSMVLDTGSELSWLRCNKTQTFQTTFDPNRSSSYSPVPCSSLTCTDRTRDFPIPA 152
Query: 99 ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC-GSEFVLQ 157
+C + + + G++ D I ++D +P IF C S F
Sbjct: 153 SCDSNQLCHAILSYADASSSEGNLASDTFYIGNSD---------MPGTIFGCMDSSFSTN 203
Query: 158 GLA-SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNF 215
S G+ G+ R ++ SQ+ KF+ C+S D G ++ GD NF
Sbjct: 204 TEEDSKNTGLMGMNRGSLSFVSQMDFP-----KFSYCISDSDFSGVLLLGDA------NF 252
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
L YTPL +++T + V Y + + + V+ K +PL K++ D+ G G
Sbjct: 253 SWLMPLNYTPLI--QISTPLPYFDR--VAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQ 308
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF------GACFRLQDIGFTRI 329
T + + +T L +Y AL F + ++ RV + C+R+ + T +
Sbjct: 309 TMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVP-LSQTSL 367
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVDGGVNPRTSIVIGAR 383
P +P + L+ + + + G + ++ G+ C F + + + VIG
Sbjct: 368 -PWLPTVSLMFRGAEM--KVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHH 424
Query: 384 QLENNLLQFDLATSRLGFS 402
+N ++FDL SR+GF+
Sbjct: 425 HQQNVWMEFDLEKSRIGFA 443
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 156/373 (41%), Gaps = 44/373 (11%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ ++ +P + G +++W C+ T Q + +C
Sbjct: 365 EFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSE 424
Query: 104 ICGAGPDNPISNTG-----THGDIRID--VLSIQS-TDGRNPGRAVTVPNFIFLCGSEFV 155
+CGA P + S+ G T+GD VL+ ++ T G + +++P F CG++
Sbjct: 425 LCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNN 484
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---AIVFGDGPYYDL 212
G + G G+ GLGR ++L SQL ++KFA CL+ DD +++ G ++
Sbjct: 485 GDGFSQGA-GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSLLLGS--LANI 536
Query: 213 NNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEG 272
+K TPL N +PS Y++ + + V G + + K+ + ++G
Sbjct: 537 TPKTSKDEMKTTPLIKNP--------SQPSF-YYLSLQGISVGGTQLSIPKSTFELHDDG 587
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV 332
GG I + T +E S + +L F + M + CF L G ++
Sbjct: 588 SGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLP-AGTNQV--E 644
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGDA----LCLGFVDGGVNPRTSIVIGARQLENN 388
VP+ L K + G+N M+ GD+ LCL + R + G Q +N
Sbjct: 645 VPK--LTFHFKGADLELPGENYMI---GDSKAGLLCLAI----GSSRGMSIFGNLQQQNF 695
Query: 389 LLQFDLATSRLGF 401
++ DL L F
Sbjct: 696 MVVHDLQEETLSF 708
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/402 (24%), Positives = 167/402 (41%), Gaps = 70/402 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC-----------------EKGYVSSTNKT-- 84
QY+ + TP V L G +++W+ C +V+S + T
Sbjct: 52 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 111
Query: 85 -ARCGSAQCNLANAKACGGGICGAGPDNPI---------SNTGTHGDIRIDVLSIQSTDG 134
C +AQC L A G C P P+ + T G + D +I ++G
Sbjct: 112 VVPCSAAQCLLVPAPRGHGPACS--PAAPVPCGYAYDYADGSSTTGFLARDTATI--SNG 167
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ G AV F CG+ G SG G+ GLG+ +++ P+Q + F+ + F+ CL
Sbjct: 168 TSGGAAVR--GVAFGCGTR-NQGGSFSGTGGVIGLGQGQLSFPAQSGSLFA--QTFSYCL 222
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
+ G + L + YTPL N + P+ Y++GV ++ V
Sbjct: 223 LDLEGGRRGRSSS-FLFLGRPERRAAFAYTPLVSNPL--------APTF-YYVGVVAIRV 272
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
+ +P+ + +ID G GGT I + + T L Y LV AFA+++ + R+ A
Sbjct: 273 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSAT 331
Query: 315 F----GACFRL--------QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
F C+ + + GF R+ ID + + + N +V + D
Sbjct: 332 FFQGLELCYNVSSSSSSAPANGGFPRL-----TIDFA---QGLSLELPTGNYLVDVADDV 383
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
CL + ++P V+G + ++FD A++R+GF+ +
Sbjct: 384 KCLA-IRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFART 424
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 164/381 (43%), Gaps = 47/381 (12%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ + TP ++ + G ++ W+ C ++G V SS+ + CG
Sbjct: 148 EYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQ 207
Query: 91 QCNL----ANAKACGGGICGAGPDNPI--SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
+C L +AC + P + T GD+ ++ ++ T PG + V
Sbjct: 208 RCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLT---APGASRRVD 264
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA-IV 203
+F CG +GL G G+ GLGR ++ SQL A + + L D G+ +V
Sbjct: 265 GVVFGCGHRN--RGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGSKVV 322
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG+ Y L + LKYT F + A F Y++ + V V G + ++
Sbjct: 323 FGED-YLVLAH----PQLKYT-AFAPTSSPADTF-------YYVKLKGVLVGGDLLNISS 369
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP-FGACFRLQ 322
+ +G GGT I + + Y+ + QAF M ++ + P P C+ +
Sbjct: 370 DTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVS 429
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-VIG 381
G R P VP++ L+ + VW +N V++ D + V G PRT + +IG
Sbjct: 430 --GVER--PEVPELSLLFAD-GAVWDFPAENYFVRLDPDGIMCLAVRG--TPRTGMSIIG 482
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q +N + +DL +RLGF+
Sbjct: 483 NFQQQNFHVVYDLQNNRLGFA 503
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/368 (22%), Positives = 145/368 (39%), Gaps = 39/368 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y+ I +P + V G +++W C + + + + + +C
Sbjct: 79 EYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASN 138
Query: 104 ICGAGPDNPISNTGTHGDIRIDVLSIQS--TDGRNPGRAVTVPNFIFLCGSEFVLQGLAS 161
C + P + + + + D S + T+PN F CG + G +
Sbjct: 139 FCSSLPFQSCTTSCKYDYMYGDGSSTSGALSTETVTVGTGTIPNVAFGCGHTNL--GSFA 196
Query: 162 GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF---DDGAIVFGDGPYYDLNNFDVS 218
G GI GLG+ ++L SQ ++ S +KF+ CL P ++ GD +
Sbjct: 197 GAAGIVGLGQGPLSLISQASSITS--KKFSYCLVPLGSTKTSPMLIGDSA--------AA 246
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
+ YT L N N P+ Y+ +T + V+GKAV SID G GG +
Sbjct: 247 GGVAYTALLTNTAN--------PTF-YYADLTGISVSGKAVTYPVGTFSIDASGQGGFIL 297
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
+ T LET + ALV A + +P + CF + P P +
Sbjct: 298 DSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVA----NPTYPT--M 351
Query: 339 VLQNKNVVWSIHGQNSMVQIG-GDALCLGF-VDGGVNPRTSIVIGARQLENNLLQFDLAT 396
K + + +N V + G ++CL G + ++G Q +N+L+ DL
Sbjct: 352 TFHFKGADYELPPENVFVALDTGGSICLAMAASTGFS-----IMGNIQQQNHLIVHDLVN 406
Query: 397 SRLGFSDS 404
R+GF ++
Sbjct: 407 QRVGFKEA 414
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/395 (24%), Positives = 158/395 (40%), Gaps = 77/395 (19%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV--------------------SSTNKTARCGSAQC 92
TP + L + G +++W C YV SS++K C + +C
Sbjct: 98 TPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKVLGCVNPKC 157
Query: 93 NLANAKACGGGICGAGPDNP-----------ISNTGTHGDIRI-DVLSIQSTDGRNPGRA 140
+ P +P +G G I + + L + PG+
Sbjct: 158 GWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDL-------PGKG 210
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFD 198
V PNFI C VL S GI+G GR +LPSQL +KF+ CL +D
Sbjct: 211 V--PNFIVGCS---VLS--TSQPAGISGFGRGPPSLPSQLGL-----KKFSYCLLSRRYD 258
Query: 199 D---GAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHV 254
D + + DG + ++ + + L YTP N KV F SV Y++G+ + V
Sbjct: 259 DTTESSSLVLDG---ESDSGEKTAGLSYTPFVQNPKVAGKHAF----SVYYYLGLRHITV 311
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PKVARVAPV 312
GK V + L +G GGT I + +T ++ I++ + F + + V +
Sbjct: 312 GGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGI 371
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGG 371
CF + + P P++ L + + N + +GG D +CL V G
Sbjct: 372 TGLRPCFNISGLN----TPSFPELTLKFRG-GAEMELPLANYVAFLGGDDVVCLTIVTDG 426
Query: 372 VNPRT-----SIVIGARQLENNLLQFDLATSRLGF 401
+ +I++G Q +N +++DL RLGF
Sbjct: 427 AAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGF 461
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 108/247 (43%), Gaps = 32/247 (12%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF-----DDGAIVFGDGPYYDLNNF 215
+G GI GLG+ V++PSQL + L KF+ CL + + + FGD
Sbjct: 146 TGGEGILGLGQGPVSMPSQLGSV--LGNKFSYCLVDWLSAGSETSTMYFGDAAV------ 197
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
S ++YTP+ N + Y+I V + V G + +++++ ID+ G GG
Sbjct: 198 -PSGEVQYTPIVPNA---------DHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGG 247
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
T I + T L+ ++ ALV A+ S + + CF + G PV P
Sbjct: 248 TIIDSGTTITYLQQEVFNALVAAYTSQV-RYPTTTSATGLDLCFNTRGTG----SPVFPA 302
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLA 395
+ + L V + N+ + + + +CL F P + G Q +N + +DL
Sbjct: 303 MTIHL--DGVHLELPTANTFISLETNIICLAFASALDFPIA--IFGNIQQQNFDIVYDLD 358
Query: 396 TSRLGFS 402
R+GF+
Sbjct: 359 NMRIGFA 365
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 158/396 (39%), Gaps = 95/396 (23%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS---------AQCNL 94
+YV QI TP V G ++ WV C ARC A +
Sbjct: 7 EYVLQISLGTPPQQFSAIVDTGSDLCWVQCAP--------CARCFEQPDPLFIPLASSSY 58
Query: 95 ANAKACGGGICGAGPDNPIS-------------NTGTHGDIRIDVLSIQSTDGRNPGRAV 141
+NA +C +C A P S + T GD + +++ +
Sbjct: 59 SNA-SCTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGS--------- 108
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL------- 194
T+ F CG +G +G G+ GLG+ ++LPSQL ++F+ F+ CL
Sbjct: 109 TLARIGFGCGHN--QEGTFAGADGLIGLGQGPLSLPSQLNSSFT--HIFSYCLVDQSTTG 164
Query: 195 --SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
SP I FG N + +TPL N+ N PS Y++GV S+
Sbjct: 165 TFSP-----ITFG--------NAAENSRASFTPLLQNEDN--------PSY-YYVGVESI 202
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR---- 308
V + VP + ID GVGG + + T+I + AF + ++ R
Sbjct: 203 SVGNRRVPTPPSAFRIDANGVGGVILDS-------GTTITYWRLAAFIPILAELRRQISY 255
Query: 309 -VAPVAPFG--ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
A P+G C+ + + + + +P + + L N + + +V G+ +C
Sbjct: 256 PEADPTPYGLNLCYDISSVSASSL--TLPSMTVHLTNVDFEIPVSNLWVLVDNFGETVCT 313
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+ + SI IG Q +NNL+ D+A SR+GF
Sbjct: 314 AM---STSDQFSI-IGNVQQQNNLIVTDVANSRVGF 345
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/393 (22%), Positives = 161/393 (40%), Gaps = 71/393 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
YVT I TP + G +++W+ C+ SS+ T CG
Sbjct: 39 DYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 91 QCNLANAKACGGGICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C+ K+C PD S +GT G + + +++ ST G + N
Sbjct: 99 LCDSLPRKSC-------SPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEK----LAAKN 147
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG----- 200
F CG + +G + G+ GLGR ++ SQL F KF+ CL P+ D
Sbjct: 148 IAFGCG--HLNRGSFNDASGLVGLGRGNLSFVSQLGDLFG--HKFSYCLVPWRDAPSKTS 203
Query: 201 AIVFGDGPYYDLNNFDVSKNLKY--TPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNG 256
+ FGD + ++ K L Y TP+ N P++E Y++ + + + G
Sbjct: 204 PMFFGD----ESSSHSSGKKLHYAFTPMIHN-----------PAMESFYYVKLKDISIAG 248
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVARVAPVAP 314
+A+ + I +G GG + T+L + Y+ +++A S + PK+ + A
Sbjct: 249 RALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSS--AG 306
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI--GGDALCLGFVDGGV 372
C+ D+ ++ + +V + + + +N + G +CL V +
Sbjct: 307 LDLCY---DVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSNM 363
Query: 373 NPRTSIVIGARQLENNL-LQFDLATSRLGFSDS 404
+ I I ++ N + +D+ +S++G++ S
Sbjct: 364 D----IGIYGNMMQQNFRVMYDIGSSKIGWAPS 392
>gi|302757463|ref|XP_002962155.1| hypothetical protein SELMODRAFT_403740 [Selaginella moellendorffii]
gi|300170814|gb|EFJ37415.1| hypothetical protein SELMODRAFT_403740 [Selaginella moellendorffii]
Length = 336
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 114/259 (44%), Gaps = 36/259 (13%)
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS 218
+A V IAGLG ++ AL +QLA A L F+ CL GA+ FG Y F S
Sbjct: 100 VACAPVRIAGLGPAESALHAQLARAAGLPLTFSYCLPSAGYGALFFGATSY----RFGAS 155
Query: 219 -KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+ K L ++++ +SGF Y V S+ + G + L++ L
Sbjct: 156 GRGFKILKLGLSRLRASSGF-------YSARVASIELGGVRIALDRDAL----------- 197
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA---CFRLQDIGFTRIGPVVP 334
T YT L + Y+AL A V+R A FGA C+R+ +P
Sbjct: 198 FGTHRRYTALPDASYRALRDRLV-AQSNVSRAN--ARFGALDLCYRIDSAAAME---SLP 251
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
I L VW I N +V L C+G V+GG + ++ IG Q ++++L+F+
Sbjct: 252 TIRLAFAG-GFVWEIGAANYLVPTREPGLFCVGIVNGGED--STPAIGTFQQQDHVLEFN 308
Query: 394 LATSRLGFSDSLLFERATC 412
LA LG S SL+ C
Sbjct: 309 LAKKTLGISKSLVGMGGNC 327
>gi|302763287|ref|XP_002965065.1| hypothetical protein SELMODRAFT_406191 [Selaginella moellendorffii]
gi|300167298|gb|EFJ33903.1| hypothetical protein SELMODRAFT_406191 [Selaginella moellendorffii]
Length = 336
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 114/259 (44%), Gaps = 36/259 (13%)
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS 218
+A V IAGLG ++ AL +QLA A L F+ CL GA+ FG Y F S
Sbjct: 100 VACAPVRIAGLGPAESALHAQLARAAGLPLTFSYCLPSAGYGALFFGATSY----RFGAS 155
Query: 219 -KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+ K L ++++ +SGF Y V S+ + G + L++ L
Sbjct: 156 GRGFKILKLGLSRLRASSGF-------YSARVASIELGGVRIALDRDAL----------- 197
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA---CFRLQDIGFTRIGPVVP 334
T YT L + Y+AL A V+R A FGA C+R+ +P
Sbjct: 198 FGTHRRYTALPGASYRALRDRLV-AQSNVSRAN--ARFGALDLCYRIDSAAAME---SLP 251
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
I L VW I N +V L C+G V+GG + ++ IG Q ++++L+F+
Sbjct: 252 TIRLAFAG-GFVWEIGAANYLVPTREPGLFCVGIVNGGED--STPAIGTFQQQDHVLEFN 308
Query: 394 LATSRLGFSDSLLFERATC 412
LA LG S SL+ C
Sbjct: 309 LAKKTLGISKSLVGMGGNC 327
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 111/252 (44%), Gaps = 29/252 (11%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSP--FDD----GAIVFGDGPYYDLNNFDVS 218
GIAG GR +++PSQL K +FA CL FD+ +V GD NN
Sbjct: 126 GIAGFGRGALSMPSQLGEHIG-KDRFAYCLQSHRFDEENKKSLMVLGDKAL--PNNIP-- 180
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP-LNKTLLSIDNEGVGGTK 277
L YTP N S G V Y+IG+ V + GK + L LL D +G GGT
Sbjct: 181 --LNYTPFLTNSRAPPSSQYG---VYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTI 235
Query: 278 ISTVNPYTVLETSIYKALVQAFASAM--PKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
I + +TV I+K + FAS + + V G C+ + G I V+P+
Sbjct: 236 IDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDV--TGLENI--VLPE 291
Query: 336 IDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDG----GVNPRTSIVIGARQLENNLL 390
+ ++V + S D++CL + V+ ++++G Q ++ L
Sbjct: 292 FAFHFKGGSDMVLPVANYFSYFS-SFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYL 350
Query: 391 QFDLATSRLGFS 402
+D +RLGF+
Sbjct: 351 LYDREKNRLGFT 362
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 157/368 (42%), Gaps = 39/368 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y+ ++ TP V + G +++W C+ T + + + +CG
Sbjct: 107 EYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 104 ICGAGPDNPISN----TGTHGDIRID--VLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVL 156
+C A P + S+ ++GD + VL+ ++ T G++ + V+V N F CG +
Sbjct: 167 LCSALPSSTCSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVHNIGFGCGEDNEG 225
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLN 213
G G+ GLGR ++L SQL +++F+ CL+P DD ++ G L
Sbjct: 226 DGFEQ-ASGLVGLGRGPLSLVSQLK-----EQRFSYCLTPIDDTKESVLLLGS-----LG 274
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV 273
+K + TPL N + +PS Y++ + ++ V + + K+ + ++G
Sbjct: 275 KVKDAKEVVTTPLLKNPL--------QPSF-YYLSLEAISVGDTRLSIEKSTFEVGDDGN 325
Query: 274 GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVV 333
GG I + T ++ Y+AL + F S CF L G T++ +
Sbjct: 326 GGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPS-GSTQV--EI 382
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
P+ LV K + +N M IG L + + G + SI G Q +N L+ D
Sbjct: 383 PK--LVFHFKGGDLELPAENYM--IGDSNLGVACLAMGASSGMSI-FGNVQQQNILVNHD 437
Query: 394 LATSRLGF 401
L + F
Sbjct: 438 LEKETISF 445
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 95/397 (23%), Positives = 158/397 (39%), Gaps = 59/397 (14%)
Query: 24 QTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------- 74
QT +P+ L V+ T+ + +Y T++ P + + G +I W+ C+
Sbjct: 136 QTEIQPQDLSTPVSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQ 195
Query: 75 ------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLS 128
SS+ C S QCN +C G C N + T GD + +S
Sbjct: 196 QSDPIFTPAASSSYSPLTCDSQQCNSLQMSSCRNGQCRYQV-NYGDGSFTFGDFVTETMS 254
Query: 129 IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
G + TV + CG + +GL G G+ GLG ++L SQL A
Sbjct: 255 F--------GGSGTVNSIALGCGHDN--EGLFVGAAGLLGLGGGPLSLTSQLKAT----- 299
Query: 189 KFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
F+ CL D A D N+ V ++ L +K++T Y++G
Sbjct: 300 SFSYCLVNRDSAA-----SSTLDFNSAPVGDSVIAPLLKSSKIDTF----------YYVG 344
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
++ + V G+ + + + + +D+ G GG + T L++ Y +L +F S +
Sbjct: 345 LSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRS 404
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGF 367
+ VA F C+ L ++ V D W + N ++ + C F
Sbjct: 405 TSGVALFDTCYDLSGQSSVKVPTVSFHFD-----GGKSWDLPAANYLIPVDSAGTYCFAF 459
Query: 368 VDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
P TS +IG Q + + FDLA +R+GFS
Sbjct: 460 A-----PTTSSLSIIGNVQQQGTRVSFDLANNRVGFS 491
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/390 (23%), Positives = 149/390 (38%), Gaps = 58/390 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------GYVSSTNKTARCGS 89
QY ++ P + L G +++WV C SST A C
Sbjct: 82 QYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYD 141
Query: 90 AQCNL----ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR-NPGRAVTVP 144
C L A C + P G + + + ++T + + G+ +
Sbjct: 142 PVCRLVPKPGRAPRCNHTRIHS--TCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLK 199
Query: 145 NFIFLCGSEFVLQGLA----SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-----S 195
+ F CG Q ++ +G G+ GLGR ++ SQL F KF+ CL S
Sbjct: 200 SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG--NKFSYCLMDYTLS 257
Query: 196 PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
P ++ GDG D L +TPL N ++ Y++ + SV VN
Sbjct: 258 PPPTSYLIIGDG-------GDAVSKLFFTPLLTNPLS---------PTFYYVKLKSVFVN 301
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVARVAPVA 313
G + ++ ++ ID+ G GGT + + L Y+ ++ A +P + P
Sbjct: 302 GAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPG- 360
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
F C + G T+ ++P++ V+ +N ++ CL V+
Sbjct: 361 -FDLCVNVS--GVTKPEKILPRLKFEFSG-GAVFVPPPRNYFIETEEQIQCLAIQS--VD 414
Query: 374 PRTSI-VIGARQLENNLLQFDLATSRLGFS 402
P+ VIG + L +FD SRLGFS
Sbjct: 415 PKVGFSVIGNLMQQGFLFEFDRDRSRLGFS 444
>gi|226427708|gb|ACO55043.1| xylanase inhibitor [Triticum aestivum]
Length = 136
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 68/132 (51%), Gaps = 8/132 (6%)
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ ++ T +++ G +ST PY L + +Y+ + AF AM A+VA VAPF C
Sbjct: 8 IAIDGTRVAVSGSGALIVGLSTTIPYAQLRSDVYRPFITAFDRAMGSSAKVAAVAPFELC 67
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV-----DGGVN 373
+ + TR G +VP +DL+L+ +++ G NSM Q+ C FV GG
Sbjct: 68 YDSSKLSPTRFGYLVPNVDLMLEG-GTNFTVVGGNSMAQVNSGTACFAFVRSGGSTGGAT 126
Query: 374 PRTSIVIGARQL 385
P ++VIG Q+
Sbjct: 127 P--ALVIGGFQM 136
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 97/405 (23%), Positives = 163/405 (40%), Gaps = 75/405 (18%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------GYVSSTNKT 84
T + QY I+ TP + L G +++WV C SS+
Sbjct: 82 STGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSP 141
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTGTH-----------GDIRIDVLSIQSTD 133
C C L P + ++T H G + S ++T
Sbjct: 142 FHCFDPHCRLLPH----------APHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTT 191
Query: 134 GRN-PGRAVTVPNFIFLCGSEFVLQGLA------SGVVGIAGLGRSKVALPSQLAAAFSL 186
++ G + + F CG F + G + +G G+ GLGR ++ SQL F
Sbjct: 192 LKSLSGSEIHLKGLSFGCG--FRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFG- 248
Query: 187 KRKFALCL-----SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEP 241
KF+ CL SP ++ G G + L + +K + YTPL IN ++
Sbjct: 249 -NKFSYCLMDYTLSPPPTSFLMIGGG-LHSLPLTNATK-ISYTPLQINPLS--------- 296
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
Y+I + S+ ++G +P+N + ID +G GGT + + T L + Y+ ++++
Sbjct: 297 PTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRR 356
Query: 302 --AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG 359
+P A + P F C + G +R P +P++ L V++ +N ++
Sbjct: 357 RVKLPNAAELTPG--FDLC--VNASGESR-RPSLPRLRFRL-GGGAVFAPPPRNYFLETE 410
Query: 360 GDALCLGF--VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+CL V+ G VIG + LL+FD SRLGF+
Sbjct: 411 EGVMCLAIRAVESG---NGFSVIGNLMQQGFLLEFDKEESRLGFT 452
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 159/397 (40%), Gaps = 70/397 (17%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGY----------VSST 81
T T Y T+I TP + V G +ILWV+C + G SST
Sbjct: 84 TDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSST 143
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDG 134
C C A GG + G P + T+GD D+L G
Sbjct: 144 GSKVSCDQGFC----AATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSG 199
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASG-----VVGIAGLGRSKVALPSQLAAAFSLKRK 189
R F CGS+ QG G + GI G G+S ++ SQL+AA +K+
Sbjct: 200 DGQTRPAN-STVTFGCGSQ---QGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKI 255
Query: 190 FALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
FA CL + G I + + N V +K TPL N + Y + +
Sbjct: 256 FAHCLDTINGGGI-------FAIGNV-VQPKVKTTPLVPNMPH------------YNVNL 295
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
S+ V G A+ L + D GT I + T L +YK ++ A A K
Sbjct: 296 KSIDVGGTALKLPSHMF--DTGEKKGTIIDSGTTLTYLPEIVYKEIMLA-VFAKHKDITF 352
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
V F CF Q +G R+ P+I +N ++ +++ + + G + C+GF +
Sbjct: 353 HNVQEF-LCF--QYVG--RVDDDFPKITFHFEN-DLPLNVYPHDYFFENGDNLYCVGFQN 406
Query: 370 GGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
GG+ + +++G L N L+ +DL +G+++
Sbjct: 407 GGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTE 443
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 151/392 (38%), Gaps = 59/392 (15%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------KGYVSSTNKTAR 86
D T QY T+I+ TP ++ V G + WV+C + S + KT
Sbjct: 78 DYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVG 137
Query: 87 CGSAQC-----NLANAKACGGGICGAGPDNPISNTGTHGD--IRIDVLSIQS-TDGRNPG 138
C + C NL + C P P S + D V + ++ T G G
Sbjct: 138 CLTQTCKVDLMNLFSLTTC------PTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNG 191
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
R +P + C S F Q G G+ GL S + S + + KF+ CL
Sbjct: 192 RMARLPGHLIGCSSSFTGQSF-QGADGVLGLAFSDFSFTSTATSLYGA--KFSYCLVD-- 246
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
L+N +VS L +F + +T + F ++ +N
Sbjct: 247 ------------HLSNKNVSNYL----IFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIG 290
Query: 259 VPLNKTLLSI-----DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
+ L +L I D GGT + + T+L + YK +V A + ++ RV P
Sbjct: 291 ISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEG 350
Query: 314 -PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
P CF GF +PQ+ L+ + H ++ +V CLGFV G
Sbjct: 351 VPIEYCFSFTS-GFNV--SKLPQLTFHLKG-GARFEPHRKSYLVDAAPGVKCLGFVSAG- 405
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
P T+ VIG +N L +FDL S L F+ S
Sbjct: 406 TPATN-VIGNIMQQNYLWEFDLMASTLSFAPS 436
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 162/392 (41%), Gaps = 83/392 (21%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ + TP PV+LT+ G ++W C+ V SST C S
Sbjct: 90 EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDST 149
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIR-----IDVLSIQSTDGRNPGRAVTVPN 145
QC L + +C + + ++GD +DV ++ G +VP
Sbjct: 150 QCKLDPSVT----MCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAG------ASVPG 199
Query: 146 FIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
+F CG G+ S GIAG GR ++LPSQL F+ C + V
Sbjct: 200 VVFGCGLNNT--GIFRSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFT------AVS 246
Query: 205 GDGP---YYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G P +DL D+ KN ++ TPL N + P+ Y++ + + V
Sbjct: 247 GRKPSTVLFDLPA-DLYKNGRGTVQTTPLIKNPAH--------PTF-YYLSLKGITVGST 296
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+P+ ++ ++ N G GGT I + +T L +Y+ + FA+ + PV P
Sbjct: 297 RLPVPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV-----KLPVVPSNE 350
Query: 318 -----CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVD 369
CF +G P VP+ LVL + + +N + + G ++CL ++
Sbjct: 351 TGPLLCFSAPPLG---KAPHVPK--LVLHFEGATMHLPRENYVFEAKDGGNCSICLAIIE 405
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G + +IG Q +N + +DL S+L F
Sbjct: 406 GEMT-----IIGNFQQQNMHVLYDLKNSKLSF 432
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 160/391 (40%), Gaps = 81/391 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ + TP PV+LT+ G ++W C+ V SST C S
Sbjct: 34 EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDST 93
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIR-----IDVLSIQSTDGRNPGRAVTVPN 145
QC L + +C + + ++GD +DV ++ G +VP
Sbjct: 94 QCKLDPSVT----MCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAG------ASVPG 143
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
+F CG S GIAG GR ++LPSQL F+ C + V G
Sbjct: 144 VVFGCGLNNT-GIFRSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFT------AVSG 191
Query: 206 DGP---YYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
P +DL D+ KN ++ TPL N + P+ Y++ + + V
Sbjct: 192 RKPSTVLFDLPA-DLYKNGRGTVQTTPLIKNPAH--------PTF-YYLSLKGITVGSTR 241
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA- 317
+P+ ++ ++ N G GGT I + +T L +Y+ + FA+ + PV P
Sbjct: 242 LPVPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV-----KLPVVPSNET 295
Query: 318 ----CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVDG 370
CF +G P VP+ LVL + + +N + + G ++CL ++G
Sbjct: 296 GPLLCFSAPPLG---KAPHVPK--LVLHFEGATMHLPRENYVFEAKDGGNCSICLAIIEG 350
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+ +IG Q +N + +DL S+L F
Sbjct: 351 EMT-----IIGNFQQQNMHVLYDLKNSKLSF 376
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 97/411 (23%), Positives = 157/411 (38%), Gaps = 69/411 (16%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--- 74
LSP T F + D + +Y ++ +P L V G +++WV C+
Sbjct: 98 LSPAYQPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCL 157
Query: 75 KGYV----------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPIS---NTGTHGD 121
+ Y S+T CGSA C CG G + +S + T G
Sbjct: 158 ECYAQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDS---GGCEYEVSYGDGSYTKGT 214
Query: 122 IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLA 181
+ ++ L++ T V CG +GL G G+ GLG ++L QL
Sbjct: 215 LALETLTLGGT---------AVEGVAIGCGHRN--RGLFVGAAGLLGLGWGPMSLVGQLG 263
Query: 182 AAFSLKRKFALCLSPFDD---------GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
A F+ CL+ G++V G + V + + PL N
Sbjct: 264 GA--AGGAFSYCLASRGGSGSGAADAAGSLVLG-------RSEAVPEGAVWVPLVRNPQ- 313
Query: 233 TASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
PS Y++GV+ + V + +PL L + +G GG + T T L Y
Sbjct: 314 -------APSF-YYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAY 365
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
AL AF A+ + R V+ C+ L G+T + VP + + ++ +
Sbjct: 366 AALRDAFVGAVGALPRAPGVSLLDTCYDLS--GYTSV--RVPTVSFYFDGAATL-TLPAR 420
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
N ++++ G CL F P +S ++G Q E + D A +GF
Sbjct: 421 NLLLEVDGGIYCLAFA-----PSSSGLSILGNIQQEGIQITVDSANGYIGF 466
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 101/392 (25%), Positives = 157/392 (40%), Gaps = 70/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGY----------VSSTNKTAR 86
Y T+I TP + V G +ILWV+C + G SST
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGR 139
C C A GG + G P + T+GD D+L G R
Sbjct: 64 CDQGFC----AATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTR 119
Query: 140 AVTVPNFIFLCGSEFVLQGLASG-----VVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
F CGS+ QG G + GI G G+S ++ SQL+AA +K+ FA CL
Sbjct: 120 PAN-STVTFGCGSQ---QGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 175
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
+ G I + + N V +K TPL N + Y + + S+ V
Sbjct: 176 DTINGGGI-------FAIGNV-VQPKVKTTPLVPNMPH------------YNVNLKSIDV 215
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
G A+ L + D GT I + T L +YK ++ A A K V
Sbjct: 216 GGTALKLPSHMF--DTGEKKGTIIDSGTTLTYLPEIVYKEIMLA-VFAKHKDITFHNVQE 272
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
F CF Q +G R+ P+I +N ++ +++ + + G + C+GF +GG+
Sbjct: 273 F-LCF--QYVG--RVDDDFPKITFHFEN-DLPLNVYPHDYFFENGDNLYCVGFQNGGLQS 326
Query: 375 RTS---IVIGARQLENNLLQFDLATSRLGFSD 403
+ +++G L N L+ +DL +G+++
Sbjct: 327 KDGKGMVLLGDLVLSNKLVVYDLENQVIGWTE 358
>gi|255552251|ref|XP_002517170.1| conserved hypothetical protein [Ricinus communis]
gi|223543805|gb|EEF45333.1| conserved hypothetical protein [Ricinus communis]
Length = 61
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 33/59 (55%), Positives = 39/59 (66%)
Query: 355 MVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
MV + +CL FVDGG PRT I+IG QLE+NLL FD A SR GFS +LL TC+
Sbjct: 1 MVAVNSYKMCLAFVDGGSQPRTPIIIGGHQLEDNLLHFDRANSRFGFSSNLLARSTTCS 59
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 149/371 (40%), Gaps = 60/371 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACG 101
T +Y+ + TP PV+LT+ G +++W C+ + + + +C
Sbjct: 86 TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCD 145
Query: 102 GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN-PGRAVTVPNFIFLCGSEFVLQGLA 160
+C P V S+ +D G +VP F CG F
Sbjct: 146 STLCQGLP----------------VASLPRSDKFTFVGAGASVPGVAFGCG-LFNNGVFK 188
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI---VFGDGPYYDLNNFDV 217
S GIAG GR ++LPSQL F+ C + GAI V D P +N
Sbjct: 189 SNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTTI-TGAIPSTVLLDLPADLFSNGQG 242
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+ ++ TPL N N Y++ + + V +P+ ++ ++ N G GGT
Sbjct: 243 A--VQTTPLIQNPAN---------PTFYYLSLKGITVGSTRLPVPESEFALKN-GTGGTI 290
Query: 278 ISTVNPYTVLETSIYKALVQAFAS--AMPKVA--RVAPVAPFGACFRLQDIGFTRIGPVV 333
I + T L T +Y+ + AFA+ +P V+ P A R + P V
Sbjct: 291 IDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAK--------PYV 342
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
P+ LVL + + +N + ++ G LCL ++GG IG Q +N +
Sbjct: 343 PK--LVLHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGG----EVTTIGNFQQQNMHV 396
Query: 391 QFDLATSRLGF 401
+DL S+L F
Sbjct: 397 LYDLQNSKLSF 407
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 151/392 (38%), Gaps = 59/392 (15%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------KGYVSSTNKTAR 86
D T QY T+I+ TP ++ V G + WV+C + S + KT
Sbjct: 100 DYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVG 159
Query: 87 CGSAQC-----NLANAKACGGGICGAGPDNPISNTGTHGD--IRIDVLSIQS-TDGRNPG 138
C + C NL + C P P S + D V + ++ T G G
Sbjct: 160 CLTQTCKVDLMNLFSLTTC------PTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNG 213
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
R +P + C S F Q G G+ GL S + S + + KF+ CL
Sbjct: 214 RMARLPGHLIGCSSSFTGQSF-QGADGVLGLAFSDFSFTSTATSLYGA--KFSYCLVD-- 268
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
L+N +VS L +F + +T + F ++ +N
Sbjct: 269 ------------HLSNKNVSNYL----IFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIG 312
Query: 259 VPLNKTLLSI-----DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
+ L +L I D GGT + + T+L + YK +V A + ++ RV P
Sbjct: 313 ISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEG 372
Query: 314 -PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
P CF GF +PQ+ L+ + H ++ +V CLGFV G
Sbjct: 373 VPIEYCFSFTS-GFNV--SKLPQLTFHLKG-GARFEPHRKSYLVDAAPGVKCLGFVSAGT 428
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
P T+ VIG +N L +FDL S L F+ S
Sbjct: 429 -PATN-VIGNIMQQNYLWEFDLMASTLSFAPS 458
>gi|147834028|emb|CAN71000.1| hypothetical protein VITISV_023637 [Vitis vinifera]
Length = 456
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 55/94 (58%), Gaps = 5/94 (5%)
Query: 287 LETSIYKALVQAF--ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKN 344
+ETSIY A +AF A+A + RVA VAPF F +++ TR VVP IDLVLQN +
Sbjct: 1 METSIYSAFTKAFISATASMNIIRVAIVAPFNXYFNSKNVYXTRGRAVVPTIDLVLQNNS 60
Query: 345 VVWSIHGQNSMVQIG---GDALCLGFVDGGVNPR 375
VVW I G NSMV+ + L G +NPR
Sbjct: 61 VVWRIFGANSMVRXSIWDFNLLTAGSSKNPINPR 94
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 159/397 (40%), Gaps = 69/397 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTV--HLGGNILWVDCE---KGY----------VSSTNKTAR 86
TL Y+ ++ L K+TV G ++ WV C+ + Y S + +T
Sbjct: 132 TLNYIVTVE----LGGRKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVL 187
Query: 87 CGSAQCNLANAKACGGGICGAGPD--NPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
C S C + G+CG+ P N + N G R ++ T+ + G + V
Sbjct: 188 CSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGEL----GTEHLDLGNSTAVN 243
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIV 203
NFIF CG QGL G G+ GLGRS ++L SQ +A F + L ++ + G++V
Sbjct: 244 NFIFGCGRNN--QGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETEASGSLV 301
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE-YFIGVTSVHVNGKAV--- 259
G N V KN TP+ ++ + P + YF+ +T + V AV
Sbjct: 302 MG-------GNSSVYKNT--TPISYTRM------IPNPQLPFYFLNLTGITVGSVAVQAP 346
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
K + ID+ GT I+ + P SIY+AL F CF
Sbjct: 347 SFGKDGMMIDS----GTVITRLPP------SIYQALKDEFVKQFSGFPSAPAFMILDTCF 396
Query: 320 RLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
L G+ + +P I + + N + + G V+ +CL I
Sbjct: 397 NLS--GYQEV--EIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGI 452
Query: 379 VIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFN 415
IG Q +N + +D S LGF+ CTF+
Sbjct: 453 -IGNYQQKNQRVIYDTKGSMLGFA------AEACTFD 482
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 84/389 (21%), Positives = 151/389 (38%), Gaps = 70/389 (17%)
Query: 58 VKLTVHLGGNILWVDCE----------------KGYVSSTNKTARCGSAQCNLANAKACG 101
V + + G + W+ C G SST A C S +C
Sbjct: 75 VTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPECQWRGRDLPV 134
Query: 102 GGICGAGPDNPISNTGTHGDIRIDVLSIQSTD-----GRNPGRAVTVPNFIFLCGSEF-- 154
C P N + ++ D I + D G P RA +F C + +
Sbjct: 135 PPFCAGPPSNSCRVSLSYADAS-SADGILAADTFLLGGAPPVRA------LFGCVTSYSS 187
Query: 155 ---VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYY 210
+ G+ G+ R ++ +Q A +FA C++P D G +V G
Sbjct: 188 ATATNSSDSEAATGLLGMNRGSLSFVTQTA-----TLRFAYCIAPGDGPGLLVLGG---- 238
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
+ ++ L YTPL +++ + V Y + + + V +P+ K++L+ D+
Sbjct: 239 --DGAALAPQLNYTPLI--QISRPLPYFDR--VAYSVQLEGIRVGAALLPIPKSVLAPDH 292
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--------FGACFRLQ 322
G G T + + +T L Y L F + A +AP+ F ACFR
Sbjct: 293 TGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ--TSALLAPLGESDFVFQGAFDACFRAS 350
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD---------ALCLGFVDGGVN 373
+ ++P++ LVL+ V ++ G+ + ++ G+ CL F + +
Sbjct: 351 EARVAAASQMLPEVGLVLRGAEV--AVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMA 408
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
++ VIG +N +++DL R+GF+
Sbjct: 409 GMSAYVIGHHHQQNVWVEYDLQNGRVGFA 437
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 94/400 (23%), Positives = 163/400 (40%), Gaps = 66/400 (16%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHL--GGNILWVDCE------------KGYVSSTNKT 84
D + +Y+ + TP P ++ +HL G +++W C + VS T
Sbjct: 88 DVGSSEYLIHLGIGTPR-PQRVVLHLDTGSDLVWTQCACTVCFDQPVPVFRASVSHTFSR 146
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTG------THGDIRIDVLSIQSTDGRNPG 138
C C A G C A + G T G + D + ++ D +
Sbjct: 147 VPCSDPLCGHAVYLPLSG--CAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRAD-- 202
Query: 139 RAVTVPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
A VPN F CG + GL GIAG G ++LPSQL R+F+ C +
Sbjct: 203 TAAAVPNIRFGCG--MMNYGLFTPNQSGIAGFGTGPLSLPSQLKV-----RRFSYCFTAM 255
Query: 198 DDG---AIVFGDGPYYDLNNFDVSKN--LKYTPLFINKVNTASGFLGEP---SVEYFIGV 249
++ ++ G P N + ++ TP A G G P YF+ +
Sbjct: 256 EESRVSPVILGGEP----ENIEAHATGPIQSTPF-------APGPAGAPVGSQPFYFLSL 304
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP-KVAR 308
V V +P N + ++ +G GGT I + T +++++L +AF + +P VA+
Sbjct: 305 RGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAK 364
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA------ 362
CF + + P VP++ L L+ + W + +N ++ D
Sbjct: 365 GYTDPDNLLCF---SVPAKKKAPAVPKLILHLEGAD--WELPRENYVLDNDDDGSGAGRK 419
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
LC+ + G + T +IG Q +N + +DL ++++ F+
Sbjct: 420 LCVVILSAGNSNGT--IIGNFQQQNMHIVYDLESNKMVFA 457
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 98/404 (24%), Positives = 158/404 (39%), Gaps = 62/404 (15%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE- 74
L P+ +T +P+ L V T+ + +Y T++ P + + G +I W+ C+
Sbjct: 134 LKPL--ETEIKPEDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQP 191
Query: 75 ------------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDI 122
SST C S QC+ +C G C N + T GD
Sbjct: 192 CTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQV-NYGDGSYTFGDF 250
Query: 123 RIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
+ +S G + +V N CG + +GL G G+ GLG ++L +QL A
Sbjct: 251 ATESVSF--------GNSGSVKNVALGCGHDN--EGLFVGAAGLLGLGGGPLSLTNQLKA 300
Query: 183 AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEP 241
F+ CL D D N+ + + PL N K++T
Sbjct: 301 T-----SFSYCLVNRDSAG-----SSTLDFNSAQLGVDSVTAPLMKNRKIDTF------- 343
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
Y++G++ + V G+ V + ++ +D G GG + T L+T Y L AF
Sbjct: 344 ---YYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVR 400
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GG 360
+ + VA F C+ L R VP + + W++ N ++ +
Sbjct: 401 MTQNLKLTSAVALFDTCYDLSGQASVR----VPTVSFHFADGK-SWNLPAANYLIPVDSA 455
Query: 361 DALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
C F P TS +IG Q + + FDLA +R+GFS
Sbjct: 456 GTYCFAFA-----PTTSSLSIIGNVQQQGTRVTFDLANNRMGFS 494
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 155/377 (41%), Gaps = 63/377 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA--------------RCGSA 90
YVT++ TP + V G ++ W+ C VS ++ C +
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAP 180
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ C N ++GD V + S D + G + +VPNF + C
Sbjct: 181 QCDALTTATLNPSTCST--SNVCIYQASYGDSSFSVGYL-SKDTVSFG-STSVPNFYYGC 236
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL R+K++L QLA S+ F+ CL P + + Y
Sbjct: 237 GQDN--EGLFGQSAGLIGLARNKLSLLYQLAP--SMGYSFSYCL-PTSSSSSGYLSIGSY 291
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS--- 267
+ + YTP+ + ++ + YFI +T + V GK + ++ + S
Sbjct: 292 NPGQY------SYTPMAKSSLDDS---------LYFIKMTGITVAGKPLSVSASAYSSLP 336
Query: 268 --IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
ID+ GT I T L T +Y AL +A A AM R + + CF+ Q
Sbjct: 337 TIIDS----GTVI------TRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-- 384
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
+R+ VPQ+ + + N +V + CL F R++ +IG Q
Sbjct: 385 -SRL--RVPQVSMAFAG-GAALKLKATNLLVDVDSATTCLAFAPA----RSAAIIGNTQQ 436
Query: 386 ENNLLQFDLATSRLGFS 402
+ + +D+ S++GF+
Sbjct: 437 QTFSVVYDVKNSKIGFA 453
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 103/409 (25%), Positives = 163/409 (39%), Gaps = 76/409 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV-------------------SSTNKTARCGSAQCN 93
TP P+ + + G ++ WV C Y SS+++ C + C
Sbjct: 107 TPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQ 166
Query: 94 LANAKA-----CGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG-------RNPGRAV 141
++ A C C G N + V ST G R PGRA
Sbjct: 167 WVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRA- 225
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDD 199
VP F+ C V Q + G+AG GR ++P+QL KF+ CL FDD
Sbjct: 226 -VPGFVLGCSLVSVHQPPS----GLAGFGRGAPSVPAQLGLP-----KFSYCLLSRRFDD 275
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
A V G L + ++Y PL V +A+G V Y++ + V V GKAV
Sbjct: 276 NAAVSGS---LVLGGTGGGEGMQYVPL----VKSAAGDKLPYGVYYYLALRGVTVGGKAV 328
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG--- 316
L + + G GGT + + +T L+ ++++ + A +A+ + + A G
Sbjct: 329 RLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGL 388
Query: 317 -ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALCLGFVD--- 369
CF L + +P++ + V + +N V G +A+CL V
Sbjct: 389 HPCFALPQGARSM---ALPELSFHFEG-GAVMQLPVENYFVVAGRGAVEAICLAVVTDFG 444
Query: 370 -----GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
G +I++G+ Q +N L+++DL RLG F R +CT
Sbjct: 445 GGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLG------FRRQSCT 487
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 89/417 (21%), Positives = 164/417 (39%), Gaps = 43/417 (10%)
Query: 5 QSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHL 64
QSF + + S SP++ Q L + +Y + TP L +
Sbjct: 160 QSFKPVFAPAASSTSPVSGQL-----VATLESGVSLGSGEYFMDVFVGTPPKHFSLILDT 214
Query: 65 GGNILWVDC--------EKGYV-----SSTNKTARCGSAQCNLANAKACGGGICGAGPDN 111
G ++ W+ C + G SS+ + C +C L ++
Sbjct: 215 GSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSC 274
Query: 112 PI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGI 166
P + T GD ++ ++ T V N +F CG +GL G G+
Sbjct: 275 PYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCG--HWNRGLFHGAAGL 332
Query: 167 AGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPL 226
GLG+ ++ SQ+ + + + F+ CL + A V + + NL +T
Sbjct: 333 LGLGKGPLSFASQMQSLYG--QSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSF 390
Query: 227 FINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTV 286
K + F Y++ + SV V+ + + + + + +EG GGT I + T
Sbjct: 391 GGGKDGSVDTF-------YYVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTY 443
Query: 287 LETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV 346
Y+ + +AF + V + P C+ + I + P ++ + V
Sbjct: 444 FAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMEL----PDFGILFAD-GAV 498
Query: 347 WSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
W+ +N +QI D +CL + NPR+++ +IG Q +N + +D+ SRLG++
Sbjct: 499 WNFPVENYFIQIDPDVVCLAILG---NPRSALSIIGNYQQQNFHILYDMKKSRLGYA 552
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 124/280 (44%), Gaps = 42/280 (15%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFD 198
+TVP+F+ C V+ GIAG GR +LPSQ+ + F+ CL FD
Sbjct: 208 LTVPDFVVGCS---VISTRTP--AGIAGFGRGPESLPSQMKL-----KSFSHCLVSRRFD 257
Query: 199 DGAIVFGDGPYYDLNNFDVSKN--LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
D + G + SK L YTP N + + FL Y++ + ++V
Sbjct: 258 DTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFL----EYYYLNLRRIYVGS 313
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR---VAPVA 313
K V + L+ G GG+ + + + +T +E +++ + + FA+ M R + V+
Sbjct: 314 KHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVS 373
Query: 314 PFGACFRLQDIGFTRIGPVV------PQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
CF + G + ++ +++L L N +S G D +CL
Sbjct: 374 GIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSN---YFSFVGN-------ADTVCLTV 423
Query: 368 V-DGGVNP----RTSIVIGARQLENNLLQFDLATSRLGFS 402
V D VNP +I++G+ Q +N L+++DL R GF+
Sbjct: 424 VSDNTVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFA 463
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 151/369 (40%), Gaps = 53/369 (14%)
Query: 53 TPLVPVKLTVHLGGNILW------VDCEKGYV-------SSTNKTARCGSAQC-NLANAK 98
TP + V G +++W VDC K SST T C SA C +L +K
Sbjct: 175 TPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSK 234
Query: 99 ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
CG S++ T G + + ++ + +P +F CG G
Sbjct: 235 CTSASKCGYTYTYGDSSS-TQGVLATETFTLAKS---------KLPGVVFGCGDTNEGDG 284
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNF 215
+ G G+ GLGR ++L SQL KF+ CL+ DD ++ G +
Sbjct: 285 FSQGA-GLVGLGRGPLSLVSQLGL-----DKFSYCLTSLDDTNNSPLLLGSLAGISEASA 338
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
+ +++ TPL N +PS Y++ + ++ V + L + ++ ++G GG
Sbjct: 339 -AASSVQTTPLIKNP--------SQPSF-YYVSLKAITVGSTRISLPSSAFAVQDDGTGG 388
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
+ + T LE Y+AL +AFA+ M A CFR G ++ VP+
Sbjct: 389 VIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQV--EVPR 446
Query: 336 IDLVLQ-NKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
LV + + +N MV GG ALCL + R +IG Q +N +D
Sbjct: 447 --LVFHFDGGADLDLPAENYMVLDGGSGALCLTV----MGSRGLSIIGNFQQQNFQFVYD 500
Query: 394 LATSRLGFS 402
+ L F+
Sbjct: 501 VGHDTLSFA 509
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/383 (23%), Positives = 150/383 (39%), Gaps = 63/383 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------------KGYVSSTN 82
T YV + TP + G ++ W CE Y + +
Sbjct: 135 TGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISC 194
Query: 83 KTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+ C + N+ +C C G + + G D L++ STD N
Sbjct: 195 SSPTCDELKSGTGNSPSCSASTCVYGIQYG-DQSYSVGFFAQDKLALTSTDVFN------ 247
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDG 200
NF+F CG +GL GV G+ GLGR+ ++L SQ A + + F+ CL + G
Sbjct: 248 --NFLFGCGQNN--RGLFVGVAGLIGLGRNALSLVSQTAQKYG--KLFSYCLPSTSSSTG 301
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ FG G SK +K+TP +N PS YF+ + ++ V G+ +
Sbjct: 302 YLTFGSGG-------GTSKAVKFTPSLVNSQG--------PSF-YFLNLIAISVGGRKLS 345
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
+ ++ S GT I + + L + Y L +F M K + AP + C+
Sbjct: 346 TSASVFS-----TAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYD 400
Query: 321 LQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIV 379
F++ V VP+I+L + + + +CL F G + +
Sbjct: 401 -----FSQYDTVDVPKINLYFSD-GAEMDLDPSGIFYILNISQVCLAFA-GNSDATDIAI 453
Query: 380 IGARQLENNLLQFDLATSRLGFS 402
+G Q + + +D+A R+GF+
Sbjct: 454 LGNVQQKTFDVVYDVAGGRIGFA 476
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 165/403 (40%), Gaps = 62/403 (15%)
Query: 25 TSFRPK---ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------- 74
+S +PK +L+ K +T YV ++ TP + + + G + WV C+
Sbjct: 116 SSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYE 175
Query: 75 ------KGYVSSTNKTARCGSAQC-NLANAKACGGGICGAGPDNPIS-----NTGTHGDI 122
SST CG+ +C LA++ + + P ++ T GD+
Sbjct: 176 QRDPVFDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDL 235
Query: 123 RIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
D L++ + + A TVP F+F CG G V G+ GLG K +LPSQ+AA
Sbjct: 236 ARDTLTLSPSPSPS--PADTVPGFVFGCGHSNA--GTFGEVDGLLGLGLGKASLPSQVAA 291
Query: 183 AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
+ + L SP G + FG N ++T + G+
Sbjct: 292 RYGAAFSYCLPSSPSAAGYLSFGGAA--------ARANAQFTEMVT----------GQDP 333
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
Y++ +T + V G+A+ + + + GT I + ++ L S Y AL +F SA
Sbjct: 334 TSYYLNLTGIVVAGRAIKVPASAFAT----AAGTIIDSGTAFSRLPPSAYAALRSSFRSA 389
Query: 303 MPKVA-RVAPVAP-FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
M + + AP +P F C+ RI P ++LV + V +H +
Sbjct: 390 MGRYRYKRAPSSPIFDTCYDFTGHETVRI----PAVELVFADGATV-HLHPSGVLYTWND 444
Query: 361 DA-LCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGF 401
A CL FV P + ++G Q + +D+ + R+GF
Sbjct: 445 VAQTCLAFV-----PNHDLGILGNTQQRTLAVIYDVGSQRIGF 482
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 119/283 (42%), Gaps = 45/283 (15%)
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
PG+ TVP F+ C + Q GIAG GR + +LPSQ+ +LKR F+ CL
Sbjct: 210 PGK--TVPQFLVGCSILSIRQ-----PSGIAGFGRGQESLPSQM----NLKR-FSYCLV- 256
Query: 197 FDDGAIVFGDGPYY-DL-----NNFDVSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGV 249
+ F D P DL + D N L YTP N F Y++ +
Sbjct: 257 ----SHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAF----KEYYYLTL 308
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
V V GK V + T L ++G GGT + + + +T +E +Y + Q F + K
Sbjct: 309 RKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSR 368
Query: 310 APVAP----FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA--L 363
A A CF + + V +L + K + + GDA +
Sbjct: 369 AEDAETQSGLSPCFNISGVK------TVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVV 422
Query: 364 CLGFV-DGGVNPRTS----IVIGARQLENNLLQFDLATSRLGF 401
CL V DGG P + I++G Q +N +++DL R GF
Sbjct: 423 CLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGF 465
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 102/409 (24%), Positives = 162/409 (39%), Gaps = 76/409 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV-------------------SSTNKTARCGSAQCN 93
TP P+ + + G ++ WV C Y SS+++ C + C
Sbjct: 75 TPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQ 134
Query: 94 LANAKA-----CGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG-------RNPGRAV 141
++ A C C G N + V ST G R PGRA
Sbjct: 135 WVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRA- 193
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDD 199
VP F+ C V Q + G+AG GR ++P+QL KF+ CL FDD
Sbjct: 194 -VPGFVLGCSLVSVHQPPS----GLAGFGRGAPSVPAQLGLP-----KFSYCLLSRRFDD 243
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
A V G L + ++Y PL V +A+G V Y++ + V V GKAV
Sbjct: 244 NAAVSGS---LVLGGTGGGEGMQYVPL----VKSAAGDKLPYGVYYYLALRGVTVGGKAV 296
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----F 315
L + + G GGT + + +T L+ ++++ + A +A+ + + A
Sbjct: 297 RLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGL 356
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALCLGFVD--- 369
CF L + +P++ + V + +N V G +A+CL V
Sbjct: 357 HPCFALPQGARSM---ALPELSFHFEG-GAVMQLPVENYFVVAGRGAVEAICLAVVTDFS 412
Query: 370 -----GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
G +I++G+ Q +N L+++DL RLG F R +CT
Sbjct: 413 GGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLG------FRRQSCT 455
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 63/376 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----------KGYVSSTNKTARCGSAQCN 93
YVTQ++ TP + V ++ WV CE SST K CGSA CN
Sbjct: 126 YVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCINACLIPTFNPNASSTYKVVGCGSALCN 185
Query: 94 LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSE 153
+ C A P S ++ D + V + S+D G + FIF C +
Sbjct: 186 AVPSATMARKSCMA-PTEGCSYRQSYHDYSLSV-GVVSSDTLTYG--LGSQKFIFGCCNL 241
Query: 154 FVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGPYYDL 212
F +G+ GI G+ +K +L SQ+ R + C P + G + FG
Sbjct: 242 F--RGVGGRYSGILGMSVNKFSLFSQMTVGHRY-RAMSYCFPHPRNQGFLQFG------- 291
Query: 213 NNFDVSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
+D K+ L++TPL+I+ N YF+ V++V V +S+D +
Sbjct: 292 -RYDEHKSLLRFTPLYIDGNN------------YFVHVSNVMVE---------TMSLDVQ 329
Query: 272 GVGGTKI----STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT 327
G + T PYT+L S++ +L + + RV + CF+ D +
Sbjct: 330 SSGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGYYRVG-ASTGQTCFQ-ADGNWI 387
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV--DGGVNPRTSIVIGARQL 385
+P + + QN + +++ ++ M + CL F DGG IV+G+R L
Sbjct: 388 EGDLYMPTVKIEFQNGARI-TLNSEDLMFMEEPNVFCLAFKMNDGG-----DIVLGSRHL 441
Query: 386 ENNLLQFDLATSRLGF 401
DL +G
Sbjct: 442 MGVHTVVDLEMMTMGL 457
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 107/428 (25%), Positives = 166/428 (38%), Gaps = 90/428 (21%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
L I +++ T + S + L TL Y+ ++ + L V G ++ WV
Sbjct: 106 LRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGGK--NMSLIVDTGSDLTWVQ 163
Query: 73 CE-------------KGYVSSTNKTARCGSAQCN-----LANAKACGG--GICGAGPDNP 112
C+ VSS+ KT C S+ C N+ CGG G+ +
Sbjct: 164 CQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYV 223
Query: 113 IS---NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
+S + T GD+ + + + T + N +F CG +GL G G+ GL
Sbjct: 224 VSYGDGSYTRGDLASESIVLGDTK---------LENLVFGCGRNN--KGLFGGASGLMGL 272
Query: 170 GRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFGDGPYYDLNNFDVSKN---LKY 223
GRS V+L SQ F+ F+ CL +DGA + FG N+F V KN + Y
Sbjct: 273 GRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGTLSFG-------NDFSVYKNSTSVFY 323
Query: 224 TPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNG---KAVPLNKTLLSIDNEGVGGTKI 278
TPL N P + Y + +T + G K + + +L ID+ GT I
Sbjct: 324 TPLVQN-----------PQLRSFYILNLTGASIGGVELKTLSFGRGIL-IDS----GTVI 367
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL---QDIGFTRIGPVVPQ 335
+ + P SIYKA+ F + CF L +DI +P
Sbjct: 368 TRLPP------SIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDIS-------IPT 414
Query: 336 IDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDL 394
I ++ + N + + G V+ +CL I IG Q +N + +D
Sbjct: 415 IKMIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI-IGNYQQKNQRVIYDT 473
Query: 395 ATSRLGFS 402
RLG +
Sbjct: 474 TQERLGIA 481
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 107/450 (23%), Positives = 178/450 (39%), Gaps = 85/450 (18%)
Query: 2 SVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATL-----QYVTQIKQRTPLV 56
S AQ L S IF Q + P AL T TL +Y T IK +P
Sbjct: 62 SAAQKSLFPYSAHIFQ------QHTKNPAAL----RSSTTTLGRKFGEYYTSIKLGSPGQ 111
Query: 57 PVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSAQCNLANAKACGGG 103
L V G + W+ C V S + K C ++Q +N+
Sbjct: 112 EAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQL-CSNSSQGTYA 170
Query: 104 ICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
C G + + ++G + D L +++ G G+ VTV +F F C ++ L+
Sbjct: 171 YCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVG---GKPVTVQDFAFGC-AQGDLEL 226
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS 218
+ +G GI GL K+ALP QL F K F+ C D + + G + N
Sbjct: 227 VPTGASGILGLNAGKMALPMQLGQRFGWK--FSHCFP--DRSSHLNSTGVVFFGNAELPH 282
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL--NKTLLSIDNEGVGGT 276
+ ++YT + + F Y + + V +N + L +++ +D+ G +
Sbjct: 283 EQVQYTSVALTNSELQRKF-------YHVALKGVSINSHELVLLPRGSVVILDS---GSS 332
Query: 277 KISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG---ACFRLQDIGFTRIGPVV 333
S V P+ + L +AF P + FG CF++ + + +
Sbjct: 333 FSSFVRPF-------HSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTL 385
Query: 334 PQIDLVLQNK--------NVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
P + LV ++ V+ + + V+ +C F DGG NP VIG Q
Sbjct: 386 PSLSLVFEDGVTIGIPSIGVLLPVARYQNHVK-----MCFAFEDGGPNPVN--VIGNYQQ 438
Query: 386 ENNLLQFDLATSRLGFSDSLLFERATCTFN 415
+N +++D+ SR+GF+ RA+C +
Sbjct: 439 QNLWVEYDIQRSRVGFA------RASCVID 462
>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 112/251 (44%), Gaps = 32/251 (12%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSKNLKY 223
G+ G+ R ++ SQ+ KF+ C+S D G ++ GD NF L Y
Sbjct: 133 GLMGMNRGSLSFVSQMDFP-----KFSYCISDSDFSGVLLLGDA------NFSWLMPLNY 181
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
TPL +++T + V Y + + + V+ K +PL K++ D+ G G T + +
Sbjct: 182 TPLI--QISTPLPYFDR--VAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQ 237
Query: 284 YTVLETSIYKALVQAFASAMPKVARVAPVAPF------GACFRLQDIGFTRIGPVVPQID 337
+T L +Y AL F + ++ RV + C+R+ + T + P +P +
Sbjct: 238 FTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRV-PLSQTSL-PWLPTVS 295
Query: 338 LVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVDGGVNPRTSIVIGARQLENNLLQ 391
L+ + + + G + ++ G+ C F + + + VIG +N ++
Sbjct: 296 LMFRGAEM--KVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWME 353
Query: 392 FDLATSRLGFS 402
FDL SR+GF+
Sbjct: 354 FDLEKSRIGFA 364
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 96/398 (24%), Positives = 154/398 (38%), Gaps = 72/398 (18%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------------------EKGYVS 79
T T Y T+I+ TP + V G +ILWV+C KG S
Sbjct: 78 TDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKG--S 135
Query: 80 STNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQST 132
S+ T C C A GG + G + P + +GD D L
Sbjct: 136 SSGSTVSCDQKFC----AATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQV 191
Query: 133 DGRNPGRAVTVPNFIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
G R + IF CG++ L + GI G G+S ++ SQLAAA +K+ F
Sbjct: 192 SGDGQTRHANA-SVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIF 250
Query: 191 ALCLSPFDDGAI-VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV-EYFIG 248
+ CL G I GD V +K TPL P + Y +
Sbjct: 251 SHCLDTIKGGGIFAIGDV---------VQPKVKSTPLV-------------PDMPHYNVN 288
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+ S++V G + L + + GT I + T L +YK ++ A + P
Sbjct: 289 LESINVGGTTLQLPSHMFETGEK--KGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTT- 345
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
V F C + F + P+I ++ ++ +++ + Q G + C GF
Sbjct: 346 FHSVQDF-LCIQY----FQSVDDGFPKITFHFED-DLGLNVYPHDYFFQNGDNLYCFGFQ 399
Query: 369 DGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
+GG+ + +++G L N ++ +DL +G++D
Sbjct: 400 NGGLQSKDGKDMVLLGDLVLSNKVVVYDLENQVVGWTD 437
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 154/374 (41%), Gaps = 70/374 (18%)
Query: 58 VKLTVHLGGNILWVDCE---KGY----------VSSTNKTARCGSAQCNLANAKACGGGI 104
+ + V G ++ WV C+ + Y S + +T C S C G+
Sbjct: 77 MTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGV 136
Query: 105 CGAGPD--NPISNTG----THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
CG+ P N + N G T G++ ++ L++ +T TV NFIF CG + QG
Sbjct: 137 CGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNT---------TVNNFIFGCGRKN--QG 185
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD---DGAIVFGDGPYYDLNNF 215
L G G+ GLGR+ ++L SQ++ F F+ CL + G++V G N
Sbjct: 186 LFGGASGLVGLGRTDLSLISQISPMFG--GVFSYCLPTTEAEASGSLVMG-------GNS 236
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVE-YFIGVTSVHVNG---KAVPLNKTLLSIDNE 271
V KN TP+ ++ + P + YF+ +T + V G +A K + ID+
Sbjct: 237 SVYKNT--TPISYTRM------IHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDS- 287
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGP 331
GT IS + P SIY+AL F +CF L G+ +
Sbjct: 288 ---GTVISRLPP------SIYQALKAEFVKQFSGYPSAPSFMILDSCFNLS--GYQEVK- 335
Query: 332 VVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
+P I + + + + + G V+ +CL I IG Q +N +
Sbjct: 336 -IPDIKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGI-IGNYQQKNQRI 393
Query: 391 QFDLATSRLGFSDS 404
+D S LGF++
Sbjct: 394 IYDTKGSMLGFAEE 407
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 156/385 (40%), Gaps = 55/385 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
QY TP L V G ++ +V C + G + SST C SA
Sbjct: 33 QYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSA 92
Query: 91 QCNLANAK---ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
+C L A C + P S +GD + + + + G + V +
Sbjct: 93 ECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGD-NSSTVGVFAYETATVG-GIRVNHVA 150
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----GAI 202
F CG+ QG G+ GLG+ ++ SQ A ++ + KFA CL+ + ++
Sbjct: 151 FGCGNR--NQGSFVSAGGVLGLGQGALSFTSQ--AGYAFENKFAYCLTSYLSPTSVFSSL 206
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
+FGD + +L++TPL N +N PSV Y++ + + G+ + +
Sbjct: 207 IFGD------DMMSTIHDLQFTPLVSNPLN--------PSV-YYVQIVRICFGGETLLIP 251
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ ID+ G GGT + T Y ++ AF ++P AP +P G +
Sbjct: 252 DSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPR--APPSPQGLPLCVN 309
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG---GVNPRTSIV 379
G P+ P + ++ + + N +++ + CL ++ G N V
Sbjct: 310 VSGIDH--PIYPSFTIEF-DQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFN-----V 361
Query: 380 IGARQLENNLLQFDLATSRLGFSDS 404
IG +N L+Q+D R+GF+ +
Sbjct: 362 IGNIIQQNYLVQYDREEHRIGFAHA 386
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 96/392 (24%), Positives = 158/392 (40%), Gaps = 58/392 (14%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGYVSS-------------TNK 83
TAT Y T+I+ +P + V G +ILWV+ C+ S +
Sbjct: 80 TATGLYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT 139
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTD----GRNPGR 139
T C C +AN+ A G +P T+GD TD + G
Sbjct: 140 TVGCEQEFC-VANSAASGVPPACPSAASPCQFRITYGDGS-STTGFYVTDFVQYNQVSGN 197
Query: 140 AVTVPNFI---FLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
T P+ + F CG++ L + + GI G G+S ++ SQLAAA +++ FA CL
Sbjct: 198 GQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCL 257
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
G I + + N +K TPL N + Y + + + V
Sbjct: 258 DTVRGGGI-------FAIGNVVQPPIVKTTPLVPN------------ATHYNVNLQGISV 298
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
G + L + + D+ GT I + L +Y+ L+ A P +A V
Sbjct: 299 GGATLQLPTS--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLA-VRNYED 355
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
F CF+ + P I + ++ +++ + + Q G D C+GF+DGGV
Sbjct: 356 F-ICFQFSG----SLDEEFPVITFSFEG-DLTLNVYPHDYLFQNGNDLYCMGFLDGGVQT 409
Query: 375 RTS---IVIGARQLENNLLQFDLATSRLGFSD 403
+ +++G L N L+ +DL +G++D
Sbjct: 410 KDGKDMVLLGDLVLSNKLVVYDLEKQVIGWTD 441
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 155/368 (42%), Gaps = 39/368 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y+ ++ TP V + G +++W C+ T + + + +CG
Sbjct: 107 EYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 104 ICGAGPDNPISN----TGTHGDIRID--VLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVL 156
+C A P + S+ ++GD + VL+ ++ T G++ + V+V N F CG +
Sbjct: 167 LCSAVPSSTCSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVHNIGFGCGEDNEG 225
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLN 213
G G+ GLGR ++L SQL + +F+ CL+P DD ++ G L
Sbjct: 226 DGFEQ-ASGLVGLGRGPLSLVSQLK-----EPRFSYCLTPMDDTKESILLLGS-----LG 274
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV 273
+K + TPL N + +PS Y++ + + V + + K+ + ++G
Sbjct: 275 KVKDAKEVVTTPLLKNPL--------QPSF-YYLSLEGISVGDTRLSIEKSTFEVGDDGN 325
Query: 274 GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVV 333
GG I + T +E ++AL + F S CF L G T++ +
Sbjct: 326 GGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPS-GSTQV--EI 382
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
P+I V K + +N M IG L + + G + SI G Q +N L+ D
Sbjct: 383 PKI--VFHFKGGDLELPAENYM--IGDSNLGVACLAMGASSGMSI-FGNVQQQNILVNHD 437
Query: 394 LATSRLGF 401
L + F
Sbjct: 438 LEKETISF 445
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 95/394 (24%), Positives = 153/394 (38%), Gaps = 60/394 (15%)
Query: 28 RPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------- 74
+P+ L V T+ + +Y T++ P + + G +I W+ C+
Sbjct: 1 KPEDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP 60
Query: 75 --KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
SST C S QC+ +C G C N + T GD + +S
Sbjct: 61 IFDPTASSTYAPVTCQSQQCSSLEMSSCRSGQC-LYQVNYGDGSYTFGDFATESVSF--- 116
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
G + +V N CG + +GL G G+ GLG ++L +QL A F+
Sbjct: 117 -----GNSGSVKNVALGCGHD--NEGLFVGAAGLLGLGGGPLSLTNQLKAT-----SFSY 164
Query: 193 CLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTS 251
CL D D N+ + + PL N K++T Y++G++
Sbjct: 165 CLVNRDSAG-----SSTLDFNSAQLGVDSVTAPLMKNRKIDTF----------YYVGLSG 209
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ V G+ V + ++ +D G GG + T L+T Y L AF + +
Sbjct: 210 MSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSA 269
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDG 370
VA F C+ L R VP + + W++ N ++ + C F
Sbjct: 270 VALFDTCYDLSGQASVR----VPTVSFHFADGK-SWNLPAANYLIPVDSAGTYCFAFA-- 322
Query: 371 GVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
P TS +IG Q + + FDLA +R+GFS
Sbjct: 323 ---PTTSSLSIIGNVQQQGTRVTFDLANNRMGFS 353
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 153/381 (40%), Gaps = 59/381 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK-GYVSSTNKTARCGSAQCNLANAKACGG 102
+Y+ ++ TP + V G ++ WV C G S N + + + ACG
Sbjct: 2 EYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKL-ACGT 60
Query: 103 GICGAGPDNPISNTGT-------------HGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
+C P P+ N T GD D +++ DG N G+ VPNF F
Sbjct: 61 ELCNGLP-YPMCNQTTCVYWYSYGDGSLSTGDFVYDTITM---DGIN-GQKQQVPNFAFG 115
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-----SPFDDGAIVF 204
CG + +G +G GI GLG+ ++ PSQL F+ KF+ CL P ++F
Sbjct: 116 CGHD--NEGSFAGADGILGLGQGPLSFPSQLKTVFN--GKFSYCLVDWLAPPTQTSPLLF 171
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
GD +KY L N KV T Y++ + + V GK + ++
Sbjct: 172 GDAA------VPTFPGVKYISLLTNPKVPTY----------YYVKLNGISVGGKLLNISS 215
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF-ASAMPKVARVAPVAPFGACFRLQ 322
T ID+ G GT + T L +++ ++ A AS M + + C
Sbjct: 216 TAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLG-- 273
Query: 323 DIGFTRIG-PVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIVI 380
GF P VP + + ++ + N + + + C V +P +I I
Sbjct: 274 --GFAEGQLPTVPSMTFHFEGGDM--ELPPSNYFIFLESSQSYCFSMVS---SPDVTI-I 325
Query: 381 GARQLENNLLQFDLATSRLGF 401
G+ Q +N + +D ++GF
Sbjct: 326 GSIQQQNFQVYYDTVGRKIGF 346
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/264 (23%), Positives = 118/264 (44%), Gaps = 42/264 (15%)
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDV 217
++ G+ G+ R ++ +Q R+FA C++P + G ++ GD + V
Sbjct: 191 VSEAATGLLGMNRGTLSFVTQTG-----TRRFAYCIAPGEGPGVLLLGD-------DGGV 238
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+ L YTPL +++ + V Y + + + V +P+ K++L+ D+ G G T
Sbjct: 239 APPLNYTPLI--EISQPLPYFDR--VAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTM 294
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARV--APVAP--------FGACFRLQDIGFT 327
+ + +T L Y AL F S AR+ AP+ F ACFR +
Sbjct: 295 VDSGTQFTFLLADAYAALKAEFTSQ----ARLLLAPLGEPGFVFQGAFDACFRGPEARVA 350
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD---------ALCLGFVDGGVNPRTSI 378
++P++ LVL+ V ++ G+ + + G+ CL F + + ++
Sbjct: 351 AASGLLPEVGLVLRGAEV--AVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAY 408
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
VIG +N +++DL R+GF+
Sbjct: 409 VIGHHHQQNVWVEYDLQNGRVGFA 432
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 147/385 (38%), Gaps = 62/385 (16%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------------KGYVSST 81
T Y+ + TP + L G ++ W C+ K Y + +
Sbjct: 150 GTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNIS 209
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
+A C S + N+ C C G S+ T G D L++ D
Sbjct: 210 CTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSF-TIGFFAKDKLTLTQND-------- 260
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
F+F CG +GL G+ GLGR +++ Q A F + L S +G
Sbjct: 261 VFDGFMFGCGQN--NKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGH 318
Query: 202 IVFGDGPYYDLNNFDVSKNLK----YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
+ FG+G N SK +K +TP F + TA YFI V + V GK
Sbjct: 319 LTFGNG-----NGVKASKAVKNGITFTP-FASSQGTA---------YYFIDVLGISVGGK 363
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
A+ ++ L GT I + T L ++ Y +L AF M K ++
Sbjct: 364 ALSISPMLFQ-----NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDT 418
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
C+ L + +T I +P+I N NV +G ++ G +CL F G +
Sbjct: 419 CYDLSN--YTSIS--IPKISFNFNGNANVELDPNG--ILITNGASQVCLAFAGNGDDDSI 472
Query: 377 SIVIGARQLENNLLQFDLATSRLGF 401
I G Q + + +D+A +LGF
Sbjct: 473 GI-FGNIQQQTLEVVYDVAGGQLGF 496
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 108/447 (24%), Positives = 178/447 (39%), Gaps = 79/447 (17%)
Query: 2 SVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATL-----QYVTQIKQRTPLV 56
S AQ L S IF Q + P AL T TL +Y T IK +P
Sbjct: 62 SAAQKSLFPYSAHIFQ------QHTKNPAAL----RSSTTTLGRKFGEYYTSIKLGSPGQ 111
Query: 57 PVKLTVHLGGNILWVDCEKGYVSSTN-----KTARCGSAQ---CNLANAKACGG------ 102
L V G + W+ C V + + AR S + CN N++ C
Sbjct: 112 EAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCN--NSQLCSNSSQGTY 169
Query: 103 GICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQ 157
C G + + ++G + D L +++ G G+ VTV +F F C ++ L+
Sbjct: 170 AYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVG---GKPVTVQDFAFGC-AQGDLE 225
Query: 158 GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDV 217
+ +G GI GL K+ALP QL F K F+ C D + + G + N
Sbjct: 226 LVPTGASGILGLNAGKMALPMQLGQRFGWK--FSHCFP--DRSSHLNSTGVVFFGNAELP 281
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL--NKTLLSIDNEGVGG 275
+ ++YT + + F Y + + V +N + +++ +D+ G
Sbjct: 282 HEQVQYTSVALTNSELQRKF-------YHVALKGVSINSHELVFLPRGSVVILDS---GS 331
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG---ACFRLQDIGFTRIGPV 332
+ S V P+ + L +AF P + FG CF++ + +
Sbjct: 332 SFSSFVRPF-------HSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRT 384
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIG----GDALCLGFVDGGVNPRTSIVIGARQLENN 388
+P + LV ++ V I ++ + +C F DGG NP VIG Q +N
Sbjct: 385 LPSLSLVFED-GVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVN--VIGNYQQQNL 441
Query: 389 LLQFDLATSRLGFSDSLLFERATCTFN 415
+++D+ SR+GF+ RA+C +
Sbjct: 442 WVEYDIQRSRVGFA------RASCVID 462
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/264 (23%), Positives = 118/264 (44%), Gaps = 42/264 (15%)
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDV 217
++ G+ G+ R ++ +Q R+FA C++P + G ++ GD + V
Sbjct: 175 VSEAATGLLGMNRGTLSFVTQTG-----TRRFAYCIAPGEGPGVLLLGD-------DGGV 222
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+ L YTPL +++ + V Y + + + V +P+ K++L+ D+ G G T
Sbjct: 223 APPLNYTPLI--EISQPLPYFDR--VAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTM 278
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARV--APVAP--------FGACFRLQDIGFT 327
+ + +T L Y AL F S AR+ AP+ F ACFR +
Sbjct: 279 VDSGTQFTFLLADAYAALKAEFTSQ----ARLLLAPLGEPGFVFQGAFDACFRGPEARVA 334
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD---------ALCLGFVDGGVNPRTSI 378
++P++ LVL+ V ++ G+ + + G+ CL F + + ++
Sbjct: 335 AASGLLPEVGLVLRGAEV--AVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAY 392
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
VIG +N +++DL R+GF+
Sbjct: 393 VIGHHHQQNVWVEYDLQNGRVGFA 416
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 144/387 (37%), Gaps = 75/387 (19%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC---------EKGYV-----SSTNKTAR 86
+T YV + TP + G + WV C +KG + SST
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVS 218
Query: 87 CGSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C + C + C GG C G G + T G D L+I
Sbjct: 219 CTDSACADLDTNGCTGGHCLYAVQYGDG-------SYTVGFFAQDTLTIAHD-------- 263
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ F F CG + GL G+ GLGR K +L Q + FA CL
Sbjct: 264 -AIKGFRFGCGEKN--NGLFGKTAGLMGLGRGKTSLTVQAYNKYG--GAFAYCLP----- 313
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
A+ G G Y D N + TP+ +K T Y++G+T + V G+ VP
Sbjct: 314 ALTTGTG-YLDFGPGSAGNNARLTPMLTDKGQTF----------YYVGMTGIRVGGQQVP 362
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FG 316
+ +++ S GT + + T L + Y AL AF M +AR AP
Sbjct: 363 VAESVFS-----TAGTLVDSGTVITRLPATAYTALSSAFDKVM--LARGYKKAPGYSILD 415
Query: 317 ACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
C+ FT + V +P + LV Q + + I +CL F G +
Sbjct: 416 TCYD-----FTGLSDVELPTVSLVFQG-GACLDVDVSGIVYAISEAQVCLAFASNGDDES 469
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
+IV +Q +L +DL +GF+
Sbjct: 470 VAIVGNTQQKTYGVL-YDLGKKTVGFA 495
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 138/348 (39%), Gaps = 72/348 (20%)
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGT-HGDIRIDVLSIQSTDGRNP 137
SST + RCG+ QC+ A A +C GG+ + N T + D L++
Sbjct: 152 SSTYRPVRCGAPQCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHDD----- 206
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
V + F C V+ G + G+ G GR ++ PSQ + F+ CL
Sbjct: 207 --VDAVAAYTFGC--LHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYG--SVFSYCL--- 257
Query: 198 DDGAIVFGDGPYYDLNNFDVS---------KNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
P Y +NF + K +K TPL N PS+ Y++
Sbjct: 258 ----------PSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNP--------HRPSL-YYVN 298
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+ + V G+ VP+ + L+ D GT + +T L +Y A+ F S + +
Sbjct: 299 MVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV-RAPV 357
Query: 309 VAPVAPFGACFR----LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
P+ F C+ + + F+ G ++ + L +NVV + G C
Sbjct: 358 AGPLGGFDTCYNVTISVPTVTFSFDG----RVSVTLPEENVV--------IRSSSGGIAC 405
Query: 365 LGF-------VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
L VD +N V+ + Q +N+ + FD+A R+GFS L
Sbjct: 406 LAMAAGPPDGVDAALN-----VLASMQQQNHRVLFDVANGRVGFSREL 448
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 152/376 (40%), Gaps = 62/376 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVS--------------STNKTARCGSA 90
YVT++ TP + V G ++ W+ C VS ST + C +
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQ 181
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ + C + N ++GD V + S D + G + ++PNF + C
Sbjct: 182 QCSDLPSATLNPSACSS--SNVCIYQASYGDSSFSVGYL-SKDTVSFG-STSLPNFYYGC 237
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGD 206
G + +GL G+ GL R+K++L QLA SL F CL S +
Sbjct: 238 GQDN--EGLFGRSAGLIGLARNKLSLLYQLAP--SLGYSFTYCLPSSSSSGYLSLGSYNP 293
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
G Y YTP+ + ++ + YFI ++ + V G N +
Sbjct: 294 GQY------------SYTPMVSSSLDDS---------LYFIKLSGMTVAG-----NPLSV 327
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGF 326
S T I + T L TS+Y AL +A A+AM +R + + CF+ Q
Sbjct: 328 SSSAYSSLPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQA--- 384
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLE 386
+R+ P + + + QN +V + CL F R++ +IG Q +
Sbjct: 385 SRVS--APAVTMSFAG-GAALKLSAQNLLVDVDDSTTCLAFAPA----RSAAIIGNTQQQ 437
Query: 387 NNLLQFDLATSRLGFS 402
+ +D+ +SR+GF+
Sbjct: 438 TFSVVYDVKSSRIGFA 453
>gi|388503026|gb|AFK39579.1| unknown [Lotus japonicus]
Length = 79
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/79 (54%), Positives = 46/79 (58%), Gaps = 17/79 (21%)
Query: 355 MVQIGGDALCLGFVDGGVNPR--------------TSIVIGARQLENNLLQFDLATSRLG 400
MVQ D +CLGFVD G NP+ TSI IGA QLENNLL+FDLA SRLG
Sbjct: 1 MVQFD-DVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLG 59
Query: 401 FSDSLLFERATC-TFNFTS 418
F SL E C F FTS
Sbjct: 60 FR-SLFLEHDNCQNFRFTS 77
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 112/291 (38%), Gaps = 60/291 (20%)
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCLSPFDDGAIV 203
NF F C S G+AG GR ++LP+QLA + L +F+ CL
Sbjct: 203 NFTFGCA-----HTTFSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVS------- 250
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE---------------YFIG 248
++F + K +PL + + N G+ VE Y +G
Sbjct: 251 ---------HSFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVG 301
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+ + V K VP K L ++ +G GG + + +T+L Y ++V+ F K R
Sbjct: 302 LKGISVGKKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNR 361
Query: 309 VAPVAP----FGACFRLQDIGFTRIGPVVPQIDL---------VLQNKNVVWSIHGQNSM 355
AP C+ L +VP + L VL KN +
Sbjct: 362 RAPEIEQKTGLSPCYYLNTAA------IVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDG 415
Query: 356 VQIGGDALCLGFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGFS 402
V+ CL F++GG S V+G Q + +++DL R+GF+
Sbjct: 416 VRRKERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFA 466
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 155/385 (40%), Gaps = 58/385 (15%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T +Y+ + TP PV+LT+ G +++W C+ SST C
Sbjct: 32 TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCD 91
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
S C +CG P+ T ++GD + ++ G +VP F
Sbjct: 92 STLCQGLPVASCGSP--KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAF 149
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI---VFG 205
CG F S GIAG GR ++LPSQL F+ C + GAI V
Sbjct: 150 GCG-LFNNGVFKSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTTI-TGAIPSTVLL 202
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
D P +N + ++ TPL N A+ L Y++ + + V +P+ ++
Sbjct: 203 DLPADLFSNGQGA--VQTTPLIQYAKNEANPTL------YYLSLKGITVGSTRLPVPESA 254
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA-----CFR 320
++ N G GGT I + T L +Y+ + FA+ + PV P A CF
Sbjct: 255 FALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-----KLPVVPGNATGHYTCFS 308
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA----LCLGFVDGGVNPRT 376
++ P VP+ LVL + + +N + ++ DA +CL G
Sbjct: 309 AP----SQAKPDVPK--LVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG----DE 358
Query: 377 SIVIGARQLENNLLQFDLATSRLGF 401
+ +IG Q +N + +DL + L F
Sbjct: 359 TTIIGNFQQQNMHVLYDLQNNMLSF 383
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 118/256 (46%), Gaps = 32/256 (12%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSK 219
S G+ G+ R ++ +Q+ +KF+ C+S D G ++FG+ ++F K
Sbjct: 186 SKTTGLIGMNRGSLSFVTQMGL-----QKFSYCISGQDSSGILLFGE------SSFSWLK 234
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
LKYTPL +++T + V Y + + + V + L K++ + D+ G G T +
Sbjct: 235 ALKYTPLV--QISTPLPYFDR--VAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 290
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF---GA---CFRLQDIGFTRIGPVV 333
+ +T L +Y AL F +V F GA C+R+ R P +
Sbjct: 291 SGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLT--RRTLPPL 348
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPRTSIVIGARQLEN 387
P + L+ + + S+ + M ++ G D++ C F + + S +IG +N
Sbjct: 349 PTVTLMFRGAEM--SVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQN 406
Query: 388 NLLQFDLATSRLGFSD 403
++FDLA SR+GF++
Sbjct: 407 VWMEFDLAKSRVGFAE 422
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 146/382 (38%), Gaps = 66/382 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 176 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSC 235
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ + + C GG C G G++ G +D L++ S D V
Sbjct: 236 AAPACSDLDTRGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 284
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG +GL G+ GLGR K +LP Q + FA CL G
Sbjct: 285 FRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCLPARSTGT---- 336
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
Y D + L TP+ ++ T Y++G+T + V G+ + + +++
Sbjct: 337 --GYLDFGAGSPAARLTTTPMLVDNGPTF----------YYVGLTGIRVGGRLLYIPQSV 384
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVAP-VAPFGACFRLQD 323
+ GT + + T L + Y +L AFA+AM + + AP V+ C+
Sbjct: 385 FA-----TAGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDF-- 437
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSIVI 380
G +++ +P + L+ Q + M +CL F DGG ++
Sbjct: 438 AGMSQV--AIPTVSLLFQG-GARLDVDASGIMYAASASQVCLAFAANEDGG----DVGIV 490
Query: 381 GARQLENNLLQFDLATSRLGFS 402
G QL+ + +D+ + FS
Sbjct: 491 GNTQLKTFGVAYDIGKKVVSFS 512
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 141/387 (36%), Gaps = 75/387 (19%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTAR 86
+T YV + TP + G + WV C V SST
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVS 218
Query: 87 CGSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C + C + C GG C G G + T G D L+I
Sbjct: 219 CTDSACADLDTNGCTGGHCLYAVQYGDG-------SYTVGFFAQDTLTIAHD-------- 263
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ F F CG + GL G+ GLGR K +L Q + FA CL
Sbjct: 264 -AIKGFRFGCGEKN--NGLFGKTAGLMGLGRGKTSLTVQAYNKYG--GAFAYCLP----- 313
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
A+ G G Y D N + TP+ +K T Y++G+T + V G+ VP
Sbjct: 314 ALTTGTG-YLDFGPGSAGNNARLTPMLTDKGQTF----------YYVGMTGIRVGGQQVP 362
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FG 316
+ +++ S GT + + T L + Y AL AF M +AR AP
Sbjct: 363 VAESVFS-----TAGTLVDSGTVITRLPATAYTALSSAFDKVM--LARGYKKAPGYSILD 415
Query: 317 ACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
C+ FT + V +P + LV Q + + I +CL F G +
Sbjct: 416 TCYD-----FTGLSDVELPTVSLVFQG-GACLDVDVSGIVYAISEAQVCLAFASNGDDES 469
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
+IV +Q +L +DL +GF+
Sbjct: 470 VAIVGNTQQKTYGVL-YDLGKKTVGFA 495
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 118/256 (46%), Gaps = 32/256 (12%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSK 219
S G+ G+ R ++ +Q+ +KF+ C+S D G ++FG+ ++F K
Sbjct: 179 SKTTGLIGMNRGSLSFVTQMGL-----QKFSYCISGQDSSGILLFGE------SSFSWLK 227
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
LKYTPL +++T + V Y + + + V + L K++ + D+ G G T +
Sbjct: 228 ALKYTPLV--QISTPLPYFDR--VAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 283
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF---GA---CFRLQDIGFTRIGPVV 333
+ +T L +Y AL F +V F GA C+R+ R P +
Sbjct: 284 SGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLT--RRTLPPL 341
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPRTSIVIGARQLEN 387
P + L+ + + S+ + M ++ G D++ C F + + S +IG +N
Sbjct: 342 PTVTLMFRGAEM--SVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQN 399
Query: 388 NLLQFDLATSRLGFSD 403
++FDLA SR+GF++
Sbjct: 400 VWMEFDLAKSRVGFAE 415
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 106/260 (40%), Gaps = 34/260 (13%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDG----AIVFGDGPYYDLNNFDVS 218
GIAG GR +LP Q+ +KF+ CL FDD + GP D + D +
Sbjct: 229 GIAGFGRGPSSLPKQMGL-----KKFSYCLLSHRFDDSPKSSKMTLYVGP--DSKD-DKT 280
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
L YTP N V++ S F Y++ + + V K V + + ++G GGT +
Sbjct: 281 GGLSYTPFRKNPVSSNSAF----KEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGGTIV 336
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA---CFRLQDIGFTRIGPVVPQ 335
+ + +T +E +++A+ F M R A V CF L +G V
Sbjct: 337 DSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVG------SVAL 390
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGD--ALCLGFVDGGVNPRT-----SIVIGARQLENN 388
LV Q K + + GD LCL V T SI++G Q +N
Sbjct: 391 PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNF 450
Query: 389 LLQFDLATSRLGFSDSLLFE 408
++DL R GF F+
Sbjct: 451 YTEYDLENERFGFRRQRCFQ 470
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 64/228 (28%), Positives = 103/228 (45%), Gaps = 48/228 (21%)
Query: 115 NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKV 174
++ T GD ++ L+++S+ G + PNF F CG + G G GI GLG+ K+
Sbjct: 88 SSSTQGDFALETLTLRSSGGSSKA----FPNFQFGCGR--LNSGSFGGAAGIVGLGQGKI 141
Query: 175 ALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPL-FINKVNT 233
+L +QL +A + KF+ CL FDD D SK +PL F + +T
Sbjct: 142 SLSTQLGSA--INNKFSYCLVDFDD----------------DSSKT---SPLIFGSSAST 180
Query: 234 ASGFLGEP-------SVEYFIGVTSVHVNGKAVPLNK---TLLSIDNE----------GV 273
SG + P S YF+G+ + V GK + L LS+ ++
Sbjct: 181 GSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNS 240
Query: 274 GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
GGT + T+L+ ++Y + AFAS++ A + F C+ +
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDV 288
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 120/267 (44%), Gaps = 32/267 (11%)
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGP 208
C S S G+ G+ R ++ +Q+ +KF+ C+S D G ++FG+
Sbjct: 426 CTSPTCRTRTHSKTTGLIGMNRGSLSFVTQMGL-----QKFSYCISGQDSSGILLFGE-- 478
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
++F K LKYTPL +++T + V Y + + + V + L K++ +
Sbjct: 479 ----SSFSWLKALKYTPLV--QISTPLPYFDR--VAYTVQLEGIKVANSMLQLPKSVYAP 530
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF---GA---CFRLQ 322
D+ G G T + + +T L +Y AL F +V F GA C+R+
Sbjct: 531 DHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVP 590
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPRT 376
R P +P + L+ + + S+ + M ++ G D++ C F + +
Sbjct: 591 LT--RRTLPPLPTVTLMFRGAEM--SVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVE 646
Query: 377 SIVIGARQLENNLLQFDLATSRLGFSD 403
S +IG +N ++FDLA SR+GF++
Sbjct: 647 SYIIGHHHQQNVWMEFDLAKSRVGFAE 673
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 90/390 (23%), Positives = 151/390 (38%), Gaps = 58/390 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------GYVSSTNKTARCGS 89
QY ++ P + L G +++WV C SST A C
Sbjct: 83 QYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYD 142
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTH------GDIRIDVLSIQSTDGR-NPGRAVT 142
C L K IC I +T + G + + + ++T + + G+
Sbjct: 143 PVCRLV-PKPDRAPICN---HTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEAR 198
Query: 143 VPNFIFLCGSEFVLQGLA----SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL---- 194
+ + F CG Q ++ +G G+ GLGR ++ SQL F KF+ CL
Sbjct: 199 LKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG--NKFSYCLMDYT 256
Query: 195 -SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
SP ++ G+G D L +TPL N ++ Y++ + SV
Sbjct: 257 LSPPPTSYLIIGNG-------GDGISKLFFTPLLTNPLSP---------TFYYVKLKSVF 300
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG + ++ ++ ID+ G GGT + + L Y++++ A + A
Sbjct: 301 VNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTP 360
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
F C + G T+ ++P++ V+ +N ++ CL V+
Sbjct: 361 GFDLCVNVS--GVTKPEKILPRLKFEFSG-GAVFVPPPRNYFIETEEQIQCLAIQS--VD 415
Query: 374 PRTSI-VIGARQLENNLLQFDLATSRLGFS 402
P+ VIG + L +FD SRLGFS
Sbjct: 416 PKVGFSVIGNLMQQGFLFEFDRDRSRLGFS 445
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 93/393 (23%), Positives = 164/393 (41%), Gaps = 58/393 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ + TP ++ + G ++ W+ C ++G V SS+ + CG
Sbjct: 150 EYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDH 209
Query: 91 QCN----LANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGR 139
+C +A C ++P + T GD+ ++ ++ T PG
Sbjct: 210 RCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLT---APGA 266
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+ V +F CG +GL G G+ GLGR ++ SQL A + F+ CL D
Sbjct: 267 SRRVDGVVFGCGHRN--RGLFHGAAGLLGLGRGPLSFASQLRAVYG--HTFSYCL--VDH 320
Query: 200 GA-----IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
G+ +VFG+ D LKYT +++ Y++ + V V
Sbjct: 321 GSDVGSKVVFGEDD--DALALAAHPQLKYTAFAPASSSSSPA-----DTFYYVKLKGVLV 373
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
G+ + ++ + +G GGT I + + Y+ + AF M + + P P
Sbjct: 374 GGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFP 433
Query: 315 -FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDALCLGFVDG 370
C+ + G R P VP++ L+ + VW +N +++ GG +CL +
Sbjct: 434 VLSPCYNVS--GVER--PEVPELSLLFAD-GAVWDFPAENYFIRLDPDGGSIMCLAVLG- 487
Query: 371 GVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
PRT + +IG Q +N + +DL +RLGF+
Sbjct: 488 --TPRTGMSIIGNFQQQNFHVVYDLQNNRLGFA 518
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 153/364 (42%), Gaps = 63/364 (17%)
Query: 70 WVDCEKGYVSSTNKTARCGSAQCN--LANAKACGGGICGAGPDNPISNTG-THGDIRIDV 126
WV S+ N +C A C+ AN A +C P + +G T G + D
Sbjct: 128 WVH------SAANLATKCRRAPCSPGAANCPAAASNVCP--PYAVVYGSGSTAGLLIADT 179
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
L R PGRAV P F+ C V Q + G+AG GR ++P+QL
Sbjct: 180 L-------RAPGRAV--PGFVLGCSLVSVHQPPS----GLAGFGRGAPSVPAQLGLP--- 223
Query: 187 KRKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE 244
KF+ CL FDD A V G L + ++Y PL V +A+G V
Sbjct: 224 --KFSYCLLSRRFDDNAAVSGS---LVLGGTGGGEGMQYVPL----VKSAAGDKLPYGVY 274
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y++ + V V GKAV L + + G GGT + + +T L+ ++++ + A +A+
Sbjct: 275 YYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVG 334
Query: 305 KVARVAPVAP----FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
+ + A CF L + +P++ + V + +N V G
Sbjct: 335 GRYKRSKDAEDELGLHPCFALPQGARSM---ALPELSFHFEG-GAVMQLPVENYFVVAGR 390
Query: 361 ---DALCLGFVD--------GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFER 409
+A+CL V G +I++G+ Q +N L+++DL RLG F R
Sbjct: 391 GAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLG------FRR 444
Query: 410 ATCT 413
+CT
Sbjct: 445 QSCT 448
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 103/426 (24%), Positives = 167/426 (39%), Gaps = 77/426 (18%)
Query: 15 IFSLSPITAQTSFRPKALVLRVAKDTATLQYVT--QIKQRTPLVPVKLTVHLGGNILWVD 72
IFS + I A S P + +R+ TL Y+ +I R + + V G ++ WV
Sbjct: 41 IFSGNNIDALDSQIPLSSGVRLQ----TLNYIVTVEIGGRN----MTVIVDTGSDLTWVQ 92
Query: 73 CEKGYV-------------SSTNKTARCGSAQCNLANAKACGGGICGAGPD--NPISNTG 117
C+ + S + +T C S+ C G+CG+ N + N G
Sbjct: 93 CQPCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYG 152
Query: 118 ----THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
T GD+ ++ L++ +T V NFIF CG +GL G G+ GLG+S
Sbjct: 153 DGSYTRGDLGMEQLNLGTTH---------VSNFIFGCGRNN--KGLFGGASGLMGLGKSD 201
Query: 174 VALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
++L SQ +A F + L + D G+++ G N V KN TP+ ++
Sbjct: 202 LSLVSQTSAIFEGVFSYCLPTTAADASGSLILG-------GNSSVYKNT--TPISYTRM- 251
Query: 233 TASGFLGEPSVE--YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETS 290
+ P + YF+ +T + + G A L N G I + T L
Sbjct: 252 -----IANPQLPTFYFLNLTGISIGGVA-------LQAPNYRQSGILIDSGTVITRLPPP 299
Query: 291 IYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSI 349
+Y+ L F P + CF L G+ + +P I + + N + +
Sbjct: 300 VYRDLKAEFLKQFSGFPSAPPFSILDTCFNLN--GYDEVD--IPTIRMQFEGNAELTVDV 355
Query: 350 HGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFER 409
G V+ +CL + I IG Q N + ++ S+LGF+
Sbjct: 356 TGIFYFVKTDASQVCLALASLSFDDEIPI-IGNYQQRNQRVIYNTKESKLGFA------A 408
Query: 410 ATCTFN 415
C+FN
Sbjct: 409 EACSFN 414
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 102/421 (24%), Positives = 160/421 (38%), Gaps = 67/421 (15%)
Query: 10 LCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNIL 69
+ + L+FS S A P A +Y+ + TP PV+L + G +++
Sbjct: 384 MAARLLFSASGRAASARVDPGPY----ANGVPDTEYLVHLAIGTPPQPVQLILDTGSDLV 439
Query: 70 WVDCEKGYV-------------SSTNKTARCGSAQCNLANAKACGGGICG--------AG 108
W C V SST C S C+ +CG G A
Sbjct: 440 WTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY 499
Query: 109 PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
D I T G + + + + DG G+A TVP+ F CG F S GIAG
Sbjct: 500 ADGSI----TTGHLDAETFTFAAADGT--GQA-TVPDLAFGCGL-FNNGIFTSNETGIAG 551
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPF---DDGAIVFGDGPYYDLNNF-DVSKNLKYT 224
GR ++LPSQL F+ C + + +++ G N + D ++ T
Sbjct: 552 FGRGALSLPSQLKV-----DNFSHCFTAITGSEPSSVLLG----LPANLYSDADGAVQST 602
Query: 225 PLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPY 284
PL N + + Y++ + + V +P+ ++ ++ +G GGT I +
Sbjct: 603 PLVQNFSSLRA---------YYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGM 653
Query: 285 TVLETSIYKALVQAFASAMPKVARVAPVAPFGA-CFRLQDIGFTRIGPVVPQIDLVLQNK 343
T L YK + AF + + A + CF R P VP+ LVL +
Sbjct: 654 TTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRLCFSFSVP--RRAKPDVPK--LVLHFE 709
Query: 344 NVVWSIHGQNSMVQI---GGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLG 400
+ +N M + GG CL G +IG Q +N + +DL + L
Sbjct: 710 GATLDLPRENYMFEFEDAGGSVTCLAINAGD----DLTIIGNYQQQNLHVLYDLVRNMLS 765
Query: 401 F 401
F
Sbjct: 766 F 766
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 70/253 (27%), Positives = 105/253 (41%), Gaps = 34/253 (13%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDG----AIVFGDGPYYDLNNFDVS 218
GIAG GR +LP Q+ +KF+ CL FDD + GP D + D +
Sbjct: 229 GIAGFGRGPSSLPKQMGL-----KKFSYCLLSHRFDDSPKSSKMTLYVGP--DSKD-DKT 280
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
L YTP N V++ S F Y++ + + V K V + + + ++G GGT +
Sbjct: 281 GGLSYTPFRKNPVSSNSAF----KEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIV 336
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA---CFRLQDIGFTRIGPVVPQ 335
+ + +T +E +++A+ F M R A V CF L +G V
Sbjct: 337 DSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVG------SVAL 390
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGD--ALCLGFVDGGVNPRT-----SIVIGARQLENN 388
LV Q K + + GD LCL V T SI++G Q +N
Sbjct: 391 PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNF 450
Query: 389 LLQFDLATSRLGF 401
++DL R GF
Sbjct: 451 YTEYDLENERFGF 463
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 114/258 (44%), Gaps = 43/258 (16%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKY 223
G+ G+ R ++ +QL KF+ C+S D G ++FGD ++ NL Y
Sbjct: 167 GLMGMNRGSLSFVTQLGLP-----KFSYCISGRDSSGVLLFGD------SHLSWLGNLTY 215
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
TPL +++T + V Y + + + V K +PL K++ + D+ G G T + +
Sbjct: 216 TPLV--QISTPLPYFDR--VAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQ 271
Query: 284 YTVLETSIYKALVQAFASAMPKVARVAPVAPFG-----------ACFRLQDIGFTRIGPV 332
+T L +Y AL F V +AP G C+R+ G P
Sbjct: 272 FTFLLGPVYTALRNEFLEQTKGV-----LAPLGDPNFVFQGAMDLCYRVPAGGKL---PE 323
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVDGGVNPRTSIVIGARQLE 386
+P + L+ + +V + G+ + ++ G CL F + + + VIG +
Sbjct: 324 LPAVSLMFRGAEMV--VGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIGHHHQQ 381
Query: 387 NNLLQFDLATSRLGFSDS 404
N ++FDL SR+GF ++
Sbjct: 382 NVWMEFDLVKSRVGFVET 399
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 105/409 (25%), Positives = 171/409 (41%), Gaps = 66/409 (16%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
+L+P A T+ R +L +A D +Y+ ++ TP + G +++W C
Sbjct: 69 ALAPGDAITAAR----ILVLASDG---EYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC 121
Query: 77 YV-------------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDI- 122
+ S+T ++ C S CN C +C +GD
Sbjct: 122 LLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVC--------VYQYFYGDSA 173
Query: 123 -RIDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
VL+ ++ T G N R V++P F CG+ + GL + G+ G GR ++L SQL
Sbjct: 174 STAGVLANETFTFGTNETR-VSLPGISFGCGN--LNAGLLANGSGMVGFGRGSLSLVSQL 230
Query: 181 AAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNTASG 236
+ +F+ CL+ F + FG Y LN+ + S ++ TP +N
Sbjct: 231 GSP-----RFSYCLTSFLSPVPSRLYFGV--YATLNSTNASSEPVQSTPFVVNP------ 277
Query: 237 FLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI-DNEGVGGTKISTVNPYTVLETSIYKAL 295
P++ YF+ +T + V G +P++ + +I D +G GGT I + T L Y A+
Sbjct: 278 --ALPTM-YFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAV 334
Query: 296 VQAFASAMP-KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
AFAS + + V + CF+ R +PQ LVL W + QN
Sbjct: 335 RAAFASQITLPLLNVTDASVLDTCFQWPPP--PRQSVTLPQ--LVLHFDGADWELPLQNY 390
Query: 355 MV--QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
M+ G LCL +IG+ Q +N + +DL S + F
Sbjct: 391 MLVDPSTGGGLCLAMASS----SDGSIIGSYQHQNFNVLYDLENSLMSF 435
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 148/374 (39%), Gaps = 59/374 (15%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG----------YVSSTNKTARCGSAQ 91
T Y+ I +P + L G ++ W C Y + + T C S
Sbjct: 131 TGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAETFDPTKSTSYANVSCSTPLCSSVI 190
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
N C C G G++ G + + L+I STD N NF F
Sbjct: 191 SATGNPSRCAASTCVYGIQY---GDGSYSIGFLGKERLTIGSTDIFN--------NFYFG 239
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDGAIVFGDGP 208
CG + + GL G+ GLGR K+++ SQ A ++ + F+ CL S G + FG
Sbjct: 240 CGQD--VDGLFGKAAGLLGLGRDKLSVVSQTAPKYN--QLFSYCLPSSSSTGFLSFGSSQ 295
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
SK+ K+TPL +SG PS Y + +T + V G+ + + ++ S
Sbjct: 296 ---------SKSAKFTPL-------SSG----PSSFYNLDLTGITVGGQKLAIPLSVFS- 334
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
GT I + T L + Y AL AF AM P++ C+ +
Sbjct: 335 ----TAGTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIK 390
Query: 329 IGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENN 388
VP+I ++ + V + V G +CL F G R + + G Q N
Sbjct: 391 ----VPKI-VISFSGGVDVDVDQAGIFVANGLKQVCLAFA-GNTGARDTAIFGNTQQRNF 444
Query: 389 LLQFDLATSRLGFS 402
+ +D++ ++GF+
Sbjct: 445 EVVYDVSGGKVGFA 458
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 106/389 (27%), Positives = 163/389 (41%), Gaps = 79/389 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EKGYVSST---NKTAR-------CGSA 90
+Y T++ TP V + + G +++W+ C +K Y + N T CGS
Sbjct: 146 EYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSP 205
Query: 91 QCNLANAKACG--GGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C ++ C IC G G + T+G+ + L+ R
Sbjct: 206 LCRRLDSPGCSTKKHICLYQVSYGDG-------SFTYGEFSTETLTF---------RGTR 249
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA- 201
V CG + +GL G G+ GLGR +++ PSQ+ FS RKF+ CL D A
Sbjct: 250 VGRVALGCGHD--NEGLFIGAAGLLGLGRGRLSFPSQIGRRFS--RKFSYCL--VDRSAS 303
Query: 202 -----IVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVN 255
+VFGD +S+ ++TPL N K++T Y++ + V V
Sbjct: 304 SKPSYMVFGDSA--------ISRTARFTPLVSNPKLDTF----------YYVELLGVSVG 345
Query: 256 GKAVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
G VP + +L +D+ G GG I + T L Y AL AF + R +
Sbjct: 346 GTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSL 405
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVN 373
F CF L G T + VP + L + +V S+ N ++ + + C F G
Sbjct: 406 FDTCFDLS--GKTEVK--VPTVVLHFRGADV--SLPASNYLIPVDNSGSFCFAFA--GTM 457
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
SIV G Q + + +DLA SR+GF+
Sbjct: 458 SGLSIV-GNIQQQGFRVVYDLAASRVGFA 485
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 154/375 (41%), Gaps = 56/375 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ +P + + + G ++ WV C+ +S++ + C +
Sbjct: 166 EYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNP 225
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+C+ +A AC GA G++ GD + L++ G + V +
Sbjct: 226 RCHDLDAAACRNST-GACLYEVAYGDGSYTVGDFATETLTL--------GDSAPVSSVAI 276
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
CG + +GL G G+ LG ++ PSQ++A F+ CL V D P
Sbjct: 277 GCGHDN--EGLFVGAAGLLALGGGPLSFPSQISAT-----TFSYCL--------VDRDSP 321
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
F + + + T I T S Y++G++ + V G+ + + + ++
Sbjct: 322 SSSTLQFGDAADAEVTAPLIRSPRT--------STFYYVGLSGLSVGGQILSIPPSAFAM 373
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
D+ G GG + + T L++S Y AL AF + R + V+ F C+ L D R
Sbjct: 374 DSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSD----R 429
Query: 329 IGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIVIGARQLEN 387
VP + L + + +N ++ + G CL F N SI IG Q +
Sbjct: 430 TSVEVPAVSLRFAGGGELR-LPAKNYLIPVDGAGTYCLAFAP--TNAAVSI-IGNVQQQG 485
Query: 388 NLLQFDLATSRLGFS 402
+ FD A S +GF+
Sbjct: 486 TRVSFDTAKSTVGFT 500
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 95/399 (23%), Positives = 152/399 (38%), Gaps = 61/399 (15%)
Query: 26 SFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------- 74
S + A L+V +++ + TP V + G +++W C+
Sbjct: 83 SSKAVAPALQVPVHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTP 142
Query: 75 --KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
SST C S C+ + C CG T T+GD ST
Sbjct: 143 VFDPSSSSTYAALPCSSTLCSDLPSSKCTSAKCG--------YTYTYGDSS-------ST 187
Query: 133 DGRNPGRAVTV-----PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
G T+ P+ F CG G G G+ GLGR ++L SQL
Sbjct: 188 QGVLAAETFTLAKTKLPDVAFGCGDTNEGDGFTQGA-GLVGLGRGPLSLVSQLGL----- 241
Query: 188 RKFALCLSPFDDGA---IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE 244
KF+ CL+ DD + ++ G + + +++ TPL N +PS
Sbjct: 242 NKFSYCLTSLDDTSKSPLLLGSLATISESAA-AASSVQTTPLIRNP--------SQPSF- 291
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y++ + + V + L + ++ ++G GG + + T LE Y+AL +AFA+ M
Sbjct: 292 YYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMK 351
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV-QIGGDAL 363
A CF G ++ VP+ LV + +N MV G AL
Sbjct: 352 LPAADGSGIGLDTCFEAPASGVDQV--EVPK--LVFHLDGADLDLPAENYMVLDSGSGAL 407
Query: 364 CLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
CL + R +IG Q +N +D+ + L F+
Sbjct: 408 CLTV----MGSRGLSIIGNFQQQNIQFVYDVGENTLSFA 442
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 84/373 (22%), Positives = 150/373 (40%), Gaps = 52/373 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
Y ++ +P + + G ++ W+ C+ V S+T + C S+
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSS 179
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
+C+L A +C A T ++GD + S D + T+P+F + C
Sbjct: 180 ECSLLKAATLNDPLCTA--SGVCVYTASYGDASYS-MGYLSRDLLTLTPSQTLPSFTYGC 236
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL GI GL R K+++ +QL+ + F+ CL G +
Sbjct: 237 GQDN--EGLFGKAAGIVGLARDKLSMLAQLSPKYGY--AFSYCLPTSTSSG-----GGFL 287
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
+ S + K+TP+ N N PS+ YF+ + ++ V G+ V + +
Sbjct: 288 SIGKISPS-SYKFTPMIRNSQN--------PSL-YFLRLAAITVAGRPVGVAAAGYQVPT 337
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV-APFGACFRLQDIGFTRI 329
GT + T L SIY AL +AF M + AP + CF+ G +
Sbjct: 338 IIDSGTVV------TRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFK----GSLKS 387
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNL 389
P+I ++ Q S+ N +++ CL F +IG Q +
Sbjct: 388 MSGAPEIRMIFQG-GADLSLRAPNILIEADKGIACLAFASS----NQIAIIGNHQQQTYN 442
Query: 390 LQFDLATSRLGFS 402
+ +D++ S++GF+
Sbjct: 443 IAYDVSASKIGFA 455
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 141/378 (37%), Gaps = 58/378 (15%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV I TP + G + WV CE V SST C
Sbjct: 158 TGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISC 217
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ K C GG C G G++ G +D L++ S D +
Sbjct: 218 AAPACSDLYIKGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AIKG 266
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG +GL G+ GLGR K +LP Q + FA C G
Sbjct: 267 FRFGCGERN--EGLYGEAAGLLGLGRGKTSLPVQAYDKYG--GVFAHCFPARSSGTGYLD 322
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
GP + TP+ ++ T Y++G+T + V GK + + +++
Sbjct: 323 FGP----GSLPAVSAKLTTPMLVDNGPTF----------YYVGLTGIRVGGKLLSIPQSV 368
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA-RVAP-VAPFGACFRLQD 323
+ GT + + T L + Y +L AFASAM + + AP ++ C+
Sbjct: 369 FTTS-----GTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDF-- 421
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
G + + +P + L+ Q +H + CLGF + IV G
Sbjct: 422 TGMSEV--AIPTVSLLFQG-GASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIV-GNT 477
Query: 384 QLENNLLQFDLATSRLGF 401
QL+ + +D+ +GF
Sbjct: 478 QLKTFGVVYDIGKKVVGF 495
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 60/261 (22%), Positives = 115/261 (44%), Gaps = 37/261 (14%)
Query: 160 ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVS 218
+ G+ G+ R ++ +Q A +FA C++P D G +V G + ++
Sbjct: 194 SEAATGLLGMNRGSLSFVTQTA-----TLRFAYCIAPGDGPGLLVLGG------DGAALA 242
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
L YTPL +++ + V Y + + + V +P+ K++L+ D+ G G T +
Sbjct: 243 PQLNYTPLI--QISRPLPYFDR--VAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMV 298
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--------FGACFRLQDIGFTRIG 330
+ +T L Y L F + A +AP+ F ACFR +
Sbjct: 299 DSGTQFTFLLADAYAPLKGEFLNQ--TSALLAPLGESDFVFQGAFDACFRASEARVAAAS 356
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD---------ALCLGFVDGGVNPRTSIVIG 381
++P++ LVL+ V ++ G+ + ++ G+ CL F + + ++ VIG
Sbjct: 357 XMLPEVGLVLRGAEV--AVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIG 414
Query: 382 ARQLENNLLQFDLATSRLGFS 402
+N +++DL R+GF+
Sbjct: 415 HHHQQNVWVEYDLQNGRVGFA 435
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 159/389 (40%), Gaps = 74/389 (19%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQC---NLAN 96
TP V + + G + W+ C + SST C SAQC +L +
Sbjct: 93 TPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPS 152
Query: 97 AKACGGGICGAGPDNPISN-TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGS--- 152
AC G ++ + + G + DV ++ S P RA F C S
Sbjct: 153 PPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGP---PLRAA------FGCMSSAF 203
Query: 153 EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYD 211
+ G+AS G+ G+ R ++ SQ + R+F+ C+S DD G ++ G + D
Sbjct: 204 DSSPDGVAS--AGLLGMNRGALSFVSQAS-----TRRFSYCISDRDDAGVLLLG---HSD 253
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
L F L YTP++ A V Y + + + V GK +P+ ++L+ D+
Sbjct: 254 LPTF---LPLNYTPMY----QPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHT 306
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP------------FGACF 319
G G T + + +T L Y AL F + AR P+ P F CF
Sbjct: 307 GAGQTMVDSGTQFTFLLGDAYSALKAEFT----RQAR--PLLPALDDPSFAFQEAFDTCF 360
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVN 373
R+ G + +P + L+ + ++ G + ++ GGD + CL F + +
Sbjct: 361 RVPQ-GRSPPTARLPGVTLLFNGAEM--AVAGDRLLYKVPGERRGGDGVWCLTFGNADMV 417
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
P + VIG N +++DL R+G +
Sbjct: 418 PIMAYVIGHHHQMNVWVEYDLERGRVGLA 446
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 118/294 (40%), Gaps = 47/294 (15%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGY--- 77
R +A ++ A AT +Y+ + TP PV LT+ G +++W C ++G
Sbjct: 69 RVRAGLVAAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLL 128
Query: 78 ---VSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTG----THGDIRIDVLSIQ 130
SST CG+ +C +CGG C + + G T G I D +
Sbjct: 129 DPAASSTYAALPCGAPRCRALPFTSCGGRSCVY-----VYHYGDKSVTVGKIATDRFTFG 183
Query: 131 STDGRN-PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK 189
RN G F CG F S GIAG GR + +LPSQL A
Sbjct: 184 DNGRRNGDGSLPATRRLTFGCG-HFNKGVFQSNETGIAGFGRGRWSLPSQLNAT-----S 237
Query: 190 FALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
F+ C + D +IV G L + S ++ TPLF N +PS+ YF+
Sbjct: 238 FSYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNP--------SQPSL-YFL 288
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
+ K + + KT L + T I + T L +Y+A+ FA+
Sbjct: 289 SL-------KGISVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAA 335
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 142/387 (36%), Gaps = 74/387 (19%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV I TP + G + WV C+ V SST C
Sbjct: 179 TGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSC 238
Query: 88 GSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
+ C+ + C GG C G G + + G +D L++ S D
Sbjct: 239 AAPACSDLYTRGCSGGHCLYSVQYGDG-------SYSIGFFAMDTLTLSSYD-------- 283
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
V F F CG +GL G+ GLGR K +LP Q + FA CL G
Sbjct: 284 AVKGFRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCLPARSSGT 339
Query: 202 --IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+ FG G +P + T Y++G+T + V G+ +
Sbjct: 340 GYLDFGPG----------------SPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLL 383
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----F 315
+ +++ S GT + + T L + Y +L AFASAM AR AP
Sbjct: 384 SIPQSVFS-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAM--AARGYKKAPALSLL 436
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
C+ G + + +P++ L+ Q ++ M +CLGF +
Sbjct: 437 DTCYDF--TGMSEV--AIPKVSLLFQG-GAYLDVNASGIMYAASLSQVCLGFAANEDDDD 491
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
IV G QL+ + +D+ +GFS
Sbjct: 492 VGIV-GNTQLKTFGVVYDIGKKTVGFS 517
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 95/392 (24%), Positives = 157/392 (40%), Gaps = 60/392 (15%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK- 98
TAT Y TQI+ +P + V G +ILWV+C + T Q + A +
Sbjct: 80 TATGLYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT 139
Query: 99 --ACGGGICGAGPDNPIS----NTGTHGDIRIDVLSIQSTDG----------RNPGRAVT 142
C C A N + +T + RI ST G + G T
Sbjct: 140 TVGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQT 199
Query: 143 VPN---FIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
P+ F CG++ L + + GI G G++ ++ SQLAAA +++ FA CL
Sbjct: 200 TPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTV 259
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G I + + N V +K TPL N + Y + + + V G
Sbjct: 260 HGGGI-------FAIGNV-VQPKVKTTPLVQNVTH------------YNVNLQGISVGGA 299
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ L + + D+ GT I + L +Y+ L+ A +A + F
Sbjct: 300 TLQLPSS--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLA-LHNYQDF-V 355
Query: 318 CFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
CF+ D GF P + + + + +++ + + Q D C+GF+DGGV
Sbjct: 356 CFQFSGSIDDGF-------PVVTFSFEGE-ITLNVYPHDYLFQNENDLYCMGFLDGGVQT 407
Query: 375 RTS---IVIGARQLENNLLQFDLATSRLGFSD 403
+ +++G L N L+ +DL +G++D
Sbjct: 408 KDGKDMVLLGDLVLSNKLVVYDLEKQVIGWAD 439
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/264 (23%), Positives = 117/264 (44%), Gaps = 42/264 (15%)
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDV 217
++ G+ G+ R ++ +Q R+FA C++P + G ++ GD + V
Sbjct: 191 VSEAATGLLGMNRGTLSFVTQTG-----TRRFAYCIAPGEGPGVLLLGD-------DGGV 238
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+ L YTPL +++ + V Y + + + V +P+ K++L+ D+ G G T
Sbjct: 239 APPLNYTPLI--EISQPLPYFDR--VAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTM 294
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARV--APVAP--------FGACFRLQDIGFT 327
+ + +T L Y AL F S AR+ AP+ F ACFR +
Sbjct: 295 VDSGTQFTFLLADAYAALKAEFTSQ----ARLLLAPLGEPGFVFQGAFDACFRGPEARVA 350
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD---------ALCLGFVDGGVNPRTSI 378
++P + LVL+ V ++ G+ + + G+ CL F + + ++
Sbjct: 351 AASGLLPVVGLVLRGAEV--AVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAY 408
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
VIG +N +++DL R+GF+
Sbjct: 409 VIGHHHQQNVWVEYDLQNGRVGFA 432
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 85/383 (22%), Positives = 152/383 (39%), Gaps = 60/383 (15%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY------------VSSTNKTARCGSAQC-----NLA 95
TP V + + G + W+ C SS+ C S+ C +
Sbjct: 81 TPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTDQTRDFP 140
Query: 96 NAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFV 155
+C + + + G++ D I S+ +PN +F C
Sbjct: 141 IRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG---------IPNVVFGCMDSIF 191
Query: 156 LQGLA--SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDL 212
S G+ G+ R ++ SQ+ KF+ C+S +D G ++ GD
Sbjct: 192 SSNSEEDSKNTGLMGMNRGSLSFVSQMGFP-----KFSYCISEYDFSGLLLLGDA----- 241
Query: 213 NNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEG 272
NF L YTPL +++T + V Y + + + V K +P+ +++ D+ G
Sbjct: 242 -NFSWLAPLNYTPLI--EMSTPLPYFDR--VAYTVQLEGIKVAHKLLPIPESVFEPDHTG 296
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF---GA---CFRLQDIGF 326
G T + + +T L Y AL F + RV + F GA C+R+
Sbjct: 297 AGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVP-TNQ 355
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPRTSIVI 380
TR+ P+ P + LV + + ++ G + ++ G D++ C F + + + VI
Sbjct: 356 TRLPPL-PSVTLVFRGAEM--TVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVI 412
Query: 381 GARQLENNLLQFDLATSRLGFSD 403
G +N ++FDL SR+G ++
Sbjct: 413 GHLHQQNVWMEFDLKKSRIGLAE 435
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 153/375 (40%), Gaps = 56/375 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ +P + + + G ++ WV C+ +S++ + C +
Sbjct: 162 EYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNP 221
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+C+ +A AC GA G++ GD + L++ G + V +
Sbjct: 222 RCHDLDAAACRNST-GACLYEVAYGDGSYTVGDFATETLTL--------GDSAPVSSVAI 272
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
CG + +GL G G+ LG ++ PSQ++A F+ CL V D P
Sbjct: 273 GCGHDN--EGLFVGAAGLLALGGGPLSFPSQISAT-----TFSYCL--------VDRDSP 317
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
F + + + T I T S Y++G++ + V G+ + + + ++
Sbjct: 318 SSSTLQFGDAADAEVTAPLIRSPRT--------STFYYVGLSGISVGGQILSIPPSAFAM 369
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
D G GG + + T L++S Y AL AF + R + V+ F C+ L D R
Sbjct: 370 DGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSD----R 425
Query: 329 IGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIVIGARQLEN 387
VP + L + + +N ++ + G CL F N SI IG Q +
Sbjct: 426 TSVEVPAVSLRFAGGGELR-LPAKNYLIPVDGAGTYCLAFAP--TNAAVSI-IGNVQQQG 481
Query: 388 NLLQFDLATSRLGFS 402
+ FD A S +GF+
Sbjct: 482 TRVSFDTAKSTVGFT 496
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 153/370 (41%), Gaps = 61/370 (16%)
Query: 58 VKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+ + + G ++ WV CE K SS+ ++ C S+ C G
Sbjct: 76 MTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135
Query: 105 CGAG-PD--NPISNTG----THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQ 157
CG+ P N + N G T+G++ ++ LS V+V +F+F CG +
Sbjct: 136 CGSSNPSTCNYVVNYGDGSYTNGELGVEALSFG---------GVSVSDFVFGCGRNN--K 184
Query: 158 GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD---DGAIVFGDGPYYDLNN 214
GL GV G+ GLGRS ++L SQ A F F+ CL + G++V G+ + +
Sbjct: 185 GLFGGVSGLMGLGRSYLSLVSQTNATFG--GVFSYCLPTTEAGSSGSLVMGN----ESSV 238
Query: 215 FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVG 274
F + + YT + N + S Y + +T + V G A+ K LS N G
Sbjct: 239 FKNANPITYTRMLSNP---------QLSNFYILNLTGIDVGGVAL---KAPLSFGN---G 283
Query: 275 GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVP 334
G I + T L +S+YKAL F + CF L G+ + +P
Sbjct: 284 GILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNL--TGYDEVS--IP 339
Query: 335 QIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
I L + N + G +V+ +CL T+I IG Q N + +D
Sbjct: 340 TISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAI-IGNYQQRNQRVIYD 398
Query: 394 LATSRLGFSD 403
S++GF++
Sbjct: 399 TKQSKVGFAE 408
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 83/391 (21%), Positives = 154/391 (39%), Gaps = 67/391 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
YVT I TP + G +++W+ C+ SS+ T CG
Sbjct: 39 DYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 91 QCNLANAKACGGGICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C+ K+C P+ S +GT G + + +++ ST G + N
Sbjct: 99 LCDSLPRKSC-------SPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEK----LAAKN 147
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG----- 200
F CG + +G + G+ GLGR ++ SQL F KF+ CL P+ D
Sbjct: 148 IAFGCG--HLNRGSFNDASGLVGLGRGNLSFVSQLGDLFG--HKFSYCLVPWRDAPSKTS 203
Query: 201 AIVFGDGPYYDLNNFDVSKNLKY--TPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNG 256
+ FGD + ++ K L Y TP+ N P++E Y++ + + + G
Sbjct: 204 PMFFGD----ESSSHSSGKKLHYAFTPMIHN-----------PAMESFYYVKLKDISIAG 248
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+A+ + I +G GG + T+L + Y+ +++A S + A
Sbjct: 249 RALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLD 308
Query: 317 ACFRLQDIGFTRIG--PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
C+ D+ ++ +P + + + + G +CL V ++
Sbjct: 309 LCY---DVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSNMD- 364
Query: 375 RTSIVIGARQLENNL-LQFDLATSRLGFSDS 404
I I ++ N + +D+ +S++G++ S
Sbjct: 365 ---IGIYGNMMQQNFRVMYDIGSSKIGWAPS 392
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 80/378 (21%), Positives = 151/378 (39%), Gaps = 38/378 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y + TP L + G ++ W+ C + G SS+ + C
Sbjct: 196 EYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDP 255
Query: 91 QCNLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
+C L +A P + T GD ++ ++ T V N
Sbjct: 256 RCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVEN 315
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
+F CG +GL G G+ GLG+ ++ SQ+ + + + F+ CL + A V
Sbjct: 316 VMFGCGH--WNRGLFHGAAGLLGLGKGPLSFASQMQSLYG--QSFSYCLVDRNSNASVSS 371
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
+ + NL +T K + F Y++ + SV V+ + + + +
Sbjct: 372 KLIFGEDKELLSHPNLNFTSFGGGKDGSVDTF-------YYVQIKSVMVDDEVLKIPEET 424
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
+ +EG GGT I + T Y+ + +AF + V + P C+ + I
Sbjct: 425 WHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIE 484
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-VIGARQ 384
+ P ++ ++ VW+ +N + I + +CL + NPR+++ +IG Q
Sbjct: 485 KMEL----PDFGILFADE-AVWNFPVENYFIWIDPEVVCLAILG---NPRSALSIIGNYQ 536
Query: 385 LENNLLQFDLATSRLGFS 402
+N + +D+ SRLG++
Sbjct: 537 QQNFHILYDMKKSRLGYA 554
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 145/377 (38%), Gaps = 60/377 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
Y + TP L G ++ W CE S++ K C S
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSE 191
Query: 91 QC-NLANAKACGGGICGAGPDNPISNTG-THGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C ++ A G + TG T G + + L+I +D NF+
Sbjct: 192 PCKSIGKESAQGCSSSNSCLYGVKYGTGYTVGFLATETLTITPSD--------VFENFVI 243
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAIVFGD 206
CG G SG G+ GLGRS VALPSQ ++ + K F+ CL S G + FG
Sbjct: 244 GCGER--NGGRFSGTAGLLGLGRSPVALPSQTSSTY--KNLFSYCLPASSSSTGHLSFGG 299
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
G VS+ K+TP+ +K+ P + Y + V+ + V G+ +P++ ++
Sbjct: 300 G---------VSQAAKFTPI-TSKI---------PEL-YGLDVSGISVGGRKLPIDPSVF 339
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGF 326
GT I + T L ++ + AL AF M + C+
Sbjct: 340 R-----TAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHAN 394
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTSIVIGARQL 385
I +PQI + + V I + G + +CL F D G N + G Q
Sbjct: 395 DNI--TIPQISIFFEG-GVEVDIDDSGIFIAANGLEEVCLAFKDNG-NDTDVAIFGNVQQ 450
Query: 386 ENNLLQFDLATSRLGFS 402
+ + +D+A +GF+
Sbjct: 451 KTYEVVYDVAKGMVGFA 467
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 100/407 (24%), Positives = 157/407 (38%), Gaps = 63/407 (15%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLV-PVKLTVHLGGNILWVDCEK 75
+L P + T+ RP + A +Y+ + P PV LT+ G +++W CE
Sbjct: 65 NLCPYSGATA-RPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEP 123
Query: 76 GYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGT-----HGDIRIDV---- 126
T R +A N + AC +C A ++ G +GD +
Sbjct: 124 CAECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFL 183
Query: 127 -LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
S DG+ G+ VTVP+ F CG + L + GIAG GR ++LPSQL
Sbjct: 184 RDSFTFDDGKGGGK-VTVPDIGFGCGMYNAGRFLQT-ETGIAGFGRGPLSLPSQLKV--- 238
Query: 186 LKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-----KVNTASGFLGE 240
R+F+ C F K +P+F+ K + L
Sbjct: 239 --RQFSYC---------------------FTTRFEAKSSPVFLGGAGDLKAHATGPILST 275
Query: 241 PSVEYF-IGVTSVH--VNGKAVPLNKTLL---SIDNEGVGGTKISTVNPYTVLETSIYKA 294
P V G + H ++ K V + KT L I +G G T I + T ++++
Sbjct: 276 PFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSGTDITTFPDAVFRQ 335
Query: 295 LVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQI-DLVLQNKNVVWSIHGQN 353
L AF +A+ A P DI F+ G + LV + W + +N
Sbjct: 336 LKSAF------IAQAA--LPVNKTADEDDICFSWDGKKTAAMPKLVFHLEGADWDLPREN 387
Query: 354 SMVQIGGDA-LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRL 399
+ + +C+ G RT +IG Q +N + +DLA +L
Sbjct: 388 YVTEDRESGQVCVAVSTSGQMDRT--LIGNFQQQNTHIVYDLAAGKL 432
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 162/387 (41%), Gaps = 58/387 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ + TP ++ + G ++ W+ C ++G V S + + CG
Sbjct: 151 EYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDP 210
Query: 91 QCNLANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
+C L A C +P + T GD+ ++ ++ T PG + V
Sbjct: 211 RCGLV-APPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLT---APGASRRV 266
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA-- 201
+ +F CG +GL G G+ GLGR ++ SQL A + F+ CL D G+
Sbjct: 267 DDVVFGCGHSN--RGLFHGAAGLLGLGRGALSFASQLRAVYG--HAFSYCL--VDHGSSV 320
Query: 202 ---IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
IVFGD + L YT + A F Y++ + V V G+
Sbjct: 321 GSKIVFGDD-----DALLGHPRLNYTAFAPSAAAAADTF-------YYVQLKGVLVGGEK 368
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV-ARVAPVAPFGA 317
+ ++ + + +G GGT I + + Y+ + +AF M K VA
Sbjct: 369 LNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP 428
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRT 376
C+ + G R+ VP+ L+ + VW +N V++ D +CL + PR+
Sbjct: 429 CYNVS--GVERV--EVPEFSLLFAD-GAVWDFPAENYFVRLDPDGIMCLAVLG---TPRS 480
Query: 377 SI-VIGARQLENNLLQFDLATSRLGFS 402
++ +IG Q +N + +DL +RLGF+
Sbjct: 481 AMSIIGNFQQQNFHVLYDLQNNRLGFA 507
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 162/387 (41%), Gaps = 58/387 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ + TP ++ + G ++ W+ C ++G V S + + CG
Sbjct: 151 EYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDP 210
Query: 91 QCNLANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
+C L A C +P + T GD+ ++ ++ T PG + V
Sbjct: 211 RCGLV-APPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLT---APGASRRV 266
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA-- 201
+ +F CG +GL G G+ GLGR ++ SQL A + F+ CL D G+
Sbjct: 267 DDVVFGCGHSN--RGLFHGAAGLLGLGRGALSFASQLRAVYG--HAFSYCL--VDHGSSV 320
Query: 202 ---IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
IVFGD + L YT + A F Y++ + V V G+
Sbjct: 321 GSKIVFGDD-----DALLGHPRLNYTAFAPSAAAAADTF-------YYVQLKGVLVGGEK 368
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV-ARVAPVAPFGA 317
+ ++ + + +G GGT I + + Y+ + +AF M K VA
Sbjct: 369 LNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP 428
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRT 376
C+ + G R+ VP+ L+ + VW +N V++ D +CL + PR+
Sbjct: 429 CYNVS--GVERV--EVPEFSLLFAD-GAVWDFPAENYFVRLDPDGIMCLAVLG---TPRS 480
Query: 377 SI-VIGARQLENNLLQFDLATSRLGFS 402
++ +IG Q +N + +DL +RLGF+
Sbjct: 481 AMSIIGNFQQQNFHVLYDLQNNRLGFA 507
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 97/420 (23%), Positives = 160/420 (38%), Gaps = 73/420 (17%)
Query: 14 LIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC 73
L LSP F + D + +Y+ ++ +P L V G +++WV C
Sbjct: 140 LATRLSPAYQPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQC 199
Query: 74 E---KGYV----------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPIS---NTG 117
+ + YV S+T CGSA C + ACG G G G + +S +
Sbjct: 200 KPCLECYVQADPLFDPATSATFSGVSCGSAICRILPTSACGDGELG-GCEYEVSYADGSY 258
Query: 118 THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALP 177
T G + ++ L++ T V + CG +GL G G+ GLG ++L
Sbjct: 259 TKGALALETLTLGGT---------AVEGVVIGCGHRN--RGLFVGAAGLMGLGWGPMSLV 307
Query: 178 SQLAAAFSLKRKFALCLSPF----------DDGAIVFGDGPYYDLNNFDVSKNLKYTPLF 227
QL + F+ CL+ D G +V G + V + + PL
Sbjct: 308 GQLGG--EVGGAFSYCLASRGGYGSGAADDDAGWLVLG-------RSEAVPEGAVWVPLV 358
Query: 228 INKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVL 287
N PS Y++G++ + V + +PL L + +G G + T T L
Sbjct: 359 RNPR--------APSF-YYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRL 409
Query: 288 ETSIYKALVQAF----ASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK 343
Y AL AF A A+P+ V+ + C+ L R VP + +
Sbjct: 410 PQEAYAALRDAFVGALAGAVPRAQGVSS-SVLDTCYDLSGYASVR----VPTVSFCF-DG 463
Query: 344 NVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
+ + +N ++++ CL F P +S ++G Q + D A +GF
Sbjct: 464 DARLILAARNVLLEVDMGIYCLAFA-----PSSSGLSIMGNTQQAGIQITVDSANGYIGF 518
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 123/289 (42%), Gaps = 20/289 (6%)
Query: 115 NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKV 174
++ T GD ++ ++ T V N +F CG +GL G G+ GLGR +
Sbjct: 280 SSNTTGDFALETFTVNLTTPNGKSEQKHVENVMFGCGH--WNRGLFHGAAGLLGLGRGPL 337
Query: 175 ALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA 234
+ SQL + + F+ CL + V + + NL +T + N+
Sbjct: 338 SFASQLQSIYG--HSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSV 395
Query: 235 SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKA 294
F Y++G+ S+ V+G+ + + + + EG GGT I + T Y+
Sbjct: 396 DTF-------YYVGIKSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEI 448
Query: 295 LVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
+ +AF + V P C+ + I + P ++ + +W +N
Sbjct: 449 IKEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMEL----PDFGILFSD-GAMWDFPVENY 503
Query: 355 MVQIGGDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
+QI D +CL + P++++ +IG Q +N + +D+ SRLG++
Sbjct: 504 FIQIEPDLVCLAILG---TPKSALSIIGNYQQQNFHILYDMKKSRLGYA 549
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 155/384 (40%), Gaps = 63/384 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ + +P + G +++W C + S++ + C SA
Sbjct: 84 EYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSA 143
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDI--RIDVLSIQS-TDGRNPGRAVTVPNFI 147
CN + C C +GD VL+ ++ T G N R V VP
Sbjct: 144 MCNALYSPLCFQNAC--------VYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVS 194
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVF 204
F CG+ + G G+ G GR ++L SQL + +F+ CL+ F A + F
Sbjct: 195 FGCGN--MNAGTLFNGSGMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSRLYF 247
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G + N S ++ TP +N P++ YF+ +T + V G +P++ +
Sbjct: 248 GAYATLNSTNTSSSGPVQSTPFIVNPA--------LPTM-YFLNMTGISVAGDLLPIDPS 298
Query: 265 LLSI-DNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVARVAPVAPFGACFRL 321
+ +I + +G GG I + T L Y + AF + +P+ A P F CF+
Sbjct: 299 VFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPR-ANATPSDTFDTCFKW 357
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFV---DGGVNPRTS 377
R +P+ +VL + +N MV GG LCL + DG
Sbjct: 358 PPP--PRRMVTLPE--MVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGS------ 407
Query: 378 IVIGARQLENNLLQFDLATSRLGF 401
+IG+ Q +N + +DL S L F
Sbjct: 408 -IIGSFQHQNFHMLYDLENSLLSF 430
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 145/384 (37%), Gaps = 68/384 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 177 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISC 236
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ + + C GG C G G++ G +D L++ S D V
Sbjct: 237 AAPACSDLDTRGCSGGNCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 285
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG +GL G+ GLGR K +LP Q + FA CL G
Sbjct: 286 FRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCLPARSSGTGYLD 341
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
GP + + TP+ + T Y++G+T + V G+ + + +++
Sbjct: 342 FGP----GSPAAAGARLTTPMLTDNGPTF----------YYVGMTGIRVGGQLLSIPQSV 387
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFRL 321
+ GT + + T L + Y +L AFASAM AR AP C+
Sbjct: 388 FT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAM--AARGYKKAPAVSLLDTCYDF 440
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + M +CLGF DGG
Sbjct: 441 --TGMSQV--AIPTVSLLFQG-GARLDVDASGIMYAASVSQVCLGFAANEDGG----DVG 491
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
++G QL+ + +D+ +GFS
Sbjct: 492 IVGNTQLKTFGVAYDIGKKVVGFS 515
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 155/384 (40%), Gaps = 63/384 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ + +P + G +++W C + S++ + C SA
Sbjct: 87 EYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSA 146
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDI--RIDVLSIQS-TDGRNPGRAVTVPNFI 147
CN + C C +GD VL+ ++ T G N R V VP
Sbjct: 147 MCNALYSPLCFQNAC--------VYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVS 197
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVF 204
F CG+ + G G+ G GR ++L SQL + +F+ CL+ F A + F
Sbjct: 198 FGCGN--MNAGTLFNGSGMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSRLYF 250
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G + N S ++ TP +N P++ YF+ +T + V G +P++ +
Sbjct: 251 GAYATLNSTNTSSSGPVQSTPFIVNPA--------LPTM-YFLNMTGISVAGDLLPIDPS 301
Query: 265 LLSI-DNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVARVAPVAPFGACFRL 321
+ +I + +G GG I + T L Y + AF + +P+ A P F CF+
Sbjct: 302 VFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPR-ANATPSDTFDTCFKW 360
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFV---DGGVNPRTS 377
R +P+ +VL + +N MV GG LCL + DG
Sbjct: 361 PPP--PRRMVTLPE--MVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGS------ 410
Query: 378 IVIGARQLENNLLQFDLATSRLGF 401
+IG+ Q +N + +DL S L F
Sbjct: 411 -IIGSFQHQNFHMLYDLENSLLSF 433
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/397 (21%), Positives = 165/397 (41%), Gaps = 64/397 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T+IK +P + V G +ILW++C+ SST+K
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 87 CGSAQCN-LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP- 144
C C+ ++ + +C + + + + G D+L+++ G T P
Sbjct: 134 CDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQV----TGDLKTGPL 189
Query: 145 --NFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+F CGS+ L S V G+ G G+S ++ SQLAA KR F+ CL G
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I + + D S +K TP+ N+++ Y + + + V+G ++
Sbjct: 250 GI-------FAVGVVD-SPKVKTTPMVPNQMH------------YNVMLMGMDVDGTSLD 289
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
L ++++ GGT + + +Y +L++ + P ++ V CF
Sbjct: 290 LPRSIVR-----NGGTIVDSGTTLAYFPKVLYDSLIETILARQP--VKLHIVEETFQCFS 342
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV--NPRTSI 378
T + P + ++ +V +++ + + + + C G+ GG+ + R+ +
Sbjct: 343 FS----TNVDEAFPPVSFEFED-SVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEV 397
Query: 379 V-IGARQLENNLLQFDLATSRLGFSD-SLLFERATCT 413
+ +G L N L+ +DL +G++D + F R+ T
Sbjct: 398 ILLGDLVLSNKLVVYDLDNEVIGWADHNFFFYRSYTT 434
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 135/309 (43%), Gaps = 46/309 (14%)
Query: 113 ISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRS 172
+ + +G I ++ S+QS DG A T+ + IF C S+ LQ G GL R
Sbjct: 85 LDGSEAYGVIAREIFSLQSWDGA----ASTLGDVIFGCASK-DLQRPVDFSSGTLGLNRG 139
Query: 173 KVALPSQLAA--AFSLKRKFALCLSPFDD-----GAIVFGDG--PYYDLNNFDVSKNLKY 223
+ P+Q+ + L +F+ C + G I+FGD P + +L+
Sbjct: 140 SFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDSGIPAHHFQYL----SLEQ 195
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
P + V+ Y++G+ + V G+ + + ++ ID G GGT +
Sbjct: 196 EPPIASIVDF-----------YYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFDSGTT 244
Query: 284 YTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ---DI--GFTRIGPVVPQIDL 338
+ L + ALV+AF + + R + G+ F + D+ G R+ P P + L
Sbjct: 245 VSFLVEPAHTALVEAFGRRVLHLNRTS-----GSDFTKELCYDVAAGDARL-PTAPLVTL 298
Query: 339 VLQNKNVVWSIHGQNSMVQIGGD----ALCLGFVDGGVNPRTSI-VIGARQLENNLLQFD 393
+N NV + + V + +CL FV+ G + + VIG Q ++ L++ D
Sbjct: 299 HFKN-NVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHD 357
Query: 394 LATSRLGFS 402
L SR+GF+
Sbjct: 358 LERSRIGFA 366
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 149/385 (38%), Gaps = 68/385 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ ++ TP P+ G +I+W CE S+T + C S
Sbjct: 84 EYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 91 QCNLANAKACGGGICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C+ C PD S N+ + GD +D L++ ST GR V P
Sbjct: 144 VCSFTGEDNS----CSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTS----GRVVAFPR 195
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF--DDGA-- 201
CG + + V GI GLG +L Q+ +A + KF+ CL+P DDG
Sbjct: 196 TAIGCGHDNA-GSFDANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSN 252
Query: 202 -IVFGDGPYYDLNNFDVS-KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+ FG +N +VS TP++I+ + F S+ + +V
Sbjct: 253 KLNFG-------SNANVSGSGAVSTPIYIS--------------DKFKSFYSLKLKAVSV 291
Query: 260 PLNKTLLSIDNEGVGGTK---ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
N T S N +GG I + T+L +Y +A ++++ P
Sbjct: 292 GRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLE 351
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
CF + VP I + + N+ + +N ++++ + +CL F G +
Sbjct: 352 YCFETTTDDYK-----VPFIAMHFEGANL--RLQRENVLIRVSDNVICLAFA-GAQDNDI 403
Query: 377 SIVIGARQLENNLLQFDLATSRLGF 401
SI Q+ N L+ +D+ L F
Sbjct: 404 SIYGNIAQI-NFLVGYDVTNMSLSF 427
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/389 (21%), Positives = 155/389 (39%), Gaps = 72/389 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------------------KGYVSSTN 82
+ + I TP V + + G ++LW+ CE +SST
Sbjct: 111 HYSYIDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTA 170
Query: 83 KTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
K C C +++ C + +NT T G + D + G NP V
Sbjct: 171 KPVLCSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNP---VK 227
Query: 143 VPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+P ++ CG + +L+G A G+ GLG + +++P++LA+ L F+LC+SP
Sbjct: 228 LPVYLG-CGKVQTGSLLKGAAPN--GLMGLGTTDISVPNKLASTGQLADSFSLCISPGGS 284
Query: 200 GAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
G + FGD GP + K++ +I ++++ + +G T++ + A
Sbjct: 285 GTLTFGDEGPAAQRTTPIIPKSVSMLDTYIVEIDSIT-----------VGNTNLLMASHA 333
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP-VAPFGA 317
+ T +T L ++Y VQA+ + M P + +
Sbjct: 334 L------------------FDTGTSFTYLSKTVYPQFVQAYDAQMSLPKWNDPRFSKWDL 375
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD--ALCLGFVDGGVNPR 375
C++ + F VP + L L N + + G S+V A+C+ +D G
Sbjct: 376 CYQTSNTNFQ-----VPVVSLALSGGNSLDVVSGLKSIVDDNNAMIAVCVTVMDSGAGLS 430
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFSDS 404
+IG + N + ++ A +G++ S
Sbjct: 431 ---IIGQNFMTNYSITYNRAKMTIGWTPS 456
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/400 (23%), Positives = 155/400 (38%), Gaps = 81/400 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWV------DCEK-GYV-----SSTNKTARCGSAQC 92
Y+ +I TP + + G++ W+ DC K G+ SST +A C S QC
Sbjct: 97 YLIKISVGTPPAEILALADITGDLTWLPCKTCQDCTKDGFTFFPSESSTYTSAACESYQC 156
Query: 93 NLANAKACGGG----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+ N C +CG P S T G + +D +S S+ G+A++ PN F
Sbjct: 157 QITNGAVCQTKMCIYLCGPLPQQRSSCTN-KGLVAMDTISFHSSS----GQALSYPNTNF 211
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
+CG+ F+ G GI GLGR ++ SQ+ + F+ CL P+
Sbjct: 212 ICGT-FIDNWHYIG-AGIVGLGRGLFSMTSQMKHL--INGTFSQCLVPYS---------- 257
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEP------SVEYFIGVTSVHVNGKAVPLN 262
SK + V + G + P S YF+ + ++ V G V
Sbjct: 258 ---------SKQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVA-- 306
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV-----APFGA 317
N K + + TS+ + + + K + P+
Sbjct: 307 -------NNFYSAPKSNIYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLSL 359
Query: 318 CFRLQ-DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR- 375
C++ + D F P I + N +V + N+ V++ + +C F+DG N
Sbjct: 360 CYKSESDHDFD-----APPITMHFTNADV--QLSPLNTFVRMDWNVVCFAFLDGTFNATK 412
Query: 376 --TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
T V G+ Q N ++ +DL +S + F++A CT
Sbjct: 413 RITHAVYGSWQQMNFIVGYDLKSSTVS------FKQADCT 446
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 149/374 (39%), Gaps = 41/374 (10%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGS 89
+T Y+ I TP +P+ + G +++W C+ Y + + T S
Sbjct: 88 STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
+ + A C + PD + ++GD + +T+ G V F
Sbjct: 148 CRSPMCQALQSPWSRC-SPPDTGCAYYFSYGD-GTSTDGVLATETFTLGSDTAVRGVAFG 205
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPY 209
CG+E + G G+ G+GR ++L SQL +F+ C +PF+ A P
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVT-----RFSYCFTPFNATAA----SPL 254
Query: 210 YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID 269
+ ++ +S K TP V + SG S Y++ + + V +P++ + +
Sbjct: 255 FLGSSARLSSAAKTTPF----VPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLT 310
Query: 270 NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRI 329
G GG I + +T LE S + AL +A AS RV GA L + F
Sbjct: 311 PMGDGGVIIDSGTTFTALEESAFVALARALAS------RVRLPLASGAHLGLS-LCFAAA 363
Query: 330 GPVVPQI-DLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTSIVIGARQLEN 387
P ++ LVL + ++ +V+ CLG V R V+G+ Q +N
Sbjct: 364 SPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVACLGMVSA----RGMSVLGSMQQQN 419
Query: 388 NLLQFDLATSRLGF 401
+ +DL L F
Sbjct: 420 THILYDLERGILSF 433
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 154/387 (39%), Gaps = 70/387 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTARCGSAQC---NL 94
TP V + + G + W+ C + S T + CGSAQC +L
Sbjct: 73 TPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCRSRDL 132
Query: 95 ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS-TDGRNPG-RAVTVPNFIFLC-G 151
+ AC G A +S + G L+ + T G+ P RA F C
Sbjct: 133 PSPPACDG----ASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAA------FGCMA 182
Query: 152 SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF---GDGP 208
+ F G+ G+ R ++ SQ + R+F+ C+S DD ++ D P
Sbjct: 183 TAFDTSPDGVATAGLLGMNRGALSFVSQAS-----TRRFSYCISDRDDAGVLLLGHSDLP 237
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
+ LN YTPL+ A V Y + + + V GK +P+ ++L+
Sbjct: 238 FLPLN---------YTPLY----QPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAP 284
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASA----MPKV--ARVAPVAPFGACFRLQ 322
D+ G G T + + +T L Y AL F+ +P + A F CFR+
Sbjct: 285 DHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVP 344
Query: 323 DIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPR 375
R P +P + L+ + ++ G + ++ GGD + CL F + + P
Sbjct: 345 Q---GRAPPARLPAVTLLFNGAQM--TVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPI 399
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
T+ VIG N +++DL R+G +
Sbjct: 400 TAYVIGHHHQMNVWVEYDLERGRVGLA 426
>gi|297744230|emb|CBI37200.3| unnamed protein product [Vitis vinifera]
Length = 168
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 173 KVALPSQLAAAFSLKRKFALCLSP--FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINK 230
+ALPSQ A+AF+ RKF++CLS DG I GDGPY L N D S+ L YTPL +N
Sbjct: 2 HIALPSQFASAFNFHRKFSICLSSSTIVDGIIFLGDGPYELLLNVDASQLLIYTPLILNP 61
Query: 231 VNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
V+ S + E F + V VNG +
Sbjct: 62 VSIVSTY---SQGEIFRANSMVFVNGDVL 87
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 22/28 (78%)
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTSIVI 380
NSMV + GD LCLGFVDGG NP+ +++
Sbjct: 77 NSMVFVNGDVLCLGFVDGGENPKLQLLL 104
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 152/383 (39%), Gaps = 62/383 (16%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV------------SSTNKTARCGSAQC-----NLA 95
TP V + + G + W+ C K S + + C S+ C + +
Sbjct: 39 TPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRPIPCSSSTCTNQTRDFS 98
Query: 96 NAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFV 155
+C + + + G++ D + ++D +P +F C
Sbjct: 99 IPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASD---------IPGMVFGCMDSVF 149
Query: 156 LQG--LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDL 212
S G+ G+ R ++ SQ+ KF+ C+S D G ++ G+
Sbjct: 150 SSNSDEDSKNTGLMGMNRGSLSFVSQMGFP-----KFSYCISGTDFSGMLLLGE------ 198
Query: 213 NNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEG 272
+NF + L YTPL +++T + + Y + + + V+ + +P+ K++ D+ G
Sbjct: 199 SNFTWAVPLNYTPLV--QISTPLPYFDR--IAYTVQLEGIKVSDRLLPIPKSVFEPDHTG 254
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF---GA---CFRLQDIGF 326
G T + + +T L Y AL F + RV F GA C+R+
Sbjct: 255 AGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPIS-- 312
Query: 327 TRIGPVVPQIDLVLQ-------NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIV 379
R+ P +P + LV ++ V++ + G+ ++ CL F + + + V
Sbjct: 313 QRVLPRLPTVSLVFNGAEMTVADERVLYRVPGE---IRGNDSVHCLSFGNSDLLGVEAYV 369
Query: 380 IGARQLENNLLQFDLATSRLGFS 402
IG +N ++FDL SR+G +
Sbjct: 370 IGHHHQQNVWMEFDLERSRIGLA 392
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 100/400 (25%), Positives = 160/400 (40%), Gaps = 74/400 (18%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGY----------VSST 81
T T Y TQI TP + V G +ILWV+C + G S++
Sbjct: 84 TDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASAS 143
Query: 82 NKTARCGSAQCNLANAKACGGGI---CGAGPDNPISNTGTHGD-------IRIDVLSIQS 131
+KT CG C A A GG+ C A ++P + T+GD D L
Sbjct: 144 SKTVTCGQEFC----ATATNGGVPPSCAA--NSPCQYSITYGDGSSTTGFFVADFLQYDQ 197
Query: 132 TDGRNPGRAVTVPNFIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK 189
G + F CG++ L + GI G G++ ++ SQL +A + +
Sbjct: 198 VSGDGQTNLANA-SVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKI 256
Query: 190 FALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
F+ CL + G I + + N V +K TPL G P Y + +
Sbjct: 257 FSHCLDTVNGGGI-------FAIGNV-VQPKVKTTPLVP----------GMP--HYNVVL 296
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
++ V G + L + I G GT I + L +YKA++ A S P V +
Sbjct: 297 KTIDVGGSTLQLPTNIFDIGG-GSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVT-L 354
Query: 310 APVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
V F CF+ D GF P++ + ++ ++ + + Q D C+G
Sbjct: 355 KNVQDF-LCFQYSGSVDNGF-------PEVTFHF-DGDLPLVVYPHDYLFQNTEDVYCVG 405
Query: 367 FVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
F GGV + +++G L N L+ +DL +G+++
Sbjct: 406 FQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTN 445
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/409 (22%), Positives = 159/409 (38%), Gaps = 74/409 (18%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE- 74
L P+ + FR + L + T+ + +Y +++ P PV + + G ++ W+ C
Sbjct: 115 LKPLDTDSQFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAP 174
Query: 75 ------------KGYVSSTNKTARCGSAQCNLANAKACGGGIC------GAGPDNPISNT 116
+ S++ C + QC + C C G G +
Sbjct: 175 CADCYHQADPIFEPASSTSYSPLSCDTKQCQSLDVSECRNNTCLYEVSYGDG-------S 227
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T GD + +++ S +V N CG +GL G G+ GLG K++
Sbjct: 228 YTVGDFVTETITLGS---------ASVDNVAIGCGHNN--EGLFIGAAGLLGLGGGKLSF 276
Query: 177 PSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG 236
PSQ+ A+ F+ CL D D + L++ + TA
Sbjct: 277 PSQINAS-----SFSYCLVDRDS----------------DSASTLEFNSALLPHAITAPL 315
Query: 237 FLG-EPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKAL 295
E Y++G+T + V G+ + + +++ +D G GG I + T L+T+ Y AL
Sbjct: 316 LRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNAL 375
Query: 296 VQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM 355
AF + + VA F C+ L + VP + L V+ + N +
Sbjct: 376 RDAFVKGTKDLPVTSEVALFDTCYDLS----RKTSVEVPTVTFHLAGGKVL-PLPATNYL 430
Query: 356 VQIGGD-ALCLGFVDGGVNPRTSI--VIGARQLENNLLQFDLATSRLGF 401
+ + D C F P +S +IG Q + + FDLA S +GF
Sbjct: 431 IPVDSDGTFCFAFA-----PTSSALSIIGNVQQQGTRVGFDLANSLVGF 474
>gi|255648351|gb|ACU24627.1| unknown [Glycine max]
Length = 208
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 91/217 (41%), Gaps = 33/217 (15%)
Query: 1 MSVAQSFLLLCSLLIFSLSPITAQTSFRPKA--LVLRVAKDTATLQYVTQIKQRTPLVPV 58
M+ L C +L+F +SP + ++ PK + L + D T Q+ T I TP +
Sbjct: 1 MATPTCVLYFC-VLVFFVSPSLSASNEFPKTGYISLPINIDPTTHQHFTSIGIGTPRHNM 59
Query: 59 KLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCN-----LANAKA-------CGGGICG 106
L + + G+ LW DC Y SS+ S QC +N A C C
Sbjct: 60 NLAIDISGSYLWYDCGGNYNSSSYNPVLWDSPQCPGPEPFQSNCDAGFPFKPGCTNNTCN 119
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEF-------VLQGL 159
DNP ++ G GD+ D L + +P F SE +L GL
Sbjct: 120 VALDNPFADFGFGGDLGHDFLFTPQ---------IKLPQTFFSVCSESSRFPQLPILVGL 170
Query: 160 ASGVVGIAGLGR-SKVALPSQLAAAF-SLKRKFALCL 194
G G GL R S L SQ++++F ++ KF LCL
Sbjct: 171 PKGTKGSLGLARQSPFTLQSQISSSFNNVPPKFTLCL 207
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGV------VGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+VT P +F C + GL+S G+ G+ R ++ +QL + KF+ C
Sbjct: 162 SVTRPGTLFGC----MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYC 212
Query: 194 LSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
+S D G ++ GD Y L ++YTPL + T + V Y + + +
Sbjct: 213 ISGSDSSGILLLGDASYSWLGP------IQYTPLVLQ--TTPLPYFDR--VAYTVQLEGI 262
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V K + L K++ D+ G G T + + +T L +Y AL F + V R+
Sbjct: 263 RVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDD 322
Query: 313 APF------GACFRLQDI---GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--- 360
F C+R+ FT + P I L+ + + S+ GQ + ++ G
Sbjct: 323 PNFVFQGTMDLCYRVGSSTRPNFTGL----PVISLMFRGAEM--SVSGQKLLYRVNGAGS 376
Query: 361 ----DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ C F + + + VIG +N ++FDLA SR+GF+
Sbjct: 377 EGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFA 422
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 120/275 (43%), Gaps = 52/275 (18%)
Query: 144 PNFIFLCGS----EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
P F F CG EF G ASGV+G+A + +L SQ A+ F K+KF+ C P +
Sbjct: 219 PKFQFGCGDSGGGEF---GTASGVLGLAK--GEQYSLISQTASKF--KKKFSYCFPPKEH 271
Query: 200 --GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G+++FG+ S +LK+T L +N SG + YF+ + + V K
Sbjct: 272 TLGSLLFGE------KAISASPSLKFTQL----LNPPSG------LGYFVELIGISVAKK 315
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--- 314
+ ++ +L + GT I + T L T+ Y+AL AF M ++P
Sbjct: 316 RLNVSSSLFASP-----GTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKL 370
Query: 315 FGACFRLQDIGFTRIG------PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
C+ L+ G I V ++D+ L ++W+ + Q CL F
Sbjct: 371 LDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWA---NGDLTQA-----CLAFA 422
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
NP +IG RQ + + +D+ RLGF +
Sbjct: 423 RKS-NPSHVTIIGNRQQVSLKVVYDIEGGRLGFGN 456
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 111/250 (44%), Gaps = 31/250 (12%)
Query: 166 IAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDD---GAIVFGDGPYYDLNNFDVSKN 220
I+G GR +LPSQL +KF+ CL +DD + + DG + ++ + +
Sbjct: 216 ISGFGRGPPSLPSQLGL-----KKFSYCLLSRRYDDTTESSSLVLDG---ESDSGEKTAG 267
Query: 221 LKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
L YTP N KV F SV Y++G+ + V GK V + L +G GGT I
Sbjct: 268 LSYTPFVQNPKVAGKHAF----SVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTIID 323
Query: 280 TVNPYTVLETSIYKALVQAFASAM--PKVARVAPVAPFGACFRLQDIGFTRIGPVVPQID 337
+ +T ++ I++ + F + + V + CF + + P P++
Sbjct: 324 SGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLN----TPSFPELT 379
Query: 338 LVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRT-----SIVIGARQLENNLLQ 391
L + + N + +GG D +CL V G + +I++G Q +N ++
Sbjct: 380 LKFRG-GAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVE 438
Query: 392 FDLATSRLGF 401
+DL RLGF
Sbjct: 439 YDLRNERLGF 448
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 106/409 (25%), Positives = 171/409 (41%), Gaps = 66/409 (16%)
Query: 17 SLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG 76
+L+P A T+ R +L +A D +Y+ ++ TP + G +++W C
Sbjct: 69 ALAPGDAITAAR----ILVLASDG---EYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC 121
Query: 77 YV-------------SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDI- 122
+ S+T ++ C S CN C +C +GD
Sbjct: 122 LLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVC--------VYQYFYGDSA 173
Query: 123 -RIDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
VL+ ++ T G N R V++P F CG+ LA+G G+ G GR ++L SQL
Sbjct: 174 STAGVLANETFTFGTNETR-VSLPGISFGCGN-LNAGSLANGS-GMVGFGRGSLSLVSQL 230
Query: 181 AAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNTASG 236
+ +F+ CL+ F + FG Y LN+ + S ++ TP +N
Sbjct: 231 GSP-----RFSYCLTSFLSPVPSRLYFGV--YATLNSTNASSEPVQSTPFVVNP------ 277
Query: 237 FLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI-DNEGVGGTKISTVNPYTVLETSIYKAL 295
P++ YF+ +T + V G +P++ + +I D +G GGT I + T L Y A+
Sbjct: 278 --ALPTM-YFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAV 334
Query: 296 VQAFASAMP-KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
AFAS + + V + CF+ R +PQ LVL W + QN
Sbjct: 335 RAAFASQITLPLLNVTDASVLDTCFQWPPP--PRQSVTLPQ--LVLHFDGADWELPLQNY 390
Query: 355 MV--QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
M+ G LCL +IG+ Q +N + +DL S + F
Sbjct: 391 MLVDPSTGGGLCLAMASS----SDGSIIGSYQHQNFNVLYDLENSLMSF 435
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 150/372 (40%), Gaps = 68/372 (18%)
Query: 54 PLVPVKLTVHLGGNILWVDC-----EKG-----------YVSSTNKTARCGSAQCNLANA 97
P P + G ++ W+ C + G +SS+ C S QC L +
Sbjct: 6 PQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDE 65
Query: 98 KACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG 151
C C G G + T G++ + L+ ++ ++PN CG
Sbjct: 66 AGCNVNSCIYKVEYGDG-------SFTIGELATETLTFVHSN--------SIPNISIGCG 110
Query: 152 SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
+ +GL G G+ GLG +++ SQL A+ F+ CL D + D
Sbjct: 111 HDN--EGLFVGADGLIGLGGGAISISSQLKAS-----SFSYCLVDIDSPSFSTLDF---- 159
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
N D + +PL N PS Y + V + V GK +P++ + ID
Sbjct: 160 --NTDPPSDSLISPLVKND--------RFPSFRY-VKVIGMSVGGKPLPISSSRFEIDES 208
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGP 331
G+GG + + T L + +Y+ L +AF + ++PF C+ L ++
Sbjct: 209 GLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEISPFDTCYDLS----SQSNV 264
Query: 332 VVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
VP I +L +N + + +N ++Q+ CL FV P + +IG Q + +
Sbjct: 265 EVPTIAFILPGENSL-QLPAKNCLIQVDSAGTFCLAFVSATF-PLS--IIGNFQQQGIRV 320
Query: 391 QFDLATSRLGFS 402
+DL S +GFS
Sbjct: 321 SYDLTNSLVGFS 332
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 159/393 (40%), Gaps = 60/393 (15%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW------VDCEKGYV-------SS 80
L+V +++ + TP + V G +++W VDC K SS
Sbjct: 89 LQVPVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSS 148
Query: 81 TNKTARCGSAQC-NLANAKACGGGICGAGPDNPISNTGTHGDIRID--VLSIQS-TDGRN 136
T T C SA C +L + CG T T+GD VL+ ++ T G+
Sbjct: 149 TYATVPCSSALCSDLPTSTCTSASKCG--------YTYTYGDASSTQGVLASETFTLGKE 200
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
+ +P F CG G G G+ GLGR ++L SQL KF+ CL+
Sbjct: 201 KKK---LPGVAFGCGDTNEGDGFTQGA-GLVGLGRGPLSLVSQLGL-----DKFSYCLTS 251
Query: 197 FDDG----AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
DDG ++ G G ++ + ++ TPL N +PS Y++ +T +
Sbjct: 252 LDDGDGKSPLLLG-GSAAAISESAATAPVQTTPLVKNP--------SQPSF-YYVSLTGL 301
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V + L + +I ++G GG + + T LE Y+AL +AF + M
Sbjct: 302 TVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSE 361
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMV-QIGGDALCLGFVDG 370
CF+ G + VP+ LVL + + +N MV ALCL
Sbjct: 362 IGLDLCFQGPAKGVDEV--QVPK--LVLHFDGGADLDLPAENYMVLDSASGALCL----- 412
Query: 371 GVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
V P + +IG Q +N +D+A L F+
Sbjct: 413 TVAPSRGLSIIGNFQQQNFQFVYDVAGDTLSFA 445
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 82/386 (21%), Positives = 160/386 (41%), Gaps = 63/386 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T+IK +P + V G +ILW++C+ SST+K
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 87 CGSAQCN-LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP- 144
C C+ ++ + +C + + + + G D+L+++ G T P
Sbjct: 134 CDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQV----TGDLKTGPL 189
Query: 145 --NFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+F CGS+ L S V G+ G G+S ++ SQLAA KR F+ CL G
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I + + D S +K TP+ N+++ Y + + + V+G ++
Sbjct: 250 GI-------FAVGVVD-SPKVKTTPMVPNQMH------------YNVMLMGMDVDGTSLD 289
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
L ++++ GGT + + +Y +L++ + P ++ V CF
Sbjct: 290 LPRSIVR-----NGGTIVDSGTTLAYFPKVLYDSLIETILARQP--VKLHIVEETFQCFS 342
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV--NPRTSI 378
T + P + ++ +V +++ + + + + C G+ GG+ + R+ +
Sbjct: 343 FS----TNVDEAFPPVSFEFED-SVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEV 397
Query: 379 V-IGARQLENNLLQFDLATSRLGFSD 403
+ +G L N L+ +DL +G++D
Sbjct: 398 ILLGDLVLSNKLVVYDLDNEVIGWAD 423
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 142/380 (37%), Gaps = 61/380 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--GYV------------SSTNKTARC 87
T YV ++ TP + G + WV C+ Y S+T C
Sbjct: 93 TGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISC 152
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
S+ C+ C GG C G + T G D L++ T+ NF
Sbjct: 153 SSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLAYD---------TIKNFR 202
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDG 207
F CG + +GL G+ GLGR K +LP Q + FA CL A G G
Sbjct: 203 FGCGEKN--RGLFGRAAGLLGLGRGKTSLPVQAYDKYG--GVFAYCLP-----ATSAGTG 253
Query: 208 PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
+ DL + N + TP+ +++ T Y++G+T + V G +P+ ++ S
Sbjct: 254 -FLDLGPGAPAANARLTPMLVDRGPTF----------YYVGMTGIKVGGHVLPIPGSVFS 302
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT 327
GT + + T L S Y L AF+ AM + A A F + D +
Sbjct: 303 -----TAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPA-----FSILDTCYD 352
Query: 328 RIGP-----VVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGA 382
G +P + LV Q + + CL F + +IV
Sbjct: 353 LTGHKGGSIALPAVSLVFQG-GACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNT 411
Query: 383 RQLENNLLQFDLATSRLGFS 402
+Q + +L +D+ +GF+
Sbjct: 412 QQKTHGVL-YDIGKKIVGFA 430
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 88/384 (22%), Positives = 143/384 (37%), Gaps = 60/384 (15%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------------KGYVSST 81
T Y+ + TP + L G ++ W C+ K Y + +
Sbjct: 150 GTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNIS 209
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
+ C + N+ C C G S+ T G D L++ D
Sbjct: 210 CTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSF-TVGFFAKDTLTLTQND-------- 260
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
F+F CG +GL G+ GLGR +++ Q A F + L S +G
Sbjct: 261 VFDGFMFGCGQN--NRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGH 318
Query: 202 IVFGDGPYYDLNNFDVSKNLK----YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
+ FG+G N SK +K +TP ++ T YFI V + V GK
Sbjct: 319 LTFGNG-----NGVKTSKAVKNGITFTPFASSQGATF----------YFIDVLGISVGGK 363
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
A+ ++ L GT I + T L +++Y +L F M K ++
Sbjct: 364 ALSISPMLFQ-----NAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDT 418
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
C+ L + +T I +P+I N N + ++ G +CL F G + T
Sbjct: 419 CYDLSN--YTSIS--IPKISFNF-NGNANVDLEPNGILITNGASQVCLAFAGNG-DDDTI 472
Query: 378 IVIGARQLENNLLQFDLATSRLGF 401
+ G Q + + +D+A +LGF
Sbjct: 473 GIFGNIQQQTLEVVYDVAGGQLGF 496
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/264 (23%), Positives = 115/264 (43%), Gaps = 37/264 (14%)
Query: 160 ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF--GDGPYYDLNNFDV 217
+ G+ G+ R ++ +Q +FA C++P D ++ GDG D
Sbjct: 205 SEAATGLLGMNRGSLSFVTQTG-----TLRFAYCIAPGDGPGLLVLGGDG---DGAALSA 256
Query: 218 SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTK 277
+ L YTPL +++ + V Y + + + V +P+ K++L+ D+ G G T
Sbjct: 257 APQLNYTPLI--EMSQPLPYFDR--VAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTM 312
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--------FGACFRLQD--IGFT 327
+ + +T L Y L F + A +AP+ F ACFR + +
Sbjct: 313 VDSGTQFTFLLADAYAPLKGEFLNQ--TSALLAPLGEPDFVFQGAFDACFRASEARVAAA 370
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD---------ALCLGFVDGGVNPRTSI 378
++P++ LVL+ V ++ G+ + + G+ CL F + + ++
Sbjct: 371 TASQLLPEVGLVLRGAEV--AVGGEKLLYMVPGERRGEGGSEAVWCLTFGNSDMAGMSAY 428
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
VIG +N +++DL SR+GF+
Sbjct: 429 VIGHHHQQNVWVEYDLQNSRVGFA 452
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 144/381 (37%), Gaps = 62/381 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV I TP + G + WV CE V SST+ C
Sbjct: 183 TGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISC 242
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ K C GG C G G++ G +D L++ S D +
Sbjct: 243 AAPACSDLYTKGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AIKG 291
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG +GL G+ GLGR K +LP Q + FA C G
Sbjct: 292 FRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQAYDKYG--GVFAHCFPARSSGTGYLD 347
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
GP ++ VS L TP+ ++ T Y++G+T + V GK + + ++
Sbjct: 348 FGP---GSSPAVSTKLT-TPMLVDNGLTF----------YYVGLTGIRVGGKLLSIPPSV 393
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFRL 321
+ GT + + T L + Y +L AFASA+ AR AP C+
Sbjct: 394 FT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAI--AARGYKKAPALSLLDTCYDF 446
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIG 381
G +++ +P + L+ Q + + CLGF + IV G
Sbjct: 447 --TGMSQV--AIPTVSLLFQG-GASLDVDASGIIYAASVSQACLGFAANEEDDDVGIV-G 500
Query: 382 ARQLENNLLQFDLATSRLGFS 402
QL+ + +D+ +GFS
Sbjct: 501 NTQLKTFGVVYDIGKKVVGFS 521
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 85/387 (21%), Positives = 160/387 (41%), Gaps = 65/387 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T+IK +P + V G +ILWV+C+ SST+K
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVG 133
Query: 87 CGSAQCN-LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR---NP-GRAV 141
C C+ ++ + +C + + + + G+ D L+++ G P G+ V
Sbjct: 134 CDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEV 193
Query: 142 TVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+F CGS+ L S V G+ G G+S ++ SQLAA KR F+ CL
Sbjct: 194 -----VFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKG 248
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G I + + D S +K TP+ N+++ Y + + + V+G A+
Sbjct: 249 GGI-------FAVGVVD-SPKVKTTPMVPNQMH------------YNVMLMGMDVDGTAL 288
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
L +++ GGT + + +Y +L++ + P ++ V CF
Sbjct: 289 DLPPSIMR-----NGGTIVDSGTTLAYFPKVLYDSLIETILARQP--VKLHIVEDTFQCF 341
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN--PRTS 377
+ + P + ++ +V +++ + + + + C G+ GG+ RT
Sbjct: 342 SFSE----NVDVAFPPVSFEFED-SVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTE 396
Query: 378 IV-IGARQLENNLLQFDLATSRLGFSD 403
++ +G L N L+ +DL +G++D
Sbjct: 397 VILLGDLVLSNKLVVYDLENEVIGWAD 423
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 146/361 (40%), Gaps = 54/361 (14%)
Query: 59 KLTVHLGGNILWVDCEK---GYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISN 115
K L ++ V C ++ N +RC CN + ++ C G G +
Sbjct: 181 KFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRC--RNCN-SKSRKCSDSCPGYGLQ--YGS 235
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
T G + + L +++ VP+F+ C V Q GIAG GR +
Sbjct: 236 GATAGILLSETLDLENK---------RVPDFLVGCSVMSVHQP-----AGIAGFGRGPES 281
Query: 176 LPSQLAAAFSLKRKFALCLSP--FDDGAI----VFGDGPYYDLNNFDVSKNLKYTPLFIN 229
LPSQ+ LKR F+ CL FDD + V G D + +K+ Y P N
Sbjct: 282 LPSQM----RLKR-FSHCLVSRGFDDSPVSSPLVLDSGSESDESK---TKSFIYAPFREN 333
Query: 230 KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLET 289
+ + F Y++ + + + GK V L D+ G GG I + + +T L+
Sbjct: 334 PSVSNAAF----REYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDK 389
Query: 290 SIYKALVQAFASAMPKVARVAPVAP---FGACFRLQDIGFTRIGPVVPQIDLVLQNK-NV 345
I++A+ + K R V CF + + P D+VL+ K
Sbjct: 390 PIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEESAEFP-----DVVLKFKGGG 444
Query: 346 VWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRT----SIVIGARQLENNLLQFDLATSRLG 400
S+ +N + + + +CL + +I++GA Q +N L+++DLA R+G
Sbjct: 445 KLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIG 504
Query: 401 F 401
F
Sbjct: 505 F 505
>gi|224108307|ref|XP_002314798.1| predicted protein [Populus trichocarpa]
gi|222863838|gb|EEF00969.1| predicted protein [Populus trichocarpa]
Length = 98
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 160 ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD-LNNFDVS 218
A G G GLG+ ++A+PSQLA+ L+RK A CL+ +G + G+ P YD + ++S
Sbjct: 4 ARGAQGTLGLGKIRIAVPSQLASNSGLERKSATCLAS-SNGLTLLGNEPSYDSVLGTEIS 62
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++L YTPL + S S EYFI V S+ +NGK
Sbjct: 63 RSLIYTPLVTSPDARGS------SQEYFINVKSIKINGK 95
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 123/287 (42%), Gaps = 42/287 (14%)
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCLS 195
P + V NF F C + + VG+AG GR +++PSQLA + L +F+ CL
Sbjct: 200 PSPPINVRNFTFGCAHTTLGE-----PVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLV 254
Query: 196 PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA------SGFLGEPSVEYF--I 247
++F + + +PL + + T + L P YF +
Sbjct: 255 S----------------HSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSV 298
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
G+ + V +P + L +D G GG + + +T+L +Y+++V F + KVA
Sbjct: 299 GLAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 358
Query: 308 ----RVAPVAPFGACFRLQD-IGFTRI--GPVVPQIDLVLQNKNVVWS-IHGQNSMVQIG 359
R+ C+ ++ +G R+ V + ++VL KN + + G + +V
Sbjct: 359 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 418
Query: 360 GDALCLGFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGFS 402
CL ++GG + +G Q + + +DL +R+GF+
Sbjct: 419 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFA 465
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 93/396 (23%), Positives = 162/396 (40%), Gaps = 65/396 (16%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG---------------YVSSTNKT 84
T T Y T+I +P + V G +ILWV+C + Y +KT
Sbjct: 64 TVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKT 123
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNP 137
+ S + N ++ G I G +NP + ++GD D L+ +G NP
Sbjct: 124 SEFVSCEHNFCSSTY-EGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG-NP 181
Query: 138 GRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
A + IF CG S + GI G G++ ++ SQLAA+ +K+ F+ CL
Sbjct: 182 HTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL 241
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
+ G +F G V +K TPL N + Y + + ++ V
Sbjct: 242 DT-NVGGGIFSIGEV-------VEPKVKTTPLVPNMAH------------YNVILKNIEV 281
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
+G + L + D+E GT I + L +Y L+ + P++ +V V
Sbjct: 282 DGDILQLPSD--TFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRL-KVYLVEE 338
Query: 315 FGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDG 370
+CF+ D GF P + L ++ ++ +++ + + GD+ C+G+
Sbjct: 339 QYSCFQYTGNVDSGF-------PIVKLHFED-SLSLTVYPHDYLFNYKGDSYWCIGWQKS 390
Query: 371 GV---NPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
N + ++G L N L+ +DL +G++D
Sbjct: 391 ASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTD 426
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 94/395 (23%), Positives = 158/395 (40%), Gaps = 59/395 (14%)
Query: 39 DTATLQYVTQIKQRTPLVPVK--LTVHLGGNILWVDCE-----------------KGYVS 79
D+ QY I+ TP P K L G ++ W++CE + S
Sbjct: 113 DSGQSQYFVSIRIGTPR-PQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDS 171
Query: 80 STNKTARCGSAQCNLANAKACGGGICGAGPDNP--ISNTGTHGDIRIDVLSIQS-TDGRN 136
S+ +T C S C + C P+ P +G I V + ++ T G N
Sbjct: 172 SSFRTIPCSSDDCKIELQDYFSLTEC-PNPNAPCLFDYRYLNGPRAIGVFANETVTVGLN 230
Query: 137 PGRAVTVPNFIFLCGSEF-VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL- 194
+ + + + + C F G GV+G LG K +L +LA F KF+ CL
Sbjct: 231 DHKKIRLFDVLIGCTESFNETNGFPDGVMG---LGYRKHSLALRLAEIFG--NKFSYCLV 285
Query: 195 ----SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
S + FGD P L +++T L + +N Y + V+
Sbjct: 286 DHLSSSNHKNFLSFGDIPEMKL------PKMQHTELLLGYINAF----------YPVNVS 329
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
+ V G + ++ + ++ GVGG + + T+L Y +V A K +V
Sbjct: 330 GISVGGSMLSISSDIWNV--TGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVV 387
Query: 311 PVA-PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
P+ P F +D GF R VP++ L+ ++ ++ ++ + CLG +
Sbjct: 388 PIELPELNNFCFEDKGFDRAA--VPRL-LIHFADGAIFKPPVKSYIIDVAEGIKCLGIIK 444
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
P +SI +G +N+L ++DL +LGF S
Sbjct: 445 ADF-PGSSI-LGNVMQQNHLWEYDLGRGKLGFGPS 477
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 149/384 (38%), Gaps = 61/384 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGYV----------SSTNKTARCGSA 90
QY TP L V G ++LWV C + Y SST C S
Sbjct: 64 QYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSP 123
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRID--VLSIQSTDGRNPGRAVTVPNFIF 148
+C L A G C + + D + V + +S + V + F
Sbjct: 124 ECLLIPATE--GFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDD----VRIDKVAF 177
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----GAIV 203
CG + QG + G+ GLG+ ++ SQ+ A+ KFA CL + D ++
Sbjct: 178 GCGRD--NQGSFAAAGGVLGLGQGPLSFGSQVGYAYG--NKFAYCLVNYLDPTSVSSWLI 233
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FGD +L++TP+ N N P++ Y++ + V V G+++P++
Sbjct: 234 FGD------ELISTIHDLQFTPIVSNSRN--------PTL-YYVQIEKVMVGGESLPISH 278
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ S+D G GG+ + T Y+ ++ AF + + R A V C +
Sbjct: 279 SAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNV-RYPRAASVQGLDLCVDVTG 337
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD-----GGVNPRTSI 378
+ P P +VL V G N V + + CL GG N
Sbjct: 338 VD----QPSFPSFTIVLGGGAVFQPQQG-NYFVDVAPNVQCLAMAGLPSSVGGFN----- 387
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
IG +N L+Q+D +R+GF+
Sbjct: 388 TIGNLLQQNFLVQYDREENRIGFA 411
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 142/373 (38%), Gaps = 52/373 (13%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T YV + TP + G ++ WV C+ +SST CG
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACG 205
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+ +C +A C + T G++ D L++ ++D T+P F+F
Sbjct: 206 APECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD--------TLPGFVF 257
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
CG + GL V G+ GLGR KV+LPSQ A ++ + L S G + G P
Sbjct: 258 GCGDQNA--GLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGGAP 315
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
N ++T L A G PS Y+I + + V G+A+ + T +
Sbjct: 316 ---------PANAQFTAL-------ADG--ATPSF-YYIDLVGIKVGGRAIRIPATAFAA 356
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
V I + T L Y L AFA +M + + ++ C+ +
Sbjct: 357 AGGTV----IDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQ 412
Query: 329 IGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENN 388
I P ++L V S+ + CL F + +I +G Q +
Sbjct: 413 I----PTVELAFAGGATV-SLDFTGVLYVSKVSQACLAFAPNADDSSIAI-LGNTQQKTF 466
Query: 389 LLQFDLATSRLGF 401
+ +D+A R+GF
Sbjct: 467 AVTYDVANQRIGF 479
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 142/380 (37%), Gaps = 61/380 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--GYV------------SSTNKTARC 87
T YV ++ TP + G + WV C+ Y S+T C
Sbjct: 158 TGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISC 217
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
S+ C+ C GG C G + T G D L++ T+ NF
Sbjct: 218 SSSYCSDLYVSGCSGGHCLYGIQY-GDGSYTIGFYAQDTLTLAYD---------TIKNFR 267
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDG 207
F CG + +GL G+ GLGR K +LP Q + FA CL A G G
Sbjct: 268 FGCGEKN--RGLFGRAAGLLGLGRGKTSLPVQAYDKYG--GVFAYCLP-----ATSAGTG 318
Query: 208 PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
+ DL + N + TP+ +++ T Y++G+T + V G +P+ ++ S
Sbjct: 319 -FLDLGPGAPAANARLTPMLVDRGPTF----------YYVGMTGIKVGGHVLPIPGSVFS 367
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT 327
GT + + T L S Y L AF+ AM + A A F + D +
Sbjct: 368 -----TAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPA-----FSILDTCYD 417
Query: 328 RIGP-----VVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGA 382
G +P + LV Q + + CL F + +IV
Sbjct: 418 LTGHKGGSIALPAVSLVFQG-GACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNT 476
Query: 383 RQLENNLLQFDLATSRLGFS 402
+Q + +L +D+ +GF+
Sbjct: 477 QQKTHGVL-YDIGKKIVGFA 495
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 126/294 (42%), Gaps = 37/294 (12%)
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQ-GLASGVV-GIAGLGRSK 173
+ T+G + DV+ + G N T IF CGS+ Q G + V GI G G+S
Sbjct: 6 SSTNGYLVKDVVHLDLVTG-NRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSN 64
Query: 174 VALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNT 233
+ SQLA+ +KR FA CL ++G +F G VS +K TP+
Sbjct: 65 SSFISQLASQGKVKRSFAHCLDN-NNGGGIFAIGEV-------VSPKVKTTPML------ 110
Query: 234 ASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL-SIDNEGVGGTKISTVNPYTVLETSIY 292
S Y + + ++ V + L+ S D++GV I + L ++Y
Sbjct: 111 ------SKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGV---IIDSGTTLVYLPDAVY 161
Query: 293 KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
L+ ++ P++ F CF D R V Q D K+V +++ +
Sbjct: 162 NPLLNEILASHPELTLHTVQESF-TCFHYTD-KLDRFPTVTFQFD-----KSVSLAVYPR 214
Query: 353 NSMVQIGGDALCLGFVDGGVNPR---TSIVIGARQLENNLLQFDLATSRLGFSD 403
+ Q+ D C G+ +GG+ + + ++G L N L+ +D+ +G+++
Sbjct: 215 EYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTN 268
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/379 (21%), Positives = 148/379 (39%), Gaps = 39/379 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y + TP L + G ++ W+ C Y SS+ K C
Sbjct: 194 EYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDP 253
Query: 91 QCNLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
+C L ++ G P ++ T GD ++ ++ T V N
Sbjct: 254 RCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVEN 313
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
+F CG +GL G G+ GLGR ++ +QL + + F+ CL + + V
Sbjct: 314 VMFGCGH--WNRGLFHGAAGLLGLGRGPLSFATQLQSLYG--HSFSYCLVDRNSNSSVSS 369
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
+ + NL +T K N F Y++ + S+ V G+ + + +
Sbjct: 370 KLIFGEDKELLSHPNLNFTSFVGGKENPVDTF-------YYVLIKSIMVGGEVLKIPEET 422
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
+ +G GGT I + T Y+ + +AF + V P C+ + +
Sbjct: 423 WHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVE 482
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGGVNPRTSI-VIGAR 383
+ P+ ++ + +W +N +QI D +CL + PR+++ +IG
Sbjct: 483 KMEL----PEFAILFAD-GAMWDFPVENYFIQIEPEDVVCLAILG---TPRSALSIIGNY 534
Query: 384 QLENNLLQFDLATSRLGFS 402
Q +N + +DL SRLG++
Sbjct: 535 QQQNFHILYDLKKSRLGYA 553
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 143/384 (37%), Gaps = 68/384 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 176 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSC 235
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C + + C GG C G G++ G +D L++ S D V
Sbjct: 236 AAPACFDLDTRGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 284
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG +GL G+ GLGR K +LP Q + FA CL G
Sbjct: 285 FRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCLPARSSGTGYLD 340
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
GP + + TP+ + T Y++G+T + V G+ + + +++
Sbjct: 341 FGP----GSPAAAGARLTTPMLTDNGPTF----------YYVGMTGIRVGGQLLSIPQSV 386
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFRL 321
+ GT + + T L Y +L AF SAM AR AP C+
Sbjct: 387 FA-----TAGTIVDSGTVITRLPPPAYSSLRSAFVSAM--AARGYKKAPAVSLLDTCYDF 439
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + + M +CLGF DGG
Sbjct: 440 --TGMSQV--AIPTVSLLFQG-GAILDVDASGIMYAASVSQVCLGFAANEDGG----DVG 490
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
++G QL+ + +D+ +GFS
Sbjct: 491 IVGNTQLKTFGVAYDIGKKVVGFS 514
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 148/385 (38%), Gaps = 76/385 (19%)
Query: 58 VKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+ + V G ++ WV CE K S + + C S C ACG
Sbjct: 133 MSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDP 192
Query: 105 CGAGPDNPISNTG----THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLA 160
+ + + N G T G++ I+ L ++V NF+F CG +GL
Sbjct: 193 STSATCDYVVNYGDGSYTSGELGIEKLGF---------GGISVSNFVFGCGRNN--KGLF 241
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD----GAIVFGDGPYYDLNNFD 216
G G+ GLGRS++++ SQ A F F+ CL D G++V G N
Sbjct: 242 GGASGLMGLGRSELSMISQTNATFG--GVFSYCLPSTDQAGASGSLVMG-------NQSG 292
Query: 217 VSKN---LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV 273
V KN + YT + N + S Y + +T + V G V L+ S N GV
Sbjct: 293 VFKNVTPIAYTRMLPNL---------QLSNFYILNLTGIDVGG--VSLHVQASSFGNGGV 341
Query: 274 ---GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIG 330
GT IS + P S+YKAL F + CF L G+ ++
Sbjct: 342 ILDSGTVISRLAP------SVYKALKAKFLEQFSGFPSAPGFSILDTCFNL--TGYDQVN 393
Query: 331 PVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNL 389
+P I + + N + G +V+ +CL I IG Q N
Sbjct: 394 --IPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGI-IGNYQQRNQR 450
Query: 390 LQFDLATSRLGFSDSLLFERATCTF 414
+ +D S++GF+ + CTF
Sbjct: 451 VLYDAKLSQVGFA------KEPCTF 469
>gi|383167635|gb|AFG66875.1| Pinus taeda anonymous locus 2_9056_02 genomic sequence
gi|383167637|gb|AFG66876.1| Pinus taeda anonymous locus 2_9056_02 genomic sequence
gi|383167639|gb|AFG66877.1| Pinus taeda anonymous locus 2_9056_02 genomic sequence
Length = 78
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/61 (57%), Positives = 41/61 (67%), Gaps = 1/61 (1%)
Query: 347 WSIHGQNSMVQIG-GDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
W I G NSM + +ALCL FVD G +P SIVIGA QL+ LLQFD+ S LGFS +L
Sbjct: 1 WRIVGANSMERAYVENALCLAFVDAGEDPEVSIVIGAYQLQEILLQFDIGRSTLGFSSNL 60
Query: 406 L 406
L
Sbjct: 61 L 61
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 142/373 (38%), Gaps = 52/373 (13%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T YV + TP + G ++ WV C+ +SST CG
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACG 205
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+ +C +A C + T G++ D L++ ++D T+P F+F
Sbjct: 206 APECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD--------TLPGFVF 257
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
CG + GL V G+ GLGR KV+LPSQ A ++ + L S G + G P
Sbjct: 258 GCGDQNA--GLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGGAP 315
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
N ++T L A G PS Y+I + + V G+A+ + T +
Sbjct: 316 ---------PANAQFTAL-------ADG--ATPSF-YYIDLVGIKVGGRAIRIPATAFAA 356
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
V I + T L Y L AFA +M + + ++ C+ +
Sbjct: 357 AGGTV----IDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQ 412
Query: 329 IGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENN 388
I P ++L V S+ + CL F + +I +G Q +
Sbjct: 413 I----PTVELAFAGGATV-SLDFTGVLYVSKVSQACLAFAPNADDSSIAI-LGNTQQKTF 466
Query: 389 LLQFDLATSRLGF 401
+ +D+A R+GF
Sbjct: 467 AVAYDVANQRIGF 479
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 121/283 (42%), Gaps = 42/283 (14%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGV------VGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+VT P +F C + GL+S G+ G+ R ++ +QL + KF+ C
Sbjct: 166 SVTRPGTLFGC----MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYC 216
Query: 194 LSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
+S D G ++ GD Y L ++YTPL + +T + V Y + + +
Sbjct: 217 ISGSDSSGFLLLGDASYSWLGP------IQYTPLVLQ--STPLPYFDR--VAYTVQLEGI 266
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V K + L K++ D+ G G T + + +T L +Y AL F + V R+
Sbjct: 267 RVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDD 326
Query: 313 APF------GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG------ 360
F C+++ + P + L+ + + S+ GQ + ++ G
Sbjct: 327 PDFVFQGTMDLCYKVGSTTRPNFSGL-PMVSLMFRGAEM--SVSGQKLLYRVNGAGSEGK 383
Query: 361 -DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ C F + + + VIG +N ++FDLA SR+GF+
Sbjct: 384 EEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFA 426
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 91/393 (23%), Positives = 146/393 (37%), Gaps = 72/393 (18%)
Query: 37 AKDTATL---QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------KGYV-----S 79
AK AT+ Y+ + TP + L G ++ W C+ K V S
Sbjct: 120 AKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQS 179
Query: 80 STNKTARCGSAQCN-----------LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLS 128
+T C S C+ + A+AC GI D S G + L+
Sbjct: 180 TTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYG--DQSFSV----GYFAKETLT 233
Query: 129 IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
+ STD + NF+F CG +GL G+ GLG+ K+++ Q A + +
Sbjct: 234 LTSTD--------VIENFLFGCGQNN--RGLFGSAAGLIGLGQDKISIVKQTAQKYG--Q 281
Query: 189 KFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
F+ CL Y LKYTP I K + + F G +
Sbjct: 282 VFSYCLPKTSSST------GYLTFGGGGGGGALKYTP--ITKAHGVANFYG-------VD 326
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
+ + V G +P++ ++ S G I + T L Y AL AF M K +
Sbjct: 327 IVGMKVGGTQIPISSSVFSTS-----GAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPK 381
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
++ C+ L +I P++ V + + + G M +CL F
Sbjct: 382 APELSILDTCYDLSKYSTIQI----PKVGFVFKGGEEL-DLDGIGIMYGASTSQVCLAFA 436
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G +P T +IG Q + + +D+ ++GF
Sbjct: 437 -GNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGF 468
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/348 (25%), Positives = 140/348 (40%), Gaps = 67/348 (19%)
Query: 79 SSTNKTARCGSAQCNL--------ANAKACG--GGICGAGPDNPISNTGTHGDIRIDVLS 128
SST ++ C S +CN + K C G G G T G + DVL
Sbjct: 132 SSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLG--------STTGQLVSDVLG 183
Query: 129 IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
+ + +P+F+F C Q GIAG GR ++P+QL
Sbjct: 184 LSKLN--------RIPDFLFGCSLVSNRQ-----PEGIAGFGRGLASIPAQLGLT----- 225
Query: 189 KFALCL--SPFDD----GAIVFGDGPYYDLNNFDVSKN-LKYTPLFINKVNTASGFLGEP 241
KF+ CL FDD G +V G + D + N + Y P T S L
Sbjct: 226 KFSYCLVSHRFDDTPQSGDLVLHRG----RRHADAAANGVAYAPF------TKSPALSPY 275
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
S Y+I ++ + V GK VP+ L EG GG + + + +T +E I+ + +
Sbjct: 276 SEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEK 335
Query: 302 AMPKVARVAPVAP---FGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQ 357
M K R + G C+ + G + + VP++ + N+ + S+V
Sbjct: 336 HMTKYKRAKEIEDSSGLGPCYNI--TGQSEVD--VPKLTFSFKGGANMDLPLTDYFSLVT 391
Query: 358 IGGDALCLGFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGF 401
G +C+ + P ++ I++G Q +N +++DL R GF
Sbjct: 392 DG--VVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDLKKQRFGF 437
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 147/368 (39%), Gaps = 62/368 (16%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYVS--------------STNKTARCGSAQCNLANAK 98
TP + V G ++ W+ C VS ST + C + QC+ +
Sbjct: 5 TPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLPSA 64
Query: 99 ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
C + N ++GD V + S D + G + ++PNF + CG + +G
Sbjct: 65 TLNPSACSS--SNVCIYQASYGDSSFSVGYL-SKDTVSFG-STSLPNFYYGCGQDN--EG 118
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGDGPYYDLNN 214
L G+ GL R+K++L QLA SL F CL S + G Y
Sbjct: 119 LFGRSAGLIGLARNKLSLLYQLAP--SLGYSFTYCLPSSSSSGYLSLGSYNPGQY----- 171
Query: 215 FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVG 274
YTP+ + ++ + YFI ++ + V G N +S
Sbjct: 172 -------SYTPMVSSSLDDS---------LYFIKLSGMTVAG-----NPLSVSSSAYSSL 210
Query: 275 GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVP 334
T I + T L TS+Y AL +A A+AM +R + + CF+ Q +R+ P
Sbjct: 211 PTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQA---SRVS--AP 265
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDL 394
+ + + QN +V + CL F R++ +IG Q + + +D+
Sbjct: 266 AVTMSFAG-GAALKLSAQNLLVDVDDSTTCLAFAPA----RSAAIIGNTQQQTFSVVYDV 320
Query: 395 ATSRLGFS 402
+SR+GF+
Sbjct: 321 KSSRIGFA 328
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/374 (23%), Positives = 148/374 (39%), Gaps = 41/374 (10%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGS 89
+T Y+ I TP +P+ + G +++W C+ Y + + T S
Sbjct: 88 STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
+ + A C + PD + ++GD + +T+ G V F
Sbjct: 148 CRSPMCQALQSPWSRC-SPPDTGCAYYFSYGD-GTSTDGVLATETFTLGSDTAVRGVAFG 205
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPY 209
CG+E + G G+ G+GR ++L SQL +F+ C +PF+ A P
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVT-----RFSYCFTPFNATAA----SPL 254
Query: 210 YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID 269
+ ++ +S K TP V + SG S Y++ + + V +P++ + +
Sbjct: 255 FLGSSARLSSAAKTTPF----VPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLT 310
Query: 270 NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRI 329
G GG I + +T LE + AL +A AS RV GA L + F
Sbjct: 311 PMGDGGVIIDSGTTFTALEERAFVALARALAS------RVRLPLASGAHLGLS-LCFAAA 363
Query: 330 GPVVPQI-DLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTSIVIGARQLEN 387
P ++ LVL + ++ +V+ CLG V R V+G+ Q +N
Sbjct: 364 SPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVACLGMVSA----RGMSVLGSMQQQN 419
Query: 388 NLLQFDLATSRLGF 401
+ +DL L F
Sbjct: 420 THILYDLERGILSF 433
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 67/265 (25%), Positives = 110/265 (41%), Gaps = 42/265 (15%)
Query: 164 VGIAGLGRSKVALPSQLA-AAFSLKRKFALCL--SPFDD------GAIVFGDGPYYDLNN 214
+G+AG G ++LP+QLA + L +F+ CL FD ++ G D
Sbjct: 237 IGVAGFGFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERD--- 293
Query: 215 FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT--SVHVNGKAVPLNKTLLSIDNEG 272
FD YTP+ N P YF V+ ++ V V L+ ID +G
Sbjct: 294 FDEITQFVYTPMLDN-----------PKHPYFYSVSMEAISVGSSRVRAPNALIRIDRDG 342
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFRLQDIGFTR 328
GG + + YT+L T Y ++ + +V + A C+ L+ G R
Sbjct: 343 NGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLEGNGVER 402
Query: 329 IGPVVPQI--------DLVLQNKNVVWS-IHGQNSMVQIGGDALCLGFVDGGVNPR--TS 377
+G VVP++ +VL +N + + G++ + G CL +DGG
Sbjct: 403 LGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDE--KKGRKVGCLMLMDGGDESEGGPG 460
Query: 378 IVIGARQLENNLLQFDLATSRLGFS 402
+G Q + + +DL R+GF+
Sbjct: 461 ATLGNYQQQGFQVVYDLEERRVGFA 485
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 155/390 (39%), Gaps = 84/390 (21%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST----------------NKTARC 87
+Y+ QI P+ L G ++ W+ C+ +T C
Sbjct: 147 EYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 88 GSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
S QC L + C C G G + T G++ + LS G +
Sbjct: 207 NSQQCKLLDKANCNSDTCIYQVHYGDG-------SFTTGELATETLSF--------GNSN 251
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
++PN CG + +GL +G G+ GLG ++L SQL A+ F+ CL D
Sbjct: 252 SIPNLPIGCGHDN--EGLFAGGAGLIGLGGGAISLSSQLKAS-----SFSYCLVNLDS-- 302
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE-------YFIGVTSVHV 254
D S L++ N S L P V+ ++ V + V
Sbjct: 303 --------------DSSSTLEF------NSNMPSDSLTSPLVKNDRFHSYRYVKVVGISV 342
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
GK +P++ T ID G+GG + + + L + +Y++L +AF ++ ++
Sbjct: 343 GGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISV 402
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGGVN 373
F C+ G + + VP I VL ++ + +N ++ + CL F +
Sbjct: 403 FDTCYNFS--GQSNVE--VPTIAFVL-SEGTSLRLPARNYLIMLDTAGTYCLAF----IK 453
Query: 374 PRTSI-VIGARQLENNLLQFDLATSRLGFS 402
++S+ +IG+ Q + + +DL S +GFS
Sbjct: 454 TKSSLSIIGSFQQQGIRVSYDLTNSLVGFS 483
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 89/376 (23%), Positives = 141/376 (37%), Gaps = 56/376 (14%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCG 88
T YV + TP + + G ++ WV C +K + SST C
Sbjct: 143 TGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCA 202
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
S +C ++++C + T G + D L++ +D +P F+F
Sbjct: 203 SPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD--------VLPGFVF 254
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
CG + GL G+ GLGR KV+L SQ A+ + + L SP G + G GP
Sbjct: 255 GCGEQDT--GLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPSAAGYLSLG-GP 311
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
F + +P F Y++ + V V G+ V ++ + S
Sbjct: 312 APANARFTAMETRHDSPSF-----------------YYVRLVGVKVAGRTVRVSPIVFS- 353
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA--RVAPVAPFGACFRLQDIGF 326
GT I + T L +Y AL AFA +M + R ++ C+
Sbjct: 354 ----AAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTT 409
Query: 327 TRIGPVVPQIDLVLQNKNVV-WSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
RI P + LV V G + ++ CL F G I IG Q
Sbjct: 410 VRI----PSVALVFAGGAAVGLDFSGVLYVAKV--SQACLAFAPNGDGADAGI-IGNTQQ 462
Query: 386 ENNLLQFDLATSRLGF 401
+ + +D+A ++GF
Sbjct: 463 KTLAVVYDVARQKIGF 478
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 91/385 (23%), Positives = 150/385 (38%), Gaps = 68/385 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILW---VDCEKGY----------VSSTNKTARCGSA 90
+Y+ ++ TP P+ G +I+W V C Y S+T + C S
Sbjct: 84 EYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 91 QCNLANAKACGGGICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C+ C PD S N+ + GD +D L++ ST GR V P
Sbjct: 144 VCSFTGEDNS----CSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGR----VVAFPR 195
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF--DDGA-- 201
CG + + V GI GLG +L Q+ +A + KF+ CL+P DDG
Sbjct: 196 TAIGCGHDNA-GSFDANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSN 252
Query: 202 -IVFGDGPYYDLNNFDVS-KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+ FG +N +VS TP++I+ + F S+ + +V
Sbjct: 253 KLNFG-------SNANVSGSGAVSTPIYIS--------------DKFKSFYSLKLKAVSV 291
Query: 260 PLNKTLLSIDNEGVGGTK---ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
N T S N +GG I + T+L +Y +A ++++ P
Sbjct: 292 GRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLE 351
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
CF + VP I + + N+ + +N ++++ + +CL F G +
Sbjct: 352 YCFETTTDDYK-----VPFIAMHFEGANL--RLQRENVLIRVSDNVICLAFA-GAQDNDI 403
Query: 377 SIVIGARQLENNLLQFDLATSRLGF 401
SI Q+ N L+ +D+ L F
Sbjct: 404 SIYGNIAQI-NFLVGYDVTNMSLSF 427
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 159/387 (41%), Gaps = 75/387 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGYV----------SSTNKTARCGSA 90
+Y T+I TP V + + G +++W+ C K Y S T CG+
Sbjct: 128 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAP 187
Query: 91 QCNLANAKACG--GGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C ++ C +C G G + T GD + L+ + T
Sbjct: 188 LCRRLDSPGCNNKNKVCQYQVSYGDG-------SFTFGDFSTETLTFRRT---------R 231
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFD 198
V CG + +GL G G+ GLGR +++ P Q F+ +KF+ CL +
Sbjct: 232 VTRVALGCGHD--NEGLFIGAAGLLGLGRGRLSFPVQTGRRFN--QKFSYCLVDRSASAK 287
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++VFGD VS+ ++TPL N K++T Y++ + + V G
Sbjct: 288 PSSVVFGDSA--------VSRTARFTPLIKNPKLDTF----------YYLELLGISVGGS 329
Query: 258 AVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
V L+ +L +D G GG I + T L Y AL AF + R A + F
Sbjct: 330 PVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFD 389
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPR 375
CF L G T + VP + L + +V S+ N ++ + + C F G
Sbjct: 390 TCFDLS--GLTEVK--VPTVVLHFRGADV--SLPATNYLIPVDNSGSFCFAFA--GTMSG 441
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
SI IG Q + + FDLA SR+GF+
Sbjct: 442 LSI-IGNIQQQGFRVSFDLAGSRVGFA 467
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 88/384 (22%), Positives = 151/384 (39%), Gaps = 74/384 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP P + G +++W C+ SS+ T C S
Sbjct: 94 EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 91 QCNLANAKACGGGIC----GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
C + C C G G + T G + + L+ S V++PN
Sbjct: 154 LCQALQSPTCSNNSCQYTYGYG-----DGSETQGSMGTETLTFGS---------VSIPNI 199
Query: 147 IFLCGSEFVLQGLASGV-VGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF---DDGAI 202
F CG QG G G+ G+GR ++LPSQL KF+ C++P + +
Sbjct: 200 TFGCGENN--QGFGQGNGAGLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSNSSTL 252
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
+ G L N + + T + +++ T Y+I + + V +P++
Sbjct: 253 LLGS-----LANSVTAGSPNTTLIQSSQIPTF----------YYITLNGLSVGSTPLPID 297
Query: 263 KTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
++ ++ N G GG I + T + Y+A+ QAF S M + F CF++
Sbjct: 298 PSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQM 357
Query: 322 -QDIGFTRIGPVVPQI---DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
D +I V DLVL ++N + I N ++ L +G G++
Sbjct: 358 PSDQSNLQIPTFVMHFDGGDLVLPSEN--YFISPSNGLI-----CLAMGSSSQGMS---- 406
Query: 378 IVIGARQLENNLLQFDLATSRLGF 401
+ G Q +N L+ +D S + F
Sbjct: 407 -IFGNIQQQNLLVVYDTGNSVVSF 429
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 146/371 (39%), Gaps = 51/371 (13%)
Query: 43 LQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVSSTNKTAR--CGSAQCN-LANA 97
+Y+ + TP V + G +++W+ C+ + +++ AR C + C L +A
Sbjct: 74 FEYLMALDVSTPPVRMLALADTGSSLVWLKCKLPAAHTPASSSYARLPCDAFACKALGDA 133
Query: 98 KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEF--- 154
+C TG+ +I V DG VTV F F +F
Sbjct: 134 ASCRA-------------TGSGNNI--CVYRYAFADGSCTAGPVTVDAFTFSTRLDFGCA 178
Query: 155 -VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLN 213
+GL+ G+ GL ++L SQL+A KF+ CL P+ V +
Sbjct: 179 TRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSLNFGSHA 238
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL--LSIDNE 271
S TPL G Y I + S+ V GK VPL T L +D+
Sbjct: 239 IVSSSPGAATTPLVA----------GRNKSFYTIALDSIKVAGKPVPLQTTTTKLIVDS- 287
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV-APVAPFGACFRLQDIGFTRIG 330
GT + T L ++ LV A +A+ K+ RV +P + C+ ++ +G
Sbjct: 288 ---GTML------TYLPKAVLDPLVAALTAAI-KLPRVKSPETLYAVCYDVRRRAPEDVG 337
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
+P + LVL V G +V+ G +CL V+ + ++G +N +
Sbjct: 338 KSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHL---PEFILGNVAQQNLHV 394
Query: 391 QFDLATSRLGF 401
FDL + F
Sbjct: 395 GFDLERRTVSF 405
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 147/386 (38%), Gaps = 65/386 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG----------YV---SSTNKTARCGSA 90
QY TP L V G ++LWV C YV SST C S+
Sbjct: 63 QYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSS 122
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRID--VLSIQST--DGRNPGRAVTVPNF 146
C L A G C + + D V + +S DG V +
Sbjct: 123 DCLLIPATE--GFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDG------VRIDKV 174
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----GA 201
F CGS+ QG + G+ GLG+ ++ SQ+ A+ KFA CL + D +
Sbjct: 175 AFGCGSD--NQGSFAAAGGVLGLGQGPLSFGSQVGYAYG--NKFAYCLVNYLDPTSVSSS 230
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
++FGD +++YTP+ N + P++ Y++ + V V GK++P+
Sbjct: 231 LIFGD------ELISTIHDMQYTPIVSNPKS--------PTL-YYVQIEKVTVGGKSLPI 275
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ + ID G GG+ + T S Y ++ AF S + R V C L
Sbjct: 276 SDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQGLDLCVEL 334
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD-----GGVNPRT 376
+ P P + + V+ +N V + + CL GG N
Sbjct: 335 TGVD----QPSFPSFTIEF-DDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFN--- 386
Query: 377 SIVIGARQLENNLLQFDLATSRLGFS 402
IG +N +Q+D + +GF+
Sbjct: 387 --TIGNLLQQNFFVQYDREENLIGFA 410
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 88/384 (22%), Positives = 150/384 (39%), Gaps = 74/384 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP P + G +++W C+ SS+ T C S
Sbjct: 94 EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 91 QCNLANAKACGGGIC----GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
C + C C G G + T G + + L+ S V++PN
Sbjct: 154 LCQALQSPTCSNNSCQYTYGYG-----DGSETQGSMGTETLTFGS---------VSIPNI 199
Query: 147 IFLCGSEFVLQGLASGV-VGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF---DDGAI 202
F CG QG G G+ G+GR ++LPSQL KF+ C++P +
Sbjct: 200 TFGCGENN--QGFGQGNGAGLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTSSTL 252
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
+ G L N + + T + +++ T Y+I + + V +P++
Sbjct: 253 LLG-----SLANSVTAGSPNTTLIESSQIPTF----------YYITLNGLSVGSTPLPID 297
Query: 263 KTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
++ ++ N G GG I + T + Y+A+ QAF S M + F CF++
Sbjct: 298 PSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQM 357
Query: 322 -QDIGFTRIGPVVPQI---DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
D +I V DLVL ++N + I N ++ L +G G++
Sbjct: 358 PSDQSNLQIPTFVMHFDGGDLVLPSEN--YFISPSNGLI-----CLAMGSSSQGMS---- 406
Query: 378 IVIGARQLENNLLQFDLATSRLGF 401
+ G Q +N L+ +D S + F
Sbjct: 407 -IFGNIQQQNLLVVYDTGNSVVSF 429
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 87/383 (22%), Positives = 157/383 (40%), Gaps = 70/383 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST----------------NKTARC 87
+Y+ QI P+ L G ++ W+ C+ +T C
Sbjct: 147 EYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 88 GSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
S QC L + C C G G + T G++ + LS G +
Sbjct: 207 NSQQCKLLDKANCNSDTCIYQVHYGDG-------SFTTGELATETLSF--------GNSN 251
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
++PN CG + +GL +G G+ GLG ++L SQL A+ F+ CL D +
Sbjct: 252 SIPNLPIGCGHDN--EGLFAGGAGLIGLGGGAISLSSQLKAS-----SFSYCLVNLDSDS 304
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ N++ S +L +PL N + ++ V + V GK +P+
Sbjct: 305 -----SSTLEFNSYMPSDSLT-SPLVKNDRFHS---------YRYVKVVGISVGGKTLPI 349
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ T ID G+GG + + + L + +Y++L +AF ++ ++ F C+
Sbjct: 350 SPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNF 409
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGGVNPRTSI-V 379
G + + VP I VL ++ + +N ++ + CL F + ++S+ +
Sbjct: 410 S--GQSNVE--VPTIAFVL-SEGTSLRLPARNYLIMLDTAGTYCLAF----IKTKSSLSI 460
Query: 380 IGARQLENNLLQFDLATSRLGFS 402
IG+ Q + + +DL S +GFS
Sbjct: 461 IGSFQQQGIRVSYDLTNSIVGFS 483
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 112/266 (42%), Gaps = 34/266 (12%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
V++PN F CG + G G G+ GLGR ++L SQL A KF+ CL+ DD
Sbjct: 198 VSIPNVGFGCGEDNEGDGFTQGS-GLVGLGRGPLSLVSQLKEA-----KFSYCLTSIDDT 251
Query: 201 AI-VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G +N S ++ TPL N + +PS Y++ + + V G +
Sbjct: 252 KTSTLLMGSLASVNG--TSAAIRTTPLIQNPL--------QPSF-YYLSLEGISVGGTRL 300
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
P+ ++ + ++G GG I + T LE S + + + F S M + C+
Sbjct: 301 PIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCY 360
Query: 320 RL-QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV---QIGGDALCLGFVDGGVNPR 375
L D VP+ LVL + G+N M+ +G L +G GG++
Sbjct: 361 NLPSDTSELE----VPK--LVLHFTGADLELPGENYMIADSSMGVICLAMG-SSGGMS-- 411
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
+ G Q +N + DL L F
Sbjct: 412 ---IFGNVQQQNMFVSHDLEKETLSF 434
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 153/403 (37%), Gaps = 82/403 (20%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EK---------------GYVSST 81
T T Y T+IK TP + V G +ILWV+C EK SS+
Sbjct: 79 TDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSS 138
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDG 134
T C C A GG + G + P + +GD D L G
Sbjct: 139 GSTVSCDQGFC----AATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTG 194
Query: 135 R---NPGRAVTVPNFIFLCGSEFVLQGLASG-----VVGIAGLGRSKVALPSQLAAAFSL 186
PG A F CG++ QG G + GI G G++ ++ SQLAAA +
Sbjct: 195 DGQTQPGNATVT----FGCGAQ---QGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKV 247
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
K+ FA CL G I + + N V +K TPL + + Y
Sbjct: 248 KKIFAHCLDTIKGGGI-------FAIGNV-VQPKVKTTPLVADMPH------------YN 287
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+ + S+ V G + L + GT I + T L ++K ++ A + +
Sbjct: 288 VNLKSIDVGGTTLQLPAHVFETGER--KGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDI 345
Query: 307 ARVAPVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
V F CF+ D GF P I ++ ++ ++ G D
Sbjct: 346 V-FHNVQDF-MCFQYPGSVDDGF-------PTITFHFED-DLALHVYPHEYFFPNGNDMY 395
Query: 364 CLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
C+GF +G + + +++G L N L+ +DL +G++D
Sbjct: 396 CVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTD 438
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 164/395 (41%), Gaps = 74/395 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL---------- 94
Y T+++ TP + V G ++LWV C +S N + Q L
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSC-----ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 95 -----ANAKACGGGICGAGPDNPISN------------TGTHGDIRIDVLSIQSTDGRNP 137
+ + C GI + + N +GT G DVL G +
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 138 GRAVTVPNFIFLCGSEFVLQGLAS--GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
T P +F C + + S V GI G G+ +++ SQLA+ R F+ CL
Sbjct: 196 VPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254
Query: 196 PFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ G +V G+ V N+ +TPL ++ + Y + + S+
Sbjct: 255 GENGGGGILVLGE---------IVEPNMVFTPLVPSQPH------------YNVNLLSIS 293
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG+A+P+N ++ S N GT I T L + Y V+A +A+ + R PV
Sbjct: 294 VNGQALPINPSVFSTSNG--QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR--PVV 349
Query: 314 PFG-ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFV 368
G C+ + T +G + P + L ++ ++ Q+ ++Q +GG A+ C+GF
Sbjct: 350 SKGNQCYVIT----TSVGDIFPPVSLNFAGGASMF-LNPQDYLIQQNNVGGTAVWCIGFQ 404
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ + ++G L++ + +DL R+G+++
Sbjct: 405 R--IQNQGITILGDLVLKDKIFVYDLVGQRIGWAN 437
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 83/389 (21%), Positives = 145/389 (37%), Gaps = 69/389 (17%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY-------------------VSSTNKTARCGSAQC- 92
TP V + + G + W+ C G S+T CGS QC
Sbjct: 71 TPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCS 130
Query: 93 --NLANAKACGGGI--CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+L +C G C + + G + DV ++ G A + +
Sbjct: 131 SRDLPAPPSCDGASRQCHVSLSY-ADGSASDGALATDVFAV--------GEAPPLRSAFG 181
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF---G 205
+ + G+ G+ R ++ +Q + R+F+ C+S DD ++
Sbjct: 182 CMSTAYDSSPDGVATAGLLGMNRGTLSFVTQAS-----TRRFSYCISDRDDAGVLLLGHS 236
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
D P+ LN YTPL+ + + V Y + + + V GKA+P+ ++
Sbjct: 237 DLPFLPLN---------YTPLYQPTLPLP--YFDR--VAYSVQLLGIRVGGKALPIPASV 283
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG------ACF 319
L+ D+ G G T + + +T L Y AL F + R F CF
Sbjct: 284 LAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCF 343
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVDGGVN 373
R+ G +P + L+ + S+ G + ++ G+ CL F + +
Sbjct: 344 RVP-AGRPPPSARLPPVTLLFNGAEM--SVAGDRLLYKVPGEHRGADGVWCLTFGNADMV 400
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
P T+ VIG N +++DL R+G +
Sbjct: 401 PLTAYVIGHHHQMNLWVEYDLERGRVGLA 429
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 152/379 (40%), Gaps = 57/379 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------KGYVSSTNKTARCGSAQC 92
Y+ ++ TP + G NI W+ C + SST C S QC
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQPFEPSKSSTYNYLTCASQQC 183
Query: 93 NLANAKACGGGICGAGPDNPI--SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
L +C DN + S T +GD + +V I S++ + G + V NF+F C
Sbjct: 184 QLLR-------VCTKS-DNSVNCSLTQRYGD-QSEVDEILSSETLSVG-SQQVENFVFGC 233
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDD---GAIVFGD 206
+ +GL + G GR+ ++ SQ A + F+ CL S F G+++ G
Sbjct: 234 SN--AARGLIQRTPSLVGFGRNPLSFVSQTATLY--DSTFSYCLPSLFSSAFTGSLLLGK 289
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
++ LK+TPL N PS Y++G+ + V + V + L
Sbjct: 290 EAL-------SAQGLKFTPLLSNSR--------YPSF-YYVGLNGISVGEELVSIPAGTL 333
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ--DI 324
S+D GT I + T L Y A+ +F S + + +P F C+ D+
Sbjct: 334 SLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDV 393
Query: 325 GFTRIG-PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
F I +DL L N+++ + S++ L G GG + S G
Sbjct: 394 EFPLITLHFDDNLDLTLPLDNILYPGNDDGSVL-----CLAFGLPPGGGDDVLS-TFGNY 447
Query: 384 QLENNLLQFDLATSRLGFS 402
Q + + D+A SRLG +
Sbjct: 448 QQQKLRIVHDVAESRLGIA 466
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 111/260 (42%), Gaps = 45/260 (17%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSK 219
S G+ G+ R ++L +Q++ KF+ C+S D G ++ GDG D
Sbjct: 185 SKTTGLMGMNRGSLSLVTQMSLP-----KFSYCISGEDALGVLLLGDGT-------DAPS 232
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
L+YTPL T+S + V Y + + + V+ K + L K++ D+ G G T +
Sbjct: 233 PLQYTPLVT--ATTSSPYFNR--VAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVD 288
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV------- 332
+ +T L S+Y +L F G R++D F G +
Sbjct: 289 SGTQFTFLLGSVYSSLKDEFLEQTK-----------GVLTRIEDPNFVFEGAMDLCYHAP 337
Query: 333 -----VPQIDLVLQNKNVVWSIHGQNSMVQI--GGDAL-CLGFVDGGVNPRTSIVIGARQ 384
VP + LV + + G+ + ++ G D + C F + + + VIG
Sbjct: 338 ASFAAVPAVTLVFSGAEM--RVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHH 395
Query: 385 LENNLLQFDLATSRLGFSDS 404
+N ++FDL SR+GF+ +
Sbjct: 396 QQNVWMEFDLLKSRVGFTQT 415
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 164/395 (41%), Gaps = 74/395 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL---------- 94
Y T+++ TP + V G ++LWV C +S N + Q L
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSC-----ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 95 -----ANAKACGGGICGAGPDNPISN------------TGTHGDIRIDVLSIQSTDGRNP 137
+ + C GI + + N +GT G DVL G +
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 138 GRAVTVPNFIFLCGSEFVLQGLAS--GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
T P +F C + + S V GI G G+ +++ SQLA+ R F+ CL
Sbjct: 196 VPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254
Query: 196 PFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ G +V G+ V N+ +TPL ++ + Y + + S+
Sbjct: 255 GENGGGGILVLGE---------IVEPNMVFTPLVPSQPH------------YNVNLLSIS 293
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG+A+P+N ++ S N GT I T L + Y V+A +A+ + R PV
Sbjct: 294 VNGQALPINPSVFSTSNG--QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR--PVV 349
Query: 314 PFG-ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFV 368
G C+ + T +G + P + L ++ ++ Q+ ++Q +GG A+ C+GF
Sbjct: 350 SKGNQCYVIT----TSVGDIFPPVSLNFAGGASMF-LNPQDYLIQQNNVGGTAVWCIGFQ 404
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ + ++G L++ + +DL R+G+++
Sbjct: 405 R--IQNQGITILGDLVLKDKIFVYDLVGQRIGWAN 437
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 152/381 (39%), Gaps = 78/381 (20%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
TLQYV + TP V L V G ++ WV C+ SS+
Sbjct: 139 TLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVP 198
Query: 87 CGSAQCN-LA-NAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
C +A C+ LA + C GG CG +T T G D L++ ++ +
Sbjct: 199 CAAASCSQLALYSNGCSGGQCGYVVSYGDGST-TTGVYSSDTLTLTGSN--------ALK 249
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
F+F CG QGL +GV G+ GLGR +L SQ ++ + F+ CL P +
Sbjct: 250 GFLFGCG--HAQQGLFAGVDGLLGLGRQGQSLVSQASSTY--GGVFSYCLPPTQNSVG-- 303
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
Y L + TPL TAS +P+ Y + + + V G+ + ++ +
Sbjct: 304 ----YISLGGPSSTAGFSTTPLL-----TAS---NDPTY-YIVMLAGISVGGQPLSIDAS 350
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFR 320
+ + G + T T L + Y AL AF +AM P AP C+
Sbjct: 351 VFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGY--PSAPATGILDTCYD 402
Query: 321 LQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL----CLGFVDGGVNPR 375
FTR G V +P I + G + + +G + CL F G + +
Sbjct: 403 -----FTRYGTVTLPTISIAF----------GGGAAMDLGTSGILTSGCLAFAPTGGDSQ 447
Query: 376 TSIVIGARQLENNLLQFDLAT 396
SI +G Q + ++FD +T
Sbjct: 448 ASI-LGNVQQRSFEVRFDGST 467
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 82/393 (20%), Positives = 150/393 (38%), Gaps = 84/393 (21%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY------------VSSTNKTARCGSAQCNL------ 94
TP V + + G + W+ C G S+T CGSA+C+
Sbjct: 69 TPPQNVTMVLDTGSELSWLLCATGRAAAAAADSFRPRASATFAAVPCGSARCSSRDLPAP 128
Query: 95 ----ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
A ++ C + A + + G + DV ++ G A + +
Sbjct: 129 PSCDAASRRCRVSLSYA------DGSASDGALATDVFAV--------GDAPPLRSAFGCM 174
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF---GDG 207
+ + A G+ G+ R ++ +Q + R+F+ C+S DD ++ D
Sbjct: 175 SAAYDSSPDAVATAGLLGMNRGALSFVTQAST-----RRFSYCISDRDDAGVLLLGHSDL 229
Query: 208 PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
P+ LN YTPL+ + + V Y + + + V GK +P+ ++L+
Sbjct: 230 PFLPLN---------YTPLY--QPTPPLPYFDR--VAYSVQLLGIRVGGKPLPIPPSVLA 276
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP------------F 315
D+ G G T + + +T L Y A+ F + + P+ P F
Sbjct: 277 PDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEF------LKQTKPLLPALEDPSFAFQEAF 330
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD------ALCLGFVD 369
CFR+ G +P + L+ + S+ G + ++ G+ CL F +
Sbjct: 331 DTCFRVPK-GRPPPSARLPPVTLLFNGAQM--SVAGDRLLYKVPGERRGADGVWCLTFGN 387
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ P T+ VIG N +++DL R+G +
Sbjct: 388 ADMVPLTAYVIGHHHQMNLWVEYDLERGRVGLA 420
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 146/388 (37%), Gaps = 77/388 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
YV TP + V G + +W C+ SST K RC S
Sbjct: 90 YVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSSPI 149
Query: 92 CNLANAKACGGG---ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C C C + +G+ GDI D L++ S D G ++ P +
Sbjct: 150 CKRGEKTRCSSNRKRKCEY-EITYLDRSGSQGDISKDTLTLNSND----GSPISFPKIVI 204
Query: 149 LCGSEFVL--QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP---------- 196
CG + L +GLASG++ G GR ++ SQL + S+ KF+ CL+
Sbjct: 205 GCGHKNSLTTEGLASGII---GFGRGNFSIVSQLGS--SIGGKFSYCLASLFSKANISSK 259
Query: 197 --FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
F D A+V G G + TPL S ++G YF + + V
Sbjct: 260 LYFGDMAVVSGHG-------------VVSTPLI------QSFYVG----NYFTNLEAFSV 296
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA-PVA 313
+ L + L DNE G I + + T L +Y L A S M K+ RV P
Sbjct: 297 GDHIIKLKDSSLIPDNE--GNAVIDSGSTITQLPNDVYSQLETAVIS-MVKLKRVKDPTQ 353
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
C++ + VP I + +V ++ N+ +Q+ + +C F
Sbjct: 354 QLSLCYKTTLKKYE-----VPIITAHFRGADV--KLNAFNTFIQMNHEVMCFAFNSSAF- 405
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGF 401
+V G +N L+ +D + + F
Sbjct: 406 --PWVVYGNIAQQNFLVGYDTLKNIISF 431
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 110/256 (42%), Gaps = 39/256 (15%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF---GDGPYYDLNNFDVSK 219
G+ G+ R ++ SQ + R+F+ C+S DD ++ D P+ LN
Sbjct: 195 TAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLGHSDLPFLPLN------ 243
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
YTPL+ A V Y + + + V GK +P+ ++L+ D+ G G T +
Sbjct: 244 ---YTPLY----QPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 296
Query: 280 TVNPYTVLETSIYKALVQAFASA----MPKV--ARVAPVAPFGACFRLQDIGFTRIGPV- 332
+ +T L Y AL F+ +P + A F CFR+ R P
Sbjct: 297 SGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQ---GRAPPAR 353
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPRTSIVIGARQLE 386
+P + L+ + ++ G + ++ GGD + CL F + + P T+ VIG
Sbjct: 354 LPAVTLLFNGAQM--TVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQM 411
Query: 387 NNLLQFDLATSRLGFS 402
N +++DL R+G +
Sbjct: 412 NVWVEYDLERGRVGLA 427
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/386 (23%), Positives = 150/386 (38%), Gaps = 75/386 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y+ TP + G +I+W+ CE SS+ K C S
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKL 146
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG 151
C+ +C + ++ + GD+ +D LS++ST G V+ P + CG
Sbjct: 147 CHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS----GSPVSFPKIVIGCG 202
Query: 152 SEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP------------- 196
++ G +SG+V GLG V+L +QL + S+ KF+ CL P
Sbjct: 203 TDNAGTFGGASSGIV---GLGGGPVSLITQLGS--SIGGKFSYCLVPLLNKESNASSILS 257
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
F D A+V GDG + TPL + + V YF+ + + V
Sbjct: 258 FGDAAVVSGDG-------------VVSTPL-----------IKKDPVFYFLTLQAFSVGN 293
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV-APVAPF 315
K V + D+E G I + T++ + +Y L A + K+ RV P F
Sbjct: 294 KRVEFGGSSEGGDDE--GNIIIDSGTTLTLIPSDVYTNLESAVVDLV-KLDRVDDPNQQF 350
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
C+ L+ + P I + + +V +H ++ V I +C F +P+
Sbjct: 351 SLCYSLKSNEYD-----FPIITVHFKGADV--ELHSISTFVPITDGIVCFAFQP---SPQ 400
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
+ G +N L+ +DL + F
Sbjct: 401 LGSIFGNLAQQNLLVGYDLQQKTVSF 426
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/411 (23%), Positives = 157/411 (38%), Gaps = 87/411 (21%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGYV-------SSTNKTARC 87
T +Y+ + TP PV LT+ G +++W C ++G + SST+ RC
Sbjct: 91 TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRC 150
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTD-----GRNP-GRAV 141
+ C +CG G G + +GD I V + S G N G V
Sbjct: 151 DAPVCRALPFTSCGRGGSSWG-ERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGV 209
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
+ F CG F + GIAG GR + +LPSQL F+ C + +
Sbjct: 210 SERRLTFGCG-HFNKGIFQANETGIAGFGRGRWSLPSQLGVT-----SFSYCFTSMFEST 263
Query: 202 ---IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
+ G P ++ ++ TPL + +PS+ YF+ + ++ V
Sbjct: 264 SSLVTLGVAP----AELHLTGQVQSTPLLRDP--------SQPSL-YFLSLKAITVGATR 310
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS----------------- 301
+P+ + + I + T L +Y+A+ F +
Sbjct: 311 IPIPERRQRLRE---ASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLC 367
Query: 302 -AMPKVARVAPVAPFGACFRLQDIGFTRIGPV-VPQI--------DLVLQNKNVVWSIHG 351
A+P A AP + FG +R G R PV VP++ D L +N V+ +G
Sbjct: 368 FALPSAA--APKSAFGWRWR----GRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYG 421
Query: 352 QNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
M L L GG + ++VIG Q +N + +DL L F+
Sbjct: 422 ARVMC------LVLDAATGGGD--QTVVIGNYQQQNTHVVYDLENDVLSFA 464
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 161/392 (41%), Gaps = 68/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY------------------VSSTNKTAR 86
Y T+++ TP V + + G ++LWV C SST+
Sbjct: 78 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIA 137
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNT-------GTHGDIRIDVLSIQSTDGRNPGR 139
C +CN N K C + +N S T GT G D++ + + +
Sbjct: 138 CSDQRCN--NGKQSSDATC-SSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTT 194
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
T P +F C ++ L V GI G G+ ++++ SQL++ R F+ CL
Sbjct: 195 NSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGD 253
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
G +V G+ V N+ YT L +P Y + + S+ VN
Sbjct: 254 SSGGGILVLGE---------IVEPNIVYTSL----------VPAQP--HYNLNLQSISVN 292
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G+ + ++ ++ + N GT + + L Y V A +A+P+ R V+
Sbjct: 293 GQTLQIDSSVFATSNS--RGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTV-VSRG 349
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGG 371
C+ + + + V PQ+ L + + Q+ ++Q IGG A+ C+GF
Sbjct: 350 NQCYLIT----SSVTDVFPQVSLNFAGGASMI-LRPQDYLIQQNSIGGAAVWCIGFQK-- 402
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ + ++G L++ ++ +DLA R+G+++
Sbjct: 403 IQGQGITILGDLVLKDKIVVYDLAGQRIGWAN 434
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 155/383 (40%), Gaps = 67/383 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y T+I TP V + + G +I+W+ C + + AC
Sbjct: 125 EYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSP 184
Query: 104 ICGAGPDNPISNTG---------------THGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+C D+P NT T GD + L+ + T V
Sbjct: 185 LCHR-LDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRT---------RVARVAL 234
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVF 204
CG + +GL G G+ GLGR +++ PSQ F+ KF+ CL + ++VF
Sbjct: 235 GCGHD--NEGLFVGAAGLLGLGRGRLSFPSQTGRRFN--HKFSYCLVDRSASSKPSSMVF 290
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVP-LN 262
GD VS+ ++TPL N K++T Y++ + + V G VP +
Sbjct: 291 GDSA--------VSRTARFTPLVSNPKLDTF----------YYVELLGISVGGTRVPGIT 332
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+L +D G GG I + T L Y A AF + + R + F CF L
Sbjct: 333 ASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLS 392
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVD--GGVNPRTSIV 379
G T + VP + L + +V S+ N ++ + CL F GG++ +
Sbjct: 393 --GKTEVK--VPTVVLHFRGADV--SLPASNYLIPVDTSGNFCLAFAGTMGGLS-----I 441
Query: 380 IGARQLENNLLQFDLATSRLGFS 402
IG Q + + +DLA SR+GF+
Sbjct: 442 IGNIQQQGFRVVYDLAGSRVGFA 464
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 158/389 (40%), Gaps = 56/389 (14%)
Query: 35 RVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SST 81
RV ++ +Y+ + TP + + G +++W C + S+T
Sbjct: 79 RVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSAT 138
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDI--RIDVLSIQS-TDGRNPG 138
+ C S++C ++ +C +C +GD VL+ ++ T G
Sbjct: 139 YRALPCRSSRCAALSSPSCFKKMC--------VYQYYYGDTASTAGVLANETFTFGAASS 190
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF- 197
V N F CGS + G + G+ G GR ++L SQL + +F+ CL+ +
Sbjct: 191 TKVRAANISFGCGS--LNAGELANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYL 243
Query: 198 --DDGAIVFGDGPYYDLNNFDVSKN--LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ FG + +LN+ + S ++ TP IN P++ YF+ V +
Sbjct: 244 SPTPSRLYFGV--FANLNSTNTSSGSPVQSTPFVINP--------ALPNM-YFLSVKGIS 292
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
+ K +P++ + +I+++G GG I + T L+ Y+A+ + AS +P A
Sbjct: 293 LGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDI 352
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGV 372
CF+ + VP D V ++ +N M+ LCL V
Sbjct: 353 GLDTCFQWPPPPNVTV--TVP--DFVFHFDGANMTLPPENYMLIASTTGYLCLAMAPTSV 408
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGF 401
+IG Q +N L +D+A S L F
Sbjct: 409 G----TIIGNYQQQNLHLLYDIANSFLSF 433
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 130/305 (42%), Gaps = 52/305 (17%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------KGYVSSTNKTAR 86
DT + +++ TP + + G ++ W+ CE +SST+KT
Sbjct: 115 DTYEYLHYAEVEVGTPSSKFLVALDTGSDLFWLPCECKLCAKNGSTMYSPSLSSTSKTVP 174
Query: 87 CGSAQCNLANAKACGGGICGAGP---DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
CG C +A A G + P +NTG+ G + DVL + G G+AV
Sbjct: 175 CGHPLCERPDACATAGKSSSSCPYEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQA 234
Query: 144 PNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAA-FSLKRKFALCLSPFDD 199
P +F CG + L+G A+G G+ GLG KV++PS LA++ F++C S
Sbjct: 235 P-IVFGCGQVQTGAFLRGAAAG--GLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGV 291
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G I FGD S + TPL A+G L +PS Y I V ++ V+ KA+
Sbjct: 292 GRINFGDA---------GSPDQAETPLI------AAGSL-QPSY-YNISVGAITVDSKAM 334
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV--APVAPFGA 317
+ T + + + +T L+ Y L F S + + + + F
Sbjct: 335 AVEFTAV-----------VDSGTSFTYLDDPAYTFLTTNFNSRVSEASETYGSGYEKFEF 383
Query: 318 CFRLQ 322
C+RL
Sbjct: 384 CYRLS 388
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 110/265 (41%), Gaps = 50/265 (18%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSK 219
S G+ G+ R ++ +Q+ KF+ C+S FD G ++ G+ +F K
Sbjct: 192 SKTTGLIGMNRGSLSFVNQMGYP-----KFSYCISGFDSAGVLLLGNA------SFPWLK 240
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
L YTPL +++T + V Y + + + V K + L K++ D+ G G T +
Sbjct: 241 PLSYTPLV--QISTPLPYFDR--VAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVD 296
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV------- 332
+ +T L +Y AL F S + +V L D F G +
Sbjct: 297 SGTQFTFLLGPVYTALKNEFLSQTRGILKV-----------LNDDNFVFQGAMDLCYLLD 345
Query: 333 --------VPQIDLVLQNKNVVWSIHGQNSMVQIGGDA------LCLGFVDGGVNPRTSI 378
+P + L+ Q + S+ G+ + ++ G+ C F + + +
Sbjct: 346 SSRPNLQNLPVVSLMFQGAEM--SVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAF 403
Query: 379 VIGARQLENNLLQFDLATSRLGFSD 403
VIG +N ++FDL SR+G +D
Sbjct: 404 VIGHHHQQNVWMEFDLEKSRIGLAD 428
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 144/382 (37%), Gaps = 60/382 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+++ + TP + V G +++W C+ SST T C S+
Sbjct: 117 EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSS 176
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV-----PN 145
C+ C G T T+GD ST G T+ P
Sbjct: 177 LCSDLPTSTCTSAAKDCG------YTYTYGDA-------SSTQGVLAAETFTLAKTKLPG 223
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---I 202
F CG G G G+ GLGR ++L SQL KF+ CL+ DD + +
Sbjct: 224 VAFGCGDTNEGDGFTQGA-GLVGLGRGPLSLVSQLGLG-----KFSYCLTSLDDTSKSPL 277
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
+ G + + ++ TPL N +PS Y++ + ++ V +PL
Sbjct: 278 LLGSLAAISTDTASAAA-IQTTPLIKNP--------SQPSF-YYVTLKALTVGSTRIPLP 327
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ ++ ++G GG + + T LE Y+ L +AFA+ M CF+
Sbjct: 328 GSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAP 387
Query: 323 DIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMV-QIGGDALCLGFVDGGVNPRTSIVI 380
G + VP+ LVL + + +N MV ALCL + R +I
Sbjct: 388 ASGVDDV--EVPK--LVLHFDGGADLDLPAENYMVLDSASGALCLTV----MGSRGLSII 439
Query: 381 GARQLENNLLQFDLATSRLGFS 402
G Q +N +D+ L F+
Sbjct: 440 GNFQQQNIQFVYDVDKDTLSFA 461
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/283 (23%), Positives = 120/283 (42%), Gaps = 42/283 (14%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGV------VGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+VT P +F C + GL+S G+ G+ R ++ +QL + KF+ C
Sbjct: 166 SVTRPGTLFGC----MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYC 216
Query: 194 LSPFDDGA-IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
+S D ++ GD Y L ++YTPL + +T + V Y + + +
Sbjct: 217 ISGSDSSVFLLLGDASYSWLGP------IQYTPLVLQ--STPLPYFDR--VAYTVQLEGI 266
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V K + L K++ D+ G G T + + +T L +Y AL F + V R+
Sbjct: 267 RVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDD 326
Query: 313 APF------GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG------ 360
F C+++ + P + L+ + + S+ GQ + ++ G
Sbjct: 327 PDFVFQGTMDLCYKVGSTTRPNFSGL-PMVSLMFRGAEM--SVSGQKLLYRVNGAGSEGK 383
Query: 361 -DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ C F + + + VIG +N ++FDLA SR+GF+
Sbjct: 384 EEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFA 426
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 155/396 (39%), Gaps = 81/396 (20%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T +Y+ + TP PV+LT+ G +++W C+ SSTN C
Sbjct: 32 TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCE 91
Query: 89 SAQCNL-----------ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNP 137
S QC L + C + ++GD + + + + D
Sbjct: 92 STQCKLDPTVTVCVKLNQTVQTC-------------AYYTSYGDNSV-TIGLLAADKFTF 137
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
++P F CG S GIAG GR ++LPSQL F+ C +
Sbjct: 138 VAGTSLPGVTFGCGLNNT-GVFNSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTTI 191
Query: 198 DDGAI---VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
GAI V D P +N + ++ TPL N A+ L Y++ + + V
Sbjct: 192 -TGAIPSTVLLDLPADLFSNGQGA--VQTTPLIQYAKNEANPTL------YYLSLKGITV 242
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
+P+ ++ ++ N G GGT I + T L +Y+ + FA+ + PV P
Sbjct: 243 GSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-----KLPVVP 296
Query: 315 FGA-----CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA----LCL 365
A CF ++ P VP+ LVL + + +N + ++ DA +CL
Sbjct: 297 GNATGHYTCFSAP----SQAKPDVPK--LVLHFEGATMDLPRENYVFEVPDDAGNSIICL 350
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G + +IG Q +N + +DL + L F
Sbjct: 351 AINKG----DETTIIGNFQQQNMHVLYDLQNNMLSF 382
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 152/381 (39%), Gaps = 78/381 (20%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
TLQYV + TP V L V G ++ WV C+ SS+
Sbjct: 128 TLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVP 187
Query: 87 CGSAQCN-LA-NAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
C +A C+ LA + C GG CG +T T G D L++ ++ +
Sbjct: 188 CAAASCSQLALYSNGCSGGQCGYVVSYGDGST-TTGVYSSDTLTLTGSN--------ALK 238
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
F+F CG QGL +GV G+ GLGR +L SQ ++ + F+ CL P +
Sbjct: 239 GFLFGCG--HAQQGLFAGVDGLLGLGRQGQSLVSQASSTYG--GVFSYCLPPTQNSVG-- 292
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
Y L + TPL TAS +P+ Y + + + V G+ + ++ +
Sbjct: 293 ----YISLGGPSSTAGFSTTPLL-----TAS---NDPTY-YIVMLAGISVGGQPLSIDAS 339
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFR 320
+ + G + T T L + Y AL AF +AM P AP C+
Sbjct: 340 VFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGY--PSAPATGILDTCYD 391
Query: 321 LQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL----CLGFVDGGVNPR 375
FTR G V +P I + G + + +G + CL F G + +
Sbjct: 392 -----FTRYGTVTLPTISIAF----------GGGAAMDLGTSGILTSGCLAFAPTGGDSQ 436
Query: 376 TSIVIGARQLENNLLQFDLAT 396
SI +G Q + ++FD +T
Sbjct: 437 ASI-LGNVQQRSFEVRFDGST 456
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 88/376 (23%), Positives = 148/376 (39%), Gaps = 67/376 (17%)
Query: 54 PLVPVKLTVHLGGNILWVDCEK-------------------GYVSSTNKTARCGSAQCNL 94
P V L + G +++W C+ +SST ++ C S+ C+
Sbjct: 92 PPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSA 151
Query: 95 ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQST--DGRNPGR------------- 139
A++ +C D P+ + T S DG R
Sbjct: 152 AHSNLPTSDLCAIA-DCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATP 210
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCL--SP 196
++++ NF F C + + VG+AG GR ++LP+QLA+ A L +F+ CL
Sbjct: 211 SLSLHNFTFGCAHTALAEP-----VGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHS 265
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA---SGFLGEPSVEYF--IGVTS 251
F+ + P ++ D K +NK + + L P YF +G+
Sbjct: 266 FNSDRLRL-PSPLILGHSDDKEKR-------VNKDDVQFVYTSMLDNPKHPYFYCVGLEG 317
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ + K +P + L +D EG GG + + +T+L S+Y ++V F + +V RV
Sbjct: 318 ISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDN---RVGRVYE 374
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGG 371
A ++D T +GP D V+ ++V G S V + F+DGG
Sbjct: 375 RAK-----EVEDK--TGLGPCY-YYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGG 426
Query: 372 VNPRTSIVIGARQLEN 387
R +G L N
Sbjct: 427 DGVRRKRRVGCLMLMN 442
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 150/356 (42%), Gaps = 68/356 (19%)
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLS--IQSTDGRN 136
SS N C + C + + + G+C P P S T G + + L+ I + G
Sbjct: 100 SSDNSHDPCAAVGCAIPSFMS---GLC-TRPCPPFSYTYGGGALVLGSLAKDIVTLHGSI 155
Query: 137 PGRAV--TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
G A+ VP F F C + + +GIAG G+ ++LPSQL L + F+ C
Sbjct: 156 FGIAILLDVPGFCFGCVGSSIRE-----PIGIAGFGKGILSLPSQLGF---LDKGFSHCF 207
Query: 195 -------SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
+P +++ GD + +TP+ + N P+ Y+I
Sbjct: 208 LGFRFARNPNFTSSLIMGD------LALSAKDDFLFTPMLKSITN--------PNF-YYI 252
Query: 248 GVTSVHV-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--P 304
G+ V + +G A+ +L SID+EG GG + T YT L Y A++ + AS +
Sbjct: 253 GLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLASVILYE 312
Query: 305 KVARVAPVAPFGACFRL--------QD----IGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
+ + F CF++ QD I F +G + L L + +++
Sbjct: 313 RSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLG----DVKLTLPKDSCYYAVTAP 368
Query: 353 NSMVQIGGDALCLGF--VD-----GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+ V + CL F +D GG N V+G+ Q++N + +D+ R+GF
Sbjct: 369 KNSVVVK----CLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGF 420
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 152/390 (38%), Gaps = 60/390 (15%)
Query: 42 TLQYVTQI-----KQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNK 83
TL YVT I +P + + V G ++ WV C+ S+T
Sbjct: 182 TLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYA 241
Query: 84 TARCGSAQCNLANAKACGG--GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
RC ++ C A+ KA G G CG G + +GD + +TD G A
Sbjct: 242 AVRCNASAC-AASLKAATGTPGSCGGGNERCYYAL-AYGDGSFS-RGVLATDTVALGGA- 297
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
++ F+F CG +GL G G+ GLGR++++L SQ A + F+ CL A
Sbjct: 298 SLDGFVFGCG--LSNRGLFGGTAGLMGLGRTELSLVSQTALRYG--GVFSYCLP-----A 348
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAV 259
GD D S TP+ ++ + +P+ YF+ VT V G
Sbjct: 349 TTSGDASGSLSLGGDASSYRNTTPVAYTRM------IADPAQPPFYFLNVTGAAVGG--- 399
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----F 315
T L+ G I + T L S+Y+ + F A P AP
Sbjct: 400 ----TALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQF--AAAGYPTAPGFSIL 453
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
C+ L G + VP + L L+ V G +V+ G +CL
Sbjct: 454 DTCYDL--TGHDEV--KVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYED 509
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+T I IG Q +N + +D SRLGF+D
Sbjct: 510 QTPI-IGNYQQKNKRVVYDTVGSRLGFADE 538
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 135/323 (41%), Gaps = 52/323 (16%)
Query: 87 CGSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C S QC+L + AC C G G + T G++ + S + ++
Sbjct: 242 CDSEQCHLLDEAACDANSCIYEVEYGDG-------SFTVGELATETFSFRHSN------- 287
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
++PN CG + +GL G G+ GLG ++L SQL A F+ CL D
Sbjct: 288 -SIPNLPIGCGHDN--EGLFVGADGLIGLGGGAISLSSQLEAT-----SFSYCLVDLDSE 339
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ D N S +L +PL N P+ Y + V + V GK +P
Sbjct: 340 S-----SSTLDFNADQPSDSLT-SPLVKNDR--------FPTFRY-VKVIGMSVGGKPLP 384
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
++ + ID G GG + + T + + +Y L AF + V+PF C+
Sbjct: 385 ISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYD 444
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIV 379
L ++ VP I +L +N + + +N ++Q+ CL F+ P + +
Sbjct: 445 LS----SQSNVEVPTIAFILPGENSLQ-LPAKNCLIQVDSAGTFCLAFLPSTF-PLS--I 496
Query: 380 IGARQLENNLLQFDLATSRLGFS 402
IG Q + + +DLA S +GFS
Sbjct: 497 IGNVQQQGIRVSYDLANSLVGFS 519
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 86/423 (20%), Positives = 158/423 (37%), Gaps = 108/423 (25%)
Query: 54 PLVPVKLTVHLGGNILW--------VDCEKGY----------------VSSTNKTARCGS 89
P P+ L + G +++W + CE Y S + K+ C +
Sbjct: 83 PPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSA 142
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTG-----------THGDIRIDVLSIQSTDGRNPG 138
A +L+++ C C P I + +GD + + +
Sbjct: 143 AHTSLSSSDLCAMARC---PLELIETSDCSSFSCPPFYYAYGDGSLVARLYRDSLSMPAS 199
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS--LKRKFALCLSP 196
+ + NF F C + + VG+AG GR ++LP+QLA+ FS L +F+ CL
Sbjct: 200 SPLVLHNFTFGCAHTALGE-----PVGVAGFGRGVLSLPAQLAS-FSPHLGNQFSYCLVS 253
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN----------------TASGFLGE 240
++FD + + +PL + + + + L
Sbjct: 254 ----------------HSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDN 297
Query: 241 PSVEYF--IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQA 298
P YF +G+ + V + +P+ + L +D G GG + + +T+L +Y++LV
Sbjct: 298 PKHPYFYCVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTE 357
Query: 299 FASAMPKVAR----VAPVAPFGACFRLQDIGFTRIGPVVPQIDL--------VLQNKNVV 346
F M +V + + G C+ D VP + L +L N
Sbjct: 358 FNHRMGRVYKRATQIEERTGLGPCYYSDDSAAK-----VPAVALHFVGNSTVILPRNNYY 412
Query: 347 WSI----HGQNSMVQIGGDALCLGFVDGGVNPRT---SIVIGARQLENNLLQFDLATSRL 399
+ GQ ++G CL ++GG + + +G Q + + +DL R+
Sbjct: 413 YEFFDGRDGQKKKRKVG----CLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRV 468
Query: 400 GFS 402
GF+
Sbjct: 469 GFA 471
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 150/372 (40%), Gaps = 52/372 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA--------------RCGSA 90
YVT++ TP + V G ++ W+ C VS ++ C +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ C N ++GD V + S D + G + +VPNF + C
Sbjct: 189 QCSDLTTATLNPASCST--SNVCIYQASYGDSSFSVGYL-SKDTVSFG-STSVPNFYYGC 244
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL R+K++L QLA S+ F+ CL + + Y
Sbjct: 245 GQDN--EGLFGQSAGLIGLARNKLSLLYQLAP--SMGYSFSYCLPTSSSSSSGYLSIGSY 300
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
+ + YTP+ + ++ + YFI +T + V GK PL+ + + +
Sbjct: 301 NPGQY------SYTPMASSSLDDS---------LYFIKMTGIKVAGK--PLSVSSSAYSS 343
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIG 330
T I + T L T +Y AL +A A AM R + + CF+ Q
Sbjct: 344 LP---TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLR--- 397
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
VP++ + + +N +V + CL F R++ +IG Q + +
Sbjct: 398 --VPEVTMAFAGGAALKLAA-RNLLVDVDSATTCLAFAPA----RSAAIIGNTQQQTFSV 450
Query: 391 QFDLATSRLGFS 402
+D+ S++GF+
Sbjct: 451 VYDVKNSKIGFA 462
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 150/372 (40%), Gaps = 52/372 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA--------------RCGSA 90
YVT++ TP + V G ++ W+ C VS ++ C +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ C N ++GD V + S D + G + +VPNF + C
Sbjct: 189 QCSDLTTATLSPASCST--SNVCIYQASYGDSSFSVGYL-SKDTVSFG-STSVPNFYYGC 244
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL R+K++L QLA S+ F+ CL + + Y
Sbjct: 245 GQDN--EGLFGQSAGLIGLARNKLSLLYQLAP--SMGYSFSYCLPTSSSSSSGYLSIGSY 300
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
+ + YTP+ + ++ + YFI +T + V GK PL+ + + +
Sbjct: 301 NPGQY------SYTPMASSSLDDS---------LYFIKMTGIKVAGK--PLSVSSSAYSS 343
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIG 330
T I + T L T +Y AL +A A AM R + + CF+ Q
Sbjct: 344 LP---TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLR--- 397
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
VP++ + + +N +V + CL F R++ +IG Q + +
Sbjct: 398 --VPEVTMAFAGGAALKLAA-RNLLVDVDSATTCLAFAPA----RSAAIIGNTQQQTFSV 450
Query: 391 QFDLATSRLGFS 402
+D+ S++GF+
Sbjct: 451 VYDVKNSKIGFA 462
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/381 (22%), Positives = 154/381 (40%), Gaps = 70/381 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ P PV + + G ++ WV C + S++ + C +
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETE 209
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
QC + C G C G G + T GD + +++ ST ++
Sbjct: 210 QCKSLDVSECRNGTCLYEVSYGDG-------SYTVGDFVTETVTLGST---------SLG 253
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
N CG +GL G G+ GLG ++ PSQL A+ F+ CL D +
Sbjct: 254 NIAIGCGHNN--EGLFIGAAGLLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDSTST 306
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVPLN 262
D N ++ + PL N P+++ +++G+T + V G +P+
Sbjct: 307 LDF------NSPITPDAVTAPLHRN-----------PNLDTFFYLGLTGMSVGGAVLPIP 349
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+T + +G GG + + T L+T++Y L AF + + VA F C+ L
Sbjct: 350 ETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLS 409
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPRTSIVIG 381
+R+ VP + N N + + +N ++ + + C F T ++G
Sbjct: 410 SK--SRVE--VPTVSFHFANGNEL-PLPAKNYLIPVDSEGTFCFAF---APTDSTLSILG 461
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q + + FDLA S +GFS
Sbjct: 462 NAQQQGTRVGFDLANSLVGFS 482
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 97/394 (24%), Positives = 148/394 (37%), Gaps = 64/394 (16%)
Query: 32 LVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYV 78
L L+ T Y+ TP L + G ++ W+ C+ +
Sbjct: 124 LPLQSGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQ 183
Query: 79 SSTNKTARCGSAQC-----NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
SS+ KT C SA C + +N C G C N + + GD + L++ S
Sbjct: 184 SSSYKTLPCLSATCTELITSESNPTPCLLGGC-VYEINYGDGSSSQGDFSQETLTLGSD- 241
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
+ NF F CG GL G G+ GLG++ ++ PSQ + + +FA C
Sbjct: 242 --------SFQNFAFGCGHTNT--GLFKGSSGLLGLGQNSLSFPSQSKSKYG--GQFAYC 289
Query: 194 LSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
L F + + + + +TPL N + P+ YF+G+ +
Sbjct: 290 LPDFGSSTSTG----SFSVGKGSIPASAVFTPLVSNFM--------YPTF-YFVGLNGIS 336
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
V G + + +L G G T + + T L Y AL +F S + P +
Sbjct: 337 VGGDRLSIPPAVL-----GRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFS 391
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDALCLGFVDG-- 370
C+ L RI P I QN +V S G VQ GG +CL F
Sbjct: 392 ILDTCYDLSRHSQVRI----PTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQ 447
Query: 371 --GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G N +IG Q + + FD R+GF+
Sbjct: 448 MDGFN-----IIGNFQQQRMRVAFDTGAGRIGFA 476
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/381 (22%), Positives = 154/381 (40%), Gaps = 70/381 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ P PV + + G ++ WV C + S++ + C +
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETE 209
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
QC + C G C G G + T GD + +++ ST ++
Sbjct: 210 QCKSLDVSECRNGTCLYEVSYGDG-------SYTVGDFVTETVTLGST---------SLG 253
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
N CG +GL G G+ GLG ++ PSQL A+ F+ CL D +
Sbjct: 254 NIAIGCGHNN--EGLFIGAAGLLGLGGGSLSFPSQLNAS-----SFSYCLVDRDSDSTST 306
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVPLN 262
D N ++ + PL N P+++ +++G+T + V G +P+
Sbjct: 307 LDF------NSPITPDAVTAPLHRN-----------PNLDTFFYLGLTGMSVGGAVLPIP 349
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+T + +G GG + + T L+T++Y L AF + + VA F C+ L
Sbjct: 350 ETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLS 409
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPRTSIVIG 381
+R+ VP + N N + + +N ++ + + C F T ++G
Sbjct: 410 SK--SRVE--VPTVSFHFANGNEL-PLPAKNYLIPVDSEGTFCFAF---APTDSTLSILG 461
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q + + FDLA S +GFS
Sbjct: 462 NAQQQGTRVGFDLANSLVGFS 482
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 100/400 (25%), Positives = 153/400 (38%), Gaps = 60/400 (15%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVS 79
T P VL++A YV ++K TP + + + + WV C G+ S
Sbjct: 81 TTAVPIAPGQQVLKIAN------YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGFSS 134
Query: 80 ST---NKTARCGSAQCNLANAKACGGGICGA-GPDNPISNTGTHGDIRIDVLSIQSTDGR 135
+T N + GS C+ A G C A G + N GD + +Q D
Sbjct: 135 TTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQ--DAI 192
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFAL 192
V +P F F C + ++ G + G+ GLGR ++L SQ A +S F+
Sbjct: 193 TLANDV-IPGFTFGC-----INAVSGGSIPPQGLLGLGRGPISLISQAGAMYS--GVFSY 244
Query: 193 CLSPFDDGAIVFGDGPYY-----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
CL F YY L K+++ TPL N PS+ Y++
Sbjct: 245 CLPSFKS---------YYFSGSLKLGPVGQPKSIRTTPLLRNP--------HRPSL-YYV 286
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
+T V V VP+ L D GT I + T +Y A+ F +
Sbjct: 287 NLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NG 344
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLG 366
++ + F CF + P I L + N+V + +NS++ +L CL
Sbjct: 345 PISSLGAFDTCFAATNEA------EAPAITLHFEGLNLVLPM--ENSLIHSSSGSLACLS 396
Query: 367 FVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDSL 405
N + + VI Q +N + FD SRLG + L
Sbjct: 397 MAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIAREL 436
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 153/379 (40%), Gaps = 61/379 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ +P + + + G ++ WV C+ +S++ C S
Sbjct: 165 EYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQ 224
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+C + AC GA G++ GD + L++ G + V N
Sbjct: 225 RCRDLDTAACRNAT-GACLYEVAYGDGSYTVGDFATETLTL--------GDSTPVGNVAI 275
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFG 205
CG + +GL G G+ LG ++ PSQ++A+ F+ CL D A + FG
Sbjct: 276 GCGHDN--EGLFVGAAGLLALGGGPLSFPSQISAS-----TFSYCLVDRDSPAASTLQFG 328
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
DG ++ T + T S Y++ ++ + V G+ + + +
Sbjct: 329 DG---------AAEAGTVTAPLVRSPRT--------STFYYVALSGISVGGQPLSIPASA 371
Query: 266 LSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
++D G GG + + T L+++ Y AL AF P + R + V+ F C+ L D
Sbjct: 372 FAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSD- 430
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIVIGAR 383
R VP + L + + + +N ++ + G CL F N SI IG
Sbjct: 431 ---RTSVEVPAVSLRFEGGGALR-LPAKNYLIPVDGAGTYCLAFAP--TNAAVSI-IGNV 483
Query: 384 QLENNLLQFDLATSRLGFS 402
Q + + FD A +GF+
Sbjct: 484 QQQGTRVSFDTARGAVGFT 502
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/392 (20%), Positives = 152/392 (38%), Gaps = 65/392 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSA 90
+Y + +P L + G ++ W+ C Y S++ K C
Sbjct: 169 EYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQ 228
Query: 91 QCNLANAKACGGGICGAGPDNPI----------------SNTGTHGDIRIDVLSIQSTDG 134
+CNL ++ PD P+ ++ T GD ++ ++ T
Sbjct: 229 RCNLVSS-----------PDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTN 277
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
V N +F CG +GL G G+ GLGR ++ SQL + + F+ CL
Sbjct: 278 GGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG--HSFSYCL 333
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
+ V + + + NL +T K N F Y++ + S+ V
Sbjct: 334 VDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTF-------YYVQIKSILV 386
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP---KVARVAP 311
G+ + + + +I ++G GGT I + + Y+ + A V R P
Sbjct: 387 AGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFP 446
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGG 371
+ CF + I ++ P++ + + VW+ +NS + + D +CL +
Sbjct: 447 I--LDPCFNVSGIHNVQL----PELGIAFAD-GAVWNFPTENSFIWLNEDLVCLAMLG-- 497
Query: 372 VNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
P+++ +IG Q +N + +D SRLG++
Sbjct: 498 -TPKSAFSIIGNYQQQNFHILYDTKRSRLGYA 528
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 155/386 (40%), Gaps = 75/386 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ +P + + + G ++ WV C+ +S++ C S
Sbjct: 168 EYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSP 227
Query: 91 QCNLANAKAC--GGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+C + AC G C G G + T GD + L++ G +
Sbjct: 228 RCRDLDTAACRNATGACLYEVAYGDG-------SYTVGDFATETLTL--------GDSTP 272
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA- 201
V N CG + +GL G G+ LG ++ PSQ++A+ F+ CL D A
Sbjct: 273 VTNVAIGCGHDN--EGLFVGAAGLLALGGGPLSFPSQISAS-----TFSYCLVDRDSPAA 325
Query: 202 --IVFG-DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
+ FG DG D V+ L +P Y++ ++ + V G+A
Sbjct: 326 STLQFGADGAEADT----VTAPLVRSP--------------RTGTFYYVALSGISVGGQA 367
Query: 259 VPLNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ + + ++D G GG + + T L++S Y AL AF P + R + V+ F
Sbjct: 368 LSIPSSAFAMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDT 427
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRT 376
C+ L D R VP + L + + + +N ++ + G CL F N
Sbjct: 428 CYDLSD----RTSVEVPAVSLRFEGGGALR-LPAKNYLIPVDGAGTYCLAFAP--TNAAV 480
Query: 377 SIVIGARQLENNLLQFDLATSRLGFS 402
SI IG Q + + FD A +GF+
Sbjct: 481 SI-IGNVQQQGTRVSFDTAKGVVGFT 505
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/308 (25%), Positives = 126/308 (40%), Gaps = 43/308 (13%)
Query: 98 KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQ 157
+ C GG C G + T G +D L++ S D + F F CG +
Sbjct: 14 RGCSGGHCLYGVQYG-DGSYTIGFFAMDTLTLSSHD--------AIKGFRFGCGER--NE 62
Query: 158 GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IVFGDGPYYDLNNF 215
GL G+ GLGR K +LP Q + FA C G + FG G ++
Sbjct: 63 GLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCFPARSSGTGYLEFGPG-----SSP 115
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
VS L TP+ I+ T Y++G+T + V GK +P+ +++ + G
Sbjct: 116 AVSAKLSTTPMLIDTGPT----------FYYVGMTGIRVGGKLLPIPQSVFA-----AAG 160
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPK--VARVAPVAPFGACFRLQDIGFTRIGPVV 333
T + + T L + Y +L AFA++M R ++ C+ L G + + +
Sbjct: 161 TIVDSGTVITRLPPAAYSSLRSAFAASMAARGYKRAPALSLLDTCYDLT--GASEV--AI 216
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
P + L+ Q V + + CLGF +IV G QL+ + +D
Sbjct: 217 PTVSLLFQG-GVSLDVDASGIIYAASVSQACLGFAGNEAADDVAIV-GNTQLKTFGVVYD 274
Query: 394 LATSRLGF 401
+A+ +GF
Sbjct: 275 IASKVVGF 282
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 161/389 (41%), Gaps = 56/389 (14%)
Query: 35 RVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SST 81
RV ++ +Y+ + TP + + G +++W C + S+T
Sbjct: 79 RVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSAT 138
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDI--RIDVLSIQS-TDGRNPG 138
+ C S++C ++ +C +C +GD VL+ ++ T G
Sbjct: 139 YRALPCRSSRCASLSSPSCFKKMC--------VYQYYYGDTASTAGVLANETFTFGAANS 190
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
V N F CGS + G + G+ G GR ++L SQL + +F+ CL+ +
Sbjct: 191 TKVRATNIAFGCGS--LNAGDLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYL 243
Query: 199 DGA---IVFGDGPYYDLNNFDVSKN--LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ FG Y +L++ + S ++ TP IN P++ YF+ + ++
Sbjct: 244 SATPSRLYFGV--YANLSSTNTSSGSPVQSTPFVINP--------ALPNM-YFLSLKAIS 292
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
+ K +P++ + +I+++G GG I + T L+ Y+A+ + SA+P A
Sbjct: 293 LGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDI 352
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGV 372
CF+ + VP DLV + ++ +N M+ LCL GV
Sbjct: 353 GLDTCFQWPPPPNVTV--TVP--DLVFHFDSANMTLLPENYMLIASTTGYLCLVMAPTGV 408
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGF 401
+IG Q +N L +D+ S L F
Sbjct: 409 G----TIIGNYQQQNLHLLYDIGNSFLSF 433
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 144/377 (38%), Gaps = 66/377 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGI 104
Y + I +P L + G ++ WV C+ ++ R S N A C
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLAS---NTYKALTCADDY 59
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
D T GD+ +D L + G P F+F CGS +L+GL SG V
Sbjct: 60 SYGYGDGSF----TQGDLSVDTLKMA---GAASDELEEFPGFVFGCGS--LLKGLISGEV 110
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL------SPFDDGAIVFGDGPYYDLNNFDVS 218
GI L ++ PSQ+ + KF+ CL + +VFG+ +L
Sbjct: 111 GILALSPGSLSFPSQIGEKYG--NKFSYCLLRQTAQNSLKKSPMVFGEAA-VELKEPGSG 167
Query: 219 K--NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGT 276
K L+YTP +GE S+ Y + + + V + + L+ + + N T
Sbjct: 168 KLQELQYTP------------IGESSIYYTVRLDGISVGNQRLDLSPS--AFLNGQDKPT 213
Query: 277 KISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR--------LQDIGFTR 328
+ T+L + ++ Q+ AS + VA + ACFR L DI F
Sbjct: 214 IFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVA-IKGLDACFRVPPSSGQGLPDITFHF 272
Query: 329 IGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-VIGARQLEN 387
G D V + N V + G CL FV P + + G Q ++
Sbjct: 273 NGGA----DFVTRPSNYVIDL----------GSLQCLIFV-----PTNEVSIFGNLQQQD 313
Query: 388 NLLQFDLATSRLGFSDS 404
+ D+ R+GF ++
Sbjct: 314 FFVLHDMDNRRIGFKET 330
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 160/387 (41%), Gaps = 75/387 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGYV----------SSTNKTARCGSA 90
+Y T++ TP V + + G +I+W+ C K Y S + CGS
Sbjct: 144 EYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSP 203
Query: 91 QCNLANAKACGGG--IC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C + C IC G G + T G+ + L+ R
Sbjct: 204 LCRRLDYPGCSTKKQICLYQVSYGDG-------SFTVGEFSTETLTF---------RGTR 247
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFD 198
V + CG + +GL G G+ GLGR +++ PSQ+ F+ KF+ CL +
Sbjct: 248 VGRVVLGCGHD--NEGLFVGAAGLLGLGRGRLSFPSQIGRRFN--SKFSYCLGDRSASSR 303
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGK 257
+IVFGD +S+ ++TPL N K++T Y++ + + V G
Sbjct: 304 PSSIVFGDSA--------ISRTTRFTPLLSNPKLDTF----------YYVELLGISVGGT 345
Query: 258 AVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
V ++ +L +D+ G GG I + T L + Y AL AF + R + F
Sbjct: 346 RVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFD 405
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPR 375
CF L G T + VP + L + +V + N ++ + + C F G
Sbjct: 406 TCFDLS--GKTEVK--VPTVVLHFRGADV--PLPASNYLIPVDNSGSFCFAFA--GTASG 457
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
SI IG Q + + +DLATSR+GF+
Sbjct: 458 LSI-IGNIQQQGFRVVYDLATSRVGFA 483
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 164/389 (42%), Gaps = 79/389 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGYV----------SSTNKTARCGSA 90
+Y T++ TP V + + G +I+W+ C + Y S T T C S
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200
Query: 91 QCNLANAKACGG--GIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C ++ C C G G + T GD + L+ + RN + V
Sbjct: 201 HCRRLDSAGCNTRRKTCLYQVSYGDG-------SFTVGDFSTETLTFR----RNRVKGVA 249
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFD 198
+ CG + +GL G G+ GLG+ K++ P Q F+ +KF+ CL +
Sbjct: 250 LG-----CGHD--NEGLFVGAAGLLGLGKGKLSFPGQTGHRFN--QKFSYCLVDRSASSK 300
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++VFG N VS+ ++TPL N K++T Y++G+ + V G
Sbjct: 301 PSSVVFG--------NAAVSRIARFTPLLSNPKLDTF----------YYVGLLGISVGGT 342
Query: 258 AVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
VP + +L +D G GG I + T L Y A+ AF + R + F
Sbjct: 343 RVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFD 402
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVD--GGVN 373
CF L ++ + VP + L + +V S+ N ++ + + C F GG++
Sbjct: 403 TCFDLSNMNEVK----VPTVVLHFRGADV--SLPATNYLIPVDTNGKFCFAFAGTMGGLS 456
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
+IG Q + + +DLA+SR+GF+
Sbjct: 457 -----IIGNIQQQGFRVVYDLASSRVGFA 480
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 155/392 (39%), Gaps = 81/392 (20%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYV------------SSTNKTARC 87
TL++V + +P L++ G ++ W+ C G+ S+T C
Sbjct: 158 TLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPC 217
Query: 88 GSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
G QC A K G C G G + T G + + LS+ ST R+
Sbjct: 218 GHPQCAAAGGKCSNSGTCLYKVTYGDG-------SSTAGVLSHETLSLSST--RD----- 263
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--D 199
+P F F CG + G GV G+ GLGR ++LPSQ AA F F+ CL +D
Sbjct: 264 -LPGFAFGCGQTNL--GEFGGVDGLVGLGRGALSLPSQAAATFG--ATFSYCLPSYDTTH 318
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G + G N+ D +++YT + I K + PS+ YF+ V S+ + G +
Sbjct: 319 GYLTMGSTTPAASNDDD---DVQYTAM-IQKED-------YPSL-YFVEVVSIDIGGYIL 366
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
P+ T+ + D GT + T L Y +L F M + PF C+
Sbjct: 367 PVPPTVFTRD-----GTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCY 421
Query: 320 RLQDIGFTRIGPVVPQIDL------VLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
G I +P + V V I+ ++ G CL FV
Sbjct: 422 DF--TGHNAI--FMPAVAFKFSDGAVFDLSPVAILIYPDDTAPATG----CLAFV----- 468
Query: 374 PRTSI----VIGARQLENNLLQFDLATSRLGF 401
PR S +IG Q + +D+A ++GF
Sbjct: 469 PRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGF 500
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 150/372 (40%), Gaps = 52/372 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA--------------RCGSA 90
YVT++ TP + V G ++ W+ C VS ++ C +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ C N ++GD V + S D + G + +VPNF + C
Sbjct: 187 QCSDLTTATLNPASCST--SNVCIYQASYGDSSFSVGYL-SKDTVSFG-STSVPNFYYGC 242
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL R+K++L QLA S+ F+ CL + + Y
Sbjct: 243 GQDN--EGLFGQSAGLIGLARNKLSLLYQLAP--SMGYSFSYCLPTSSSSSSGYLSIGSY 298
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
+ + YTP+ + ++ + YFI +T + V GK PL+ + + +
Sbjct: 299 NPGQY------SYTPMASSSLDDS---------LYFIKMTGIKVAGK--PLSVSSSAYSS 341
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIG 330
T I + T L T +Y AL +A A AM R + + CF+ Q
Sbjct: 342 LP---TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLR--- 395
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
VP++ + + +N +V + CL F R++ +IG Q + +
Sbjct: 396 --VPEVTMAFAGGAALKLAA-RNLLVDVDSATTCLAFAPA----RSAAIIGNTQQQTFSV 448
Query: 391 QFDLATSRLGFS 402
+D+ S++GF+
Sbjct: 449 VYDVKNSKIGFA 460
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 164/389 (42%), Gaps = 79/389 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGYV----------SSTNKTARCGSA 90
+Y T++ TP V + + G +I+W+ C + Y S T T C S
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200
Query: 91 QCNLANAKACGG--GIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C ++ C C G G + T GD + L+ + RN + V
Sbjct: 201 HCRRLDSAGCNTRRKTCLYQVSYGDG-------SFTVGDFSTETLTFR----RNRVKGVA 249
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFD 198
+ CG + +GL G G+ GLG+ K++ P Q F+ +KF+ CL +
Sbjct: 250 LG-----CGHD--NEGLFVGAAGLLGLGKGKLSFPGQTGHRFN--QKFSYCLVDRSASSK 300
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++VFG N VS+ ++TPL N K++T Y++G+ + V G
Sbjct: 301 PSSVVFG--------NAAVSRIARFTPLLSNPKLDTF----------YYVGLLGISVGGT 342
Query: 258 AVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
VP + +L +D G GG I + T L Y A+ AF + R + F
Sbjct: 343 RVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFD 402
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVD--GGVN 373
CF L ++ + VP + L + +V S+ N ++ + + C F GG++
Sbjct: 403 TCFDLSNMNEVK----VPTVVLHFRRADV--SLPATNYLIPVDTNGKFCFAFAGTMGGLS 456
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
+IG Q + + +DLA+SR+GF+
Sbjct: 457 -----IIGNIQQQGFRVVYDLASSRVGFA 480
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 150/372 (40%), Gaps = 52/372 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTA--------------RCGSA 90
YVT++ TP + V G ++ W+ C VS ++ C +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ C N ++GD V + S D + G + +VPNF + C
Sbjct: 187 QCSDLTTATLNPASCST--SNVCIYQASYGDSSFSVGYL-SKDTVSFG-STSVPNFYYGC 242
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL R+K++L QLA S+ F+ CL + + Y
Sbjct: 243 GQDN--EGLFGQSAGLIGLARNKLSLLYQLAP--SMGYSFSYCLPTSSSSSSGYLSIGSY 298
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
+ + YTP+ + ++ + YFI +T + V GK PL+ + + +
Sbjct: 299 NPGQY------SYTPMASSSLDDS---------LYFIKMTGIKVAGK--PLSVSSSAYSS 341
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIG 330
T I + T L T +Y AL +A A AM R + + CF+ Q
Sbjct: 342 LP---TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLR--- 395
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
VP++ + + +N +V + CL F R++ +IG Q + +
Sbjct: 396 --VPEVTMAFAGGAALKLAA-RNLLVDVDSATTCLAFAPA----RSAAIIGNTQQQTFSV 448
Query: 391 QFDLATSRLGFS 402
+D+ S++GF+
Sbjct: 449 VYDVKNSKIGFA 460
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 93/399 (23%), Positives = 164/399 (41%), Gaps = 49/399 (12%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSA 90
A+ L T T QY +++ TP P L G ++ WV C SS++ A
Sbjct: 90 AMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQR 149
Query: 91 QCNLANAKA-----CGGGICG----------AGPDNPIS-------NTGTHGDIRIDVLS 128
A +K+ C C + P +P S N+ G + +D +
Sbjct: 150 VFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSAT 209
Query: 129 IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
+ S G + R + + C + + Q S G+ LG S ++ S+ A+ F +
Sbjct: 210 V-SLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSS-DGVLSLGNSNISFASRAASRFGGRF 267
Query: 189 KFALC--LSPFDDGA-IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
+ L L+P + + + FG+G ++ + + TPL + + F Y
Sbjct: 268 SYCLVDHLAPRNATSFLTFGNG----DSSPGDDSSSRRTPLVLLEDARTRPF-------Y 316
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
F+ V +V V G+ + + + D GG + + T+L T Y A+V+A +
Sbjct: 317 FVSVDAVTVAGERLEILPDVW--DFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAG 374
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
V RV + PF C+ +T + +P+++L + + G++ ++ C+
Sbjct: 375 VPRVN-MDPFEYCY-----NWTGVSAEIPRMELRFAGAATL-APPGKSYVIDTAPGVKCI 427
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
G V+G P S VIG + +L +FDLA L F S
Sbjct: 428 GVVEGAW-PGVS-VIGNILQQEHLWEFDLANRWLRFKQS 464
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 90/388 (23%), Positives = 160/388 (41%), Gaps = 62/388 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ ++ TP ++ + G ++ W+ C ++G V S++ + CG
Sbjct: 149 EYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDT 208
Query: 91 QCNLANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
+C L + A C + +P + T GD+ ++ ++ T + V
Sbjct: 209 RCGLVSPPA-APRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLT----ASSSRRV 263
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA-- 201
+ CG +GL G G+ GLGR ++ SQL A + F+ CL D G+
Sbjct: 264 DGVVLGCGHRN--RGLFHGAAGLLGLGRGPLSFASQLRAVYG--HAFSYCL--VDHGSAV 317
Query: 202 ---IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
IVFGD N L YT + + Y++ + + V G+
Sbjct: 318 GSKIVFGDD-----NVLLSHPQLNYTAFAPSAAE---------NTFYYVQLKGILVGGEM 363
Query: 259 VPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV-ARVAPVAPFG 316
+ + + E G GGT I + + YKA+ QAF M K +A
Sbjct: 364 LDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLS 423
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPR 375
C+ + G R+ VP+ L+ + VW +N +++ + +CL + PR
Sbjct: 424 PCYNVS--GVERV--EVPEFSLLFAD-GAVWDFPAENYFIRLDTEGIMCLAVLG---TPR 475
Query: 376 TSI-VIGARQLENNLLQFDLATSRLGFS 402
+++ +IG Q +N + +DL +RLGF+
Sbjct: 476 SAMSIIGNYQQQNFHVLYDLHHNRLGFA 503
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 134/323 (41%), Gaps = 52/323 (16%)
Query: 87 CGSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C S QC+L + AC C G G + T G++ + S + ++
Sbjct: 242 CDSEQCHLLDEAACDANSCIYEVEYGDG-------SFTVGELATETFSFRHSN------- 287
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
++PN CG + +GL G G+ GLG ++L SQL A F+ CL D
Sbjct: 288 -SIPNLPIGCGHDN--EGLFVGAAGLIGLGGGAISLSSQLEAT-----SFSYCLVDLDSE 339
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ D N S +L +PL N P+ Y + V + V GK +P
Sbjct: 340 S-----SSTLDFNADQPSDSLT-SPLVKNDR--------FPTFRY-VKVIGMSVGGKPLP 384
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
++ + ID G GG + + T + + +Y L AF + V+PF C+
Sbjct: 385 ISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYD 444
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIV 379
L ++ VP I +L +N + + +N + Q+ CL F+ P + +
Sbjct: 445 LS----SQSNVEVPTIAFILPGENSLQ-LPAKNCLFQVDSAGTFCLAFLPSTF-PLS--I 496
Query: 380 IGARQLENNLLQFDLATSRLGFS 402
IG Q + + +DLA S +GFS
Sbjct: 497 IGNVQQQGIRVSYDLANSLVGFS 519
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 103/395 (26%), Positives = 159/395 (40%), Gaps = 90/395 (22%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSA 90
+Y+ + +P V V G +++W+ C + SST K A C S
Sbjct: 88 EYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQ 147
Query: 91 QCNL--ANAKACGG-GICGAG---PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
C L + + CG G C G D S G + + LS ST G + V+ P
Sbjct: 148 PCTLLQPSQRDCGKLGQCIYGIMYGDKSFS----VGILGTETLSFGSTGG---AQTVSFP 200
Query: 145 NFIFLCG--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA- 201
N IF CG + F + ++ V+GIAGLG ++L SQL A + KF+ CL P+D +
Sbjct: 201 NTIFGCGVDNNFTIY-TSNKVMGIAGLGAGPLSLVSQLGA--QIGHKFSYCLLPYDSTST 257
Query: 202 --IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGK 257
+ FG N + TPL I +PS+ YF+ + +V + K
Sbjct: 258 SKLKFGSEAIITTN------GVVSTPLII-----------KPSLPTYYFLNLEAVTIGQK 300
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVAPVAPFG 316
V +T G I + P T LE + Y V + + K+ + P +P
Sbjct: 301 VVSTGQT--------DGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLP-SPLK 351
Query: 317 ACF------RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
CF + DI F G V L+ KNV+ + N LCL V
Sbjct: 352 TCFPNRANLAIPDIAFQFTGASV-----ALRPKNVLIPLTDSN--------ILCLAVV-- 396
Query: 371 GVNPRTSI---VIGARQLENNLLQFDLATSRLGFS 402
P + I + G+ + +++DL ++ F+
Sbjct: 397 ---PSSGIGISLFGSIAQYDFQVEYDLEGKKVSFA 428
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 133/366 (36%), Gaps = 58/366 (15%)
Query: 62 VHLGGNILWVDCE---------------KGYVSSTNKTARCGSAQCNLANAKACGG-GIC 105
V G ++ WV CE S T CGS C + A G G C
Sbjct: 198 VDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSC 257
Query: 106 GAGPDNPISNTG---THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASG 162
N ++GD + + D G + F+F CG +GL G
Sbjct: 258 ARSAGNSEQRCYYALSYGDGSFS-RGVLAQDTLGLGTTTKLDGFVFGCG--LSNRGLFGG 314
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLK 222
G+ GLGR+ ++L SQ AA F + L + G++ G GP N+
Sbjct: 315 TAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTSTGSLSLGPGPSSSF------PNMA 368
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
YT + + +P YFI +T V L+ G G + +
Sbjct: 369 YTRMIADPT--------QPPF-YFINITGAAVG------GGAALTAPGFGAGNVLVDSGT 413
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFRLQDIGFTRIGPVVPQIDL 338
T L S+YKA+ FA P AP AC+ L G + VP + L
Sbjct: 414 VITRLAPSVYKAVRAEFARRFEY-----PAAPGFSILDACYDL--TGRDEVN--VPLLTL 464
Query: 339 VLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATS 397
L+ V G +V+ G +CL +T I IG Q N + +D S
Sbjct: 465 TLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPI-IGNYQQRNKRVVYDTVGS 523
Query: 398 RLGFSD 403
RLGF+D
Sbjct: 524 RLGFAD 529
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 109/258 (42%), Gaps = 45/258 (17%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNL 221
G+ G+ R ++L +Q+ KF+ C+S D G ++ GDGP L
Sbjct: 186 TTGLMGMNRGSLSLVTQMVLP-----KFSYCISGEDAFGVLLLGDGP-------SAPSPL 233
Query: 222 KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTV 281
+YTPL T+S + V Y + + + V+ K + L K++ D+ G G T + +
Sbjct: 234 QYTPLVT--ATTSSPYFDR--VAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSG 289
Query: 282 NPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV--------- 332
+T L +Y +L F G R++D F G +
Sbjct: 290 TQFTFLLGPVYNSLKDEFLEQTK-----------GVLTRIEDPNFVFEGAMDLCYHAPAS 338
Query: 333 ---VPQIDLVLQNKNVVWSIHGQNSMVQI--GGDAL-CLGFVDGGVNPRTSIVIGARQLE 386
VP + LV + + G+ + ++ G D + C F + + + VIG +
Sbjct: 339 LAAVPAVTLVFSGAEM--RVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQ 396
Query: 387 NNLLQFDLATSRLGFSDS 404
N ++FDL SR+GF+++
Sbjct: 397 NVWMEFDLVKSRVGFTET 414
>gi|302799870|ref|XP_002981693.1| hypothetical protein SELMODRAFT_421198 [Selaginella moellendorffii]
gi|300150525|gb|EFJ17175.1| hypothetical protein SELMODRAFT_421198 [Selaginella moellendorffii]
Length = 374
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 109/257 (42%), Gaps = 45/257 (17%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGD-GPYYDLNNFDVS 218
V+G+A G S +LP Q++ + L +F CL+ G + G GPY +N D+
Sbjct: 117 VIGLAASGSS--SLPLQVSRSAKLAHRFTYCLASSSGRGLGELYIGQQGPYRVFHNTDIL 174
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL-NKTLLSIDNEGVGGTK 277
+ L+ ++SG S H+ +V L +KT ++I +G
Sbjct: 175 NSTSLPMLYFPLTVSSSG--------------SYHLKLDSVSLGSKTTVTITMVEIG--- 217
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA---CFRL---QDIGFTRIGP 331
T YT L + Y+ L F + + + FG C+++ Q F+ +
Sbjct: 218 --TSFRYTRLPQAAYQMLRDGFLREVGEKKLGRDSSSFGELDLCYKMSVEQRTTFSNVTM 275
Query: 332 VVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNL 389
VV I W + G N +V G + C FV G + R+ VIG Q ENN
Sbjct: 276 VVSGIP---------WMVSGDNYLVTKPGIRNVACFAFVSAGKDGRS--VIGTAQQENNF 324
Query: 390 LQFDLATSRLGFSDSLL 406
++FD+ +LG S SL
Sbjct: 325 VEFDVDAKKLGVSGSLF 341
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 143/378 (37%), Gaps = 57/378 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN-------------KTARCGSAQ 91
YV + K TP + + + + +W+ C G +N T C +AQ
Sbjct: 30 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCS-GCSGCSNASTSFNTNSSSTYSTVSCSTAQ 88
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC- 150
C A C P N GD +Q T P +PNF F C
Sbjct: 89 CTQARGLTCPSS--SPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD---VIPNFSFGCI 143
Query: 151 ----GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
G+ QGL GLGR ++L SQ + +S F+ CL F F
Sbjct: 144 NSASGNSLPPQGLM-------GLGRGPMSLVSQTTSLYS--GVFSYCLPSFRS----FYF 190
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
L K+++YTPL N PS+ Y++ +T V V VP++ L
Sbjct: 191 SGSLKLGLLGQPKSIRYTPLLRNP--------RRPSL-YYVNLTGVSVGSVQVPVDPVYL 241
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGF 326
+ D GT I + T +Y+A+ F + V+ + + F CF +
Sbjct: 242 TFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCFSADNEN- 299
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI-VIGARQ 384
V P+I L + + ++ + +N+++ L CL N + VI Q
Sbjct: 300 -----VAPKITLHMTSLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQ 352
Query: 385 LENNLLQFDLATSRLGFS 402
+N + FD+ SR+G +
Sbjct: 353 QQNLRILFDVPNSRIGIA 370
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 101/467 (21%), Positives = 173/467 (37%), Gaps = 110/467 (23%)
Query: 14 LIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC 73
L+ S S ++A+ R +L L D TL + + R P+ L + G +++W C
Sbjct: 22 LLKSTSTLSAKRFRRQLSLPLSPGSDY-TLSF--NLGPRAQAQPITLYMDTGSDLVWFPC 78
Query: 74 E---------KGYVSSTNKTAR-----CGSAQCNLANAKACGGGICGAG--PDNPISNTG 117
K S T R C S C+ A+ A +C A P I +
Sbjct: 79 APFKCILCEGKPNASPPVNTTRSVAVSCKSPACSAAHNLASPSDLCAAARCPLESIETSD 138
Query: 118 -----------THGD------IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLA 160
+GD + D LS+ S + + NF F C + +
Sbjct: 139 CANFKCPPFYYAYGDGSLIARLYRDTLSLSS---------LFLRNFTFGCAYTTLAE--- 186
Query: 161 SGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSK 219
G+AG GR ++LP+QLA + L +F+ CL ++FD +
Sbjct: 187 --PTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVS----------------HSFDSER 228
Query: 220 NLKYTPLFINKVNTA----------SGFLGEPSVE-------YFIGVTSVHVNGKAVPLN 262
K +PL + + + F+ P +E Y +G+ + V + VP
Sbjct: 229 VRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGKRIVPAP 288
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV----ARVAPVAPFGAC 318
+ L ++N G GG + + +T+L Y ++V F + +V ++ C
Sbjct: 289 EMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLAPC 348
Query: 319 FRLQDIGFTRIGPVVPQIDL---------VLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
+ L + VP + L VL KN + + CL ++
Sbjct: 349 YYLNSVA------EVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMN 402
Query: 370 GGVNPRTS----IVIGARQLENNLLQFDLATSRLGFSD---SLLFER 409
GG S +G Q + +++DL R+GF+ + L+ER
Sbjct: 403 GGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCASLWER 449
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 100/423 (23%), Positives = 161/423 (38%), Gaps = 59/423 (13%)
Query: 2 SVAQSFLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATL--QYVTQIKQRTPLVPVK 59
SV + L + S L+ +++ + +PK + VA YV + K TP +
Sbjct: 59 SVIDTVLHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMF 118
Query: 60 LTVHLGGNILWVDCEKGYVSSTN-------------KTARCGSAQCNLANAKACGGGICG 106
+ + + +W+ C G +N T C +AQC A C
Sbjct: 119 MVLDTSNDAVWLPCS-GCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSS--S 175
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC-----GSEFVLQGLAS 161
P N GD +Q T P +PNF F C G+ QGL
Sbjct: 176 PQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD---VIPNFSFGCINSASGNSLPPQGLM- 231
Query: 162 GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNL 221
GLGR ++L SQ + +S F+ CL F F L K++
Sbjct: 232 ------GLGRGPMSLVSQTTSLYS--GVFSYCLPSFRS----FYFSGSLKLGLLGQPKSI 279
Query: 222 KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTV 281
+YTPL N PS+ Y++ +T V V VP++ L+ D GT I +
Sbjct: 280 RYTPLLRNP--------RRPSL-YYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSG 330
Query: 282 NPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ 341
T +Y+A+ F + V+ + + F CF + V P+I L +
Sbjct: 331 TVITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCFSADNEN------VAPKITLHMT 383
Query: 342 NKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRL 399
+ ++ + +N+++ L CL N + VI Q +N + FD+ SR+
Sbjct: 384 SLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 441
Query: 400 GFS 402
G +
Sbjct: 442 GIA 444
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 150/395 (37%), Gaps = 70/395 (17%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK-------------GYVSSTNKTAR 86
T + YV TP P + L G ++W C++ S+T +
Sbjct: 46 TQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEP 105
Query: 87 CGSAQCNL--ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
CG+ C ++++ C G +C +TG G + D ++ G A
Sbjct: 106 CGTPLCESIPSDSRNCSGNVCAYQASTNAGDTG--GKVGTDTFAV--------GTAKASL 155
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---A 201
F + S+ G SG+VG LGR+ +L +Q A F+ CL+P D G A
Sbjct: 156 AFGCVVASDIDTMGGPSGIVG---LGRTPWSLVTQTGVA-----AFSYCLAPHDAGRNSA 207
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ G TP F+N SG + S Y + + + +PL
Sbjct: 208 LFLGSS-----AKLAGGGKAASTP-FVN----ISGNGNDLSNYYKVQLEGLKAGDAMIPL 257
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ ++ + T +P + L Y+A+ +A +A+ PV PF
Sbjct: 258 PPSGSTV--------LLDTFSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPF------ 303
Query: 322 QDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDALCLGFVDGG-VNPRTSI- 378
D+ F + G DLV + ++ N ++ +CL + +N T +
Sbjct: 304 -DLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELS 362
Query: 379 VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
++G+ Q EN FDL ++L FE A CT
Sbjct: 363 LLGSLQQENIHFLFDLD------KETLSFEPADCT 391
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 116/294 (39%), Gaps = 72/294 (24%)
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCLSPFDDGAIV 203
NF F C + + G+AG GR ++LP+QLA + +L +F+ CL
Sbjct: 144 NFTFGCAHTALAE-----PTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVS------- 191
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG---------FLGEPSVEYF--IGVTSV 252
++FD + K +PL + + S L P YF +G+T +
Sbjct: 192 ---------HSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGI 242
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR---- 308
V + + + L +D G GG + + +T+L S+Y ++V F + +V +
Sbjct: 243 SVGKRTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASE 302
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---------- 358
V G C+ L+ + ++ V W G NS V +
Sbjct: 303 VEEKTGLGPCYFLEGL---------------VEVPTVTWHFLGNNSNVMLPRMNYFYEFL 347
Query: 359 -GGDAL-----CLGFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGFS 402
G D CL ++GG + S ++G Q + + +DL R+GF+
Sbjct: 348 DGEDEARRKVGCLMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFA 401
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/387 (22%), Positives = 157/387 (40%), Gaps = 69/387 (17%)
Query: 43 LQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGS 89
++Y+ ++ TP VP G ++ W C+ + SST C S
Sbjct: 64 VEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 123
Query: 90 AQC-NLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGRAV 141
A C ++ C + P +P ++ D + + L+I S+ PG+ V
Sbjct: 124 ATCLPTWRSRNC------SNPSSPCRYIYSYSDGAYSVGILGTETLTIGSS---VPGQTV 174
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
+V + F CG++ L S G GLGR ++L +QL KF+ CL+ F +
Sbjct: 175 SVGSVAFGCGTDNGGDSLNS--TGTVGLGRGTLSLLAQLGVG-----KFSYCLTDFFNST 227
Query: 202 IVFGDGPYYDLNNFDVSK---NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
+ D P++ +++ ++ TPL + +N + YF+ + + +
Sbjct: 228 M---DSPFFLGTLAELAPGPGTVQSTPLLQSPLNPS---------RYFVNLQGISLGDVR 275
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PKVARVAPVAPFG 316
+P+ + +G GG + + +T+L S ++ +V A + P V + +P
Sbjct: 276 LPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP-- 333
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNP 374
CF D P +P DLVL +H N M D + CL V +P
Sbjct: 334 -CFPSPD-----GEPFMP--DLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIVG---SP 382
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGF 401
T +G Q +N + FD+ +L F
Sbjct: 383 STWSRLGNFQQQNIQMLFDMTVGQLSF 409
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 137/353 (38%), Gaps = 68/353 (19%)
Query: 79 SSTNKTARCGSAQCNLAN------AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
SS N C A C+L+ A+ C G ++ T T +R+
Sbjct: 85 SSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTRDTLRVH------- 137
Query: 133 DGRNPGRAVT-VPNFIFLC-GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
P R +P F F C GS + +GIAG R ++ PSQL LK+ F
Sbjct: 138 --EGPARVTKDIPKFCFGCVGSTY------HEPIGIAGFVRGTLSFPSQLGL---LKKGF 186
Query: 191 ALCLSPFD-------DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
+ C F +V GD +N + LK +P++ N
Sbjct: 187 SHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLK-SPMYPNY------------- 232
Query: 244 EYFIGVTSV---HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
Y+IG+ ++ +V+ VPLN L D++G GG I + YT L Y L+ F
Sbjct: 233 -YYIGLEAITVGNVSATTVPLN--LREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFK 289
Query: 301 SAM--PKVARVAPVAPFGACFRL--QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
+ + P+ V A F C+++ + T + P I N NV + + N
Sbjct: 290 AIITYPRATEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLN-NVSFVLPQGNHFY 348
Query: 357 QIGGDA-----LCLGF---VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+ + CL F D P + V G+ Q +N + +DL R+GF
Sbjct: 349 AMSAPSNSTVVKCLLFQSMADSDYGP--AGVFGSFQQQNVQIVYDLEKERIGF 399
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 65/250 (26%), Positives = 104/250 (41%), Gaps = 27/250 (10%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKN-L 221
GIAG GR + +LPSQ+ +F+ CL FDD A + + ++ D N +
Sbjct: 222 GIAGFGRGEESLPSQMNLT-----RFSYCLLSHQFDDSATITSNLVLETASSRDGKTNGV 276
Query: 222 KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTV 281
YTP N + G Y+I + + V K V + + LL + +G GG + +
Sbjct: 277 SYTPFLKNPTTKKNPAFG---AYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGFIVDSG 333
Query: 282 NPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG--ACFRLQDIGFTRIGPVVPQIDLV 339
+ +T +E I+ + Q FA + FG CF L T P++
Sbjct: 334 STFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFVLAGGAETA---SFPELRFE 390
Query: 340 LQNKNVVWSIHGQNSMVQIG-GDALCLGFVD-------GGVNPRTSIVIGARQLENNLLQ 391
+ + N +G GD CL V G V P ++++G Q +N ++
Sbjct: 391 FRG-GAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGP--AVILGNYQQQNFYVE 447
Query: 392 FDLATSRLGF 401
+DL R GF
Sbjct: 448 YDLENERFGF 457
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 119/291 (40%), Gaps = 49/291 (16%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCLSPFD 198
++ + NF F C + + G+AG GR ++LP+QL+ + L +F+ CL
Sbjct: 196 SLHLQNFTFGCAHTALAE-----PTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVS-- 248
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN---TASG-----------FLGEPSVE 244
++FD + + +PL + + N T +G L P
Sbjct: 249 --------------HSFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHP 294
Query: 245 YF--IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
Y+ +G+ + V + VP + L +D +G GG + + +T+L S Y A+V F
Sbjct: 295 YYYCVGLAGISVGKRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKR 354
Query: 303 M----PKVARVAPVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM 355
+ + + + G C+ L I ++ V D+VL KN +
Sbjct: 355 VNRFHKRASEIETKTGLGPCYYLNGLSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDG 414
Query: 356 VQIGGDALCLGFVDG----GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
++ G C+ ++G ++ +G Q + + +DL R+GF+
Sbjct: 415 IRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFA 465
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 152/383 (39%), Gaps = 75/383 (19%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
+L+YV ++ TP VP + + G ++ W+ C+ SST
Sbjct: 110 SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 169
Query: 87 CGSAQCNLANAKACGGGICGAGPD-----NPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
C S C A A G G C +G + T T G D L++ PG
Sbjct: 170 CASDVCKKLAADAYGSG-CTSGKQCGFAISYADGTSTVGAYSQDKLTLA------PG--A 220
Query: 142 TVPNFIFLCGS-EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD- 199
V NF F CG + ++GL GV+ GLGR + +L ++ FS CL
Sbjct: 221 IVQNFYFGCGHGKHAVRGLFDGVL---GLGRLRESLGARYGGVFS------YCLPSVSSK 271
Query: 200 -GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
G + G G +TP+ G+P+ + + ++V GK
Sbjct: 272 PGFLALGAG--------KNPSGFVFTPM--------GTVPGQPTFST-VTLAGINVGGKK 314
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ L + S GG + + T L+++ Y+AL AF AM + R+ P C
Sbjct: 315 LDLRPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAM-EAYRLLPNGDLDTC 367
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
+ L G+ + VVP+I L + ++ N ++ G CL F + G + +
Sbjct: 368 YNL--TGYKNV--VVPKIALTFTGGATI-NLDVPNGILVNG----CLAFAESGPDGSAGV 418
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
+ Q +L FD +TS+ GF
Sbjct: 419 LGNVNQRAFEVL-FDTSTSKFGF 440
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 92/395 (23%), Positives = 163/395 (41%), Gaps = 74/395 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL---------- 94
Y T+I+ +P + V G ++LWV C +S N + Q L
Sbjct: 81 YYTKIRLGSPPRDFYVQVDTGSDVLWVSC-----ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 95 -----ANAKACGGGICGAGPDNPISN------------TGTHGDIRIDVLSIQSTDGRNP 137
+ + C GI + + N +GT G DVL G +
Sbjct: 136 ATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 138 GRAVTVPNFIFLCGSEFVLQGLAS--GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
T P +F C + + S V GI G G+ +++ SQLA+ R F+ CL
Sbjct: 196 VPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLK 254
Query: 196 PFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ G +V G+ V N+ +TPL ++ + Y + + S+
Sbjct: 255 GENGGGGILVLGE---------IVEPNMVFTPLVPSQPH------------YNVNLLSIS 293
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG+A+P+N ++ S N GT I T L + Y V+A +A+ + R PV
Sbjct: 294 VNGQALPINPSVFSTSNG--QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR--PVV 349
Query: 314 PFG-ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFV 368
G C+ + T + + P + L ++ ++ Q+ ++Q +GG A+ C+GF
Sbjct: 350 SKGNQCYVIA----TSVADIFPPVSLNFAGGASMF-LNPQDYLIQQNNVGGTAVWCIGFQ 404
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ + ++G L++ + +DL R+G+++
Sbjct: 405 R--IQNQGITILGDLVLKDKIFVYDLVGQRIGWAN 437
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 155/366 (42%), Gaps = 38/366 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ ++ TP + G +++W C+ T + + + +C
Sbjct: 96 EFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQ 155
Query: 104 ICGAGPDNPISN----TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGL 159
+C A P + +N ++GD I +++ G+A +VPN F CG++ G
Sbjct: 156 LCEALPQSSCNNGCEYLYSYGDYS-STQGILASETLTFGKA-SVPNVAFGCGADNEGSGF 213
Query: 160 ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFD 216
+ G G+ GLGR ++L SQL + KF+ CL+ DD ++ G L + +
Sbjct: 214 SQGA-GLVGLGRGPLSLVSQLK-----EPKFSYCLTTVDDTKTSTLLMGS-----LASVN 262
Query: 217 VSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
S + +K TPL + + PS Y++ + + V +P+ K+ S+ ++G GG
Sbjct: 263 ASSSAIKTTPLIHSPAH--------PSF-YYLSLEGISVGDTRLPIKKSTFSLQDDGSGG 313
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
I + T LE S + + + F + + + CF L G T I VP+
Sbjct: 314 LIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPS-GSTNI--EVPK 370
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLA 395
LV + +N M IG ++ + + G + SI G Q +N L+ DL
Sbjct: 371 --LVFHFDGADLELPAENYM--IGDSSMGVACLAMGSSSGMSI-FGNVQQQNMLVLHDLE 425
Query: 396 TSRLGF 401
L F
Sbjct: 426 KETLSF 431
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 139/348 (39%), Gaps = 58/348 (16%)
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS--TDGRN 136
SS N C A C+L+ G C P + T G + I L+ + T G +
Sbjct: 85 SSDNSYDPCAVAGCSLSTLVK---GTC-PRPCPSFAYTYGAGGVVIGTLTRDTLTTHGSS 140
Query: 137 PGRAVTVPNFIFLC-GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL- 194
P VPNF F C GS + +GIAG GR ++LPSQL L++ F+ C
Sbjct: 141 PSFTREVPNFCFGCVGSTY------REPIGIAGFGRGVLSLPSQLGF---LQKGFSHCFL 191
Query: 195 ------SPFDDGAIVFGDGPYYDLNNFDVSKN--LKYTPLFINKVNTASGFLGEPSVEYF 246
+P +V GD +S N L++T L N + P+ Y+
Sbjct: 192 GFKFANNPNISSPLVIGD--------LAISSNDHLQFTSLLKNPM--------YPNY-YY 234
Query: 247 IGVTSVHV-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-- 303
IG+ ++ V N A+ + +L D+ G GG I + YT L Y L+ S +
Sbjct: 235 IGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITY 294
Query: 304 PKVARVAPVAPFGACFRL--QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD 361
P+ F C+R+ + T ++P I N NV + N +G
Sbjct: 295 PRAQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSN-NVSLVLPQGNHFYAMGAP 353
Query: 362 A--------LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+ L D P + V G+ Q +N + +DL R+GF
Sbjct: 354 SNSTVVKCLLLQNMDDSDSGP--AGVFGSFQQQNVKVVYDLEKERIGF 399
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 82/378 (21%), Positives = 147/378 (38%), Gaps = 37/378 (9%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y + TP L + G ++ W+ C Y SS+ + C +
Sbjct: 180 EYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDS 239
Query: 91 QCNLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
+C+L ++ P ++ T GD ++ ++ T V N
Sbjct: 240 RCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVEN 299
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
+F CG +GL G G+ GLGR ++ SQL + + F+ CL + A V
Sbjct: 300 VMFGCGH--WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG--HSFSYCLVDRNSDANVSS 355
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
+ + + L +T L K N F Y++ + S+ V G+ V + +
Sbjct: 356 KLIFGEDKDLLSHPELNFTTLVAGKENPVDTF-------YYVQIKSIVVGGEVVNIPEEK 408
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
I +G GGT I + + Y+ + +AF + + V C+ + +
Sbjct: 409 WQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVE 468
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGGVNPRTSIVIGARQ 384
P +P +V + VW+ +N ++I + +CL + G P +IG Q
Sbjct: 469 ----QPDLPDFGIVFSD-GAVWNFPVENYFIEIEPREVVCLAIL--GTPPSALSIIGNYQ 521
Query: 385 LENNLLQFDLATSRLGFS 402
+N + +D SRLGF+
Sbjct: 522 QQNFHILYDTKKSRLGFA 539
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 89/346 (25%), Positives = 136/346 (39%), Gaps = 54/346 (15%)
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS--TDGRN 136
SS N C A C+L+ G C P + T G + I L+ + T G +
Sbjct: 102 SSDNSYDPCAVAGCSLSTLVK---GTC-PRPCPSFAYTYGAGGVVIGTLTRDTLTTHGSS 157
Query: 137 PGRAVTVPNFIFLC-GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL- 194
P VPNF F C GS + +GIAG GR ++LPSQL L++ F+ C
Sbjct: 158 PSFTREVPNFCFGCVGSTY------REPIGIAGFGRGVLSLPSQLGF---LQKGFSHCFL 208
Query: 195 ------SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
+P +V GD ++ + LK P++ N Y+IG
Sbjct: 209 GFKFANNPNISSPLVIGDLAISSNDHLQFTSLLK-NPMYPNY--------------YYIG 253
Query: 249 VTSVHV-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PK 305
+ ++ V N A+ + +L D+ G GG I + YT L Y L+ S + P+
Sbjct: 254 LEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPR 313
Query: 306 VARVAPVAPFGACFRL--QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA- 362
F C+R+ + T ++P I N NV + N +G +
Sbjct: 314 AQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSN-NVSLVLPQGNHFYAMGAPSN 372
Query: 363 -------LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
L D P + V G+ Q +N + +DL R+GF
Sbjct: 373 STVVKCLLLQNMDDSDSGP--AGVFGSFQQQNVKVVYDLEKERIGF 416
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 155/381 (40%), Gaps = 59/381 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
Y ++ +P + V G + W+ C+ + S T KT C S+
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ + C + N ++GD L S D + T+ +F++ C
Sbjct: 163 QCSSLKSATLNEPTC-SKQSNACVYKASYGDSSFS-LGYLSQDVLTLTPSQTLSSFVYGC 220
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--------SPFDDGAI 202
G + QGL GI GL +++++ SQL+ + F+ CL SP +G +
Sbjct: 221 GQDN--QGLFGRTDGIIGLANNELSMLSQLSGKYG--NAFSYCLPTSFSTPNSP-KEGFL 275
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
G ++ S + K+TPL N N PS+ YFI + S+ V G+ + +
Sbjct: 276 SIGT------SSLTPSSSYKFTPLLKNPNN--------PSL-YFIDLESITVAGRPLGVA 320
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP-VAPFGACFRL 321
+ + GT I T L T +Y L A+ + + K + AP ++ CF+
Sbjct: 321 ASSYKVPTIIDSGTVI------TRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKG 374
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIG 381
G I V P I ++ + + G NS+V++ CL + +IG
Sbjct: 375 SLAG---ISEVAPDIRIIFKG-GADLQLKGHNSLVELETGITCLAMAGS----SSIAIIG 426
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q + + +D+ SR+GF+
Sbjct: 427 NYQQQTVKVAYDVGNSRVGFA 447
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 152/381 (39%), Gaps = 71/381 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
+L+YV ++ TP VP + + G ++ W+ C+ SST
Sbjct: 76 SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 135
Query: 87 CGSAQCNLANAKACGGGICGAGPD-----NPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
C S C A A G G C +G + T T G D L++ PG
Sbjct: 136 CASDVCKKLAADAYGSG-CTSGKQCGFAISYADGTSTVGAYSQDKLTLA------PG--A 186
Query: 142 TVPNFIFLCGS-EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
V NF F CG + ++GL GV+ GLGR + +L ++ FS + L G
Sbjct: 187 IVQNFYFGCGHGKHAVRGLFDGVL---GLGRLRESLGARYGGVFS----YCLPSVSSKPG 239
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ G G +TP+ G+P+ + + ++V GK +
Sbjct: 240 FLALGAG--------KNPSGFVFTPM--------GTVPGQPTFST-VTLAGINVGGKKLD 282
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
L + S GG + + T L+++ Y+AL AF AM + R+ P C+
Sbjct: 283 LRPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAM-EAYRLLPNGDLDTCYN 335
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVI 380
L G+ + VVP+I L + ++ N ++ G CL F + G + ++
Sbjct: 336 L--TGYKNV--VVPKIALTFTGGATI-NLDVPNGILVNG----CLAFAESGPDGSAGVLG 386
Query: 381 GARQLENNLLQFDLATSRLGF 401
Q +L FD +TS+ GF
Sbjct: 387 NVNQRAFEVL-FDTSTSKFGF 406
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 155/381 (40%), Gaps = 59/381 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
Y ++ +P + V G + W+ C+ + S T KT C S+
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ + C + N ++GD L S D + T+ +F++ C
Sbjct: 163 QCSSLKSATLNEPTC-SKQSNACVYKASYGDSSFS-LGYLSQDVLTLTPSQTLSSFVYGC 220
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--------SPFDDGAI 202
G + QGL GI GL +++++ SQL+ + F+ CL SP +G +
Sbjct: 221 GQDN--QGLFGRTDGIIGLANNELSMLSQLSGKYG--NAFSYCLPTSFSTPNSP-KEGFL 275
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
G ++ S + K+TPL N N PS+ YFI + S+ V G+ + +
Sbjct: 276 SIGT------SSLTPSSSYKFTPLLKNPNN--------PSL-YFIDLESITVAGRPLGVA 320
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP-VAPFGACFRL 321
+ + GT I T L T +Y L A+ + + K + AP ++ CF+
Sbjct: 321 ASSYKVPTIIDSGTVI------TRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKG 374
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIG 381
G I V P I ++ + + G NS+V++ CL + +IG
Sbjct: 375 SLAG---ISEVAPDIRIIFKG-GADLQLKGHNSLVELETGITCLAMAGS----SSIAIIG 426
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q + + +D+ SR+GF+
Sbjct: 427 NYQQQTVKVAYDVGNSRVGFA 447
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 88/337 (26%), Positives = 135/337 (40%), Gaps = 64/337 (18%)
Query: 38 KDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYV---SS 80
+ +L Y ++ TP + + G ++ WV C+ K Y SS
Sbjct: 77 RHAGSLHYA-KVALGTPNATFVVALDTGSDLFWVPCDCKRCAPIANTSELLKPYSPRQSS 135
Query: 81 TNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNP--- 137
T+K C + C+ NA G G C +NT + G + DVL + +
Sbjct: 136 TSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMTRQSSSSRSGN 195
Query: 138 ----GRAVTVPNFIFLCGSEFV---LQGLASGVVGIAGLGRSKVALPSQLAAAFSL-KRK 189
G AV +F CG E L G A + G+ GLG +V++PS LAAA +
Sbjct: 196 GGNVGEAVGA-RVVFGCGQEQTGAFLDGAA--MEGLLGLGMDRVSVPSLLAAAGLVGSDS 252
Query: 190 FALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
F++C SP +G I FG+ N TP ++K Y I V
Sbjct: 253 FSMCFSPDGNGRINFGEPSDAGAQN--------ETPFIVSKTRPT----------YNISV 294
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVAR 308
T+V+V GK + +D+ GT +T L Y L +F S + K A
Sbjct: 295 TAVNVKGKGAMAAEFAAVVDS----GTS------FTYLNDPAYSLLATSFNSQVREKRAN 344
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNV 345
++ PF C+ L G T + ++P++ L + V
Sbjct: 345 LSASIPFEYCYALSR-GQTEV--LMPEVSLTTRGGAV 378
>gi|255552249|ref|XP_002517169.1| conserved hypothetical protein [Ricinus communis]
gi|223543804|gb|EEF45332.1| conserved hypothetical protein [Ricinus communis]
Length = 98
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 56/91 (61%), Gaps = 8/91 (8%)
Query: 166 IAGLGRSKVALPSQLAAAFSLKRKFALCLSPF--DDGAIVFGDGPYYDLNNFDVSKNLKY 223
+AGLGR ++LP+ ++A ++FA+CLS +G + FG GPY + N D+ L Y
Sbjct: 1 MAGLGRRNISLPAYFSSALGFPKQFAVCLSSSTKSNGVMFFGAGPYSIIPN-DL---LIY 56
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
TPL +N + F+GE + +Y+IGV S+ V
Sbjct: 57 TPLILN--SPVYKFIGESAADYYIGVKSIRV 85
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 98/398 (24%), Positives = 149/398 (37%), Gaps = 76/398 (19%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------------KGYV---S 79
+L Y ++ TP + + G ++ WV C+ + Y S
Sbjct: 105 SLHYA-EVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIANASDLRGGPDLRPYSPGKS 163
Query: 80 STNKTARCGSAQCNLANAKACGGGICGAGPDNP---ISNTGTHGDIRIDVLSIQSTDGRN 136
ST+K C A C NA A G + P +NT + G + DVL +
Sbjct: 164 STSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDVLHLSREAAGG 223
Query: 137 PGRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQL-AAAFSLKRKFAL 192
AVT P + CG + L G A V G+ GLG KV++PS L AA F++
Sbjct: 224 ASTAVTAP-VVLGCGQVQTGAFLDGAA--VDGLLGLGMDKVSVPSVLHAAGLVASDSFSM 280
Query: 193 CLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
C SP G I FGD + TP + + Y I VT++
Sbjct: 281 CFSPDGFGRINFGD---------SGRRGQAETPFTVRNTHPT----------YNISVTAM 321
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVAP 311
V+GK V + + + +T L Y L F S + + A ++
Sbjct: 322 SVSGKEVAAEFAAI-----------VDSGTSFTYLNDPAYTELATGFNSEVRERRANLSA 370
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNV------VWSIHGQNSMVQIGGDALCL 365
PF C+ L G + VP++ L + V + I+G+ S +I CL
Sbjct: 371 SIPFEYCYEL---GRGQTELFVPEVSLTTRGGAVFPVTRPIVVIYGETSDGRIVAAGYCL 427
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ N T +IG + + FD S LG+ +
Sbjct: 428 AVLK---NDITIDIIGQNFMTGLKVVFDRERSVLGWHE 462
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 100/404 (24%), Positives = 155/404 (38%), Gaps = 84/404 (20%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGY----------VSST 81
T T Y T+IK TP + V G +ILWV+C + G SST
Sbjct: 81 TDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASST 140
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQ--ST 132
C A C A GG + G + P + T+GD D L +
Sbjct: 141 GSMVMCDQAFC----AATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTR 196
Query: 133 DGR-NPGRAVTVPNFIFLCGSEFVLQGLASG-----VVGIAGLGRSKVALPSQLAAAFSL 186
DG+ P A + IF CG++ QG G + GI G G + ++ SQL A +
Sbjct: 197 DGQTQPANA----SVIFGCGAQ---QGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKV 249
Query: 187 KRKFALCLSPFDDGAIV-FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
K+ FA CL G I GD V +K TPL +K + Y
Sbjct: 250 KKIFAHCLDTIKGGGIFSIGD---------VVQPKVKTTPLVADKPH------------Y 288
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+ + ++ V G + L + + GT I + T L ++K ++ A +
Sbjct: 289 NVNLKTIDVGGTTLQLPAHIFEPGEK--KGTIIDSGTTLTYLPELVFKEVMLAVFNKHQD 346
Query: 306 VARVAPVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA 362
+ V F CF+ D GF P I ++ ++ ++ G D
Sbjct: 347 IT-FHDVQGF-LCFQYPGSVDDGF-------PTITFHFED-DLALHVYPHEYFFANGNDV 396
Query: 363 LCLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
C+GF +G + +++G L N L+ +DL +G++D
Sbjct: 397 YCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTD 440
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 109/265 (41%), Gaps = 32/265 (12%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD- 199
V+VP F CG + G + G G+ GLGR ++L SQL + KF+ CL+ DD
Sbjct: 195 VSVPEVAFGCGEDNEGSGFSQGS-GLVGLGRGPLSLVSQLK-----EPKFSYCLTSVDDT 248
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
A G + D +K TPL N +PS Y++ + + V ++
Sbjct: 249 KASTLLMGSLASVKASD--SEIKTTPLIQNSA--------QPSF-YYLSLEGISVGDTSL 297
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
P+ K+ S+ +G GG I + T LE S + + + F S + + CF
Sbjct: 298 PIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCF 357
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV---QIGGDALCLGFVDGGVNPRT 376
L G T I VP+ LV + +N M+ +G L +G G
Sbjct: 358 TLPS-GSTDI--EVPK--LVFHFDGADLELPAENYMIADASMGVACLAMGSSSG------ 406
Query: 377 SIVIGARQLENNLLQFDLATSRLGF 401
+ G Q +N L+ DL L F
Sbjct: 407 MSIFGNIQQQNMLVLHDLEKETLSF 431
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 93/392 (23%), Positives = 148/392 (37%), Gaps = 64/392 (16%)
Query: 37 AKDTATL---QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--GY------------VS 79
+K +TL YV + +P + G ++ W CE GY S
Sbjct: 136 SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTS 195
Query: 80 STNKTARCGSAQCN-----LANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQST 132
+ C S C N+ C C G G++ G + LS+ ST
Sbjct: 196 LSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRY---GDGSYSIGFFAREKLSLTST 252
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
D N NF F CG +GL G G+ GL R+ ++L SQ A + + L
Sbjct: 253 DVFN--------NFQFGCGQNN--RGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL 302
Query: 193 CLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
S G + FG G SK +K+TP +N PS YF+ + +
Sbjct: 303 PSSSSSTGYLSFGSGD-------GDSKAVKFTPSEVNS--------DYPSF-YFLDMVGI 346
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V + +P+ K++ S GT I + + L ++Y ++ + F M RV V
Sbjct: 347 SVGERKLPIPKSVFS-----TAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGV 401
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
+ C+ L + VP+I L + + + + +CL F G
Sbjct: 402 SILDTCYDLSKYKTVK----VPKIILYFSG-GAEMDLAPEGIIYVLKVSQVCLAFA-GNS 455
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+ +IG Q + + +D A R+GF+ S
Sbjct: 456 DDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPS 487
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 101/406 (24%), Positives = 154/406 (37%), Gaps = 72/406 (17%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGY----------VSST 81
T T Y T+IK TP + V G +ILWV+C + G SS+
Sbjct: 82 TDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSS 141
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDG 134
T C C A GG + G + P + +GD D L G
Sbjct: 142 GSTVSCDQGFC----AATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTG 197
Query: 135 R---NPGRAVTVPNFIFLCGSEFVLQGLASG-----VVGIAGLGRSKVALPSQLAAAFSL 186
PG A F CG++ QG G + GI G G++ ++ SQLAAA
Sbjct: 198 DGQTQPGNATIT----FGCGAQ---QGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKA 250
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDV---SKNLKYTPLFINKVNTASGFLGEPSV 243
K+ FA CL G I F G + V + L PLF+ L P
Sbjct: 251 KKIFAHCLDTIKGGGI-FAIGNVVQPKCYFVFFFAHGLLNIPLFL----LVMILLSRP-- 303
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
Y + + S+ V G + L + + GT I + T L ++K ++ S
Sbjct: 304 HYNVNLKSIDVGGTTLQLPAHVFETGEK--KGTIIDSGTTLTYLPELVFKQVMDVVFSKH 361
Query: 304 PKVARVAPVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
+A + F CF+ D GF P I ++ ++ ++ G
Sbjct: 362 RDIA-FHNLQDF-LCFQYSGSVDDGF-------PTITFHFED-DLALHVYPHEYFFPNGN 411
Query: 361 DALCLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
D C+GF +G + + +++G L N L+ +DL +G++D
Sbjct: 412 DIYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENQVIGWTD 457
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 86/382 (22%), Positives = 153/382 (40%), Gaps = 56/382 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y+ + TP + V G +++W C + + T A+ C
Sbjct: 91 EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 104 ICGAGPDNPISNTGT------HGD--IRIDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEF 154
+C A P +GD VL+ ++ T G V V + F CG+
Sbjct: 151 LCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGN-- 208
Query: 155 VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF---DDGAIVFGDGPYYD 211
+ G + G+ GLGR ++L SQL + +F+ CL+ F + + FG +
Sbjct: 209 INSGQLANSSGMVGLGRGPLSLVSQLGPS-----RFSYCLTSFLSPEPSRLNFGV--FAT 261
Query: 212 LNNFDVSKN---LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
LN + S + ++ TPL +N PS+ YF+ + + + K +P++ + +I
Sbjct: 262 LNGTNASSSGSPVQSTPLVVNA--------ALPSL-YFMSLKGISLGQKRLPIDPLVFAI 312
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
+++G GG I + T L+ Y A+ + S + P+ P +IG
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVL------RPLPPTND----TEIGLET 362
Query: 329 IGPVVPQIDLVLQNKNVVWSIHG--------QNSMVQIGGDA-LCLGFVDGGVNPRTSIV 379
P P + + ++ G +N M+ G LCL + G + +
Sbjct: 363 CFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSG----DATI 418
Query: 380 IGARQLENNLLQFDLATSRLGF 401
IG Q +N + +D+A S L F
Sbjct: 419 IGNYQQQNMHILYDIANSLLSF 440
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 108/290 (37%), Gaps = 63/290 (21%)
Query: 57 PVKLTVHLGGNILWVDC-------------EKGYVSSTN--------KTARCGSAQCNLA 95
PV L + G +++W C G +S+N + C S C+ A
Sbjct: 97 PVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAA 156
Query: 96 NAKACGGGICGAG--PDNPISNTGTHGDIRIDVLSIQSTDGR-----NPGR-----AVTV 143
++ A +C A P + I L DG GR +V V
Sbjct: 157 HSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRGRVGIAASVAV 216
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--------S 195
NF F C + + VG+AG GR ++LP+QLA A +L +F+ CL
Sbjct: 217 ENFTFACAHTALGE-----PVGVAGFGRGPLSLPAQLAPA-ALSGRFSYCLVAHSFRADR 270
Query: 196 PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF--IGVTSVH 253
P ++ G P D + + YTPL N P YF + + +V
Sbjct: 271 PIRPSPLILGRSPGEDPAS---ETGIVYTPLLHN-----------PKHPYFYSVALEAVS 316
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
V G +P L + G GG + + +T+L Y + + F AM
Sbjct: 317 VGGTRIPARPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAM 366
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 86/382 (22%), Positives = 152/382 (39%), Gaps = 56/382 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y+ + TP + V G +++W C + + T A+ C
Sbjct: 91 EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 104 ICGAGPDNPISNTGT------HGD--IRIDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEF 154
+C A P +GD VL+ ++ T G V V + F CG+
Sbjct: 151 LCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGN-- 208
Query: 155 VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF---DDGAIVFGDGPYYD 211
+ G + G+ GLGR ++L SQL + +F+ CL+ F + + FG +
Sbjct: 209 INSGQLANSSGMVGLGRGPLSLVSQLGPS-----RFSYCLTSFLSPEPSRLNFGV--FAT 261
Query: 212 LNNFDVSKN---LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
LN + S + ++ TPL +N PS+ YF+ + + + K +P++ + +I
Sbjct: 262 LNGTNASSSGSPVQSTPLVVNA--------ALPSL-YFMSLKGISLGQKRLPIDPLVFAI 312
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
+++G GG I + T L+ Y A+ S + P+ P +IG
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVL------RPLPPTND----TEIGLET 362
Query: 329 IGPVVPQIDLVLQNKNVVWSIHG--------QNSMVQIGGDA-LCLGFVDGGVNPRTSIV 379
P P + + ++ G +N M+ G LCL + G + +
Sbjct: 363 CFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSG----DATI 418
Query: 380 IGARQLENNLLQFDLATSRLGF 401
IG Q +N + +D+A S L F
Sbjct: 419 IGNYQQQNMHILYDIANSLLSF 440
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 146/367 (39%), Gaps = 46/367 (12%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGP--- 109
TP P K+ + LG ++LW C ++ +A+ + + C +C AG
Sbjct: 115 TPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGTFTN 174
Query: 110 ----DNPISNTGTHGDIR-IDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGV 163
D + +G + VL+ ++ T G + G + N F CG + G +
Sbjct: 175 KTCTDRKCAYENDYGIMTATGVLATETFTFGAHHGVSA---NLTFGCGK--LANGTIAEA 229
Query: 164 VGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFDVSKN 220
GI GL +++ QLA KF+ CL+PF D ++FG DL + +
Sbjct: 230 SGILGLSPGPLSMLKQLAIT-----KFSYCLTPFADRKTSPVMFG--AMADLGKYKTTGK 282
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
++ PL N V + Y++ + + V K + + + L+I +G GGT + +
Sbjct: 283 VQTIPLLKNPVE---------DIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDS 333
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL 340
L + L +A + V + CF L G + G VP LVL
Sbjct: 334 ATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDDYPVCFELPR-GMSMEGVQVPP--LVL 390
Query: 341 Q-NKNVVWSIHGQNSMVQIGGDALCLGFV----DGGVNPRTSIVIGARQLENNLLQFDLA 395
+ + S+ N + +CL + +G N VIG Q +N + +D+
Sbjct: 391 HFDGDAEMSLPRDNYFQEPSPGMMCLAVMQAPFEGAPN-----VIGNVQQQNMHVLYDVG 445
Query: 396 TSRLGFS 402
+ ++
Sbjct: 446 NRKFSYA 452
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 94/403 (23%), Positives = 149/403 (36%), Gaps = 66/403 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------------SSTNK 83
Y + TP VKL + G +++W C YV SS++K
Sbjct: 84 YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C + +C + P N I + ST G +
Sbjct: 144 LIGCKNPKCAWVFGSSVQSKCHNCNPQ--AQNCTQACPPYIIQYGLGSTAGLLLSETINF 201
Query: 144 PNFI---FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFD 198
PN FL G + G IAG GRS+ +LP QL +KF+ CL FD
Sbjct: 202 PNKTISDFLAGCSLLSTRQPEG---IAGFGRSQESLPLQLGL-----KKFSYCLVSRRFD 253
Query: 199 DGAI----VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
D + + GP + + L YTP N + ++ E Y++ + + V
Sbjct: 254 DSPVSSDLILDMGPSTSDSK---TTGLSYTPFQKNLASQSNPAFQE---YYYVMLRKIIV 307
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK---VARVAP 311
V + + L ++G GGT + + + +T +E +++ L + F M V
Sbjct: 308 GKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQK 367
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKN-VVWSIHGQNSMVQIGGDALCLGFV-- 368
+ CF DI + V+P DL Q K + N + +CL V
Sbjct: 368 LTGLRPCF---DISGEK-SVVIP--DLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSD 421
Query: 369 -------DGGVNPR-TSIVIGARQLENNLLQFDLATSRLGFSD 403
DGGV +I++G Q +N +++DL R GF +
Sbjct: 422 NAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKE 464
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 141/380 (37%), Gaps = 62/380 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
YV + TP + L G ++ W CE SS+ C S+
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSS 105
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDV-------LSIQSTDGRNPGRAVTV 143
C + C + D +GD V L+I +TD V
Sbjct: 106 LCTQLTSDGIKSE-CSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATD--------IV 156
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GA 201
+F+F CG + +GL +G G+ GLGR +++ Q ++ + + F+ CL G
Sbjct: 157 DDFLFGCGQDN--EGLFNGSAGLMGLGRHPISIVQQTSSNY--NKIFSYCLPATSSSLGH 212
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ FG + +L YTPL S G+ S Y + + S+ V G +P
Sbjct: 213 LTFGASAA-------TNASLIYTPL--------STISGDNSF-YGLDIVSISVGGTKLP- 255
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+S GG+ I + T L ++Y AL AF M K C+ L
Sbjct: 256 ---AVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDL 312
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIG 381
G+ I VP+ID V H V+ +CL F G + + V G
Sbjct: 313 S--GYKEIS--VPRIDFEFSGGVTVELXHRGILXVE-SEQQVCLAFAANGSDNDIT-VFG 366
Query: 382 ARQLENNLLQFDLATSRLGF 401
Q + + +D+ R+GF
Sbjct: 367 NVQQKTLEVVYDVKGGRIGF 386
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 110/271 (40%), Gaps = 34/271 (12%)
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
+PNF+ C + Q GIAG GR +LPSQ+ +KFA CL+ +
Sbjct: 203 IPNFVVGCSFLSIHQP-----SGIAGFGRGSESLPSQMGL-----KKFAYCLA-----SR 247
Query: 203 VFGDGPYYDLNNFDV----SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
F D P+ D S L YTP N + + + Y++ + + V +A
Sbjct: 248 KFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY----KEYYYLNIRKIIVGNQA 303
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
V + L +G GG+ I + + +T ++ + + + + F + R V
Sbjct: 304 VKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGL 363
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKN-VVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRT 376
DI + V +L+ Q K W++ N + + CL V +
Sbjct: 364 RPCFDISKEK---SVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
Query: 377 ------SIVIGARQLENNLLQFDLATSRLGF 401
S+++GA Q +N +++DL RLGF
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGF 451
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 110/271 (40%), Gaps = 34/271 (12%)
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
+PNF+ C + Q GIAG GR +LPSQ+ +KFA CL+ +
Sbjct: 203 IPNFVVGCSFLSIHQP-----SGIAGFGRGSESLPSQMGL-----KKFAYCLA-----SR 247
Query: 203 VFGDGPYYDLNNFDV----SKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
F D P+ D S L YTP N + + + Y++ + + V +A
Sbjct: 248 KFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY----KEYYYLNIRKIIVGNQA 303
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
V + L +G GG+ I + + +T ++ + + + + F + R V
Sbjct: 304 VKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGL 363
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKN-VVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRT 376
DI + V +L+ Q K W++ N + + CL V +
Sbjct: 364 RPCFDISKEK---SVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
Query: 377 ------SIVIGARQLENNLLQFDLATSRLGF 401
S+++GA Q +N +++DL RLGF
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGF 451
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 99/393 (25%), Positives = 151/393 (38%), Gaps = 60/393 (15%)
Query: 29 PKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVSST---NK 83
P VL++A YV ++K TP + + + + WV C G S+T N
Sbjct: 88 PGQQVLKIAN------YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTTFLPNA 141
Query: 84 TARCGSAQCNLANAKACGGGICGA-GPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+ GS C+ A G C A G + N GD + +Q D V
Sbjct: 142 STTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQ--DAITLANDV- 198
Query: 143 VPNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+P F F C + ++ G + G+ GLGR ++L SQ A +S F+ CL F
Sbjct: 199 IPGFTFGC-----INAVSGGSIPPQGLLGLGRGPISLISQAGAMYS--GVFSYCLPSFKS 251
Query: 200 GAIVFGDGPYY-----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
YY L K+++ TPL N PS+ Y++ +T V V
Sbjct: 252 ---------YYFSGSLKLGPVGQPKSIRTTPLLRNP--------HRPSL-YYVNLTGVSV 293
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
VP+ L D GT I + T +Y A+ F + ++ +
Sbjct: 294 GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NGPISSLGA 351
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVN 373
F CF + P I L + N+V + +NS++ +L CL N
Sbjct: 352 FDTCFAATNEA------EAPAITLHFEGLNLVLPM--ENSLIHSSSGSLACLSMAAAPNN 403
Query: 374 PRTSI-VIGARQLENNLLQFDLATSRLGFSDSL 405
+ + VI Q +N + FD SRLG + L
Sbjct: 404 VNSVLNVIANLQQQNLRIMFDTTNSRLGIAREL 436
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 91/409 (22%), Positives = 159/409 (38%), Gaps = 68/409 (16%)
Query: 18 LSPITAQ-TSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE 74
L P+ + T ++P+AL V + + +Y ++I TP + L + G ++ W+ CE
Sbjct: 132 LKPVNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE 191
Query: 75 -------------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD 121
SST K+ C + QC+L AC C ++GD
Sbjct: 192 PCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQV--------SYGD 243
Query: 122 IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLA 181
V + +TD G + + + CG + +GL +G G+ GLG +++ +Q+
Sbjct: 244 GSFTVGEL-ATDTVTFGNSGKINDVALGCGHDN--EGLFTGAAGLLGLGGGALSITNQMK 300
Query: 182 AAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGE 240
A F+ CL D G D N+ + PL N K++T
Sbjct: 301 AT-----SFSYCLVDRDSGK-----SSSLDFNSVQLGSGDATAPLLRNQKIDTF------ 344
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
Y++G++ V G+ V + + +D G GG + T L+T Y +L AF
Sbjct: 345 ----YYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFL 400
Query: 301 SAMPKVAR-VAPVAPFGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSM 355
+ + + ++ F C+ + ++ V L L KN + + +
Sbjct: 401 KLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGT- 459
Query: 356 VQIGGDALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
C F P +S +IG Q + + +DLA +G S
Sbjct: 460 -------FCFAFA-----PTSSSLSIIGNVQQQGTRITYDLANKIIGLS 496
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 155/389 (39%), Gaps = 64/389 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y T++ TP L V G + +V C + +SST + +C +
Sbjct: 81 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC-TLD 139
Query: 92 CNLANAK-ACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
CN N + C + +++G G+ DV+S + P RAV F C
Sbjct: 140 CNCDNDRMQC---VYERQYAEMSTSSGVLGE---DVVSFGNQSELAPQRAV------FGC 187
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGD-G 207
+ + GI GLGR +++ QL + F+LC D GA+V G
Sbjct: 188 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGIS 247
Query: 208 PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
P D+ +F S + Y I + +HV GK +PLN ++
Sbjct: 248 PPSDM-------------VFAQSDPVRSPY-------YNIDLKEIHVAGKRLPLNPSVF- 286
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--FGACFRLQDIG 325
+G G+ + + Y L + A +A + ++++ P CF I
Sbjct: 287 ---DGKHGSVLDSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGID 343
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGAR 383
+++ P +D++ N + +S+ +N M + A CLG G +P T ++G
Sbjct: 344 VSQLSKTFPVVDMIFGNGH-KYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTT--LLGGI 400
Query: 384 QLENNLLQFDLATSRLGFSDS---LLFER 409
+ N L+ +D +++GF + L+ER
Sbjct: 401 VVRNTLVLYDREQTKIGFWKTNCAELWER 429
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 134/339 (39%), Gaps = 62/339 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T +Y+ + TP PV+LT+ G +++W C+ SST C
Sbjct: 79 TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCD 138
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
S C +CG P+ T ++GD + ++ G +VP F
Sbjct: 139 STLCQGLPVASCGSP--KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAF 196
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGD 206
CG F S GIAG GR ++LPSQL F+ C + + + V D
Sbjct: 197 GCG-LFNNGVFKSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTAVNGLKPSTVLLD 250
Query: 207 GPYYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
P D+ K+ ++ TPL N N P+ Y++ + + V +P+
Sbjct: 251 LPA------DLYKSGRGAVQSTPLIQNPAN--------PTF-YYLSLKGITVGSTRLPVP 295
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVA--RVAPVAPFGAC 318
++ ++ N G GGT I + T L T +Y+ + AFA+ +P V+ P A
Sbjct: 296 ESEFALKN-GTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAP 354
Query: 319 FRLQDIGFTRIGPVVPQIDLV-------LQNKNVVWSIH 350
R + P VP++ L L +N VW H
Sbjct: 355 LRAK--------PYVPKLVLHFEGATMDLPRENYVWLKH 385
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 148/391 (37%), Gaps = 85/391 (21%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGY----------VSSTNKTARCGSA 90
+Y+ I TP VP+ G +++W CE Y SST + C S+
Sbjct: 85 EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSS 144
Query: 91 QCNLANAKACGGGICGAGPDNPIS-------NTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
QC +C + +N S N+ T GD+ +D +++ S+ GR P V++
Sbjct: 145 QCRALEDASC------STDENTCSYTITYGDNSYTKGDVAVDTVTMGSS-GRRP---VSL 194
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---- 199
N I CG E GI GLG +L SQL S+ KF+ CL PF
Sbjct: 195 RNMIIGCGHENT-GTFDPAGSGIIGLGGGSTSLVSQLRK--SINGKFSYCLVPFTSETGL 251
Query: 200 --------GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
IV GDG V + S +P+ YF+ + +
Sbjct: 252 TSKINFGTNGIVSGDG-----------------------VVSTSMVKKDPATYYFLNLEA 288
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA- 310
+ V K + T+ G G I + T+L ++ Y L AS + K RV
Sbjct: 289 ISVGSKKIQFTSTIFGT---GEGNIVIDSGTTLTLLPSNFYYELESVVASTI-KAERVQD 344
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
P C+R F VP I + + +V + N+ V + D C F
Sbjct: 345 PDGILSLCYR-DSSSFK-----VPDITVHFKGGDV--KLGNLNTFVAVSEDVSCFAF--- 393
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
N + +I Q+ N L+ +D + + F
Sbjct: 394 AANEQLTIFGNLAQM-NFLVGYDTVSGTVSF 423
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 152/391 (38%), Gaps = 67/391 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKA----- 99
Y +I +P + V G +ILWV+C S+ K + G L N K+
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVG--CSNCPKKSDIG-VDLQLYNPKSSSTST 129
Query: 100 ---CGGGICGAGPDNPI----------------SNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C C A D PI + T G D + +Q G N +
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVG-NHKTS 188
Query: 141 VTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
T + +F CG++ L + + GI G G++ ++ SQLAA +K+ FA CL
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSIS 248
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
G I F G V LK TP+ N+ + Y + + V V A
Sbjct: 249 GGGI-FAIGEV-------VEPKLKTTPVVPNQAH------------YNVVLNGVKVGDTA 288
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ L L + G I + L SIY L++ A P + ++ V C
Sbjct: 289 LDLPLGLFETSYK--RGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDL-KLRTVDDQFTC 345
Query: 319 FRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
F D GF P + + ++++ +I+ + QI D C+G+ + G +
Sbjct: 346 FVFDKNVDDGF-------PTVTFKFE-ESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSK 397
Query: 376 TS---IVIGARQLENNLLQFDLATSRLGFSD 403
++G L+N L+ ++L +G+++
Sbjct: 398 DGNEVTLLGDLVLQNKLVYYNLENQTIGWTE 428
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 149/395 (37%), Gaps = 70/395 (17%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK-------------GYVSSTNKTAR 86
T + YV TP P + L G ++W C++ S+T +
Sbjct: 46 TQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEP 105
Query: 87 CGSAQCNL--ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
CG+ C ++++ C G +C +TG G + D ++ G A
Sbjct: 106 CGTPLCESIPSDSRNCSGNVCAYQASTNAGDTG--GKVGTDTFAV--------GTAKASL 155
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---A 201
F + S+ G SG+V GLGR+ +L +Q A F+ CL+P D G A
Sbjct: 156 AFGCVVASDIDTMGGPSGIV---GLGRTPWSLVTQTGVA-----AFSYCLAPHDAGKNSA 207
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ G TP F+N SG + S Y + + + +PL
Sbjct: 208 LFLGSS-----AKLAGGGKAASTP-FVN----ISGNGNDLSNYYKVQLEGLKAGDAMIPL 257
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ ++ + T +P + L Y+A+ +A A+ PV PF
Sbjct: 258 PPSGSTV--------LLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPF------ 303
Query: 322 QDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDALCLGFVDGG-VNPRTSI- 378
D+ F + G DLV + ++ N ++ +CL + +N T +
Sbjct: 304 -DLCFPKSGASGAAPDLVFTFRGGAAMTVAASNYLLDYKNGTVCLAMLSSARLNSTTELS 362
Query: 379 VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
++G+ Q EN FDL ++L FE A CT
Sbjct: 363 LLGSLQQENIHFLFDLD------KETLSFEPADCT 391
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/395 (23%), Positives = 149/395 (37%), Gaps = 70/395 (17%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKG------YVSSTNKTAR 86
T + YV TP P + L G ++W C E+G S+T +
Sbjct: 46 TQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEP 105
Query: 87 CGSAQCNL--ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
CG+ C ++ + C G +C +TG G + D ++ G A
Sbjct: 106 CGTPLCESIPSDVRNCSGNVCAYEASTNAGDTG--GKVGTDTFAV--------GTAKASL 155
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---A 201
F + S+ G SG+V GLGR+ +L +Q A F+ CL+P D G A
Sbjct: 156 AFGCVVASDIDTMGGPSGIV---GLGRTPWSLVTQTGVA-----AFSYCLAPHDAGKNSA 207
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ G TP F+N SG + S Y + + + +PL
Sbjct: 208 LFLGSS-----AKLAGGGKAASTP-FVN----ISGNGNDLSNYYKVQLEGLKAGDAMIPL 257
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ ++ + T +P + L Y+A+ +A A+ PV PF
Sbjct: 258 PPSGSTV--------LLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPF------ 303
Query: 322 QDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDALCLGFVDGG-VNPRTSI- 378
D+ F + G DLV + ++ N ++ +CL + +N T +
Sbjct: 304 -DLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELS 362
Query: 379 VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
++G+ Q EN FDL ++L FE A CT
Sbjct: 363 LLGSLQQENIHFLFDLD------KETLSFEPADCT 391
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 148/388 (38%), Gaps = 78/388 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ I TP P+ G ++LW C+ SST K C S+
Sbjct: 93 EYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSS 152
Query: 91 QCN-LANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGRAVT 142
QC L N +C + DN S + ++GD I +D L++ STD R V
Sbjct: 153 QCTALENQASC------STEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTD----TRPVQ 202
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
+ N I CG GI GLG V+L +QL S+ KF+ CL P
Sbjct: 203 LKNIIIGCGHNNA-GTFNKKGSGIVGLGGGAVSLITQLGD--SIDGKFSYCLVPL----- 254
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFI--NKVNTASGFLGEPSVE------YFIGVTSVHV 254
S+N + + + N V + +G + P + Y++ + S+ V
Sbjct: 255 --------------TSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISV 300
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
K V S G G I + T+L T Y L A AS++ + P
Sbjct: 301 GSKEVQYPG---SDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTG 357
Query: 315 FGACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
C ++ G + VP I + +V ++ N VQI D +C F +
Sbjct: 358 LSLC-------YSATGDLKVPAITMHFDGADV--NLKPSNCFVQISEDLVCFAFRG---S 405
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGF 401
P SI Q+ N L+ +D + + F
Sbjct: 406 PSFSIYGNVAQM-NFLVGYDTVSKTVSF 432
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/398 (23%), Positives = 150/398 (37%), Gaps = 72/398 (18%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGY----------VSST 81
T T Y T+++ TP + V G +ILWV+C + G SST
Sbjct: 83 TDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASST 142
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDG 134
T C C A GG + + P + T+GD D L G
Sbjct: 143 GSTVMCDQGFC----ADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTG 198
Query: 135 RNPGRAVTVPNFIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
+ + IF CG++ L + + GI G G + ++ SQLA A +K+ FA
Sbjct: 199 DGQTQPANA-SVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAH 257
Query: 193 CLSPFDDGAI-VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
CL G I GD V +K TPL +K + Y + + +
Sbjct: 258 CLDTIKGGGIFAIGDV---------VQPKVKTTPLVADKPH------------YNVNLKT 296
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ V G + L + + GT I + T L ++K ++ A + +
Sbjct: 297 IDVGGTTLELPADIFKPGEK--RGTIIDSGTTLTYLPELVFKKVMLAVFNKHQDIT-FHD 353
Query: 312 VAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV 368
V F CF D GF P + ++ ++ ++ G D C+GF
Sbjct: 354 VQDF-LCFEYSGSVDDGF-------PTLTFHFED-DLALHVYPHEYFFPNGNDVYCVGFQ 404
Query: 369 DGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
+G + + +++G L N L+ +DL +G++D
Sbjct: 405 NGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTD 442
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 122/293 (41%), Gaps = 63/293 (21%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL------ 194
+ VP F F C + + +GIAG G+ ++LPSQL L + F+ C
Sbjct: 162 LDVPGFCFGCVGSSIRE-----PIGIAGFGKGILSLPSQLGF---LDKGFSHCFLGFRFA 213
Query: 195 -SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+P +++ GD + +TP+ + N P+ Y+IG+ V
Sbjct: 214 RNPNFTSSLIMGD------LALSAKDDFLFTPMLKSITN--------PNF-YYIGLEGVS 258
Query: 254 V-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PKVARVA 310
+ +G A+ +L SID+EG GG + T YT L Y A++ + AS + + +
Sbjct: 259 IGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLASVILYERSYDLE 318
Query: 311 PVAPFGACFRL--------QD----IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI 358
F CF++ QD I F +G + L L + +++ + V +
Sbjct: 319 MRTGFDLCFKIPCTHTPCTQDELPLINFHFLG----DVKLTLPKDSCYYAVTAPKNSVVV 374
Query: 359 GGDALCLGFVD----------GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
CL F GG N V+G+ Q++N + +D+ R+GF
Sbjct: 375 K----CLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGF 423
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 140/349 (40%), Gaps = 51/349 (14%)
Query: 78 VSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTG-THGDIRIDVLSIQSTDGRN 136
V++++ G CN N+K C P + TG + GD ++ L+
Sbjct: 156 VNTSSPDVHLGCPPCN-GNSKNCSHA---CPPYSLQYGTGASSGDFLLENLNF------- 204
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
PG+ T+ F+ C + V + ++ +AG GRS +LP Q+ +KFA CL+
Sbjct: 205 PGK--TIHEFLVGCTTSAVGEVTSAA---LAGFGRSMFSLPMQMGV-----KKFAYCLNS 254
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
D D ++ + +K L Y P N + + Y++GV + +
Sbjct: 255 HDYDDTRNSSKLILDYSDGE-TKGLSYAPFLKNPPDFP--------IYYYLGVKDIKIGN 305
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR-VAPVAPF 315
K + + L+ ++G GG I + Y + ++K + M K R + A
Sbjct: 306 KLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEI 365
Query: 316 G--ACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDAL-CLGF-VDG 370
G C+ +I DL+ Q + + G+N V I +L C D
Sbjct: 366 GVTPCYNFTGQKSIKIP------DLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDA 419
Query: 371 GVN-----PRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTF 414
G N P SI++G Q + ++FDL RLGF +CTF
Sbjct: 420 GTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTC---QSCTF 465
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 143/384 (37%), Gaps = 71/384 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 177 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSC 236
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ + C GG C G G++ G +D L++ S D V
Sbjct: 237 AAPACSDLDVSGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 285
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IV 203
F F CG GL G+ GLGR K +LP Q + FA CL P G +
Sbjct: 286 FRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGKYG--GVFAHCLPPRSTGTGYLD 341
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG G TP+ G Y++G+T + V G+ +P+
Sbjct: 342 FGAGS---------PPATTTTPMLT----------GNGPTFYYVGMTGIRVGGRLLPIAP 382
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA--RVAPVAPFGACFRL 321
++ + GT + + T L + Y +L AFA+AM + A V+ C+
Sbjct: 383 SVFA-----AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDF 437
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + M + +CL F DGG
Sbjct: 438 --TGMSQV--AIPTVSLLFQG-GAALDVDASGIMYTVSASQVCLAFAGNEDGG----DVG 488
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
++G QL+ + +D+ +GFS
Sbjct: 489 IVGNTQLKTFGVAYDIGKKVVGFS 512
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 91/395 (23%), Positives = 151/395 (38%), Gaps = 66/395 (16%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYVSSTNKTARCGSAQ 91
T T Y T+I+ +P + V G +ILWV+C G + GS
Sbjct: 79 TDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT 138
Query: 92 CNLANAKACGGGICGAGP------DNPISNTGTHGD-------IRIDVLSIQSTDGRNPG 138
+ C G P +P T+GD D + G N
Sbjct: 139 TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSG-NGQ 197
Query: 139 RAVTVPNFIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
+ + F CG++ L + GI G G+S ++ SQLAAA +++ FA CL
Sbjct: 198 TTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDT 257
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
G I + + N V +K TPL N + Y + + + V G
Sbjct: 258 VRGGGI-------FAIGNV-VQPKVKTTPLVPNVTH------------YNVNLQGISVGG 297
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+ L + + D+ GT I + L +Y+ L+ A + P+ +
Sbjct: 298 ATLQLPTS--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDL----PLHNYQ 351
Query: 317 --ACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGG 371
CF+ D GF P I + ++ +++ + + Q D C+GF+DGG
Sbjct: 352 DFVCFQFSGSIDDGF-------PVITFSFEG-DLTLNVYPDDYLFQNRNDLYCMGFLDGG 403
Query: 372 VNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
V + +++G L N L+ +DL +G++D
Sbjct: 404 VQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIGWTD 438
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/381 (22%), Positives = 152/381 (39%), Gaps = 68/381 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ P + + + G ++ W+ C+ VS++ T C S
Sbjct: 162 EYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSP 221
Query: 91 QCNLANAKAC--GGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+C +A AC G C G G + T GD + L++ G +
Sbjct: 222 RCRDLDAAACRNSTGSCLYEVAYGDG-------SYTVGDFATETLTL--------GDSAP 266
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
V N CG + +GL G G+ LG ++ PSQ++A F+ CL
Sbjct: 267 VSNVAIGCGHDN--EGLFVGAAGLLALGGGPLSFPSQISAT-----TFSYCL-------- 311
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
V D P F S+ T I T + Y++ ++ + V G+A+ +
Sbjct: 312 VDRDSPSSSTLQFGDSEQPAVTAPLIRSPRT--------NTFYYVALSGISVGGEALSIP 363
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ ++D+ G GG + + T L++ Y AL +AF + R + V+ F C+ L
Sbjct: 364 SSAFAMDDAGSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLA 423
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIVIG 381
R VP + L + + + +N ++ + CL F G P + +IG
Sbjct: 424 G----RSSVQVPAVALWFEGGGEL-KLPAKNYLIPVDAAGTYCLAFA-GTSGPVS--IIG 475
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q + + FD A + +GF+
Sbjct: 476 NVQQQGVRVSFDTAKNTVGFT 496
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/386 (23%), Positives = 149/386 (38%), Gaps = 75/386 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y+ TP + G +I+W+ CE SS+ K C S
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKL 146
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG 151
C+ +C + ++ + GD+ +D LS++ST G V+ P + CG
Sbjct: 147 CHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS----GSPVSFPKTVIGCG 202
Query: 152 SEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP------------- 196
++ G +SG+V GLG V+L +QL + S+ KF+ CL P
Sbjct: 203 TDNAGTFGGASSGIV---GLGGGPVSLITQLGS--SIGGKFSYCLVPLLNKESNASSILS 257
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
F D A+V GDG + TPL + + V YF+ + + V
Sbjct: 258 FGDAAVVSGDG-------------VVSTPL-----------IKKDPVFYFLTLQAFSVGN 293
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV-APVAPF 315
K V + D+E G I + T++ + +Y L A + K+ RV P F
Sbjct: 294 KRVEFGGSSEGGDDE--GNIIIDSGTTLTLIPSDVYTNLESAVVDLV-KLDRVDDPNQQF 350
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
C+ L+ + P I + ++ +H ++ V I +C F +P+
Sbjct: 351 SLCYSLKSNEYD-----FPIITAHFKGADI--ELHSISTFVPITDGIVCFAFQP---SPQ 400
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
+ G +N L+ +DL + F
Sbjct: 401 LGSIFGNLAQQNLLVGYDLQQKTVSF 426
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 105/257 (40%), Gaps = 31/257 (12%)
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF 215
L A + GI G G S SQLAAA K+ F+ CL + G I F G
Sbjct: 187 LNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI-FAIGEV------ 239
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
V +K TP+ N V + + + S++V G + L + G
Sbjct: 240 -VEPKVKTTPIVKNN-----------EVYHLVNLKSINVAGTTLQLPANIFGTTK--TKG 285
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT-RIGPVVP 334
T I + + L IY L+ A + P + GA + Q F + P
Sbjct: 286 TFIDSGSTLVYLPEIIYSELILAVFAKHPDIT-------MGAMYNFQCFHFLGSVDDKFP 338
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP-RTSIVIGARQLENNLLQFD 393
+I +N ++ ++ + +++ G+ C GF D G++ + I++G + N ++ +D
Sbjct: 339 KITFHFEN-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYD 397
Query: 394 LATSRLGFSDSLLFERA 410
+ +G+++ E A
Sbjct: 398 MEKQAIGWTEHNSVEEA 414
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 87/399 (21%), Positives = 154/399 (38%), Gaps = 91/399 (22%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ TP + V G +I+W+ CE SS+ K C S
Sbjct: 86 EYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSK 145
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C +C N+ + GD+ +D L+++ST+ G V+ PN + C
Sbjct: 146 LCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTN----GLTVSFPNIVIGC 201
Query: 151 GSEFVL--QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP------------ 196
G+ +L +G +SG+V G G + +QL + S KF+ CL+P
Sbjct: 202 GTNNILSYEGASSGIV---GFGSGPASFITQLGS--STGGKFSYCLTPLFSVTNIQSNAT 256
Query: 197 ----FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
F D A V GDG + TP+ +P Y++ + +
Sbjct: 257 SKLNFGDAATVSGDG-------------VVTTPILKK----------DPETFYYLTLEAF 293
Query: 253 HVNGKAVPLNKTLLSIDNEG----VGGTKIS--TVNPYTVLETSIYKALVQAFASAMPKV 306
V + V + + + DNEG GT ++ T + Y+ LE+++ + K+
Sbjct: 294 SVGNRRVEIGG-VPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLV---------KL 343
Query: 307 ARV-APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
RV P C+ ++ G+ P I + + +V +H ++ V + CL
Sbjct: 344 ERVDDPTQTLNLCYSVKAEGYD-----FPIITMHFKGADV--DLHPISTFVSVADGVFCL 396
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
F + + + G +N ++ +DL + F S
Sbjct: 397 AF----ESSQDHAIFGNLAQQNLMVGYDLQQKIVSFKPS 431
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 93/409 (22%), Positives = 161/409 (39%), Gaps = 62/409 (15%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--- 74
LS + A+ S P A ++ + Y+ + K TP + L + + W+ C
Sbjct: 72 LSSLVARKSVVPIASGRQIVQSPT---YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCV 128
Query: 75 -------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVL 127
S+T KT C + QC CGG C + ++ ++ DV+
Sbjct: 129 GCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAF--NMTYGSSSIAANLSQDVV 186
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
++ +TD ++P++ F C +E G + G+ GLGR ++L SQ + +
Sbjct: 187 TL-ATD--------SIPSYTFGCLTEAT--GSSIPPQGLLGLGRGPMSLLSQTQNLY--Q 233
Query: 188 RKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
F+ CL F ++ F L K +K TPL N S Y++
Sbjct: 234 STFSYCLPSFR--SLNFSGS--LRLGPVGQPKRIKTTPLLKNP---------RRSSLYYV 280
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
+ ++ V + V + + L+ + GT + +T L Y A+ AF + A
Sbjct: 281 NLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGN-A 339
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVV-PQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CL 365
V + F C+ P+V P I + NV ++ N ++ ++ CL
Sbjct: 340 TVTSLGGFDTCY---------TSPIVAPTITFMFSGMNV--TLPPDNLLIHSTASSITCL 388
Query: 366 GFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
N + + VI Q +N+ + FD+ SRLG + R CT
Sbjct: 389 AMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVA------REPCT 431
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 98/393 (24%), Positives = 155/393 (39%), Gaps = 91/393 (23%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
YV + TP L G +I W CE S++ K C SA
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 178
Query: 91 QCNLANA-----KACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
C L + ++C C G G + + G + L++ S++
Sbjct: 179 LCKLVASGKKFSQSCSSSTCLYQVQYGDG-------SYSIGFFATETLTLSSSN------ 225
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPF 197
NF+F CG + GL G G+ GLGR+K+ALPSQ A + K+ F+ CL S
Sbjct: 226 --VFKNFLFGCGQQ--NNGLFGGAAGLLGLGRTKLALPSQTAKTY--KKLFSYCLPASSS 279
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G VSK++K+TPL ++ F P Y + +T + V G+
Sbjct: 280 SKGYLSLGG---------QVSKSVKFTPL-------SADFDSTPF--YGLDITGLSVGGR 321
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ ++++ S GT I + T L + Y L AF + M + + F
Sbjct: 322 KLSIDESAFS------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDT 375
Query: 318 CF--------RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
C+ R+ +G T G V ID+ +++ ++G +CL F
Sbjct: 376 CYDFSKYDTVRIPKVGVTFKGGVEMDIDV----SGILYPVNGLK--------KVCLAFAG 423
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ TSI G Q + +D A R+GF+
Sbjct: 424 NDDDSDTSI-FGNVQQRTYQVVYDGAKGRVGFA 455
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 148/384 (38%), Gaps = 76/384 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y T++ P V + + G ++ W+ C + SS+ + C +
Sbjct: 147 EYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTP 206
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
QCN C C G G + T GD + L+I ST V
Sbjct: 207 QCNALEVSECRNATCLYEVSYGDG-------SYTVGDFATETLTIGST---------LVQ 250
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--- 201
N CG +GL G G+ GLG +ALPSQL F+ CL D +
Sbjct: 251 NVAVGCGHSN--EGLFVGAAGLLGLGGGLLALPSQLNTT-----SFSYCLVDRDSDSAST 303
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ FG +S + PL N + Y++G+T + V G+ + +
Sbjct: 304 VDFGTS---------LSPDAVVAPLLRNH---------QLDTFYYLGLTGISVGGELLQI 345
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
++ +D G GG I + T L+T IY +L +F + + A VA F C+ L
Sbjct: 346 PQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNL 405
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTS--I 378
+ VP + ++ ++ +N M+ + CL F P S
Sbjct: 406 S----AKTTVEVPTVAFHFPGGKML-ALPAKNYMIPVDSVGTFCLAFA-----PTASSLA 455
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
+IG Q + + FDLA S +GFS
Sbjct: 456 IIGNVQQQGTRVTFDLANSLIGFS 479
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 103/420 (24%), Positives = 162/420 (38%), Gaps = 75/420 (17%)
Query: 8 LLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLG 65
L L SL L P + RP+ L V+ TA + +Y +++ P P + + G
Sbjct: 119 LALSSLNRSDLYPTETEL-LRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTG 177
Query: 66 GNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGIC------G 106
++ W+ C+ SS+ C + QC AC G C G
Sbjct: 178 SDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYG 237
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGI 166
G + T G+ + +S A +V CG + +GL G G+
Sbjct: 238 DG-------SFTVGEYVTETVSFG---------AGSVNRVAIGCGHDN--EGLFVGSAGL 279
Query: 167 AGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPL 226
GLG ++L SQ+ A F+ CL D G + N+ ++ PL
Sbjct: 280 LGLGGGPLSLTSQIKAT-----SFSYCLVDRDSGK-----SSTLEFNSPRPGDSV-VAPL 328
Query: 227 FIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYT 285
N KVNT Y++ +T V V G+ V + ++D G GG + + T
Sbjct: 329 LKNQKVNTF----------YYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAIT 378
Query: 286 VLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNV 345
L T Y ++ AF + VA F C+ L + R VP + + +
Sbjct: 379 RLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVR----VPTVSFHF-SGDR 433
Query: 346 VWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
W++ +N ++ + G C F P TS +IG Q + + FDLA S +GFS
Sbjct: 434 AWALPAKNYLIPVDGAGTYCFAFA-----PTTSSMSIIGNVQQQGTRVSFDLANSLVGFS 488
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 91/395 (23%), Positives = 151/395 (38%), Gaps = 66/395 (16%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYVSSTNKTARCGSAQ 91
T T Y T+I+ +P + V G +ILWV+C G + GS
Sbjct: 79 TDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT 138
Query: 92 CNLANAKACGGGICGAGP------DNPISNTGTHGD-------IRIDVLSIQSTDGRNPG 138
+ C G P +P T+GD D + G N
Sbjct: 139 TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSG-NGQ 197
Query: 139 RAVTVPNFIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
+ + F CG++ L + GI G G+S ++ SQLAAA +++ FA CL
Sbjct: 198 TTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDT 257
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
G I + + N V +K TPL N + Y + + + V G
Sbjct: 258 VRGGGI-------FAIGNV-VQPKVKTTPLVPNVTH------------YNVNLQGISVGG 297
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+ L + + D+ GT I + L +Y+ L+ A + P+ +
Sbjct: 298 ATLQLPTS--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDL----PLHNYQ 351
Query: 317 --ACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGG 371
CF+ D GF P I + ++ +++ + + Q D C+GF+DGG
Sbjct: 352 DFVCFQFSGSIDDGF-------PVITFSFKG-DLTLNVYPDDYLFQNRNDLYCMGFLDGG 403
Query: 372 VNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
V + +++G L N L+ +DL +G++D
Sbjct: 404 VQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIGWTD 438
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 115/258 (44%), Gaps = 51/258 (19%)
Query: 160 ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFD 216
ASGV+G+A + +L SQ A+ F K+KF+ C P ++ G+++FG+
Sbjct: 238 ASGVLGLAQ--GEQYSLISQTASKF--KKKFSYCF-PHNENTRGSLLFGE------KAIS 286
Query: 217 VSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGT 276
S +LK+T L +N +SG + YF+ + + V K + ++ +L + GT
Sbjct: 287 ASPSLKFTRL----LNPSSGSV------YFVELIGISVAKKRLNVSSSLFASP-----GT 331
Query: 277 KISTVNPYTVLETSIYKALVQAFASAMPKVARVAP---VAPFGACFRLQDIGFTRIG--- 330
I + T L T+ Y+AL AF M V+P P C+ L+ G I
Sbjct: 332 IIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPE 391
Query: 331 ---PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD--ALCLGFVDGGVNPRTSIVIGARQL 385
V ++D+ L ++W+ GD CL F +P +IG RQ
Sbjct: 392 IVLHFVGEVDVSLHPSGILWA----------NGDLTQACLAFARKS-HPSHVTIIGNRQQ 440
Query: 386 ENNLLQFDLATSRLGFSD 403
+ + +D+ RLGF +
Sbjct: 441 VSLKVVYDIEGGRLGFGN 458
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 152/369 (41%), Gaps = 81/369 (21%)
Query: 78 VSSTNKTARCGSAQCNLANAKACGGGICGAGPD------NPISNTGTHGDIRIDVLSIQS 131
SST C S +C +CG CG + + + G + DVL++
Sbjct: 126 ASSTASRISCTSPKC------SCGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALH- 178
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
DG PG + IF C + + G+ GLG S ++ +QL A + F+
Sbjct: 179 -DGL-PGAPI-----IFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFS 231
Query: 192 LCLSPFD-DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
LC + DGA++ GD S +L+YTPL + + Y + +
Sbjct: 232 LCFGMVEGDGALLLGDA------EVPGSISLQYTPLLTSTTHP---------FYYNVKML 276
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA--- 307
S+ V G+ +P++++L ++G GT + + +T + + ++K AFA A+ K A
Sbjct: 277 SLAVEGQLLPVSQSLF---DQGY-GTVLDSGTTFTYMPSPVFK----AFAGAVEKYALSH 328
Query: 308 ---RVAPVAP------FGACFRLQDIGFTRIGPVVPQID--------LVLQNKNVVWSIH 350
RV P FG D+ + V P ++ LVL N ++ +H
Sbjct: 329 GLKRVPGPDPQFDDICFGQAPSHDDL--EALSSVFPSMEVQFDQGTSLVLGPLNYLF-VH 385
Query: 351 GQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE-- 408
NS CLG D G R ++G N L+++D A R+GF +L E
Sbjct: 386 TFNS------GKYCLGVFDNG---RAGTLLGGITFRNVLVRYDRANQRVGFGPALCKELG 436
Query: 409 ---RATCTF 414
R C+F
Sbjct: 437 EMQRPPCSF 445
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 87/363 (23%), Positives = 144/363 (39%), Gaps = 33/363 (9%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ ++ TP + G +++W C+ T + + + C
Sbjct: 96 EFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSD 155
Query: 104 ICGAGPDNPISNTGTHGDIRIDVLSIQ---STDGRNPGRAVTVPNFIFLCGSEFVLQGLA 160
+C A P + S+ + D S Q +T+ G A +V F CG + G +
Sbjct: 156 LCAALPISSCSDGCEYLYSYGDYSSTQGVLATETFAFGDA-SVSKIGFGCGEDNDGSGFS 214
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKN 220
G G+ GLGR ++L SQL + KF+ CL+ DD + + + KN
Sbjct: 215 QGA-GLVGLGRGPLSLISQLG-----EPKFSYCLTSMDDSKGI----SSLLVGSEATMKN 264
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
TPL N +PS Y++ + + V +P+ K+ SI N+G GG I +
Sbjct: 265 AITTPLIQNP--------SQPSF-YYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDS 315
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL 340
T LE S + AL + F S + + CF L T VPQ LV
Sbjct: 316 GTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTV---DVPQ--LVF 370
Query: 341 QNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRL 399
+ + +N ++ G +CL G + SI G Q +N ++ DL +
Sbjct: 371 HFEGADLKLPAENYIIADSGLGVICLTM---GSSSGMSI-FGNFQQQNIVVLHDLEKETI 426
Query: 400 GFS 402
F+
Sbjct: 427 SFA 429
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 98/393 (24%), Positives = 155/393 (39%), Gaps = 91/393 (23%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
YV + TP L G +I W CE S++ K C SA
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 190
Query: 91 QCNLANA-----KACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
C L + ++C C G G + + G + L++ S++
Sbjct: 191 LCKLVASGKKFSQSCSSSTCLYQVQYGDG-------SYSIGFFATETLTLSSSN------ 237
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPF 197
NF+F CG + GL G G+ GLGR+K+ALPSQ A + K+ F+ CL S
Sbjct: 238 --VFKNFLFGCGQQ--NNGLFGGAAGLLGLGRTKLALPSQTAKTY--KKLFSYCLPASSS 291
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G VSK++K+TPL ++ F P Y + +T + V G+
Sbjct: 292 SKGYLSLGG---------QVSKSVKFTPL-------SADFDSTPF--YGLDITGLSVGGR 333
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ ++++ S GT I + T L + Y L AF + M + + F
Sbjct: 334 KLSIDESAFS------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDT 387
Query: 318 CF--------RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
C+ R+ +G T G V ID+ +++ ++G +CL F
Sbjct: 388 CYDFSKYDTVRIPKVGVTFKGGVEMDIDV----SGILYPVNGLK--------KVCLAFAG 435
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ TSI G Q + +D A R+GF+
Sbjct: 436 NDDDSDTSI-FGNVQQRTYQVVYDGAKGRVGFA 467
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 82/375 (21%), Positives = 144/375 (38%), Gaps = 38/375 (10%)
Query: 43 LQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGG 102
L + + TP P L + G +++W C+ + A+ + A C G
Sbjct: 87 LHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDG 146
Query: 103 GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR--------NPGRAVTVPNFIFLCGSEF 154
+C G N N + I +T G R V+V + F CG
Sbjct: 147 RLCETGSFN-TKNCSRNKCIYTYNYGSATTKGELASETFTFGEHRRVSV-SLDFGCGK-- 202
Query: 155 VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD----GAIVFGDGPYY 210
+ G G GI G+ +++L SQL +F+ CL+PF D I FG
Sbjct: 203 LTSGSLPGASGILGISPDRLSLVSQLQIP-----RFSYCLTPFLDRNTTSHIFFG--AMA 255
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
DL+ + + ++ T L N + + Y++ + + V K + + + +I
Sbjct: 256 DLSKYRTTGPIQTTSLVTNPDGS--------NYYYYVPLIGISVGTKRLNVPVSSFAIGR 307
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASA--MPKVARVAPVAPFGACFRLQDIGFTR 328
+G GGT + + + +L + + +AL +A A +P V + CF+L G
Sbjct: 308 DGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGA 367
Query: 329 IGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLEN 387
+ V LV + + + MV++ +CL G +IG Q +N
Sbjct: 368 VETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISSGA----RGAIIGNYQQQN 423
Query: 388 NLLQFDLATSRLGFS 402
+ FD+ F+
Sbjct: 424 MHVLFDVENHEFSFA 438
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/360 (23%), Positives = 130/360 (36%), Gaps = 74/360 (20%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSAQCNL---- 94
TP + L G ++ W CE S++ C S C
Sbjct: 153 TPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTSTLCTQLSTA 212
Query: 95 --------ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
A+ KAC GI + G + LS+ +TD V NF
Sbjct: 213 TGNEPGCSASTKACIYGIQYGDSSFSV------GYFSRERLSVTATD--------IVDNF 258
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAIVF 204
+F CG QGL G G+ GLGR ++ Q AA + ++ F+ CL + G + F
Sbjct: 259 LFGCGQN--NQGLFGGSAGLIGLGRHPISFVQQTAAVY--RKIFSYCLPATSSSTGRLSF 314
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G + +KYTP + ++ S F G + +T + V G +P++ +
Sbjct: 315 GT---------TTTSYVKYTPF--STISRGSSFYG-------LDITGISVGGAKLPVSSS 356
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
S GG I + T L + Y AL AF M K ++ C+ L
Sbjct: 357 TFS-----TGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLS-- 409
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
G+ +P+ID V + Q + +CL F G + +I +Q
Sbjct: 410 GYEVFS--IPKIDFSFAG-GVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQ 466
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 147/384 (38%), Gaps = 66/384 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGYV----------SSTNKTARCGSA 90
+Y+ TP + V + G +I+W+ C+ K Y S T KT C S
Sbjct: 88 EYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPSN 147
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C C + + + + GD+ ++ L++ ST+ G V P + C
Sbjct: 148 TCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTN----GSPVQFPGTVIGC 203
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP----------FDDG 200
G + G+ GI GLGR ++L +QL+ S KF+ CL P F +
Sbjct: 204 G-RYNAIGIEEKNSGIVGLGRGPMSLITQLSP--STGGKFSYCLVPGLSTASSKLNFGNA 260
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
A+V G G VS TPLF + +G V YF+ + + V +
Sbjct: 261 AVVSGRG--------TVS-----TPLF-----SKNGL-----VFYFLTLEAFSVGRNRIE 297
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
S + G G I + T L +Y L A A + P G C++
Sbjct: 298 FG----SPGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYK 353
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVI 380
+ ++ VP I +V +++ N+ VQ+ D +C F T V
Sbjct: 354 VTP---DKLDASVPVITAHFSGADV--TLNAINTFVQVADDVVCFAFQP----TETGAVF 404
Query: 381 GARQLENNLLQFDLATSRLGFSDS 404
G +N L+ +DL + + F +
Sbjct: 405 GNLAQQNLLVGYDLQMNTVSFKHT 428
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/388 (21%), Positives = 151/388 (38%), Gaps = 80/388 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWV----------DCEKGYV----SSTNKTARCGS 89
++ +I P + + V G +++W+ +C+ + SST K C S
Sbjct: 97 DFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPCDS 156
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
+C + NA C C D ++ GD+ +D L++ ST G++ +PN F+
Sbjct: 157 YRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNST----TGKSFMLPNTGFI 212
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP----------FDD 199
CG+ + G G VGI GLG ++L ++++ + KF+ C+ P F D
Sbjct: 213 CGNR--IGGDYPG-VGILGLGHGSLSLLNRISHL--IDGKFSHCIVPYSSNQTSKLSFGD 267
Query: 200 GAIVFGDGPY---YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
A+V G + D+ S L + + + + ++G +G + ++NG
Sbjct: 268 KAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIG----------SDYYMNG 317
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNP---YTVLETSIYKALVQAFASAMPKVARVAPVA 313
G+ + T P Y+ LE + A+ Q + P
Sbjct: 318 L--------------GMDSGTMFTYFPEYFYSQLEYDVRYAIQQ-------EPLYPDPTR 356
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
C+R P P I + + +V + NS +++ D +CL F +
Sbjct: 357 RLRLCYRYS----PDFSP--PTITMHFEGGSV--ELSSSNSFIRMTEDIVCLAFATS--S 406
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGF 401
V G Q N L+ +DL L F
Sbjct: 407 SEQDAVFGYWQQTNLLIGYDLDAGFLSF 434
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 88/392 (22%), Positives = 156/392 (39%), Gaps = 67/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T++K +P + + G +ILWV C SST+
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGR 139
C +C A + +C ++P T T+GD D + S G N
Sbjct: 151 CSDDRCTAALQTS--EAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMG-NEQT 207
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
A + + +F C + L V GI G G+ ++++ SQL + + F+ CL
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGS 267
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
D+G +V G+ V L YTPL ++ + Y + + S+ VN
Sbjct: 268 DNGGGILVLGE---------IVEPGLVYTPLVPSQPH------------YNLNLESIVVN 306
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G+ +P++ +L + N GT + + L Y V A +A+ R + V+
Sbjct: 307 GQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKG 363
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGG 371
CF + + + P + L V ++ +N ++Q I + L C+G+
Sbjct: 364 NQCF----VTSSSVDSSFPTVSLYFMG-GVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ 418
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
T ++G L++ + +DLA R+G++D
Sbjct: 419 GQQIT--ILGDLVLKDKIFVYDLANMRMGWTD 448
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 92/390 (23%), Positives = 146/390 (37%), Gaps = 71/390 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------------------KGYVSSTN 82
+ T I TP VP + + +G ++LWV C+ +SST+
Sbjct: 103 HYTWIDLGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSANYYSVLDRDLSEYNPALSSTS 162
Query: 83 KTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
K CG C + C D NT T G + D L + S ++ ++
Sbjct: 163 KHLFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSF-SKHGTHSLL 221
Query: 143 VPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+ +F CG S L G A G+ GLG +++P+ LA ++ F+LC
Sbjct: 222 QASVVFGCGRKQSGSYLDGAAPD--GVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS 279
Query: 200 GAIVFG-DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
G I+FG DGP + ++ PLF GE + YFIGV S V
Sbjct: 280 GRILFGDDGP-------ATQQTTQFLPLF-----------GEFAA-YFIGVESFCVGSSC 320
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR--VAPVAPFG 316
+ + +D + + +T L +YK +V F + A V P+
Sbjct: 321 LQRSGFQALVD----------SGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELPWN 370
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV--QIGGDALCLGFVDGGVNP 374
C+ + T + +P + LV + IH ++ G CL +
Sbjct: 371 YCYNIS----TLVSFNIPSMQLVFPLNQIF--IHDPVYVLPANQGYKVFCLTLEE---TD 421
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFSDS 404
VIG + + FD +LG+S S
Sbjct: 422 EDYGVIGQNLMVGYRMVFDRENLKLGWSKS 451
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 145/363 (39%), Gaps = 61/363 (16%)
Query: 60 LTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGICG 106
+ + G ++ WV C+ +S++ C S +C + AC G
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNAT-G 59
Query: 107 AGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
A G++ GD + L++ G + V N CG + +GL G
Sbjct: 60 ACLYEVAYGDGSYTVGDFATETLTL--------GDSTPVGNVAIGCGHDN--EGLFVGAA 109
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFGDGPYYDLNNFDVSKNL 221
G+ LG ++ PSQ++A+ F+ CL D A + FGDG ++
Sbjct: 110 GLLALGGGPLSFPSQISAS-----TFSYCLVDRDSPAASTLQFGDG---------AAEAG 155
Query: 222 KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID-NEGVGGTKIST 280
T + T S Y++ ++ + V G+ + + + ++D G GG + +
Sbjct: 156 TVTAPLVRSPRT--------STFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDS 207
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL 340
T L+++ Y AL AF P + R + V+ F C+ L D R VP + L
Sbjct: 208 GTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSD----RTSVEVPAVSLRF 263
Query: 341 QNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRL 399
+ + + +N ++ + G CL F N SI IG Q + + FD A +
Sbjct: 264 EGGGAL-RLPAKNYLIPVDGAGTYCLAFAP--TNAAVSI-IGNVQQQGTRVSFDTARGAV 319
Query: 400 GFS 402
GF+
Sbjct: 320 GFT 322
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 98/393 (24%), Positives = 155/393 (39%), Gaps = 91/393 (23%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSA 90
YV + TP L G +I W CE S++ K C SA
Sbjct: 71 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSA 130
Query: 91 QCNLANA-----KACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
C L + ++C C G G + + G + L++ S++
Sbjct: 131 LCKLVASGKKFSQSCSSSTCLYQVQYGDG-------SYSIGFFATETLTLSSSN------ 177
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPF 197
NF+F CG + GL G G+ GLGR+K+ALPSQ A + K+ F+ CL S
Sbjct: 178 --VFKNFLFGCGQQ--NNGLFGGAAGLLGLGRTKLALPSQTAKTY--KKLFSYCLPASSS 231
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G VSK++K+TPL ++ F P Y + +T + V G+
Sbjct: 232 SKGYLSLGG---------QVSKSVKFTPL-------SADFDSTPF--YGLDITGLSVGGR 273
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ ++++ S GT I + T L + Y L AF + M + + F
Sbjct: 274 QLSIDESAFS------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDT 327
Query: 318 CF--------RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
C+ R+ +G T G V ID+ +++ ++G +CL F
Sbjct: 328 CYDFSKYDTVRIPKVGVTFKGGVEMDIDV----SGILYPVNGLK--------KVCLAFAG 375
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ TSI G Q + +D A R+GF+
Sbjct: 376 NDDDSDTSI-FGNVQQRTYQVVYDGAKGRVGFA 407
>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like, partial [Brachypodium distachyon]
Length = 364
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 109/264 (41%), Gaps = 37/264 (14%)
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNF 215
G+AS G+ G+ R ++ SQ R+F+ C+S DD G ++ G + DL NF
Sbjct: 105 DGVAS--AGLLGMNRGALSFVSQAG-----TRRFSYCISDRDDAGVLLLG---HSDLPNF 154
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
L YTPL+ + + V Y + + + V K +P+ ++L+ D+ G G
Sbjct: 155 ---LPLNYTPLYQPSLPLP--YFDR--VAYSVQLLGILVGSKPLPIPASVLAPDHTGAGQ 207
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARV------APVAPFGACFRLQDIGFTRI 329
T + + +T L Y AL F R A F CFR+
Sbjct: 208 TMVDSGTQFTFLLGDAYAALKAEFYRQSTPFLRALDEPSFAFQGAFDTCFRVPRGMSPPP 267
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-----------ALCLGFVDGGVNPRTSI 378
G ++P + L +V + G + ++ G+ CL F + + P +
Sbjct: 268 GRLLPSVTLRFNGAEMV--VGGDRLLYKVPGERRGGAGADDDAVWCLTFGNADMVPIMAY 325
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
VIG N +++DL R+G +
Sbjct: 326 VIGHHHQMNLWVEYDLERGRVGLA 349
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/263 (22%), Positives = 105/263 (39%), Gaps = 31/263 (11%)
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF 215
L A + GI G G S SQLAAA K+ F+ CL + G I F G
Sbjct: 187 LNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI-FAIGEV------ 239
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
V +K TP+ N V + + + S++V G + L + G
Sbjct: 240 -VEPKVKTTPIVKNN-----------EVYHLVNLKSINVAGTTLQLPANIFGTTK--TKG 285
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT-RIGPVVP 334
T I + + L IY L+ A + P + GA + Q F + P
Sbjct: 286 TFIDSGSTLVYLPEIIYSELILAVFAKHPDIT-------MGAMYNFQCFHFLGSVDDKFP 338
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN-PRTSIVIGARQLENNLLQFD 393
+I +N ++ ++ + +++ G+ C GF D G++ + I++G + N ++ +D
Sbjct: 339 KITFHFEN-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYD 397
Query: 394 LATSRLGFSDSLLFERATCTFNF 416
+ +G+++ R F
Sbjct: 398 MEKQAIGWTEHNSMARIVLRLQF 420
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 93/419 (22%), Positives = 152/419 (36%), Gaps = 67/419 (15%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
LL S TA S P VA A YV + +P + L + + W
Sbjct: 53 LLFLSSKAATAGVSSAP------VASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAH 106
Query: 73 CEKGYV-----------SSTNKTARCGSAQCNLANAKAC----GGGICGAGP-------- 109
C SS+ + C S+ C L +AC GGG P
Sbjct: 107 CSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 166
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
P ++ + D L + G++ +PN+ F C S G+ GL
Sbjct: 167 SKPFADASFQAALASDTLRL----GKD-----AIPNYTFGCVSSVTGPTTNMPRQGLLGL 217
Query: 170 GRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTP 225
GR +AL SQ + + F+ CL S + G++ G G ++++YTP
Sbjct: 218 GRGPMALLSQAGSLY--NGVFSYCLPSYRSYYFSGSLRLGAG-------GGQPRSVRYTP 268
Query: 226 LFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYT 285
+ N S Y++ VT + V V + + D GT + + T
Sbjct: 269 MLRNPHR---------SSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVIT 319
Query: 286 VLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNV 345
+Y AL + F + + + F CF ++ V +D V
Sbjct: 320 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMD-----GGV 374
Query: 346 VWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
++ +N+++ L CL + N + + VI Q +N + FD+A SR+GF+
Sbjct: 375 DLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFA 433
>gi|302768809|ref|XP_002967824.1| hypothetical protein SELMODRAFT_408674 [Selaginella moellendorffii]
gi|300164562|gb|EFJ31171.1| hypothetical protein SELMODRAFT_408674 [Selaginella moellendorffii]
Length = 408
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 106/251 (42%), Gaps = 45/251 (17%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGD-GPYYDLNNFDVS 218
V+G+A G S +LP Q++ + L +F CL+ G + G GPY +N D+
Sbjct: 117 VIGLAASGSS--SLPFQVSRSAKLAHRFTYCLASSSGRGLGELYIGQQGPYRVFHNTDIL 174
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL-NKTLLSIDNEGVGGTK 277
+ L+ ++SG S H+ +V L +KT ++I +G
Sbjct: 175 NSTSLPMLYFPLTVSSSG--------------SYHLKLDSVSLGSKTTVTITMVEIG--- 217
Query: 278 ISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA---CFRL---QDIGFTRIGP 331
T YT L + Y+ L F + + + FG C+++ Q F+ +
Sbjct: 218 --TSFRYTRLPQAAYQMLRDGFLREVGEKKLGRDSSSFGELDLCYKMSVEQRTTFSNVTM 275
Query: 332 VVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNL 389
VV I W + G N +V G + C FV G + R+ VIG Q ENN
Sbjct: 276 VVSGIQ---------WMVSGDNYLVTKPGIRNVACFAFVSAGKDGRS--VIGTAQQENNF 324
Query: 390 LQFDLATSRLG 400
++FD+ +LG
Sbjct: 325 VEFDVDAKKLG 335
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 89/427 (20%), Positives = 153/427 (35%), Gaps = 110/427 (25%)
Query: 57 PVKLTVHLGGNILWVDC-------------EKGYVSSTNKT----ARCGSAQCNLANAKA 99
P+ L + G +++W C E TN T C S C+ A+ A
Sbjct: 84 PITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVAVSCKSPACSAAHNLA 143
Query: 100 CGGGICGAG--PDNPISNTG-----------THGD------IRIDVLSIQSTDGRNPGRA 140
+C A P I + +GD + D LS+ S
Sbjct: 144 PPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDTLSLSS--------- 194
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCLSPFDD 199
+ + NF F C + + G+AG GR ++LP+QLA + L +F+ CL
Sbjct: 195 LFLRNFTFGCAHTTLAE-----PTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVS--- 246
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA--------------SGFLGEPSVEY 245
++FD + K +PL + + + L P Y
Sbjct: 247 -------------HSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPY 293
Query: 246 FIGVT--SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
F V+ + V + +P + L ++N G GG + + +T+L Y ++V F +
Sbjct: 294 FYTVSLIGIAVGKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRV 353
Query: 304 ----PKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL----------VLQNKNVVWSI 349
+ ++ C+ L + VP + L VL KN +
Sbjct: 354 GRDNKRARKIEEKTGLAPCYYLNSVA------DVPALTLRFAGGKNSSVVLPRKNYFYEF 407
Query: 350 HGQNSMVQIGGDALCLGFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGFSD-- 403
+ + CL ++GG S +G Q + +++DL R+GF+
Sbjct: 408 SDGSDGAKGKRKVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQ 467
Query: 404 -SLLFER 409
+LL+ER
Sbjct: 468 CALLWER 474
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 93/419 (22%), Positives = 152/419 (36%), Gaps = 67/419 (15%)
Query: 13 LLIFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD 72
LL S TA S P VA A YV + +P + L + + W
Sbjct: 55 LLFLSSKAATAGVSSAP------VASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAH 108
Query: 73 CEKGYV-----------SSTNKTARCGSAQCNLANAKAC----GGGICGAGP-------- 109
C SS+ + C S+ C L +AC GGG P
Sbjct: 109 CSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 168
Query: 110 DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGL 169
P ++ + D L + G++ +PN+ F C S G+ GL
Sbjct: 169 SKPFADASFQAALASDTLRL----GKD-----AIPNYTFGCVSSVTGPTTNMPRQGLLGL 219
Query: 170 GRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTP 225
GR +AL SQ + + F+ CL S + G++ G G ++++YTP
Sbjct: 220 GRGPMALLSQAGSLY--NGVFSYCLPSYRSYYFSGSLRLGAG-------GGQPRSVRYTP 270
Query: 226 LFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYT 285
+ N S Y++ VT + V V + + D GT + + T
Sbjct: 271 MLRNPHR---------SSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDSGTVIT 321
Query: 286 VLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNV 345
+Y AL + F + + + F CF ++ V +D V
Sbjct: 322 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMD-----GGV 376
Query: 346 VWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
++ +N+++ L CL + N + + VI Q +N + FD+A SR+GF+
Sbjct: 377 DLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRIGFA 435
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 91/391 (23%), Positives = 155/391 (39%), Gaps = 74/391 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ + TP + G +++W C + S + C S
Sbjct: 88 EYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSP 147
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
CN C +C G + T G + + + + D R VTVP
Sbjct: 148 MCNALYYPLCYRNVCVYQYFYG-------DSANTAGVLSNETFTFGTNDTR-----VTVP 195
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GA 201
F CG+ + G G+ G GR ++L SQL + +F+ CL+ F
Sbjct: 196 RIAFGCGN--LNAGSLFNGSGMVGFGRGPLSLVSQLGSP-----RFSYCLTSFMSPVPSR 248
Query: 202 IVFGDGPYYDLNNFDVS--KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+ FG Y LN+ S + ++ TP +N G P++ Y++ +T + V G+ +
Sbjct: 249 LYFGA--YATLNSTSASTGEPVQSTPFIVNP--------GLPTM-YYLNMTGISVGGELL 297
Query: 260 PLNKTLLSI-DNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVARVAPVAPFG 316
P++ ++ +I D +G GG I + + T L + Y + QAFA +P +
Sbjct: 298 PIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADVLD 357
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA--LCLGFV---DGG 371
CF R +P++ + N+ + +N M+ I GD LCL DG
Sbjct: 358 TCFVWPPP--PRKIVTMPELAFHFEGANMELPL--ENYML-IDGDTGNLCLAIAASDDGS 412
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+IG+ Q +N + +D S L F+
Sbjct: 413 -------IIGSFQHQNFHVLYDNENSLLSFT 436
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 115/279 (41%), Gaps = 46/279 (16%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSAQCNLANA-- 97
TP P+ T+ + WV C +S+++ CGS C+ +A
Sbjct: 7 TPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCSAFSAVS 66
Query: 98 KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEF--V 155
+CG + + +N + GD+ D+ ++ S R N CG + +
Sbjct: 67 TSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRK-----VAANLSLGCGRDSGGL 121
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLN 213
L+ L + G G + V+ QL+A + KF CL P D G +V G+ Y L
Sbjct: 122 LELLDTS--GFVGFDKGNVSFMGQLSA-LGYRSKFIYCL-PSDTFRGKLVIGN---YKLR 174
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV--NGKAVPLNKTLLSIDNE 271
N +S ++ YTP+ N + + YFI ++++ + N VP+ L +
Sbjct: 175 NASISSSMAYTPMITNP---------QAAELYFINLSTISIDKNKFQVPIQGFL----SN 221
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
G GGT I T + L + Y LVQA + + V+
Sbjct: 222 GTGGTVIDTTTFLSYLTSDFYTQLVQAIKNYTTNLVEVS 260
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 98/389 (25%), Positives = 163/389 (41%), Gaps = 79/389 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGYV----------SSTNKTARCGSA 90
+Y T++ TP V + + G +I+W+ C + Y S T T C S
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200
Query: 91 QCNLANAKACGG--GIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C ++ C C G G + T GD + L+ + RN + V
Sbjct: 201 HCRRLDSAGCNTRRKTCLYQVSYGDG-------SFTVGDFSTETLTFR----RNRVKGVA 249
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFD 198
+ CG + +GL G G+ GLG+ K++ P Q F+ +KF+ CL +
Sbjct: 250 LG-----CGHD--NEGLFVGAAGLLGLGKGKLSFPGQTGHRFN--QKFSYCLVDRSASSK 300
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++VFG N VS+ ++TPL N K++T Y++ + + V G
Sbjct: 301 PSSVVFG--------NAAVSRIARFTPLLSNPKLDTF----------YYVELLGISVGGT 342
Query: 258 AVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
VP + +L +D G GG I + T L Y A+ AF + R + F
Sbjct: 343 RVPGVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFD 402
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVD--GGVN 373
CF L ++ + VP + L + +V S+ N ++ + + C F GG++
Sbjct: 403 TCFDLSNMNEVK----VPTVVLHFRGADV--SLPATNYLIPVDTNGKFCFAFAGTMGGLS 456
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFS 402
+IG Q + + +DLA+SR+GF+
Sbjct: 457 -----IIGNIQQQGFRVVYDLASSRVGFA 480
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 103/250 (41%), Gaps = 31/250 (12%)
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF 215
L A + GI G G S SQLAAA K+ F+ CL + G I F G
Sbjct: 211 LNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI-FAIGEV------ 263
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
V +K TP+ N V + + + S++V G + L + G
Sbjct: 264 -VEPKVKTTPIVKNN-----------EVYHLVNLKSINVAGTTLQLPANIFGTTK--TKG 309
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT-RIGPVVP 334
T I + + L IY L+ A + P + GA + Q F + P
Sbjct: 310 TFIDSGSTLVYLPEIIYSELILAVFAKHPDIT-------MGAMYNFQCFHFLGSVDDKFP 362
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN-PRTSIVIGARQLENNLLQFD 393
+I +N ++ ++ + +++ G+ C GF D G++ + I++G + N ++ +D
Sbjct: 363 KITFHFEN-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYD 421
Query: 394 LATSRLGFSD 403
+ +G+++
Sbjct: 422 MEKQAIGWTE 431
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 87/347 (25%), Positives = 130/347 (37%), Gaps = 73/347 (21%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSAQCNLANAK 98
TP + L G ++ W CE S++ C SA C +
Sbjct: 154 TPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCTSALCTQLSTA 213
Query: 99 ACGGGICGAGPDNPISNTGTHGDIRIDV-------LSIQSTDGRNPGRAVTVPNFIFLCG 151
C A I +GD V L++ +TD V NF+F CG
Sbjct: 214 TGNDPGCSASTKACIYGI-QYGDSSFSVGYFSRERLTVTATD--------VVDNFLFGCG 264
Query: 152 SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDGAIVFGDGPY 209
QGL G G+ GLGR ++ Q AA + ++ F+ CL + G + FG
Sbjct: 265 QN--NQGLFGGSAGLIGLGRHPISFVQQTAAKY--RKIFSYCLPSTSSSTGHLSFGPAA- 319
Query: 210 YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID 269
+ LKYTP + ++ S F G + +T++ V G +P++ + S
Sbjct: 320 -------TGRYLKYTPF--STISRGSSFYG-------LDITAIAVGGVKLPVSSSTFS-- 361
Query: 270 NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ------- 322
GG I + T L + Y AL AF M K ++ C+ L
Sbjct: 362 ---TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSI 418
Query: 323 -DIGFTRIGPVV----PQ-IDLVLQNKNV--VWSIHGQNSMVQIGGD 361
I F+ G V PQ I V K V ++ +G +S V I G+
Sbjct: 419 PTIEFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGN 465
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 97/393 (24%), Positives = 158/393 (40%), Gaps = 72/393 (18%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGY----------VSS 80
LR + +Y + TP V + G ++LW+ C+ Y SS
Sbjct: 70 LRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSS 129
Query: 81 TNKTARCGSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
T ++ CGS+ C + C C G G + T G+ + LS G
Sbjct: 130 TFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDG-------SFTVGEFSTETLSF----G 178
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
N +V + CG QGL +G G+ GLG+ ++ PSQ+ + F+ CL
Sbjct: 179 SNAVNSVAIG-----CGHNN--QGLFTGAAGLLGLGKGLLSFPSQVGQLYG--SVFSYCL 229
Query: 195 SPFDDGA---IVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVT 250
+ ++FG N V+ N ++T L N K++T Y++ +
Sbjct: 230 PTRESTGSVPLIFG--------NQAVASNAQFTTLLTNPKLDTF----------YYVEMV 271
Query: 251 SVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
+ V G +V + LS+D+ G GG + + T L TS Y + AF + MP A++
Sbjct: 272 GIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKM 331
Query: 310 AP-VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGF 367
+ F C+ L R ++P + V N ++ QN MV + CL F
Sbjct: 332 TSGFSLFDTCYDLSG----RSSIMLPAVSFVF-NGGATMALPAQNIMVPVDNSGTYCLAF 386
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLG 400
N +IG Q ++ + FD +R+G
Sbjct: 387 AP---NSENFSIIGNIQQQSFRMSFDSTGNRVG 416
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 93/410 (22%), Positives = 153/410 (37%), Gaps = 77/410 (18%)
Query: 33 VLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------KGYVSSTNKT 84
+ + D L Y T I TP V + + G ++LWV C+ Y SS ++
Sbjct: 102 TMSLGDDFGWLHY-TWIDIGTPHVSFLVALDAGSDLLWVPCDCLQCAPLSASYYSSLDRD 160
Query: 85 ARCGS-AQCNLANAKACGGGICGAGP-------------DNPISNTGTHGDIRIDVLSIQ 130
S + + + +C +C GP D NT + G + D+L +
Sbjct: 161 LNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLA 220
Query: 131 STDGRNPGRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
S +V P + CG S L G+A G+ GLG +++++PS LA A ++
Sbjct: 221 SNGDNALSYSVRAP-VVIGCGMKQSGGYLDGVAPD--GLMGLGLAEISVPSFLAKAGLIR 277
Query: 188 RKFALCLSPFDDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
F++C D G I FGD GP + TP N + Y
Sbjct: 278 NSFSMCFDEDDSGRIFFGDQGP----------TTQQSTPFLTLDGNYTT---------YV 318
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+GV V + +D T +T L +Y+ + + F +
Sbjct: 319 VGVEGFCVGSSCLKQTSFRALVD----------TGTSFTFLPNGVYERITEEFDRQVNAT 368
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLV--LQN----KNVVWSIHGQNSMVQIGG 360
P+ C++ T+ VP + L+ L N N V+ I+G + I G
Sbjct: 369 ISSFNGYPWKYCYKSSSNHLTK----VPSVKLIFPLNNSFVIHNPVFMIYG---IQGITG 421
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERA 410
L + +G + IG + + FD +LG+S S +R+
Sbjct: 422 FCLAIQPTEGDIG-----TIGQNFMAGYRVVFDRENMKLGWSHSSCEDRS 466
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 87/392 (22%), Positives = 156/392 (39%), Gaps = 67/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T++K +P + + G +ILWV C SST+
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGR 139
C +C A + +C ++P T T+GD D + + G N
Sbjct: 177 CSDDRCTAALQTS--EAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMG-NEQT 233
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
A + + +F C + L V GI G G+ ++++ SQL + + F+ CL
Sbjct: 234 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGS 293
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
D+G +V G+ V L YTPL ++ + Y + + S+ VN
Sbjct: 294 DNGGGILVLGE---------IVEPGLVYTPLVPSQPH------------YNLNLESIVVN 332
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G+ +P++ +L + N GT + + L Y V A +A+ R + V+
Sbjct: 333 GQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKG 389
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGG 371
CF + + + P + L V ++ +N ++Q I + L C+G+
Sbjct: 390 NQCF----VTSSSVDSSFPTVSLYFMG-GVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ 444
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
T ++G L++ + +DLA R+G++D
Sbjct: 445 GQQIT--ILGDLVLKDKIFVYDLANMRMGWTD 474
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 155/381 (40%), Gaps = 63/381 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGYV----------SSTNKTARCGSA 90
+Y T+I TP V + + G +++W+ C K Y S T CG+
Sbjct: 117 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAP 176
Query: 91 QCNLANAKACG--GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C ++ C +C + T GD + L+ + RN V
Sbjct: 177 LCRRLDSPGCSNKNKVCQYQVSYG-DGSFTFGDFSTETLTFR----RN-----RVTRVAL 226
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVF 204
CG + +GL +G G+ GLGR +++ P Q F+ KF+ CL + +++F
Sbjct: 227 GCGHD--NEGLFTGAAGLLGLGRGRLSFPVQTGRRFN--HKFSYCLVDRSASAKPSSVIF 282
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVP-LN 262
GD VS+ +TPL N K++T Y++ + + V G V L+
Sbjct: 283 GDSA--------VSRTAHFTPLIKNPKLDTF----------YYLELLGISVGGAPVRGLS 324
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+L +D G GG I + T L Y AL AF + R + F CF L
Sbjct: 325 ASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLS 384
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPRTSIVIG 381
G T + VP + L + +V S+ N ++ + + C F G SI IG
Sbjct: 385 --GLTEVK--VPTVVLHFRGADV--SLPATNYLIPVDNSGSFCFAF--AGTMSGLSI-IG 435
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q + + +DL SR+GF+
Sbjct: 436 NIQQQGFRISYDLTGSRVGFA 456
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 87/392 (22%), Positives = 156/392 (39%), Gaps = 67/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T++K +P + + G +ILWV C SST+
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGR 139
C +C A + +C ++P T T+GD D + + G N
Sbjct: 151 CSDDRCTAALQTS--EAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMG-NEQT 207
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
A + + +F C + L V GI G G+ ++++ SQL + + F+ CL
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGS 267
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
D+G +V G+ V L YTPL ++ + Y + + S+ VN
Sbjct: 268 DNGGGILVLGE---------IVEPGLVYTPLVPSQPH------------YNLNLESIVVN 306
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G+ +P++ +L + N GT + + L Y V A +A+ R + V+
Sbjct: 307 GQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKG 363
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGG 371
CF + + + P + L V ++ +N ++Q I + L C+G+
Sbjct: 364 NQCF----VTSSSVDSSFPTVSLYFMG-GVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ 418
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
T ++G L++ + +DLA R+G++D
Sbjct: 419 GQQIT--ILGDLVLKDKIFVYDLANMRMGWTD 448
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 158/373 (42%), Gaps = 58/373 (15%)
Query: 60 LTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSAQCNLANAKACGGGICG 106
L + G ++ W+ C + G V S++ K C +A C+L C
Sbjct: 102 LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSK 161
Query: 107 AGPDNPI------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLA 160
P ++ T GD+ ++ LS+ +D +P ++ + + + CG +GL
Sbjct: 162 TSPKTCKYFYWYGDSSRTSGDLALESLSVSLSD--HPS-SLEIRDMVIGCGHS--NKGLF 216
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----GAIVFGDGPYYDLNNF 215
G G+ GLG+ ++ PSQL ++ + + F+ CL + AI FG G + +F
Sbjct: 217 QGAGGLLGLGQGALSFPSQLRSS-PIGQSFSYCLVDRTNNLSVSSAISFGAG-FALSRHF 274
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
D +K+TP F+ N+ F Y++G+ + ++ + +P+ +I G GG
Sbjct: 275 D---QMKFTP-FVRTNNSVETF-------YYLGIQGIKIDQELLPIPAERFAIATNGSGG 323
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
T I + T L Y+A+ AF + + R P G C+ R P
Sbjct: 324 TIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYN----ATGRAAVPFPA 378
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDAL--CLGFV--DGGVNPRTSIVIGARQLENNLLQ 391
+ +V QN + +N +Q CL + DG +IG Q +N
Sbjct: 379 LSIVFQN-GAELDLPQENYFIQPDPQEAKHCLAILPTDGMS------IIGNFQQQNIHFL 431
Query: 392 FDLATSRLGFSDS 404
+D+ +RLGF+++
Sbjct: 432 YDVQHARLGFANT 444
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 116/291 (39%), Gaps = 33/291 (11%)
Query: 118 THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALP 177
T G I D + S +G V V F CG ++ S GIAG GR +LP
Sbjct: 45 TAGHIFKDTFTFMSPNGV----PVAVSELAFGCG-DYNTGLFVSNESGIAGFGRGPQSLP 99
Query: 178 SQLAAAFSLKRKFALCLSPFDDG---AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA 234
SQL +F+ CL+ + ++ G P D + + TP+ N +
Sbjct: 100 SQLKVG-----RFSYCLTLVTESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPT 154
Query: 235 SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKA 294
Y++ + + V +P +K++ ++ +G GGT I + T L ++++
Sbjct: 155 F---------YYLSLEGITVGKTRLPFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFEL 205
Query: 295 LVQAFASAMPKVARVAPVAPFGA--CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
L + + P + R G CFR + G VP L+L +
Sbjct: 206 LQEELVAQFP-LPRYDNTPEVGDRLCFRR-----PKGGKQVPVPKLILHLAGADMDLPRD 259
Query: 353 NSMV-QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
N V + +CL G T ++IG Q +N + +D+ ++L F+
Sbjct: 260 NYFVEEPDSGVMCLQI--NGAEDTTMVLIGNFQQQNMHVVYDVENNKLLFA 308
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 91/368 (24%), Positives = 147/368 (39%), Gaps = 58/368 (15%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGI----CGA- 107
TP PVKL + G ++W + +N + C G+ CG+
Sbjct: 3 TPPNPVKLKLENGNELIW--------NHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSP 54
Query: 108 --GPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVG 165
P+ T ++GD + ++ G +VP F CG F S G
Sbjct: 55 KFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL-FNNGVFKSNETG 113
Query: 166 IAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI---VFGDGPYYDLNNFDVSKNLK 222
IAG GR ++LPSQL F+ C + GAI V D P +N + ++
Sbjct: 114 IAGFGRGPLSLPSQLKVG-----NFSHCFTTI-TGAIPSTVLLDLPADLFSNGQGA--VQ 165
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
TPL N A+ L Y++ + + V +P+ ++ ++ N G GGT I +
Sbjct: 166 TTPLIQYAKNEANPTL------YYLSLKGITVGSTRLPVPESAFALTN-GTGGTIIDSGT 218
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA-----CFRLQDIGFTRIGPVVPQID 337
T L +Y+ + FA+ + PV P A CF ++ P VP+
Sbjct: 219 SITSLPPQVYQVVRDEFAAQI-----KLPVVPGNATGHYTCFSAP----SQAKPDVPK-- 267
Query: 338 LVLQNKNVVWSIHGQNSMVQIGGDA----LCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
LVL + + +N + ++ DA +CL G + +IG Q +N + +D
Sbjct: 268 LVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG----DETTIIGNFQQQNMHVLYD 323
Query: 394 LATSRLGF 401
L + L F
Sbjct: 324 LQNNMLSF 331
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 103/250 (41%), Gaps = 31/250 (12%)
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF 215
L A + GI G G S SQLAAA K+ F+ CL + G I F G
Sbjct: 211 LNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI-FAIGEV------ 263
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
V +K TP+ N V + + + S++V G + L + G
Sbjct: 264 -VEPKVKTTPIVKNN-----------EVYHLVNLKSINVAGTTLQLPANIFGTTK--TKG 309
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT-RIGPVVP 334
T I + + L IY L+ A + P + GA + Q F + P
Sbjct: 310 TFIDSGSTLVYLPEIIYSELILAVFAKHPDIT-------MGAMYNFQCFHFLGSVDDKFP 362
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN-PRTSIVIGARQLENNLLQFD 393
+I +N ++ ++ + +++ G+ C GF D G++ + I++G + N ++ +D
Sbjct: 363 KITFHFEN-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYD 421
Query: 394 LATSRLGFSD 403
+ +G+++
Sbjct: 422 MEKQAIGWTE 431
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 119/283 (42%), Gaps = 46/283 (16%)
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
PG+ TVP F+ C + Q GIAG GR + +LPSQ+ +LKR F+ CL
Sbjct: 207 PGK--TVPQFLVGCSILSIRQ-----PSGIAGFGRGQESLPSQM----NLKR-FSYCLV- 253
Query: 197 FDDGAIVFGDGPYY-DL-----NNFDVSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGV 249
+ F D P DL + D N L YTP N N + E Y++ +
Sbjct: 254 ----SHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNS--VFRE---YYYVTL 304
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVAR 308
+ V G V + L ++G GGT + + + +T +E +Y + Q F + K +R
Sbjct: 305 RKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSR 364
Query: 309 ---VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
V + CF + + + + Q K + GDA L
Sbjct: 365 EENVEAQSGLSPCFNISGVK------TISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVL 418
Query: 366 GFV---DGGV-NPRT---SIVIGARQLENNLLQFDLATSRLGF 401
F DGG P+T +I++G Q +N +++DL R GF
Sbjct: 419 CFTVVSDGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGF 461
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 113/286 (39%), Gaps = 43/286 (15%)
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
R VP F F C + +GIAG GR ++LPSQL L++ F+ C PF
Sbjct: 137 ARTRDVPRFSFGC-----VTSTYHEPIGIAGFGRGLLSLPSQLGF---LEKGFSHCFLPF 188
Query: 198 D-------DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
++ G + +++ +L++TP+ V S ++G S+ +T
Sbjct: 189 KFVNNPNISSPLILGASAL----SINLTDSLQFTPMLNTPVYPNSYYIGLESITIGTNIT 244
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PKVAR 308
VPL TL D++G GG + + YT L Y L+ S + P+
Sbjct: 245 PTQ-----VPL--TLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRATE 297
Query: 309 VAPVAPFGACFRL--QDIGFTRIGP----VVPQIDLVLQNKNVVWSIHGQN----SMVQI 358
F C+++ + T + V P I N + G + S
Sbjct: 298 TESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSD 357
Query: 359 GGDALCLGF---VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G CL F DG P + V G+ Q +N + +DL R+GF
Sbjct: 358 GSVVQCLLFQNMEDGNYGP--AGVFGSFQQQNVKVVYDLEKERIGF 401
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 130/293 (44%), Gaps = 52/293 (17%)
Query: 120 GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQ 179
G + DVL++ + + G F++ CG + QGL GI GL K+++ Q
Sbjct: 202 GYLSQDVLTLTPSAAPSSG-------FVYGCGQDN--QGLFGRSAGIIGLANDKLSMLGQ 252
Query: 180 LAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF-------DVSKNLKYTPLFIN-KV 231
L+ + F+ CL F P ++ F S K+TPL N K+
Sbjct: 253 LSNKYG--NAFSYCLPS------SFSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKI 304
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
PS+ YF+G+T++ V GK + ++ + ++ GT I T L +I
Sbjct: 305 ---------PSL-YFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVI------TRLPVAI 348
Query: 292 YKALVQAFASAMPKVARVAP-VAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSI 349
Y AL ++F M K AP + CF+ G + VP+I ++ + + +
Sbjct: 349 YNALKKSFVMIMSKKYAQAPGFSILDTCFK----GSVKEMSTVPEIRIIFRGGAGLELKV 404
Query: 350 HGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
H NS+V+I CL + NP + +IG Q + + +D+A S++GF+
Sbjct: 405 H--NSLVEIEKGTTCLA-IAASSNPIS--IIGNYQQQTFTVAYDVANSKIGFA 452
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 84/364 (23%), Positives = 149/364 (40%), Gaps = 66/364 (18%)
Query: 65 GGNILWVDCEKGYV-------------SSTNKTARCGSAQCNLANAKACGGGICGAGPDN 111
G +++W C + S+T + C S++C ++ +C +C
Sbjct: 4 GSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMC------ 57
Query: 112 PISNTGTHGDI--RIDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
+GD VL+ ++ T G V N F CGS + G + G+ G
Sbjct: 58 --VYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGS--LNAGDLANSSGMVG 113
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFGDGPYYDLNNFDVSKN--LKY 223
GR ++L SQL + +F+ CL+ + + FG Y +L++ + S ++
Sbjct: 114 FGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRLYFGV--YANLSSTNTSSGSPVQS 166
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
TP IN P++ YF+ + ++ + K +P++ + +I+++G GG I +
Sbjct: 167 TPFVINP--------ALPNM-YFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTS 217
Query: 284 YTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQI-----DL 338
T L+ Y+A+ + SA+P A CF+ P P + DL
Sbjct: 218 ITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQW---------PPPPNVTVTVPDL 268
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATS 397
V + ++ +N M+ LCL GV +IG Q +N L +D+ S
Sbjct: 269 VFHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVG----TIIGNYQQQNLHLLYDIGNS 324
Query: 398 RLGF 401
L F
Sbjct: 325 FLSF 328
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 97/393 (24%), Positives = 158/393 (40%), Gaps = 72/393 (18%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGY----------VSS 80
LR + +Y + TP V + G ++LW+ C+ Y SS
Sbjct: 70 LRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSS 129
Query: 81 TNKTARCGSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
T ++ CGS+ C + C C G G + T G+ + LS G
Sbjct: 130 TFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDG-------SFTVGEFSTETLSF----G 178
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
N +V + CG QGL +G G+ GLG+ ++ PSQ+ + F+ CL
Sbjct: 179 SNAVNSVAIG-----CGHNN--QGLFTGAAGLLGLGKGLLSFPSQVGQLYG--SVFSYCL 229
Query: 195 SPFDDGA---IVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVT 250
+ ++FG N V+ N ++T L N K++T Y++ +
Sbjct: 230 PTRESTGSVPLIFG--------NQAVASNAQFTTLLTNPKLDTF----------YYVEMV 271
Query: 251 SVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
+ V G +V + LS+D+ G GG + + T L TS Y + AF + MP A++
Sbjct: 272 GIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKM 331
Query: 310 AP-VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGF 367
+ F C+ L R ++P + V N ++ QN MV + CL F
Sbjct: 332 TSGFSLFDTCYDLSG----RSSIMLPAVSFVF-NGGATMALPAQNIMVPVDNSGTYCLAF 386
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLG 400
N +IG Q ++ + FD +R+G
Sbjct: 387 AP---NSENFSIIGNIQQQSFRMSFDSTGNRVG 416
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 89/378 (23%), Positives = 153/378 (40%), Gaps = 54/378 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y ++ TP + L + G +ILW+ C Y SST T C S
Sbjct: 36 EYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSR 95
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC + C G C D + T G+ D +S+ ST G G V + C
Sbjct: 96 QCLNLDVGGCVGNKCLYQVDYGDGSFST-GEFATDAVSLNSTSG---GGQVVLNKIPLGC 151
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----GAIVFG 205
G + +G G G+ GLG+ ++ P+Q+ + +F+ CL+ D +++FG
Sbjct: 152 GHDN--EGYFVGAAGLLGLGKGPLSFPNQINSENG--GRFSYCLTGRDTDSTERSSLIFG 207
Query: 206 DGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
D +++TP N +V+T Y++ +T + V G + + +
Sbjct: 208 DAAV-------PPAGVRFTPQASNLRVSTF----------YYLKMTGISVGGSILTIPTS 250
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+D+ G GG I + T L+ + Y +L +AF + + + F C+ L D+
Sbjct: 251 AFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDL 310
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGAR 383
VP + L Q + N +V + + CL F G P +IG
Sbjct: 311 SSVD----VPTVTLHFQG-GADLKLPASNYLVPVDNSSTFCLAFA-GTTGPS---IIGNI 361
Query: 384 QLENNLLQFDLATSRLGF 401
Q + + +D +++GF
Sbjct: 362 QQQGFRVIYDNLHNQVGF 379
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 153/392 (39%), Gaps = 71/392 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA--------- 95
Y T++K TP + + G ++LWV C +S N + Q L+
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSC-----TSCNGCPKTSELQIQLSFFDPGVSSS 138
Query: 96 ------NAKACGGGI---CGAGPDNPIS-------NTGTHGDIRIDVLSIQSTDGRNPGR 139
+ + C G P+N S +GT G D +S +
Sbjct: 139 ASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAI 198
Query: 140 AVTVPNFIFLCGS--EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
+ P F+F C + LQ V GI GLG+ +++ SQLA R F+ CL
Sbjct: 199 NSSAP-FVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGD 257
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G + YTPL ++ + Y + + S+ VNG+
Sbjct: 258 KSGGGIMVLGQIK-------RPDTVYTPLVPSQPH------------YNVNLQSIAVNGQ 298
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+P++ ++ +I GT I T L Y +QA A+A+ + R +
Sbjct: 299 ILPIDPSVFTIATG--DGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESY-Q 355
Query: 318 CFRLQ--DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI----GGDALCLGFVDGG 371
CF + D+ V PQ+ L + + G + +QI G C+GF
Sbjct: 356 CFEITAGDVD------VFPQVSLSFAGGASM--VLGPRAYLQIFSSSGSSIWCIGFQR-- 405
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
++ R ++G L++ ++ +DL R+G+++
Sbjct: 406 MSHRRITILGDLVLKDKVVVYDLVRQRIGWAE 437
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 91/391 (23%), Positives = 151/391 (38%), Gaps = 67/391 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKA----- 99
Y +I +P + V G +ILWV+C S+ K + G L N K+
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVG--CSNCPKKSDIG-VDLQLYNPKSSSTST 129
Query: 100 ---CGGGICGAGPDNPI----------------SNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C C A D PI + T G D + +Q G N +
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVG-NHKTS 188
Query: 141 VTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
T + +F CG++ L + + GI G G++ ++ SQLAA +K+ FA CL
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSIS 248
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
G I F G V L TP+ N+ + Y + + V V A
Sbjct: 249 GGGI-FAIGEV-------VEPKLXNTPVVPNQAH------------YNVVLNGVKVGDTA 288
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ L L + G I + L SIY L++ A P + ++ V C
Sbjct: 289 LDLPLGLFETSYK--RGAIIDSGTTLAYLPESIYLPLMEKILGAQPDL-KLRTVDDQFTC 345
Query: 319 FRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
F D GF P + + ++++ +I+ + QI D C+G+ + G +
Sbjct: 346 FVFDKNVDDGF-------PTVTFKFE-ESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSK 397
Query: 376 TS---IVIGARQLENNLLQFDLATSRLGFSD 403
++G L+N L+ ++L +G+++
Sbjct: 398 DGNEVTLLGDLVLQNKLVYYNLENQTIGWTE 428
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 85/398 (21%), Positives = 157/398 (39%), Gaps = 68/398 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTAR---C 87
QY +++ TP L V G ++ W+ C Y S++ + R C
Sbjct: 58 QYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 117
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTD------G 134
+C A G C +P T + D + + +S++S G
Sbjct: 118 TDDECQFLPAPI--GSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAG 175
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ R + + N C E V G G+ GLG+ ++L +Q +L F+ CL
Sbjct: 176 NHKTRRIRIKNVALGCSRESVGASFL-GASGVLGLGQGPISLATQTRHT-ALGGIFSYCL 233
Query: 195 SPFDDGA-----IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
+ G+ +V G + + L +TP+ N A F Y++ V
Sbjct: 234 VDYLRGSNASSFLVMGRTHW---------RKLAHTPIVRNPA--AQSF-------YYVNV 275
Query: 250 TSVHVNGKAVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA--MPKV 306
T V V+GK V + + ID +G GT + + L Y ++ A ++ +P+
Sbjct: 276 TGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRA 335
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ F C+ + TR+ +P++ + Q V+ + N MV + + C+
Sbjct: 336 QEIP--EGFELCYNV-----TRMEKGMPKLGVEFQGGAVM-ELPWNNYMVLVAENVQCVA 387
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+ S ++G +++ +++DLA +R+GF S
Sbjct: 388 -LQKVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKWS 424
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 80/381 (20%), Positives = 148/381 (38%), Gaps = 43/381 (11%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSA 90
+Y + +P L + G ++ W+ C + S++ K C
Sbjct: 154 EYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDP 213
Query: 91 QCNLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
+CNL + P ++ T GD ++ ++ T V N
Sbjct: 214 RCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVEN 273
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
+F CG +GL G G+ GLGR ++ SQL + + F+ CL + V
Sbjct: 274 MMFGCGH--WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG--HSFSYCLVDRNSDTNVSS 329
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
+ + + NL +T K N F Y++ + S+ V G+ + + +
Sbjct: 330 KLIFGEDKDLLSHPNLNFTSFVARKENLVDTF-------YYVQIKSIIVAGEVLNIPEET 382
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIY---KALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+I ++G GGT I + + Y K + A V R P+ CF +
Sbjct: 383 WNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPI--LDPCFNVS 440
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-VIG 381
I ++ P++ + + VW+ +NS + + D +CL + P+++ +IG
Sbjct: 441 GIDSIQL----PELGIAFAD-GAVWNFPTENSFIWLNEDLVCLAILG---TPKSAFSIIG 492
Query: 382 ARQLENNLLQFDLATSRLGFS 402
Q +N + +D SRLG++
Sbjct: 493 NYQQQNFHILYDTKRSRLGYA 513
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 157/394 (39%), Gaps = 69/394 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTA---- 85
Y T++K P + + G +ILWV C + + ++ TA
Sbjct: 89 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148
Query: 86 ----RC------GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR 135
RC G A C +N+++ G D +GT G D + ++ G
Sbjct: 149 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGD----GSGTSGYYVSDTMFFETVMG- 203
Query: 136 NPGRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
N A + + +F C + L V GI G G+ ++++ SQL + + F+ C
Sbjct: 204 NEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC 263
Query: 194 LSPFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
L D+G +V G+ V L YTPL ++ + Y + + S
Sbjct: 264 LKGSDNGGGILVLGE---------IVEPGLVYTPLVPSQPH------------YNLNLES 302
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ VNG+ +P++ +L + N GT + + L Y V A A+A+ R +
Sbjct: 303 IAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR-SL 359
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDG 370
V+ CF I + + P + L V S+ +N ++Q D L +
Sbjct: 360 VSKGSQCF----ITSSSVDSSFPTVTLYFMG-GVAMSVKPENYLLQQASVDNSVLWCIGW 414
Query: 371 GVNPRTSI-VIGARQLENNLLQFDLATSRLGFSD 403
N I ++G L++ + +DLA R+G++D
Sbjct: 415 QRNQGQEITILGDLVLKDKIFVYDLANMRMGWAD 448
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 157/394 (39%), Gaps = 69/394 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTA---- 85
Y T++K P + + G +ILWV C + + ++ TA
Sbjct: 91 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150
Query: 86 ----RC------GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR 135
RC G A C +N+++ G D +GT G D + ++ G
Sbjct: 151 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGD----GSGTSGYYVSDTMFFETVMG- 205
Query: 136 NPGRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
N A + + +F C + L V GI G G+ ++++ SQL + + F+ C
Sbjct: 206 NEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC 265
Query: 194 LSPFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
L D+G +V G+ V L YTPL ++ + Y + + S
Sbjct: 266 LKGSDNGGGILVLGE---------IVEPGLVYTPLVPSQPH------------YNLNLES 304
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ VNG+ +P++ +L + N GT + + L Y V A A+A+ R +
Sbjct: 305 IAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR-SL 361
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDG 370
V+ CF I + + P + L V S+ +N ++Q D L +
Sbjct: 362 VSKGSQCF----ITSSSVDSSFPTVTLYFMG-GVAMSVKPENYLLQQASVDNSVLWCIGW 416
Query: 371 GVNPRTSI-VIGARQLENNLLQFDLATSRLGFSD 403
N I ++G L++ + +DLA R+G++D
Sbjct: 417 QRNQGQEITILGDLVLKDKIFVYDLANMRMGWAD 450
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 90/396 (22%), Positives = 159/396 (40%), Gaps = 73/396 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTA---- 85
Y T++K P + + G +ILWV C + + ++ TA
Sbjct: 5 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64
Query: 86 ----RC------GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR 135
RC G A C +N+++ G D +GT G D + ++ G
Sbjct: 65 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGD----GSGTSGYYVSDTMFFETVMG- 119
Query: 136 NPGRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
N A + + +F C + L V GI G G+ ++++ SQL + + F+ C
Sbjct: 120 NEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC 179
Query: 194 LSPFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
L D+G +V G+ V L YTPL ++ + Y + + S
Sbjct: 180 LKGSDNGGGILVLGE---------IVEPGLVYTPLVPSQPH------------YNLNLES 218
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ VNG+ +P++ +L + N GT + + L Y V A A+A+ R +
Sbjct: 219 IAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR-SL 275
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGF 367
V+ CF I + + P + L V S+ +N ++Q + L C+G+
Sbjct: 276 VSKGSQCF----ITSSSVDSSFPTVTLYFMG-GVAMSVKPENYLLQQASVDNSVLWCIGW 330
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
T ++G L++ + +DLA R+G++D
Sbjct: 331 QRNQGQEIT--ILGDLVLKDKIFVYDLANMRMGWAD 364
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 101/394 (25%), Positives = 154/394 (39%), Gaps = 65/394 (16%)
Query: 42 TLQYVTQIK----QRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKT 84
TL YVT I +P + + V G ++ WV C+ S+T
Sbjct: 141 TLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAA 200
Query: 85 ARCGSAQCNLANAKACGG----GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
RC ++ C + A G G GAG + +GD + +TD G A
Sbjct: 201 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYAL-AYGDGSFS-RGVLATDTVALGGA 258
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SP 196
++ F+F CG +GL G G+ GLGR++++L SQ A+ + F+ CL S
Sbjct: 259 -SLGGFVFGCG--LSNRGLFGGTAGLMGLGRTELSLVSQTASRYG--GVFSYCLPAATSG 313
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHV 254
G++ G G + S TP+ ++ + +P+ YF+ VT V
Sbjct: 314 DASGSLSLGGG------DDAASSYRNTTPVAYTRM------IADPAQPPFYFLNVTGAAV 361
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
G T L+ G I + T L S+Y+A+ F P AP
Sbjct: 362 GG-------TALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGY--PAAP 412
Query: 315 ----FGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVD 369
C+ L G + VP + L L+ +V G +V+ G +CL
Sbjct: 413 GFSILDTCYDL--TGHDEV--KVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMAS 468
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
T I IG Q +N + +D SRLGF+D
Sbjct: 469 LSYEDETPI-IGNYQQKNKRVVYDTLGSRLGFAD 501
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 154/379 (40%), Gaps = 60/379 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVD---CEKGYV----------SSTNKTARCGSA 90
+Y T++ TP V + + G +++W+ C K Y S + + C S
Sbjct: 146 EYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSP 205
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C ++ C + T G+ + L+ R VP C
Sbjct: 206 LCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTF---------RGTRVPKVALGC 256
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGD 206
G + +GL G G+ GLGR +++ P+Q F RKF+ CL + ++VFG
Sbjct: 257 GHD--NEGLFVGAAGLLGLGRGRLSFPTQTGLRFG--RKFSYCLVDRSASSKPSSVVFGQ 312
Query: 207 GPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVP-LNKT 264
VS+ +TPL N K++T Y++ +T + V G V + +
Sbjct: 313 SA--------VSRTAVFTPLITNPKLDTF----------YYLELTGISVGGARVAGITAS 354
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
L +D G GG I + T L Y +L AF + + R + F CF L
Sbjct: 355 LFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLS-- 412
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSIVIGAR 383
G T + VP + + + +V S+ N ++ + + + C F G SI IG
Sbjct: 413 GKTEVK--VPTVVMHFRGADV--SLPATNYLIPVDTNGVFCFAFA--GTMSGLSI-IGNI 465
Query: 384 QLENNLLQFDLATSRLGFS 402
Q + + FD+A SR+GF+
Sbjct: 466 QQQGFRVVFDVAASRIGFA 484
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 97/402 (24%), Positives = 145/402 (36%), Gaps = 86/402 (21%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSAQ 91
YV TP P V + G ++W C SST K CG+A
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG 151
C ++C G +C S G +R + +TD G A F +
Sbjct: 105 CESIPTRSCSGDVC--------SYKGPPTQLRGNTSGFAATDTFAIGTATVRLAFGCVVA 156
Query: 152 SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
S+ G G GLGR+ P L A L R F+ CLSP + G
Sbjct: 157 SDI---DTMDGPSGFIGLGRT----PWSLVAQMKLTR-FSYCLSPRNTG----------- 197
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
K + LF+ +G + + TS +G N LLS+D
Sbjct: 198 ----------KSSRLFLGSSAKLAGSESTSTAPFI--KTSPDDDGS----NYYLLSLDAI 241
Query: 272 GVGGTKIS-----------TVNPYTVLETSIYKALVQAFASAM---PKVARVAPVAPFGA 317
G T I+ TV+P+++L S YKA +A A+ P PF
Sbjct: 242 RAGNTTIATAQSGGILVMHTVSPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDL 301
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG--GDALCLGFVDGGVNPR 375
CF+ + GF+R P + Q + ++ ++ +G D C + R
Sbjct: 302 CFK-KAAGFSRA--TAPDLVFTFQGAAAL-TVPPAKYLIDVGEEKDTACAAILSMAWLNR 357
Query: 376 TSI----VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
T + V+G+ Q E+ +DL ++L FE A C+
Sbjct: 358 TGLEGVSVLGSLQQEDVHFLYDLK------KETLSFEPADCS 393
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 147/384 (38%), Gaps = 76/384 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y T++ P V + + G ++ W+ C + SS+ + C +
Sbjct: 150 EYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTP 209
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
QCN C C G G + T GD + L+I ST V
Sbjct: 210 QCNALEVSECRNATCLYEVSYGDG-------SYTVGDFATETLTIGST---------LVQ 253
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--- 201
N CG +GL G G+ GLG +ALPSQL F+ CL D +
Sbjct: 254 NVAVGCGHSN--EGLFVGAAGLLGLGGGLLALPSQLNTT-----SFSYCLVDRDSDSAST 306
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ FG + + PL N + Y++G+T + V G+ + +
Sbjct: 307 VEFGTS---------LPPDAVVAPLLRNH---------QLDTFYYLGLTGISVGGELLQI 348
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
++ +D G GG I + T L+T IY +L +F + + A VA F C+ L
Sbjct: 349 PQSSFEMDESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNL 408
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTS--I 378
+ VP + ++ ++ +N M+ + CL F P S
Sbjct: 409 S----AKTTIEVPTVAFHFPGGKML-ALPAKNYMIPVDSVGTFCLAFA-----PTASSLA 458
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
+IG Q + + FDLA S +GFS
Sbjct: 459 IIGNVQQQGTRVTFDLANSLIGFS 482
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 120/289 (41%), Gaps = 19/289 (6%)
Query: 115 NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKV 174
++ T GD ++ ++ T V N +F CG +GL G G+ GLGR +
Sbjct: 83 SSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGH--WNRGLFHGAAGLLGLGRGPL 140
Query: 175 ALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTA 234
+ SQL + + F+ CL + A V + + + L +T L K N
Sbjct: 141 SFSSQLQSLYG--HSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPV 198
Query: 235 SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKA 294
F Y++ + S+ V G+ V + + I +G GGT I + + Y+
Sbjct: 199 DTF-------YYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQV 251
Query: 295 LVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
+ +AF + + V C+ + + P +P +V + VW+ +N
Sbjct: 252 IKEAFMAKVKGYPVVKDFPVLEPCYNVTGVE----QPDLPDFGIVFSD-GAVWNFPVENY 306
Query: 355 MVQIG-GDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
++I + +CL + G P +IG Q +N + +D SRLGF+
Sbjct: 307 FIEIEPREVVCLAIL--GTPPSALSIIGNYQQQNFHILYDTKKSRLGFA 353
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 88/386 (22%), Positives = 148/386 (38%), Gaps = 69/386 (17%)
Query: 58 VKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANA---KACG 101
V LT+ G +++W C VSST + C C ++ AC
Sbjct: 101 VALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACA 160
Query: 102 GG------ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFV 155
+C G D I T G I D + S +G V V F CG ++
Sbjct: 161 LKTFRCFYLCSYG-DKSI----TAGYIFKDTFTFMSPNGEG-APPVAVSGLAFGCG-DYN 213
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD------GAIVFGDGPY 209
AS GIAG GR ++LPSQL +F+ CL+ D+ A+ G P
Sbjct: 214 TGVFASNESGIAGFGRGPLSLPSQLRVG-----RFSYCLTSHDETESNKTSAVFLGTPP- 267
Query: 210 YDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVPLNKTLLS 267
+ S + TP+ + PS Y++ + + V +P++ ++ +
Sbjct: 268 -NGLRAHSSGPFRSTPI-----------IHSPSFPTFYYLSLEGITVGKTRLPVDSSVFA 315
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT 327
+ +G GGT I + T ++++ L F + +P + R + G Q
Sbjct: 316 LKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLP-LPRYDNTSEVGNLLCFQR---P 371
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTSIVIGARQLE 386
+ G VP L+ + + +N + + +CL V+ ++IG Q +
Sbjct: 372 KGGKQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVD---MVLIGNFQQQ 428
Query: 387 NNLLQFDLATSRLGFSDSLLFERATC 412
N + +D+ S+ LLF A C
Sbjct: 429 NMHIVYDVENSK------LLFASAQC 448
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 141/366 (38%), Gaps = 39/366 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ + TP + G +++W C+ V T + + + C
Sbjct: 96 EFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSD 155
Query: 104 ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT-----VPNFIFLCGSEFVLQG 158
+C A PIS+ + R ST G T V F CG + +
Sbjct: 156 LCVA---LPISSCSDGCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRA 212
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS 218
+ G G+ GLGR ++L SQL KF+ CL+ DD + + +
Sbjct: 213 YSQGA-GLVGLGRGPLSLISQLGVP-----KFSYCLTSIDDSKGI----STLLVGSEATV 262
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
K+ TPL N PS Y++ + + V +P+ K+ SI ++G GG I
Sbjct: 263 KSAIPTPLIQNP--------SRPSF-YYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLII 313
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV-VPQID 337
+ T L+ S + AL + F S M + CF L G PV VPQ
Sbjct: 314 DSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDG----SPVDVPQ-- 367
Query: 338 LVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLAT 396
LV + V + +N +++ +CL G + SI G Q +N ++ DL
Sbjct: 368 LVFHFEGVDLKLPKENYIIEDSALRVICLTM---GSSSGMSI-FGNFQQQNIVVLHDLEK 423
Query: 397 SRLGFS 402
+ F+
Sbjct: 424 ETISFA 429
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 86/392 (21%), Positives = 159/392 (40%), Gaps = 68/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY------------------VSSTNKTAR 86
Y T+++ TP V + + G ++LWV C SST+
Sbjct: 75 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 134
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNT-------GTHGDIRIDVLSIQSTDGRNPGR 139
C +CN N C + +N S T GT G D++ + + +
Sbjct: 135 CSDQRCN--NGIQSSDATC-SSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTT 191
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
T P +F C ++ L V GI G G+ ++++ SQL++ R F+ CL
Sbjct: 192 NSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 250
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
G +V G+ V N+ YT L +P Y + + S+ VN
Sbjct: 251 SSGGGILVLGE---------IVEPNIVYTSLVP----------AQP--HYNLNLQSIAVN 289
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G+ + ++ ++ + N GT + + L Y V A +++P+ V+
Sbjct: 290 GQTLQIDSSVFATSNS--RGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTV-VSRG 346
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGG 371
C+ + + + V PQ+ L + + Q+ ++Q IGG A+ C+GF
Sbjct: 347 NQCYLIT----SSVTEVFPQVSLNFAGGASMI-LRPQDYLIQQNSIGGAAVWCIGFQK-- 399
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ + ++G L++ ++ +DLA R+G+++
Sbjct: 400 IQGQGITILGDLVLKDKIVVYDLAGQRIGWAN 431
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 143/377 (37%), Gaps = 73/377 (19%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE-------------------KGYVSSTNKTARCG----- 88
TP + L G ++ W CE Y++ T ++ C
Sbjct: 144 TPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSA 203
Query: 89 --SAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
++C+ ++ AC GI + G + + L+I +TD V +F
Sbjct: 204 GIKSRCS-SSTTACIYGIQYGDKSTSV------GFLSQERLTITATD--------IVDDF 248
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVF 204
+F CG + +GL SG G+ GLGR ++ Q ++ ++ + F+ CL G + F
Sbjct: 249 LFGCGQDN--EGLFSGSAGLIGLGRHPISFVQQTSSIYN--KIFSYCLPSTSSSLGHLTF 304
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G + NLKYTPL +T SG + Y + + + V G +P
Sbjct: 305 GASA-------ATNANLKYTPL-----STISG----DNTFYGLDIVGISVGGTKLP---- 344
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+S GG+ I + T L + Y AL AF M K F C+
Sbjct: 345 AVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFS-- 402
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
G+ I VP+ID V + ++ +CL F G N + G Q
Sbjct: 403 GYKEIS--VPKIDFEFAG-GVTVELPLVGILIGRSAQQVCLAFAANG-NDNDITIFGNVQ 458
Query: 385 LENNLLQFDLATSRLGF 401
+ + +D+ R+GF
Sbjct: 459 QKTLEVVYDVEGGRIGF 475
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 151/388 (38%), Gaps = 58/388 (14%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGYV-------SSTNKTARC 87
T +Y+ + TP PV LT+ G +++W C E+G SST+ C
Sbjct: 87 TNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPC 146
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTD-----GRNPGRAVT 142
+ C +CGG G D +GD + V + +TD G + +
Sbjct: 147 DAPLCRALPFTSCGGRSWG---DRSCVYVYHYGDRSLTVGQL-ATDSFTFGGDDNAGGLA 202
Query: 143 VPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-- 199
F CG + +G+ + GIAG GR + +LPSQL F+ C + D
Sbjct: 203 ARRVTFGCG--HINKGIFQANETGIAGFGRGRWSLPSQLNVT-----SFSYCFTSMFDTK 255
Query: 200 --GAIVFGDGPYYDLNNFDVSK--NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
+ G L+ + +++ T L N +PS+ YF+ + + V
Sbjct: 256 SSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNP--------SQPSL-YFVPLRGISVG 306
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G V + ++ L T I + T L +Y+A+ F S + A A A
Sbjct: 307 GARVAVPESRLR------SSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAAL 360
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNP 374
CF L R P VP + L L + W + N + + LC+ +D
Sbjct: 361 DLCFALPVAALWR-RPAVPALTLHL-DGGADWELPRGNYVFEDYAARVLCV-VLDAAAGE 417
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFS 402
+ +VIG Q +N + +DL L F+
Sbjct: 418 Q--VVIGNYQQQNTHVVYDLENDVLSFA 443
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 150/387 (38%), Gaps = 59/387 (15%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------SSTNKTAR 86
T YV + TP + + G ++ WV C SST R
Sbjct: 151 TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVR 210
Query: 87 CGSAQCNLANAKACGGGICGAGPDN-PIS-----NTGTHGDIRIDVLSIQSTDGRNPGRA 140
CG+ +C ++CGG G D P + T G + D L++ + N
Sbjct: 211 CGARECRA--RQSCGG---SPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAE 265
Query: 141 --VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
+P F+F CG GL G+ GLGR KV+L SQ A F F+ CL
Sbjct: 266 NDNKLPGFVFGCGENNT--GLFGQADGLFGLGRGKVSLSSQAAGKFG--EGFSYCLPSSS 321
Query: 199 DGAIVFGDGPYYDLNN-FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
A Y L + ++TP+ +N+ T S Y++ + + V G+
Sbjct: 322 SSAP-----GYLSLGTPVPAPAHAQFTPM-LNRTTTPS--------FYYVKLVGIRVAGR 367
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA--RVAPVAPF 315
A+ ++ +++ GT I+ + P Y+AL AF SAM K R ++
Sbjct: 368 AIRVSSPRVALPLIVDSGTVITRLAPRA------YRALRAAFLSAMGKYGYKRAPRLSIL 421
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
C+ + +P + LV + S+ + CL F G + R
Sbjct: 422 DTCYDFTAHANATVS--IPAVALVFAGGATI-SVDFSGVLYVAKVAQACLAFAPNG-DGR 477
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
++ ++G Q + +D+A ++GF+
Sbjct: 478 SAGILGNTQQRTLAVVYDVARQKIGFA 504
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 96/388 (24%), Positives = 147/388 (37%), Gaps = 63/388 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+++ + TP +P V G +++W C+ SST C SA
Sbjct: 115 EFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSA 174
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C C + +P T T+GD + +T+ R VP F C
Sbjct: 175 LCADLPTSTCASSSSSSSASSPCGYTYTYGDAS-STQGVLATETFTLARQ-KVPGVAFGC 232
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G G G G+ GLGR ++L SQL +F+ CL+ DD A G P
Sbjct: 233 GDTNEGDGFTQGA-GLVGLGRGPLSLVSQLGI-----DRFSYCLTSLDDAA---GRSPLL 283
Query: 211 -----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
++ + + TPL N +PS Y++ +T + V + L +
Sbjct: 284 LGSAAGISASAATAPAQTTPLVKNP--------SQPSF-YYVSLTGLTVGSTRLALPSSA 334
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
+I ++G GG + + T LE Y+AL +AF + M A CF+
Sbjct: 335 FAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQ----- 389
Query: 326 FTRIGP--VVPQIDLVLQNKNVVWSIHG--------QNSMV-QIGGDALCLGFVDGGVNP 374
GP V Q D+ +Q +V G +N MV ALCL +
Sbjct: 390 ----GPAGAVDQ-DVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMAS---- 440
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFS 402
R +IG Q +N +D+A L F+
Sbjct: 441 RGLSIIGNFQQQNFQFVYDVAGDTLSFA 468
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 143/341 (41%), Gaps = 65/341 (19%)
Query: 79 SSTNKTARCGSAQC-NLANA-------KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQ 130
SST KT C S +C N+ N K C G + GD+ ID L++
Sbjct: 136 SSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGG------EAYSQGDLSIDTLTLN 189
Query: 131 STDGRNPGRAVTVPNFIFLCG--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
S N ++ N + CG ++ L+G SG + GLGR ++ SQL + S+
Sbjct: 190 S----NNDTPISFKNIVIGCGHRNKGPLEGYVSGNI---GLGRGPLSFISQLNS--SIGG 240
Query: 189 KFALCLSP-FDD----GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
KF+ CL P F + G + FGD ++ V T S + +
Sbjct: 241 KFSYCLVPLFSNEGISGKLHFGDKS------------------VVSGVGTVSTPITAGEI 282
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
Y + ++ V + + + N+ +G T I + T+L ++Y L ++ ++M
Sbjct: 283 GYSTTLNALSVGDHIIKFENS--TSKNDNLGNTIIDSGTTLTILPENVYSRL-ESIVTSM 339
Query: 304 PKVARV-APVAPFGACFR--LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
K+ R +P F C++ L+++ VP I +V ++ N+ I
Sbjct: 340 VKLERAKSPNQQFKLCYKATLKNLD-------VPIITAHFNGADV--HLNSLNTFYPIDH 390
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+ +C FV G P T +IG +N L+ FDL + + F
Sbjct: 391 EVVCFAFVSVGNFPGT--IIGNIAQQNFLVGFDLQKNIISF 429
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 140/378 (37%), Gaps = 58/378 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTN-------------KTARCGSAQ 91
YV + + TP + + + + +W+ C G +N T C + Q
Sbjct: 105 YVVRARLGTPPQLMFMVLDTSNDAVWLPCS-GCSGCSNASTSFNTNSSSTYSTVSCSTTQ 163
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC- 150
C A C P N GD +Q T +P +PNF F C
Sbjct: 164 CTQARGLTCPSST--PQPSICSFNQSYGGDSSFSANLVQDTLTLSPD---VIPNFSFGCI 218
Query: 151 ----GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
G+ QGL GLGR ++L SQ + +S F+ CL F F
Sbjct: 219 NSASGNSLPPQGLM-------GLGRGPMSLVSQTTSLYS--GVFSYCLPSFRS----FYF 265
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
L K+++YTPL N PS+ Y++ +T V V VP++ L
Sbjct: 266 SGSLKLGLLGQPKSIRYTPLLRNP--------RRPSL-YYVNLTGVSVGSVQVPVDPVYL 316
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGF 326
+ D+ GT I + T +Y+A+ F + + + F CF +
Sbjct: 317 TFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV--NGSFSTLGAFDTCFSADNENV 374
Query: 327 T-RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-VIGARQ 384
T +I + +DL L +N + IH G CL N + VI Q
Sbjct: 375 TPKITLHMTSLDLKLPMENTL--IHSS------AGTLTCLSMAGIRQNANAVLNVIANLQ 426
Query: 385 LENNLLQFDLATSRLGFS 402
+N + FD+ SR+G +
Sbjct: 427 QQNLRILFDVPNSRIGIA 444
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 90/402 (22%), Positives = 157/402 (39%), Gaps = 67/402 (16%)
Query: 24 QTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EKGY- 77
+T F+P+ L V T+ + +Y ++I TP + + + G ++ W+ C + Y
Sbjct: 141 ETRFQPEDLTTPVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQ 200
Query: 78 ---------VSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLS 128
SST K+ C +C + AC C ++GD V +
Sbjct: 201 QSDPIFDPTSSSTFKSLTCSDPKCASLDVSACRSNKCLYQV--------SYGDGSFTVGN 252
Query: 129 IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
+TD G + V + CG + +GL +G G+ GLG +++ +Q+ A +
Sbjct: 253 Y-ATDTVTFGESGKVNDVALGCGHDN--EGLFTGAAGLLGLGGGALSMTNQIKA-----K 304
Query: 189 KFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFI 247
F+ CL D D N+ + PL N K++T Y++
Sbjct: 305 SFSYCLVDRDSAK-----SSSLDFNSVQIGAGDATAPLLRNSKMDTF----------YYV 349
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
G++ V G+ V + +L +D G GG + T L+T Y +L AF
Sbjct: 350 GLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFK 409
Query: 308 R-VAPVAPFGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGGDA 362
+ +P++ F C+ + ++ V L L KN + I +
Sbjct: 410 KGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGT-------- 461
Query: 363 LCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
C F P +S +IG Q + + +DLA + +G S
Sbjct: 462 FCFAFA-----PTSSSLSIIGNVQQQGTRITYDLANNLIGLS 498
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 89/388 (22%), Positives = 139/388 (35%), Gaps = 70/388 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYVSS---------------T 81
+ T I TP V + + G ++LW+ C Y SS T
Sbjct: 97 HYTWIDIGTPSVSFLVALDSGSDLLWIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTT 156
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
+K C C A A C NT + G + DVL + + +
Sbjct: 157 SKVFPCSHKLCESAPACESPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSANASSSVKA 216
Query: 142 TVPNFIFLCGS----EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
V + CG EF L+G+A G+ GLG ++++PS LA A ++ F++C
Sbjct: 217 RV---VVGCGEKQSGEF-LKGIAPD--GVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEE 270
Query: 198 DDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
D G I FGD GP ++ ++ P + N+ V YF+GV V
Sbjct: 271 DSGRIYFGDVGP-------STQQSTRFLP-YKNEF-----------VAYFVGVEVCCVGN 311
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+ + T I + +T L IY+ + S + + P+
Sbjct: 312 SCL----------KQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWE 361
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
C+ T P VP I L + N + IH ++Q + T
Sbjct: 362 YCYE------TSFEPKVPAIKLKFSSNN-TFVIHKPLFVLQRSEGLVQFCLPISASEEGT 414
Query: 377 SIVIGARQLENNLLQFDLATSRLGFSDS 404
VIG + + FD +LG+S S
Sbjct: 415 GGVIGQNYMAGYRIVFDRENMKLGWSAS 442
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 157/402 (39%), Gaps = 68/402 (16%)
Query: 27 FRPKALVL-----RVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------K 75
FRP A+ R+ +Y+ + TP VPV V G ++ W C K
Sbjct: 71 FRPTAMTSDGIQSRIVPSAG--EYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK 128
Query: 76 GYV-------SSTNKTARCGSAQC-NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVL 127
V SST + + CG++ C L ++C + + T G++ + L
Sbjct: 129 QVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETL 188
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCG--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
++ ST G+ V+ P F F CG S + +SG+VG+ G +++L SQL + +
Sbjct: 189 TVDST----AGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGG---GELSLISQLKS--T 239
Query: 186 LKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLK-----YTPLFINKVNTASGFLGE 240
+ F+ CL P V D NF S + TPL + K
Sbjct: 240 INGLFSYCLLP------VSTDSSISSRINFGASGRVSGYGTVSTPL-VQK---------S 283
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
P Y++ + + V K +P + E G + + YT L Y L ++ A
Sbjct: 284 PDTFYYLTLEGISVGKKRLPYKGYSKKTEVE-EGNIIVDSGTTYTFLPQEFYSKLEKSVA 342
Query: 301 SAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
+++ P F C+ T P I ++ NV + N+ +++
Sbjct: 343 NSIKGKRVRDPNGIFSLCYN------TTAEINAPIITAHFKDANV--ELQPLNTFMRMQE 394
Query: 361 DALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGF 401
D +C V P + I V+G N L+ FDL R+ F
Sbjct: 395 DLVCF-----TVAPTSDIGVLGNLAQVNFLVGFDLRKKRVSF 431
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 81/378 (21%), Positives = 144/378 (38%), Gaps = 48/378 (12%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGSA 90
T Y + P P L V G ++ W+ C+ Y + NK C ++
Sbjct: 54 TGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKLVPCANS 113
Query: 91 QCNLANAKACGGGICGAGP--DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C ++ + C D I T + + V+ S RN ++ P+ F
Sbjct: 114 ICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRN--KSNVRPSLSF 171
Query: 149 LCGSEFVL--QGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
CG + + G A G+ GLGR V+L SQL K CLS G + FG
Sbjct: 172 GCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGFLFFG 231
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
D D+ + T +++ V + SG P G +++ + +++
Sbjct: 232 D---------DMVPTSRVT--WVSMVRSTSGNYYSP------GSATLYFDRRSLSTKPME 274
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
+ D+ G T YT Y+A + A ++ K + C++ Q
Sbjct: 275 VVFDS---GST-------YTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKGQK-A 323
Query: 326 FTRIGPVVPQIDLV--LQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
F + V + + KN V I +N ++ +CLG +DG + +IG
Sbjct: 324 FKSVSDVKKDFKSLQFIFGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDI 383
Query: 384 QLENNLLQFDLATSRLGF 401
+++ ++ +D ++LG+
Sbjct: 384 TMQDQMVIYDNEKAQLGW 401
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 96/399 (24%), Positives = 159/399 (39%), Gaps = 74/399 (18%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGYV-----------SST 81
T T Y T+I TP + V G +ILWV+C K + S +
Sbjct: 85 TETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQS 144
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQ--ST 132
+ C C +AN GG + +P + ++GD D L S
Sbjct: 145 GELVTCDQQFC-VANY---GGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSG 200
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV--GIAGLGRSKVALPSQLAAAFSLKRKF 190
DG+ +V F CG++ +S + GI G G+S ++ SQLAAA +++ F
Sbjct: 201 DGQTTPANASVS---FGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMF 257
Query: 191 ALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
A CL + G I + + N V +K TPL + + Y + +
Sbjct: 258 AHCLDTVNGGGI-------FAIGNV-VQPKVKTTPLVSDMPH------------YNVILK 297
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
+ V G A+ L + N GT I + + +YKAL ++ V
Sbjct: 298 GIDVGGTALGLPTNIFDSGNS--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDIS-VQ 354
Query: 311 PVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
+ F +CF+ D GF P++ + +V + + + Q G + C+GF
Sbjct: 355 TLQDF-SCFQYSGSVDDGF-------PEVTFHFEG-DVSLIVSPHDYLFQNGKNLYCMGF 405
Query: 368 VDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
+GGV + +++G L N L+ +DL +G++D
Sbjct: 406 QNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWAD 444
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 114/286 (39%), Gaps = 43/286 (15%)
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
R VP F F C + + +GIAG GR ++LPSQL L++ F+ C PF
Sbjct: 209 ARTRDVPRFSFGCVTSTYREP-----IGIAGFGRGLLSLPSQLGF---LEKGFSHCFLPF 260
Query: 198 D-------DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
++ G + +++ +L++TP+ + S ++G S+ +T
Sbjct: 261 KFVNNPNISSPLILGASAL----SINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNIT 316
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PKVAR 308
VPL TL D++G GG + + YT L Y L+ S + P+
Sbjct: 317 PTQ-----VPL--TLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYPRATE 369
Query: 309 VAPVAPFGACFRL--QDIGFTRIGP----VVPQIDLVLQNKNVVWSIHGQN----SMVQI 358
F C+++ + T + + P I N + G + S
Sbjct: 370 TESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSD 429
Query: 359 GGDALCLGF---VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G CL F DG P + V G+ Q +N + +DL R+GF
Sbjct: 430 GSVVQCLLFQNMEDGDYGP--AGVFGSFQQQNVKVVYDLEKERIGF 473
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 76/170 (44%), Gaps = 21/170 (12%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDDG----AIVFGDGPYYDLNNFDVS 218
GIAG GR +LP Q+ +KF+ CL FDD + GP D + D +
Sbjct: 235 GIAGFGRGPSSLPKQMGL-----KKFSYCLLSHRFDDSPKSSKMTLYVGP--DSKD-DKT 286
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
L YTP N V++ S F Y++ + + V K V + + + ++G GGT +
Sbjct: 287 GGLSYTPFRKNPVSSNSAF----KEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIV 342
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA---CFRLQDIG 325
+ + +T +E +++A+ F M R A V CF L +G
Sbjct: 343 DSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVG 392
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 81/384 (21%), Positives = 145/384 (37%), Gaps = 68/384 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP P + G +++W C+ SS+ T C S
Sbjct: 94 EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 91 QCNLANAKACGGGIC----GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
C ++ C C G G + T G + + L+ S V++PN
Sbjct: 154 LCQALSSPTCSNNFCQYTYGYG-----DGSETQGSMGTETLTFGS---------VSIPNI 199
Query: 147 IFLCGSEFVLQGLASGV-VGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---I 202
F CG QG G G+ G+GR ++LPSQL KF+ C++P +
Sbjct: 200 TFGCGENN--QGFGQGNGAGLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTPSNL 252
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
+ G L N + + T + +++ T Y+I + + V +P++
Sbjct: 253 LLG-----SLANSVTAGSPNTTLIQSSQIPTF----------YYITLNGLSVGSTRLPID 297
Query: 263 KTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ +++ N G GG I + T + Y+++ Q F S + + F CF+
Sbjct: 298 PSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQ- 356
Query: 322 QDIGFTRIGPVVPQI-DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVI 380
T P QI V+ + +N + +CL G + + +
Sbjct: 357 -----TPSDPSNLQIPTFVMHFDGGDLELPSENYFISPSNGLICLAM---GSSSQGMSIF 408
Query: 381 GARQLENNLLQFDLATSRLGFSDS 404
G Q +N L+ +D S + F+ +
Sbjct: 409 GNIQQQNMLVVYDTGNSVVSFASA 432
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 126/305 (41%), Gaps = 44/305 (14%)
Query: 114 SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
S++G G+ D++S + P RAV F C + + GI GLGR +
Sbjct: 167 SSSGVLGE---DIVSFGTESELKPQRAV------FGCENSETGDLFSQHADGIMGLGRGQ 217
Query: 174 VALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
+++ QL + F++C D GA+V G P + YT N V
Sbjct: 218 LSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMP--------APPGMIYT--HSNAV 267
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
S Y I + +HV GKA+ ++ + +G GT + + Y L
Sbjct: 268 R---------SPYYNIELKEMHVAGKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQA 314
Query: 292 YKALVQAFASAMPKVARVAPVAP--FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
+ A A +S + + ++ P CF +++ V P++D+V N + S+
Sbjct: 315 FVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKL-SL 373
Query: 350 HGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---S 404
+N + + A CLG G +P T ++G + N L+ +D ++GF S
Sbjct: 374 SPENYLFRHSKVEGAYCLGVFQNGKDPTT--LLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431
Query: 405 LLFER 409
L+ER
Sbjct: 432 ELWER 436
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 97/400 (24%), Positives = 159/400 (39%), Gaps = 76/400 (19%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGYV-----------SST 81
T T Y T+I TP + V G +ILWV+C K + S +
Sbjct: 85 TETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQS 144
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQ--ST 132
+ C C +AN GG + +P + ++GD D L S
Sbjct: 145 GELVTCDQQFC-VANY---GGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSG 200
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV--GIAGLGRSKVALPSQLAAAFSLKRKF 190
DG+ +V F CG++ +S + GI G G+S ++ SQLAAA +++ F
Sbjct: 201 DGQTTPANASVS---FGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMF 257
Query: 191 ALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV-EYFIGV 249
A CL + G I + + N V +K TPL P + Y + +
Sbjct: 258 AHCLDTVNGGGI-------FAIGNV-VQPKVKTTPLV-------------PDMPHYNVIL 296
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
+ V G A+ L + N GT I + + +YKAL ++ V
Sbjct: 297 KGIDVGGTALGLPTNIFDSGNS--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDIS-V 353
Query: 310 APVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ F +CF+ D GF P++ + +V + + + Q G + C+G
Sbjct: 354 QTLQDF-SCFQYSGSVDDGF-------PEVTFHFEG-DVSLIVSPHDYLFQNGKNLYCMG 404
Query: 367 FVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
F +GGV + +++G L N L+ +DL +G++D
Sbjct: 405 FQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWAD 444
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 150/389 (38%), Gaps = 64/389 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y T++ TP L V G + +V C + S T + +C + Q
Sbjct: 93 YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC-TWQ 151
Query: 92 CNLAN-AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
CN + K C A +++G G+ DV+S + +P RA IF C
Sbjct: 152 CNCDDDRKQCTYERRYAEMS---TSSGVLGE---DVVSFGNQSELSPQRA------IFGC 199
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF---GDG 207
++ GI GLGR +++ QL + F+LC G G
Sbjct: 200 ENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGIS 259
Query: 208 PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
P D+ V T S + P Y I + +HV GK + LN +
Sbjct: 260 PPADM------------------VFTHSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVF- 298
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF--GACFRLQDIG 325
+G GT + + Y L S + A A + R++ P CF +I
Sbjct: 299 ---DGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEIN 355
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGAR 383
+++ P +++V N + + S+ +N + + A CLG G +P T ++G
Sbjct: 356 VSQLSKSFPVVEMVFGNGHKL-SLSPENYLFRHSKVRGAYCLGVFSNGNDPTT--LLGGI 412
Query: 384 QLENNLLQFDLATSRLGFSD---SLLFER 409
+ N L+ +D S++GF S L+ER
Sbjct: 413 VVRNTLVMYDREHSKIGFWKTNCSELWER 441
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 95/417 (22%), Positives = 155/417 (37%), Gaps = 82/417 (19%)
Query: 15 IFSLSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC- 73
+ PI A S PKA V + +Y+ + TP + G +++W C
Sbjct: 68 VHHFDPIAA-ASVSPKAAESDVTSNRG--EYLMSLSLGTPPFKIMGIADTGSDLIWTQCK 124
Query: 74 --EKGY----------VSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD 121
E+ Y S T + C + QC+L + C G IC + T G+
Sbjct: 125 PCERCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQSTCSGNICQYQYSYG-DRSYTMGN 183
Query: 122 IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSE----FVLQGLASGVVGIAGLGRSKVALP 177
+ D +++ ST G V+ P + CG E F +G GI GLG ++L
Sbjct: 184 VASDTITLDST----TGSPVSFPKTVIGCGHENDGTFSDKG-----SGIVGLGAGPLSLI 234
Query: 178 SQLAAAFSLKRKFALCLSP------------FDDGAIVFGDGPYYDLNNFDVSKNLKYTP 225
SQ+ + S+ KF+ CL P F A+V G G ++ TP
Sbjct: 235 SQMGS--SVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPG-------------VQSTP 279
Query: 226 LFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYT 285
L ++ T S F YF+ + ++ V + + + L G G I + T
Sbjct: 280 LLSSE--TMSSF-------YFLTLEAMSVGNERIKFGDSSL---GTGEGNIIIDSGTTLT 327
Query: 286 VLETSIYKALVQAFASAMPKVARVAPVAPFGACFR-LQDIGFTRIGPVVPQIDLVLQNKN 344
++ + L A + + P C+ D+ VP I +
Sbjct: 328 IVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDLK-------VPAITAHFTGAD 380
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
V + N+ VQ+ D +CL F SI Q+ N L+++++ L F
Sbjct: 381 V--KLKPINTFVQVSDDVVCLAFA--STTSGISIYGNVAQM-NFLVEYNIQGKSLSF 432
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/387 (24%), Positives = 158/387 (40%), Gaps = 75/387 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC----------EKGYVSSTNKT---ARCGSA 90
+Y T++ TP + + + G +++W+ C ++ + S +K+ C S
Sbjct: 129 EYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSP 188
Query: 91 QCNLANAKACG--GGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C ++ C +C G G + T GD + L+ R
Sbjct: 189 LCRRLDSPGCSLKNNLCQYQVSYGDG-------SFTFGDFSTETLTF---------RRAA 232
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA- 201
VP CG + +GL G G+ GLGR ++ P+Q F+ KF+ CL+ A
Sbjct: 233 VPRVAIGCGHD--NEGLFVGAAGLLGLGRGGLSFPTQTGTRFN--NKFSYCLTDRTASAK 288
Query: 202 ---IVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGK 257
IVFGD VS+ ++TPL N K++T Y++ + + V G
Sbjct: 289 PSSIVFGDSA--------VSRTARFTPLVKNPKLDTF----------YYVELLGISVGGA 330
Query: 258 AVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
V ++ + +D+ G GG I + T L Y +L AF + R + F
Sbjct: 331 PVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFD 390
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPR 375
C+ L + + VP + L + +V S+ N +V + + C F G
Sbjct: 391 TCYDLSGLSEVK----VPTVVLHFRGADV--SLPAANYLVPVDNSGSFCFAF--AGTMSG 442
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
SI IG Q + + FDLA SR+GF+
Sbjct: 443 LSI-IGNIQQQGFRVVFDLAGSRVGFA 468
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 136/372 (36%), Gaps = 37/372 (9%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y ++ +P L V G +++W+ C A A C G
Sbjct: 132 EYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSG 191
Query: 104 ICGAGPDNPISNTGTHGDIRIDV--------LSIQSTDGRNPGRAVTVPNFIFLCGSEFV 155
+C P S G R V + + + G + V CG
Sbjct: 192 VCRTLPGGS-SGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGHRN- 249
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IVFGDGPYYDLN 213
+GL G G+ GLG ++L QL A + L D GA +VFG
Sbjct: 250 -RGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLVFG-------R 301
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV 273
+ + + PL N +PS Y++G+T + V G+ +PL L + +G
Sbjct: 302 DDAMPVGAVWVPLLRNAQ--------QPSF-YYVGLTGLGVGGERLPLQDGLFDLTEDGG 352
Query: 274 GGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVAPVAPFGACFRLQDIGFTRIGPV 332
GG + T T L Y AL AFAS + + R V+ C+ L R
Sbjct: 353 GGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVR---- 408
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQF 392
VP + L ++ +N +V++GG CL F + ++G Q + +
Sbjct: 409 VPTVALYFGRDGAALTLPARNLLVEMGGGVYCLAFA---ASASGLSILGNIQQQGIQITV 465
Query: 393 DLATSRLGFSDS 404
D A +GF S
Sbjct: 466 DSANGYVGFGPS 477
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 149/403 (36%), Gaps = 83/403 (20%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGYV-----------SST 81
T T Y TQI TP + V G +ILWV+C K + SS+
Sbjct: 76 TETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSS 135
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDG 134
CG C + GG I P P + ++GD D L G
Sbjct: 136 GTGVTCGQDFCVATH----GGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSG 191
Query: 135 RNPGRAVTVPNFIFLCGSEF--VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
N + + F CG++ L + + GI G G+S ++ SQLAAA +++ FA
Sbjct: 192 -NSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAH 250
Query: 193 CLSPFDDGAI-VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
CL + G I GD V + TPL G P Y + + +
Sbjct: 251 CLDTINGGGIFAIGD---------VVQPKVSTTPLVP----------GMP--HYNVNLEA 289
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ V G + L + I GT I + L +Y A+ M KV
Sbjct: 290 IDVGGVKLQLPTNIFDIGES--KGTIIDSGTTLAYLPGVVYNAI-------MSKVFAQYG 340
Query: 312 VAPFG-----ACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL 363
P CFR D GF P I + + +IH + + Q G+
Sbjct: 341 DMPLKNDQDFQCFRYSGSVDDGF-------PIITFHFEG-GLPLNIHPHDYLFQ-NGELY 391
Query: 364 CLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
C+GF GG+ + +++G N L+ +DL +G++D
Sbjct: 392 CMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTD 434
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 144/379 (37%), Gaps = 63/379 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y ++I TP + L + G ++ W+ CE SST K+ C +
Sbjct: 161 EYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAP 220
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+L AC C ++GD V + +TD G + + N C
Sbjct: 221 QCSLLETSACRSNKCLYQV--------SYGDGSFTVGEL-ATDTVTFGNSGKINNVALGC 271
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL +G G+ GLG +++ +Q+ A F+ CL D G
Sbjct: 272 GHDN--EGLFTGAAGLLGLGGGVLSITNQMKAT-----SFSYCLVDRDSGK-----SSSL 319
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
D N+ + PL NK + Y++G++ V G+ V L + +D
Sbjct: 320 DFNSVQLGGGDATAPLLRNK---------KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDA 370
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR-VAPVAPFGACFRLQDIGFTRI 329
G GG + T L+T Y +L AF + + + ++ F C+ + ++
Sbjct: 371 SGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKV 430
Query: 330 GPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS--IVIGAR 383
V L L KN + + + C F P +S +IG
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLIPVDDSGT--------FCFAFA-----PTSSSLSIIGNV 477
Query: 384 QLENNLLQFDLATSRLGFS 402
Q + + +DL+ + +G S
Sbjct: 478 QQQGTRITYDLSKNVIGLS 496
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 103/249 (41%), Gaps = 31/249 (12%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVS 218
S G+ GLGR ++L SQL KF+ CL+P+ D GP LN+ +
Sbjct: 214 SSASGLVGLGRGSLSLVSQLGVP-----KFSYCLTPYQDTNSTSTLLLGPSASLND---T 265
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
+ TP + + S Y++ +T + + A+ + T LS+ +G GG I
Sbjct: 266 GGVSSTPFVASPSD------APMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGGFII 319
Query: 279 STVNPYTVLETSIYKALVQAFAS--AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQI 336
+ T+L + Y+ + A S +P + CF L T P +P +
Sbjct: 320 DSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGGSAATGLDLCFELPS--STSAPPTMPSM 377
Query: 337 DLVLQNKNVVWSIHGQNSMVQIGGDALCLGF---VDGGVNPRTSIVIGARQLENNLLQFD 393
L ++V +S + + + CL DGGV+ ++G Q +N + +D
Sbjct: 378 TLHFDGADMVLP---ADSYMMLDSNLWCLAMQNQTDGGVS-----ILGNYQQQNMHILYD 429
Query: 394 LATSRLGFS 402
+ L F+
Sbjct: 430 VGQETLTFA 438
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 158/373 (42%), Gaps = 58/373 (15%)
Query: 60 LTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSAQCNLANAKACGGGICG 106
L + G ++ W+ C + G V S++ K C +A C+L C
Sbjct: 186 LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSK 245
Query: 107 AGPDNPI------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLA 160
P ++ T GD+ ++ LS+ +D +P ++ + + + CG +GL
Sbjct: 246 TSPKTCKYFYWYGDSSRTSGDLALESLSVSLSD--HPS-SLEIRDMVIGCGHSN--KGLF 300
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----GAIVFGDGPYYDLNNF 215
G G+ GLG+ ++ PSQL ++ + + F+ CL + AI FG G + +F
Sbjct: 301 QGAGGLLGLGQGALSFPSQLRSS-PIGQSFSYCLVDRTNNLSVSSAISFGAG-FALSRHF 358
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
D +++TP F+ N+ F Y++G+ + ++ + +P+ +I G GG
Sbjct: 359 D---QMRFTP-FVRTNNSVETF-------YYLGIQGIKIDQELLPIPAERFAIAPNGSGG 407
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
T I + T L Y+A+ AF + + R P G C+ R P
Sbjct: 408 TIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYN----ATGRTAVPFPT 462
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDAL--CLGFV--DGGVNPRTSIVIGARQLENNLLQ 391
+ +V QN + +N +Q CL + DG +IG Q +N
Sbjct: 463 LSIVFQN-GAELDLPQENYFIQPDPQEAKHCLAILPTDGMS------IIGNFQQQNIHFL 515
Query: 392 FDLATSRLGFSDS 404
+D+ +RLGF+++
Sbjct: 516 YDVQHARLGFANT 528
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 144/383 (37%), Gaps = 60/383 (15%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------SSTNKTAR 86
+L+YV + TP V + + G ++ WV C+ SST T
Sbjct: 122 SLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIP 181
Query: 87 CGSAQCNL----ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C S C C G P + +G I + ST+ G +
Sbjct: 182 CASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAI---TEGVYSTETLALGSSAV 238
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
V +F F CGS+ G G+ GLG + +L SQ A+ + F+ CL P + GA
Sbjct: 239 VKSFRFGCGSD--QHGPYDKFDGLLGLGGAPESLVSQTASVYG--GAFSYCLPPLNSGAG 294
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPL--FINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
G NN + +TP+ F K+ T Y + +T + V GKA+
Sbjct: 295 FLTLGAPNSTNNSN--SGFVFTPMHAFSPKIATF----------YVVTLTGISVGGKALD 342
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV-APFGACF 319
+ + + N GT I T + T+ YKAL AF SAM + + P + C+
Sbjct: 343 IPPAVFAKGNIVDSGTVI------TGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCY 396
Query: 320 RLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
FT G V VP++ L V +V+ CL F D G +
Sbjct: 397 N-----FTGHGTVTVPKVALTFVGGATVDLDVPSGVLVED-----CLAFADAGDG--SFG 444
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
+IG + +D LGF
Sbjct: 445 IIGNVNTRTIEVLYDSGKGHLGF 467
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 128/305 (41%), Gaps = 44/305 (14%)
Query: 114 SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
S++G G+ D++S + P RAV F C + + GI GLGR +
Sbjct: 167 SSSGVLGE---DIVSFGTESELKPQRAV------FGCENSETGDLFSQHADGIMGLGRGQ 217
Query: 174 VALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
+++ QL + F++C D GA+V G P + YT N V
Sbjct: 218 LSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMP--------APPGMIYT--HSNAV 267
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
S Y I + +HV GKA+ ++ + +G GT + + Y L
Sbjct: 268 R---------SPYYNIELKEMHVAGKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQA 314
Query: 292 YKALVQAFASAMPKVARV-APVAPF-GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
+ A A +S + + ++ P + + CF +++ V P++D+V N + S+
Sbjct: 315 FVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKL-SL 373
Query: 350 HGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---S 404
+N + + A CLG G +P T ++G + N L+ +D ++GF S
Sbjct: 374 SPENYLFRHSKVEGAYCLGVFQNGKDPTT--LLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431
Query: 405 LLFER 409
L+ER
Sbjct: 432 ELWER 436
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 93/397 (23%), Positives = 155/397 (39%), Gaps = 71/397 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------KGYV-------SSTNKTARCGSA 90
+Y+ + TP VPV V G ++ W C K V SST + + CG++
Sbjct: 91 EYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTS 150
Query: 91 QC-NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
C L N ++C G + + T G++ ++ L++ ST G+ V+ P F F
Sbjct: 151 FCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVAST----AGKPVSFPGFAFG 206
Query: 150 C--GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDG 207
C S + +SG+V GLG +++++ SQL + ++ +F+ CL P VF D
Sbjct: 207 CVHRSGGIFDEHSSGIV---GLGVAELSMISQLKS--TINGRFSYCLLP------VFTDS 255
Query: 208 PYYDLNNFDVSKNLK-----YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
NF S + TPL + G + Y I + V K +
Sbjct: 256 SMSSRINFGRSGIVSGAGTVSTPLVMK---------GPDTYYYLITLEGFSVGKKRLSYK 306
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ E G + + YT L Y L ++ A ++ P C+
Sbjct: 307 GFSKKAEVEE-GNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNT- 364
Query: 323 DIGFTRIGPVVPQID---LVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI- 378
V QID + K+ + N+ +++ D +C V P + I
Sbjct: 365 ---------TVDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCF-----TVLPTSDIG 410
Query: 379 VIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFN 415
++G N L+ FDL R+ F+ A CT +
Sbjct: 411 ILGNLAQVNFLVGFDLRKKRVS------FKAADCTLH 441
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/390 (22%), Positives = 153/390 (39%), Gaps = 67/390 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLA--------- 95
Y T++K TP + + G ++LWV C +S N + Q L+
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSC-----TSCNGCPKTSELQIQLSFFDPGVSSS 138
Query: 96 ------NAKACGGGI---CGAGPDNPIS-------NTGTHGDIRIDVLSIQSTDGRNPGR 139
+ + C G P+N S +GT G D +S +
Sbjct: 139 ASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAI 198
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
+ P F+F C + LQ V GI GLG+ +++ SQLA R F+ CL
Sbjct: 199 NSSAP-FVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGD 257
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G + YTPL ++ + Y + + S+ VNG+
Sbjct: 258 KSGGGIMVLGQIK-------RPDTVYTPLVPSQPH------------YNVNLQSIAVNGQ 298
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+P++ ++ +I GT I T L Y +QA A+A+ + R +
Sbjct: 299 ILPIDPSVFTIATG--DGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESY-Q 355
Query: 318 CFRLQ--DIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVN 373
CF + D+ V P++ L ++V H + G ++ C+GF ++
Sbjct: 356 CFEITAGDVD------VFPEVSLSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQR--MS 407
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFSD 403
R ++G L++ ++ +DL R+G+++
Sbjct: 408 HRRITILGDLVLKDKVVVYDLVRQRIGWAE 437
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/398 (22%), Positives = 150/398 (37%), Gaps = 58/398 (14%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGSA 90
T Y + P P L + G ++ W+ C+ Y + NK C ++
Sbjct: 49 TGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPTKNKLVPCAAS 108
Query: 91 QCNLANAKACGGGICGAGP--DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C ++ C D I T + + + V + RN + P+F F
Sbjct: 109 ICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRN--SSSVRPSFTF 166
Query: 149 LCGSEFVLQGLASGVV-----GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
CG + Q +GVV G+ GLG+ V+L SQL K CLS G +
Sbjct: 167 GCG--YDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNGGGFLF 224
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FGD N + + P+ V + SG P G +++ + +++ +
Sbjct: 225 FGD-------NVVPTSRATWVPM----VRSTSGNYYSP------GSGTLYFDRRSLGVKP 267
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ D+ G T YT Y+A V A + + K + C++ Q
Sbjct: 268 MEVVFDS---GST-------YTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKGQK 317
Query: 324 IGFTRIGPVVPQIDLVLQN--KNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIG 381
+ F + V + + KN V I +N ++ CLG +DG T +IG
Sbjct: 318 V-FKSVSDVKNDFKSLFLSFVKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNIIG 376
Query: 382 ARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFTSI 419
+++ L+ +D +LG+ R +C+ + SI
Sbjct: 377 DITMQDQLIIYDNERGQLGWI------RGSCSRSTKSI 408
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 142/384 (36%), Gaps = 71/384 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 176 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSC 235
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ + C GG C G G++ G +D L++ S D V
Sbjct: 236 AAPACSDLDVSGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 284
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IV 203
F F CG GL G+ GLGR K +LP Q + FA CL G +
Sbjct: 285 FRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGKYG--GVFAHCLPARSTGTGYLD 340
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG G TP+ G Y++G+T + V G+ +P+
Sbjct: 341 FGAGS---------PPATTTTPMLT----------GNGPTFYYVGMTGIRVGGRLLPIAP 381
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA--RVAPVAPFGACFRL 321
++ + GT + + T L + Y +L AFA+AM + A V+ C+
Sbjct: 382 SVFA-----AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDF 436
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + M + +CL F DGG
Sbjct: 437 --TGMSQV--AIPTVSLLFQG-GAALDVDASGIMYTVSASQVCLAFAGNEDGG----DVG 487
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
++G QL+ + +D+ +GFS
Sbjct: 488 IVGNTQLKTFGVAYDIGKKVVGFS 511
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 144/379 (37%), Gaps = 63/379 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y ++I TP + L + G ++ W+ CE SST K+ C +
Sbjct: 161 EYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAP 220
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+L AC C ++GD V + +TD G + + N C
Sbjct: 221 QCSLLETSACRSNKCLYQV--------SYGDGSFTVGEL-ATDTVTFGNSGKINNVALGC 271
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL +G G+ GLG +++ +Q+ A F+ CL D G
Sbjct: 272 GHDN--EGLFTGAAGLLGLGGGVLSITNQMKAT-----SFSYCLVDRDSGK-----SSSL 319
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN 270
D N+ + PL NK + Y++G++ V G+ V L + +D
Sbjct: 320 DFNSVQLGGGDATAPLLRNK---------KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDA 370
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR-VAPVAPFGACFRLQDIGFTRI 329
G GG + T L+T Y +L AF + + + ++ F C+ + ++
Sbjct: 371 SGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKV 430
Query: 330 GPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS--IVIGAR 383
V L L KN + + + C F P +S +IG
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLIPVDDSGT--------FCFAFA-----PTSSSLSIIGNV 477
Query: 384 QLENNLLQFDLATSRLGFS 402
Q + + +DL+ + +G S
Sbjct: 478 QQQGTRITYDLSKNVIGLS 496
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 142/384 (36%), Gaps = 71/384 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 180 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSC 239
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ + C GG C G G++ G +D L++ S D V
Sbjct: 240 AAPACSDLDVSGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 288
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IV 203
F F CG GL G+ GLGR K +LP Q + FA CL G +
Sbjct: 289 FRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGKYG--GVFAHCLPARSTGTGYLD 344
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG G TP+ G Y++G+T + V G+ +P+
Sbjct: 345 FGAGS---------PPATTTTPMLT----------GNGPTFYYVGMTGIRVGGRLLPIAP 385
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA--RVAPVAPFGACFRL 321
++ + GT + + T L + Y +L AFA+AM + A V+ C+
Sbjct: 386 SVFA-----AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDF 440
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + M + +CL F DGG
Sbjct: 441 --TGMSQV--AIPTVSLLFQG-GAALDVDASGIMYTVSASQVCLAFAGNEDGG----DVG 491
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
++G QL+ + +D+ +GFS
Sbjct: 492 IVGNTQLKTFGVAYDIGKKVVGFS 515
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/402 (22%), Positives = 148/402 (36%), Gaps = 49/402 (12%)
Query: 12 SLLIFSLSPITAQTSFRPKALVLRVAK--DTATLQYVTQIKQRTPLVPVKLTVHLGGNIL 69
S+ + P TA S K + L + T Y+ + TP + + G ++
Sbjct: 153 SIHRMTAGPWTAGQSSASKGVSLPAHRGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLS 212
Query: 70 WVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVL-- 127
WV C+ +Q +A CG C S T + G R +V+
Sbjct: 213 WVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC------LDSGTCSSGKCRYEVVYG 266
Query: 128 SIQSTDGRNPGRAVTV-------PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
+ TDG +T+ F+F CG + GL G+ GLGR +V+L SQ
Sbjct: 267 DMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDT--GLFGRADGLFGLGRDRVSLASQA 324
Query: 181 AAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGE 240
AA + + L S +G + G F TP F
Sbjct: 325 AARYGAGFSYCLPSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSF------------- 371
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
Y++ + + V G+ V + + GT I + T L + Y AL +FA
Sbjct: 372 ----YYLDLVGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRLPSRAYSALRSSFA 422
Query: 301 SAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
M + R ++ C+ G T++ +P + L+ + G V
Sbjct: 423 GFMRRYKRAPALSILDTCYDF--TGRTKV--QIPSVALLFDGGATLNLGFGGVLYVANRS 478
Query: 361 DALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGF 401
A CL F G + TS+ ++G Q + + +DLA ++GF
Sbjct: 479 QA-CLAFASNGDD--TSVGILGNMQQKTFAVVYDLANQKIGF 517
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 111/281 (39%), Gaps = 36/281 (12%)
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCL--SPFDDGA 201
NF F C + + +G+AG GR ++LP+QLA + L +F+ CL FD
Sbjct: 215 NFTFGCAHTTLAEP-----IGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDR 269
Query: 202 I------VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF--IGVTSVH 253
+ + G YD + + N P F+ + L P YF +G+ +
Sbjct: 270 VRRPSPLILGR---YDHDEKERRVNGVKKPSFV-----YTSMLDNPRHPYFYCVGLEGIS 321
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
+ K +P L +D +G GG + + +T+L S+Y +V F + + +V A V
Sbjct: 322 IGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERASVI 381
Query: 314 P----FGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
C+ + V+ + +VL +N + CL
Sbjct: 382 EENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKRKVGCL 441
Query: 366 GFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGFS 402
++GG S +G Q + + +DL R+GF+
Sbjct: 442 MLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFA 482
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 83/380 (21%), Positives = 149/380 (39%), Gaps = 52/380 (13%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSAQCNLANA-- 97
TP V L + G ++ W+ C+ Y SST + C +C L ++
Sbjct: 179 TPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQLVSSSD 238
Query: 98 --KACGGG--ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSE 153
+ C C D + T GD + ++ T + V + +F CG
Sbjct: 239 PLQHCKAENQTCPYFYDYA-DGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGCGH- 296
Query: 154 FVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP-FDDGAI----VFGDGP 208
+G G G+ GLGR ++ PSQ+ + + F+ CL+ F + ++ +FG+
Sbjct: 297 -WNKGFFYGASGLLGLGRGPISFPSQIQSIYG--HSFSYCLTDLFSNTSVSSKLIFGEDK 353
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
LNN NL +T L + F Y++ + S+ V G+ + +++
Sbjct: 354 EL-LNN----HNLNFTTLLAGEETPDETF-------YYLQIKSIMVGGEVLDISEQTWHW 401
Query: 269 DNE-----GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+E GGT I + + T S Y + +AF + A C+ +
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCY---N 458
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGA 382
+ + +P + + VW+ +N Q D +CL + + +I IG
Sbjct: 459 VSGAMMQVELPDFGIHFADGG-VWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTI-IGN 516
Query: 383 RQLENNLLQFDLATSRLGFS 402
+N + +D+ SRLG+S
Sbjct: 517 LLQQNFHILYDVKRSRLGYS 536
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/398 (21%), Positives = 157/398 (39%), Gaps = 68/398 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTAR---C 87
QY +++ TP L + G ++ W+ C Y S++ + R C
Sbjct: 26 QYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 85
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTD------G 134
+C A G C +P T + D + + +S++S G
Sbjct: 86 TDDECLFLPAPI--GSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAG 143
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ R + + N C E V G G+ GLG+ ++L +Q +L F+ CL
Sbjct: 144 NHKTRTIRIKNVALGCSRESVGASFL-GASGVLGLGQGPISLATQTRHT-ALGGIFSYCL 201
Query: 195 SPFDDGA-----IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
+ G+ +V G + + L +TP+ N A F Y++ V
Sbjct: 202 VDYLRGSNASSFLVMGRTRW---------RKLAHTPIVRNPA--AQSF-------YYVNV 243
Query: 250 TSVHVNGKAVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA--MPKV 306
T V V+GK V + + ID +G GT + + L Y ++ A ++ +P+
Sbjct: 244 TGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRA 303
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ F C+ + TR+ +P++ + Q V+ + N MV + + C+
Sbjct: 304 QEIP--EGFELCYNV-----TRMEKGMPKLGVEFQGGAVM-ELPWNNYMVLVAENVQCVA 355
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+ S ++G +++ +++DLA +R+GF S
Sbjct: 356 -LQKVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKWS 392
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 118/295 (40%), Gaps = 43/295 (14%)
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
D++S P RAV F C + + GI GLGR ++++ QL
Sbjct: 176 DIVSFGRESELKPQRAV------FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKG 229
Query: 185 SLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
+ F+LC D GA+V G P F S L+ S
Sbjct: 230 VISDSFSLCYGGMDIGGGAMVLGGVPAPSDMVFSHSDPLR-------------------S 270
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
Y I + +HV GKA+ ++ + + + GT + + Y L + A A S
Sbjct: 271 PYYNIELKEIHVAGKALRVDSRVFNSKH----GTVLDSGTTYAYLPEQAFVAFKDAVTSK 326
Query: 303 MPKVARVAPVAP--FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV---Q 357
+ + ++ P CF +++ V P +D+V N + S+ +N + +
Sbjct: 327 VHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKL-SLTPENYLFRHSK 385
Query: 358 IGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---SLLFER 409
+ G A CLG G +P T ++G + N L+ +D ++GF S L+ER
Sbjct: 386 VDG-AYCLGVFQNGKDPTT--LLGGIIVRNTLVTYDRHNEKIGFWKTNCSELWER 437
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/370 (22%), Positives = 141/370 (38%), Gaps = 34/370 (9%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ ++ P V V G +++W C+ T + + + C G
Sbjct: 106 EFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 165
Query: 104 ICGAGPDNPISNTG-------THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVL 156
+C A P + + T+GD + +T+ ++ F CG E
Sbjct: 166 LCNALPRSNCNEDKDACEYLYTYGDYS-STRGLLATETFTFEDENSISGIGFGCGVENEG 224
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG----AIVFGDGPYYDL 212
G + G G+ GLGR ++L SQL + KF+ CL+ +D ++ G +
Sbjct: 225 DGFSQGS-GLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLASGIV 278
Query: 213 NNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEG 272
N S + + T N +PS Y++ + + V K + + K+ + +G
Sbjct: 279 NKTGASLDGEVTKTMSLLRNPD-----QPSF-YYLELQGITVGAKRLSVEKSTFELAEDG 332
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV 332
GG I + T LE + +K L + F S M + CF+L D
Sbjct: 333 TGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAA---KNIA 389
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGARQLENNLLQ 391
VP+ ++ K + G+N MV LCL G + SI +Q N+L
Sbjct: 390 VPK--MIFHFKGADLELPGENYMVADSSTGVLCLAM---GSSNGMSIFGNVQQQNFNVLH 444
Query: 392 FDLATSRLGF 401
DL + F
Sbjct: 445 -DLEKETVSF 453
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/259 (22%), Positives = 112/259 (43%), Gaps = 30/259 (11%)
Query: 164 VGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSKNLK 222
G+ G+ R ++ +Q A R+FA C++ G ++ G + L
Sbjct: 186 TGLLGMNRGGLSFVTQTA-----TRRFAYCIAAGQGPGILLLGGNDTETPLTSPPQQQLN 240
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
YTPL +++ + Y + + + V + + K LL+ D+ G G T + +
Sbjct: 241 YTPLV--EISQPLPYFDR--AAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQTMVDSGT 296
Query: 283 PYTVLETSIYKALVQAFASAMPKV--ARVAPVAP--------FGACFRLQD--IGFTRIG 330
+T L Y AL FA+ + + +AP+ F ACFR + + G
Sbjct: 297 RFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGTEARVSAAAAG 356
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-------ALCLGFVDGGVNPRTSIVIGAR 383
++P++ LVL+ VV + + + ++ G+ CL F + ++ VIG
Sbjct: 357 GLLPEVGLVLRGAEVVVA-GAEKLLYRVPGERRGEGEGVWCLTFGSSDMAGVSAYVIGHH 415
Query: 384 QLENNLLQFDLATSRLGFS 402
++ +++DL +RLGF+
Sbjct: 416 HQQDVWVEYDLRNARLGFA 434
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 92/395 (23%), Positives = 149/395 (37%), Gaps = 72/395 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
T YV + TP + + G ++ WV C SST R
Sbjct: 82 TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVR 141
Query: 87 CGSAQCNLANAKACGGGICGAGPDN---PIS-----NTGTHGDIRIDVLSIQSTDGRNPG 138
CG +C A C + P + P + T G + D L++ +T N
Sbjct: 142 CGEPECPRARQS------CSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNAS 195
Query: 139 R--AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
+ +P F+F CG GL G+ GLGR KV+L SQ A + F+ CL
Sbjct: 196 ENNSNKLPGFVFGCGENNT--GLFGKADGLFGLGRGKVSLSSQAAGKYG--EGFSYCLPS 251
Query: 197 FDDGAIVFGDGPYYDLNN-FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
A Y L + ++TP+ +N+ NT S Y++ + + V
Sbjct: 252 SSSNAHG-----YLSLGTPAPAPAHARFTPM-LNRSNTPS--------FYYVKLVGIRVA 297
Query: 256 GKAVPLNKTL------LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA-- 307
G+A+ ++ L +D+ GT I+ + P Y AL AF SAM K
Sbjct: 298 GRAIKVSSRPALWPAGLIVDS----GTVITRLAPRA------YSALRTAFLSAMGKYGYK 347
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
R ++ C+ + +P + LV + S+ + CL F
Sbjct: 348 RAPRLSILDTCYDFTAHANATVS--IPAVALVFAGGATI-SVDFSGVLYVAKVAQACLAF 404
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
G N R++ ++G Q + +D+ ++GF+
Sbjct: 405 APNG-NGRSAGILGNTQQRTVAVVYDVGRQKIGFA 438
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 88/380 (23%), Positives = 158/380 (41%), Gaps = 56/380 (14%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTAR---CGSA 90
Y +I TP + V G ++ W+ C+ + ST+KT + C S+
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSS 172
Query: 91 QCNLA-----NAKACGGGICGAGPDNPISNTG-THGDIRIDVLSIQSTDGRNPGRAVTVP 144
QC+ NA C +T + G + DVL++ ++ + G
Sbjct: 173 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSG------ 226
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
F++ CG + QGL GI GL K+++ QL+ + F+ CL
Sbjct: 227 -FVYGCGQDN--QGLFGRSSGIIGLANDKISMLGQLSKKYG--NAFSYCLPSSFSAPNSS 281
Query: 205 GDGPYYDLNNFDVSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
+ + ++ + K+TPL N+ PS+ YF+ +T++ V GK + ++
Sbjct: 282 SLSGFLSIGASSLTSSPYKFTPLVKNQ--------KIPSL-YFLDLTTITVAGKPLGVSA 332
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP-VAPFGACFRLQ 322
+ ++ GT I T L ++Y AL ++F M K AP + CF+
Sbjct: 333 SSYNVPTIIDSGTVI------TRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFK-- 384
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGA 382
G + VP+I ++ + + NS+V+I CL + NP + +IG
Sbjct: 385 --GSVKEMSTVPEIQIIFRG-GAGLELKAHNSLVEIEKGTTCLA-IAASSNPIS--IIGN 438
Query: 383 RQLENNLLQFDLATSRLGFS 402
Q + + +D+A ++GF+
Sbjct: 439 YQQQTFKVAYDVANFKIGFA 458
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 141/366 (38%), Gaps = 39/366 (10%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ + TP + G +++W C+ V T + + + C
Sbjct: 96 EFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSD 155
Query: 104 ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT-----VPNFIFLCGSEFVLQG 158
+C A PIS+ + R ST G T V F CG + +
Sbjct: 156 LCVA---LPISSCSDGCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRA 212
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS 218
+ G G+ GLGR ++L SQL KF+ CL+ DD + + +
Sbjct: 213 YSQGA-GLVGLGRGPLSLISQLGVP-----KFSYCLTSIDDSKGI----STLLVGSEATV 262
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
K+ TPL N PS Y++ + + V +P+ K+ SI ++G GG I
Sbjct: 263 KSAIPTPLIQNP--------SRPSF-YYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLII 313
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV-VPQID 337
+ T L+ + + AL + F S M + CF L G PV VPQ
Sbjct: 314 DSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDG----SPVEVPQ-- 367
Query: 338 LVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLAT 396
LV + V + +N +++ +CL G + SI G Q +N ++ DL
Sbjct: 368 LVFHFEGVDLKLPKENYIIEDSALRVICLTM---GSSSGMSI-FGNFQQQNIVVLHDLEK 423
Query: 397 SRLGFS 402
+ F+
Sbjct: 424 ETISFA 429
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/405 (21%), Positives = 154/405 (38%), Gaps = 64/405 (15%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------- 74
+ A D QY K TP L G ++ W+ C+
Sbjct: 1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60
Query: 75 ---KGYVSSTNKTARCGSAQC-----NLANAKACGGGICGAGPDNPISNTGTH-GDIRID 125
+SS+ KT C + C +L + C + G D S+ T G +
Sbjct: 61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120
Query: 126 VLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
++++ +GR + + N + C F Q + G+ GLG SK + + A F
Sbjct: 121 TVTVELKEGRK----MKLHNVLIGCSESFQGQSFQAAD-GVMGLGYSKYSFAIKAAEKFG 175
Query: 186 LKRKFALCLSPF-----DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGE 240
KF+ CL + FG + + N+ YT L + VN+
Sbjct: 176 --GKFSYCLVDHLSHKNVSNYLTFGSSRSKEA----LLNNMTYTELVLGMVNSF------ 223
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
Y + + + + G + + + D +G GGT + + + T L Y+ ++ A
Sbjct: 224 ----YAVNMMGISIGGAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALR 277
Query: 301 SAMPKVARVA-PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG 359
++ K +V + P CF GF +VP++ + + ++ ++
Sbjct: 278 VSLLKFRKVEMDIGPLEYCF--NSTGFEE--SLVPRLVFHFAD-GAEFEPPVKSYVISAA 332
Query: 360 GDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
CLGFV P TS+V Q +N+L +FDL +LGF+ S
Sbjct: 333 DGVRCLGFVSVAW-PGTSVVGNIMQ-QNHLWEFDLGLKKLGFAPS 375
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/265 (22%), Positives = 108/265 (40%), Gaps = 32/265 (12%)
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GA 201
N F CG + G +G GI G+ +++ QL+ KF+ CL+PF D
Sbjct: 23 NLTFGCGK--LTNGTIAGASGIMGVSPGPLSVLKQLSIT-----KFSYCLTPFTDHKTSP 75
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
++FG DL + + ++ PL N V + Y++ + + + K + +
Sbjct: 76 VMFGA--MADLGKYKTTGKVQTIPLLKNPVE---------DIYYYVPMVGISIGSKRLDV 124
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ +L++ +G GGT + + L +K L +A M A + + CF L
Sbjct: 125 PEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDDYPVCFEL 184
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV----DGGVNPRTS 377
G + G VP + L + S+ + + +CL + +G N
Sbjct: 185 PR-GMSMEGVQVPPLVLHFAG-DAEMSLPRDSYFQEPSPGMMCLAVMQAPFEGAPN---- 238
Query: 378 IVIGARQLENNLLQFDLATSRLGFS 402
VIG Q +N + +DL + ++
Sbjct: 239 -VIGNVQQQNMHVLYDLGNRKFSYA 262
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 150/399 (37%), Gaps = 87/399 (21%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE-----------KGYV---------SSTNKTARCGSAQC 92
TP V + + G N+LW+ C+ G V SST++ C S C
Sbjct: 70 TPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTVDLNIYSPNTSSTSEKVPCNSTLC 129
Query: 93 NLANAKACGGGICGAGPDNPI---SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
+ C P + + T T G I D+L + S D ++ +AV F
Sbjct: 130 SQTQRDRCPSDQSNC-PYQVVYLSNGTSTTGYIVQDLLHLISDDSQS--KAVDA-KITFG 185
Query: 150 CGSEFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
CG L G G+ GLG S +++PS LA F++C SP G I FGD
Sbjct: 186 CGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNGIGRISFGD-- 243
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFL----GEP-SVEYFIGVTSVHVNGKAVPLNK 263
K +T G G+P S Y I +T + G+A L
Sbjct: 244 ---------------------KGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDL-- 280
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+I + G +T L Y + ++F + + R + PF C+ ++
Sbjct: 281 VYSAIFDSGTS---------FTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRS 331
Query: 324 IGFTRI-----------GPVVPQIDLVLQNKNVVWSIHGQNSMVQI--GGDALCLGFVDG 370
+I P +P + LV+ + +++ +VQ+ G CLG +
Sbjct: 332 FISAQILPFSCAYANQTEPTIPAVTLVMSGGD-YFNVTDPIVLVQLADGSAVYCLGMIKS 390
Query: 371 G-VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
G VN +IG + + + FD LG+ S ++
Sbjct: 391 GDVN-----IIGQNFMTGHRIVFDRERMILGWKPSNCYD 424
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 122/291 (41%), Gaps = 25/291 (8%)
Query: 116 TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
+ T GD ++ ++ T V N +F CG +GL G G+ GLGR ++
Sbjct: 223 SNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGRGPLS 280
Query: 176 LPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS 235
SQL + + F+ CL + V + + + NL +T K N
Sbjct: 281 FSSQLQSLYG--HSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVD 338
Query: 236 GFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKAL 295
F Y++ + S+ V G+ + + + +I ++G GGT I + + Y+ +
Sbjct: 339 TF-------YYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFI 391
Query: 296 VQAFASAMP---KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQ 352
A V R P+ CF + I ++ P++ + + VW+ +
Sbjct: 392 KNKIAEKAKGKYPVYRDFPI--LDPCFNVSGIHNVQL----PELGIAFAD-GAVWNFPTE 444
Query: 353 NSMVQIGGDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
NS + + D +CL + P+++ +IG Q +N + +D SRLG++
Sbjct: 445 NSFIWLNEDLVCLAMLG---TPKSAFSIIGNYQQQNFHILYDTKRSRLGYA 492
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 105/240 (43%), Gaps = 27/240 (11%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKY 223
G+ G+ R ++ +QL KF+ C+S D G ++FGD + NL Y
Sbjct: 1127 GLMGMNRGSLSFVTQLGLP-----KFSYCISGRDSSGVLLFGD------LHLSWLGNLTY 1175
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
TPL +++T + V Y + + + V K +PL K++ + D+ G G T + +
Sbjct: 1176 TPLV--QISTPLPYFDR--VAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQ 1231
Query: 284 YTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR-LQDIGFTRIG----PVVPQIDL 338
+T L +Y AL F V +AP+ F+ D+ ++ P +P + L
Sbjct: 1232 FTFLLGPVYTALRNEFLEQTKGV--LAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSL 1289
Query: 339 VLQNKNVVWS----IHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDL 394
+ + +V ++ M++ CL F + + + VIG +N ++FDL
Sbjct: 1290 MFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDL 1349
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 120/328 (36%), Gaps = 56/328 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------------KGYVSSTNKTA 85
TL+YV + +P + ++ + G ++ WV CE SST
Sbjct: 132 TLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAF 191
Query: 86 RCGSAQC-NLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGR 139
C +A C L ++ G C A + T G DVL++ +D
Sbjct: 192 NCSAAACAQLGDSGEANG--CDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSD------ 243
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
V F F C + G+ G+ GLG +L SQ AA + + L +P
Sbjct: 244 --VVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASS 301
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G + + TP+ +K + YF + + V GK +
Sbjct: 302 GFLTL---GAPASGGGGGASRFATTPMLRSK---------KVPTYYFAALEDIAVGGKKL 349
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
L+ ++ + + GT I+ + P + Y AL AF + M + AR P+ CF
Sbjct: 350 GLSPSVFAAGSLVDSGTVITRLPP------AAYAALSSAFRAGMTRYARAEPLGILDTCF 403
Query: 320 RLQDIGFTRIGPV-VPQIDLVLQNKNVV 346
FT + V +P + LV VV
Sbjct: 404 N-----FTGLDKVSIPTVALVFAGGAVV 426
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 98/409 (23%), Positives = 151/409 (36%), Gaps = 65/409 (15%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQ---YVTQIKQRTPLVPVKLTVHLGGNILWVDCE 74
LS +TAQ K + +A L YV +++ TP + + + + W C
Sbjct: 70 LSSLTAQ-----KTVAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCS 124
Query: 75 -----------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIR 123
SST T C +C A +C G + + N GD
Sbjct: 125 GCIGCSSTTTFSAQNSSTFATLDCSKPECTQARGLSCPT----TGNVDCLFNQTYGGDST 180
Query: 124 IDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAA 183
+Q + P +PNF F C S G + G+ GLGR ++L SQ +
Sbjct: 181 FSATLVQDSLHLGPN---VIPNFSFGCISS--ASGSSIPPQGLMGLGRGPLSLISQSGSL 235
Query: 184 FSLKRKFALCLSPFDDGAIVFGDGPYY-----DLNNFDVSKNLKYTPLFINKVNTASGFL 238
+S F+ CL F YY L K ++ TPL N
Sbjct: 236 YS--GLFSYCLPSFKS---------YYFSGSLKLGPVGQPKAIRTTPLLHNP-------- 276
Query: 239 GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQA 298
PS+ Y++ +T + V VP++ LL+ D GT I + T +IY A+
Sbjct: 277 HRPSL-YYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDE 335
Query: 299 FASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI 358
F + +P+ F CF T P I L L ++ + +NS++
Sbjct: 336 FRKQV--GGSFSPLGAFDTCFA------TNNEVSAPAITLHLSGLDLKLPM--ENSLIHS 385
Query: 359 GGDAL-CLGFVDG-GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
+L CL VI Q +N+ + FD+ S+LG + L
Sbjct: 386 SAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIAREL 434
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 93/397 (23%), Positives = 143/397 (36%), Gaps = 90/397 (22%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSAQCNLANAKA 99
TP P + + G ++W C + SST + CG+ C
Sbjct: 51 TPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPTSN 110
Query: 100 CGGGICGAGPDNPISNTGTHGDIRID---VLSIQSTDGRNPGRAVTVPNFIFLCGSEFVL 156
C G +C + +IR+D L I T+ G A F + S+
Sbjct: 111 CSGDVC---------TYESTTNIRLDRHTTLGIVGTETFAIGTATASLAFGCVVASDI-- 159
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFD 216
G G GLGR+ P L A L KF+ CLSP G
Sbjct: 160 -DTMDGTSGFIGLGRT----PRSLVAQMKLT-KFSYCLSPRGTG---------------- 197
Query: 217 VSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGT 276
K + LF+ +G GE + TS + + LLS+D G T
Sbjct: 198 -----KSSRLFLGSSAKLAG--GESTSTAPFIKTSPDDDSH----HYYLLSLDAIRAGNT 246
Query: 277 KIS-----------TVNPYTVLETSIYKALVQAFASAMPKVAR---VAPVAPFGACFRLQ 322
I+ TV+P+++L S Y+A +A A+ A P PF CF+ +
Sbjct: 247 TIATAQSGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFK-K 305
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG--GDALCLGFVDGGVNPRTSI-- 378
GF+R P + Q + ++ ++ +G D C + RT +
Sbjct: 306 AAGFSRA--TAPDLVFTFQGAAAL-TVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEG 362
Query: 379 --VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
V+G+ Q E+ +DL ++L FE A C+
Sbjct: 363 VSVLGSLQQEDVHFLYDLK------KETLSFEPADCS 393
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 85/400 (21%), Positives = 154/400 (38%), Gaps = 54/400 (13%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------- 74
+ A D QY K TP L G ++ W+ C+
Sbjct: 72 MHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131
Query: 75 ---KGYVSSTNKTARCGSAQC-----NLANAKACGGGICGAGPDNPISNTGTH-GDIRID 125
+SS+ KT C + C +L + C + G D S+ T G +
Sbjct: 132 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191
Query: 126 VLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
++++ +GR + + N + C F Q + G+ GLG SK + + A F
Sbjct: 192 TVTVELKEGRK----MKLHNVLIGCSESFQGQSFQAAD-GVMGLGYSKYSFAIKAAEKFG 246
Query: 186 LKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
KF+ CL + +++N+ + + +N + LG + Y
Sbjct: 247 --GKFSYCL---------VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFY 295
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+ + + + G + + + D +G GGT + + + T L Y+ ++ A ++ K
Sbjct: 296 AVNMMGISIGGAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLK 353
Query: 306 VARVA-PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+V + P CF GF +VP++ + + ++ ++ C
Sbjct: 354 FRKVEMDIGPLEYCF--NSTGFEE--SLVPRLVFHFAD-GAEFEPPVKSYVISAADGVRC 408
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
LGFV P TS+V Q +N+L +FDL +LGF+ S
Sbjct: 409 LGFVSVAW-PGTSVVGNIMQ-QNHLWEFDLGLKKLGFAPS 446
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 109/255 (42%), Gaps = 37/255 (14%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGD-GPYYDLNNFDVSKNL 221
GI GLGR +++ QL + F+LC D GA+V G P D+ F S +
Sbjct: 205 GIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLGGISPPSDMA-FAYSDPV 263
Query: 222 KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTV 281
+ S Y I + +HV GK +PLN + +G GT + +
Sbjct: 264 R-------------------SPYYNIDLKEIHVAGKRLPLNANVF----DGKHGTVLDSG 300
Query: 282 NPYTVLETSIYKALVQAFASAMPKVARVAPVAP--FGACFRLQDIGFTRIGPVVPQIDLV 339
Y L + + A A + + +++ P CF I +++ P +D+V
Sbjct: 301 TTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMV 360
Query: 340 LQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATS 397
+N +++ +N M + A CLG G N +T++ +G + N L+ +D +
Sbjct: 361 FENGQ-KYTLSPENYMFRHSKVRGAYCLGVFQNG-NDQTTL-LGGIIVRNTLVVYDREQT 417
Query: 398 RLGFSDS---LLFER 409
++GF + L+ER
Sbjct: 418 KIGFWKTNCAELWER 432
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/304 (23%), Positives = 125/304 (41%), Gaps = 55/304 (18%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------- 74
A+ R + +R + D L+YV + TP P+ + G +++W C+
Sbjct: 79 AREREREPGMAVRASGD---LEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQ 135
Query: 75 -----KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSI 129
+SS+ + RC C +C PD + ++GD L
Sbjct: 136 PDPLFSPRMSSSYEPMRCAGQLCGDILHHSC------VRPDT-CTYRYSYGD-GTTTLGY 187
Query: 130 QSTD----GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
+T+ + G +VP F CG+ V G + GI G GR ++L SQL+
Sbjct: 188 YATERFTFASSSGETQSVP-LGFGCGTMNV--GSLNNASGIVGFGRDPLSLVSQLSI--- 241
Query: 186 LKRKFALCLSPF---DDGAIVFG---DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLG 239
R+F+ CL+P+ + FG D YD D + ++ TP+ + N
Sbjct: 242 --RRFSYCLTPYASSRKSTLQFGSLADVGLYD----DATGPVQTTPILQSAQN------- 288
Query: 240 EPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF 299
P+ Y++ T V V + + + + ++ +G GG I + T+ ++ +V+AF
Sbjct: 289 -PTF-YYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAF 346
Query: 300 ASAM 303
S +
Sbjct: 347 RSQL 350
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 142/349 (40%), Gaps = 61/349 (17%)
Query: 79 SSTNKTARCGSAQCNLAN-AKACGGGIC-------GAGPDNPISNTGTHGDIRIDVLSIQ 130
SS N C A C+L+ KA C GAG ++ T T +R+
Sbjct: 153 SSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAG--GVVTGTLTRDTLRVH----- 205
Query: 131 STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
GRN G +P F F C + + +GIAG GR ++LPSQL L++ F
Sbjct: 206 ---GRNLGVTQEIPRFCFGCVASSYREP-----IGIAGFGRGALSLPSQLGF---LRKGF 254
Query: 191 ALCLSPFD-------DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
+ C F ++ GD ++ + LK +P++ N
Sbjct: 255 SHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLK-SPMYPNY------------- 300
Query: 244 EYFIGVTSVHV-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
Y++G+ ++ V N A + +L D+ G GG + + YT L Y ++ S
Sbjct: 301 -YYVGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSI 359
Query: 303 M--PKVARVAPVAPFGACFRLQDIGFTRI-GPVVPQIDL-VLQNKNVVWSIHGQNSMVQI 358
+ P+ + F C+++ + + G ++P I L N ++V S +
Sbjct: 360 INYPRATDMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSA 419
Query: 359 GGDAL---CLGFV---DGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
++ CL F DG P + V+G+ Q ++ + +D+ R+GF
Sbjct: 420 PSNSTVVKCLLFQSMDDGDYGP--AGVLGSFQQQDVEVVYDMEKERIGF 466
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/384 (20%), Positives = 151/384 (39%), Gaps = 49/384 (12%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSA 90
+Y + TP L + G ++ W+ C Y SS+ K C
Sbjct: 191 EYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDP 250
Query: 91 QCNLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
+C+L ++ P ++ T GD ++ ++ T V N
Sbjct: 251 RCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVEN 310
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-----G 200
+F CG +GL G G+ GLGR ++ SQL + + F+ CL +
Sbjct: 311 VMFGCGH--WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG--HSFSYCLVDRNSDTNVSS 366
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
++FG+ DL N + +T L K N F Y++ + S+ V G+ +
Sbjct: 367 KLIFGEDK--DLLN---HPEVNFTSLVAGKENPVDTF-------YYVQIKSIMVGGEVLK 414
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
+ + + EG GGT + + + Y+ + AF + + C+
Sbjct: 415 IPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYN 474
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI- 378
+ + + P+ ++ ++ VW+ +N +++ + + CL + PR+++
Sbjct: 475 VSGVEKMEL----PEFRILFED-GAVWNFPVENYFIKLEPEEIVCLAILG---TPRSALS 526
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
+IG Q +N + +D SRLG++
Sbjct: 527 IIGNYQQQNFHILYDTKKSRLGYA 550
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 84/391 (21%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGY----------VSSTNKTARCGSA 90
+Y T+I TP+ + + G +++W+ CE K Y +S++ T C SA
Sbjct: 196 EYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSA 255
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
C+ +A C GG C G G + T G ++L+ +T +V
Sbjct: 256 VCSYLDAYNCHGGGCLYKVSYGDG-------SYTIGSFATEMLTFGTT---------SVR 299
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
N CG + GL G G+ GLG ++ PSQL R F+ CL
Sbjct: 300 NVAIGCGHDNA--GLFVGAAGLLGLGAGLLSFPSQLGT--QTGRAFSYCL---------- 345
Query: 205 GDGPYYDLNNF-DVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNG---KA 258
++ F + S L++ P + + + L PS+ Y++ + S+ V G +
Sbjct: 346 -------VDRFSESSGTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDS 398
Query: 259 VPLNKTLLSIDN-EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
VP + + ID G GG + + T L+T +Y A+ AF + ++ + V+ F
Sbjct: 399 VPPD--VFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDT 456
Query: 318 CFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
C+ L + + VV L+L KN + + + C F
Sbjct: 457 CYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGT--------FCFAFA----- 503
Query: 374 PRTS--IVIGARQLENNLLQFDLATSRLGFS 402
P TS ++G Q + + FD A S +GF+
Sbjct: 504 PATSDLSIMGNIQQQGIRVSFDTANSLVGFA 534
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 80/378 (21%), Positives = 142/378 (37%), Gaps = 48/378 (12%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGSA 90
T Y + P P L V G ++ W+ C+ Y + NK C ++
Sbjct: 54 TGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKLVPCANS 113
Query: 91 QCNLANAKACGGGICGAGP--DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C ++ + C D I T + + V S RN ++ P+ F
Sbjct: 114 ICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRN--KSNVRPSLSF 171
Query: 149 LCGSEFVL--QGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
CG + + G A G+ GLGR V+L SQL K CLS G + FG
Sbjct: 172 GCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGFLFFG 231
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
D + + + + P+ V + SG P G +++ + +++
Sbjct: 232 D-------DMVPTSRVTWVPM----VRSTSGNYYSP------GSATLYFDRRSLSTKPME 274
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
+ D+ G T YT Y+A + A ++ K + C++ Q
Sbjct: 275 VVFDS---GST-------YTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKGQK-A 323
Query: 326 FTRIGPVVPQIDLV--LQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
F + V + + KN V I +N ++ +CLG +DG + +IG
Sbjct: 324 FKSVSDVKKDFKSLQFIFGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDI 383
Query: 384 QLENNLLQFDLATSRLGF 401
+++ ++ +D ++LG+
Sbjct: 384 TMQDQMVIYDNEKAQLGW 401
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 70/284 (24%), Positives = 111/284 (39%), Gaps = 44/284 (15%)
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
T G P VP F C + G AS G+ GLGR +++L SQL KF
Sbjct: 133 TFGSTPAGHARVPGIAFGCST--ASSGFNASSASGLVGLGRGRLSLVSQLGVP-----KF 185
Query: 191 ALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE---- 244
+ CL+P+ D GP LN + + TP F+ PS
Sbjct: 186 SYCLTPYQDTNSTSTLLLGPSASLNG---TAGVSSTP-----------FVASPSTAPMNT 231
Query: 245 -YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS-- 301
Y++ +T + + A+ + S++ +G GG I + T+L + Y+ + A S
Sbjct: 232 FYYLNLTGISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV 291
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD 361
+P A CF L T P +P + L ++V + + M+
Sbjct: 292 TLPTTDGSADTG-LDLCFMLPSS--TSAPPAMPSMTLHFNGADMV--LPADSYMMSDDSG 346
Query: 362 ALCLGF---VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
CL DG VN ++G Q +N + +D+ L F+
Sbjct: 347 LWCLAMQNQTDGEVN-----ILGNYQQQNMHILYDIGQETLSFA 385
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 70/304 (23%), Positives = 125/304 (41%), Gaps = 55/304 (18%)
Query: 23 AQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------- 74
A+ R + +R + D L+YV + TP P+ + G +++W C+
Sbjct: 79 AREREREPGMAVRASGD---LEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQ 135
Query: 75 -----KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSI 129
+SS+ + RC C +C PD + ++GD L
Sbjct: 136 PDPLFSPRMSSSYEPMRCAGQLCGDILHHSC------VRPDT-CTYRYSYGD-GTTTLGY 187
Query: 130 QSTD----GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
+T+ + G +VP F CG+ V G + GI G GR ++L SQL+
Sbjct: 188 YATERFTFASSSGETQSVP-LGFGCGTMNV--GSLNNASGIVGFGRDPLSLVSQLSI--- 241
Query: 186 LKRKFALCLSPF---DDGAIVFG---DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLG 239
R+F+ CL+P+ + FG D YD D + ++ TP+ + N
Sbjct: 242 --RRFSYCLTPYASSRKSTLQFGSLADVGLYD----DATGPVQTTPILQSAQN------- 288
Query: 240 EPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF 299
P+ Y++ T V V + + + + ++ +G GG I + T+ ++ +V+AF
Sbjct: 289 -PTF-YYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAF 346
Query: 300 ASAM 303
S +
Sbjct: 347 RSQL 350
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 70/284 (24%), Positives = 111/284 (39%), Gaps = 44/284 (15%)
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
T G P VP F C + G AS G+ GLGR +++L SQL KF
Sbjct: 193 TFGSTPAGHARVPGIAFGCST--ASSGFNASSASGLVGLGRGRLSLVSQLGVP-----KF 245
Query: 191 ALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE---- 244
+ CL+P+ D GP LN + + TP F+ PS
Sbjct: 246 SYCLTPYQDTNSTSTLLLGPSASLNG---TAGVSSTP-----------FVASPSTAPMNT 291
Query: 245 -YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS-- 301
Y++ +T + + A+ + S++ +G GG I + T+L + Y+ + A S
Sbjct: 292 FYYLNLTGISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV 351
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD 361
+P A CF L T P +P + L ++V + + M+
Sbjct: 352 TLPTTDGSADTG-LDLCFMLPSS--TSAPPAMPSMTLHFNGADMV--LPADSYMMSDDSG 406
Query: 362 ALCLGF---VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
CL DG VN ++G Q +N + +D+ L F+
Sbjct: 407 LWCLAMQNQTDGEVN-----ILGNYQQQNMHILYDIGQETLSFA 445
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 109/278 (39%), Gaps = 38/278 (13%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
++V NF+F CG +GL GV GI GLGRS +++ SQ F F+ CL D
Sbjct: 237 GISVSNFVFGCGRNN--KGLFGGVSGIMGLGRSNLSMISQTNTTFG--GVFSYCLPTTDS 292
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGK 257
GA G N + KNL TP+ + + P + Y + +T + V G
Sbjct: 293 GA----SGSLVIGNESSLFKNL--TPI------AYTSMVSNPQLSNFYVLNLTGIDVGGV 340
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
A+ G GG I + T L S+Y AL F ++
Sbjct: 341 AIQDTSF-------GNGGILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDT 393
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRT 376
CF L I I P + + +N NV ++ + G +CL +
Sbjct: 394 CFNLTGIEEVSI----PTLSMHFEN-NVDLNVDAVGILYMPKDGSQVCLALASLS-DEND 447
Query: 377 SIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTF 414
+IG Q N + +D S++GF+ R C+F
Sbjct: 448 MAIIGNYQQRNQRVIYDAKQSKIGFA------REDCSF 479
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 97/431 (22%), Positives = 163/431 (37%), Gaps = 84/431 (19%)
Query: 10 LCSLLIFSLSPITAQTSFRPKALV---LRVAKDTATLQ---YVTQIKQRTPLVPVKLTVH 63
+ S+L P+T + +PK + +A L Y+ + TP + + +
Sbjct: 61 IVSMLTSGAGPLTTRAKPKPKNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAID 120
Query: 64 LGGNILWVDCE------------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDN 111
+ WV C SST +T CGS QC + +C G+ + N
Sbjct: 121 PSNDAAWVPCSACAGCAASSPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFN 180
Query: 112 PISNTGT-HGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLG 170
T + D L++++ V ++ F C V+ G + G+ G G
Sbjct: 181 LTYAASTFQAVLGQDSLALENN---------VVVSYTFGC--LRVVSGNSVPPQGLIGFG 229
Query: 171 RSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS---------KNL 221
R ++ SQ + F+ CL P Y +NF + K +
Sbjct: 230 RGPLSFLSQTKDTYG--SVFSYCL-------------PNYRSSNFSGTLKLGPIGQPKRI 274
Query: 222 KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTV 281
K TPL N PS+ Y++ + + V K V + ++ L+ + GT I
Sbjct: 275 KTTPLLYNP--------HRPSL-YYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAG 325
Query: 282 NPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ----DIGFTRIGPVVPQID 337
+T L +Y A+ AF + + P+ F C+ + + F G +
Sbjct: 326 TMFTRLAAPVYAAVRDAFRGRV-RTPVAPPLGGFDTCYNVTVSVPTVTFMFAG----AVA 380
Query: 338 LVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG---GVNPRTSIVIGARQLENNLLQFDL 394
+ L +NV+ IH + G CL G GVN + V+ + Q +N + FD+
Sbjct: 381 VTLPEENVM--IHSSS------GGVACLAMAAGPSDGVNAALN-VLASMQQQNQRVLFDV 431
Query: 395 ATSRLGFSDSL 405
A R+GFS L
Sbjct: 432 ANGRVGFSREL 442
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 145/374 (38%), Gaps = 74/374 (19%)
Query: 58 VKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+ L V G ++ WV C+ VSS+ KT C S+ C A G
Sbjct: 98 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 157
Query: 105 CGAGP---DNPISNTGTHGD---IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
CG P ++GD R D+ S G + NF+F CG +G
Sbjct: 158 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGD-----TKLENFVFGCGRNN--KG 210
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFGDGPYYDLNNF 215
L G G+ GLGRS V+L SQ F+ F+ CL +DGA + FG+ D + +
Sbjct: 211 LFGGSSGLMGLGRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGSLSFGN----DSSVY 264
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNG---KAVPLNKTLLSIDN 270
S ++ YTPL N P + Y + +T + G K+ + +L ID+
Sbjct: 265 TNSTSVSYTPLVQN-----------PQLRSFYILNLTGASIGGVELKSSSFGRGIL-IDS 312
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL---QDIGFT 327
GT I+ + P SIYKA+ F + CF L +DI
Sbjct: 313 ----GTVITRLPP------SIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDIS-- 360
Query: 328 RIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLE 386
+P I ++ Q N + + G V+ +CL I IG Q +
Sbjct: 361 -----IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI-IGNYQQK 414
Query: 387 NNLLQFDLATSRLG 400
N + +D RLG
Sbjct: 415 NQRVIYDTTQERLG 428
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 145/374 (38%), Gaps = 74/374 (19%)
Query: 58 VKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+ L V G ++ WV C+ VSS+ KT C S+ C A G
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205
Query: 105 CGAGP---DNPISNTGTHGD---IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
CG P ++GD R D+ S G + NF+F CG +G
Sbjct: 206 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK-----LENFVFGCGRNN--KG 258
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFGDGPYYDLNNF 215
L G G+ GLGRS V+L SQ F+ F+ CL +DGA + FG+ D + +
Sbjct: 259 LFGGSSGLMGLGRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGSLSFGN----DSSVY 312
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNG---KAVPLNKTLLSIDN 270
S ++ YTPL N P + Y + +T + G K+ + +L ID+
Sbjct: 313 TNSTSVSYTPLVQN-----------PQLRSFYILNLTGASIGGVELKSSSFGRGIL-IDS 360
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL---QDIGFT 327
GT I+ + P SIYKA+ F + CF L +DI
Sbjct: 361 ----GTVITRLPP------SIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDIS-- 408
Query: 328 RIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLE 386
+P I ++ Q N + + G V+ +CL I IG Q +
Sbjct: 409 -----IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI-IGNYQQK 462
Query: 387 NNLLQFDLATSRLG 400
N + +D RLG
Sbjct: 463 NQRVIYDTTQERLG 476
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 98/431 (22%), Positives = 163/431 (37%), Gaps = 84/431 (19%)
Query: 10 LCSLLIFSLSPITAQTSFRPKALV---LRVAKDTATLQ---YVTQIKQRTPLVPVKLTVH 63
+ S+L P+T + +PK + +A L Y+ + TP + + +
Sbjct: 42 IVSMLTSGAGPLTTRAKPKPKNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAID 101
Query: 64 LGGNILWVDCE------------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDN 111
+ WV C SST +T CGS QC + +C G+ + N
Sbjct: 102 PSNDAAWVPCSACAGCAASSPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFN 161
Query: 112 PISNTGT-HGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLG 170
T + D L++++ V ++ F C V+ G + G+ G G
Sbjct: 162 LTYAASTFQAVLGQDSLALENN---------VVVSYTFGC--LRVVSGNSVPPQGLIGFG 210
Query: 171 RSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS---------KNL 221
R ++ SQ + F+ CL P Y +NF + K +
Sbjct: 211 RGPLSFLSQTKDTYG--SVFSYCL-------------PNYRSSNFSGTLKLGPIGQPKRI 255
Query: 222 KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTV 281
K TPL N PS+ Y++ + + V K V + ++ L+ + GT I
Sbjct: 256 KTTPLLYNP--------HRPSL-YYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAG 306
Query: 282 NPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ----DIGFTRIGPVVPQID 337
+T L +Y A+ AF + + P+ F C+ + + F G V
Sbjct: 307 TMFTRLAAPVYAAVRDAFRGRV-RTPVAPPLGGFDTCYNVTVSVPTVTFMFAGAVA---- 361
Query: 338 LVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG---GVNPRTSIVIGARQLENNLLQFDL 394
+ L +NV+ IH + G CL G GVN + V+ + Q +N + FD+
Sbjct: 362 VTLPEENVM--IHSSS------GGVACLAMAAGPSDGVNAALN-VLASMQQQNQRVLFDV 412
Query: 395 ATSRLGFSDSL 405
A R+GFS L
Sbjct: 413 ANGRVGFSREL 423
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 89/405 (21%), Positives = 154/405 (38%), Gaps = 64/405 (15%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------- 74
+ A D QY K TP L G ++ W+ C+
Sbjct: 72 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131
Query: 75 ---KGYVSSTNKTARCGSAQC-----NLANAKACGGGICGAGPDNPISNTGTH-GDIRID 125
+SS+ KT C + C +L + C + G D S+ T G +
Sbjct: 132 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191
Query: 126 VLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
++++ +GR + + N + C F Q + G+ GLG SK + + A F
Sbjct: 192 TVTVELKEGRK----MKLHNVLIGCSESFQGQSFQAAD-GVMGLGYSKYSFAIKAAEKFG 246
Query: 186 LKRKFALCLSPFDDGA-----IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGE 240
KF+ CL + FG + + N+ YT L + VN+
Sbjct: 247 --GKFSYCLVDHLSHKNVSNYLTFGSSRSKEA----LLNNMTYTELVLGMVNSF------ 294
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
Y + + + + G + + + D +G GGT + + + T L Y+ ++ A
Sbjct: 295 ----YAVNMMGISIGGAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALR 348
Query: 301 SAMPKVARVA-PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG 359
++ K +V + P CF GF +VP++ + + ++ ++
Sbjct: 349 VSLLKFRKVEMDIGPLEYCF--NSTGFEE--SLVPRLVFHFAD-GAEFEPPVKSYVISAA 403
Query: 360 GDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
CLGFV P TS+V Q +N+L +FDL +LGF+ S
Sbjct: 404 DGVRCLGFVSVAW-PGTSVVGNIMQ-QNHLWEFDLGLKKLGFAPS 446
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 103/417 (24%), Positives = 159/417 (38%), Gaps = 70/417 (16%)
Query: 24 QTSFRPKALVLRV-----AKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KG 76
T R + +L V T+ Y QI P+ + V G +ILW C+ +G
Sbjct: 62 HTRARSRRFLLEVDLMLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQG 121
Query: 77 YVSSTNKTARCGSAQCN------------LANAKACGGGICGAGPDNPISNTGTHGDIRI 124
SS C S A+ C +C G +N DI
Sbjct: 122 -CSSKKNVIVCSSIIMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNNSCAYDISY 180
Query: 125 DVLS----IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
+ S I D + G ++ +FL G + GL V GI G GRSKV++P+QL
Sbjct: 181 EDTSSSTGIYFRDVVHLGHKASLNTTMFL-GCATSISGLWP-VDGIMGFGRSKVSVPNQL 238
Query: 181 AAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGE 240
AA F CLS +G G L D + YTP+ N
Sbjct: 239 AAQAGSYNIFYHCLSGEKEG------GGILVLGKNDEFPEMVYTPMLAND---------- 282
Query: 241 PSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAF 299
+ Y + + S+ VN KA+P+ + + G GGT I + + KAL F
Sbjct: 283 --IVYNVKLVSLSVNSKALPIEASEFEYNATVGNGGTIIDSGTSSATFPS---KALA-LF 336
Query: 300 ASAMPKVARVAPVAPFGA----CF-RLQDIGFTRIG-PVV-------PQIDLVLQN--KN 344
A+ K P AP + CF + D + P V ++L N +
Sbjct: 337 VKAVSKFTTAIPTAPLESSGSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEA 396
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
VV +++ Q G +C+ + G S ++G L++ ++ +D+ SR+G+
Sbjct: 397 VVSRKLSESTHFQ-GVRLVCISWSVG-----NSTILGDAILKDKVVVYDMEKSRIGW 447
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 152/394 (38%), Gaps = 81/394 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV------------SSTNKTARCGSAQC 92
Y+ + TP P+ + + + W+ C G V SS+++T +C + QC
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCS-GCVGCSSSVLFDPSKSSSSRTLQCEAPQC 146
Query: 93 NLAN------AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
A +K+CG + G + + D L++ S +PN+
Sbjct: 147 KQAPNPSCTVSKSCGFNMTYGG-------STIEAYLTQDTLTLASD---------VIPNY 190
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
F C ++ G + G+ GLGR ++L SQ + + F+ CL
Sbjct: 191 TFGCINK--ASGTSLPAQGLMGLGRGPLSLISQSQNLY--QSTFSYCL------------ 234
Query: 207 GPYYDLNNFDVSKNL--KYTPLFINKVNTASGFLGEP--SVEYFIGVTSVHVNGKAVPLN 262
P +NF S L K P+ I + L P S Y++ + + V K V +
Sbjct: 235 -PNSKSSNFSGSLRLGPKNQPIRIK----TTPLLKNPRRSSLYYVNLVGIRVGNKIVDIP 289
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ L+ D GT + YT L Y A+ F + K A + F C+
Sbjct: 290 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRV-KNANATSLGGFDTCYS-- 346
Query: 323 DIGFTRIGPVV-PQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTSI-V 379
G VV P + + NV ++ N ++ G+ CL VN + + V
Sbjct: 347 -------GSVVFPSVTFMFAGMNV--TLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNV 397
Query: 380 IGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
I + Q +N+ + D+ SRLG S R TCT
Sbjct: 398 IASMQQQNHRVLIDVPNSRLGIS------RETCT 425
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 99/439 (22%), Positives = 155/439 (35%), Gaps = 83/439 (18%)
Query: 9 LLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQ--------------YVTQIKQRTP 54
L+ +LL+ L P+T R L +A D T + YV + TP
Sbjct: 4 LVSTLLLMCLIPLTRAHELRRG---LELADDATTARPGGVTVPVHFSQAFYVVNLTIGTP 60
Query: 55 LVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKA--------CGGGICG 106
PV + +GG ++W C + RC L + A CG +C
Sbjct: 61 PQPVSAIIDIGGELVWTQCAQ-------HCRRCFKQDLPLFDTNASSTFRPEPCGAAVCE 113
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV-----PNFIFLCGSEFVLQGLAS 161
+ P + G S T GR AV + F C + +
Sbjct: 114 SIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATARLAFGCAVASEMDTMW- 172
Query: 162 GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---AIVFGDGPYYDLNNFDVS 218
G G GLGR+ ++L +Q+ A F+ CL+P D G A+ G
Sbjct: 173 GSSGSVGLGRTNLSLAAQMNAT-----AFSYCLAPPDTGKSSALFLG----ASAKLAGAG 223
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
K TP SG S Y + + ++ + + ++ +I +
Sbjct: 224 KGAGTTPFVKTSTPPNSGL----SRSYLLRLEAIRAGNATIAMPQSGNTI--------TV 271
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
ST P T L S+Y+ L +A A A+ PV + CF P + L
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGG-----APDLVL 326
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDALCLGFVD----GGVNPRTSIVIGARQLENNLLQFDL 394
Q ++ + + G D C+ + GGV+ ++G+ Q N L FDL
Sbjct: 327 AFQG-GAEMTVPVSSYLFDAGNDTACVAILGSPALGGVS-----ILGSLQQVNIHLLFDL 380
Query: 395 ATSRLGFSDSLLFERATCT 413
++L FE A C+
Sbjct: 381 D------KETLSFEPADCS 393
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 92/386 (23%), Positives = 153/386 (39%), Gaps = 59/386 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYVSSTNKTARCGSAQCNLA 95
++ I TP + V G ++ WV C E G + K++ S C+
Sbjct: 84 EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143
Query: 96 NAKACGGGICGAGPDNPI--------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
N +A G N I + + GD+ + +SI S G V+ P +
Sbjct: 144 NCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSAS----GSPVSFPGTV 199
Query: 148 FLCGSEFVLQG--LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL---SPFDDGAI 202
F CG G GI GLG ++L SQL + S+ +KF+ CL S +G
Sbjct: 200 FGCGYN---NGGTFDETGSGIIGLGGGHLSLISQLGS--SISKKFSYCLSHKSATTNGTS 254
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
V G ++ + TPL ++K EP Y++ + ++ V K +P
Sbjct: 255 VINLGTNSIPSSLSKDSGVVSTPL-VDK---------EPLTYYYLTLEAISVGKKKIPYT 304
Query: 263 KTLLSIDNEGV-----GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA-PVAPFG 316
+ + +++G+ G I + T+LE + A ++ RV+ P
Sbjct: 305 GSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLS 364
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
CF+ G IG +P+I + +V S N+ V++ D +CL V P T
Sbjct: 365 HCFK---SGSAEIG--LPEITVHFTGADVRLS--PINAFVKLSEDMVCLSMV-----PTT 412
Query: 377 SIVI-GARQLENNLLQFDLATSRLGF 401
+ I G + L+ +DL T + F
Sbjct: 413 EVAIYGNFAQMDFLVGYDLETRTVSF 438
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 96/386 (24%), Positives = 153/386 (39%), Gaps = 71/386 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----KGYV------------SSTNKTA 85
TL++V + TP P L G ++ WV C+ G+ SST
Sbjct: 146 TLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAV 205
Query: 86 RCGSAQCNLANAKACGGGICGAGPDNPI--------SNTGTHGDIRIDVLSIQSTDGRNP 137
CG QC A GG+C DN + T G + D L++ S+
Sbjct: 206 HCGEPQC------AAAGGLCSE--DNTTCLYLVHYGDGSSTTGVLSRDTLALTSS----- 252
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
RA+ F F CG+ + G V G+ GLGR +++LPSQ AA+F + L S
Sbjct: 253 -RALA--GFPFGCGTRNL--GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS 307
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G P D + L+ P F PS YF+ + S+ + G
Sbjct: 308 TTGYLTIGATPATDTGAAQYTAMLR-KPQF-------------PSF-YFVELVSIDIGGY 352
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+P+ + + GGT + + T L Y+ L F M + P A
Sbjct: 353 ILPVPPAVFT-----RGGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDA 407
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF--VDGGVNPR 375
C+ G + + +VP + + V+ + M+ + + CL F +D G P
Sbjct: 408 CYDFA--GESEV--IVPAVSFRFGD-GAVFELDFFGVMIFLDENVGCLAFAAMDAGGLPL 462
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
+ +IG Q + + +D+A ++GF
Sbjct: 463 S--IIGNTQQRSAEVIYDVAAEKIGF 486
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/258 (21%), Positives = 110/258 (42%), Gaps = 32/258 (12%)
Query: 160 ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSK 219
+ G+ G+ R ++ +Q A +FA C++P I+ + +
Sbjct: 195 SEAATGLLGMNRGSLSFVTQTA-----TLRFAYCIAPGQGPGIL------LLGGDGGAAP 243
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
L YTPL +++ + V Y + + + V + + K++L+ D+ G G T +
Sbjct: 244 PLNYTPLI--EISQPLPYFDR--VAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQTMVD 299
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--------FGACFRLQDIGFTRIGP 331
+ +T L Y AL F + + +AP+ F ACFR + +
Sbjct: 300 SGTQFTFLLADAYAALKAEFLNQARSL--LAPLGEPGFVFQGAFDACFRGPEERVSAASR 357
Query: 332 VVPQIDLVLQNKNV-------VWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
++P++ LVL+ V ++S+ G+ + CL F + + ++ VIG
Sbjct: 358 LLPEVGLVLRGAEVAVAGEKLLYSVPGERRGEEGAEAVWCLTFGNSDMAGMSAYVIGHHH 417
Query: 385 LENNLLQFDLATSRLGFS 402
++ +++DL R+GF+
Sbjct: 418 QQDVWVEYDLQNGRVGFA 435
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/399 (21%), Positives = 156/399 (39%), Gaps = 79/399 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSA 90
+Y + TP L + G ++ W+ C + SS+ + C
Sbjct: 89 EYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDP 148
Query: 91 QCNLANAKACGGGICGAGPDNPI----------------SNTGTHGDIRIDVLSIQSTDG 134
+C+L ++ PD P+ ++ T GD + ++ T
Sbjct: 149 RCHLVSS-----------PDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSP 197
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
V N +F CG +GL G G+ GLGR ++ SQL + + F+ CL
Sbjct: 198 TGKSEFKRVENVMFGCGH--WNRGLFHGASGLLGLGRGPLSFSSQLQSLYG--HSFSYCL 253
Query: 195 SPFDD-----GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
+ ++FG+ DL N L +T L K N F Y++ +
Sbjct: 254 VDRNSDTNVSSKLIFGEDK--DLLN---HPELNFTTLVGGKENPVDTF-------YYVQI 301
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
S+ V G+ + + ++ ++ ++GVGGT + + + Y+ + AF + V
Sbjct: 302 KSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIV 361
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDL----VLQNKNVVWSIHGQNSMVQIG-GDALC 364
C+ + V +IDL +L VW+ +N +++ + +C
Sbjct: 362 QDFPILDPCYNVSG---------VEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVC 412
Query: 365 LGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
L + PR+++ +IG Q +N + +D SRLG++
Sbjct: 413 LAILG---TPRSALSIIGNYQQQNFHVLYDTKKSRLGYA 448
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/392 (23%), Positives = 152/392 (38%), Gaps = 77/392 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV------------SSTNKTARCGSAQC 92
Y+ + TP P+ + + + W+ C G V SS+++T +C + QC
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCS-GCVGCSSSVLFDPSKSSSSRTLQCEAPQC 146
Query: 93 NLAN------AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
A +K+CG + G + + D L++ S +PN+
Sbjct: 147 KQAPNPSCTVSKSCGFNMTYGG-------STIEAYLTQDTLTLASD---------VIPNY 190
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
F C ++ G + G+ GLGR ++L SQ + + F+ CL
Sbjct: 191 TFGCINK--ASGTSLPAQGLMGLGRGPLSLISQSQNLY--QSTFSYCL------------ 234
Query: 207 GPYYDLNNFDVSKNL--KYTPLFINKVNTASGFLGEP--SVEYFIGVTSVHVNGKAVPLN 262
P +NF S L K P+ I + L P S Y++ + + V K V +
Sbjct: 235 -PNSKSSNFSGSLRLGPKNQPIRIK----TTPLLKNPRRSSLYYVNLVGIRVGNKIVDIP 289
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ L+ D GT + YT L Y A+ F + K A + F C+
Sbjct: 290 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRV-KNANATSLGGFDTCYS-G 347
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-VIG 381
+ F + + +++ L N++ IH G+ CL VN + + VI
Sbjct: 348 SVVFPSVTFMFAGMNVTLPPDNLL--IHSS------AGNLSCLAMAAAPVNVNSVLNVIA 399
Query: 382 ARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
+ Q +N+ + D+ SRLG S R TCT
Sbjct: 400 SMQQQNHRVLIDVPNSRLGIS------RETCT 425
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 144/392 (36%), Gaps = 72/392 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y+ + TP P +HL G +W C SST + CG+A
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87
Query: 92 CNLANAKACGG-GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C A C G G+C + +T I TD G A + F C
Sbjct: 88 CESVPASTCSGDGVCSYEVETMFGDTS----------GIGGTDTFAIGTATA--SLAFGC 135
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF----DDGAIVFGD 206
+ ++ L G G+ GLGR+ +L Q+ A F+ CL+P A++ G
Sbjct: 136 AMDSNIKQLL-GASGVVGLGRTPWSLVGQMNA-----TAFSYCLAPHGAAGKKSALLLGA 189
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK--AVPLNKT 264
K+ TPL VNT+ + S +Y I + + A P N +
Sbjct: 190 S-----AKLAGGKSAATTPL----VNTS-----DDSSDYMIHLEGIKFGDVIIAPPPNGS 235
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
++ +D T+ + L + ++A+ +A A+ P PF CF +
Sbjct: 236 VVLVD----------TIFGVSFLVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFP-KAA 284
Query: 325 GFTRIGPVVPQIDLVLQNKN-VVWSIHGQNSMVQIGGDALCLGFVDGGV-NPRTSI-VIG 381
+P D+VL + ++ M G +CL + + N T + ++G
Sbjct: 285 AAAGANSSLPLPDVVLTFQGAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILG 344
Query: 382 ARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
EN FDL ++L FE A C+
Sbjct: 345 RLHQENIHFLFDLD------KETLSFEPADCS 370
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 136/360 (37%), Gaps = 34/360 (9%)
Query: 54 PLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPI 113
P V V G +++W C+ T + + + C G+C A P +
Sbjct: 8 PAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNC 67
Query: 114 SNTG-------THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGI 166
+ T+GD + +T+ ++ F CG E G + G G+
Sbjct: 68 NEDKDACEYLYTYGDYS-STRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGS-GL 125
Query: 167 AGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG----AIVFGDGPYYDLNNFDVSKNLK 222
GLGR ++L SQL + KF+ CL+ +D ++ G +N S + +
Sbjct: 126 VGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGE 180
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
T N +PS Y++ + + V K + + K+ + +G GG I +
Sbjct: 181 VTKTMSLLRNPD-----QPSF-YYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGT 234
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQN 342
T LE + +K L + F S M + CF+L D VP+ ++
Sbjct: 235 TITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAA---KNIAVPK--MIFHF 289
Query: 343 KNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
K + G+N MV LCL G + SI +Q N+L DL + F
Sbjct: 290 KGADLELPGENYMVADSSTGVLCLAM---GSSNGMSIFGNVQQQNFNVLH-DLEKETVSF 345
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 69/159 (43%), Gaps = 32/159 (20%)
Query: 62 VHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAG 108
+ G ++ WV CE K SS+ ++ C S+ C G C +
Sbjct: 160 IDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESN 219
Query: 109 PDN--PISNTG----THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASG 162
P N N G T+G++ + LS ++V NF+F CG +GL G
Sbjct: 220 PSNCSYAVNYGDGSYTNGELGAEHLSF---------GGISVSNFVFGCGKNN--KGLFGG 268
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
V G+ GLGRS ++L SQ + F F+ CL P D GA
Sbjct: 269 VSGLMGLGRSNLSLISQTNSTFG--GVFSYCLPPTDAGA 305
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 85/188 (45%), Gaps = 35/188 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y+ TP V + G NI+W+ C+ SS+ K C S+
Sbjct: 88 EYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSS 147
Query: 91 QCNLANAK--AC--GGGICGAGPDNPISNTG---THGDIRIDVLSIQSTDGRNPGRAVTV 143
C N +C GG +C + I+ G + GD+ D L++ ST G +V
Sbjct: 148 TCKDTNDTHISCSNGGDVC----EYSITYGGDAKSQGDLSNDSLTLDSTS----GSSVLF 199
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA-- 201
PN + CG VLQ S G+ G+GR ++L Q+ ++ S+ KF+ CL P++ +
Sbjct: 200 PNIVIGCGHINVLQD-NSQSSGVVGMGRGPMSLIKQVGSS-SVGSKFSYCLIPYNSDSNS 257
Query: 202 ---IVFGD 206
++FG+
Sbjct: 258 SSKLIFGE 265
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 99/439 (22%), Positives = 155/439 (35%), Gaps = 83/439 (18%)
Query: 9 LLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQ--------------YVTQIKQRTP 54
L+ +LL+ L P+T R L +A D T + YV + TP
Sbjct: 4 LVSTLLLMCLIPLTRAHELRRG---LELADDATTARPGGVTVPVHFSQAFYVVNLTIGTP 60
Query: 55 LVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKA--------CGGGICG 106
PV + +GG ++W C + RC L + A CG +C
Sbjct: 61 PQPVSAIIDIGGELVWTQCAQ-------HCRRCFKQDLPLFDTNASSTFRPEPCGAAVCE 113
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV-----PNFIFLCGSEFVLQGLAS 161
+ P + G S T GR AV + F C + +
Sbjct: 114 SIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATARLAFGCAVASEMDTMW- 172
Query: 162 GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---AIVFGDGPYYDLNNFDVS 218
G G GLGR+ ++L +Q+ A F+ CL+P D G A+ G
Sbjct: 173 GSSGSVGLGRTNLSLAAQMNAT-----AFSYCLAPPDTGKSSALFLG----ASAKLAGAG 223
Query: 219 KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
K TP SG S Y + + ++ + + ++ +I +
Sbjct: 224 KGAGTTPFVKTSTPPHSGL----SRSYLLRLEAIRAGNATIAMPQSGNTI--------MV 271
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
ST P T L S+Y+ L +A A A+ PV + CF P + L
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGG-----APDLVL 326
Query: 339 VLQNKNVVWSIHGQNSMVQIGGDALCLGFVD----GGVNPRTSIVIGARQLENNLLQFDL 394
Q ++ + + G D C+ + GGV+ ++G+ Q N L FDL
Sbjct: 327 AFQG-GAEMTVPVSSYLFDAGNDTACVAILGSPALGGVS-----ILGSLQQVNIHLLFDL 380
Query: 395 ATSRLGFSDSLLFERATCT 413
++L FE A C+
Sbjct: 381 D------KETLSFEPADCS 393
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 86/409 (21%), Positives = 155/409 (37%), Gaps = 81/409 (19%)
Query: 57 PVKLTVHLGGNILWVDCE-------KGYVSSTN------------------KTARCGSAQ 91
P+ L + G +++W C+ +G +T+ K++ C +A
Sbjct: 92 PIFLYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAH 151
Query: 92 CNLANAKACGGGICGAGPDNPISNTG-----------THGD------IRIDVLSIQSTDG 134
NL ++ C C P I + +GD + D +S+ ++
Sbjct: 152 SNLPSSDLCAISNC---PLESIETSDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLSNP 208
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALC 193
N + V NF F C + + +G+AG GR ++LP+QLA + L +F+ C
Sbjct: 209 TN----LIVNNFTFGCAHTALAE-----PIGVAGFGRGVLSLPAQLATLSPQLGNQFSYC 259
Query: 194 L--SPFDD------GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
L FD ++ G YD + + N P F+ + L P Y
Sbjct: 260 LVSHSFDSDRLRRPSPLILGR---YDHDEKERRVNGVNKPRFV--YTSMLDNLEHPYF-Y 313
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+G+ + + K +P L +D EG GG + + +T+L S+Y ++V F + + +
Sbjct: 314 CVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGR 373
Query: 306 VARVAPVAP----FGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQ 357
V A V C+ + V+ + +VL +N +
Sbjct: 374 VNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKG 433
Query: 358 IGGDALCLGFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGFS 402
CL ++GG S +G Q + + +DL R+GF+
Sbjct: 434 KKRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFA 482
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 146/381 (38%), Gaps = 56/381 (14%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
+LQYV + TP VP L + G ++ WV C+ SST
Sbjct: 119 SLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVP 178
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTH---GDIRIDVLSIQSTDGRNPGRAVTV 143
CGS C + + G + + G GD + V S + T +P A V
Sbjct: 179 CGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTE-TLTLSPEAATVV 237
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
NF F CG V +G+ G+ GLG + +L SQ + F+ CL + A
Sbjct: 238 NNFSFGCG--LVQKGVFDLFDGLLGLGGAPESLVSQTTGTYG--GAFSYCLPAGNSTAGF 293
Query: 204 FGDG-PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
G P NN + ++TPL + + + Y + +T + V GK + +
Sbjct: 294 LALGAPATGGNN---TAGFQFTPLQVVE-----------TTFYLVKLTGISVGGKQLDIE 339
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP--VAPFGACFR 320
T+ + GG I + T L + Y AL AF SAM + P C+
Sbjct: 340 PTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYD 393
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVI 380
G T + VP + L + V + + ++ G CL FV G + T I+
Sbjct: 394 F--TGNTNV--TVPTVALTFEG-GVTIDLDVPSGVLLDG----CLAFVAGASDGDTGIIG 444
Query: 381 GARQLENNLLQFDLATSRLGF 401
Q +L +D A +GF
Sbjct: 445 NVNQRTFEVL-YDSARGHVGF 464
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 146/408 (35%), Gaps = 81/408 (19%)
Query: 29 PKALVLRVAKDTATLQ---YVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------K 75
PKA + +A L YV ++K TP + + + + WV C
Sbjct: 80 PKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSSPTFS 139
Query: 76 GYVSSTNKTARCGSAQCNLANAKAC---GGGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
SST + +C QC +C G C N GD + Q +
Sbjct: 140 PNTSSTYASLQCSVPQCTQVRGLSCPTTGTAAC-------FFNQTYGGDSSFSAMLSQDS 192
Query: 133 DGRNPGRAVTVPNFIFLC-----GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
G T+P++ F C GS QGL GLGR ++L SQ + +S
Sbjct: 193 LGL---AVDTLPSYSFGCVNAVSGSTLPPQGLL-------GLGRGPMSLLSQSGSLYS-- 240
Query: 188 RKFALCLSPFDDGAIVFGDGPYY-----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
F+ C F YY L KN++ TPL N P+
Sbjct: 241 GVFSYCFPSFKS---------YYFSGSLRLGPLGQPKNIRTTPLLRNP--------HRPT 283
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
+ Y++ +T V V VP+ LL+ D GT I + T +Y A+ F
Sbjct: 284 L-YYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQ 342
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPVV----PQIDLVLQNKNVVWSIHGQNSMVQI 358
+ A + F CF + I P V +DL L +N + IH
Sbjct: 343 VK--GPFATIGAFDTCFAATN---EDIAPPVTFHFTGMDLKLPLENTL--IHSS------ 389
Query: 359 GGDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDSL 405
G CL N + + VI Q +N + FD+ SRLG + L
Sbjct: 390 AGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIAREL 437
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 91/399 (22%), Positives = 146/399 (36%), Gaps = 83/399 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP PV+L + G ++ W C S T C
Sbjct: 84 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 143
Query: 91 QCNLANAKACG-----GGIC---GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C +CG GIC A D+ I T G + D S S D G +
Sbjct: 144 ICRDLTWSSCGEQSWGNGICVYAYAYADHSI----TTGHLDSDTFSFASADHAIGG--AS 197
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
VP+ F CG F S GIAG R +++P+QL F+ C +
Sbjct: 198 VPDLTFGCG-LFNNGIFVSNETGIAGFSRGALSMPAQLKV-----DNFSYCFT------A 245
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFI----NKVNTASG-------------FLGEPSVEY 245
+ G P +P+F+ N + A+G + Y
Sbjct: 246 ITGSEP---------------SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAY 290
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+I + V V +P+ +++ ++ +G GGT + + T+L ++Y + AF +
Sbjct: 291 YISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKL 350
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDA 362
+ + CF + P VP LVL + + +N M +I GG
Sbjct: 351 TVHNSTSSLSQLCFSVP----PGAKPDVPA--LVLHFEGATLDLPRENYMFEIEEAGGIR 404
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
L ++ G + VIG Q +N + +DLA L F
Sbjct: 405 LTCLAINAGEDLS---VIGNFQQQNMHVLYDLANDMLSF 440
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 83/342 (24%), Positives = 134/342 (39%), Gaps = 51/342 (14%)
Query: 79 SSTNKTARCGSAQCNLAN-AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNP 137
S T + +C + QCN N K C A + + G + DV+S + +P
Sbjct: 140 SETYQPVKC-TWQCNCDNDRKQCTYERRYA------EMSTSSGALGEDVVSFGNQTELSP 192
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
RA IF C ++ GI GLGR +++ QL + F+LC
Sbjct: 193 QRA------IFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGM 246
Query: 198 DDGAIVF---GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
G G P D+ V T S + P Y I + +HV
Sbjct: 247 GVGGGAMVLGGISPPADM------------------VFTRSDPVRSPY--YNIDLKEIHV 286
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
GK + LN + +G GT + + Y L S + A A + R++ P
Sbjct: 287 AGKRLHLNPKVF----DGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDP 342
Query: 315 F--GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDG 370
CF +I ++I P +++V N + + S+ +N + + A CLG
Sbjct: 343 RYNDICFSGAEIDVSQISKSFPVVEMVFGNGHKL-SLSPENYLFRHSKVRGAYCLGVFSN 401
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---SLLFER 409
G +P T ++G + N L+ +D +++GF S L+ER
Sbjct: 402 GNDPTT--LLGGIVVRNTLVMYDREHTKIGFWKTNCSELWER 441
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 85/383 (22%), Positives = 144/383 (37%), Gaps = 88/383 (22%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGY----------VSSTNKTARCGSA 90
+Y+ TP V V G +++W+ CE + Y +SS+ + C S
Sbjct: 87 EYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLSD 146
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C+ +C G + ++ L++ ST G +V+ P + C
Sbjct: 147 TCHSMRTTSCD----------------VRGYLSVETLTLDSTTGY----SVSFPKTMIGC 186
Query: 151 G--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP----------FD 198
G + G +SG+VG LG ++LPSQL S+ KF+ CL P F
Sbjct: 187 GYRNTGTFHGPSSGIVG---LGSGPMSLPSQLGT--SIGGKFSYCLGPWLPNSTSKLNFG 241
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
D AIV+GDG TP I K + SG Y++ + + V K
Sbjct: 242 DAAIVYGDG-------------AMTTP--IVKKDAQSG--------YYLTLEAFSVGNKL 278
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+ NE G I + +T L +Y A A + P F C
Sbjct: 279 IEFGGPTYG-GNE--GNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLC 335
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
+ + GF P I + ++ ++ ++ +++ CL F+ P +
Sbjct: 336 YNVAYHGFE-----APLITAHFKGADI--KLYYISTFIKVSDGIACLAFI-----PSQTA 383
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
+ G +N L+ ++L + + F
Sbjct: 384 IFGNVAQQNLLVGYNLVQNTVTF 406
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 92/390 (23%), Positives = 151/390 (38%), Gaps = 63/390 (16%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSS---TNKTAR------CGSA 90
+ T QY +++ TP+ L G ++ WV C KT+R C S
Sbjct: 111 SGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGRVFRPKTSRSWAPIPCSSD 170
Query: 91 QCN------LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN----PGRA 140
C LAN + P +P + + + I T+ G+
Sbjct: 171 TCKLDVPFTLANCSS---------PASPCTYDYRYKEGSAGARGIVGTESATIALPGGKV 221
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC--LSPFD 198
+ + + C S Q S G+ LG +K++ +Q AA F + L L+P +
Sbjct: 222 AQLKDVVLGCSSSHDGQSFRS-ADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRN 280
Query: 199 -DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + FG G + TP K+ FL Y + V ++HV GK
Sbjct: 281 ATGYLAFGPG------------QVPRTPATQTKL-----FLDPEMPFYGVKVDAIHVAGK 323
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
A+ + + + GG + + N TVL YKA+V A + + V +V+ PF
Sbjct: 324 ALDIPAEVWDAKS---GGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVS-FPPFEH 379
Query: 318 CFRLQDIGFTRIGPVVPQI--DLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
C+ +T P P+I L +Q + ++ ++ + C+G V G P
Sbjct: 380 CYN-----WTARRPGAPEIIPKLAVQFAGSARLEPPAKSYVIDVKPGVKCIG-VQEGEWP 433
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFSDS 404
S VIG + +L +FDL ++ F S
Sbjct: 434 GLS-VIGNIMQQEHLWEFDLKNMQVRFKQS 462
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 145/374 (38%), Gaps = 74/374 (19%)
Query: 58 VKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+ L V G ++ WV C+ VSS+ KT C S+ C A G
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205
Query: 105 CGAGP---DNPISNTGTHGD---IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
CG P ++GD R D+ S G + NF+F CG +G
Sbjct: 206 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGD-----TKLENFVFGCGRNN--KG 258
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFGDGPYYDLNNF 215
L G G+ GLGRS V+L SQ F+ F+ CL +DGA + FG+ D + +
Sbjct: 259 LFGGSSGLMGLGRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGSLSFGN----DSSVY 312
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNG---KAVPLNKTLLSIDN 270
S ++ YTPL N P + Y + +T + G K+ + +L ID+
Sbjct: 313 TNSTSVSYTPLVQN-----------PQLRSFYILNLTGASIGGVELKSSSFGRGIL-IDS 360
Query: 271 EGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL---QDIGFT 327
GT I+ + P SIYKA+ F + CF L +DI
Sbjct: 361 ----GTVITRLPP------SIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDIS-- 408
Query: 328 RIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLE 386
+P I ++ Q N + + G V+ +CL I IG Q +
Sbjct: 409 -----IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI-IGNYQQK 462
Query: 387 NNLLQFDLATSRLG 400
N + +D RLG
Sbjct: 463 NQRVIYDSTQERLG 476
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 92/417 (22%), Positives = 155/417 (37%), Gaps = 87/417 (20%)
Query: 33 VLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------KGYVSSTNK 83
+ + D L Y T I TP V + + G ++ WV C+ Y S
Sbjct: 87 TMSLGNDFGWLHY-TWIDIGTPHVSFMVALDSGSDLFWVPCDCVQCAPLSASHYSSLDRD 145
Query: 84 TARCGSAQCNLANAKACGGGICGAGPD--NP-----------ISNTGTHGDIRIDVLSIQ 130
+ +Q + + +C +C GP+ NP +T + G + D++ +
Sbjct: 146 LSEYSPSQSSTSKQLSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLA 205
Query: 131 STDGRNPGRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
S +V P I CG S L G+A G+ GLG ++++PS LA A ++
Sbjct: 206 SGGDDTLNTSVKAP-VIIGCGMKQSGGYLDGVAPD--GLLGLGLQEISVPSFLAKAGLIQ 262
Query: 188 RKFALCLSPFDDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
F++C + D G I FGD GP + + N YT Y
Sbjct: 263 NSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKLNGNYT-------------------TYI 303
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+GV V + + +D+ GT +T L +++ + + F + +
Sbjct: 304 VGVEVCCVGTSCLKQSSFSALVDS----GTS------FTFLPDDVFEMIAEEFDTQVNAS 353
Query: 307 ARVAPVAPFGACFRL--QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ C++ QD+ P +P + L+ N S QN + I G
Sbjct: 354 RSSFEGYSWKYCYKTSSQDL------PKIPSLRLIFPQNN---SFMVQNPVFMIYGIQGV 404
Query: 365 LGF------VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFN 415
+GF DG + IG + + FD +LG+S R+ C F+
Sbjct: 405 IGFCLAIQPADGDIG-----TIGQNFMMGYRVVFDRENLKLGWS------RSNCEFS 450
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 121/282 (42%), Gaps = 48/282 (17%)
Query: 143 VPNFIFLCGSEFVLQGLA-SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-- 199
+ F+F CG+ QG G G+ GLGRS V+L SQ F F+ CL P +
Sbjct: 230 IEGFVFGCGTSN--QGAPFGGTSGLMGLGRSHVSLVSQTMDQFG--GVFSYCL-PMRESG 284
Query: 200 --GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G++V GD D S TP+ + + SG L P YF+ +T + V G+
Sbjct: 285 SSGSLVLGD---------DSSAYRNSTPIVYTAMVSDSGPLQGPF--YFLNLTGITVGGQ 333
Query: 258 AVP---LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
V + + ID+ GT I+T+ P S+Y A+ F S + + + +
Sbjct: 334 EVESPWFSAGRVIIDS----GTIITTLVP------SVYNAVRAEFLSQLAEYPQAPAFSI 383
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA--LCLGFVDGGV 372
CF L + + VP + V + +V + + + + DA +CL
Sbjct: 384 LDTCFNLTGLKEVQ----VPSLKFVFEG-SVEVEVDSKGVLYFVSSDASQVCLALASLKS 438
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTF 414
TSI IG Q +N + FD S++GF+ + TC +
Sbjct: 439 EYDTSI-IGNYQQKNLRVIFDTLGSQIGFA------QETCDY 473
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 85/375 (22%), Positives = 152/375 (40%), Gaps = 48/375 (12%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSAQCNLANA-- 97
TP V L + G ++ W+ C+ Y SS+ + C +C L ++
Sbjct: 178 TPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQLVSSPD 237
Query: 98 --KACG--GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSE 153
+ C C D + T GD ++ ++ T + V + +F CG
Sbjct: 238 PLQHCKTENQTCPYFYDY-ADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFGCGH- 295
Query: 154 FVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP-FDDGAI----VFGDGP 208
+G G G+ GLGR ++ PSQL + + F+ CL+ F + ++ +FG+
Sbjct: 296 -WNKGFFHGAGGLLGLGRGPLSFPSQLQSIYG--HSFSYCLTDLFSNTSVSSKLIFGEDK 352
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
LN+ NL +T L + F Y++ + S+ V G+ + + +
Sbjct: 353 EL-LNH----HNLNFTKLLAGEETPDDTF-------YYLQIKSIVVGGEVLDIPEKTWHW 400
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTR 328
+EGVGGT I + + T S Y + +AF + A C+ + G +
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVS--GAMQ 458
Query: 329 IGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGARQLEN 387
+ +P + + VW+ +N Q D +CL + + +I IG +N
Sbjct: 459 VE--LPDYGIHFAD-GAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTI-IGNLLQQN 514
Query: 388 NLLQFDLATSRLGFS 402
+ +D+ SRLG+S
Sbjct: 515 FHILYDVKRSRLGYS 529
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 94/406 (23%), Positives = 151/406 (37%), Gaps = 80/406 (19%)
Query: 33 VLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------ 74
+ D L Y T I TP + + G ++LWV C+
Sbjct: 89 TMSFGNDFGWLHY-TWIDIGTPSTSFLVALDAGSDLLWVPCDCIHCAPLSASFYSNLDRD 147
Query: 75 -KGYVSS---TNKTARCGSAQCNL-ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSI 129
Y S ++K C C++ +N K C + NT + G + D+ +
Sbjct: 148 LNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHL 207
Query: 130 QSTDGRNPGRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
QS DG +V P + CG S L G A G+ GLG + ++PS LA + +
Sbjct: 208 QSGDGSTSNSSVQAP-VVVGCGMKQSGGYLDGTAPD--GLIGLGPGESSVPSFLAKSGLI 264
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
+ F+LC + D G + FGD S + TP + G Y
Sbjct: 265 RDSFSLCFNEDDSGRLFFGD---------QGSTVQQSTPFLL-----VDGMFS----TYI 306
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+GV + + G + P + S + + GT +T L Y A+ + F +
Sbjct: 307 VGVETCCI-GNSCP---KVTSFNAQFDSGTS------FTFLPGHAYGAIAEEFDKQVNAT 356
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKN--VVWSIHGQN----SMVQIGG 360
+P+ C+ + ++ P +P + L+ Q N VV+ N S + G
Sbjct: 357 RSTFQGSPWEYCY----VPSSQQLPKIPTLTLMFQQNNSFVVY-----NPVFVSYNEQGV 407
Query: 361 DALCLGF--VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
D CL +GG+ IG + L FD +L +S S
Sbjct: 408 DGFCLAIQPTEGGMG-----TIGQNFMTGYRLVFDRENKKLAWSHS 448
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 84/382 (21%), Positives = 155/382 (40%), Gaps = 71/382 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +++ +P V + V G ++ WV C + SS+ C +
Sbjct: 154 EYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETH 213
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
QC + C C G G + T GD + +++ DG + ++
Sbjct: 214 QCKSLDVSECRNDSCLYEVSYGDG-------SYTVGDFATETITL---DG-----SASLN 258
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIV 203
N CG + +GL G G+ GLG ++ PSQ+ A+ F+ CL D D A
Sbjct: 259 NVAIGCGHDN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----SFSYCLVNRDTDSAST 311
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
+ N+ S ++ L N+++T Y++G+T + V G+ + + +
Sbjct: 312 L------EFNSPIPSHSVTAPLLRNNQLDTF----------YYLGMTGIGVGGQMLSIPR 355
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ +D G GG + + T L++ +Y +L +F + + VA F C+ L
Sbjct: 356 SSFEVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLS- 414
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRTSI--VI 380
+R VP + + + ++ +N ++ + C F P TS +I
Sbjct: 415 ---SRSSVEVPTVSFHFPDGKYL-ALPAKNYLIPVDSAGTFCFAFA-----PTTSALSII 465
Query: 381 GARQLENNLLQFDLATSRLGFS 402
G Q + + +DL+ S +GFS
Sbjct: 466 GNVQQQGTRVSYDLSNSLVGFS 487
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 112/302 (37%), Gaps = 55/302 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ +I TP V G +++W C +K + S++ K C S
Sbjct: 90 EYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGRAVTV 143
QC L + +C + P + +GD I + L++ S N G+ ++
Sbjct: 150 QCRLLDTVSC------SQPQKLCDFSYGYGDGSLAQGVIATETLTLNS----NSGQPTSI 199
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD----- 198
N +F CG +G+ G G ++L SQ+ + RKF+ CL PF
Sbjct: 200 LNIVFGCGHNNS-GTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSI 258
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
I+F GP +++ DV TPL T YF+ + + V K
Sbjct: 259 TSKIIF--GPEAEVSGSDVVS----TPLVTKDDPTY----------YFVTLDGISVGDKL 302
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
P + S G I P T+L Y LVQ A+P P C
Sbjct: 303 FPFSS---SSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLC 359
Query: 319 FR 320
+R
Sbjct: 360 YR 361
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 91/399 (22%), Positives = 146/399 (36%), Gaps = 83/399 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP PV+L + G ++ W C S T C
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 91 QCNLANAKACG-----GGIC---GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C +CG GIC A D+ I T G + D S S D G +
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSI----TTGHLDSDTFSFASADHAIGG--AS 223
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
VP+ F CG F S GIAG R +++P+QL F+ C +
Sbjct: 224 VPDLTFGCG-LFNNGIFVSNETGIAGFSRGALSMPAQLKV-----DNFSYCFT------A 271
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFI----NKVNTASG-------------FLGEPSVEY 245
+ G P +P+F+ N + A+G + Y
Sbjct: 272 ITGSEP---------------SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAY 316
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+I + V V +P+ +++ ++ +G GGT + + T+L ++Y + AF +
Sbjct: 317 YISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKL 376
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDA 362
+ + CF + P VP LVL + + +N M +I GG
Sbjct: 377 TVHNSTSSLSQLCFSVP----PGAKPDVPA--LVLHFEGATLDLPRENYMFEIEEAGGIR 430
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
L ++ G + VIG Q +N + +DLA L F
Sbjct: 431 LTCLAINAGEDLS---VIGNFQQQNMHVLYDLANDMLSF 466
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 71/305 (23%), Positives = 121/305 (39%), Gaps = 44/305 (14%)
Query: 114 SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
S++G G+ D++S P RAV F C + + GI GLGR +
Sbjct: 170 SSSGVLGE---DIMSFGKESELKPQRAV------FGCENTETGDLFSQHADGIMGLGRGQ 220
Query: 174 VALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
+++ QL + F+LC D G +V G P F S ++
Sbjct: 221 LSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAPPDMVFSHSNPVR--------- 271
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
S Y I + +HV GKA+ L+ + + + GT + + Y L
Sbjct: 272 ----------SPYYNIELKEIHVAGKALRLDPKIFNSKH----GTVLDSGTTYAYLPEQA 317
Query: 292 YKALVQAFASAMPKVARVAPVAP--FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
+ A A + + + ++ P CF +++ V P +D+V N + S+
Sbjct: 318 FVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKL-SL 376
Query: 350 HGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---S 404
+N + + A CLG G +P T ++G + N L+ +D ++GF S
Sbjct: 377 SPENYLFRHSKVEGAYCLGVFQNGKDPTT--LLGGIVVRNTLVTYDRHNEKIGFWKTNCS 434
Query: 405 LLFER 409
L+ER
Sbjct: 435 ELWER 439
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 91/399 (22%), Positives = 146/399 (36%), Gaps = 83/399 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP PV+L + G ++ W C S T C
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 91 QCNLANAKACG-----GGIC---GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C +CG GIC A D+ I T G + D S S D G +
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSI----TTGHLDSDTFSFASADHAIGG--AS 223
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
VP+ F CG F S GIAG R +++P+QL F+ C +
Sbjct: 224 VPDLTFGCG-LFNNGIFVSNETGIAGFSRGALSMPAQLKV-----DNFSYCFT------A 271
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFI----NKVNTASG-------------FLGEPSVEY 245
+ G P +P+F+ N + A+G + Y
Sbjct: 272 ITGSEP---------------SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAY 316
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+I + V V +P+ +++ ++ +G GGT + + T+L ++Y + AF +
Sbjct: 317 YISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKL 376
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI---GGDA 362
+ + CF + P VP LVL + + +N M +I GG
Sbjct: 377 TVHNSTSSLSQLCFSVP----PGAKPDVPA--LVLHFEGATLDLPRENYMFEIEEAGGIR 430
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
L ++ G + VIG Q +N + +DLA L F
Sbjct: 431 LTCLAINAGEDLS---VIGNFQQQNMHVLYDLANDMLSF 466
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 86/409 (21%), Positives = 155/409 (37%), Gaps = 81/409 (19%)
Query: 57 PVKLTVHLGGNILWVDCE-------KGYVSSTN------------------KTARCGSAQ 91
P+ L + G +++W C+ +G +T+ K++ C +A
Sbjct: 92 PIFLYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAH 151
Query: 92 CNLANAKACGGGICGAGPDNPISNTG-----------THGD------IRIDVLSIQSTDG 134
NL ++ C C P I + +GD + D +S+ ++
Sbjct: 152 SNLPSSDLCAISNC---PLESIETSDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLSNP 208
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALC 193
N + V NF F C + + +G+AG GR ++LP+QLA + L +F+ C
Sbjct: 209 TN----LIVNNFTFGCAHTALAE-----PIGVAGFGRGVLSLPAQLATLSPQLGNQFSYC 259
Query: 194 L--SPFDD------GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
L FD ++ G YD + + N P F+ + L P Y
Sbjct: 260 LVSHSFDSDRLRRPSPLILGR---YDHDEKERRVNGVNKPRFV--YTSMLDNLEHPYF-Y 313
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+G+ + + K +P L +D EG GG + + +T+L S+Y ++V F + + +
Sbjct: 314 CVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGR 373
Query: 306 VARVAPVAP----FGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQ 357
V A V C+ + V+ + +VL +N +
Sbjct: 374 VNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKG 433
Query: 358 IGGDALCLGFVDGGVNPRTS----IVIGARQLENNLLQFDLATSRLGFS 402
CL ++GG S +G Q + + +DL R+GF+
Sbjct: 434 KKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFA 482
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 71/305 (23%), Positives = 121/305 (39%), Gaps = 44/305 (14%)
Query: 114 SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
S++G G+ D++S P RAV F C + + GI GLGR +
Sbjct: 180 SSSGVLGE---DIMSFGKESELKPQRAV------FGCENTETGDLFSQHADGIMGLGRGQ 230
Query: 174 VALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
+++ QL + F+LC D G +V G P F S ++
Sbjct: 231 LSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAPPDMVFSHSNPVR--------- 281
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
S Y I + +HV GKA+ L+ + + + GT + + Y L
Sbjct: 282 ----------SPYYNIELKEIHVAGKALRLDPKIFNSKH----GTVLDSGTTYAYLPEQA 327
Query: 292 YKALVQAFASAMPKVARVAPVAP--FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
+ A A + + + ++ P CF +++ V P +D+V N + S+
Sbjct: 328 FVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKL-SL 386
Query: 350 HGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---S 404
+N + + A CLG G +P T ++G + N L+ +D ++GF S
Sbjct: 387 SPENYLFRHSKVEGAYCLGVFQNGKDPTT--LLGGIVVRNTLVTYDRHNEKIGFWKTNCS 444
Query: 405 LLFER 409
L+ER
Sbjct: 445 ELWER 449
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 71/305 (23%), Positives = 121/305 (39%), Gaps = 44/305 (14%)
Query: 114 SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSK 173
S++G G+ D++S P RAV F C + + GI GLGR +
Sbjct: 181 SSSGVLGE---DIMSFGKESELKPQRAV------FGCENTETGDLFSQHADGIMGLGRGQ 231
Query: 174 VALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
+++ QL + F+LC D G +V G P F S ++
Sbjct: 232 LSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAPPDMVFSHSNPVR--------- 282
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
S Y I + +HV GKA+ L+ + + + GT + + Y L
Sbjct: 283 ----------SPYYNIELKEIHVAGKALRLDPKIFNSKH----GTVLDSGTTYAYLPEQA 328
Query: 292 YKALVQAFASAMPKVARVAPVAP--FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
+ A A + + + ++ P CF +++ V P +D+V N + S+
Sbjct: 329 FVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKL-SL 387
Query: 350 HGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---S 404
+N + + A CLG G +P T ++G + N L+ +D ++GF S
Sbjct: 388 SPENYLFRHSKVEGAYCLGVFQNGKDPTT--LLGGIVVRNTLVTYDRHNEKIGFWKTNCS 445
Query: 405 LLFER 409
L+ER
Sbjct: 446 ELWER 450
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 145/402 (36%), Gaps = 86/402 (21%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSAQ 91
YV TP P V + G ++W C SST K CG+A
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG 151
C ++C G +C S G +R + +TD G A F +
Sbjct: 122 CESIPTRSCSGDVC--------SYKGPPTQLRGNTSGFAATDTFAIGTATVRLAFGCVVA 173
Query: 152 SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
S+ G G GLGR+ P L A L R F+ CLSP + G
Sbjct: 174 SDI---DTMDGPSGFIGLGRT----PWSLVAQMKLTR-FSYCLSPRNTG----------- 214
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
K + LF+ +G GE + TS + + LLS+D
Sbjct: 215 ----------KSSRLFLGSSAKLAG--GESTSTAPFIKTSPDDDSH----HYYLLSLDAI 258
Query: 272 GVGGTKIS-----------TVNPYTVLETSIYKALVQAFASAM---PKVARVAPVAPFGA 317
G T I+ TV+P+++L S Y+A +A A+ P PF
Sbjct: 259 RAGNTTIATAQSGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDL 318
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG--GDALCLGFVDGGVNPR 375
CF+ + GF+R P + Q + ++ ++ +G D C + R
Sbjct: 319 CFK-KAAGFSRA--TAPDLVFTFQGAAAL-TVPPAKYLIDVGEEKDTACAAILSMAWLNR 374
Query: 376 TSI----VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
T + V+G+ Q E+ +DL ++L FE A C+
Sbjct: 375 TGLEGVSVLGSLQQEDVHFLYDLK------KETLSFEPADCS 410
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 82/389 (21%), Positives = 151/389 (38%), Gaps = 63/389 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKT------ARCGSAQCNLANAK 98
Y +I TP+ + V G +ILWV+C G + K+ + + + +N
Sbjct: 74 YFAKIGLGTPVQDYYVQVDTGSDILWVNC-AGCTNCPKKSDLGIELSLYSPSSSSTSNRV 132
Query: 99 ACGGGICGAGPDNPI----------------SNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C C + D PI + T G D + + G N T
Sbjct: 133 TCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTG-NFQTTST 191
Query: 143 VPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ +F CG++ L ++ + GI G G++ ++ SQLA++ +KR FA CL + G
Sbjct: 192 NGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGG 251
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I F G V ++ TPL + + Y + + ++ V+ + +
Sbjct: 252 GI-FAIGEV-------VQPKVRTTPLVPQQAH------------YNVFMKAIEVDNEVLN 291
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
L + D GT I + IY+ L+ A ++ V CF
Sbjct: 292 LPTDVFDTDLR--KGTIIDSGTTLAYFPDVIYEPLISKIF-ARQSTLKLHTVEEQFTCFE 348
Query: 321 LQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
D GF P + ++ ++ +++ + I + C+G+ + G R
Sbjct: 349 YDGNVDDGF-------PTVTFHFED-SLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDG 400
Query: 378 ---IVIGARQLENNLLQFDLATSRLGFSD 403
I++G L+N L+ +DL +G+++
Sbjct: 401 KDMILLGDLVLQNRLVMYDLENQTIGWTE 429
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 88/396 (22%), Positives = 156/396 (39%), Gaps = 54/396 (13%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--- 74
LS + A+ SF P A ++ + +V + K TP + L + + W+ C
Sbjct: 2 LSSLVARRSFVPIASARQLIQSPT---FVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCI 58
Query: 75 ---KGYVSSTNKTAR-----CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDV 126
V S++K++ C S QCN +C G CG + ++ D+ D
Sbjct: 59 GCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGF--NLTYGSSTVAADLVQDN 116
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
L++ +TD +VP++ F C + + S V LG + L +
Sbjct: 117 LTL-ATD--------SVPSYTFGC----IRKATGSSVPPQGLLGLGRGPLSLLGQSQSLY 163
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
+ F+ CL F ++ F L +KYTPL N ++ Y+
Sbjct: 164 QSTFSYCLPSFK--SVNFSGS--LRLGPVAQPIRIKYTPLLRNPRRSS---------LYY 210
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+ + S+ V K V + + L+ ++ GT I + +T L Y A+ F + +
Sbjct: 211 VNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRN 270
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCL 365
V+ + F C+ + I P I + NV ++ N ++ G CL
Sbjct: 271 VTVSSLGGFDTCYTVPIIS--------PTITFMFAGMNV--TLPPDNFLIHSTSGSTTCL 320
Query: 366 GFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLG 400
N + + VI + Q +N+ + FD+ SR+G
Sbjct: 321 AMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVG 356
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 158/377 (41%), Gaps = 62/377 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
YVT++ TP + V G ++ W+ C V S T +C S+
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSS 190
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
+C A C N ++GD V + S D + G + P F + C
Sbjct: 191 ECGELQAATLNPSACSV--SNVCIYQASYGDSSYSVGYL-SKDTVSFGSG-SFPGFYYGC 246
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + +GL G+ GL ++K++L QLA SL F+ CL P A Y
Sbjct: 247 GQDN--EGLFGRSAGLIGLAKNKLSLLYQLAP--SLGYAFSYCL-PTSSAA-----AGYL 296
Query: 211 DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK--AVPLN--KTLL 266
+ +++ + YTP+ + ++ + YF+ ++ + V G AVP + ++L
Sbjct: 297 SIGSYNPGQ-YSYTPMASSSLDAS---------LYFVTLSGISVAGAPLAVPPSEYRSLP 346
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV-APFGACFRLQDIG 325
+I + GT I+ + P ++Y AL +A A+AM A AP + CFR G
Sbjct: 347 TIIDS---GTVITRLPP------NVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSAAG 397
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
VP++D+ ++ N ++ + CL F G + +IG Q
Sbjct: 398 LR-----VPRVDMAFAG-GATLALSPGNVLIDVDDSTTCLAFAPTG----GTAIIGNTQQ 447
Query: 386 ENNLLQFDLATSRLGFS 402
+ + +D+A SR+GF+
Sbjct: 448 QTFSVVYDVAQSRIGFA 464
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 99/405 (24%), Positives = 149/405 (36%), Gaps = 94/405 (23%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILW---VDCEKGY----------VSSTNKTARCGSA 90
+Y I P + + G +++W V C Y SST++ C S
Sbjct: 87 EYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASP 146
Query: 91 QCN----LANAKACGGG-----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
+C A GG + G G + + GD+ D L
Sbjct: 147 RCRDVLRYPGCDARTGGCVYMVVYGDG-------SASSGDLATDRLVFPDD--------T 191
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC----LSPF 197
V N CG + V GL G+ G+GR +++ P+QLA A+ F+ C LS
Sbjct: 192 HVHNVTLGCGHDNV--GLLESAAGLLGVGRGQLSFPTQLAPAYG--HVFSYCLGDRLSRA 247
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
+G+ +VFG P + +TPL N PS+ Y++ + V
Sbjct: 248 QNGSSYLVFGRTPE--------PPSTAFTPLRTNPR--------RPSL-YYVDMVGFSVG 290
Query: 256 GKAVP--LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF------ASAMPKVA 307
G+ V N +L G GG + + + Y A+ AF A M K+A
Sbjct: 291 GERVTGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLA 350
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQI--------DLVLQNKNVVWSIHGQNSMVQIG 359
V F AC+ L+ G VP I D+ L N + + G +
Sbjct: 351 TKFSV--FDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTY-- 406
Query: 360 GDALCLGF--VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
CLG D G+N V+G Q + L FD+ R+GF+
Sbjct: 407 ---FCLGLQAADDGLN-----VLGNVQQQGFGLVFDVERGRIGFT 443
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 146/376 (38%), Gaps = 63/376 (16%)
Query: 61 TVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHG 120
T L G+ VD SS N+ C +A C + A GG C P P S T G
Sbjct: 83 TRDLCGSRFCVDVH----SSDNRFDPCAAAGCAI---PAFTGGQC-PRPCPPFSYTYGGG 134
Query: 121 DIRIDVLSIQS------TDGRNPGRA---VTVPNFIFLCGSEFVLQGLASGVVGIAGLGR 171
+ + LS S T G G V P F F C + + L GIAG GR
Sbjct: 135 ALVLGSLSRDSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREPL-----GIAGFGR 189
Query: 172 SKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
++LPSQL L + F+ C F F P NF + L L ++
Sbjct: 190 GALSLPSQLGF---LGKGFSHCFLGFR-----FARNP-----NF--TSPLVMGDLALSSA 234
Query: 232 NTASGFLGEPSVE-------YFIGVTSVHVN----GKAVPLNKTLLSIDNEGVGGTKIST 280
+T GF+ P + Y++G+ V + G A+ +L ID +G GG + T
Sbjct: 235 STDGGFVFTPMLTSATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDT 294
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAP---FGACFRLQDIGFTRIGPVVPQID 337
YT L Y +++ + SA P R + F CF++ +P I
Sbjct: 295 GTTYTQLPDPFYASVLASLISAAPPYERSRDLEARTGFDLCFKVPCARAPCADDELPPIT 354
Query: 338 LVLQN--KNVVWSIHGQNSMVQIGGDAL--CLGFVDGGVNPRT--------SIVIGARQL 385
L L + + + + I + CL F + + V+G+ Q+
Sbjct: 355 LHLAGGARLALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQM 414
Query: 386 ENNLLQFDLATSRLGF 401
+N + +DLA R+GF
Sbjct: 415 QNVEVVYDLAAGRVGF 430
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 87/404 (21%), Positives = 150/404 (37%), Gaps = 69/404 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGSA 90
T Y + P P L V G ++ W+ C+ Y + N+ C +A
Sbjct: 50 TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANA 109
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG-------RNPGRAVTV 143
C ++ G G +N + D +I S+ G P R+ +
Sbjct: 110 LCTALHS--------GQGSNNKCPSP-KQCDYQIKYTDSASSQGVLINDSFSLPMRSSNI 160
Query: 144 -PNFIFLCGSEFVLQ---GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
P F CG + + + + + G+ GLGR V+L SQL K CLS
Sbjct: 161 RPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGG 220
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G + FGD DV + + T ++ SG P G +++ + +++
Sbjct: 221 GFLFFGD---------DVVPSSRVT--WVPMAQRTSGNYYSP------GSGTLYFDRRSL 263
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
+ + D+ G T YT Y+A+V A + K + C+
Sbjct: 264 GVKPMEVVFDS---GST-------YTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCW 313
Query: 320 RLQDIGFTRIGPVVPQIDLVL----QNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
+ Q F + V + + KN I +N ++ +CLG +DG
Sbjct: 314 KGQK-AFKSVFDVKNEFKSMFLSFSSAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKL 372
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFTSI 419
+ VIG +++ ++ +D S+LG++ R CT + SI
Sbjct: 373 SFNVIGDITMQDQMVIYDNEKSQLGWA------RGACTRSAKSI 410
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 85/380 (22%), Positives = 141/380 (37%), Gaps = 54/380 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+++ ++ P V V G +++W C+ T + + + C G
Sbjct: 107 EFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 166
Query: 104 ICGAGPDNPISNTG-------THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVL 156
+C A P + + T+GD + +T+ ++ F CG E
Sbjct: 167 LCNALPRSNCNEDKDSCEYLYTYGDYS-STRGLLATETFTFEDENSISGIGFGCGVENEG 225
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFD 216
G + G G+ GLGR ++L SQL + KF+ CL+ +D +
Sbjct: 226 DGFSQGS-GLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDS---------------E 264
Query: 217 VSKNLKYTPLFINKVN-TASGFLGE-------------PSVEYFIGVTSVHVNGKAVPLN 262
S +L L VN T + GE PS Y++ + + V K + +
Sbjct: 265 ASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSF-YYLELQGITVGAKRLSVE 323
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
K+ + +G GG I + T LE + +K L + F S M + CF+L
Sbjct: 324 KSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLP 383
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIG 381
+ VP+ L+ K + G+N MV LCL G + SI
Sbjct: 384 NAA---KNIAVPK--LIFHFKGADLELPGENYMVADSSTGVLCLAM---GSSNGMSIFGN 435
Query: 382 ARQLENNLLQFDLATSRLGF 401
+Q N+L DL + F
Sbjct: 436 VQQQNFNVLH-DLEKETVTF 454
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 95/406 (23%), Positives = 157/406 (38%), Gaps = 64/406 (15%)
Query: 28 RPKALVLRVAKDTATL-QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------ 74
RP A +R+ D T Y T++ TP L V G + +V C
Sbjct: 68 RPSAR-MRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPR 126
Query: 75 -KGYVSSTNKTARCGSAQCNLANAKA-CGGGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
+ +SST +C SA C + K+ C A S++G G+ D++S +
Sbjct: 127 FQPDLSSTYSPVKC-SADCTCDSDKSQCTYERQYA---EMSSSSGVLGE---DIVSFGTE 179
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
P RAV F C + + GI GLGR ++++ QL + F++
Sbjct: 180 SELKPQRAV------FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSM 233
Query: 193 CLSPFD--DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
C D GA+V G P F S ++ S Y I +
Sbjct: 234 CYGGMDIGGGAMVLGAMPAPPDMVFSRSDPVR-------------------SPYYNIELK 274
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
+HV GKA+ L+ + + GT + + Y L + A A S + + ++
Sbjct: 275 EIHVAGKALRLDPRIFDSKH----GTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIR 330
Query: 311 PVAP--FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLG 366
P CF +++ P +D+V + + S+ +N + + A CLG
Sbjct: 331 GPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKL-SLSPENYLFRHSKVEGAYCLG 389
Query: 367 FVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---SLLFER 409
G +P T ++G + N L+ +D ++GF S L+ER
Sbjct: 390 VFQNGKDPTT--LLGGIVVRNTLVTYDRHNEKIGFWKTNCSELWER 433
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 65/311 (20%), Positives = 119/311 (38%), Gaps = 57/311 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------------------KGYVSSTN 82
+ T I TP V + + G ++LWV C+ + +S+T+
Sbjct: 105 HYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTS 164
Query: 83 KTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+ CG C++ + C +NT + G + D L + S DG++ +
Sbjct: 165 RHLPCGHKLCDVHSFCKGSKDPCPYEVQYASANTSSSGYVFEDKLHLTS-DGKHAEQNSV 223
Query: 143 VPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
+ I CG + L +G G+ GLG +++PS LA A ++ F++CL + G
Sbjct: 224 QASIILGCGRKQTGDYLHGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICLDENESGR 283
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
I+FGD + ++ TP P + Y +GV S V +
Sbjct: 284 IIFGDQGHVTQHS---------TPFL-------------PIIAYMVGVESFCVGSLCLKE 321
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ ID + + +T L +Y+ +V F + +R+ + + C+
Sbjct: 322 TRFQALID----------SGSSFTFLPNEVYQKVVTEFDKQV-NASRIVLQSSWEYCYNA 370
Query: 322 QDIGFTRIGPV 332
I P+
Sbjct: 371 SSQELVNIPPL 381
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 87/404 (21%), Positives = 150/404 (37%), Gaps = 69/404 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGSA 90
T Y + P P L V G ++ W+ C+ Y + N+ C +A
Sbjct: 50 TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANA 109
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG-------RNPGRAVTV 143
C ++ G G +N + D +I S+ G P R+ +
Sbjct: 110 LCTALHS--------GQGSNNKCPSP-KQCDYQIKYTDSASSQGVLINDSFSLPMRSSNI 160
Query: 144 -PNFIFLCGSEFVLQ---GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
P F CG + + + + + G+ GLGR V+L SQL K CLS
Sbjct: 161 RPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGG 220
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G + FGD DV + + T ++ SG P G +++ + +++
Sbjct: 221 GFLFFGD---------DVVPSSRVT--WVPMAQRTSGNYYSP------GSGTLYFDRRSL 263
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
+ + D+ G T YT Y+A+V A + K + C+
Sbjct: 264 GVKPMEVVFDS---GST-------YTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCW 313
Query: 320 RLQDIGFTRIGPVVPQIDLVL----QNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
+ Q F + V + + KN I +N ++ +CLG +DG
Sbjct: 314 KGQK-AFKSVFDVKNEFKSMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKL 372
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFNFTSI 419
+ VIG +++ ++ +D S+LG++ R CT + SI
Sbjct: 373 SFNVIGDITMQDQMVIYDNEKSQLGWA------RGACTRSAKSI 410
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 88/396 (22%), Positives = 156/396 (39%), Gaps = 54/396 (13%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--- 74
LS + A+ SF P A ++ + +V + K TP + L + + W+ C
Sbjct: 79 LSSLVARRSFVPIASARQLIQSPT---FVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCI 135
Query: 75 ---KGYVSSTNKTAR-----CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDV 126
V S++K++ C S QCN +C G CG + ++ D+ D
Sbjct: 136 GCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGF--NLTYGSSTVAADLVQDN 193
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
L++ +TD +VP++ F C + + S V LG + L +
Sbjct: 194 LTL-ATD--------SVPSYTFGC----IRKATGSSVPPQGLLGLGRGPLSLLGQSQSLY 240
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
+ F+ CL F ++ F L +KYTPL N ++ Y+
Sbjct: 241 QSTFSYCLPSFK--SVNFSGS--LRLGPVAQPIRIKYTPLLRNPRRSS---------LYY 287
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+ + S+ V K V + + L+ ++ GT I + +T L Y A+ F + +
Sbjct: 288 VNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRN 347
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCL 365
V+ + F C+ + I P I + NV ++ N ++ G CL
Sbjct: 348 VTVSSLGGFDTCYTVPIIS--------PTITFMFAGMNV--TLPPDNFLIHSTAGSTTCL 397
Query: 366 GFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLG 400
N + + VI + Q +N+ + FD+ SR+G
Sbjct: 398 AMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVG 433
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/277 (25%), Positives = 114/277 (41%), Gaps = 38/277 (13%)
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA- 201
+ F+F CG+ QG G G+ GLGRS+++L SQ F F+ CL P + G+
Sbjct: 220 IQGFVFGCGTSN--QGPFGGTSGLMGLGRSQLSLISQTMDQFG--GVFSYCLPPKESGSS 275
Query: 202 --IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+V GD D + + S + YT + + + Y +T + V G+ V
Sbjct: 276 GSLVLGD----DASVYRNSTPIVYTAMVSDPLQGPF---------YLANLTGITVGGEDV 322
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
G G + + T L S+Y A+ F S + + + AP + CF
Sbjct: 323 QSP----GFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCF 378
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA--LCLGFVDGGVNPRTS 377
L G + VP + LV V + + + + GDA +CL T
Sbjct: 379 DL--TGLREV--QVPSLKLVFDGGAEV-EVDSKGVLYVVTGDASQVCLALASLKSEYDTP 433
Query: 378 IVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTF 414
I IG Q +N + FD S++GF+ + TC +
Sbjct: 434 I-IGNYQQKNLRVIFDTVGSQIGFA------QETCDY 463
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 152/391 (38%), Gaps = 58/391 (14%)
Query: 38 KDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKT 84
+ + L+Y+ + TP PV + G +++W C SS+ +
Sbjct: 97 RPSGDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEP 156
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPI------SNTGTHGDIRIDVLSIQSTDGRNPG 138
RC CN +C PD T T G + + S+
Sbjct: 157 MRCAGELCNDILHHSC------QRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGET 210
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
++ P F CG+ + +G + GI G GR+ ++L SQLA R+F+ CL+P+
Sbjct: 211 TKLSAP-LGFGCGT--MNKGSLNNGSGIVGFGRAPLSLVSQLAI-----RRFSYCLTPYA 262
Query: 199 DG---AIVFGD--GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
G ++FG G YD + ++ T L ++ N P+ Y++ T V
Sbjct: 263 SGRKSTLLFGSLRGGVYDA----ATATVQTTRLLRSRQN--------PTF-YYVPFTGVT 309
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA--MPKVARVAP 311
V + + + + ++ +G GG + + T+ + +V+AF S +P A +
Sbjct: 310 VGARRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSS 369
Query: 312 VAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGG 371
G CF R VVP++ LQ ++ Q G+ LCL D G
Sbjct: 370 GPDDGVCFAAAASRVPRPA-VVPRMVFHLQGADLDLPRRNYVLDDQRKGN-LCLLLADSG 427
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ IG ++ + +DL L F+
Sbjct: 428 ---DSGTTIGNFVQQDMRVLYDLEADTLSFA 455
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 66/294 (22%), Positives = 127/294 (43%), Gaps = 47/294 (15%)
Query: 35 RVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SST 81
RV ++ +Y+ + TP + + G +++W C + S+T
Sbjct: 79 RVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSAT 138
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDI--RIDVLSIQS-TDGRNPG 138
+ C S++C ++ +C +C +GD VL+ ++ T G
Sbjct: 139 YRALPCRSSRCASLSSPSCFKKMC--------VYQYYYGDTASTAGVLANETFTFGAANS 190
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF- 197
V N F CGS + G + G+ G GR ++L SQL + +F+ CL+ +
Sbjct: 191 TKVRATNIAFGCGS--LNAGDLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYL 243
Query: 198 --DDGAIVFGDGPYYDLNNFDVSKN--LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ FG Y +L++ + S ++ TP IN P++ YF+ + ++
Sbjct: 244 SATPSRLYFGV--YANLSSTNTSSGSPVQSTPFVINP--------ALPNM-YFLSLKAIS 292
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
+ K +P++ + +I+++G GG I + T L+ Y+A+ + SA+P A
Sbjct: 293 LGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTA 346
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 88/405 (21%), Positives = 155/405 (38%), Gaps = 71/405 (17%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST 81
+A + + K + V D + +Y+ Q+ TP + + + G +++W C ST
Sbjct: 19 SAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDCST 78
Query: 82 NKTAR-----------CGSAQCNLANAKACGG-GICGAGPDNPISN-TGTHGDIRIDVLS 128
+ C S+ C + +C G C P + + T G + + S
Sbjct: 79 SSIYDPSSSSTYSKVLCQSSLCQPPSIFSCNNDGDCEYV--YPYGDRSSTSGILSDETFS 136
Query: 129 IQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKR 188
I S ++PN F CG + QG V G+ G GR ++L SQL S+
Sbjct: 137 ISSQ---------SLPNITFGCGHDN--QGFDK-VGGLVGFGRGSLSLVSQLGP--SMGN 182
Query: 189 KFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFI-NKVNTASGFLGE-PSVE-- 244
KF+ CL D + K +PLFI N + + +G P V+
Sbjct: 183 KFSYCLVSRTDSS--------------------KTSPLFIGNTASLEATTVGSTPLVQSS 222
Query: 245 ----YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFA 300
Y++ + + V G+++ + I ++G GG I + T L+ + Y A+ +A
Sbjct: 223 STNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMV 282
Query: 301 SA--MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV-Q 357
S+ +P+ CF Q P P + K + + +N +
Sbjct: 283 SSINLPQAD-----GQLDLCFNQQG----SSNPGFPS--MTFHFKGADYDVPKENYLFPD 331
Query: 358 IGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
D +CL + N + G Q +N + +D + L F+
Sbjct: 332 STSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFA 376
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 151/379 (39%), Gaps = 60/379 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVSSTN-----------KTARCGSA 90
+Y T+I TP V + + G +I+W+ C K S T+ C +
Sbjct: 128 EYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTP 187
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C + C + + T G+ + L+ + T V C
Sbjct: 188 LCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK---------VEQVALGC 238
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGD 206
G + +GL G G+ GLGR ++ PSQ F+ +KF+ CL + ++VFG
Sbjct: 239 GHD--NEGLFVGAAGLLGLGRGGLSFPSQAGRTFN--QKFSYCLVDRSASSKPSSVVFG- 293
Query: 207 GPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVP-LNKT 264
N VS+ ++TPL N +++T Y++ + + V G V + +
Sbjct: 294 -------NSAVSRTARFTPLLTNPRLDTF----------YYVELLGISVGGTPVSGITAS 336
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+D G GG I T L Y AL AF + + + F C+ L
Sbjct: 337 HFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLS-- 394
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGAR 383
G T + VP + L + +V S+ N ++ + G C F G SI IG
Sbjct: 395 GKTTVK--VPTVVLHFRGADV--SLPASNYLIPVDGSGRFCFAF--AGTTSGLSI-IGNI 447
Query: 384 QLENNLLQFDLATSRLGFS 402
Q + + +DLA+SR+GFS
Sbjct: 448 QQQGFRVVYDLASSRVGFS 466
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 150/389 (38%), Gaps = 64/389 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y T++ TP L V G + +V C + SST + +C +
Sbjct: 112 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TID 170
Query: 92 CNLANAKACGGGICGAGPDNPISNTGTH-GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
CN C G + + T G + DV+S + P RAV F C
Sbjct: 171 CN------CDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAV------FGC 218
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGD-G 207
+ + GI GLGR +++ QL + F+LC D GA+V G
Sbjct: 219 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGGIS 278
Query: 208 PYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
P D+ F S + S Y I + +HV GK +PLN +
Sbjct: 279 PPSDMT-FAYSD-------------------PDRSPYYNIDLKEMHVAGKRLPLNANVF- 317
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--FGACFRLQDIG 325
+G GT + + Y L + + A A + + +++ P CF
Sbjct: 318 ---DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGND 374
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGAR 383
+++ P +D+V N + +S+ +N M + A CLG G N +T++ +G
Sbjct: 375 VSQLSKSFPVVDMVFGNGH-KYSLSPENYMFRHSKVRGAYCLGIFQNG-NDQTTL-LGGI 431
Query: 384 QLENNLLQFDLATSRLGFSDS---LLFER 409
+ N L+ +D +++GF + L+ER
Sbjct: 432 IVRNTLVMYDREQTKIGFWKTNCAELWER 460
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 131/341 (38%), Gaps = 60/341 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG--------YVSSTNKTARCGSAQCN 93
T++YV + +P V + + G ++ WV C S+T C SA C
Sbjct: 126 TMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGLTLFDPSKSTTYAPFSCSSAAC- 184
Query: 94 LANAKACGGGICGAGPDNPI---SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
A G G +G + + T G D L++ ++D TV +F F C
Sbjct: 185 -AQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASD--------TVTDFHFGC 235
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGP 208
S + G+ GLG +L SQ AA + + F+ CL P + G + FG P
Sbjct: 236 -SHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYG--KSFSYCLPPTNRTSGFLTFG-AP 291
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
F + L++ K T G L + + V G + + ++LS
Sbjct: 292 NGTSGGFVTTPMLRWP-----KAPTLYGVL----------LQDISVGGTPLGIQPSVLSN 336
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV--ARVAPVAPFGACFRLQDIGF 326
+ GT I T L Y AL AF S+M ++ R AP+ C+ +
Sbjct: 337 GSVMDSGTVI------TWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGL-- 388
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
+ +P + LVL VV + G M+Q CL F
Sbjct: 389 --VNVSIPAVSLVLDGGAVV-DLDGNGIMIQD-----CLAF 421
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 110/289 (38%), Gaps = 60/289 (20%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGY----------VSST 81
TAT Y T+I TP + V G +ILWV+C + G SST
Sbjct: 28 TATRLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSST 87
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDG 134
C C A GG + G P + T+GD D+L G
Sbjct: 88 GSKVSCDQGFC----AATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSG 143
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASG-----VVGIAGLGRSKVALPSQLAAAFSLKRK 189
R F CGS+ QG G + GI G G+S ++ SQL+AA +K+
Sbjct: 144 DGQTRPAN-STVTFGCGSQ---QGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKI 199
Query: 190 FALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
FA CL + G I + + N V +K TPL N + Y + +
Sbjct: 200 FAHCLDTINGGGI-------FAIGNV-VQPKVKTTPLVPNMPH------------YNVNL 239
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQA 298
S+ V G A+ L + D GT I + T L +YK ++ A
Sbjct: 240 KSIDVGGTALKLPSHMF--DTGEKKGTIIDSGTTLTYLPEIVYKEIMLA 286
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 65/250 (26%), Positives = 103/250 (41%), Gaps = 35/250 (14%)
Query: 162 GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSK 219
G G+ GLGR ++L SQL A +F+ CL+PF D GP LN V
Sbjct: 239 GSAGLVGLGRGSLSLVSQLGAG-----RFSYCLTPFQDTNSTSTLLLGPSAALNGTGV-- 291
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+ TP AS S Y++ +T + + KA+P++ S+ +G GG I
Sbjct: 292 --RSTPFV------ASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIID 343
Query: 280 TVNPYTVLETSIY---KALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGP-VVPQ 335
+ T L + Y +A V++ + +P V CF L T P V+P
Sbjct: 344 SGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTG-LDLCFALP--APTSAPPAVLPS 400
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF---VDGGVNPRTSIVIGARQLENNLLQF 392
+ L ++V + + M+ G CL DG ++ G Q +N + +
Sbjct: 401 MTLHFDGADMV--LPADSYMIS-GSGVWCLAMRNQTDGAMS-----TFGNYQQQNMHILY 452
Query: 393 DLATSRLGFS 402
D+ L F+
Sbjct: 453 DVREETLSFA 462
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 111/262 (42%), Gaps = 33/262 (12%)
Query: 147 IFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
IF CG++ L + + GI G G++ ++ SQLAAA +KR FA CL G I F
Sbjct: 204 IFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGI-F 262
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G VS + TP+ N+ + Y + + + V G + L
Sbjct: 263 AIGEV-------VSPKVNTTPMVPNQPH------------YNVVMKEIEVGGNVLELPTD 303
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+ D GT I + L +Y++++ S P + ++ V CF Q
Sbjct: 304 IF--DTGDRRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGL-KLHTVEEQFTCF--QYT 358
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV---NPRTSIVIG 381
G G V + N ++ +++ + + QI + C G+ + G+ + R ++G
Sbjct: 359 GNVNEGFPVVKFHF---NGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLG 415
Query: 382 ARQLENNLLQFDLATSRLGFSD 403
L N L+ +DL +G++D
Sbjct: 416 DLVLSNKLVLYDLENQAIGWTD 437
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 83/382 (21%), Positives = 152/382 (39%), Gaps = 63/382 (16%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
T Y+ + TP + + G ++ WV C+ S+T CG
Sbjct: 135 TANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCG 194
Query: 89 SAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+ +C ++ +C G C G + T G++ D L++ + + +
Sbjct: 195 AQECRRLDSGSCSSGKCRYEVVYG-------DMSQTDGNLARDTLTLGPSSSSS--SSDQ 245
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
+ F+F CG + GL G+ GLGR +V+L SQ AA + + L S +G +
Sbjct: 246 LQEFVFGCGDDDT--GLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYL 303
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
G N ++T + + + +T PS Y++ + + V G+ V ++
Sbjct: 304 SLGSA---------APPNARFTAM-VTRSDT-------PSF-YYLNLVGIKVAGRTVRVS 345
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA--RVAPVAPFGACFR 320
+ GT I + T L + Y AL +FA M + + R ++ C+
Sbjct: 346 PAVFRTP-----GTVIDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYD 400
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI-V 379
G ++ +P + L+ + G+ V A CL F G + TSI +
Sbjct: 401 F--TGRNKV--QIPSVALLFDGGATLNLGFGEVLYVANKSQA-CLAFASNGDD--TSIAI 453
Query: 380 IGARQLENNLLQFDLATSRLGF 401
+G Q + + +D+A ++GF
Sbjct: 454 LGNMQQKTFAVVYDVANQKIGF 475
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 85/392 (21%), Positives = 147/392 (37%), Gaps = 69/392 (17%)
Query: 54 PLVPVKLTVHLGGNILWVDCEKG-----------YVSSTNKTARCGSAQCNLANAKACGG 102
P P L V G ++ W+ C+ Y + N+ C +A C ++
Sbjct: 4 PAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHS----- 58
Query: 103 GICGAGPDNPISNTGTHGDIRIDVLSIQSTDG-------RNPGRAVTV-PNFIFLCGSEF 154
G G +N + D +I S+ G P R+ + P F CG +
Sbjct: 59 ---GQGSNNKCPSP-KQCDYQIKYTDSASSQGVLINDSFSLPMRSSNIRPGLTFGCGYDQ 114
Query: 155 VLQ---GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
+ + + + G+ GLGR V+L SQL K CLS G + FGD
Sbjct: 115 QVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGD----- 169
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
DV + + T ++ SG P G +++ + +++ + + D+
Sbjct: 170 ----DVVPSSRVT--WVPMAQRTSGNYYSP------GSGTLYFDRRSLGVKPMEVVFDS- 216
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGP 331
G T YT Y+A+V A + K + C++ Q F +
Sbjct: 217 --GST-------YTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQK-AFKSVFD 266
Query: 332 VVPQIDLVL----QNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLEN 387
V + + KN I +N ++ +CLG +DG + VIG +++
Sbjct: 267 VKNEFKSMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQD 326
Query: 388 NLLQFDLATSRLGFSDSLLFERATCTFNFTSI 419
++ +D S+LG++ R CT + SI
Sbjct: 327 QMVIYDNEKSQLGWA------RGACTRSAKSI 352
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 109/254 (42%), Gaps = 39/254 (15%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSKNLKY 223
GI GLG S+V+L +QLA + + FALC + DGA++ GD D +DV+ L+Y
Sbjct: 188 GILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGALMLGD---VDAAEYDVA--LQY 242
Query: 224 TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNP 283
T L + L P Y + + ++ V G+ +P+ EG GT + +
Sbjct: 243 TALLSS--------LAHPHY-YSVQLEALWVGGQQLPVKPERY---EEGY-GTVLDSGTT 289
Query: 284 YTVLETS---IYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGF-----------TRI 329
+T L + ++K V A+A + P + + DI F +++
Sbjct: 290 FTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHADQSKL 349
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD--ALCLGFVDGGVNPRTSIVIGARQLEN 387
V P +L + V N + G+ A CLG D G + ++G N
Sbjct: 350 EKVFPVFELQFAD-GVRLRTGPLNYLFMHTGEMGAYCLGVFDNGA---SGTLLGGISFRN 405
Query: 388 NLLQFDLATSRLGF 401
L+Q+D R+GF
Sbjct: 406 ILVQYDRRNRRVGF 419
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 140/392 (35%), Gaps = 102/392 (26%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK---------------GYVSSTNKTARCG 88
+Y+ + TP V+LT+ G +I W C++ SS+ + C
Sbjct: 87 EYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCS 146
Query: 89 SAQCNLANAKACGGG-ICGAGPDNPISNTG----THGDIRIDVLSIQSTDGRNPGRAVTV 143
S C CGGG + P N + G + G+I +V + S G G + V
Sbjct: 147 SPACE--TTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGE--GSSAAV 202
Query: 144 PNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
P +F CG +G+ S GIAG GR ++LPSQL F+ C +
Sbjct: 203 PGLVFGCG--HANRGVFTSNETGIAGFGRGSLSLPSQLKVG-----NFSHCFT------- 248
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
I T++ LG P V A PL
Sbjct: 249 ------------------------TITGSKTSAVLLGLPGV----------APPSASPLG 274
Query: 263 K-----TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ S GT I+++ P T Y+A+ + FA+ + PV P A
Sbjct: 275 RRRGSYRCRSTPRSSNSGTSITSLPPRT------YRAVREEFAAQV-----KLPVVPGNA 323
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA--------LCLGFVD 369
GP + L + + +N + ++ D +CL ++
Sbjct: 324 TDPFTCFSAPLRGPKPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVIE 383
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
GG I++G Q +N + +DL S+L F
Sbjct: 384 GG-----EIILGNIQQQNMHVLYDLQNSKLSF 410
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 150/380 (39%), Gaps = 62/380 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVSSTN-----------KTARCGSA 90
+Y T+I TP V + + G +I+W+ C K S T+ C +
Sbjct: 41 EYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTP 100
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C + C + + T G+ + L+ + T V C
Sbjct: 101 LCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK---------VEQVALGC 151
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGD 206
G + +GL G G+ GLGR ++ PSQ F+ +KF+ CL + ++VFG
Sbjct: 152 GHD--NEGLFVGAAGLLGLGRGGLSFPSQAGRTFN--QKFSYCLVDRSASSKPSSVVFG- 206
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVP-LNK 263
N VS+ ++TPL N P ++ Y++ + + V G V +
Sbjct: 207 -------NSAVSRTARFTPLLTN-----------PRLDTFYYVELLGISVGGTPVSGITA 248
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ +D G GG I T L Y AL AF + + + F C+ L
Sbjct: 249 SHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLS- 307
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVDGGVNPRTSIVIGA 382
G T + VP + L + +V S+ N ++ + G C F G SI IG
Sbjct: 308 -GKTTVK--VPTVVLHFRGADV--SLPASNYLIPVDGSGRFCFAFA--GTTSGLSI-IGN 359
Query: 383 RQLENNLLQFDLATSRLGFS 402
Q + + +DLA+SR+GFS
Sbjct: 360 IQQQGFRVVYDLASSRVGFS 379
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 141/378 (37%), Gaps = 61/378 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST----------NKTARCGSAQCN 93
+YV + TP VP L + G ++ WV C+ S N ++ C+
Sbjct: 128 EYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCD 187
Query: 94 LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSI----QSTDGRNPGRAVTVPNFIFL 149
+A GI G G + + G +I + STD G V F F
Sbjct: 188 SQECRALAAGIDGDGCTS-DGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVKRFHFG 246
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK---FALCLSPFDDGAIVFGD 206
CG +G G+ GLGR LP LA S +R F+ CL P
Sbjct: 247 CG-HHQQRGKFDMADGVLGLGR----LPQSLAWQASARRGGGVFSHCLPPTGVSTGFLAL 301
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
G +D + F +TPL + + Y + T++ V G+ + + +
Sbjct: 302 GAPHDTSAF------VFTPLLT---------MDDQPWFYQLMPTAISVAGQLLDIPPAVF 346
Query: 267 SIDNEGV---GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
EGV GT +S L+ + Y AL AF SAM + PV CF
Sbjct: 347 ---REGVITDSGTVLS------ALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNF-- 395
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
G+ + VP + L + V + + ++ G CL F G + T ++
Sbjct: 396 TGYDNV--TVPTVSLTFRGGATV-HLDASSGVLMDG----CLAFWSSG-DEYTGLIGSVS 447
Query: 384 QLENNLLQFDLATSRLGF 401
Q +L +D+ ++GF
Sbjct: 448 QRTIEVL-YDMPGRKVGF 464
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 87/383 (22%), Positives = 142/383 (37%), Gaps = 68/383 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 177 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSC 236
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ N C GG C G G++ G +D L++ S D V
Sbjct: 237 AAPACSDLNIHGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 285
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IV 203
F F CG +GL G+ GLGR K +LP Q + FA CL G +
Sbjct: 286 FRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCLPARSTGTGYLD 341
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FG G + ++ TP+ T Y++G+T + V G+ + + +
Sbjct: 342 FGAG------SLAAARARLTTPMLTENGPTF----------YYVGMTGIRVGGQLLSIPQ 385
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKAL-VQAFASAMPKVARVAP-VAPFGACFRL 321
++ + GT + + T L + Y +L A+ + + AP V+ C+
Sbjct: 386 SVFA-----TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 440
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + M +CL F DGG
Sbjct: 441 --TGMSQV--AIPTVSLLFQG-GARLDVDASGIMYAASASQVCLAFAANEDGG----DVG 491
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
++G QL+ + +D+ +GF
Sbjct: 492 IVGNTQLKTFGVAYDIGKKVVGF 514
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 107/279 (38%), Gaps = 63/279 (22%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----YVSSTNKTARCGSAQCNLANAKA 99
YV TP V + + +++W C S+T C C +
Sbjct: 100 YVFSYGIGTPPQQVSGALDISSDLVWTACGATAPFNPVRSTTVADVPCTDDACQQFAPQT 159
Query: 100 CGGG--------ICGAGPDNPISNTGTH----GDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
CG G + G G N GT GD RID +
Sbjct: 160 CGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTRID-------------------GVV 200
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD----DGAIV 203
F CG + V G SGV G+ GLGR ++L SQL +F+ +P D I+
Sbjct: 201 FGCGLKNV--GDFSGVSGVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDTQSFIL 253
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
FGD + + + T L + N PS+ Y++ + + V+GK + +
Sbjct: 254 FGD------DATPQTSHTLSTRLLASDAN--------PSL-YYVELAGIQVDGKDLAIPS 298
Query: 264 TLLSIDN-EGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
+ N +G GG +S + TVLE + YK L QA AS
Sbjct: 299 GTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVAS 337
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/336 (22%), Positives = 122/336 (36%), Gaps = 61/336 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------------KGYVSSTNKTA 85
TL+YV + +P V ++ + G ++ WV CE SST
Sbjct: 105 TLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAF 164
Query: 86 RCGSAQC---------NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN 136
C +A C N +AK+ I G + T G DVL++ +D
Sbjct: 165 NCSAAACAQLGDSGEANGCDAKSRCQYIVKYG-----DGSNTTGTYSSDVLTLSGSD--- 216
Query: 137 PGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
V F F C + G+ G+ GLG + SQ AA + + L +P
Sbjct: 217 -----VVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATP 271
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
G + G + TP+ +K + YF + + V G
Sbjct: 272 ASSGFLTLG---APASGGGGGASRFATTPMLRSK---------KVPTYYFAALEDIAVGG 319
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
K + L+ ++ + + GT I+ + P + Y AL AF + M + AR P+
Sbjct: 320 KKLGLSPSVFAAGSLVDSGTVITRLPP------AAYAALSSAFRAGMTRYARAEPLGILD 373
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVV-WSIHG 351
CF G ++ +P + LV VV HG
Sbjct: 374 TCFNF--TGLDKVS--IPTVALVFAGGAVVDLDAHG 405
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 77/186 (41%), Gaps = 30/186 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------------KGYVSSTNK 83
Y T + TP + + G ++ WV C+ K S+T++
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSR 161
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C C+ A+ C D NT + G + D+L + S +G P A
Sbjct: 162 HLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVNA--- 218
Query: 144 PNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ I CG S L+G+A G+ GLG + +++PS LA A ++ F++C D G
Sbjct: 219 -SVIIGCGKKQSGSYLEGIAPD--GLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSG 275
Query: 201 AIVFGD 206
I FGD
Sbjct: 276 RIFFGD 281
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 77/186 (41%), Gaps = 30/186 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------------KGYVSSTNK 83
Y T + TP + + G ++ WV C+ K S+T++
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSR 161
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C C+ A+ C D NT + G + D+L + S +G P A
Sbjct: 162 HLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVNA--- 218
Query: 144 PNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ I CG S L+G+A G+ GLG + +++PS LA A ++ F++C D G
Sbjct: 219 -SVIIGCGKKQSGSYLEGIAPD--GLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSG 275
Query: 201 AIVFGD 206
I FGD
Sbjct: 276 RIFFGD 281
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 92/404 (22%), Positives = 155/404 (38%), Gaps = 78/404 (19%)
Query: 33 VLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------KGYVSSTNKT 84
+ + D L Y T I TP + + G ++LW+ C+ Y S+ ++
Sbjct: 85 TMSLGNDFGWLHY-TWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSSYYSNLDRD 143
Query: 85 ARCGSAQCNLANAK-ACGGGICGAGPDNPIS-------------NTGTHGDIRIDVLSIQ 130
S +L++ +C +C G + S NT + G + D+L +Q
Sbjct: 144 LNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQ 203
Query: 131 STDGRNPGRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
S G +V P + CG S L G+A G+ GLG + ++PS LA + +
Sbjct: 204 S-GGSLSNSSVQAP-VVLGCGMKQSGGYLDGVAPD--GLLGLGPGESSVPSFLAKSGLIH 259
Query: 188 RKFALCLSPFDDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
F+LC + D G I FGD GP + ++ + PL G Y
Sbjct: 260 DSFSLCFNEDDSGRIFFGDQGP-------TIQQSTSFLPL--------DGLYS----TYI 300
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
IGV S V + + + +D+ GT +T L +Y A+ + F +
Sbjct: 301 IGVESCCVGNSCLKMTSFKVQVDS----GTS------FTFLPGHVYGAIAEEFDQQVNGS 350
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNV------VWSIHGQNSMVQIGG 360
+P+ C+ + ++ P VP + L Q N V+ +G ++ G
Sbjct: 351 RSSFEGSPWEYCY----VPSSQELPKVPSLTLTFQQNNSFVVYDPVFVFYGNEGVI---G 403
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
L + +G + IG + L FD +L +S S
Sbjct: 404 FCLAIQPTEGDMG-----TIGQNFMTGYRLVFDRGNKKLAWSRS 442
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 152/386 (39%), Gaps = 71/386 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----KGYV------------SSTNKTA 85
TL++V + TP P L G ++ WV C+ G+ SST
Sbjct: 141 TLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAV 200
Query: 86 RCGSAQCNLANAKACGGGICGAGPDNPI--------SNTGTHGDIRIDVLSIQSTDGRNP 137
CG QC A G +C DN + T G + D L++ S+
Sbjct: 201 HCGEPQC------AAAGDLCSE--DNTTCLYLVRYGDGSSTTGVLSRDTLALTSS----- 247
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
RA+T F F CG+ + G V G+ GLGR +++LPSQ AA+F + L S
Sbjct: 248 -RALT--GFPFGCGTRNL--GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS 302
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G P D + L+ P F PS YF+ + S+ + G
Sbjct: 303 TTGYLTIGATPATDTGAAQYTAMLR-KPQF-------------PSF-YFVELVSIDIGGY 347
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+P+ + + GGT + + T L Y L F M + P A
Sbjct: 348 VLPVPPAVFT-----RGGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDA 402
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF--VDGGVNPR 375
C+ G + + VVP + + V+ + M+ + + CL F +D G P
Sbjct: 403 CYDFA--GESEV--VVPAVSFRFGD-GAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPL 457
Query: 376 TSIVIGARQLENNLLQFDLATSRLGF 401
+ +IG Q + + +D+A ++GF
Sbjct: 458 S--IIGNTQQRSAEVIYDVAAEKIGF 481
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 146/388 (37%), Gaps = 82/388 (21%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC------------EKGY---VSSTNKTAR 86
T +YV + TP V +++ G ++ WV C +K + +S+T
Sbjct: 126 TTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFS 185
Query: 87 CGSAQCNL--ANAKACGGGIC------GAGPDNPISNT-GTHGDIRIDVLSIQSTDGRNP 137
CGSAQC C C G G SNT GT+G D LS+ S+D
Sbjct: 186 CGSAQCAQLGDEGNGCLKSQCQYIVKYGDG-----SNTAGTYGS---DTLSLTSSDA--- 234
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
V +F F C G + G+ GLG +L SQ AA + + F+ CL P
Sbjct: 235 -----VKSFQFGCSHR--AAGFVGELDGLMGLGGDTESLVSQTAATYG--KAFSYCLPPP 285
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKY--TPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
G + L + + +Y TP+ V T G + G+T V
Sbjct: 286 SSSG-----GGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGV-------FLQGIT---VA 330
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G + + ++ S + GT I+ + P + Y+AL AF M APV
Sbjct: 331 GTMLNVPASVFSGASVVDSGTVITQLPP------TAYQALRTAFKKEMKAYPSAAPVGSL 384
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVN 373
CF GF I VP + L G + I G A CL F +
Sbjct: 385 DTCFDFS--GFNTI--TVPTVTLTFS--------RGAAMDLDISGILYAGCLAFTATAHD 432
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGF 401
T I+ +Q +L FD+ +GF
Sbjct: 433 GDTGILGNVQQRTFEML-FDVGGRTIGF 459
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 69/164 (42%), Gaps = 19/164 (11%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y++G+T + V G+ + + L +D+ G GG + + T L+T Y L AF S P
Sbjct: 337 YYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTP 396
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGG 360
+ + A F C+ L I V + L L KN + + +G
Sbjct: 397 YLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDS------VG- 449
Query: 361 DALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
C F P TS +IG Q + + +DLA S +GFS
Sbjct: 450 -TFCFAFA-----PTTSSLSIIGNVQQQGTRVHYDLANSVVGFS 487
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 80/185 (43%), Gaps = 24/185 (12%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------------------KGYVSSTN 82
+ T I TP V + + G ++LWV C+ + +S+T+
Sbjct: 105 HYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTS 164
Query: 83 KTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+ CG C++ + C +NT + G + D L + S +G++ +
Sbjct: 165 RHLPCGHKLCDVHSVCKGSKDPCPYAVQYSSANTSSSGYVFEDKLHLTS-NGKHAEQNSV 223
Query: 143 VPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
+ I CG + + L +G G+ GLG +++PS LA A ++ F++C + G
Sbjct: 224 QASIILGCGRKQTGEYLRGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICFEENESGR 283
Query: 202 IVFGD 206
I+FGD
Sbjct: 284 IIFGD 288
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 69/164 (42%), Gaps = 19/164 (11%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y++G+T + V G+ + + L +D+ G GG + + T L+T Y L AF S P
Sbjct: 337 YYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTP 396
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGG 360
+ + A F C+ L I V + L L KN + + +G
Sbjct: 397 YLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDS------VG- 449
Query: 361 DALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
C F P TS +IG Q + + +DLA S +GFS
Sbjct: 450 -TFCFAFA-----PTTSSLSIIGNVQQQGTRVHYDLANSVVGFS 487
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/283 (25%), Positives = 111/283 (39%), Gaps = 46/283 (16%)
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
P V VP F C + G AS G+ GLGR ++L SQL A KF+ CL
Sbjct: 180 TPADQVRVPGIAFGCSNAS--SGFNASSASGLVGLGRGSLSLVSQLGAP-----KFSYCL 232
Query: 195 SPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS-VEYFIGVTS 251
+P+ D GP LN+ V + TP F+ PS + Y++ +T
Sbjct: 233 TPYQDTNSTSTLLLGPSASLNDTGV---VSSTP-----------FVASPSSIYYYLNLTG 278
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVARV 309
+ + A+P+ S+ +G GG I + T+L + Y+ + A S +P
Sbjct: 279 ISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAVLSLVTLPTTDGS 338
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV-----QIGGDALC 364
A CF L T P +P + L ++V + N M+ C
Sbjct: 339 AATG-LDLCFELPSS--TSAPPSMPSMTLHFDGADMV--LPADNYMMSLSDPDSDSSLWC 393
Query: 365 LGF-----VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
L DG V ++G Q +N + +D+ L F+
Sbjct: 394 LAMQNQTDTDGVV----VSILGNYQQQNMHILYDVGKETLSFA 432
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 165/395 (41%), Gaps = 70/395 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ + TP ++ + G ++ W+ C + G V SS+ + CG
Sbjct: 150 EYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQ 209
Query: 91 QCNLANAKACGGGICGAGPDN-PI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
+C L G D+ P + T GD+ ++ ++ T PG + V
Sbjct: 210 RCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLT---APGASRRVD 266
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--- 201
+ +F CG +GL G G+ GLGR ++ SQL A + F+ CL D G+
Sbjct: 267 DVVFGCG--HWNRGLFHGAAGLLGLGRGPLSFASQLRAVYG--HTFSYCL--VDHGSDVA 320
Query: 202 --IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+VFG+ L L YT F + A F Y++ + V V G+
Sbjct: 321 SKVVFGED--DALALAAAHPQLNYT-AFAPASSPADTF-------YYVKLKGVLVGGE-- 368
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSI---------YKALVQAFASAMPKVARVA 310
LL+I ++ G + + T++++ Y+ + QAF M + +
Sbjct: 369 -----LLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLI 423
Query: 311 PVAP-FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFV 368
P P C+ + G R P VP++ L+ + VW +N +++ D +CL +
Sbjct: 424 PDFPVLSPCYNVS--GVDR--PEVPELSLLFAD-GAVWDFPAENYFIRLDPDGIMCLAVL 478
Query: 369 DGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
PRT + +IG Q +N + +DL +RLGF+
Sbjct: 479 G---TPRTGMSIIGNFQQQNFHVVYDLKNNRLGFA 510
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 141/377 (37%), Gaps = 46/377 (12%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC---------EKGYVSSTNKTARCGSAQCNL 94
+Y+ + TP + G +++W C + G + + + G CN
Sbjct: 87 EYIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCN- 145
Query: 95 ANAKACGGGICGAGPDNP-----ISNTGTHGDIRIDVLSIQS-TDGRNPGRAVTVPNFIF 148
++ C AGP P + N + S+++ T G P VP F
Sbjct: 146 SSVSMCAAL---AGPSPPPGCSCMYNQTYGTGWTAGIQSVETFTFGSTPADQTRVPGIAF 202
Query: 149 LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG--AIVFGD 206
C + +G G+ GLGR ++L SQL A F+ CL+PF D
Sbjct: 203 GCSN--ASSDDWNGSAGLVGLGRGSMSLVSQLGAGM-----FSYCLTPFQDANSTSTLLL 255
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
GP LN V T F+ + A S Y++ +T + + A+ +
Sbjct: 256 GPSAALNGTGV-----LTTPFVASPSKA-----PMSTYYYLNLTGISIGTTALSIPPNAF 305
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP-KVARVAPVAPFGACFRLQDIG 325
++ +G GG I + T L + Y+ + A S + VA + CF L
Sbjct: 306 ALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESLVTLPVADGSDSTGLDLCFALTSE- 364
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQL 385
T P +P + ++V + N M+ +G CL + V ++ G Q
Sbjct: 365 -TSTPPSMPSMTFHFDGADMVLPV--DNYMI-LGSGVWCLAMRNQTVGAMST--FGNYQQ 418
Query: 386 ENNLLQFDLATSRLGFS 402
+N L +D+ L F+
Sbjct: 419 QNVHLLYDIHEETLSFA 435
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/392 (22%), Positives = 146/392 (37%), Gaps = 59/392 (15%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQ------CN 93
++T Y T++ +P + V G +ILWV+C G + K+
Sbjct: 67 SSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCA-GCTACPKKSGLGMDLTLYDPNGSK 125
Query: 94 LANAKACGGGICGAGPDNPISN----------------TGTHGDIRIDVLSIQSTDGRNP 137
+NA CG G C PIS + T G D L+ G N
Sbjct: 126 TSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSG-NL 184
Query: 138 GRAVTVPNFIFLCGSE---FVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
+ IF CG++ + + GI G G++ ++ SQLAA+ +KR F+ CL
Sbjct: 185 HTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCL 244
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
G I F + + ++ K NT L Y + + + V
Sbjct: 245 DSHHGGGI------------FSIGQVME------PKFNTTP--LVPRMAHYNVILKDMDV 284
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
+G+ P+ L D+ GT I + L SIY L+ P + +
Sbjct: 285 DGE--PILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQ 342
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
F CF D ++ P + + ++ ++H + + D C+G+
Sbjct: 343 F-TCFHYSD----KLDEGFPVVKFHFEGLSL--TVHPHDYLFLYKEDIYCIGWQKSSTQT 395
Query: 375 ---RTSIVIGARQLENNLLQFDLATSRLGFSD 403
R I+IG L N L+ +DL +G+++
Sbjct: 396 KEGRDLILIGDLVLSNKLVVYDLENMVIGWTN 427
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/278 (25%), Positives = 114/278 (41%), Gaps = 52/278 (18%)
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
G +VP F CG F S GIAG GR ++LPSQL F+ C +
Sbjct: 86 GAGASVPGVAFGCG-LFNNGVFKSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTAV 139
Query: 198 D--DGAIVFGDGPYYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
+ + V D P D+ KN ++ TPL N N P+ Y++ +
Sbjct: 140 NGLKQSTVLLDLPA------DLYKNGRGAVQSTPLIQNSAN--------PTF-YYLSLKG 184
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ V +P+ ++ ++ N G GGT I + T L +Y+ + FA+ + P
Sbjct: 185 ITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-----KLP 238
Query: 312 VAPFGA-----CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA---- 362
V P A CF ++ P VP+ LVL + + +N + ++ DA
Sbjct: 239 VVPGNATGPYTCFSAP----SQAKPDVPK--LVLHFEGATMDLPRENYVFEVPDDAGNSI 292
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLG 400
+CL G + +IG Q +N + +DL G
Sbjct: 293 ICLAINKG----DETTIIGNFQQQNMHVLYDLQNMHRG 326
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 110/284 (38%), Gaps = 44/284 (15%)
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
T G P VP F C + G AS G+ GLGR +++L SQL KF
Sbjct: 191 TFGSTPAGQSRVPGIAFGCST--ASSGFNASSASGLVGLGRGRLSLVSQLGVP-----KF 243
Query: 191 ALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE---- 244
+ CL+P+ D GP LN + + TP F+ PS
Sbjct: 244 SYCLTPYQDTNSTSTLLLGPSASLNG---TAGVSSTP-----------FVASPSTAPMNT 289
Query: 245 -YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS-- 301
Y++ +T + + A+ + ++ +G GG I + T+L + Y+ + A S
Sbjct: 290 FYYLNLTGISLGTTALSIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV 349
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD 361
+P A CF L T P +P + L ++V + + M+
Sbjct: 350 TLPTTDGSAATG-LDLCFMLPSS--TSAPPAMPSMTLHFNGADMV--LPADSYMMSDDSG 404
Query: 362 ALCLGF---VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
CL DG VN ++G Q +N + +D+ L F+
Sbjct: 405 LWCLAMQNQTDGEVN-----ILGNYQQQNMHILYDIGQETLSFA 443
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 70/267 (26%), Positives = 110/267 (41%), Gaps = 35/267 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG-----YVSSTNKTARCGSAQCNLANAKA 99
YV TP V + + +++W C S+T C C +
Sbjct: 100 YVFSYGIGTPPQQVSGALDISSDLVWTACGATAPFNPVRSTTVADVPCTDDACQQFAPQT 159
Query: 100 CGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGL 159
CG G GAG + + T +G + + T+ G + +F CG + V G
Sbjct: 160 CGAG-AGAG-SSECAYTYMYGGGAANTTGLLGTEAFTFGD-TRIDGVVFGCGLQNV--GD 214
Query: 160 ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD----DGAIVFGDGPYYDLNNF 215
SGV G+ GLGR ++L SQL +F+ +P D I+FGD +
Sbjct: 215 FSGVSGVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDTQSFILFGD------DAT 263
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDN-EGVG 274
+ + T L + N PS+ Y++ + + V+GK + + + N +G G
Sbjct: 264 PQTSHTLSTRLLASDAN--------PSL-YYVELAGIQVDGKDLAIPSGTFDLRNKDGSG 314
Query: 275 GTKISTVNPYTVLETSIYKALVQAFAS 301
G +S + TVLE + YK L QA AS
Sbjct: 315 GVFLSITDLVTVLEEAAYKPLRQAVAS 341
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/368 (23%), Positives = 133/368 (36%), Gaps = 64/368 (17%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSST---NKTARCGSAQCNLA 95
D TL YV TP V + V G ++ WV C+ + + K AQ +
Sbjct: 134 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSY 193
Query: 96 NAKACGGGIC-GAGPDNPISNTG-------THGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
A CGG +C G G + + ++GD + + S+D + V F
Sbjct: 194 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGS-NTTGVYSSDTLTLSASSAVQGFF 252
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG-D 206
F CG GL +GV G+ GLGR + +L Q A + + L P G + G
Sbjct: 253 FGCG--HAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLG 310
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVPLNKT 264
GP F ++ L P+ Y + +T + V G+ + + +
Sbjct: 311 GPSGAAPGFSTTQ-----------------LLPSPNAPTYYVVMLTGISVGGQQLSVPAS 353
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFR 320
+ GGT + T T L + Y AL AF S M P AP C+
Sbjct: 354 AFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGY--PTAPSNGILDTCYN 405
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL----CLGFVDGGVNPRT 376
G + P + L G + V +G D + CL F G +
Sbjct: 406 FAGYGTVTL----PNVALTF----------GSGATVMLGADGILSFGCLAFAPSGSDGGM 451
Query: 377 SIVIGARQ 384
+I+ +Q
Sbjct: 452 AILGNVQQ 459
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 143/386 (37%), Gaps = 73/386 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--GYV------------SSTNKTARC 87
T YV I+ TP + G + WV C+ Y S+T C
Sbjct: 162 TGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISC 221
Query: 88 GSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
S+ C+ + + C GG C G G + T G D L++ G +
Sbjct: 222 TSSYCSDLDTRGCSGGHCLYAVQYGDG-------SYTVGFYAQDTLTL----GYD----- 265
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
TV +F F CG + +GL G+ GLGR K ++P Q +S FA C+ A
Sbjct: 266 TVKDFRFGCGEKN--RGLFGKAAGLMGLGRGKTSVPVQAYDKYS--GVFAYCIP-----A 316
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
G G + N + TP+ ++ T Y++G+T + V G + +
Sbjct: 317 TSSGTGFLDFGPGAPAAANARLTPMLVDNGPTF----------YYVGMTGIKVGGHLLSI 366
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA-RVAPVAPFGACFR 320
T+ S G + + T L S Y+ L AFA M + + AP F
Sbjct: 367 PATVFSD-----AGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPA------FS 415
Query: 321 LQDIGFTRIGP----VVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
+ D + G +P + LV Q + + CL F +
Sbjct: 416 ILDTCYDLTGYQGSIALPAVSLVFQG-GACLDVDASGILYVADVSQACLAFAANDDDTDM 474
Query: 377 SIVIGARQLENNLLQFDLATSRLGFS 402
+IV +Q ++L +DL +GF+
Sbjct: 475 TIVGNTQQKTYSVL-YDLGKKVVGFA 499
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 128/326 (39%), Gaps = 67/326 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNL---------- 94
Y T+++ TP + V G ++LWV C +S N + Q L
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSC-----ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 95 -----ANAKACGGGICGAGPDNPISN------------TGTHGDIRIDVLSIQSTDGRNP 137
+ + C GI + + N +GT G DVL G +
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 138 GRAVTVPNFIFLCGSEFVLQGLAS--GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
T P +F C + + S V GI G G+ +++ SQLA+ R F+ CL
Sbjct: 196 VPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254
Query: 196 PFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ G +V G+ V N+ +TPL ++ + Y + + S+
Sbjct: 255 GENGGGGILVLGE---------IVEPNMVFTPLVPSQPH------------YNVNLLSIS 293
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG+A+P+N ++ S N GT I T L + Y V+A +A+ + R PV
Sbjct: 294 VNGQALPINPSVFSTSNG--QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR--PVV 349
Query: 314 PFG-ACFRLQDIGFTRIGPVVPQIDL 338
G C+ + T +G + P + L
Sbjct: 350 SKGNQCYVIT----TSVGDIFPPVSL 371
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 94/411 (22%), Positives = 149/411 (36%), Gaps = 84/411 (20%)
Query: 37 AKDTATLQYVTQIKQRTPLVPVK-------------------------LTVHLGGNILWV 71
AKD A +QY + + R +VP+ L + + W+
Sbjct: 64 AKDQARMQYFSSLVARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWI 123
Query: 72 DCEKGYVSSTNK-----------TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHG 120
C ST+K CGS C CGG C T+G
Sbjct: 124 PCSGCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFN--------FTYG 175
Query: 121 DIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQL 180
I +Q T A +P + F C ++ G ++ G+ GLGR ++L SQ
Sbjct: 176 SSSIAASVVQDTLTL---AADPIPGYTFGCVNK--TTGSSAPQQGLLGLGRGPLSLLSQS 230
Query: 181 AAAFSLKRKFALCLSPFDDGAIVFGD----GPYYDLNNFDVSKNLKYTPLFINKVNTASG 236
+ K F+ CL F +I F GP Y K +KYTPL N ++
Sbjct: 231 QNLY--KSTFSYCLPSFK--SINFSGSLRLGPVYQ------PKRIKYTPLLRNPRRSS-- 278
Query: 237 FLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALV 296
Y++ + ++ V K V + L+ + GT + +T L +Y A+
Sbjct: 279 -------LYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVR 331
Query: 297 QAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
F + V + F C+ + VVP I + NV ++ N ++
Sbjct: 332 NEFRRRVGPKLPVTTLGGFDTCYNVPI--------VVPTITFLFSGMNV--ALPPDNIVI 381
Query: 357 Q-IGGDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDSL 405
G CL N + + VI Q +N+ + FD+ SR+G + L
Sbjct: 382 HSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIAREL 432
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/273 (24%), Positives = 113/273 (41%), Gaps = 37/273 (13%)
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVF 204
IF C + + GI GLGR ++++ QL + F+LC D GA+V
Sbjct: 191 IFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVL 250
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G L D+ +F N S + Y I + +HV GKA+ +
Sbjct: 251 GGM----LAPPDM--------IFSNSDPLRSPY-------YNIELKEIHVAGKALRVESR 291
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF--GACFRLQ 322
+ + + GT + + Y L + A +A S + + ++ P CF
Sbjct: 292 IFNSKH----GTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGA 347
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV---QIGGDALCLGFVDGGVNPRTSIV 379
+++ V P +D+V N + S+ +N + ++ G A CLG G +P T +
Sbjct: 348 GRNVSKLHEVFPDVDMVFGNGQKL-SLTPENYLFRHSKVDG-AYCLGVFQNGKDPTT--L 403
Query: 380 IGARQLENNLLQFDLATSRLGFSD---SLLFER 409
+G + N L+ +D ++GF S L+ER
Sbjct: 404 LGGIIVRNTLVTYDRHNEKIGFWKTNCSELWER 436
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 139/372 (37%), Gaps = 60/372 (16%)
Query: 62 VHLGGNILWVDC--------EKGY----------VSSTNKTARCGSAQCNLANAKACGGG 103
V G + LWV+C + G +S T+K C C G
Sbjct: 91 VDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYDGQISG- 149
Query: 104 ICGAGPDNPISNTGTHGDIRIDVLSIQSTD---GRNPGRAVTVPN---FIFLCGSE---F 154
C G P S T +GD S D R G TVP+ IF CGS+
Sbjct: 150 -CTKGMSCPYSIT--YGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGT 206
Query: 155 VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNN 214
+ + + GI G G++ ++ SQLAAA +KR F+ CL G I F G
Sbjct: 207 LSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGGGI-FAIGEV----- 260
Query: 215 FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVG 274
V +K TPL + Y + + + V G + L +L D+
Sbjct: 261 --VQPKVKTTPLL------------QGMAHYNVVLKDIEVAGDPIQLPSDIL--DSSSGR 304
Query: 275 GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVP 334
GT I + L SIY L++ A ++ V CF D + + P
Sbjct: 305 GTIIDSGTTLAYLPVSIYDQLLEKIL-AQRSGMKLYLVEDQFTCFHYSD--EESVDDLFP 361
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS---IVIGARQLENNLLQ 391
+ + + + + + ++ + D C+G+ + I++G L N L+
Sbjct: 362 TVKFTFE-EGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVV 420
Query: 392 FDLATSRLGFSD 403
+DL +G++D
Sbjct: 421 YDLDNMAIGWAD 432
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 94/408 (23%), Positives = 157/408 (38%), Gaps = 68/408 (16%)
Query: 24 QTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNK 83
Q + P + +R + D L+YV + TP PV + G +++W C
Sbjct: 84 QRTTPPTGVSVRPSGD---LEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQP 140
Query: 84 TARCGSAQCNLANAKACGGGICG-------AGPD------NPISNTGTHGDIRIDVLSIQ 130
+ C G +C PD N T T G + +
Sbjct: 141 DPLFAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFT 200
Query: 131 STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
S+ G R +TVP F CGS V G + GI G GR+ ++L SQL+ R+F
Sbjct: 201 SSGG---DRLMTVP-LGFGCGSMNV--GSLNNGSGIVGFGRNPLSLVSQLSI-----RRF 249
Query: 191 ALCLSPFDDG---AIVFG--DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
+ CL+ + G ++FG G Y D + ++ TPL + L P+ Y
Sbjct: 250 SYCLTSYGSGRKSTLLFGSLSGGVYG----DATGPVQTTPLLQS--------LQNPTF-Y 296
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
++ + + V + + + ++ ++ +G GG + + T+L ++ +V+AF +
Sbjct: 297 YVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRL 356
Query: 306 VARVAPVAPFGACFRLQDIGFTRIGPV----VPQI-------DLVLQNKNVVWSIHGQNS 354
G CF L + R VP++ DL L +N V H +
Sbjct: 357 PFANGGNPEDGVCF-LVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGR 415
Query: 355 MVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
LCL D G + T IG ++ + +DL L F+
Sbjct: 416 --------LCLLLADSGDDGST---IGNLVQQDMRVLYDLEAETLSFA 452
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 149/382 (39%), Gaps = 69/382 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC----------EKGYV-----SSTNKTAR 86
TL+YV + TP V +T+ G ++ WV C + G + SST +
Sbjct: 124 TLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVS 183
Query: 87 CGSAQCNLANAKACGGGI----CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C +A+C + G G C G +T T+G D L++ G +
Sbjct: 184 CAAAECAQLEQQGNGCGATNYECQYGVQYGDGST-TNGTYSRDTLTLS-------GASDA 235
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
V F F C V G + G+ GLG +L SQ AAA+ F+ CL P +
Sbjct: 236 VKGFQFGC--SHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYG--NSFSYCLPPTSGSSG 291
Query: 203 VFGDGPYYDLNNFDVSKNL--KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
G ++ F ++ L + P F Y + + V GK +
Sbjct: 292 FLTLGGGGGVSGFVTTRMLRSRQIPTF-----------------YGARLQDIAVGGKQLG 334
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP-FGACF 319
L+ ++ + + GT I+ + P + Y AL AF + M K R AP CF
Sbjct: 335 LSPSVFAAGSVVDSGTIITRLPP------TAYSALSSAFKAGM-KQYRSAPARSILDTCF 387
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIV 379
G T+I +P + LV +I + + G CL F G + T I+
Sbjct: 388 DF--AGQTQIS--IPTVALVFSGG---AAIDLDPNGIMYGN---CLAFAATGDDGTTGII 437
Query: 380 IGARQLENNLLQFDLATSRLGF 401
+Q +L +D+ +S LGF
Sbjct: 438 GNVQQRTFEVL-YDVGSSTLGF 458
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 103/253 (40%), Gaps = 46/253 (18%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS------ 218
G+ G G ++ PSQ + F+ CL P Y +NF +
Sbjct: 377 GLVGFGCGPLSFPSQNKDVYGFV--FSYCL-------------PSYKSSNFSSTLRLGPA 421
Query: 219 ---KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
K +K TPL N PS+ Y++ + +HV G+ + + + L+ D G
Sbjct: 422 GQPKRIKMTPLLSNP--------HRPSL-YYVNMVGIHVGGRPMLVPASALAFDPASGRG 472
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
T + +T L +Y A+ F S + + P+ F C+ + VP
Sbjct: 473 TIVDAGTMFTRLSAPVYAAVRDVFRSRV-RAPVTGPLGGFDTCY--------NVTISVPT 523
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI--VIGARQLENNLLQF 392
+ + V ++ +N +++ D + CL G + ++ V+ + Q +N+ + F
Sbjct: 524 VTFSFDGRVSV-TLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLF 582
Query: 393 DLATSRLGFSDSL 405
D+A R+GFS L
Sbjct: 583 DVANGRVGFSREL 595
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 139/383 (36%), Gaps = 71/383 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC----------EKGYVSSTNKTARCGSAQ 91
T +YV + TP V +++ G ++ WV C +K + K+A +
Sbjct: 127 TPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFS 186
Query: 92 CNLANAKACGGGICGAGPDNP--------ISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C+ A GG G G N + ++ T G D L + ++D V
Sbjct: 187 CSSAQCAQLGGE--GNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDA--------V 236
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
NF F C G + G+ GLG +L SQ AA + + F+ CL P A
Sbjct: 237 KNFQFGCSHR--ANGFVGQLDGLMGLGGDTESLVSQTAATYG--KAFSYCLPPSSSSA-- 290
Query: 204 FGDGPYYDLN---NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
G + L S TPL V T Y + + ++ V G +
Sbjct: 291 ---GGFLTLGAAAGGTSSSRYSRTPLVRFNVPTF----------YGVFLQAITVAGTKLN 337
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
+ ++ S G + + + T L + Y+AL AF M APV CF
Sbjct: 338 VPASVFS------GASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFD 391
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSI 378
I R VP + L V+ + + G A CL F + T I
Sbjct: 392 FSGIKTVR----VPVVTLTFSRGAVM--------DLDVSGIFYAGCLAFTATAQDGDTGI 439
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
+ +Q +L FD+ S LGF
Sbjct: 440 LGNVQQRTFEML-FDVGGSTLGF 461
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 64/275 (23%), Positives = 108/275 (39%), Gaps = 41/275 (14%)
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDD 199
T+P+F+ C + Q GIAG GRS +LPSQL +KF+ CL FDD
Sbjct: 205 TIPDFLVGCSIFSIKQP-----EGIAGFGRSPESLPSQLGL-----KKFSYCLVSHAFDD 254
Query: 200 ----GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
+V G + + L +TP N + Y++ + ++ +
Sbjct: 255 TPTSSDLVLDTG---SGSGVTKTAGLSHTPFLKNPTTAFRDY-------YYVLLRNIVIG 304
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK---VARVAPV 312
V + L +G GGT + + +T +E +Y+ + + F M + +
Sbjct: 305 DTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNL 364
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNK-NVVWSIHGQNSMVQIGGDALCLGFVDGG 371
C+ + G + VP DL+ Q K ++ N + +CL V
Sbjct: 365 TGLRPCYNIS--GEKSLS--VP--DLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDN 418
Query: 372 V-----NPRTSIVIGARQLENNLLQFDLATSRLGF 401
V +I++G Q N ++FDL + GF
Sbjct: 419 VAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGF 453
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 88/392 (22%), Positives = 157/392 (40%), Gaps = 68/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDC-------------------EKGYVSSTNKTA 85
Y T+++ TP + + G ++LWV C + G S T
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPG-SSPTASLI 110
Query: 86 RCGSAQCNL---ANAKACGG--GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
C +C+L ++ C +CG +GT G D+L + G +
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYG-DGSGTSGYYVSDLLHFDTVLGGSVMNN 169
Query: 141 VTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
+ P +F C + L V GI G G+ +++ SQLA+ R F+ CL D
Sbjct: 170 SSAP-IVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDD 228
Query: 199 DGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
G +V G+ V N+ YTPL ++ + Y + + S+ VNG
Sbjct: 229 SGGGILVLGE---------IVEPNIVYTPLVPSQPH------------YNLNMQSISVNG 267
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+ + ++ ++ + GT I + L + Y + A S + R P G
Sbjct: 268 QTLAIDPSVFGTSSS--QGTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVR--PYLSKG 323
Query: 317 -ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGG 371
C+ + + I + PQ+ L + I Q+ ++Q IGG AL C+GF
Sbjct: 324 NHCYLIS----SSINDIFPQVSLNFAGGASMILIP-QDYLIQQSSIGGAALWCIGFQK-- 376
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ + ++G L++ + +D+A R+G+++
Sbjct: 377 IQGQGITILGDLVLKDKIFVYDIANQRIGWAN 408
>gi|413922180|gb|AFW62112.1| putative aspartic protease family protein [Zea mays]
Length = 222
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 93/227 (40%), Gaps = 43/227 (18%)
Query: 189 KFALCLSPFDDGAIVF---GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
+F+ C+S DD ++ D P+ LN YTPL+ + + V Y
Sbjct: 16 RFSYCISDRDDAGVLLLGNSDLPFLPLN---------YTPLY--QPTPPLPYFDR--VAY 62
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
+ + + V GK +P+ ++L+ D+ G G T + + +T L Y A+ F
Sbjct: 63 SVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEF------ 116
Query: 306 VARVAPVAP------------FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQN 353
+ + P+ P F CFR+ G +P + L+ + S+ G
Sbjct: 117 LKQTKPLLPALEDPSFAFQEAFDTCFRVPK-GRPPPSARLPPVTLLFNGAQM--SVAGDR 173
Query: 354 SMVQIGGD------ALCLGFVDGGVNPRTSIVIGARQLENNLLQFDL 394
+ ++ G+ CL F + + P T+ VIG N +++DL
Sbjct: 174 LLYKVPGERRGAEGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYDL 220
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 133/342 (38%), Gaps = 58/342 (16%)
Query: 79 SSTNKTARCGSAQCNL-ANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNP 137
SST+K C S+ C + GG C + NT T G + DVL + + +
Sbjct: 168 SSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQT 227
Query: 138 GRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
A P F CG + L G A G+ GLG S V++PS LA F++C
Sbjct: 228 QHAN--PLITFGCGQVQTGAFLDGAAPN--GLFGLGMSDVSVPSILAKQGLTSNSFSMCF 283
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
+ G I FGD + S + TP I ++ Y I VT + V
Sbjct: 284 AADGLGRITFGDN--------NSSLDQGKTPFNIRPSHST----------YNITVTQIIV 325
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV----A 310
G + L + T +T L YK + Q+F S + K+ R +
Sbjct: 326 GGNSADLEFNAI-----------FDTGTSFTYLNNPAYKQITQSFDSKI-KLQRHSFSNS 373
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG---DALCLGF 367
PF C+ L+ T VP I+L ++ + + + + ++ GG LCL
Sbjct: 374 DDLPFEYCYDLR----TNQTIEVPNINLTMKGGDNYFVM---DPIITSGGGNNGVLCLAV 426
Query: 368 V-DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
+ VN +IG + + FD LG+ +S ++
Sbjct: 427 LKSNNVN-----IIGQNFMTGYRIVFDRENMTLGWKESNCYD 463
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 143/402 (35%), Gaps = 99/402 (24%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSAQCNLANAKA 99
TP P + + G ++W C + SST + CG+ C
Sbjct: 51 TPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPTSN 110
Query: 100 CGGGICGAGPDNPISNTGTHGDIRID---VLSIQSTDGRNPGRAVTVPNFIFLCGSEFVL 156
C G +C + +IR+D L I T+ G A F + S+
Sbjct: 111 CSGDVC---------TYESTTNIRLDRHTTLGIVGTETFAIGTATASLAFGCVVASDI-- 159
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFD 216
G G GLGR+ P L A L KF+ CLSP G
Sbjct: 160 -DTMDGTSGFIGLGRT----PRSLVAQMKLT-KFSYCLSPRGTG---------------- 197
Query: 217 VSKNLKYTPLFINKVNTASGFLGEP-SVEYFIGVTSVHVNGKAVPLNKT----LLSIDNE 271
K + LF+ +G GE S FI K P + + LLS+D
Sbjct: 198 -----KSSRLFLGSSAKLAG--GESTSTAPFI---------KTSPDDDSHHYYLLSLDAI 241
Query: 272 GVGGTKIST-----------VNPYTVLETSIYKALVQAFASAM---PKVARVAPVAPFGA 317
G T I+T V+P+++L S Y+A +A A+ P PF
Sbjct: 242 RAGNTTIATAQSGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDL 301
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG--GDALCLGFVDGGVNPR 375
CF+ + GF+R P + Q ++ ++ +G D C + R
Sbjct: 302 CFK-KAAGFSRA--TAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNR 358
Query: 376 TSI----VIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
T + V+G+ Q EN +DL ++L FE A C+
Sbjct: 359 TGLEGVSVLGSLQQENVHFLYDLK------KETLSFEPADCS 394
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 153/401 (38%), Gaps = 77/401 (19%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG----------YVSSTNKTARCGSAQ 91
T QY ++ TP L G + WV C G S + C S
Sbjct: 88 TGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLVFRPEASKSWAPVPCSSDT 147
Query: 92 C------NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRN----PGRAV 141
C +LAN + +P S + + L + TD G+
Sbjct: 148 CKLDVPFSLANCSSSA---------SPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVA 198
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC--LSPFD- 198
+ + + C S Q S V G+ LG +K++ S+ AA F + L L+P +
Sbjct: 199 QLQDVVLGCSSTHDGQSFKS-VDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNA 257
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF-IGVTSVHVNGK 257
G + FG G + TP K+ FL +P++ ++ + V +VHV G+
Sbjct: 258 TGYLAFGPG------------QVPRTPATQTKL-----FL-DPAMPFYGVKVDAVHVAGQ 299
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
A+ + + + GG + + TVL T YKA+V A + V +V PF
Sbjct: 300 ALDIPAEVWDPKS---GGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVD-FPPFEH 355
Query: 318 CFRLQDIGFTRIGPVVPQI-DLVLQNKNVV-WSIHGQNSMVQIGGDALCLGFVDG---GV 372
C+ +T P P+I L +Q ++ ++ + C+G +G GV
Sbjct: 356 CYN-----WTAPRPGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGV 410
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
+ VIG + +L +FDL + F S TCT
Sbjct: 411 S-----VIGNIMQQEHLWEFDLKNMEVRFMPS------TCT 440
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 147/387 (37%), Gaps = 75/387 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EKGYV----------SSTNKTARCGSA 90
+Y+ + TP P+ G ++LW C + Y SST K C S+
Sbjct: 89 EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 91 QCN-LANAKACGGGICGAGPDNPIS-------NTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
QC L N +C DN S N+ T G+I +D L++ S+D R +
Sbjct: 149 QCTALENQASCSTN------DNTCSYSLSYGDNSYTKGNIAVDTLTLGSSD----TRPMQ 198
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
+ N I CG GI GLG V+L QL S+ KF+ CL P
Sbjct: 199 LKNIIIGCGHNNA-GTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPL----- 250
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE-------YFIGVTSVHVN 255
+ D + + + N + + SG + P + Y++ + S+ V
Sbjct: 251 ---------TSKKDQTSKINFG---TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVG 298
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
K + + + + I + T+L T Y L A AS++ + P +
Sbjct: 299 SKQIQYSGSDSESSEGNI---IIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGL 355
Query: 316 GACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
C ++ G + VP I + +V + N+ VQ+ D +C F +P
Sbjct: 356 SLC-------YSATGDLKVPVITMHFDGADV--KLDSSNAFVQVSEDLVCFAFRG---SP 403
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGF 401
SI Q+ N L+ +D + + F
Sbjct: 404 SFSIYGNVAQM-NFLVGYDTVSKTVSF 429
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 98/414 (23%), Positives = 154/414 (37%), Gaps = 65/414 (15%)
Query: 31 ALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTAR---- 86
A+ L T T QY + + TP P L G ++ WV C +G S ++ TA
Sbjct: 96 AMPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKC-RGAASPSHATATASPA 154
Query: 87 -------------------------CGSAQC------NLANAKACGGGICGAGPDNPIS- 114
C S C +LAN C D +
Sbjct: 155 AAPSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLAN---CSSSTAACSYDYRYND 211
Query: 115 NTGTHGDIRID----VLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLG 170
N+ G + D LS G R + + C + QG + G+ LG
Sbjct: 212 NSAARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASD-GVLSLG 270
Query: 171 RSKVALPSQLAAAFSLKRKFALC--LSPFDDGA-IVFGDGPYYDLNNFDVSKNLKYTPLF 227
S ++ S+ A+ F + + L L+P + + + FG GP D + TPL
Sbjct: 271 YSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGP--DAASSSAPAPGSRTPLL 328
Query: 228 INKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVL 287
++ Y + V SV V+G A+ + + + + G GT I + TVL
Sbjct: 329 LD---------ARVRPFYAVAVDSVSVDGVALDIPAEVWDVGSNG--GTIIDSGTSLTVL 377
Query: 288 ETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVW 347
T YKA+V A + + + RVA + PF C+ G VP++ + +
Sbjct: 378 ATPAYKAVVAALSEQLAGLPRVA-MDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARL- 435
Query: 348 SIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
++ ++ C+G V G P S VIG + +L +FDL L F
Sbjct: 436 EPPAKSYVIDAAPGVKCIG-VQEGAWPGVS-VIGNILQQEHLWEFDLNNRWLRF 487
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 125/296 (42%), Gaps = 32/296 (10%)
Query: 115 NTGTHGDIRIDVLSIQ---STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR 171
++ T GD ++ ++ ST G++ R V N +F CG +GL G G+ GLGR
Sbjct: 284 SSNTTGDFALETFTVNLTSSTTGKSEFRRVE--NVMFGCGH--WNRGLFHGAAGLLGLGR 339
Query: 172 SKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
++ SQL + + F+ CL D V + + + L +T L K
Sbjct: 340 GPLSFSSQLQSLYG--HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKE 397
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
N F Y++ + S+ V G+ + + + ++ +G GGT I + +
Sbjct: 398 NPVDTF-------YYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPA 450
Query: 292 YKALVQAFASAMPKVARVAPVAPFGACFRL---QDIGFTRIGPVVPQIDLVLQNKNVVWS 348
Y+ + +AF + V C+ + ++ F L+ VW+
Sbjct: 451 YRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEF--------LIQFADGAVWN 502
Query: 349 IHGQNSMVQIGG-DALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
+N ++I D +CL + P++++ +IG Q +N + +D SRLG++
Sbjct: 503 FPVENYFIRIQQLDIVCLAMLG---TPKSALSIIGNYQQQNFHILYDTKNSRLGYA 555
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 71/302 (23%), Positives = 111/302 (36%), Gaps = 55/302 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ +I TP V G +++W C +K + S++ K C S
Sbjct: 90 EYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGRAVTV 143
QC L + +C + P + +GD I + L++ S N G+ ++
Sbjct: 150 QCRLLDTVSC------SQPQKLCDFSYGYGDGSLAQGVIATETLTLNS----NSGQPXSI 199
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD----- 198
N +F CG +G+ G G ++L SQ+ + RKF+ CL PF
Sbjct: 200 XNIVFGCGHNNS-GTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSI 258
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
I+F GP +++ V TPL T YF+ + + V K
Sbjct: 259 TSKIIF--GPEAEVSGSXVVS----TPLVTKDDPTY----------YFVTLDGISVGDKL 302
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
P + S G I P T+L Y LVQ A+P P C
Sbjct: 303 FPFSS---SSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLC 359
Query: 319 FR 320
+R
Sbjct: 360 YR 361
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 69/287 (24%), Positives = 113/287 (39%), Gaps = 50/287 (17%)
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--- 198
T+ NF+ C + + L+S +AG GRS +LP Q+ +KFA CL+ D
Sbjct: 198 TIRNFLLGCTTS-AARELSSDA--LAGFGRSMFSLPIQMGV-----KKFAYCLNSHDYDD 249
Query: 199 ---DGAIV--FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
G ++ + DG +K L YTP K AS F Y +GV +
Sbjct: 250 TRNSGKLILDYRDG---------KTKGLSYTPFL--KSPPASAFY------YHLGVKDIK 292
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKI-STVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+ K + + L+ ++G G I S + ++K + M K R
Sbjct: 293 IGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEA 352
Query: 313 AP---FGACFRLQDIGFTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGGDALCL 365
C+ +I P++ Q ++V+ KN + I Q S+ D
Sbjct: 353 ETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNY-FGISPQESLACFLMDTNGT 411
Query: 366 GFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
++ + P SI++G Q + +++DL R G F R TC
Sbjct: 412 NALE--ITPDPSIILGNSQHVDYYVEYDLKNDRFG------FRRQTC 450
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 112/298 (37%), Gaps = 48/298 (16%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGY----------VSSTNKTAR 86
T T ++ QI P + L + W+ C+ K Y SS+
Sbjct: 182 TGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLS 241
Query: 87 CGSAQCNLANAKACGG-GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C + CNL +C G C T T G + + +S +S+ V
Sbjct: 242 CETKHCNLLPNSSCSDDGYCRYNITYK-DGTNTEGVLINETVSFESSGW--------VDR 292
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
C ++ QG G G GLGR ++ PS++ A+ + CL DG
Sbjct: 293 VSLGCSNK--NQGPFVGSDGTFGLGRGSLSFPSRINAS-----SMSYCLVESKDGY---- 341
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAVPLNK 263
+ N+ S ++K + L P E Y++G+ + V G+ + +
Sbjct: 342 SSSTLEFNSPPCSGSVK------------AKLLQNPKAENLYYVGLKGIKVGGEKIDVPN 389
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
+ +ID G GG +S+ + T+LE Y + AF + + R+ F C+ L
Sbjct: 390 STFTIDPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNL 447
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 112/302 (37%), Gaps = 80/302 (26%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------SSTNKTAR 86
T QYV + TP V + V G ++ WV C+ SST
Sbjct: 140 TFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVP 199
Query: 87 CGSAQCNLAN--AKACGGGICG--AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
CG+ C+ C G CG + + TG +G D L++ PG T
Sbjct: 200 CGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGS---DTLALA------PGN--T 248
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
V F+F CG G+ +G+ G+ LGR ++L SQ A A+ F+ CL
Sbjct: 249 VGTFLFGCG--HAQAGMFAGIDGLLALGRQSMSLKSQAAGAYG--GVFSYCLP------- 297
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF--------LGEPSVEYFIGVTSVHV 254
SK L + +ASGF P+ Y + +T + V
Sbjct: 298 ---------------SKQSAAGYLTLGGPTSASGFATTGLLTAWAAPTF-YMVMLTGISV 341
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
G+ V + + + GGT + T T L + Y AL AF A +AP
Sbjct: 342 GGQQVAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGA---------IAP 386
Query: 315 FG 316
+G
Sbjct: 387 YG 388
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 141/389 (36%), Gaps = 63/389 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV------------------SSTNKTAR 86
Y +I TP + V G +I+WV+C + S T K
Sbjct: 78 YYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVP 137
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
C C N G C A P + T G DV+ G A
Sbjct: 138 CDQEFCYEINGGQLPG--CTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAA 195
Query: 142 TVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
+ IF CG S + + GI G G+S ++ SQLA +K+ FA CL +
Sbjct: 196 N-GSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLDGTN 254
Query: 199 DGAI-VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G I V G V + TPL N+ + Y + +T+V V +
Sbjct: 255 GGGIFVIG---------HVVQPKVNMTPLIPNQPH------------YNVNMTAVQVGHE 293
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ L + + G I + L +YK LV S P + +V V
Sbjct: 294 FLSLPTDVFEAGDR--KGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDL-KVHTVRDEYT 350
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGV---NP 374
CF+ D + P + +N +V+ ++ + G C+G+ + GV +
Sbjct: 351 CFQYSD----SLDDGFPNVTFHFEN-SVILKVYPHEYLFPFEG-LWCIGWQNSGVQSRDR 404
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFSD 403
R ++G L N L+ +DL +G+++
Sbjct: 405 RNMTLLGDLVLSNKLVLYDLENQAIGWTE 433
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 81/382 (21%), Positives = 142/382 (37%), Gaps = 70/382 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP + G +++W CE SS+ T C S
Sbjct: 95 EYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQ 154
Query: 91 QCNLANAKACGGGIC----GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
C ++ C C G G + T G + + + +++ +VPN
Sbjct: 155 YCQDLPSETCNNNECQYTYGYG-----DGSTTQGYMATETFTFETS---------SVPNI 200
Query: 147 IFLCGSEFVLQGLASGV-VGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---I 202
F CG + QG G G+ G+G ++LPSQL +F+ C++ + + +
Sbjct: 201 AFGCGEDN--QGFGQGNGAGLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTL 253
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
G V + T L + +N Y+I + + V G + +
Sbjct: 254 ALGSAAS------GVPEGSPSTTLIHSSLN---------PTYYYITLQGITVGGDNLGIP 298
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA--MPKVARVAPVAPFGACFR 320
+ + ++G GG I + T L Y A+ QAF +P V + + CF+
Sbjct: 299 SSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESS--SGLSTCFQ 356
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVI 380
G T VP+I +Q V ++ QN ++ +CL G + + I I
Sbjct: 357 QPSDGSTV---QVPEIS--MQFDGGVLNLGEQNILISPAEGVICLAM---GSSSQLGISI 408
Query: 381 -GARQLENNLLQFDLATSRLGF 401
G Q + + +DL + F
Sbjct: 409 FGNIQQQETQVLYDLQNLAVSF 430
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 103/253 (40%), Gaps = 46/253 (18%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS------ 218
G+ G G ++ PSQ + F+ CL P Y +NF +
Sbjct: 316 GLVGFGCGPLSFPSQNKDVYGFV--FSYCL-------------PSYKSSNFSSTLRLGPA 360
Query: 219 ---KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
K +K TPL N PS+ Y++ + +HV G+ + + + L+ D G
Sbjct: 361 GQPKRIKMTPLLSNP--------HRPSL-YYVNMVGIHVGGRPMLVPASALAFDPASGRG 411
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
T + +T L +Y A+ F S + + P+ F C+ + VP
Sbjct: 412 TIVDAGTMFTRLSAPVYAAVRDVFRSRV-RAPVTGPLGGFDTCY--------NVTISVPT 462
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI--VIGARQLENNLLQF 392
+ + V ++ +N +++ D + CL G + ++ V+ + Q +N+ + F
Sbjct: 463 VTFSFDGRVSV-TLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLF 521
Query: 393 DLATSRLGFSDSL 405
D+A R+GFS L
Sbjct: 522 DVANGRVGFSREL 534
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 147/387 (37%), Gaps = 75/387 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EKGYV----------SSTNKTARCGSA 90
+Y+ + TP P+ G ++LW C + Y SST K C S+
Sbjct: 89 EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 91 QCN-LANAKACGGGICGAGPDNPIS-------NTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
QC L N +C DN S N+ T G+I +D L++ S+D R +
Sbjct: 149 QCTALENQASCSTN------DNTCSYSLSYGDNSYTKGNIAVDTLTLGSSD----TRPMQ 198
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
+ N I CG GI GLG V+L QL S+ KF+ CL P
Sbjct: 199 LKNIIIGCGHNNA-GTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPL----- 250
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE-------YFIGVTSVHVN 255
+ D + + + N + + SG + P + Y++ + S+ V
Sbjct: 251 ---------TSKKDQTSKINFG---TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVG 298
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
K + + + + I + T+L T Y L A AS++ + P +
Sbjct: 299 SKQIQYSGSDSESSEGNI---IIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGL 355
Query: 316 GACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
C ++ G + VP I + +V + N+ VQ+ D +C F +P
Sbjct: 356 SLC-------YSATGDLKVPVITMHFDGADV--KLDSSNAFVQVSEDLVCFAFRG---SP 403
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGF 401
SI Q+ N L+ +D + + F
Sbjct: 404 SFSIYGNVAQM-NFLVGYDTVSKTVSF 429
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 85/386 (22%), Positives = 149/386 (38%), Gaps = 71/386 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y ++I +P + + + G ++ W+ C +SS+ T C S
Sbjct: 195 EYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSP 254
Query: 91 QCNLANAKAC------GGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPG 138
C +A AC G C G G + T GD + L++ DG
Sbjct: 255 HCRALDASACHNNAANGNSSCVYEVAYGDG-------SYTVGDFATETLTLGG-DG---- 302
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
+ V + CG + +GL G G+ LG ++ PSQ++A +F+ CL
Sbjct: 303 -SAAVHDVAIGCGHDN--EGLFVGAAGLLALGGGPLSFPSQISAT-----EFSYCL---- 350
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
V D P F S + T + + + Y++ + + V G+
Sbjct: 351 ----VDRDSPSASTLQFGASDSSTVTAPLMRSPRS--------NTFYYVALNGISVGGET 398
Query: 259 VP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ + ++D +G GG + + T L++S Y AL AF + R + V+ F
Sbjct: 399 LSDIPPAAFAMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDT 458
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDALCLGFVDGGVNPRT 376
C+ L R VP + L + + + +N ++ + G CL F G
Sbjct: 459 CYDLAG----RSSVQVPAVSLRFEGGGEL-KLPAKNYLIPVDGAGTYCLAFAATG---GA 510
Query: 377 SIVIGARQLENNLLQFDLATSRLGFS 402
++G Q + + FD A + +GFS
Sbjct: 511 VSIVGNVQQQGIRVSFDTAKNTVGFS 536
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 111/285 (38%), Gaps = 55/285 (19%)
Query: 44 QYVTQIKQRTPLVPVKLT--VHLGGNILWVDCE-------------KGYVSSTNKTARCG 88
Q++ +I TP P+K+T V G +++W+ C SST C
Sbjct: 67 QHLMEIYIGTP--PIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCD 124
Query: 89 SAQCNLANAKACGGGICGAGPDNPISNTGTHGD--IRIDVLSIQS-TDGRNPGRAVTVPN 145
S C+ + C P+ + T +GD + VL+ + T N G+ V++
Sbjct: 125 SPLCHKLDTGVC-------SPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSR 177
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F+F CG G +G+ GLG +L SQ+ F +KF+ CL PF
Sbjct: 178 FLFGCGHNNT-GGFNDHEMGLIGLGGGPTSLISQIGPLFG-GKKFSQCLVPF-------- 227
Query: 206 DGPYYDLNNFDVSKNLKY---TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
L + +S + + + + N V T E YF+ + + V P+N
Sbjct: 228 ------LTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMN 281
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
T+ G + + P +L +Y + FA KVA
Sbjct: 282 STI------GKANMLVDSGTPPILLPQQLYD---KVFAEVRNKVA 317
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 110/267 (41%), Gaps = 24/267 (8%)
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
R+V+ P I C +E G++G+ LGR A S++ KF+ C+ P
Sbjct: 175 RSVSTPPLILGCATE---STDPRGILGM-NLGRLSFAKQSKIT-------KFSYCVPPRQ 223
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
G +Y L N SK KY + + F +P + Y I + + + GK
Sbjct: 224 TRPGFTPTGSFY-LGNNPSSKGFKYVGMMTSSRQRMPNF--DP-LAYTIPMVGIRIAGKK 279
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY-KALVQAFASAMPKVAR---VAPVAP 314
+ ++ + D G G T I + + +T L + Y K Q + P++ + VA
Sbjct: 280 LNISPAVFRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVA- 338
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
CF + IG ++ ++ + + V I + + +GG C+G
Sbjct: 339 -DMCF--DSVKAVEIGRLIGEMVFEFE-RGVEVVIPKERVLADVGGGVHCVGIGSSDKLG 394
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGF 401
S +IG +N ++FDL R+GF
Sbjct: 395 AASNIIGNFHQQNLWVEFDLVRRRVGF 421
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 91/392 (23%), Positives = 155/392 (39%), Gaps = 88/392 (22%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVS--------------STNKTARCGSA 90
Y+T++ TP + V G ++ W+ C VS ST C +
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAP 167
Query: 91 QCN-----LANAKACGG-GICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNP 137
QC N +C G G+C ++GD + D +S+ S+
Sbjct: 168 QCAELQAATLNPSSCSGSGVC--------QYQASYGDGSFSFGYLSKDTVSLSSSG---- 215
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--- 194
+ P F + CG + V GL G+ GL R+K++L SQLA S+ FA CL
Sbjct: 216 ----SFPGFYYGCGQDNV--GLFGRAAGLIGLARNKLSLLSQLAP--SVGNSFAYCLPTS 267
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
+ G + FG + +N + K YT + + ++ + YF+ + + V
Sbjct: 268 AAASAGYLSFGS----NSDNKNPGK-YSYTSMVSSSLDAS---------LYFVSLAGMSV 313
Query: 255 NGK--AVPLNK--TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
G AVP ++ +L +I + G T+ L T +Y AL +A +A+
Sbjct: 314 AGSPLAVPSSEYGSLPTIIDSGTVITR---------LPTPVYTALSKAVGAAL-AAPSAP 363
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
+ CF+ Q VP +++ + G N +V + CL F
Sbjct: 364 AYSILQTCFKGQVAKLP-----VPAVNMAFAGGATLRLTPG-NVLVDVNETTTCLAFAPT 417
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
++ +IG Q + + +D+ SR+GF+
Sbjct: 418 ----DSTAIIGNTQQQTFSVVYDVKGSRIGFA 445
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 94/413 (22%), Positives = 150/413 (36%), Gaps = 88/413 (21%)
Query: 37 AKDTATLQYVTQIKQRTPLVPVK-------------------------LTVHLGGNILWV 71
AKD A +QY + + R +VP+ L + + W+
Sbjct: 64 AKDQARMQYFSSLVARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWI 123
Query: 72 DCEKGYVSSTNK-----------TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHG 120
C ST+K CGS C CGG C T+G
Sbjct: 124 PCSGCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFN--------FTYG 175
Query: 121 DIRIDVLSIQSTD--GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPS 178
I +Q T +P +P + F C ++ G ++ G+ GLGR ++L S
Sbjct: 176 SSSIAASVVQDTLTLATDP-----IPGYTFGCVNK--TTGSSAPQQGLLGLGRGPLSLLS 228
Query: 179 QLAAAFSLKRKFALCLSPFDDGAIVFGD----GPYYDLNNFDVSKNLKYTPLFINKVNTA 234
Q + K F+ CL F +I F GP Y K +KYTPL N ++
Sbjct: 229 QSQNLY--KSTFSYCLPSFK--SINFSGSLRLGPVYQ------PKRIKYTPLLRNPRRSS 278
Query: 235 SGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKA 294
Y++ + ++ V K V + L+ + GT + +T L +Y A
Sbjct: 279 ---------LYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTA 329
Query: 295 LVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
+ F + V + F C+ + VVP I + NV ++ N
Sbjct: 330 VRNEFRRRVGPKLPVTTLGGFDTCYNVPI--------VVPTITFLFSGMNV--TLPPDNI 379
Query: 355 MVQ-IGGDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDSL 405
++ G CL N + + VI Q +N+ + FD+ SR+G + L
Sbjct: 380 VIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIAREL 432
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 74/289 (25%), Positives = 109/289 (37%), Gaps = 71/289 (24%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------SSTNKTAR 86
T QYV + TP V + V G ++ WV C+ SST
Sbjct: 140 TFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVP 199
Query: 87 CGSAQCNLAN--AKACGGGICG--AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
CG+ C+ C G CG + + TG +G D L++ PG T
Sbjct: 200 CGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGS---DTLALA------PGN--T 248
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
V F+F CG G+ +G+ G+ LGR ++L SQ A A+ F+ CL
Sbjct: 249 VGTFLFGCG--HAQAGMFAGIDGLLALGRQSMSLKSQAAGAYG--GVFSYCLP------- 297
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF--------LGEPSVEYFIGVTSVHV 254
SK L + ++ASGF P+ Y + +T + V
Sbjct: 298 ---------------SKQSAAGYLTLGGPSSASGFATTGLLTAWAAPTF-YMVMLTGISV 341
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
G+ V + + + GGT + T T L + Y AL AF A+
Sbjct: 342 GGQQVAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGAI 384
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 77/183 (42%), Gaps = 24/183 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----------KGYV---------SSTNKT 84
Y ++ TP VP + + G ++ W+ C+ +G V SST+K
Sbjct: 130 YYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKE 189
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
+C S+ C+ + + C NT + G + D+L + + D ++ +
Sbjct: 190 VQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARI- 248
Query: 145 NFIFLCGSEFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
CG + L+S G+ GLG V++PS LA A + F+LC P G I
Sbjct: 249 --TLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIE 306
Query: 204 FGD 206
FGD
Sbjct: 307 FGD 309
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 92/394 (23%), Positives = 152/394 (38%), Gaps = 81/394 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV------------SSTNKTARCGSAQC 92
Y+ + TP + + + + W+ C G V SS+++T +C + QC
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCS-GCVGCSSSVLFDPSKSSSSRTLQCEAPQC 146
Query: 93 NLAN------AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
A +K+CG + G + + D L++ +TD +PN+
Sbjct: 147 KQAPNPSCTVSKSCGFNMTYGG-------SAIEAYLTQDTLTL-ATD--------VIPNY 190
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
F C ++ G + G+ GLGR ++L SQ + + F+ CL
Sbjct: 191 TFGCINK--ASGTSLPAQGLMGLGRGPLSLISQSQNLY--QSTFSYCL------------ 234
Query: 207 GPYYDLNNFDVSKNL--KYTPLFINKVNTASGFLGEP--SVEYFIGVTSVHVNGKAVPLN 262
P +NF S L K P+ I + L P S Y++ + + V K V +
Sbjct: 235 -PNSKSSNFSGSLRLGPKNQPIRIK----TTPLLKNPRRSSLYYVNLVGIRVGNKIVDIP 289
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ L+ D GT + YT L Y A+ F + K A + F C+
Sbjct: 290 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRV-KNANATSLGGFDTCYS-- 346
Query: 323 DIGFTRIGPVV-PQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTSI-V 379
G VV P + + NV ++ N ++ G+ CL N + + V
Sbjct: 347 -------GSVVFPSVTFMFAGMNV--TLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNV 397
Query: 380 IGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
I + Q +N+ + D+ SRLG S R TCT
Sbjct: 398 IASMQQQNHRVLIDVPNSRLGIS------RETCT 425
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 77/183 (42%), Gaps = 24/183 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----------KGYV---------SSTNKT 84
Y ++ TP VP + + G ++ W+ C+ +G V SST+K
Sbjct: 107 YYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKE 166
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
+C S+ C+ + + C NT + G + D+L + + D ++ +
Sbjct: 167 VQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARI- 225
Query: 145 NFIFLCGSEFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV 203
CG + L+S G+ GLG V++PS LA A + F+LC P G I
Sbjct: 226 --TLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIE 283
Query: 204 FGD 206
FGD
Sbjct: 284 FGD 286
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 125/296 (42%), Gaps = 32/296 (10%)
Query: 115 NTGTHGDIRIDVLSIQ---STDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGR 171
++ T GD ++ ++ ST G++ R V N +F CG +GL G G+ GLGR
Sbjct: 284 SSNTTGDFALETFTVNLTSSTTGKSEFRRVE--NVMFGCGH--WNRGLFHGAAGLLGLGR 339
Query: 172 SKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKV 231
++ SQL + + F+ CL D V + + + L +T L K
Sbjct: 340 GPLSFSSQLQSLYG--HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKE 397
Query: 232 NTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSI 291
N F Y++ + S+ V G+ + + + ++ +G GGT I + +
Sbjct: 398 NPVDTF-------YYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPA 450
Query: 292 YKALVQAFASAMPKVARVAPVAPFGACFRL---QDIGFTRIGPVVPQIDLVLQNKNVVWS 348
Y+ + +AF + V C+ + ++ F L+ VW+
Sbjct: 451 YRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEF--------LIQFADGAVWN 502
Query: 349 IHGQNSMVQIGG-DALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
+N ++I D +CL + P++++ +IG Q +N + +D SRLG++
Sbjct: 503 FPVENYFIRIQQLDIVCLAMLG---TPKSALSIIGNYQQQNFHILYDTKNSRLGYA 555
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 87/393 (22%), Positives = 159/393 (40%), Gaps = 70/393 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY------------------VSSTNKTAR 86
Y T+++ +P + + G ++LWV C S T
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNT-------GTHGDIRIDVLSIQSTDGRNPGR 139
C +C+L + +C A +N T GT G D+L + G + +
Sbjct: 150 CSDQRCSLGLQSS--DSVCAA-QNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMK 206
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
+ P +F C + L V GI G G+ +++ SQLA+ R F+ CL
Sbjct: 207 NSSAP-IVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGD 265
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
D G +V G+ V N+ YTPL ++ + Y + + S++VN
Sbjct: 266 DSGGGILVLGE---------IVEPNIVYTPLVPSQPH------------YNLNLQSIYVN 304
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP-VAP 314
G+ + ++ ++ + + GT I + L + Y + A S V+P V+P
Sbjct: 305 GQTLAIDPSVFATSSN--QGTIIDSGTTLAYLTEAAYDPFISAITST------VSPSVSP 356
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDG 370
+ + + + I V PQ+ L + I Q+ ++Q I G AL C+GF
Sbjct: 357 YLSKGNQCYLTSSSINDVFPQVSLNFAGGTSMILIP-QDYLIQQSSINGAALWCVGFQK- 414
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ + ++G L++ + +D+A R+G+++
Sbjct: 415 -IQGQEITILGDLVLKDKIFVYDIAGQRIGWAN 446
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 85/357 (23%), Positives = 143/357 (40%), Gaps = 76/357 (21%)
Query: 79 SSTNKTARCGSAQCNLAN------AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQST 132
SS N C A C+L+ ++ C G ++ T +R++
Sbjct: 155 SSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTRDTLRVN------- 207
Query: 133 DGRNPGRAVTVPNFIFLC-GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFA 191
G +PG A +P F F C GS + +GIAG GR +++ SQL L++ F+
Sbjct: 208 -GSSPGVAKEIPKFCFGCVGSAY------REPIGIAGFGRGTLSMVSQLGF---LQKGFS 257
Query: 192 LCLSPFD-------DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE 244
C F +V GD L + D ++++TP+ L P
Sbjct: 258 HCFLAFKYANNPNISSPLVVGD---IALTSKD---DMQFTPM-----------LNSPMYP 300
Query: 245 --YFIGVTSVHV-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
Y++G+ ++ V N A + +L D+ G GG KI + YT L Y ++ S
Sbjct: 301 NFYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQS 360
Query: 302 AM--PKVARVAPVAPFGACFRL----------QDIGFTRIGPVVPQIDLVLQNKNVVWSI 349
+ P+ + F C+++ D+ + + + LVL N + +
Sbjct: 361 TINYPRDTGMEMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPV 420
Query: 350 H--GQNSMVQIGGDALCLGFV---DGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G ++V+ CL F DG P + V G+ Q +N + +DL R+GF
Sbjct: 421 SAPGNPAVVK------CLMFQSTDDGDDGP--AGVFGSFQQQNVEVVYDLEKERIGF 469
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 96/394 (24%), Positives = 147/394 (37%), Gaps = 81/394 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV------------SSTNKTARCGSAQC 92
Y+ + TP P+ + + + WV C G V SS+++ +C + QC
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCS-GCVGCASSVLFDPSKSSSSRNLQCDAPQC 149
Query: 93 NLANAKAC-GGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN-----F 146
A C G CG T+G I+ Q T +T+ N +
Sbjct: 150 KQAPNPTCTAGKSCGFN--------MTYGGSTIEASLTQDT--------LTLANDVIKSY 193
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
F C S+ G + G+ GLGR ++L SQ + F+ CL
Sbjct: 194 TFGCISKAT--GTSLPAQGLMGLGRGPLSLISQTQNLY--MSTFSYCL------------ 237
Query: 207 GPYYDLNNFDVSKNL--KYTPLFINKVNTASGFLGEP--SVEYFIGVTSVHVNGKAVPLN 262
P +NF S L KY P+ I + L P S Y++ + + V K V +
Sbjct: 238 -PNSKSSNFSGSLRLGPKYQPVRIK----TTPLLKNPRRSSLYYVNLVGIRVGNKIVDIP 292
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ 322
+ L+ D GT + +T L Y A+ F + K A + F C+
Sbjct: 293 TSALAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRI-KNANATSLGGFDTCYS-- 349
Query: 323 DIGFTRIGPVV-PQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTSI-V 379
G VV P + + NV ++ N ++ G CL N + + V
Sbjct: 350 -------GSVVYPSVTFMFAGMNV--TLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNV 400
Query: 380 IGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
I + Q +N+ + DL SRLG S R TCT
Sbjct: 401 IASMQQQNHRVLIDLPNSRLGIS------RETCT 428
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 148/383 (38%), Gaps = 71/383 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC----------EKGYV-----SSTNKTAR 86
TL+YV + TP V +T+ G ++ WV C + G + SST +
Sbjct: 124 TLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVS 183
Query: 87 CGSAQCNLANAKACGGGI----CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C +A+C + G G C G +T T+G D L++ G +
Sbjct: 184 CAAAECAQLEQQGNGCGATNYECQYGVQYGDGST-TNGTYSRDTLTLS-------GASDA 235
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAI 202
V F F C + G + G+ GLG +L SQ AAA+ F+ CL P G+
Sbjct: 236 VKGFQFGC--SHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYG--NSFSYCLPP-TSGSS 290
Query: 203 VFGDGPYYDLNNFDVSKNL---KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
F + V+ + K P F Y + + V GK +
Sbjct: 291 GFLTLGGGGGASGFVTTRMLRSKQIPTF-----------------YGARLQDIAVGGKQL 333
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP-FGAC 318
L+ ++ + + GT I+ + P + Y AL AF + M K R AP C
Sbjct: 334 GLSPSVFAAGSVVDSGTIITRLPP------TAYSALSSAFKAGM-KQYRSAPARSILDTC 386
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
F G T+I +P + LV +I + + G CL F G + T I
Sbjct: 387 FDF--AGQTQIS--IPTVALVFSGG---AAIDLDPNGIMYGN---CLAFAATGDDGTTGI 436
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
+ +Q +L +D+ +S LGF
Sbjct: 437 IGNVQQRTFEVL-YDVGSSTLGF 458
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 113/278 (40%), Gaps = 56/278 (20%)
Query: 144 PNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL------SPF 197
P F+F CGS +L+GL SG VGI L ++ PSQ+ + KF+ CL +
Sbjct: 217 PGFVFGCGS--LLKGLISGEVGILALSPGSLSFPSQIGEKYG--NKFSYCLLRQTAQNSL 272
Query: 198 DDGAIVFGDGPYYDLNNFDVSK--NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
+VFG+ +L K L+YTP +GE S+ Y + + + V
Sbjct: 273 KKSPMVFGEAA-VELKEPGSGKPQELQYTP------------IGESSIYYTVRLDGISVG 319
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
+ + L+ + + N T + T+L + + ++ Q+ AS M A +
Sbjct: 320 NQRLDLSPS--TFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLAS-MVSGAEFVAIKGL 376
Query: 316 GACFR--------LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF 367
ACFR L DI F G D V + N V + G CL F
Sbjct: 377 DACFRVPPSSGQGLPDITFHFNGGA----DFVTRPSNYVIDL----------GSLQCLIF 422
Query: 368 VDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDS 404
V P + + G Q ++ + D+ R+GF ++
Sbjct: 423 V-----PTNEVSIFGNLQQQDFFVLHDMDNRRIGFKET 455
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 92/397 (23%), Positives = 151/397 (38%), Gaps = 76/397 (19%)
Query: 30 KALVLRVAKDTATLQ---YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------- 78
K V+ +A +Q Y+ + K TP + + + + W+ C KG V
Sbjct: 17 KKSVVPIASGRGVIQSPSYIVKAKVGTPPQTLLMALDNSYDAAWIPC-KGCVGCSSTVFN 75
Query: 79 ---SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR 135
S+T KT CG+ QC CGG C + ++ ++ D +++ S D
Sbjct: 76 TVKSTTFKTLGCGAPQCKQVPNPICGGSTCTW--NTTYGSSTILSNLTRDTIAL-SMD-- 130
Query: 136 NPGRAVTVPNFIFLC-----GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
VP + F C GS QGL G GR ++ SQ + K F
Sbjct: 131 ------PVPYYAFGCIQKATGSSVPPQGL-------LGFGRGPLSFLSQTQNLY--KSTF 175
Query: 191 ALCLSPFD----DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
+ CL F G++ G +K TPL N S Y+
Sbjct: 176 SYCLPSFRTLNFSGSLRLGP--------VGQPPRIKTTPLLKNP---------RRSSLYY 218
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
+ + + V K V + ++ L+ + GT + +T L Y A+ F +
Sbjct: 219 VKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGN- 277
Query: 307 ARVAPVAPFGACFRLQDIGFTRIGPVV-PQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALC 364
A V+ + F C+ + P+V P I + NV ++ +N ++ G C
Sbjct: 278 ATVSSLGGFDTCYSV---------PIVPPTITFMFSGMNV--TMPPENLLIHSTAGVTSC 326
Query: 365 LGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLG 400
L N + + VI + Q +N+ + FD+ SRLG
Sbjct: 327 LAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLG 363
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 88/400 (22%), Positives = 158/400 (39%), Gaps = 68/400 (17%)
Query: 34 LRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCN 93
+ +D A +Y ++K +P L V G W++C K + + T C S +C
Sbjct: 102 MHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNCSKSFEAVT-----CASRKCK 156
Query: 94 LANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
+ ++ +C P +P + G D +++ T+G+ + N
Sbjct: 157 VDLSELFSLSVC-PKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQG----KLNNL 211
Query: 147 IFLCGSEFVLQGL--ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
C ++ +L G+ GI GLG +K + + A + KF+ CL
Sbjct: 212 TIGC-TKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGA--KFSYCLVD-------- 260
Query: 205 GDGPYYDLNNFDVSKNL----KYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
L++ VS NL + + ++ L P Y + V + + G+ +
Sbjct: 261 ------HLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPF--YGVNVVGISIGGQMLK 312
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
+ + + E GGT I + T L Y+A+ +A ++ KV RV G F
Sbjct: 313 IPPQVWDFNAE--GGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVT-----GEDFD 365
Query: 321 LQDIGFTRIG---PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD----GGVN 373
+ F G VVP++ + ++ ++ + C+G V GG +
Sbjct: 366 ALEFCFDAEGFDDSVVPRLVFHFAG-GARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGAS 424
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
VIG +N+L +FDL+T+ +GF+ S TCT
Sbjct: 425 -----VIGNIMQQNHLWEFDLSTNTVGFAPS------TCT 453
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 85/383 (22%), Positives = 139/383 (36%), Gaps = 71/383 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EKGY----------VSSTNKTARCGSA 90
Y+ TP PV V +I+WV C E Y S T K C S
Sbjct: 87 DYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSST 146
Query: 91 QCNLANAKACGGG---ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
C +C IC N + + GD+ ++ +++ S + +P V P +
Sbjct: 147 TCKSVQGTSCSSDERKICEHTV-NYKDGSHSQGDLIVETVTLGSYN--DP--FVHFPRTV 201
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP---------FD 198
C ++ +GI GLG V+L QL++ S+ +KF+ CL+P F
Sbjct: 202 IGCIRN---TNVSFDSIGIVGLGGGPVSLVPQLSS--SISKKFSYCLAPISDRSSKLKFG 256
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
D A+V GDG F K Y L V G +E+ +
Sbjct: 257 DAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSV-------GNNRIEFRSSSSRS------ 303
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
G G I + +TVL +Y L A A + P+ F C
Sbjct: 304 ------------SGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLC 351
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
++ + ++ VP I +V ++ N+ + +CL F ++ ++
Sbjct: 352 YKST---YDKVD--VPVITAHFSGADV--KLNALNTFIVASHRVVCLAF----LSSQSGA 400
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
+ G +N L+ +DL + F
Sbjct: 401 IFGNLAQQNFLVGYDLQRKIVSF 423
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 91/405 (22%), Positives = 148/405 (36%), Gaps = 92/405 (22%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------KG----YVSSTNKTARCGSAQC 92
QY T I P P L V G ++ W+ C+ KG Y + K C
Sbjct: 193 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLC 252
Query: 93 N--------LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
A K C I A + + G + D + + +T+G GR
Sbjct: 253 QELQGDQNYCATCKQCDYEIEYA------DRSSSMGVLAKDDMHMIATNG---GREKL-- 301
Query: 145 NFIFLCGSEFVLQGLAS--GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS--PFDDG 200
+F+F C + Q L S GI GL + ++LPSQLA+ + F C++ P G
Sbjct: 302 DFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPNGGG 361
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ GD Y P + T + G P Y H + V
Sbjct: 362 YMFLGD---------------DYVPRW---GMTWAPIRGGPDNLY-------HTEAQKVN 396
Query: 261 LNKTLLSIDNEGVGGTKISTV----NPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
L + G G+ I + + YT L IYK LV A P +
Sbjct: 397 YGDQQLRM--HGQAGSSIQVIFDSGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTTLP 454
Query: 317 ACFR-------LQDIG---------FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
C++ L+D+ F V+P+ +L + ++ S G
Sbjct: 455 LCWKADFDVRYLEDVKQFFKPLNLHFGNRWFVIPRTFTILPDDYLIISDKGN-------- 506
Query: 361 DALCLGFVDGG-VNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+CLG ++G ++ +++++G L L+ +D ++G++DS
Sbjct: 507 --VCLGLLNGAEIDHASTLIVGDVSLRGKLVVYDNERRQIGWADS 549
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 103/253 (40%), Gaps = 46/253 (18%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVS------ 218
G+ G R ++ PSQ + F+ CL P Y +NF +
Sbjct: 344 GLVGFNRGPLSFPSQNKNVYG--SVFSYCL-------------PSYKSSNFSGTLRLGPA 388
Query: 219 ---KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
K +K TPL N PS+ Y++ + + V G+ V + + L+ D G
Sbjct: 389 GQPKRIKTTPLLSNP--------HRPSL-YYVNMVGIRVGGRPVAVPASALAFDPASGHG 439
Query: 276 TKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQ 335
T + +T L +Y A+ F S + + P+ F C+ + VP
Sbjct: 440 TIVDAGTMFTRLSAPVYAAVCDVFRSRV-RAPVAGPLGGFDTCYN--------VTISVPT 490
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI--VIGARQLENNLLQF 392
+ + + V ++ +N +++ D + CL G + ++ V+ + Q +N+ + F
Sbjct: 491 VTFLFDGRVSV-TLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVLF 549
Query: 393 DLATSRLGFSDSL 405
D+A R+GFS L
Sbjct: 550 DVANGRVGFSREL 562
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 92/409 (22%), Positives = 156/409 (38%), Gaps = 72/409 (17%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-- 73
L P ++ F AL V T+ + +Y ++ P + + G ++ W+ C
Sbjct: 120 LHPAESKAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAP 179
Query: 74 -EKGY----------VSSTNKTARCGSAQCNLANAKACGGGIC------GAGPDNPISNT 116
+ Y S++ RC QC + C G C G G +
Sbjct: 180 CSECYQQSDPIFDPISSNSYSPIRCDEPQCKSLDLSECRNGTCLYEVSYGDG-------S 232
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G+ + +++ S V N CG +GL G G+ GLG K++
Sbjct: 233 YTVGEFATETVTLGS---------AAVENVAIGCGHNN--EGLFVGAAGLLGLGGGKLSF 281
Query: 177 PSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG 236
P+Q+ A F+ CL D A+ + N + +N PL N
Sbjct: 282 PAQVNAT-----SFSYCLVNRDSDAVSTLEF------NSPLPRNAATAPLMRNP------ 324
Query: 237 FLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALV 296
E Y++G+ + V G+A+P+ ++ +D G GG I + T L + +Y AL
Sbjct: 325 ---ELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALR 381
Query: 297 QAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
AF + + V+ F C+ L +R +P + + + +N ++
Sbjct: 382 DAFVKGAKGIPKANGVSLFDTCYDLS----SRESVEIPTVSFRFPEGREL-PLPARNYLI 436
Query: 357 QIGG-DALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
+ C F P TS +IG Q + + FD+A S +GFS
Sbjct: 437 PVDSVGTFCFAFA-----PTTSSLSIIGNVQQQGTRVGFDIANSLVGFS 480
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 71/167 (42%), Gaps = 37/167 (22%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
++ V NF F C + + VG+AG GR ++LP+QLA SL +F+ CL
Sbjct: 215 SMAVENFTFACAHTALAE-----PVGVAGFGRGPLSLPAQLAP--SLSGRFSYCLV---- 263
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG------------FLGEPSVEYF- 246
++F + ++ +PL + + A+ L P YF
Sbjct: 264 ------------AHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFY 311
Query: 247 -IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
+ + +V V GK + L +D +G GG + + +T+L + +
Sbjct: 312 SVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTF 358
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 90/408 (22%), Positives = 157/408 (38%), Gaps = 92/408 (22%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCG 88
+ +Y+ + TP ++ + G ++ W+ C ++G V SS+ + CG
Sbjct: 143 SAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCG 202
Query: 89 SAQC-NLANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGRA 140
+C ++A +A C ++P + + GD+ ++ ++
Sbjct: 203 DPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVN---------- 252
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIA-GLGRSKVAL-----------------PSQLAA 182
+T P G +S V G+ G G L SQL A
Sbjct: 253 LTAP-------------GASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRA 299
Query: 183 AFSLKRKFALCLSPFDDGA-----IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGF 237
+ F+ CL D G+ +VFG+ D LKYT F + A F
Sbjct: 300 VYG-GHTFSYCL--VDHGSDVASKVVFGED---DALALAAHPRLKYT-AFAPASSPADTF 352
Query: 238 LGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQ 297
Y++ +T V V G+ + ++ G GGT I + + Y+ + +
Sbjct: 353 -------YYVRLTGVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRR 405
Query: 298 AFASAMPKVARVAPVAP-FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
AF M P P C+ + G R P VP++ L+ + VW +N +
Sbjct: 406 AFIDRMSGSYPPVPDFPVLSPCYNVS--GVER--PEVPELSLLFAD-GAVWDFPAENYFI 460
Query: 357 QIGGDA-LCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
++ D +CL + PRT + +IG Q +N + +DL +RLGF+
Sbjct: 461 RLDPDGIMCLAVLG---TPRTGMSIIGNFQQQNFHVAYDLHNNRLGFA 505
>gi|218189694|gb|EEC72121.1| hypothetical protein OsI_05107 [Oryza sativa Indica Group]
Length = 89
Score = 51.6 bits (122), Expect = 7e-04, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 45/78 (57%), Gaps = 7/78 (8%)
Query: 347 WSIHGQNSMVQIGGDALCLGFVDGG------VNPRTSIVIGARQLENNLLQFDLATSRLG 400
W+I G +++V++ +A C FVD G V+ +++IG Q+E+NL+ FDL + G
Sbjct: 8 WTIVGASAVVEVSQEAACFAFVDMGAAAAPAVDHSPAVIIGGHQMEDNLVVFDLEKWQFG 67
Query: 401 FSDSLLFERATC-TFNFT 417
FS LL C F+F+
Sbjct: 68 FSGLLLGTMTRCGNFDFS 85
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 72/309 (23%), Positives = 117/309 (37%), Gaps = 55/309 (17%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------GYVSSTNKTARC 87
V K +Y+ Q P + + V G +++WV C + ++
Sbjct: 78 VTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSS 137
Query: 88 GSAQCNLANAKACGGG--ICGAGPDNP--------ISNTGTHGDIRIDVLSIQSTDGRNP 137
G C+ +A G G I D+P ++G H + T+
Sbjct: 138 GKLPCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDH-----STQGVLGTETFTF 192
Query: 138 GRAVTVPNFIF-----LCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
G N F + GS+F G G+ GLGR ++L SQL A +FA
Sbjct: 193 GDGYVANNVSFGRSDTIDGSQF------GGTAGLVGLGRGHLSLVSQLGAG-----RFAY 241
Query: 193 CLS--PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVT 250
CL+ P I+FG D + DVS TPL N + Y++ +
Sbjct: 242 CLAADPNVYSTILFGSLAALDTSAGDVSS----TPLVTNPKP-------DRDTHYYVNLQ 290
Query: 251 SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
+ V G +P+ +I+++G GG + T L+ + Y+ + QA S ++ R+
Sbjct: 291 GISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITS---EIQRLG 347
Query: 311 PVAPFGACF 319
A CF
Sbjct: 348 YDAGDDTCF 356
>gi|302799212|ref|XP_002981365.1| hypothetical protein SELMODRAFT_420972 [Selaginella moellendorffii]
gi|300150905|gb|EFJ17553.1| hypothetical protein SELMODRAFT_420972 [Selaginella moellendorffii]
Length = 347
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 105/253 (41%), Gaps = 41/253 (16%)
Query: 166 IAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTP 225
IA LG ALP+Q++ + L +FA L D A +F F + ++YT
Sbjct: 118 IAALGSKNTALPAQISRSLGLPLRFAYTLR--DTSASIF----------FGKTAWIQYTQ 165
Query: 226 LFINKVNTASGFLGEP-----SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
+ + V F+ P + Y + +T + + K L+ + V +IS
Sbjct: 166 I-VPPVTVPVEFMQIPLKLDGAASYMVKMTGIGI--------KAFLTGQEDNV---EISV 213
Query: 281 VNPYTVLETSIYKALVQAF--ASAMPKVARVAPVA---PFGACFRLQDIGFTRIGPVVPQ 335
+T L IY +V F ++ K+ R + A G C++++ TR
Sbjct: 214 TQRFTTLPPKIYGFVVAQFQQEASERKIKRASTSAYNGKLGLCYQMRSSDVTRFR----N 269
Query: 336 IDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQFD 393
+ +V +K WS+ +V G + CL +++ + VIG Q E+ ++F+
Sbjct: 270 VTMVFSSK-FRWSVPADKYLVPKPGTSNVFCLAYLELAAGNGSHGVIGTLQQEDRAMEFN 328
Query: 394 LATSRLGFSDSLL 406
L LG S L+
Sbjct: 329 LERKSLGVSSPLI 341
>gi|302783204|ref|XP_002973375.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
gi|300159128|gb|EFJ25749.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
Length = 407
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 74/258 (28%), Positives = 112/258 (43%), Gaps = 53/258 (20%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVS 218
SG+VG A +S + QLA KF C +P D G IVFG N+ +S
Sbjct: 143 SGLVGFAKTNKSFIG---QLAE-MDYTGKFIYC-APSDTFSGKIVFG--------NYKIS 189
Query: 219 KN--LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGT 276
N L YTP+ +N ++TA Y+IG+ S+ +N L + +L+ +G GGT
Sbjct: 190 SNSSLSYTPMIVNPISTA---------LYYIGLRSISINDMLTFLVQGILA---DGTGGT 237
Query: 277 KISTVNPYTVLETSIYKALVQAFA---SAMPKVARVAPVAPFG--ACFRLQDIGFTRIGP 331
I + ++ Y LVQA S + KV+ A G C+ + G T P
Sbjct: 238 IIDSTFAFSYFTPDSYTPLVQAIQNLNSNLTKVSSNKTAALLGNDICYNVSVNGDT---P 294
Query: 332 VVPQIDLVLQNKNVV----WSI---HGQNSMVQIG-GDALCLGFVDGGVNPRTSIVIGAR 383
+ +N V W + +N+ V + GD+ +GF +N VIG
Sbjct: 295 PPQTLTYHFENGTQVEFRTWFLLDDDAENATVCLAVGDSQKVGF---SLN-----VIGTY 346
Query: 384 QLENNLLQFDLATSRLGF 401
Q + ++FDL +GF
Sbjct: 347 QQLDVAVEFDLEKQEIGF 364
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 71/167 (42%), Gaps = 37/167 (22%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
++ V NF F C + + VG+AG GR ++LP+QLA SL +F+ CL
Sbjct: 215 SMAVENFTFACAHTALAE-----PVGVAGFGRGPLSLPAQLAP--SLSGRFSYCLV---- 263
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG------------FLGEPSVEYF- 246
++F + ++ +PL + + A+ L P YF
Sbjct: 264 ------------AHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFY 311
Query: 247 -IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
+ + +V V GK + L +D +G GG + + +T+L + +
Sbjct: 312 SVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTF 358
>gi|361068027|gb|AEW08325.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165459|gb|AFG65601.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165460|gb|AFG65602.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165461|gb|AFG65603.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165462|gb|AFG65604.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165463|gb|AFG65605.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165465|gb|AFG65607.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165466|gb|AFG65608.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165467|gb|AFG65609.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165468|gb|AFG65610.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165469|gb|AFG65611.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165472|gb|AFG65614.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165473|gb|AFG65615.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165474|gb|AFG65616.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165475|gb|AFG65617.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165476|gb|AFG65618.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 55/110 (50%), Gaps = 12/110 (10%)
Query: 99 ACGGGICGAGPDNPISNTG------THGD--IRIDVLSIQS-TDGRNPGRAVTVPNFIFL 149
+C +C A PD +T T+GD I + +LS ++ T G +PNF F
Sbjct: 19 SCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLTLTSKSGAEQLIPNFAFG 78
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
CG G G GI GLGR ++L SQL+A S+ +KF+ CL DD
Sbjct: 79 CGQNNEGNGFDQGA-GIVGLGRGPLSLISQLSA--SMPKKFSYCLMTIDD 125
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 99/418 (23%), Positives = 159/418 (38%), Gaps = 90/418 (21%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------------SSTNK 83
Y + TP P+ + + G ++ WV C Y SS+++
Sbjct: 91 YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150
Query: 84 TARCGSAQCNLANAKA---CG-------GGICGAGPDNPISNTG-THGDIRIDVLSIQST 132
C + C ++K+ CG G +C P + +G T G + D L + +
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCP--PYLVVYGSGSTSGLLISDTLRLSPS 208
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
+ NF C V Q + G+AG GR ++PSQL KF+
Sbjct: 209 SSSS--APAPFRNFAIGCSIVSVHQPPS----GLAGFGRGAPSVPSQLKVP-----KFSY 257
Query: 193 CL--SPFDD-----GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
CL FDD G +V GD + ++Y PL N + SV Y
Sbjct: 258 CLLSRRFDDNSAVSGELVLGDA---MVPAGKKKTTMQYVPLLNNAASKPPY-----SVYY 309
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP- 304
++ +T + V GK P+N + GG I + +T L+ +++K + A SA+
Sbjct: 310 YLALTGISVGGK--PVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGG 367
Query: 305 KVARVAPVAP---FGACFRLQDIGFTRIGP----VVPQIDLVLQNKNVV-------WSIH 350
+ R PV CF L GP +P ++L + V+ +
Sbjct: 368 RYNRSRPVEDALGLRPCFALPP------GPGGAMELPDLELKFKGGAVMRLPVENYFVAA 421
Query: 351 GQNSMVQIGGDALCLGFVDG-------GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G G A+CL V G +I++G+ Q +N +++DL RLGF
Sbjct: 422 GPAGGPAAGPVAICLAVVSDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGF 479
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 155/387 (40%), Gaps = 75/387 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y T++ TP + + + G +++W+ C Y S + C S
Sbjct: 109 EYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSP 168
Query: 91 QCNLANAKACGG--GIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C ++ C C G G + T GD + L+ R
Sbjct: 169 LCRRLDSSGCSTRRHTCLYQVSYGDG-------SFTTGDFATETLTF---------RGNK 212
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL----SPFD 198
+ CG +GL G G+ GLGR +++ PSQ F+ KF+ CL +
Sbjct: 213 IAKVALGCGHH--NEGLFVGAAGLLGLGRGRLSFPSQTGIRFN--HKFSYCLVDRSASSK 268
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++VFGD +S+ ++TPL N K++T Y++G+ + V G
Sbjct: 269 PSSMVFGDAA--------ISRLARFTPLIRNPKLDTF----------YYVGLIGISVGGV 310
Query: 258 AVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
V ++ +L +D+ G GG I + T L Y AL AF + R + F
Sbjct: 311 RVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFD 370
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPR 375
C+ L ++ P V VL + ++ N ++ + + + C F G ++
Sbjct: 371 TCYDLSGQSSVKV-PTV-----VLHFRGADMALPATNYLIPVDENGSFCFAFA-GTISGL 423
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
+ +IG Q + + +DLA SR+GF+
Sbjct: 424 S--IIGNIQQQGFRVVYDLAGSRIGFA 448
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 96/391 (24%), Positives = 143/391 (36%), Gaps = 81/391 (20%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYV------------SSTNKTARC 87
TL++V + TP L G ++ W+ C G+ S+T C
Sbjct: 117 TLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPC 176
Query: 88 GSAQCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
G QC A K G C G G + T G + + LS+ S A
Sbjct: 177 GHPQCAAAGGKCSSNGTCLYKVQYGDG-------SSTAGVLSHETLSLTS--------AR 221
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
+P F F CG + G V G+ GLGR +++L SQ AA+F + L G
Sbjct: 222 ALPGFAFGCGETNL--GDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGY 279
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPL 261
+ G S ++YT + I K + S YF+ + S+ V G +P+
Sbjct: 280 LTIG-----TTTPASGSDGVRYTAM-IQKQDYPS--------FYFVDLVSIVVGGFVLPV 325
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRL 321
L + D GT + + T L Y AL F M + PF C+
Sbjct: 326 PPILFTRD-----GTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDF 380
Query: 322 --QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-----CLGFVDGGVNP 374
Q+ F +P + + + S V I D CL FV P
Sbjct: 381 AGQNAIF------MPLVSFKFSDGS---SFDLSPFGVLIFPDDTAPATGCLAFV-----P 426
Query: 375 RTS----IVIGARQLENNLLQFDLATSRLGF 401
R S ++G Q N + +D+A ++GF
Sbjct: 427 RPSTMPFTIVGNTQQRNTEMIYDVAAEKIGF 457
>gi|361067987|gb|AEW08305.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125859|gb|AFG43520.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125865|gb|AFG43523.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125875|gb|AFG43528.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTP N S G V Y+IG+ +V + GK + L LL D +G GGT I +
Sbjct: 24 LNYTPFLTNYRAPPSSQYG---VYYYIGLRAVSIGGKRMKLPSKLLRFDTKGNGGTIIDS 80
Query: 281 VNPYTVLETSIYKALVQAFASAMP--KVARVAPVAPFGACF 319
+TV I+K + FAS + + V + G C+
Sbjct: 81 GTTFTVFHDEIFKHIAAGFASQIEYRRAVDVEALTGMGLCY 121
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 86/383 (22%), Positives = 141/383 (36%), Gaps = 68/383 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP+ + G + WV C+ V SST C
Sbjct: 177 TGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSC 236
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ N C GG C G G++ G +D L++ S D V
Sbjct: 237 AAPACSDLNIHGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 285
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG +GL G+ GLGR K +LP Q + FA CL G
Sbjct: 286 FRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCLPARSTGT---- 337
Query: 206 DGPYYDLNNFDVSKNLKY--TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
Y D ++ TP+ + T Y++G+T + V G+ + + +
Sbjct: 338 --GYLDFGAGSLAAASARLTTPMLTDNGPTF----------YYVGMTGIRVGGQLLSIPQ 385
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKAL-VQAFASAMPKVARVAP-VAPFGACFRL 321
++ + GT + + T L + Y +L A+ + + AP V+ C+
Sbjct: 386 SVFA-----TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 440
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + M +CL F DGG
Sbjct: 441 --TGMSQV--AIPTVSLLFQG-GARLDVDASGIMYAASASQVCLAFAANEDGG----DVG 491
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
++G QL+ + +D+ +GF
Sbjct: 492 IVGNTQLKTFGVAYDIGKKVVGF 514
>gi|152206086|gb|ABS30428.1| xyloglucan-specific endo-beta-1,4-glucanase inhibitor protein
[Nicotiana benthamiana]
Length = 78
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 36/64 (56%)
Query: 97 AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVL 156
+ C C P NP T T G++ D++S+QSTDG NPG ++ P +F C + +L
Sbjct: 14 SPGCNNNTCSHIPYNPFIRTSTGGELAEDIVSLQSTDGSNPGNFISKPGVVFDCAPKSLL 73
Query: 157 QGLA 160
+ LA
Sbjct: 74 EKLA 77
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 99/419 (23%), Positives = 167/419 (39%), Gaps = 87/419 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------------SSTNKT 84
Y + TP P+ + + G ++ WV C Y SS+++
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 85 ARCGSAQC-------NLANAKA---CGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
C + C +L++ +A C G C P N +N + V ST G
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANC--TPRN--ANANNVCPPYLVVYGSGSTAG 204
Query: 135 -------RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
R PGRAV NF+ C V Q + G+AG GR ++PSQL
Sbjct: 205 LLISDTLRTPGRAVR--NFVIGCSLASVHQPPS----GLAGFGRGAPSVPSQLGLT---- 254
Query: 188 RKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
KF+ CL FDD A V G+ D ++Y PL +AS SV Y
Sbjct: 255 -KFSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPL----ARSASARPPY-SVYY 308
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
++ +T++ V GK+V L + + GG + + ++ + ++++ + A +A+
Sbjct: 309 YLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGG 367
Query: 306 VARVAPVAPFG----ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-- 359
+ V G CF + T +P++ L + +V+ ++ +N V G
Sbjct: 368 RYSRSKVVEEGLGLSPCFAMPPGTKTM---ELPEMSLHFKGGSVM-NLPVENYFVVAGPA 423
Query: 360 --------GDALCLGFVDG---------GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+A+CL V + +I++G+ Q +N +++DL RLGF
Sbjct: 424 PSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGF 482
>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 44/84 (52%), Gaps = 4/84 (4%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+L YTP IN ++SG+ + Y+I + V + K + L L S DN+G GGT I
Sbjct: 14 SLNYTPFLINTKASSSGY----NTFYYIDLRGVSIGRKRLNLPSKLFSFDNKGNGGTIID 69
Query: 280 TVNPYTVLETSIYKALVQAFASAM 303
+ +T+ YK + AFAS +
Sbjct: 70 SGTTFTIFNEEFYKNITAAFASQI 93
>gi|383125857|gb|AFG43519.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125863|gb|AFG43522.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125867|gb|AFG43524.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125869|gb|AFG43525.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125871|gb|AFG43526.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125873|gb|AFG43527.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125877|gb|AFG43529.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTP N S G V Y+IG+ +V + GK + L LL D +G GGT I +
Sbjct: 24 LNYTPFLTNYRAPPSSQYG---VYYYIGLRAVSIGGKRMKLPSKLLRFDTKGNGGTIIDS 80
Query: 281 VNPYTVLETSIYKALVQAFASAMP--KVARVAPVAPFGACF 319
+TV I+K + FAS + + V + G C+
Sbjct: 81 GTTFTVFHDEIFKHIAAGFASQIEYRRAVDVEALTGMGLCY 121
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 151/385 (39%), Gaps = 72/385 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---KGYV----------SSTNKTARCGSA 90
+Y T+I TP + + G +++W+ CE + Y S + T C SA
Sbjct: 7 EYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSA 66
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
C+ +A C GG C G G + T G + L+ +T ++
Sbjct: 67 VCSQLDANDCHGGGCLYEVSYGDG-------SYTVGSYATETLTFGTT---------SIQ 110
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GA 201
N CG + V GL G G+ GLG ++ P+QL R F+ CL D G
Sbjct: 111 NVAIGCGHDNV--GLFVGAAGLLGLGAGSLSFPAQLGT--QTGRAFSYCLVDRDSESSGT 166
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG---KA 258
+ FG V +TPL N FL P+ Y++ + ++ V G +
Sbjct: 167 LEFGPE--------SVPIGSIFTPLVANP------FL--PTF-YYLSMVAISVGGVILDS 209
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
VP ++ + G GG I + T L+TS Y AL AF + + R ++ F C
Sbjct: 210 VP-SEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTC 268
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTS 377
+ L + I P + N + + +N ++ + C F N
Sbjct: 269 YDLSALQSVSI----PAVGFHFSN-GAGFILPAKNCLIPMDSMGTFCFAFAPADSNLS-- 321
Query: 378 IVIGARQLENNLLQFDLATSRLGFS 402
++G Q + + FD A S +GF+
Sbjct: 322 -IMGNIQQQGIRVSFDSANSLVGFA 345
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 85/382 (22%), Positives = 152/382 (39%), Gaps = 56/382 (14%)
Query: 43 LQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGS 89
++Y+ ++ TP VP G ++ W C+ + SST C S
Sbjct: 75 VEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 134
Query: 90 AQC-NLANAKACG--GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
A C + ++ C +C G + G + + L++ S+ PG+AV+V +
Sbjct: 135 ATCLPVLRSRNCSTPSSLCRYGYSYS-DGAYSAGILGTETLTLGSS---VPGQAVSVSDV 190
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGD 206
F CG++ L S G GLGR ++L +QL KF+ CL+ F + + D
Sbjct: 191 AFGCGTDNGGDSLNS--TGTVGLGRGTLSLLAQLGVG-----KFSYCLTDFFNSTL---D 240
Query: 207 GPYYDLNNFDVSKN---LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
P+ +++ ++ TPL + +N + Y + + + + +P+
Sbjct: 241 SPFLLGTLAELAPGPGAVQSTPLLQSPLNPS---------RYVVSLQGITLGDVRLPIPN 291
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PKVARVAPVAPFGACFRL 321
+ GG + + +++L S ++ +V A + P V + +P CF
Sbjct: 292 KTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSP---CFPA 348
Query: 322 QDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPRTSIV 379
R P +P DLVL +H N M D + CL V T +
Sbjct: 349 P--AGERQLPFMP--DLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVG---TTSTWSM 401
Query: 380 IGARQLENNLLQFDLATSRLGF 401
+G Q +N + FD+ +L F
Sbjct: 402 LGNFQQQNIQMLFDMTVGQLSF 423
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 83/372 (22%), Positives = 140/372 (37%), Gaps = 64/372 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ +I TP + G +++W C+ S+T K C S
Sbjct: 82 EYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSP 141
Query: 91 QCNLANAKACGGGICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C+ + G C + S ++ + G++ +D +++QST GR V P
Sbjct: 142 VCSYSG----DGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTS----GRPVAFPR 193
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
+ CG + + V GI GLGR +L +QL A KF+ CL P G
Sbjct: 194 TVIGCGHDNA-GTFNANVSGIVGLGRGPASLVTQLGPA--TGGKFSYCLIP-------IG 243
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
G D + N + SG + P T + +AV + T
Sbjct: 244 TGSTNDSTKLNFGSNANV---------SGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTK 294
Query: 266 LSIDNEG---VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA----PVAPFGAC 318
+ EG +GG ++ T L T + AL+ +F SA+ + + P C
Sbjct: 295 FNFP-EGASKLGGESNIIIDSGTTL-TYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYC 352
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
F + +P + + + +V + +N V++ D +CL F G P +I
Sbjct: 353 FATTTDDYE-----MPPVTMHFEGADV--PLQRENLFVRLSDDTICLAF---GSFPDDNI 402
Query: 379 VIGARQLENNLL 390
I ++N L
Sbjct: 403 FIYGNIAQSNFL 414
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 65/285 (22%), Positives = 97/285 (34%), Gaps = 54/285 (18%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQ--------YVTQIKQRTPLVPV 58
FLLL S+ S S P ++ ++ T++ Y I P P
Sbjct: 16 FLLLSSIFPHHFSAANKNNSIPPTSIHSLISSLVYTIKGNVYPDGIYTVSINIGNPPNPY 75
Query: 59 KLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+L + G ++ WV C+ K Y + N+ +C C G
Sbjct: 76 ELDIDTGSDLTWVQCDGPDAPCKGCTLPKDKLYKPNGNQLVKCSDPICAAVQPPFSTFGQ 135
Query: 105 CGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSE--FV 155
A P P N + G + D + I S G N VP +F CG E F
Sbjct: 136 KCAKPIPPCVYKVEYADNAESTGALARDYMHIGSPSGSN------VPLVVFGCGYEQKFS 189
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF 215
G+ GLG K+++ SQL + + CLS G + GD F
Sbjct: 190 GPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSAEGGGYLFLGD-------KF 242
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
S + +TP+ + + Y G + NGK P
Sbjct: 243 IPSSGIFWTPIIQSSLEK----------HYSTGPVDLFFNGKPTP 277
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 99/402 (24%), Positives = 154/402 (38%), Gaps = 61/402 (15%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----KGYV----------SSTNK 83
D T QY T+++ TP ++ V G + WV+C KG V S + K
Sbjct: 82 DYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFK 141
Query: 84 TARCGSAQC-----NLANAKACGGGICGAGPDNPISNTGTHGD--IRIDVLSIQS-TDGR 135
T C + C NL + C P P S + D V + ++ T G
Sbjct: 142 TVGCFTQTCKVDLMNLFSLSTC------PTPSTPCSYDYRYADGSAAQGVFAKETITVGL 195
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC-- 193
GR + + C S F Q G+ GL S + S + F K + L
Sbjct: 196 TNGRKARLRGLLVGCSSSFSGQSFQG-ADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDH 254
Query: 194 LSPFD-DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
LS + ++FG Y + + + TPL + + P Y I + +
Sbjct: 255 LSNKNISNYLIFG---YSSSSTSTKTAPGRTTPLDLTLI---------PPF-YAINIIGI 301
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
+ + + + D GGT + + T+L + YK +V A + ++ RV P
Sbjct: 302 SIGDDMLDIPTQVW--DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPE 359
Query: 313 A-PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGG 371
P CF GF +PQ+ L+ + H ++ +V CLGF+ G
Sbjct: 360 GIPIEYCFS-STSGFNE--SKLPQLTFHLKG-GARFEPHRKSYLVDAAPGVKCLGFMSAG 415
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
P T++V Q +N L +FDL S L F+ S TCT
Sbjct: 416 T-PATNVVGNIMQ-QNYLWEFDLMASTLSFAPS------TCT 449
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 121/306 (39%), Gaps = 57/306 (18%)
Query: 115 NTGTHGDIRIDVLSIQSTD-GRNPGRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGR 171
+T T G + DVL + + D G P +A N CG LQ A+ V G+ GLG
Sbjct: 195 DTFTTGTLFEDVLHLVTEDEGLEPVKA----NITLGCGKNQTGFLQSSAA-VNGLLGLGL 249
Query: 172 SKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFIN 229
++PS LA A F++C D G I FGD Y D TPL
Sbjct: 250 KDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQME---------TPLLPT 300
Query: 230 KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLET 289
EPS Y + VT V V G AV + LL++ + G +T L
Sbjct: 301 ----------EPSPTYAVSVTEVSVGGDAVGVQ--LLALFDTGTS---------FTHLLE 339
Query: 290 SIYKALVQAFASAMPKVAR-VAPVAPFGACFRLQDIGFTRIGPVVPQI-----DLVLQNK 343
Y + +AF + R + P PF C+ L T + P V + L+N
Sbjct: 340 PEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNP 399
Query: 344 -NVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+VW+ ++M +G L VD +N +IG + + FD LG+
Sbjct: 400 LFIVWN-EDNSAMYCLG----ILKSVDFKIN-----IIGQNFMSGYRIVFDRERMILGWK 449
Query: 403 DSLLFE 408
S FE
Sbjct: 450 RSDCFE 455
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 96/387 (24%), Positives = 148/387 (38%), Gaps = 71/387 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGSA 90
+Y ++ TP + + + G +++W+ C V S T T CGS
Sbjct: 135 EYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSR 194
Query: 91 QC-NLANAKACGGGICGAGPDNPISNTG--THGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
C L ++ C A G T GD + L+ V +
Sbjct: 195 LCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFH---------GARVDHVA 245
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--------SPFDD 199
CG + +GL G G+ GLGR ++ PSQ ++ KF+ CL S
Sbjct: 246 LGCGHDN--EGLFVGAAGLLGLGRGGLSFPSQTKNRYN--GKFSYCLVDRTSSGSSSKPP 301
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
IVFG+G V K +TPL N K++T Y++ + + V G
Sbjct: 302 STIVFGNGA--------VPKTAVFTPLLTNPKLDTF----------YYLQLLGISVGGSR 343
Query: 259 VP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
VP ++++ +D G GG I + T L S Y AL AF ++ R + F
Sbjct: 344 VPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFDT 403
Query: 318 CFRLQDIGFTRIGPVVPQI---DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
CF L + ++ VV ++ L N + ++ Q C F G
Sbjct: 404 CFDLSGMTTVKVPTVVFHFTGGEVSLPASNYLIPVNNQGR--------FCFAF--AGTMG 453
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGF 401
SI IG Q + + +DL SR+GF
Sbjct: 454 SLSI-IGNIQQQGFRVAYDLVGSRVGF 479
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 144/392 (36%), Gaps = 67/392 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-----------------------SST 81
Y + TP + + G ++ W+ C G S+T
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGR-NPGRA 140
+ + RC +C + + IC S TGT G + DVL + + D P +A
Sbjct: 162 SSSIRCSDKRCFGSKKCSSPSSICPYQISYSNS-TGTKGTLLQDVLHLATEDENLTPVKA 220
Query: 141 VTVPNFIFLCGSEFV-LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF-- 197
N CG + L + V G+ GLG ++PS LA A F++C
Sbjct: 221 ----NVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIG 276
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
+ G I FGD Y D + TP FI+ PS Y + ++ V V G
Sbjct: 277 NVGRISFGDRGYTD---------QEETP-FISVA---------PSTAYGVNISGVSVAGD 317
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR-VAPVAPFG 316
P++ L + K T + +T L Y L ++F + R V P PF
Sbjct: 318 --PVDIRLFA---------KFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFE 366
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
C+ L T P+V + ++ + + Q G CLG + V +
Sbjct: 367 FCYDLSPNATTIQFPLVEMT--FIGGSKIILNNPFFTARTQEGNVMYCLGVLK-SVGLKI 423
Query: 377 SIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
+ VIG + + FD LG+ SL FE
Sbjct: 424 N-VIGQNFVAGYRIVFDRERMILGWKQSLCFE 454
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 91/396 (22%), Positives = 147/396 (37%), Gaps = 77/396 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y + TP L + G +++W+ C ++G V SST + C S
Sbjct: 85 EYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSP 144
Query: 91 QCNLANAKAC-GGGICGAGPDNPIS---NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
QC C GG G G ++ + + GD+ D L+ + V N
Sbjct: 145 QCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAND--------TYVNNV 196
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA----- 201
CG + +GL G+ G+GR K+++ +Q+A A+ F CL +
Sbjct: 197 TLGCGRD--NEGLFDSAAGLLGVGRGKISISTQVAPAYG--SVFEYCLGDRTSRSTRSSY 252
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP- 260
+VFG P + +T L N PS+ Y++ + V G+ V
Sbjct: 253 LVFGRTP--------EPPSTAFTALLSNPR--------RPSL-YYVDMAGFSVGGERVTG 295
Query: 261 LNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP---FG 316
+ L++D G GG + + + Y AL AF + F
Sbjct: 296 FSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFD 355
Query: 317 ACFRLQDIGFTRIGPVVPQI--------DLVLQNKNVVWSIHGQNSMVQIGGDALCLGF- 367
AC+ L+ R P I D+ L +N + G + CLGF
Sbjct: 356 ACYDLRG----RPAASAPLIVLHFAGGADMALPPENYFLPVDGGRR--RAASYRRCLGFE 409
Query: 368 -VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
D G++ VIG Q + + FD+ R+GF+
Sbjct: 410 AADDGLS-----VIGNVQQQGFRVVFDVEKERIGFA 440
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/341 (23%), Positives = 124/341 (36%), Gaps = 62/341 (18%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY 77
LS I A +F R++ L Y T ++ TP V + + G ++ WV C+
Sbjct: 75 LSQIDAGLAFSDGNSTFRISS-LGFLHYTT-VQIGTPGVKFMVALDTGSDLFWVPCDCTR 132
Query: 78 VSSTNKTARCGSAQCNLANAK--------ACGGGIC-------GAGPDNPI------SNT 116
++++ TA N+ N C +C G + P + T
Sbjct: 133 CAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNCPYMVSYVSAET 192
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV---GIAGLGRSK 173
T G + DVL + D + + N IF CG + G V G+ GLG K
Sbjct: 193 STSGILVEDVLHLTQEDNHHD---LVEANVIFGCGQ--IQSGSFLDVAAPNGLFGLGMEK 247
Query: 174 VALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNT 233
+++PS L+ F++C G I FGD +D + TP +N
Sbjct: 248 ISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDE---------TPFNLNP--- 295
Query: 234 ASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYK 293
P+ Y I VT V V + + T L + +T L Y
Sbjct: 296 -----SHPT--YNITVTQVRVGTTVIDVEFTAL-----------FDSGTSFTYLVDPTYT 337
Query: 294 ALVQAFASAM-PKVARVAPVAPFGACFRLQDIGFTRIGPVV 333
L ++F S + + R PF C+ + T + P V
Sbjct: 338 RLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSV 378
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 122/301 (40%), Gaps = 55/301 (18%)
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
DV+S + P RAV F C + + GI GLG ++L QL
Sbjct: 170 DVISFGNQSELIPQRAV------FGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223
Query: 185 SLKRKFALCLSPFD--DGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEP 241
++ F+LC D GA+V G P D+ + T S + P
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDM------------------IFTYSDPVRSP 265
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
Y + + +HV GK +PL+ + +G G + + Y L + A A
Sbjct: 266 Y--YNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLPAEAFSAFKDAIMD 319
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGP-------VVPQIDLVLQNKNVVWSIHGQNS 354
+ + ++ P +DI F+ G P +D+V +N + S+ +N
Sbjct: 320 EIHSLKKIDGPDP-----NFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL-SLTPENY 373
Query: 355 MV---QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---SLLFE 408
++ G A CLG + G N +T++ +G + N L+ +D A S++GF S L+E
Sbjct: 374 FFRHSKVHG-AYCLGIFENG-NDQTTL-LGGIVVRNTLVMYDRANSKIGFWKTNCSELWE 430
Query: 409 R 409
R
Sbjct: 431 R 431
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 89/408 (21%), Positives = 152/408 (37%), Gaps = 72/408 (17%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE- 74
L P+ + F+P+ L + T+ + +Y +++ P L + G ++ WV C
Sbjct: 120 LKPLETDSEFKPEDLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAP 179
Query: 75 ------------KGYVSSTNKTARCGSAQCNLANAKACGGGIC------GAGPDNPISNT 116
+ S++ T C + QC + C C G G +
Sbjct: 180 CADCYQQADPIFEPASSASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDG-------S 232
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T GD + +++ G P V N CG +GL G G+ GLG ++
Sbjct: 233 YTVGDFVTETITL----GSAP-----VDNVAIGCGHNN--EGLFVGAAGLLGLGGGSLSF 281
Query: 177 PSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG 236
PSQ+ A F+ CL D + + N+ + N PL N
Sbjct: 282 PSQINAT-----SFSYCLVDRDSES-----ASTLEFNS-TLPPNAVSAPLLRNH------ 324
Query: 237 FLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALV 296
Y++G+T + V G+ V + ++ ID G GG + + T L+T +Y +L
Sbjct: 325 ---HLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLR 381
Query: 297 QAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
AF + +A F C+ L G VP + + + + +N +V
Sbjct: 382 DAFVKRTRDLPSTNGIALFDTCYDLSSKGNVE----VPTVSFHFPDGKEL-PLPAKNYLV 436
Query: 357 QIGGD-ALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
+ + C F P S +IG Q + + +DL +GF
Sbjct: 437 PLDSEGTFCFAFA-----PTASSLSIIGNVQQQGTRVVYDLVNHLVGF 479
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 122/301 (40%), Gaps = 55/301 (18%)
Query: 125 DVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAF 184
DV+S + P RAV F C + + GI GLG ++L QL
Sbjct: 170 DVISFGNQSELIPQRAV------FGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223
Query: 185 SLKRKFALCLSPFD--DGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEP 241
++ F+LC D GA+V G P D+ + T S + P
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDM------------------IFTYSDPVRSP 265
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
Y + + +HV GK +PL+ + +G G + + Y L + A A
Sbjct: 266 Y--YNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLPAEAFSAFKDAIMD 319
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGP-------VVPQIDLVLQNKNVVWSIHGQNS 354
+ + ++ P +DI F+ G P +D+V +N + S+ +N
Sbjct: 320 EIHSLKKIDGPDP-----NFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL-SLTPENY 373
Query: 355 MV---QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD---SLLFE 408
++ G A CLG + G N +T++ +G + N L+ +D A S++GF S L+E
Sbjct: 374 FFRHSKVHG-AYCLGIFENG-NDQTTL-LGGIVVRNTLVMYDRANSKIGFWKTNCSELWE 430
Query: 409 R 409
R
Sbjct: 431 R 431
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 83/390 (21%), Positives = 151/390 (38%), Gaps = 63/390 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T++K P + + G +ILWV C SST+
Sbjct: 89 YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148
Query: 87 CGSAQCNLANAKACGGGICGA--GPDNPISNTGTHGD-------IRIDVLSIQSTDGRNP 137
C +C A G +C + P +P T T+GD D + + G N
Sbjct: 149 CSDDRCTA--ALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMG-NE 205
Query: 138 GRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
A + + +F C + L V GI G G+ ++++ SQL + + F+ CL
Sbjct: 206 QTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLK 265
Query: 196 PFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
D+G +V G+ V L +TPL ++ + Y + + S+
Sbjct: 266 GSDNGGGILVLGE---------IVEPGLVFTPLVPSQPH------------YNLNLESIA 304
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
V+G+ +P++ +L + N GT + + L Y + A A+A+ R
Sbjct: 305 VSGQKLPIDSSLFATSN--TQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSK 362
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVN 373
CF + + + P L + V ++ +N ++Q G + + G
Sbjct: 363 GI-QCF----VTTSSVDSSFPTATLYFKG-GVSMTVKPENYLLQQGSVDNNVLWCIGWQR 416
Query: 374 PRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ ++G L++ + +DLA R+G++D
Sbjct: 417 SQGITILGDLVLKDKIFVYDLANMRMGWAD 446
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 145/381 (38%), Gaps = 70/381 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------SSTNKTAR 86
+L+YV + TP VP + + G ++ W+ C+ SST
Sbjct: 109 SLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVP 168
Query: 87 CGSAQCNLANAKACGGGICGAGPD----NPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
C S +C A A G G P + + T T G D L++ PG
Sbjct: 169 CASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLA------PG--AI 220
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--G 200
V +F F CG S + G+ L L A + F+ CL + G
Sbjct: 221 VKDFYFGCGHS------KSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSKPG 274
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ FG G +TP+ +V G+P+ + + + V GK +
Sbjct: 275 FLAFGAG--------RNPSGFVFTPM--GRVP------GQPTFST-VTLAGITVGGKKLD 317
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
L + S GG + + TVL++++Y+AL AF AM K R+ C+
Sbjct: 318 LRPSAFS------GGMIVDSGTVVTVLQSTVYRALRAAFREAM-KAYRLVH-GDLDTCYD 369
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVI 380
L G+ + VVP+I L + ++ N ++ G CL F + G + ++
Sbjct: 370 L--TGYKNV--VVPKIALTFSGGATI-NLDVPNGILVNG----CLAFAETGKDGTAGVLG 420
Query: 381 GARQLENNLLQFDLATSRLGF 401
Q +L FD + S+ GF
Sbjct: 421 NVNQRTFEVL-FDTSASKFGF 440
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 101/392 (25%), Positives = 152/392 (38%), Gaps = 81/392 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCG-- 88
+Y ++ TP V + + G +++W+ C S T T CG
Sbjct: 137 EYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSR 196
Query: 89 -------SAQCNLANAKACGGGIC-GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
S++C +K C + G G + T GD + L+ R
Sbjct: 197 LCRRLDDSSECVTRRSKTCLYQVSYGDG-------SFTEGDFSTETLTFHG------ARV 243
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL------ 194
VP CG + +GL G G+ GLGR ++ PSQ + ++ KF+ CL
Sbjct: 244 DHVP---LGCGHD--NEGLFVGAAGLLGLGRGGLSFPSQTKSRYN--GKFSYCLVDRTSS 296
Query: 195 --SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTS 251
S IVFG N V K +TPL N K++T Y++ +
Sbjct: 297 GSSSKPPSTIVFG--------NDAVPKTSVFTPLLTNPKLDTF----------YYLQLLG 338
Query: 252 VHVNGKAVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
+ V G VP ++++ +D G GG I + T L S Y AL AF K+ R
Sbjct: 339 ISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAP 398
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVD 369
+ F CF L G T + VP + V S+ N ++ + + C F
Sbjct: 399 SYSLFDTCFDLS--GMTTVK--VPTVVFHFGGGEV--SLPASNYLIPVNTEGRFCFAF-- 450
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G SI IG Q + + +DL SR+GF
Sbjct: 451 AGTMGSLSI-IGNIQQQGFRVAYDLVGSRVGF 481
>gi|383125861|gb|AFG43521.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTP N S G V Y+IG+ +V + GK + L LL D +G GGT I +
Sbjct: 24 LNYTPFLTNYRAPPSSQYG---VYYYIGLRAVSIGGKRMKLPSKLLRFDAKGNGGTIIDS 80
Query: 281 VNPYTVLETSIYKALVQAFASAMP--KVARVAPVAPFGACF 319
+TV I+K + FAS + + V + G C+
Sbjct: 81 GTTFTVFHDEIFKHIAAGFASQIEYRRAVDVEALTGMGLCY 121
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 64/144 (44%), Gaps = 17/144 (11%)
Query: 162 GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVFGDGPYYDLNNFDVSK 219
G G+ GLGR ++L SQL A +F+ CL+PF D GP LN V
Sbjct: 241 GSAGLVGLGRGSLSLVSQLGAG-----RFSYCLTPFQDTNSTSTLLLGPSAALNGTGV-- 293
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+ TP AS S Y++ +T + + KA+P++ S+ +G GG I
Sbjct: 294 --RSTPFV------ASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIID 345
Query: 280 TVNPYTVLETSIYKALVQAFASAM 303
+ T L + Y+ + A S +
Sbjct: 346 SGTTITSLANAAYQQVRAAVKSQL 369
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 84/380 (22%), Positives = 144/380 (37%), Gaps = 57/380 (15%)
Query: 43 LQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCGS 89
++Y+ ++ P VP G ++ W C+ + SST C S
Sbjct: 69 VEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSS 128
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTHGD--IRIDVLSIQSTDGRNPGRAVTVPNFI 147
A C ++ C P + +GD +L ++ V+V
Sbjct: 129 ATCLPIWSRNCT-------PSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVA 181
Query: 148 FLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDG 207
F CG++ L S G GLGR ++L +QL KF+ CL+ F + A+ D
Sbjct: 182 FGCGTDNGGDSLNS--TGTVGLGRGTLSLLAQLGVG-----KFSYCLTDFFNSAL---DS 231
Query: 208 PYYDLNNFDVS---KNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
P+ +++ ++ TPL + N + YF+ + + + +P+
Sbjct: 232 PFLLGTLAELAPGPSTVQSTPLLQSPQNPS---------RYFVSLQGISLGDVRLPIPNG 282
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+ +G GG + + +T+L S ++ +V +VARV P A L
Sbjct: 283 TFDLRGDGTGGMIVDSGTTFTILAESGFREVVG-------RVARVLGQPPVNAS-SLDAP 334
Query: 325 GFTRIGPVVPQI-DLVLQ-NKNVVWSIHGQNSMVQIGGD-ALCLGFVDGGVNPRTSIVIG 381
F P + DLVL ++ N M D + CL G P ++ V+G
Sbjct: 335 CFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIA--GTTPESTSVLG 392
Query: 382 ARQLENNLLQFDLATSRLGF 401
Q +N + FD +L F
Sbjct: 393 NFQQQNIQMLFDTTVGQLSF 412
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 116/317 (36%), Gaps = 48/317 (15%)
Query: 57 PVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNT 116
P LT G W++ SST+ C S+ C LAN + C N+
Sbjct: 138 PTYLTKRDNGK-FWLNHYSSNASSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENS 196
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV---GIAGLGRSK 173
+ G + D+L + + D + V V CG V G S V G+ GLG K
Sbjct: 197 SSAGYLVQDILHMATDDSQLKPVDVKV---TLGCGK--VQTGKFSNVTAPNGLIGLGMGK 251
Query: 174 VALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNT 233
V++PS LA+ F++C + G I FGD I V
Sbjct: 252 VSVPSFLASQGLTTDSFSMCFGYYGYGRIDFGD---------------------IGPVGQ 290
Query: 234 ASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYK 293
S+ Y + + + V + P N L +I + G +T L Y
Sbjct: 291 RETPFNPASLSYNVTILQIIVTNR--PTNVHLTAIIDSGAS---------FTYLTDPFYS 339
Query: 294 ALVQAFASAMPKVARVAPVA--PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHG 351
+ + +AM ++ R+ + PF C+RL + P ++ ++ I
Sbjct: 340 IITENMDAAM-ELERIKSDSDFPFEYCYRLSLATIFQ----QPNLNFTMEGGRKFDVITS 394
Query: 352 QNSMVQIGGDALCLGFV 368
S+ G ALCL V
Sbjct: 395 YVSVDTDDGPALCLAIV 411
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 93/409 (22%), Positives = 160/409 (39%), Gaps = 72/409 (17%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTA--TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-- 73
L P + F AL V T+ + +Y ++ P + + G ++ W+ C
Sbjct: 120 LHPAESNAEFEANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAP 179
Query: 74 -EKGY---------VSSTNKTA-RCGSAQCNLANAKACGGGIC------GAGPDNPISNT 116
+ Y VSS + + RC + QC + C G C G G +
Sbjct: 180 CSECYQQSDPIFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLYEVSYGDG-------S 232
Query: 117 GTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVAL 176
T G+ + +++ + V N CG +GL G G+ GLG K++
Sbjct: 233 YTVGEFATETVTLGT---------AAVENVAIGCGHNN--EGLFVGAAGLLGLGGGKLSF 281
Query: 177 PSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASG 236
P+Q+ A F+ CL D A+ + N + +N+ PL N
Sbjct: 282 PAQVNAT-----SFSYCLVNRDSDAVSTLEF------NSPLPRNVVTAPLRRNP------ 324
Query: 237 FLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALV 296
E Y++G+ + V G+A+P+ +++ +D G GG I + T L + +Y AL
Sbjct: 325 ---ELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALR 381
Query: 297 QAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
AF + + V+ F C+ L +R VP + + + +N ++
Sbjct: 382 DAFVKGAKGIPKANGVSLFDTCYDLS----SRESVQVPTVSFHFPEGREL-PLPARNYLI 436
Query: 357 QIGG-DALCLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
+ C F P TS ++G Q + + FD+A S +GFS
Sbjct: 437 PVDSVGTFCFAFA-----PTTSSLSIMGNVQQQGTRVGFDIANSLVGFS 480
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 85/357 (23%), Positives = 145/357 (40%), Gaps = 51/357 (14%)
Query: 57 PVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNT 116
P KL V G +++W C+ SST AR GS + G + T
Sbjct: 52 PRKLIVDTGSDLIWTQCK--LSSSTAAAARHGSPPLSRTAPARTGA----------FTRT 99
Query: 117 GTHGDIRIDVLSIQS-TDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVA 175
T + VL+ ++ T G RAV++ F CG+ + G G GI GL ++
Sbjct: 100 CTASAAAVGVLASETFTFGAR--RAVSL-RLGFGCGA--LSAGSLIGATGILGLSPESLS 154
Query: 176 LPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVN 232
L +QL ++F+ CL+PF D ++FG DL+ ++ ++ T + N V
Sbjct: 155 LITQLKI-----QRFSYCLTPFADKKTSPLLFG--AMADLSRHKTTRPIQTTAIVSNPVE 207
Query: 233 TASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
T V Y++ + + + K + + L++ +G GGT + + + L + +
Sbjct: 208 T---------VYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAF 258
Query: 293 KALVQAFASA--MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSI 349
+A+ +A +P R V + CF L V LVL + +
Sbjct: 259 EAVKEAVMDVVRLPVANRT--VEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVL 316
Query: 350 HGQNSMVQIGGDALCLGF---VDG-GVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
N + +CL DG GV+ +IG Q +N + FD+ + F+
Sbjct: 317 PRDNYFQEPRAGLMCLAVGKTTDGSGVS-----IIGNVQQQNMHVLFDVQHHKFSFA 368
>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
Length = 490
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 150/359 (41%), Gaps = 59/359 (16%)
Query: 69 LWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRI-DVL 127
LW+ ++S + C A C NA A +C P + +G+ + I D L
Sbjct: 156 LWIH-SPDHLSDCRAASSCPGANCTPRNANA--NNVCP--PYLVVYGSGSTAGLLISDTL 210
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
R PGRAV NF+ C V Q + G+AG GR ++PSQL
Sbjct: 211 -------RTPGRAVR--NFVIGCSLASVHQPPS----GLAGFGRGAPSVPSQLGLT---- 253
Query: 188 RKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
KF+ CL FDD A V G+ D ++Y PL +AS SV Y
Sbjct: 254 -KFSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPL----ARSASARPPY-SVYY 307
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
++ +T++ V GK+V L + + GG + + ++ + ++++ + A +A+
Sbjct: 308 YLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGG 366
Query: 306 VARVAPVAPFG----ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-- 359
+ V G CF + T +P++ L + +V+ ++ +N V G
Sbjct: 367 RYSRSKVVEEGLGLSPCFAMPPGTKTM---ELPEMSLHFKGGSVM-NLPVENYFVVAGPA 422
Query: 360 --------GDALCLGFVDG---------GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+A+CL V + +I++G+ Q +N +++DL RLGF
Sbjct: 423 PSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGF 481
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 146/376 (38%), Gaps = 52/376 (13%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----KGYVSSTNKTARCGSAQCNLANAKA 99
YV ++K TP + + + + WV C STN ++ GS C++A
Sbjct: 97 YVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFSTNTSSTYGSLDCSMAQCTQ 156
Query: 100 CGGGICGA-GPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVT--VPNFIFLCGSEFVL 156
G C A G + + N GD S +T + R V +PNF F C +
Sbjct: 157 VRGFSCPATGSSSCVFNQSYGGDS-----SFSATLVEDSLRLVNDVIPNFAFGC-----I 206
Query: 157 QGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK-FALCLSPFDD----GAIVFGDGPYYD 211
++ G V GL S +A + SL F+ CL F G++ G
Sbjct: 207 NSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPA---- 262
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
K+++YTPL N PS+ Y++ +T V V VP+ LL+ +
Sbjct: 263 ----GQPKSIRYTPLLRNP--------HRPSL-YYVNLTGVSVGRTLVPIAPELLAFNPN 309
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGP 331
GT I + T IY A+ F + + + F CF +
Sbjct: 310 TGAGTIIDSGTVITRFVQPIYTAIRDEFRKQV--AGPFSSLGAFDTCFAATNEA------ 361
Query: 332 VVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGFVDGGVNPRTSI-VIGARQLENNL 389
V P + L N+V + +NS++ +L CL N + + VI Q +N
Sbjct: 362 VAPAVTLHFTGLNLVLPM--ENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLR 419
Query: 390 LQFDLATSRLGFSDSL 405
L FD+ SRLG + L
Sbjct: 420 LLFDVPNSRLGIAREL 435
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/322 (23%), Positives = 118/322 (36%), Gaps = 63/322 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY---------------------VSSTNK 83
+ T +K TP + + + G ++ WV C+ G VS+TNK
Sbjct: 107 HYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNK 166
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C ++ C N C + T T G + DV+ + +T+ +NP R
Sbjct: 167 KVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHL-TTEDKNPERVEAY 225
Query: 144 PNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
F CG V G + G+ GLG K+++PS LA + F++C G
Sbjct: 226 --VTFGCGQ--VQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVG 281
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I FGD S + + TP +N P+ Y I VT V V +
Sbjct: 282 RISFGDKG---------SSDQEETPFNLNP--------SHPN--YNITVTRVRVGTTLID 322
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS-AMPKVARVAPVAPFGACF 319
T L T +T L +Y + ++F S A K PF C+
Sbjct: 323 DEFTAL-----------FDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCY 371
Query: 320 RLQDIGFTRIGPVVPQIDLVLQ 341
D+ ++P + L ++
Sbjct: 372 ---DMSNDANASLIPSLSLTMK 390
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/267 (26%), Positives = 111/267 (41%), Gaps = 35/267 (13%)
Query: 143 VPNFIFLCGSEFVLQGLA-SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-- 199
+ F+F CG+ QG G G+ GLGRS+++L SQ F + L LS D
Sbjct: 255 IDGFVFGCGTSN--QGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDAS 312
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G++V GD P N+ V ++ + V+ + L P Y + +T + V G+ V
Sbjct: 313 GSLVLGDDPSAYRNSTPV--------VYTSMVSNSDPLLQGPF--YLVNLTGITVGGQEV 362
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLET---SIYKALVQAFASAMPKVARVAPVAPFG 316
E G + + V+ TV+ + S+Y A+ F S + + + +
Sbjct: 363 -----------ESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILD 411
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQN-KNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
CF + G + VP + LV V G V +CL
Sbjct: 412 TCFNM--TGLKEV--QVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDE 467
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
TSI IG Q +N + FD + S++GF+
Sbjct: 468 TSI-IGNYQQKNLRVVFDTSASQVGFA 493
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/356 (22%), Positives = 140/356 (39%), Gaps = 66/356 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY------------------VSSTNKTAR 86
Y T+++ TP V + + G ++LWV C SST+
Sbjct: 25 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNT-------GTHGDIRIDVLSIQSTDGRNPGR 139
C +CN N C + +N S T GT G D++ + + +
Sbjct: 85 CSDQRCN--NGIQSSDATC-SSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTT 141
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
T P +F C ++ L V GI G G+ ++++ SQL++ R F+ CL
Sbjct: 142 NSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 200
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
G +V G+ V N+ YT L +P Y + + S+ VN
Sbjct: 201 SSGGGILVLGE---------IVEPNIVYTSL----------VPAQP--HYNLNLQSIAVN 239
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF 315
G+ + ++ ++ + N GT + + L Y V A +++P+ A V+
Sbjct: 240 GQTLQIDSSVFATSNS--RGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTA-VSRG 296
Query: 316 GACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ---IGGDAL-CLGF 367
C+ + + + V PQ+ L + + Q+ ++Q IGG A+ C+GF
Sbjct: 297 NQCYLIT----SSVTEVFPQVSLNFAGGASMI-LRPQDYLIQQNSIGGAAVWCIGF 347
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 149/385 (38%), Gaps = 72/385 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y T+I TP + + G +++W+ CE S + T C SA
Sbjct: 153 EYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSA 212
Query: 91 QCNLANAKACGGGIC------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
C+ +A C GG C G G + T G + L+ +T ++
Sbjct: 213 VCSQLDANDCHGGGCLYEVSYGDG-------SYTVGSYATETLTFGTT---------SIQ 256
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GA 201
N CG + V GL G G+ GLG ++ P+QL R F+ CL D G
Sbjct: 257 NVAIGCGHDNV--GLFVGAAGLLGLGAGSLSFPAQLGT--QTGRAFSYCLVDRDSESSGT 312
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG---KA 258
+ FG V +TPL N FL P+ Y++ + ++ V G +
Sbjct: 313 LEFGPE--------SVPIGSIFTPLVANP------FL--PTF-YYLSMVAISVGGVILDS 355
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
VP ++ + G GG I + T L+TS Y AL AF + + R ++ F C
Sbjct: 356 VP-SEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTC 414
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DALCLGFVDGGVNPRTS 377
+ L + I P + N + + +N ++ + C F N
Sbjct: 415 YDLSALQSVSI----PAVGFHFSN-GAGFILPAKNCLIPMDSMGTFCFAFAPADSNLS-- 467
Query: 378 IVIGARQLENNLLQFDLATSRLGFS 402
++G Q + + FD A S +GF+
Sbjct: 468 -IMGNIQQQGIRVSFDSANSLVGFA 491
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 85/388 (21%), Positives = 144/388 (37%), Gaps = 66/388 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y +I TP + V G +ILWV+C S+T+
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAV 141
C C+L + G C G S + T G D + G N
Sbjct: 215 CDDNFCSLYDGPLPG---CKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISG-NFQTTP 270
Query: 142 TVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
T +F CG++ L + + GI G G++ ++ SQLA++ +K+ F+ CL D
Sbjct: 271 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDG 330
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G I F G V + TPL N+ + Y + + + V G
Sbjct: 331 GGI-FAIGEV-------VEPKVNITPLVQNQAH------------YNVVMKEIEVGGD-- 368
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
PL+ + ++ GT I + +Y L++ S P + R+ V CF
Sbjct: 369 PLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCF 427
Query: 320 RLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
D GF P + L +K++ +++ + Q+ C+G+ + G +
Sbjct: 428 DYTGNVDDGF-------PTVTLHF-DKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKD 479
Query: 377 S---IVIGARQLENNLLQFDLATSRLGF 401
++G L N L+ +DL +G+
Sbjct: 480 GKDLTLLGDLVLSNKLVVYDLEKQGIGW 507
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 117/279 (41%), Gaps = 51/279 (18%)
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
G +VP F CG F S GIAG GR ++LPSQL F+ C +
Sbjct: 238 GAGASVPGVAFGCGL-FNNGVFKSNETGIAGFGRGPLSLPSQLKVG-----NFSHCFTAV 291
Query: 198 D--DGAIVFGDGPYYDLNNFDVSKN----LKYTPLFINKVNTASGFLGEPSVEYFIGVTS 251
+ + V DL D+ KN ++ TPL N N P++ Y++ +
Sbjct: 292 NGLKQSTVL-----LDLLA-DLYKNGRGAVQSTPLIQNSAN--------PTL-YYLSLKG 336
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ V +P+ ++ ++ N G GGT I + T L +Y+ + FA+ + P
Sbjct: 337 ITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-----LP 390
Query: 312 VAPFGA-----CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA---- 362
V P A CF ++ P VP+ LVL + + +N + ++ DA
Sbjct: 391 VVPGNATGPYTCFSAP----SQAKPDVPK--LVLHFEGATMDLPRENYVFEVPDDAGNSM 444
Query: 363 LCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+CL + G T IG Q +N + +DL + L F
Sbjct: 445 ICLAINELGDERAT---IGNFQQQNMHVLYDLQNNMLSF 480
>gi|125572775|gb|EAZ14290.1| hypothetical protein OsJ_04214 [Oryza sativa Japonica Group]
Length = 465
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 82/180 (45%), Gaps = 39/180 (21%)
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD----DGAI 202
+F CG + V G SGV G+ GLGR ++L SQL +F+ +P D I
Sbjct: 163 VFGCGLKNV--GDFSGVSGVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDTQSFI 215
Query: 203 VFGDGPYYDLNNFDVSKNLKYT---PLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+FGD D + +T L + N PS+ Y++ + + V+GK +
Sbjct: 216 LFGD---------DATPQTSHTLSTRLLASDAN--------PSL-YYVELAGIQVDGKDL 257
Query: 260 PLNKTLLSIDN-EGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKV----ARVAPV 312
+ + N +G GG +S + TVLE + YK L QA AS +P V AR PV
Sbjct: 258 AIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGLGARARPV 317
>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
Length = 191
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/185 (22%), Positives = 82/185 (44%), Gaps = 17/185 (9%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
NL +T L K N F Y++ + SV V G+ + + + ++ EGVGGT I
Sbjct: 15 NLNFTSLVGGKENHLETF-------YYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIID 67
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLV 339
+ + Y+ + QAF + + + + C+ + + + P +V
Sbjct: 68 SGTTLSYFAEPAYEIIKQAFVNKVKRYPILDDFPILKPCYNVSGVEKLEL----PSFGIV 123
Query: 340 LQNKNVVWSIHGQNSMVQIG-GDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATS 397
+ +W+ +N +++ D +CL + P +++ +IG Q +N + +D S
Sbjct: 124 FGD-GAIWTFPVENYFIKLEPEDIVCLAILG---TPHSAMSIIGNYQQQNFHILYDTKRS 179
Query: 398 RLGFS 402
RLGF+
Sbjct: 180 RLGFA 184
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 84/334 (25%), Positives = 129/334 (38%), Gaps = 57/334 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP V +++WV C + + SST C S
Sbjct: 89 EYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQ 148
Query: 91 QCNLANAKACG--GGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C +N C G +C +NT G VL +S + + VT P IF
Sbjct: 149 PCTSSNIYYCPLVGNLCL------YTNTYGDGSSTKGVLCTESIHFGS--QTVTFPKTIF 200
Query: 149 LCGS--EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV--- 203
CGS +F+ Q +++ V GI GLG ++L SQL + KF+ CL PF + +
Sbjct: 201 GCGSNNDFMHQ-ISNKVTGIVGLGAGPLSLVSQLGD--QIGHKFSYCLLPFTSTSTIKLK 257
Query: 204 FGDGPYYDLNNFDVSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
FG N+ ++ N + TPL I+ PS YF+ + + + K + +
Sbjct: 258 FG-------NDTTITGNGVVSTPLIIDP--------HYPSY-YFLHLVGITIGQKMLQVR 301
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM--PKVARVAPVAPFGACFR 320
T + G I T LE + Y V A+ + P PF CF
Sbjct: 302 TT-----DHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPY-PFDFCFP 355
Query: 321 LQ-DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQN 353
Q +I F +I + L KN+ + N
Sbjct: 356 NQANITFPKIVFQFTGAKVFLSPKNLFFRFDDLN 389
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 87/383 (22%), Positives = 138/383 (36%), Gaps = 68/383 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARC 87
T YV + TP + G + WV C+ V SST C
Sbjct: 175 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSC 234
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTH--GDIRIDVLSIQSTDGRNPGRAVTVPN 145
+ C+ N C GG C G G++ G +D L++ S D V
Sbjct: 235 AAPACSDLNIHGCSGGHCLYGVQY---GDGSYSIGFFAMDTLTLSSYD--------AVKG 283
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG +GL G+ GLGR K +LP Q + FA CL G
Sbjct: 284 FRFGCGERN--EGLFGEAAGLLGLGRGKTSLPVQTYDKYG--GVFAHCLPARSTGT---- 335
Query: 206 DGPYYDLNNFDVSKNLKY--TPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
Y D + TP+ + T Y+IG+T + V G+ + + +
Sbjct: 336 --GYLDFGAGSPAAASARLTTPMLTDNGPTF----------YYIGMTGIRVGGQLLSIPQ 383
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKAL-VQAFASAMPKVARVAP-VAPFGACFRL 321
++ + GT + + T L Y +L A+ + + AP V+ C+
Sbjct: 384 SVFA-----TAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 438
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV---DGGVNPRTSI 378
G +++ +P + L+ Q + M +CL F DGG
Sbjct: 439 --TGMSQV--AIPTVSLLFQG-GARLDVDASGIMYAASASQVCLAFAANEDGG----DVG 489
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
++G QL+ + +D+ +GF
Sbjct: 490 IVGNTQLKTFGVAYDIGKKVVGF 512
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 82/395 (20%), Positives = 162/395 (41%), Gaps = 74/395 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG---------SAQCNLA 95
Y T++K +P + + G +ILW++C ++ +N G +A + A
Sbjct: 83 YFTKVKLGSPAKEFYVQIDTGSDILWINC----ITCSNCPHSSGLGIELDFFDTAGSSTA 138
Query: 96 NAKACGGGICGAGPDNPISN-----------------TGTHGDIRIDVLSIQSTDGRNPG 138
+CG IC S +GT G D + +
Sbjct: 139 ALVSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSV 198
Query: 139 RAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSP 196
A + IF C + L V GI G G +++ SQL++ + F+ CL
Sbjct: 199 VANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKG 258
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
++G V G + ++ Y+PL ++ + Y + + S+ VNG
Sbjct: 259 GENGGGVLVLGEI-------LEPSIVYSPLVPSQPH------------YNLNLQSIAVNG 299
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG 316
+ +P++ + + N GT + + L Y V+A +A+ + ++ P+ G
Sbjct: 300 QLLPIDSNVFATTNN--QGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSK--PIISKG 355
Query: 317 -ACFRLQDIGFTRIGPVVPQIDL-VLQNKNVVWSIHGQNSMVQIG---GDAL-CLGF--V 368
C+ + + +G + PQ+ L + ++V ++ ++ ++ G G A+ C+GF V
Sbjct: 356 NQCYLVSN----SVGDIFPQVSLNFMGGASMV--LNPEHYLMHYGFLDGAAMWCIGFQKV 409
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ G ++G L++ + +DLA R+G++D
Sbjct: 410 EQGFT-----ILGDLVLKDKIFVYDLANQRIGWAD 439
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 105/274 (38%), Gaps = 39/274 (14%)
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SPFDD 199
T+P F+ C + Q GIAG GRS +LPSQL +KF+ CL FDD
Sbjct: 214 TIPGFLVGCSLFSIRQ-----PEGIAGFGRSPESLPSQLGL-----KKFSYCLVSHAFDD 263
Query: 200 ----GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
+V G D + L YTP N + Y++ + ++ +
Sbjct: 264 TPASSDLVLDTGSGSDDTK---TPGLSYTPFQKNPTAAFRDY-------YYVLLRNIVIG 313
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV--- 312
V + L ++G GGT + + +T +E +Y+ + + F + V
Sbjct: 314 DTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQ 373
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD--- 369
CF + G + VP+ + ++ N + +CL V
Sbjct: 374 TGLRPCFNIS--GEKSVS--VPEFIFHFKG-GAKMALPLANYFSFVDSGVICLTIVSDNM 428
Query: 370 --GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G+ +I++G Q N ++FDL R GF
Sbjct: 429 SGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGF 462
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/269 (23%), Positives = 109/269 (40%), Gaps = 43/269 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGSA 90
Y ++ +P + V G ++ W+ C+ V S T K+ C S+
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC+ +C N T ++GD + S D + T+P F++ C
Sbjct: 178 QCSSLVDATLNNPLCETS-SNVCVYTASYGDSSYS-MGYLSQDLLTLAPSQTLPGFVYGC 235
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYY 210
G + GL GI GLGR+K+++ Q+++ F F+ CL G G +
Sbjct: 236 GQD--SDGLFGRAAGILGLGRNKLSMLGQVSSKFGY--AFSYCLP-------TRGGGGFL 284
Query: 211 DLNNFDVSKN-LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSID 269
+ ++ + K+TP+ + G PS+ YF+ +T++ V G+A+ + +
Sbjct: 285 SIGKASLAGSAYKFTPMTTDP--------GNPSL-YFLRLTAITVGGRALGVAAAQYRVP 335
Query: 270 NEGVGGTKISTVNPYTVLETSIYKALVQA 298
GT I T L S+Y QA
Sbjct: 336 TIIDSGTVI------TRLPMSVYTPFQQA 358
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 145/382 (37%), Gaps = 72/382 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY----VSSTNKTARCGSAQCNLANA 97
TL+YV + +P V + + G ++ W+ C+ SST C + C
Sbjct: 128 TLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCKSRLYDPGTSSTYAPFSCSAPACAQLGR 187
Query: 98 KACGGGICGAGPDNPIS--------NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
+ G C +G S TGT+G D L++ T + F F
Sbjct: 188 RGTG---CSSGSTCVYSVKYGDGSNTTGTYGS---DTLTLAGTS------EPLISGFQFG 235
Query: 150 CGSEFVLQGLAS-GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF--DDGAIVFGD 206
C + V G G+ GLG + SQ AA + F+ CL P G + G
Sbjct: 236 CSA--VEHGFEEDNTDGLMGLGGDAQSFVSQTAATYG--SAFSYCLPPTWNSSGFLTLG- 290
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLL 266
+ S TP+ +K + + Y + + + V GK + + ++
Sbjct: 291 -----APSSSTSAAFSTTPMLRSK---------QAATFYGLLLRGISVGGKTLEIPSSVF 336
Query: 267 SIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFG---ACFRLQD 323
S + GT I+ + P + Y AL AF M + + P AP G CF
Sbjct: 337 SAGSIVDSGTVITRLPP------TAYGALSAAFRDGMARY-QYQPAAPRGLLDTCFDFTG 389
Query: 324 IG----FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIV 379
G FT VP + LVL VV +H N +VQ G CL F + RT I+
Sbjct: 390 HGEGNNFT-----VPSVALVLDGGAVV-DLH-PNGIVQDG----CLAFAATDDDGRTGII 438
Query: 380 IGARQLENNLLQFDLATSRLGF 401
+Q +L +D+ S GF
Sbjct: 439 GNVQQRTFEVL-YDVGQSVFGF 459
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 79/340 (23%), Positives = 130/340 (38%), Gaps = 68/340 (20%)
Query: 33 VLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------KGYVSSTNKT 84
+ + D L Y T I TP + + G ++LW+ C+ Y S+ ++
Sbjct: 86 TMSLGNDFGWLHY-TWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSSYYSNLDRD 144
Query: 85 ARCGSAQCNLANAK-ACGGGICGAGPDNPIS-------------NTGTHGDIRIDVLSIQ 130
S +L++ +C +C G + S NT + G + D+L +Q
Sbjct: 145 LNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQ 204
Query: 131 STDGRNPGRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
S G +V P + CG S L G+A G+ GLG + ++PS LA + +
Sbjct: 205 S-GGTLSNSSVQAP-VVLGCGMKQSGGYLDGVAPD--GLLGLGPGESSVPSFLAKSGLIH 260
Query: 188 RKFALCLSPFDDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
F+LC + D G + FGD GP ++ + PL G Y
Sbjct: 261 YSFSLCFNEDDSGRMFFGDQGP-------TSQQSTSFLPL--------DGLYS----TYI 301
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKV 306
IGV S + + + +D+ GT +T L +Y A+ + F +
Sbjct: 302 IGVESCCIGNSCLKMTSFKAQVDS----GTS------FTFLPGHVYGAITEEFDQQVNGS 351
Query: 307 ARVAPVAPFGACF--RLQDIGFTRIGPVVPQIDLVLQNKN 344
+P+ C+ QD+ P VP L+ Q N
Sbjct: 352 RSSFEGSPWEYCYVPSSQDL------PKVPSFTLMFQRNN 385
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 88/215 (40%), Gaps = 35/215 (16%)
Query: 120 GDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQ 179
G + DVL N +P F F C + +GIAG GR ++LP Q
Sbjct: 139 GSLTRDVLFTHGNYNNNNNNNKQIPRFCFGC-----VGATYREPIGIAGFGRGLLSLPFQ 193
Query: 180 LAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD----LNNFDVS---KNLKYTPLFINKVN 232
L + + F+ C PF F + P + L N +S +NL++TPL + +
Sbjct: 194 LGFS---HKGFSHCFLPFK-----FSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPM- 244
Query: 233 TASGFLGEPSVEYFIGVTSVHV----NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLE 288
P+ Y+IG+ S+ + N ++ L ID +G GG I + YT L
Sbjct: 245 -------YPNY-YYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLP 296
Query: 289 TSIYKALVQ--AFASAMPKVARVAPVAPFGACFRL 321
+Y L+ P+ +V F C+++
Sbjct: 297 EPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKV 331
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 85/388 (21%), Positives = 144/388 (37%), Gaps = 66/388 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y +I TP + V G +ILWV+C S+T+
Sbjct: 74 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 133
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPIS-----NTGTHGDIRIDVLSIQSTDGRNPGRAV 141
C C+L + G C G S + T G D + G N
Sbjct: 134 CDDNFCSLYDGPLPG---CKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISG-NFQTTP 189
Query: 142 TVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
T +F CG++ L + + GI G G++ ++ SQLA++ +K+ F+ CL D
Sbjct: 190 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDG 249
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G I F G V + TPL N+ + Y + + + V G
Sbjct: 250 GGI-FAIGEV-------VEPKVNITPLVQNQAH------------YNVVMKEIEVGGD-- 287
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
PL+ + ++ GT I + +Y L++ S P + R+ V CF
Sbjct: 288 PLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCF 346
Query: 320 RLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
D GF P + L +K++ +++ + Q+ C+G+ + G +
Sbjct: 347 DYTGNVDDGF-------PTVTLHF-DKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKD 398
Query: 377 S---IVIGARQLENNLLQFDLATSRLGF 401
++G L N L+ +DL +G+
Sbjct: 399 GKDLTLLGDLVLSNKLVVYDLEKQGIGW 426
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 88/388 (22%), Positives = 137/388 (35%), Gaps = 75/388 (19%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY-------------VSSTNKTARCGSAQCNLANAKA 99
TP P + + G ++W C SST + CG+ C
Sbjct: 75 TPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPTSN 134
Query: 100 CGGGICG-AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
C +C G N S G H L I +TD G A F + S G
Sbjct: 135 CSSNMCTYEGTIN--SKLGGH------TLGIVATDTFAIGTATASLGFGCVVASGIDTMG 186
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA---IVFGDGPYYDLNNF 215
SG++G LGR+ +L SQ+ KF+ CL+P D G ++ G
Sbjct: 187 GPSGLIG---LGRAPSSLVSQMNIT-----KFSYCLTPHDSGKNSRLLLGSS-----AKL 233
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV---PLNKTLLSIDNEG 272
N TP V T+ G + S Y I + + A+ P T+L
Sbjct: 234 AGGGNSTTTPF----VKTSPG--DDMSQYYPIQLDGIKAGDAAIALPPSGNTVL------ 281
Query: 273 VGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV 332
+ T+ P + L S Y+AL + A+ P+ PF CF G +
Sbjct: 282 -----VQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCF--PKAGLSNAS-- 332
Query: 333 VPQIDLVLQNKNVVWSIHGQNSMVQIGGD--ALCLGFVDGGVNPRTSI-----VIGARQL 385
P + Q ++ ++ +G + +C+ + T++ ++G+ Q
Sbjct: 333 APDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQ 392
Query: 386 ENNLLQFDLATSRLGFSDSLLFERATCT 413
EN DL +L FE A C+
Sbjct: 393 ENTHFLLDLE------KKTLSFEPADCS 414
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 82/396 (20%), Positives = 156/396 (39%), Gaps = 76/396 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDC--------------EKGYV----SSTNKTAR 86
Y T++K +P + + G +ILW++C E + SST
Sbjct: 83 YFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVS 142
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD------------IRIDVLSIQSTDG 134
C C+ A A G C + N S T +GD + D + + +
Sbjct: 143 CADPICSYAVQTATSG--CSSQA-NQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMV 199
Query: 135 RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL 194
N + + G L V GI G G +++ SQL++ + F+ CL
Sbjct: 200 ANSSSTIVFGCSTYQSGD---LTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL 256
Query: 195 SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV-EYFIGVTSVH 253
++G V G + ++ Y+PL PS+ Y + + S+
Sbjct: 257 KGGENGGGVLVLGEI-------LEPSIVYSPLV-------------PSLPHYNLNLQSIA 296
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG+ +P++ + + N GT + + L Y V A +A+ + ++ P+
Sbjct: 297 VNGQLLPIDSNVFATTNN--QGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSK--PII 352
Query: 314 PFG-ACFRLQDIGFTRIGPVVPQIDL-VLQNKNVVWSIHGQNSMVQIG----GDALCLGF 367
G C+ + + +G + PQ+ L + ++V ++ ++ ++ G C+GF
Sbjct: 353 SKGNQCYLVSN----SVGDIFPQVSLNFMGGASMV--LNPEHYLMHYGFLDSAAMWCIGF 406
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
R ++G L++ + +DLA R+G++D
Sbjct: 407 QK---VERGFTILGDLVLKDKIFVYDLANQRIGWAD 439
>gi|413952261|gb|AFW84910.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 298
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 108/247 (43%), Gaps = 37/247 (14%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IVFGDGPYYDLNNFDVSKN 220
V GI G G+ ++++ SQL + + F+ CL D+G +V G+ V
Sbjct: 24 VDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGE---------IVEPG 74
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTPL + +P Y + + S+ VNG+ +P++ +L + N GT + +
Sbjct: 75 LVYTPLVPS----------QP--HYNLNLESIAVNGQKLPIDSSLFTTSN--TQGTIVDS 120
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL 340
L Y V A A+A+ R + V+ CF I + + P + L
Sbjct: 121 GTTLAYLADGAYDPFVSAIAAAVSPSVR-SLVSKGSQCF----ITSSSVDSSFPTVTLYF 175
Query: 341 QNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGGVNPRTSIVIGARQLENNLLQFDLAT 396
V S+ +N ++Q + L C+G+ T ++G L++ + +DLA
Sbjct: 176 MG-GVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEIT--ILGDLVLKDKIFVYDLAN 232
Query: 397 SRLGFSD 403
R+G++D
Sbjct: 233 MRMGWAD 239
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 108/247 (43%), Gaps = 37/247 (14%)
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA--IVFGDGPYYDLNNFDVSKN 220
V GI G G+ ++++ SQL + + F+ CL D+G +V G+ V
Sbjct: 38 VDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGE---------IVEPG 88
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTPL + +P Y + + S+ VNG+ +P++ +L + N GT + +
Sbjct: 89 LVYTPLVPS----------QP--HYNLNLESIAVNGQKLPIDSSLFTTSN--TQGTIVDS 134
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVL 340
L Y V A A+A+ R + V+ CF I + + P + L
Sbjct: 135 GTTLAYLADGAYDPFVSAIAAAVSPSVR-SLVSKGSQCF----ITSSSVDSSFPTVTLYF 189
Query: 341 QNKNVVWSIHGQNSMVQ---IGGDAL-CLGFVDGGVNPRTSIVIGARQLENNLLQFDLAT 396
V S+ +N ++Q + L C+G+ T ++G L++ + +DLA
Sbjct: 190 MG-GVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEIT--ILGDLVLKDKIFVYDLAN 246
Query: 397 SRLGFSD 403
R+G++D
Sbjct: 247 MRMGWAD 253
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 93/403 (23%), Positives = 146/403 (36%), Gaps = 80/403 (19%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK- 98
T T Y T++ +P + V G +ILWV+C K S + + G L + K
Sbjct: 65 TETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVK--CSRCPRKSDLG-IDLTLYDPKG 121
Query: 99 -------ACGGGICGAGPDNPI----------------SNTGTHGDIRIDVLSIQSTDGR 135
+C C A D PI + T G D L+ +
Sbjct: 122 SETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVND- 180
Query: 136 NPGRAVTVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
N A + IF CG S + + GI G G+S ++ SQLAA+ +K+ F+
Sbjct: 181 NLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSH 240
Query: 193 CLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
CL G I F G V + TPL + Y + + S+
Sbjct: 241 CLDNIRGGGI-FAIGEV-------VEPKVSTTPLVPRMAH------------YNVVLKSI 280
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V+ + L + D+ GT I + L +Y L+ + P++
Sbjct: 281 EVDTDILQLPSDIF--DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVE 338
Query: 313 APFGACFRLQ---DIGFTRIG---------PVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
F +CF+ D GF + V P D + Q K+ +W I Q S+ Q
Sbjct: 339 QQF-SCFQYTGNVDRGFPVVKLHFEDSLSLTVYPH-DYLFQFKDGIWCIGWQKSVAQTK- 395
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
N + ++G L N L+ +DL +G++D
Sbjct: 396 ------------NGKDMTLLGDLVLSNKLVIYDLENMAIGWTD 426
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/324 (23%), Positives = 119/324 (36%), Gaps = 64/324 (19%)
Query: 43 LQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY---------------------VSST 81
L Y T +K TP + + + G ++ WV C+ G +S+T
Sbjct: 104 LHYTT-VKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKISTT 162
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
NK C ++ C N C + T T G + DV+ + +T+ +NP R
Sbjct: 163 NKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHL-TTEDKNPERVE 221
Query: 142 TVPNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
F CG V G + G+ GLG K+++PS LA + F++C
Sbjct: 222 AY--VTFGCGQ--VQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDG 277
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
G I FGD S + + TP +N P+ Y I VT V V
Sbjct: 278 VGRISFGDKG---------SSDQEETPFNLNP--------SHPN--YNITVTRVRVGTTL 318
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS-AMPKVARVAPVAPFGA 317
+ T L T +T L +Y + ++F S A K PF
Sbjct: 319 IDDEFTAL-----------FDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEY 367
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQ 341
C+ D+ ++P + L ++
Sbjct: 368 CY---DMSNDANASLIPSLSLTMK 388
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 83/385 (21%), Positives = 139/385 (36%), Gaps = 66/385 (17%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------------KGYVSSTNK 83
Y T + TP + + G ++ WV C+ K S+T++
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYKPAESTTSR 202
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C C + + C D NT + G + D+L + S + P +A
Sbjct: 203 HLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKA--- 259
Query: 144 PNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ + CG S L G+A G+ GLG + +++PS LA A ++ F++C D G
Sbjct: 260 -SVVIGCGRKQSGSYLDGIAPD--GLLGLGMADISVPSFLARAGLVRNSFSMCFKE-DSG 315
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I FGD + ++ + PL+ K T Y + V V K
Sbjct: 316 RIFFGD------QGVSIQQSTPFVPLY-GKYQT-----------YAVNVDKSCVGHKCFE 357
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
+D+ GT +T L ++YKA+ F + A F C+
Sbjct: 358 ATSFEALVDS----GTS------FTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYS 407
Query: 321 LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG-GVNPRTSIV 379
+ P VP + L +++ ++V G+ GF +P +
Sbjct: 408 ASPLKM----PDVPTVTLTFAANKSFQAVN--PTIVLKDGEGSVAGFCLALQKSPEPIGI 461
Query: 380 IGARQLENNLLQFDLATSRLGFSDS 404
IG L + FD +LG+ S
Sbjct: 462 IGQNFLTGYHIVFDKENMKLGWYRS 486
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 96/408 (23%), Positives = 145/408 (35%), Gaps = 92/408 (22%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTARCGS 89
YV TP V V L G ++W C S+T + +CGS
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 90 AQCNLANAKACGG-GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
C + C G G CG + P T G D ++I + +GR F
Sbjct: 122 PLCKSIPTRNCSGDGECGY--EAPSMFGDTFGIASTDAIAIGNAEGR----------LAF 169
Query: 149 LC--GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG---AIV 203
C S+ + G G G GLGR+ +L Q S F+ CL+P G A+
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQ-----SNVTAFSYCLAPHGPGKKSALF 224
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG-KAVPLN 262
G K+ TPL + S +P Y+ +V + G KA +
Sbjct: 225 LG----ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDP---YY----TVQLEGIKAGDVA 273
Query: 263 KTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR-- 320
S + ++ T P + L + Y+AL + +A+ + P PF CF+
Sbjct: 274 VAAASSGGGAITILQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNA 333
Query: 321 ----LQDIGFTRIGPV---VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV----- 368
+ D+ FT G P +L + N G +CL +
Sbjct: 334 AVSGVPDLVFTFQGGATLTAPPSKYLLGDGN--------------GNGTVCLSILSSTRL 379
Query: 369 ---DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
D GV+ ++G+ EN FDL ++L FE A C+
Sbjct: 380 DSADDGVS-----ILGSLLQENVHFLFDLE------KETLSFEPADCS 416
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 89/395 (22%), Positives = 153/395 (38%), Gaps = 79/395 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y+ + TP ++ + G ++ W+ C + G + S + + CG
Sbjct: 148 EYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDD 207
Query: 91 QCNLANAKA-CGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
+C L + A C +P + T GD+ ++ ++ T
Sbjct: 208 RCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGT----RR 263
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA- 201
V F CG +GL G G+ GLGR ++ SQL + F+ CL A
Sbjct: 264 VDGVAFGCGHRN--RGLFHGAAGLLGLGRGPLSFASQLRGVYG-GHAFSYCLVEHGSAAG 320
Query: 202 --IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
I+FG + L YT A F Y++ + S+ V G+AV
Sbjct: 321 SKIIFGHD-----DALLAHPQLNYTAF--APTTDADTF-------YYLQLKSILVGGEAV 366
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
++ LS GGT I + + Y+A+ QAF + R++P P
Sbjct: 367 NISSDTLS-----AGGTIIDSGTTLSYFPEPAYQAIRQAF------IDRMSPSYPL---- 411
Query: 320 RLQDIGFTRIGPV----------VPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFV 368
+GF + P VP++ LV + W +N +++ + +CL +
Sbjct: 412 ---ILGFPVLSPCYNVSGAEKVEVPELSLVFAD-GAAWEFPAENYFIRLEPEGIMCLAVL 467
Query: 369 DGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
PR+ + +IG Q +N + +DL +RLGF+
Sbjct: 468 G---TPRSGMSIIGNYQQQNFHVLYDLEHNRLGFA 499
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 89/387 (22%), Positives = 152/387 (39%), Gaps = 49/387 (12%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYVSSTNKTARC 87
V++ + +Y+ +I TP V L + ++ W+ C + G V +
Sbjct: 129 VSRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSY 188
Query: 88 GSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLS-IQSTDGRNPGRAVTVPNF 146
N A+ +A G G T +GD V I+ T G V +P
Sbjct: 189 REMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGG--VRLPRI 246
Query: 147 IFLCGSEFVLQGL-ASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
CG + +GL + GI GLGR ++ P+Q+ F+ CL F G
Sbjct: 247 SIGCGHDN--KGLFGAPAAGILGLGRGLMSFPNQI----DHNGTFSYCLVDFLSGPGSLS 300
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP-LNKT 264
+ D S + +TP +N L P+ Y++ +T + V G VP + +
Sbjct: 301 STLTFGAGAVDTSPPVSFTPTVLN--------LNMPTF-YYVRLTGISVGGVRVPGVTER 351
Query: 265 LLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA---PVAPFGACFR 320
L +D G GG + + T L Y A AF + + +V+ P F C+
Sbjct: 352 DLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYT 411
Query: 321 LQDIGFTRIGPVVP-----QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPR 375
+ G ++ P V +++ LQ KN + + +SM +C F G +
Sbjct: 412 VGGRGMKKV-PTVSMHFAGSVEVKLQPKNYLIPV---DSM-----GTVCFAFAATGDH-- 460
Query: 376 TSIVIGARQLENNLLQFDLATSRLGFS 402
+ +IG Q + + +D+ R+GF+
Sbjct: 461 SVSIIGNIQQQGFRIVYDIG-GRVGFA 486
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 91/395 (23%), Positives = 145/395 (36%), Gaps = 86/395 (21%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------KGYVSSTNKT-ARCGSAQCNLAN 96
Y T + TP + + G ++ W+ C+ GY S ++ A+ +
Sbjct: 208 YYTWVDVGTPNTSFMVALDTGSDLFWIPCDCIECAPLSGYHGSLDRDLGIYKPAESTTSR 267
Query: 97 AKACGGGICGAGPD-----NPI--------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C +C G D P NT + G + D+L + S + P +A
Sbjct: 268 HLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPVKA--- 324
Query: 144 PNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ I CG S L G+A G+ GLG + +++PS LA A ++ F++C + D G
Sbjct: 325 -SVIIGCGRKQSGSYLDGIAPD--GLLGLGMADISVPSFLARAGLVRNSFSMCFTK-DSG 380
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I FGD ++ + PL+ K+ T Y + V V K
Sbjct: 381 RIFFGD------QGVSTQQSTPFVPLY-GKLQT-----------YTVNVDKSCVGHKCFE 422
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF-----ASAMPKVARVAPVAPF 315
+D+ GT +T L IYKA+ F AS +P+ A F
Sbjct: 423 STSFQAIVDS----GTS------FTALPLDIYKAVAIEFDKQVNASRLPQEA-----TSF 467
Query: 316 GACFRLQDIGFTRIGPVVPQIDLV------LQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
C+ + + P VP + L Q N + +H + V CL V
Sbjct: 468 DYCYSASPL----VMPDVPTVTLTFAGNKSFQPVNPTFLLHDEEGAVA----GFCLAVVQ 519
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+P +I L + FD +LG+ S
Sbjct: 520 ---SPEPIGIIAQNFLLGYHVVFDRENMKLGWYRS 551
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 85/395 (21%), Positives = 148/395 (37%), Gaps = 82/395 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------KG----YVSSTNKTARCGSAQC 92
QY T I P P L V G ++ W+ C+ KG Y + K + C
Sbjct: 190 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDSLC 249
Query: 93 N--------LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
K C I A + + G + D + + +T+G GR
Sbjct: 250 QELQGDQNYCETCKQCDYEIEYA------DRSSSMGVLAKDDMHLIATNG---GREKL-- 298
Query: 145 NFIFLCGSEFVLQGLAS--GVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA- 201
+F+F C + Q L+S GI GL + ++LPSQLA+ + F C++ +G
Sbjct: 299 DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGG 358
Query: 202 -IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ GD Y P + T + G P Y V+ + +
Sbjct: 359 YMFLGD---------------DYVPRW---GMTWAPIRGGPDNLYHTEAQKVNYGDQELH 400
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC-- 318
++ I + G + YT L +YK L+ A P + + C
Sbjct: 401 AGNSVQVIFDSG---------SSYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWK 451
Query: 319 --------FRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
F+ ++ F R VVP+ ++ + ++ S G +CLG ++G
Sbjct: 452 ADFSVRSFFKPLNLHFGRRWFVVPKTFTIVPDDYLIISDKGN----------VCLGLLNG 501
Query: 371 -GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+N ++I++G L L+ +D ++G+++S
Sbjct: 502 TEINHGSTIIVGDVSLRGKLVVYDNERRQIGWANS 536
>gi|383165471|gb|AFG65613.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 54/110 (49%), Gaps = 12/110 (10%)
Query: 99 ACGGGICGAGPDNPISNTG------THGD--IRIDVLSIQS-TDGRNPGRAVTVPNFIFL 149
+C +C A PD + T+GD I + +LS ++ T G +PNF F
Sbjct: 19 SCKSLLCNALPDFECKSAAGCEYQYTYGDFSITVGILSYETLTLTSKSGAEQLIPNFAFG 78
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
CG G G GI GLGR ++L SQL+A S+ +KF+ CL DD
Sbjct: 79 CGQNNEGNGFDQGA-GIVGLGRGPLSLISQLSA--SMPKKFSYCLMTIDD 125
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 114/302 (37%), Gaps = 52/302 (17%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------- 74
R +A D L Y + TP V + + G ++ WV C+
Sbjct: 60 RRRAATRDRLNDFGFLHYAV-VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYG 118
Query: 75 -------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVL 127
S+T++ C S C+L NA C NT + G + DVL
Sbjct: 119 SLKFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVL 178
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSL 186
+ S ++ + VT P +F CG L S G+ GLG ++PS LA+
Sbjct: 179 YLTSDSAQS--KIVTAP-IMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLA 235
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
F++C G I FGD S + K TPL + K N Y
Sbjct: 236 ANSFSMCFGDDGHGRINFGD---------TGSSDQKETPLNVYKQNP----------YYN 276
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQA----FASA 302
I +T + V K++ + +I + G T +S +P TS + A +++ S+
Sbjct: 277 ITITGITVGSKSI--STEFSAIVDSGTSFTALS--DPMYTQITSSFDAQIRSSRNMLDSS 332
Query: 303 MP 304
MP
Sbjct: 333 MP 334
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 94/443 (21%), Positives = 159/443 (35%), Gaps = 86/443 (19%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRVAKDTATLQ--------YVTQIKQRTPLVPV 58
FLLL S+ S S P ++ ++ T++ Y I P P
Sbjct: 16 FLLLSSIFPHHFSAANKNNSIPPTSIHSLISSLVYTIKGNVYPDGLYTVSINIGNPPKPY 75
Query: 59 KLTVHLGGNILWVDCE--------------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+L + G ++ WV C+ K Y + + +C C + G I
Sbjct: 76 ELDIDTGSDLTWVQCDGPDAPCKGCTMPKDKLYKPNGKQVVKCSDPICVATQSTHVLGQI 135
Query: 105 CGAGPDNPISN------TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQG 158
C + N T G + D + I S P + P F CG E G
Sbjct: 136 CSKQSPPCVYNVQYADHASTLGVLVRDYMHIGS-----PSSSTKDPLVAFGCGYEQKFSG 190
Query: 159 LA---SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF 215
S GI GLG K ++ SQL + + CLS G + GD F
Sbjct: 191 PTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCLSAEGGGYLFLGD-------KF 243
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
S + +TP+ + + Y G + NGK P K L I + G
Sbjct: 244 VPSSGIVWTPIIQSSLEK----------HYNTGPVDLFFNGKPTPA-KGLQIIFDSGSSY 292
Query: 276 TKISTVNPYTVLETSIY-----KALVQAFASAMPKVARVAPVAPFGAC------FRLQDI 324
T S+ YT++ + K L + ++P + V PF + F+ +
Sbjct: 293 TYFSSPV-YTIVANMVNNDLKGKPLSRVKDPSLPICWKG--VKPFKSLNEVNNYFKPLTL 349
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDG---GVNPRTSIVIG 381
FT+ +KN+ + + ++ +CLG ++G G+ R V+G
Sbjct: 350 SFTK-------------SKNLQFQLPPVAYLIITKYGNVCLGILNGNEAGLGNRN--VVG 394
Query: 382 ARQLENNLLQFDLATSRLGFSDS 404
L++ ++ +D ++G++ +
Sbjct: 395 DISLQDKVVVYDNEKQQIGWASA 417
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 79/396 (19%), Positives = 139/396 (35%), Gaps = 65/396 (16%)
Query: 36 VAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------GYVSSTNKTAR- 86
VA + YV + +P P+ L + + W C + N T+
Sbjct: 68 VASGQSPPSYVVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGSLFAPANSTSYA 127
Query: 87 ---CGSAQCNLANAKACGG----------GICGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
C S C + + C +C P ++ + D L +
Sbjct: 128 PLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAF--TKPFADASFQASLASDWLHL---- 181
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
G++ +PN+ F C S G+ GLGR +AL SQ+ + F+ C
Sbjct: 182 GKD-----AIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMY--NGVFSYC 234
Query: 194 LSPFDDGAIVFGDGPYY-----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIG 248
L + YY L + ++YTP+ N S Y++
Sbjct: 235 LPSYKS---------YYFSGSLRLGAAGQPRGVRYTPMLKNPNR---------SSLYYVN 276
Query: 249 VTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
VT + V V + + D GT + + T +Y AL + F + +
Sbjct: 277 VTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSG 336
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDAL-CLGF 367
+ F CF ++ + P V V + + ++ +N+++ L CL
Sbjct: 337 YTSLGAFDTCFNTDEVA-AGVAPAV----TVHMDGGLDLALPMENTLIHSSATPLACLAM 391
Query: 368 VDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFS 402
+ N + V+ Q +N + FD+A SR+GF+
Sbjct: 392 AEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFA 427
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 92/401 (22%), Positives = 155/401 (38%), Gaps = 65/401 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYVSSTNKTARCGSAQCNLA 95
++ I TP + V G ++ WV C E G + K++ S C+
Sbjct: 84 EFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143
Query: 96 NAKACGGGICGAGPDNPI--------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
N A G + + + GD+ + +SI S G V+ P +
Sbjct: 144 NCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSAS----GSPVSFPGTV 199
Query: 148 FLCGSEFVLQG--LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL---SPFDDGAI 202
F CG G GI GLG ++L SQL + S+ +KF+ CL S +G
Sbjct: 200 FGCGYN---NGGTFDETGSGIIGLGGGHLSLISQLGS--SISKKFSYCLSHKSATTNGTS 254
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLN 262
V G ++ + TPL ++K EP Y++ + ++ V K +P
Sbjct: 255 VINLGTNSIPSSLSKDSGVISTPL-VDK---------EPRTYYYLTLEAISVGKKKIPYT 304
Query: 263 KTLLSIDNEGV-----GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA-PVAPFG 316
+ + ++ G+ G I + T+L++ + A + RV+ P
Sbjct: 305 GSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLS 364
Query: 317 ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT 376
CF+ G IG +P+I + +V S N+ V++ D +CL V P T
Sbjct: 365 HCFK---SGSAEIG--LPEITVHFTGADVRLS--PINAFVKVSEDMVCLSMV-----PTT 412
Query: 377 SIVI-GARQLENNLLQFDLATSRLGFSDSLLFERATCTFNF 416
+ I G + L+ +DL T + F+R C+ N
Sbjct: 413 EVAIYGNFAQMDFLVGYDLETRTVS------FQRMDCSANL 447
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 158/400 (39%), Gaps = 76/400 (19%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDC-------EKGYV-----------SST 81
T T Y T+I TP + V G +ILWV+C K + S +
Sbjct: 85 TETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQS 144
Query: 82 NKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQ--ST 132
+ C C +AN GG + +P + ++GD D L S
Sbjct: 145 GELVTCDQQFC-VANY---GGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSG 200
Query: 133 DGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV--GIAGLGRSKVALPSQLAAAFSLKRKF 190
DG+ +V F CG++ +S + GI G G+S ++ SQLAAA +++ F
Sbjct: 201 DGQTTPANASVS---FGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMF 257
Query: 191 ALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV-EYFIGV 249
A CL + G I + + N V +K TPL P + Y + +
Sbjct: 258 AHCLDTVNGGGI-------FAIGNV-VQPKVKTTPLV-------------PDMPHYNVIL 296
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
+ V G A+ L + N GT I + + +YKAL ++ V
Sbjct: 297 KGIDVGGTALGLPTNIFDSGNS--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDIS-V 353
Query: 310 APVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLG 366
+ F +CF+ D GF P++ + +V + + + Q G + C+G
Sbjct: 354 QTLQDF-SCFQYSGSVDDGF-------PEVTFHFEG-DVSLIVSPHDYLFQNGKNLYCMG 404
Query: 367 FVDGG---VNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
F +GG + + ++G L N L+ +DL +G++D
Sbjct: 405 FQNGGGKTKDGKDLGLLGDLVLSNKLVLYDLENQAIGWAD 444
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 85/384 (22%), Positives = 142/384 (36%), Gaps = 58/384 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC----------EKGYVSSTNKTARCGSAQCN 93
+Y+ + TP + G +++W C + + + + G CN
Sbjct: 91 EYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCN 150
Query: 94 LANAKACGGGICGAGPD-------NPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
++ C G + G P N TG ++ S T G VP
Sbjct: 151 -SSLSMCAGVLAGKAPPPGCACMYNQTYGTGWTAGVQG---SETFTFGSAAADQARVPGI 206
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVF 204
F C + +G G+ GLGR ++L SQL A +F+ CL+PF D
Sbjct: 207 AFGCSNASSSDW--NGSAGLVGLGRGSLSLVSQLGAG-----RFSYCLTPFQDTNSTSTL 259
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
GP LN V + TP AS S Y++ +T + + KA+ ++
Sbjct: 260 LLGPSAALNGTGV----RSTPFV------ASPAKAPMSTYYYLNLTGISLGAKALSISPD 309
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIY---KALVQAFASAMPKVARVAPVAPFGACFRL 321
S+ +G GG I + T L + Y +A VQ+ + +P + C+ L
Sbjct: 310 AFSLKADGTGGLIIDSGTTITSLVNAAYQQVRAAVQSLVT-LPAIDGSDSTG-LDLCYAL 367
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGF---VDGGVNPRTSI 378
T P +P + L ++V + + M+ G CL DG ++
Sbjct: 368 PTP--TSAPPAMPSMTLHFDGADMV--LPADSYMIS-GSGVWCLAMRNQTDGAMS----- 417
Query: 379 VIGARQLENNLLQFDLATSRLGFS 402
G Q +N + +D+ L F+
Sbjct: 418 TFGNYQQQNMHILYDVRNEMLSFA 441
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 85/372 (22%), Positives = 138/372 (37%), Gaps = 60/372 (16%)
Query: 62 VHLGGNILWVDC--------EKG-------YVSSTNKTAR---CGSAQCNLANAKACGGG 103
V G + LWV+C + G Y +++KT++ C C G
Sbjct: 92 VDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDEFC----TSTYDGP 147
Query: 104 ICGAGPDNPISNTGTHGDIRIDVLSIQSTD---GRNPGRAVTVPN---FIFLCGSE---F 154
I G D + T+GD S D R G TVP+ IF CGS+
Sbjct: 148 ISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGT 207
Query: 155 VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNN 214
+ + + GI G G++ ++ SQLAAA +KR F+ CL + G I F G
Sbjct: 208 LSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGI-FAIGEV----- 261
Query: 215 FDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVG 274
V +K TPL + Y + + + V G + L + D+
Sbjct: 262 --VQPKVKTTPLVPRMAH------------YNVVLKDIEVAGDPIQLPTDIF--DSTSGR 305
Query: 275 GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVP 334
GT I + L SIY L++ + + F CF D + P
Sbjct: 306 GTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQF-TCFHYSDE--KSLDDAFP 362
Query: 335 QIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS---IVIGARQLENNLLQ 391
+ + + + + + + + D C+G+ + I++G L N L
Sbjct: 363 TVKFTFE-EGLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFI 421
Query: 392 FDLATSRLGFSD 403
+DL +G++D
Sbjct: 422 YDLDNMSIGWTD 433
>gi|383165464|gb|AFG65606.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165470|gb|AFG65612.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 54/110 (49%), Gaps = 12/110 (10%)
Query: 99 ACGGGICGAGPDNPISNTG------THGD--IRIDVLSIQS-TDGRNPGRAVTVPNFIFL 149
+C +C A PD +T T+GD I + +LS ++ T G +P F F
Sbjct: 19 SCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLTLTSKSGAEQLIPKFAFG 78
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
CG G G GI GLGR ++L SQL+A S+ +KF+ CL DD
Sbjct: 79 CGQNNEGNGFDQGA-GIVGLGRGPLSLISQLSA--SMPKKFSYCLMTIDD 125
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 70/314 (22%), Positives = 113/314 (35%), Gaps = 60/314 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK------ 98
+ T ++ TP V + + G ++ WV C+ ++T+ +A N+ N
Sbjct: 96 HYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRCAATDSSAFASDFDLNVYNPNGSSTSK 155
Query: 99 --ACGGGIC-------GAGPDNPI------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C +C G + P + T T G + DVL + D + +
Sbjct: 156 KVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHD---LVE 212
Query: 144 PNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
N IF CG + G V G+ GLG K+++PS L+ F++C G
Sbjct: 213 ANVIFGCGQ--IQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIG 270
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I FGD +D + TP +N + Y I VT V V +
Sbjct: 271 RISFGDKGSFDQDE---------TPFNLNPSHPT----------YNITVTQVRVGTTLID 311
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVAPVAPFGACF 319
+ T L + +T L Y L ++F S + + R PF C+
Sbjct: 312 VEFTAL-----------FDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCY 360
Query: 320 RLQDIGFTRIGPVV 333
+ T + P V
Sbjct: 361 DMSPDANTSLIPSV 374
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 86/386 (22%), Positives = 144/386 (37%), Gaps = 73/386 (18%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQC--------- 92
+L+Y+ + TP VP L + G ++ WV C A C S +C
Sbjct: 122 SLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQC-----------APCNSTECYPQKDPLFD 170
Query: 93 ----NLANAKACGGGIC---GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR------ 139
+ ACG C G N ++ GT R++ ST G
Sbjct: 171 PSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAP 230
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
+TV +F F CG + +G + G+ GLG + +L Q A+ + F+ CL +
Sbjct: 231 GITVKDFHFGCGHD--QRGPSDKFDGLLGLGGAPESLVVQTASVYG--GAFSYCLPALNS 286
Query: 200 --GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G + G P N + +TP++ ++ S Y + +T + V GK
Sbjct: 287 EAGFLALGVRPSAATN----TSAFVFTPMWHLPMDATS---------YMVNMTGISVGGK 333
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+ + ++ GG I + T L + Y AL A A VA F
Sbjct: 334 PLDIPRSAFR------GGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVA-SEDFDT 386
Query: 318 CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS 377
C+ G++ + VP++ L + +V+ CL F + G +
Sbjct: 387 CYNF--TGYSNV--TVPRVALTFSGGATIDLDVPNGILVKD-----CLAFRESGPDVGLG 437
Query: 378 IV--IGARQLENNLLQFDLATSRLGF 401
I+ + R LE + +D ++GF
Sbjct: 438 IIGNVNQRTLE---VLYDAGHGKVGF 460
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 114/274 (41%), Gaps = 42/274 (15%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+V F+F CG +GL G G+ GLGR++++L SQ A F F+ CL
Sbjct: 279 ASVDGFVFGCG--LSNRGLFGGTAGLMGLGRTELSLVSQTAPRFG--GVFSYCLP----- 329
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKA 258
A GD D S TP+ ++ + +P+ YF+ VT V G A
Sbjct: 330 AATSGDAAGSLSLGGDTSSYRNATPVSYTRM------IADPAQPPFYFMNVTGASVGGAA 383
Query: 259 VPLNKTLLS---IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP- 314
V + +D+ GT I+ + P S+Y+A+ FA A P AP
Sbjct: 384 VAAAGLGAANVLLDS----GTVITRLAP------SVYRAVRAEFARQFG--AERYPAAPP 431
Query: 315 ---FGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
AC+ L G + VP + L L+ ++ G M + G +CL
Sbjct: 432 FSLLDACYNL--TGHDEVK--VPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASL 487
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+T I IG Q +N + +D SRLGF+D
Sbjct: 488 SFEDQTPI-IGNYQQKNKRVVYDTVGSRLGFADE 520
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 99/419 (23%), Positives = 167/419 (39%), Gaps = 87/419 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------------SSTNKT 84
Y + TP P+ + + G ++ WV C Y SS+++
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 85 ARCGSAQC-------NLANAKA---CGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
C + C +L++ +A C G C P N +N + V ST G
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANC--TPRN--ANANNVCPPYLVVYGSGSTAG 204
Query: 135 -------RNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK 187
R PGRAV NF+ C V Q + G+AG GR ++PSQL
Sbjct: 205 LLISDTLRTPGRAVR--NFVIGCSLASVHQPPS----GLAGFGRGAPSVPSQLGLT---- 254
Query: 188 RKFALCL--SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEY 245
KF+ CL FDD A V G+ D ++Y PL +AS SV Y
Sbjct: 255 -KFSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPL----ARSASARPPY-SVYY 308
Query: 246 FIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPK 305
++ +T++ V GK+V L + + GG + + ++ + ++++ + A +A+
Sbjct: 309 YLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGG 367
Query: 306 VARVAPVAPFG----ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-- 359
+ V G CF + T +P++ L + +V+ ++ +N V G
Sbjct: 368 RYSRSKVVEEGLGLSPCFAMPPGTKTM---ELPEMSLHFKGGSVM-NLPVENYFVVAGPA 423
Query: 360 --------GDALCLGFVDG---------GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+A+CL V + +I++G+ Q +N +++DL RLGF
Sbjct: 424 PSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGF 482
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 114/274 (41%), Gaps = 42/274 (15%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+V F+F CG +GL G G+ GLGR++++L SQ A F F+ CL
Sbjct: 278 ASVDGFVFGCG--LSNRGLFGGTAGLMGLGRTELSLVSQTAPRFG--GVFSYCLP----- 328
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKA 258
A GD D S TP+ ++ + +P+ YF+ VT V G A
Sbjct: 329 AATSGDAAGSLSLGGDTSSYRNATPVSYTRM------IADPAQPPFYFMNVTGASVGGAA 382
Query: 259 VPLNKTLLS---IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP- 314
V + +D+ GT I+ + P S+Y+A+ FA A P AP
Sbjct: 383 VAAAGLGAANVLLDS----GTVITRLAP------SVYRAVRAEFARQFG--AERYPAAPP 430
Query: 315 ---FGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDG 370
AC+ L G + VP + L L+ ++ G M + G +CL
Sbjct: 431 FSLLDACYNL--TGHDEVK--VPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASL 486
Query: 371 GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+T I IG Q +N + +D SRLGF+D
Sbjct: 487 SFEDQTPI-IGNYQQKNKRVVYDTVGSRLGFADE 519
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 128/363 (35%), Gaps = 64/363 (17%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSS--TNKTARCGSAQCNLANAKA 99
TL YV + TP V L V G ++ WV C + + K AQ + A
Sbjct: 137 TLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVP 196
Query: 100 CGGGICG----------AGPDNPISNTG----THGDIRIDVLSIQSTDGRNPGRAVTVPN 145
CGG +CG A + + G T G D L++ D V
Sbjct: 197 CGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPND--------AVRG 248
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG Q +G G+ GLGR + +L Q A + + L P G + G
Sbjct: 249 FFFGCGHA---QSGFTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLPTRPSTTGYLTLG 305
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
F ++ ++ N A+ Y + +T + V G+ + + ++
Sbjct: 306 GPSGAAPPGFSTTQ-------LLSSPNAAT--------YYVVMLTGISVGGQQLSVPSSV 350
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP----FGACFRL 321
+ GGT + T T L + Y AL AF S M P AP C+
Sbjct: 351 FA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGY--PSAPATGILDTCYNF 402
Query: 322 QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIG 381
G +P + L V G + ++ G CL F G + +I+
Sbjct: 403 SGYGTV----TLPNVALTFSGGATV--TLGADGILSFG----CLAFAPSGSDGGMAILGN 452
Query: 382 ARQ 384
+Q
Sbjct: 453 VQQ 455
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 135/375 (36%), Gaps = 59/375 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV--------------SSTNKTARCGS 89
YV + TP + G ++ W+ C+ V SST + C
Sbjct: 15 NYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSCTE 74
Query: 90 AQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
C + + C C G + T G + +D + A NFIF
Sbjct: 75 PACVGLSTRGCSSSTCLYGVFYG-DGSSTIGFLAMDTFMLTP--------AQKFKNFIFG 125
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKV-ALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGP 208
CG GL G G+ GLGRS +L SQ+A SL F+ CL P A +
Sbjct: 126 CGQNNT--GLFQGTAGLVGLGRSSTYSLNSQVAP--SLGNVFSYCL-PSTSSATGY---- 176
Query: 209 YYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLS 267
LN + YT + + +V T YFI + + V G + L+ T+
Sbjct: 177 ---LNIGNPQNTPGYTAMLTDTRVPTL----------YFIDLIGISVGGTRLSLSSTVF- 222
Query: 268 IDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFT 327
+ VG T I + T L + Y AL A +AM + V C+ F+
Sbjct: 223 ---QSVG-TIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYD-----FS 273
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLEN 387
R VV + +VL + I +CL F G + +IG Q
Sbjct: 274 RTTSVVYPV-IVLHFAGLDVRIPATGVFFVFNSSQVCLAFA-GNTDSTMIGIIGNVQQLT 331
Query: 388 NLLQFDLATSRLGFS 402
+ +D R+GFS
Sbjct: 332 MEVTYDNELKRIGFS 346
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 87/387 (22%), Positives = 143/387 (36%), Gaps = 79/387 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y T+I TP L V G + +V C + SST + +C S +
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SME 150
Query: 92 CNLANAKACGGGICGAGPDNPISN-TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C C + D + + + G + D++S P R V F C
Sbjct: 151 CT------CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTV------FGC 198
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGP 208
+ + GI GLGR +++ QL + F+LC D GA+V G
Sbjct: 199 ENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLG--- 255
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFL---GEP--SVEYFIGVTSVHVNGKAVPLNK 263
++ +G + +P S Y I + +H+ GK +P+N
Sbjct: 256 ---------------------GISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINP 294
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ +G GT + + Y L +KA A + + + G D
Sbjct: 295 MVF----DGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQ-----GPDRNYND 345
Query: 324 IGFTRIGPVV-------PQIDLVLQNKNVVWSIHGQNSMVQI--GGDALCLGFVDGGVNP 374
I F+ +G V P +DLV N N + S+ +N + Q A CLG N
Sbjct: 346 ICFSGVGSDVSQLSKTFPAVDLVFSNGNRL-SLSPENYLFQHSKAHGAYCLGIFQNE-ND 403
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGF 401
+T++ +G + N L+ +D ++GF
Sbjct: 404 QTTL-LGGIIVRNTLVMYDREHLKIGF 429
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 87/398 (21%), Positives = 152/398 (38%), Gaps = 78/398 (19%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------------GYVSST 81
T+ Y T++K +P + + G +ILWV C G +++
Sbjct: 102 TMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAG 161
Query: 82 NKTAR---CGS------AQCNLANAKACGGGI-CGAGPDNPISNTGTHGDIRIDVLSIQS 131
+ T C S AQC+ N CG G G +GT G D +
Sbjct: 162 SVTCSDPICSSVFQTTAAQCSENNQ--CGYSFRYGDG-------SGTSGYYMTDTFYFDA 212
Query: 132 TDGRNPGRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRK 189
G + + P +F C + L V GI G G+ K+++ SQL++
Sbjct: 213 ILGESLVANSSAP-IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPV 271
Query: 190 FALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
F+ CL G VF G + + Y+PL ++ + Y + +
Sbjct: 272 FSHCLKGDGSGGGVFVLGEI-------LVPGMVYSPLVPSQPH------------YNLNL 312
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
S+ VNG+ +PL+ + N GT + T T L Y + A ++++ ++ V
Sbjct: 313 LSIGVNGQMLPLDAAVFEASN--TRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQL--V 368
Query: 310 APVAPFGA-CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI----GGDALC 364
P+ G C+ + T I + P + L + + Q+ + G C
Sbjct: 369 TPIISNGEQCYLVS----TSISDMFPSVSLNFAGGASMM-LRPQDYLFHYGIYDGASMWC 423
Query: 365 LGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+GF P ++G L++ + +DLA R+G++
Sbjct: 424 IGFQKA---PEEQTILGDLVLKDKVFVYDLARQRIGWA 458
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 118/278 (42%), Gaps = 34/278 (12%)
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
G TV + F CG++ + G G+ G+GR ++L SQL KF+ C +PF
Sbjct: 211 GAGTTVHDLAFGCGTDNL--GGTDNSSGLVGMGRGPLSLVSQLGVT-----KFSYCFTPF 263
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
+D P + ++ +S K TP V + SG S Y++ + + V
Sbjct: 264 NDTTT---SSPLFLGSSASLSPAAKSTPF----VPSPSG--PRRSSYYYLSLEGITVGDT 314
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+P++ + + G GG I + +T LE + L +A A+ +
Sbjct: 315 LLPIDPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSV 374
Query: 318 CFRL-QDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ--IGGDALCLGFVDGGVNP 374
CF Q G + VP+ LVL + +++V+ + G A CLG V+
Sbjct: 375 CFAAPQGRGPEAVD--VPR--LVLHFDGADMELPRSSAVVEDRVAGVA-CLGI----VSA 425
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATC 412
R V+G+ Q +N +++D+ D L FE A C
Sbjct: 426 RGMSVLGSMQQQNMHVRYDVG------RDVLSFEPANC 457
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 98/245 (40%), Gaps = 38/245 (15%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYT 224
GI G G+S ++ SQLAA +K+ FA CL + G I F G V + T
Sbjct: 229 GILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGGI-FAIGHV-------VQPKVNMT 280
Query: 225 PLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPY 284
PL N+ + Y + +T+V V + L + G I +
Sbjct: 281 PLIPNQPH------------YNVNMTAVQVGEDFLHLPTEEFEAGDR--KGAIIDSGTTL 326
Query: 285 TVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQ 341
L +Y+ LV S P + +V V CF+ D GF P + +
Sbjct: 327 AYLPEIVYEPLVSKIISQQPDL-KVHIVRDEYTCFQYSGSVDDGF-------PNVTFHFE 378
Query: 342 NKNVVWSIHGQNSMVQIGGDALCLGFVDGGV---NPRTSIVIGARQLENNLLQFDLATSR 398
N +V +H + G C+G+ + G+ + R ++G L N L+ +DL
Sbjct: 379 N-SVFLKVHPHEYLFPFEG-LWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDLENQA 436
Query: 399 LGFSD 403
+G+++
Sbjct: 437 IGWTE 441
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 85/377 (22%), Positives = 140/377 (37%), Gaps = 58/377 (15%)
Query: 41 ATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARC 87
+TL+YV + +P V +++ G ++ WV C+ SST C
Sbjct: 127 STLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSC 186
Query: 88 GSAQCNLANAKACGGGICGAGPDNPIS---NTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
SA C + G G + +S + T G D L++ S +
Sbjct: 187 SSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLGSN---------AIK 237
Query: 145 NFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVF 204
F F C S+ G + G+ GLG +L SQ A F + F+ CL P
Sbjct: 238 GFQFGC-SQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFG--KAFSYCLPP------TP 288
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G + L S +K L ++ T G L E ++ V G+ + + +
Sbjct: 289 GSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLE----------AIRVGGQQLNIPTS 338
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI 324
+ S + GT I+ + P + Y AL AF + M K P CF
Sbjct: 339 VFSAGSVMDSGTVITRLPP------TAYSALSSAFKAGMKKYPPAQPSGILDTCFDFS-- 390
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
G + + +P + LV VV ++ M+++ D CL F + + +Q
Sbjct: 391 GQSSVS--IPSVALVFSGGAVV-NLDFNGIMLEL--DNWCLAFAANSDDSSLGFIGNVQQ 445
Query: 385 LENNLLQFDLATSRLGF 401
+L +D+ +GF
Sbjct: 446 RTFEVL-YDVGGGAVGF 461
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 88/395 (22%), Positives = 144/395 (36%), Gaps = 69/395 (17%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE----------------------KG 76
D L Y T I TP V + + G ++ WV C+ +
Sbjct: 97 DLDWLHY-TWIDIGTPNVSFLVALDAGSDLSWVPCDCIQCAPLSASLYKPLDRDLSEYRP 155
Query: 77 YVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQS-TDGR 135
+S+T++ C C L + C D NT + G + D+L + S +D
Sbjct: 156 SLSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDS 215
Query: 136 NPGRAVTVPNFIFLCGSEFV---LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
N + + I CG + L G A G+ GLG +++PS LA A +++ F+L
Sbjct: 216 NSTQKRVQASVILGCGRKQTGGYLDGAAPD--GVMGLGPGSISVPSLLAKAGLIRKSFSL 273
Query: 193 CLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
C G I+FGD + + K TPL + N + Y I V S
Sbjct: 274 CFDVNGSGTILFGDQGH---------TSQKSTPLLPTQGNYDA---------YLIEVESY 315
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
V G + ++ + G +T L +Y +V F + +
Sbjct: 316 CV-GNSCLKQSGFKALVDSGAS---------FTYLPIDVYNKIVLEFDKQVNAQRISSQG 365
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDL-VLQNKNVVWSIHGQNSMVQIGGD--ALCLGFVD 369
P+ C+ VP + L L N++++ IH V + CL
Sbjct: 366 GPWNYCYNTSSKQLDN----VPAMRLSFLMNQSLL--IHNSTYYVPQNQEFAVFCLTLQP 419
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+N +IG + + FD+ +LG+S S
Sbjct: 420 TDLNYG---IIGQNYMTGYRVVFDMENLKLGWSSS 451
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 107/277 (38%), Gaps = 51/277 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE--------------------KGYVSSTNKTARCGSAQC 92
TP V + + G ++ WV C+ S+T++ C S C
Sbjct: 70 TPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSSNLC 129
Query: 93 NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGS 152
+L NA C NT + G + DVL + S ++ + VT P +F CG
Sbjct: 130 DLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS--KIVTAP-IMFGCGQ 186
Query: 153 EFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
L S G+ GLG ++PS LA+ F++C G I FGD
Sbjct: 187 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD----- 241
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
S + K TPL + K N Y I +T + V K++ + +I +
Sbjct: 242 ----TGSSDQKETPLNVYKQNP----------YYNITITGITVGSKSI--STEFSAIVDS 285
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQA----FASAMP 304
G T +S +P TS + A +++ S+MP
Sbjct: 286 GTSFTALS--DPMYTQITSSFDAQIRSSRNMLDSSMP 320
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 107/277 (38%), Gaps = 51/277 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE--------------------KGYVSSTNKTARCGSAQC 92
TP V + + G ++ WV C+ S+T++ C S C
Sbjct: 107 TPNVTFLVALDTGSDLFWVPCDCLKCAPLQSPNYGSLKFDVYSPAQSTTSRKVPCSSNLC 166
Query: 93 NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGS 152
+L NA C NT + G + DVL + S ++ + VT P +F CG
Sbjct: 167 DLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS--KIVTAP-IMFGCGQ 223
Query: 153 EFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
L S G+ GLG ++PS LA+ F++C G I FGD
Sbjct: 224 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD----- 278
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
S + K TPL + K N Y I +T + V K++ + +I +
Sbjct: 279 ----TGSSDQKETPLNVYKQNP----------YYNITITGITVGSKSI--STEFSAIVDS 322
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQA----FASAMP 304
G T +S +P TS + A +++ S+MP
Sbjct: 323 GTSFTALS--DPMYTQITSSFDAQIRSSRNMLDSSMP 357
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 93/280 (33%), Gaps = 59/280 (21%)
Query: 56 VPVKLTVHLGGNILWVDC---------------EKGYVSSTNKTAR--CGSAQCNLANAK 98
PV L + G +++W C G + + R C S C+ A+A
Sbjct: 103 APVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRRIPCASPLCSAAHAS 162
Query: 99 ACGGGICGAG--PDNPISNTGTHGDIRIDVLSIQSTDGR-----NPGR-----------A 140
A +C A P I L DG GR A
Sbjct: 163 APPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVA 222
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-- 198
V V NF F C + + VG+AG GR ++LP QL+ S + + L F
Sbjct: 223 VAVDNFTFACAHTALGE-----PVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRAD 277
Query: 199 ----DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF--IGVTSV 252
++ G P + YTPL N P YF + + +V
Sbjct: 278 RLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHN-----------PKHPYFYSVALEAV 326
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
V + L +D G GG + + +T+L +Y
Sbjct: 327 SVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMY 366
>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 4/84 (4%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+L YTP IN ++SG+ Y+I + V + K + L L S D++G GGT I
Sbjct: 14 SLNYTPFLINTKASSSGY----HTFYYIDLRGVSIGRKRLNLPSKLFSFDSKGNGGTIID 69
Query: 280 TVNPYTVLETSIYKALVQAFASAM 303
+ +T+ YK + AFAS +
Sbjct: 70 SGTTFTIFNEEFYKNITAAFASQI 93
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 87/387 (22%), Positives = 143/387 (36%), Gaps = 79/387 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y T+I TP L V G + +V C + SST + +C S +
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SME 150
Query: 92 CNLANAKACGGGICGAGPDNPISN-TGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C C + D + + + G + D++S P R V F C
Sbjct: 151 CT------CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTV------FGC 198
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGP 208
+ + GI GLGR +++ QL + F+LC D GA+V G
Sbjct: 199 ENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLG--- 255
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFL---GEP--SVEYFIGVTSVHVNGKAVPLNK 263
++ +G + +P S Y I + +H+ GK +P+N
Sbjct: 256 ---------------------GISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINP 294
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ +G GT + + Y L +KA A + + + G D
Sbjct: 295 MVF----DGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQ-----GPDRNYND 345
Query: 324 IGFTRIGPVV-------PQIDLVLQNKNVVWSIHGQNSMVQI--GGDALCLGFVDGGVNP 374
I F+ +G V P +DLV N N + S+ +N + Q A CLG N
Sbjct: 346 ICFSGVGSDVSQLSKTFPAVDLVFSNGNRL-SLSPENYLFQHSKAHGAYCLGIFQNE-ND 403
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGF 401
+T++ +G + N L+ +D ++GF
Sbjct: 404 QTTL-LGGIIVRNTLVMYDREHLKIGF 429
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 78/181 (43%), Gaps = 28/181 (15%)
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDG 200
T P F C +E + + GI GLGR ++L SQLA +F+ CL S DG
Sbjct: 197 TFPKVAFGCSTENGVDNSS----GIVGLGRGPLSLVSQLAVG-----RFSYCLRSDMADG 247
Query: 201 A---IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
I+FG L ++ TPL N +L + S Y++ +T + V+
Sbjct: 248 GASPILFGS-----LAKLTERSVVQSTPLLKNP------YL-QRSTHYYVNLTGIAVDST 295
Query: 258 AVPLNKTLLSIDNEGV-GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV--AP 314
+P+ + G+ GGT + + T L Y + QAF S M + + P AP
Sbjct: 296 ELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAP 355
Query: 315 F 315
+
Sbjct: 356 Y 356
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 112/293 (38%), Gaps = 55/293 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE------------------KGYVSSTNKTAR 86
Y T++K +P + + G +ILWV C SST+
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGD-------IRIDVLSIQSTDGRNPGR 139
C +C A + +C ++P T T+GD D + + G N
Sbjct: 151 CSDDRCTAALQTS--EAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMG-NEQT 207
Query: 140 AVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
A + + +F C + L V GI G G+ ++++ SQL + + F+ CL
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGS 267
Query: 198 DDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
D+G +V G+ V L YTPL ++ + Y + + S+ VN
Sbjct: 268 DNGGGILVLGE---------IVEPGLVYTPLVPSQPH------------YNLNLESIVVN 306
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVAR 308
G+ +P++ +L + N GT + + L Y V A +A+ R
Sbjct: 307 GQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR 357
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 131/350 (37%), Gaps = 58/350 (16%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVS 79
T P VL++A YV ++K TP + + + + WV C G S
Sbjct: 28 TTAVPIAPGQQVLKIAN------YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS 81
Query: 80 ST---NKTARCGSAQCNLANAKACGGGICGA-GPDNPISNTGTHGDIRIDVLSIQSTDGR 135
+T N + GS C+ A G C A G + N GD + +Q D
Sbjct: 82 TTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQ--DAI 139
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFAL 192
V +P F F C + ++ G + G+ GLGR ++L SQ A +S F+
Sbjct: 140 TLANDV-IPGFTFGC-----INAVSGGSIPPQGLLGLGRGPISLISQAGAMYS--GVFSY 191
Query: 193 CLSPFDDGAIVFGDGPYY-----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
CL F YY L K+++ TPL N PS+ Y++
Sbjct: 192 CLPSFKS---------YYFSGSLKLGPVGQPKSIRTTPLLRNP--------HRPSL-YYV 233
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
+T V V VP+ L D GT I + T +Y A+ F +
Sbjct: 234 NLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NG 291
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
++ + F CF + P + L + N+V + +NS++
Sbjct: 292 PISSLGAFDTCFAATNEA------EAPAVTLHFEGLNLVLPM--ENSLIH 333
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 88/382 (23%), Positives = 138/382 (36%), Gaps = 65/382 (17%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILW---VDCEKGYV----------SSTNKTARCGSA 90
Y+ ++ TP + G ++ W V C Y S+T + C S
Sbjct: 71 HYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSK 130
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
C+ + C S T G + + +++ ST G+ +V + +F C
Sbjct: 131 LCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGK----SVPLKGIVFGC 186
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIV-----FG 205
G G +GI GLG V+L SQ+ ++F KR F+ CL PF V FG
Sbjct: 187 GHNNT-GGFNDHEMGIIGLGGGPVSLISQMGSSFGGKR-FSQCLVPFHTDVSVSSKMSFG 244
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
G K + TPL + T YF+ + + V + N +
Sbjct: 245 KGSKVS------GKGVVSTPLVAKQDKTP----------YFVTLLGISVENTYLHFNGSS 288
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVARVAPVAPFGACFRLQD 323
+++ G + + P T+L T +Y +V S AM V + P C+R ++
Sbjct: 289 QNVEK---GNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGP-QLCYRTKN 344
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFV----DGGVNPRTSIV 379
GPV L + + + + CLGF DGG V
Sbjct: 345 ---NLRGPV-----LTAHFEGADVKLSPTQTFISPKDGVFCLGFTNTSSDGG-------V 389
Query: 380 IGARQLENNLLQFDLATSRLGF 401
G N L+ FDL + F
Sbjct: 390 YGNFAQSNYLIGFDLDRQVVSF 411
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 78/181 (43%), Gaps = 28/181 (15%)
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL-SPFDDG 200
T P F C +E + + GI GLGR ++L SQLA +F+ CL S DG
Sbjct: 197 TFPKVAFGCSTENGVDNSS----GIVGLGRGPLSLVSQLAVG-----RFSYCLRSDMADG 247
Query: 201 A---IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
I+FG L ++ TPL N +L + S Y++ +T + V+
Sbjct: 248 GASPILFGS-----LAKLTEGSVVQSTPLLKNP------YL-QRSTHYYVNLTGIAVDST 295
Query: 258 AVPLNKTLLSIDNEGV-GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV--AP 314
+P+ + G+ GGT + + T L Y + QAF S M + + P AP
Sbjct: 296 ELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAP 355
Query: 315 F 315
+
Sbjct: 356 Y 356
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 83/201 (41%), Gaps = 33/201 (16%)
Query: 33 VLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------KGYVSST--- 81
+ D L Y T I TP + + + G ++LW+ C+ Y S
Sbjct: 89 TMSFGNDYGWLHY-TWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRD 147
Query: 82 -NKTARCGSAQCNLANAKACGGGICGAGP--DNPI-----------SNTGTHGDIRIDVL 127
N+ + GS+ + +C +C + P D+P NT + G + D+L
Sbjct: 148 LNQYSPSGSST---SKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 204
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV--GIAGLGRSKVALPSQLAAAFS 185
+ S +V P I CG G GV G+ GLG ++++PS L+ A
Sbjct: 205 HLTSGIDDASNSSVRAP-VIIGCGMR-QTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGL 262
Query: 186 LKRKFALCLSPFDDGAIVFGD 206
+K F+LC + D G I FGD
Sbjct: 263 VKNSFSLCFNDDDSGRIFFGD 283
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 83/201 (41%), Gaps = 33/201 (16%)
Query: 33 VLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--------KGYVSST--- 81
+ D L Y T I TP + + + G ++LW+ C+ Y S
Sbjct: 70 TMSFGNDYGWLHY-TWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRD 128
Query: 82 -NKTARCGSAQCNLANAKACGGGICGAGP--DNPI-----------SNTGTHGDIRIDVL 127
N+ + GS+ + +C +C + P D+P NT + G + D+L
Sbjct: 129 LNQYSPSGSST---SKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 185
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV--GIAGLGRSKVALPSQLAAAFS 185
+ S +V P I CG G GV G+ GLG ++++PS L+ A
Sbjct: 186 HLTSGIDDASNSSVRAP-VIIGCGMR-QTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGL 243
Query: 186 LKRKFALCLSPFDDGAIVFGD 206
+K F+LC + D G I FGD
Sbjct: 244 VKNSFSLCFNDDDSGRIFFGD 264
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 68/161 (42%), Gaps = 13/161 (8%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y++G+T + V G+ + + ++ +D G GG + T L+T Y AL F
Sbjct: 342 YYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTK 401
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDAL 363
+ + A F C+ L +R VP + + + + N ++ +
Sbjct: 402 DLPSTSGFALFDTCYNLS----SRTSVRVPTVAFLFDGGKSL-PLPPSNYLIPVDSAGTF 456
Query: 364 CLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
CL F P T+ +IG Q + + +DLA S++ FS
Sbjct: 457 CLAFA-----PTTASLSIIGNVQQQGTRVTYDLANSQVSFS 492
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 134/353 (37%), Gaps = 74/353 (20%)
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQS 131
S+T+ + RC +C G G C + P++ SNT T G + DVL + +
Sbjct: 148 STTSSSIRCSDKRC-------FGSGKCSS-PESICPYQIALSSNTVTTGTLLQDVLHLVT 199
Query: 132 TDGR-NPGRAVTVPNFIFLCGSEFVLQGLAS-GVVGIAGLGRSKVALPSQLAAAFSLKRK 189
D P A N CG V G+ GL + ++PS LA A
Sbjct: 200 EDEDLKPVNA----NVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANS 255
Query: 190 FALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
F++C G I FGD Y D + TPL + +TA G +
Sbjct: 256 FSMCFGRIISVVGRISFGDKGYTD---------QEETPLVSLETSTAYG----------V 296
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
VT V V G VP++ L ++ + G + +T+L S Y +AF M
Sbjct: 297 NVTGVSVGG--VPVDVPLFALFDTG---------SSFTLLLESAYGVFTKAFDDLMEDKR 345
Query: 308 R-VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK-------NVVWSIHGQN----SM 355
R V P PF C+ L++ P+ +Q+K + W I + S
Sbjct: 346 RPVDPDFPFEFCYDLREEHLN--SDARPR---HMQSKCYNPCRDDFRWRIQNDSQESVSY 400
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
G CLG + +N +IG + + + FD LG+ S FE
Sbjct: 401 SNEGTKMYCLGILK-SINLN---IIGQNLMSGHRIVFDRERMILGWKQSNCFE 449
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 91/396 (22%), Positives = 158/396 (39%), Gaps = 64/396 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKA---- 99
+Y+ + TP + + G +++W C + T+ +C L N +
Sbjct: 86 EYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145
Query: 100 ----CGG--GICGA--GPDNP---------ISNTG-THGDIRIDVLSIQSTDGRNPGRAV 141
C +C A GP P TG T G ++ + S+ P AV
Sbjct: 146 GVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTGWTAGVQSVETFTFGSSS--TP-PAV 202
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG- 200
VPN F C + +G G+ GLGR ++L SQL A F+ CL+PF D
Sbjct: 203 RVPNIAFGCSN--ASSNDWNGSAGLVGLGRGSMSLVSQLGAG-----AFSYCLTPFQDAN 255
Query: 201 ---AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
++ G L + ++ TP F+ + A S Y++ +T + V
Sbjct: 256 STSTLLLGPSAAAALKG---TGPVRSTP-FVAGPSKA-----PMSTYYYLNLTGISVGET 306
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--- 314
A+ + S+ +G GG I + T L S Y+ + A S + V R+ P+A
Sbjct: 307 ALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLL--VTRL-PLAHGPD 363
Query: 315 ----FGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVD 369
CF L+ + P +P + L + ++V + +N M+ +G CL +
Sbjct: 364 HSTGLDLCFALKA---STPPPAMPSMTLHFEGGADMVLPV--ENYMI-LGSGVWCLAMRN 417
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSL 405
V + ++G Q +N + +D+ L F+ ++
Sbjct: 418 QTVGAMS--MVGNYQQQNIHVLYDVRKETLSFAPAV 451
>gi|77702557|gb|ABB01163.1| putative xylanase inhibitor [Triticum aestivum]
Length = 100
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRT---SIVI 380
I ++R +VP + ++L+ W++ G NSM Q+ C FV G + + ++VI
Sbjct: 1 IAWSRPRYLVPNMGVMLEG-GTNWTVVGGNSMAQVNSGTACFAFVRSGGSTGSATPAVVI 59
Query: 381 GARQLENNLLQFDLATSRLGFSDSLLFERATCT-FNFTSIA 420
G Q+EN L+ D + L F+ +L +C+ FNFT A
Sbjct: 60 GGFQMENKLVVLDNSKKTLSFTQNLPGMGFSCSNFNFTKAA 100
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 134/353 (37%), Gaps = 74/353 (20%)
Query: 79 SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPI-------SNTGTHGDIRIDVLSIQS 131
S+T+ + RC +C G G C + P++ SNT T G + DVL + +
Sbjct: 160 STTSSSIRCSDKRC-------FGSGKCSS-PESICPYQIALSSNTVTTGTLLQDVLHLVT 211
Query: 132 TDGR-NPGRAVTVPNFIFLCGSEFVLQGLAS-GVVGIAGLGRSKVALPSQLAAAFSLKRK 189
D P A N CG V G+ GL + ++PS LA A
Sbjct: 212 EDEDLKPVNA----NVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANS 267
Query: 190 FALCLSPFDD--GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
F++C G I FGD Y D + TPL + +TA G +
Sbjct: 268 FSMCFGRIISVVGRISFGDKGYTD---------QEETPLVSLETSTAYG----------V 308
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
VT V V G VP++ L ++ + G + +T+L S Y +AF M
Sbjct: 309 NVTGVSVGG--VPVDVPLFALFDTG---------SSFTLLLESAYGVFTKAFDDLMEDKR 357
Query: 308 R-VAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNK-------NVVWSIHGQN----SM 355
R V P PF C+ L++ P+ +Q+K + W I + S
Sbjct: 358 RPVDPDFPFEFCYDLREEHLN--SDARPR---HMQSKCYNPCRDDFRWRIQNDSQESVSY 412
Query: 356 VQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFE 408
G CLG + +N +IG + + + FD LG+ S FE
Sbjct: 413 SNEGTKMYCLGILK-SINLN---IIGQNLMSGHRIVFDRERMILGWKQSNCFE 461
>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+L YTP IN ++SG+ Y+I + V + K + L L S D +G GGT I
Sbjct: 14 SLNYTPFLINTKASSSGY----HTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIID 69
Query: 280 TVNPYTVLETSIYKALVQAFASAM 303
+ +T+ YK + AFAS +
Sbjct: 70 SGTTFTIFNEEFYKNITAAFASQI 93
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 111/263 (42%), Gaps = 38/263 (14%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKA------CGGGICG 106
TP P KL V G +++W C+ SST AR GS + C +C
Sbjct: 99 TPPQPRKLIVDTGSDLIWTQCK--LSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQ 156
Query: 107 AGPDNPISNTGTHGDIRIDVLSIQSTDGR--------NPGRAVTVPNFIFLCGSEFVLQG 158
G + + T + + DV + G RAV++ F CG+ + G
Sbjct: 157 EGQFSFKNCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSL-RLGFGCGA--LSAG 213
Query: 159 LASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNF 215
G GI GL ++L +QL ++F+ CL+PF D ++FG DL+
Sbjct: 214 SLIGATGILGLSPESLSLITQLKI-----QRFSYCLTPFADKKTSPLLFG--AMADLSRH 266
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGG 275
++ ++ T + N V T V Y++ + + + K + + L++ +G GG
Sbjct: 267 KTTRPIQTTAIVSNPVKT---------VYYYVPLVGISLGHKRLAVPAASLAMRPDGGGG 317
Query: 276 TKISTVNPYTVLETSIYKALVQA 298
T + + + L + ++A+ +A
Sbjct: 318 TIVDSGSTVAYLVEAAFEAVKEA 340
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 88/392 (22%), Positives = 145/392 (36%), Gaps = 66/392 (16%)
Query: 34 LRVAKDTATL-QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVS 79
+R+ D T Y T++ +P L V G + +V C + +S
Sbjct: 77 MRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELS 136
Query: 80 STNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
ST + +C +A CN G+ + + G + DV+S P R
Sbjct: 137 STYQPVKC-NADCNCDE-----NGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQR 190
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD- 198
AV F C + GI GLGR +++ QL + F+LC D
Sbjct: 191 AV------FGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244
Query: 199 -DGAIVFG---DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
GA+V G P ++ D S+ S Y I + +HV
Sbjct: 245 GGGAMVLGGISSPPGMVFSHSDPSR----------------------SPYYNIELKEIHV 282
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
GK + LN +G G + + Y Y A A + + +++ P
Sbjct: 283 AGKPLKLNPRTF----DGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDP 338
Query: 315 --FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM---VQIGGDALCLGFVD 369
CF T + V P++D+V N + S+ +N + ++ G A CLG
Sbjct: 339 NFKDICFSGAGRDVTELPKVFPEVDMVFANGQKI-SLSPENYLFRHTKVSG-AYCLGIFK 396
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G N +T++ +G + N L+ ++ S +GF
Sbjct: 397 NG-NDQTTL-LGGIIVRNTLVTYNRENSTIGF 426
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 88/390 (22%), Positives = 148/390 (37%), Gaps = 72/390 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYVSSTNKTARCGSAQCNLA 95
+Y T+I TP+ P + + G +++W+ C + G + + G+ C
Sbjct: 146 EYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAP 205
Query: 96 NAKACGGGICGAGPDNPISNTG------THGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
+ G C + T GD + L+ S VP
Sbjct: 206 LCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS--------GARVPRVALG 257
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL---------SPFDDG 200
CG + +GL G+ GLGR ++ PSQ++ F R F+ CL +
Sbjct: 258 CGHDN--EGLFVAAAGLLGLGRGSLSFPSQISRRFG--RSFSYCLVDRTSSSASATSRSS 313
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKA 258
+ FG G S +TP+ N P +E Y++ + + V G
Sbjct: 314 TVTFGSG------AVGPSAAASFTPMVKN-----------PRMETFYYVQLMGISVGGAR 356
Query: 259 VP-LNKTLLSID-NEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP--VAP 314
VP + + L +D + G GG + + T L Y AL AF +A + R++P +
Sbjct: 357 VPGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGL-RLSPGGFSL 415
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFV--DGG 371
F C+ L + + VP + + ++ +N ++ + C F DGG
Sbjct: 416 FDTCYDLSGLKVVK----VPTVSMHFAG-GAEAALPPENYLIPVDSRGTFCFAFAGTDGG 470
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGF 401
V+ +IG Q + + FD RLGF
Sbjct: 471 VS-----IIGNIQQQGFRVVFDGDGQRLGF 495
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 107/277 (38%), Gaps = 51/277 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE--------------------KGYVSSTNKTARCGSAQC 92
TP V + + G ++ WV C+ S+T++ C S C
Sbjct: 43 TPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSSNLC 102
Query: 93 NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGS 152
+L NA C NT + G + DVL + S ++ + VT P +F CG
Sbjct: 103 DLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS--KIVTAP-IMFGCGQ 159
Query: 153 EFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
L S G+ GLG ++PS LA+ F++C G I FGD
Sbjct: 160 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD----- 214
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
S + K TPL + K N Y I +T + V K++ + +I +
Sbjct: 215 ----TGSSDQKETPLNVYKQNPY----------YNITITGITVGSKSI--STEFSAIVDS 258
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQA----FASAMP 304
G T +S +P TS + A +++ S+MP
Sbjct: 259 GTSFTALS--DPMYTQITSSFDAQIRSSRNMLDSSMP 293
>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+L YTP IN ++SG+ Y+I + V + K + L L S D +G GGT I
Sbjct: 14 SLNYTPFLINTKASSSGY----HTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIID 69
Query: 280 TVNPYTVLETSIYKALVQAFASAM 303
+ +T+ YK + AFAS +
Sbjct: 70 SGTTFTIFNEEFYKNITAAFASQI 93
>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 154
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+L YTP IN ++SG+ Y+I + V + K + L L S D +G GGT I
Sbjct: 14 SLNYTPFLINTKASSSGY----HTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIID 69
Query: 280 TVNPYTVLETSIYKALVQAFASAM 303
+ +T+ YK + AFAS +
Sbjct: 70 SGTTFTIFNEEFYKNITAAFASQI 93
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 76/378 (20%), Positives = 138/378 (36%), Gaps = 63/378 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP + + G +++W CE SS+ T C S
Sbjct: 95 EYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQ 154
Query: 91 QCNLANAKACGGG---ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
C +++C G G + T G + + + +++ +VPN
Sbjct: 155 YCQDLPSESCYNDCQYTYGYG-----DGSSTQGYMATETFTFETS---------SVPNIA 200
Query: 148 FLCGSEFVLQGLASGV-VGIAGLGRSKVALPSQLAAAFSLKRKFALCLS---PFDDGAIV 203
F CG + QG G G+ G+G ++LPSQL +F+ C++ +
Sbjct: 201 FGCGEDN--QGFGQGNGAGLIGMGWGPLSLPSQLGVG-----QFSYCMTSSGSSSPSTLA 253
Query: 204 FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNK 263
G V + T L + +N Y+I + + V G + +
Sbjct: 254 LGSAAS------GVPEGSPSTTLIHSSLN---------PTYYYITLQGITVGGDNLGIPS 298
Query: 264 TLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQD 323
+ + ++G GG I + T L Y A+ QAF + + CF+L
Sbjct: 299 STFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPS 358
Query: 324 IGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
G T VP+I +Q V ++ +N ++ +CL G + + + G
Sbjct: 359 DGSTV---QVPEIS--MQFDGGVLNLGEENVLISPAEGVICLAM--GSSSQQGISIFGNI 411
Query: 384 QLENNLLQFDLATSRLGF 401
Q + + +DL + F
Sbjct: 412 QQQETQVLYDLQNLAVSF 429
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 88/392 (22%), Positives = 145/392 (36%), Gaps = 66/392 (16%)
Query: 34 LRVAKDTATL-QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVS 79
+R+ D T Y T++ +P L V G + +V C + +S
Sbjct: 77 MRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELS 136
Query: 80 STNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGR 139
ST + +C +A CN G+ + + G + DV+S P R
Sbjct: 137 STYQPVKC-NADCNCDE-----NGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQR 190
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD- 198
AV F C + GI GLGR +++ QL + F+LC D
Sbjct: 191 AV------FGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244
Query: 199 -DGAIVFG---DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHV 254
GA+V G P ++ D S+ S Y I + +HV
Sbjct: 245 GGGAMVLGGISSPPGMVFSHSDPSR----------------------SPYYNIELKEIHV 282
Query: 255 NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP 314
GK + LN +G G + + Y Y A A + + +++ P
Sbjct: 283 AGKPLKLNPRTF----DGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDP 338
Query: 315 --FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSM---VQIGGDALCLGFVD 369
CF T + V P++D+V N + S+ +N + ++ G A CLG
Sbjct: 339 NFKDICFSGAGRDVTELPKVFPEVDMVFANGQKI-SLSPENYLFRHTKVSG-AYCLGIFK 396
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G N +T++ +G + N L+ ++ S +GF
Sbjct: 397 NG-NDQTTL-LGGIIVRNTLVTYNRENSTIGF 426
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 74/186 (39%), Gaps = 29/186 (15%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY---------------------VSSTNK 83
+ T +K TP + + + G ++ WV C+ G VS+TNK
Sbjct: 107 HYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNK 166
Query: 84 TARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C ++ C N C + T T G + DV+ + +T+ +NP R
Sbjct: 167 KVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHL-TTEDKNPERVEAY 225
Query: 144 PNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
F CG V G + G+ GLG K+++PS LA + F++C G
Sbjct: 226 --VTFGCGQ--VQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVG 281
Query: 201 AIVFGD 206
I FGD
Sbjct: 282 RISFGD 287
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 68/309 (22%), Positives = 119/309 (38%), Gaps = 73/309 (23%)
Query: 140 AVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA-AFSLKRKFALCLSPFD 198
+V+V NF F C + + +G+AG GR +++LP+QL+ + L F+ CL
Sbjct: 204 SVSVANFTFGCAHTTLAEP-----IGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVS-- 256
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPL----FINKVNTA-------------------- 234
++FD + + +PL F++K
Sbjct: 257 --------------HSFDSDRVRRPSPLILGRFVDKKEKRVATTDDDDDGDETKKKKNEF 302
Query: 235 --SGFLGEPSVEYFIGVT--SVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETS 290
+ L P YF V+ + + + +P L ID G GG + + +T+L
Sbjct: 303 VFTEMLVNPKHPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAK 362
Query: 291 IYKALVQAFASAMPKVA----RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV 346
Y ++V+ F S + +V RV P + C+ L VP + L
Sbjct: 363 FYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLNQT------VKVPALVLHFAGNGST 416
Query: 347 WSIHGQNSMVQI--GGDAL-------CLGFVDGGVNPR----TSIVIGARQLENNLLQFD 393
++ +N + GGD CL ++GG T ++G Q + + +D
Sbjct: 417 VTLPRRNYFYEFMDGGDGKEEKRKVGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYD 476
Query: 394 LATSRLGFS 402
L R+GF+
Sbjct: 477 LLNRRVGFA 485
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 131/350 (37%), Gaps = 58/350 (16%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--KGYVS 79
T P VL++A YV ++K TP + + + + WV C G S
Sbjct: 28 TTAVPIAPGQQVLKIAN------YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS 81
Query: 80 ST---NKTARCGSAQCNLANAKACGGGICGA-GPDNPISNTGTHGDIRIDVLSIQSTDGR 135
+T N + GS C+ A G C A G + N GD + +Q D
Sbjct: 82 TTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQ--DAI 139
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFAL 192
V +P F F C + ++ G + G+ GLGR ++L SQ A +S F+
Sbjct: 140 TLANDV-IPGFTFGC-----INAVSGGSIPPQGLLGLGRGPISLISQAGAMYS--GVFSY 191
Query: 193 CLSPFDDGAIVFGDGPYY-----DLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFI 247
CL F YY L K+++ TPL N PS+ Y++
Sbjct: 192 CLPSFKS---------YYFSGSLKLGPVGQPKSIRTTPLLRNP--------HRPSL-YYV 233
Query: 248 GVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA 307
+T V V VP+ L D GT I + T +Y A+ F +
Sbjct: 234 NLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV--NG 291
Query: 308 RVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ 357
++ + F CF + P + L + N+V + +NS++
Sbjct: 292 PISSLGAFDTCFAETNEA------EAPAVTLHFEGLNLVLPM--ENSLIH 333
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 150/392 (38%), Gaps = 81/392 (20%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV-------------SSTNKTARCG-- 88
+Y ++ TP V + + G +++W+ C S T T CG
Sbjct: 134 EYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSR 193
Query: 89 -------SAQCNLANAKACGGGIC-GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRA 140
S++C +K C + G G + T GD + L+ R
Sbjct: 194 LCRRLDDSSECVTRRSKTCLYQVSYGDG-------SFTEGDFSTETLTFHG------ARV 240
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL------ 194
VP CG + +GL G G+ GLGR ++ PSQ ++ KF+ CL
Sbjct: 241 DHVP---LGCGHD--NEGLFVGAAGLLGLGRGGLSFPSQTKNRYN--GKFSYCLVDRTSS 293
Query: 195 --SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFIN-KVNTASGFLGEPSVEYFIGVTS 251
S IVFG N V K +TPL N K++T Y++ +
Sbjct: 294 GSSSKPPSTIVFG--------NAAVPKTSVFTPLLTNPKLDTF----------YYLQLLG 335
Query: 252 VHVNGKAVP-LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVA 310
+ V G VP ++++ +D G GG I + T L Y AL AF K+ R
Sbjct: 336 ISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAP 395
Query: 311 PVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA-LCLGFVD 369
+ F CF L G T + VP + V S+ N ++ + + C F
Sbjct: 396 SYSLFDTCFDLS--GMTTVK--VPTVVFHFGGGEV--SLPASNYLIPVNTEGRFCFAF-- 447
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G SI IG Q + + +DL SR+GF
Sbjct: 448 AGTMGSLSI-IGNIQQQGFRVAYDLVGSRVGF 478
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 118/284 (41%), Gaps = 56/284 (19%)
Query: 145 NFIFLCGSEFVLQGLASGVV----GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ +F CG + QGL + GI GL R+K++LPSQLA+ + CL+
Sbjct: 310 DIVFGCG--YDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLA----- 362
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ + G+G + ++ S + + P+ L P +E + + G A
Sbjct: 363 SDLNGEGYIFMGSDLVPSHGMTWVPM-----------LHHPHLEVYQMQVTKMSYGNA-- 409
Query: 261 LNKTLLSIDNEG--VGGTKISTVNPYTVLETSIYKALVQAFA-------------SAMPK 305
+LS+D E VG T + YT Y LV + A+P
Sbjct: 410 ----MLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPI 465
Query: 306 VARVAPVAPFGACFRLQDI-GFTRIGPVVPQID---LVLQNKNVVWSIHGQNSMVQIGGD 361
R +P + L D+ F R P+ QI L++ K + I ++ ++
Sbjct: 466 CWRAKTNSPISS---LSDVKKFFR--PITLQIGSKWLIISKKLL---IQPEDYLIISNKG 517
Query: 362 ALCLGFVDG-GVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
+CLG +DG V+ ++I+IG + L+ +D R+G+ S
Sbjct: 518 NVCLGILDGSNVHDGSTIIIGDISMRGRLIVYDNVKQRIGWMKS 561
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 107/277 (38%), Gaps = 51/277 (18%)
Query: 53 TPLVPVKLTVHLGGNILWVDCE--------------------KGYVSSTNKTARCGSAQC 92
TP V + + G ++ WV C+ S+T++ C S C
Sbjct: 107 TPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCSSNLC 166
Query: 93 NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGS 152
+L NA C NT + G + DVL + S ++ + VT P +F CG
Sbjct: 167 DLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS--KIVTAP-IMFGCGQ 223
Query: 153 EFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYD 211
L S G+ GLG ++PS LA+ F++C G I FGD
Sbjct: 224 VQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGD----- 278
Query: 212 LNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNE 271
S + K TPL + K N Y I +T + V K++ + +I +
Sbjct: 279 ----TGSSDQKETPLNVYKQNPY----------YNITITGITVGSKSI--STEFSAIVDS 322
Query: 272 GVGGTKISTVNPYTVLETSIYKALVQA----FASAMP 304
G T +S +P TS + A +++ S+MP
Sbjct: 323 GTSFTALS--DPMYTQITSSFDAQIRSSRNMLDSSMP 357
>gi|225878|prf||1402246A basic 7S globulin LII
Length = 135
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 52/110 (47%), Gaps = 19/110 (17%)
Query: 274 GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVV 333
GGT IST P VL+ S+Y+A FA+A V PFG C G T +GP+
Sbjct: 9 GGTMISTSTPXMVLQQSVYQAFXN-FANA-------XIVGPFGLCPNQN--GVTSLGPMX 58
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
+Q + G N MVQ LG ++GG+ PR I +GAR
Sbjct: 59 -----XMQPARQL----GLNLMVQAQPGVTXLGVMNGGMQPRAEITLGAR 99
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 99/437 (22%), Positives = 163/437 (37%), Gaps = 76/437 (17%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRV-------AKDTATLQYVTQIKQRTPLVPVK 59
F + C +LS + A R +L+ V + A Y +I TP
Sbjct: 40 FNVKCKYQDRTLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIGIGTPPKNYY 99
Query: 60 LTVHLGGNILWVDCEKGYVSSTN------------KTARCGS-AQCNLANAKACGGGI-- 104
L V G +I+WV+C + T K + G C+ K GG+
Sbjct: 100 LQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVPCDQEFCKEINGGLLT 159
Query: 105 -CGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG---SEFV 155
C A P + T G D++ G + + +F CG S +
Sbjct: 160 GCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSAN-GSIVFGCGARQSGDL 218
Query: 156 LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF 215
+ GI G G++ ++ SQLA++ +K+ FA CL+ + G I F G
Sbjct: 219 SSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGI-FAIGHV------ 271
Query: 216 DVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVG- 274
V + TPL ++ + Y + +T+V V + LS D G
Sbjct: 272 -VQPKVNMTPLLPDQPH------------YSVNMTAVQVGHAFLS-----LSTDTSTQGD 313
Query: 275 --GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ---DIGFTRI 329
GT I + L IY+ LV S P + +V + CF+ D GF
Sbjct: 314 RKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDL-KVRTLHDEYTCFQYSESVDDGF--- 369
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS---IVIGARQLE 386
P + +N + ++ + + GD C+G+ + G R S ++G L
Sbjct: 370 ----PAVTFYFEN-GLSLKVYPHDYLFP-SGDFWCIGWQNSGTQSRDSKNMTLLGDLVLS 423
Query: 387 NNLLQFDLATSRLGFSD 403
N L+ +DL +G+++
Sbjct: 424 NKLVFYDLENQVIGWTE 440
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 85/395 (21%), Positives = 156/395 (39%), Gaps = 73/395 (18%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDC--------------EKGYV----SSTNKTAR 86
Y T++K TP + + G +ILW++C E + SST
Sbjct: 84 YTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVP 143
Query: 87 CGSAQCNLANAKACGGGICGAGPD-NPISNT-------GTHGDIRIDVLSIQSTDGRN-P 137
C C A A G P N S T GT G D + G++ P
Sbjct: 144 CSDPMC----ASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTP 199
Query: 138 GRAVTVPNFIFLCGS--EFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
+ +F C + L V GI G G ++++ SQL++ + F+ CL
Sbjct: 200 ANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLK 259
Query: 196 PFDDGA--IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+G +V G+ + ++ Y+PL ++ + Y + + S+
Sbjct: 260 GDGNGGGILVLGE---------ILEPSIVYSPLVPSQPH------------YNLNLQSIA 298
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA 313
VNG+ + +N + + ++ GT I + + L Y LV A +A+ + A + ++
Sbjct: 299 VNGQVLSINPAVFATSDK--RGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFA-TSFIS 355
Query: 314 PFGACFRLQDIGFTRIGPVVPQIDLVLQ---NKNVVWSIHGQNSMVQIGGDALCLGF--V 368
C+ + T I P + + + ++ S + N Q G C+GF V
Sbjct: 356 KGSQCYLV----LTSIDDSFPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKV 411
Query: 369 DGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
GV ++G L++ ++ +DLA ++G+++
Sbjct: 412 QEGVT-----ILGDLVLKDKIVVYDLARQQIGWTN 441
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 82/392 (20%), Positives = 144/392 (36%), Gaps = 66/392 (16%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------KG----YVSSTNKTAR-----C 87
QY T I P P L + G + W+ C+ KG Y + K C
Sbjct: 15 QYYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHPRDPLC 74
Query: 88 GSAQCN---LANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVP 144
Q N K C I A + + G + D + + + DG +
Sbjct: 75 EELQGNQNYCETCKQCDYEITYA------DRSSSKGVLARDNMQLTTADGE-----MKNV 123
Query: 145 NFIFLCG--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS--PFDDG 200
+F+F C + L + GI GL ++L +QLA + + F C++ P G
Sbjct: 124 DFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGG 183
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
+ GD ++ + + P+ N S + P V Y G +++ G+A
Sbjct: 184 YMFLGD-------DYVPRWGMTWVPIRNGPGNVYSTEV--PKVNY--GAQELNLRGQAGK 232
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
L + + + + YT IY L+ A P R C +
Sbjct: 233 LTQVIF------------DSGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPFCMK 280
Query: 321 LQDIGFTRIGPVVPQID-LVLQNKN------VVWSIHGQNSMVQIGGDALCLGFVDG-GV 372
++ +G V + L+LQ + ++I +N ++ +CLG +DG +
Sbjct: 281 -PNVPVRSVGDVEQLFNPLILQLRKRWFVIPTTFAISPENYLIISDKGNVCLGVLDGTEI 339
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
++I+IG L + +D +R+G+ S
Sbjct: 340 GHSSTIIIGDASLRGKFVVYDNDENRIGWVQS 371
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 80/378 (21%), Positives = 139/378 (36%), Gaps = 85/378 (22%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGGI 104
Y+ +++ P +P+ L G ++W V++ N +C + +C+ G
Sbjct: 91 YLVKVRIGNPGIPLYLVPDTGSALIWT------VNNQN-IFQCRNNKCSYTRRYDDGSIT 143
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
G + + + G+ RI S D +N S F G + GV+
Sbjct: 144 TGVAAQDILQSEGSE---RIPFYFGCSRDNQN--------------FSVFEHTGKSGGVM 186
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNF--DVSK--- 219
G L S V+L QL+ +R+F+ CL+P+ G+ + P L F D+ K
Sbjct: 187 G---LNTSPVSLLQQLSHI--TQRRFSYCLNPYQHGS----EPPPSSLLRFGNDIRKGRR 237
Query: 220 NLKYTPLFINKVNTASGFLGEPSV-EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
+ TPL + P YF+ + + V G+ + L ++ +G GGT I
Sbjct: 238 RFQSTPL-----------MSSPDRPNYFLNLLDMTVAGQRLHLPPGTFALRQDGTGGTII 286
Query: 279 STVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDL 338
+ T + + Y L+ AF + GF R+ +P+ DL
Sbjct: 287 DSGTGLTFITQTAYPRLISAFQNY------------------FDHRGFQRVH--IPEFDL 326
Query: 339 VLQ-------NKNVVWSIHGQNSMVQIGGDALCLGFVDGG--------VNPRTSIVIGAR 383
+ + + H + + + D + L D P+ VIGA
Sbjct: 327 CYSFRGNHTFHDHASMTFHFERADFTVQADYVYLPMEDDNAFCVALQPTPPQQRTVIGAI 386
Query: 384 QLENNLLQFDLATSRLGF 401
N +D A +L F
Sbjct: 387 NQGNTRFIYDAAAHQLLF 404
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 13/160 (8%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y+IG+ + V G VP+++ + + G GG + T T L T Y+A AF +
Sbjct: 325 YYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTA 384
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDAL 363
+ R VA F C+ L +GF + VP + ++ ++ +N ++ +
Sbjct: 385 NLPRATGVAIFDTCYDL--LGFVSV--RVPTVSFYFSGGPIL-TLPARNFLIPMDDAGTF 439
Query: 364 CLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
C F P TS ++G Q E + FD A +GF
Sbjct: 440 CFAFA-----PSTSGLSILGNIQQEGIQISFDGANGYVGF 474
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 76/383 (19%), Positives = 147/383 (38%), Gaps = 58/383 (15%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGY------------VSSTNKTARCGSAQCNLAN---- 96
TP + + + G + W+ C +SS+ C S C
Sbjct: 74 TPPQNMSMVIDTGSELSWLHCNTNTTATIPYPFFNPNISSSYTPISCSSPTCTTRTRDFP 133
Query: 97 --AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEF 154
A +C A +++ + G++ D S+ NPG N + SE
Sbjct: 134 IPASCDSNNLCHATLSYADASS-SEGNLASDTFGFGSS--FNPGIVFGCMNSSYSTNSE- 189
Query: 155 VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLN 213
S G+ G+ ++L SQL KF+ C+S D G ++ G+ +
Sbjct: 190 ----SDSNTTGLMGMNLGSLSLVSQLKIP-----KFSYCISGSDFSGILLLGE------S 234
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV 273
NF +L YTPL +++T + + Y + + + ++ K + ++ L D+ G
Sbjct: 235 NFSWGGSLNYTPLV--QISTPLPYFDRSA--YTVRLEGIKISDKLLNISGNLFVPDHTGA 290
Query: 274 GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF------GACFRLQDIGFT 327
G T ++ L +Y AL F + R F C+R+ + +
Sbjct: 291 GQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVP-VNQS 349
Query: 328 RIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-----GGDAL-CLGFVDGGVNPRTSIVIG 381
+ P +P + LV + + + G + ++ G D++ C F + + + +IG
Sbjct: 350 EL-PELPSVSLVFEGAEM--RVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIG 406
Query: 382 ARQLENNLLQFDLATSRLGFSDS 404
++ ++FDL R+G + +
Sbjct: 407 HHHQQSMWMEFDLVEHRVGLAHA 429
>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
Length = 330
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 148/353 (41%), Gaps = 62/353 (17%)
Query: 77 YVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRI-DVLSIQSTDGR 135
++S + C A C NA A +C P + +G+ + I D L R
Sbjct: 3 HLSDCRAASSCPGANCTPRNANA--NNVCP--PYLVVYGSGSTAGLLISDTL-------R 51
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL- 194
PGRAV NF+ C V Q + G+AG GR ++PSQL KF+ CL
Sbjct: 52 TPGRAVR--NFVIGCSLASVHQPPS----GLAGFGRGAPSVPSQLGLT-----KFSYCLL 100
Query: 195 -SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEP--SVEYFIGVTS 251
FDD A V G+ D ++Y PL +AS P SV Y++ +T+
Sbjct: 101 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPL----ARSASA---RPPYSVYYYLALTA 153
Query: 252 VHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+ V GK+V L + + GG + + ++ + ++++ + A +A+ +
Sbjct: 154 ITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSK 212
Query: 312 VAPFG----ACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG-------- 359
V G CF + T +P++ L + +V+ ++ +N V G
Sbjct: 213 VVEEGLGLSPCFAMPPGTKTM---ELPEMSLHFKGGSVM-NLPVENYFVVAGPAPSGGAP 268
Query: 360 --GDALCLGFVDG---------GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+A+CL V + +I++G+ Q +N +++DL RLGF
Sbjct: 269 AMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGF 321
>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 441
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 124/294 (42%), Gaps = 58/294 (19%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
+ P F F C + + +GIAG G+ K++LPSQL L + F+ C F
Sbjct: 162 IEFPGFCFGCVGSSIRE-----PIGIAGFGKGKLSLPSQLGF---LDKGFSHCFLGF--- 210
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKV--NTASGFLGEPSVE-------YFIGVTS 251
F + N+ +P+ I + + GFL P ++ Y+IG+
Sbjct: 211 -------------WFARNPNIT-SPMVIGDLALSVKDGFLFTPMLKSLTYPNFYYIGLEG 256
Query: 252 VHV-NGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP--KVAR 308
V + + A+P +L ID+EG GG + T YT L Y +++ + +S +P +
Sbjct: 257 VTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDPFYASVLSSLSSTVPYNRSYE 316
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQI--------DLVLQNKNVVWSIHG-QNSMV--- 356
+ F C ++ + +P I L L ++ +++ +NS+V
Sbjct: 317 LEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKESCYYAVTAPRNSVVIKC 376
Query: 357 ----QIGGDALCLGFVDGGVNPRTS-----IVIGARQLENNLLQFDLATSRLGF 401
+ D + D G + S V+G+ Q++N + +DL + R+GF
Sbjct: 377 LLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVVYDLESGRVGF 430
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 91/390 (23%), Positives = 150/390 (38%), Gaps = 59/390 (15%)
Query: 33 VLRVAKDTATLQ---YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV----------- 78
V+ +A LQ Y+ ++ TP P+ L + ++ W+ C G V
Sbjct: 84 VVPIASGRQMLQSTTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCS-GCVGCPSNTAFSPA 142
Query: 79 -SSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTD--GR 135
S++ K C + QC ACG C S T+G I Q T
Sbjct: 143 KSTSFKNVSCSAPQCKQVPNPACGARAC--------SFNLTYGSSSIAANLSQDTIRLAA 194
Query: 136 NPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS 195
+P +A F F C ++ G G+ GLGR ++L SQ + + K F+ CL
Sbjct: 195 DPIKA-----FTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVY--KSTFSYCLP 247
Query: 196 PFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
F ++ F L + +KYT L N ++ Y++ + ++ V
Sbjct: 248 SFR--SLTFSGS--LRLGPTSQPQRVKYTQLLRNPRRSS---------LYYVNLVAIRVG 294
Query: 256 GKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM-PKVARVAPVAP 314
K V L ++ + GT + YT L +Y+A+ F + P A V +
Sbjct: 295 RKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGG 354
Query: 315 FGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVN 373
F C+ Q VP I + + N+ ++ N M+ G CL N
Sbjct: 355 FDTCYSGQV--------KVPTITFMFKGVNM--TMPADNLMLHSTAGSTSCLAMASAPEN 404
Query: 374 PRTSI-VIGARQLENNLLQFDLATSRLGFS 402
+ + VI + Q +N+ + D+ RLG +
Sbjct: 405 VNSVVNVIASMQQQNHRVLIDVPNGRLGLA 434
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 92/397 (23%), Positives = 143/397 (36%), Gaps = 71/397 (17%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----------------------K 75
D L Y T I TP V + + G N+LW+ C
Sbjct: 95 DFGWLHY-TWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYN 153
Query: 76 GYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQ-STDG 134
SST+K C C+ A+ C + NT + G + D+L + +T+
Sbjct: 154 PSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNN 213
Query: 135 R-NPGRAVTVPNFIFLCGSEF---VLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKF 190
R G + + CG + L G+A G+ GLG +++++PS L+ A ++ F
Sbjct: 214 RLMNGSSSVKARVVIGCGKKQSGDYLDGVAPD--GLMGLGPAEISVPSFLSKAGLMRNSF 271
Query: 191 ALCLSPFDDGAIVFGD-GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGV 249
+LC D G I FGD GP + TP N SG Y +GV
Sbjct: 272 SLCFDEEDSGRIYFGDMGPSIQ----------QSTPFLQLDNNKYSG--------YIVGV 313
Query: 250 TSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARV 309
+ + L +T + T I + +T L IY+ + + ++
Sbjct: 314 EACCIGNSC--LKQTSFT--------TFIDSGQSFTYLPEEIYRKVALEIDRHINATSKN 363
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQ--IGGDALCLGF 367
+ C+ + P VP I L + N + IH + Q G CL
Sbjct: 364 FEGVSWEYCYE------SSAEPKVPAIKLKFSHNN-TFVIHKPLFVFQQSQGLVQFCLPI 416
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
G S IG + + FD +LG+S S
Sbjct: 417 SPSGQEGIGS--IGQNYMRGYRMVFDRENMKLGWSPS 451
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 90/406 (22%), Positives = 154/406 (37%), Gaps = 75/406 (18%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARC 87
RP+A+ L AK I TP + V G +I+WV+C + + K +
Sbjct: 91 RPEAVGLYYAK----------IGIGTPARDYYVQVDTGSDIMWVNCIQ--CNECPKKSSL 138
Query: 88 GSAQCNLANAK--------ACGGGIC----GAGPDNPISNTG-THGDIRID--------V 126
G + L + K +C C G P I+N ++ +I D V
Sbjct: 139 G-MELTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFV 197
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLAS---GVVGIAGLGRSKVALPSQLAAA 183
I D + T N + G G S + GI G G+S ++ SQLA++
Sbjct: 198 RDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASS 257
Query: 184 FSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
+++ FA CL + G I F G V + TPL N+ +
Sbjct: 258 GKVRKMFAHCLDGLNGGGI-FAIGHI-------VQPKVNTTPLVPNQTH----------- 298
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
Y + + +V V G + L + + ++ GT I + L +Y L+ S
Sbjct: 299 -YNVNMKAVEVGGYFLNLPTDVFDVGDK--KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQ 355
Query: 304 PKVARVAPVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
+ +V + CF+ D GF P + +N ++ +H + G
Sbjct: 356 SDL-KVHTIHDQFTCFQYSESLDDGF-------PAVTFHFEN-SLYLKVHPHEYLFSYDG 406
Query: 361 DALCLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
C+G+ + G+ R ++G L N L+ +DL +G+++
Sbjct: 407 -LWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTE 451
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 70/306 (22%), Positives = 105/306 (34%), Gaps = 53/306 (17%)
Query: 39 DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYV---------------SSTNK 83
D T YV TP + L V G ++ WV C+ SS+
Sbjct: 131 DIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYA 190
Query: 84 TARCGSAQCNLAN--AKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
CG + C A AC CG ++GD + + S+D
Sbjct: 191 AVPCGRSACAGLGIYASACSAAQCG--------YVVSYGDGS-NTTGVYSSDTLTLAANA 241
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
TV F+F CG GL +G+ G+ G GR + +L Q A A+ + L G
Sbjct: 242 TVQGFLFGCG-HAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGY 300
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVNGKAV 259
+ G GP F ++ L P+ Y + +T + V G+ +
Sbjct: 301 LTLG-GPSGVAPGFSTTQ-----------------LLPSPNAPTYYVVMLTGISVGGQPL 342
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
+ + + GT + T T L + Y AL AF S M P+ C+
Sbjct: 343 SVPASAFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCY 396
Query: 320 RLQDIG 325
G
Sbjct: 397 SFAGYG 402
>gi|302772671|ref|XP_002969753.1| hypothetical protein SELMODRAFT_410693 [Selaginella moellendorffii]
gi|300162264|gb|EFJ28877.1| hypothetical protein SELMODRAFT_410693 [Selaginella moellendorffii]
Length = 347
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 105/255 (41%), Gaps = 45/255 (17%)
Query: 166 IAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKN--LKY 223
IA LG ALP+Q++ + L +FA L D A +F V K ++Y
Sbjct: 118 IAALGSKNTALPAQISRSLGLPLRFAYVLR--DTSASIF------------VGKTGWIQY 163
Query: 224 TPLFINKVNTASGFLGEP-----SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKI 278
T + + V F+ P + Y + +T + + K L+ + V +I
Sbjct: 164 TQI-VPPVTVPVEFVQIPLKLDGAASYMVKMTGIGI--------KAFLTGQEDNV---EI 211
Query: 279 STVNPYTVLETSIYKALVQAF--ASAMPKVARVAPVA---PFGACFRLQDIGFTRIGPVV 333
S +T L IY +V F ++ K+ R + A G C++++ TR
Sbjct: 212 SVTQRFTTLPPKIYGFVVAQFQQEASERKIKRASTSAYNGKLGLCYQMRSSDVTRFR--- 268
Query: 334 PQIDLVLQNKNVVWSIHGQNSMVQIGG--DALCLGFVDGGVNPRTSIVIGARQLENNLLQ 391
+ +V +K WS+ +V G + CL +++ + VIG Q E+ ++
Sbjct: 269 -NVTMVFSSK-FRWSVPADKYLVPKPGTSNVFCLAYLELAAGNGSHGVIGTLQQEDRAME 326
Query: 392 FDLATSRLGFSDSLL 406
F+L LG S L+
Sbjct: 327 FNLERKSLGVSSPLI 341
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 13/160 (8%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y+IG+ + V G VP+++ + + G GG + T T L T Y+A AF +
Sbjct: 367 YYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTA 426
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI-GGDAL 363
+ R VA F C+ L +GF + VP + ++ ++ +N ++ +
Sbjct: 427 NLPRATGVAIFDTCYDL--LGFVSV--RVPTVSFYFSGGPIL-TLPARNFLIPMDDAGTF 481
Query: 364 CLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGF 401
C F P TS ++G Q E + FD A +GF
Sbjct: 482 CFAFA-----PSTSGLSILGNIQQEGIQISFDGANGYVGF 516
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 78/378 (20%), Positives = 138/378 (36%), Gaps = 56/378 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK-----------GYVSSTNKTARCGSAQC 92
Y + P P L V G ++ W+ C+ Y + NK C ++ C
Sbjct: 72 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKIVPCAASLC 131
Query: 93 -NLANAKACGGGICGAGP---DNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIF 148
+L K C A P D I T + + + + RN + N F
Sbjct: 132 TSLTPNKKC------AVPQQCDYQIKYTDKASSLGVLIADNFTLSLRN--SSTVRANLTF 183
Query: 149 LCGSEFVLQ---GLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
CG + + + + G+ GLG+ V+L SQL K C S G + FG
Sbjct: 184 GCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFG 243
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
D D+ + T ++ T SG P G +++ + +++ +
Sbjct: 244 D---------DIVPTSRVT--WVPMARTTSGNYYSP------GSGTLYFDRRSLGMKPME 286
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
+ D+ G T Y Y+A V A + + K + C++ Q +
Sbjct: 287 VVFDS---GST-------YAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPLCWKGQKV- 335
Query: 326 FTRIGPVVPQIDLVLQN--KNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGAR 383
F + V + + KN V I +N ++ +CLG +DG +IG
Sbjct: 336 FKSVSEVKNDFKSLFLSFGKNSVMEIPPENYLIVTKYGNVCLGILDGTTAKLKFNIIGDI 395
Query: 384 QLENNLLQFDLATSRLGF 401
+++ ++ +D +LG+
Sbjct: 396 TMQDQMIIYDNEKGQLGW 413
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 111/290 (38%), Gaps = 61/290 (21%)
Query: 40 TATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEK------------------GYVSST 81
T T YV TP V + + + +W+ C ++SST
Sbjct: 92 TNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSST 151
Query: 82 NKTARCGSAQCNL-------ANAKACGGG-ICGAGPDNPISNTGTHGDIRIDVLSIQSTD 133
+ RC + C A+ CG + G G N T G + +D + +
Sbjct: 152 IREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANT-----TAGLLAVDAFAFAT-- 204
Query: 134 GRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALC 193
V IF C +G GV+G LGR +++L SQL +F+
Sbjct: 205 -------VRADGVIFGCA--VATEGDIGGVIG---LGRGELSLVSQLQIG-----RFSYY 247
Query: 194 LSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
L+P DD V + D S+ + TPL N+ + + Y++ + +
Sbjct: 248 LAP-DDAVDVGSFILFLDDAKPRTSRAVS-TPLVANRASRS---------LYYVELAGIR 296
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
V+G+ + + + + +G GG +S P T L+ YK + QA AS +
Sbjct: 297 VDGEDLAIPRGTFDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI 346
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 93/406 (22%), Positives = 150/406 (36%), Gaps = 59/406 (14%)
Query: 21 ITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKG---- 76
IT + F P+A LR+++D Y+ ++ +P VP+ L G + W CE
Sbjct: 69 ITTSSGFSPEAFRLRISQDDTC--YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRF 126
Query: 77 ------YVSSTNKTAR---CGSAQC-NLANAKACGGGIC-----GAGPDNPISNTGTHGD 121
+ S+ ++T R C C N N C C AG + T G
Sbjct: 127 RQLPPIFNSTASRTYRDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAG------GSATAGV 180
Query: 122 IRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSE---FVLQGLASGVVGIAGLGRSKVALPS 178
D+L D R P F F C + F + GI GL S V+L
Sbjct: 181 AAQDILQSAEND-RIP--------FYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQ 231
Query: 179 QLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFL 238
Q+ K +F+ CL+ FD + N+ S+ + F++
Sbjct: 232 QMNHI--TKNRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSP-------R 282
Query: 239 GEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQA 298
G P+ YF+ + V V G + + ++ +G GGT I + T + + Y ++ A
Sbjct: 283 GMPN--YFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITA 340
Query: 299 FASAMPK--VARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV 356
F + + RV C++ Q F P + Q + V
Sbjct: 341 FKNYFDQHGFQRVNIQLSGYICYKQQGHTFHN----YPSMAFHFQGADFFVEPEYVYLTV 396
Query: 357 QIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
Q G A C+ ++P+ +IGA N +D A +L F+
Sbjct: 397 QDRG-AFCVAL--QPISPQQRTIIGALNQANTQFIYDAANRQLLFT 439
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 104/256 (40%), Gaps = 34/256 (13%)
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSI---QSTDGRNPGRAV 141
C S C+ NA A C ++NT + G++ DVL + + G AV
Sbjct: 169 VTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAAGAAV 228
Query: 142 TVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK-RKFALCLSPF 197
P +F CG + L G A+ G+ GLG KV++PS LA+ +K F++C S
Sbjct: 229 RTP-VVFGCGQVQTGSFLDGAAAD--GLMGLGMEKVSVPSILASTGVVKSNSFSMCFSKD 285
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G I FGD S + TP + ++ Y I +TS+ V K
Sbjct: 286 GLGRINFGD---------TGSADQSETPFIVKSTHS----------YYNISITSMSVGDK 326
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+PL +I + G T ++ +P T+ + A + + R P PF
Sbjct: 327 NLPLG--FYAIADSGTSFTYLN--DPAYTAYTTNFNAQISERRANFSGSTRSGPF-PFEY 381
Query: 318 CFRLQDIGFTRIGPVV 333
C+ L T PVV
Sbjct: 382 CYSLSPDQTTVELPVV 397
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 90/406 (22%), Positives = 154/406 (37%), Gaps = 75/406 (18%)
Query: 28 RPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARC 87
RP+A+ L AK I TP + V G +I+WV+C + + K +
Sbjct: 91 RPEAVGLYYAK----------IGIGTPARDYYVQVDTGSDIMWVNCIQ--CNECPKKSSL 138
Query: 88 GSAQCNLANAK--------ACGGGIC----GAGPDNPISNTG-THGDIRID--------V 126
G + L + K +C C G P I+N ++ +I D V
Sbjct: 139 G-MELTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFV 197
Query: 127 LSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLAS---GVVGIAGLGRSKVALPSQLAAA 183
I D + T N + G G S + GI G G+S ++ SQLA++
Sbjct: 198 RDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASS 257
Query: 184 FSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSV 243
+++ FA CL + G I F G V + TPL N+ +
Sbjct: 258 GKVRKMFAHCLDGLNGGGI-FAIGHI-------VQPKVNTTPLVPNQTH----------- 298
Query: 244 EYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM 303
Y + + +V V G + L + + ++ GT I + L +Y L+ S
Sbjct: 299 -YNVNMKAVEVGGYFLNLPTDVFDVGDK--KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQ 355
Query: 304 PKVARVAPVAPFGACFRLQ---DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG 360
+ +V + CF+ D GF P + +N ++ +H + G
Sbjct: 356 SDL-KVHTIHDQFTCFQYSESLDDGF-------PAVTFHFEN-SLYLKVHPHEYLFSYDG 406
Query: 361 DALCLGFVDGGVNPRTS---IVIGARQLENNLLQFDLATSRLGFSD 403
C+G+ + G+ R ++G L N L+ +DL +G+++
Sbjct: 407 -LWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTE 451
>gi|255635082|gb|ACU17899.1| unknown [Glycine max]
Length = 92
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 47/92 (51%), Gaps = 7/92 (7%)
Query: 2 SVAQSFLLL---CSLLIFSLSPITAQTSFRPKALV-LRVAKDTATLQYVTQIKQRTPLVP 57
S+ FL L CS L F +T +P LV L V D +T + +++RTPL+
Sbjct: 3 SILHYFLALSLSCSFLFFLSDSVTPT---KPINLVVLPVQNDGSTGLHWANLQKRTPLMQ 59
Query: 58 VKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
V + V L GN LWV+C + Y S T + C S
Sbjct: 60 VPVLVDLNGNHLWVNCVQQYSSKTYQAPFCHS 91
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 143/372 (38%), Gaps = 54/372 (14%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAKACGGG 103
+Y+ Q+ TP + + G ++ W+ C+ + C S AK+
Sbjct: 114 EYIIQVDFGTPKQSMYTLIDTGSDVAWIPCK--------QCQGCHSTAPIFDPAKSSSYK 165
Query: 104 --ICGAGPDNPIS-NTGTHGDIRIDVLSIQST--DGRNPGRAVT-----VPNFIFLCGSE 153
C + P IS N G + + +VL T DG A+T +PNF F C
Sbjct: 166 PFACDSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAES 225
Query: 154 FVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLN 213
+S + G G + + A F + L S G++V G
Sbjct: 226 LSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVS-- 283
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGV 273
S +LK+T L + +PS F VT KA+ + T +S+ +
Sbjct: 284 ----SSSLKFTTL-----------IKDPSFPTFYFVTL-----KAISVGNTRISVPATNI 323
Query: 274 ---GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIG 330
GGT I + T L S YK L AF + + + PV C+ L
Sbjct: 324 ASGGGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSL-QPTPVEDMDTCYDLSSSSVD--- 379
Query: 331 PVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLL 390
VP I L L ++NV + +N ++ CL F + R+ +IG Q +N +
Sbjct: 380 --VPTITLHL-DRNVDLVLPKENILITQESGLSCLAF--SSTDSRS--IIGNVQQQNWRI 432
Query: 391 QFDLATSRLGFS 402
FD+ S++GF+
Sbjct: 433 VFDVPNSQVGFA 444
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 146/399 (36%), Gaps = 72/399 (18%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTARCG 88
YV TP V V L G ++W C S+T + +CG
Sbjct: 61 HYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCG 120
Query: 89 SAQCNLANAKACGG-GICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFI 147
S C + C G G CG + P T G D ++I + +GR
Sbjct: 121 SPLCKSIPTRNCSGDGECGY--EAPSMFGDTFGIASTDAIAIGNAEGR----------LA 168
Query: 148 FLC--GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS---PFDDGAI 202
F C S+ + G G G GLGR+ +L Q S F+ CL+ P A+
Sbjct: 169 FGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQ-----SNVTAFSYCLALHGPGKKSAL 223
Query: 203 VFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG-KAVPL 261
G K+ TPL + S +P Y+ +V + G KA +
Sbjct: 224 FLG----ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDP---YY----TVQLEGIKAGDV 272
Query: 262 NKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR- 320
S + ++ T P + L + Y+AL + +A+ + P PF CF+
Sbjct: 273 AVAAASSGGGAITVLQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQN 332
Query: 321 -----LQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
+ D+ FT G L Q +K ++ +G ++ + L D GV+
Sbjct: 333 AAVSGVPDLVFTFQGGAT----LTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVS- 387
Query: 375 RTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCT 413
++G+ EN FDL ++L FE A C+
Sbjct: 388 ----ILGSLLQENVHFLFDLE------KETLSFEPADCS 416
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 116/280 (41%), Gaps = 40/280 (14%)
Query: 38 KDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANA 97
+ + L+YV + TP PV + G +++W C + Q
Sbjct: 89 RPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEP 148
Query: 98 KACGGGICG----AGPDNPISNTGTH----GDIRIDVLSIQ----STDGRNPGRAVTVPN 145
C G +C + P + T + G + + V + + ++ G TVP
Sbjct: 149 MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP- 207
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF---DDGAI 202
F CGS V G + GI G GR+ ++L SQL+ R+F+ CL+ + +
Sbjct: 208 LGFGCGSVNV--GSLNNGSGIVGFGRNPLSLVSQLSI-----RRFSYCLTSYASRRQSTL 260
Query: 203 VFG---DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
+FG DG Y D + ++ TPL + N P+ Y++ T + V + +
Sbjct: 261 LFGSLSDGVY-----GDATGRVQTTPLLQSPQN--------PTF-YYVHFTGLTVGARRL 306
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF 299
+ ++ ++ +G GG + + T+L ++ +V+AF
Sbjct: 307 RIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAF 346
>gi|383130052|gb|AFG45746.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+L YTP IN ++SG+ Y+I + V + K + L L S D +G GGT I
Sbjct: 14 SLNYTPFLINTKASSSGY----HTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIID 69
Query: 280 TVNPYTVLETSIYKALVQAFASAM 303
+ +T+ YK + AF+S +
Sbjct: 70 SGTTFTIFNEEFYKNITAAFSSQI 93
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 93/411 (22%), Positives = 149/411 (36%), Gaps = 84/411 (20%)
Query: 31 ALVLRVAKDTATLQYVTQIKQ-RTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS 89
++VL ++ + L Y + + Q P + + G +I WV C+ A C
Sbjct: 39 SVVLLLSGNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCD----------APC-- 86
Query: 90 AQCNL---------ANAKACGGGICGA--GPDNP-ISNTGTHGDIRIDVLSIQSTDG--- 134
CNL N C IC A P+NP N D ++ S+ G
Sbjct: 87 TGCNLPPKLQYKPKGNTVPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALV 146
Query: 135 --RNPGR----AVTVPNFIFLCGSE--FVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL 186
+ P + + P F CG + + G+ GLGR K+ L +QL +A
Sbjct: 147 IDQFPFKLLNGSAMQPRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLT 206
Query: 187 KRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYF 246
+ CLS G + FGD S + +TPL P Y
Sbjct: 207 RNVVGHCLSSKGGGYLFFGD-------TLIPSLGVAWTPLL------------PPDNHYT 247
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP-- 304
G + NGK L L D T + YT + Y+ +V + +
Sbjct: 248 TGPAELLFNGKPTGLKGLKLIFD----------TGSSYTYFNSKTYQTIVNLIGNDLKVS 297
Query: 305 --KVA---RVAPVA-----PFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNS 354
KVA + P+ PF + +++ F I I+ +N I ++
Sbjct: 298 PLKVAKEDKTLPICWKGAKPFKSVLEVKNF-FKTI-----TINFTNARRNTQLQIPPESY 351
Query: 355 MVQIGGDALCLGFVDGG-VNPRTSIVIGARQLENNLLQFDLATSRLGFSDS 404
++ CLG ++G V + S VIG ++ L+ +D +LG+ S
Sbjct: 352 LIISKTGNACLGLLNGSEVGLQNSNVIGDISMQGLLIIYDNEKQQLGWVSS 402
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 34/166 (20%)
Query: 138 GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPF 197
G +V V NF F C + + VG+AG GR ++LP QLA L +F+ CL
Sbjct: 225 GASVAVDNFTFACAHTALGE-----PVGVAGFGRGPLSLPGQLAP--QLSGRFSYCLV-- 275
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTAS----GF-----LGEPSVEYF-- 246
++F + ++ +PL + + A+ GF L P YF
Sbjct: 276 --------------SHSFRADRLIRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYS 321
Query: 247 IGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIY 292
+ + +V V + L +D G GG + + +T+L Y
Sbjct: 322 VALEAVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTMLPNETY 367
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 56/258 (21%), Positives = 98/258 (37%), Gaps = 51/258 (19%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD------DGAIV--FGDGPYYDLNNFD 216
+AG GR+ +LP Q+ +KFA CL+ D G ++ + DG
Sbjct: 226 ALAGFGRTMFSLPMQMGV-----KKFAYCLNSHDYDDTRNSGKLILDYSDG--------- 271
Query: 217 VSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGT 276
++ L Y P N + + Y++GV + + K + + L+ ++ GG
Sbjct: 272 ETQGLSYAPFXKNPPDYP--------IYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGV 323
Query: 277 KISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP---FGACFRLQDIGFTRIGPVV 333
I + Y+ + ++K + M K R + C+ +I
Sbjct: 324 VIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIP--- 380
Query: 334 PQIDLVLQ---NKNVVWSIHGQNSMVQIGGDAL-CLGFVDGG------VNPRTSIVIGAR 383
DL+ Q N+V + G N + +L C P SI++G
Sbjct: 381 ---DLIYQFTGGANMV--VPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNY 435
Query: 384 QLENNLLQFDLATSRLGF 401
Q ++ ++FDL RLGF
Sbjct: 436 QQVDHYVEFDLKNERLGF 453
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 87/377 (23%), Positives = 137/377 (36%), Gaps = 45/377 (11%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-----KGYVSS----------TNKTAR 86
+L+YV + TP V + + G ++ WV C+ + Y + +
Sbjct: 88 SLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVP 147
Query: 87 CGSAQCNLANAKACGGGICG-AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C S C A A G G G +G + G R + ST+ V V +
Sbjct: 148 CDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVAD 207
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG G G+ GLG + +L SQ ++ F F+ CL P GA
Sbjct: 208 FGFGCGDH--QHGPYEKFDGLLGLGGAPESLVSQTSSQFG--GPFSYCLPPTSGGAGFLT 263
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
G + ++ + L +TP+ ++ + F Y + +T + V G + + +
Sbjct: 264 LGAPPNSSSSTAASGLSFTPM--RRLPSVPTF-------YIVTLTGISVGGAPLAIPPSA 314
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDI- 324
S G I + T L + Y AL AF SAM + R+ P + G D
Sbjct: 315 FS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNGGVLDTCYDFT 367
Query: 325 GFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQ 384
G + VP I L + +V CL F G + I+ Q
Sbjct: 368 GHANV--TVPTISLTFSGGATIDLAAPAGVLVD-----GCLAFAGAGTDNAIGIIGNVNQ 420
Query: 385 LENNLLQFDLATSRLGF 401
+L +D +GF
Sbjct: 421 RTFEVL-YDSGKGTVGF 436
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 40/172 (23%), Positives = 76/172 (44%), Gaps = 14/172 (8%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y + + ++ V+G+ + + + + +D G GG + + TVL Y+A+V A +
Sbjct: 321 YDVSIKAISVDGELLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLA 378
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALC 364
+ RVA + PF C+ G +P++ + + ++ ++ C
Sbjct: 379 RFPRVA-MDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARL-EPPSKSYVIDAAPGVKC 436
Query: 365 LGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFN 415
+G +G P I VIG + +L +FDL R L F+R+ CT +
Sbjct: 437 IGVQEG---PWPGISVIGNILQQEHLWEFDLKNRR------LRFKRSRCTHS 479
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 100/244 (40%), Gaps = 34/244 (13%)
Query: 85 ARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSI---QSTDGRNPGRAV 141
C S C+ NA A C ++NT + G++ DVL + + G AV
Sbjct: 169 VTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAAGAAV 228
Query: 142 TVPNFIFLCG---SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLK-RKFALCLSPF 197
P +F CG + L G A+ G+ GLG KV++PS LA+ +K F++C S
Sbjct: 229 RTP-VVFGCGQVQTGSFLDGAAAD--GLMGLGMEKVSVPSILASTGVVKSNSFSMCFSKD 285
Query: 198 DDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGK 257
G I FGD S + TP + ++ Y I +TS+ V K
Sbjct: 286 GLGRINFGD---------TGSADQSETPFIVKSTHS----------YYNISITSMSVGDK 326
Query: 258 AVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA 317
+PL +I + G T ++ +P T+ + A + + R P PF
Sbjct: 327 NLPLG--FYAIADSGTSFTYLN--DPAYTAYTTNFNAQISERRANFSGSTRSGPF-PFEY 381
Query: 318 CFRL 321
C+ L
Sbjct: 382 CYSL 385
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 85/370 (22%), Positives = 147/370 (39%), Gaps = 52/370 (14%)
Query: 57 PVKLTVHLGGNILWVDCEKGYVSSTNKTARCGS------AQCNLANAKACGGGICGAGPD 110
P KL V G +++W C+ SST AR GS + + C +C G
Sbjct: 25 PRKLIVDTGSDLIWTQCKLS--SSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQF 82
Query: 111 NPISNTGTHGDIRIDVLSIQSTDGR--------NPGRAVTVPNFIFLCGSEFVLQGLASG 162
+ + T + + DV + G RAV++ F CG+ + G G
Sbjct: 83 SFKNCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSL-RLGFGCGA--LSAGSLIG 139
Query: 163 VVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD---GAIVFGDGPYYDLNNFDVSK 219
GI GL ++L +QL ++F+ CL+PF D ++FG DL+ ++
Sbjct: 140 ATGILGLSPESLSLITQLKI-----QRFSYCLTPFADKKTSPLLFG--AMADLSRHKTTR 192
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
++ T + N V T V Y++ + + + K + + L++ +G GGT +
Sbjct: 193 PIQTTAIVSNPVET---------VYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVD 243
Query: 280 TVNPYTVLETSIYKALVQAFASA--MPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQID 337
+ + L + ++A+ +A +P R V + CF L V
Sbjct: 244 SGSTVAYLVEAAFEAVKEAVMDVVRLPVANRT--VEDYELCFVLPRRTAAAAMEAVQVPP 301
Query: 338 LVLQ-NKNVVWSIHGQNSMVQIGGDALCLGF---VDG-GVNPRTSIVIGARQLENNLLQF 392
LVL + + N + +CL DG GV+ +IG Q +N + F
Sbjct: 302 LVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVS-----IIGNVQQQNMHVLF 356
Query: 393 DLATSRLGFS 402
D+ + F+
Sbjct: 357 DVQHHKFSFA 366
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 86/395 (21%), Positives = 150/395 (37%), Gaps = 78/395 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------------GYVSSTNKT 84
Y T++K +P + + G +ILWV C G +++ + T
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 85 AR---CGS------AQCNLANAKACGGGI-CGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
C S AQC+ N CG G G +GT G D + G
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQ--CGYSFRYGDG-------SGTSGYYMTDTFYFDAILG 210
Query: 135 RNPGRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
+ + P +F C + L V GI G G+ K+++ SQL++ F+
Sbjct: 211 ESLVANSSAP-IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSH 269
Query: 193 CLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
CL G VF G + + Y+PL ++ + Y + + S+
Sbjct: 270 CLKGDGSGGGVFVLGEI-------LVPGMVYSPLVPSQPH------------YNLNLLSI 310
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
VNG+ +PL+ + N GT + T T L Y + A ++++ ++ V P+
Sbjct: 311 GVNGQMLPLDAAVFEASN--TRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQL--VTPI 366
Query: 313 APFGA-CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI----GGDALCLGF 367
G C+ + T I + P + L + + Q+ + G C+GF
Sbjct: 367 ISNGEQCYLVS----TSISDMFPSVSLNFAGGASMM-LRPQDYLFHYGIYDGASMWCIGF 421
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
P ++G L++ + +DLA R+G++
Sbjct: 422 QKA---PEEQTILGDLVLKDKVFVYDLARQRIGWA 453
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 111/298 (37%), Gaps = 58/298 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y I TPL+ + G +++W C + SST C S+
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 92 CN-LANA-KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
C L N+ + C C + + T G + + L + + P+ F
Sbjct: 146 CQFLPNSIRTCNATGCVY--NYKYGSGYTAGYLATETLKVGD---------ASFPSVAFG 194
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL---SPFDDGAIVFGD 206
C +E G+ + GIAGLGR ++L QL +F+ CL S I+FG
Sbjct: 195 CSTE---NGVGNSTSGIAGLGRGALSLIPQLGVG-----RFSYCLRSGSAAGASPILFG- 245
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE---YFIGVTSVHVNGKAVPLNK 263
L N N++ TP F+ P+V Y++ +T + V +P+
Sbjct: 246 ----SLANL-TDGNVQSTP-----------FVNNPAVHPSYYYVNLTGITVGETDLPVTT 289
Query: 264 TLLSIDNEGV-GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
+ G+ GGT + + T L Y+ + QAF S V V CF+
Sbjct: 290 STFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFK 347
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 94/429 (21%), Positives = 160/429 (37%), Gaps = 94/429 (21%)
Query: 27 FRPK-------ALVLRVAKDTATLQYVTQIKQRTPLVPVK-------------------- 59
FRPK +++ AKD A LQ+++ + R +VP+
Sbjct: 51 FRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSVVPIASGRQIVQNPTYIVRAKIGTP 110
Query: 60 -----LTVHLGGNILWVDCE----------KGYVSSTNKTARCGSAQCNLANAKACGGGI 104
+ + ++ W+ C S+T K+ C +AQC CGGG+
Sbjct: 111 AQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGV 170
Query: 105 CGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVV 164
C + + ++ D +++ +TD VP + F C +Q G +
Sbjct: 171 CSF--NLTYGGSSLAANLSQDTITL-ATDA--------VPGYSFGC-----IQKATGGSL 214
Query: 165 GIAGLGRSKVALPSQLAAAFSL-KRKFALCLSPFD----DGAIVFGDGPYYDLNNFDVSK 219
GL S L+ +L + F+ CL F G++ G K
Sbjct: 215 PAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGP--------VGQPK 266
Query: 220 NLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIS 279
+KYTPL N PS+ YF+ + +V V + V + + + GT
Sbjct: 267 RIKYTPLLKNP--------RRPSL-YFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFD 317
Query: 280 TVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPV-VPQIDL 338
+ +T L T Y A+ AF + + + V + F C+ + P+ P I
Sbjct: 318 SGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV---------PIAAPTITF 368
Query: 339 VLQNKNVVWSIHGQNSMVQ-IGGDALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLAT 396
+ NV ++ N ++ G CL N + + VI Q +N+ L +D+
Sbjct: 369 MFTGMNV--TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPN 426
Query: 397 SRLGFSDSL 405
SRLG + L
Sbjct: 427 SRLGVAREL 435
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 86/395 (21%), Positives = 150/395 (37%), Gaps = 78/395 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEK--------------------GYVSSTNKT 84
Y T++K +P + + G +ILWV C G +++ + T
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 85 AR---CGS------AQCNLANAKACGGGI-CGAGPDNPISNTGTHGDIRIDVLSIQSTDG 134
C S AQC+ N CG G G +GT G D + G
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQ--CGYSFRYGDG-------SGTSGYYMTDTFYFDAILG 210
Query: 135 RNPGRAVTVPNFIFLCGSEFV--LQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFAL 192
+ + P +F C + L V GI G G+ K+++ SQL++ F+
Sbjct: 211 ESLVANSSAP-IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSH 269
Query: 193 CLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSV 252
CL G VF G + + Y+PL ++ + Y + + S+
Sbjct: 270 CLKGDGSGGGVFVLGEI-------LVPGMVYSPLVPSQPH------------YNLNLLSI 310
Query: 253 HVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
VNG+ +PL+ + N GT + T T L Y + A ++++ ++ V P+
Sbjct: 311 GVNGQMLPLDAAVFEASN--TRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQL--VTPI 366
Query: 313 APFGA-CFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI----GGDALCLGF 367
G C+ + T I + P + L + + Q+ + G C+GF
Sbjct: 367 ISNGEQCYLVS----TSISDMFPSVSLNFAGGASMM-LRPQDYLFHYGIYDGASMWCIGF 421
Query: 368 VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
P ++G L++ + +DLA R+G++
Sbjct: 422 QKA---PEEQTILGDLVLKDKVFVYDLARQRIGWA 453
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 89/396 (22%), Positives = 146/396 (36%), Gaps = 77/396 (19%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC--------EKGYV-----SSTNKTARCGSA 90
+Y + TP L + G +++W+ C ++G V SST + C S
Sbjct: 85 EYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSP 144
Query: 91 QCNLANAKAC-GGGICGAGPDNPIS---NTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
QC C GG G G ++ + + G++ D L+ + V N
Sbjct: 145 QCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFAND--------TYVNNV 196
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA----- 201
CG + +GL G+ G+ R K+++ +Q+A A+ F CL +
Sbjct: 197 TLGCGRD--NEGLFDSAAGLLGVARGKISISTQVAPAYG--SVFEYCLGDRTSRSTRSSY 252
Query: 202 IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP- 260
+VFG P + +T L N PS+ Y++ + V G+ V
Sbjct: 253 LVFGRTP--------EPPSTAFTALLSNPR--------RPSL-YYVDMAGFSVGGERVTG 295
Query: 261 LNKTLLSIDNE-GVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP---FG 316
+ L++D G GG + + + Y AL AF + F
Sbjct: 296 FSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFD 355
Query: 317 ACFRLQDIGFTRIGPVVPQI--------DLVLQNKNVVWSIHGQNSMVQIGGDALCLGF- 367
AC+ L+ R P I D+ L +N + G + CLGF
Sbjct: 356 ACYDLRG----RPAASAPLIVLHFAGGADMALPPENYFLPVDGGRR--RAASYRRCLGFE 409
Query: 368 -VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
D G++ VIG Q + + FD+ R+GF+
Sbjct: 410 AADDGLS-----VIGNVQQQGFRVVFDVEKERIGFA 440
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 69/161 (42%), Gaps = 13/161 (8%)
Query: 245 YFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMP 304
Y++G+ + V G+ + + +T +D+ G GG + + T L++ +Y + AF
Sbjct: 11 YYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFVKGTK 70
Query: 305 KVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG-DAL 363
+ V+ F C+ L ++ VP + V+ + +N +V +
Sbjct: 71 DLLATNEVSLFDTCYDLS----SKTSVEVPTVAFHFGEGKVLV-LPAKNYLVPVDSVGTF 125
Query: 364 CLGFVDGGVNPRTS--IVIGARQLENNLLQFDLATSRLGFS 402
C F P S +IG Q + + FDLA S +GFS
Sbjct: 126 CFAFA-----PTMSSLSIIGNIQQQGTRVSFDLANSLVGFS 161
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 99/409 (24%), Positives = 159/409 (38%), Gaps = 91/409 (22%)
Query: 53 TPLVPVKLTVHLGGNILWVDCEKGYV-------------------SSTNKTARCGSAQC- 92
TP P+ + + G + WV C Y SS+++ C + C
Sbjct: 111 TPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPKNSSSSRLVGCRNPSCL 170
Query: 93 --NLANAKACGGGICGAGPD-NPISNT-----------GTHGDIRIDVLSIQSTDGRNPG 138
+ A A C G + P SN T G + D L R PG
Sbjct: 171 WVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGSTAGLLIADTL-------RAPG 223
Query: 139 RAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL--SP 196
RAV+ F+ C V Q + G+AG GR ++P+QL + KF+ CL
Sbjct: 224 RAVS--GFVLGCSLVSVHQPPS----GLAGFGRGAPSVPAQLGLS-----KFSYCLLSRR 272
Query: 197 FDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNG 256
FDD A V G L + ++Y PL V +A+G +V Y++ ++ V V G
Sbjct: 273 FDDNAAVSGS-----LVLGGDNDGMQYVPL----VKSAAGDKQPYAVYYYLALSGVTVGG 323
Query: 257 KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA----MPKVARVAPV 312
KAV L + + G GG + + +T L+ ++++ + A +A + V
Sbjct: 324 KAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEG 383
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIG------------- 359
CF L + +P++ L + V + +N V G
Sbjct: 384 LGLHPCFALPQGAKSM---ALPELSLHFKG-GAVMQLPLENYFVVAGRAPVPGAGAGAGA 439
Query: 360 GDALCLGFVDG-------GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
+A+CL V +I++G+ Q +N L+++DL RLGF
Sbjct: 440 AEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGF 488
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 125/311 (40%), Gaps = 72/311 (23%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILW---VDCEKGY----------VSSTNKTARCGSAQ 91
Y+ ++ TP V V G +++W + C Y SST GS
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSES 118
Query: 92 CNLANAKACGGGICGAGPD-NPISNTGTHGDIRI-------DVLSIQSTDGRNPGRAVTV 143
C+ + +C PD N + T ++ D I + L++ ST G+ V +
Sbjct: 119 CSKLYSTSCS-------PDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGK----PVAL 167
Query: 144 PNFIFLCGSEFVLQGLASGV-----VGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
IF CG +GV +GI GLGR ++L SQ+ ++F K F+ CL PF
Sbjct: 168 KGVIFGCGHN------NNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKM-FSQCLVPFH 220
Query: 199 DGAIV-----FGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
+ FG G L N VS TPL ++K NT F YF+ + +
Sbjct: 221 TNPSITSPMSFGKGSEV-LGNGVVS-----TPL-VSK-NTHQAF-------YFVTLLGIS 265
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVA----RV 309
V +P N S++ G I + P T+L Y LV+ + KVA +
Sbjct: 266 VEDINLPFNDG-SSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRN---KVALDPIPI 321
Query: 310 APVAPFGACFR 320
P + C+R
Sbjct: 322 DPTLGYQLCYR 332
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 85/203 (41%), Gaps = 35/203 (17%)
Query: 175 ALPSQLAAAFSLKRKFALCL----SPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFINK 230
+L SQL+A + + FA C +P G +VFGD Y N D++ + ++N
Sbjct: 233 SLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDATYL---NGDMTPMVIAEFYYVNL 289
Query: 231 VNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETS 290
+ G +GEP ++ +N + +G GG I + + +V
Sbjct: 290 LGIGLG-VGEPRLD----------------INSSSFERKPDGSGGVIIDSGSTLSVFPPE 332
Query: 291 IYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVV---W 347
+Y+ + A + K ++P+ CF + R P+ P + L L++ ++ W
Sbjct: 333 VYEVVRNAVVDKLKKGYNISPLTSSPDCFEGK---IERDLPLFPTLVLYLESTGILNDRW 389
Query: 348 SIHGQNSMVQIGGDALCLGFVDG 370
SI +Q + CLGF G
Sbjct: 390 SI-----FLQRYDELFCLGFTSG 407
>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 521
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 101/261 (38%), Gaps = 54/261 (20%)
Query: 161 SGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKN 220
S G+ G+ ++ +Q+ KF+ C+S D ++ + ++ N
Sbjct: 142 SKTTGLMGMDLGSLSFSNQMRLP-----KFSYCISNKDSTGVLVLE----NIANPPRLGP 192
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L YTPL + K + YF N K+ D+ G G T + +
Sbjct: 193 LHYTPL-VKKTT---------PLPYF--------NRNCCLFQKSAFLPDHTGAGQTMVDS 234
Query: 281 VNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGA-----------CFRLQDIGFTRI 329
+T L +Y AL FA + + P G CFR+ IG T
Sbjct: 235 ATQFTFLRQPVYTALKNEFAIQTKNI-----LTPLGDPKFVFQGVMDLCFRV-PIGSTL- 287
Query: 330 GPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDA------LCLGFVDGGVNPRTSIVIGAR 383
PV+P + L+ + + G+ + ++ A C F + + + +IG
Sbjct: 288 -PVLPVVTLMFDGAEL--RVTGERLLYKVSNVAKSNSWIYCFTFGNSDLLGIEAFIIGHH 344
Query: 384 QLENNLLQFDLATSRLGFSDS 404
N +++DLA SR+GFSD+
Sbjct: 345 HQRNVWMEYDLANSRIGFSDT 365
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 96/424 (22%), Positives = 160/424 (37%), Gaps = 92/424 (21%)
Query: 10 LCSLLIFSLSPITAQTSFRPKALV---LRVAKDTATLQ---YVTQIKQRTPLVPVKLTVH 63
+ S+L P+T + +PK + +A L Y+ + TP + + +
Sbjct: 61 IVSMLTSGAGPLTTRAKPKPKNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAID 120
Query: 64 LGGNILWVDCE------------KGYVSSTNKTARCGSAQCNLANAKACGGGI---CGAG 108
+ WV C SST +T CGS QC + +C G+ CG
Sbjct: 121 PSNDAAWVPCSACAGCAASSPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGF- 179
Query: 109 PDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAG 168
++ + Q+ G++ ++ + N + + + L+ VV
Sbjct: 180 ------------NLTYAASTFQAVLGQD---SLALENNVVVSYTFGCLR-----VVN--- 216
Query: 169 LGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLNNFDVSKNLKYTPLFI 228
G S+ A A A L+ + AL L D G L K +K TPL
Sbjct: 217 -GNSRAA-----AGAHRLRPRAALLLV-ADQG----------HLGPIGQPKRIKTTPLLY 259
Query: 229 NKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLE 288
N PS+ Y++ + + V K V + ++ L+ + GT I +T L
Sbjct: 260 NP--------HRPSL-YYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLA 310
Query: 289 TSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ----DIGFTRIGPVVPQIDLVLQNKN 344
+Y A+ AF + + P+ F C+ + + F G V + L +N
Sbjct: 311 APVYAAVRDAFRGRV-RTPVAPPLGGFDTCYNVTVSVPTVTFMFAGAVA----VTLPEEN 365
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDG---GVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
V+ IH + G CL G GVN + V+ + Q +N + FD+A R+GF
Sbjct: 366 VM--IHSSS------GGVACLAMAAGPSDGVNAALN-VLASMQQQNQRVLFDVANGRVGF 416
Query: 402 SDSL 405
S L
Sbjct: 417 SREL 420
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 111/298 (37%), Gaps = 58/298 (19%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y I TPL+ + G +++W C + SST C S+
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSF 145
Query: 92 CN-LANA-KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFL 149
C L N+ + C C + + T G + + L + + P+ F
Sbjct: 146 CQFLPNSIRTCNATGCVY--NYKYGSGYTAGYLATETLKVGD---------ASFPSVAFG 194
Query: 150 CGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCL---SPFDDGAIVFGD 206
C +E G+ + GIAGLGR ++L QL +F+ CL S I+FG
Sbjct: 195 CSTE---NGVGNSTSGIAGLGRGALSLIPQLGVG-----RFSYCLRSGSAAGASPILFG- 245
Query: 207 GPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE---YFIGVTSVHVNGKAVPLNK 263
L N N++ TP F+ P+V Y++ +T + V +P+
Sbjct: 246 ----SLANL-TDGNVQSTP-----------FVNNPAVHPSYYYVNLTGITVGETDLPVTT 289
Query: 264 TLLSIDNEGV-GGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFR 320
+ G+ GGT + + T L Y+ + QAF S V V CF+
Sbjct: 290 STFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFK 347
>gi|222623568|gb|EEE57700.1| hypothetical protein OsJ_08178 [Oryza sativa Japonica Group]
Length = 441
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 56/118 (47%), Gaps = 10/118 (8%)
Query: 285 TVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKN 344
T L TS+Y AL +A A+AM +R + + CF+ Q +R+ P + +
Sbjct: 329 TRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQ---ASRVS--APAVTMSFAG-G 382
Query: 345 VVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
+ QN +V + CL F R++ +IG Q + + +D+ +SR+GF+
Sbjct: 383 AALKLSAQNLLVDVDDSTTCLAFAPA----RSAAIIGNTQQQTFSVVYDVKSSRIGFA 436
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 92/237 (38%), Gaps = 32/237 (13%)
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVF 204
+F C + + GI GLGR ++++ QL + F+LC D GA+V
Sbjct: 192 VFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVL 251
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G P F S L+ S Y I + +HV GKA+ ++
Sbjct: 252 GGVPTPSDMVFSRSDPLR-------------------SPYYNIELKEIHVAGKALRVDSR 292
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPF--GACFRLQ 322
+ + GT + + Y L + A A S + + ++ P CF
Sbjct: 293 IFDSKH----GTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGA 348
Query: 323 DIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMV---QIGGDALCLGFVDGGVNPRT 376
+++ V P +D+V N + S+ +N + ++ G A CLG G +P T
Sbjct: 349 RRNVSKLHEVFPDVDMVFGNGQKL-SLTPENYLFRHSKVDG-AYCLGVFQNGKDPTT 403
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 81/196 (41%), Gaps = 31/196 (15%)
Query: 22 TAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILW---VDCEKGY- 77
++ S++P + V+ +Y+ ++ TP + + G +++W + C K Y
Sbjct: 39 SSHDSYKPSTIQSPVSA--YDCEYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYK 96
Query: 78 ---------VSSTNKTARCGSAQCNLANAKACGGG--ICG---AGPDNPISNTGTHGDIR 123
SS+ CG+ CN ++ C C + DN I T G +
Sbjct: 97 QQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSI----TQGVLA 152
Query: 124 IDVLSIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAA 183
+ L++ ST G V IF CG G +G+ GLGR ++L SQ+ ++
Sbjct: 153 QETLTLTSTTGE----PVAFQGIIFGCGHN--NSGFNDREMGLIGLGRGPLSLISQIGSS 206
Query: 184 FSL-KRKFALCLSPFD 198
F+ CL PF+
Sbjct: 207 LGAGGNMFSQCLVPFN 222
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 109/286 (38%), Gaps = 35/286 (12%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
+L+YV + TP V + + G ++ WV C+ SS+ +
Sbjct: 168 SLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVP 227
Query: 87 CGSAQCNLANAKACGGGICG-AGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPN 145
C S C A A G G G +G + G R + ST+ V V +
Sbjct: 228 CDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVAD 287
Query: 146 FIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFG 205
F F CG G G+ GLG + +L SQ ++ F F+ CL P GA
Sbjct: 288 FGFGCGDH--QHGPYEKFDGLLGLGGAPESLVSQTSSQFG--GPFSYCLPPTSGGAGFLT 343
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
G + ++ + L +TP+ ++ + F Y + +T + V G + + +
Sbjct: 344 LGAPPNSSSSTAASGLSFTPM--RRLPSVPTF-------YIVTLTGISVGGAPLAIPPSA 394
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
S G I + T L + Y AL AF SAM + + P
Sbjct: 395 FS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPP 434
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 92/401 (22%), Positives = 149/401 (37%), Gaps = 46/401 (11%)
Query: 14 LIFSLSPITAQTSFRPKALVLRVAK--DTATLQYVTQIKQRTPLVPVKLTVHLGGNILWV 71
LI LSP A +S+ + V + + +Y +I +P + + G +I+WV
Sbjct: 109 LIRRLSPRDATSSYSVEEFGAEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWV 168
Query: 72 DCEKGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTH-GDIRIDVLSIQ 130
C+ A C +C I N G H G R +V+
Sbjct: 169 QCQPCTQCYHQTDPVFDPADSASFMGVPCSSSVC-----ERIENAGCHAGGCRYEVMYGD 223
Query: 131 STDGRNP--------GRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAA 182
+ + GR V V N CG +G+ G G+ GLG ++L QL
Sbjct: 224 GSYTKGTLALETLTFGRTV-VRNVAIGCGHR--NRGMFVGAAGLLGLGGGSMSLVGQLGG 280
Query: 183 AFSLKRKFALCLSPFDD-GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEP 241
+ L D G++ FG G + + PL N P
Sbjct: 281 QTGGAFSYCLVSRGTDSAGSLEFGRGA--------MPVGAAWIPLIRNPR--------AP 324
Query: 242 SVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFAS 301
S Y+I ++ V V G VP+++ + ++ G GG + T T + T Y A AF
Sbjct: 325 SF-YYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIG 383
Query: 302 AMPKVARVAPVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGG- 360
+ R + V+ F C+ L GF + VP + ++ ++ +N ++ +
Sbjct: 384 QTGNLPRASGVSIFDTCYNLN--GFVSV--RVPTVSFYFAGGPIL-TLPARNFLIPVDDV 438
Query: 361 DALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
C F +P +IG Q E + FD A +GF
Sbjct: 439 GTFCFAFA---ASPSGLSIIGNIQQEGIQISFDGANGFVGF 476
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 110/287 (38%), Gaps = 42/287 (14%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE---------------KGYVSSTNKTAR 86
+L+YV + TP V + + G ++ WV C+ SST
Sbjct: 124 SLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVP 183
Query: 87 CGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNF 146
C S C A G C + G R + + ST+ V+V +F
Sbjct: 184 CDSKACKDLVPDAYDHG-CTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQVSVKDF 242
Query: 147 IFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--GAIVF 204
F CG V QG G+ GLG + +L SQ A + F+ CL P + G +
Sbjct: 243 GFGCG--LVQQGTFDLFDGLLGLGGAPESLVSQTAETYG--GAFSYCLPPGNSTTGFLAL 298
Query: 205 GDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKT 264
G NN D + L +TPL L E + Y + +T V V GK + + T
Sbjct: 299 G----APTNNNDTAGFL-FTPLH---------SLPEQATFYLVNLTGVSVGGKPLDIPPT 344
Query: 265 LLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAP 311
+LS GG I + T L + Y AL AF +AM + P
Sbjct: 345 VLS------GGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPP 385
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 148/392 (37%), Gaps = 91/392 (23%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y+ + TP P+ G N++W C+ SST K C S+
Sbjct: 93 EYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSS 152
Query: 91 QCN-LANAKACGGGICGAGPDNPIS-------NTGTHGDIRIDVLSIQSTDGRNPGRAVT 142
QC L N +C + D S + T G +D L++ STD R V
Sbjct: 153 QCTALENQASC------STEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTD----NRPVQ 202
Query: 143 VPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD--- 199
+ N I CG + + G+ GLG V+L QL S+ KF+ CL P +D
Sbjct: 203 LKNIIIGCGQNNAVT-FRNKSSGVVGLGGGAVSLIKQLGD--SIDGKFSYCLVPENDQTS 259
Query: 200 ------GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVH 253
A+V G G VS TPL + +T Y++ + S+
Sbjct: 260 KINFGTNAVVSGPGT--------VS-----TPLVVKSRDTF----------YYLTLKSIS 296
Query: 254 VNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAM----PKVARV 309
V K + + D+ G I + T+L Y + A AS + K R+
Sbjct: 297 VGS------KNMQTPDSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERI 350
Query: 310 APVAPFGACFRLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVD 369
+ A L +P I + + +V ++ NS ++ D +CL F
Sbjct: 351 GSSLCYNATADLN----------IPVITMHFEGADV--KLYPYNSFFKVTEDLVCLAF-- 396
Query: 370 GGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
G++ + + G +N L+ +D A+ + F
Sbjct: 397 -GMSFYRNGIYGNVAQKNFLVGYDTASKTMSF 427
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 84/191 (43%), Gaps = 32/191 (16%)
Query: 27 FRPKALVL-----RVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE------K 75
FRP A+ R+ +Y+ + TP VPV V G ++ W C K
Sbjct: 71 FRPTAMTSDGIQSRIVPSAG--EYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK 128
Query: 76 GYV-------SSTNKTARCGSAQC-NLANAKACGGGICGAGPDNPISNTGTHGDIRIDVL 127
V SST + + CG++ C L ++C + + T G++ + L
Sbjct: 129 QVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETL 188
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCG--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFS 185
++ ST G+ V+ P F F CG S + +SG+VG+ G +++L SQL + +
Sbjct: 189 TVDST----AGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGG---GELSLISQLKS--T 239
Query: 186 LKRKFALCLSP 196
+ F+ CL P
Sbjct: 240 INGLFSYCLLP 250
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 118/284 (41%), Gaps = 49/284 (17%)
Query: 141 VTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD-- 198
+ V +FIF CG +GL GV G+ GLGRS ++L SQ + F F+ CL +
Sbjct: 159 ILVKDFIFGCGRNN--KGLFGGVSGLMGLGRSDLSLISQTSGIFG--GVFSYCLPSTERK 214
Query: 199 -DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVE--YFIGVTSVHVN 255
G+++ G N V +N +P+ K+ + P + YFI +T + +
Sbjct: 215 GSGSLILG-------GNSSVYRN--SSPISYAKM------IENPQLYNFYFINLTGISIG 259
Query: 256 G---KAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPV 312
G +A + + + +D+ GT I+ + P +IYKAL F
Sbjct: 260 GVALQAPSVGPSRILVDS----GTVITRLPP------TIYKALKAEFLKQFTGFPPAPAF 309
Query: 313 APFGACFRLQDIGFTRIGPVVPQIDLVLQ-NKNVVWSIHGQNSMVQIGGDALCLGFVDGG 371
+ CF L + + +P I + + N + + G V+ +CL
Sbjct: 310 SILDTCFNLS--AYQEVD--IPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLE 365
Query: 372 VNPRTSIVIGARQLENNLLQFDLATSRLGFSDSLLFERATCTFN 415
+I +G Q +N + +D +++GF+ TC+F
Sbjct: 366 YQDEVAI-LGNYQQKNLRVIYDTKETKVGFA------LETCSFE 402
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 140/383 (36%), Gaps = 74/383 (19%)
Query: 42 TLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGY-----------VSSTNKTARCGSA 90
TL YV + TP + + + G ++ WV C SST C SA
Sbjct: 122 TLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSLFFDPGKSSTYTPFSCSSA 181
Query: 91 QCNLANAKACGGGI---C------GAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
C + G + C G G + TGT+G D L++ ST+
Sbjct: 182 ACTRLEGRDNGCSLNSTCQYTVRYGDGSN----TTGTYGS---DTLALNSTE-------- 226
Query: 142 TVPNFIFLCG-SEFVLQGLASGVV-GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD 199
V NF F C + +GL G+ GLG +L SQ AA + F+ CL
Sbjct: 227 KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYG--SAFSYCLP---- 280
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
A G + L + TP+F ++ YF+ + ++V G V
Sbjct: 281 -ATTRSSG-FLTLGASTGTSGFVTTPMFRSR---------RAPTFYFVILQGINVGGDPV 329
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
++ T+ + + GT I+ + P Y AL AF + M + R + CF
Sbjct: 330 AISPTVFAAGSIMDSGTIITRLPPRA------YSALSAAFRAGMRRYPRARAFSILDTCF 383
Query: 320 RLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSI 378
FT V +P ++LV VV + M G L GG+
Sbjct: 384 D-----FTGQDNVSIPAVELVFSGGAVV-DLDADGIMY---GSCLAFAPATGGIGS---- 430
Query: 379 VIGARQLENNLLQFDLATSRLGF 401
+IG Q + D+ S LGF
Sbjct: 431 IIGNVQQRTFEVLHDVGQSVLGF 453
>gi|367068400|gb|AEX13224.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
Length = 80
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 40/65 (61%), Gaps = 6/65 (9%)
Query: 221 LKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKIST 280
L +TPLFIN++N ++ S Y++G+ +V + GK + L +LLS D+ G GGT I +
Sbjct: 19 LTWTPLFINRINPSA------STFYYLGLQAVSIGGKRLTLPSSLLSFDSHGNGGTIIDS 72
Query: 281 VNPYT 285
+T
Sbjct: 73 GTSFT 77
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 87/391 (22%), Positives = 155/391 (39%), Gaps = 66/391 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCG-------SAQCNLANA 97
Y T++K TP + + G +ILWV+C S+ ++++ G + + A
Sbjct: 78 YYTKVKMGTPPKEFNVQIDTGSDILWVNCNT--CSNCPQSSQLGIELNFFDTVGSSTAAL 135
Query: 98 KACGGGIC------GAGPDNPISN-----------TGTHGDIRIDVLSIQSTDGRNPGRA 140
C IC A +P N +GT G D + G+ P
Sbjct: 136 IPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVN 195
Query: 141 VTVPNFIFLCG--SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD 198
+ +F C L V GI G G +++ SQL++ + F+ CL
Sbjct: 196 SSA-TIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDG 254
Query: 199 DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
DG V G + ++ Y+PL ++ + Y + + S+ VNG+
Sbjct: 255 DGGGVLVLGEILE-------PSIVYSPLVPSQPH------------YNLNLQSIAVNGQL 295
Query: 259 VPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGAC 318
+P+N + SI N GGT + L Y LV A +A+ + AR + C
Sbjct: 296 LPINPAVFSISNNR-GGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTN-SKGNQC 353
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQN------KNVVWSIHGQNSMVQIGGDALCLGFVDGGV 372
+ + T IG + P + L + K + +H N + G + C+GF
Sbjct: 354 YLVS----TSIGDIFPSVSLNFEGGASMVLKPEQYLMH--NGYLD-GAEMWCIGFQK--- 403
Query: 373 NPRTSIVIGARQLENNLLQFDLATSRLGFSD 403
+ ++G L++ ++ +D+A R+G+++
Sbjct: 404 FQEGASILGDLVLKDKIVVYDIAQQRIGWAN 434
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 119/281 (42%), Gaps = 67/281 (23%)
Query: 165 GIAGLGRSKVA-LPSQLAAAFSLKRKFALCLSPFD-DGAIVFGDGPYYDLNNFDVSKNLK 222
G+ GLG ++ A +P+QLA L R F+LC F+ GA+ FG L + L
Sbjct: 246 GLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALSFG-----RLPATPHTPPLV 300
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
YT + +N+ + A Y++ T+ G + L++ GT + +
Sbjct: 301 YTDMRVNEAHPA----------YYVVSTAAMKIGDVAVATPSDLAVGY----GTVMDSGT 346
Query: 283 PYTVLETSIYKALVQAF-------ASAMPKVARVA---PVAPFGACFRLQDIGFTRIGPV 332
+T + T ++ A A A K+A+V P P CF+ + G T I P+
Sbjct: 347 TFTYVPTKVFHATAAALDAAVTTNAKPEKKLAKVPGPDPSYPDDVCFQRE--GATEIEPI 404
Query: 333 VPQID------------------LVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNP 374
V + LVL N ++ +HG+ A CLG +D N
Sbjct: 405 VTMANLGEYYPPLTIAFDGEGASLVLPPSNYLF-VHGKKP------GAFCLGVMD---NK 454
Query: 375 RTSIVIGARQLENNLLQFD--LATSRLGFS----DSLLFER 409
+ +IG + + L+++D + R+GF+ D+LL ++
Sbjct: 455 QQGTLIGGISVRDVLVEYDKTVGGGRIGFAATDCDALLRDQ 495
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 151/400 (37%), Gaps = 70/400 (17%)
Query: 44 QYVTQIKQRTPLVP-VKLTVHLGGNILWVDCEKGYV-SSTNKTARCGSAQCNLANAKACG 101
+Y+ + TP V LT+ G +++W C + T ++Q LA C
Sbjct: 99 EYLIHLSIGTPRPQRVALTLDTGSDLVWTQCACHVCFAQPFPTFDALASQTTLA--VPCS 156
Query: 102 GGICGAGPDNPIS-----------------NTGTHGDIRIDVLSIQSTDGRNPGRA---V 141
IC +G P+S + T G I D + +S G N +A V
Sbjct: 157 DPICTSG-KYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGV 215
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG- 200
VPN F CG ++ S GIAG R ++LPSQL A +F+ C + D
Sbjct: 216 AVPNVRFGCG-QYNKGIFKSNESGIAGFSRGPMSLPSQLKVA-----RFSHCFTAIADAR 269
Query: 201 --AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKA 258
+ G P D + ++ TP F N ++G L Y++ + + V
Sbjct: 270 TSPVFLGGAPGPDNLGAHATGPVQSTP-FAN----SNGSL------YYLTLKGITVGKTR 318
Query: 259 VPLNKTLLSIDNEGV--------GGTKISTVNPYTVLETSIYKALVQAFAS--AMPKVAR 308
+PLN + G GT I T L +Y++L AF + +P
Sbjct: 319 LPLNALAFAGKGTGSGSGGTIIDSGTGIRT------LPGPMYRSLRAAFVARVKLPVANE 372
Query: 309 VAPVAPFGACFR-LQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQI------GGD 361
A A CF + P +VL W + ++ ++ + G
Sbjct: 373 SAADAESTLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGS 432
Query: 362 ALCLGFVDGGVNPRTSIVIGARQLENNLLQFDLATSRLGF 401
LCL G + T +IG Q +N + +DL ++L F
Sbjct: 433 GLCLVMNSAGDSDLT--IIGNFQQQNMHVAYDLEKNKLVF 470
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 147/380 (38%), Gaps = 58/380 (15%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSA 90
+Y +I TP + L + G +ILW+ C Y SST T C +
Sbjct: 57 EYFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTR 116
Query: 91 QCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
QC + C C D + T G+ D +S+ ST G V + C
Sbjct: 117 QCLNLDIGTCQANKCLYQVDYG-DGSFTTGEFGTDDVSLNSTSGVG---QVVLNKIPLGC 172
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLS-----PFDDGAIVFG 205
G + +G G G+ GLG+ ++ P+Q+ +F+ CL+ + ++VFG
Sbjct: 173 GHDN--EGYFVGAAGLLGLGKGPLSFPNQVDPQNG--GRFSYCLTDRETDSTEGSSLVFG 228
Query: 206 DGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTL 265
+ ++TP N + P+ Y++ +T + V G + + +
Sbjct: 229 EAAV-------PPAGARFTPQDSN--------MRVPTF-YYLKMTGISVGGTILTIPTSA 272
Query: 266 LSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQDIG 325
+D+ G GG I + T L+ + Y +L AF + +A A + F C+ L +
Sbjct: 273 FQLDSLGNGGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLA 332
Query: 326 FTRIGPVVPQI----DLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIVIG 381
+ V DL L N + + N+ CL F G P +IG
Sbjct: 333 SVDVPTVTLHFQGGTDLKLPASNYLIPVDNSNT--------FCLAFA-GTTGPS---IIG 380
Query: 382 ARQLENNLLQFDLATSRLGF 401
Q + + +D +++GF
Sbjct: 381 NIQQQGFRVIYDNLHNQVGF 400
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 91/406 (22%), Positives = 157/406 (38%), Gaps = 65/406 (16%)
Query: 18 LSPITAQTSFRPKALVLRVAKDTATLQYVTQIKQRTPLVPVKLTVHLGGNILWVDCE--- 74
LS + A+ S P A ++ ++ Y+ + K TP + + + ++ W+ C
Sbjct: 12 LSSLVARKSVVPIASGRQIVQNPT---YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCL 68
Query: 75 -------KGYVSSTNKTARCGSAQCNLANAKACGGGICGAGPDNPISNTGTHGDIRIDVL 127
S+T K+ C +AQC CGGG+C + + ++ D +
Sbjct: 69 GCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSF--NLTYGGSSLAANLSQDTI 126
Query: 128 SIQSTDGRNPGRAVTVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSL- 186
++ +TD VP + F C +Q G + GL S L+ +L
Sbjct: 127 TL-ATDA--------VPGYSFGC-----IQKATGGSLPAQGLLGLGRGPLSLLSQTQNLY 172
Query: 187 KRKFALCLSPFD----DGAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPS 242
+ F+ CL F G++ G K +KYTPL N PS
Sbjct: 173 QSTFSYCLPSFKSLNFSGSLRLGP--------VGQPKRIKYTPLLKNP--------RRPS 216
Query: 243 VEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASA 302
+ YF+ + +V V + V + + + GT + +T L T Y A+ AF +
Sbjct: 217 L-YFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNR 275
Query: 303 MPKVARVAPVAPFGACFRLQDIGFTRIGPV-VPQIDLVLQNKNVVWSIHGQNSMVQ-IGG 360
+ + V + F C+ + P+ P I + NV ++ N ++ G
Sbjct: 276 VGRNLTVTSLGGFDTCYTV---------PIAAPTITFMFTGMNV--TLPPDNLLIHSTAG 324
Query: 361 DALCLGFVDGGVNPRTSI-VIGARQLENNLLQFDLATSRLGFSDSL 405
CL N + + VI Q +N+ L +D+ SRLG + L
Sbjct: 325 STTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVAREL 370
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 99/441 (22%), Positives = 160/441 (36%), Gaps = 84/441 (19%)
Query: 7 FLLLCSLLIFSLSPITAQTSFRPKALVLRV-------AKDTATLQYVTQIKQRTPLVPVK 59
F + C SLS + A R +L+ V + A Y +I TP
Sbjct: 38 FNVKCKYQDRSLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIGIGTPPKNYY 97
Query: 60 LTVHLGGNILWVDCEKGYV------------------SSTNKTARCGSAQCNLANAKACG 101
L V G +I+WV+C + SS+ K C C N
Sbjct: 98 LQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVPCDQEFCKEINGGLLT 157
Query: 102 GGICGAGPDNPI-----SNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLCG---SE 153
G C A P + T G D++ G + + +F CG S
Sbjct: 158 G--CTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSAN-GSIVFGCGARQSG 214
Query: 154 FVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGAIVFGDGPYYDLN 213
+ + GI G G++ ++ SQLA++ +K+ FA CL+ + G I F G
Sbjct: 215 DLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGI-FAIGHV---- 269
Query: 214 NFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI--DNE 271
V + TPL ++ + + VN AV + T LS+ D
Sbjct: 270 ---VQPKVNMTPLLPDQPHYS-------------------VNMTAVQVGHTFLSLSTDTS 307
Query: 272 GVG---GTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACFRLQ---DIG 325
G GT I + L IY+ LV S P + +V + CF+ D G
Sbjct: 308 AQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDL-KVQTLHDEYTCFQYSESVDDG 366
Query: 326 FTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTS---IVIGA 382
F P + +N + ++ + + + C+G+ + G R S ++G
Sbjct: 367 F-------PAVTFFFEN-GLSLKVYPHDYLFP-SVNFWCIGWQNSGTQSRDSKNMTLLGD 417
Query: 383 RQLENNLLQFDLATSRLGFSD 403
L N L+ +DL +G+++
Sbjct: 418 LVLSNKLVFYDLENQAIGWAE 438
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 83/378 (21%), Positives = 143/378 (37%), Gaps = 61/378 (16%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCE-------------KGYVSSTNKTARCGSAQ 91
Y T++ TP L V G + +V C + +SST + +C +
Sbjct: 77 YTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCNPS- 135
Query: 92 CNLANA-KACGGGICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAVTVPNFIFLC 150
CN + K C A + + G I DV+S + P RAV F C
Sbjct: 136 CNCDDEGKQCTYERRYA------EMSSSSGVIAEDVVSFGNESELKPQRAV------FGC 183
Query: 151 GSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGP 208
+ + GI GLGR ++++ QL + F+LC D GA+V G
Sbjct: 184 ENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLG--- 240
Query: 209 YYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSI 268
+ P N V + S P Y I + +HV GK + L +
Sbjct: 241 -----------QISPPP---NMVFSHSNPYRSPY--YNIELKELHVAGKPLKLKPKVFDE 284
Query: 269 DNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAP--FGACFRLQDIGF 326
+ GT + + Y + + AL A + + ++ P CF
Sbjct: 285 KH----GTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREV 340
Query: 327 TRIGPVVPQIDLVLQNKNVVWSIHGQNSM---VQIGGDALCLGFVDGGVNPRTSIVIGAR 383
+ + V P++++V + + S+ +N + ++ G A CLG G + + ++G
Sbjct: 341 SHLSKVFPEVNMVFGSGQKL-SLSPENYLFRHTKVSG-AYCLGIFQNGND--LTTLLGGI 396
Query: 384 QLENNLLQFDLATSRLGF 401
+ N L+ +D ++GF
Sbjct: 397 VVRNTLVTYDRENDKIGF 414
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 96/407 (23%), Positives = 150/407 (36%), Gaps = 100/407 (24%)
Query: 44 QYVTQIKQRTPLVPVKLTVHLGGNILWVDC---EKGYV----------SSTNKTARCGSA 90
+Y I P + + G +++W+ C + Y S T++ C S
Sbjct: 91 EYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASP 150
Query: 91 QCN----LANAKACGGG-----ICGAGPDNPISNTGTHGDIRIDVLSIQSTDGRNPGRAV 141
QC A GG + G G + + GD+ D L + D R
Sbjct: 151 QCRGVLRYPGCDARTGGCVYMVVYGDG-------SASSGDLATDTL-VLPDDTR------ 196
Query: 142 TVPNFIFLCGSEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDGA 201
V N CG + +GL + G+ G GR +++ P+QLA A+ F+ CL A
Sbjct: 197 -VHNVTLGCGHDN--EGLLASAAGLLGAGRGQLSFPTQLAPAYG--HVFSYCLGDRMSRA 251
Query: 202 ------IVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVN 255
+VFG P + +TPL N PS+ Y++ + V
Sbjct: 252 RNSSSYLVFGRTPEL--------PSTAFTPLRTNPR--------RPSL-YYVDMVGFSVG 294
Query: 256 GKAVP--LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAF-----ASAMPKVAR 308
G+ V N +L G GG + + + Y A+ AF A+ M ++
Sbjct: 295 GERVAGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRN 354
Query: 309 VAPVAPFGACFRLQDIGFTRIGPVVPQI--------DLVLQNKNVVWSIHGQNSMVQIGG 360
V F C+ + G G VP I D+ L N + + +GG
Sbjct: 355 KFSV--FDTCYDVHGNG-PGTGVRVPSIVLHFAAAADMALPQANYLIPV--------VGG 403
Query: 361 DA---LCLGF--VDGGVNPRTSIVIGARQLENNLLQFDLATSRLGFS 402
D CLG D G+N V+G Q + + FD+ R+GF+
Sbjct: 404 DRRTYFCLGLQAADDGLN-----VLGNVQQQGFGVVFDVERGRIGFT 445
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 72/323 (22%), Positives = 120/323 (37%), Gaps = 65/323 (20%)
Query: 45 YVTQIKQRTPLVPVKLTVHLGGNILWVDCEKGYVSSTNKTARCGSAQCNLANAK------ 98
+ T + TP + + G ++ WV C+ + T T + ++ N K
Sbjct: 103 HYTTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSR 162
Query: 99 --ACGGGIC-------GAGPDNPI------SNTGTHGDIRIDVLSIQSTDGRNPGRAVTV 143
C +C G + P + T T G + DVL + + D R V
Sbjct: 163 KVTCDNSLCAHRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVEAYV 222
Query: 144 PNFIFLCGSEFVLQGLASGVV---GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDDG 200
F CG V G + G+ GLG K+++PS L+ F++C P G
Sbjct: 223 ---TFGCGQ--VQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIG 277
Query: 201 AIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVP 260
I FGD S + + TP +N ++ Y I VT V V +
Sbjct: 278 RISFGD---------KGSPDQEETPFNLNALHPT----------YNITVTQVRVGTTLID 318
Query: 261 LNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVA--PFGAC 318
L+ T L + +T L IY ++++F S + +R P + PF C
Sbjct: 319 LDFTAL-----------FDSGTSFTYLVDPIYTNVLKSFHSQA-QDSRRPPDSRIPFEFC 366
Query: 319 FRLQDIGFTRIGPVVPQIDLVLQ 341
+ D+ ++P + L ++
Sbjct: 367 Y---DMSPGENTSLIPSMSLTMK 386
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 87/219 (39%), Gaps = 32/219 (14%)
Query: 165 GIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFD--DGAIVFGDGPYYDLNNFDVSKNLK 222
GI GLGR ++++ QL + F+LC D GA+V G P F S L+
Sbjct: 45 GIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLR 104
Query: 223 YTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAVPLNKTLLSIDNEGVGGTKISTVN 282
S Y I + +HV GKA+ ++ + + GT + +
Sbjct: 105 -------------------SPYYNIELKEIHVAGKALRVDSRIFDSKH----GTVLDSGT 141
Query: 283 PYTVLETSIYKALVQAFASAMPKVARVAPVAPF--GACFRLQDIGFTRIGPVVPQIDLVL 340
Y L + A A S + + ++ P CF +++ V P +D+V
Sbjct: 142 TYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVF 201
Query: 341 QNKNVVWSIHGQNSMV---QIGGDALCLGFVDGGVNPRT 376
N + S+ +N + ++ G A CLG G +P T
Sbjct: 202 GNGQKL-SLTPENYLFRHSKVDG-AYCLGVFQNGKDPTT 238
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 109/262 (41%), Gaps = 46/262 (17%)
Query: 143 VPNFIFLCG-SEFVLQGLASGVVGIAGLGRSKVALPSQLAAAFSLKRKFALCLSPFDD-- 199
V NF F CG + ++GL GV+G LGR + +L ++ FS CL
Sbjct: 66 VQNFYFGCGHGKHAVRGLFDGVLG---LGRLRESLGARYGGVFSY------CLPSVSSKP 116
Query: 200 GAIVFGDGPYYDLNNFDVSKNLKYTPLFINKVNTASGFLGEPSVEYFIGVTSVHVNGKAV 259
G + G G +TP+ G+P+ + + ++V GK +
Sbjct: 117 GFLALGAG--------KNPSGFVFTPM--------GTVPGQPTFST-VTLAGINVGGKKL 159
Query: 260 PLNKTLLSIDNEGVGGTKISTVNPYTVLETSIYKALVQAFASAMPKVARVAPVAPFGACF 319
L + S GG + + T L+++ Y+AL AF AM + R+ P C+
Sbjct: 160 DLRPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAM-EAYRLLPNGDLDTCY 212
Query: 320 RLQDIGFTRIGPVVPQIDLVLQNKNVVWSIHGQNSMVQIGGDALCLGFVDGGVNPRTSIV 379
L G+ + VVP+I L + ++ N ++ G CL F + G + ++
Sbjct: 213 NL--TGYKNV--VVPKIALTFTGGATI-NLDVPNGILVNG----CLAFAESGPDGSAGVL 263
Query: 380 IGARQLENNLLQFDLATSRLGF 401
Q +L FD +TS+ GF
Sbjct: 264 GNVNQRAFEVL-FDTSTSKFGF 284
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.138 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,520,986,839
Number of Sequences: 23463169
Number of extensions: 278533972
Number of successful extensions: 622498
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 350
Number of HSP's successfully gapped in prelim test: 1107
Number of HSP's that attempted gapping in prelim test: 619300
Number of HSP's gapped (non-prelim): 1662
length of query: 420
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 275
effective length of database: 8,957,035,862
effective search space: 2463184862050
effective search space used: 2463184862050
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)